From dde at twn.tuv.com  Wed Apr  1 00:20:40 2009
From: dde at twn.tuv.com (Denis Anthony Dowling/Twn/TUV)
Date: Wed, 1 Apr 2009 08:20:40 +0800
Subject: [Linux-cluster] Disable fencing in a non-shared storage cluster 
Message-ID: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>

I'm trying to configure a high availability cluster for Squid. There will 
be no shared storage device. The problem relates to the time required for 
starting and stopping the fencing daemon. Is it possible just to disable 
this? 
I've tried the "clean_start=0" and "post_join_delay=-1" but without 
success.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090401/0bb9909c/attachment.htm>

From gianluca.cecchi at gmail.com  Wed Apr  1 15:07:06 2009
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Wed, 1 Apr 2009 17:07:06 +0200
Subject: [Linux-cluster] Can same cluster name in same subnet?
Message-ID: <561c252c0904010807v16e3d88av78ba899e5aa6c5d@mail.gmail.com>

Conversely, how is it dangerous to have two two-node-clusters with
different names sharing the intra-cluster network?
In particular if one is in production and the other is for testing?
And what about relative multicast-adresses for these two clusters? Can
I safely use same multicast if the names are different or do I have to
change? Ant rule in this case?

Thanks,
Gianluca


From ccaulfie at redhat.com  Wed Apr  1 15:13:20 2009
From: ccaulfie at redhat.com (Chrissie Caulfield)
Date: Wed, 01 Apr 2009 16:13:20 +0100
Subject: [Linux-cluster] Can same cluster name in same subnet?
In-Reply-To: <561c252c0904010807v16e3d88av78ba899e5aa6c5d@mail.gmail.com>
References: <561c252c0904010807v16e3d88av78ba899e5aa6c5d@mail.gmail.com>
Message-ID: <49D38490.6070300@redhat.com>

Gianluca Cecchi wrote:
> Conversely, how is it dangerous to have two two-node-clusters with
> different names sharing the intra-cluster network?
> In particular if one is in production and the other is for testing?
> And what about relative multicast-adresses for these two clusters? Can
> I safely use same multicast if the names are different or do I have to
> change? Ant rule in this case?
>

I would strongly advise against using the same multicast address for two
different cluster in the same subnet. Ideally all clusters should use
different multicast addresses.
-- 

Chrissie


From sdake at redhat.com  Wed Apr  1 17:29:37 2009
From: sdake at redhat.com (Steven Dake)
Date: Wed, 01 Apr 2009 10:29:37 -0700
Subject: [Linux-cluster] Can same cluster name in same subnet?
In-Reply-To: <561c252c0904010807v16e3d88av78ba899e5aa6c5d@mail.gmail.com>
References: <561c252c0904010807v16e3d88av78ba899e5aa6c5d@mail.gmail.com>
Message-ID: <1238606977.22887.15.camel@sdake-laptop>

On Wed, 2009-04-01 at 17:07 +0200, Gianluca Cecchi wrote:
> Conversely, how is it dangerous to have two two-node-clusters with
> different names sharing the intra-cluster network?
> In particular if one is in production and the other is for testing?
> And what about relative multicast-adresses for these two clusters? Can
> I safely use same multicast if the names are different or do I have to
> change? Ant rule in this case?
> 
> Thanks,
> Gianluca
> 

The multicast address/port always should be unique for each cluster or
bad things will happen.  It uniquely identifies the cluster and nothing
else matters for unique identification.

regards
-steve


> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rhurst at bidmc.harvard.edu  Wed Apr  1 17:50:38 2009
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Wed, 01 Apr 2009 13:50:38 -0400
Subject: [Linux-cluster] Disable fencing in a non-shared storage
 cluster
In-Reply-To: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>
References: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>
Message-ID: <1238608238.18992.2.camel@WSBID06223.bidmc.harvard.edu>

Yes, if you are not using GFS, then why start the fencing daemon at all?
It's only required for GFS.


________________________________________________________________________


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.


On Wed, 2009-04-01 at 08:20 +0800, Denis Anthony Dowling/Twn/TUV wrote:

> 
> I'm trying to configure a high availability cluster for Squid. There
> will be no shared storage device. The problem relates to the time
> required for starting and stopping the fencing daemon. Is it possible
> just to disable this? 
> I've tried the "clean_start=0" and "post_join_delay=-1" but without
> success.
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090401/3cdb8c9a/attachment.htm>

From jakub.suchy at enlogit.cz  Wed Apr  1 18:04:46 2009
From: jakub.suchy at enlogit.cz (Jakub Suchy)
Date: Wed, 1 Apr 2009 20:04:46 +0200
Subject: [Linux-cluster] Disable fencing in a non-shared storage cluster
In-Reply-To: <1238608238.18992.2.camel@WSBID06223.bidmc.harvard.edu>
References: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>
	<1238608238.18992.2.camel@WSBID06223.bidmc.harvard.edu>
Message-ID: <20090401180446.GC28473@galatea>

Robert,
even for HA cluster, you usually need Virtual IP to run your service.
This IP can have the same problem as GFS - you can't know if failed node
is really down or it's just a glitch -> you need fencing.

=> you need fencing everytime.

Best,
Jakub Suchy

> Yes, if you are not using GFS, then why start the fencing daemon at all?
> It's only required for GFS.
> 
> 
> On Wed, 2009-04-01 at 08:20 +0800, Denis Anthony Dowling/Twn/TUV wrote:
> 
> > 
> > I'm trying to configure a high availability cluster for Squid. There
> > will be no shared storage device. The problem relates to the time
> > required for starting and stopping the fencing daemon. Is it possible
> > just to disable this? 
> > I've tried the "clean_start=0" and "post_join_delay=-1" but without
> > success.
> > 


From teigland at redhat.com  Wed Apr  1 19:03:48 2009
From: teigland at redhat.com (David Teigland)
Date: Wed, 1 Apr 2009 14:03:48 -0500
Subject: [Linux-cluster] Disable fencing in a non-shared storage cluster
In-Reply-To: <20090401180446.GC28473@galatea>
References: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>
	<1238608238.18992.2.camel@WSBID06223.bidmc.harvard.edu>
	<20090401180446.GC28473@galatea>
Message-ID: <20090401190348.GA28414@redhat.com>

On Wed, Apr 01, 2009 at 08:04:46PM +0200, Jakub Suchy wrote:
> Robert,
> even for HA cluster, you usually need Virtual IP to run your service.
> This IP can have the same problem as GFS - you can't know if failed node
> is really down or it's just a glitch -> you need fencing.
> 
> => you need fencing everytime.

You may want something to terminate the IP, but I wouldn't use the word
"fencing" to describe it, just to avoid confusion.  Fencing is explicitly
defined as disabling access to shared storage devices.

That said, you may be able to use the fencing capabilities to implement what
you need.

Dave


From crosa at redhat.com  Wed Apr  1 19:27:16 2009
From: crosa at redhat.com (Cleber Rodrigues)
Date: Wed, 01 Apr 2009 16:27:16 -0300
Subject: [Linux-cluster] Disable fencing in a non-shared storage cluster
In-Reply-To: <20090401190348.GA28414@redhat.com>
References: <OFC5A33424.3B3076FC-ON4825758B.00020E5A-4825758B.00021835@twn.tuv.com>
	<1238608238.18992.2.camel@WSBID06223.bidmc.harvard.edu>
	<20090401180446.GC28473@galatea>  <20090401190348.GA28414@redhat.com>
Message-ID: <1238614036.5711.4.camel@localhost.localdomain>

On Wed, 2009-04-01 at 14:03 -0500, David Teigland wrote:
> You may want something to terminate the IP, but I wouldn't use the word
> "fencing" to describe it, just to avoid confusion.  Fencing is explicitly
> defined as disabling access to shared storage devices.
> 

IMHO, "fencing" now means the pratical use of it. It might mean
disabling access to storage, but most of the time it means STONITH.

> That said, you may be able to use the fencing capabilities to implement what
> you need.
> 

And that would be...? Connecting to the (probably unreachable) machine
and shutting down its network interface? 

Call me paranoid, but I would always go for power cycling *first*.

> Dave
> 

-- 
Cleber Rodrigues <crosa at redhat.com>
Solutions Architect - Red Hat, Inc.


From Ed.Sanborn at genband.com  Thu Apr  2 03:51:23 2009
From: Ed.Sanborn at genband.com (Ed Sanborn)
Date: Wed, 1 Apr 2009 23:51:23 -0400
Subject: [Linux-cluster] Trouble after Openais upgrade to 0.80.3-22.el5
In-Reply-To: <alpine.DEB.2.00.0903311119100.23085@lxserv0.kfki.hu>
References: <alpine.DEB.2.00.0903262333350.4377@lxserv1.kfki.hu><20090330180739.GB6135@redhat.com>
	<alpine.DEB.2.00.0903311119100.23085@lxserv0.kfki.hu>
Message-ID: <593E210EDC38444DA1C17E9E9F5E264B98FC9A@GBMDMail01.genband.com>

I have RHEL 5.2  
I tried upgrading openais on a few of my nodes. Original version was
0.80.3-15.el5  and I upgraded to version  0.80.3-22.el5.
Now the node will not connect to the cluster.  I get the following error
in
/var/log/messages:

"unable to connect to cluster infrastructure"

Has anyone else run into this issue?  Is there a way around this besides
going back to 
the old version?

Ed


From corey.kovacs at gmail.com  Thu Apr  2 05:37:05 2009
From: corey.kovacs at gmail.com (Corey Kovacs)
Date: Thu, 2 Apr 2009 06:37:05 +0100
Subject: [Linux-cluster] Trouble after Openais upgrade to 0.80.3-22.el5
In-Reply-To: <593E210EDC38444DA1C17E9E9F5E264B98FC9A@GBMDMail01.genband.com>
References: <alpine.DEB.2.00.0903262333350.4377@lxserv1.kfki.hu>
	<20090330180739.GB6135@redhat.com>
	<alpine.DEB.2.00.0903311119100.23085@lxserv0.kfki.hu>
	<593E210EDC38444DA1C17E9E9F5E264B98FC9A@GBMDMail01.genband.com>
Message-ID: <7d6e8da40904012237r3c4fb8d0ga77e6aacc425f497@mail.gmail.com>

Ed, your upgrade requires you to downgrade your 5.3 machines to use
the 5.2 openais, or you upgrade all the nodes at the same time and
reboot the whole thing. There is an incompatibility between the two
versions of openais.

Have fun,,,

-Corey

To the list... Is this in a FAQ somewhere? This question seems to come
up quite often?


On Thu, Apr 2, 2009 at 4:51 AM, Ed Sanborn <Ed.Sanborn at genband.com> wrote:
> I have RHEL 5.2
> I tried upgrading openais on a few of my nodes. Original version was
> 0.80.3-15.el5 ?and I upgraded to version ?0.80.3-22.el5.
> Now the node will not connect to the cluster. ?I get the following error
> in
> /var/log/messages:
>
> "unable to connect to cluster infrastructure"
>
> Has anyone else run into this issue? ?Is there a way around this besides
> going back to
> the old version?
>
> Ed
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From kirbyzhou at sohu-rd.com  Thu Apr  2 06:53:20 2009
From: kirbyzhou at sohu-rd.com (Kirby Zhou)
Date: Thu, 2 Apr 2009 14:53:20 +0800
Subject: [Linux-cluster] How to resolve "Open Disconnected Pending" state?
Message-ID: <07bc01c9b35f$b5a675a0$20f360e0$@com>

The machine which exported gnbd is power off. The client machine fall into
the state 'Open Disconnected Pending'.
Any process access the dead gnbd fall into state 'D'. 
How can I destroy the gnbd block device on the client machine?

[root at xen-727057 ~]# gnbd_import -n -l
Device name : 63.131.xvdb
----------------------
    Minor # : 0
 sysfs name : /block/gnbd0
     Server : 10.10.63.131
       Port : 14567
      State : Open Disconnected Pending
   Readonly : No
    Sectors : 16777216

[root at xen-727057 ~]# pvs &
[1] 4561
[root at xen-727057 ~]# ps aux | fgrep pvs
root      4561  0.6  0.1  79364  1544 pts/0    D    14:50   0:00 pvs
root      4563  0.0  0.0  61112   608 pts/0    S+   14:50   0:00 fgrep pvs

[root at xen-727057 ~]# gnbd_import -n -R
gnbd_import: ERROR cannot disconnect device #0 : Device or resource busy


From kadlec at mail.kfki.hu  Thu Apr  2 11:07:54 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Thu, 2 Apr 2009 13:07:54 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
Message-ID: <alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>

On Tue, 31 Mar 2009, Kadlecsik Jozsef wrote:

> I'll restore the kernel on a not so critical node and will try to find out 
> how to trigger the bug without mailman. If that succeeds then I'll remove 
> the patch in question and re-run the test. It'll need a few days, surely, 
> but I'll report the results.

I had been unsuccesful to find a reliable way to trigger the freeze 
without mailman. So I created a backup mailman directory by which I can 
test the system. The following has been verified so far:

- Removed commit 17968b0fe87829edff1af7fa9ffbbc92540159fb (Remove 
  splice_read file op for jdata files) and commit
  4787e11dc7831f42228b89ba7726fd6f6901a1e3 (gfs-kmod: workaround for 
  potential deadlock. Prefault user pages), the system freezes.
- Removed commit 5e83cdb08b423478a0b6cc8f6de396ab8328d47a (gfs-kernel: Bug 
  466645 - reproduceable gfs (dlm) hanger with simple stresstest),
  the system freezes.

(Please note, the volumes are mounted with noatime).

If you have any idea what to do next, please write it.

Best regards,   
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From s.wendy.cheng at gmail.com  Thu Apr  2 14:37:51 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Thu, 02 Apr 2009 09:37:51 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
Message-ID: <49D4CDBF.6040203@gmail.com>

Kadlecsik Jozsef wrote:
>
> If you have any idea what to do next, please write it.
>
>   
Do you have your kernel source somewhere (in tar ball format) so people 
can look into it ?

-- Wendy


From stevan.colaco at gmail.com  Thu Apr  2 15:11:43 2009
From: stevan.colaco at gmail.com (Stevan Colaco)
Date: Thu, 2 Apr 2009 18:11:43 +0300
Subject: [Linux-cluster] Unable to mount GFS File System in RHEL5.2 (32bit)
Message-ID: <56bb44d0904020811u118479fdgbd6f0b004581f095@mail.gmail.com>

Dear All,

I have setup 2 node cluster on RHEL5.2 (32bit) + Quorum Partition +
GFS Partition.
i could make gfs file system but issues while trying to mount it. is
it due to GFS module not loaded?
unable to load GFS module.

below are the details, anyone has faced this issue before, please
suggest........

[root at quod-core1-uat ~]# mount -t gfs /dev/quoduat/rv /rv
/sbin/mount.gfs: error mounting /dev/mapper/quoduat-rv on /rv: No such device
[root at quod-core1-uat ~]#

GFS rpms are installed
[root at quod-core1-uat ~]# rpm -qa | grep -i gfs
kmod-gfs2-1.92-1.1.el5
kmod-gfs-0.1.23-5.el5
gfs-utils-0.1.17-1.el5
gfs2-utils-0.1.44-1.el5
[root at quod-core1-uat ~]#

couldn't find the gfs module loaded
[root at quod-core1-uat ~]# lsmod | grep -i gfs
gfs2                  346344  1 lock_dlm
configfs               28753  2 dlm
[root at quod-core1-uat ~]#

modinfo gfs throws below error:-
[root at quod-core1-uat ~]# modinfo gfs
modinfo: could not find module gfs
[root at quod-core1-uat ~]#

manually locating module, list the module
[root at quod-core1-uat ~]# modinfo /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
filename:       /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
license:        GPL
author:         Red Hat, Inc.
description:    Global File System 0.1.23-5.el5
srcversion:     F36BE93709E650F2BEC45A5
depends:        gfs2
vermagic:       2.6.18-92.el5 SMP mod_unload 686 REGPARM 4KSTACKS gcc-4.1
[root at quod-core1-uat ~]#

unable to install the module gfs
[root at quod-core1-uat ~]# modprobe /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
FATAL: Module /lib/modules/2.6.18_92.el5/extra/gfs/gfs.ko not found.
[root at quod-core1-uat ~]#

[root at quod-core1-uat ~]# clustat
Cluster Status for quod-clust-uat @ Thu Apr  2 18:09:45 2009
Member Status: Quorate

 Member Name                                                     ID   Status
 ------ ----                                                     ---- ------
 quod-core2-uat.kmefic.com.kw                                        1
Online, rgmanager
 quod-core1-uat.kmefic.com.kw                                        2
Online, Local, rgmanager
 /dev/sdc1                                                           0
Online, Quorum Disk

 Service Name                                             Owner (Last)
                                            State
 ------- ----                                             ----- ------
                                            -----
 service:quod-uat-ip
quod-core1-uat.kmefic.com.kw                             started
[root at quod-core1-uat ~]#

Thanks in Advance,
-Stevan Colaco


From mrugeshkarnik at gmail.com  Thu Apr  2 15:33:38 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Thu, 2 Apr 2009 21:03:38 +0530
Subject: [Linux-cluster] Network Interface Binding for cman
Message-ID: <200904022103.38273.mrugeshkarnik@gmail.com>

Hi,

How do I specify which network interfaces to listen on, to cman? I specifically 
need it to listen on two interfaces. The system has four interfaces in total.

I'm on CentOS 5.2.

Thanks,
Mrugesh


From jeff.sturm at eprize.com  Thu Apr  2 16:04:12 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Thu, 2 Apr 2009 12:04:12 -0400
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <200904022103.38273.mrugeshkarnik@gmail.com>
References: <200904022103.38273.mrugeshkarnik@gmail.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>

It binds to a multicast address.  That address is bound to one interface
normally.

If you need two interfaces, look into ethernet bonding. 

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Mrugesh Karnik
> Sent: Thursday, April 02, 2009 11:34 AM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] Network Interface Binding for cman
> 
> Hi,
> 
> How do I specify which network interfaces to listen on, to 
> cman? I specifically need it to listen on two interfaces. The 
> system has four interfaces in total.
> 
> I'm on CentOS 5.2.
> 
> Thanks,
> Mrugesh
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 


From arwin.tugade at csun.edu  Thu Apr  2 16:31:31 2009
From: arwin.tugade at csun.edu (Arwin L Tugade)
Date: Thu, 2 Apr 2009 09:31:31 -0700
Subject: [Linux-cluster] rgmanager stop just hangs,
	clurgmgrd never terminates
Message-ID: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C38F@CSUN-EX-V01.csun.edu>

Hey all,

I ran into an issue where my cluster was quorate but none of the services were showing up via the clustat command.  When I tried to do a /sbin/service rgmanager stop, it hangs indefinitely.  The sigterm is sent but the clurgmgrd processes don't stop.  What I ended up doing was manually kill off clurgmgrd, remove the pid file from /var/run/, restart cman and ultimately had to restart clvmd.  I'm on RHEL5U3 (x86_64), 2 node with a qdisk.  I'm also having this same rgmanager hang on RHEL5U2 (x86_64) 3 node.  Am I doing something wrong here?

Thanks,
Arwin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090402/a53a59fd/attachment.htm>

From fernando at lozano.eti.br  Thu Apr  2 17:38:06 2009
From: fernando at lozano.eti.br (Fernando Lozano)
Date: Thu, 02 Apr 2009 14:38:06 -0300
Subject: [Linux-cluster] rgmanager stop just hangs,
	clurgmgrd never terminates
In-Reply-To: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C38F@CSUN-EX-V01.csun.edu>
References: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C38F@CSUN-EX-V01.csun.edu>
Message-ID: <49D4F7FE.5060906@lozano.eti.br>

Hi Arwin,

I have the same problem on a two-node cluster (two KVM vitual machines)
and on another two-node cluster with real Dell servers. If I flush
iptables rules BEFORE starting cman, everything works fine. But if I
start cman and rgmanager with iptables rules, I see no services and
rgmanager hangs. Flusing iptables rules after starting cman changes
anything. :-(

I have all ports open as stated by RHCS manual, but it wasn't enough. I
still cannot find why rgmanager hangs and which rules my iptables setup
is missing, but I have the same behaviour on another setup with two
VMware virtual machines.

I don't use qdisk, clvmd nor gfs. My clustert setup has clean_start="1"
on fenced. I'm on RHEL5.2, tried both 32 and 64-bits.

Have you tried starting your cluster with no firewall?


[]s, Fernando Lozano

> Hey all,
>
>  
>
> I ran into an issue where my cluster was quorate but none of the
> services were showing up via the clustat command.  When I tried to do
> a /sbin/service rgmanager stop, it hangs indefinitely.  The sigterm is
> sent but the clurgmgrd processes don?t stop.  What I ended up doing
> was manually kill off clurgmgrd, remove the pid file from /var/run/,
> restart cman and ultimately had to restart clvmd.  I?m on RHEL5U3
> (x86_64), 2 node with a qdisk.  I?m also having this same rgmanager
> hang on RHEL5U2 (x86_64) 3 node.  Am I doing something wrong here?
>
>  
>
> Thanks,
>
> Arwin
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From arwin.tugade at csun.edu  Thu Apr  2 18:38:33 2009
From: arwin.tugade at csun.edu (Arwin L Tugade)
Date: Thu, 2 Apr 2009 11:38:33 -0700
Subject: [Linux-cluster] rgmanager stop just hangs,	clurgmgrd never
	terminates
In-Reply-To: <49D4F7FE.5060906@lozano.eti.br>
References: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C38F@CSUN-EX-V01.csun.edu>
	<49D4F7FE.5060906@lozano.eti.br>
Message-ID: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C390@CSUN-EX-V01.csun.edu>

Yup, matter of fact, I disabled iptables altogether.  The cluster comes up fine and I have services running once again (this is a test setup btw). Just to let you know I managed to get the cluster in this state when I was doing some failover testing.  I'm just wondering why when I do a /sbin/service rgmanager {stop|restart} it hangs indefinitely.

Btw, a question about that clean_start directive.  I'm reading the fenced man page and will the value of "1" prevent a fencing loop at startup.  I've seen it where I bring up 1 node, and then bring up node 2 and node 2 fences node1 and I see this in the log:

Apr  1 22:47:14 oilfish openais[4643]: [CPG  ] got joinlist message from node 1
Apr  1 22:47:14 oilfish openais[4643]: [CPG  ] got joinlist message from node 2
Apr  1 22:47:15 oilfish openais[4643]: [CMAN ] cman killed by node 2 because we rejoined the cluster without a full restart

Arwin

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Fernando Lozano
Sent: Thursday, April 02, 2009 10:38 AM
To: linux clustering
Subject: Re: [Linux-cluster] rgmanager stop just hangs, clurgmgrd never terminates

Hi Arwin,

I have the same problem on a two-node cluster (two KVM vitual machines)
and on another two-node cluster with real Dell servers. If I flush
iptables rules BEFORE starting cman, everything works fine. But if I
start cman and rgmanager with iptables rules, I see no services and
rgmanager hangs. Flusing iptables rules after starting cman changes
anything. :-(

I have all ports open as stated by RHCS manual, but it wasn't enough. I
still cannot find why rgmanager hangs and which rules my iptables setup
is missing, but I have the same behaviour on another setup with two
VMware virtual machines.

I don't use qdisk, clvmd nor gfs. My clustert setup has clean_start="1"
on fenced. I'm on RHEL5.2, tried both 32 and 64-bits.

Have you tried starting your cluster with no firewall?


[]s, Fernando Lozano

> Hey all,
>
>  
>
> I ran into an issue where my cluster was quorate but none of the
> services were showing up via the clustat command.  When I tried to do
> a /sbin/service rgmanager stop, it hangs indefinitely.  The sigterm is
> sent but the clurgmgrd processes don?t stop.  What I ended up doing
> was manually kill off clurgmgrd, remove the pid file from /var/run/,
> restart cman and ultimately had to restart clvmd.  I?m on RHEL5U3
> (x86_64), 2 node with a qdisk.  I?m also having this same rgmanager
> hang on RHEL5U2 (x86_64) 3 node.  Am I doing something wrong here?
>
>  
>
> Thanks,
>
> Arwin
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From sdake at redhat.com  Thu Apr  2 18:59:15 2009
From: sdake at redhat.com (Steven Dake)
Date: Thu, 02 Apr 2009 11:59:15 -0700
Subject: [Linux-cluster] Trouble after Openais upgrade to 0.80.3-22.el5
In-Reply-To: <593E210EDC38444DA1C17E9E9F5E264B98FC9A@GBMDMail01.genband.com>
References: <alpine.DEB.2.00.0903262333350.4377@lxserv1.kfki.hu>
	<20090330180739.GB6135@redhat.com>
	<alpine.DEB.2.00.0903311119100.23085@lxserv0.kfki.hu>
	<593E210EDC38444DA1C17E9E9F5E264B98FC9A@GBMDMail01.genband.com>
Message-ID: <1238698755.4602.17.camel@sdake-laptop>

Likely you ran into the segfault that happens during the upgrade process
from some 5.2 to 5.3 nodes.  You can reboot your cluster with all either
5.2 or alternatively 5.3 nodes or wait until the 5.3.z stream release
becomes available which resolves this problem.

regards
-steve

On Wed, 2009-04-01 at 23:51 -0400, Ed Sanborn wrote:
> I have RHEL 5.2  
> I tried upgrading openais on a few of my nodes. Original version was
> 0.80.3-15.el5  and I upgraded to version  0.80.3-22.el5.
> Now the node will not connect to the cluster.  I get the following error
> in
> /var/log/messages:
> 
> "unable to connect to cluster infrastructure"
> 
> Has anyone else run into this issue?  Is there a way around this besides
> going back to 
> the old version?
> 
> Ed
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From kadlec at mail.kfki.hu  Thu Apr  2 19:29:45 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Thu, 2 Apr 2009 21:29:45 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D4CDBF.6040203@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<49D4CDBF.6040203@gmail.com>
Message-ID: <alpine.DEB.2.00.0904021906180.31982@lxserv1.kfki.hu>

Hi,

On Thu, 2 Apr 2009, Wendy Cheng wrote:

> > If you have any idea what to do next, please write it.
> >   
> Do you have your kernel source somewhere (in tar ball format) so people can
> look into it ?

I have created the tarballs, you can find them at 
http://www.kfki.hu/~kadlec/gfs/:

- Kernel is vanilla 2.6.27.21, the '.config' file is preserved in the 
  tarball as 'config'.

- On top of that I installed vanilla e1000-8.0.6, e1000e-0.5.8.2 and
  aoe6-69. The same e1000-8.0.6 and e1000e-0.5.8.2 are used with the 
  working cluster-2.01.00 to which the earlier aoe6-59 was added.

- The cluster-2.03.11 is also the vanilla version, except that since
  this thread started I have added two small corrections:
  - fence/fenced/agent.c fixed, see
    https://www.redhat.com/archives/linux-cluster/2009-March/msg00222.html
  - gfs2/mount/umount.gfs2.c, '-l' flag support added, see
    https://www.redhat.com/archives/cluster-devel/2009-April/msg00000.html

  The configure options are in 'configure-options', the locally used init
  scripts can be found under deb/DEBIAN/etc ;-)

The GFS volumes are mounted with noatime, quota is enabled (can't leave 
that off). The volumes are tuned with the values:

statfs_slots 128
statfs_fast 1
demote_secs 30
glock_purge 50
scand_secs 3

Those are mostly remnants of the time when Maildir was in use instead 
of plain mailbox format and we tried to cure the terrible performance.

Probably it's worth to note that 'statfs_fast 1' takes a lot of time to 
complete (usually around 15-20 seconds) which is, at least for me, 
surprising.

I think that's all. 

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From kadlec at mail.kfki.hu  Thu Apr  2 21:09:45 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Thu, 2 Apr 2009 23:09:45 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
Message-ID: <alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>

On Thu, 2 Apr 2009, Kadlecsik Jozsef wrote:

> If you have any idea what to do next, please write it.

Spent again some time looking through the git commits and that 
triggered some wild guessing:

- commit ddebb0c3dc7d0b87c402ba17731ad41abdd43f2d ?
  It is a temporary fix for 2.6.26, which is additionally based on
  a kludge and I'm trying 2.6.27/28. Might be not appropriate anymore 
  for these kernels?

- commit d9c3e59e90437567d063144bcfdbbc9fe6e8d615 ?
  (and other noatime related commits)
  Noatime handling and I do use noatime.
  Hm, I could try to start mailman without noatime and we'll see
  what happens.

- commit ff7d89bfe60ed041d9342c8c9d91815c1f3d3bef ?
  gfs1-specific lock module, a huge patch.
  I could restore the gfs2_*lock* functions and check whether it helps.

- commit 82d176ba485f2ef049fd303b9e41868667cebbdb
  gfs_drop_inode as .drop_inode replacing .put_inode.
  .put_inode was called without holding a lock, but .drop_inode
  is called under inode_lock held. Might it be a problem?

What do you think?

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From s.wendy.cheng at gmail.com  Thu Apr  2 21:37:19 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Thu, 02 Apr 2009 16:37:19 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
Message-ID: <49D5300F.9010805@gmail.com>

Kadlecsik Jozsef wrote:
> - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
>   gfs_drop_inode as .drop_inode replacing .put_inode.
>   .put_inode was called without holding a lock, but .drop_inode
>   is called under inode_lock held. Might it be a problem?
>
>   
I was planning to take a look over the weekend .. but this one looks 
very promising. Give it a try and let us know !

-- Wendy


From kadlec at mail.kfki.hu  Thu Apr  2 21:45:32 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Thu, 2 Apr 2009 23:45:32 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D5300F.9010805@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
Message-ID: <alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>

On Thu, 2 Apr 2009, Wendy Cheng wrote:

> Kadlecsik Jozsef wrote:
> > - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
> >   gfs_drop_inode as .drop_inode replacing .put_inode.
> >   .put_inode was called without holding a lock, but .drop_inode
> >   is called under inode_lock held. Might it be a problem?
> >   
> I was planning to take a look over the weekend .. but this one looks very
> promising. Give it a try and let us know !

But - how? .put_inode was eliminated, cannot be used anymore in recent 
kernels. And I have no idea what should be changed in gfs_drop_inode.

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From s.wendy.cheng at gmail.com  Thu Apr  2 22:07:41 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Thu, 02 Apr 2009 17:07:41 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
Message-ID: <49D5372D.9090709@gmail.com>

Kadlecsik Jozsef wrote:
> On Thu, 2 Apr 2009, Wendy Cheng wrote:
>
>   
>> Kadlecsik Jozsef wrote:
>>     
>>> - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
>>>   gfs_drop_inode as .drop_inode replacing .put_inode.
>>>   .put_inode was called without holding a lock, but .drop_inode
>>>   is called under inode_lock held. Might it be a problem?
>>>   
>>>       
>> I was planning to take a look over the weekend .. but this one looks very
>> promising. Give it a try and let us know !
>>     
>
> But - how? .put_inode was eliminated, cannot be used anymore in recent 
> kernels. And I have no idea what should be changed in gfs_drop_inode.
>
>   
I see :) ... let me move your tar ball over. Know about cluster IRC 
(check cluster wiki for instruction if you don't know how) ? Go there - 
maybe some IRC folks will be able to work this with you.

-- Wendy


From kbphillips80 at gmail.com  Thu Apr  2 22:23:51 2009
From: kbphillips80 at gmail.com (Kaerka Phillips)
Date: Thu, 2 Apr 2009 18:23:51 -0400
Subject: [Linux-cluster] RHEL5.3 Cluster - backup fencing methods
Message-ID: <e558921b0904021523u76d4607an99c22a7700721c2a@mail.gmail.com>

Hi - I've got an issue with a 4-node cluster, and I'm hoping to get some
good advice or best-practices for this.  The 4-node cluster is on dell
hardware, using DRAC cards as the primary fencing device, but I'd like to
eliminate the single-point of failure introduced with the cabling for this
method.

I attempted to use Fence_ipmilan, but once i got the fence_drac5 working,
this no longer works for unknown reasons, but even if it did work, the DRACs
are on a private VLAN, as is the cluster and cluster multicast address.  I'm
concerned that a failure of the switch which hosts that vlan and drac
ethernet connections would cause an outright cluster failure.  The point of
this cluster is to share GFS2 filesystems amongst the 4-nodes.

My network setup is this:
2x GigE cables plugged into PCI-E cards (two physical cards), bonded
ethernet config, public network
1x GigE cable plugged into port #1 on dell server, in "shared" mode in DRAC5
card, on private network:
     DRAC5 card maps to this with an assigned ip address, OS also maps to
this with a different assigned ip address.

All cluster communication between nodes passes over private network over
GigE shared port.  I've not been able to determine the correct solution to
eliminate this last single-point of failure, aside from adding an additional
network connection to another on-board ethernet card, and mapping this to
the private network, to a 2nd switch.  I have no guarantee that this will
work, nor much documentation to indicate what setup would be required for
this.

Any thoughts?

Thanks,

Kaerka
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090402/d8e87000/attachment.htm>

From erickson.jon at gmail.com  Fri Apr  3 00:53:34 2009
From: erickson.jon at gmail.com (Jon Erickson)
Date: Thu, 2 Apr 2009 20:53:34 -0400
Subject: [Linux-cluster] Unable to mount GFS File System in RHEL5.2 
	(32bit)
In-Reply-To: <56bb44d0904020811u118479fdgbd6f0b004581f095@mail.gmail.com>
References: <56bb44d0904020811u118479fdgbd6f0b004581f095@mail.gmail.com>
Message-ID: <6a90e4da0904021753o16a9d996rcf8e3cb30035b796@mail.gmail.com>

I'm having the same problem...

When running the mount command with the '-v' option it says something
about errno 19?  I don't remember exactly, I can post more info
tomorrow.


On Thu, Apr 2, 2009 at 11:11 AM, Stevan Colaco <stevan.colaco at gmail.com> wrote:
> Dear All,
>
> I have setup 2 node cluster on RHEL5.2 (32bit) + Quorum Partition +
> GFS Partition.
> i could make gfs file system but issues while trying to mount it. is
> it due to GFS module not loaded?
> unable to load GFS module.
>
> below are the details, anyone has faced this issue before, please
> suggest........
>
> [root at quod-core1-uat ~]# mount -t gfs /dev/quoduat/rv /rv
> /sbin/mount.gfs: error mounting /dev/mapper/quoduat-rv on /rv: No such device
> [root at quod-core1-uat ~]#
>
> GFS rpms are installed
> [root at quod-core1-uat ~]# rpm -qa | grep -i gfs
> kmod-gfs2-1.92-1.1.el5
> kmod-gfs-0.1.23-5.el5
> gfs-utils-0.1.17-1.el5
> gfs2-utils-0.1.44-1.el5
> [root at quod-core1-uat ~]#
>
> couldn't find the gfs module loaded
> [root at quod-core1-uat ~]# lsmod | grep -i gfs
> gfs2 ? ? ? ? ? ? ? ? ?346344 ?1 lock_dlm
> configfs ? ? ? ? ? ? ? 28753 ?2 dlm
> [root at quod-core1-uat ~]#
>
> modinfo gfs throws below error:-
> [root at quod-core1-uat ~]# modinfo gfs
> modinfo: could not find module gfs
> [root at quod-core1-uat ~]#
>
> manually locating module, list the module
> [root at quod-core1-uat ~]# modinfo /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> filename: ? ? ? /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> license: ? ? ? ?GPL
> author: ? ? ? ? Red Hat, Inc.
> description: ? ?Global File System 0.1.23-5.el5
> srcversion: ? ? F36BE93709E650F2BEC45A5
> depends: ? ? ? ?gfs2
> vermagic: ? ? ? 2.6.18-92.el5 SMP mod_unload 686 REGPARM 4KSTACKS gcc-4.1
> [root at quod-core1-uat ~]#
>
> unable to install the module gfs
> [root at quod-core1-uat ~]# modprobe /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> FATAL: Module /lib/modules/2.6.18_92.el5/extra/gfs/gfs.ko not found.
> [root at quod-core1-uat ~]#
>
> [root at quod-core1-uat ~]# clustat
> Cluster Status for quod-clust-uat @ Thu Apr ?2 18:09:45 2009
> Member Status: Quorate
>
> ?Member Name ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ID ? Status
> ?------ ---- ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ---- ------
> ?quod-core2-uat.kmefic.com.kw ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?1
> Online, rgmanager
> ?quod-core1-uat.kmefic.com.kw ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?2
> Online, Local, rgmanager
> ?/dev/sdc1 ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? 0
> Online, Quorum Disk
>
> ?Service Name ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Owner (Last)
> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?State
> ?------- ---- ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ----- ------
> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?-----
> ?service:quod-uat-ip
> quod-core1-uat.kmefic.com.kw ? ? ? ? ? ? ? ? ? ? ? ? ? ? started
> [root at quod-core1-uat ~]#
>
> Thanks in Advance,
> -Stevan Colaco
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From fernando at lozano.eti.br  Fri Apr  3 01:07:10 2009
From: fernando at lozano.eti.br (Fernando Lozano)
Date: Thu, 02 Apr 2009 22:07:10 -0300
Subject: [Linux-cluster] rgmanager stop just hangs,
	clurgmgrd never	terminates
In-Reply-To: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C390@CSUN-EX-V01.csun.edu>
References: <6708F96BBF31F846BFA56EC0AE37D62281A6E3C38F@CSUN-EX-V01.csun.edu>	<49D4F7FE.5060906@lozano.eti.br>
	<6708F96BBF31F846BFA56EC0AE37D62281A6E3C390@CSUN-EX-V01.csun.edu>
Message-ID: <49D5613E.1050403@lozano.eti.br>

Arwin,

Doesn't you log shows one node trying to fence the other? Clean_start
prevents that at cluster startup, but on failover the survivor wants to
fence the other. You may need to use fence_ack to let one node belive
the other was fenced if you do not have a real fence device, for example
Dell DRAC or a Network APS.


[]s, Fernando Lozano

> Yup, matter of fact, I disabled iptables altogether.  The cluster comes up fine and I have services running once again (this is a test setup btw). Just to let you know I managed to get the cluster in this state when I was doing some failover testing.  I'm just wondering why when I do a /sbin/service rgmanager {stop|restart} it hangs indefinitely.
>
> Btw, a question about that clean_start directive.  I'm reading the fenced man page and will the value of "1" prevent a fencing loop at startup.  I've seen it where I bring up 1 node, and then bring up node 2 and node 2 fences node1 and I see this in the log:
>   
[]s, Fernando Lozano


From kbphillips80 at gmail.com  Fri Apr  3 01:44:49 2009
From: kbphillips80 at gmail.com (Kaerka Phillips)
Date: Thu, 2 Apr 2009 21:44:49 -0400
Subject: [Linux-cluster] Unable to mount GFS File System in RHEL5.2 
	(32bit)
In-Reply-To: <6a90e4da0904021753o16a9d996rcf8e3cb30035b796@mail.gmail.com>
References: <56bb44d0904020811u118479fdgbd6f0b004581f095@mail.gmail.com>
	<6a90e4da0904021753o16a9d996rcf8e3cb30035b796@mail.gmail.com>
Message-ID: <e558921b0904021844y5783d09cuf56bffa945686b28@mail.gmail.com>

It looks like there is a mix between the gfs and gfs2 filesystem and modules
on your system -- your loaded module is GFS2, so perhaps try mounting with
"-t gfs2", except that you will need to have made the filesystem with GFS2
as well.  All of my mounted GFS2 filesystems show "gfs2" as the FS type
currently mounted.

You may want to remove all of the GFS items, and leave the GFS2 components
(since that is the supported FS)

On a RHEL5.3 system with only GFS2:
# lsmod |grep gfs
gfs2                  524204  12 lock_dlm

On a RHEL5.3 system with GFS2 only (64bit system):
# modinfo gfs2
filename:       /lib/modules/2.6.18-128.1.1.el5/weak-updates/gfs2/gfs2.ko
license:        GPL
author:         Red Hat, Inc.
description:    Global File System
srcversion:     3E318153BB4A45EAE38B903
depends:
vermagic:       2.6.18-92.el5 SMP mod_unload gcc-4.1
parm:           scand_secs:The number of seconds between scand runs (uint)

On a RHEL5.2 system with GFS2:
# modinfo gfs2
filename:       /lib/modules/2.6.18-92.1.13.el5PAE/kernel/fs/gfs2/gfs2.ko
license:        GPL
author:         Red Hat, Inc.
description:    Global File System
srcversion:     B09BC266DD032D7FCEA51E5
depends:
vermagic:       2.6.18-92.1.13.el5PAE SMP mod_unload 686 REGPARM 4KSTACKS
gcc-4.1
parm:           scand_secs:The number of seconds between scand runs (uint)
module_sig:
883e35048bf9999c45df68fce924fd711286f509771f47a50c8a08af053af2d178c427da1e6788409f6ae5853585f2f14ddf7f78d9fb259eac8236bd9


On Thu, Apr 2, 2009 at 8:53 PM, Jon Erickson <erickson.jon at gmail.com> wrote:

> I'm having the same problem...
>
> When running the mount command with the '-v' option it says something
> about errno 19?  I don't remember exactly, I can post more info
> tomorrow.
>
>
> On Thu, Apr 2, 2009 at 11:11 AM, Stevan Colaco <stevan.colaco at gmail.com>
> wrote:
> > Dear All,
> >
> > I have setup 2 node cluster on RHEL5.2 (32bit) + Quorum Partition +
> > GFS Partition.
> > i could make gfs file system but issues while trying to mount it. is
> > it due to GFS module not loaded?
> > unable to load GFS module.
> >
> > below are the details, anyone has faced this issue before, please
> > suggest........
> >
> > [root at quod-core1-uat ~]# mount -t gfs /dev/quoduat/rv /rv
> > /sbin/mount.gfs: error mounting /dev/mapper/quoduat-rv on /rv: No such
> device
> > [root at quod-core1-uat ~]#
> >
> > GFS rpms are installed
> > [root at quod-core1-uat ~]# rpm -qa | grep -i gfs
> > kmod-gfs2-1.92-1.1.el5
> > kmod-gfs-0.1.23-5.el5
> > gfs-utils-0.1.17-1.el5
> > gfs2-utils-0.1.44-1.el5
> > [root at quod-core1-uat ~]#
> >
> > couldn't find the gfs module loaded
> > [root at quod-core1-uat ~]# lsmod | grep -i gfs
> > gfs2                  346344  1 lock_dlm
> > configfs               28753  2 dlm
> > [root at quod-core1-uat ~]#
> >
> > modinfo gfs throws below error:-
> > [root at quod-core1-uat ~]# modinfo gfs
> > modinfo: could not find module gfs
> > [root at quod-core1-uat ~]#
> >
> > manually locating module, list the module
> > [root at quod-core1-uat ~]# modinfo
> /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> > filename:       /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> > license:        GPL
> > author:         Red Hat, Inc.
> > description:    Global File System 0.1.23-5.el5
> > srcversion:     F36BE93709E650F2BEC45A5
> > depends:        gfs2
> > vermagic:       2.6.18-92.el5 SMP mod_unload 686 REGPARM 4KSTACKS gcc-4.1
> > [root at quod-core1-uat ~]#
> >
> > unable to install the module gfs
> > [root at quod-core1-uat ~]# modprobe
> /lib/modules/2.6.18-92.el5/extra/gfs/gfs.ko
> > FATAL: Module /lib/modules/2.6.18_92.el5/extra/gfs/gfs.ko not found.
> > [root at quod-core1-uat ~]#
> >
> > [root at quod-core1-uat ~]# clustat
> > Cluster Status for quod-clust-uat @ Thu Apr  2 18:09:45 2009
> > Member Status: Quorate
> >
> >  Member Name                                                     ID
> Status
> >  ------ ----                                                     ----
> ------
> >  quod-core2-uat.kmefic.com.kw                                        1
> > Online, rgmanager
> >  quod-core1-uat.kmefic.com.kw                                        2
> > Online, Local, rgmanager
> >  /dev/sdc1                                                           0
> > Online, Quorum Disk
> >
> >  Service Name                                             Owner (Last)
> >                                            State
> >  ------- ----                                             ----- ------
> >                                            -----
> >  service:quod-uat-ip
> > quod-core1-uat.kmefic.com.kw                             started
> > [root at quod-core1-uat ~]#
> >
> > Thanks in Advance,
> > -Stevan Colaco
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
>
> --
> Jon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090402/d8856f42/attachment.htm>

From mrugeshkarnik at gmail.com  Fri Apr  3 03:09:07 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Fri, 3 Apr 2009 08:39:07 +0530
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>
References: <200904022103.38273.mrugeshkarnik@gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>
Message-ID: <200904030839.08056.mrugeshkarnik@gmail.com>

On Thursday 02 Apr 2009 21:34:12 Jeff Sturm wrote:
> It binds to a multicast address.  That address is bound to one interface
> normally.

Well, how do I specify which interface to bind that multicast address to? I 
see the `bindnetaddr' directive in openais.conf man page. The cman man page 
tells me that the parameter from the <clusternodes> section will overwrite. 
Now, I haven't been able to find any reference as to the syntax of the 
clusternodes directive.

Also, according to the openais.conf man page, the bindnetaddr directive is a 
subdirective of the interface directive, which in itself a subdirective of the 
totem directive. So I'm wondering if it goes something like what follows:

<cluster>
  <totem>
    <interface>
      <bindnetaddr/>
    </interface>
  </totem>
</cluster>

> If you need two interfaces, look into ethernet bonding.

Can't, in this setup.

Though, is it not at all possible to use multiple multicast addresses to bind 
to on different interfaces? Heartbeat, for instance, allows it.

Thanks,
Mrugesh


From mrugeshkarnik at gmail.com  Fri Apr  3 03:32:17 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Fri, 3 Apr 2009 09:02:17 +0530
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <200904030839.08056.mrugeshkarnik@gmail.com>
References: <200904022103.38273.mrugeshkarnik@gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>
	<200904030839.08056.mrugeshkarnik@gmail.com>
Message-ID: <200904030902.17952.mrugeshkarnik@gmail.com>

On Friday 03 Apr 2009 08:39:07 Mrugesh Karnik wrote:
> On Thursday 02 Apr 2009 21:34:12 Jeff Sturm wrote:
> > It binds to a multicast address.  That address is bound to one interface
> > normally.
>
> Well, how do I specify which interface to bind that multicast address to? I
> see the `bindnetaddr' directive in openais.conf man page. The cman man page
> tells me that the parameter from the <clusternodes> section will overwrite.
> Now, I haven't been able to find any reference as to the syntax of the
> clusternodes directive.
>
> Also, according to the openais.conf man page, the bindnetaddr directive is
> a subdirective of the interface directive, which in itself a subdirective
> of the totem directive. So I'm wondering if it goes something like what
> follows:
>
> <cluster>
>   <totem>
>     <interface>
>       <bindnetaddr/>
>     </interface>
>   </totem>
> </cluster>

I guess this is what I was looking for:
http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html

I'm reading through the wiki now.

Mrugesh


From s.wendy.cheng at gmail.com  Fri Apr  3 03:59:52 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Thu, 02 Apr 2009 22:59:52 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
Message-ID: <49D589B8.1050702@gmail.com>


>> Kadlecsik Jozsef wrote:
>>     
>>> - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
>>>   gfs_drop_inode as .drop_inode replacing .put_inode.
>>>   .put_inode was called without holding a lock, but .drop_inode
>>>   is called under inode_lock held. Might it be a problem
>>>       
Based on code reading ...
1. iput() gets inode_lock (a spin lock)
2. iput() calls iput_final()
3. iput_final() calls filesystem drop_inode(), followed by 
generic_drop_inode()
4. generic_drop_inode() unlock inode_lock after doing all sorts of fun 
things with the inode

So look to me that generic_drop_inode() statement within 
gfs_drop_inode() should be removed. Otherwise you would get double 
unlock and double list free.

In short, *remove* line #73 from gfs-kernel/src/gfs/ops_super.c in your 
source and let us know how it goes.

-- Wendy


From kadlec at mail.kfki.hu  Fri Apr  3 06:38:12 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Fri, 3 Apr 2009 08:38:12 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D589B8.1050702@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
	<49D589B8.1050702@gmail.com>
Message-ID: <alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>

On Thu, 2 Apr 2009, Wendy Cheng wrote:

> > > Kadlecsik Jozsef wrote:
> > >     
> > > > - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
> > > >   gfs_drop_inode as .drop_inode replacing .put_inode.
> > > >   .put_inode was called without holding a lock, but .drop_inode
> > > >   is called under inode_lock held. Might it be a problem
> > > >       
> Based on code reading ...
> 1. iput() gets inode_lock (a spin lock)
> 2. iput() calls iput_final()
> 3. iput_final() calls filesystem drop_inode(), followed by
> generic_drop_inode()
> 4. generic_drop_inode() unlock inode_lock after doing all sorts of fun things
> with the inode
> 
> So look to me that generic_drop_inode() statement within 
> gfs_drop_inode() should be removed. Otherwise you would get double 
> unlock and double list free.

I think those function calls are right: iput_final calls either the 
filesystem drop_inode function (in this case gfs_drop_inode) or 
generic_drop_inode. There's no double call of generic_drop_inode. However 
gfs_sync_page_i (and in turn filemap_fdatawrite and filemap_fdatawait) is 
now called under inode_lock held and that was not so in previous versions.
But I'm just speculating.
 
> In short, *remove* line #73 from gfs-kernel/src/gfs/ops_super.c in your 
> source and let us know how it goes.

I won't get a chance to start a test before Monday, sorry. 

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From reggaestar at gmail.com  Fri Apr  3 09:16:41 2009
From: reggaestar at gmail.com (remi doubi)
Date: Fri, 3 Apr 2009 09:16:41 +0000
Subject: [Linux-cluster] Virtualization on top of Centos cluster
Message-ID: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>

Hi everyone,
i apologize for my bad english.
i'm a familiar with Linux environement  ( fedora 10 user ) and i got a
project in a training where i have to create a cluster with two nodes where
i have to set up a number of VMs that will run applications such as ( Samba,
Ldap, Zimbra, ...)
but i don't know how to virtualize on top of a cluster !!
i would like to know how that can be done, and how is it possible to let the
VMs get ressources ( RAM & CPU ) from the two nodes ??
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/351668fe/attachment.htm>

From binder.christian at gmx.de  Fri Apr  3 09:42:38 2009
From: binder.christian at gmx.de (Christian Binder)
Date: Fri, 03 Apr 2009 11:42:38 +0200
Subject: [Linux-cluster] resetting a fence - Device (ILOM) during a running
	Cluster (RHEL 5.2)
Message-ID: <20090403094238.320830@gmx.net>

Hello,

we are comfortable with our 2-Node RHEL-5.2 - Cluster (some Oracle-DBs),
we have subscribed for several months and which is running stable.

We use two SunXFires 4200 M2 server and do the fencing with the integrated ILOMs.
Unfortunatly, -  because of errors on the ILOM the ILOM of the one node has to be reset, which is the advice of our hardware-vendor.
The reset of the ILOM can be done in production (tested on a single machine)
transparent for the OS on this machine (means: without the need of rebooting the OS of the machine.)
The only thing, I noticed during the test on the single - Server is,
that the ILOM is down (network not available) for about 30 sec.

Does this short downtime of the fencedevice affect the Redhat Clustersoftware
or can we do that action without problems in procution -time ?

Thank you for your answer.

Christian
-- 
Neu: GMX FreeDSL Komplettanschluss mit DSL 6.000 Flatrate + Telefonanschluss f?r nur 17,95 Euro/mtl.!* http://dsl.gmx.de/?ac=OM.AD.PD003K11308T4569a


From kirbyzhou at sohu-rd.com  Fri Apr  3 10:00:56 2009
From: kirbyzhou at sohu-rd.com (Kirby Zhou)
Date: Fri, 3 Apr 2009 18:00:56 +0800
Subject: [Linux-cluster] How to recover from gnbd "Open Disconnected
	Pending" state?
In-Reply-To: <07bc01c9b35f$b5a675a0$20f360e0$@com>
References: <07bc01c9b35f$b5a675a0$20f360e0$@com>
Message-ID: <0c3701c9b443$14cede80$3e6c9b80$@com>

How to recover from gnbd "Open Disconnected Pending" state?

When the machine which exported gnbd is broken or shutdown, the client
machine would fall into the state 'Open Disconnected Pending'.
Any process accessing the dead gnbd will fall into state 'D'. 
How can I remove the dead gnbd block device on the client machine?

#On the client machine
[root at xen-727057 ~]# gnbd_import -n -l
Device name : 63.131.xvdb
----------------------
    Minor # : 0
 sysfs name : /block/gnbd0
     Server : 10.10.63.131
       Port : 14567
      State : Open Disconnected Pending
   Readonly : No
    Sectors : 16777216

[root at xen-727057 ~]# pvs &
[1] 4561
[root at xen-727057 ~]# ps aux | fgrep pvs
root      4561  0.6  0.1  79364  1544 pts/0    D    14:50   0:00 pvs
root      4563  0.0  0.0  61112   608 pts/0    S+   14:50   0:00 fgrep pvs

[root at xen-727057 ~]# gnbd_import -n -R
gnbd_import: ERROR cannot disconnect device #0 : Device or resource busy


From jdong at redhat.com  Fri Apr  3 10:18:38 2009
From: jdong at redhat.com (jdong)
Date: Fri, 03 Apr 2009 18:18:38 +0800
Subject: [Linux-cluster] Virtualization on top of Centos cluster
In-Reply-To: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
Message-ID: <49D5E27E.1040803@redhat.com>

Hey remi,
    When you create VMs, you can assign hardware sources to VMs.
    Did you send the mail to ask how to create vm?
    If you use fedora 10,you can use qemu tool to add kvm.There is a GUI 
named virt-manager.It has bugs about creating kvm before.You also can 
use qemu-img to create image file and use qemu-kvm to assign resource.
    You can get details from man page.After setting up VMs,you can login 
them to install applications.


remi doubi wrote:
> Hi everyone,
> i apologize for my bad english.
> i'm a familiar with Linux environement  ( fedora 10 user ) and i got a 
> project in a training where i have to create a cluster with two nodes 
> where i have to set up a number of VMs that will run applications such 
> as ( Samba, Ldap, Zimbra, ...)
> but i don't know how to virtualize on top of a cluster !!
> i would like to know how that can be done, and how is it possible to 
> let the VMs get ressources ( RAM & CPU ) from the two nodes ??
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From reggaestar at gmail.com  Fri Apr  3 10:27:46 2009
From: reggaestar at gmail.com (remi doubi)
Date: Fri, 3 Apr 2009 10:27:46 +0000
Subject: Fwd: [Linux-cluster] Virtualization on top of Centos cluster
In-Reply-To: <49D5E27E.1040803@redhat.com>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
	<49D5E27E.1040803@redhat.com>
Message-ID: <3c88c73a0904030327g15876027g231105c6b860331e@mail.gmail.com>

 When you create VMs, you can assign hardware sources to VMs.

but how can i do that, do i have to assign hadware sources manually ( which
when i want to create a VM, i have to set for example the memory from the
first server and the CPU from the second or the opposite) or do i have just
to specify that the VM would have an amout of memory and CPU and the cluster
will choose from which servers the sources will be taken ???


i will probably choose Xen instead of Qemu and the servers OS will be Centos
because they told me that i have ti use GFS for shared storage.
what do you think about it ?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/fb3c45e2/attachment.htm>

From neuroticimbecile at yahoo.com  Fri Apr  3 10:51:40 2009
From: neuroticimbecile at yahoo.com (eric rosel)
Date: Fri, 3 Apr 2009 03:51:40 -0700 (PDT)
Subject: [Linux-cluster] Virtualization on top of Centos cluster
In-Reply-To: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
Message-ID: <310778.21764.qm@web53209.mail.re2.yahoo.com>


Hi Remi,

I was able to configure such a beast last year.  I used OpenVZ for virtualization, and a separate iSCSI SAN storage server (which was also a RHCS cluster) for the /vz directory where most of the OpenVZ stuff are stored.   Configuring it was not very complicated, I basically just had to add the resources (the openvz startup script, the iscsi device, etc.) through luci.  I used CentOS 5.2 for that project, but it didn't involve getting RAM and CPU resources from the standby node.

HTH,
-eric


--- On Fri, 4/3/09, remi doubi <reggaestar at gmail.com> wrote:

> From: remi doubi <reggaestar at gmail.com>
> Subject: [Linux-cluster] Virtualization on top of Centos cluster
> To: linux-cluster at redhat.com
> Date: Friday, April 3, 2009, 5:16 PM
> Hi everyone,
> i apologize for my bad english.
> i'm a familiar with Linux environement  ( fedora 10
> user ) and i got a
> project in a training where i have to create a cluster with
> two nodes where
> i have to set up a number of VMs that will run applications
> such as ( Samba,
> Ldap, Zimbra, ...)
> but i don't know how to virtualize on top of a cluster
> !!
> i would like to know how that can be done, and how is it
> possible to let the
> VMs get ressources ( RAM & CPU ) from the two nodes ??
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From reggaestar at gmail.com  Fri Apr  3 11:03:48 2009
From: reggaestar at gmail.com (remi doubi)
Date: Fri, 3 Apr 2009 11:03:48 +0000
Subject: [Linux-cluster] Virtualization on top of Centos cluster
In-Reply-To: <310778.21764.qm@web53209.mail.re2.yahoo.com>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
	<310778.21764.qm@web53209.mail.re2.yahoo.com>
Message-ID: <3c88c73a0904030403q486b836ej34f5265568f25261@mail.gmail.com>

Thakns eric & jdong

that's what i tought, there will be a lot of bugs due to sources
synchronisation for sure.

But i read yesterday in a topic that a guy did that with Xen,
he assigned the cluster as Dom0, and then all VMs as DomU.

and probably all the hadware sources "Cluster" will be shared. is this can
work ??

Remi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/a5558027/attachment.htm>

From hlawatschek at atix.de  Fri Apr  3 11:11:00 2009
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Fri, 3 Apr 2009 13:11:00 +0200
Subject: [Linux-cluster] resetting a fence - Device (ILOM) during a
	running Cluster (RHEL 5.2)
In-Reply-To: <20090403094238.320830@gmx.net>
References: <20090403094238.320830@gmx.net>
Message-ID: <200904031311.00393.hlawatschek@atix.de>

> we are comfortable with our 2-Node RHEL-5.2 - Cluster (some Oracle-DBs),
> we have subscribed for several months and which is running stable.
>
> We use two SunXFires 4200 M2 server and do the fencing with the integrated
> ILOMs. Unfortunatly, -  because of errors on the ILOM the ILOM of the one
> node has to be reset, which is the advice of our hardware-vendor. The reset
> of the ILOM can be done in production (tested on a single machine)
> transparent for the OS on this machine (means: without the need of
> rebooting the OS of the machine.) The only thing, I noticed during the test
> on the single - Server is, that the ILOM is down (network not available)
> for about 30 sec.
>
> Does this short downtime of the fencedevice affect the Redhat
> Clustersoftware or can we do that action without problems in procution
> -time ?
If there's no need for fencing the node during the 30secs period, it can be 
done during production time.

-Mark


From reggaestar at gmail.com  Fri Apr  3 11:37:06 2009
From: reggaestar at gmail.com (remi doubi)
Date: Fri, 3 Apr 2009 11:37:06 +0000
Subject: [Linux-cluster] linux clustering <linux-cluster@redhat.com>
Message-ID: <3c88c73a0904030437o20924ff1n48c63926520a633@mail.gmail.com>

here's the article :

http://www.mail-archive.com/linux-cluster at redhat.com/msg05169.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/5f5b16d9/attachment.htm>

From binder.christian at gmx.de  Fri Apr  3 12:59:06 2009
From: binder.christian at gmx.de (Christian Binder)
Date: Fri, 03 Apr 2009 14:59:06 +0200
Subject: [Linux-cluster] resetting a fence - Device (ILOM) during a	running
	Cluster (RHEL 5.2)
In-Reply-To: <200904031311.00393.hlawatschek@atix.de>
References: <20090403094238.320830@gmx.net>
	<200904031311.00393.hlawatschek@atix.de>
Message-ID: <20090403125906.262190@gmx.net>

Thank you Mark,

the reset was sucessful with no affect to the OS
(PS: I gladly remember me on your solution-day in Neuss at February)

Christian

-------- Original-Nachricht --------
> Datum: Fri, 3 Apr 2009 13:11:00 +0200
> Von: Mark Hlawatschek <hlawatschek at atix.de>
> An: linux clustering <linux-cluster at redhat.com>
> Betreff: Re: [Linux-cluster] resetting a fence - Device (ILOM) during a	running Cluster (RHEL 5.2)

> > we are comfortable with our 2-Node RHEL-5.2 - Cluster (some Oracle-DBs),
> > we have subscribed for several months and which is running stable.
> >
> > We use two SunXFires 4200 M2 server and do the fencing with the
> integrated
> > ILOMs. Unfortunatly, -  because of errors on the ILOM the ILOM of the
> one
> > node has to be reset, which is the advice of our hardware-vendor. The
> reset
> > of the ILOM can be done in production (tested on a single machine)
> > transparent for the OS on this machine (means: without the need of
> > rebooting the OS of the machine.) The only thing, I noticed during the
> test
> > on the single - Server is, that the ILOM is down (network not available)
> > for about 30 sec.
> >
> > Does this short downtime of the fencedevice affect the Redhat
> > Clustersoftware or can we do that action without problems in procution
> > -time ?
> If there's no need for fencing the node during the 30secs period, it can
> be 
> done during production time.
> 
> -Mark
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Neu: GMX FreeDSL Komplettanschluss mit DSL 6.000 Flatrate + Telefonanschluss f?r nur 17,95 Euro/mtl.!* http://dsl.gmx.de/?ac=OM.AD.PD003K11308T4569a


From reggaestar at gmail.com  Fri Apr  3 13:43:14 2009
From: reggaestar at gmail.com (remi doubi)
Date: Fri, 3 Apr 2009 13:43:14 +0000
Subject: Fwd: [Linux-cluster] Virtualization on top of Centos cluster
Message-ID: <3c88c73a0904030643r3cf07114wa66a922eb6b02d3d@mail.gmail.com>

here's the article :

http://www.mail-archive.com/linux-cluster at redhat.com/msg05169.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/c00863a0/attachment.htm>

From burton at simondsfamily.com  Fri Apr  3 13:44:35 2009
From: burton at simondsfamily.com (Burton Simonds)
Date: Fri, 3 Apr 2009 09:44:35 -0400
Subject: [Linux-cluster] Service behavior when migration fails.
Message-ID: <77f48c0d0904030644n3b87e83drfd0c75b7b2a645c3@mail.gmail.com>

I have a 2 node cluster, and I would like for the following behavior to happen:

Node 1 is running apache

Node 2 is in standby, but has a bad apache config (I know it should be
tested before going into production, but lets pretend I am a moron)

Node 1's apache is killed.

Tries to migrate to node 2, but fails

Tries to migrage back to node 1 and succeeds.


What is happening is when it tries to go back to node one, it says
failed, and states the ip address is in use.

I have the service set up as follows:

        <service autostart="1" domain="dom1" exclusive="0" name="web"
recovery="relocate">
            <script ref="httpd"/>
            <ip ref="xx.xx.xx.xx"/>
        </service>

Also, Could someone send me some links or search terms for details on
the structure for the cluster.conf file?  Everything I find on redhats
site is conga or system config cluster based, but would like an
explanation of the possible values for the conf file.

Thanks,
B


From burton at simondsfamily.com  Fri Apr  3 13:51:11 2009
From: burton at simondsfamily.com (Burton Simonds)
Date: Fri, 3 Apr 2009 09:51:11 -0400
Subject: [Linux-cluster] cluster email
Message-ID: <77f48c0d0904030651m1f502869gad6c8fcb2dc7e6e1@mail.gmail.com>

We are running a 2 node cluster in an active / passive configuration.
When running a sendmail on the passive host, we get a message saying
it can not bind to the vip address (because that address is on the
active.)  Is there a way to configure sendmail / postfix to still be
able to send outgoing messages on the passive node.

B


From s.wendy.cheng at gmail.com  Fri Apr  3 14:15:39 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Fri, 03 Apr 2009 10:15:39 -0400
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>	<49D5300F.9010805@gmail.com>	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
Message-ID: <49D61A0B.9010001@gmail.com>

Kadlecsik Jozsef wrote:
> On Thu, 2 Apr 2009, Wendy Cheng wrote:
>
>   
>>>> Kadlecsik Jozsef wrote:
>>>>     
>>>>         
>>>>> - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
>>>>>   gfs_drop_inode as .drop_inode replacing .put_inode.
>>>>>   .put_inode was called without holding a lock, but .drop_inode
>>>>>   is called under inode_lock held. Might it be a problem
>>>>>       
>>>>>           
>> Based on code reading ...
>> 1. iput() gets inode_lock (a spin lock)
>> 2. iput() calls iput_final()
>> 3. iput_final() calls filesystem drop_inode(), followed by
>> generic_drop_inode()
>> 4. generic_drop_inode() unlock inode_lock after doing all sorts of fun things
>> with the inode
>>
>> So look to me that generic_drop_inode() statement within 
>> gfs_drop_inode() should be removed. Otherwise you would get double 
>> unlock and double list free.
>>     
>
> I think those function calls are right: iput_final calls either the 
> filesystem drop_inode function (in this case gfs_drop_inode) or 
> generic_drop_inode. There's no double call of generic_drop_inode. However 
> gfs_sync_page_i (and in turn filemap_fdatawrite and filemap_fdatawait) is 
> now called under inode_lock held and that was not so in previous versions.
> But I'm just speculating.
>   

It *is* called twice unless my eyes deceive me

static inline void iput_final(struct inode *inode)
{
const struct super_operations *op = inode->i_sb->s_op;
void (*drop)(struct inode *) = generic_drop_inode;

if (op && op->drop_inode)
drop = op->drop_inode; /* gfs call generic_drop_inode() */
drop(inode); /* second call into generic_drop_inode() again. */
}

>  
>   
>> In short, *remove* line #73 from gfs-kernel/src/gfs/ops_super.c in your 
>> source and let us know how it goes.
>>     
>
> I won't get a chance to start a test before Monday, sorry. 
>
>   

I'll be traveling next week as well. However, a few cautious words here:

Even this "fix" eventually solves your hang, running GFS on newer 
kernels with production system simply is *not* a good idea.

-- Wendy


From bpejakov at fina.hr  Fri Apr  3 14:15:44 2009
From: bpejakov at fina.hr (Branimir)
Date: Fri, 03 Apr 2009 16:15:44 +0200
Subject: [Linux-cluster] Service behavior when migration fails.
In-Reply-To: <77f48c0d0904030644n3b87e83drfd0c75b7b2a645c3@mail.gmail.com>
References: <77f48c0d0904030644n3b87e83drfd0c75b7b2a645c3@mail.gmail.com>
Message-ID: <49D61A10.8070908@fina.hr>

Burton Simonds wrote:
> I have a 2 node cluster, and I would like for the following behavior to happen:
> 
> Node 1 is running apache
> 
> Node 2 is in standby, but has a bad apache config (I know it should be
> tested before going into production, but lets pretend I am a moron)
> 
> Node 1's apache is killed.
> 
> Tries to migrate to node 2, but fails
> 
> Tries to migrage back to node 1 and succeeds.
> 
> 
> What is happening is when it tries to go back to node one, it says
> failed, and states the ip address is in use.
> 
> I have the service set up as follows:
> 
>         <service autostart="1" domain="dom1" exclusive="0" name="web"
> recovery="relocate">
>             <script ref="httpd"/>
>             <ip ref="xx.xx.xx.xx"/>
>         </service>
> 
> Also, Could someone send me some links or search terms for details on
> the structure for the cluster.conf file?  Everything I find on redhats
> site is conga or system config cluster based, but would like an
> explanation of the possible values for the conf file.


Hi, Burton!

concerning cluster.conf file here is a useful link:

http://sources.redhat.com/cluster/doc/cluster_schema.html

that contains links to cluster schema structure for RHEL4 and 5.


Cheers,

Branimir


From burton at simondsfamily.com  Fri Apr  3 14:32:27 2009
From: burton at simondsfamily.com (Burton Simonds)
Date: Fri, 3 Apr 2009 10:32:27 -0400
Subject: [Linux-cluster] Service behavior when migration fails.
In-Reply-To: <49D61A10.8070908@fina.hr>
References: <77f48c0d0904030644n3b87e83drfd0c75b7b2a645c3@mail.gmail.com>
	<49D61A10.8070908@fina.hr>
Message-ID: <77f48c0d0904030732w23e9c4cehd2b8f927d3d21480@mail.gmail.com>

Thanks for that, I am looking for a little more detail, for example
somewhere i found that I can add an <action\> tag to individual
resources to manage when the are checked.

So i want to know what I can add to a service definition and what I
can manipulate in it.  This does give the basic structure of it, and I
appreciate the link!

Thanks,
B

On Fri, Apr 3, 2009 at 10:15 AM, Branimir <bpejakov at fina.hr> wrote:
> Burton Simonds wrote:
>>
>> I have a 2 node cluster, and I would like for the following behavior to
>> happen:
>>
>> Node 1 is running apache
>>
>> Node 2 is in standby, but has a bad apache config (I know it should be
>> tested before going into production, but lets pretend I am a moron)
>>
>> Node 1's apache is killed.
>>
>> Tries to migrate to node 2, but fails
>>
>> Tries to migrage back to node 1 and succeeds.
>>
>>
>> What is happening is when it tries to go back to node one, it says
>> failed, and states the ip address is in use.
>>
>> I have the service set up as follows:
>>
>> ? ? ? ?<service autostart="1" domain="dom1" exclusive="0" name="web"
>> recovery="relocate">
>> ? ? ? ? ? ?<script ref="httpd"/>
>> ? ? ? ? ? ?<ip ref="xx.xx.xx.xx"/>
>> ? ? ? ?</service>
>>
>> Also, Could someone send me some links or search terms for details on
>> the structure for the cluster.conf file? ?Everything I find on redhats
>> site is conga or system config cluster based, but would like an
>> explanation of the possible values for the conf file.
>
>
> Hi, Burton!
>
> concerning cluster.conf file here is a useful link:
>
> http://sources.redhat.com/cluster/doc/cluster_schema.html
>
> that contains links to cluster schema structure for RHEL4 and 5.
>
>
> Cheers,
>
> Branimir
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From npf-mlists at eurotux.com  Fri Apr  3 15:08:41 2009
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Fri, 3 Apr 2009 16:08:41 +0100
Subject: [Linux-cluster] cluster email
In-Reply-To: <77f48c0d0904030651m1f502869gad6c8fcb2dc7e6e1@mail.gmail.com>
References: <77f48c0d0904030651m1f502869gad6c8fcb2dc7e6e1@mail.gmail.com>
Message-ID: <200904031608.42725.npf-mlists@eurotux.com>

On Friday 03 April 2009 14:51:11 Burton Simonds wrote:
> We are running a 2 node cluster in an active / passive configuration.
> When running a sendmail on the passive host, we get a message saying
> it can not bind to the vip address (because that address is on the
> active.)  Is there a way to configure sendmail / postfix to still be
> able to send outgoing messages on the passive node.
>
> B
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


echo "net.ipv4.ip_nonlocal_bind=1" >> /etc/sysctl.conf
sysctl -p

Now postfix/sendmail should be able to bind to ips that the server does not 
have.

Best regards,
./npf


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090403/6dcae4da/attachment.htm>

From kadlec at mail.kfki.hu  Fri Apr  3 15:12:11 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Fri, 3 Apr 2009 17:12:11 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D61A0B.9010001@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280057480.20524@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
	<49D61A0B.9010001@gmail.com>
Message-ID: <alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>

On Fri, 3 Apr 2009, Wendy Cheng wrote:

> Kadlecsik Jozsef wrote:
> > On Thu, 2 Apr 2009, Wendy Cheng wrote:
> > > > > Kadlecsik Jozsef wrote:
> > > > >             
> > > > > > - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
> > > > > >   gfs_drop_inode as .drop_inode replacing .put_inode.
> > > > > >   .put_inode was called without holding a lock, but .drop_inode
> > > > > >   is called under inode_lock held. Might it be a problem
> > > > > >                 
> > > Based on code reading ...
> > > 1. iput() gets inode_lock (a spin lock)
> > > 2. iput() calls iput_final()
> > > 3. iput_final() calls filesystem drop_inode(), followed by
> > > generic_drop_inode()
> > > 4. generic_drop_inode() unlock inode_lock after doing all sorts of fun
> > > things
> > > with the inode
> > > 
> > > So look to me that generic_drop_inode() statement within gfs_drop_inode()
> > > should be removed. Otherwise you would get double unlock and double list
> > > free.
> > 
> > I think those function calls are right: iput_final calls either the
> > filesystem drop_inode function (in this case gfs_drop_inode) or
> > generic_drop_inode. There's no double call of generic_drop_inode. However
> > gfs_sync_page_i (and in turn filemap_fdatawrite and filemap_fdatawait) is
> > now called under inode_lock held and that was not so in previous versions.
> > But I'm just speculating.
> 
> It *is* called twice unless my eyes deceive me
> 
> static inline void iput_final(struct inode *inode)
> {
> const struct super_operations *op = inode->i_sb->s_op;
> void (*drop)(struct inode *) = generic_drop_inode;
> 
> if (op && op->drop_inode)
> drop = op->drop_inode; /* gfs call generic_drop_inode() */
> drop(inode); /* second call into generic_drop_inode() again. */
> }

No, the line 'drop = op->drop_inode;' is just an assignment (there's no 
function argument): if there's a filesystem-specific drop_inode 
function, that is assigned to 'drop', overwriting thus the originally 
initialized 'generic_drop_inode' value of the 'drop' variable as function 
pointer. And the last line is only the function call, with the proper 
argument.
 
> > I won't get a chance to start a test before Monday, sorry. 
> 
> I'll be traveling next week as well. However, a few cautious words here:
> 
> Even this "fix" eventually solves your hang, running GFS on newer 
> kernels with production system simply is *not* a good idea.

That might be so, but this is a catch: at least we must test GFS2 before 
migrating from GFS1 to GFS2. That requires a recent kernel, with working 
GFS1 and GFS2 support. A migration of an in-production system cannot be 
started lightheartedly, and transforming GFS1 to GFS2 wont' be easy: that 
needs downtime, fresh backups created after bringing down the systems, 
backup verification, converting/creating the GFS2 volumes, restoring the 
data. And crossing fingers that it must not be undone if something goes 
wrong. It's not the same as just replacing an older kernel and an older 
package.

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From s.wendy.cheng at gmail.com  Fri Apr  3 15:41:05 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Fri, 03 Apr 2009 11:41:05 -0400
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>	<49D5300F.9010805@gmail.com>	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>	<49D589B8.1050702@gmail.com>	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
Message-ID: <49D62E11.5050609@gmail.com>

Kadlecsik Jozsef wrote:
> On Fri, 3 Apr 2009, Wendy Cheng wrote:
>
>   
>> Kadlecsik Jozsef wrote:
>>     
>>> On Thu, 2 Apr 2009, Wendy Cheng wrote:
>>>       
>>>>>> Kadlecsik Jozsef wrote:
>>>>>>             
>>>>>>             
>>>>>>> - commit 82d176ba485f2ef049fd303b9e41868667cebbdb
>>>>>>>   gfs_drop_inode as .drop_inode replacing .put_inode.
>>>>>>>   .put_inode was called without holding a lock, but .drop_inode
>>>>>>>   is called under inode_lock held. Might it be a problem
>>>>>>>                 
>>>>>>>               
>>>> Based on code reading ...
>>>> 1. iput() gets inode_lock (a spin lock)
>>>> 2. iput() calls iput_final()
>>>> 3. iput_final() calls filesystem drop_inode(), followed by
>>>> generic_drop_inode()
>>>> 4. generic_drop_inode() unlock inode_lock after doing all sorts of fun
>>>> things
>>>> with the inode
>>>>
>>>> So look to me that generic_drop_inode() statement within gfs_drop_inode()
>>>> should be removed. Otherwise you would get double unlock and double list
>>>> free.
>>>>         
>>> I think those function calls are right: iput_final calls either the
>>> filesystem drop_inode function (in this case gfs_drop_inode) or
>>> generic_drop_inode. There's no double call of generic_drop_inode. However
>>> gfs_sync_page_i (and in turn filemap_fdatawrite and filemap_fdatawait) is
>>> now called under inode_lock held and that was not so in previous versions.
>>> But I'm just speculating.
>>>       
>> It *is* called twice unless my eyes deceive me
>>
>> static inline void iput_final(struct inode *inode)
>> {
>> const struct super_operations *op = inode->i_sb->s_op;
>> void (*drop)(struct inode *) = generic_drop_inode;
>>
>> if (op && op->drop_inode)
>> drop = op->drop_inode; /* gfs call generic_drop_inode() */
>> drop(inode); /* second call into generic_drop_inode() again. */
>> }
>>     
>
> No, the line 'drop = op->drop_inode;' is just an assignment (there's no 
>   

ok, I see ... my eyes do deceive me :) - actually it is my brain that 
was not working ...

Then don't remove it yet. The ramification needs more thoughts ...

-- Wendy


From lhh at redhat.com  Fri Apr  3 19:17:57 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 03 Apr 2009 15:17:57 -0400
Subject: [Linux-cluster] linux clustering <linux-cluster@redhat.com>
In-Reply-To: <3c88c73a0904030437o20924ff1n48c63926520a633@mail.gmail.com>
References: <3c88c73a0904030437o20924ff1n48c63926520a633@mail.gmail.com>
Message-ID: <1238786277.17913.42.camel@ayanami>

On Fri, 2009-04-03 at 11:37 +0000, remi doubi wrote:
> here's the article :
> 
> http://www.mail-archive.com/linux-cluster at redhat.com/msg05169.html


http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=a97856264104f0fe2156daed7a4c637dd1bacf8e

?

-- Lon


From jeff.sturm at eprize.com  Fri Apr  3 20:42:03 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Fri, 3 Apr 2009 16:42:03 -0400
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <200904030839.08056.mrugeshkarnik@gmail.com>
References: <200904022103.38273.mrugeshkarnik@gmail.com><64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>
	<200904030839.08056.mrugeshkarnik@gmail.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB6FE@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Mrugesh Karnik
> Sent: Thursday, April 02, 2009 11:09 PM
> To: linux-cluster at redhat.com
> Subject: Re: [Linux-cluster] Network Interface Binding for cman
> 
> On Thursday 02 Apr 2009 21:34:12 Jeff Sturm wrote:
> > It binds to a multicast address.  That address is bound to one 
> > interface normally.
> 
> Well, how do I specify which interface to bind that multicast 
> address to?

Generally speaking you would create a route entry, e.g.:

ip route add multicast 224.0.0.0/4 dev eth1 scope link

However I believe CMAN does not require this, and will automatically
bind to the same interface as the unicast address associated with the
node names you specify in cluster.conf.

> I see the `bindnetaddr' directive in openais.conf man page.

The multicast address (not interface) is automatically determined based
on your cluster name.  Probably best to leave it that way unless you
have a specific reason to override it.

Jeff


From sdake at redhat.com  Fri Apr  3 20:46:00 2009
From: sdake at redhat.com (Steven Dake)
Date: Fri, 03 Apr 2009 13:46:00 -0700
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB6FE@hugo.eprize.local>
References: <200904022103.38273.mrugeshkarnik@gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB6D8@hugo.eprize.local>
	<200904030839.08056.mrugeshkarnik@gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB6FE@hugo.eprize.local>
Message-ID: <1238791560.4602.54.camel@sdake-laptop>

On Fri, 2009-04-03 at 16:42 -0400, Jeff Sturm wrote:
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Mrugesh Karnik
> > Sent: Thursday, April 02, 2009 11:09 PM
> > To: linux-cluster at redhat.com
> > Subject: Re: [Linux-cluster] Network Interface Binding for cman
> > 
> > On Thursday 02 Apr 2009 21:34:12 Jeff Sturm wrote:
> > > It binds to a multicast address.  That address is bound to one 
> > > interface normally.
> > 
> > Well, how do I specify which interface to bind that multicast 
> > address to?
> 
> Generally speaking you would create a route entry, e.g.:
> 
> ip route add multicast 224.0.0.0/4 dev eth1 scope link
> 
> However I believe CMAN does not require this, and will automatically
> bind to the same interface as the unicast address associated with the
> node names you specify in cluster.conf.
> 
> > I see the `bindnetaddr' directive in openais.conf man page.
> 
> The multicast address (not interface) is automatically determined based
> on your cluster name.  Probably best to leave it that way unless you
> have a specific reason to override it.
> 
> Jeff

Multicast and unicast sockets are explicitly bound to the interface that
matches the ip address in the configuration.  This removes the need for
any default routes or problems with multicast binding to one interface
while the unicast socket binds to another.

regards
-steve

> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From dist-list at LEXUM.UMontreal.CA  Fri Apr  3 20:46:57 2009
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 03 Apr 2009 16:46:57 -0400
Subject: [Linux-cluster] Conga : cluster creation problem
Message-ID: <49D675C1.5020902@lexum.umontreal.ca>

Hello,
I just installed 5.3 to create a new cluster

I have :
1 luci server
3 ricci servers

iptables and selinux are off on all servers. luci and ricci are started

When I tried to create a cluster inside http interface :

An error occurred when trying to contact any of the nodes in the cluster.

I used tcpdump and I can see traffic on the wire.
Any idea ?
Regards


From rmccabe at redhat.com  Fri Apr  3 20:58:08 2009
From: rmccabe at redhat.com (Ryan McCabe)
Date: Fri, 3 Apr 2009 16:58:08 -0400
Subject: [Linux-cluster] Conga : cluster creation problem
In-Reply-To: <49D675C1.5020902@lexum.umontreal.ca>
References: <49D675C1.5020902@lexum.umontreal.ca>
Message-ID: <20090403205807.GA2209@redhat.com>

On Fri, Apr 03, 2009 at 04:46:57PM -0400, FM wrote:
> An error occurred when trying to contact any of the nodes in the cluster.

Try either upgrading to luci-0.12.1-7.3 or unticking the "Enable shared
storage" box while creating the cluster.


Ryan


From vu at sivell.com  Sat Apr  4 01:59:58 2009
From: vu at sivell.com (Vu Pham)
Date: Fri, 03 Apr 2009 20:59:58 -0500
Subject: [Linux-cluster] Conga : cluster creation problem
In-Reply-To: <49D675C1.5020902@lexum.umontreal.ca>
References: <49D675C1.5020902@lexum.umontreal.ca>
Message-ID: <49D6BF1E.9040600@sivell.com>


FM wrote:
> Hello,
> I just installed 5.3 to create a new cluster
> 
> I have :
> 1 luci server
> 3 ricci servers
> 
> iptables and selinux are off on all servers. luci and ricci are started
> 
> When I tried to create a cluster inside http interface :
> 
> An error occurred when trying to contact any of the nodes in the cluster.
> 
> I used tcpdump and I can see traffic on the wire.
> Any idea ?
> Regards


I believe luci that comes with RHEL5.3 media is broken. yum to get the 
lastest version luci-0.12.1-7.3.el5_3 of luci from RHN. I tried that one 
a week ago and could use luci to create cluster fine.

Vu


From mrugeshkarnik at gmail.com  Sat Apr  4 07:47:59 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Sat, 4 Apr 2009 13:17:59 +0530
Subject: [Linux-cluster] Network Interface Binding for cman
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB6FE@hugo.eprize.local>
References: <200904022103.38273.mrugeshkarnik@gmail.com>
	<200904030839.08056.mrugeshkarnik@gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB6FE@hugo.eprize.local>
Message-ID: <200904041317.59947.mrugeshkarnik@gmail.com>

On Saturday 04 Apr 2009 02:12:03 Jeff Sturm wrote:
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Mrugesh Karnik
> > Sent: Thursday, April 02, 2009 11:09 PM
> > To: linux-cluster at redhat.com
> > Subject: Re: [Linux-cluster] Network Interface Binding for cman
> >
> > On Thursday 02 Apr 2009 21:34:12 Jeff Sturm wrote:
> > > It binds to a multicast address.  That address is bound to one
> > > interface normally.
> >
> > Well, how do I specify which interface to bind that multicast
> > address to?
>
> Generally speaking you would create a route entry, e.g.:
>
> ip route add multicast 224.0.0.0/4 dev eth1 scope link
>
> However I believe CMAN does not require this, and will automatically
> bind to the same interface as the unicast address associated with the
> node names you specify in cluster.conf.
>
> > I see the `bindnetaddr' directive in openais.conf man page.
>
> The multicast address (not interface) is automatically determined based
> on your cluster name.  Probably best to leave it that way unless you
> have a specific reason to override it.

Didn't need to change anything manually. I just specified the FQDN for that 
specific interface as the nodename in cluster.conf. Worked perfectly.

Earlier I was using the hostname given by `uname -n` as is done with 
heartbeat. The reference link from the wiki said that I could use either the 
DNS name or the IP address as well.

Thanks,
Mrugesh Karnik
Executive | Technical Services
Geodesic Ltd. | www.geodesic.com
Tel: +91 22 4031 5800
Mobile: +91 9892286688


From reggaestar at gmail.com  Sat Apr  4 10:21:26 2009
From: reggaestar at gmail.com (remi doubi)
Date: Sat, 4 Apr 2009 10:21:26 +0000
Subject: [Linux-cluster] Fwd: Virtualization on top of Centos cluster
In-Reply-To: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
Message-ID: <3c88c73a0904040321qdcc8d8u39e81009c80605c4@mail.gmail.com>

Isn't virtualization made to virtualise nodes sources ( for me they're two
), in order to let the VMs turn in a virtual environement without knowing on
which physical node they're running ???

Please i need to know if this is possible, because on Monday i have to tell
my trainer how i'm going to deal with this !!!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090404/cd4d4700/attachment.htm>

From s.wendy.cheng at gmail.com  Mon Apr  6 04:12:34 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Sun, 05 Apr 2009 23:12:34 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D62E11.5050609@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<alpine.DEB.2.00.0903280131570.20524@lxserv0.kfki.hu>	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>	<49CE4B36.2000103@gmail.com>	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>	<49D03447.2000901@gmail.com>	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>	<49D0EC83.4030202@gmail.com>
	<49D11334.5030406@redhat.com>	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>	<49D5300F.9010805@gmail.com>	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>	<49D589B8.1050702@gmail.com>	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
	<49D62E11.5050609@gmail.com>
Message-ID: <49D98132.9010201@gmail.com>


>
> Then don't remove it yet. The ramification needs more thoughts ...
>

That generic_drop_inode() can *not* be removed.

Not sure whether my head is clear enough this time ....

Based on code reading ...
1. iput() gets inode_lock (a spin lock)
2. iput() calls iput_final()
3. iput_final() calls gfs_drop_inode() that calls
    generic_drop_inode()
4. generic_drop_inode() unlocks inode_lock.

In theory, this logic violates the usage of spin lock as it is expected 
to lock for a short period of time but gfs_drop_inode() could take a 
while to finish. It has a blocking write page that need to make sure the 
data gets sync-ed to storage before it can returns. Make matter worse is 
that inode_lock is a global lock that could block non-GFS threads. One 
would think a quick fix is to drop the inode_lock at the beginning of 
gfs_drop_inode() and then re-acquires it after gfs sync the page. 
Unfortunately, inode_lock is not an exported symbol and GFS is an 
out-of-tree filesystem that has to be compiled as a kernel module. So 
this trick won't work for GFS.

With a flight to catch tomorrow and a flu-infected body, I lose the will 
to think over what the correct fix should and/or will be.

-- Wendy


From kadlec at mail.kfki.hu  Mon Apr  6 11:09:35 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Mon, 6 Apr 2009 13:09:35 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49D98132.9010201@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<1a2a6dd60903272036k7bedef6ft718cf74331f562bc@mail.gmail.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
	<49D62E11.5050609@gmail.com> <49D98132.9010201@gmail.com>
Message-ID: <alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>

On Sun, 5 Apr 2009, Wendy Cheng wrote:

> Based on code reading ...
> 1. iput() gets inode_lock (a spin lock)
> 2. iput() calls iput_final()
> 3. iput_final() calls gfs_drop_inode() that calls
>    generic_drop_inode()
> 4. generic_drop_inode() unlocks inode_lock.
> 
> In theory, this logic violates the usage of spin lock as it is expected 
> to lock for a short period of time but gfs_drop_inode() could take a 
> while to finish. It has a blocking write page that need to make sure the 
> data gets sync-ed to storage before it can returns. Make matter worse is 
> that inode_lock is a global lock that could block non-GFS threads. One 
> would think a quick fix is to drop the inode_lock at the beginning of 
> gfs_drop_inode() and then re-acquires it after gfs sync the page. 
> Unfortunately, inode_lock is not an exported symbol and GFS is an 
> out-of-tree filesystem that has to be compiled as a kernel module. So 
> this trick won't work for GFS.

Actually, it can work. inode_lock is not private and gfs can unlock/lock 
it:

--- gfs-orig/ops_super.c	2009-01-22 13:33:51.000000000 +0100
+++ gfs/ops_super.c	2009-04-06 13:07:06.000000000 +0200
@@ -9,6 +9,7 @@
 #include <linux/statfs.h>
 #include <linux/seq_file.h>
 #include <linux/mount.h>
+#include <linux/writeback.h>
 
 #include "gfs.h"
 #include "dio.h"
@@ -68,8 +69,11 @@
 	if (ip &&
 	    !inode->i_nlink &&
 	    S_ISREG(inode->i_mode) &&
-	    !sdp->sd_args.ar_localcaching)
+	    !sdp->sd_args.ar_localcaching) {
+	    	spin_unlock(&inode_lock);
 		gfs_sync_page_i(inode, DIO_START | DIO_WAIT);
+		spin_lock(&inode_lock);
+	}
 	generic_drop_inode(inode);
 }

Tomorrow I'll give it a try, there's no time to test it today.

> With a flight to catch tomorrow and a flu-infected body, I lose the will 
> to think over what the correct fix should and/or will be.

A speedy recover! And thank you all your efforts!

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From kadlec at mail.kfki.hu  Mon Apr  6 11:15:42 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Mon, 6 Apr 2009 13:15:42 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
	<49D62E11.5050609@gmail.com> <49D98132.9010201@gmail.com>
	<alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>
Message-ID: <alpine.DEB.2.00.0904061313350.22370@lxserv1.kfki.hu>

On Mon, 6 Apr 2009, Kadlecsik Jozsef wrote:

> On Sun, 5 Apr 2009, Wendy Cheng wrote:
> 
> > Based on code reading ...
> > 1. iput() gets inode_lock (a spin lock)
> > 2. iput() calls iput_final()
> > 3. iput_final() calls gfs_drop_inode() that calls
> >    generic_drop_inode()
> > 4. generic_drop_inode() unlocks inode_lock.
> > 
> > In theory, this logic violates the usage of spin lock as it is expected 
> > to lock for a short period of time but gfs_drop_inode() could take a 
> > while to finish. It has a blocking write page that need to make sure the 
> > data gets sync-ed to storage before it can returns. Make matter worse is 
> > that inode_lock is a global lock that could block non-GFS threads. One 
> > would think a quick fix is to drop the inode_lock at the beginning of 
> > gfs_drop_inode() and then re-acquires it after gfs sync the page. 
> > Unfortunately, inode_lock is not an exported symbol and GFS is an 
> > out-of-tree filesystem that has to be compiled as a kernel module. So 
> > this trick won't work for GFS.
> 
> Actually, it can work. inode_lock is not private and gfs can unlock/lock 
> it:

Darn, it's not EXPORTED! Anyway, I'll patch the kernel and will test it.

Best regards,
Jzosef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From rhurst at bidmc.harvard.edu  Mon Apr  6 11:58:33 2009
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Mon, 06 Apr 2009 07:58:33 -0400
Subject: [Linux-cluster] RHEL 4.7 rgmanager-1.9.80-1 debug output?
In-Reply-To: <3c88c73a0904040321qdcc8d8u39e81009c80605c4@mail.gmail.com>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
	<3c88c73a0904040321qdcc8d8u39e81009c80605c4@mail.gmail.com>
Message-ID: <1239019113.5265.6.camel@WSBID06223.bidmc.harvard.edu>

We're having a bizarre problem with rgmanager ... every now and then, it
decides to stop a service with no recorded reason as to why:

Apr  6 04:05:25 acropolis clurgmgrd[967]: <notice> Stopping service
WATSONAPP1 

Is there a way to increase debugging / verbosity with rgmanager?

I cannot find command-line options for clurgmgrd, other than the
soft-coded value in its service script, /etc/init.d/rgmanager.  I
made /etc/sysconfig/cluster and added the line:

RGMGR_OPTS="-t 30 -d"

... hoping that the "-d" will increase debug level in syslog messages,
but nothing new appeared.

Please advise, thanks!


________________________________________________________________________

Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090406/9e7d325e/attachment.htm>

From alan.zg at gmail.com  Mon Apr  6 20:17:05 2009
From: alan.zg at gmail.com (Alan A)
Date: Mon, 6 Apr 2009 15:17:05 -0500
Subject: [Linux-cluster] RHEL 5.3 Cluster - Time Zone Question
Message-ID: <fac531740904061317w1cce7f09j9b08aee4dc597ede@mail.gmail.com>

Just a quick question - could the nodes belonging to the same cluster run in
different time zones? What are the possible implications if any?

-- 
Alan A.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090406/2e09af9b/attachment.htm>

From theophanis_kontogiannis at yahoo.gr  Mon Apr  6 21:59:12 2009
From: theophanis_kontogiannis at yahoo.gr (Theophanis Kontogiannis)
Date: Tue, 7 Apr 2009 00:59:12 +0300
Subject: [Linux-cluster] Can not mount GFS2 after upgrading to 5.3?
Message-ID: <003201c9b702$ed581480$c8083d80$@gr>


Hello All,

 
I have a 2 nodes cluster that has a GFS2 filesystem, created with 5.2

 
Now I have clean installed 5.3 and gfs2-utils-0.1.53-1.el5_3.1

 
I try to mount my GFS2 and I get the following error:

 
/sbin/mount.gfs2: bad seek: Invalid argument on line 276 of file util.c

 
I hope this does not mean that I cannot have backwards compatibility with my
GFS2 created with RHEL 5.2 and also that I cannot mount it now under 5.3 (or
this is exactly what it means??!! ;) )

 
Any help to mount my fs?

 
Thank you all for your time

 
Theophanis Kontogiannis

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/5886df0a/attachment.htm>

From theophanis_kontogiannis at yahoo.gr  Mon Apr  6 22:20:51 2009
From: theophanis_kontogiannis at yahoo.gr (Theophanis Kontogiannis)
Date: Tue, 7 Apr 2009 01:20:51 +0300
Subject: [Linux-cluster] Can not mount GFS2 after upgrading to 5.3?
In-Reply-To: <003201c9b702$ed581480$c8083d80$@gr>
References: <003201c9b702$ed581480$c8083d80$@gr>
Message-ID: <004001c9b705$f2f016b0$d8d04410$@gr>

I reply to myself.

 
I did not observe that I had not commented out the wildcard default filter of lvm.conf and so no matter what my filter was, it was not considered.

 
So LVM was using duplicate UUIDs and so I could not mount ( I forgot to mention my setup: DRBD ? PV ? LV ? VG ? GFS2)

 
Now that I have my filter working, I can also mount my GFS2.

 
(though the error looked interesting).

 
Thank you All.

 
Theophanis Kontogiannis

 
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Theophanis Kontogiannis
Sent: Tuesday, April 07, 2009 12:59 AM
To: 'linux clustering'
Subject: [Linux-cluster] Can not mount GFS2 after upgrading to 5.3?

 
Hello All,

 
I have a 2 nodes cluster that has a GFS2 filesystem, created with 5.2

 
Now I have clean installed 5.3 and gfs2-utils-0.1.53-1.el5_3.1

 
I try to mount my GFS2 and I get the following error:

 
/sbin/mount.gfs2: bad seek: Invalid argument on line 276 of file util.c

 
I hope this does not mean that I cannot have backwards compatibility with my GFS2 created with RHEL 5.2 and also that I cannot mount it now under 5.3 (or this is exactly what it means??!! ;) )

 
Any help to mount my fs?

 
Thank you all for your time

 
Theophanis Kontogiannis

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/e35959d9/attachment.htm>

From muffaleta at gmail.com  Tue Apr  7 00:18:49 2009
From: muffaleta at gmail.com (Christopher Chen)
Date: Mon, 6 Apr 2009 17:18:49 -0700
Subject: [Linux-cluster] Stale Rgmanager State?
Message-ID: <7bc80d500904061718n629fa70bg7b54326c459c0d6b@mail.gmail.com>

Upon adding a new virtual machine to my cluster.conf, and submitting a
ccs_tool update command from a cluster host, I see in cman_tool that
the version has incremented. However, I have one host that doesn't
show the new services in clustat.

I'm used to seeing lines in syslog where cman notices a new config,
then clurmgrd re-reads the config. Two of three machines do this, but
the third only shows the version change.

Is there any way to ask politely for a node to sync back up?
init.d/rgmanager reload doesn't seem to do anything.

The machines are currently running Centos 5.2 Xen Dom0 kernels.

cc

-- 
Chris Chen <muffaleta at gmail.com>
"I want the kind of six pack you can't drink."
-- Micah


From linux at vdkrone.de  Tue Apr  7 12:21:39 2009
From: linux at vdkrone.de (Tobias von der Krone)
Date: Tue, 07 Apr 2009 14:21:39 +0200
Subject: [Linux-cluster] Unmounting gfs2 on reboot hangs
Message-ID: <49DB4553.9080907@vdkrone.de>

Hello,
I'm using a up2date RHEL 5.3 and I have the problem that unmounting gfs2 
with its init script on reboot/shutdown hangs. Using 'service gfs2 stop' 
while system is running works fine.
I'm testing starting / stopping gfs2 earlier and later... but till now 
no improvements.
Does anybody have an idea what is going wrong?
Does anybody know which dependencies gfs2 has?

Tobias


From lw at hygeos.com  Tue Apr  7 13:16:58 2009
From: lw at hygeos.com (Laurent Wandrebeck)
Date: Tue, 7 Apr 2009 15:16:58 +0200
Subject: [Linux-cluster] Seeking advice about solution to adopt for a cluster
Message-ID: <200904071516.58177.lw@hygeos.com>

Hi list,

our park is going to gain three new boxes, pushing storage size to 70TB.
I think it's time to get rid of nfs /net automounts, and to go for some kind 
of a cluster.
long story short:
each typical server has a local storage (1 to 8TB, up to 15 soon), that are 
sata discs connected to a 3ware card, using hard raid 10,5,6.
each of these machines is aimed at processing data from a given satellite.
there are also one pgsql server, one apache server, one nis/home (via nfs) 
server each with a 3ware and its discs. brw, the nis/nfs server is soon to be 
turned into a directory server.
gbps network, non administrable switches. /24 network class. every server run 
Centos 4.7 or 5.3 (only one 4.7 remains, to be precise). every box runs 
x86_64 software.

now, I'd like to transform that mess into:
1) have one volume for sat1...N data. So that, if needed, you can process 
whatever you want from whatever machine.
2) have a failover machine that could automagically take load for pg, apache 
and nfs/nis (the soon to be directory server) if the dedicated box fails. 
that means an efficient replication so data are identical on original 
pg/apache/etc machines and the failover one.
3) have some kind of load balancing on sat1...N, that would put processes on a 
box where processed data are local, without having the user to decide where 
to launch processes. resulting data from processes would have to be written 
on the local storage of the box. So that sat1 data and sat1 processed data 
stay on the same physical volume. That way, if a box really badly crashes, we 
know which data were lost (we can't afford to backup 70TB).

now, questions (thx for arriving down there:) :

1) what I've read and been told is GFS wouldn't do the trick. Lustre and 
hadoop/hdfs could. For now, with what I've read about lustre, it could do the 
trick, but found nothing about load balacing.
2) failover should be possible if i understood correctly doc. where i'm a bit 
stuck is the replication part. wal shipping should do the trick for pg. 
directory server has some kind of failover mechanism afaik. about apache, i'm 
a bit in the dark. could someone enlighten me ?
I've been told that drdb could solve the whole replication problem ?
3) is such a thing possible with cluster suite ? at all ? Would there be any 
better way to solve problem of boxes configuration so our DC can 
continue to grow without becoming a nightmare for me and users ?
4) right now, user homes follow them to whatever box they log on thanks to 
nfs. How to make such a thing work with Directory server ? Use another lustre 
volume ? what if servers are hidden by a load balancer ?

You'll find attached some kind of ascii art trying to describe what i'd like 
to get :) (open it with fixed size font)
Thanks a lot for helping.
Best Regards,
-- 
Laurent
-------------- next part --------------
        _____
|S1|----|L  |----|U1|--------|
        |U  |                |
|S2|----|S  |----|U2|--------|
        |T  |                |
|S3|----|R  |----|U3|--------|
        |E  |                |
|S4|----|   |----|U4|--------|
        |V  |                |
.       |O  |    .           |
.       |L  |    .           |
.       |U  |    .           |
|Sn|----|M  |----|Un|--------| |--home Lustre volume accessible by every box ?
        |E  |                | |
        |   |---------------|Ds|--|
        |   |                     |
        |   |----|Pg|--|----------|
        |   |          |
        |   |----|Ap|--|
        |   |          |
        |   |----|Fo|--|
        |___|

|Sx|: boxes with dedicated storage for satellite images processing.
|Ux|: user boxes.
|Ds|: Directory server (serves /home to user machines)
|Pg|: PostgreSQL server
|Ap|: Apache server
|Fo|: Failover server (can take Pg, Ds, Ap load)

From kadlec at mail.kfki.hu  Tue Apr  7 14:01:08 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Tue, 7 Apr 2009 16:01:08 +0200 (CEST)
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<alpine.DEB.2.00.0903280802360.20524@lxserv0.kfki.hu>
	<49CE4B36.2000103@gmail.com>
	<alpine.DEB.2.00.0903292015190.9853@lxserv1.kfki.hu>
	<49D03447.2000901@gmail.com>
	<alpine.DEB.2.00.0903300936290.18190@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0903300948450.18190@lxserv0.kfki.hu>
	<49D0EC83.4030202@gmail.com> <49D11334.5030406@redhat.com>
	<alpine.DEB.2.00.0903311151120.23085@lxserv0.kfki.hu>
	<alpine.DEB.2.00.0904021301290.15724@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904022248120.7334@lxserv0.kfki.hu>
	<49D5300F.9010805@gmail.com>
	<alpine.DEB.2.00.0904022339000.7334@lxserv0.kfki.hu>
	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
	<49D62E11.5050609@gmail.com> <49D98132.9010201@gmail.com>
	<alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>
Message-ID: <alpine.DEB.2.00.0904071552510.15283@lxserv1.kfki.hu>

Hi, 

On Mon, 6 Apr 2009, Kadlecsik Jozsef wrote:

> On Sun, 5 Apr 2009, Wendy Cheng wrote:
> 
> > Based on code reading ...
> > 1. iput() gets inode_lock (a spin lock)
> > 2. iput() calls iput_final()
> > 3. iput_final() calls gfs_drop_inode() that calls
> >    generic_drop_inode()
> > 4. generic_drop_inode() unlocks inode_lock.
> > 
> > In theory, this logic violates the usage of spin lock as it is expected 
> > to lock for a short period of time but gfs_drop_inode() could take a 
> > while to finish. It has a blocking write page that need to make sure the 
> > data gets sync-ed to storage before it can returns. Make matter worse is 
> > that inode_lock is a global lock that could block non-GFS threads. One 
> > would think a quick fix is to drop the inode_lock at the beginning of 
> > gfs_drop_inode() and then re-acquires it after gfs sync the page. 
> > Unfortunately, inode_lock is not an exported symbol and GFS is an 
> > out-of-tree filesystem that has to be compiled as a kernel module. So 
> > this trick won't work for GFS.
> 
> Actually, it can work. inode_lock is not private and gfs can unlock/lock 
> it:
> 
> --- gfs-orig/ops_super.c	2009-01-22 13:33:51.000000000 +0100
> +++ gfs/ops_super.c	2009-04-06 13:07:06.000000000 +0200
> @@ -9,6 +9,7 @@
>  #include <linux/statfs.h>
>  #include <linux/seq_file.h>
>  #include <linux/mount.h>
> +#include <linux/writeback.h>
>  
>  #include "gfs.h"
>  #include "dio.h"
> @@ -68,8 +69,11 @@
>  	if (ip &&
>  	    !inode->i_nlink &&
>  	    S_ISREG(inode->i_mode) &&
> -	    !sdp->sd_args.ar_localcaching)
> +	    !sdp->sd_args.ar_localcaching) {
> +	    	spin_unlock(&inode_lock);
>  		gfs_sync_page_i(inode, DIO_START | DIO_WAIT);
> +		spin_lock(&inode_lock);
> +	}
>  	generic_drop_inode(inode);
>  }
> 
> Tomorrow I'll give it a try, there's no time to test it today.

I added the required 

EXPORT_SYMBOL(inode_lock);

line to fs/inode.c, recompiled the kernel and the modules.

Starting mailman in the test environment did not produce the almost 
instant freeze. I started/stopped mailman several times and the system 
worked just fine. So I believe the patch above and the plus line in 
fs/inode.c fix the reported problem. I dunno whether modifying 
fs/inode.c is acceptable or not...

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From lst103 at hotmail.com  Tue Apr  7 15:53:15 2009
From: lst103 at hotmail.com (John Adams)
Date: Tue, 7 Apr 2009 11:53:15 -0400
Subject: [Linux-cluster] Clustering with Redhat
Message-ID: <COL107-W20E771EB9707EE73896D296850@phx.gbl>


Hello all,I just bought Linux Redhat,not cluster and I am requesting help in setting up a cluster with my i7 and phenom desktops.I have no experience in this field and I am asking for help which would be greatly appreciated.I would like to combine the processors to work on one application,we can talk over the phone also if needed I am in USA eastcoast.I also have an external harddrive and was wondereing if I could use that.Do I need any special software to make this work or will all be included?
 
John T.

_________________________________________________________________
Rediscover Hotmail?: Get e-mail storage that grows with you. 
http://windowslive.com/RediscoverHotmail?ocid=TXT_TAGLM_WL_HM_Rediscover_Storage1_042009
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/ebb3c7d2/attachment.htm>

From virginian at blueyonder.co.uk  Tue Apr  7 17:30:19 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Tue, 7 Apr 2009 18:30:19 +0100
Subject: [Linux-cluster] Clustering with Redhat
References: <COL107-W20E771EB9707EE73896D296850@phx.gbl>
Message-ID: <123CD25CD0984F6AA713C99BFADBAF51@Desktop>

Hi John,

I would begin by reading the following:

http://www.centos.org/docs/5/html/Cluster_Suite_Overview/

This will give you an overview of cluster suite. You will need to have your two PC's on the same network / subnet. If you want to share storage, read up on GFS or GFS2:

https://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/en-US/RHEL510/Global_File_System/index.html

To get yourself going, install the following on both RHEL 5 machines:

yum groupinstall Clustering "Cluster Storage"

The above will download the relevant packages for you (assuming you are connected to the internet and have a support contract for Red Hat Network)

If you haven't paid for a support contract with Red Hat, I would recommend installing Centos 5.2 or 5.3 on your two machines (Centos is a RHEL binary compatible Linux distribution, no charge and you get all the updates and applications / cluster suite / GFS etc. for free).

In order to set up and maintain your cluster, one of your two machines will need to be running the luci admin software. Choose one machine then do the following:

luci_admin init

you will be prompted to enter a password (twice). Once complete you can access the luci GUI by a web browser:

https://nnn.nnn.nnn.nnn:8084

(where nnn.nnn.nnn.nnn is the IP address of the machine running luci)

Login to luci and then have a browse through the various options. 

It's difficult to say exactly how you need to configure your cluster as there are many options. You could, for example, use GFS / GFS2 to share your storage if it can be directly attached to both machines or you could use GNBD / NFS. I would read up on the Cluster Suite and GFS / GFS 2 docs to familiarise yourself with the technology before formulating a plan of how you would like to implement your set up. 

There is a lot to learn, I won't pretend that there isn't. The key is to read as much as you can, have a play and be prepared for the odd set back or frustration. You can always come back to the list when you need more help on a specific problem.

Regards

John
  ----- Original Message ----- 
  From: John Adams 
  To: linux-cluster at redhat.com 
  Sent: Tuesday, April 07, 2009 4:53 PM
  Subject: [Linux-cluster] Clustering with Redhat


  Hello all,I just bought Linux Redhat,not cluster and I am requesting help in setting up a cluster with my i7 and phenom desktops.I have no experience in this field and I am asking for help which would be greatly appreciated.I would like to combine the processors to work on one application,we can talk over the phone also if needed I am in USA eastcoast.I also have an external harddrive and was wondereing if I could use that.Do I need any special software to make this work or will all be included?
   
  John T.


------------------------------------------------------------------------------
  Rediscover Hotmail?: Get e-mail storage that grows with you. Check it out. 


------------------------------------------------------------------------------


  --
  Linux-cluster mailing list
  Linux-cluster at redhat.com
  https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/b36d5109/attachment.htm>

From alan.zg at gmail.com  Tue Apr  7 18:21:16 2009
From: alan.zg at gmail.com (Alan A)
Date: Tue, 7 Apr 2009 13:21:16 -0500
Subject: [Linux-cluster] Re: RHEL 5.3 Cluster - Time Zone Question
In-Reply-To: <fac531740904061317w1cce7f09j9b08aee4dc597ede@mail.gmail.com>
References: <fac531740904061317w1cce7f09j9b08aee4dc597ede@mail.gmail.com>
Message-ID: <fac531740904071121j75808495xf5fb933f04d1d07e@mail.gmail.com>

Can anyone answer this simple question?

On Mon, Apr 6, 2009 at 3:17 PM, Alan A <alan.zg at gmail.com> wrote:

> Just a quick question - could the nodes belonging to the same cluster run
> in different time zones? What are the possible implications if any?
>
> --
> Alan A.
>


-- 
Alan A.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/c3237a5a/attachment.htm>

From muffaleta at gmail.com  Tue Apr  7 18:27:35 2009
From: muffaleta at gmail.com (Christopher Chen)
Date: Tue, 7 Apr 2009 11:27:35 -0700
Subject: [Linux-cluster] Re: RHEL 5.3 Cluster - Time Zone Question
In-Reply-To: <fac531740904071121j75808495xf5fb933f04d1d07e@mail.gmail.com>
References: <fac531740904061317w1cce7f09j9b08aee4dc597ede@mail.gmail.com>
	<fac531740904071121j75808495xf5fb933f04d1d07e@mail.gmail.com>
Message-ID: <7bc80d500904071127s44be3dfeq469463a0b1ce6b0@mail.gmail.com>

If the machines choose to use local time instead of UTC for cluster
management, I'd say that's a bug.

But then again...

On Tue, Apr 7, 2009 at 11:21 AM, Alan A <alan.zg at gmail.com> wrote:
> Can anyone answer this simple question?
>
> On Mon, Apr 6, 2009 at 3:17 PM, Alan A <alan.zg at gmail.com> wrote:
>>
>> Just a quick question - could the nodes belonging to the same cluster run
>> in different time zones? What are the possible implications if any?
>>
>> --
>> Alan A.
>
>
>
> --
> Alan A.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Chris Chen <muffaleta at gmail.com>
"I want the kind of six pack you can't drink."
-- Micah


From cryptogrid at gmail.com  Tue Apr  7 18:54:03 2009
From: cryptogrid at gmail.com (crypto grid)
Date: Tue, 7 Apr 2009 15:54:03 -0300
Subject: [Linux-cluster] GFS2 on RHEL5.3
Message-ID: <a9f464b80904071154l46018a99qe5e43393594d5cd1@mail.gmail.com>

Is gfs2 fully supported (for production systems) on RHEL5.3?

Reading the RELEASE-NOTES-U3-en, I found: "In Red Hat Enterprise Linux 5.2,
*GFS2* was provided as a kernel module for evaluation purposes. In Red Hat
Enterprise Linux 5.3, *GFS2* is now part of the kernel package."

I've also noticed that GFS2 has been removed from the "Technology Previews"
section in the file RELEASE-NOTES-U3-en.

Is there any official announcement?

Thanks,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090407/3a1a7426/attachment.htm>

From jos at xos.nl  Tue Apr  7 19:00:02 2009
From: jos at xos.nl (Jos Vos)
Date: Tue, 7 Apr 2009 21:00:02 +0200
Subject: [Linux-cluster] GFS2 on RHEL5.3
In-Reply-To: <a9f464b80904071154l46018a99qe5e43393594d5cd1@mail.gmail.com>
References: <a9f464b80904071154l46018a99qe5e43393594d5cd1@mail.gmail.com>
Message-ID: <20090407190002.GA6733@jasmine.xos.nl>

On Tue, Apr 07, 2009 at 03:54:03PM -0300, crypto grid wrote:

> Reading the RELEASE-NOTES-U3-en, I found: "In Red Hat Enterprise Linux 5.2,
> *GFS2* was provided as a kernel module for evaluation purposes. In Red Hat
> Enterprise Linux 5.3, *GFS2* is now part of the kernel package."

The fact that GFS2 is mentioned as "Feature Update" (chapter 3 of the
release notes) implies that it is now a supported feature, yes.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From ffv at tjpr.jus.br  Wed Apr  8 03:31:25 2009
From: ffv at tjpr.jus.br (Fabiano Fantini Vitale)
Date: Wed, 08 Apr 2009 00:31:25 -0300
Subject: [Linux-cluster] qdiskd problem
Message-ID: <20090408003125.15982f3n8bbji5cd@webmail01.tjpr.jus.br>


Hello,


I use centos 5.2 updated and cman cman-2.0.98-1.el5
When I start qdiskd (qdisk -q) I receive the following  errors

qdisk_validate: open of /dev/cdrom-hda for RDWR failed: No medium found
qdisk_verify: No medium found
qdisk_validate: open of /dev/cdrom for RDWR failed: No medium found
qdisk_verify: No medium found
open_partition: seek: Invalid argument
qdisk_validate: open of /dev/md0 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram0 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram10 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram11 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram12 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram13 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram14 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram15 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram2 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram3 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram4 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram5 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram6 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram7 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram8 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument
qdisk_validate: open of /dev/ram9 for RDWR failed: Invalid argument
qdisk_verify: Invalid argument

and in /var/log/messages

Apr  7 20:04:59 smtp02 qdiskd[6656]: <info> Quorum Partition:  Label:  
smtpqdisk
Apr  7 20:04:59 smtp02 qdiskd[6657]: <info> Quorum Daemon Initializing
Apr  7 20:04:59 smtp02 qdiskd[6657]: <crit> Initialization failed
Apr  7 20:05:34 smtp02 qdiskd[6686]: <info> Quorum Partition:  Label


Anybody have a idea what happend?
thanks for any help


From dgmorales at gmail.com  Wed Apr  8 04:39:24 2009
From: dgmorales at gmail.com (Diego Morales)
Date: Wed, 8 Apr 2009 01:39:24 -0300
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
Message-ID: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>

Hello,

I'm looking for some pointers and advice. Is anybody around here using
the Cluster Suite and GFS on Ubuntu? If so, which versions? Ubuntu
packages or compiled from sources?

I'm using packages from Ubuntu Hardy (not the latest, but a Long Term
Support edition of Ubuntu), the rhcs related packages all have a
version "2.20080227". I'm having some "BUG: soft lockup - CPU#1 stuck
for Xs" problems with GFS (version 1).

Any strong recommendations to use the latest source from generation 2,
instead of the distro packages?


( --- if you're feeling like reading some logs, here is more --- )

It's a two node cluster, using brocade SAN switch fencing (no network
power switch or IPMI for doing some power fencing). By now I don't
have any services (rgmanager) configured, just the nodes.
Unfortunally, it's already production use, so I'm very limited in
playing and testing right now.

The said lockup usually happens when I try to umount GFS (by normally
halting the system) in a system which was fenced (the brocade fence).
But monday it happened when I tried to mount GFS in a clean boot
state, with the fc switch ports enabled, and AFAICT a perfect running
cluster manager /fence domain up to that point.

Related logging (stripped down, can send full log if anybody wants to see it):
============================
**** ====  In the machine that I did the mount, from that point on...

Apr  6 18:41:31 <node2> kernel: [ 9101.559599] Trying to join cluster
"lock_dlm", "srvwebGFS:www"
Apr  6 18:41:31 <node2> kernel: [ 9101.559857] dlm: Using TCP for communications
Apr  6 18:41:31 <node2> kernel: [ 9101.564627] dlm: connecting to 1
Apr  6 18:41:31 <node2> kernel: [ 9101.618713] dlm: got connection from 1
Apr  6 18:41:31 <node2> kernel: [ 9101.618914] Joined cluster. Now
mounting FS...
Apr  6 18:41:51 <node2> snmpd[5965]: Connection from UDP: [172.16.8.10]:53683
Apr  6 18:41:59 <node2> kernel: [ 9129.492405] GFS:
fsid=srvwebGFS:www.1: jid=1: Trying to acquire journal lock...
Apr  6 18:41:59 <node2> kernel: [ 9129.492414] GFS:
fsid=srvwebGFS:www.1: jid=1: Looking at journal...
Apr  6 18:41:59 <node2> kernel: [ 9129.554362] GFS:
fsid=srvwebGFS:www.1: jid=1: Done
Apr  6 18:41:59 <node2> kernel: [ 9129.557633] GFS:
fsid=srvwebGFS:www.1: Scanning for log elements...
Apr  6 18:42:15 <node2> kernel: [ 9140.874965] BUG: soft lockup -
CPU#1 stuck for 11s! [mount.gfs:7188]
Apr  6 18:42:15 <node2> kernel: [ 9140.874997] CPU 1:
(... long module listing ...)
Apr  6 18:42:15 <node2> kernel: [ 9140.875078] Pid: 7188, comm:
mount.gfs Not tainted 2.6.24-23-server #1
Apr  6 18:42:15 <node2> kernel: [ 9140.875082] RIP:
0010:[gfs:gfs_unlinked_get+0x5c/0x190]
[gfs:gfs_unlinked_get+0x5c/0x190] :gfs:gfs_unlinked_get+0x5c/0x190
(... registers sutff ...)
Apr  6 18:42:15 <node2> kernel: [ 9140.875147] Call Trace:
Apr  6 18:42:15 <node2> kernel: [ 9140.875171]
[gfs:gfs_unlinked_get+0x35/0x190] :gfs:gfs_unlinked_get+0x35/0x190
Apr  6 18:42:15 <node2> kernel: [ 9140.875216]
[gfs:gfs_unlinked_merge+0x47/0x1e0] :gfs:gfs_unlinked_merge+0x47/0x1e0
Apr  6 18:42:15 <node2> kernel: [ 9140.875252]
[gfs:unlinked_scan_elements+0xc9/0x1b0]
:gfs:unlinked_scan_elements+0xc9/0x1b0
Apr  6 18:42:15 <node2> kernel: [ 9140.875312]
[gfs:foreach_descriptor+0x155/0x340]
:gfs:foreach_descriptor+0x155/0x340
Apr  6 18:42:15 <node2> kernel: [ 9140.875455]
[gfs:gfs_recover_dump+0x121/0x1d0] :gfs:gfs_recover_dump+0x121/0x1d0
Apr  6 18:42:15 <node2> kernel: [ 9140.875543]
[gfs:gfs_make_fs_rw+0x114/0x150] :gfs:gfs_make_fs_rw+0x114/0x150
Apr  6 18:42:15 <node2> kernel: [ 9140.875657]
[gfs:init_journal+0x3fe/0x4b0] :gfs:init_journal+0x3fe/0x4b0
Apr  6 18:42:15 <node2> kernel: [ 9140.875727]
[gfs:fill_super+0x554/0x6a0] :gfs:fill_super+0x554/0x6a0
Apr  6 18:42:15 <node2> kernel: [ 9140.875767]
[set_bdev_super+0x0/0x10] set_bdev_super+0x0/0x10
Apr  6 18:42:15 <node2> kernel: [ 9140.875791]
[fuse:get_sb_bdev+0x14b/0x180] get_sb_bdev+0x14b/0x180
Apr  6 18:42:15 <node2> kernel: [ 9140.875817]
[gfs:fill_super+0x0/0x6a0] :gfs:fill_super+0x0/0x6a0
Apr  6 18:42:15 <node2> kernel: [ 9140.875853]
[vfs_kern_mount+0xcc/0x160] vfs_kern_mount+0xcc/0x160
Apr  6 18:42:15 <node2> kernel: [ 9140.875877]
[do_kern_mount+0x53/0x110] do_kern_mount+0x53/0x110
Apr  6 18:42:15 <node2> kernel: [ 9140.875902]  [do_mount+0x546/0x810]
do_mount+0x546/0x810
Apr  6 18:42:15 <node2> kernel: [ 9140.875934]
[n_tty_receive_buf+0x335/0xf30] n_tty_receive_buf+0x335/0xf30
Apr  6 18:42:15 <node2> kernel: [ 9140.875953]
[n_tty_receive_buf+0x335/0xf30] n_tty_receive_buf+0x335/0xf30
Apr  6 18:42:15 <node2> kernel: [ 9140.875993]
[zone_statistics+0x7d/0x80] zone_statistics+0x7d/0x80
Apr  6 18:42:15 <node2> kernel: [ 9140.876043]
[tty_default_put_char+0x1d/0x30] tty_default_put_char+0x1d/0x30
Apr  6 18:42:15 <node2> kernel: [ 9140.876128]
[configfs:__get_free_pages+0x1b/0x40] __get_free_pages+0x1b/0x40
Apr  6 18:42:15 <node2> kernel: [ 9140.876152]  [sys_mount+0x9b/0x100]
sys_mount+0x9b/0x100
Apr  6 18:42:15 <node2> kernel: [ 9140.876183]
[system_call+0x7e/0x83] system_call+0x7e/0x83
(... more and more of CPUs 0-3 soft locking and stuck for x seconds,
and then...)
Apr  6 18:43:39 <node2> kernel: [ 9229.653782] dlm: closing connection to node 1
Apr  6 18:43:39 <node2> openais[7119]: cman killed by node 1 because
we rejoined the cluster without a full restart
Apr  6 18:43:39 <node2> groupd[7128]: cman_get_nodes error -1 104
Apr  6 18:43:39 <node2> gfs_controld[7162]: cluster is down, exiting
Apr  6 18:43:39 <node2> dlm_controld[7160]: cluster is down, exiting
Apr  6 18:43:39 <node2> kernel: [ 9229.897402] dlm: closing connection to node 2
Apr  6 18:43:41 <node2> fenced[7135]: Unable to connect to to cman:
Connection refused
Apr  6 18:43:41 <node2> fenced[7135]: fence "<node1>" success
Apr  6 18:44:05 <node2> ccsd[7115]: Unable to connect to cluster
infrastructure after 60 seconds.
Apr  6 18:44:14 <node2> kernel: [ 9264.560626]  rport-4:0-0: blocked
FC remote port time out: saving binding
Apr  6 18:44:14 <node2> kernel: [ 9264.560677]  rport-4:0-1: blocked
FC remote port time out: saving binding
Apr  6 18:44:14 <node2> kernel: [ 9264.574363] sd 4:0:1:0: [sdd]
Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Apr  6 18:44:14 <node2> kernel: [ 9264.574371] end_request: I/O error,
dev sdd, sector 157284528
Apr  6 18:44:14 <node2> kernel: [ 9264.574386] GFS:
fsid=srvwebGFS:www.1: fatal: I/O error
Apr  6 18:44:14 <node2> kernel: [ 9264.574388] GFS:
fsid=srvwebGFS:www.1:   block = 19660562
Apr  6 18:44:14 <node2> kernel: [ 9264.574390] GFS:
fsid=srvwebGFS:www.1:   function = gfs_dreread
Apr  6 18:44:14 <node2> kernel: [ 9264.574391] GFS:
fsid=srvwebGFS:www.1:   file =
/build/buildd/linux-ubuntu-modules-2.6.24-2.6.24/debian/build/build-server/fs/gfs/dio.c,
line = 576
Apr  6 18:44:14 <node2> kernel: [ 9264.574394] GFS:
fsid=srvwebGFS:www.1:   time = 1239057854
Apr  6 18:44:14 <node2> kernel: [ 9264.574399] GFS:
fsid=srvwebGFS:www.1: about to withdraw from the cluster
Apr  6 18:44:14 <node2> kernel: [ 9264.574401] GFS:
fsid=srvwebGFS:www.1: telling LM to withdraw
Apr  6 18:44:35 <node2> ccsd[7115]: Unable to connect to cluster
infrastructure after 90 seconds.
(...)
(power manually cut off)

**** ==== From the machine that had the GFS already mounted:

Apr  6 18:42:31 <node1> kernel: [246196.908621] dlm: closing
connection to node 2
Apr  6 18:42:31 <node1> fenced[5611]: <node2> not a cluster member
after 0 sec post_fail_delay
Apr  6 18:42:31 <node1> fenced[5611]: fencing node "<node2>"
Apr  6 18:42:33 <node1> fenced[5611]: fence "<node2>" success
Apr  6 18:42:33 <node1> kernel: [246198.585885] GFS:
fsid=srvwebGFS:www.0: jid=1: Trying to acquire journal lock...
Apr  6 18:43:19 <node1> kernel: [246244.692100] GFS:
fsid=srvwebGFS:www.0: jid=1: Looking at journal...
Apr  6 18:43:19 <node1> kernel: [246244.741659] GFS:
fsid=srvwebGFS:www.0: jid=1: Done
Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 100 MSG_PLOCK
Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 200 MSG_PLOCK
Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 300 MSG_PLOCK
Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 400 MSG_PLOCK
(... several like these ...)
Apr  6 18:43:41 <node1> gfs_controld[5615]: cpg_mcast_joined retry
1300 MSG_PLOCK
Apr  6 18:43:41 <node1> kernel: [246266.801054] qla2xxx 0000:02:01.0:
LOOP DOWN detected (2).
Apr  6 18:43:41 <node1> gfs_controld[5615]: cpg_mcast_joined retry
1400 MSG_PLOCK
(... again until ...)
Apr  6 18:43:54 <node1> gfs_controld[5615]: cpg_mcast_joined retry
14200 MSG_PLOCK
Apr  6 18:44:16 <node1> kernel: [246301.772188]  rport-4:0-0: blocked
FC remote port time out: saving binding
Apr  6 18:44:16 <node1> kernel: [246301.772236]  rport-4:0-1: blocked
FC remote port time out: saving binding
Apr  6 18:44:16 <node1> kernel: [246301.790639] sd 4:0:1:0: [sdd]
Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Apr  6 18:44:16 <node1> kernel: [246301.790647] end_request: I/O
error, dev sdd, sector 11453568
Apr  6 18:44:16 <node1> kernel: [246301.790667] GFS:
fsid=srvwebGFS:www.0: fatal: I/O error
Apr  6 18:44:16 <node1> kernel: [246301.790672] GFS:
fsid=srvwebGFS:www.0:   block = 1431692
Apr  6 18:44:16 <node1> kernel: [246301.790676] GFS:
fsid=srvwebGFS:www.0:   function = gfs_dreread
Apr  6 18:44:16 <node1> kernel: [246301.790680] GFS:
fsid=srvwebGFS:www.0:   file =
/build/buildd/linux-ubuntu-modules-2.6.24-2.6.24/debian/build/build-server/fs/gfs/dio.c,
line = 576
Apr  6 18:44:16 <node1> kernel: [246301.790688] GFS:
fsid=srvwebGFS:www.0:   time = 1239057856
Apr  6 18:44:16 <node1> kernel: [246301.790694] sd 4:0:1:0: [sdd]
Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Apr  6 18:44:16 <node1> kernel: [246301.790700] end_request: I/O
error, dev sdd, sector 141890640
Apr  6 18:44:16 <node1> kernel: [246301.790705] GFS:
fsid=srvwebGFS:www.0: about to withdraw from the cluster
Apr  6 18:44:16 <node1> kernel: [246301.790711] GFS:
fsid=srvwebGFS:www.0: telling LM to withdraw
Apr  6 18:44:16 <node1> kernel: [246301.790718] sd 4:0:1:0: [sdd]
Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Apr  6 18:44:16 <node1> kernel: [246301.790731] end_request: I/O
error, dev sdd, sector 3611616
Apr  6 18:44:16 <node1> kernel: [246301.790751] sd 4:0:1:0: [sdd]
Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Apr  6 18:44:16 <node1> kernel: [246301.790765] end_request: I/O
error, dev sdd, sector 64424648
Apr  6 18:44:16 <node1> kernel: [246301.790774] Buffer I/O error on
device sdd1, logical block 8053077
(...)
Apr  6 18:45:22 <node1> kernel: [246367.692933] dlm: www: group leave
failed -512 0
Apr  6 18:45:22 <node1> kernel: [246368.160042] GFS:
fsid=srvwebGFS:www.0: withdrawn
(...)
(power manually cut off)

==============================

So, both machines fenced, GFS down, people screaming :)
Up in the first log there is the line:
Apr  6 18:43:39 <node2> openais[7119]: cman killed by node 1 because
we rejoined the cluster without a full restart

I wonder if it is consequence (it happened after the soft lockup) or
cause. It was a clean boot without any of the cluster daemons going up
before a I manually start each one of them. *Noticing now, I did
unfenced the switch port when the machine was already up (but cman
still down)*. Could that be the culprit?

Thanks for any help or advice.


Diego Morales


From sdake at redhat.com  Wed Apr  8 04:55:58 2009
From: sdake at redhat.com (Steven Dake)
Date: Tue, 07 Apr 2009 21:55:58 -0700
Subject: [Linux-cluster] qdiskd problem
In-Reply-To: <20090408003125.15982f3n8bbji5cd@webmail01.tjpr.jus.br>
References: <20090408003125.15982f3n8bbji5cd@webmail01.tjpr.jus.br>
Message-ID: <1239166558.8905.0.camel@sdake-laptop>

I don't know anything about qdisk but chrissie may be able to help you.

regards
-steve

On Wed, 2009-04-08 at 00:31 -0300, Fabiano Fantini Vitale wrote:
> Hello,
> 
> 
> I use centos 5.2 updated and cman cman-2.0.98-1.el5
> When I start qdiskd (qdisk -q) I receive the following  errors
> 
> qdisk_validate: open of /dev/cdrom-hda for RDWR failed: No medium found
> qdisk_verify: No medium found
> qdisk_validate: open of /dev/cdrom for RDWR failed: No medium found
> qdisk_verify: No medium found
> open_partition: seek: Invalid argument
> qdisk_validate: open of /dev/md0 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram0 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram10 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram11 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram12 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram13 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram14 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram15 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram2 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram3 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram4 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram5 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram6 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram7 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram8 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> qdisk_validate: open of /dev/ram9 for RDWR failed: Invalid argument
> qdisk_verify: Invalid argument
> 
> and in /var/log/messages
> 
> Apr  7 20:04:59 smtp02 qdiskd[6656]: <info> Quorum Partition:  Label:  
> smtpqdisk
> Apr  7 20:04:59 smtp02 qdiskd[6657]: <info> Quorum Daemon Initializing
> Apr  7 20:04:59 smtp02 qdiskd[6657]: <crit> Initialization failed
> Apr  7 20:05:34 smtp02 qdiskd[6686]: <info> Quorum Partition:  Label
> 
> 
> 
> Anybody have a idea what happend?
> thanks for any help
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From swhiteho at redhat.com  Wed Apr  8 08:46:59 2009
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 08 Apr 2009 09:46:59 +0100
Subject: [Linux-cluster] GFS2 on RHEL5.3
In-Reply-To: <20090407190002.GA6733@jasmine.xos.nl>
References: <a9f464b80904071154l46018a99qe5e43393594d5cd1@mail.gmail.com>
	<20090407190002.GA6733@jasmine.xos.nl>
Message-ID: <1239180419.3359.0.camel@localhost.localdomain>

Hi,

On Tue, 2009-04-07 at 21:00 +0200, Jos Vos wrote:
> On Tue, Apr 07, 2009 at 03:54:03PM -0300, crypto grid wrote:
> 
> > Reading the RELEASE-NOTES-U3-en, I found: "In Red Hat Enterprise Linux 5.2,
> > *GFS2* was provided as a kernel module for evaluation purposes. In Red Hat
> > Enterprise Linux 5.3, *GFS2* is now part of the kernel package."
> 
> The fact that GFS2 is mentioned as "Feature Update" (chapter 3 of the
> release notes) implies that it is now a supported feature, yes.
> 

It is indeed fully supported now,

Steve.


From ccaulfie at redhat.com  Wed Apr  8 09:54:56 2009
From: ccaulfie at redhat.com (Chrissie Caulfield)
Date: Wed, 08 Apr 2009 10:54:56 +0100
Subject: [Linux-cluster] Re: RHEL 5.3 Cluster - Time Zone Question
In-Reply-To: <7bc80d500904071127s44be3dfeq469463a0b1ce6b0@mail.gmail.com>
References: <fac531740904061317w1cce7f09j9b08aee4dc597ede@mail.gmail.com>	<fac531740904071121j75808495xf5fb933f04d1d07e@mail.gmail.com>
	<7bc80d500904071127s44be3dfeq469463a0b1ce6b0@mail.gmail.com>
Message-ID: <49DC7470.9000003@redhat.com>

Christopher Chen wrote:
> If the machines choose to use local time instead of UTC for cluster
> management, I'd say that's a bug.

Indeed it would be a bug.

I don't think its ever been explicitly tested but it really should be
fine, and if it isn't we want to know about it.

> 
> On Tue, Apr 7, 2009 at 11:21 AM, Alan A <alan.zg at gmail.com> wrote:
>> Can anyone answer this simple question?
>>
>> On Mon, Apr 6, 2009 at 3:17 PM, Alan A <alan.zg at gmail.com> wrote:
>>> Just a quick question - could the nodes belonging to the same cluster run
>>> in different time zones? What are the possible implications if any?
>>>


-- 

Chrissie


From kadlec at mail.kfki.hu  Wed Apr  8 11:28:42 2009
From: kadlec at mail.kfki.hu (Kadlecsik Jozsef)
Date: Wed, 8 Apr 2009 13:28:42 +0200 (CEST)
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
In-Reply-To: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>
References: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>
Message-ID: <alpine.DEB.2.00.0904081324110.10743@lxserv0.kfki.hu>

On Wed, 8 Apr 2009, Diego Morales wrote:

> I'm looking for some pointers and advice. Is anybody around here using
> the Cluster Suite and GFS on Ubuntu? If so, which versions? Ubuntu
> packages or compiled from sources?

We run the GFS cluster suite on Ubuntu hardy. But it is compiled from 
source, together with openais and LVM2.

I know about another GFS cluster running on Ubuntu dapper and they use the 
GFS packages from the distribution.
 
Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From rhurst at bidmc.harvard.edu  Wed Apr  8 12:08:48 2009
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Wed, 08 Apr 2009 08:08:48 -0400
Subject: [Linux-cluster] RHEL 4.7 rgmanager-1.9.80-1 debug output?
In-Reply-To: <1239019113.5265.6.camel@WSBID06223.bidmc.harvard.edu>
References: <3c88c73a0904030216k351345b7pad2528ec28e9dd38@mail.gmail.com>
	<3c88c73a0904040321qdcc8d8u39e81009c80605c4@mail.gmail.com>
	<1239019113.5265.6.camel@WSBID06223.bidmc.harvard.edu>
Message-ID: <1239192528.5243.42.camel@WSBID06223.bidmc.harvard.edu>

Red Hat Network support provided the following, and I thought I would
pass this along... 


        Following website and kbase article will help you to configure
        the "cluster.conf"  to have rgmanager log to different places
        and a different level.
        
        http://sources.redhat.com/cluster/wiki/FAQ/RGManager#rgm_logging
        
        http://kbase.redhat.com/faq/docs/DOC-5953
        

________________________________________________________________________


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.


On Mon, 2009-04-06 at 07:58 -0400, Robert Hurst wrote:

> We're having a bizarre problem with rgmanager ... every now and then,
> it decides to stop a service with no recorded reason as to why:
> 
> Apr  6 04:05:25 acropolis clurgmgrd[967]: <notice> Stopping service
> WATSONAPP1 
> 
> Is there a way to increase debugging / verbosity with rgmanager?
> 
> I cannot find command-line options for clurgmgrd, other than the
> soft-coded value in its service script, /etc/init.d/rgmanager.  I
> made /etc/sysconfig/cluster and added the line:
> 
> RGMGR_OPTS="-t 30 -d"
> 
> ... hoping that the "-d" will increase debug level in syslog messages,
> but nothing new appeared.
> 
> Please advise, thanks!
> 
> 
> ______________________________________________________________________
> 
> 
> Robert Hurst, Sr. Cach? Administrator
> Beth Israel Deaconess Medical Center
> 1135 Tremont Street, REN-7
> Boston, Massachusetts   02120-2140
> 617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
> Any technology distinguishable from magic is insufficiently advanced.
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090408/cc1c1425/attachment.htm>

From linux at vdkrone.de  Wed Apr  8 12:43:12 2009
From: linux at vdkrone.de (Tobias von der Krone)
Date: Wed, 08 Apr 2009 14:43:12 +0200
Subject: [Linux-cluster] Unmounting gfs2 on reboot hangs
In-Reply-To: <49DB4553.9080907@vdkrone.de>
References: <49DB4553.9080907@vdkrone.de>
Message-ID: <49DC9BE0.3070805@vdkrone.de>

Hello,

found the problem... the VMware Tools stopped network before gfs2 tried 
to unmount volumes.

Tobias


Tobias von der Krone wrote:
> Hello,
> I'm using a up2date RHEL 5.3 and I have the problem that unmounting 
> gfs2 with its init script on reboot/shutdown hangs. Using 'service 
> gfs2 stop' while system is running works fine.
> I'm testing starting / stopping gfs2 earlier and later... but till now 
> no improvements.
> Does anybody have an idea what is going wrong?
> Does anybody know which dependencies gfs2 has?
>
> Tobias
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From ffv at tjpr.jus.br  Wed Apr  8 15:18:30 2009
From: ffv at tjpr.jus.br (Fabiano Fantini Vitale)
Date: Wed, 08 Apr 2009 12:18:30 -0300
Subject: [Linux-cluster] qdiskd problem
In-Reply-To: <1239166558.8905.0.camel@sdake-laptop>
References: <20090408003125.15982f3n8bbji5cd@webmail01.tjpr.jus.br>
	<1239166558.8905.0.camel@sdake-laptop>
Message-ID: <20090408121830.39525ivjy2qwgabq@webmail02.tjpr.jus.br>

Hello,

I try to start qdisk using:
strace -o /tmp/qdisk.log qdiskd -Q and on end of file /tmp/qdisk.log presents:

sendto(5, "<30>Apr  8 11:33:24 qdiskd[8205]"..., 77, MSG_NOSIGNAL,  
NULL, 0) = 77
getuid()                                = 0
stat("/var/run/qdiskd.pid", {st_mode=S_IFREG|0644, st_size=4, ...}) = 0
open("/var/run/qdiskd.pid", O_RDONLY)   = 6
fstat(6, {st_mode=S_IFREG|0644, st_size=4, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,  
0) = 0x2afc29418000
read(6, "8069", 4096)                   = 4
read(6, "", 4096)                       = 0
close(6)                                = 0
munmap(0x2afc29418000, 4096)            = 0
open("/proc/8069", O_RDONLY|O_NONBLOCK|O_DIRECTORY) = -1 ENOENT (No  
such file or directory)
rt_sigprocmask(SIG_BLOCK, ~[QUIT ILL TRAP ABRT BUS FPE SEGV TERM CHLD  
RTMIN RT_1], NULL, 8) = 0
clone(child_stack=0,  
flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,  
child_tidptr=0x2afc29421950) = 8206
exit_group(0)                           = ?


this shows that qdiskd is running on 8205 process and  
/var/run/qdiskd.pid caught the 8069. However, when it tries to have  
access /proc/8069 returns error. It must not have attemped to access  
/proc/8205?

thanks for any help


Citando Steven Dake <sdake at redhat.com>:

> I don't know anything about qdisk but chrissie may be able to help you.
>
> regards
> -steve
>
> On Wed, 2009-04-08 at 00:31 -0300, Fabiano Fantini Vitale wrote:
>> Hello,
>>
>>
>> I use centos 5.2 updated and cman cman-2.0.98-1.el5
>> When I start qdiskd (qdisk -q) I receive the following  errors
>>
>> qdisk_validate: open of /dev/cdrom-hda for RDWR failed: No medium found
>> qdisk_verify: No medium found
>> qdisk_validate: open of /dev/cdrom for RDWR failed: No medium found
>> qdisk_verify: No medium found
>> open_partition: seek: Invalid argument
>> qdisk_validate: open of /dev/md0 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram0 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram10 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram11 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram12 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram13 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram14 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram15 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram2 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram3 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram4 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram5 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram6 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram7 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram8 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>> qdisk_validate: open of /dev/ram9 for RDWR failed: Invalid argument
>> qdisk_verify: Invalid argument
>>
>> and in /var/log/messages
>>
>> Apr  7 20:04:59 smtp02 qdiskd[6656]: <info> Quorum Partition:  Label:
>> smtpqdisk
>> Apr  7 20:04:59 smtp02 qdiskd[6657]: <info> Quorum Daemon Initializing
>> Apr  7 20:04:59 smtp02 qdiskd[6657]: <crit> Initialization failed
>> Apr  7 20:05:34 smtp02 qdiskd[6686]: <info> Quorum Partition:  Label
>>
>>
>>
>> Anybody have a idea what happend?
>> thanks for any help
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From gianluca.cecchi at gmail.com  Wed Apr  8 15:19:20 2009
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Wed, 8 Apr 2009 17:19:20 +0200
Subject: [Linux-cluster] Info on lvm setup for cluster without clvmd
Message-ID: <561c252c0904080819m542212f8me04de1d73db15a77@mail.gmail.com>

Hello,
I would like to setup a two-node cluster where I will have some
services relying on filesystems on lvm resources.
I'm using rh el 5U3 but I only have entitlements for RHEL Clustering
and not for Cluster-Storage, so that I cannot use clvmd as in other
clusters I set up previously.
I don't need GFS so I think this is the correct setup for me, also
from a legal point of view
>From a documentation point of view I see:
Note:
Shared storage for use in Red Hat Cluster Suite requires that you be
running the cluster
logical volume manager daemon (clvmd) or the High Availability Logical Volume
Management agents (HA-LVM). If you are not able to use either the
clvmd daemon or
HA-LVM for operational reasons or because you do not have the correct
entitlements, you
must not use single-instance LVM on the shared disk as this may result
in data corruption.
If you have any concerns please contact your Red Hat service representative.

I suppose that with HA-LVM here we are talking about lvm.sh script
(with the other two lvm_by_lv.sh and lvm_by_vg.sh referred into it)
inside /usr/share/cluster/ directory.

At the moment the cluster is not formed at all; I only have setup a
basic cluster.conf without the lvm resources in it.
I have setup volume groups and logical volumes on one node. The disks
are seen by the other node too, but correctly it is not seeing the LVM
parts
In lvm.conf of both I have
locking_type = 1
and I think it should remain this way in my setup, correct?

Which is the correct approach in my situation if I want to go straight
with command line and not graphical tools?
In old days when lvm was not cluster-aware I had to:
vgchange -an VG on the first node and then vgchange -ay VG on the
second to let it create the devices and the lvm cache for the first
time after creation.

After this, in lvm.sh script it seems I have to populate the
"activation" section filling up the volume_list part, to prevent
concurrent activation of volume groups used by the cluster.

Any good documentation reference for this? It seems I didn't find anything....

I thank all the gui creators (Conga and system-config-cluster), and I
think there is a good audience for them, but I would like to be able
at least to do it manually too.... I think it helps very much to
understand internal mechanisms and eventually debugging when problems
arise...

Thanks in advance,
Gianluca


From Norbert.Nemeth at mscibarra.com  Wed Apr  8 17:37:51 2009
From: Norbert.Nemeth at mscibarra.com (Nemeth, Norbert)
Date: Wed, 8 Apr 2009 19:37:51 +0200
Subject: [Linux-cluster] Info on lvm setup for cluster without clvmd
In-Reply-To: <561c252c0904080819m542212f8me04de1d73db15a77@mail.gmail.com>
References: <561c252c0904080819m542212f8me04de1d73db15a77@mail.gmail.com>
Message-ID: <CD5897CBBD759345BB46FDE6A5616C2C165E9B18DC@EU-MAILBOX001.int.msci.com>

Hi,

Have you checked this? : http://sources.redhat.com/cluster/wiki/LVMFailover

Best regards,
Norbert Nemeth

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Gianluca Cecchi
Sent: Wednesday, April 08, 2009 5:19 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Info on lvm setup for cluster without clvmd

Hello,
I would like to setup a two-node cluster where I will have some
services relying on filesystems on lvm resources.
I'm using rh el 5U3 but I only have entitlements for RHEL Clustering
and not for Cluster-Storage, so that I cannot use clvmd as in other
clusters I set up previously.
I don't need GFS so I think this is the correct setup for me, also
from a legal point of view
>From a documentation point of view I see:
Note:
Shared storage for use in Red Hat Cluster Suite requires that you be
running the cluster
logical volume manager daemon (clvmd) or the High Availability Logical Volume
Management agents (HA-LVM). If you are not able to use either the
clvmd daemon or
HA-LVM for operational reasons or because you do not have the correct
entitlements, you
must not use single-instance LVM on the shared disk as this may result
in data corruption.
If you have any concerns please contact your Red Hat service representative.

I suppose that with HA-LVM here we are talking about lvm.sh script
(with the other two lvm_by_lv.sh and lvm_by_vg.sh referred into it)
inside /usr/share/cluster/ directory.

At the moment the cluster is not formed at all; I only have setup a
basic cluster.conf without the lvm resources in it.
I have setup volume groups and logical volumes on one node. The disks
are seen by the other node too, but correctly it is not seeing the LVM
parts
In lvm.conf of both I have
locking_type = 1
and I think it should remain this way in my setup, correct?

Which is the correct approach in my situation if I want to go straight
with command line and not graphical tools?
In old days when lvm was not cluster-aware I had to:
vgchange -an VG on the first node and then vgchange -ay VG on the
second to let it create the devices and the lvm cache for the first
time after creation.

After this, in lvm.sh script it seems I have to populate the
"activation" section filling up the volume_list part, to prevent
concurrent activation of volume groups used by the cluster.

Any good documentation reference for this? It seems I didn't find anything....

I thank all the gui creators (Conga and system-config-cluster), and I
think there is a good audience for them, but I would like to be able
at least to do it manually too.... I think it helps very much to
understand internal mechanisms and eventually debugging when problems
arise...

Thanks in advance,
Gianluca

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

NOTICE: If received in error, please destroy and notify sender. Sender does not intend to waive confidentiality or privilege. Use of this email is prohibited when received in error.

Local registered entity: MSCI KFT
Metropolitan Court acting as the Court of Registry
Registered office: 1138 Budapest, N?pf?rd? utca 22, Hungary
Registration No. 01-09-885383


From virginian at blueyonder.co.uk  Wed Apr  8 17:47:25 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Wed, 8 Apr 2009 18:47:25 +0100
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
References: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>
Message-ID: <5C88FA2A4F4E42418581BA1C38CB76D8@Desktop>

I'm not entirely sure that the CPU lockup message is actually caused by 
RHCS.What hardware are you running on, how many CPUs, how much RAM? I've see 
this soft lockup error before on IBM 3850 machines running RHEL 5 (but not 
RHCS) I think....


----- Original Message ----- 
From: "Diego Morales" <dgmorales at gmail.com>
To: <linux-cluster at redhat.com>
Sent: Wednesday, April 08, 2009 5:39 AM
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck


> Hello,
>
> I'm looking for some pointers and advice. Is anybody around here using
> the Cluster Suite and GFS on Ubuntu? If so, which versions? Ubuntu
> packages or compiled from sources?
>
> I'm using packages from Ubuntu Hardy (not the latest, but a Long Term
> Support edition of Ubuntu), the rhcs related packages all have a
> version "2.20080227". I'm having some "BUG: soft lockup - CPU#1 stuck
> for Xs" problems with GFS (version 1).
>
> Any strong recommendations to use the latest source from generation 2,
> instead of the distro packages?
>
>
> ( --- if you're feeling like reading some logs, here is more --- )
>
> It's a two node cluster, using brocade SAN switch fencing (no network
> power switch or IPMI for doing some power fencing). By now I don't
> have any services (rgmanager) configured, just the nodes.
> Unfortunally, it's already production use, so I'm very limited in
> playing and testing right now.
>
> The said lockup usually happens when I try to umount GFS (by normally
> halting the system) in a system which was fenced (the brocade fence).
> But monday it happened when I tried to mount GFS in a clean boot
> state, with the fc switch ports enabled, and AFAICT a perfect running
> cluster manager /fence domain up to that point.
>
> Related logging (stripped down, can send full log if anybody wants to see 
> it):
> ============================
> **** ====  In the machine that I did the mount, from that point on...
>
> Apr  6 18:41:31 <node2> kernel: [ 9101.559599] Trying to join cluster
> "lock_dlm", "srvwebGFS:www"
> Apr  6 18:41:31 <node2> kernel: [ 9101.559857] dlm: Using TCP for 
> communications
> Apr  6 18:41:31 <node2> kernel: [ 9101.564627] dlm: connecting to 1
> Apr  6 18:41:31 <node2> kernel: [ 9101.618713] dlm: got connection from 1
> Apr  6 18:41:31 <node2> kernel: [ 9101.618914] Joined cluster. Now
> mounting FS...
> Apr  6 18:41:51 <node2> snmpd[5965]: Connection from UDP: 
> [172.16.8.10]:53683
> Apr  6 18:41:59 <node2> kernel: [ 9129.492405] GFS:
> fsid=srvwebGFS:www.1: jid=1: Trying to acquire journal lock...
> Apr  6 18:41:59 <node2> kernel: [ 9129.492414] GFS:
> fsid=srvwebGFS:www.1: jid=1: Looking at journal...
> Apr  6 18:41:59 <node2> kernel: [ 9129.554362] GFS:
> fsid=srvwebGFS:www.1: jid=1: Done
> Apr  6 18:41:59 <node2> kernel: [ 9129.557633] GFS:
> fsid=srvwebGFS:www.1: Scanning for log elements...
> Apr  6 18:42:15 <node2> kernel: [ 9140.874965] BUG: soft lockup -
> CPU#1 stuck for 11s! [mount.gfs:7188]
> Apr  6 18:42:15 <node2> kernel: [ 9140.874997] CPU 1:
> (... long module listing ...)
> Apr  6 18:42:15 <node2> kernel: [ 9140.875078] Pid: 7188, comm:
> mount.gfs Not tainted 2.6.24-23-server #1
> Apr  6 18:42:15 <node2> kernel: [ 9140.875082] RIP:
> 0010:[gfs:gfs_unlinked_get+0x5c/0x190]
> [gfs:gfs_unlinked_get+0x5c/0x190] :gfs:gfs_unlinked_get+0x5c/0x190
> (... registers sutff ...)
> Apr  6 18:42:15 <node2> kernel: [ 9140.875147] Call Trace:
> Apr  6 18:42:15 <node2> kernel: [ 9140.875171]
> [gfs:gfs_unlinked_get+0x35/0x190] :gfs:gfs_unlinked_get+0x35/0x190
> Apr  6 18:42:15 <node2> kernel: [ 9140.875216]
> [gfs:gfs_unlinked_merge+0x47/0x1e0] :gfs:gfs_unlinked_merge+0x47/0x1e0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875252]
> [gfs:unlinked_scan_elements+0xc9/0x1b0]
> :gfs:unlinked_scan_elements+0xc9/0x1b0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875312]
> [gfs:foreach_descriptor+0x155/0x340]
> :gfs:foreach_descriptor+0x155/0x340
> Apr  6 18:42:15 <node2> kernel: [ 9140.875455]
> [gfs:gfs_recover_dump+0x121/0x1d0] :gfs:gfs_recover_dump+0x121/0x1d0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875543]
> [gfs:gfs_make_fs_rw+0x114/0x150] :gfs:gfs_make_fs_rw+0x114/0x150
> Apr  6 18:42:15 <node2> kernel: [ 9140.875657]
> [gfs:init_journal+0x3fe/0x4b0] :gfs:init_journal+0x3fe/0x4b0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875727]
> [gfs:fill_super+0x554/0x6a0] :gfs:fill_super+0x554/0x6a0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875767]
> [set_bdev_super+0x0/0x10] set_bdev_super+0x0/0x10
> Apr  6 18:42:15 <node2> kernel: [ 9140.875791]
> [fuse:get_sb_bdev+0x14b/0x180] get_sb_bdev+0x14b/0x180
> Apr  6 18:42:15 <node2> kernel: [ 9140.875817]
> [gfs:fill_super+0x0/0x6a0] :gfs:fill_super+0x0/0x6a0
> Apr  6 18:42:15 <node2> kernel: [ 9140.875853]
> [vfs_kern_mount+0xcc/0x160] vfs_kern_mount+0xcc/0x160
> Apr  6 18:42:15 <node2> kernel: [ 9140.875877]
> [do_kern_mount+0x53/0x110] do_kern_mount+0x53/0x110
> Apr  6 18:42:15 <node2> kernel: [ 9140.875902]  [do_mount+0x546/0x810]
> do_mount+0x546/0x810
> Apr  6 18:42:15 <node2> kernel: [ 9140.875934]
> [n_tty_receive_buf+0x335/0xf30] n_tty_receive_buf+0x335/0xf30
> Apr  6 18:42:15 <node2> kernel: [ 9140.875953]
> [n_tty_receive_buf+0x335/0xf30] n_tty_receive_buf+0x335/0xf30
> Apr  6 18:42:15 <node2> kernel: [ 9140.875993]
> [zone_statistics+0x7d/0x80] zone_statistics+0x7d/0x80
> Apr  6 18:42:15 <node2> kernel: [ 9140.876043]
> [tty_default_put_char+0x1d/0x30] tty_default_put_char+0x1d/0x30
> Apr  6 18:42:15 <node2> kernel: [ 9140.876128]
> [configfs:__get_free_pages+0x1b/0x40] __get_free_pages+0x1b/0x40
> Apr  6 18:42:15 <node2> kernel: [ 9140.876152]  [sys_mount+0x9b/0x100]
> sys_mount+0x9b/0x100
> Apr  6 18:42:15 <node2> kernel: [ 9140.876183]
> [system_call+0x7e/0x83] system_call+0x7e/0x83
> (... more and more of CPUs 0-3 soft locking and stuck for x seconds,
> and then...)
> Apr  6 18:43:39 <node2> kernel: [ 9229.653782] dlm: closing connection to 
> node 1
> Apr  6 18:43:39 <node2> openais[7119]: cman killed by node 1 because
> we rejoined the cluster without a full restart
> Apr  6 18:43:39 <node2> groupd[7128]: cman_get_nodes error -1 104
> Apr  6 18:43:39 <node2> gfs_controld[7162]: cluster is down, exiting
> Apr  6 18:43:39 <node2> dlm_controld[7160]: cluster is down, exiting
> Apr  6 18:43:39 <node2> kernel: [ 9229.897402] dlm: closing connection to 
> node 2
> Apr  6 18:43:41 <node2> fenced[7135]: Unable to connect to to cman:
> Connection refused
> Apr  6 18:43:41 <node2> fenced[7135]: fence "<node1>" success
> Apr  6 18:44:05 <node2> ccsd[7115]: Unable to connect to cluster
> infrastructure after 60 seconds.
> Apr  6 18:44:14 <node2> kernel: [ 9264.560626]  rport-4:0-0: blocked
> FC remote port time out: saving binding
> Apr  6 18:44:14 <node2> kernel: [ 9264.560677]  rport-4:0-1: blocked
> FC remote port time out: saving binding
> Apr  6 18:44:14 <node2> kernel: [ 9264.574363] sd 4:0:1:0: [sdd]
> Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
> Apr  6 18:44:14 <node2> kernel: [ 9264.574371] end_request: I/O error,
> dev sdd, sector 157284528
> Apr  6 18:44:14 <node2> kernel: [ 9264.574386] GFS:
> fsid=srvwebGFS:www.1: fatal: I/O error
> Apr  6 18:44:14 <node2> kernel: [ 9264.574388] GFS:
> fsid=srvwebGFS:www.1:   block = 19660562
> Apr  6 18:44:14 <node2> kernel: [ 9264.574390] GFS:
> fsid=srvwebGFS:www.1:   function = gfs_dreread
> Apr  6 18:44:14 <node2> kernel: [ 9264.574391] GFS:
> fsid=srvwebGFS:www.1:   file =
> /build/buildd/linux-ubuntu-modules-2.6.24-2.6.24/debian/build/build-server/fs/gfs/dio.c,
> line = 576
> Apr  6 18:44:14 <node2> kernel: [ 9264.574394] GFS:
> fsid=srvwebGFS:www.1:   time = 1239057854
> Apr  6 18:44:14 <node2> kernel: [ 9264.574399] GFS:
> fsid=srvwebGFS:www.1: about to withdraw from the cluster
> Apr  6 18:44:14 <node2> kernel: [ 9264.574401] GFS:
> fsid=srvwebGFS:www.1: telling LM to withdraw
> Apr  6 18:44:35 <node2> ccsd[7115]: Unable to connect to cluster
> infrastructure after 90 seconds.
> (...)
> (power manually cut off)
>
> **** ==== From the machine that had the GFS already mounted:
>
> Apr  6 18:42:31 <node1> kernel: [246196.908621] dlm: closing
> connection to node 2
> Apr  6 18:42:31 <node1> fenced[5611]: <node2> not a cluster member
> after 0 sec post_fail_delay
> Apr  6 18:42:31 <node1> fenced[5611]: fencing node "<node2>"
> Apr  6 18:42:33 <node1> fenced[5611]: fence "<node2>" success
> Apr  6 18:42:33 <node1> kernel: [246198.585885] GFS:
> fsid=srvwebGFS:www.0: jid=1: Trying to acquire journal lock...
> Apr  6 18:43:19 <node1> kernel: [246244.692100] GFS:
> fsid=srvwebGFS:www.0: jid=1: Looking at journal...
> Apr  6 18:43:19 <node1> kernel: [246244.741659] GFS:
> fsid=srvwebGFS:www.0: jid=1: Done
> Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 100 
> MSG_PLOCK
> Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 200 
> MSG_PLOCK
> Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 300 
> MSG_PLOCK
> Apr  6 18:43:40 <node1> gfs_controld[5615]: cpg_mcast_joined retry 400 
> MSG_PLOCK
> (... several like these ...)
> Apr  6 18:43:41 <node1> gfs_controld[5615]: cpg_mcast_joined retry
> 1300 MSG_PLOCK
> Apr  6 18:43:41 <node1> kernel: [246266.801054] qla2xxx 0000:02:01.0:
> LOOP DOWN detected (2).
> Apr  6 18:43:41 <node1> gfs_controld[5615]: cpg_mcast_joined retry
> 1400 MSG_PLOCK
> (... again until ...)
> Apr  6 18:43:54 <node1> gfs_controld[5615]: cpg_mcast_joined retry
> 14200 MSG_PLOCK
> Apr  6 18:44:16 <node1> kernel: [246301.772188]  rport-4:0-0: blocked
> FC remote port time out: saving binding
> Apr  6 18:44:16 <node1> kernel: [246301.772236]  rport-4:0-1: blocked
> FC remote port time out: saving binding
> Apr  6 18:44:16 <node1> kernel: [246301.790639] sd 4:0:1:0: [sdd]
> Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
> Apr  6 18:44:16 <node1> kernel: [246301.790647] end_request: I/O
> error, dev sdd, sector 11453568
> Apr  6 18:44:16 <node1> kernel: [246301.790667] GFS:
> fsid=srvwebGFS:www.0: fatal: I/O error
> Apr  6 18:44:16 <node1> kernel: [246301.790672] GFS:
> fsid=srvwebGFS:www.0:   block = 1431692
> Apr  6 18:44:16 <node1> kernel: [246301.790676] GFS:
> fsid=srvwebGFS:www.0:   function = gfs_dreread
> Apr  6 18:44:16 <node1> kernel: [246301.790680] GFS:
> fsid=srvwebGFS:www.0:   file =
> /build/buildd/linux-ubuntu-modules-2.6.24-2.6.24/debian/build/build-server/fs/gfs/dio.c,
> line = 576
> Apr  6 18:44:16 <node1> kernel: [246301.790688] GFS:
> fsid=srvwebGFS:www.0:   time = 1239057856
> Apr  6 18:44:16 <node1> kernel: [246301.790694] sd 4:0:1:0: [sdd]
> Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
> Apr  6 18:44:16 <node1> kernel: [246301.790700] end_request: I/O
> error, dev sdd, sector 141890640
> Apr  6 18:44:16 <node1> kernel: [246301.790705] GFS:
> fsid=srvwebGFS:www.0: about to withdraw from the cluster
> Apr  6 18:44:16 <node1> kernel: [246301.790711] GFS:
> fsid=srvwebGFS:www.0: telling LM to withdraw
> Apr  6 18:44:16 <node1> kernel: [246301.790718] sd 4:0:1:0: [sdd]
> Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
> Apr  6 18:44:16 <node1> kernel: [246301.790731] end_request: I/O
> error, dev sdd, sector 3611616
> Apr  6 18:44:16 <node1> kernel: [246301.790751] sd 4:0:1:0: [sdd]
> Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
> Apr  6 18:44:16 <node1> kernel: [246301.790765] end_request: I/O
> error, dev sdd, sector 64424648
> Apr  6 18:44:16 <node1> kernel: [246301.790774] Buffer I/O error on
> device sdd1, logical block 8053077
> (...)
> Apr  6 18:45:22 <node1> kernel: [246367.692933] dlm: www: group leave
> failed -512 0
> Apr  6 18:45:22 <node1> kernel: [246368.160042] GFS:
> fsid=srvwebGFS:www.0: withdrawn
> (...)
> (power manually cut off)
>
> ==============================
>
> So, both machines fenced, GFS down, people screaming :)
> Up in the first log there is the line:
> Apr  6 18:43:39 <node2> openais[7119]: cman killed by node 1 because
> we rejoined the cluster without a full restart
>
> I wonder if it is consequence (it happened after the soft lockup) or
> cause. It was a clean boot without any of the cluster daemons going up
> before a I manually start each one of them. *Noticing now, I did
> unfenced the switch port when the machine was already up (but cman
> still down)*. Could that be the culprit?
>
> Thanks for any help or advice.
>
>
>
> Diego Morales
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From dgmorales at gmail.com  Wed Apr  8 19:23:34 2009
From: dgmorales at gmail.com (Diego Morales)
Date: Wed, 8 Apr 2009 16:23:34 -0300
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
In-Reply-To: <5C88FA2A4F4E42418581BA1C38CB76D8@Desktop>
References: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>
	<5C88FA2A4F4E42418581BA1C38CB76D8@Desktop>
Message-ID: <70e191b70904081223s44135bb0v64261b8af889b705@mail.gmail.com>

Well, it's a Supermicro server hardware (seems a little like "generic
server hardware with glorified name", never had seen one before), Xeon
3Ghz, two CPUs + HT, 8GB RAM.

Well, maybe. We recently had some overheating problems and power
outages on that room too.

But the lockup only happened when doing gfs mounting/umounting. And
those machines were up using the gfs file system for several days with
no problem.


On Wed, Apr 8, 2009 at 2:47 PM, Virginian <virginian at blueyonder.co.uk> wrote:
> I'm not entirely sure that the CPU lockup message is actually caused by
> RHCS.What hardware are you running on, how many CPUs, how much RAM? I've see
> this soft lockup error before on IBM 3850 machines running RHEL 5 (but not
> RHCS) I think....
>


From dist-list at LEXUM.UMontreal.CA  Wed Apr  8 20:17:49 2009
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Wed, 08 Apr 2009 16:17:49 -0400
Subject: [Linux-cluster] fence_ilo problem
Message-ID: <49DD066D.5060507@lexum.umontreal.ca>

I am testing a new 5.3 cluster.
After the creation of my fence method : ILO I tried to fence a node but  
received that in the log  :
Apr  8 16:14:55 cluster01-node1 fence_node[14298]: agent "fence_ilo" 
reports: Traceback (most recent call last):   File "/sbin/fence_ilo", 
line 18, in ?     from OpenSSL import SSL ImportError: No module named 
OpenSSL
Apr  8 16:14:55 cluster01-node1 fence_node[14298]: Fence of 
"cluster01-node2.cluster.lexum.pri" was unsuccessful

Any idea ?


From bmarzins at redhat.com  Wed Apr  8 21:16:38 2009
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Wed, 8 Apr 2009 16:16:38 -0500
Subject: [Linux-cluster] How to recover from gnbd "Open Disconnected
	Pending" state?
In-Reply-To: <0c3701c9b443$14cede80$3e6c9b80$@com>
References: <07bc01c9b35f$b5a675a0$20f360e0$@com>
	<0c3701c9b443$14cede80$3e6c9b80$@com>
Message-ID: <20090408211637.GD15911@ether.msp.redhat.com>

On Fri, Apr 03, 2009 at 06:00:56PM +0800, Kirby Zhou wrote:
> How to recover from gnbd "Open Disconnected Pending" state?
> 
> When the machine which exported gnbd is broken or shutdown, the client
> machine would fall into the state 'Open Disconnected Pending'.
> Any process accessing the dead gnbd will fall into state 'D'. 
> How can I remove the dead gnbd block device on the client machine?
> 
> #On the client machine
> [root at xen-727057 ~]# gnbd_import -n -l
> Device name : 63.131.xvdb
> ----------------------
>     Minor # : 0
>  sysfs name : /block/gnbd0
>      Server : 10.10.63.131
>        Port : 14567
>       State : Open Disconnected Pending
>    Readonly : No
>     Sectors : 16777216
> 
> [root at xen-727057 ~]# pvs &
> [1] 4561
> [root at xen-727057 ~]# ps aux | fgrep pvs
> root      4561  0.6  0.1  79364  1544 pts/0    D    14:50   0:00 pvs
> root      4563  0.0  0.0  61112   608 pts/0    S+   14:50   0:00 fgrep pvs
> 
> [root at xen-727057 ~]# gnbd_import -n -R
> gnbd_import: ERROR cannot disconnect device #0 : Device or resource busy

Did you try

#gnbd_import -n -RO

The override option should allow you to remove the device and fail the
IO.

-Ben


From redhat at insanegeeks.com  Wed Apr  8 21:32:03 2009
From: redhat at insanegeeks.com (Thomas Suiter)
Date: Wed, 8 Apr 2009 17:32:03 -0400
Subject: [Linux-cluster] GFS network and fencing questions
Message-ID: <001101c9b891$7eb9e310$7c2da930$@com>

 
I'm going to be building a 6 node cluster with blade servers that only have
2x network connections attached to EMC DMX storage.  The application we are
running has it's own cluster layer so we won't be using the failover
services (they just want the filesystem to be visible to all nodes).  Each
node should be reading/writing only in it's own directory with a single
filesystem size ~15TB.

 
Questions I have are this:

 
      1) The documentation is unclear as to this, I'm assuming that I should
I bond the 2x interfaces rather than have one interface for public and one
for private.  I'm thinking this will make the system much more available in
general, but I don't know if the public/private is a hard requirement (or if
what I'm thinking is even better) Best case would be to get 2x more but
unfortunately I don't have that luxury. If this is preferred, would I need
to use 2x ip addresses in this configuration, or can I use just the 1x per
node.

 
      2) I have the capabilities to support scsi3 reservations inside the
DMX, should I be using scsi3 instead of power based fencing (or both).  It
seems like a relatively option, is it ready for use or should it bake a bit
longer. I've used Veritas VCS with scsi3 previously and it was sometimes
semi-annoying.  But the reality is that availability and data protection is
more important than not being annoyed.

 
      3) Since I have more than 2x nodes should I use qdiskd or not (or is
it even needed in this type of configuration with no failover); looking
around it appears that it's caused some problems in the past.

 
      4) Any other tips for a first time GFS user

 
Thanks

      Thomas Suiter

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090408/1f2ed78e/attachment.htm>

From jon.skelton at gmail.com  Thu Apr  9 05:19:47 2009
From: jon.skelton at gmail.com (Jon Skelton)
Date: Wed, 8 Apr 2009 22:19:47 -0700
Subject: [Linux-cluster] fence_ilo problem
In-Reply-To: <49DD066D.5060507@lexum.umontreal.ca>
References: <49DD066D.5060507@lexum.umontreal.ca>
Message-ID: <b637d2000904082219r3525fb9eka1987297ffb4fd82@mail.gmail.com>

fence_ilo requires the following package:

perl-Crypt-SSLeay

and its dependencies.  This package is not installed automatically (well, in
RHEL5.[012]).

hope this helps,
Jon

On Wed, Apr 8, 2009 at 1:17 PM, FM <dist-list at lexum.umontreal.ca> wrote:

> I am testing a new 5.3 cluster.
> After the creation of my fence method : ILO I tried to fence a node but
>  received that in the log  :
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: agent "fence_ilo"
> reports: Traceback (most recent call last):   File "/sbin/fence_ilo", line
> 18, in ?     from OpenSSL import SSL ImportError: No module named OpenSSL
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: Fence of
> "cluster01-node2.cluster.lexum.pri" was unsuccessful
>
> Any idea ?
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090408/0d49fd82/attachment.htm>

From s.wendy.cheng at gmail.com  Thu Apr  9 06:23:38 2009
From: s.wendy.cheng at gmail.com (Wendy Cheng)
Date: Thu, 9 Apr 2009 01:23:38 -0500
Subject: [Linux-cluster] Freeze with cluster-2.03.11
In-Reply-To: <49DD9246.7000709@gmail.com>
References: <1404804625.1710261238184677530.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<49D589B8.1050702@gmail.com>
	<alpine.DEB.2.00.0904030828080.17892@lxserv0.kfki.hu>
	<49D61A0B.9010001@gmail.com>
	<alpine.DEB.2.00.0904031651040.10568@lxserv0.kfki.hu>
	<49D62E11.5050609@gmail.com> <49D98132.9010201@gmail.com>
	<alpine.DEB.2.00.0904061301110.22370@lxserv1.kfki.hu>
	<alpine.DEB.2.00.0904071552510.15283@lxserv1.kfki.hu>
	<49DD9246.7000709@gmail.com>
Message-ID: <1a2a6dd60904082323l6b17fe4eud510216e09ff9366@mail.gmail.com>

 Kadlecsik Jozsef wrote:
>
>
>
> EXPORT_SYMBOL(inode_lock);
>
> line to fs/inode.c, recompiled the kernel and the modules.
>
> Starting mailman in the test environment did not produce the almost
> instant freeze. I started/stopped mailman several times and the system
> worked just fine. So I believe the patch above and the plus line in
> fs/inode.c fix the reported problem. I dunno whether modifying
> fs/inode.c is acceptable or not...
>
>
>
>  Glad to learn you got it working! Send your patch to cluster-devel to see
> whether GFS team could take it. I personally think any other solution,
> though doable, would not worth the effort. And people using upstream kernel
> needs to do recompiling anyway so that EXPORT_SYMBOL thing should not be too
> bad.
>
> -- Wendy
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090409/e39398a4/attachment.htm>

From virginian at blueyonder.co.uk  Thu Apr  9 12:16:46 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Thu, 9 Apr 2009 13:16:46 +0100
Subject: [Linux-cluster] fence_ilo problem
References: <49DD066D.5060507@lexum.umontreal.ca>
Message-ID: <0CB19652BF244B7BA501CCCD43A93CBE@Desktop>

Install the pyOpenSSL package:

yum install pyOpenSSL

Then fence_ilo should work for you.


----- Original Message ----- 
From: "FM" <dist-list at LEXUM.UMontreal.CA>
To: "Redhat Cluster" <linux-cluster at redhat.com>
Sent: Wednesday, April 08, 2009 9:17 PM
Subject: [Linux-cluster] fence_ilo problem


>I am testing a new 5.3 cluster.
> After the creation of my fence method : ILO I tried to fence a node but  
> received that in the log  :
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: agent "fence_ilo" 
> reports: Traceback (most recent call last):   File "/sbin/fence_ilo", 
> line 18, in ?     from OpenSSL import SSL ImportError: No module named 
> OpenSSL
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: Fence of 
> "cluster01-node2.cluster.lexum.pri" was unsuccessful
> 
> Any idea ?
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From virginian at blueyonder.co.uk  Thu Apr  9 12:38:32 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Thu, 9 Apr 2009 13:38:32 +0100
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
References: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com><5C88FA2A4F4E42418581BA1C38CB76D8@Desktop>
	<70e191b70904081223s44135bb0v64261b8af889b705@mail.gmail.com>
Message-ID: <6FE6F438AC544FAFBD3454D55E11FB90@Desktop>

This could be a kernel bug. A quick google search revealed this:

http://www.linuxquestions.org/questions/linux-server-73/bug-soft-lockup-cpu3-stuck-for-10s-648946/

Probably worth googling some more but I don't think this is a RHCS issue to 
be honest.


----- Original Message ----- 
From: "Diego Morales" <dgmorales at gmail.com>
To: "linux clustering" <linux-cluster at redhat.com>
Sent: Wednesday, April 08, 2009 8:23 PM
Subject: Re: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU 
stuck


> Well, it's a Supermicro server hardware (seems a little like "generic
> server hardware with glorified name", never had seen one before), Xeon
> 3Ghz, two CPUs + HT, 8GB RAM.
>
> Well, maybe. We recently had some overheating problems and power
> outages on that room too.
>
> But the lockup only happened when doing gfs mounting/umounting. And
> those machines were up using the gfs file system for several days with
> no problem.
>
>
> On Wed, Apr 8, 2009 at 2:47 PM, Virginian <virginian at blueyonder.co.uk> 
> wrote:
>> I'm not entirely sure that the CPU lockup message is actually caused by
>> RHCS.What hardware are you running on, how many CPUs, how much RAM? I've 
>> see
>> this soft lockup error before on IBM 3850 machines running RHEL 5 (but 
>> not
>> RHCS) I think....
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From crosa at redhat.com  Thu Apr  9 14:01:53 2009
From: crosa at redhat.com (Cleber Rodrigues)
Date: Thu, 09 Apr 2009 11:01:53 -0300
Subject: [Linux-cluster] fence_ilo problem
In-Reply-To: <49DD066D.5060507@lexum.umontreal.ca>
References: <49DD066D.5060507@lexum.umontreal.ca>
Message-ID: <1239285713.6626.1.camel@localhost.localdomain>

You should install the 'pyOpenSSL' package.

On Wed, 2009-04-08 at 16:17 -0400, FM wrote:
> I am testing a new 5.3 cluster.
> After the creation of my fence method : ILO I tried to fence a node but  
> received that in the log  :
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: agent "fence_ilo" 
> reports: Traceback (most recent call last):   File "/sbin/fence_ilo", 
> line 18, in ?     from OpenSSL import SSL ImportError: No module named 
> OpenSSL
> Apr  8 16:14:55 cluster01-node1 fence_node[14298]: Fence of 
> "cluster01-node2.cluster.lexum.pri" was unsuccessful
> 
> Any idea ?
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Cleber Rodrigues <crosa at redhat.com>
Solutions Architect - Red Hat, Inc.


From crosa at redhat.com  Thu Apr  9 16:32:33 2009
From: crosa at redhat.com (Cleber Rodrigues)
Date: Thu, 09 Apr 2009 13:32:33 -0300
Subject: [Linux-cluster] fence_ilo problem
In-Reply-To: <b637d2000904082219r3525fb9eka1987297ffb4fd82@mail.gmail.com>
References: <49DD066D.5060507@lexum.umontreal.ca>
	<b637d2000904082219r3525fb9eka1987297ffb4fd82@mail.gmail.com>
Message-ID: <1239294753.6626.9.camel@localhost.localdomain>

AFAIK, there's been a rewrite of fence_ilo, now in Python
(whoooraaay!). 

On Wed, 2009-04-08 at 22:19 -0700, Jon Skelton wrote:
> fence_ilo requires the following package:
> 
> perl-Crypt-SSLeay 
> 
> and its dependencies.  This package is not installed automatically
> (well, in RHEL5.[012]).
> 
> hope this helps,
> Jon
> 
> On Wed, Apr 8, 2009 at 1:17 PM, FM <dist-list at lexum.umontreal.ca>
> wrote:
>         I am testing a new 5.3 cluster.
>         After the creation of my fence method : ILO I tried to fence a
>         node but  received that in the log  :
>         Apr  8 16:14:55 cluster01-node1 fence_node[14298]: agent
>         "fence_ilo" reports: Traceback (most recent call last):   File
>         "/sbin/fence_ilo", line 18, in ?     from OpenSSL import SSL
>         ImportError: No module named OpenSSL
>         Apr  8 16:14:55 cluster01-node1 fence_node[14298]: Fence of
>         "cluster01-node2.cluster.lexum.pri" was unsuccessful
>         
>         Any idea ?
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Cleber Rodrigues <crosa at redhat.com>
Solutions Architect - Red Hat, Inc.


From pavlos.parissis at gmail.com  Thu Apr  9 22:35:06 2009
From: pavlos.parissis at gmail.com (Pavlos Parissis)
Date: Fri, 10 Apr 2009 00:35:06 +0200
Subject: [Linux-cluster] prevent ping-pong
Message-ID: <fb143d260904091535l26030d31nf1679075ba6ce7cc@mail.gmail.com>

Hi,

Is there a way to configure cluster to not ping-pong a service more than N
times?

I have an application where sometimes cluster gets a no zero status on the
check and keeps failover back and forth the service.
I want the cluster to try only N times to fail over the service and then
disables it if it can't start it or gets a no zero status on checks after N
times.

System-info
# cat /etc/redhat-release
Red Hat Enterprise Linux AS release 4 (Nahant Update 4)
# rpm -q cman
cman-1.0.11-0
# rpm -q rgmanager-1.9.53-0
rgmanager-1.9.53-0


Cheers,
Pavlos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090410/461acaaf/attachment.htm>

From cthulhucalling at gmail.com  Fri Apr 10 00:07:58 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Thu, 9 Apr 2009 20:07:58 -0400
Subject: [Linux-cluster] Fenced failing continuously
Message-ID: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>

I've been testing a newly built 2-node cluster. The cluster resources are a
virtual IP and squid, so in a node failure, the VIP would go to the
surviving node and start up Squid. I'm running a modified fencing agent that
will SSH into the failing node and firewall it off via IPtables (not my
choice).

This all works fine for graceful shutdowns, but when I do something nasty
like pulling the power cord on the node that is currently running the
service, the surviving node never assumes the service and spends all its
time trying to fire off the fence agent, which obviously will not work
because the server is completely offline. The only way I can get the
surviving node to assume the VIP and start Squid is to fence_ack_manual,
which sort of runs counter to running a cluster to begin with. The logs are
filled with

Apr 12 00:01:44 <hostname> fenced[3223]: fencing node "<otherhost>"
 Could not disable xx.xx.xx.xx on    23]: agent "fence_iptables" reports:
ssh: connect to host xx.xx.xx.xx port 22: No route to host

Is this a misconfiguration, or is there an option I can include somewhere to
tell the nodes to give it up after a certain number of tries?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090409/cb7ecbe9/attachment.htm>

From virginian at blueyonder.co.uk  Fri Apr 10 06:36:58 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Fri, 10 Apr 2009 07:36:58 +0100
Subject: [Linux-cluster] Fenced failing continuously
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>
Message-ID: <6D2741530D9041139E81EFDCCF9EE46B@Desktop>

Hi Ian,

I think there is a flaw in the design. For example, say the network card fails on machine A. Machine B detects this and tries to fence machine A. The problem with doing it via ssh to modify iptables is that there is no network connectivity to Machine A and hence this mechanism will never work. What you need is a solution that works independently of the OS such as a power switch or remote management interface such as IBM RSA II, HP iLO etc. With fencing, the solution has to be absolute and ruthless in that, in this example, machine B needs to be able to fence Machine A absolutely every time there is a problem and as soon as there is a problem.

Regards

John

 
  ----- Original Message ----- 
  From: Ian Hayes 
  To: linux-cluster at redhat.com 
  Sent: Friday, April 10, 2009 1:07 AM
  Subject: [Linux-cluster] Fenced failing continuously


  I've been testing a newly built 2-node cluster. The cluster resources are a virtual IP and squid, so in a node failure, the VIP would go to the surviving node and start up Squid. I'm running a modified fencing agent that will SSH into the failing node and firewall it off via IPtables (not my choice).

  This all works fine for graceful shutdowns, but when I do something nasty like pulling the power cord on the node that is currently running the service, the surviving node never assumes the service and spends all its time trying to fire off the fence agent, which obviously will not work because the server is completely offline. The only way I can get the surviving node to assume the VIP and start Squid is to fence_ack_manual, which sort of runs counter to running a cluster to begin with. The logs are filled with 

  Apr 12 00:01:44 <hostname> fenced[3223]: fencing node "<otherhost>"
   Could not disable xx.xx.xx.xx on    23]: agent "fence_iptables" reports: ssh: connect to host xx.xx.xx.xx port 22: No route to host

  Is this a misconfiguration, or is there an option I can include somewhere to tell the nodes to give it up after a certain number of tries?


------------------------------------------------------------------------------


  --
  Linux-cluster mailing list
  Linux-cluster at redhat.com
  https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090410/40362838/attachment.htm>

From Alain.Moulle at bull.net  Fri Apr 10 06:51:27 2009
From: Alain.Moulle at bull.net (Alain.Moulle)
Date: Fri, 10 Apr 2009 08:51:27 +0200
Subject: [Linux-cluster] Question about performance on gfs2
Message-ID: <49DEEC6F.2050509@bull.net>

Hi

We have executed some benchs on gfs2, gfs compared to ext3 and ocfs2
and the results are (just some examples):

* Read on 1 file   :
        quite the same performance

* Write on 1 file :
        
	quite the same performance

* Creation of many many files and repertories :
        gfs2 is 10 times less good than ext3 and ocfs2

* Write of a file tree followed by 4 reading process : 
        gfs and gfs2 : 40 mn
        ocfs2 : 3 mn

Does it seem normal for you ? or is there some hidden tuning somewhere 
to increase
performances when using gfs2 (or gfs)  ?

Thanks a lot.
Regards.
Alain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090410/8b0aa6f2/attachment.htm>

From dgmorales at gmail.com  Sat Apr 11 23:33:24 2009
From: dgmorales at gmail.com (Diego Morales)
Date: Sat, 11 Apr 2009 20:33:24 -0300
Subject: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU stuck
In-Reply-To: <6FE6F438AC544FAFBD3454D55E11FB90@Desktop>
References: <70e191b70904072139n1cc43be2pf218edf3f5db0e9e@mail.gmail.com>
	<5C88FA2A4F4E42418581BA1C38CB76D8@Desktop>
	<70e191b70904081223s44135bb0v64261b8af889b705@mail.gmail.com>
	<6FE6F438AC544FAFBD3454D55E11FB90@Desktop>
Message-ID: <70e191b70904111633l43d2033y7a02749bc5789ce1@mail.gmail.com>

Well just for the record... After trying again to mount the GFS on the
second and having the exact same problem, I did:

* A gfs_fsck, which found many inode errors.
* Paid more attention so that when the second node comes up, the fc
switch port is already up.

After that I got up the cluster two times without any problem (had a
scheduled shutdown in between).
Unfortunaly, I didn't have the time to rule one of those out, or do
any more tests.

Thanks for  the anwsers...


Diego Morales


On Thu, Apr 9, 2009 at 9:38 AM, Virginian <virginian at blueyonder.co.uk> wrote:
> This could be a kernel bug. A quick google search revealed this:
>
> http://www.linuxquestions.org/questions/linux-server-73/bug-soft-lockup-cpu3-stuck-for-10s-648946/
>
> Probably worth googling some more but I don't think this is a RHCS issue to
> be honest.
>
>
> ----- Original Message ----- From: "Diego Morales" <dgmorales at gmail.com>
> To: "linux clustering" <linux-cluster at redhat.com>
> Sent: Wednesday, April 08, 2009 8:23 PM
> Subject: Re: [Linux-cluster] GFS on Ubuntu hardy: BUG soft lockup - CPU
> stuck
>
>
>> Well, it's a Supermicro server hardware (seems a little like "generic
>> server hardware with glorified name", never had seen one before), Xeon
>> 3Ghz, two CPUs + HT, 8GB RAM.
>>
>> Well, maybe. We recently had some overheating problems and power
>> outages on that room too.
>>
>> But the lockup only happened when doing gfs mounting/umounting. And
>> those machines were up using the gfs file system for several days with
>> no problem.
>>
>>
>> On Wed, Apr 8, 2009 at 2:47 PM, Virginian <virginian at blueyonder.co.uk>
>> wrote:
>>>
>>> I'm not entirely sure that the CPU lockup message is actually caused by
>>> RHCS.What hardware are you running on, how many CPUs, how much RAM? I've
>>> see
>>> this soft lockup error before on IBM 3850 machines running RHEL 5 (but
>>> not
>>> RHCS) I think....
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From cthulhucalling at gmail.com  Mon Apr 13 16:19:16 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Mon, 13 Apr 2009 12:19:16 -0400
Subject: [Linux-cluster] Fenced failing continuously
In-Reply-To: <6D2741530D9041139E81EFDCCF9EE46B@Desktop>
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>
	<6D2741530D9041139E81EFDCCF9EE46B@Desktop>
Message-ID: <36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>

I realize that the ssh option is not optimal, but I'm stuck with the design
requirements. I'm hoping I can get them changed.

But, this got me thinking... conventional fencing is not failsafe. I can
think of quite a number of less than optimal but entirely real-world
situations where a node can die and not be able to be absolutely fenced off.
iLO only works of the victim node still has power. I've only been in 1 shop
that had the APC managed power, and they didn't even have that set up.
Brocade fencing doesn't always apply, especially if you're just doing a
virtual IP. So sometimes having a second fencing method as a backup may not
always be feasible.

So even with more traditional fences, this may not work unless I start
modding fence scripts to return a success code even if they fail.

On Fri, Apr 10, 2009 at 2:36 AM, Virginian <virginian at blueyonder.co.uk>wrote:

>  Hi Ian,
>
> I think there is a flaw in the design. For example, say the network card
> fails on machine A. Machine B detects this and tries to fence machine A. The
> problem with doing it via ssh to modify iptables is that there is no network
> connectivity to Machine A and hence this mechanism will never work. What you
> need is a solution that works independently of the OS such as a power switch
> or remote management interface such as IBM RSA II, HP iLO etc. With fencing,
> the solution has to be absolute and ruthless in that, in this example,
> machine B needs to be able to fence Machine A absolutely every time there is
> a problem and as soon as there is a problem.
>
> Regards
>
> John
>
>
>
> ----- Original Message -----
> *From:* Ian Hayes <cthulhucalling at gmail.com>
> *To:* linux-cluster at redhat.com
> *Sent:* Friday, April 10, 2009 1:07 AM
> *Subject:* [Linux-cluster] Fenced failing continuously
>
> I've been testing a newly built 2-node cluster. The cluster resources are a
> virtual IP and squid, so in a node failure, the VIP would go to the
> surviving node and start up Squid. I'm running a modified fencing agent that
> will SSH into the failing node and firewall it off via IPtables (not my
> choice).
>
> This all works fine for graceful shutdowns, but when I do something nasty
> like pulling the power cord on the node that is currently running the
> service, the surviving node never assumes the service and spends all its
> time trying to fire off the fence agent, which obviously will not work
> because the server is completely offline. The only way I can get the
> surviving node to assume the VIP and start Squid is to fence_ack_manual,
> which sort of runs counter to running a cluster to begin with. The logs are
> filled with
>
> Apr 12 00:01:44 <hostname> fenced[3223]: fencing node "<otherhost>"
>  Could not disable xx.xx.xx.xx on    23]: agent "fence_iptables" reports:
> ssh: connect to host xx.xx.xx.xx port 22: No route to host
>
> Is this a misconfiguration, or is there an option I can include somewhere
> to tell the nodes to give it up after a certain number of tries?
>
> ------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/6d315ecd/attachment.htm>

From rhurst at bidmc.harvard.edu  Mon Apr 13 18:08:00 2009
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Mon, 13 Apr 2009 18:08:00 +0000
Subject: [Linux-cluster] Fenced failing continuously
In-Reply-To: <36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>
	<6D2741530D9041139E81EFDCCF9EE46B@Desktop>
	<36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>
Message-ID: <1239646080.3339.54.camel@WSBID06223.bidmc.harvard.edu>

You're right about there is no such thing as fail-safe ... but I would
worry more if I just hard-code a return value of SUCCESS in my scripts.
Management cards are supposed to work, even if they are powered down --
not that there is a loss of power to both lines.  If that is the case,
no electricity == no servers == no cluster, which means you are doing a
cold boot regardless.

We have both fence_ilo and fence_bladecenter in effect.  As good as the
iLO cards have performed to date, we are still moving off HP DL385s into
IBM BladeCenter because its management processors are closer to fault
tolerant than anything else we have experienced.  I have had HP iLO
cards "crash" and not reset themselves -- although later firmware
revisions have reduced those outages greatly.  Monitoring its https and
ssh ports for availability are a requirement!

There is user-contributed fence_ilo patch listed somewhere in this list
worth investigating -- it runs A LOT FASTER than the stock one.  AFAIK,
the fence_ilo does not use ssh, but a sort of web soap services call via
https.  We have seen in production and testing that a typical fencing
operation using fence_ilo is 42-seconds, and a good percentage of time,
up to twice as long as that.  The bladecenter fencing operations we have
seen occur in under 7-seconds.

We are starting to roll-out smart APC switches to allow for remote power
control, and I will consider adding them in as secondary fence devices.
But I figure if highly-available dual AMMs in a blade chassis fail, I
probably have a lot more problems to deal with -- and I would rather
have my clustered apps suspend before incurring any further harm.


________________________________________________________________________


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.


On Mon, 2009-04-13 at 12:19 -0400, Ian Hayes wrote:

> I realize that the ssh option is not optimal, but I'm stuck with the
> design requirements. I'm hoping I can get them changed.
> 
> But, this got me thinking... conventional fencing is not failsafe. I
> can think of quite a number of less than optimal but entirely
> real-world situations where a node can die and not be able to be
> absolutely fenced off. iLO only works of the victim node still has
> power. I've only been in 1 shop that had the APC managed power, and
> they didn't even have that set up. Brocade fencing doesn't always
> apply, especially if you're just doing a virtual IP. So sometimes
> having a second fencing method as a backup may not always be feasible.
> 
> So even with more traditional fences, this may not work unless I start
> modding fence scripts to return a success code even if they fail.
> 
> 
> On Fri, Apr 10, 2009 at 2:36 AM, Virginian
> <virginian at blueyonder.co.uk> wrote:
> 
>         Hi Ian,
>          
>         I think there is a flaw in the design. For example, say the
>         network card fails on machine A. Machine B detects this and
>         tries to fence machine A. The problem with doing it via ssh to
>         modify iptables is that there is no network connectivity to
>         Machine A and hence this mechanism will never work. What you
>         need is a solution that works independently of the OS such as
>         a power switch or remote management interface such as IBM RSA
>         II, HP iLO etc. With fencing, the solution has to be absolute
>         and ruthless in that, in this example, machine B needs to be
>         able to fence Machine A absolutely every time there is a
>         problem and as soon as there is a problem.
>          
>         Regards
>          
>         John
>          
>          
>                 
>                 ----- Original Message ----- 
>                 From: Ian Hayes 
>                 To: linux-cluster at redhat.com 
>                 Sent: Friday, April 10, 2009 1:07 AM
>                 Subject: [Linux-cluster] Fenced failing continuously
>                 
>                 
>                 
>                 I've been testing a newly built 2-node cluster. The
>                 cluster resources are a virtual IP and squid, so in a
>                 node failure, the VIP would go to the surviving node
>                 and start up Squid. I'm running a modified fencing
>                 agent that will SSH into the failing node and firewall
>                 it off via IPtables (not my choice).
>                 
>                 This all works fine for graceful shutdowns, but when I
>                 do something nasty like pulling the power cord on the
>                 node that is currently running the service, the
>                 surviving node never assumes the service and spends
>                 all its time trying to fire off the fence agent, which
>                 obviously will not work because the server is
>                 completely offline. The only way I can get the
>                 surviving node to assume the VIP and start Squid is to
>                 fence_ack_manual, which sort of runs counter to
>                 running a cluster to begin with. The logs are filled
>                 with 
>                 
>                 Apr 12 00:01:44 <hostname> fenced[3223]: fencing node
>                 "<otherhost>"
>                  Could not disable xx.xx.xx.xx on    23]: agent
>                 "fence_iptables" reports: ssh: connect to host
>                 xx.xx.xx.xx port 22: No route to host
>                 
>                 Is this a misconfiguration, or is there an option I
>                 can include somewhere to tell the nodes to give it up
>                 after a certain number of tries?
>                 
>                 
>                 
>                 ______________________________________________________
>                 
>                 --
>                 Linux-cluster mailing list
>                 Linux-cluster at redhat.com
>                 https://www.redhat.com/mailman/listinfo/linux-cluster
>         
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/9ea95c07/attachment.htm>

From cryptogrid at gmail.com  Mon Apr 13 19:11:09 2009
From: cryptogrid at gmail.com (crypto grid)
Date: Mon, 13 Apr 2009 16:11:09 -0300
Subject: [Linux-cluster] Question about performance on gfs2
In-Reply-To: <49DEEC6F.2050509@bull.net>
References: <49DEEC6F.2050509@bull.net>
Message-ID: <a9f464b80904131211m5a40e5dcv5d07856f38c2da2a@mail.gmail.com>

Could you mention please, the benchmarks you are using?

Thanks,

On Fri, Apr 10, 2009 at 3:51 AM, Alain.Moulle <Alain.Moulle at bull.net> wrote:

>  Hi
>
> We have executed some benchs on gfs2, gfs compared to ext3 and ocfs2
> and the results are (just some examples):
>
> * Read on 1 file   :
>         quite the same performance
>
> * Write on 1 file :
>
> 	quite the same performance
>
> * Creation of many many files and repertories :
>         gfs2 is 10 times less good than ext3 and ocfs2
>
> * Write of a file tree followed by 4 reading process :
>         gfs and gfs2 : 40 mn
>         ocfs2 : 3 mn
>
> Does it seem normal for you ? or is there some hidden tuning somewhere to
> increase
> performances when using gfs2 (or gfs)  ?
>
> Thanks a lot.
> Regards.
> Alain
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/1a0bd917/attachment.htm>

From virginian at blueyonder.co.uk  Mon Apr 13 19:28:42 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Mon, 13 Apr 2009 20:28:42 +0100
Subject: [Linux-cluster] Fenced failing continuously
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com><6D2741530D9041139E81EFDCCF9EE46B@Desktop>
	<36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>
Message-ID: <FF6BAFFDFC4F4AADBD8B811E7B0E8889@Desktop>

If you lose power to the server (ie the victim node does not have power) then it is effectively fenced and cannot cause I/O data corruption. More than one fencing method will increase the effectiveness of cluster fencing but only in so far as the errant machine can be prevented from performing I/O to the data. Sadly, whoever designed your cluster is clearly not experienced enough to be designing HA / Cluster solutions. You really do need to convince the sponsors of your project that attempting to fence a server via ssh / iptables is not the way to go and a more robust solution is called for. 
  ----- Original Message ----- 
  From: Ian Hayes 
  To: linux clustering 
  Sent: Monday, April 13, 2009 5:19 PM
  Subject: Re: [Linux-cluster] Fenced failing continuously


  I realize that the ssh option is not optimal, but I'm stuck with the design requirements. I'm hoping I can get them changed.

  But, this got me thinking... conventional fencing is not failsafe. I can think of quite a number of less than optimal but entirely real-world situations where a node can die and not be able to be absolutely fenced off. iLO only works of the victim node still has power. I've only been in 1 shop that had the APC managed power, and they didn't even have that set up. Brocade fencing doesn't always apply, especially if you're just doing a virtual IP. So sometimes having a second fencing method as a backup may not always be feasible.

  So even with more traditional fences, this may not work unless I start modding fence scripts to return a success code even if they fail.


  On Fri, Apr 10, 2009 at 2:36 AM, Virginian <virginian at blueyonder.co.uk> wrote:

    Hi Ian,

    I think there is a flaw in the design. For example, say the network card fails on machine A. Machine B detects this and tries to fence machine A. The problem with doing it via ssh to modify iptables is that there is no network connectivity to Machine A and hence this mechanism will never work. What you need is a solution that works independently of the OS such as a power switch or remote management interface such as IBM RSA II, HP iLO etc. With fencing, the solution has to be absolute and ruthless in that, in this example, machine B needs to be able to fence Machine A absolutely every time there is a problem and as soon as there is a problem.

    Regards

    John


      ----- Original Message ----- 
      From: Ian Hayes 
      To: linux-cluster at redhat.com 
      Sent: Friday, April 10, 2009 1:07 AM
      Subject: [Linux-cluster] Fenced failing continuously


      I've been testing a newly built 2-node cluster. The cluster resources are a virtual IP and squid, so in a node failure, the VIP would go to the surviving node and start up Squid. I'm running a modified fencing agent that will SSH into the failing node and firewall it off via IPtables (not my choice).

      This all works fine for graceful shutdowns, but when I do something nasty like pulling the power cord on the node that is currently running the service, the surviving node never assumes the service and spends all its time trying to fire off the fence agent, which obviously will not work because the server is completely offline. The only way I can get the surviving node to assume the VIP and start Squid is to fence_ack_manual, which sort of runs counter to running a cluster to begin with. The logs are filled with 

      Apr 12 00:01:44 <hostname> fenced[3223]: fencing node "<otherhost>"
       Could not disable xx.xx.xx.xx on    23]: agent "fence_iptables" reports: ssh: connect to host xx.xx.xx.xx port 22: No route to host

      Is this a misconfiguration, or is there an option I can include somewhere to tell the nodes to give it up after a certain number of tries?


--------------------------------------------------------------------------


      --
      Linux-cluster mailing list
      Linux-cluster at redhat.com
      https://www.redhat.com/mailman/listinfo/linux-cluster

    --
    Linux-cluster mailing list
    Linux-cluster at redhat.com
    https://www.redhat.com/mailman/listinfo/linux-cluster


------------------------------------------------------------------------------


  --
  Linux-cluster mailing list
  Linux-cluster at redhat.com
  https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/d6ccad49/attachment.htm>

From cthulhucalling at gmail.com  Mon Apr 13 20:26:49 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Mon, 13 Apr 2009 16:26:49 -0400
Subject: [Linux-cluster] Fenced failing continuously
In-Reply-To: <1239646080.3339.54.camel@WSBID06223.bidmc.harvard.edu>
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>
	<6D2741530D9041139E81EFDCCF9EE46B@Desktop>
	<36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>
	<1239646080.3339.54.camel@WSBID06223.bidmc.harvard.edu>
Message-ID: <36df569a0904131326q7d030cccy8bcc20d6756b6881@mail.gmail.com>

Thanks for the info, I will be sure to have our monitor watch the iLO
ports...

I've done some testing with fence_ilo and haven't seen a lengthy failover
time. I'm running the Python script that is part of the Clustering group. Is
that the user contributed one? My testing right now has failover done in a
few seconds.


On Mon, Apr 13, 2009 at 2:08 PM, Robert Hurst <rhurst at bidmc.harvard.edu>wrote:

>  You're right about there is no such thing as fail-safe ... but I would
> worry more if I just hard-code a return value of SUCCESS in my scripts.
> Management cards are supposed to work, even if they are powered down -- not
> that there is a loss of power to both lines.  If that is the case, no
> electricity == no servers == no cluster, which means you are doing a cold
> boot regardless.
>
> We have both fence_ilo and fence_bladecenter in effect.  As good as the iLO
> cards have performed to date, we are still moving off HP DL385s into IBM
> BladeCenter because its management processors are closer to fault tolerant
> than anything else we have experienced.  I have had HP iLO cards "crash" and
> not reset themselves -- although later firmware revisions have reduced those
> outages greatly.  Monitoring its https and ssh ports for availability are a
> requirement!
>
> There is user-contributed fence_ilo patch listed somewhere in this list
> worth investigating -- it runs A LOT FASTER than the stock one.  AFAIK, the
> fence_ilo does not use ssh, but a sort of web soap services call via https.
> We have seen in production and testing that a typical fencing operation
> using fence_ilo is 42-seconds, and a good percentage of time, up to twice as
> long as that.  The bladecenter fencing operations we have seen occur in
> under 7-seconds.
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/b70d9295/attachment.htm>

From sjpark at rondaandspencer.info  Mon Apr 13 21:23:54 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Mon, 13 Apr 2009 15:23:54 -0600
Subject: [Linux-cluster] conga nfs problems...
Message-ID: <c7a040fa0904131423o53b41388pc261994f79f88bae@mail.gmail.com>

I am trying to setup a cluster that has an NFS mount in addition to
additional services, but the NFS does not appear to want to work.  I am
using conga to admin the cluster.  I have mysql and apache as additional
services running on the cluster just fine.  Whenever I go to add an nfs
mount to my cluster it won't keep the export field.  I give it the info it
needs and the export always comes back blank.  If I try to add it within a
service on its own...it always returns an error...any idea?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/9fd1d93b/attachment.htm>

From lmb at suse.de  Mon Apr 13 22:53:53 2009
From: lmb at suse.de (Lars Marowsky-Bree)
Date: Tue, 14 Apr 2009 00:53:53 +0200
Subject: [Linux-cluster] Linux Plumbers Conference mini-conf on clustering?
Message-ID: <20090413225353.GF6147@suse.de>

Hi all,

what do you think of a half-day / day long miniconference on clustering
along LPC? (http://lwn.net/Articles/319215/)

It's a bit late, but if noone objects, I'll submit a proposal tomorrow.

What topics would you like to see discussed?

We probably don't have enough mass to justify our completely own summit,
but on-going integration, (lack of) progress since the meeting in
Prague, etc should provide enough meat for a miniconf.


Regards,
    Lars

-- 
Teamlead Kernel, SuSE Labs, Research and Development
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG N?rnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde


From Joel.Becker at oracle.com  Mon Apr 13 23:31:35 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Mon, 13 Apr 2009 16:31:35 -0700
Subject: [Linux-cluster] Re: [Ocfs2-devel] Linux Plumbers Conference
	mini-conf on clustering?
In-Reply-To: <20090413225353.GF6147@suse.de>
References: <20090413225353.GF6147@suse.de>
Message-ID: <20090413233135.GC8778@mail.oracle.com>

On Tue, Apr 14, 2009 at 12:53:53AM +0200, Lars Marowsky-Bree wrote:
> what do you think of a half-day / day long miniconference on clustering
> along LPC? (http://lwn.net/Articles/319215/)
> 
> It's a bit late, but if noone objects, I'll submit a proposal tomorrow.
> 
> What topics would you like to see discussed?
> 
> We probably don't have enough mass to justify our completely own summit,
> but on-going integration, (lack of) progress since the meeting in
> Prague, etc should provide enough meat for a miniconf.

	I think its a great idea.  We actually have progress, with a lot
of work that we talked about in Prague starting to see the light of day
in STABLE3, so it would be good to nail that down.

Joel

-- 

Life's Little Instruction Book #313

	"Never underestimate the power of love."

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From Danny.Wall at health-first.org  Tue Apr 14 01:39:08 2009
From: Danny.Wall at health-first.org (Danny Wall)
Date: Mon, 13 Apr 2009 21:39:08 -0400
Subject: [Linux-cluster] GFS file size/performance question
Message-ID: <49E3B0FC020000C8000020FB@mail-int.health-first.org>

I realize that GFS is not the most optimized filesystem for a lot of small files, but at what point does that become a concern? Is it when you have a lot of files less than 1MB? Less than 10MB?

We have five Red Hat clusters. The original system is RHEL 4.4 with RHCS and GFS. Most filesystems are 2TB LUNs on a Fibre Channel SAN, with hundreds of thousands of folders (50,000 at root) and millions of files.  The files range from 60KB to about 5MB each, with 20-100 files in each folder. 

We are having a problem with the new cluster. After a while of being on a node, the performance is horrible until we migrate the users to a different node. There are no issues with RAM, CPU, disk or NIC IO. 

The new cluster servers are RHEL 5.1 with RHCS and GFS. There are currently 4 folders off root, with 5,000-7,000 sub-folders each. The folders generally hold approx. 10-100 files, ranging in size from 1k to 5MB. Several of these folders have 100 files all less than 1MB. The three servers in one cluster have a single 2TB FC LUN attached using GFS. Service to the files is generally only provided from one node at a time, except off hours during backups, so there should not be a lot of locking issues.

Both clusters are running samba that comes with the respective versions of RHEL, for WinXP and Win2003 workstations in a Win2003 AD domain.

Is it possible that GFS performance is worse on the newer, more powerful cluster nodes because there are so many files under 100k? At what point does GFS performance really start taking a hit due to smaller file sizes?

The newer servers all have 32GB RAM and 8 CPU cores.

Thanks
Danny


#####################################
This message is for the named person's use only.  It may 
contain private, proprietary, or legally privileged information.  
No privilege is waived or lost by any mistransmission.  If you 
receive this message in error, please immediately delete it and 
all copies of it from your system, destroy any hard copies of it, 
and notify the sender.  You must not, directly or indirectly, use, 
disclose, distribute, print, or copy any part of this message if you 
are not the intended recipient.  Health First reserves the right to 
monitor all e-mail communications through its networks.  Any views 
or opinions expressed in this message are solely those of the 
individual sender, except (1) where the message states such views 
or opinions are on behalf of a particular entity;  and (2) the sender 
is authorized by the entity to give such views or opinions.
#####################################
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/e19f495b/attachment.htm>

From Danny.Wall at health-first.org  Tue Apr 14 01:46:02 2009
From: Danny.Wall at health-first.org (Danny Wall)
Date: Mon, 13 Apr 2009 21:46:02 -0400
Subject: [Linux-cluster] GFS file size/performance question
Message-ID: <49E3B29A020000C80000210D@mail-int.health-first.org>

I realize that GFS is not the most optimized filesystem for a lot of small files, but at what point does that become a concern? Is it when you have a lot of files less than 1MB? Less than 10MB?

We have five Red Hat clusters. The original system is RHEL 4.4 with RHCS and GFS. Most filesystems are 2TB LUNs on a Fibre Channel SAN, with hundreds of thousands of folders (50,000 at root) and millions of files.  The files range from 60KB to about 5MB each, with 20-100 files in each folder. 

We are having a problem with the new cluster. After a while of being on a node, the performance is horrible until we migrate the users to a different node. There are no issues with RAM, CPU, disk or NIC IO. 

The new cluster servers are RHEL 5.1 with RHCS and GFS. There are currently 4 folders off root, with 5,000-7,000 sub-folders each. The folders generally hold approx. 10-100 files, ranging in size from 1k to 5MB. Several of these folders have 100 files all less than 1MB. The three servers in one cluster have a single 2TB FC LUN attached using GFS. Service to the files is generally only provided from one node at a time, except off hours during backups, so there should not be a lot of locking issues.

Both clusters are running samba that comes with the respective versions of RHEL, for WinXP and Win2003 workstations in a Win2003 AD domain.

Is it possible that GFS performance is worse on the newer, more powerful cluster nodes because there are so many files under 100k? At what point does GFS performance really start taking a hit due to smaller file sizes?

The newer servers all have 32GB RAM and 8 CPU cores.


#####################################
This message is for the named person's use only.  It may 
contain private, proprietary, or legally privileged information.  
No privilege is waived or lost by any mistransmission.  If you 
receive this message in error, please immediately delete it and 
all copies of it from your system, destroy any hard copies of it, 
and notify the sender.  You must not, directly or indirectly, use, 
disclose, distribute, print, or copy any part of this message if you 
are not the intended recipient.  Health First reserves the right to 
monitor all e-mail communications through its networks.  Any views 
or opinions expressed in this message are solely those of the 
individual sender, except (1) where the message states such views 
or opinions are on behalf of a particular entity;  and (2) the sender 
is authorized by the entity to give such views or opinions.
#####################################
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/3a5ac111/attachment.htm>

From jumanjiman at gmail.com  Tue Apr 14 02:16:58 2009
From: jumanjiman at gmail.com (Paul Morgan)
Date: Tue, 14 Apr 2009 02:16:58 +0000
Subject: [Linux-cluster] GFS file size/performance question
In-Reply-To: <49E3B29A020000C80000210D@mail-int.health-first.org>
References: <49E3B29A020000C80000210D@mail-int.health-first.org>
Message-ID: <117457784-1239675440-cardhu_decombobulator_blackberry.rim.net-2108675065-@bxe1239.bisx.prod.on.blackberry>

Top-posting due to blackberry...

Have you looked at your slab cache?
There was a memory leak long ago (rhel 4 timeframe iirc) that results in slab growing and growing and... 

Not saying this is the problem, but worth looking at on your slow node. Also, what happens if you migrate the users back to the original, slow node?

-paul
 
-----Original Message-----
From: "Danny Wall" <Danny.Wall at health-first.org>

Date: Mon, 13 Apr 2009 21:46:02 
To: <linux-cluster at redhat.com>
Subject: [Linux-cluster] GFS file size/performance question


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jeff.sturm at eprize.com  Tue Apr 14 02:38:15 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 13 Apr 2009 22:38:15 -0400
Subject: [Linux-cluster] GFS file size/performance question
In-Reply-To: <49E3B29A020000C80000210D@mail-int.health-first.org>
References: <49E3B29A020000C80000210D@mail-int.health-first.org>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB83B@hugo.eprize.local>

Danny,
 
Have you tried any "gfs_tool settune" options with your setup?  Aside
from defaults, on GFS1 we use:
 
                        gfs_tool settune $fs demote_secs 7200
                        gfs_tool settune $fs glock_purge 50
                        gfs_tool settune $fs statfs_fast 1

You might want to experiment a bit, especially with the first two.  It's
hard to guess at the cause of the slowdown you are experiencing without
more information, but gfs_scand thrashing is one possibility.
 
There have been many prior articles posted on GFS performance.  Search
for e.g. "glock trimming patch".
 
If possible, mount your GFS filesystems with noatime as well.
 
Jeff


________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Danny Wall
	Sent: Monday, April 13, 2009 9:46 PM
	To: linux-cluster at redhat.com
	Subject: [Linux-cluster] GFS file size/performance question
	
	
	I realize that GFS is not the most optimized filesystem for a
lot of small files, but at what point does that become a concern? Is it
when you have a lot of files less than 1MB? Less than 10MB?
	
	We have five Red Hat clusters. The original system is RHEL 4.4
with RHCS and GFS. Most filesystems are 2TB LUNs on a Fibre Channel SAN,
with hundreds of thousands of folders (50,000 at root) and millions of
files.  The files range from 60KB to about 5MB each, with 20-100 files
in each folder. 
	
	We are having a problem with the new cluster. After a while of
being on a node, the performance is horrible until we migrate the users
to a different node. There are no issues with RAM, CPU, disk or NIC IO. 
	
	The new cluster servers are RHEL 5.1 with RHCS and GFS. There
are currently 4 folders off root, with 5,000-7,000 sub-folders each. The
folders generally hold approx. 10-100 files, ranging in size from 1k to
5MB. Several of these folders have 100 files all less than 1MB. The
three servers in one cluster have a single 2TB FC LUN attached using
GFS. Service to the files is generally only provided from one node at a
time, except off hours during backups, so there should not be a lot of
locking issues.
	
	Both clusters are running samba that comes with the respective
versions of RHEL, for WinXP and Win2003 workstations in a Win2003 AD
domain.
	
	Is it possible that GFS performance is worse on the newer, more
powerful cluster nodes because there are so many files under 100k? At
what point does GFS performance really start taking a hit due to smaller
file sizes?
	
	The newer servers all have 32GB RAM and 8 CPU cores.
	
	
	#####################################
	This message is for the named person's use only.  It may 
	contain private, proprietary, or legally privileged information.

	No privilege is waived or lost by any mistransmission.  If you 
	receive this message in error, please immediately delete it and 
	all copies of it from your system, destroy any hard copies of
it, 
	and notify the sender.  You must not, directly or indirectly,
use, 
	disclose, distribute, print, or copy any part of this message if
you 
	are not the intended recipient.  Health First reserves the right
to 
	monitor all e-mail communications through its networks.  Any
views 
	or opinions expressed in this message are solely those of the 
	individual sender, except (1) where the message states such
views 
	or opinions are on behalf of a particular entity;  and (2) the
sender 
	is authorized by the entity to give such views or opinions.
	#####################################

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090413/3410bcaf/attachment.htm>

From teigland at redhat.com  Tue Apr 14 15:46:58 2009
From: teigland at redhat.com (David Teigland)
Date: Tue, 14 Apr 2009 10:46:58 -0500
Subject: [Linux-cluster] GFS file size/performance question
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB83B@hugo.eprize.local>
References: <49E3B29A020000C80000210D@mail-int.health-first.org>
	<64D0546C5EBBD147B75DE133D798665F02FDB83B@hugo.eprize.local>
Message-ID: <20090414154657.GC17186@redhat.com>

On Mon, Apr 13, 2009 at 10:38:15PM -0400, Jeff Sturm wrote:

> gfs_tool settune $fs glock_purge 50

> There have been many prior articles posted on GFS performance.  Search
> for e.g. "glock trimming patch".

That purge/trimming thing was a hack for a very particular corner case (backup
process walking over the entire fs and caching all locks on the fs).  Enabling
it under other circumstances will most likely *hurt* your performance by
artificially reducing normal gfs caching.

Dave


From golharam at umdnj.edu  Tue Apr 14 16:17:44 2009
From: golharam at umdnj.edu (Ryan Golhar)
Date: Tue, 14 Apr 2009 12:17:44 -0400
Subject: [Linux-cluster] really reliable?
Message-ID: <49E4B728.8020906@umdnj.edu>

Hi all,

Is redhat cluster suite really reliable?  I've been having so much 
trouble getting a cluster up and running, I'm beginning to second guess 
my decision to use this software stack.

I have 3 nodes (eventually 10) running and set up.  The fencing method 
is by a brocade fibre switch.  The ultimate goal of this cluster is to 
shared a SAN connected by fibre.

I've installed just the bare minimum (before even getting to GFS) to 
test the cluster software.  Just starting cman cluster services fails on 
two of the nodes.

Even when I try to reboot the nodes, I can't because the whole system 
hangs on various processes that don't ever shut down.  I have to 
physically reboot these boxes.

The logs fill up with errors about not being able to connect to cman, etc.

I've been at it for awhile now and am not sure this is the best route 
anymore.

Ryan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: golharam.vcf
Type: text/x-vcard
Size: 438 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/b5c278ae/attachment.vcf>

From finnzi at finnzi.com  Tue Apr 14 16:31:37 2009
From: finnzi at finnzi.com (=?ISO-8859-1?Q?Finnur_=D6rn_Gu=F0mundsson?=)
Date: Tue, 14 Apr 2009 16:31:37 +0000
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E4B728.8020906@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu>
Message-ID: <49E4BA69.7030301@finnzi.com>

Ryan Golhar wrote:
> Hi all,
>
> Is redhat cluster suite really reliable?  I've been having so much 
> trouble getting a cluster up and running, I'm beginning to second 
> guess my decision to use this software stack.
>
> I have 3 nodes (eventually 10) running and set up.  The fencing method 
> is by a brocade fibre switch.  The ultimate goal of this cluster is to 
> shared a SAN connected by fibre.
>
> I've installed just the bare minimum (before even getting to GFS) to 
> test the cluster software.  Just starting cman cluster services fails 
> on two of the nodes.
>
> Even when I try to reboot the nodes, I can't because the whole system 
> hangs on various processes that don't ever shut down.  I have to 
> physically reboot these boxes.
>
> The logs fill up with errors about not being able to connect to cman, 
> etc.
>
> I've been at it for awhile now and am not sure this is the best route 
> anymore.
>
> Ryan
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
Hi Ryan,

I've got around 5 RHEL 5.{1 2 3} clusters running (3 of them are running 
Oracle, other NFS/MySQL) with out any issues at all. However all of 
these are 2 node clusters.

If you are having some issues you might want to talk to Red Hat 
regarding those and get them sorted (if you have a support contract 
anyway).

Most of my errors have been with iptables and mis-configured TKO for my 
Quorum heuristics ;)

Bgrds,
Finnur


From sjpark at rondaandspencer.info  Tue Apr 14 16:40:55 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 10:40:55 -0600
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E4BA69.7030301@finnzi.com>
References: <49E4B728.8020906@umdnj.edu> <49E4BA69.7030301@finnzi.com>
Message-ID: <c7a040fa0904140940y653870elc856f13fc843d9c4@mail.gmail.com>

I have a 2 node cluster dedicated to just mysql...the only problem I have is
when I want to add an NFS mount into the mix.  I just can't seem to get the
NFS mount to work properly.  When it does a status check it error out every
time on me and then stops.  Any insight into this?

2009/4/14 Finnur ?rn Gu?mundsson <finnzi at finnzi.com>

> Ryan Golhar wrote:
>
>> Hi all,
>>
>> Is redhat cluster suite really reliable?  I've been having so much trouble
>> getting a cluster up and running, I'm beginning to second guess my decision
>> to use this software stack.
>>
>> I have 3 nodes (eventually 10) running and set up.  The fencing method is
>> by a brocade fibre switch.  The ultimate goal of this cluster is to shared a
>> SAN connected by fibre.
>>
>> I've installed just the bare minimum (before even getting to GFS) to test
>> the cluster software.  Just starting cman cluster services fails on two of
>> the nodes.
>>
>> Even when I try to reboot the nodes, I can't because the whole system
>> hangs on various processes that don't ever shut down.  I have to physically
>> reboot these boxes.
>>
>> The logs fill up with errors about not being able to connect to cman, etc.
>>
>> I've been at it for awhile now and am not sure this is the best route
>> anymore.
>>
>> Ryan
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> Hi Ryan,
>
> I've got around 5 RHEL 5.{1 2 3} clusters running (3 of them are running
> Oracle, other NFS/MySQL) with out any issues at all. However all of these
> are 2 node clusters.
>
> If you are having some issues you might want to talk to Red Hat regarding
> those and get them sorted (if you have a support contract anyway).
>
> Most of my errors have been with iptables and mis-configured TKO for my
> Quorum heuristics ;)
>
> Bgrds,
> Finnur
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/0c6dc137/attachment.htm>

From gordan at bobich.net  Tue Apr 14 16:44:07 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 14 Apr 2009 17:44:07 +0100
Subject: [Linux-cluster] really =?UTF-8?Q?reliable=3F?=
In-Reply-To: <49E4B728.8020906@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu>
Message-ID: <6296bab6d0ecbbe92334969f026dc9f8@localhost>

What distro are you using? I've found that:

1) Distros other than RHEL/CentOS can be quirky when it comes to using
RHCS. I've even run into problems on Fedora more than once (not to mention
that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
production stable until last month - and we're up to FC10 now).

2) Starting RHCS components using anything except the intended init scripts
tends to cause problems.

3) Source of 99% of problems in the rest of the cases (i.e. not covered by
1) and 2) above) is incorrectly configured fencing.

Does your setup fall under either of the first two categories?
Have you verified beyond doubt that your fencing is configured correctly
and that the fencing script gets verification upon success?

Gordan

On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu> wrote:
> Hi all,
> 
> Is redhat cluster suite really reliable?  I've been having so much 
> trouble getting a cluster up and running, I'm beginning to second guess 
> my decision to use this software stack.
> 
> I have 3 nodes (eventually 10) running and set up.  The fencing method 
> is by a brocade fibre switch.  The ultimate goal of this cluster is to 
> shared a SAN connected by fibre.
> 
> I've installed just the bare minimum (before even getting to GFS) to 
> test the cluster software.  Just starting cman cluster services fails on 
> two of the nodes.
> 
> Even when I try to reboot the nodes, I can't because the whole system 
> hangs on various processes that don't ever shut down.  I have to 
> physically reboot these boxes.
> 
> The logs fill up with errors about not being able to connect to cman,
etc.
> 
> I've been at it for awhile now and am not sure this is the best route 
> anymore.
> 
> Ryan


From sjpark at rondaandspencer.info  Tue Apr 14 16:56:32 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 10:56:32 -0600
Subject: [Linux-cluster] really reliable?
In-Reply-To: <6296bab6d0ecbbe92334969f026dc9f8@localhost>
References: <49E4B728.8020906@umdnj.edu>
	<6296bab6d0ecbbe92334969f026dc9f8@localhost>
Message-ID: <c7a040fa0904140956r70bc72d6x91564260392442b0@mail.gmail.com>

My fencing is setup properly as all of my services start and stop
correctly.  No modifications with scripts and done according to redhat
standards.  I'm using RHEL.  The NFS share gets mounted properly, but when
it runs a status on the nfs mount it fails.  I have checked to see that it
is mounting it as needed(rw access, etc.).  This basically causes all of my
other services to fail since they rely on this nfs mount to be done first.
This is where all of my MySQL data is living at.  It also likes to mount the
nfs mount multiple times as well.  It never actually unmounts the share.

On Tue, Apr 14, 2009 at 10:44 AM, Gordan Bobic <gordan at bobich.net> wrote:

> What distro are you using? I've found that:
>
> 1) Distros other than RHEL/CentOS can be quirky when it comes to using
> RHCS. I've even run into problems on Fedora more than once (not to mention
> that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
> production stable until last month - and we're up to FC10 now).
>
> 2) Starting RHCS components using anything except the intended init scripts
> tends to cause problems.
>
> 3) Source of 99% of problems in the rest of the cases (i.e. not covered by
> 1) and 2) above) is incorrectly configured fencing.
>
> Does your setup fall under either of the first two categories?
> Have you verified beyond doubt that your fencing is configured correctly
> and that the fencing script gets verification upon success?
>
> Gordan
>
> On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu>
> wrote:
> > Hi all,
> >
> > Is redhat cluster suite really reliable?  I've been having so much
> > trouble getting a cluster up and running, I'm beginning to second guess
> > my decision to use this software stack.
> >
> > I have 3 nodes (eventually 10) running and set up.  The fencing method
> > is by a brocade fibre switch.  The ultimate goal of this cluster is to
> > shared a SAN connected by fibre.
> >
> > I've installed just the bare minimum (before even getting to GFS) to
> > test the cluster software.  Just starting cman cluster services fails on
> > two of the nodes.
> >
> > Even when I try to reboot the nodes, I can't because the whole system
> > hangs on various processes that don't ever shut down.  I have to
> > physically reboot these boxes.
> >
> > The logs fill up with errors about not being able to connect to cman,
> etc.
> >
> > I've been at it for awhile now and am not sure this is the best route
> > anymore.
> >
> > Ryan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/61fab3f0/attachment.htm>

From vu at sivell.com  Tue Apr 14 17:03:08 2009
From: vu at sivell.com (vu pham)
Date: Tue, 14 Apr 2009 12:03:08 -0500
Subject: [Linux-cluster] really reliable?
In-Reply-To: <c7a040fa0904140956r70bc72d6x91564260392442b0@mail.gmail.com>
References: <49E4B728.8020906@umdnj.edu><6296bab6d0ecbbe92334969f026dc9f8@localhost>
	<c7a040fa0904140956r70bc72d6x91564260392442b0@mail.gmail.com>
Message-ID: <49E4C1CC.8090309@sivell.com>

Spencer Parker wrote:
> My fencing is setup properly as all of my services start and stop 
> correctly.  No modifications with scripts and done according to redhat 
> standards.  I'm using RHEL.  The NFS share gets mounted properly, but 
> when it runs a status on the nfs mount it fails.  I have checked to see 
> that it is mounting it as needed(rw access, etc.).  This basically 
> causes all of my other services to fail since they rely on this nfs 
> mount to be done first.  This is where all of my MySQL data is living 
> at.  It also likes to mount the nfs mount multiple times as well.  It 
> never actually unmounts the share.


Spencer,

Could you please post your problem in a separate thread ? I think it 
really confuses when having two different problems in one thread.
And more information such as log, if you use NFS over GFS ... will be 
more helpful in troubleshooting.


Thanks,

Vu

> 
> On Tue, Apr 14, 2009 at 10:44 AM, Gordan Bobic <gordan at bobich.net 
> <mailto:gordan at bobich.net>> wrote:
> 
>     What distro are you using? I've found that:
> 
>     1) Distros other than RHEL/CentOS can be quirky when it comes to using
>     RHCS. I've even run into problems on Fedora more than once (not to
>     mention
>     that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
>     production stable until last month - and we're up to FC10 now).
> 
>     2) Starting RHCS components using anything except the intended init
>     scripts
>     tends to cause problems.
> 
>     3) Source of 99% of problems in the rest of the cases (i.e. not
>     covered by
>     1) and 2) above) is incorrectly configured fencing.
> 
>     Does your setup fall under either of the first two categories?
>     Have you verified beyond doubt that your fencing is configured correctly
>     and that the fencing script gets verification upon success?
> 
>     Gordan
> 
>     On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu
>     <mailto:golharam at umdnj.edu>> wrote:
>      > Hi all,
>      >
>      > Is redhat cluster suite really reliable?  I've been having so much
>      > trouble getting a cluster up and running, I'm beginning to second
>     guess
>      > my decision to use this software stack.
>      >
>      > I have 3 nodes (eventually 10) running and set up.  The fencing
>     method
>      > is by a brocade fibre switch.  The ultimate goal of this cluster
>     is to
>      > shared a SAN connected by fibre.
>      >
>      > I've installed just the bare minimum (before even getting to GFS) to
>      > test the cluster software.  Just starting cman cluster services
>     fails on
>      > two of the nodes.
>      >
>      > Even when I try to reboot the nodes, I can't because the whole system
>      > hangs on various processes that don't ever shut down.  I have to
>      > physically reboot these boxes.
>      >
>      > The logs fill up with errors about not being able to connect to cman,
>     etc.
>      >
>      > I've been at it for awhile now and am not sure this is the best route
>      > anymore.
>      >
>      > Ryan
> 
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From lmb at suse.de  Tue Apr 14 16:47:28 2009
From: lmb at suse.de (Lars Marowsky-Bree)
Date: Tue, 14 Apr 2009 18:47:28 +0200
Subject: [Linux-cluster] Re: [Openais] [Ocfs2-devel] Linux Plumbers
	Conference mini-conf on clustering?
In-Reply-To: <20090413233135.GC8778@mail.oracle.com>
References: <20090413225353.GF6147@suse.de>
	<20090413233135.GC8778@mail.oracle.com>
Message-ID: <20090414164728.GK6147@suse.de>

On 2009-04-13T16:31:35, Joel Becker <Joel.Becker at oracle.com> wrote:

> 	I think its a great idea.

Ok, all respondents were in favor, so I will submit a miniconf proposal.

> We actually have progress, with a lot
> of work that we talked about in Prague starting to see the light of day
> in STABLE3, so it would be good to nail that down.

Right, I know we have impressive progress in many(!) areas - the (lack
of) was only mentioned to note that we aren't done yet.


Regards,
    Lars

-- 
Teamlead Kernel, SuSE Labs, Research and Development
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG N?rnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde


From teigland at redhat.com  Tue Apr 14 17:50:34 2009
From: teigland at redhat.com (David Teigland)
Date: Tue, 14 Apr 2009 12:50:34 -0500
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E4B728.8020906@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu>
Message-ID: <20090414175034.GE17186@redhat.com>

On Tue, Apr 14, 2009 at 12:17:44PM -0400, Ryan Golhar wrote:
> Is redhat cluster suite really reliable?  I've been having so much 
> trouble getting a cluster up and running,

Problems getting a cluster up are common, usually come down to network issues,
and are very difficult to diagnose.  The cluster software produces almost
indecipherable errors and strange behaviors when the network isn't behaving as
expected.

My usual suggestion is to disable the cman init script, and just run
"ccsd; cman_tool join" on the nodes.  Then watch the output of
"cman_tool nodes", and "cman_tool status", observing how long it takes
the nodes to recognize each other.  Any delay over a few seconds for
a steady-state cluster membership to form, and you may have some network
problems.

To successfully administer a cluster, you really need to be proficient in
using cman_tool to start up, monitor and shut down the nodes.  The cman init
script does a bunch of things for you, which is great when everything is
working, but when something doesn't work the init script can leave a big
complicated mess that's impossible to sort out.


> I've installed just the bare minimum (before even getting to GFS) to 
> test the cluster software.  Just starting cman cluster services fails on 
> two of the nodes.

That's the right approach, but as mentioned above, you probably need to pare
things down to just using cman_tool if it's network problems at the root.

> Even when I try to reboot the nodes, I can't because the whole system 
> hangs on various processes that don't ever shut down.  I have to 
> physically reboot these boxes.

If something has gone wrong, it's often impossible to shutdown without a hard
reboot.  Even when things are working, rebooting can be a delicate task
because the system may easily be configured to stop things in the wrong order,
and one thing out of place can cause a wreck.

Dave


From sjpark at rondaandspencer.info  Tue Apr 14 18:13:35 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 12:13:35 -0600
Subject: [Linux-cluster] NFS mount problem..
Message-ID: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>

I am running a MySQL cluster using cluster services and I have one issue
when it comes to NFS.  The MySQl services run fine until add in an NFS
mount.  The NFS mount is where all of the MySQl databases live at.  I can
get the NFS share to mount properly on the cluster machines, but the log
files keep telling it errors out.  Once it errors out the service then
stops.  I have tried restarting the service, but that has it remounting the
share over the top of the old one.  It never unmounts the NFS share upon
failure.  I can manually mount it and it works fine...I can read-write to it
just fine...I have added in options and taken them away.  All of this
results in the same end.  Any ideas?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/1c710d1a/attachment.htm>

From vu at sivell.com  Tue Apr 14 18:23:06 2009
From: vu at sivell.com (vu pham)
Date: Tue, 14 Apr 2009 13:23:06 -0500
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
Message-ID: <49E4D48A.7060307@sivell.com>

Spencer Parker wrote:
> I am running a MySQL cluster using cluster services and I have one issue 
> when it comes to NFS.  The MySQl services run fine until add in an NFS 
> mount.  The NFS mount is where all of the MySQl databases live at.  I 
> can get the NFS share to mount properly on the cluster machines, but the 
> log files keep telling it errors out.  Once it errors out the service 
> then stops.  I have tried restarting the service, but that has it 
> remounting the share over the top of the old one.  It never unmounts the 
> NFS share upon failure.  I can manually mount it and it works fine...I 
> can read-write to it just fine...I have added in options and taken them 
> away.  All of this results in the same end.  Any ideas?

Spencer,

How do you share the NFS ? Do you use GSF ?  What are the error messages 
? Is the storage-device iscsi ?


Vu


From sjpark at rondaandspencer.info  Tue Apr 14 18:26:12 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 12:26:12 -0600
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <49E4D48A.7060307@sivell.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<49E4D48A.7060307@sivell.com>
Message-ID: <c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>

The NFS share is located on a NetApp box not running GFS.  The NFS share is
only there to share the database information for the MySQL resource.  The
failure comes when it goes to check the status of the NFS mount.

Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status on netfs
"mysql_data" returned 1 (generic error)

That is the error coming out of my log file.  It mounts the NFS share just
fine...and leaves it mounted as well.  When it checks the status it then
errors out.

On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com> wrote:

> Spencer Parker wrote:
>
>> I am running a MySQL cluster using cluster services and I have one issue
>> when it comes to NFS.  The MySQl services run fine until add in an NFS
>> mount.  The NFS mount is where all of the MySQl databases live at.  I can
>> get the NFS share to mount properly on the cluster machines, but the log
>> files keep telling it errors out.  Once it errors out the service then
>> stops.  I have tried restarting the service, but that has it remounting the
>> share over the top of the old one.  It never unmounts the NFS share upon
>> failure.  I can manually mount it and it works fine...I can read-write to it
>> just fine...I have added in options and taken them away.  All of this
>> results in the same end.  Any ideas?
>>
>
> Spencer,
>
> How do you share the NFS ? Do you use GSF ?  What are the error messages ?
> Is the storage-device iscsi ?
>
>
> Vu
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/f2f865f6/attachment.htm>

From vu at sivell.com  Tue Apr 14 18:30:55 2009
From: vu at sivell.com (vu pham)
Date: Tue, 14 Apr 2009 13:30:55 -0500
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com><49E4D48A.7060307@sivell.com>
	<c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>
Message-ID: <49E4D65F.2020905@sivell.com>


Spencer Parker wrote:
> The NFS share is located on a NetApp box not running GFS.  The NFS share 
> is only there to share the database information for the MySQL resource.  
> The failure comes when it goes to check the status of the NFS mount.
> 
> Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status on netfs 
> "mysql_data" returned 1 (generic error)
> 
> That is the error coming out of my log file.  It mounts the NFS share 
> just fine...and leaves it mounted as well.  When it checks the status it 
> then errors out.

What is your cluster.conf ?


> 
> On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com 
> <mailto:vu at sivell.com>> wrote:
> 
>     Spencer Parker wrote:
> 
>         I am running a MySQL cluster using cluster services and I have
>         one issue when it comes to NFS.  The MySQl services run fine
>         until add in an NFS mount.  The NFS mount is where all of the
>         MySQl databases live at.  I can get the NFS share to mount
>         properly on the cluster machines, but the log files keep telling
>         it errors out.  Once it errors out the service then stops.  I
>         have tried restarting the service, but that has it remounting
>         the share over the top of the old one.  It never unmounts the
>         NFS share upon failure.  I can manually mount it and it works
>         fine...I can read-write to it just fine...I have added in
>         options and taken them away.  All of this results in the same
>         end.  Any ideas?
> 
> 
>     Spencer,
> 
>     How do you share the NFS ? Do you use GSF ?  What are the error
>     messages ? Is the storage-device iscsi ?
> 
> 
>     Vu
> 
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From sjpark at rondaandspencer.info  Tue Apr 14 18:41:18 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 12:41:18 -0600
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <49E4D65F.2020905@sivell.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<49E4D48A.7060307@sivell.com>
	<c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>
	<49E4D65F.2020905@sivell.com>
Message-ID: <c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>

<?xml version="1.0"?>
<cluster alias="cluster" config_version="43" name="cluster">
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="shadowhawk" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="shadowhawk-ilo"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="darkhawk" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="darkhawk-ilo"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="darkhawk-ilo"
login="cluster" name="darkhawk-ilo" passwd="*******"/>
                <fencedevice agent="fence_ilo" hostname="shadowhawk-ilo"
login="cluster" name="shadowhawk-ilo" passwd="*******"/>
        </fencedevices>
        <rm log_level="7">
                <failoverdomains>
                        <failoverdomain name="failover" nofailback="0"
ordered="1" restricted="0">
                                <failoverdomainnode name="shadowhawk"
priority="1"/>
                                <failoverdomainnode name="darkhawk"
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="10.10.200.25" monitor_link="1"/>
                        <script file="/etc/init.d/mysqld" name="mysqld"/>
                        <script file="/etc/init.d/httpd" name="httpd"/>
                        <netfs export="/vol/test_mysql/mysql"
exportpath="/vol/test_mysql/mysql" force_unmount="1" fstype="nfs"
host="netapp" mountpoint="/mnt/mysql/" name="mysql_data" nfstype="nfs"
options="defaults,rw,async,nfsvers=3,mountvers=3,proto=tcp"/>
                </resources>
                <service autostart="1" domain="failover" exclusive="0"
name="cluster" recovery="relocate">
                        <ip ref="10.10.200.25">
                                <script ref="mysqld"/>
                                <script ref="httpd"/>
                        </ip>
                </service>
                <service autostart="1" domain="failover" exclusive="0"
name="nfs" recovery="relocate">
                        <netfs ref="mysql_data"/>
                </service>
        </rm>
</cluster>


On Tue, Apr 14, 2009 at 12:30 PM, vu pham <vu at sivell.com> wrote:

>
> Spencer Parker wrote:
>
>> The NFS share is located on a NetApp box not running GFS.  The NFS share
>> is only there to share the database information for the MySQL resource.  The
>> failure comes when it goes to check the status of the NFS mount.
>>
>> Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status on netfs
>> "mysql_data" returned 1 (generic error)
>>
>> That is the error coming out of my log file.  It mounts the NFS share just
>> fine...and leaves it mounted as well.  When it checks the status it then
>> errors out.
>>
>
> What is your cluster.conf ?
>
>
>
>> On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com <mailto:
>> vu at sivell.com>> wrote:
>>
>>    Spencer Parker wrote:
>>
>>        I am running a MySQL cluster using cluster services and I have
>>        one issue when it comes to NFS.  The MySQl services run fine
>>        until add in an NFS mount.  The NFS mount is where all of the
>>        MySQl databases live at.  I can get the NFS share to mount
>>        properly on the cluster machines, but the log files keep telling
>>        it errors out.  Once it errors out the service then stops.  I
>>        have tried restarting the service, but that has it remounting
>>        the share over the top of the old one.  It never unmounts the
>>        NFS share upon failure.  I can manually mount it and it works
>>        fine...I can read-write to it just fine...I have added in
>>        options and taken them away.  All of this results in the same
>>        end.  Any ideas?
>>
>>
>>    Spencer,
>>
>>    How do you share the NFS ? Do you use GSF ?  What are the error
>>    messages ? Is the storage-device iscsi ?
>>
>>
>>    Vu
>>
>>    --
>>    Linux-cluster mailing list
>>    Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>>    https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>>
>> ------------------------------------------------------------------------
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/4d6603a1/attachment.htm>

From sjpark at rondaandspencer.info  Tue Apr 14 19:46:33 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 13:46:33 -0600
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<49E4D48A.7060307@sivell.com>
	<c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>
	<49E4D65F.2020905@sivell.com>
	<c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>
Message-ID: <c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>

I found my problem.  It was the trailing slash on /mnt/mysql

On Tue, Apr 14, 2009 at 12:41 PM, Spencer Parker <
sjpark at rondaandspencer.info> wrote:

> <?xml version="1.0"?>
> <cluster alias="cluster" config_version="43" name="cluster">
>         <fence_daemon clean_start="0" post_fail_delay="0"
> post_join_delay="3"/>
>         <clusternodes>
>                 <clusternode name="shadowhawk" nodeid="1" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="shadowhawk-ilo"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="darkhawk" nodeid="2" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="darkhawk-ilo"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>         </clusternodes>
>         <cman expected_votes="1" two_node="1"/>
>         <fencedevices>
>                 <fencedevice agent="fence_ilo" hostname="darkhawk-ilo"
> login="cluster" name="darkhawk-ilo" passwd="*******"/>
>                 <fencedevice agent="fence_ilo" hostname="shadowhawk-ilo"
> login="cluster" name="shadowhawk-ilo" passwd="*******"/>
>         </fencedevices>
>         <rm log_level="7">
>                 <failoverdomains>
>                         <failoverdomain name="failover" nofailback="0"
> ordered="1" restricted="0">
>                                 <failoverdomainnode name="shadowhawk"
> priority="1"/>
>                                 <failoverdomainnode name="darkhawk"
> priority="2"/>
>                         </failoverdomain>
>                 </failoverdomains>
>                 <resources>
>                         <ip address="10.10.200.25" monitor_link="1"/>
>                         <script file="/etc/init.d/mysqld" name="mysqld"/>
>                         <script file="/etc/init.d/httpd" name="httpd"/>
>                         <netfs export="/vol/test_mysql/mysql"
> exportpath="/vol/test_mysql/mysql" force_unmount="1" fstype="nfs"
> host="netapp" mountpoint="/mnt/mysql/" name="mysql_data" nfstype="nfs"
> options="defaults,rw,async,nfsvers=3,mountvers=3,proto=tcp"/>
>                 </resources>
>                 <service autostart="1" domain="failover" exclusive="0"
> name="cluster" recovery="relocate">
>                         <ip ref="10.10.200.25">
>                                 <script ref="mysqld"/>
>                                 <script ref="httpd"/>
>                         </ip>
>                 </service>
>                 <service autostart="1" domain="failover" exclusive="0"
> name="nfs" recovery="relocate">
>                         <netfs ref="mysql_data"/>
>                 </service>
>         </rm>
> </cluster>
>
>
>
> On Tue, Apr 14, 2009 at 12:30 PM, vu pham <vu at sivell.com> wrote:
>
>>
>> Spencer Parker wrote:
>>
>>> The NFS share is located on a NetApp box not running GFS.  The NFS share
>>> is only there to share the database information for the MySQL resource.  The
>>> failure comes when it goes to check the status of the NFS mount.
>>>
>>> Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status on netfs
>>> "mysql_data" returned 1 (generic error)
>>>
>>> That is the error coming out of my log file.  It mounts the NFS share
>>> just fine...and leaves it mounted as well.  When it checks the status it
>>> then errors out.
>>>
>>
>> What is your cluster.conf ?
>>
>>
>>
>>> On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com <mailto:
>>> vu at sivell.com>> wrote:
>>>
>>>    Spencer Parker wrote:
>>>
>>>        I am running a MySQL cluster using cluster services and I have
>>>        one issue when it comes to NFS.  The MySQl services run fine
>>>        until add in an NFS mount.  The NFS mount is where all of the
>>>        MySQl databases live at.  I can get the NFS share to mount
>>>        properly on the cluster machines, but the log files keep telling
>>>        it errors out.  Once it errors out the service then stops.  I
>>>        have tried restarting the service, but that has it remounting
>>>        the share over the top of the old one.  It never unmounts the
>>>        NFS share upon failure.  I can manually mount it and it works
>>>        fine...I can read-write to it just fine...I have added in
>>>        options and taken them away.  All of this results in the same
>>>        end.  Any ideas?
>>>
>>>
>>>    Spencer,
>>>
>>>    How do you share the NFS ? Do you use GSF ?  What are the error
>>>    messages ? Is the storage-device iscsi ?
>>>
>>>
>>>    Vu
>>>
>>>    --
>>>    Linux-cluster mailing list
>>>    Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>>>    https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>>
>>>
>>> ------------------------------------------------------------------------
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/c9e1fd8a/attachment.htm>

From vu at sivell.com  Tue Apr 14 20:39:36 2009
From: vu at sivell.com (vu pham)
Date: Tue, 14 Apr 2009 15:39:36 -0500
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com><49	E4D48A.7060307@sivell.com><c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mai	l.gmail.com><49E4D65F.2020905@sivell.com><c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>
	<c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>
Message-ID: <49E4F488.6010709@sivell.com>


Spencer Parker wrote:
> I found my problem.  It was the trailing slash on /mnt/mysql

Glad that you found the problem. Just curious that your log didn't show 
any clear clue. Below is my log in a similar situation. It says pretty 
clear that there is something wrong in my mount parameters.

Jan 16 15:33:22 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs 
started
Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> status on netfs 
"nfsdata" returned 1 (generic error)
Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> Stopping service 
service:nfs
Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is 
recovering
Jan 16 15:33:27 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is 
now running on member 1
Jan 16 15:33:36 xen2vm1 clurgmgrd[1790]: <notice> Recovering failed 
service service:nfs
Jan 16 15:33:37 xen2vm1 clurgmgrd: [1790]: <err> 'mount  -o 
sync,soft,noac 172.16.254.14:/data /mnt/nfsdata/' failed, error=32
Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> start on netfs 
"nfsdata" returned 2 (invalid argument(s))
Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <warning> #68: Failed to start 
service:nfs; return value: 1
Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> Stopping service 
service:nfs
Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is 
recovering

Btw, I use RHEL 5.2, just plain 5.2 from the DVD without any RHN update.

Vu
> 
> On Tue, Apr 14, 2009 at 12:41 PM, Spencer Parker 
> <sjpark at rondaandspencer.info <mailto:sjpark at rondaandspencer.info>> wrote:
> 
>     <?xml version="1.0"?>
>     <cluster alias="cluster" config_version="43" name="cluster">
>             <fence_daemon clean_start="0" post_fail_delay="0"
>     post_join_delay="3"/>
>             <clusternodes>
>                     <clusternode name="shadowhawk" nodeid="1" votes="1">
>                             <fence>
>                                     <method name="1">
>                                             <device name="shadowhawk-ilo"/>
>                                     </method>
>                             </fence>
>                     </clusternode>
>                     <clusternode name="darkhawk" nodeid="2" votes="1">
>                             <fence>
>                                     <method name="1">
>                                             <device name="darkhawk-ilo"/>
>                                     </method>
>                             </fence>
>                     </clusternode>
>             </clusternodes>
>             <cman expected_votes="1" two_node="1"/>
>             <fencedevices>
>                     <fencedevice agent="fence_ilo"
>     hostname="darkhawk-ilo" login="cluster" name="darkhawk-ilo"
>     passwd="*******"/>
>                     <fencedevice agent="fence_ilo"
>     hostname="shadowhawk-ilo" login="cluster" name="shadowhawk-ilo"
>     passwd="*******"/>
>             </fencedevices>
>             <rm log_level="7">
>                     <failoverdomains>
>                             <failoverdomain name="failover"
>     nofailback="0" ordered="1" restricted="0">
>                                     <failoverdomainnode
>     name="shadowhawk" priority="1"/>
>                                     <failoverdomainnode name="darkhawk"
>     priority="2"/>
>                             </failoverdomain>
>                     </failoverdomains>
>                     <resources>
>                             <ip address="10.10.200.25" monitor_link="1"/>
>                             <script file="/etc/init.d/mysqld"
>     name="mysqld"/>
>                             <script file="/etc/init.d/httpd" name="httpd"/>
>                             <netfs export="/vol/test_mysql/mysql"
>     exportpath="/vol/test_mysql/mysql" force_unmount="1" fstype="nfs"
>     host="netapp" mountpoint="/mnt/mysql/" name="mysql_data"
>     nfstype="nfs"
>     options="defaults,rw,async,nfsvers=3,mountvers=3,proto=tcp"/>
>                     </resources>
>                     <service autostart="1" domain="failover"
>     exclusive="0" name="cluster" recovery="relocate">
>                             <ip ref="10.10.200.25">
>                                     <script ref="mysqld"/>
>                                     <script ref="httpd"/>
>                             </ip>
>                     </service>
>                     <service autostart="1" domain="failover"
>     exclusive="0" name="nfs" recovery="relocate">
>                             <netfs ref="mysql_data"/>
>                     </service>
>             </rm>
>     </cluster>
> 
> 
> 
>     On Tue, Apr 14, 2009 at 12:30 PM, vu pham <vu at sivell.com
>     <mailto:vu at sivell.com>> wrote:
> 
> 
>         Spencer Parker wrote:
> 
>             The NFS share is located on a NetApp box not running GFS.
>              The NFS share is only there to share the database
>             information for the MySQL resource.  The failure comes when
>             it goes to check the status of the NFS mount.
> 
>             Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status
>             on netfs "mysql_data" returned 1 (generic error)
> 
>             That is the error coming out of my log file.  It mounts the
>             NFS share just fine...and leaves it mounted as well.  When
>             it checks the status it then errors out.
> 
> 
>         What is your cluster.conf ?
> 
> 
> 
>             On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com
>             <mailto:vu at sivell.com> <mailto:vu at sivell.com
>             <mailto:vu at sivell.com>>> wrote:
> 
>                Spencer Parker wrote:
> 
>                    I am running a MySQL cluster using cluster services
>             and I have
>                    one issue when it comes to NFS.  The MySQl services
>             run fine
>                    until add in an NFS mount.  The NFS mount is where
>             all of the
>                    MySQl databases live at.  I can get the NFS share to
>             mount
>                    properly on the cluster machines, but the log files
>             keep telling
>                    it errors out.  Once it errors out the service then
>             stops.  I
>                    have tried restarting the service, but that has it
>             remounting
>                    the share over the top of the old one.  It never
>             unmounts the
>                    NFS share upon failure.  I can manually mount it and
>             it works
>                    fine...I can read-write to it just fine...I have added in
>                    options and taken them away.  All of this results in
>             the same
>                    end.  Any ideas?
> 
> 
>                Spencer,
> 
>                How do you share the NFS ? Do you use GSF ?  What are the
>             error
>                messages ? Is the storage-device iscsi ?
> 
> 
>                Vu
> 
>                --
>                Linux-cluster mailing list
>                Linux-cluster at redhat.com
>             <mailto:Linux-cluster at redhat.com>
>             <mailto:Linux-cluster at redhat.com
>             <mailto:Linux-cluster at redhat.com>>
> 
>                https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
>             ------------------------------------------------------------------------
> 
> 
>             --
>             Linux-cluster mailing list
>             Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>             https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>         https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From sjpark at rondaandspencer.info  Tue Apr 14 20:47:19 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Tue, 14 Apr 2009 14:47:19 -0600
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <49E4F488.6010709@sivell.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<49E4D65F.2020905@sivell.com>
	<c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>
	<c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>
	<49E4F488.6010709@sivell.com>
Message-ID: <c7a040fa0904141347n70246derff4d88df6d15cb50@mail.gmail.com>

I did kick up the logging on the node and it did give me that same info.
The way figured it out was by modifying the netfs.sh script to log what it
was looking for and what it was actually finding.  It was looking for
/mnt/mysql/ not /mnt/mysql.  This is according to the awk statement at the
end of the isMounted function in that script.  Once I found out what it was
looking for, I was able to get it to work.  This sounds like a bug to me
more than anything.

On Tue, Apr 14, 2009 at 2:39 PM, vu pham <vu at sivell.com> wrote:

>
>
> Spencer Parker wrote:
>
>> I found my problem.  It was the trailing slash on /mnt/mysql
>>
>
> Glad that you found the problem. Just curious that your log didn't show any
> clear clue. Below is my log in a similar situation. It says pretty clear
> that there is something wrong in my mount parameters.
>
> Jan 16 15:33:22 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs
> started
> Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> status on netfs "nfsdata"
> returned 1 (generic error)
> Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> Stopping service
> service:nfs
> Jan 16 15:33:26 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is
> recovering
> Jan 16 15:33:27 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is
> now running on member 1
> Jan 16 15:33:36 xen2vm1 clurgmgrd[1790]: <notice> Recovering failed service
> service:nfs
> Jan 16 15:33:37 xen2vm1 clurgmgrd: [1790]: <err> 'mount  -o sync,soft,noac
> 172.16.254.14:/data /mnt/nfsdata/' failed, error=32
> Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> start on netfs "nfsdata"
> returned 2 (invalid argument(s))
> Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <warning> #68: Failed to start
> service:nfs; return value: 1
> Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> Stopping service
> service:nfs
> Jan 16 15:33:37 xen2vm1 clurgmgrd[1790]: <notice> Service service:nfs is
> recovering
>
> Btw, I use RHEL 5.2, just plain 5.2 from the DVD without any RHN update.
>
> Vu
>
>>
>> On Tue, Apr 14, 2009 at 12:41 PM, Spencer Parker <
>> sjpark at rondaandspencer.info <mailto:sjpark at rondaandspencer.info>> wrote:
>>
>>    <?xml version="1.0"?>
>>    <cluster alias="cluster" config_version="43" name="cluster">
>>            <fence_daemon clean_start="0" post_fail_delay="0"
>>    post_join_delay="3"/>
>>            <clusternodes>
>>                    <clusternode name="shadowhawk" nodeid="1" votes="1">
>>                            <fence>
>>                                    <method name="1">
>>                                            <device name="shadowhawk-ilo"/>
>>                                    </method>
>>                            </fence>
>>                    </clusternode>
>>                    <clusternode name="darkhawk" nodeid="2" votes="1">
>>                            <fence>
>>                                    <method name="1">
>>                                            <device name="darkhawk-ilo"/>
>>                                    </method>
>>                            </fence>
>>                    </clusternode>
>>            </clusternodes>
>>            <cman expected_votes="1" two_node="1"/>
>>            <fencedevices>
>>                    <fencedevice agent="fence_ilo"
>>    hostname="darkhawk-ilo" login="cluster" name="darkhawk-ilo"
>>    passwd="*******"/>
>>                    <fencedevice agent="fence_ilo"
>>    hostname="shadowhawk-ilo" login="cluster" name="shadowhawk-ilo"
>>    passwd="*******"/>
>>            </fencedevices>
>>            <rm log_level="7">
>>                    <failoverdomains>
>>                            <failoverdomain name="failover"
>>    nofailback="0" ordered="1" restricted="0">
>>                                    <failoverdomainnode
>>    name="shadowhawk" priority="1"/>
>>                                    <failoverdomainnode name="darkhawk"
>>    priority="2"/>
>>                            </failoverdomain>
>>                    </failoverdomains>
>>                    <resources>
>>                            <ip address="10.10.200.25" monitor_link="1"/>
>>                            <script file="/etc/init.d/mysqld"
>>    name="mysqld"/>
>>                            <script file="/etc/init.d/httpd" name="httpd"/>
>>                            <netfs export="/vol/test_mysql/mysql"
>>    exportpath="/vol/test_mysql/mysql" force_unmount="1" fstype="nfs"
>>    host="netapp" mountpoint="/mnt/mysql/" name="mysql_data"
>>    nfstype="nfs"
>>    options="defaults,rw,async,nfsvers=3,mountvers=3,proto=tcp"/>
>>                    </resources>
>>                    <service autostart="1" domain="failover"
>>    exclusive="0" name="cluster" recovery="relocate">
>>                            <ip ref="10.10.200.25">
>>                                    <script ref="mysqld"/>
>>                                    <script ref="httpd"/>
>>                            </ip>
>>                    </service>
>>                    <service autostart="1" domain="failover"
>>    exclusive="0" name="nfs" recovery="relocate">
>>                            <netfs ref="mysql_data"/>
>>                    </service>
>>            </rm>
>>    </cluster>
>>
>>
>>
>>    On Tue, Apr 14, 2009 at 12:30 PM, vu pham <vu at sivell.com
>>    <mailto:vu at sivell.com>> wrote:
>>
>>
>>        Spencer Parker wrote:
>>
>>            The NFS share is located on a NetApp box not running GFS.
>>             The NFS share is only there to share the database
>>            information for the MySQL resource.  The failure comes when
>>            it goes to check the status of the NFS mount.
>>
>>            Jan  9 13:50:40 shadowhawk clurgmgrd[4212]: <notice> status
>>            on netfs "mysql_data" returned 1 (generic error)
>>
>>            That is the error coming out of my log file.  It mounts the
>>            NFS share just fine...and leaves it mounted as well.  When
>>            it checks the status it then errors out.
>>
>>
>>        What is your cluster.conf ?
>>
>>
>>
>>            On Tue, Apr 14, 2009 at 12:23 PM, vu pham <vu at sivell.com
>>            <mailto:vu at sivell.com> <mailto:vu at sivell.com
>>
>>            <mailto:vu at sivell.com>>> wrote:
>>
>>               Spencer Parker wrote:
>>
>>                   I am running a MySQL cluster using cluster services
>>            and I have
>>                   one issue when it comes to NFS.  The MySQl services
>>            run fine
>>                   until add in an NFS mount.  The NFS mount is where
>>            all of the
>>                   MySQl databases live at.  I can get the NFS share to
>>            mount
>>                   properly on the cluster machines, but the log files
>>            keep telling
>>                   it errors out.  Once it errors out the service then
>>            stops.  I
>>                   have tried restarting the service, but that has it
>>            remounting
>>                   the share over the top of the old one.  It never
>>            unmounts the
>>                   NFS share upon failure.  I can manually mount it and
>>            it works
>>                   fine...I can read-write to it just fine...I have added
>> in
>>                   options and taken them away.  All of this results in
>>            the same
>>                   end.  Any ideas?
>>
>>
>>               Spencer,
>>
>>               How do you share the NFS ? Do you use GSF ?  What are the
>>            error
>>               messages ? Is the storage-device iscsi ?
>>
>>
>>               Vu
>>
>>               --
>>               Linux-cluster mailing list
>>               Linux-cluster at redhat.com
>>            <mailto:Linux-cluster at redhat.com>
>>            <mailto:Linux-cluster at redhat.com
>>            <mailto:Linux-cluster at redhat.com>>
>>
>>               https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>>
>>
>>  ------------------------------------------------------------------------
>>
>>
>>            --
>>            Linux-cluster mailing list
>>            Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>>            https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>>        --
>>        Linux-cluster mailing list
>>        Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>>        https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>>
>>
>> ------------------------------------------------------------------------
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/bf3deb5c/attachment.htm>

From golharam at umdnj.edu  Tue Apr 14 21:01:24 2009
From: golharam at umdnj.edu (Ryan Golhar)
Date: Tue, 14 Apr 2009 17:01:24 -0400
Subject: [Linux-cluster] really reliable?
In-Reply-To: <6296bab6d0ecbbe92334969f026dc9f8@localhost>
References: <49E4B728.8020906@umdnj.edu>
	<6296bab6d0ecbbe92334969f026dc9f8@localhost>
Message-ID: <49E4F9A4.4060306@umdnj.edu>

I'm running RHEL 5.3 64-bit.  So far, I only want to see that the
cluster can run.  I'll worry about getting GFS after I'm confident this
works.

I've got three nodes: pico, vail, and whistler.  They each have two NIC
cards, one that provides a public IP address, and another that provides
private communications.  All cluster traffic will go over the private
network, 192.168.20.0.

I've installed only the following components:
system-config-cluster-1.0.52-1.1, cman-2.0.98-1, and rgmanager-2.0.38-2.

I've created my cluster.conf file to include these three nodees and
fence them using a brocade fibre switch (for GFS).

When I start the cluster services on all 3 nodes using the manually 
method of:

/sbin/ccsd; /usr/sbin/cman_tool join

The nodes successfully form a cluster.  I am able to leave the cluster 
and kill ccsd as well.

If I try to start the cman service I see:

[root at pico cluster]# /sbin/service cman start
Starting cluster:
    Loading modules... done
    Mounting configfs... done
    Starting ccsd... done
    Starting cman... done
    Starting daemons... done
    Starting fencing...


And it just hangs.  I know my fencing is set up correctly because I've 
had nodes fence other nodes before (when I was trying with 6 members). 
If I let it sit for long enough sometimes it finishes successfully.  I'm 
not sure what its doing because fence_tool is called and its a binary...

Ryan


Gordan Bobic wrote:
> What distro are you using? I've found that:
> 
> 1) Distros other than RHEL/CentOS can be quirky when it comes to using
> RHCS. I've even run into problems on Fedora more than once (not to mention
> that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
> production stable until last month - and we're up to FC10 now).
> 
> 2) Starting RHCS components using anything except the intended init scripts
> tends to cause problems.
> 
> 3) Source of 99% of problems in the rest of the cases (i.e. not covered by
> 1) and 2) above) is incorrectly configured fencing.
> 
> Does your setup fall under either of the first two categories?
> Have you verified beyond doubt that your fencing is configured correctly
> and that the fencing script gets verification upon success?
> 
> Gordan
> 
> On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu> wrote:
>> Hi all,
>>
>> Is redhat cluster suite really reliable?  I've been having so much 
>> trouble getting a cluster up and running, I'm beginning to second guess 
>> my decision to use this software stack.
>>
>> I have 3 nodes (eventually 10) running and set up.  The fencing method 
>> is by a brocade fibre switch.  The ultimate goal of this cluster is to 
>> shared a SAN connected by fibre.
>>
>> I've installed just the bare minimum (before even getting to GFS) to 
>> test the cluster software.  Just starting cman cluster services fails on 
>> two of the nodes.
>>
>> Even when I try to reboot the nodes, I can't because the whole system 
>> hangs on various processes that don't ever shut down.  I have to 
>> physically reboot these boxes.
>>
>> The logs fill up with errors about not being able to connect to cman,
> etc.
>> I've been at it for awhile now and am not sure this is the best route 
>> anymore.
>>
>> Ryan
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: golharam.vcf
Type: text/x-vcard
Size: 438 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090414/c1437c1f/attachment.vcf>

From gordan at bobich.net  Tue Apr 14 21:48:35 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 14 Apr 2009 22:48:35 +0100
Subject: [Linux-cluster] really reliable?
In-Reply-To: <c7a040fa0904140956r70bc72d6x91564260392442b0@mail.gmail.com>
References: <49E4B728.8020906@umdnj.edu>	<6296bab6d0ecbbe92334969f026dc9f8@localhost>
	<c7a040fa0904140956r70bc72d6x91564260392442b0@mail.gmail.com>
Message-ID: <49E504B3.50603@bobich.net>

Spencer Parker wrote:
> My fencing is setup properly as all of my services start and stop 
> correctly.

That doesn't follow. Your fencing doesn't get tested until it's needed. 
Have you verified that when you yank the power cable on one of the nodes 
that the fencing gets completed successfully? The cluster will lock up 
until fencing is successful, completed, and verified.

> No modifications with scripts and done according to redhat 
> standards.  I'm using RHEL.  The NFS share gets mounted properly, but 
> when it runs a status on the nfs mount it fails.  I have checked to see 
> that it is mounting it as needed(rw access, etc.).  This basically 
> causes all of my other services to fail since they rely on this nfs 
> mount to be done first.  This is where all of my MySQL data is living 
> at.  It also likes to mount the nfs mount multiple times as well.  It 
> never actually unmounts the share.

Have you tried mounting NFS via UDP?

Gordan


From gordan at bobich.net  Tue Apr 14 21:52:03 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 14 Apr 2009 22:52:03 +0100
Subject: [Linux-cluster] really reliable?
In-Reply-To: <20090414175034.GE17186@redhat.com>
References: <49E4B728.8020906@umdnj.edu> <20090414175034.GE17186@redhat.com>
Message-ID: <49E50583.6060202@bobich.net>

David Teigland wrote:

>> Even when I try to reboot the nodes, I can't because the whole system 
>> hangs on various processes that don't ever shut down.  I have to 
>> physically reboot these boxes.
> 
> If something has gone wrong, it's often impossible to shutdown without a hard
> reboot.  Even when things are working, rebooting can be a delicate task
> because the system may easily be configured to stop things in the wrong order,
> and one thing out of place can cause a wreck.

If things don't come down in the correct order using the standard init 
scripts, you should file a bug report about this. I've never seen it 
happen on any of my clusters.

Gordan


From sdake at redhat.com  Tue Apr 14 22:22:04 2009
From: sdake at redhat.com (Steven Dake)
Date: Tue, 14 Apr 2009 15:22:04 -0700
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E50583.6060202@bobich.net>
References: <49E4B728.8020906@umdnj.edu> <20090414175034.GE17186@redhat.com>
	<49E50583.6060202@bobich.net>
Message-ID: <1239747724.29777.8.camel@sdake-laptop>

On Tue, 2009-04-14 at 22:52 +0100, Gordan Bobic wrote:
> David Teigland wrote:
> 
> >> Even when I try to reboot the nodes, I can't because the whole system 
> >> hangs on various processes that don't ever shut down.  I have to 
> >> physically reboot these boxes.
> > 
> > If something has gone wrong, it's often impossible to shutdown without a hard
> > reboot.  Even when things are working, rebooting can be a delicate task
> > because the system may easily be configured to stop things in the wrong order,
> > and one thing out of place can cause a wreck.
> 
> If things don't come down in the correct order using the standard init 
> scripts, you should file a bug report about this. I've never seen it 
> happen on any of my clusters.
> 

I believe he means the administrator makes a configuration change to the
init scripts which result in a shutdown not working.  RHCS amplifies
this problem somewhat so more care must be taken in these situations.

regards
-steve

> Gordan
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From golharam at umdnj.edu  Tue Apr 14 23:26:34 2009
From: golharam at umdnj.edu (Ryan Golhar)
Date: Tue, 14 Apr 2009 19:26:34 -0400
Subject: [Linux-cluster] really reliable?
In-Reply-To: <1239747724.29777.8.camel@sdake-laptop>
References: <49E4B728.8020906@umdnj.edu> <20090414175034.GE17186@redhat.com>
	<49E50583.6060202@bobich.net> <1239747724.29777.8.camel@sdake-laptop>
Message-ID: <49E51BAA.4090402@umdnj.edu>

No no no, I haven't modified any of the init files...these are out of 
the box...

Steven Dake wrote:
> On Tue, 2009-04-14 at 22:52 +0100, Gordan Bobic wrote:
>> David Teigland wrote:
>>
>>>> Even when I try to reboot the nodes, I can't because the whole system 
>>>> hangs on various processes that don't ever shut down.  I have to 
>>>> physically reboot these boxes.
>>> If something has gone wrong, it's often impossible to shutdown without a hard
>>> reboot.  Even when things are working, rebooting can be a delicate task
>>> because the system may easily be configured to stop things in the wrong order,
>>> and one thing out of place can cause a wreck.
>> If things don't come down in the correct order using the standard init 
>> scripts, you should file a bug report about this. I've never seen it 
>> happen on any of my clusters.
>>
> 
> I believe he means the administrator makes a configuration change to the
> init scripts which result in a shutdown not working.  RHCS amplifies
> this problem somewhat so more care must be taken in these situations.
> 
> regards
> -steve
> 
>> Gordan
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From vu at sivell.com  Wed Apr 15 01:16:43 2009
From: vu at sivell.com (Vu Pham)
Date: Tue, 14 Apr 2009 20:16:43 -0500
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E4F9A4.4060306@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu><6296bab6d0ecbbe92334969f026dc9f8@localhost>
	<49E4F9A4.4060306@umdnj.edu>
Message-ID: <49E5357B.8050503@sivell.com>

Ryan Golhar wrote:
> I'm running RHEL 5.3 64-bit.  So far, I only want to see that the
> cluster can run.  I'll worry about getting GFS after I'm confident this
> works.
> 
> I've got three nodes: pico, vail, and whistler.  They each have two NIC
> cards, one that provides a public IP address, and another that provides
> private communications.  All cluster traffic will go over the private
> network, 192.168.20.0.
> 
> I've installed only the following components:
> system-config-cluster-1.0.52-1.1, cman-2.0.98-1, and rgmanager-2.0.38-2.
> 
> I've created my cluster.conf file to include these three nodees and
> fence them using a brocade fibre switch (for GFS).
> 
> When I start the cluster services on all 3 nodes using the manually 
> method of:
> 
> /sbin/ccsd; /usr/sbin/cman_tool join
> 
> The nodes successfully form a cluster.  I am able to leave the cluster 
> and kill ccsd as well.
> 
> If I try to start the cman service I see:
> 
> [root at pico cluster]# /sbin/service cman start
> Starting cluster:
>    Loading modules... done
>    Mounting configfs... done
>    Starting ccsd... done
>    Starting cman... done
>    Starting daemons... done
>    Starting fencing...
> 
> 
> And it just hangs.  I know my fencing is set up correctly because I've 
> had nodes fence other nodes before (when I was trying with 6 members). 
> If I let it sit for long enough sometimes it finishes successfully.  I'm 
> not sure what its doing because fence_tool is called and its a binary...
> 

Ryan,

Anything suspicious in the log when it hangs at fencing ?
Could you show your cluster.conf ?

Vu

> Ryan
> 
> 
> Gordan Bobic wrote:
>> What distro are you using? I've found that:
>>
>> 1) Distros other than RHEL/CentOS can be quirky when it comes to using
>> RHCS. I've even run into problems on Fedora more than once (not to 
>> mention
>> that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
>> production stable until last month - and we're up to FC10 now).
>>
>> 2) Starting RHCS components using anything except the intended init 
>> scripts
>> tends to cause problems.
>>
>> 3) Source of 99% of problems in the rest of the cases (i.e. not 
>> covered by
>> 1) and 2) above) is incorrectly configured fencing.
>>
>> Does your setup fall under either of the first two categories?
>> Have you verified beyond doubt that your fencing is configured correctly
>> and that the fencing script gets verification upon success?
>>
>> Gordan
>>
>> On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu> 
>> wrote:
>>> Hi all,
>>>
>>> Is redhat cluster suite really reliable?  I've been having so much 
>>> trouble getting a cluster up and running, I'm beginning to second 
>>> guess my decision to use this software stack.
>>>
>>> I have 3 nodes (eventually 10) running and set up.  The fencing 
>>> method is by a brocade fibre switch.  The ultimate goal of this 
>>> cluster is to shared a SAN connected by fibre.
>>>
>>> I've installed just the bare minimum (before even getting to GFS) to 
>>> test the cluster software.  Just starting cman cluster services fails 
>>> on two of the nodes.
>>>
>>> Even when I try to reboot the nodes, I can't because the whole system 
>>> hangs on various processes that don't ever shut down.  I have to 
>>> physically reboot these boxes.
>>>
>>> The logs fill up with errors about not being able to connect to cman,
>> etc.
>>> I've been at it for awhile now and am not sure this is the best route 
>>> anymore.
>>>
>>> Ryan
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From sdake at redhat.com  Wed Apr 15 01:19:59 2009
From: sdake at redhat.com (Steven Dake)
Date: Tue, 14 Apr 2009 18:19:59 -0700
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E5357B.8050503@sivell.com>
References: <49E4B728.8020906@umdnj.edu>
	<6296bab6d0ecbbe92334969f026dc9f8@localhost>
	<49E4F9A4.4060306@umdnj.edu> <49E5357B.8050503@sivell.com>
Message-ID: <1239758399.29777.22.camel@sdake-laptop>

On Tue, 2009-04-14 at 20:16 -0500, Vu Pham wrote:
> Ryan Golhar wrote:
> > I'm running RHEL 5.3 64-bit.  So far, I only want to see that the
> > cluster can run.  I'll worry about getting GFS after I'm confident this
> > works.
> > 
> > I've got three nodes: pico, vail, and whistler.  They each have two NIC
> > cards, one that provides a public IP address, and another that provides
> > private communications.  All cluster traffic will go over the private
> > network, 192.168.20.0.
> > 
> > I've installed only the following components:
> > system-config-cluster-1.0.52-1.1, cman-2.0.98-1, and rgmanager-2.0.38-2.
> > 
> > I've created my cluster.conf file to include these three nodees and
> > fence them using a brocade fibre switch (for GFS).
> > 
> > When I start the cluster services on all 3 nodes using the manually 
> > method of:
> > 
> > /sbin/ccsd; /usr/sbin/cman_tool join
> > 
> > The nodes successfully form a cluster.  I am able to leave the cluster 
> > and kill ccsd as well.
> > 
> > If I try to start the cman service I see:
> > 
> > [root at pico cluster]# /sbin/service cman start
> > Starting cluster:
> >    Loading modules... done
> >    Mounting configfs... done
> >    Starting ccsd... done
> >    Starting cman... done
> >    Starting daemons... done
> >    Starting fencing...
> > 
> > 
> > And it just hangs.  I know my fencing is set up correctly because I've 
> > had nodes fence other nodes before (when I was trying with 6 members). 
> > If I let it sit for long enough sometimes it finishes successfully.  I'm 
> > not sure what its doing because fence_tool is called and its a binary...
> > 
> 
> Ryan,
> 
> Anything suspicious in the log when it hangs at fencing ?
> Could you show your cluster.conf ?
> 

A hang in fencing may indicate that the cluster does not have quorum.
run cman_tool nodes to see a list of nodes and see if half+1 are in the
cluster.

Regards
-steve

> Vu
> 
> > Ryan
> > 
> > 
> > Gordan Bobic wrote:
> >> What distro are you using? I've found that:
> >>
> >> 1) Distros other than RHEL/CentOS can be quirky when it comes to using
> >> RHCS. I've even run into problems on Fedora more than once (not to 
> >> mention
> >> that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
> >> production stable until last month - and we're up to FC10 now).
> >>
> >> 2) Starting RHCS components using anything except the intended init 
> >> scripts
> >> tends to cause problems.
> >>
> >> 3) Source of 99% of problems in the rest of the cases (i.e. not 
> >> covered by
> >> 1) and 2) above) is incorrectly configured fencing.
> >>
> >> Does your setup fall under either of the first two categories?
> >> Have you verified beyond doubt that your fencing is configured correctly
> >> and that the fencing script gets verification upon success?
> >>
> >> Gordan
> >>
> >> On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu> 
> >> wrote:
> >>> Hi all,
> >>>
> >>> Is redhat cluster suite really reliable?  I've been having so much 
> >>> trouble getting a cluster up and running, I'm beginning to second 
> >>> guess my decision to use this software stack.
> >>>
> >>> I have 3 nodes (eventually 10) running and set up.  The fencing 
> >>> method is by a brocade fibre switch.  The ultimate goal of this 
> >>> cluster is to shared a SAN connected by fibre.
> >>>
> >>> I've installed just the bare minimum (before even getting to GFS) to 
> >>> test the cluster software.  Just starting cman cluster services fails 
> >>> on two of the nodes.
> >>>
> >>> Even when I try to reboot the nodes, I can't because the whole system 
> >>> hangs on various processes that don't ever shut down.  I have to 
> >>> physically reboot these boxes.
> >>>
> >>> The logs fill up with errors about not being able to connect to cman,
> >> etc.
> >>> I've been at it for awhile now and am not sure this is the best route 
> >>> anymore.
> >>>
> >>> Ryan
> >>
> >> -- 
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From mgrac at redhat.com  Wed Apr 15 07:45:25 2009
From: mgrac at redhat.com (Marek Grac)
Date: Wed, 15 Apr 2009 09:45:25 +0200
Subject: [Linux-cluster] Fenced failing continuously
In-Reply-To: <1239646080.3339.54.camel@WSBID06223.bidmc.harvard.edu>
References: <36df569a0904091707t684f13f5s564f8f0a65d4a361@mail.gmail.com>	<6D2741530D9041139E81EFDCCF9EE46B@Desktop>	<36df569a0904130919jd1862dct313bc309ce789bd7@mail.gmail.com>
	<1239646080.3339.54.camel@WSBID06223.bidmc.harvard.edu>
Message-ID: <49E59095.7010409@redhat.com>

Hi,

Robert Hurst wrot
>
> There is user-contributed fence_ilo patch listed somewhere in this 
> list worth investigating -- it runs A LOT FASTER than the stock one.  
> AFAIK, the fence_ilo does not use ssh, but a sort of web soap services 
> call via https.  We have seen in production and testing that a typical 
> fencing operation using fence_ilo is 42-seconds, and a good percentage 
> of time, up to twice as long as that.  The bladecenter fencing 
> operations we have seen occur in under 7-seconds.
>
Fence_ilo script available in newer updates is in python as it does not 
have these problems. If you have one, please report it into bugzilla.

Fence_ilo is not using ssh as there were problems with several firmware 
version and their ability to do hard-reboot. Their RIBCL (http/ssl) 
language did not have such problems.


marx,


From Alain.Moulle at bull.net  Wed Apr 15 08:19:19 2009
From: Alain.Moulle at bull.net (Alain.Moulle)
Date: Wed, 15 Apr 2009 10:19:19 +0200
Subject: [Linux-cluster] CS5/ How to reduce the checkinterval ?
Message-ID: <49E59887.80407@bull.net>

Hi

I would like to force the CS5 to check one specific service every 10s and
and try restart it if status is failed . I 've tried to set 
checkinterval="10"
on the service line :
<service domain="testHA1" name="testHA1" autostart="0" checkinterval="10">
but it seems to have none effect, the status is done every xx seconds, 
the "xx" value
being the one I set for every services in /usr/share/cluster/script.sh 
(I think so) :
        <action name="status" interval="180s" timeout="0"/>

What would be the solution ?

And by the way, is it possible to force the CS to execute "n" start 
retries on failed status ?

Thanks
Regards,
Alain Moull?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090415/fd13d105/attachment.htm>

From schlegel at riege.com  Wed Apr 15 08:32:19 2009
From: schlegel at riege.com (Gunther Schlegel)
Date: Wed, 15 Apr 2009 10:32:19 +0200
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E51BAA.4090402@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu>
	<20090414175034.GE17186@redhat.com>	<49E50583.6060202@bobich.net>
	<1239747724.29777.8.camel@sdake-laptop>
	<49E51BAA.4090402@umdnj.edu>
Message-ID: <49E59B93.1090704@riege.com>

The cluster suite has a well known problem establishing a fence domain , 
established in 5.3. Updated packages have been released a couple of days 
ago.

Ryan Golhar schrieb:
> No no no, I haven't modified any of the init files...these are out of 
> the box...
> 
> Steven Dake wrote:
>> On Tue, 2009-04-14 at 22:52 +0100, Gordan Bobic wrote:
>>> David Teigland wrote:
>>>
>>>>> Even when I try to reboot the nodes, I can't because the whole 
>>>>> system hangs on various processes that don't ever shut down.  I 
>>>>> have to physically reboot these boxes.
>>>> If something has gone wrong, it's often impossible to shutdown 
>>>> without a hard
>>>> reboot.  Even when things are working, rebooting can be a delicate task
>>>> because the system may easily be configured to stop things in the 
>>>> wrong order,
>>>> and one thing out of place can cause a wreck.
>>> If things don't come down in the correct order using the standard 
>>> init scripts, you should file a bug report about this. I've never 
>>> seen it happen on any of my clusters.
>>>
>>
>> I believe he means the administrator makes a configuration change to the
>> init scripts which result in a shutdown not working.  RHCS amplifies
>> this problem somewhat so more care must be taken in these situations.
>>
>> regards
>> -steve
>>
>>> Gordan
>>>
>>> -- 
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 

-- 
.............................................................
Riege Software International GmbH  Fon: +49 (2159) 9148 0
Mollsfeld 10                       Fax: +49 (2159) 9148 11
40670 Meerbusch                    Web: www.riege.com
Germany                            E-Mail: schlegel at riege.com
---                                ---
Handelsregister:                   Managing Directors:
Amtsgericht Neuss HRB-NR 4207      Christian Riege
USt-ID-Nr.: DE120585842            Gabriele  Riege
                                   Johannes  Riege
.............................................................
           YOU CARE FOR FREIGHT, WE CARE FOR YOU          


-------------- next part --------------
A non-text attachment was scrubbed...
Name: schlegel.vcf
Type: text/x-vcard
Size: 344 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090415/39cde194/attachment.vcf>

From macbogucki at gmail.com  Wed Apr 15 09:39:18 2009
From: macbogucki at gmail.com (Maciej Bogucki)
Date: Wed, 15 Apr 2009 11:39:18 +0200
Subject: [Linux-cluster] GFS network and fencing questions
In-Reply-To: <001101c9b891$7eb9e310$7c2da930$@com>
References: <001101c9b891$7eb9e310$7c2da930$@com>
Message-ID: <49E5AB46.8050402@gmail.com>

Thomas Suiter wrote:
>
> I?m going to be building a 6 node cluster with blade servers that only 
> have 2x network connections attached to EMC DMX storage. The 
> application we are running has it's own cluster layer so we won't be 
> using the failover services (they just want the filesystem to be 
> visible to all nodes). Each node should be reading/writing only in 
> it's own directory with a single filesystem size ~15TB.
>
>
Are You going to connect to DMX via FC or iSCSI?
>
> Questions I have are this:
>
> 1) The documentation is unclear as to this, I'm assuming that I should 
> I bond the 2x interfaces rather than have one interface for public and 
> one for private. I'm thinking this will make the system much more 
> available in general, but I don't know if the public/private is a hard 
> requirement (or if what I'm thinking is even better) Best case would 
> be to get 2x more but unfortunately I don't have that luxury. If this 
> is preferred, would I need to use 2x ip addresses in this 
> configuration, or can I use just the 1x per node.
>
Bonding allows You to achieve High Availability and with VLAN on it You 
could have public/private interfaces. You could also use on interface 
for public and the second for private.
>
> 2) I have the capabilities to support scsi3 reservations inside the 
> DMX, should I be using scsi3 instead of power based fencing (or both). 
> It seems like a relatively option, is it ready for use or should it 
> bake a bit longer. I've used Veritas VCS with scsi3 previously and it 
> was sometimes semi-annoying. But the reality is that availability and 
> data protection is more important than not being annoyed.
>
If You don't use multipath it should works. But if You have multipath 
environment then You should check if it is supported(there wasn't some 
time agoe).
>
> 3) Since I have more than 2x nodes should I use qdiskd or not (or is 
> it even needed in this type of configuration with no failover); 
> looking around it appears that it?s caused some problems in the past.
>
>

Qdiskd is good option and You should use it if You can. It is like 
as(and more) another independent Etherenet interface.

Best Regards
Maciej Bogucki


From mrugeshkarnik at gmail.com  Wed Apr 15 14:06:53 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Wed, 15 Apr 2009 19:36:53 +0530
Subject: [Linux-cluster] No VM migration on node failure.. waiting to be
	fenced?
Message-ID: <200904151936.53349.mrugeshkarnik@gmail.com>

Hi,

I have a two node cluster of Xen dom0s, but configured with a qdisk. I've 
created a failover domain for each of the nodes and configured vm services.

I've tried out killing a running VM with xm destroy and its brought back up 
properly. Migration etc. works fine as well. However, when I kill the node 
running a VM, the cluster keeps on waiting for the node to be fenced and the 
virtual machine is never migrated to the other node.

When the failed node starts up again and joins the cluster, the VM service 
that was supposed to have been migrated to the other node, starts up on the 
resurrected node.

Since this is just a testing setup, I have configured manual fencing.

How can I get the VM to fail over to the second node in the event of failure, 
without admin intervention?

TIA,
Mrugesh


From jeff.sturm at eprize.com  Wed Apr 15 14:47:40 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 15 Apr 2009 10:47:40 -0400
Subject: [Linux-cluster] GFS file size/performance question
In-Reply-To: <20090414154657.GC17186@redhat.com>
References: <49E3B29A020000C80000210D@mail-int.health-first.org><64D0546C5EBBD147B75DE133D798665F02FDB83B@hugo.eprize.local>
	<20090414154657.GC17186@redhat.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB86C@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Teigland
> Sent: Tuesday, April 14, 2009 11:47 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS file size/performance question
> 
> On Mon, Apr 13, 2009 at 10:38:15PM -0400, Jeff Sturm wrote:
> 
> > gfs_tool settune $fs glock_purge 50
> 
> > There have been many prior articles posted on GFS 
> performance.  Search 
> > for e.g. "glock trimming patch".
> 
> That purge/trimming thing was a hack for a very particular 
> corner case (backup process walking over the entire fs and 
> caching all locks on the fs).  Enabling it under other 
> circumstances will most likely *hurt* your performance by 
> artificially reducing normal gfs caching.

In our case it was a periodic rsync process.  The OP was asking about
large numbers of files, so he may have had something similar.  The
gradual slowdown of a single node sounds a lot like what we had seen
with rsync.

Jeff


From pavlos.parissis at gmail.com  Wed Apr 15 17:12:42 2009
From: pavlos.parissis at gmail.com (Pavlos Parissis)
Date: Wed, 15 Apr 2009 19:12:42 +0200
Subject: [Linux-cluster] Failback option on failover domains
Message-ID: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>

Hello,

Does anyone know if the failback option is available on RedHat 4.X systems?

I went through all docs [1] and it seams to be that is only available on
RedHat 5.X version,
am I right?

Cheers,
Pavlos


[1]
http://www.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/5/html/Cluster_Administration/index.html
http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/index.html
http://www.redhat.com/docs/manuals/enterprise/#RHEL5
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090415/7fad2d89/attachment.htm>

From garromo at us.ibm.com  Wed Apr 15 19:57:26 2009
From: garromo at us.ibm.com (Gary Romo)
Date: Wed, 15 Apr 2009 13:57:26 -0600
Subject: [Linux-cluster] add a node to RHCS
Message-ID: <OFF3FEA100.AD2A2DD7-ON87257599.006D8463-87257599.006DA11C@us.ibm.com>


Hello.

I am looking for doucmentation that covers adding a node to an existing
cluster.
Does anyone know where this is?   Thanks.

Gary Romo
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090415/33878579/attachment.htm>

From finnzi at finnzi.com  Wed Apr 15 20:07:18 2009
From: finnzi at finnzi.com (=?ISO-8859-1?Q?Finnur_=D6rn_Gu=F0mundsson?=)
Date: Wed, 15 Apr 2009 20:07:18 +0000
Subject: [Linux-cluster] add a node to RHCS
In-Reply-To: <OFF3FEA100.AD2A2DD7-ON87257599.006D8463-87257599.006DA11C@us.ibm.com>
References: <OFF3FEA100.AD2A2DD7-ON87257599.006D8463-87257599.006DA11C@us.ibm.com>
Message-ID: <49E63E76.9050605@finnzi.com>

Gary Romo wrote:
>
> Hello.
>
> I am looking for doucmentation that covers adding a node to an 
> existing cluster.
> Does anyone know where this is? Thanks.
>
> Gary Romo
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
I'm pretty darn sure the Red Hat Enterprise Linux 5 Cluster 
Administration guide has a chapter for it: 
http://www.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/5/html/Cluster_Administration/index.html

Bgrds,
Finnur


From raju.rajsand at gmail.com  Thu Apr 16 04:08:36 2009
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 16 Apr 2009 09:38:36 +0530
Subject: [Linux-cluster] Failback option on failover domains
In-Reply-To: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
References: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
Message-ID: <8786b91c0904152108j2558e3a6oe05b7e79b3741939@mail.gmail.com>

Greetings,

On Wed, Apr 15, 2009 at 10:42 PM, Pavlos Parissis
<pavlos.parissis at gmail.com> wrote:
> Hello,
>
> Does anyone know if the failback option is available on RedHat 4.X systems?
>
> I went through all docs [1] and it seams to be that is only available on
> RedHat 5.X version,
> am I right?


Please note that the RHCS is very different for 4.x and 5.x

Rajagopal


From JBayles at readytechs.com  Thu Apr 16 14:47:49 2009
From: JBayles at readytechs.com (Jonathan Bayles)
Date: Thu, 16 Apr 2009 09:47:49 -0500
Subject: [Linux-cluster] Question about gnbd_import
Message-ID: <9EB99FF0783ACA4F8B4B8A666D8462E20C875A@mail-11ps.atlarge.net>

I am a gnbd newbie who is using gnbd to export LV's from a one server to
another. I have two servers exporting unique gnbd devices, I would like
to import both sets of block devices onto the same server. Are there any
considerations? At the moment it looks like there is some hidden unique
identifier problem, wherein, when I start to import from server B at a
certain point it stops and throws an error(names and such altered to
appease management), is this a problem with some kind of UID or
major/minor numbers? Any insight would be appreciated:

[root at serverC ~]# gnbd_import -ni serverA
gnbd_import: created gnbd device deviceA
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceB
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceC
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceD
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceE
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceF
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceG
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceH
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceI
gnbd_recvd: gnbd_recvd started

[root at serverC ~]# gnbd_import -ni serverB
gnbd_import: created gnbd device deviceX
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceY
gnbd_recvd: gnbd_recvd started
gnbd_import: created gnbd device deviceZ
gnbd_recvd: ERROR login refused by the server, quitting : No such file
or directory
gnbd_import: ERROR gnbd_recvd failed

Jonathan Bayles
Network Engineer
Readytechs LLC
jbayles at readytechs.com


From alan.zg at gmail.com  Thu Apr 16 15:56:37 2009
From: alan.zg at gmail.com (Alan A)
Date: Thu, 16 Apr 2009 10:56:37 -0500
Subject: [Linux-cluster] GFS statistics - what is the best way to get some
	statistics from GFS
Message-ID: <fac531740904160856j587013j622b954cd438cbcd@mail.gmail.com>

Hello!

Just a quick question - what is the best way to get GFS statistics. I used
'gfs_tool counters /directory' to obtain statistics. Is there a better way
to get running, dynamic, statistical output?

-- 
Alan A.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090416/4a1dda0a/attachment.htm>

From esggrupos at gmail.com  Fri Apr 17 10:57:07 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Fri, 17 Apr 2009 12:57:07 +0200
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
Message-ID: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>

Hello all,

I have a doubt about using the same service ip with two diferent services in
two diferent nodes of my two-nodes-cluster.


I?ll explain it a litle:

I have two services: BBDD and HTTPD

I have configured a shared IP: 192.168.1.100

two nodes: node1 and node2.

When I run the two services on node1 all runs ok. If I try to relocate one
service, with

clusvcadm -r BBDD -m node2

it fails with the error:
 in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
 in.rdiscd[4805]: Failed joining addresses

I suposse that is because one services is on node1 with the ip and the other
trys to run in node2 with the same ip.

So my question is if it is possible to run the services this way or I need
an ip per service?
(I have tested that with 2 diferent ips I can run BBDD on node1 an HTTPD on
node2)

Thanks in advance

ESG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/65adf844/attachment.htm>

From robejrm at gmail.com  Fri Apr 17 11:03:53 2009
From: robejrm at gmail.com (Juan Ramon Martin Blanco)
Date: Fri, 17 Apr 2009 13:03:53 +0200
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>
Message-ID: <8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>

On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com> wrote:

> Hello all,
>
> I have a doubt about using the same service ip with two diferent services
> in two diferent nodes of my two-nodes-cluster.


>
> I?ll explain it a litle:
>
> I have two services: BBDD and HTTPD
>
> I have configured a shared IP: 192.168.1.100
>
Hi,
You _must_ use a different  IP, cannot have the same IP on different
machines.

Greetings,
Juanra

>
> two nodes: node1 and node2.
>
> When I run the two services on node1 all runs ok. If I try to relocate one
> service, with
>
> clusvcadm -r BBDD -m node2
>
> it fails with the error:
>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
>  in.rdiscd[4805]: Failed joining addresses
>
> I suposse that is because one services is on node1 with the ip and the
> other trys to run in node2 with the same ip.
>
> So my question is if it is possible to run the services this way or I need
> an ip per service?
> (I have tested that with 2 diferent ips I can run BBDD on node1 an HTTPD on
> node2)
>
> Thanks in advance
>
> ESG
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/8cc5a431/attachment.htm>

From esggrupos at gmail.com  Fri Apr 17 11:23:30 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Fri, 17 Apr 2009 13:23:30 +0200
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>
	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>
Message-ID: <3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>

Hi,

thanks for your answer

I suspected  so,

can I say as an axiom: One service needs One IP? ( One man, One vote ;-) )

and if is this true, has it sense to configure ip resources as shared? it
must be better to configure as a private resource, isnt it?

Greetings,

ESG

2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>

>
>
> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com> wrote:
>
>> Hello all,
>>
>> I have a doubt about using the same service ip with two diferent services
>> in two diferent nodes of my two-nodes-cluster.
>
>
>>
>> I?ll explain it a litle:
>>
>> I have two services: BBDD and HTTPD
>>
>> I have configured a shared IP: 192.168.1.100
>>
> Hi,
> You _must_ use a different  IP, cannot have the same IP on different
> machines.
>
> Greetings,
> Juanra
>
>>
>> two nodes: node1 and node2.
>>
>> When I run the two services on node1 all runs ok. If I try to relocate one
>> service, with
>>
>> clusvcadm -r BBDD -m node2
>>
>> it fails with the error:
>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
>>  in.rdiscd[4805]: Failed joining addresses
>>
>> I suposse that is because one services is on node1 with the ip and the
>> other trys to run in node2 with the same ip.
>>
>> So my question is if it is possible to run the services this way or I need
>> an ip per service?
>> (I have tested that with 2 diferent ips I can run BBDD on node1 an HTTPD
>> on node2)
>>
>> Thanks in advance
>>
>> ESG
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/f9efb7b4/attachment.htm>

From gordan at bobich.net  Fri Apr 17 11:52:10 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 17 Apr 2009 12:52:10 +0100
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>
	<3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>
Message-ID: <06d2fdcb2614a82906c6ae34ca8440f6@localhost>

You can't have the same IP on multiple machines at the same time. That
much should be obvious before even thinking about clustering.

You can, however configure floating IPs as resources that RHCS will
fail over between cluster nodes. Only one node will have a particular
IP address at any one time, but if that node fails, the floating IP
will get migrated to one of the surviving nodes.

Any number of services can run on a floating IP. There is no need
to have one floating IP per service.

Gordan

On Fri, 17 Apr 2009 13:23:30 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> Hi,
> 
> thanks for your answer
> 
> I suspected  so,
> 
> can I say as an axiom: One service needs One IP? ( One man, One vote ;-)
)
> 
> and if is this true, has it sense to configure ip resources as shared? it
> must be better to configure as a private resource, isnt it?
> 
> Greetings,
> 
> ESG
> 
> 2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>
> 
>>
>>
>> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com> wrote:
>>
>>> Hello all,
>>>
>>> I have a doubt about using the same service ip with two diferent
>>> services
>>> in two diferent nodes of my two-nodes-cluster.
>>
>>
>>>
>>> I?ll explain it a litle:
>>>
>>> I have two services: BBDD and HTTPD
>>>
>>> I have configured a shared IP: 192.168.1.100
>>>
>> Hi,
>> You _must_ use a different  IP, cannot have the same IP on different
>> machines.
>>
>> Greetings,
>> Juanra
>>
>>>
>>> two nodes: node1 and node2.
>>>
>>> When I run the two services on node1 all runs ok. If I try to relocate
>>> one
>>> service, with
>>>
>>> clusvcadm -r BBDD -m node2
>>>
>>> it fails with the error:
>>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in
use
>>>  in.rdiscd[4805]: Failed joining addresses
>>>
>>> I suposse that is because one services is on node1 with the ip and the
>>> other trys to run in node2 with the same ip.
>>>
>>> So my question is if it is possible to run the services this way or I
>>> need
>>> an ip per service?
>>> (I have tested that with 2 diferent ips I can run BBDD on node1 an
HTTPD
>>> on node2)
>>>
>>> Thanks in advance
>>>
>>> ESG
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>


From esggrupos at gmail.com  Fri Apr 17 12:01:05 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Fri, 17 Apr 2009 14:01:05 +0200
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <06d2fdcb2614a82906c6ae34ca8440f6@localhost>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>
	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>
	<3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>
	<06d2fdcb2614a82906c6ae34ca8440f6@localhost>
Message-ID: <3128ba140904170501m60d17d5ch9c6901ac3cec3e82@mail.gmail.com>

Hi Gordan,

I?m talking about the floating IP. (I called it service IP, is wrog the
name?)

The real ip of the two nodes is diferent and share the floating IP but in
that scenario when one service fails (no the whole node) I get one service
in one node and one service on the other node but I get the error I posted.

for example
floating IP + BBDD is runing on node1
floating IP + HTTPD is running on node2

is this possible?

thanks

ESG


2009/4/17 Gordan Bobic <gordan at bobich.net>

> You can't have the same IP on multiple machines at the same time. That
> much should be obvious before even thinking about clustering.
>
> You can, however configure floating IPs as resources that RHCS will
> fail over between cluster nodes. Only one node will have a particular
> IP address at any one time, but if that node fails, the floating IP
> will get migrated to one of the surviving nodes.
>
> Any number of services can run on a floating IP. There is no need
> to have one floating IP per service.
>
> Gordan
>
> On Fri, 17 Apr 2009 13:23:30 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> > Hi,
> >
> > thanks for your answer
> >
> > I suspected  so,
> >
> > can I say as an axiom: One service needs One IP? ( One man, One vote ;-)
> )
> >
> > and if is this true, has it sense to configure ip resources as shared? it
> > must be better to configure as a private resource, isnt it?
> >
> > Greetings,
> >
> > ESG
> >
> > 2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>
> >
> >>
> >>
> >> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com> wrote:
> >>
> >>> Hello all,
> >>>
> >>> I have a doubt about using the same service ip with two diferent
> >>> services
> >>> in two diferent nodes of my two-nodes-cluster.
> >>
> >>
> >>>
> >>> I?ll explain it a litle:
> >>>
> >>> I have two services: BBDD and HTTPD
> >>>
> >>> I have configured a shared IP: 192.168.1.100
> >>>
> >> Hi,
> >> You _must_ use a different  IP, cannot have the same IP on different
> >> machines.
> >>
> >> Greetings,
> >> Juanra
> >>
> >>>
> >>> two nodes: node1 and node2.
> >>>
> >>> When I run the two services on node1 all runs ok. If I try to relocate
> >>> one
> >>> service, with
> >>>
> >>> clusvcadm -r BBDD -m node2
> >>>
> >>> it fails with the error:
> >>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in
> use
> >>>  in.rdiscd[4805]: Failed joining addresses
> >>>
> >>> I suposse that is because one services is on node1 with the ip and the
> >>> other trys to run in node2 with the same ip.
> >>>
> >>> So my question is if it is possible to run the services this way or I
> >>> need
> >>> an ip per service?
> >>> (I have tested that with 2 diferent ips I can run BBDD on node1 an
> HTTPD
> >>> on node2)
> >>>
> >>> Thanks in advance
> >>>
> >>> ESG
> >>>
> >>>
> >>> --
> >>> Linux-cluster mailing list
> >>> Linux-cluster at redhat.com
> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/38bf7c5b/attachment.htm>

From gordan at bobich.net  Fri Apr 17 12:10:25 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 17 Apr 2009 13:10:25 +0100
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <3128ba140904170501m60d17d5ch9c6901ac3cec3e82@mail.gmail.com>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>	<3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>	<06d2fdcb2614a82906c6ae34ca8440f6@localhost>
	<3128ba140904170501m60d17d5ch9c6901ac3cec3e82@mail.gmail.com>
Message-ID: <167d516bc744af67606fd4ce89a68750@localhost>

Right, I see what you mean now. No, you can't do that - the service always
has to run on the node where it's IP is. But you can fail over the whole
resource group (IP + services) together if required.

Gordan

On Fri, 17 Apr 2009 14:01:05 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> Hi Gordan,
> 
> I?m talking about the floating IP. (I called it service IP, is wrog the
> name?)
> 
> The real ip of the two nodes is diferent and share the floating IP but in
> that scenario when one service fails (no the whole node) I get one
service
> in one node and one service on the other node but I get the error I
posted.
> 
> for example
> floating IP + BBDD is runing on node1
> floating IP + HTTPD is running on node2
> 
> is this possible?
> 
> thanks
> 
> ESG
> 
> 
> 
> 2009/4/17 Gordan Bobic <gordan at bobich.net>
> 
>> You can't have the same IP on multiple machines at the same time. That
>> much should be obvious before even thinking about clustering.
>>
>> You can, however configure floating IPs as resources that RHCS will
>> fail over between cluster nodes. Only one node will have a particular
>> IP address at any one time, but if that node fails, the floating IP
>> will get migrated to one of the surviving nodes.
>>
>> Any number of services can run on a floating IP. There is no need
>> to have one floating IP per service.
>>
>> Gordan
>>
>> On Fri, 17 Apr 2009 13:23:30 +0200, ESGLinux <esggrupos at gmail.com>
wrote:
>> > Hi,
>> >
>> > thanks for your answer
>> >
>> > I suspected  so,
>> >
>> > can I say as an axiom: One service needs One IP? ( One man, One vote
>> > ;-)
>> )
>> >
>> > and if is this true, has it sense to configure ip resources as shared?
>> > it
>> > must be better to configure as a private resource, isnt it?
>> >
>> > Greetings,
>> >
>> > ESG
>> >
>> > 2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>
>> >
>> >>
>> >>
>> >> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com>
>> >> wrote:
>> >>
>> >>> Hello all,
>> >>>
>> >>> I have a doubt about using the same service ip with two diferent
>> >>> services
>> >>> in two diferent nodes of my two-nodes-cluster.
>> >>
>> >>
>> >>>
>> >>> I?ll explain it a litle:
>> >>>
>> >>> I have two services: BBDD and HTTPD
>> >>>
>> >>> I have configured a shared IP: 192.168.1.100
>> >>>
>> >> Hi,
>> >> You _must_ use a different  IP, cannot have the same IP on different
>> >> machines.
>> >>
>> >> Greetings,
>> >> Juanra
>> >>
>> >>>
>> >>> two nodes: node1 and node2.
>> >>>
>> >>> When I run the two services on node1 all runs ok. If I try to
>> >>> relocate
>> >>> one
>> >>> service, with
>> >>>
>> >>> clusvcadm -r BBDD -m node2
>> >>>
>> >>> it fails with the error:
>> >>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in
>> use
>> >>>  in.rdiscd[4805]: Failed joining addresses
>> >>>
>> >>> I suposse that is because one services is on node1 with the ip and
>> >>> the
>> >>> other trys to run in node2 with the same ip.
>> >>>
>> >>> So my question is if it is possible to run the services this way or
I
>> >>> need
>> >>> an ip per service?
>> >>> (I have tested that with 2 diferent ips I can run BBDD on node1 an
>> HTTPD
>> >>> on node2)
>> >>>
>> >>> Thanks in advance
>> >>>
>> >>> ESG
>> >>>
>> >>>
>> >>> --
>> >>> Linux-cluster mailing list
>> >>> Linux-cluster at redhat.com
>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >>>
>> >>
>> >>
>> >> --
>> >> Linux-cluster mailing list
>> >> Linux-cluster at redhat.com
>> >> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>


From esggrupos at gmail.com  Fri Apr 17 12:23:40 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Fri, 17 Apr 2009 14:23:40 +0200
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <167d516bc744af67606fd4ce89a68750@localhost>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>
	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>
	<3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>
	<06d2fdcb2614a82906c6ae34ca8440f6@localhost>
	<3128ba140904170501m60d17d5ch9c6901ac3cec3e82@mail.gmail.com>
	<167d516bc744af67606fd4ce89a68750@localhost>
Message-ID: <3128ba140904170523g243c2206lcc73f8755f3d7a6c@mail.gmail.com>

Hi again,

now we speak the same language ;-)  what do your think about my question?:

Has it sense to configure ip resources as shared?
  I think it must be better to configure always as a private resource, isnt
it?

ESG


2009/4/17 Gordan Bobic <gordan at bobich.net>

> Right, I see what you mean now. No, you can't do that - the service always
> has to run on the node where it's IP is. But you can fail over the whole
> resource group (IP + services) together if required.
>
> Gordan
>
> On Fri, 17 Apr 2009 14:01:05 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> > Hi Gordan,
> >
> > I?m talking about the floating IP. (I called it service IP, is wrog the
> > name?)
> >
> > The real ip of the two nodes is diferent and share the floating IP but in
> > that scenario when one service fails (no the whole node) I get one
> service
> > in one node and one service on the other node but I get the error I
> posted.
> >
> > for example
> > floating IP + BBDD is runing on node1
> > floating IP + HTTPD is running on node2
> >
> > is this possible?
> >
> > thanks
> >
> > ESG
> >
> >
> >
> > 2009/4/17 Gordan Bobic <gordan at bobich.net>
> >
> >> You can't have the same IP on multiple machines at the same time. That
> >> much should be obvious before even thinking about clustering.
> >>
> >> You can, however configure floating IPs as resources that RHCS will
> >> fail over between cluster nodes. Only one node will have a particular
> >> IP address at any one time, but if that node fails, the floating IP
> >> will get migrated to one of the surviving nodes.
> >>
> >> Any number of services can run on a floating IP. There is no need
> >> to have one floating IP per service.
> >>
> >> Gordan
> >>
> >> On Fri, 17 Apr 2009 13:23:30 +0200, ESGLinux <esggrupos at gmail.com>
> wrote:
> >> > Hi,
> >> >
> >> > thanks for your answer
> >> >
> >> > I suspected  so,
> >> >
> >> > can I say as an axiom: One service needs One IP? ( One man, One vote
> >> > ;-)
> >> )
> >> >
> >> > and if is this true, has it sense to configure ip resources as shared?
> >> > it
> >> > must be better to configure as a private resource, isnt it?
> >> >
> >> > Greetings,
> >> >
> >> > ESG
> >> >
> >> > 2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>
> >> >
> >> >>
> >> >>
> >> >> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com>
> >> >> wrote:
> >> >>
> >> >>> Hello all,
> >> >>>
> >> >>> I have a doubt about using the same service ip with two diferent
> >> >>> services
> >> >>> in two diferent nodes of my two-nodes-cluster.
> >> >>
> >> >>
> >> >>>
> >> >>> I?ll explain it a litle:
> >> >>>
> >> >>> I have two services: BBDD and HTTPD
> >> >>>
> >> >>> I have configured a shared IP: 192.168.1.100
> >> >>>
> >> >> Hi,
> >> >> You _must_ use a different  IP, cannot have the same IP on different
> >> >> machines.
> >> >>
> >> >> Greetings,
> >> >> Juanra
> >> >>
> >> >>>
> >> >>> two nodes: node1 and node2.
> >> >>>
> >> >>> When I run the two services on node1 all runs ok. If I try to
> >> >>> relocate
> >> >>> one
> >> >>> service, with
> >> >>>
> >> >>> clusvcadm -r BBDD -m node2
> >> >>>
> >> >>> it fails with the error:
> >> >>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already in
> >> use
> >> >>>  in.rdiscd[4805]: Failed joining addresses
> >> >>>
> >> >>> I suposse that is because one services is on node1 with the ip and
> >> >>> the
> >> >>> other trys to run in node2 with the same ip.
> >> >>>
> >> >>> So my question is if it is possible to run the services this way or
> I
> >> >>> need
> >> >>> an ip per service?
> >> >>> (I have tested that with 2 diferent ips I can run BBDD on node1 an
> >> HTTPD
> >> >>> on node2)
> >> >>>
> >> >>> Thanks in advance
> >> >>>
> >> >>> ESG
> >> >>>
> >> >>>
> >> >>> --
> >> >>> Linux-cluster mailing list
> >> >>> Linux-cluster at redhat.com
> >> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >> >>>
> >> >>
> >> >>
> >> >> --
> >> >> Linux-cluster mailing list
> >> >> Linux-cluster at redhat.com
> >> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/255b13bd/attachment.htm>

From sunhux at gmail.com  Fri Apr 17 12:54:05 2009
From: sunhux at gmail.com (sunhux G)
Date: Fri, 17 Apr 2009 20:54:05 +0800
Subject: [Linux-cluster] OT: Error installing Acronis Enterprise agent :
	snapapi module not found
Message-ID: <60f08e700904170554y766776et2dabc500d950b4a5@mail.gmail.com>

Hi,
Sorry this is OT but I'm stuck.

Anyone has installed Acronis True Image Echo Enterprise agent on
Linux.

I got the following error after running the Acronis  *.i686 install file :

      ?????????????????????????????? Message ??????????????????????????????
      ?                                                                  ?
      ? Acronis True Image Echo Enterprise Server has not found          ?
      ? precompiled snapapi module for running kernel. It is             ?
      ? suggested to compile it now.                                     ?
      ?                                                                  ?
      ? You may choose "Skip" and compile snapapi module later. Please   ?
      ? read readme.txt and HOWTO.INSTALL for compile instructions.      ?
      ?                                                                  ?
      ?            ????????????                      ????????            ?
      ?            ? Continue ?                      ? Skip ?            ?
      ?            ????????????                      ????????            ?
      ?                                                                  ?
      ?                                                                  ?
      ????????????????????????????????????????????????????????????????????
Here's a list of RPMs listed at the time I hit above error and I pause at
above screen and pending a solution, will not proceed :

# rpm -qa | grep -i gcc-
compat-libgcc-296-2.96-132.7.2
gcc-g77-3.4.6-9
libgcc-3.4.6-9
gcc-c++-3.4.6-9
gcc-objc-3.4.6-9
gcc-java-3.4.6-9
gcc-3.4.6-9

# rpm -qa | grep -i glibc-
glibc-devel-2.3.4-2.39
acronis-glibc-1.0-0
glibc-common-2.3.4-2.39
glibc-headers-2.3.4-2.39
glibc-2.3.4-2.39
glibc-kernheaders-2.4-9.1.100.EL
# rpm -qa | grep -i libgomp-
libgomp-4.1.2-14.EL4

What should I do next?
Nothing much in readme.txt but I'm sorry to append below the lengthy
HOWTO.INSTALL file  :

=========================================================

Table of content
Introduction
I.    Common procedure to build  and  install  kernel module
. . . . .
VIII. Installing on RedHat 8.0
IX.   Installing on TurboLinux 8.0
X .FAQ:
. . . . .
Introduction
Sometimes  Acronis True Image Echo  Enterprise Server  setup
can not  compile  the necessary  kernel  modules or  prepare
required   execution   environment   for   "trueimage"   and
"trueimagecmd" utilities. Usually it prompts you about  such
problem and refers you to this file.
Section I describes common "how to build and install module"
procedure. Most offten you will have to read it if you have
custom kernel or non-stantard kernel sources location.
Sections II,III and others provide necessary information for
specific distributions.
Please note that Redhat 9.0, Fedora Core 1, 2, 3, 4, 5, 6, 7,
Redhat Advanced Server 3.0, 4.0, 5.0,SuSE 8.2, 9.0, 9.1, 9.2,
9.3, 10.1, 10.2, Mandrake 10, Slackware 10, 11, ASPLinux 9.2,
ASPLinux 10, ASPLinux Server II and Gentoo with stock kernels
do not  have  problems  with  Acronis  True  Image Echo Linux
Server and are not mentioned below.
I. Building  and  installing  kernel module  in general case
If the setup cannot  compile the necessary kernel module you
will  have to do it manually. Please install kernel  sources,
apropriate config  file  and all  required  for kernel build
packages (like  gcc, glibc-devel, etc). You will be prompted
about necessary packages while kernel sources install.
Most  often  the snapapi  kernel module  should be built and
installed by "dkms" command. It may be done by the following
commands:
# dkms build -m <MODULE_NAME> -v <MODULE_VERSION> \
 --config <CONFIG_FILE> --arch <KERNEL_ARCH> \
 --kernelsourcedir <PATH_TO_KERNEL_SOURCES>
# dkms install -m <MODULE_NAME> -v <MODULE_VERSION> \
 --config <CONFIG_FILE> --arch <KERNEL_ARCH> \
 --kernelsourcedir <PATH_TO_KERNEL_SOURCES>
<MODULE_NAME> must be "snapapi" for 2.4.x kernels or
"snapapi26" for 2.6.x kernels.
<MODULE_VERSION> could be detected by
# ls /usr/src/snapapi*
<CONFIG_FILE> is  your  kernel config filename. Usually this
file may be found in /boot directory.
<KERNEL_ARCH> may be detected by
# rpm -q --queryformat "%{ARCH}\n" kernel
for RPM based distrubutions or by
# uname -m
for non-RPM based distributions.
For details please refer to dkms man page.
After succesful module  build  and  install  you  may try to
launch "trueimage"  or  "trueimagecmd"  utilities and  check
their  functionality.  Appropriate  kernel  modules  will be
loaded automatically.
II. Installing on Mandrake 9.2 with kernel 2.4.X
1. Please install kernel sources and prepare kernel to build
by:
# make -C /usr/src/linux-2.4.22-37mdk/ mrproper
# cp /boot/config-2.4.22-37mdksmp /usr/src/linux-2.4.22-37mdk/.config
# make -C /usr/src/linux-2.4.22-37mdk/ oldconfig
# make -C /usr/src/linux-2.4.22-37mdk/ dep
2. Please build and install snapapi module by the  following
commands:
# dkms build -m snapapi -v 0.6.4 -k 2.4.22-37mdksmp --arch i686 \
 --config /boot/config-2.4.22-37mdksmp --kernelsourcedir \
 /usr/src/linux-2.4.22-37mdk/ --no-prepare-kernel
# dkms install -m snapapi -v 0.6.4 -k 2.4.22-37mdksmp \
 --arch i686
It is supposed that you have kernel 2.4.22-37mdksmp,  kernel
architecture is i686 and module version is 0.6.4.
2. Activate devfs support by commands:
# mkdir /devfs
# mount -t devfs devfs /devfs
3. Make devfs support permanent by adding
"devfs /devfs devfs defaults 0 0" to your  /etc/fstab  file.
III. Installing on Mandrake 10 with kernel 2.6.3-4mdk
Mandrake  Linux kernel  2.6.3-4 is not supported by  Acronis
True Image Echo Linux Server. Please upgrade  kernel  up  to
2.6.3-7mdk or  later  and   repeate  install.  Some Mandrake
10 kernels (i.e.2.6.3-19mdk) have broken "build" and "source"
link in  /lib/modules/... directory so dkms  can  not  build
snapapi26 kernel module. In such a case please locate kernel
sources or includes in /usr/src directory and build snapapi26
kernel module manually according to the section I.
IV. Common installation procedure on Debian
a) If you are installing the software on a 2.4 kernel based
distribution:
1. Install kernel sources.
2. Build and install kernel module.
# dkms build -m snapapi -v <SNAPAPI_VERSION> -k 2.4.25-1 \
 --config /boot/config-2.4.25-1-386 --arch i686 \
 --kernelsourcedir /usr/src/kernel-sources-2.4.25
# dkms install -m snapapi -v <SNAPAPI_VERSION> -k 2.4.25-1 \
 --config /boot/config-2.4.25-1-386 --arch i686 \
 --kernelsourcedir /usr/src/kernel-sources-2.4.25
<SNAPAPI_VERSION> is current snapapi version.
It   is  supposed  that  you  have  kernel  2.4.25-1, kernel
architecture is i686.
3. Most  probably raw-devices were not created during Debian
3.0 installation/configuring. Following simple script can be
used  to check  and  create  the  devices  if  needed  (root
permissions required):
#!/bin/bash
mkdir -p /dev/raw/
if [ ! -e /dev/rawctl ] ;then
 mknod /dev/rawctl c 162 0
fi
for i in `seq 1 128`; do
 if [ ! -e /dev/raw/raw${i}  ] ;then
  mknod /dev/raw/raw${i} c 162 ${i}
 fi
done
4. Activate devfs support by commands:
# mkdir /devfs
# mount -t devfs devfs /devfs
5. Make devfs support permanent by adding the following line
to your /etc/fstab file:
devfs /devfs devfs defaults 0 0

VIII. Installing on RedHat 8.0.
On stock RedHat 8.0 kernel (2.4.18-14) installer can't build
kernel  module.  It  is  recommended to upgrade kernel up to
2.4.20-28.8 (or higher) located on RedHat  site  in  updates
for RedHat  8.0.
If  you  want  to keep old kernel you can follow section  II
("Installing on Mandrake 9.2 with kernel 2.4.X") subsections
I and II and build kernel module.
IX. Installing on TurboLinux 8.0.
On  stock  TurboLinux 8.0  if  the  installer  cannot  build
kernel modules please build them manually:
1.  Install  the  below  packages from CD 2 if they are have
not been installed earlier:
kernel-source-2.4.18-5.i586.rpm
kernel-headers-2.4.18-5.i586.rpm
gcc-2.96-9.i586.rpm cpp-2.96-9.i586.rpm
glibc-devel-2.2.5-13.i586.rpm
2. Detect your kernel release by
# uname -r
3. Build snapapi modules by
# dkms build -m snapapi -v 0.6.4 --config \
 /usr/src/linux/configs/kernel-2.4.18-5smp-i586.config
Please use correct config file according your kernel release.
In our example kernel release was 2.4.18-5smp.
X. FAQ:
    Q: How do I run *.i686 installation file?
    A: This is a standard binary file. In order to start the
installation, do:
# chmod +x <file_name>.i686
# ./<file_name>.i686
where <file_name>.i686 is the name of your installation file.
   Q: Is  *.i686 installation file compatible with my x86_64
   arch?
   A: Yes.In order to install Acronis True Echo Linux Server
software  on  x86_64  Linux distribution follow the standard
installation procedure.
   Q: Can I install Acronis True Image Echo Linux Servert on
   an ia64 system?
   A:  No, this architecture is not supported by the current
versions of Acronis True Image Echo Linux Server.
  Q:  The  installer  complains  that  it cannot find kernel
   sources in /lib/modules/<MY_KERNEL>/build or in
   /lib/modules/<MY_KERNEL>/source directories. What  should
   I do?
  A:  This usually means that you do not have the sources of
your  running  kernel installed. We suggest you to check the
documentation   for  your  Linux  distribution  to find  the
correct sources.
Note:  kernel sources should 100% correspond to your running
kernel!
Some examples are below:
If you have Fedora Core Linux distribution:
You may install the sources using rpm manager or using yum.
Find your kernel version by entering:
# uname -r
Let us suppose it is 2.6.12-1.1372_FC3
And your kernel architecture by entering:
# uname -m
Suppose it is i686
So you should install kernel-devel-2.6.12-1.1372_FC3.i686.rpm
package. You may find it on  your Fedora Core installation CD
or  download  from  Fedora  ftp server.In order to use yum to
install  the  kernel  sources,  consult  with the appropriate
documentation.
If you have RHEL4/CentOS 4.x Linux:
Find your kernel version by entering:
# uname -r
FSuppose it is 2.6.9-34.ELsmp
Detect your kernel package architecture:
# uname -m
Suppose it is i686.
So you should install the kernel-smp-devel-2.6.9-34.EL.i686.rpm
package.  You  should  be  able  to find it  on  your Linux
distribution CD.
Note:  on RHEL3/CentOS the name of the kernel source package
should be kernel-source-<your_version>, where <your_version>
is the name of your running kernel.
If you are using SUSE Linux distribution:
Find your kernel version:
# uname -r
Suppose it is 2.6.5-7.244
Detect your kernel package architecture:
# uname -m
Suppose it is i686
Note, most  of  all SUSE packages have i586 architecture, so
you should install the package kernel-source-2.6.5-7.244.i586.rpm.
You  can find it on your Linux distribution CD  or  download
from SUSE Linux ftp server.
If you are using Debian distribution.
The  easiest way is to install the sources is to use apt-get
utility.
Find your kernel version:
# uname -r
Suppose it is 2.6.18-3-686
So, to install the sources of your kernel you should use the
command below:
# apt-get install linux-source-2.6.18-3-686
Note:  "apt-get  install  linux-source"  will  download  the
sources of the most recent kernel. Use it only  if  you  are
completely sure that you are using the  most  recent  kernel.
Note:  If you  are using Debian Sarge, the command should be
similar to:
# apt-get install kernel-source-<your_kernel>
where <your_kernel> is the name of your kernel image.
Q: I cannot  connect to Acronis True  Image Echo  Enterprise
Linux  Agent  installed  on  64 bit Debian (or Debian-based)
system. What should I do?
A: At  the moment Debian  x86_64  does  not  support running
32-bit applications to its full extent.
Acronis True Image Echo  Enterprise Linux Agent is  a 32-bit
application and it tries to authenticate against  64-bit PAM
module  by default. To solve this issue, you should  install
a minimal set of 32-bit libraries:
# apt-get install ia32-libs
Download  i386  package  libpam-modules  and the packages it
depends on:
libselinux1 libsepol1 from packages.debian.org
For example, packages for Debian Etch can be downloaded from:
http://packages.debian.org/stable/libs/libpam-modules
http://packages.debian.org/stable/libs/libselinux1
http://packages.debian.org/stable/libs/libsepol1
Extract the downloaded packages. For example:
# dpkg --extract libpam-modules_0.79-4_i386.deb <TEMPORARY_DIRECTORY>
# dpkg --extract libselinux1_1.32-3_i386.deb <TEMPORARY_DIRECTORY>
# dpkg --extract libsepol1_1.14-2_i386.deb <TEMPORARY_DIRECTORY>
Put the extracted shared libraries to /lib32/
# mkdir /lib32/security
# cp <TEMPORARY_DIRECTORY>/lib/security/* /lib32/security/
# cp <TEMPORARY_DIRECTORY>/lib/libselinux.so.1 /lib32/
# cp <TEMPORARY_DIRECTORY>/lib/libsepol.so.1 /lib32/
Add the path to the extracted libraries to /etc/pam.d/acronisagent
-----------
#%PAM-1.0
auth    required        /lib32/security/pam_unix.so
account required        /lib32/security/pam_unix.so
-----------
Restart Acronis True Image Echo Enterprise Linux Agent:
# /etc/init.d/acronis_trueimage_agent restart
  Q: I'm   trying   to  install   Acronis  True  Image  Echo
  Enterprise Linux Agent on my  Fedora Core 4,  but  getting
  the message that the software cannot be started. How can I
  solve this problem?
A:There is a incompatibility between Acronis True Image Echo
Enterprise Linux Agent and SELINUX included to Fedora Core 4.
To solve the problem  we suggest you to  disable  SELINUX in
/etc/selinux/config and restart the computer.Then you should
be able to use  Acronis  True  Image  Echo Enterprise  Linux
Agent successfully.
  Q: I'm trying to install Acronis True Image Echo Enterprise
  on  my Gentoo Linux, but getting the message  that snapapi
  modules cannot be loaded. How can I solve this problem?
A: Acronis True Image Echo Enterprise requires that you have
mounted  boot  partition  to  be  able  to  resolve   module
dependencies. To solve the problem, please  make  sure  that
your  boot  partition  is mounted then run the below command:
depmode -a
Then  you  should  be  able  to  use  Acronis True Image Echo
Enterprise successfully.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/8646a3c1/attachment.htm>

From sunhux at gmail.com  Fri Apr 17 13:00:07 2009
From: sunhux at gmail.com (sunhux G)
Date: Fri, 17 Apr 2009 21:00:07 +0800
Subject: [Linux-cluster] Re: OT: Error installing Acronis Enterprise agent :
	snapapi module not found
In-Reply-To: <60f08e700904170554y766776et2dabc500d950b4a5@mail.gmail.com>
References: <60f08e700904170554y766776et2dabc500d950b4a5@mail.gmail.com>
Message-ID: <60f08e700904170600v3e059c26k85eb03febdf03a4d@mail.gmail.com>

Some additional information :


 This is what I have :

# pwd
/usr/src
# ls
kernels  lpfcdfc  redhat  snapapi26-0.7.42

# ls kernels
2.6.9-42.EL-i686  2.6.9-67.0.7.EL-hugemem-i686  2.6.9-67.0.7.EL-i686
2.6.9-67.0.7.EL-smp-i686  2.6.9-67.EL-i686
# pwd
/lib/modules/2.6.9-67.0.7.ELsmp
# ls -l build
lrwxrwxrwx  1 root root 41 Mar 31  2008 build ->
/usr/src/kernels/2.6.9-67.0.7.EL-smp-i686
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/ced4db34/attachment.htm>

From gordan at bobich.net  Fri Apr 17 13:59:20 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 17 Apr 2009 14:59:20 +0100
Subject: [Linux-cluster] sharing same ip with 2 services on two nodes
In-Reply-To: <3128ba140904170523g243c2206lcc73f8755f3d7a6c@mail.gmail.com>
References: <3128ba140904170357v4f3c4528o5b1d6de4d0b21e04@mail.gmail.com>	<8a5668960904170403r3bc9c474xfcabed93832d3dc5@mail.gmail.com>	<3128ba140904170423x18cdeac0o851f3cbe43e9de05@mail.gmail.com>	<06d2fdcb2614a82906c6ae34ca8440f6@localhost>	<3128ba140904170501m60d17d5ch9c6901ac3cec3e82@mail.gmail.com>	<167d516bc744af67606fd4ce89a68750@localhost>
	<3128ba140904170523g243c2206lcc73f8755f3d7a6c@mail.gmail.com>
Message-ID: <b616fd31027236ffcc368d95beb70bff@localhost>

If you want reasonably transparent failover, you should always float the IP
address to the surviving node. If the service fails, deal with it locally
with a monitoring/watchdog script of some sort. If the service isn't
recoverable locally, have the machine fail itself, and the services will
float to the surviving one.

If you split each service to a separate IP, though, you can do partial
service/IP failover, as I think you're saying. That means you don't have to
fail the whole machine if a service fails and won't restart. It should also
provide some load balancing between the nodes since not all services will
run on all nodes. Depending on where the application bottlenecks, this may
be advantageous.

On Fri, 17 Apr 2009 14:23:40 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> Hi again,
> 
> now we speak the same language ;-)  what do your think about my
question?:
> 
> Has it sense to configure ip resources as shared?
>   I think it must be better to configure always as a private resource,
isnt
> it?
> 
> ESG
> 
> 
> 2009/4/17 Gordan Bobic <gordan at bobich.net>
> 
>> Right, I see what you mean now. No, you can't do that - the service
>> always
>> has to run on the node where it's IP is. But you can fail over the whole
>> resource group (IP + services) together if required.
>>
>> Gordan
>>
>> On Fri, 17 Apr 2009 14:01:05 +0200, ESGLinux <esggrupos at gmail.com>
wrote:
>> > Hi Gordan,
>> >
>> > I?m talking about the floating IP. (I called it service IP, is wrog
>> > the
>> > name?)
>> >
>> > The real ip of the two nodes is diferent and share the floating IP but
>> > in
>> > that scenario when one service fails (no the whole node) I get one
>> service
>> > in one node and one service on the other node but I get the error I
>> posted.
>> >
>> > for example
>> > floating IP + BBDD is runing on node1
>> > floating IP + HTTPD is running on node2
>> >
>> > is this possible?
>> >
>> > thanks
>> >
>> > ESG
>> >
>> >
>> >
>> > 2009/4/17 Gordan Bobic <gordan at bobich.net>
>> >
>> >> You can't have the same IP on multiple machines at the same time.
That
>> >> much should be obvious before even thinking about clustering.
>> >>
>> >> You can, however configure floating IPs as resources that RHCS will
>> >> fail over between cluster nodes. Only one node will have a particular
>> >> IP address at any one time, but if that node fails, the floating IP
>> >> will get migrated to one of the surviving nodes.
>> >>
>> >> Any number of services can run on a floating IP. There is no need
>> >> to have one floating IP per service.
>> >>
>> >> Gordan
>> >>
>> >> On Fri, 17 Apr 2009 13:23:30 +0200, ESGLinux <esggrupos at gmail.com>
>> wrote:
>> >> > Hi,
>> >> >
>> >> > thanks for your answer
>> >> >
>> >> > I suspected  so,
>> >> >
>> >> > can I say as an axiom: One service needs One IP? ( One man, One
vote
>> >> > ;-)
>> >> )
>> >> >
>> >> > and if is this true, has it sense to configure ip resources as
>> >> > shared?
>> >> > it
>> >> > must be better to configure as a private resource, isnt it?
>> >> >
>> >> > Greetings,
>> >> >
>> >> > ESG
>> >> >
>> >> > 2009/4/17 Juan Ramon Martin Blanco <robejrm at gmail.com>
>> >> >
>> >> >>
>> >> >>
>> >> >> On Fri, Apr 17, 2009 at 12:57 PM, ESGLinux <esggrupos at gmail.com>
>> >> >> wrote:
>> >> >>
>> >> >>> Hello all,
>> >> >>>
>> >> >>> I have a doubt about using the same service ip with two diferent
>> >> >>> services
>> >> >>> in two diferent nodes of my two-nodes-cluster.
>> >> >>
>> >> >>
>> >> >>>
>> >> >>> I?ll explain it a litle:
>> >> >>>
>> >> >>> I have two services: BBDD and HTTPD
>> >> >>>
>> >> >>> I have configured a shared IP: 192.168.1.100
>> >> >>>
>> >> >> Hi,
>> >> >> You _must_ use a different  IP, cannot have the same IP on
>> >> >> different
>> >> >> machines.
>> >> >>
>> >> >> Greetings,
>> >> >> Juanra
>> >> >>
>> >> >>>
>> >> >>> two nodes: node1 and node2.
>> >> >>>
>> >> >>> When I run the two services on node1 all runs ok. If I try to
>> >> >>> relocate
>> >> >>> one
>> >> >>> service, with
>> >> >>>
>> >> >>> clusvcadm -r BBDD -m node2
>> >> >>>
>> >> >>> it fails with the error:
>> >> >>>  in.rdiscd[4805]: setsockopt (IP_ADD_MEMBERSHIP): Address already
>> >> >>>  in
>> >> use
>> >> >>>  in.rdiscd[4805]: Failed joining addresses
>> >> >>>
>> >> >>> I suposse that is because one services is on node1 with the ip
and
>> >> >>> the
>> >> >>> other trys to run in node2 with the same ip.
>> >> >>>
>> >> >>> So my question is if it is possible to run the services this way
>> >> >>> or
>> I
>> >> >>> need
>> >> >>> an ip per service?
>> >> >>> (I have tested that with 2 diferent ips I can run BBDD on node1
an
>> >> HTTPD
>> >> >>> on node2)
>> >> >>>
>> >> >>> Thanks in advance
>> >> >>>
>> >> >>> ESG
>> >> >>>
>> >> >>>
>> >> >>> --
>> >> >>> Linux-cluster mailing list
>> >> >>> Linux-cluster at redhat.com
>> >> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >> >>>
>> >> >>
>> >> >>
>> >> >> --
>> >> >> Linux-cluster mailing list
>> >> >> Linux-cluster at redhat.com
>> >> >> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >> >>
>> >>
>> >> --
>> >> Linux-cluster mailing list
>> >> Linux-cluster at redhat.com
>> >> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>


From jweismann at gmail.com  Fri Apr 17 20:24:13 2009
From: jweismann at gmail.com (Jonathan Weismann)
Date: Fri, 17 Apr 2009 16:24:13 -0400
Subject: [Linux-cluster] CVS in a cluster
Message-ID: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>

All,
        I've got a quick question that perhaps someone can answer as my
googlefu and searches haven't been fruitful:

Is it possible to create a cluster of RHEL5 servers and place CVS servers
inside them using shared storage of some sort (Fibre GNBD ISCSI etc). If so
how would one go about it and are there any docs, wikis, web sites dedicated
to this setup.

I know it CAN be done, but I can't find any information on HOW. Or is it
just as simple as building the servers, installing clustering software, then
throwing the CVS services at it and saying "Use this shared resource"?


Thanks in advance and if this question was already asked, just point me in
the right direction as I didn't find it in the last hours of searching :)

J.Weismann
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090417/ba964d60/attachment.htm>

From jeff.sturm at eprize.com  Sat Apr 18 14:54:55 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Sat, 18 Apr 2009 10:54:55 -0400
Subject: [Linux-cluster] CVS in a cluster
In-Reply-To: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
References: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB8BD@hugo.eprize.local>

Place your CVSROOT on a mounted GFS filesystem, then the files can be
access from 2 or more cluster nodes at once.  The rest depends on how
you use it.  Do you use pserver protocol, ssh, or access your CVS
repositories locally?
 
What are you trying to achieve with cluster services?  E.g. high
availability?


________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jonathan Weismann
	Sent: Friday, April 17, 2009 4:24 PM
	To: linux-cluster at redhat.com
	Subject: [Linux-cluster] CVS in a cluster
	
	
	All,
	        I've got a quick question that perhaps someone can
answer as my googlefu and searches haven't been fruitful:
	
	Is it possible to create a cluster of RHEL5 servers and place
CVS servers inside them using shared storage of some sort (Fibre GNBD
ISCSI etc). If so how would one go about it and are there any docs,
wikis, web sites dedicated to this setup.
	
	I know it CAN be done, but I can't find any information on HOW.
Or is it just as simple as building the servers, installing clustering
software, then throwing the CVS services at it and saying "Use this
shared resource"?
	
	
	Thanks in advance and if this question was already asked, just
point me in the right direction as I didn't find it in the last hours of
searching :)
	
	J.Weismann
	
	
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090418/01f136e5/attachment.htm>

From 123.oleg at gmail.com  Sat Apr 18 16:01:18 2009
From: 123.oleg at gmail.com (OlegG)
Date: Sat, 18 Apr 2009 20:01:18 +0400
Subject: [Linux-cluster] Updating Quorum votes after adding a new node.
Message-ID: <af57f1ac0904180901y7303ce69r38bdefc5f4abdef5@mail.gmail.com>

Hello!

I have 2-node CMAN cluster with 1 vote for each node
Also I have Quorum disk - it has 1 vote (on 1 less then number of nodes).

I added then 1 node - and update configuration so that Expected votes=3
 But Quorum votes didnot change - it 's equal to 1, but in config i have
increased number of Qdisk votes to 2 votes.

So for success update i need to reboot all nodes after adding new node to
cluster.

The question is - how to propagate new number of Qdisk votes of cluster
without rebooting cluster?

Thank you.


From jweismann at gmail.com  Sat Apr 18 20:04:06 2009
From: jweismann at gmail.com (Jonathan Weismann)
Date: Sat, 18 Apr 2009 16:04:06 -0400
Subject: [Linux-cluster] CVS in a cluster
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB8BD@hugo.eprize.local>
References: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8BD@hugo.eprize.local>
Message-ID: <caaac3a70904181304y6d15ec0el4468bbacaf4d79d0@mail.gmail.com>

Pserver: Yes we use Pserver configuration to access our cvs repositories. I
wasn't the inital creator of this setup but I know we have to edit that
pserver file to add in repos.

We have several hundred repos scattered over 5 servers and the goal is to
consolodate them down to one single source that's highly available with
failover and scalability. Where if we need more storage/cpu/IO we just throw
another server at it. If one dies another picks it back up.

Hence I turned to Clustering with RHEL and my questions herein. If there's a
better solution to consolodate all my server down to one set that would do
all above I'm all ears but the 'net is very barren when it comes to CVS ,
High availability, Failover, Redundancy, etc.


On Sat, Apr 18, 2009 at 10:54 AM, Jeff Sturm <jeff.sturm at eprize.com> wrote:

>  Place your CVSROOT on a mounted GFS filesystem, then the files can be
> access from 2 or more cluster nodes at once.  The rest depends on how you
> use it.  Do you use pserver protocol, ssh, or access your CVS repositories
> locally?
>
> What are you trying to achieve with cluster services?  E.g. high
> availability?
>
>  ------------------------------
> *From:* linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] *On Behalf Of *Jonathan Weismann
> *Sent:* Friday, April 17, 2009 4:24 PM
> *To:* linux-cluster at redhat.com
> *Subject:* [Linux-cluster] CVS in a cluster
>
> All,
>         I've got a quick question that perhaps someone can answer as my
> googlefu and searches haven't been fruitful:
>
> Is it possible to create a cluster of RHEL5 servers and place CVS servers
> inside them using shared storage of some sort (Fibre GNBD ISCSI etc). If so
> how would one go about it and are there any docs, wikis, web sites dedicated
> to this setup.
>
> I know it CAN be done, but I can't find any information on HOW. Or is it
> just as simple as building the servers, installing clustering software, then
> throwing the CVS services at it and saying "Use this shared resource"?
>
>
> Thanks in advance and if this question was already asked, just point me in
> the right direction as I didn't find it in the last hours of searching :)
>
> J.Weismann
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jonathan Weismann
CCNA
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090418/b2fbf17a/attachment.htm>

From sunhux at gmail.com  Sun Apr 19 04:03:39 2009
From: sunhux at gmail.com (sunhux G)
Date: Sun, 19 Apr 2009 12:03:39 +0800
Subject: [Linux-cluster] How to extract an rpm for use on another server?
Message-ID: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>

 Hi,


I've just installed an rpm ( VMWare-esx-vmx-3.5.0-158869.i386.rpm )
and this caused the entire ESX/VMWare to panic/crash whenever a
VM in this VMWare is started.

I've just done "rpm -e --nodeps -allmatches VMware-esx-vms-3.5.-158869)
and would like to put back the old rpm 64607 :

   VMWare-esx-3.5.0-64607


However, I could not find 64607 from anywhere (requires a code), so I'll
need to extract this 64607 rpm from my other ESX servers : is this possible
& what's the command.

If it's not possible to build the 64607 rpm from an existing ESX which has
not been patched yet, appreciate if someone can point me to where I could
download it


Tks
U
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090419/f02d56d9/attachment.htm>

From Joel.Becker at oracle.com  Sun Apr 19 06:30:14 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Sat, 18 Apr 2009 23:30:14 -0700
Subject: [Linux-cluster] CVS in a cluster
In-Reply-To: <caaac3a70904181304y6d15ec0el4468bbacaf4d79d0@mail.gmail.com>
References: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8BD@hugo.eprize.local>
	<caaac3a70904181304y6d15ec0el4468bbacaf4d79d0@mail.gmail.com>
Message-ID: <20090419063014.GB15686@ca-server1.us.oracle.com>

On Sat, Apr 18, 2009 at 04:04:06PM -0400, Jonathan Weismann wrote:
> Pserver: Yes we use Pserver configuration to access our cvs repositories. I
> wasn't the inital creator of this setup but I know we have to edit that
> pserver file to add in repos.

	CVS uses lock files.  So ocfs2 and gfs2 should both work just
fine with pserver processes running on multiple nodes.  I think you just
set up your cluster and run pserver on all the nodes, and you are good
to go.  Use a round-robin CNAME to make one server address speak to all
nodes.

Joel

-- 

"The nice thing about egotists is that they don't talk about other
 people."
         - Lucille S. Harper

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From Luis.Cerezo at pgs.com  Sun Apr 19 15:58:29 2009
From: Luis.Cerezo at pgs.com (Luis Cerezo)
Date: Sun, 19 Apr 2009 10:58:29 -0500
Subject: [Linux-cluster] How to extract an rpm for use on another server?
In-Reply-To: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>
References: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>
Message-ID: <8413932B-C62F-4D78-BB2F-EFADC82FBD77@pgs.com>

A little off topic isn't it?

On Apr 18, 2009, at 23:04, "sunhux G" <sunhux at gmail.com> wrote:

> Hi,
>
>
> I've just installed an rpm ( VMWare-esx-vmx-3.5.0-158869.i386.rpm )
> and this caused the entire ESX/VMWare to panic/crash whenever a
> VM in this VMWare is started.
>
> I've just done "rpm -e --nodeps -allmatches VMware-esx- 
> vms-3.5.-158869)
> and would like to put back the old rpm 64607 :
>
>    VMWare-esx-3.5.0-64607
>
>
> However, I could not find 64607 from anywhere (requires a code), so  
> I'll
> need to extract this 64607 rpm from my other ESX servers : is this  
> possible
> & what's the command.
>
> If it's not possible to build the 64607 rpm from an existing ESX  
> which has
> not been patched yet, appreciate if someone can point me to where I  
> could
> download it
>
>
> Tks
> U
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


This email and any files contained therein is confidential and may contain privileged information.  If you are not the named addressee(s) or you have otherwise received this in error, you should not distribute or copy this e-mail or use any of its content for any purpose. Please notify the sender immediately by e-mail if you have received this e-mail in error and delete it from your system


From ccaulfie at redhat.com  Mon Apr 20 07:10:57 2009
From: ccaulfie at redhat.com (Chrissie Caulfield)
Date: Mon, 20 Apr 2009 08:10:57 +0100
Subject: [Linux-cluster] Updating Quorum votes after adding a new node.
In-Reply-To: <af57f1ac0904180901y7303ce69r38bdefc5f4abdef5@mail.gmail.com>
References: <af57f1ac0904180901y7303ce69r38bdefc5f4abdef5@mail.gmail.com>
Message-ID: <49EC2001.8040903@redhat.com>

OlegG wrote:
> Hello!
> 
> I have 2-node CMAN cluster with 1 vote for each node
> Also I have Quorum disk - it has 1 vote (on 1 less then number of nodes).
> 
> I added then 1 node - and update configuration so that Expected votes=3
>  But Quorum votes didnot change - it 's equal to 1, but in config i have
> increased number of Qdisk votes to 2 votes.
> 
> So for success update i need to reboot all nodes after adding new node to
> cluster.
> 
> The question is - how to propagate new number of Qdisk votes of cluster
> without rebooting cluster?
> 

I would have thought that simply restarting qdiskd would update the
votes it has.
-- 

Chrissie


From beekhof at gmail.com  Mon Apr 20 07:38:15 2009
From: beekhof at gmail.com (Andrew Beekhof)
Date: Mon, 20 Apr 2009 09:38:15 +0200
Subject: [Linux-cluster] CVS in a cluster
In-Reply-To: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
References: <caaac3a70904171324j1e11778cn6b6a3825822a1033@mail.gmail.com>
Message-ID: <26ef5e70904200038i30140bd3haf58296adcaf631e@mail.gmail.com>

On Fri, Apr 17, 2009 at 22:24, Jonathan Weismann <jweismann at gmail.com> wrote:
> All,
> ??????? I've got a quick question that perhaps someone can answer as my
> googlefu and searches haven't been fruitful:
>
> Is it possible to create a cluster of RHEL5 servers and place CVS servers
> inside them using shared storage of some sort (Fibre GNBD ISCSI etc). If so
> how would one go about it and are there any docs, wikis, web sites dedicated
> to this setup.
>
> I know it CAN be done, but I can't find any information on HOW. Or is it
> just as simple as building the servers, installing clustering software, then
> throwing the CVS services at it and saying "Use this shared resource"?

The question that really needs to be asked, now that tools like Git
and Mercurial are out there, is "SHOULD it be done at all?" :-)


From sergey.skipidarov at gmail.com  Mon Apr 20 13:27:57 2009
From: sergey.skipidarov at gmail.com (Sergey Skipidarov)
Date: Mon, 20 Apr 2009 17:27:57 +0400
Subject: [Linux-cluster] gfs and ext3
Message-ID: <e3327c360904200627m227a11a3t268566cff4c89fdb@mail.gmail.com>

HI,

i have a problem with gfs..
3 nodes: (rhel5.2 x64) connected to san network to eva4400.
and i have one gfs1 file system (about 1TB)shared through three nodes
if i place php-code on gfs fs - i have performance in 10 times lower
than on ext3 fs in the same storage

test i do from one node (1), second and third node doing nothing..
only mounted a gfs partition

when php-code is being on gfs vmstat look like this

procs -----------memory---------- ---swap-- -----io---- --system--
-----cpu------
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us
sy id wa st
 7  0    308 7488148   3032 176300    0    0   248   152 1080 347400
3 28 68  1  0
 5  0    308 7487032   3032 176312    0    0     1     1 1014 387835
2 31 67  0  0
 1  0    308 7486504   3032 176692    0    0     0     0 1011 386325
2 30 68  0  0
13  0    308 7486380   3032 176700    0    0     0     0 1017 391721
2 31 67  0  0
 5  0    308 7485884   3032 176784    0    0     0     0 1011 378043
3 29 68  0  0
 5  0    308 7485512   3056 176760    0    0     5   321 1030 388592
2 32 66  0  0
 8  0    308 7485388   3056 176864    0    0     0     0 1011 375568
2 30 68  0  0

and on ext3 fs

procs -----------memory---------- ---swap-- -----io---- --system--
-----cpu------
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa st
60  0    308 6731144  57744 184440    0    0     0     0 21343 56952
53 42  5  0  0
44  0    308 6730152  57756 184540    0    0     0    60 20975 56834
53 41  6  0  0
48  0    308 6728912  57760 184620    0    0     4     0 21094 60327
52 41  6  0  0
27  0    308 6729284  57764 184792    0    0     1     1 21464 58993
48 36 16  0  0
 8  0    308 6730028  57792 184852    0    0     0  1004 21048 52217
42 29 29  0  0
11  0    308 6730152  57792 184960    0    0     0     0 20631 48064
33 21 46  0  0
 4  0    308 6729656  57804 185040    0    0     0    52 19673 38394
22 13 65  0  0
 4  0    308 6729284  57804 185116    0    0     5     1 18023 32104
19 10 71  0  0
 4  0    308 6729284  57804 185184    0    0     0     0 17620 30895
18 10 73  0  0


context switches on gfs about 380000/s, but on ext3 about 50000/s

a tried tune gfs:
- mount noatime,noquota,nodiratime
- set statfs_fast = 1
- increasing statfs_slots

how i can increase performance ?

--
wbr, Sergey Skipidarov


From dist-list at LEXUM.UMontreal.CA  Mon Apr 20 14:26:42 2009
From: dist-list at LEXUM.UMontreal.CA (dist-list at LEXUM.UMontreal.CA)
Date: Mon, 20 Apr 2009 10:26:42 -0400
Subject: [Linux-cluster] Xen interface and clustering
Message-ID: <49EC8622.3050400@lexum.umontreal.ca>

Hello,
I just installed a 4 nodes cluster with RH 5.3. Each node have 6 
interfaces. It will be our Xen cluster.

I create a new VLAN and all eth0 are connected to it for cluster 
communication only. They will be no VM connected to eth0.

Can I still use eth0 as my multicast interface in cluster.conf or I 
should use peth0 ?

I suppose I could also remove eth0 from xen and use eth0 ?

Regards,


From jumanjiman at gmail.com  Mon Apr 20 14:51:37 2009
From: jumanjiman at gmail.com (Paul Morgan)
Date: Mon, 20 Apr 2009 14:51:37 +0000
Subject: [Linux-cluster] Xen interface and clustering
Message-ID: <370960437-1240239135-cardhu_decombobulator_blackberry.rim.net-746573884-@bxe1239.bisx.prod.on.blackberry>

You should be fine with eth0.
I prefer using xenbr0 instead of virbr0 so that I don't have to mess with the virtual routes or nat.
-paul

------Original Message------
From: dist-list at LEXUM.UMontreal.CA
Sender: linux-cluster-bounces at redhat.com
To: linux clustering
ReplyTo: linux clustering
Subject: [Linux-cluster] Xen interface and clustering
Sent: Apr 20, 2009 10:26

Hello,
I just installed a 4 nodes cluster with RH 5.3. Each node have 6 
interfaces. It will be our Xen cluster.

I create a new VLAN and all eth0 are connected to it for cluster 
communication only. They will be no VM connected to eth0.

Can I still use eth0 as my multicast interface in cluster.conf or I 
should use peth0 ?

I suppose I could also remove eth0 from xen and use eth0 ?

Regards,

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From esggrupos at gmail.com  Mon Apr 20 15:29:13 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Mon, 20 Apr 2009 17:29:13 +0200
Subject: [Linux-cluster] iscsi doubt
Message-ID: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>

Hello,

I have a 2-nodes cluster. I have a shared partition mounted via iscsi. Only
one node is active at a time but I see that when I write in one node in the
other node I cant see the files created,

ie:

node1
  /dev/sda                972404    107592    815416  12% /iscsivol

node2
/dev/sda                972404    107596    815412  12% /iscsivol

 first the numbers are different and /dev/sda is the same partition. When I
create a file with

#touch test
root at node2 iscsivol]# ll
total 24
drwx------ 2 root  root  16384 mar 23 17:25 lost+found
-rw-r--r-- 1 root  root      0 abr 20 10:39 test

in the other node:
[root at node1 iscsivol]# ll
total 24
drwx------ 2 root  root  16384 mar 23 17:25 lost+found

and after a time i get:
[root at node1 iscsivol]# ll
total 24
drwx------ 2 root  root  16384 mar 23 17:25 lost+found
?--------- ? ?     ?         ?            ? test


the filesystem on it is ext3 and I dont want to implement gfs.

Is necessary any special configuration to update de iscsi targets when one
client writes to it and with correct info?

Thanks in advance

ESG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/456576df/attachment.htm>

From jeff.sturm at eprize.com  Mon Apr 20 15:45:56 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 20 Apr 2009 11:45:56 -0400
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>

You can't mount ext3 from more than one node (unless all mounts are
read-only).  You will immediately corrupt the volume.
 
You have to use a clustered filesystem.  If for some reason you don't
want to implement GFS, try another:
http://en.wikipedia.org/wiki/Shared_disk_file_system


________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of ESGLinux
	Sent: Monday, April 20, 2009 11:29 AM
	To: linux clustering
	Subject: [Linux-cluster] iscsi doubt
	
	
	Hello, 
	
	I have a 2-nodes cluster. I have a shared partition mounted via
iscsi. Only one node is active at a time but I see that when I write in
one node in the other node I cant see the files created, 
	
	ie:
	
	node1
	  /dev/sda                972404    107592    815416  12%
/iscsivol
	
	node2
	/dev/sda                972404    107596    815412  12%
/iscsivol
	
	 first the numbers are different and /dev/sda is the same
partition. When I create a file with
	
	#touch test
	root at node2 iscsivol]# ll
	total 24
	drwx------ 2 root  root  16384 mar 23 17:25 lost+found
	-rw-r--r-- 1 root  root      0 abr 20 10:39 test
	
	in the other node:
	[root at node1 iscsivol]# ll
	total 24
	drwx------ 2 root  root  16384 mar 23 17:25 lost+found
	
	and after a time i get:
	[root at node1 iscsivol]# ll
	total 24
	drwx------ 2 root  root  16384 mar 23 17:25 lost+found
	?--------- ? ?     ?         ?            ? test
	
	
	the filesystem on it is ext3 and I dont want to implement gfs. 
	
	Is necessary any special configuration to update de iscsi
targets when one client writes to it and with correct info?
	
	Thanks in advance
	
	ESG
	
	
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/faaeb707/attachment.htm>

From vikash at netvigator.com  Mon Apr 20 15:22:41 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Mon, 20 Apr 2009 23:22:41 +0800
Subject: [Linux-cluster] GFS performance.
Message-ID: <200904201549.n3KFndL0008930@mx3.redhat.com>

Hello,

OS : CentOS 5.2
FS : GFS
Journals : 4
Nodes : 1 (currently testing)
iSCSI Target : Dell MD3000i
Disks : 5 x SAS 15K RPM 300GB.

I would like to know what is the expected performance penalty for using GFS.

Currently I have a single node cluster for testing using the lock_dlm 
over an iSCSI RAID 5 disk system. Ive create 2 volumes on the same 
RAID 5 disk set 1st one is GFS and the second one is ext3. Ive used 
"fio" for my performance testing and found that ext3 is about 4 times 
faster then GFS. The test is using random reads of 4KB. If instead I 
use sequential reads of 256K then the performance is very close to 
the ext3 filesystem, but thats far from our practical environment.

This seems to be quite extreme and so would like to know how to tune 
the performance on GFS. Ive tried guidelines from:
http://www.redhat.com/promo/summit/2008/downloads/pdf/Thursday/Summit08presentation_GFSBestPractices_Final.pdf

However they only improve CPU utilization but disk IO performance 
does not improve by much.

Now I need to make a decision to go with GFS or not, clearly at 4 
times less performance we cannot afford it, also it doesn't sound 
right so would like to find out whats wrong.

Thanks and hope to get some pointers.

Regards,
Vikash.


From gordan at bobich.net  Mon Apr 20 16:02:22 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 20 Apr 2009 17:02:22 +0100
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <200904201549.n3KFndL0008930@mx3.redhat.com>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
Message-ID: <56b092abeda5f69b36548c68060815d1@localhost>

Are you mounting with noatime parameter? That's the only thing I've found
that makes any significant difference.

4x slowdown may be on the slow side for a single node, but it's in the
right ball park. It's not going to get close to ext3 in terms of
performance. Also expect a further slow-down of a few factors as you up the
number of nodes for concurrent small random reads case.

In general - a clustered shared disk fs will never be as fast as a
non-shared file system. There is no way around the fact that it has to do
more work to arbitrate the data access between the nodes.

Gordan

On Mon, 20 Apr 2009 23:22:41 +0800, Vikash Khatuwala
<vikash at netvigator.com>
wrote:
> Hello,
> 
> OS : CentOS 5.2
> FS : GFS
> Journals : 4
> Nodes : 1 (currently testing)
> iSCSI Target : Dell MD3000i
> Disks : 5 x SAS 15K RPM 300GB.
> 
> I would like to know what is the expected performance penalty for using
> GFS.
> 
> Currently I have a single node cluster for testing using the lock_dlm 
> over an iSCSI RAID 5 disk system. Ive create 2 volumes on the same 
> RAID 5 disk set 1st one is GFS and the second one is ext3. Ive used 
> "fio" for my performance testing and found that ext3 is about 4 times 
> faster then GFS. The test is using random reads of 4KB. If instead I 
> use sequential reads of 256K then the performance is very close to 
> the ext3 filesystem, but thats far from our practical environment.
> 
> This seems to be quite extreme and so would like to know how to tune 
> the performance on GFS. Ive tried guidelines from:
>
http://www.redhat.com/promo/summit/2008/downloads/pdf/Thursday/Summit08presentation_GFSBestPractices_Final.pdf
> 
> However they only improve CPU utilization but disk IO performance 
> does not improve by much.
> 
> Now I need to make a decision to go with GFS or not, clearly at 4 
> times less performance we cannot afford it, also it doesn't sound 
> right so would like to find out whats wrong.
> 
> Thanks and hope to get some pointers.
> 
> Regards,
> Vikash.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From esggrupos at gmail.com  Mon Apr 20 16:18:22 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Mon, 20 Apr 2009 18:18:22 +0200
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
Message-ID: <3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>

Hello,

first, thanks for your answer,

I suspected it but why can i do it with NFS. ?

the nodes never are going to be active at the same time so I can mount the
shares via NFS. With NFS when I create a file in a share automatically i got
it in the shared mounted by all the clients.

In this case I don?t need to write to the share concurrently

can be this configuration a problem?

Thanks again

ESG


2009/4/20 Jeff Sturm <jeff.sturm at eprize.com>

>  You can't mount ext3 from more than one node (unless all mounts are
> read-only).  You will immediately corrupt the volume.
>
> You have to use a clustered filesystem.  If for some reason you don't want
> to implement GFS, try another:
> http://en.wikipedia.org/wiki/Shared_disk_file_system
>
>  ------------------------------
> *From:* linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] *On Behalf Of *ESGLinux
> *Sent:* Monday, April 20, 2009 11:29 AM
> *To:* linux clustering
> *Subject:* [Linux-cluster] iscsi doubt
>
> Hello,
>
> I have a 2-nodes cluster. I have a shared partition mounted via iscsi. Only
> one node is active at a time but I see that when I write in one node in the
> other node I cant see the files created,
>
> ie:
>
> node1
>   /dev/sda                972404    107592    815416  12% /iscsivol
>
> node2
> /dev/sda                972404    107596    815412  12% /iscsivol
>
>  first the numbers are different and /dev/sda is the same partition. When I
> create a file with
>
> #touch test
> root at node2 iscsivol]# ll
> total 24
> drwx------ 2 root  root  16384 mar 23 17:25 lost+found
> -rw-r--r-- 1 root  root      0 abr 20 10:39 test
>
> in the other node:
> [root at node1 iscsivol]# ll
> total 24
> drwx------ 2 root  root  16384 mar 23 17:25 lost+found
>
> and after a time i get:
> [root at node1 iscsivol]# ll
> total 24
> drwx------ 2 root  root  16384 mar 23 17:25 lost+found
> ?--------- ? ?     ?         ?            ? test
>
>
> the filesystem on it is ext3 and I dont want to implement gfs.
>
> Is necessary any special configuration to update de iscsi targets when one
> client writes to it and with correct info?
>
> Thanks in advance
>
> ESG
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/4b59a7e1/attachment.htm>

From gordan at bobich.net  Mon Apr 20 16:25:14 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 20 Apr 2009 17:25:14 +0100
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
Message-ID: <c63a83b07142b32f75eab466919321e3@localhost>

On Mon, 20 Apr 2009 18:18:22 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> Hello,
> 
> first, thanks for your answer,
> 
> I suspected it but why can i do it with NFS. ?

Not sure I understand your question. NFS is a network file system like CIFS
specifically designed to be mounted from multiple clients simultaneously.
ext3 is only designed with a single accessor in mind.

> the nodes never are going to be active at the same time so I can mount
the
> shares via NFS. With NFS when I create a file in a share automatically i
> got it in the shared mounted by all the clients.

I still don't understand your question - that is what NFS is designed for.

> In this case I don?t need to write to the share concurrently
> 
> can be this configuration a problem?

No, it's fundamentally impossible. In order to have a FS that can be
mounted simultaneously from multiple nodes, it has to be aware of multiple
nodes accessing it, which means that it needs coherent caching. Local file
systems like ext3 don't have this. When one node writes to the ext3 file
system, the other node will have cached the inodes as they were originally,
and it won't bother hitting the disk to re-read the contents, it'll just
return what it already has cached. And almost certainly corrupt the file
system in the process.

You cannot have a shared ext3 volume with writing enabled. Period.

Gordan


From esggrupos at gmail.com  Mon Apr 20 16:56:23 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Mon, 20 Apr 2009 18:56:23 +0200
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <c63a83b07142b32f75eab466919321e3@localhost>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
	<c63a83b07142b32f75eab466919321e3@localhost>
Message-ID: <3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>

Hello,


2009/4/20 Gordan Bobic <gordan at bobich.net>

> On Mon, 20 Apr 2009 18:18:22 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> > Hello,
> >
> > first, thanks for your answer,
> >
> > I suspected it but why can i do it with NFS. ?
>
> Not sure I understand your question. NFS is a network file system like CIFS
> specifically designed to be mounted from multiple clients simultaneously.
> ext3 is only designed with a single accessor in mind.


I?ll try to explain myself

I have a partition /dev/sda

/dev/sda on /iscsivol type ext3 (rw)

but this partition is a target iscsi on another server. I format the
partition with ext3 but its not a local disk, is a target iscsi.

with this configuration I have the filesystem corrupted.

second scenario

I have
192.168.1.198:/nfsexport/                       6983168   2839168   3783552
43% /mnt

but the parttion 192.168.1.198:/nfsexport/ is again ext3 the diference is
that I use nfs as network protocol instead of iscsi.


>
>
> > the nodes never are going to be active at the same time so I can mount
> the
> > shares via NFS. With NFS when I create a file in a share automatically i
> > got it in the shared mounted by all the clients.
>
> I still don't understand your question - that is what NFS is designed for.


Yes I agree with you, but I thought with iscsi i can do the same as with
NFS.


>
>
> > In this case I don?t need to write to the share concurrently
> >
> > can be this configuration a problem?
>
> No, it's fundamentally impossible. In order to have a FS that can be
> mounted simultaneously from multiple nodes, it has to be aware of multiple
> nodes accessing it, which means that it needs coherent caching. Local file
> systems like ext3 don't have this. When one node writes to the ext3 file
> system, the other node will have cached the inodes as they were originally,
> and it won't bother hitting the disk to re-read the contents, it'll just
> return what it already has cached. And almost certainly corrupt the file
> system in the process.
>
> You cannot have a shared ext3 volume with writing enabled. Period.


ok understand it,

but (always there is a but ...)

I only want to share a directory in which one node writes at one and when it
fails the other node has the diretory mounted with the data and can write to
it.

Before I have known about cluster my decission would been to mount the
shares with NFS. Now I want to be more sofisticated and want to use cluster
tools, so I thought to mount it with iSCSI instead of NFS, but always with
the ext3 as the underlying filesystem.

Perphaps this is my mistake.

any suggestion that makes me see the light ;.)

TIA

ESG


>
>
> Gordan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/229b376e/attachment.htm>

From jeff.sturm at eprize.com  Mon Apr 20 17:00:48 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 20 Apr 2009 13:00:48 -0400
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <200904201549.n3KFndL0008930@mx3.redhat.com>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash 
> Khatuwala
> Sent: Monday, April 20, 2009 11:23 AM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] GFS performance.
> 
> OS : CentOS 5.2
> FS : GFS

Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to have some
performance improvements over GFS1.

> Now I need to make a decision to go with GFS or not, clearly 
> at 4 times less performance we cannot afford it, also it 
> doesn't sound right so would like to find out whats wrong.

Be careful with benchmarks, as they often do not give you a good
indication of real-world performance.

Are you more concerned with latency or throughput?  Any single read will
almost certainly take longer to complete over GFS than EXT3.  There's
simply more overhead involved with any cluster filesystem.  However,
that's not to say you're limited as to how many reads you can execute in
parallel.  So the overall number of reads you can perform in a given
time interval may not be 4x at all (are you running a parallel
benchmark?)

Jeff


From cthulhucalling at gmail.com  Mon Apr 20 17:07:42 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Mon, 20 Apr 2009 13:07:42 -0400
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
	<c63a83b07142b32f75eab466919321e3@localhost>
	<3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
Message-ID: <36df569a0904201007s2d1a3978u7c3600c081820915@mail.gmail.com>

On Mon, Apr 20, 2009 at 12:56 PM, ESGLinux <esggrupos at gmail.com> wrote:

> Hello,
>
>
>
> 2009/4/20 Gordan Bobic <gordan at bobich.net>
>
>> On Mon, 20 Apr 2009 18:18:22 +0200, ESGLinux <esggrupos at gmail.com> wrote:
>> > Hello,
>> >
>> > first, thanks for your answer,
>> >
>> > I suspected it but why can i do it with NFS. ?
>>
>> Not sure I understand your question. NFS is a network file system like
>> CIFS
>> specifically designed to be mounted from multiple clients simultaneously.
>> ext3 is only designed with a single accessor in mind.
>
>
> I?ll try to explain myself
>
> I have a partition /dev/sda
>
> /dev/sda on /iscsivol type ext3 (rw)
>
> but this partition is a target iscsi on another server. I format the
> partition with ext3 but its not a local disk, is a target iscsi.
>
> with this configuration I have the filesystem corrupted.
>
> second scenario
>
> I have
> 192.168.1.198:/nfsexport/                       6983168   2839168
> 3783552  43% /mnt
>
> but the parttion 192.168.1.198:/nfsexport/ is again ext3 the diference is
> that I use nfs as network protocol instead of iscsi.
>

(overly simplified explanation)
This is because with iSCSI you are mounting the filesystem directly as a
block device. Through NFS, the filesystem isn't accessed directly. The NFS
server handles all the actual I/O, so only one system is actually touching
the filesystem.


>
>> > the nodes never are going to be active at the same time so I can mount
>> the
>> > shares via NFS. With NFS when I create a file in a share automatically i
>> > got it in the shared mounted by all the clients.
>>
>> I still don't understand your question - that is what NFS is designed for.
>
>
> Yes I agree with you, but I thought with iscsi i can do the same as with
> NFS.
>

No, iSCSI and NFS are completely different. One is SCSI encapsulated in
Ethernet- the volumes are going to look local to the system. The other is a
conventional file sharing protocol.


>
>> > In this case I don?t need to write to the share concurrently
>> >
>> > can be this configuration a problem?
>>
>> No, it's fundamentally impossible. In order to have a FS that can be
>> mounted simultaneously from multiple nodes, it has to be aware of multiple
>> nodes accessing it, which means that it needs coherent caching. Local file
>> systems like ext3 don't have this. When one node writes to the ext3 file
>> system, the other node will have cached the inodes as they were
>> originally,
>> and it won't bother hitting the disk to re-read the contents, it'll just
>> return what it already has cached. And almost certainly corrupt the file
>> system in the process.
>>
>> You cannot have a shared ext3 volume with writing enabled. Period.
>
>
> ok understand it,
>
> but (always there is a but ...)
>
> I only want to share a directory in which one node writes at one and when
> it fails the other node has the diretory mounted with the data and can write
> to it.
>
> Before I have known about cluster my decission would been to mount the
> shares with NFS. Now I want to be more sofisticated and want to use cluster
> tools, so I thought to mount it with iSCSI instead of NFS, but always with
> the ext3 as the underlying filesystem.
>
> Perphaps this is my mistake.
>
> any suggestion that makes me see the light ;.)
>

If you want to have multiple systems access the same volume via iSCSI, you
need to use a filesystem that is multiple-node aware such as GFS.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/dee1fb8b/attachment.htm>

From gwood at dragonhold.org  Mon Apr 20 17:09:08 2009
From: gwood at dragonhold.org (Graham Wood)
Date: Mon, 20 Apr 2009 18:09:08 +0100
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
	<c63a83b07142b32f75eab466919321e3@localhost>
	<3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
Message-ID: <20090420170908.GA6748@dragonhold.org>

On Mon, Apr 20, 2009 at 06:56:23PM +0200, ESGLinux wrote:
> Yes I agree with you, but I thought with iscsi i can do the same as with
> NFS.
No. iSCSI is a way of remotely getting access to a block device.  NFS is a way of accessing 
a network filesystem.  They server completely different purposes, and run at different parts 
of the "stack" as well.

> any suggestion that makes me see the light ;.)
As has already been stated, you cannot do it.

You cannot mount ext3 on two machines at the same time, and write data.  If you want to 
write from one at a time, don't mount it on the other.  It's THAT simple.

There is NO other answer with ext3.  If you want to have it mounted twice your choices are:
- NFS
- GFS/OCFS2/etc.

The standard reason for wanting to avoid GFS and the like is the complexity - unfortunately 
the complexity is required because (as previously stated) of cache coherency.

As for your example of NFS, I can tell you from experience that it doesn't work as well as 
you seem to think - it is still quite possible for NFS to get things wrong, and give you the 
wrong data - however, the protocol has been designed with that in mind, and you get 
"sensible" errors when it detects things.

We've got NFS servers that are accessed by 1000+ clients - and creating a file on one can 
take a few minutes to appear on the others, however the way NFS works means that this is 
done "safely", and it will recover.  Ext3 has NO functionality to do this.


So, in summary, "No".

Graham


From jeff.sturm at eprize.com  Mon Apr 20 17:16:49 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 20 Apr 2009 13:16:49 -0400
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com><64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local><3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com><c63a83b07142b32f75eab466919321e3@localhost>
	<3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB8ED@hugo.eprize.local>

 
________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of ESGLinux
	Sent: Monday, April 20, 2009 12:56 PM
	To: linux clustering
	Subject: Re: [Linux-cluster] iscsi doubt
	
	I only want to share a directory in which one node writes at one
and when it fails the other node has the diretory mounted with the data
and can write to it.  

The ONLY safe way to do this with ext3 is to ensure the fs is mounted on
one node at a time.
 
That means:
 
- The failover node cannot have the ext3 fs mounted until a failover is
performed
- The failed node must not write to the fs after it fails (i.e. you
fence the node)
- Prior to failing back to the primary node, the failover node must
unmount the fs
 
I believe you can automate such a failover with rgmanager, but I have
not done so myself.

	 Before I have known about cluster my decission would been to
mount the shares with NFS. Now I want to be more sofisticated and want
to use cluster tools, so I thought to mount it with iSCSI instead of
NFS, but always with the ext3 as the underlying filesystem.  

 iSCSI isn't a cluster tool, it's a block-level storage protocol.  Using
iSCSI is neither necessary nor sufficient to implement a cluster.  You
can use other software such as DRBD to get the failover you describe.
 
Jeff
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/57b25ea5/attachment.htm>

From esggrupos at gmail.com  Mon Apr 20 17:19:16 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Mon, 20 Apr 2009 19:19:16 +0200
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <20090420170908.GA6748@dragonhold.org>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>
	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>
	<c63a83b07142b32f75eab466919321e3@localhost>
	<3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
	<20090420170908.GA6748@dragonhold.org>
Message-ID: <3128ba140904201019j2f1fcd4drf46c6c8e5b36b1f1@mail.gmail.com>

Thank you ALL


Finally the light came to me,

I thought that iscsi was a file sharing protocol and now I see  it isn?t.

I think I?m going to implement GFS on it, (its a good oportunity to learn
something new :-)

Thaks all again for your help

ESG


2009/4/20 Graham Wood <gwood at dragonhold.org>

> On Mon, Apr 20, 2009 at 06:56:23PM +0200, ESGLinux wrote:
> > Yes I agree with you, but I thought with iscsi i can do the same as with
> > NFS.
> No. iSCSI is a way of remotely getting access to a block device.  NFS is a
> way of accessing
> a network filesystem.  They server completely different purposes, and run
> at different parts
> of the "stack" as well.
>
> > any suggestion that makes me see the light ;.)
> As has already been stated, you cannot do it.
>
> You cannot mount ext3 on two machines at the same time, and write data.  If
> you want to
> write from one at a time, don't mount it on the other.  It's THAT simple.
>
> There is NO other answer with ext3.  If you want to have it mounted twice
> your choices are:
> - NFS
> - GFS/OCFS2/etc.
>
> The standard reason for wanting to avoid GFS and the like is the complexity
> - unfortunately
> the complexity is required because (as previously stated) of cache
> coherency.
>
> As for your example of NFS, I can tell you from experience that it doesn't
> work as well as
> you seem to think - it is still quite possible for NFS to get things wrong,
> and give you the
> wrong data - however, the protocol has been designed with that in mind, and
> you get
> "sensible" errors when it detects things.
>
> We've got NFS servers that are accessed by 1000+ clients - and creating a
> file on one can
> take a few minutes to appear on the others, however the way NFS works means
> that this is
> done "safely", and it will recover.  Ext3 has NO functionality to do this.
>
>
> So, in summary, "No".
>
> Graham
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/b5fef018/attachment.htm>

From gordan at bobich.net  Mon Apr 20 18:57:26 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 20 Apr 2009 19:57:26 +0100
Subject: [Linux-cluster] iscsi doubt
In-Reply-To: <3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
References: <3128ba140904200829v77cc9437h6ceec67e0378c84e@mail.gmail.com>	<64D0546C5EBBD147B75DE133D798665F02FDB8E4@hugo.eprize.local>	<3128ba140904200918q32ffb61al263fe02eb8b08bce@mail.gmail.com>	<c63a83b07142b32f75eab466919321e3@localhost>
	<3128ba140904200956g7df33851l8692ad7a3abf15a3@mail.gmail.com>
Message-ID: <49ECC596.205@bobich.net>

ESGLinux wrote:

>     On Mon, 20 Apr 2009 18:18:22 +0200, ESGLinux <esggrupos at gmail.com
>     <mailto:esggrupos at gmail.com>> wrote:
>      > Hello,
>      >
>      > first, thanks for your answer,
>      >
>      > I suspected it but why can i do it with NFS. ?
> 
>     Not sure I understand your question. NFS is a network file system
>     like CIFS
>     specifically designed to be mounted from multiple clients
>     simultaneously.
>     ext3 is only designed with a single accessor in mind.
> 
> 
> I?ll try to explain myself
> 
> I have a partition /dev/sda
> 
> /dev/sda on /iscsivol type ext3 (rw)
> 
> but this partition is a target iscsi on another server. I format the 
> partition with ext3 but its not a local disk, is a target iscsi.
> 
> with this configuration I have the filesystem corrupted.
> 
> second scenario
> 
> I have
> 192.168.1.198:/nfsexport/                       6983168   2839168   
> 3783552  43% /mnt
> 
> but the parttion 192.168.1.198:/nfsexport/ is again ext3 the diference 
> is that I use nfs as network protocol instead of iscsi.

You first need to understand how iSCSI works and what it is. It is a 
block device (like a physical hard disk, only virtualized). There is no 
arbitration of file access that NFS has, it is up to the file system 
layer to sort it out. ext3 (or any other non-shared non-clustered file 
system) cannot do this, because it wasn't designed to do it. What your 
iSCSI volume file is backed on is irrelevant. You can export a raw 
partition using iSCSI, it's all the same to it.

>      > the nodes never are going to be active at the same time so I can
>     mount
>     the
>      > shares via NFS. With NFS when I create a file in a share
>     automatically i
>      > got it in the shared mounted by all the clients.
> 
>     I still don't understand your question - that is what NFS is
>     designed for.
> 
> 
> Yes I agree with you, but I thought with iscsi i can do the same as with 
> NFS.

No. The two are about as different in concept as you can get.

>      > In this case I don?t need to write to the share concurrently
>      >
>      > can be this configuration a problem?
> 
>     No, it's fundamentally impossible. In order to have a FS that can be
>     mounted simultaneously from multiple nodes, it has to be aware of
>     multiple
>     nodes accessing it, which means that it needs coherent caching.
>     Local file
>     systems like ext3 don't have this. When one node writes to the ext3 file
>     system, the other node will have cached the inodes as they were
>     originally,
>     and it won't bother hitting the disk to re-read the contents, it'll just
>     return what it already has cached. And almost certainly corrupt the file
>     system in the process.
> 
>     You cannot have a shared ext3 volume with writing enabled. Period.
> 
> 
> ok understand it,
> 
> but (always there is a but ...)
> 
> I only want to share a directory in which one node writes at one and 
> when it fails the other node has the diretory mounted with the data and 
> can write to it.

You can't share it simultaneously. You can have it fail over - primary 
node gets fenced (powered off), and the secondary can then mount the FS 
on it's own. But they can't both mount it at the same time. That will 
instantly trash your file system.

> Before I have known about cluster my decission would been to mount the 
> shares with NFS. Now I want to be more sofisticated and want to use 
> cluster tools, so I thought to mount it with iSCSI instead of NFS, but 
> always with the ext3 as the underlying filesystem.

If you want shared storage, you can export NFS from a single node 
(backed with whatever you want, including ext3) or use GFS on top of iSCSI.

If you are doing this for redundancy, ask yourself what the point is in 
shifting the single point of failure around. If you don't have mirrored 
SANs (and even most of the high end SANs with a 6-figure price tag can't 
handle that feature), you might want to consider something like 
DRBD+GFS. If you just want performance and redundancy is less relevant, 
you'll probably find that NFS beats any other solution out there for use 
cases involding lots of small files and lots of rear-write load.

Gordan


From golharam at umdnj.edu  Mon Apr 20 19:03:56 2009
From: golharam at umdnj.edu (Ryan Golhar)
Date: Mon, 20 Apr 2009 15:03:56 -0400
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
Message-ID: <49ECC71C.2000309@umdnj.edu>

This brings up an interesting question for me....We can 6 machines that 
host a bunch of virtual machines.  I'd like to put the virtual machines 
on a shared SAN disk.  If one of the physical machines goes down, 
another one will take over and host a virtual machine.

Does it make sense to use GFS to manage the SAN then?  IF the 4x 
slowdown is there, then this may not be the way to go.

Jeff Sturm wrote:
>> -----Original Message-----
>> From: linux-cluster-bounces at redhat.com 
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash 
>> Khatuwala
>> Sent: Monday, April 20, 2009 11:23 AM
>> To: linux-cluster at redhat.com
>> Subject: [Linux-cluster] GFS performance.
>>
>> OS : CentOS 5.2
>> FS : GFS
> 
> Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to have some
> performance improvements over GFS1.
> 
>> Now I need to make a decision to go with GFS or not, clearly 
>> at 4 times less performance we cannot afford it, also it 
>> doesn't sound right so would like to find out whats wrong.
> 
> Be careful with benchmarks, as they often do not give you a good
> indication of real-world performance.
> 
> Are you more concerned with latency or throughput?  Any single read will
> almost certainly take longer to complete over GFS than EXT3.  There's
> simply more overhead involved with any cluster filesystem.  However,
> that's not to say you're limited as to how many reads you can execute in
> parallel.  So the overall number of reads you can perform in a given
> time interval may not be 4x at all (are you running a parallel
> benchmark?)
> 
> Jeff
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From golharam at umdnj.edu  Mon Apr 20 19:15:06 2009
From: golharam at umdnj.edu (Ryan Golhar)
Date: Mon, 20 Apr 2009 15:15:06 -0400
Subject: [Linux-cluster] How to extract an rpm for use on another server?
In-Reply-To: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>
References: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>
Message-ID: <49ECC9BA.9090102@umdnj.edu>

If you don't have the rpm, you'll have to do a 'rpm -qf ...' on every 
file on your system to determine which one belong to VMware.  Try this 
perl script:

#!/usr/bin/perl

foreach $file (`find / -type f`) {
         $_ = `rpm -qf $file`;
         if ($_ =~ m/VMware/) {
                 print "$file ... $_\n";
         }
}


sunhux G wrote:
>  Hi,
> 
> 
> I've just installed an rpm ( VMWare-esx-vmx-3.5.0-158869.i386.rpm )
> and this caused the entire ESX/VMWare to panic/crash whenever a
> VM in this VMWare is started.
> 
> I've just done "rpm -e --nodeps -allmatches VMware-esx-vms-3.5.-158869)
> and would like to put back the old rpm 64607 :
> 
>    VMWare-esx-3.5.0-64607
> 
> 
> However, I could not find 64607 from anywhere (requires a code), so I'll
> need to extract this 64607 rpm from my other ESX servers : is this possible
> & what's the command.
> 
> If it's not possible to build the 64607 rpm from an existing ESX which has
> not been patched yet, appreciate if someone can point me to where I could
> download it
> 
> 
> Tks
> U
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From gordan at bobich.net  Mon Apr 20 19:22:44 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 20 Apr 2009 20:22:44 +0100
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <49ECC71C.2000309@umdnj.edu>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>	<64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
	<49ECC71C.2000309@umdnj.edu>
Message-ID: <49ECCB84.3090304@bobich.net>

Ryan Golhar wrote:
> This brings up an interesting question for me....We can 6 machines that 
> host a bunch of virtual machines.  I'd like to put the virtual machines 
> on a shared SAN disk.  If one of the physical machines goes down, 
> another one will take over and host a virtual machine.
> 
> Does it make sense to use GFS to manage the SAN then?  IF the 4x 
> slowdown is there, then this may not be the way to go.

I would expect the performance on few large files (VM disk images) to 
suffer much, much less than this. You have quite a few chices for this 
sort of thing (GFS/GFS2, OCFS/OCFS2, VMFS), but generally, performance 
between them is pretty similar. And on VMs you'll be so virtual I/O 
bound that the speed of the underlying storage won't make any difference 
anyway.

Gordan


From luis.cerezo at pgs.com  Mon Apr 20 19:25:41 2009
From: luis.cerezo at pgs.com (Luis Cerezo)
Date: Mon, 20 Apr 2009 14:25:41 -0500
Subject: [Linux-cluster] How to extract an rpm for use on another server?
In-Reply-To: <49ECC9BA.9090102@umdnj.edu>
References: <60f08e700904182103i5cb1ae64me74915fcc9aca142@mail.gmail.com>
	<49ECC9BA.9090102@umdnj.edu>
Message-ID: <1240255541.29777.6.camel@hou58375.onshore.pgs.com>

rpm -qlf somefile will list all the files owned by a package too. You
don't need to do it on every file.


[luis.cerezo at houts38 ~]$ rpm -qlf /etc/redhat-release 
/etc/issue
/etc/issue.net
/etc/pki/rpm-gpg
/etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-auxiliary
/etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-beta
/etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-former
/etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-release
/etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-rhx
/etc/redhat-release
/etc/yum.repos.d/rhel-debuginfo.repo
/usr/share/doc/redhat-release-5Server
/usr/share/doc/redhat-release-5Server/EULA
*****SNIP****

-----Original Message-----
From: Ryan Golhar <golharam at umdnj.edu>
Reply-To: golharam at umdnj.edu, linux clustering
<linux-cluster at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] How to extract an rpm for use on another
server?
Date: Mon, 20 Apr 2009 15:15:06 -0400


If you don't have the rpm, you'll have to do a 'rpm -qf ...' on every 
file on your system to determine which one belong to VMware.  Try this 
perl script:

#!/usr/bin/perl

foreach $file (`find / -type f`) {
         $_ = `rpm -qf $file`;
         if ($_ =~ m/VMware/) {
                 print "$file ... $_\n";
         }
}


sunhux G wrote:
>  Hi,
> 
> 
> I've just installed an rpm ( VMWare-esx-vmx-3.5.0-158869.i386.rpm )
> and this caused the entire ESX/VMWare to panic/crash whenever a
> VM in this VMWare is started.
> 
> I've just done "rpm -e --nodeps -allmatches VMware-esx-vms-3.5.-158869)
> and would like to put back the old rpm 64607 :
> 
>    VMWare-esx-3.5.0-64607
> 
> 
> However, I could not find 64607 from anywhere (requires a code), so I'll
> need to extract this 64607 rpm from my other ESX servers : is this possible
> & what's the command.
> 
> If it's not possible to build the 64607 rpm from an existing ESX which has
> not been patched yet, appreciate if someone can point me to where I could
> download it
> 
> 
> Tks
> U
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

Luis E. Cerezo
PGS Global Shared Services - IT
281.509.8397
***********


This email and any files contained therein is confidential and may contain privileged information.  If you are not the named addressee(s) or you have otherwise received this in error, you should not distribute or copy this e-mail or use any of its content for any purpose. Please notify the sender immediately by e-mail if you have received this e-mail in error and delete it from your system
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090420/87729ef2/attachment.htm>

From jeff.sturm at eprize.com  Mon Apr 20 19:31:34 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 20 Apr 2009 15:31:34 -0400
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <49ECC71C.2000309@umdnj.edu>
References: <200904201549.n3KFndL0008930@mx3.redhat.com><64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
	<49ECC71C.2000309@umdnj.edu>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB904@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ryan Golhar
> Sent: Monday, April 20, 2009 3:04 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS performance.
> 
> This brings up an interesting question for me....We can 6 
> machines that host a bunch of virtual machines.  I'd like to 
> put the virtual machines on a shared SAN disk.  If one of the 
> physical machines goes down, another one will take over and 
> host a virtual machine.

Nice.  If your virtualization is based on Xen, you can do live
migrations over shared storage.  I've done it and it works well.

> Does it make sense to use GFS to manage the SAN then?

Perhaps I misunderstand, but I don't think there's a need for GFS unless
you also have shared filesystems.

In our setup each virtual machine has a (dedicated) root filesystem on a
SAN.  They are all formatted with ext3, because they are not shared.
(We also have shared non-root volumes that are GFS.)

The decisions to virtualize your infrastructure with Xen (or equivalent)
and to share your filesystem storage with GFS (or equivalent) are
orthogonal considerations.  You can successfully implement either, or
both.  However, shared storage is useful to have either way.

Jeff


From criley at erad.com  Mon Apr 20 19:40:33 2009
From: criley at erad.com (Charles Riley)
Date: Mon, 20 Apr 2009 15:40:33 -0400
Subject: [Linux-cluster] RHEL4 - loss of iscsi connectivity causes rgmanager
	crash
Message-ID: <49ECCFB1.3060701@erad.com>

Hi all,

With the "iscsi doubt" thread in mind, I thought I'd share an experience
I've had twice now with
iscsi and RHEL 4 cluster manager.

What happens is that an iscsi filesystem which is part of a resource
group will become unavailable
(in dmesg, you see iscsid lose connection, then attempt to reconnect
over and over). 
However, rgmanager does not seem to detect that the filesystem has
disappeared, even though the
filesystem is configured in the resource group using the built in "fs"
resource agent.
When I try to fail the resource group over to another node, rgmanager
gets all out of whack and starts
reporting bogus information.  During the most recent failure,  rgmanager
crashed on all but two of six
total nodes.  On the two nodes where it was still running, resource
groups showed as starting, stopping,
or running on nodes that I'd manually fenced five minutes before.  I
ended up rebooting all of the servers
and bringing them up clean.

I also found that the rest of the resource group will start even if
iscsid is not running.
Which is really weird since all of the rest of the resource group are
attached to the iscsi filesystem.
e.g.  all of the other resource agents/scripts are nested/indented
within the "fs" block in cluster.conf.
If I understand correctly, that shouldn't be able to happen.

I'm going to try a few things:
Setting "Continuous=no" for all the iscsi targets in iscsi.conf
(disables continuous discovery)
Setting self_fence=1 in cluster.conf
Setting the recovery policy to "relocate"

Any recommendations from the experts?

I have a support ticket open with Redhat, but they are still combing
through six nodes worth of sosreport files.

Cheers

-- 
Charles Riley


From criley at erad.com  Mon Apr 20 19:48:22 2009
From: criley at erad.com (Charles Riley)
Date: Mon, 20 Apr 2009 15:48:22 -0400
Subject: [Linux-cluster] Failback option on failover domains
In-Reply-To: <8786b91c0904152108j2558e3a6oe05b7e79b3741939@mail.gmail.com>
References: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
	<8786b91c0904152108j2558e3a6oe05b7e79b3741939@mail.gmail.com>
Message-ID: <49ECD186.2040503@erad.com>

If you configure your failover domain as "ordered", it will do what  you
want in rhel4.
I have several clusters running in that configuration now.

Charles

Rajagopal Swaminathan wrote:
> Greetings,
>
> On Wed, Apr 15, 2009 at 10:42 PM, Pavlos Parissis
> <pavlos.parissis at gmail.com> wrote:
>   
>> Hello,
>>
>> Does anyone know if the failback option is available on RedHat 4.X systems?
>>
>> I went through all docs [1] and it seams to be that is only available on
>> RedHat 5.X version,
>> am I right?
>>     
>
>
> Please note that the RHCS is very different for 4.x and 5.x
>
> Rajagopal
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   


-- 
Charles Riley


From arwin.tugade at csun.edu  Mon Apr 20 20:21:15 2009
From: arwin.tugade at csun.edu (Arwin L Tugade)
Date: Mon, 20 Apr 2009 13:21:15 -0700
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E51BAA.4090402@umdnj.edu>
References: <49E4B728.8020906@umdnj.edu> <20090414175034.GE17186@redhat.com>
	<49E50583.6060202@bobich.net> <1239747724.29777.8.camel@sdake-laptop>
	<49E51BAA.4090402@umdnj.edu>
Message-ID: <6708F96BBF31F846BFA56EC0AE37D62281E8B38EEC@CSUN-EX-V01.csun.edu>

Ryan,

Are you talking about when you try to do "/sbin/service rgmanger stop", it will hang there indefinitely?

Check out this bug report:

https://bugzilla.redhat.com/show_bug.cgi?id=485026

Arwin

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ryan Golhar
Sent: Tuesday, April 14, 2009 4:27 PM
To: linux clustering
Subject: Re: [Linux-cluster] really reliable?

No no no, I haven't modified any of the init files...these are out of 
the box...

Steven Dake wrote:
> On Tue, 2009-04-14 at 22:52 +0100, Gordan Bobic wrote:
>> David Teigland wrote:
>>
>>>> Even when I try to reboot the nodes, I can't because the whole system 
>>>> hangs on various processes that don't ever shut down.  I have to 
>>>> physically reboot these boxes.
>>> If something has gone wrong, it's often impossible to shutdown without a hard
>>> reboot.  Even when things are working, rebooting can be a delicate task
>>> because the system may easily be configured to stop things in the wrong order,
>>> and one thing out of place can cause a wreck.
>> If things don't come down in the correct order using the standard init 
>> scripts, you should file a bug report about this. I've never seen it 
>> happen on any of my clusters.
>>
> 
> I believe he means the administrator makes a configuration change to the
> init scripts which result in a shutdown not working.  RHCS amplifies
> this problem somewhat so more care must be taken in these situations.
> 
> regards
> -steve
> 
>> Gordan
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From arwin.tugade at csun.edu  Mon Apr 20 20:55:35 2009
From: arwin.tugade at csun.edu (Arwin L Tugade)
Date: Mon, 20 Apr 2009 13:55:35 -0700
Subject: [Linux-cluster] really reliable?
In-Reply-To: <49E5357B.8050503@sivell.com>
References: <49E4B728.8020906@umdnj.edu><6296bab6d0ecbbe92334969f026dc9f8@localhost>
	<49E4F9A4.4060306@umdnj.edu> <49E5357B.8050503@sivell.com>
Message-ID: <6708F96BBF31F846BFA56EC0AE37D62281E8B38EEE@CSUN-EX-V01.csun.edu>

How long is long when waiting for the fencing part to complete?  For you guys, what is the normal amount of time that it takes for "Starting fencing..." to complete?  For me, it takes anywhere from 30-45 seconds to complete.  Would this be because I'm on Cisco switches?  If this is normal then I'm just going to leave it be because it does complete and my cluster forms just fine, it just sits there with these messages repeated for 30-45 seconds:

Apr 16 21:01:47 oilfish openais[4648]: [TOTEM] entering GATHER state from 11. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] entering GATHER state from 0. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] Creating commit token because I am the rep. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] Saving state aru 37 high seq received 37 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] Storing new sequence id for ring 13c 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] entering COMMIT state. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] entering RECOVERY state. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] position [0] member 172.31.37.2: 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] previous ring seq 312 rep 172.31.37.2 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] aru 37 high delivered 37 received flag 1 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] Did not need to originate any messages in recover
y. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] Sending initial ORF token 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] CLM CONFIGURATION CHANGE 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] New Configuration: 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] 	r(0) ip(172.31.37.2)  
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] Members Left: 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] Members Joined: 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] CLM CONFIGURATION CHANGE 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] New Configuration: 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] 	r(0) ip(172.31.37.2)  
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] Members Left: 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] Members Joined: 
Apr 16 21:01:52 oilfish openais[4648]: [SYNC ] This node is within the primary component and wil
l provide service. 
Apr 16 21:01:52 oilfish openais[4648]: [TOTEM] entering OPERATIONAL state. 
Apr 16 21:01:52 oilfish openais[4648]: [CLM  ] got nodejoin message 172.31.37.2 
Apr 16 21:01:52 oilfish openais[4648]: [CPG  ] got joinlist message from node 1

Then eventually:

Apr 16 21:02:19 oilfish openais[4648]: [TOTEM] entering OPERATIONAL state. 
Apr 16 21:02:19 oilfish openais[4648]: [CLM  ] got nodejoin message 172.31.37.2 
Apr 16 21:02:19 oilfish openais[4648]: [CLM  ] got nodejoin message 172.31.37.4 
Apr 16 21:02:19 oilfish openais[4648]: [CPG  ] got joinlist message from node 2 
Apr 16 21:02:19 oilfish openais[4648]: [CPG  ] got joinlist message from node 1 
Apr 16 21:02:23 oilfish kernel: dlm: connecting to 2
Apr 16 21:02:23 oilfish kernel: dlm: got connection from 2


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vu Pham
Sent: Tuesday, April 14, 2009 6:17 PM
To: golharam at umdnj.edu; linux clustering
Subject: Re: [Linux-cluster] really reliable?

Ryan Golhar wrote:
> I'm running RHEL 5.3 64-bit.  So far, I only want to see that the
> cluster can run.  I'll worry about getting GFS after I'm confident this
> works.
> 
> I've got three nodes: pico, vail, and whistler.  They each have two NIC
> cards, one that provides a public IP address, and another that provides
> private communications.  All cluster traffic will go over the private
> network, 192.168.20.0.
> 
> I've installed only the following components:
> system-config-cluster-1.0.52-1.1, cman-2.0.98-1, and rgmanager-2.0.38-2.
> 
> I've created my cluster.conf file to include these three nodees and
> fence them using a brocade fibre switch (for GFS).
> 
> When I start the cluster services on all 3 nodes using the manually 
> method of:
> 
> /sbin/ccsd; /usr/sbin/cman_tool join
> 
> The nodes successfully form a cluster.  I am able to leave the cluster 
> and kill ccsd as well.
> 
> If I try to start the cman service I see:
> 
> [root at pico cluster]# /sbin/service cman start
> Starting cluster:
>    Loading modules... done
>    Mounting configfs... done
>    Starting ccsd... done
>    Starting cman... done
>    Starting daemons... done
>    Starting fencing...
> 
> 
> And it just hangs.  I know my fencing is set up correctly because I've 
> had nodes fence other nodes before (when I was trying with 6 members). 
> If I let it sit for long enough sometimes it finishes successfully.  I'm 
> not sure what its doing because fence_tool is called and its a binary...
> 

Ryan,

Anything suspicious in the log when it hangs at fencing ?
Could you show your cluster.conf ?

Vu

> Ryan
> 
> 
> Gordan Bobic wrote:
>> What distro are you using? I've found that:
>>
>> 1) Distros other than RHEL/CentOS can be quirky when it comes to using
>> RHCS. I've even run into problems on Fedora more than once (not to 
>> mention
>> that FC hasn't shipped GFS1 since FC5 and GFS2 hasn't been deemed
>> production stable until last month - and we're up to FC10 now).
>>
>> 2) Starting RHCS components using anything except the intended init 
>> scripts
>> tends to cause problems.
>>
>> 3) Source of 99% of problems in the rest of the cases (i.e. not 
>> covered by
>> 1) and 2) above) is incorrectly configured fencing.
>>
>> Does your setup fall under either of the first two categories?
>> Have you verified beyond doubt that your fencing is configured correctly
>> and that the fencing script gets verification upon success?
>>
>> Gordan
>>
>> On Tue, 14 Apr 2009 12:17:44 -0400, Ryan Golhar <golharam at umdnj.edu> 
>> wrote:
>>> Hi all,
>>>
>>> Is redhat cluster suite really reliable?  I've been having so much 
>>> trouble getting a cluster up and running, I'm beginning to second 
>>> guess my decision to use this software stack.
>>>
>>> I have 3 nodes (eventually 10) running and set up.  The fencing 
>>> method is by a brocade fibre switch.  The ultimate goal of this 
>>> cluster is to shared a SAN connected by fibre.
>>>
>>> I've installed just the bare minimum (before even getting to GFS) to 
>>> test the cluster software.  Just starting cman cluster services fails 
>>> on two of the nodes.
>>>
>>> Even when I try to reboot the nodes, I can't because the whole system 
>>> hangs on various processes that don't ever shut down.  I have to 
>>> physically reboot these boxes.
>>>
>>> The logs fill up with errors about not being able to connect to cman,
>> etc.
>>> I've been at it for awhile now and am not sure this is the best route 
>>> anymore.
>>>
>>> Ryan
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From vikash at netvigator.com  Tue Apr 21 03:18:21 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Tue, 21 Apr 2009 11:18:21 +0800
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local
 >
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
Message-ID: <200904210318.n3L3ISxb009791@mx3.redhat.com>

Hi,

I am using Virtuozzo OS visualization which does not have a single 
file for the entire VM's filesystem. All VMs are simply 
sub-directories and OS files are stored in a common templates 
directory which is sym linked to back to the VM's directory, so if an 
OS file is changed inside the VM then the symlink breaks and a new 
file is put in the VM's private directory. I cant use GFS2 because it 
is not supported by Virtuozzo. All VMs are simply running web/db/ftp.

So this basically means that there are a lot of symbolic links (small 
files). The GFS has a block size of 4K so I also chose 4K as my block 
size for my performance testing to asses the worst case scenario. If 
I change the block size to 256K then the performance difference 
between ext3 and GFS are minimal. Also when I migrate the VM out from 
GFS(RAID5 SAS 15K) to ext3(single disk SATA), there is a significant 
noticeable performance gain!

Below tests are on the same disk set (5 disk RAID5 SAS 15K) with 2 
partitions, GFS and ext3.
Results at 4K random reads:
GFS : about 1500K/s
ext3 : about 7000K/s

Results at 256K random reads:
GFS : about 45000K/s
ext3 : about 50000K/s

Results at 256K sequential reads:
GFS : over 110,000K/s (my single GB NIC maxes out)
ext3 : over 110,000K/s (my single GB NIC maxes out)

fio test file as below only rw and blocksize were changed for the 3 
different scenarios above.
[random-read1]
rw=randread
size=10240m
directory=/vz/tmp
ioengine=libaio
iodepth=16
direct=1
invalidate=1
blocksize=4k

[random-read2]
rw=randread
size=10240m
directory=/vz/tmp
ioengine=libaio
iodepth=16
direct=1
invalidate=1
blocksize=4k

Thanks,
Vikash.


At 01:00 AM 21-04-09, Jeff Sturm wrote:
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash
> > Khatuwala
> > Sent: Monday, April 20, 2009 11:23 AM
> > To: linux-cluster at redhat.com
> > Subject: [Linux-cluster] GFS performance.
> >
> > OS : CentOS 5.2
> > FS : GFS
>
>Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to have some
>performance improvements over GFS1.
>
> > Now I need to make a decision to go with GFS or not, clearly
> > at 4 times less performance we cannot afford it, also it
> > doesn't sound right so would like to find out whats wrong.
>
>Be careful with benchmarks, as they often do not give you a good
>indication of real-world performance.
>
>Are you more concerned with latency or throughput?  Any single read will
>almost certainly take longer to complete over GFS than EXT3.  There's
>simply more overhead involved with any cluster filesystem.  However,
>that's not to say you're limited as to how many reads you can execute in
>parallel.  So the overall number of reads you can perform in a given
>time interval may not be 4x at all (are you running a parallel
>benchmark?)
>
>Jeff
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster


From p_pavlos at freemail.gr  Tue Apr 21 03:48:38 2009
From: p_pavlos at freemail.gr (Pavlos Parissis)
Date: Tue, 21 Apr 2009 05:48:38 +0200
Subject: [Linux-cluster] Failback option on failover domains
In-Reply-To: <49ECD186.2040503@erad.com>
References: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
	<8786b91c0904152108j2558e3a6oe05b7e79b3741939@mail.gmail.com>
	<49ECD186.2040503@erad.com>
Message-ID: <fb143d260904202048s6435738bu1037fcd02aa68dc3@mail.gmail.com>

Hi Charles,

My failover domains are configured with ordered [1] and a service will
automatically fail back
on the primary node when that node is alive again.

Cheers,
Pavlos


[1]

<failoverdomains>
      <failoverdomain restricted="1" ordered="1" name="FirstDomain">
        <failoverdomainnode priority="0" name="node1-cluster"/>
        <failoverdomainnode priority="1" name="node3-cluster"/>
      </failoverdomain>
      <failoverdomain restricted="1" ordered="1" name="SecondDomain">
        <failoverdomainnode priority="0" name="node2-cluster"/>
        <failoverdomainnode priority="1" name="node3-cluster"/>
      </failoverdomain>
      <failoverdomain restricted="1" ordered="1" name="ThirdDomain">
        <failoverdomainnode priority="0" name="node3-cluster"/>
        <failoverdomainnode priority="1" name="node1-cluster"/>
        <failoverdomainnode priority="1" name="node2-cluster"/>
      </failoverdomain>


2009/4/20 Charles Riley <criley at erad.com>

> If you configure your failover domain as "ordered", it will do what  you
> want in rhel4.
> I have several clusters running in that configuration now.
>
> Charles
>
> Rajagopal Swaminathan wrote:
> > Greetings,
> >
> > On Wed, Apr 15, 2009 at 10:42 PM, Pavlos Parissis
> > <pavlos.parissis at gmail.com> wrote:
> >
> >> Hello,
> >>
> >> Does anyone know if the failback option is available on RedHat 4.X
> systems?
> >>
> >> I went through all docs [1] and it seams to be that is only available on
> >> RedHat 5.X version,
> >> am I right?
> >>
> >
> >
> > Please note that the RHCS is very different for 4.x and 5.x
> >
> > Rajagopal
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
> --
> Charles Riley
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090421/9e388dd6/attachment.htm>

From mrugeshkarnik at gmail.com  Tue Apr 21 03:51:11 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Tue, 21 Apr 2009 09:21:11 +0530
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F02FDB904@hugo.eprize.local>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
	<49ECC71C.2000309@umdnj.edu>
	<64D0546C5EBBD147B75DE133D798665F02FDB904@hugo.eprize.local>
Message-ID: <200904210921.11701.mrugeshkarnik@gmail.com>

On Tuesday 21 Apr 2009 01:01:34 Jeff Sturm wrote:
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ryan Golhar
> > Sent: Monday, April 20, 2009 3:04 PM
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS performance.
> >
> > This brings up an interesting question for me....We can 6
> > machines that host a bunch of virtual machines.  I'd like to
> > put the virtual machines on a shared SAN disk.  If one of the
> > physical machines goes down, another one will take over and
> > host a virtual machine.

I've described my setup for the same below.

> Nice.  If your virtualization is based on Xen, you can do live
> migrations over shared storage.  I've done it and it works well.

+1

> > Does it make sense to use GFS to manage the SAN then?
>
> Perhaps I misunderstand, but I don't think there's a need for GFS unless
> you also have shared filesystems.
>
> In our setup each virtual machine has a (dedicated) root filesystem on a
> SAN.  They are all formatted with ext3, because they are not shared.
> (We also have shared non-root volumes that are GFS.)

I use clvm volumes as xvd? devices used with Xen domUs and format them with 
standard ext3. GFS2 is used for sharing the xen configuration files between 
the dom0 nodes so that rgmanager can be used to look after the domUs.

So, basically there are two clusters, the dom0 cluster and the domU cluster. 
In the dom0 cluster, I'm running clvmd, fence_xvmd, gfs2 and rgmanager. The 
domU cluster only runs fence_xvm.

Each dom0 node is an unrestricted, unordered <failoverdomainnode/>. Each 
<vm/> is treated directly as a service under <rm/> in cluster.conf. The domain 
argument to each <vm/> specifies which node to run the domU on, by default. 
Also, be sure to specify migrate="live" to enable Live Migration of the domUs. 
The path argument to <vm/> specifies the path to the domU config files. In my 
setup, they reside on the GFS2 mount point.

The setup works pretty well. The only issue is that when the failed dom0 comes 
back up, the domUs associated with it are automatically migrated back. Its not 
a problem for me as of yet, but I'd still like to know how I can override this 
behaviour, if I want to.

Btw, if you do setup things this way, make sure you use the following in 
/etc/sysconfig/cluster:

RGMGR_OPTS="-N"

This'll prevent rgmanager from stopping the services before starting them.

Mrugesh


From erkrishna at gmail.com  Tue Apr 21 04:52:13 2009
From: erkrishna at gmail.com (er krishna)
Date: Tue, 21 Apr 2009 10:22:13 +0530
Subject: [Linux-cluster] GFS + NFS FAILOVER IN CLUSTERING
Message-ID: <80321e0e0904202152v49d79124o469ed0be1c018438@mail.gmail.com>

Dear All,

I am trying to setup NFS failover and its recovery via GFS (using CMAN)
tool. Of course, I have created a gfs file system over my logical volume (
basically its on exported block devices ). I have three nodes centos,
centos1 and centos2. I am attaching my cluster.conf file for further
reference. Everything seems fine, but I am not able to migrate my NFS
services when I poweroff my first node. Anybody has any idea about it ?


Objective of experiment : (Please tell weather its possible to achieve)

 Suppose I have my nfstest directory exported on network via nfs from the
node1 ( here centos ) and it is mounted on node3 (centos2 machine). There is
the similar directory (nfstest) available on node two (centos1 machine ). Is
it possible that if node1 get crashed then the nfstest directory from
centos1 machine (node2) , will be exported on the network and will be
automatically mounted on node3 (centos2 ). That i was trying to achieve.
Please tell if its possible to achieve this result through this experiment.


Thanks & Best Regards,
Krishna
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090421/6d38d14a/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/octet-stream
Size: 1752 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090421/6d38d14a/attachment.obj>

From ralphzukeb at gmail.com  Tue Apr 21 07:02:22 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Tue, 21 Apr 2009 08:02:22 +0100
Subject: [Linux-cluster] Ethernet devices and spofs
Message-ID: <6e4a03690904210002sbcfe460l4b5e6987479d893b@mail.gmail.com>

Hello!

I have multicast running over eth2, with this device configured for
communication in my cluster.conf file. I've seen that this only works
then the node resolves to the ip address on eth2. Fair enough.

This raises a question:
I would like to use eth0 for multicast in the case that eth2 dies.

What is the best way to approach this?

Thanks

Ralph


From robejrm at gmail.com  Tue Apr 21 07:45:31 2009
From: robejrm at gmail.com (Juan Ramon Martin Blanco)
Date: Tue, 21 Apr 2009 09:45:31 +0200
Subject: [Linux-cluster] Ethernet devices and spofs
In-Reply-To: <6e4a03690904210002sbcfe460l4b5e6987479d893b@mail.gmail.com>
References: <6e4a03690904210002sbcfe460l4b5e6987479d893b@mail.gmail.com>
Message-ID: <8a5668960904210045r2a9e0553p562441b23278d8e3@mail.gmail.com>

On Tue, Apr 21, 2009 at 9:02 AM, Ralph Zukeb <ralphzukeb at gmail.com> wrote:

> Hello!
>
> I have multicast running over eth2, with this device configured for
> communication in my cluster.conf file. I've seen that this only works
> then the node resolves to the ip address on eth2. Fair enough.


> This raises a question:
> I would like to use eth0 for multicast in the case that eth2 dies.
>
Consider configuring eth0 and eth2 as a bonding device bond0.
Greetings,
Juanra


>
> What is the best way to approach this?
>
> Thanks
>
> Ralph
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090421/cf554c1e/attachment.htm>

From ralphzukeb at gmail.com  Tue Apr 21 08:10:13 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Tue, 21 Apr 2009 09:10:13 +0100
Subject: [Linux-cluster] Ethernet devices and spofs
In-Reply-To: <8a5668960904210045r2a9e0553p562441b23278d8e3@mail.gmail.com>
References: <6e4a03690904210002sbcfe460l4b5e6987479d893b@mail.gmail.com>
	<8a5668960904210045r2a9e0553p562441b23278d8e3@mail.gmail.com>
Message-ID: <6e4a03690904210110n6160657fl9b5bed352ba3e285@mail.gmail.com>

2009/4/21 Juan Ramon Martin Blanco <robejrm at gmail.com>
>
>
> On Tue, Apr 21, 2009 at 9:02 AM, Ralph Zukeb <ralphzukeb at gmail.com> wrote:
>>
>> Hello!
>>
>> I have multicast running over eth2, with this device configured for
>> communication in my cluster.conf file. I've seen that this only works
>> then the node resolves to the ip address on eth2. Fair enough.
>>
>> This raises a question:
>> I would like to use eth0 for multicast in the case that eth2 dies.
>
> Consider configuring eth0 and eth2 as a bonding device bond0.

Thanks for this.

My problem is that I have two separate networks on eth0 and eth2, with
different ip ranges...

> Greetings,
> Juanra
>
>>
>> What is the best way to approach this?
>>
>> Thanks
>>
>> Ralph
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From moya at latertulia.org  Wed Apr 22 05:40:58 2009
From: moya at latertulia.org (Maykel Moya)
Date: Wed, 22 Apr 2009 01:40:58 -0400
Subject: [Linux-cluster] Service not relocated after successful fence of its
	owner
Message-ID: <1240378858.15385.9.camel@localhost>

I still can't get my service automatically relocated after
_successfully_ fencing its owner node.

I have a 4 node cluster n{1,2,3,4} and 4 services s{1,2,3,4}. My fence
device uses 'off' as action, so a successful fence means the node is
off.

Say, s4 running on n4 and I do a 'ip link set eth0 down' on n4. n4 get
successfully fenced but s4 is never relocated to one of the other
available nodes which means s4 is not available.

Find attached the cluster.conf.

Please help...

Regards,
maykel

-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/xml
Size: 3743 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/3d6e6aff/attachment.wsdl>

From esggrupos at gmail.com  Wed Apr 22 07:04:16 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Wed, 22 Apr 2009 09:04:16 +0200
Subject: [Linux-cluster] fence method for vmware images
Message-ID: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>

Hello all,

I?m testing a two node cluster using  a VMWare Server with two images
running the nodes.

I have seen that there is a fence method for virtual machines (I suspect
that only for Xen images, but I?m not sure).

So, my question is which is the best fence method for this scenario (I have
not fence hardware at all)

Thanks

ESG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/d3703d6f/attachment.htm>

From ralphzukeb at gmail.com  Wed Apr 22 07:11:40 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 08:11:40 +0100
Subject: [Linux-cluster] Custom fence agents
Message-ID: <6e4a03690904220011l2748f214l5c818ed7745b0342@mail.gmail.com>

Greetings,

What is the recommended way of adding custom fence agents to a cluster
setup, so that changes will not be lost on a "yum upgrade"?

Ralph


From jfriesse at redhat.com  Wed Apr 22 08:00:03 2009
From: jfriesse at redhat.com (Jan Friesse)
Date: Wed, 22 Apr 2009 10:00:03 +0200
Subject: [Linux-cluster] Custom fence agents
In-Reply-To: <6e4a03690904220011l2748f214l5c818ed7745b0342@mail.gmail.com>
References: <6e4a03690904220011l2748f214l5c818ed7745b0342@mail.gmail.com>
Message-ID: <49EECE83.1010408@redhat.com>

Ralph,
fence agents are just files which lives in /sbin. So
1. if you add new agent, yum upgrade will not touch your added new files
(of course, there can be special situations, like we add agent with same
name, where this is not true, but ...)
2. if you change existing agent, you can just rename it, and you are in
same situation as in 1.

Of course, in both cases, we will be happy, if you can send that
agent/fixes to us (for example me), and we can make it part of our GIT
tree and maybe officially support.

Regards,
  Honza

Ralph Zukeb wrote:
> Greetings,
> 
> What is the recommended way of adding custom fence agents to a cluster
> setup, so that changes will not be lost on a "yum upgrade"?
> 
> Ralph
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jfriesse at redhat.com  Wed Apr 22 08:03:06 2009
From: jfriesse at redhat.com (Jan Friesse)
Date: Wed, 22 Apr 2009 10:03:06 +0200
Subject: [Linux-cluster] fence method for vmware images
In-Reply-To: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>
References: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>
Message-ID: <49EECF3A.6040002@redhat.com>

ESG,
in RHEL 5.3 is included vmware_fence agent. Please try to look to man
page and/or http://sources.redhat.com/cluster/wiki/VMware_FencingConfig
(<--- includes new version of agent for ESX/ESXi and Server 2 (not
supported in RHEL 5.3 default version)).

Regards,
  Honza

ESGLinux wrote:
> Hello all,
> 
> I?m testing a two node cluster using  a VMWare Server with two images
> running the nodes.
> 
> I have seen that there is a fence method for virtual machines (I suspect
> that only for Xen images, but I?m not sure).
> 
> So, my question is which is the best fence method for this scenario (I have
> not fence hardware at all)
> 
> Thanks
> 
> ESG
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From ralphzukeb at gmail.com  Wed Apr 22 08:21:12 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 09:21:12 +0100
Subject: [Linux-cluster] Custom fence agents
In-Reply-To: <49EECE83.1010408@redhat.com>
References: <6e4a03690904220011l2748f214l5c818ed7745b0342@mail.gmail.com>
	<49EECE83.1010408@redhat.com>
Message-ID: <6e4a03690904220121g145fbb92qc27494581f2ec247@mail.gmail.com>

2009/4/22 Jan Friesse <jfriesse at redhat.com>:
> Ralph,
> fence agents are just files which lives in /sbin. So
> 1. if you add new agent, yum upgrade will not touch your added new files
> (of course, there can be special situations, like we add agent with same
> name, where this is not true, but ...)
> 2. if you change existing agent, you can just rename it, and you are in
> same situation as in 1.
>
> Of course, in both cases, we will be happy, if you can send that
> agent/fixes to us (for example me), and we can make it part of our GIT
> tree and maybe officially support.
>
> Regards,
> ?Honza

Thanks Honza.

To get the fencing agents to show up in luci, must I patch luci, or is
there a mechanism for adding new agents?
I don't want to patch luci and lose changes on an update.


From esggrupos at gmail.com  Wed Apr 22 09:14:59 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Wed, 22 Apr 2009 11:14:59 +0200
Subject: [Linux-cluster] fence method for vmware images
In-Reply-To: <49EECF3A.6040002@redhat.com>
References: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>
	<49EECF3A.6040002@redhat.com>
Message-ID: <3128ba140904220214i4c0ea256k5b206205d13fa916@mail.gmail.com>

Hello,

Thanks for your answer,

I?m using conga tu manage my cluster and the fence devices it allows me to
configure are this:

        <option name="fence_apc" value="fence_apc">APC Power Switch</option>
	<option name="fence_wti" value="fence_wti">WTI Power Switch</option>

	<option name="fence_brocade" value="fence_brocade">Brocade Fabric
Switch</option>
	<option name="fence_mcdata" value="fence_mcdata">McData SAN Switch</option>
	<option name="fence_sanbox2" value="fence_sanbox2">QLogic SANbox2</option>
	<option name="fence_vixel" value="fence_vixel">Vixel SAN Switch</option>
	<option name="fence_gnbd" value="fence_gnbd">GNBD</option>
	<option name="fence_egenera" value="fence_egenera">Egenera SAN
Controller</option>

	<option name="fence_bladecenter" value="fence_bladecenter">IBM Blade
Center</option>
	<option name="fence_bullpap" value="fence_bullpap">Bull PAP</option>
	<option name="fence_xvm" value="fence_xvm">Virtual Machine Fencing</option>
	<option name="fence_scsi" value="fence_scsi">SCSI Fencing</option>


how can I configure vmware_fence?


greetings

ESG

2009/4/22 Jan Friesse <jfriesse at redhat.com>

> ESG,
> in RHEL 5.3 is included vmware_fence agent. Please try to look to man
> page and/or http://sources.redhat.com/cluster/wiki/VMware_FencingConfig
> (<--- includes new version of agent for ESX/ESXi and Server 2 (not
> supported in RHEL 5.3 default version)).
>
> Regards,
>  Honza
>
> ESGLinux wrote:
> > Hello all,
> >
> > I?m testing a two node cluster using  a VMWare Server with two images
> > running the nodes.
> >
> > I have seen that there is a fence method for virtual machines (I suspect
> > that only for Xen images, but I?m not sure).
> >
> > So, my question is which is the best fence method for this scenario (I
> have
> > not fence hardware at all)
> >
> > Thanks
> >
> > ESG
> >
> >
> >
> > ------------------------------------------------------------------------
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/51fae40b/attachment.htm>

From ralphzukeb at gmail.com  Wed Apr 22 09:55:32 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 10:55:32 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
Message-ID: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>

Hello again! Thanks for the tips so far.

How can I ensure that the node holding the services and resources in a
two node cluster keeps holding them if the two nodes cannot see each
other?

I am not using a quorum disk.

Ralph


From mrugeshkarnik at gmail.com  Wed Apr 22 09:59:34 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Wed, 22 Apr 2009 15:29:34 +0530
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
Message-ID: <200904221529.34421.mrugeshkarnik@gmail.com>

On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
> Hello again! Thanks for the tips so far.
>
> How can I ensure that the node holding the services and resources in a
> two node cluster keeps holding them if the two nodes cannot see each
> other?
>
> I am not using a quorum disk.

<cman two_nodes="1" expected_votes="1"/>

Make sure you configure fencing properly though. Also, getting rid of the 
two_node attribute requires a full cluster restart.

Mrugesh


From ralphzukeb at gmail.com  Wed Apr 22 10:14:49 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 11:14:49 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <200904221529.34421.mrugeshkarnik@gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
Message-ID: <6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>

2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
> On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>> Hello again! Thanks for the tips so far.
>>
>> How can I ensure that the node holding the services and resources in a
>> two node cluster keeps holding them if the two nodes cannot see each
>> other?
>>
>> I am not using a quorum disk.
>
> <cman two_nodes="1" expected_votes="1"/>
>
> Make sure you configure fencing properly though. Also, getting rid of the
> two_node attribute requires a full cluster restart.
>
> Mrugesh
>

I tried this, but when I killed the switch for the cluster traffic,
BOTH nodes got fenced! Can I avoid this?


From m.watts at eris.qinetiq.com  Wed Apr 22 10:27:36 2009
From: m.watts at eris.qinetiq.com (Mark Watts)
Date: Wed, 22 Apr 2009 11:27:36 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
Message-ID: <200904221127.40332.m.watts@eris.qinetiq.com>


On Wednesday 22 April 2009 11:14:49 Ralph Zukeb wrote:
> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
> > On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
> >> Hello again! Thanks for the tips so far.
> >>
> >> How can I ensure that the node holding the services and resources in a
> >> two node cluster keeps holding them if the two nodes cannot see each
> >> other?
> >>
> >> I am not using a quorum disk.
> >
> > <cman two_nodes="1" expected_votes="1"/>
> >
> > Make sure you configure fencing properly though. Also, getting rid of the
> > two_node attribute requires a full cluster restart.
> >
> > Mrugesh
>
> I tried this, but when I killed the switch for the cluster traffic,
> BOTH nodes got fenced! Can I avoid this?

Don't kill the switch like that - you're causing a split brain.
Use two switches and multiple interfaces (bonding) to provide network 
redundancy.

Mark.


-- 
Mark Watts BSc RHCE MBCS
Senior Systems Engineer
QinetiQ Applied Technologies
GPG Key: http://www.linux-corner.info/mwatts.gpg
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/c1477a61/attachment.sig>

From ralphzukeb at gmail.com  Wed Apr 22 10:39:13 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 11:39:13 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <200904221127.40332.m.watts@eris.qinetiq.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
Message-ID: <6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>

2009/4/22 Mark Watts <m.watts at eris.qinetiq.com>:
>
> On Wednesday 22 April 2009 11:14:49 Ralph Zukeb wrote:
>> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
>> > On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>> >> Hello again! Thanks for the tips so far.
>> >>
>> >> How can I ensure that the node holding the services and resources in a
>> >> two node cluster keeps holding them if the two nodes cannot see each
>> >> other?
>> >>
>> >> I am not using a quorum disk.
>> >
>> > <cman two_nodes="1" expected_votes="1"/>
>> >
>> > Make sure you configure fencing properly though. Also, getting rid of the
>> > two_node attribute requires a full cluster restart.
>> >
>> > Mrugesh
>>
>> I tried this, but when I killed the switch for the cluster traffic,
>> BOTH nodes got fenced! Can I avoid this?
>
> Don't kill the switch like that - you're causing a split brain.
> Use two switches and multiple interfaces (bonding) to provide network
> redundancy.
>
> Mark.

I have bonding, but both network paths go over the same physical
cable. I want to simulate a cable cut or a switch dying along this
stretch of cable.


From ralphzukeb at gmail.com  Wed Apr 22 10:41:29 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 11:41:29 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
	<6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
Message-ID: <6e4a03690904220341q4543654cu48be4bc78a00a1bf@mail.gmail.com>

2009/4/22 Ralph Zukeb <ralphzukeb at gmail.com>:
> 2009/4/22 Mark Watts <m.watts at eris.qinetiq.com>:
>>
>> On Wednesday 22 April 2009 11:14:49 Ralph Zukeb wrote:
>>> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
>>> > On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>>> >> Hello again! Thanks for the tips so far.
>>> >>
>>> >> How can I ensure that the node holding the services and resources in a
>>> >> two node cluster keeps holding them if the two nodes cannot see each
>>> >> other?
>>> >>
>>> >> I am not using a quorum disk.
>>> >
>>> > <cman two_nodes="1" expected_votes="1"/>
>>> >
>>> > Make sure you configure fencing properly though. Also, getting rid of the
>>> > two_node attribute requires a full cluster restart.
>>> >
>>> > Mrugesh
>>>
>>> I tried this, but when I killed the switch for the cluster traffic,
>>> BOTH nodes got fenced! Can I avoid this?
>>
>> Don't kill the switch like that - you're causing a split brain.
>> Use two switches and multiple interfaces (bonding) to provide network
>> redundancy.
>>
>> Mark.
>
> I have bonding, but both network paths go over the same physical
> cable. I want to simulate a cable cut or a switch dying along this
> stretch of cable.
>

Maybe I can specify a different fencing delay per host?


From ralphzukeb at gmail.com  Wed Apr 22 10:57:00 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 11:57:00 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220341q4543654cu48be4bc78a00a1bf@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
	<6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
	<6e4a03690904220341q4543654cu48be4bc78a00a1bf@mail.gmail.com>
Message-ID: <6e4a03690904220357v72c9c3b1l75e4fdfbad4e89f9@mail.gmail.com>

>>>> I tried this, but when I killed the switch for the cluster traffic,
>>>> BOTH nodes got fenced! Can I avoid this?
>>>
>>> Don't kill the switch like that - you're causing a split brain.
>>> Use two switches and multiple interfaces (bonding) to provide network
>>> redundancy.

Can I use a tie-breaker ip without a quorum disk?


From gordan at bobich.net  Wed Apr 22 11:01:32 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 22 Apr 2009 12:01:32 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
Message-ID: <71a66eeca860911f0e95e2aa78a46abf@localhost>

On Wed, 22 Apr 2009 11:14:49 +0100, Ralph Zukeb <ralphzukeb at gmail.com>
wrote:
> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
>> On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>>> Hello again! Thanks for the tips so far.
>>>
>>> How can I ensure that the node holding the services and resources in a
>>> two node cluster keeps holding them if the two nodes cannot see each
>>> other?
>>>
>>> I am not using a quorum disk.
>>
>> <cman two_nodes="1" expected_votes="1"/>
>>
>> Make sure you configure fencing properly though. Also, getting rid of
the
>> two_node attribute requires a full cluster restart.
>>
>> Mrugesh
>>
> 
> I tried this, but when I killed the switch for the cluster traffic,
> BOTH nodes got fenced! Can I avoid this?

One option would be to edit the fencing agents to include a different
delay on both nodes. (e.g. put something like sleep 10 in the fencing
agent on the secondary node). That will make sure that the secondary
node loses the fencing shootout.

Gordan


From ralphzukeb at gmail.com  Wed Apr 22 11:09:26 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 12:09:26 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <71a66eeca860911f0e95e2aa78a46abf@localhost>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
	<71a66eeca860911f0e95e2aa78a46abf@localhost>
Message-ID: <6e4a03690904220409v4d37a048h3c54e410e43b7d@mail.gmail.com>

2009/4/22 Gordan Bobic <gordan at bobich.net>:
> On Wed, 22 Apr 2009 11:14:49 +0100, Ralph Zukeb <ralphzukeb at gmail.com>
> wrote:
>> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
>>> On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>>>> Hello again! Thanks for the tips so far.
>>>>
>>>> How can I ensure that the node holding the services and resources in a
>>>> two node cluster keeps holding them if the two nodes cannot see each
>>>> other?
>>>>
>>>> I am not using a quorum disk.
>>>
>>> <cman two_nodes="1" expected_votes="1"/>
>>>
>>> Make sure you configure fencing properly though. Also, getting rid of
> the
>>> two_node attribute requires a full cluster restart.
>>>
>>> Mrugesh
>>>
>>
>> I tried this, but when I killed the switch for the cluster traffic,
>> BOTH nodes got fenced! Can I avoid this?
>
> One option would be to edit the fencing agents to include a different
> delay on both nodes. (e.g. put something like sleep 10 in the fencing
> agent on the secondary node). That will make sure that the secondary
> node loses the fencing shootout.
>
> Gordan
>

Thanks. How do I ensure a certain node is secondary?


From mrugeshkarnik at gmail.com  Wed Apr 22 11:26:09 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Wed, 22 Apr 2009 16:56:09 +0530
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
	<6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
Message-ID: <200904221656.09304.mrugeshkarnik@gmail.com>

On Wednesday 22 Apr 2009 16:09:13 Ralph Zukeb wrote:
> I have bonding, but both network paths go over the same physical
> cable. I want to simulate a cable cut or a switch dying along this
> stretch of cable.

Why not have two switches cascaded with the cable from each of two NICs 
plugged into a different switch, on both nodes?

Mrugesh


From ralphzukeb at gmail.com  Wed Apr 22 11:31:00 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Wed, 22 Apr 2009 12:31:00 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <200904221656.09304.mrugeshkarnik@gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
	<6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
	<200904221656.09304.mrugeshkarnik@gmail.com>
Message-ID: <6e4a03690904220431u25b617d9u583c328f5b4497aa@mail.gmail.com>

2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
> On Wednesday 22 Apr 2009 16:09:13 Ralph Zukeb wrote:
>> I have bonding, but both network paths go over the same physical
>> cable. I want to simulate a cable cut or a switch dying along this
>> stretch of cable.
>
> Why not have two switches cascaded with the cable from each of two NICs
> plugged into a different switch, on both nodes?
>
> Mrugesh

Because the connection between the two switches is still one physical route.


From gordan at bobich.net  Wed Apr 22 11:40:39 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 22 Apr 2009 12:40:39 +0100
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220409v4d37a048h3c54e410e43b7d@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>	<200904221529.34421.mrugeshkarnik@gmail.com>	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>	<71a66eeca860911f0e95e2aa78a46abf@localhost>
	<6e4a03690904220409v4d37a048h3c54e410e43b7d@mail.gmail.com>
Message-ID: <1394543b216e50c7dd356cd72b20bb5d@localhost>

On Wed, 22 Apr 2009 12:09:26 +0100, Ralph Zukeb <ralphzukeb at gmail.com>
wrote:
> 2009/4/22 Gordan Bobic <gordan at bobich.net>:
>> On Wed, 22 Apr 2009 11:14:49 +0100, Ralph Zukeb <ralphzukeb at gmail.com>
>> wrote:
>>> 2009/4/22 Mrugesh Karnik <mrugeshkarnik at gmail.com>:
>>>> On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
>>>>> Hello again! Thanks for the tips so far.
>>>>>
>>>>> How can I ensure that the node holding the services and resources in
a
>>>>> two node cluster keeps holding them if the two nodes cannot see each
>>>>> other?
>>>>>
>>>>> I am not using a quorum disk.
>>>>
>>>> <cman two_nodes="1" expected_votes="1"/>
>>>>
>>>> Make sure you configure fencing properly though. Also, getting rid of
>> the
>>>> two_node attribute requires a full cluster restart.
>>>>
>>>> Mrugesh
>>>>
>>>
>>> I tried this, but when I killed the switch for the cluster traffic,
>>> BOTH nodes got fenced! Can I avoid this?
>>
>> One option would be to edit the fencing agents to include a different
>> delay on both nodes. (e.g. put something like sleep 10 in the fencing
>> agent on the secondary node). That will make sure that the secondary
>> node loses the fencing shootout.
>>
> 
> Thanks. How do I ensure a certain node is secondary?

You don't. You arbitrarily pick with one you want to be the primary
and doctor the fencing agent to add a delay on the other one, which
will thus be made to lose a fencing shootout.

Gordan


From jfriesse at redhat.com  Wed Apr 22 13:14:48 2009
From: jfriesse at redhat.com (Jan Friesse)
Date: Wed, 22 Apr 2009 15:14:48 +0200
Subject: [Linux-cluster] Custom fence agents
In-Reply-To: <6e4a03690904220121g145fbb92qc27494581f2ec247@mail.gmail.com>
References: <6e4a03690904220011l2748f214l5c818ed7745b0342@mail.gmail.com>	<49EECE83.1010408@redhat.com>
	<6e4a03690904220121g145fbb92qc27494581f2ec247@mail.gmail.com>
Message-ID: <49EF1848.6060804@redhat.com>

Ralph,
Luci currently doesn't support mechanism like that. So yes, you must
patch Luci. And yes, it's highly probable, that you will lost your changes.

Regards,
  Honza


Ralph Zukeb wrote:
> 2009/4/22 Jan Friesse <jfriesse at redhat.com>:
>> Ralph,
>> fence agents are just files which lives in /sbin. So
>> 1. if you add new agent, yum upgrade will not touch your added new files
>> (of course, there can be special situations, like we add agent with same
>> name, where this is not true, but ...)
>> 2. if you change existing agent, you can just rename it, and you are in
>> same situation as in 1.
>>
>> Of course, in both cases, we will be happy, if you can send that
>> agent/fixes to us (for example me), and we can make it part of our GIT
>> tree and maybe officially support.
>>
>> Regards,
>>  Honza
> 
> Thanks Honza.
> 
> To get the fencing agents to show up in luci, must I patch luci, or is
> there a mechanism for adding new agents?
> I don't want to patch luci and lose changes on an update.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jfriesse at redhat.com  Wed Apr 22 13:16:12 2009
From: jfriesse at redhat.com (Jan Friesse)
Date: Wed, 22 Apr 2009 15:16:12 +0200
Subject: [Linux-cluster] fence method for vmware images
In-Reply-To: <3128ba140904220214i4c0ea256k5b206205d13fa916@mail.gmail.com>
References: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>	<49EECF3A.6040002@redhat.com>
	<3128ba140904220214i4c0ea256k5b206205d13fa916@mail.gmail.com>
Message-ID: <49EF189C.5040008@redhat.com>

ESG,
vmware is currently not supported in Luci. You can change it by direct
edit of your cluster.conf configuration file. Please refer
http://sources.redhat.com/cluster/wiki/VMware_FencingConfig, how to do
it (there is example).

Regards,
  Honza


ESGLinux wrote:
> Hello,
> 
> Thanks for your answer,
> 
> I?m using conga tu manage my cluster and the fence devices it allows me to
> configure are this:
> 
>         <option name="fence_apc" value="fence_apc">APC Power Switch</option>
> 	<option name="fence_wti" value="fence_wti">WTI Power Switch</option>
> 
> 	<option name="fence_brocade" value="fence_brocade">Brocade Fabric
> Switch</option>
> 	<option name="fence_mcdata" value="fence_mcdata">McData SAN Switch</option>
> 	<option name="fence_sanbox2" value="fence_sanbox2">QLogic SANbox2</option>
> 	<option name="fence_vixel" value="fence_vixel">Vixel SAN Switch</option>
> 	<option name="fence_gnbd" value="fence_gnbd">GNBD</option>
> 	<option name="fence_egenera" value="fence_egenera">Egenera SAN
> Controller</option>
> 
> 	<option name="fence_bladecenter" value="fence_bladecenter">IBM Blade
> Center</option>
> 	<option name="fence_bullpap" value="fence_bullpap">Bull PAP</option>
> 	<option name="fence_xvm" value="fence_xvm">Virtual Machine Fencing</option>
> 	<option name="fence_scsi" value="fence_scsi">SCSI Fencing</option>
> 
> 
> 
> how can I configure vmware_fence?
> 
> 
> greetings
> 
> ESG
> 
> 2009/4/22 Jan Friesse <jfriesse at redhat.com>
> 
>> ESG,
>> in RHEL 5.3 is included vmware_fence agent. Please try to look to man
>> page and/or http://sources.redhat.com/cluster/wiki/VMware_FencingConfig
>> (<--- includes new version of agent for ESX/ESXi and Server 2 (not
>> supported in RHEL 5.3 default version)).
>>
>> Regards,
>>  Honza
>>
>> ESGLinux wrote:
>>> Hello all,
>>>
>>> I?m testing a two node cluster using  a VMWare Server with two images
>>> running the nodes.
>>>
>>> I have seen that there is a fence method for virtual machines (I suspect
>>> that only for Xen images, but I?m not sure).
>>>
>>> So, my question is which is the best fence method for this scenario (I
>> have
>>> not fence hardware at all)
>>>
>>> Thanks
>>>
>>> ESG
>>>
>>>
>>>
>>> ------------------------------------------------------------------------
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From billpp at gmail.com  Wed Apr 22 13:46:39 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 10:46:39 -0300
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220357v72c9c3b1l75e4fdfbad4e89f9@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
	<200904221127.40332.m.watts@eris.qinetiq.com>
	<6e4a03690904220339v346422fdi25d95f350a43088f@mail.gmail.com>
	<6e4a03690904220341q4543654cu48be4bc78a00a1bf@mail.gmail.com>
	<6e4a03690904220357v72c9c3b1l75e4fdfbad4e89f9@mail.gmail.com>
Message-ID: <58aa8d780904220646q148ca8b8y7581e18b7ac75207@mail.gmail.com>

Yes, you can.

http://sources.redhat.com/cluster/wiki/FAQ/CMAN#tie_breaker

Some examples by qdisk manpage:

3.3. Examples
3.3.1. 3 cluster nodes & 3 routers
        <cman expected_votes="6" .../>
        <clusternodes>
            <clusternode name="node1" votes="1" ... />
            <clusternode name="node2" votes="1" ... />
            <clusternode name="node3" votes="1" ... />
        </clusternodes>
        <quorumd interval="1" tko="10" votes="3" label="testing">
            <heuristic program="ping A -c1 -t1" score="1" interval="2" tko="3"/>
            <heuristic program="ping B -c1 -t1" score="1" interval="2" tko="3"/>
            <heuristic program="ping C -c1 -t1" score="1" interval="2" tko="3"/>
        </quorumd>


--

Fl?vio do Carmo J?nior aka waKKu

On Wed, Apr 22, 2009 at 7:57 AM, Ralph Zukeb <ralphzukeb at gmail.com> wrote:
>>>>> I tried this, but when I killed the switch for the cluster traffic,
>>>>> BOTH nodes got fenced! Can I avoid this?
>>>>
>>>> Don't kill the switch like that - you're causing a split brain.
>>>> Use two switches and multiple interfaces (bonding) to provide network
>>>> redundancy.
>
> Can I use a tie-breaker ip without a quorum disk?
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From billpp at gmail.com  Wed Apr 22 13:53:26 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 10:53:26 -0300
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
Message-ID: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>

Hi folks,

I'm setting up an IMAP server using postfix and dovecot.
I've a IBM DS4700 storage configured with RDAC I/O multipath and using
OCFS2 as filesystem, everything works fine.

The problem comes when my first I/O path goes down, apparently this
generate a kernel oops and I got my two servers fenced. Is there a way
to configure a good fence but still make use of multipath IO using
OCFS2?


Thanks in advance.


--

Fl?vio do Carmo J?nior aka waKKu


From esggrupos at gmail.com  Wed Apr 22 14:55:19 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Wed, 22 Apr 2009 16:55:19 +0200
Subject: [Linux-cluster] fence method for vmware images
In-Reply-To: <49EF189C.5040008@redhat.com>
References: <3128ba140904220004l729dc944lbeaee06a0130df41@mail.gmail.com>
	<49EECF3A.6040002@redhat.com>
	<3128ba140904220214i4c0ea256k5b206205d13fa916@mail.gmail.com>
	<49EF189C.5040008@redhat.com>
Message-ID: <3128ba140904220755ha3578a5x4b71f014275a7f25@mail.gmail.com>

Hello,

I?ll try it,

thanks for your help

ESG

2009/4/22 Jan Friesse <jfriesse at redhat.com>

> ESG,
> vmware is currently not supported in Luci. You can change it by direct
> edit of your cluster.conf configuration file. Please refer
> http://sources.redhat.com/cluster/wiki/VMware_FencingConfig, how to do
> it (there is example).
>
> Regards,
>  Honza
>
>
>
> ESGLinux wrote:
> > Hello,
> >
> > Thanks for your answer,
> >
> > I?m using conga tu manage my cluster and the fence devices it allows me
> to
> > configure are this:
> >
> >         <option name="fence_apc" value="fence_apc">APC Power
> Switch</option>
> >       <option name="fence_wti" value="fence_wti">WTI Power
> Switch</option>
> >
> >       <option name="fence_brocade" value="fence_brocade">Brocade Fabric
> > Switch</option>
> >       <option name="fence_mcdata" value="fence_mcdata">McData SAN
> Switch</option>
> >       <option name="fence_sanbox2" value="fence_sanbox2">QLogic
> SANbox2</option>
> >       <option name="fence_vixel" value="fence_vixel">Vixel SAN
> Switch</option>
> >       <option name="fence_gnbd" value="fence_gnbd">GNBD</option>
> >       <option name="fence_egenera" value="fence_egenera">Egenera SAN
> > Controller</option>
> >
> >       <option name="fence_bladecenter" value="fence_bladecenter">IBM
> Blade
> > Center</option>
> >       <option name="fence_bullpap" value="fence_bullpap">Bull
> PAP</option>
> >       <option name="fence_xvm" value="fence_xvm">Virtual Machine
> Fencing</option>
> >       <option name="fence_scsi" value="fence_scsi">SCSI Fencing</option>
> >
> >
> >
> > how can I configure vmware_fence?
> >
> >
> > greetings
> >
> > ESG
> >
> > 2009/4/22 Jan Friesse <jfriesse at redhat.com>
> >
> >> ESG,
> >> in RHEL 5.3 is included vmware_fence agent. Please try to look to man
> >> page and/or http://sources.redhat.com/cluster/wiki/VMware_FencingConfig
> >> (<--- includes new version of agent for ESX/ESXi and Server 2 (not
> >> supported in RHEL 5.3 default version)).
> >>
> >> Regards,
> >>  Honza
> >>
> >> ESGLinux wrote:
> >>> Hello all,
> >>>
> >>> I?m testing a two node cluster using  a VMWare Server with two images
> >>> running the nodes.
> >>>
> >>> I have seen that there is a fence method for virtual machines (I
> suspect
> >>> that only for Xen images, but I?m not sure).
> >>>
> >>> So, my question is which is the best fence method for this scenario (I
> >> have
> >>> not fence hardware at all)
> >>>
> >>> Thanks
> >>>
> >>> ESG
> >>>
> >>>
> >>>
> >>>
> ------------------------------------------------------------------------
> >>>
> >>> --
> >>> Linux-cluster mailing list
> >>> Linux-cluster at redhat.com
> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >
> >
> > ------------------------------------------------------------------------
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/9f1a733c/attachment.htm>

From esggrupos at gmail.com  Wed Apr 22 15:58:49 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Wed, 22 Apr 2009 17:58:49 +0200
Subject: [Linux-cluster] help understanding How service ip works
Message-ID: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>

Hello all

This is not a support question. Is about understanding how the service ip
works.

I don?t understand where this ghost ip is. I can't see it with ifconfig in
any of the nodes but it's there because it answers to my packets.

I think it?s related to multicasting but I don?t see how. I can get the
multicast ip of my nodes with, for example, cman_tool status. But I can' t
get the service IP.

I have read about multicast ( http://www.firewall.cx/multicast-intro.php,
http://www.tldp.org/HOWTO/Multicast-HOWTO.html) but dont see the light.

any resource(man page, howto, explanation...) is wellcome

Thanks in advance

ESG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/6558c756/attachment.htm>

From gordan at bobich.net  Wed Apr 22 16:03:40 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 22 Apr 2009 17:03:40 +0100
Subject: [Linux-cluster] help understanding How service ip works
In-Reply-To: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
References: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
Message-ID: <483a01714f156482053afca26939718f@localhost>

It has nothing to do with multicasting.
Try "ip addr".

On Wed, 22 Apr 2009 17:58:49 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> Hello all
> 
> This is not a support question. Is about understanding how the service ip
> works.
> 
> I don?t understand where this ghost ip is. I can't see it with ifconfig
in
> any of the nodes but it's there because it answers to my packets.
> 
> I think it?s related to multicasting but I don?t see how. I can get the
> multicast ip of my nodes with, for example, cman_tool status. But I can'
t
> get the service IP.
> 
> I have read about multicast ( http://www.firewall.cx/multicast-intro.php,
> http://www.tldp.org/HOWTO/Multicast-HOWTO.html) but dont see the light.
> 
> any resource(man page, howto, explanation...) is wellcome
> 
> Thanks in advance
> 
> ESG


From pavlos.parissis at gmail.com  Wed Apr 22 16:14:08 2009
From: pavlos.parissis at gmail.com (Pavlos Parissis)
Date: Wed, 22 Apr 2009 18:14:08 +0200
Subject: [Linux-cluster] help understanding How service ip works
In-Reply-To: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
References: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
Message-ID: <fb143d260904220914w4fb3d7e2tfa935f991c11625a@mail.gmail.com>

2009/4/22 ESGLinux <esggrupos at gmail.com>

> Hello all
>
> This is not a support question. Is about understanding how the service ip
> works.
>
> I don?t understand where this ghost ip is. I can't see it with ifconfig in
> any of the nodes but it's there because it answers to my packets.
>
>
You need to use ip(8) tool to see the service ip, the reason is that the
/usr/share/cluster/ip.sh uses that tool to assign IPs. You can also read
that file to get  a better understanding of the Service IP management in a
cluster environment.


Cheers,
Pavlos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/af65ce8c/attachment.htm>

From esggrupos at gmail.com  Wed Apr 22 16:15:34 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Wed, 22 Apr 2009 18:15:34 +0200
Subject: [Linux-cluster] help understanding How service ip works
In-Reply-To: <483a01714f156482053afca26939718f@localhost>
References: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
	<483a01714f156482053afca26939718f@localhost>
Message-ID: <3128ba140904220915p5f7a0a10oaaa34992a3c271e9@mail.gmail.com>

WOW!

2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast qlen
1000
    link/ether 00:0c:29:2a:02:db brd ff:ff:ff:ff:ff:ff
    inet 192.168.1.195/24 brd 192.168.1.255 scope global eth0
    inet 192.168.1.184/24 scope global secondary eth0
    inet6 fe80::20c:29ff:fe2a:2db/64 scope link
       valid_lft forever preferred_lft forever

here is the litle B.....!!!

2 question brings to my mind.

first, why ipconfig dont show it?

second, why hell I thought it was related to multicast???? I thought I have
read about it anywhere...


Thank you ALL for your help (again....)

ESG


2009/4/22 Gordan Bobic <gordan at bobich.net>

> It has nothing to do with multicasting.
> Try "ip addr".
>
> On Wed, 22 Apr 2009 17:58:49 +0200, ESGLinux <esggrupos at gmail.com> wrote:
> > Hello all
> >
> > This is not a support question. Is about understanding how the service ip
> > works.
> >
> > I don?t understand where this ghost ip is. I can't see it with ifconfig
> in
> > any of the nodes but it's there because it answers to my packets.
> >
> > I think it?s related to multicasting but I don?t see how. I can get the
> > multicast ip of my nodes with, for example, cman_tool status. But I can'
> t
> > get the service IP.
> >
> > I have read about multicast ( http://www.firewall.cx/multicast-intro.php
> ,
> > http://www.tldp.org/HOWTO/Multicast-HOWTO.html) but dont see the light.
> >
> > any resource(man page, howto, explanation...) is wellcome
> >
> > Thanks in advance
> >
> > ESG
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/f505fb66/attachment.htm>

From jumanjiman at gmail.com  Wed Apr 22 16:48:14 2009
From: jumanjiman at gmail.com (Paul Morgan)
Date: Wed, 22 Apr 2009 16:48:14 +0000
Subject: [Linux-cluster] help understanding How service ip works
In-Reply-To: <3128ba140904220915p5f7a0a10oaaa34992a3c271e9@mail.gmail.com>
References: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com><483a01714f156482053afca26939718f@localhost><3128ba140904220915p5f7a0a10oaaa34992a3c271e9@mail.gmail.com>
Message-ID: <1823429546-1240418928-cardhu_decombobulator_blackberry.rim.net-116853864-@bxe1239.bisx.prod.on.blackberry>

ifconfig only works on labeled interfaces. Labeled interfaces must include the device name in the label, which means labels would force your cluster to be homogeneous. Besides, the ip command is just better ;-)

-paul

-----Original Message-----
From: ESGLinux <esggrupos at gmail.com>

Date: Wed, 22 Apr 2009 18:15:34 
To: linux clustering<linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] help understanding How service ip works


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From mad at wol.de  Wed Apr 22 16:51:03 2009
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Wed, 22 Apr 2009 18:51:03 +0200
Subject: [Linux-cluster] help understanding How service ip works
In-Reply-To: <3128ba140904220915p5f7a0a10oaaa34992a3c271e9@mail.gmail.com>
References: <3128ba140904220858m6824146di1c987ff52023b8c7@mail.gmail.com>
	<483a01714f156482053afca26939718f@localhost>
	<3128ba140904220915p5f7a0a10oaaa34992a3c271e9@mail.gmail.com>
Message-ID: <1240419063.30149.33.camel@marc>

Hello,

Am Mittwoch, den 22.04.2009, 18:15 +0200 schrieb ESGLinux:
> WOW!
> 
> 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast
> qlen 1000
>     link/ether 00:0c:29:2a:02:db brd ff:ff:ff:ff:ff:ff
>     inet 192.168.1.195/24 brd 192.168.1.255 scope global eth0
>     inet 192.168.1.184/24 scope global secondary eth0
>     inet6 fe80::20c:29ff:fe2a:2db/64 scope link
>        valid_lft forever preferred_lft forever
> 
> here is the litle B.....!!!
> 
> 2 question brings to my mind.
> 
> first, why ipconfig dont show it?

Simple answer:

ipconfig is deprecated, doesn't support all functions of the linux
network stack and is in your distro just for stoneaged shellscript
compatibility purposes.

iproute2 is the interface one should use begining with kernel 2.4 if i
recall correctly. learn to use it and stick to it, its by far a better
pet than ipconfig ever was.

> 
> second, why hell I thought it was related to multicast???? I thought I
> have read about it anywhere...
> 
> 
> Thank you ALL for your help (again....)
> 
> ESG
> 
> 
> 2009/4/22 Gordan Bobic <gordan at bobich.net>
>         It has nothing to do with multicasting.
>         Try "ip addr".
>         
>         
>         On Wed, 22 Apr 2009 17:58:49 +0200, ESGLinux
>         <esggrupos at gmail.com> wrote:
>         > Hello all
>         >
>         > This is not a support question. Is about understanding how
>         the service ip
>         > works.
>         >
>         > I don?t understand where this ghost ip is. I can't see it
>         with ifconfig
>         in
>         > any of the nodes but it's there because it answers to my
>         packets.
>         >
>         > I think it?s related to multicasting but I don?t see how. I
>         can get the
>         > multicast ip of my nodes with, for example, cman_tool
>         status. But I can'
>         t
>         > get the service IP.
>         >
>         > I have read about multicast
>         ( http://www.firewall.cx/multicast-intro.php,
>         > http://www.tldp.org/HOWTO/Multicast-HOWTO.html) but dont see
>         the light.
>         >
>         > any resource(man page, howto, explanation...) is wellcome
>         >
>         > Thanks in advance
>         >
>         > ESG
>         
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From dhayes501 at gmail.com  Wed Apr 22 17:00:30 2009
From: dhayes501 at gmail.com (Dan Hayes)
Date: Wed, 22 Apr 2009 12:00:30 -0500
Subject: [Linux-cluster] DRAC5 fencing problem with Redhat 5.3 cluster
Message-ID: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com>

I have 8 Dell servers all loaded with Redhat 5.3.  The machines have an
external and internal IP address.  The cluster is configured on the internal
ip address.

The cluster appears to be working fine except for fencing.  I try to
manually fence a node, but get a connection error
[root at w002 ~]# fence_node lb001.domain.com
agent "fence_drac5" reports: Unable to connect/login to fencing device

I was able to get the fence_drac5 command to work directly by adding the
"-x" option for using SSH.  It says the connection times out, but it does
shut down the other machine
[root at w002 ~]# fence_drac5 -a 165.289.178.221 -l root -p pass -x
lb001.domain.com
Connection timed out

Why does the connection time out?  And how do I add the "-x" option to the
cluster.conf file so that the fencing agent can connect?

Here are my /etc/hosts and /etc/cluster/cluster.conf files.

##############################################/etc/hosts
127.0.0.1               localhost.localdomain localhost
::1             localhost6.localdomain6 localhost6

#eth0  --  p=public interface
165.289.178.111    w001p w001p.domain.com
165.289.178.112    w002p w002p.domain.com
165.289.178.113    w003p w003p.domain.com
165.289.178.121    lb001p lb001p.domain.com
165.289.178.122    lb002p lb002p.domain.com
165.289.178.124    db001p db001p.domain.com
165.289.178.125    db002p db002p.domain.com
165.289.178.126    dev001p dev001p.domain.com
165.289.178.211    w001-drac
165.289.178.212    w002-drac
165.289.178.213    w003-drac
165.289.178.221    lb001-drac
165.289.178.222    lb002-drac
165.289.178.224    db001-drac
165.289.178.225    db002-drac
165.289.178.226    dev001-drac

#eth1 --  this is the cluster
192.168.178.111    w001 w001.domain.com
192.168.178.112    w002 w002.domain.com
192.168.178.113    w003 w003.domain.com
192.168.178.121    lb001 lb001.domain.com
192.168.178.122    lb002 lb002.domain.com
192.168.178.124    db001 db001.domain.com
192.168.178.125    db002 db002.domain.com
192.168.178.126    dev001 dev001.domain.com
#########################################################


##################################/etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="cluster1" config_version="1" name="cluster1">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="dev001.domain.com" nodeid="1" votes="1">
                     <fence><method name="1"> <device name="dev001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w001.domain.com" nodeid="2" votes="1">
                     <fence><method name="1"> <device name="w001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w002.domain.com" nodeid="3" votes="1">
                     <fence><method name="1"> <device name="w002-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w003.domain.com" nodeid="4" votes="1">
                     <fence><method name="1"> <device name="w003-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="lb001.domain.com" nodeid="5" votes="1">
                     <fence><method name="1"> <device name="lb001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="lb002.domain.com" nodeid="6" votes="1">
                     <fence><method name="1"> <device name="lb002-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="db001.domain.com" nodeid="7" votes="1">
                     <fence><method name="1"> <device name="db001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="db002.domain.com" nodeid="8" votes="1">
                     <fence><method name="1"> <device name="db002-drac"/>
</method></fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.211"
login="root" name="w001-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.212"
login="root" name="w002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.213"
login="root" name="w003-drac" passwd="pass/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.221"
login="root" name="lb001-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.222"
login="root" name="lb002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.224"
login="root" name="db001-drac" passwd="pass/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.225"
login="root" name="db002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.226"
login="root" name="dev001-drac" passwd="pass"/>
        </fencedevices>
        <rm/>
</cluster>
########################################################
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/9783ed82/attachment.htm>

From fernando at lozano.eti.br  Wed Apr 22 17:29:30 2009
From: fernando at lozano.eti.br (fernando at lozano.eti.br)
Date: Wed, 22 Apr 2009 14:29:30 -0300
Subject: [Linux-cluster] Two node cluster without quorum disk
Message-ID: <49ef53fa.28c.4518.2073376346@lozano.eti.br>

Hi there,


> > On Wednesday 22 Apr 2009 15:25:32 Ralph Zukeb wrote:
> >> Hello again! Thanks for the tips so far.
> >>
> >> How can I ensure that the node holding the services and resources in a
> >> two node cluster keeps holding them if the two nodes cannot see each
> >> other?
(...)
> I tried this, but when I killed the switch for the cluster traffic,
> BOTH nodes got fenced! Can I avoid this?


You mean you killed the switch for *both* nodes?

RHCS checks the signal on any interface bound to a virtual IP, so if that interface looses signal,
it'll try to relocate all services that include that ip resource. After all, if the virtual ip is on
an interface without signal, clients cannot reach the server and so it's as good as dead for any
practical purpose.

So if my assumption is correct your problem may is related to fence at all, but instead to the with
the way you tested failover. Try unpluging the cable on just one node at a time.


[]s, Fernando Lozano


From jakub.suchy at enlogit.cz  Wed Apr 22 17:50:28 2009
From: jakub.suchy at enlogit.cz (Jakub Suchy)
Date: Wed, 22 Apr 2009 19:50:28 +0200
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
Message-ID: <20090422175028.GB3896@galatea>

> > I tried this, but when I killed the switch for the cluster traffic,
> > BOTH nodes got fenced! Can I avoid this?

Hi,
i accidentally deleted previous messages so I am not sure if you have
sent your cluster.conf? Please resend it...


Jakub


From Joel.Becker at oracle.com  Wed Apr 22 17:50:49 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 22 Apr 2009 10:50:49 -0700
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
Message-ID: <20090422175049.GD9042@ca-server1.us.oracle.com>

On Wed, Apr 22, 2009 at 10:53:26AM -0300, Flavio Junior wrote:
> I'm setting up an IMAP server using postfix and dovecot.
> I've a IBM DS4700 storage configured with RDAC I/O multipath and using
> OCFS2 as filesystem, everything works fine.
> 
> The problem comes when my first I/O path goes down, apparently this
> generate a kernel oops and I got my two servers fenced. Is there a way
> to configure a good fence but still make use of multipath IO using
> OCFS2?

	What's the oops look like?  You shouldn't get an oops from a
path going down.  If you do, of course the machine is going to reboot -
it's in a broken state.  You're using RDAC, not dm-multipath, so I'm
betting you have to ask IBM why their multipath code is oopsing.
	As a short sanity check, make sure ocfs2 is mounted on the
multipath device, not a component path.

Joel

-- 

"Nothing is wrong with California that a rise in the ocean level
 wouldn't cure."
        - Ross MacDonald

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From sjpark at rondaandspencer.info  Wed Apr 22 18:30:11 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Wed, 22 Apr 2009 12:30:11 -0600
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <20090422175028.GB3896@galatea>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
	<20090422175028.GB3896@galatea>
Message-ID: <c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>

I have this same problem with the 2 node cluster that I am working on.  I
have mysql running with an NFS that holds all of the data for it.  If I
unplug one of the node to test the network, it will move the service to the
other node...no problem.  When that node comes back up, it will then kill
the services on the node that it failed back too.  This goes on and on
resulting in a loop.  They'll start up and then begin to fence each other.
One comes up...starts the services...then it dies...the other one takes
over...starts the services and then dies...over and over again.

I have found that I need to start one node up...and let it run for a short
period of time...then start up the second node...and sometimes it won't
start the loop and sometimes it will.

2009/4/22 Jakub Suchy <jakub.suchy at enlogit.cz>

> > > I tried this, but when I killed the switch for the cluster traffic,
> > > BOTH nodes got fenced! Can I avoid this?
>
> Hi,
> i accidentally deleted previous messages so I am not sure if you have
> sent your cluster.conf? Please resend it...
>
>
> Jakub
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/d9281cc0/attachment.htm>

From cthulhucalling at gmail.com  Wed Apr 22 18:33:12 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Wed, 22 Apr 2009 14:33:12 -0400
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
	<20090422175028.GB3896@galatea>
	<c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>
Message-ID: <36df569a0904221133y3ff668c7yd357b19677ec03ce@mail.gmail.com>

have you tried the clean_start="1" option in the fence_daemon line of your
cluster.conf file?

On Wed, Apr 22, 2009 at 2:30 PM, Spencer Parker <sjpark at rondaandspencer.info
> wrote:

> I have this same problem with the 2 node cluster that I am working on.  I
> have mysql running with an NFS that holds all of the data for it.  If I
> unplug one of the node to test the network, it will move the service to the
> other node...no problem.  When that node comes back up, it will then kill
> the services on the node that it failed back too.  This goes on and on
> resulting in a loop.  They'll start up and then begin to fence each other.
> One comes up...starts the services...then it dies...the other one takes
> over...starts the services and then dies...over and over again.
>
> I have found that I need to start one node up...and let it run for a short
> period of time...then start up the second node...and sometimes it won't
> start the loop and sometimes it will.
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/da9b6ff6/attachment.htm>

From philjones85 at hotmail.com  Wed Apr 22 18:41:39 2009
From: philjones85 at hotmail.com (Phil Jones)
Date: Wed, 22 Apr 2009 18:41:39 +0000
Subject: [Linux-cluster] Fencing with Dell Remote Access Cards
Message-ID: <BAY117-W48BC3877335411EAC2DE52A2740@phx.gbl>


Can someone tell me the proper syntax for the fencedevices section of the cluster.conf file using DRAC5?

_________________________________________________________________
Rediscover Hotmail?: Get e-mail storage that grows with you. 
http://windowslive.com/RediscoverHotmail?ocid=TXT_TAGLM_WL_HM_Rediscover_Storage2_042009
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/aa3d5d70/attachment.htm>

From billpp at gmail.com  Wed Apr 22 19:10:37 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 16:10:37 -0300
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <20090422175049.GD9042@ca-server1.us.oracle.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
Message-ID: <58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>

On Wed, Apr 22, 2009 at 2:50 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
>
> ? ? ? ?What's the oops look like? ?You shouldn't get an oops from a
> path going down. ?If you do, of course the machine is going to reboot -
> it's in a broken state. ?You're using RDAC, not dm-multipath, so I'm
> betting you have to ask IBM why their multipath code is oopsing.
> ? ? ? ?As a short sanity check, make sure ocfs2 is mounted on the
> multipath device, not a component path.
>

Hi Joel, thanks for the answer...

OK, but i'm not sure about a kernel oops being executed.

I'd change panic_on_oops to 0 and it work "as expected" (well, you
will see that I wasnt totally sincere about my setup. I'm using RHCS
and gfs too :X)

Here is the paste of /var/log/messages from the remaining node
(panic_on_oops=0) when I turn off the fiber switch for primary path.

http://rafb.net/p/TgLoWH71.html


> Joel
>


Thanks again,

--

Fl?vio do Carmo J?nior aka waKKu


From dhayes501 at gmail.com  Wed Apr 22 19:28:01 2009
From: dhayes501 at gmail.com (Dan Hayes)
Date: Wed, 22 Apr 2009 14:28:01 -0500
Subject: [Linux-cluster] Re: DRAC5 fencing problem with Redhat 5.3 cluster
In-Reply-To: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com>
References: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com>
Message-ID: <55bfc63d0904221228r7c8dfa20h64856849cac430c0@mail.gmail.com>

For some reason, the last time I tried to post this, it didn't work (if it
did, I apologize for the repeat), so I'm posting it again...


I have 8 Dell servers all loaded with Redhat 5.3.  The machines have an
external and internal IP address.  The cluster is configured on the internal
ip address.

The cluster appears to be working fine except for fencing.  I try to
manually fence a node, but get a connection error
[root at w002 ~]# fence_node lb001.domain.com
agent "fence_drac5" reports: Unable to connect/login to fencing device

I was able to get the fence_drac5 command to work directly by adding the
"-x" option for using SSH.  It says the connection times out, but it does
shut down the other machine
[root at w002 ~]# fence_drac5 -a 165.289.178.221 -l root -p pass -x
lb001.domain.com
Connection timed out

Why does the connection time out?  And how do I add the "-x" option to the
cluster.conf file so that the fencing agent can connect?

Here are my /etc/hosts and /etc/cluster/cluster.conf files.

##############################################/etc/hosts
127.0.0.1               localhost.localdomain localhost
::1             localhost6.localdomain6 localhost6

#eth0  --  p=public interface
165.289.178.111    w001p w001p.domain.com
165.289.178.112    w002p w002p.domain.com
165.289.178.113    w003p w003p.domain.com
165.289.178.121    lb001p lb001p.domain.com
165.289.178.122    lb002p lb002p.domain.com
165.289.178.124    db001p db001p.domain.com
165.289.178.125    db002p db002p.domain.com
165.289.178.126    dev001p dev001p.domain.com
165.289.178.211    w001-drac
165.289.178.212    w002-drac
165.289.178.213    w003-drac
165.289.178.221    lb001-drac
165.289.178.222    lb002-drac
165.289.178.224    db001-drac
165.289.178.225    db002-drac
165.289.178.226    dev001-drac

#eth1 --  this is the cluster
192.168.178.111    w001 w001.domain.com
192.168.178.112    w002 w002.domain.com
192.168.178.113    w003 w003.domain.com
192.168.178.121    lb001 lb001.domain.com
192.168.178.122    lb002 lb002.domain.com
192.168.178.124    db001 db001.domain.com
192.168.178.125    db002 db002.domain.com
192.168.178.126    dev001 dev001.domain.com
#########################################################


##################################/etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="cluster1" config_version="1" name="cluster1">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="dev001.domain.com" nodeid="1" votes="1">
                     <fence><method name="1"> <device name="dev001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w001.domain.com" nodeid="2" votes="1">
                     <fence><method name="1"> <device name="w001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w002.domain.com" nodeid="3" votes="1">
                     <fence><method name="1"> <device name="w002-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="w003.domain.com" nodeid="4" votes="1">
                     <fence><method name="1"> <device name="w003-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="lb001.domain.com" nodeid="5" votes="1">
                     <fence><method name="1"> <device name="lb001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="lb002.domain.com" nodeid="6" votes="1">
                     <fence><method name="1"> <device name="lb002-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="db001.domain.com" nodeid="7" votes="1">
                     <fence><method name="1"> <device name="db001-drac"/>
</method></fence>
                </clusternode>
                <clusternode name="db002.domain.com" nodeid="8" votes="1">
                     <fence><method name="1"> <device name="db002-drac"/>
</method></fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.211"
login="root" name="w001-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.212"
login="root" name="w002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.213"
login="root" name="w003-drac" passwd="pass/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.221"
login="root" name="lb001-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.222"
login="root" name="lb002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.224"
login="root" name="db001-drac" passwd="pass/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.225"
login="root" name="db002-drac" passwd="pass"/>
            <fencedevice agent="fence_drac5" ipaddr="165.289.178.226"
login="root" name="dev001-drac" passwd="pass"/>
        </fencedevices>
        <rm/>
</cluster>
########################################################
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/34e08a15/attachment.htm>

From sjpark at rondaandspencer.info  Wed Apr 22 19:31:03 2009
From: sjpark at rondaandspencer.info (Spencer Parker)
Date: Wed, 22 Apr 2009 13:31:03 -0600
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <36df569a0904221133y3ff668c7yd357b19677ec03ce@mail.gmail.com>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
	<20090422175028.GB3896@galatea>
	<c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>
	<36df569a0904221133y3ff668c7yd357b19677ec03ce@mail.gmail.com>
Message-ID: <c7a040fa0904221231u2c07238dxe6a51e59d78967a5@mail.gmail.com>

Bing! that did it!

Is there a place that lists these options?  I have looked in a lot of
places, but am not able to find what I want.  If I knew these options I
could build an interface to manage this for me.  This option isn't listed in
conga.

On Wed, Apr 22, 2009 at 12:33 PM, Ian Hayes <cthulhucalling at gmail.com>wrote:

> have you tried the clean_start="1" option in the fence_daemon line of your
> cluster.conf file?
>
>
> On Wed, Apr 22, 2009 at 2:30 PM, Spencer Parker <
> sjpark at rondaandspencer.info> wrote:
>
>> I have this same problem with the 2 node cluster that I am working on.  I
>> have mysql running with an NFS that holds all of the data for it.  If I
>> unplug one of the node to test the network, it will move the service to the
>> other node...no problem.  When that node comes back up, it will then kill
>> the services on the node that it failed back too.  This goes on and on
>> resulting in a loop.  They'll start up and then begin to fence each other.
>> One comes up...starts the services...then it dies...the other one takes
>> over...starts the services and then dies...over and over again.
>>
>> I have found that I need to start one node up...and let it run for a short
>> period of time...then start up the second node...and sometimes it won't
>> start the loop and sometimes it will.
>>
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/106c38aa/attachment.htm>

From cthulhucalling at gmail.com  Wed Apr 22 19:37:54 2009
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Wed, 22 Apr 2009 15:37:54 -0400
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <c7a040fa0904221231u2c07238dxe6a51e59d78967a5@mail.gmail.com>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
	<20090422175028.GB3896@galatea>
	<c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>
	<36df569a0904221133y3ff668c7yd357b19677ec03ce@mail.gmail.com>
	<c7a040fa0904221231u2c07238dxe6a51e59d78967a5@mail.gmail.com>
Message-ID: <36df569a0904221237v50e4bcdboa9248cea11d125fa@mail.gmail.com>

I found that in the manpage for fenced. There are a few more options in
there that you can throw that deal with fencing. I had a similar problem a
short while back. Once the comedic value of seeing two nodes killing each
other wore off, I had to figure out how to stop it. At least I knew that the
fencing worked.


On Wed, Apr 22, 2009 at 3:31 PM, Spencer Parker <sjpark at rondaandspencer.info
> wrote:

> Bing! that did it!
>
> Is there a place that lists these options?  I have looked in a lot of
> places, but am not able to find what I want.  If I knew these options I
> could build an interface to manage this for me.  This option isn't listed in
> conga.
>
> On Wed, Apr 22, 2009 at 12:33 PM, Ian Hayes <cthulhucalling at gmail.com>wrote:
>
>> have you tried the clean_start="1" option in the fence_daemon line of your
>> cluster.conf file?
>>
>>
>> On Wed, Apr 22, 2009 at 2:30 PM, Spencer Parker <
>> sjpark at rondaandspencer.info> wrote:
>>
>>> I have this same problem with the 2 node cluster that I am working on.  I
>>> have mysql running with an NFS that holds all of the data for it.  If I
>>> unplug one of the node to test the network, it will move the service to the
>>> other node...no problem.  When that node comes back up, it will then kill
>>> the services on the node that it failed back too.  This goes on and on
>>> resulting in a loop.  They'll start up and then begin to fence each other.
>>> One comes up...starts the services...then it dies...the other one takes
>>> over...starts the services and then dies...over and over again.
>>>
>>> I have found that I need to start one node up...and let it run for a
>>> short period of time...then start up the second node...and sometimes it
>>> won't start the loop and sometimes it will.
>>>
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/85f54150/attachment.htm>

From billpp at gmail.com  Wed Apr 22 19:58:25 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 16:58:25 -0300
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
Message-ID: <58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>

Ok.. seems like I found my real problem..

I'm using OCFS2 Threshold as 30 seconds (value 61) and my multipath
RDAC devloss timeout was 60 seconds.

Adjust these values seems to solve the problem with incorrect fencing.


Any news I report here, thanks :)

--

Fl?vio do Carmo J?nior aka waKKu

On Wed, Apr 22, 2009 at 4:10 PM, Flavio Junior <billpp at gmail.com> wrote:
> On Wed, Apr 22, 2009 at 2:50 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
>>
>> ? ? ? ?What's the oops look like? ?You shouldn't get an oops from a
>> path going down. ?If you do, of course the machine is going to reboot -
>> it's in a broken state. ?You're using RDAC, not dm-multipath, so I'm
>> betting you have to ask IBM why their multipath code is oopsing.
>> ? ? ? ?As a short sanity check, make sure ocfs2 is mounted on the
>> multipath device, not a component path.
>>
>
> Hi Joel, thanks for the answer...
>
> OK, but i'm not sure about a kernel oops being executed.
>
> I'd change panic_on_oops to 0 and it work "as expected" (well, you
> will see that I wasnt totally sincere about my setup. I'm using RHCS
> and gfs too :X)
>
> Here is the paste of /var/log/messages from the remaining node
> (panic_on_oops=0) when I turn off the fiber switch for primary path.
>
> http://rafb.net/p/TgLoWH71.html
>
>
>> Joel
>>
>
>
> Thanks again,
>
> --
>
> Fl?vio do Carmo J?nior aka waKKu
>


From kbphillips80 at gmail.com  Wed Apr 22 20:10:06 2009
From: kbphillips80 at gmail.com (Kaerka Phillips)
Date: Wed, 22 Apr 2009 16:10:06 -0400
Subject: [Linux-cluster] Fencing with Dell Remote Access Cards
In-Reply-To: <BAY117-W48BC3877335411EAC2DE52A2740@phx.gbl>
References: <BAY117-W48BC3877335411EAC2DE52A2740@phx.gbl>
Message-ID: <e558921b0904221310s2ee1c2aas2d95ead282e73327@mail.gmail.com>

Here is the fence configuration that I used, as in my experience if you have
drac5 cards with greater than version 1.20 firmware you will need to use the
lesser known fence_drac5 module rather than the fence_drac module or else
you'll have drac hangs and the cluster won't know that it successfully
fenced the node.  Dell firmware versions 1.0 through 1.20 seem to work okay
with the original fence_drac module.

I don't know how good the delay settings here are, and this setup still
relies on telnet control over the drac5 rather than SSH.  With this you'll
need to enable telnet commands on the drac card before this can be used, and
also set a password for the root user on the drac as well.

<?xml version="1.0"?>
<cluster alias="cluster_name" config_version="01" name="cluster_name">
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="20"/>
        <clusternodes>
                <clusternode name="node1name" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="drac1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node2name" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="drac2"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node3name" nodeid="3" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="drac3"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node4name" nodeid="4" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="drac4"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_drac5" ipaddr="ipaddr"
login="root" name="drac1" passwd="pass"/>
                <fencedevice agent="fence_drac5" ipaddr="ipaddr"
login="root" name="drac2" passwd="pass"/>
                <fencedevice agent="fence_drac5" ipaddr="ipaddr"
login="root" name="drac3" passwd="pass"/>
                <fencedevice agent="fence_drac5" ipaddr="ipaddr"
login="root" name="drac4" passwd="pass"/>
                </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
        <totem consensus="200" join="60" token="10000"
token_retransmits_before_loss_const="4"/>
</cluster>


On Wed, Apr 22, 2009 at 2:41 PM, Phil Jones <philjones85 at hotmail.com> wrote:

>  Can someone tell me the proper syntax for the fencedevices section of the
> cluster.conf file using DRAC5?
>
> ------------------------------
> Rediscover Hotmail?: Get e-mail storage that grows with you. Check it out.<http://windowslive.com/RediscoverHotmail?ocid=TXT_TAGLM_WL_HM_Rediscover_Storage2_042009>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/e8856729/attachment.htm>

From billpp at gmail.com  Wed Apr 22 20:27:03 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 17:27:03 -0300
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <c7a040fa0904221231u2c07238dxe6a51e59d78967a5@mail.gmail.com>
References: <49ef53fa.28c.4518.2073376346@lozano.eti.br>
	<20090422175028.GB3896@galatea>
	<c7a040fa0904221130m624beec5n62f57d271c3ebea1@mail.gmail.com>
	<36df569a0904221133y3ff668c7yd357b19677ec03ce@mail.gmail.com>
	<c7a040fa0904221231u2c07238dxe6a51e59d78967a5@mail.gmail.com>
Message-ID: <58aa8d780904221327w732f166k17e6dafc91c42c8a@mail.gmail.com>

This is a good place to start looking for:
http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html

--

Fl?vio do Carmo J?nior aka waKKu

On Wed, Apr 22, 2009 at 4:31 PM, Spencer Parker
<sjpark at rondaandspencer.info> wrote:
> Bing! that did it!
>
> Is there a place that lists these options?? I have looked in a lot of
> places, but am not able to find what I want.? If I knew these options I
> could build an interface to manage this for me.? This option isn't listed in
> conga.
>
> On Wed, Apr 22, 2009 at 12:33 PM, Ian Hayes <cthulhucalling at gmail.com>
> wrote:
>>
>> have you tried the clean_start="1" option in the fence_daemon line of your
>> cluster.conf file?
>>
>> On Wed, Apr 22, 2009 at 2:30 PM, Spencer Parker
>> <sjpark at rondaandspencer.info> wrote:
>>>
>>> I have this same problem with the 2 node cluster that I am working on.? I
>>> have mysql running with an NFS that holds all of the data for it.? If I
>>> unplug one of the node to test the network, it will move the service to the
>>> other node...no problem.? When that node comes back up, it will then kill
>>> the services on the node that it failed back too.? This goes on and on
>>> resulting in a loop.? They'll start up and then begin to fence each other.
>>> One comes up...starts the services...then it dies...the other one takes
>>> over...starts the services and then dies...over and over again.
>>>
>>> I have found that I need to start one node up...and let it run for a
>>> short period of time...then start up the second node...and sometimes it
>>> won't start the loop and sometimes it will.
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From Joel.Becker at oracle.com  Wed Apr 22 20:49:11 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 22 Apr 2009 13:49:11 -0700
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
	<58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
Message-ID: <20090422204911.GA5058@mail.oracle.com>

On Wed, Apr 22, 2009 at 04:58:25PM -0300, Flavio Junior wrote:
> I'm using OCFS2 Threshold as 30 seconds (value 61) and my multipath
> RDAC devloss timeout was 60 seconds.

	Aha!  That would do it.

> Adjust these values seems to solve the problem with incorrect fencing.

	Good, glad it worked.

> Any news I report here, thanks :)

	Please do so.  Also, as a question, what version of ocfs2 are
you running?  Is this a production environment?

Joel

-- 

"There is shadow under this red rock.
 (Come in under the shadow of this red rock)
 And I will show you something different from either
 Your shadow at morning striding behind you
 Or your shadow at evening rising to meet you.
 I will show you fear in a handful of dust."

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From Joel.Becker at oracle.com  Wed Apr 22 20:50:15 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 22 Apr 2009 13:50:15 -0700
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
Message-ID: <20090422205015.GB5058@mail.oracle.com>

On Wed, Apr 22, 2009 at 04:10:37PM -0300, Flavio Junior wrote:
> On Wed, Apr 22, 2009 at 2:50 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
> >
> > ? ? ? ?What's the oops look like? ?You shouldn't get an oops from a
> > path going down. ?If you do, of course the machine is going to reboot -
> > it's in a broken state. ?You're using RDAC, not dm-multipath, so I'm
> > betting you have to ask IBM why their multipath code is oopsing.
> > ? ? ? ?As a short sanity check, make sure ocfs2 is mounted on the
> > multipath device, not a component path.
> >
> 
> Hi Joel, thanks for the answer...
> 
> OK, but i'm not sure about a kernel oops being executed.

	You mean there isn't an oops after all - I can see that from
your paste.  As your following email points out, it's actual fencing due
to the timeout configuration.

> I'd change panic_on_oops to 0 and it work "as expected" (well, you
> will see that I wasnt totally sincere about my setup. I'm using RHCS
> and gfs too :X)

	You should be able to use RHCS and gfs as well, as long as the
timeout and failure modes are managed correctly.

Joel

-- 

"You can get more with a kind word and a gun than you can with
 a kind word alone."
         - Al Capone

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From billpp at gmail.com  Wed Apr 22 21:23:17 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 18:23:17 -0300
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <20090422204911.GA5058@mail.oracle.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
	<58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
	<20090422204911.GA5058@mail.oracle.com>
Message-ID: <58aa8d780904221423j6e2d1736ja3ea07556bdf20d0@mail.gmail.com>

On Wed, Apr 22, 2009 at 5:49 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
> On Wed, Apr 22, 2009 at 04:58:25PM -0300, Flavio Junior wrote:
>> I'm using OCFS2 Threshold as 30 seconds (value 61) and my multipath
>> RDAC devloss timeout was 60 seconds.
>
> ? ? ? ?Aha! ?That would do it.
>
>> Adjust these values seems to solve the problem with incorrect fencing.
>
> ? ? ? ?Good, glad it worked.
>
>> Any news I report here, thanks :)
>
> ? ? ? ?Please do so. ?Also, as a question, what version of ocfs2 are
> you running? ?Is this a production environment?

Hi Joel, i'm using 1.4.1
[root at pinky ~]# ocfs2console --version
OCFS2Console version 1.4.1

Yes, i'm interested in use it at production. There is a flow of 4, 5k
mails by day with 300 maildir's.
What do you think about it? I was trying to use GFS2 but a simple ls
/home/* or sending a mail for everyone at mydomain.com hangs GFS2 or even
GFS (without tunning).

>
> Joel
>

--

Fl?vio do Carmo J?nior aka waKKu


From Joel.Becker at oracle.com  Wed Apr 22 21:40:30 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 22 Apr 2009 14:40:30 -0700
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904221423j6e2d1736ja3ea07556bdf20d0@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
	<58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
	<20090422204911.GA5058@mail.oracle.com>
	<58aa8d780904221423j6e2d1736ja3ea07556bdf20d0@mail.gmail.com>
Message-ID: <20090422214030.GD5058@mail.oracle.com>

On Wed, Apr 22, 2009 at 06:23:17PM -0300, Flavio Junior wrote:
> On Wed, Apr 22, 2009 at 5:49 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
> > On Wed, Apr 22, 2009 at 04:58:25PM -0300, Flavio Junior wrote:
> >> I'm using OCFS2 Threshold as 30 seconds (value 61) and my multipath
> >> RDAC devloss timeout was 60 seconds.
> >
> > ? ? ? ?Aha! ?That would do it.
> >
> >> Adjust these values seems to solve the problem with incorrect fencing.
> >
> > ? ? ? ?Good, glad it worked.
> >
> >> Any news I report here, thanks :)
> >
> > ? ? ? ?Please do so. ?Also, as a question, what version of ocfs2 are
> > you running? ?Is this a production environment?
> 
> Hi Joel, i'm using 1.4.1
> [root at pinky ~]# ocfs2console --version
> OCFS2Console version 1.4.1

	The version of ocfs2console is not the same as the version of
the ocfs2 driver.  Is this on EL?  What does 'rpm -qa | grep ocfs2' say?
Or if it is mainline, what version of the mainline kernel?

> Yes, i'm interested in use it at production. There is a flow of 4, 5k
> mails by day with 300 maildir's.
> What do you think about it? I was trying to use GFS2 but a simple ls
> /home/* or sending a mail for everyone at mydomain.com hangs GFS2 or even
> GFS (without tunning).

	It most certainly will not hang ocfs2, but maildir might cause
some speed concerns.  Indexed directories are only just landing in
mainline ocfs2 - they are not in the production version.  So any
directory with more than 1000 entries will be a little slow (just like
ext3 without hcache).
	It'll work - people are using it just fine.  But large mail dirs
will be a little slow, so you should watch out for that.

Joel

-- 

Life's Little Instruction Book #347

	"Never waste the oppourtunity to tell someone you love them."

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From billpp at gmail.com  Wed Apr 22 22:21:58 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 19:21:58 -0300
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <20090422214030.GD5058@mail.oracle.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
	<58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
	<20090422204911.GA5058@mail.oracle.com>
	<58aa8d780904221423j6e2d1736ja3ea07556bdf20d0@mail.gmail.com>
	<20090422214030.GD5058@mail.oracle.com>
Message-ID: <58aa8d780904221521w5316d562xf617403e6d6c706f@mail.gmail.com>

On Wed, Apr 22, 2009 at 6:40 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
> On Wed, Apr 22, 2009 at 06:23:17PM -0300, Flavio Junior wrote:
>> On Wed, Apr 22, 2009 at 5:49 PM, Joel Becker <Joel.Becker at oracle.com> wrote:
>> > On Wed, Apr 22, 2009 at 04:58:25PM -0300, Flavio Junior wrote:
>> >> I'm using OCFS2 Threshold as 30 seconds (value 61) and my multipath
>> >> RDAC devloss timeout was 60 seconds.
>> >
>> > ? ? ? ?Aha! ?That would do it.
>> >
>> >> Adjust these values seems to solve the problem with incorrect fencing.
>> >
>> > ? ? ? ?Good, glad it worked.
>> >
>> >> Any news I report here, thanks :)
>> >
>> > ? ? ? ?Please do so. ?Also, as a question, what version of ocfs2 are
>> > you running? ?Is this a production environment?
>>
>> Hi Joel, i'm using 1.4.1
>> [root at pinky ~]# ocfs2console --version
>> OCFS2Console version 1.4.1
>
> ? ? ? ?The version of ocfs2console is not the same as the version of
> the ocfs2 driver. ?Is this on EL? ?What does 'rpm -qa | grep ocfs2' say?
> Or if it is mainline, what version of the mainline kernel?

Ok, I deserve that ;).
[root at cerebro ~]# rpm -qa | grep ocfs
ocfs2-2.6.18-128.1.6.el5-1.4.1-1.el5
ocfs2console-1.4.1-1.el5
ocfs2-tools-devel-1.4.1-1.el5
ocfs2-tools-1.4.1-1.el5

I'm using CentOS 5.3 with the last release from oss.oracle.com i've found.

PS: Just a ponit, to use o2cb init script with centos I need to
commend exit 1 line for lsb-functions check. Everything works fine,
maybe lsb-functions "script" isnt a fatal dependency.


>
>> Yes, i'm interested in use it at production. There is a flow of 4, 5k
>> mails by day with 300 maildir's.
>> What do you think about it? I was trying to use GFS2 but a simple ls
>> /home/* or sending a mail for everyone at mydomain.com hangs GFS2 or even
>> GFS (without tunning).
>
> ? ? ? ?It most certainly will not hang ocfs2, but maildir might cause
> some speed concerns. ?Indexed directories are only just landing in
> mainline ocfs2 - they are not in the production version. ?So any
> directory with more than 1000 entries will be a little slow (just like
> ext3 without hcache).
> ? ? ? ?It'll work - people are using it just fine. ?But large mail dirs
> will be a little slow, so you should watch out for that.

OK, I can accept that. I'm using ext3 by now, so for some users this
can just "dont change". But I dont believe that have most users with
thousand mails per folder. Probably just a bit, not something to
really worry now.

Really thanks for info :).

>
> Joel
>

--

Fl?vio do Carmo J?nior aka waKKu


From andy at linxit.eclipse.co.uk  Wed Apr 22 23:11:45 2009
From: andy at linxit.eclipse.co.uk (Andy Wallace)
Date: Thu, 23 Apr 2009 00:11:45 +0100
Subject: [Linux-cluster] GFS2 performance on large files
Message-ID: <1240441905.19981.19.camel@phobos.andyw.net>

Hi,

I've just set up a GFS2 filesystem for a client, but have some serious
issues with the performance on large files (i.e. > 2GB). This is a real
problem, as the files we're going to be using are going to be approx.
20GB up to 170GB.

Hardware setup is:
2 x IBM X3650 servers with 2 x Dual Xeon, 4GB RAM, 2 x 2GB/s HBAs per
server;
Storage on IBM DS4700 - 48 x 1TB SATA disks

Files will be written to the storage via FTP, read via NFS mounts, both
on an LVS virtual IP address.

Although it's not as quick as I'd like, I'm getting about 150MB/s on
average when reading/writing files in the 100MB - 1GB range. However, if
I try to write a 10GB file, this goes down to about 50MB/s. That's just
doing dd to the mounted gfs2 on an individual node. If I do a get from
an ftp client, I'm seeing half that; cp from an NFS mount is more like
1/5.

I've spent a lot of time reading up on GFS2 performance, but I haven't
found anything useful for improving throughput with large files. Has
anyone got any suggestions or managed to solve a similar problem?

-- 
Andy Wallace


From Joel.Becker at oracle.com  Wed Apr 22 23:11:50 2009
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 22 Apr 2009 16:11:50 -0700
Subject: [Linux-cluster] OCFS2 and SAN MultiPath I/O
In-Reply-To: <58aa8d780904221521w5316d562xf617403e6d6c706f@mail.gmail.com>
References: <58aa8d780904220653g2fbb2bc5k9471ceb9c7540727@mail.gmail.com>
	<20090422175049.GD9042@ca-server1.us.oracle.com>
	<58aa8d780904221210o41714240v6fc6281e6ad660b7@mail.gmail.com>
	<58aa8d780904221258s77f23085o2695ae5d664961c9@mail.gmail.com>
	<20090422204911.GA5058@mail.oracle.com>
	<58aa8d780904221423j6e2d1736ja3ea07556bdf20d0@mail.gmail.com>
	<20090422214030.GD5058@mail.oracle.com>
	<58aa8d780904221521w5316d562xf617403e6d6c706f@mail.gmail.com>
Message-ID: <20090422231150.GA18997@mail.oracle.com>

On Wed, Apr 22, 2009 at 07:21:58PM -0300, Flavio Junior wrote:
> I'm using CentOS 5.3 with the last release from oss.oracle.com i've found.

	Just wanted to check if you were mainline or 1.4.

> PS: Just a ponit, to use o2cb init script with centos I need to
> commend exit 1 line for lsb-functions check. Everything works fine,
> maybe lsb-functions "script" isnt a fatal dependency.

	lsb-functions is imperative - it's the only way to have
cross-distro init calls.  You should have the file.  If not, you need
the lsb RPM.  On EL it's called redhat-lsb, so it's probably that name
or centos-lsb or something on centos.

> Really thanks for info :).

	You're welcome.  Enjoy!

Joel

-- 

Life's Little Instruction Book #252

	"Take good care of those you love."

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From billpp at gmail.com  Wed Apr 22 23:31:44 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 20:31:44 -0300
Subject: [Linux-cluster] Quorum Disk and I/O MultiPath problem
Message-ID: <58aa8d780904221631o5c8f6354l7e13780266c45157@mail.gmail.com>

Hi folks,

I'm trying to configure a 2-node cluster using quorum disk as tie-breaker.

I'm getting a problem when my active I/O path for quorum disk goes
down (I'm testing turn off one (of two) SAN fiber switches), so one
node is being fenced.
I believe this is not right or can have a better way to, so I'll post
my configs here and wait for comments :).

# cluster.conf, cman status/services/nodes -f
http://rafb.net/p/J4D5UD76.html

# Log from messages when one switch is turned off
http://rafb.net/p/SA8Y0A83.html

Any help, sugest or comment is appreciate :).

Thanks.

--

Fl?vio do Carmo J?nior aka waKKu


From billpp at gmail.com  Wed Apr 22 23:37:50 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 20:37:50 -0300
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <1240441905.19981.19.camel@phobos.andyw.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>
Message-ID: <58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>

On Wed, Apr 22, 2009 at 8:11 PM, Andy Wallace <andy at linxit.eclipse.co.uk> wrote:

> Although it's not as quick as I'd like, I'm getting about 150MB/s on
> average when reading/writing files in the 100MB - 1GB range. However, if
> I try to write a 10GB file, this goes down to about 50MB/s. That's just
> doing dd to the mounted gfs2 on an individual node. If I do a get from
> an ftp client, I'm seeing half that; cp from an NFS mount is more like
> 1/5.
>

Have you tried the same thing with another filesystem? Ext3 maybe ?
Are you using RAID right? Did you check about RAID and LVM/partition alignment?

If you will try ext3, see about -E stride and -E stripe_width values
on mkfs.ext3 manpage.
This calc should helps: http://busybox.net/~aldot/mkfs_stride.html


> --
> Andy Wallace
>

--

Fl?vio do Carmo J?nior aka waKKu


From barbos at gmail.com  Thu Apr 23 00:45:49 2009
From: barbos at gmail.com (Alex Kompel)
Date: Wed, 22 Apr 2009 17:45:49 -0700
Subject: [Linux-cluster] Quorum Disk and I/O MultiPath problem
In-Reply-To: <58aa8d780904221631o5c8f6354l7e13780266c45157@mail.gmail.com>
References: <58aa8d780904221631o5c8f6354l7e13780266c45157@mail.gmail.com>
Message-ID: <3ae027040904221745j7d8356auc3f2a2a4310a94f9@mail.gmail.com>

It appears that it took 20 sec for path to fail over. quorumd tko is 10 sec
by default. You may want to reduce HBA timeout or tweak tko for quorumd.
Basically you want to set all cluster timeouts to exceed expected failover
time of lower-level systems.
-Alex

On Wed, Apr 22, 2009 at 4:31 PM, Flavio Junior <billpp at gmail.com> wrote:

> Hi folks,
>
> I'm trying to configure a 2-node cluster using quorum disk as tie-breaker.
>
> I'm getting a problem when my active I/O path for quorum disk goes
> down (I'm testing turn off one (of two) SAN fiber switches), so one
> node is being fenced.
> I believe this is not right or can have a better way to, so I'll post
> my configs here and wait for comments :).
>
> # cluster.conf, cman status/services/nodes -f
> http://rafb.net/p/J4D5UD76.html
>
> # Log from messages when one switch is turned off
> http://rafb.net/p/SA8Y0A83.html
>
> Any help, sugest or comment is appreciate :).
>
> Thanks.
>
> --
>
> Fl?vio do Carmo J?nior aka waKKu
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090422/a18eff7c/attachment.htm>

From billpp at gmail.com  Thu Apr 23 01:35:21 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 22 Apr 2009 22:35:21 -0300
Subject: [Linux-cluster] Quorum Disk and I/O MultiPath problem
In-Reply-To: <3ae027040904221745j7d8356auc3f2a2a4310a94f9@mail.gmail.com>
References: <58aa8d780904221631o5c8f6354l7e13780266c45157@mail.gmail.com>
	<3ae027040904221745j7d8356auc3f2a2a4310a94f9@mail.gmail.com>
Message-ID: <58aa8d780904221835w532214d9p308e4be5b986a0d8@mail.gmail.com>

On Wed, Apr 22, 2009 at 9:45 PM, Alex Kompel <barbos at gmail.com> wrote:
> It appears that it took 20 sec for path to fail over. quorumd tko is 10 sec
> by default. You may want to reduce HBA timeout or tweak tko for quorumd.
> Basically you want to set all cluster timeouts to exceed expected failover
> time of lower-level systems.
> -Alex
> On Wed, Apr 22, 2009 at 4:31 PM, Flavio Junior <billpp at gmail.com> wrote:
>>


Hi Alex, thanks..

I was expecting that is it, but not sure.
I'll try to bring multipath timeout down to 10 seconds and increase
tko timeout to 15, tomorrow :)

Any news I report here.

Thanks again.

--

Fl?vio do Carmo J?nior aka waKKu


From andy at linxit.eclipse.co.uk  Thu Apr 23 05:45:49 2009
From: andy at linxit.eclipse.co.uk (Andy Wallace)
Date: Thu, 23 Apr 2009 06:45:49 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>
References: <1240441905.19981.19.camel@phobos.andyw.net>
	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>
Message-ID: <1240465549.19981.28.camel@phobos.andyw.net>

On Wed, 2009-04-22 at 20:37 -0300, Flavio Junior wrote:
> On Wed, Apr 22, 2009 at 8:11 PM, Andy Wallace <andy at linxit.eclipse.co.uk> wrote:
> 
> > Although it's not as quick as I'd like, I'm getting about 150MB/s on
> > average when reading/writing files in the 100MB - 1GB range. However, if
> > I try to write a 10GB file, this goes down to about 50MB/s. That's just
> > doing dd to the mounted gfs2 on an individual node. If I do a get from
> > an ftp client, I'm seeing half that; cp from an NFS mount is more like
> > 1/5.
> >
> 
> Have you tried the same thing with another filesystem? Ext3 maybe ?
> Are you using RAID right? Did you check about RAID and LVM/partition alignment?
> 
> If you will try ext3, see about -E stride and -E stripe_width values
> on mkfs.ext3 manpage.
> This calc should helps: http://busybox.net/~aldot/mkfs_stride.html
> 
> 

Yes, I have (by the way, do you know how long ext3 takes to create a 6TB
filesystem???).

I've aligned the RAID and LVM stripes using various different values,
and found slight improvements in performance as a result. My main
problem is that when the file size hits a certain point, performance
degrades alarmingly. For example, on NFS moving a 100M file is about 20%
slower than direct access, with a 5GB file it's 80% slower (and the
direct access itself is 50% slower).

As I said before, I'll be working with 20G-170G files, so I really have
to find a way around this!

-- 
Andy


From mockey.chen at nsn.com  Thu Apr 23 05:51:29 2009
From: mockey.chen at nsn.com (Chen Ming)
Date: Thu, 23 Apr 2009 13:51:29 +0800
Subject: [Linux-cluster] cman_init error (nil) 111
Message-ID: <49F001E1.105@nsn.com>

I install a new two nodes cluster, one node can start successfully. 
while the other node can not start cman.
There is the logs. Any comment is appreciated.

Apr 23 12:37:22 notice AS-2 openais[8307]: [SERV ] Initialising service 
handler \'openais message service B.01.01\'
Apr 23 12:37:22 notice AS-2 openais[8307]: [SERV ] Initialising service 
handler \'openais configuration service\'
Apr 23 12:37:22 notice AS-2 openais[8307]: [SERV ] Initialising service 
handler \'openais cluster closed process group service
 v1.01\'
Apr 23 12:37:22 notice AS-2 openais[8307]: [SERV ] Initialising service 
handler \'openais CMAN membership service 2.01\'
Apr 23 12:37:22 info AS-2 openais[8307]: [CMAN ] CMAN 2.0.98 (built Dec  
3 2008 16:32:34) started
Apr 23 12:37:22 notice AS-2 openais[8307]: [SYNC ] Not using a virtual 
synchrony filter.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] Creating commit token 
because I am the rep.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] Saving state aru 0 
high seq received 0
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] Storing new sequence 
id for ring 38
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] entering COMMIT state.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] entering RECOVERY state.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] position [0] member 
10.56.150.139:
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] previous ring seq 52 
rep 10.56.150.139
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] aru 0 high delivered 
0 received flag 1
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] Did not need to 
originate any messages in recovery.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] Sending initial ORF token
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] CLM CONFIGURATION CHANGE
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] New Configuration:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] Members Left:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] Members Joined:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] CLM CONFIGURATION CHANGE
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] New Configuration:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ]      r(0) 
ip(10.56.150.139)
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] Members Left:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] Members Joined:
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ]      r(0) 
ip(10.56.150.139)
Apr 23 12:37:22 notice AS-2 openais[8307]: [SYNC ] This node is within 
the primary component and will provide service.
Apr 23 12:37:22 notice AS-2 openais[8307]: [TOTEM] entering OPERATIONAL 
state.
Apr 23 12:37:22 info AS-2 openais[8307]: [CMAN ] quorum regained, 
resuming activity
Apr 23 12:37:22 notice AS-2 openais[8307]: [CLM  ] got nodejoin message 
10.56.150.139
Apr 23 12:37:22 err AS-2 syslog-ng[7516]: Error connecting to remote 
host AF_INET(logserver:601), reattempting in 5 seconds
Apr 23 12:37:22 err AS-2 groupd[8319]: found uncontrolled kernel object 
rgmanager in /sys/kernel/dlm
Apr 23 12:37:22 err AS-2 groupd[8319]: local node must be reset to clear 
1 uncontrolled instances of gfs and/or dlm
Apr 23 12:37:22 info AS-2 openais[8307]: [CMAN ] cman killed by node 2 
because we were killed by cman_tool or other applicatio
n
Apr 23 12:37:22 err AS-2 fenced[8327]: cman_init error (nil) 111
Apr 23 12:37:22 err AS-2 fence_node[8320]: Fence of \"AS-2\" was 
unsuccessful
Apr 23 12:37:22 err AS-2 gfs_controld[8339]: cman_init error 111
Apr 23 12:37:30 err AS-2 syslog-ng[7516]: Error connecting to remote 
host AF_INET(logserver:601), reattempting in 5 seconds
Apr 23 12:37:32 err AS-2 dlm_controld[8333]: group_init error (nil) 111


From ccaulfie at redhat.com  Thu Apr 23 07:15:25 2009
From: ccaulfie at redhat.com (Chrissie Caulfield)
Date: Thu, 23 Apr 2009 08:15:25 +0100
Subject: [Linux-cluster] cman_init error (nil) 111
In-Reply-To: <49F001E1.105@nsn.com>
References: <49F001E1.105@nsn.com>
Message-ID: <49F0158D.8080904@redhat.com>

Chen Ming wrote:
> I install a new two nodes cluster, one node can start successfully.
> while the other node can not start cman.
> There is the logs. Any comment is appreciated.
> 


> Apr 23 12:37:22 err AS-2 groupd[8319]: found uncontrolled kernel object
> rgmanager in /sys/kernel/dlm
> Apr 23 12:37:22 err AS-2 groupd[8319]: local node must be reset to clear
> 1 uncontrolled instances of gfs and/or dlm
> Apr 23 12:37:22 info AS-2 openais[8307]: [CMAN ] cman killed by node 2
> because we were killed by cman_tool or other applicatio
> n

Those messages indicate that cman has been restarted without previously
shutting down GFS or the DLM. If you need to restart the cluster, I
really recommend you reboot the entire node. It is possible to restart
cman without this but it requires very careful shutting down of all of
its dependant subsystems.

-- 

Chrissie


From mockey.chen at nsn.com  Thu Apr 23 07:22:41 2009
From: mockey.chen at nsn.com (Chen Ming)
Date: Thu, 23 Apr 2009 15:22:41 +0800
Subject: [Linux-cluster] cman_init error (nil) 111
In-Reply-To: <49F0158D.8080904@redhat.com>
References: <49F001E1.105@nsn.com> <49F0158D.8080904@redhat.com>
Message-ID: <49F01741.7060700@nsn.com>

ext Chrissie Caulfield wrote:
> Chen Ming wrote:
>   
>> I install a new two nodes cluster, one node can start successfully.
>> while the other node can not start cman.
>> There is the logs. Any comment is appreciated.
>>
>>     
>
>
>   
>> Apr 23 12:37:22 err AS-2 groupd[8319]: found uncontrolled kernel object
>> rgmanager in /sys/kernel/dlm
>> Apr 23 12:37:22 err AS-2 groupd[8319]: local node must be reset to clear
>> 1 uncontrolled instances of gfs and/or dlm
>> Apr 23 12:37:22 info AS-2 openais[8307]: [CMAN ] cman killed by node 2
>> because we were killed by cman_tool or other applicatio
>> n
>>     
>
> Those messages indicate that cman has been restarted without previously
> shutting down GFS or the DLM. If you need to restart the cluster, I
> really recommend you reboot the entire node. It is possible to restart
> cman without this but it requires very careful shutting down of all of
> its dependant subsystems.
>
>   
Thanks.

I have tried to manually reboot the entire node as-2. but the issue
still exist.


From ralphzukeb at gmail.com  Thu Apr 23 07:58:02 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Thu, 23 Apr 2009 08:58:02 +0100
Subject: [Linux-cluster] Restart children but relocate parent
Message-ID: <6e4a03690904230058v7d853c12qfc0f98ef3fc2bcde@mail.gmail.com>

Hello,

I have two init scripts:
 /etc/init.d/noBugs
 /etc/init.d/buggy

buggy is dependent on noBugs.

If buggy dies, I want to restart it. If noBugs dies, I want to
relocate both noBugs and buggy to the other node.

How can I represent this? Would a failover domain achieve this?

Ralph


From swhiteho at redhat.com  Thu Apr 23 08:18:34 2009
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 23 Apr 2009 09:18:34 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <1240441905.19981.19.camel@phobos.andyw.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>
Message-ID: <1240474714.3396.3.camel@localhost.localdomain>

Hi,

On Thu, 2009-04-23 at 00:11 +0100, Andy Wallace wrote:
> Hi,
> 
> I've just set up a GFS2 filesystem for a client, but have some serious
> issues with the performance on large files (i.e. > 2GB). This is a real
> problem, as the files we're going to be using are going to be approx.
> 20GB up to 170GB.
> 

If you are talking about the write side of things, then yes, we know
there is an issue which is related to the "page at a time" architecture
of the Linux VFS helpers. It is not easy to work around this because its
an area thats rather prone to deadlocks. We do know there is an issue
though with large streaming writes and we will look at solutions as soon
as we can.

On the read side though, I would expect performance to be pretty good,
so if you are having trouble there, then that is something that we
should look into.

You don't mention which kernel version you are using. Thats always
helpful to know in diagnosing issues like this,

Steve.


> Hardware setup is:
> 2 x IBM X3650 servers with 2 x Dual Xeon, 4GB RAM, 2 x 2GB/s HBAs per
> server;
> Storage on IBM DS4700 - 48 x 1TB SATA disks
> 
> Files will be written to the storage via FTP, read via NFS mounts, both
> on an LVS virtual IP address.
> 
> Although it's not as quick as I'd like, I'm getting about 150MB/s on
> average when reading/writing files in the 100MB - 1GB range. However, if
> I try to write a 10GB file, this goes down to about 50MB/s. That's just
> doing dd to the mounted gfs2 on an individual node. If I do a get from
> an ftp client, I'm seeing half that; cp from an NFS mount is more like
> 1/5.
> 
> I've spent a lot of time reading up on GFS2 performance, but I haven't
> found anything useful for improving throughput with large files. Has
> anyone got any suggestions or managed to solve a similar problem?
> 


From gordan at bobich.net  Thu Apr 23 09:41:20 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 10:41:20 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <1240465549.19981.28.camel@phobos.andyw.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>
	<1240465549.19981.28.camel@phobos.andyw.net>
Message-ID: <00eaf66dfa170d255e8dd4d63e94cc52@localhost>

On Thu, 23 Apr 2009 06:45:49 +0100, Andy Wallace
<andy at linxit.eclipse.co.uk>
wrote:
> On Wed, 2009-04-22 at 20:37 -0300, Flavio Junior wrote:
>> On Wed, Apr 22, 2009 at 8:11 PM, Andy Wallace
<andy at linxit.eclipse.co.uk>
>> wrote:
>> 
>> > Although it's not as quick as I'd like, I'm getting about 150MB/s on
>> > average when reading/writing files in the 100MB - 1GB range. However,
>> > if
>> > I try to write a 10GB file, this goes down to about 50MB/s. That's
just
>> > doing dd to the mounted gfs2 on an individual node. If I do a get from
>> > an ftp client, I'm seeing half that; cp from an NFS mount is more like
>> > 1/5.
>> >
>> 
>> Have you tried the same thing with another filesystem? Ext3 maybe ?
>> Are you using RAID right? Did you check about RAID and LVM/partition
>> alignment?
>> 
>> If you will try ext3, see about -E stride and -E stripe_width values
>> on mkfs.ext3 manpage.
>> This calc should helps: http://busybox.net/~aldot/mkfs_stride.html
>> 
> 
> Yes, I have (by the way, do you know how long ext3 takes to create a 6TB
> filesystem???).

It depends on what you consider to be "long". The last box I built had
8x 1TB 7200rpm disks in software md RAID6 = 6 TB usable, DAS, consumer
grade motherboard, and 2 of the 8 ports were massively bottlenecked by
a 32-bit 33MHz PCI SATA controller, but all 8 ports had NCQ. This took
about 10-20 minutes (I haven't timed it exactly, about a cup of coffee
long ;)) to mkfs ext3 when the parameters were properly set up. With
default settings, it was taking around 10x longer, maybe even more.

My findings are that the default settings and old wisdom often taken
as axiomatically true are actually completely off the mark.

Here is the line of reasoning that I have found to lead to best results.

RAID block size of 64-256KB is way, way too big. It will kill
the performance of small IOs without yielding a noticeable increase
in performance for large IOs, and sometimes in fact hurting large
IOs, too.

To pick the optimum RAID block size, look at the disks. What is the
multi-sector transfer size they can handle? I have not seen any disks
to date that have this figure at anything other than 16, and
16sectors * 512 bytes/sector = 8KB.

So set the RAID block size to 8KB.

Make sure your stride parameter is set so that
ext3 block size (usually 4KB) * stride = block size,
in this case ext3 block size = 4, stride = 2, RAID block = 8KB.

So far so good, but we're not done yet. The last thing to consider
is the extent / block group size. The beginning of each block group
contains a superblock for that group. It is the top of that inode
tree, and needs to be checked to find any file/block in that group.
That means the beginning block of a block group is a major hot-spot
for I/O, as it has to be read for every read and written for every
write to that group. This, in turn, means that for anything like
reasonable performance you need to have the block group beginnings
distributed evenly across all the disks in your RAID array, or else
you'll hammer one disk while the others are sitting idle.

For example, the default for ext3 is 32768 blocks in a block group.
IIRC, on ext3, the adjustment can only be made in increments of 8
blocks (32KB assuming 4KB blocks). In GFS, IIRC, the minimum
adjustment increment is 1MB(!) (not a FS limitation but a mkfs.gfs
limitation, I'm told).

The optimum number of blocks in a group will depend on the RAID
level and the number of disks in the array, but you can simplify
it into a RAID0 equivalent for the purpose of this exercise
e.g. 8 disks in RAID6 can be considered to be 6 disks in RAID0.
Ideally you want the block group size to align to the stripe
width +/- 1 stride width so that the block group beginnings
rotate among the disks (upward for +1 stride, downward for
-1 stride, both will achieve the same purpose).

The stripe width in the case described is 8KB * 6 disks = 48KB.
So, you want block group to align to a multiple of
8KB * 7 disks = 56KB. But be careful here - you should aim for
a number that is a multiple of 56KB, but not a multiple of
48KB because if they line up, you haven't achieved anything
and you're back where you started!

56KB is 14 4KB blocks. Without getting involved in a major
factoring exercise, 28,000 blocks sounds good (default is
32768 for ext3, which is in a reasonable ball park).
28,000*4KB is a multiple of 56KB but not 48KB, so it looks
like a good choice in this example.

Obviously, you'll have to work out the optimal numbers for
your specific RAID configuration, the above example is for:
disk multi-sector = 16
ext3 block size = 4KB
RAID block size = 8KB
ext3 stride = 2
RAID = 6
disks = 8

This is one of the key reasons why I think LVM is evil. It abstracts
things and encourages no forward thinking. Adding a new volume is
the same as adding a new disk to a software RAID to stretch it.
It'll upset the block group size calculation and in one fell swoop
take away the advantage of load balancing across all the disks you
have. By doing this you can cripple the performance on some
operations from scaling linearly with the number of disks to being
bogged down to the performance of just one disk.

This can make a massive difference to IOPS figures you get out
of a storage system, but I suspect that enterprise storage vendors
are much happier being able to sell more (and more expensive)
equipment when the performance gets crippled through misconfiguration,
or even just lack of consideration of parameters such as the above.
This is also why quite frequently a cheap box made of COTS components
can complete blow away a similar enterprise grade box with 10-100x
the price tag.

> I've aligned the RAID and LVM stripes using various different values,
> and found slight improvements in performance as a result. My main
> problem is that when the file size hits a certain point, performance
> degrades alarmingly. For example, on NFS moving a 100M file is about 20%
> slower than direct access, with a 5GB file it's 80% slower (and the
> direct access itself is 50% slower).
> 
> As I said before, I'll be working with 20G-170G files, so I really have
> to find a way around this!

Have you tried increasing the resource group sizes? IIRC the default is
256MB (-r parameter to mkfs.gfs), which may well have a considerable
impact when you are shifting huge files around. Try upping it,
potentially by a large factor, and see how that affects your large file
performance.

Note - "resource group" in GFS is the same as the "block group" described
above for ext3 in the performance optimization example. However, while
ext3 is adjustable in multiples of 8 blocks (32KB), gfs is only adjustable
in increments of 1MB.

HTH

Gordan


From m.radzewicz at tech.crmedia.pl  Thu Apr 23 10:01:18 2009
From: m.radzewicz at tech.crmedia.pl (=?ISO-8859-2?Q?Miko=B3aj_Radzewicz?=)
Date: Thu, 23 Apr 2009 12:01:18 +0200
Subject: [Linux-cluster] Two questions about GFS2?
Message-ID: <49F03C6E.80400@tech.crmedia.pl>

Hello all,
I have two simple questions considering GFS2?
1. I would like to now if the GFS2 version which has appeared with
Centos 5.3 is really production ready or it is better to wait for a some
time??

2. Is is possibile/advised to use GFS and GFS2 in the same cluster?

Thanks a lot.

-- 
Mikolaj


From swhiteho at redhat.com  Thu Apr 23 10:05:53 2009
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 23 Apr 2009 11:05:53 +0100
Subject: [Linux-cluster] Two questions about GFS2?
In-Reply-To: <49F03C6E.80400@tech.crmedia.pl>
References: <49F03C6E.80400@tech.crmedia.pl>
Message-ID: <1240481153.3396.10.camel@localhost.localdomain>

Hi,

On Thu, 2009-04-23 at 12:01 +0200, Miko?aj Radzewicz wrote:
> Hello all,
> I have two simple questions considering GFS2?
> 1. I would like to now if the GFS2 version which has appeared with
> Centos 5.3 is really production ready or it is better to wait for a some
> time??
> 
Its ready in the sense that it is based upon the supported RHEL 5.3

> 2. Is is possibile/advised to use GFS and GFS2 in the same cluster?
> 
> Thanks a lot.
> 
There should be no issues in using a mixed gfs/gfs2 cluster,

Steve.


From pasik at iki.fi  Thu Apr 23 10:16:28 2009
From: pasik at iki.fi (Pasi =?iso-8859-1?Q?K=E4rkk=E4inen?=)
Date: Thu, 23 Apr 2009 13:16:28 +0300
Subject: [Linux-cluster] GFS2 performance on large files / mkfs tuning
In-Reply-To: <00eaf66dfa170d255e8dd4d63e94cc52@localhost>
References: <1240441905.19981.19.camel@phobos.andyw.net>
	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>
	<1240465549.19981.28.camel@phobos.andyw.net>
	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>
Message-ID: <20090423101628.GU24960@edu.joroinen.fi>

On Thu, Apr 23, 2009 at 10:41:20AM +0100, Gordan Bobic wrote:
> On Thu, 23 Apr 2009 06:45:49 +0100, Andy Wallace
> <andy at linxit.eclipse.co.uk>
> wrote:
> > On Wed, 2009-04-22 at 20:37 -0300, Flavio Junior wrote:
> >> On Wed, Apr 22, 2009 at 8:11 PM, Andy Wallace
> <andy at linxit.eclipse.co.uk>
> >> wrote:
> >> 
> >> > Although it's not as quick as I'd like, I'm getting about 150MB/s on
> >> > average when reading/writing files in the 100MB - 1GB range. However,
> >> > if
> >> > I try to write a 10GB file, this goes down to about 50MB/s. That's
> just
> >> > doing dd to the mounted gfs2 on an individual node. If I do a get from
> >> > an ftp client, I'm seeing half that; cp from an NFS mount is more like
> >> > 1/5.
> >> >
> >> 
> >> Have you tried the same thing with another filesystem? Ext3 maybe ?
> >> Are you using RAID right? Did you check about RAID and LVM/partition
> >> alignment?
> >> 
> >> If you will try ext3, see about -E stride and -E stripe_width values
> >> on mkfs.ext3 manpage.
> >> This calc should helps: http://busybox.net/~aldot/mkfs_stride.html
> >> 

Oh.. nice tool. I've been always calculating manually. Then again the math
is pretty simple..

> > 
> > Yes, I have (by the way, do you know how long ext3 takes to create a 6TB
> > filesystem???).
> 
> It depends on what you consider to be "long". The last box I built had
> 8x 1TB 7200rpm disks in software md RAID6 = 6 TB usable, DAS, consumer
> grade motherboard, and 2 of the 8 ports were massively bottlenecked by
> a 32-bit 33MHz PCI SATA controller, but all 8 ports had NCQ. This took
> about 10-20 minutes (I haven't timed it exactly, about a cup of coffee
> long ;)) to mkfs ext3 when the parameters were properly set up. With
> default settings, it was taking around 10x longer, maybe even more.
> 

Interesting information.. what settings did you tweak for faster mkfs? Just
the things mentioned below?

> My findings are that the default settings and old wisdom often taken
> as axiomatically true are actually completely off the mark.
> 
> Here is the line of reasoning that I have found to lead to best results.
> 
> RAID block size of 64-256KB is way, way too big. It will kill
> the performance of small IOs without yielding a noticeable increase
> in performance for large IOs, and sometimes in fact hurting large
> IOs, too.
> 
> To pick the optimum RAID block size, look at the disks. What is the
> multi-sector transfer size they can handle? I have not seen any disks
> to date that have this figure at anything other than 16, and
> 16sectors * 512 bytes/sector = 8KB.
> 

Hmm.. how can you determine multi-sector transfer size from a disk? 


> So set the RAID block size to 8KB.
> 
> Make sure your stride parameter is set so that
> ext3 block size (usually 4KB) * stride = block size,
> in this case ext3 block size = 4, stride = 2, RAID block = 8KB.
> 
> So far so good, but we're not done yet. The last thing to consider
> is the extent / block group size. The beginning of each block group
> contains a superblock for that group. It is the top of that inode
> tree, and needs to be checked to find any file/block in that group.
> That means the beginning block of a block group is a major hot-spot
> for I/O, as it has to be read for every read and written for every
> write to that group. This, in turn, means that for anything like
> reasonable performance you need to have the block group beginnings
> distributed evenly across all the disks in your RAID array, or else
> you'll hammer one disk while the others are sitting idle.
> 
> For example, the default for ext3 is 32768 blocks in a block group.
> IIRC, on ext3, the adjustment can only be made in increments of 8
> blocks (32KB assuming 4KB blocks). In GFS, IIRC, the minimum
> adjustment increment is 1MB(!) (not a FS limitation but a mkfs.gfs
> limitation, I'm told).
> 
> The optimum number of blocks in a group will depend on the RAID
> level and the number of disks in the array, but you can simplify
> it into a RAID0 equivalent for the purpose of this exercise
> e.g. 8 disks in RAID6 can be considered to be 6 disks in RAID0.
> Ideally you want the block group size to align to the stripe
> width +/- 1 stride width so that the block group beginnings
> rotate among the disks (upward for +1 stride, downward for
> -1 stride, both will achieve the same purpose).
> 
> The stripe width in the case described is 8KB * 6 disks = 48KB.
> So, you want block group to align to a multiple of
> 8KB * 7 disks = 56KB. But be careful here - you should aim for
> a number that is a multiple of 56KB, but not a multiple of
> 48KB because if they line up, you haven't achieved anything
> and you're back where you started!
> 
> 56KB is 14 4KB blocks. Without getting involved in a major
> factoring exercise, 28,000 blocks sounds good (default is
> 32768 for ext3, which is in a reasonable ball park).
> 28,000*4KB is a multiple of 56KB but not 48KB, so it looks
> like a good choice in this example.
> 
> Obviously, you'll have to work out the optimal numbers for
> your specific RAID configuration, the above example is for:
> disk multi-sector = 16
> ext3 block size = 4KB
> RAID block size = 8KB
> ext3 stride = 2
> RAID = 6
> disks = 8
> 
> This is one of the key reasons why I think LVM is evil. It abstracts
> things and encourages no forward thinking. Adding a new volume is
> the same as adding a new disk to a software RAID to stretch it.
> It'll upset the block group size calculation and in one fell swoop
> take away the advantage of load balancing across all the disks you
> have. By doing this you can cripple the performance on some
> operations from scaling linearly with the number of disks to being
> bogged down to the performance of just one disk.
> 
> This can make a massive difference to IOPS figures you get out
> of a storage system, but I suspect that enterprise storage vendors
> are much happier being able to sell more (and more expensive)
> equipment when the performance gets crippled through misconfiguration,
> or even just lack of consideration of parameters such as the above.
> This is also why quite frequently a cheap box made of COTS components
> can complete blow away a similar enterprise grade box with 10-100x
> the price tag.
> 
> > I've aligned the RAID and LVM stripes using various different values,
> > and found slight improvements in performance as a result. My main
> > problem is that when the file size hits a certain point, performance
> > degrades alarmingly. For example, on NFS moving a 100M file is about 20%
> > slower than direct access, with a 5GB file it's 80% slower (and the
> > direct access itself is 50% slower).
> > 
> > As I said before, I'll be working with 20G-170G files, so I really have
> > to find a way around this!
> 
> Have you tried increasing the resource group sizes? IIRC the default is
> 256MB (-r parameter to mkfs.gfs), which may well have a considerable
> impact when you are shifting huge files around. Try upping it,
> potentially by a large factor, and see how that affects your large file
> performance.
> 
> Note - "resource group" in GFS is the same as the "block group" described
> above for ext3 in the performance optimization example. However, while
> ext3 is adjustable in multiples of 8 blocks (32KB), gfs is only adjustable
> in increments of 1MB.
> 

Very good information you have here.. 

Thanks for posting it.

-- Pasi


From gordan at bobich.net  Thu Apr 23 11:16:53 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 12:16:53 +0100
Subject: [Linux-cluster] GFS2 performance on large files / mkfs tuning
In-Reply-To: <20090423101628.GU24960@edu.joroinen.fi>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>
	<20090423101628.GU24960@edu.joroinen.fi>
Message-ID: <9fbf42bf4d10d8b36517b6950264cb14@localhost>

On Thu, 23 Apr 2009 13:16:28 +0300, "Pasi K?rkk?inen" <pasik at iki.fi>
wrote:


>> > Yes, I have (by the way, do you know how long ext3 takes to create a
>> > 6TB filesystem???).
>> 
>> It depends on what you consider to be "long". The last box I built had
>> 8x 1TB 7200rpm disks in software md RAID6 = 6 TB usable, DAS, consumer
>> grade motherboard, and 2 of the 8 ports were massively bottlenecked by
>> a 32-bit 33MHz PCI SATA controller, but all 8 ports had NCQ. This took
>> about 10-20 minutes (I haven't timed it exactly, about a cup of coffee
>> long ;)) to mkfs ext3 when the parameters were properly set up. With
>> default settings, it was taking around 10x longer, maybe even more.
>> 
> 
> Interesting information.. what settings did you tweak for faster mkfs?
Just
> the things mentioned below?

Yes, all of the ones mentioned.

>> My findings are that the default settings and old wisdom often taken
>> as axiomatically true are actually completely off the mark.
>> 
>> Here is the line of reasoning that I have found to lead to best results.
>> 
>> RAID block size of 64-256KB is way, way too big. It will kill
>> the performance of small IOs without yielding a noticeable increase
>> in performance for large IOs, and sometimes in fact hurting large
>> IOs, too.
>> 
>> To pick the optimum RAID block size, look at the disks. What is the
>> multi-sector transfer size they can handle? I have not seen any disks
>> to date that have this figure at anything other than 16, and
>> 16sectors * 512 bytes/sector = 8KB.
>> 
> 
> Hmm.. how can you determine multi-sector transfer size from a disk? 

hdparm -i /dev/[hs]d[a-z]

Gordan


From csmith at nighthawkrad.net  Thu Apr 23 13:09:23 2009
From: csmith at nighthawkrad.net (Christopher Smith)
Date: Thu, 23 Apr 2009 15:09:23 +0200
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <00eaf66dfa170d255e8dd4d63e94cc52@localhost>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>
	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>
Message-ID: <49F06883.5010000@nighthawkrad.net>

Gordan Bobic wrote:

> To pick the optimum RAID block size, look at the disks. What is the
> multi-sector transfer size they can handle? I have not seen any disks
> to date that have this figure at anything other than 16, and
> 16sectors * 512 bytes/sector = 8KB.
> 
> So set the RAID block size to 8KB.

Is this "chunk size" ?  Or would chunk size be #disks*8KB ?

> This is also why quite frequently a cheap box made of COTS components
> can complete blow away a similar enterprise grade box with 10-100x
> the price tag.

In fairness, those enterprise boxes typically have dual redundant 
controllers with mirrored cache, and other failure-resistant goodies you 
can't really do with COTS hardware. ;)

-- 
Christopher Smith

UNIX Team Leader
Nighthawk Radiology Services
Limmatquai 4, 6th Floor
8001 Zurich, Switzerland
http://www.nighthawkrad.net
Sydney Fax:    +61 2 8211 2333
Zurich Fax:    +41 43 497 3301
USA Toll free:  866 241 6635

Email:         csmith at nighthawkrad.net
IP Extension:  8163
Sydney Phone:  +61 2 8211 2363
Sydney Mobile: +61 4 0739 7563
Zurich Phone:  +41 44 267 3363
Zurich Mobile: +41 79 550 2715

All phones forwarded to my current location, however, please consider 
the local time in Zurich before calling from abroad.


CONFIDENTIALITY NOTICE:   This email, including any attachments, 
contains information from NightHawk Radiology Services, which may be 
confidential or privileged. The information is intended to be for the 
use of the individual or entity named above. If you are not the intended 
recipient, be aware that any disclosure, copying, distribution or use of 
the contents of this information is prohibited. If you have received 
this email in error, please notify NightHawk Radiology Services 
immediately by forwarding message to postmaster at nighthawkrad.net and 
destroy all electronic and hard copies of the communication, including 
attachments.


From gordan at bobich.net  Thu Apr 23 13:14:28 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 14:14:28 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <49F06883.5010000@nighthawkrad.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>
	<49F06883.5010000@nighthawkrad.net>
Message-ID: <390883fef8a5add08e84ca9d9aa9e144@localhost>

On Thu, 23 Apr 2009 15:09:23 +0200, Christopher Smith
<csmith at nighthawkrad.net> wrote:
> Gordan Bobic wrote:
> 
>> To pick the optimum RAID block size, look at the disks. What is the
>> multi-sector transfer size they can handle? I have not seen any disks
>> to date that have this figure at anything other than 16, and
>> 16sectors * 512 bytes/sector = 8KB.
>> 
>> So set the RAID block size to 8KB.
> 
> Is this "chunk size" ?  Or would chunk size be #disks*8KB ?

In the case of software RAID, yes, it is referred to as chunk size.

>> This is also why quite frequently a cheap box made of COTS components
>> can complete blow away a similar enterprise grade box with 10-100x
>> the price tag.
> 
> In fairness, those enterprise boxes typically have dual redundant 
> controllers with mirrored cache, and other failure-resistant goodies you 
> can't really do with COTS hardware. ;)

Maybe so, but you can still build two complete COTS boxes with
no internal redundancy for a fraction of the cost and deal with
mirroring and fail-over on server level. Enter RHCS and DRBD. :)
Why compromise?

Gordan


From divi at divinet.pl  Thu Apr 23 13:19:38 2009
From: divi at divinet.pl (Piotr Baranowski)
Date: Thu, 23 Apr 2009 15:19:38 +0200
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <49F06883.5010000@nighthawkrad.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>
	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>
	<1240465549.19981.28.camel@phobos.andyw.net>
	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>
	<49F06883.5010000@nighthawkrad.net>
Message-ID: <1240492778.5599.3.camel@dezerter>

Dnia 2009-04-23, czw o godzinie 15:09 +0200, Christopher Smith pisze:
> Gordan Bobic wrote:
> 
> > To pick the optimum RAID block size, look at the disks. What is the
> > multi-sector transfer size they can handle? I have not seen any disks
> > to date that have this figure at anything other than 16, and
> > 16sectors * 512 bytes/sector = 8KB.
> > 
> > So set the RAID block size to 8KB.
> 
> Is this "chunk size" ?  Or would chunk size be #disks*8KB ?
> 
> > This is also why quite frequently a cheap box made of COTS components
> > can complete blow away a similar enterprise grade box with 10-100x
> > the price tag.
> 
> In fairness, those enterprise boxes typically have dual redundant 
> controllers with mirrored cache, and other failure-resistant goodies you 
> can't really do with COTS hardware. ;)

I don't want to stir the hornet's nest but just look at that:
http://linux.yyz.us/why-software-raid.html

There is no clear winner in that bet, but knowing pros and cons of both
approaches helps choose the best.

peace
-- 
Piotr Baranowski
Open Source Education Center www.osec.pl

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: To jest cz??? wiadomo?ci podpisana cyfrowo
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090423/8d7f70a6/attachment.sig>

From gordan at bobich.net  Thu Apr 23 13:28:44 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 14:28:44 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <1240492778.5599.3.camel@dezerter>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>	<49F06883.5010000@nighthawkrad.net>
	<1240492778.5599.3.camel@dezerter>
Message-ID: <ee07579a8be1eb8eb98cdd0dbb9d8538@localhost>

On Thu, 23 Apr 2009 15:19:38 +0200, Piotr Baranowski <divi at divinet.pl>
wrote:

>> > To pick the optimum RAID block size, look at the disks. What is the
>> > multi-sector transfer size they can handle? I have not seen any disks
>> > to date that have this figure at anything other than 16, and
>> > 16sectors * 512 bytes/sector = 8KB.
>> > 
>> > So set the RAID block size to 8KB.
>> 
>> Is this "chunk size" ?  Or would chunk size be #disks*8KB ?
>> 
>> > This is also why quite frequently a cheap box made of COTS components
>> > can complete blow away a similar enterprise grade box with 10-100x
>> > the price tag.
>> 
>> In fairness, those enterprise boxes typically have dual redundant 
>> controllers with mirrored cache, and other failure-resistant goodies you

>> can't really do with COTS hardware. ;)
> 
> I don't want to stir the hornet's nest but just look at that:
> http://linux.yyz.us/why-software-raid.html
> 
> There is no clear winner in that bet, but knowing pros and cons of both
> approaches helps choose the best.

I don't see how that contradicts what I said. If we assume that a decent
RAID controller will take full advantage of the disk's multi-sector
transfer capability (which I have to assume is the case or else my
opinion of RAID controller vendors would mostly be limited to pondering
how they have managed to stay in business so far), than everything I said
still applies. Just because you're pushing the multi-sector and chunk size
down to a lower level doesn't mean that everything else doesn't still
stack.

Oh, and I consider RAID controllers to be included in COTS. At no point
did I mention advantages of hardware vs. software RAID - either will do,
and the benefits of knowing what one is doing when it comes to file
system layout are still the same.

Gordan


From csmith at nighthawkrad.net  Thu Apr 23 13:41:45 2009
From: csmith at nighthawkrad.net (Christopher Smith)
Date: Thu, 23 Apr 2009 15:41:45 +0200
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <390883fef8a5add08e84ca9d9aa9e144@localhost>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>	<49F06883.5010000@nighthawkrad.net>
	<390883fef8a5add08e84ca9d9aa9e144@localhost>
Message-ID: <49F07019.7010703@nighthawkrad.net>

Gordan Bobic wrote:
> On Thu, 23 Apr 2009 15:09:23 +0200, Christopher Smith
> <csmith at nighthawkrad.net> wrote:
>>> This is also why quite frequently a cheap box made of COTS components
>>> can complete blow away a similar enterprise grade box with 10-100x
>>> the price tag.
>> In fairness, those enterprise boxes typically have dual redundant 
>> controllers with mirrored cache, and other failure-resistant goodies you 
>> can't really do with COTS hardware. ;)
> 
> Maybe so, but you can still build two complete COTS boxes with
> no internal redundancy for a fraction of the cost and deal with
> mirroring and fail-over on server level. Enter RHCS and DRBD. :)
> Why compromise?

Not every service is failover-friendly at the server level. ;)

Don't get me wrong, I'm 100% behind using COTS stuff wherever possible, 
and setups with DRBD, et al, have worked very well for us in several 
locations.  But there are some situations where it just doesn't (eg: SAN 
LUNs shared between multiple servers - unless you want to forego the 
performance benefits of write caching and DIY with multiple machines, 
DRBD and iscsi-target).

(There's also the labour, maintenance and support costs of DIY vs 
plug-in-and-go to consider, as well.)

-- 
Christopher Smith

UNIX Team Leader
Nighthawk Radiology Services
Limmatquai 4, 6th Floor
8001 Zurich, Switzerland
http://www.nighthawkrad.net
Sydney Fax:    +61 2 8211 2333
Zurich Fax:    +41 43 497 3301
USA Toll free:  866 241 6635

Email:         csmith at nighthawkrad.net
IP Extension:  8163
Sydney Phone:  +61 2 8211 2363
Sydney Mobile: +61 4 0739 7563
Zurich Phone:  +41 44 267 3363
Zurich Mobile: +41 79 550 2715

All phones forwarded to my current location, however, please consider 
the local time in Zurich before calling from abroad.


CONFIDENTIALITY NOTICE:   This email, including any attachments, 
contains information from NightHawk Radiology Services, which may be 
confidential or privileged. The information is intended to be for the 
use of the individual or entity named above. If you are not the intended 
recipient, be aware that any disclosure, copying, distribution or use of 
the contents of this information is prohibited. If you have received 
this email in error, please notify NightHawk Radiology Services 
immediately by forwarding message to postmaster at nighthawkrad.net and 
destroy all electronic and hard copies of the communication, including 
attachments.


From gordan at bobich.net  Thu Apr 23 14:09:57 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 15:09:57 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <49F07019.7010703@nighthawkrad.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>	<49F06883.5010000@nighthawkrad.net>	<390883fef8a5add08e84ca9d9aa9e144@localhost>
	<49F07019.7010703@nighthawkrad.net>
Message-ID: <12ca2dc9875ac0c72cbb92e4bdc85dd3@localhost>

On Thu, 23 Apr 2009 15:41:45 +0200, Christopher Smith
<csmith at nighthawkrad.net> wrote:
> Gordan Bobic wrote:
>> On Thu, 23 Apr 2009 15:09:23 +0200, Christopher Smith
>> <csmith at nighthawkrad.net> wrote:
>>>> This is also why quite frequently a cheap box made of COTS components
>>>> can complete blow away a similar enterprise grade box with 10-100x
>>>> the price tag.
>>> In fairness, those enterprise boxes typically have dual redundant 
>>> controllers with mirrored cache, and other failure-resistant goodies
you
>>>
>>> can't really do with COTS hardware. ;)
>> 
>> Maybe so, but you can still build two complete COTS boxes with
>> no internal redundancy for a fraction of the cost and deal with
>> mirroring and fail-over on server level. Enter RHCS and DRBD. :)
>> Why compromise?
> 
> Not every service is failover-friendly at the server level. ;)

Sure, but then again, there aren't many and unless you are talking
about the equipment on the order of 1000x more expensive, some downtime
will eventually happen anyway (some things are rather hard to work
around). So best protect yourself from that on a higher level where
things are cheaper, easier to support and easier to do something about.

> Don't get me wrong, I'm 100% behind using COTS stuff wherever possible, 
> and setups with DRBD, et al, have worked very well for us in several 
> locations.  But there are some situations where it just doesn't (eg: SAN 
> LUNs shared between multiple servers - unless you want to forego the 
> performance benefits of write caching and DIY with multiple machines, 
> DRBD and iscsi-target).

I'm not sure write-caching is that big a deal - your SAN will be caching
all the writes anyway. Granted, the cache will be about 0.05ms further
away than it would be on a local controller, but then again, the
clustering overheads will relegate that into the realm of irrelevance.
I have yet to see a shared-SAN file system that doesn't introduce
performance penalties big enough to make the ping time to SAN a drop
in the ocean.

> (There's also the labour, maintenance and support costs of DIY vs 
> plug-in-and-go to consider, as well.)

That's another contentious point. You'll pay for that labour
several times over in increased hardware costs, vendor support
contracts (I've yet to find one that actually provides anything
more useful than the purely imaginary backside covering),
reduced performance, less transparency and the consequences
of a misguided belief of a growing number of individuals that
they can control and maintain a complex system by clicking
on pretty pictures.

Just being a devil's advocate. ;)

Gordan


From m.radzewicz at tech.crmedia.pl  Thu Apr 23 15:09:14 2009
From: m.radzewicz at tech.crmedia.pl (=?ISO-8859-2?Q?Miko=B3aj_Radzewicz?=)
Date: Thu, 23 Apr 2009 17:09:14 +0200
Subject: [Linux-cluster] GFS block size
Message-ID: <49F0849A.1000109@tech.crmedia.pl>

Hello again,
I'm planning to run GFS cluster of two nodes on some SAN storage over
fabric - 4TB ( 5 disk in RAID 5 - with stripe 128k). I'm having some
mixed feelings about setting the GFS block size for my fs...

The nodes are going to hold some WWW files (let say for video streaming)
which can be divided in 3 main groups:
1/3 files smaller 100K
1/3 files about 1M
1/3 files about 2M

I used to use a GFS block size of 4K - the default one but now I'm
wondering of changing it to some greater value like 16K. Does it make
sense and I will be able to notice some performance benefits?? On the
other hand I have read that changing default 4K for GFS isn't advisable.

btw: if it makes some difference some directoris must contain about 50K
files...

-- 
Miko?aj


From swhiteho at redhat.com  Thu Apr 23 15:11:54 2009
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 23 Apr 2009 16:11:54 +0100
Subject: [Linux-cluster] GFS block size
In-Reply-To: <49F0849A.1000109@tech.crmedia.pl>
References: <49F0849A.1000109@tech.crmedia.pl>
Message-ID: <1240499514.29604.108.camel@localhost.localdomain>

Hi,

On Thu, 2009-04-23 at 17:09 +0200, Miko?aj Radzewicz wrote:
> Hello again,
> I'm planning to run GFS cluster of two nodes on some SAN storage over
> fabric - 4TB ( 5 disk in RAID 5 - with stripe 128k). I'm having some
> mixed feelings about setting the GFS block size for my fs...
> 
> The nodes are going to hold some WWW files (let say for video streaming)
> which can be divided in 3 main groups:
> 1/3 files smaller 100K
> 1/3 files about 1M
> 1/3 files about 2M
> 
> I used to use a GFS block size of 4K - the default one but now I'm
> wondering of changing it to some greater value like 16K. Does it make
> sense and I will be able to notice some performance benefits?? On the
> other hand I have read that changing default 4K for GFS isn't advisable.
> 
> btw: if it makes some difference some directoris must contain about 50K
> files...
> 

4k is the largest block size available as we don't support block sizes
greater than the page size, so I'd stick with 4k for now. 50k files
should be ok in a single directory. Of more importance is the access
pattern (particularly creates and deletes) from the different nodes
within single directories,

Steve.


From csmith at nighthawkrad.net  Thu Apr 23 15:24:18 2009
From: csmith at nighthawkrad.net (Christopher Smith)
Date: Thu, 23 Apr 2009 17:24:18 +0200
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <12ca2dc9875ac0c72cbb92e4bdc85dd3@localhost>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>	<49F06883.5010000@nighthawkrad.net>	<390883fef8a5add08e84ca9d9aa9e144@localhost>	<49F07019.7010703@nighthawkrad.net>
	<12ca2dc9875ac0c72cbb92e4bdc85dd3@localhost>
Message-ID: <49F08822.1090104@nighthawkrad.net>

Gordan Bobic wrote:
> On Thu, 23 Apr 2009 15:41:45 +0200, Christopher Smith

>> Don't get me wrong, I'm 100% behind using COTS stuff wherever possible, 
>> and setups with DRBD, et al, have worked very well for us in several 
>> locations.  But there are some situations where it just doesn't (eg: SAN 
>> LUNs shared between multiple servers - unless you want to forego the 
>> performance benefits of write caching and DIY with multiple machines, 
>> DRBD and iscsi-target).
> 
> I'm not sure write-caching is that big a deal - your SAN will be caching
> all the writes anyway. Granted, the cache will be about 0.05ms further
> away than it would be on a local controller, but then again, the
> clustering overheads will relegate that into the realm of irrelevance.
> I have yet to see a shared-SAN file system that doesn't introduce
> performance penalties big enough to make the ping time to SAN a drop
> in the ocean.

I was actually thinking of the DIY-SAN scenario.  Eg: you get a couple 
of 2U machines with a few TB of internal disk, mirror them with DRBD, 
then export the disk as an iSCSI target.  Setup something (we used 
heartbeat) to failover between the two and voila, you have your own 
redundant iSCSI SAN.

Unfortunately you then can't get the best benefit from the dirt cheap 
gigabytes of RAM you can stuff into those machines for write caching, 
since there's no way to synchronise between the two - so if one machines 
dies there's data loss.

The same applies if you want to DIY an SMB or NFS NAS - either no write 
caching, or a high risk of data corruption.

(Unless I'm missing something ?)

> Just being a devil's advocate. ;)

Me too, to a degree.  We have a couple of SANs, primarily to keep 
higher-ups feeling warm and fuzzy, and I'm not convinced any of them 
have delivered anything close to proportionally better performance and 
reliability than something we could have built ourselves.

With that said, I wouldn't want to be the guy who (for example) DIYed an 
NFS NAS to run crtical Oracle DBs on, when Oracle support comes back 
with "until you're running on supported storage, we won't help you with 
your Oracle problems".

-- 
Christopher Smith

UNIX Team Leader
Nighthawk Radiology Services
Limmatquai 4, 6th Floor
8001 Zurich, Switzerland
http://www.nighthawkrad.net
Sydney Fax:    +61 2 8211 2333
Zurich Fax:    +41 43 497 3301
USA Toll free:  866 241 6635

Email:         csmith at nighthawkrad.net
IP Extension:  8163
Sydney Phone:  +61 2 8211 2363
Sydney Mobile: +61 4 0739 7563
Zurich Phone:  +41 44 267 3363
Zurich Mobile: +41 79 550 2715

All phones forwarded to my current location, however, please consider 
the local time in Zurich before calling from abroad.


CONFIDENTIALITY NOTICE:   This email, including any attachments, 
contains information from NightHawk Radiology Services, which may be 
confidential or privileged. The information is intended to be for the 
use of the individual or entity named above. If you are not the intended 
recipient, be aware that any disclosure, copying, distribution or use of 
the contents of this information is prohibited. If you have received 
this email in error, please notify NightHawk Radiology Services 
immediately by forwarding message to postmaster at nighthawkrad.net and 
destroy all electronic and hard copies of the communication, including 
attachments.


From gordan at bobich.net  Thu Apr 23 16:04:50 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 23 Apr 2009 17:04:50 +0100
Subject: [Linux-cluster] GFS2 performance on large files
In-Reply-To: <49F08822.1090104@nighthawkrad.net>
References: <1240441905.19981.19.camel@phobos.andyw.net>	<58aa8d780904221637m2aa93544q1432e22e8b358f8e@mail.gmail.com>	<1240465549.19981.28.camel@phobos.andyw.net>	<00eaf66dfa170d255e8dd4d63e94cc52@localhost>	<49F06883.5010000@nighthawkrad.net>	<390883fef8a5add08e84ca9d9aa9e144@localhost>	<49F07019.7010703@nighthawkrad.net>	<12ca2dc9875ac0c72cbb92e4bdc85dd3@localhost>
	<49F08822.1090104@nighthawkrad.net>
Message-ID: <84cc569851beeec402362db5840a55a1@localhost>

On Thu, 23 Apr 2009 17:24:18 +0200, Christopher Smith
<csmith at nighthawkrad.net> wrote:

>>> Don't get me wrong, I'm 100% behind using COTS stuff wherever possible,

>>> and setups with DRBD, et al, have worked very well for us in several 
>>> locations.  But there are some situations where it just doesn't (eg:
SAN
>>>
>>> LUNs shared between multiple servers - unless you want to forego the 
>>> performance benefits of write caching and DIY with multiple machines, 
>>> DRBD and iscsi-target).
>> 
>> I'm not sure write-caching is that big a deal - your SAN will be caching
>> all the writes anyway. Granted, the cache will be about 0.05ms further
>> away than it would be on a local controller, but then again, the
>> clustering overheads will relegate that into the realm of irrelevance.
>> I have yet to see a shared-SAN file system that doesn't introduce
>> performance penalties big enough to make the ping time to SAN a drop
>> in the ocean.
> 
> I was actually thinking of the DIY-SAN scenario.  Eg: you get a couple 
> of 2U machines with a few TB of internal disk, mirror them with DRBD, 
> then export the disk as an iSCSI target.  Setup something (we used 
> heartbeat) to failover between the two and voila, you have your own 
> redundant iSCSI SAN.
> 
> Unfortunately you then can't get the best benefit from the dirt cheap 
> gigabytes of RAM you can stuff into those machines for write caching, 
> since there's no way to synchronise between the two - so if one machines 
> dies there's data loss.

Using replication protocol B instead of C in DRBD offsets some of that.

> The same applies if you want to DIY an SMB or NFS NAS - either no write 
> caching, or a high risk of data corruption.

It depends on how much of your stack is fsync() safe, and even then
you'll still get data loss if the client machine crashes (everything
cached there gets lost), the most fsync() will buy you is a FS consistent
enough for a journal replay/rollback. The only way to ensure 100% recovery
is to make sure that any failed transactions can be replayed from the very
top of the application stack.

>> Just being a devil's advocate. ;)
> 
> Me too, to a degree.  We have a couple of SANs, primarily to keep 
> higher-ups feeling warm and fuzzy, and I'm not convinced any of them 
> have delivered anything close to proportionally better performance and 
> reliability than something we could have built ourselves.
> 
> With that said, I wouldn't want to be the guy who (for example) DIYed an 
> NFS NAS to run crtical Oracle DBs on, when Oracle support comes back 
> with "until you're running on supported storage, we won't help you with 
> your Oracle problems".

Database on NFS? That's just asking for trouble any way you look at it.

The point is that people who are likely to DIY the solution are the ones
that will do it for performance reasons before cost reasons, and they
are also the ones that are likely to be more knowledgeable about all
the components in the application stack than the vendor support are
ever going to. When something goes disastrously wrong, passing the buck
to the vendor is usually not going to help one's job prospects or
reputation - fixing the problem will. The more opaque the components
in the stack are, the lower the chance that it can actually be fixed
in a timely manner.

On a separate note, I think that we have strayed off the topic
sufficiently far to be told off by the list admins, so I'm going
to withdraw from this thread.

Gordan


From andrew at ntsg.umt.edu  Thu Apr 23 18:02:37 2009
From: andrew at ntsg.umt.edu (Andrew A. Neuschwander)
Date: Thu, 23 Apr 2009 12:02:37 -0600
Subject: [Linux-cluster] nfs/smb slow on gfs2
Message-ID: <49F0AD3D.5060406@ntsg.umt.edu>

I have a local only (lock_nolock) gfs2 filesystem which is served to 
clients via smb and nfs v3. Crawling around the filesystem is extremely 
slow over smb and nfs. Stat, ls and find over the network take many 
orders of magnitude longer than I'd expect.

The same tasks are very fast locally on the gfs2 filesystem. Also, for 
comparison, I created an ext3 on the same server and exported it via smb 
and nfs. The same task over nfs or samba on ext3 is nearly as fast as 
doing the task locally.

Is there any way to optimized the interaction between smb/nfs and gfs2?

I'm using gfs2 because I needed a filesystem over 8TB. I'm on CentOS5.3 
(2.6.18-128.1.6.el5).

The storage is locally attached. It is two hardware RAID6 (11 disks each 
+ 1 hotspare) arrays striped together with software raid, then managed 
with lvm2. Both the ext3 and gfs2 are on this storage. Large file 
sequential reads are fast. For example, dd if=file-on-nfs of=/dev/null 
will where file-on-nfs is an 80GB file stored on the gfs2 will max out a 
GigE connection.

Any insight or advice on speeding up these operations would be appreciated.

Thanks,
-Andrew
--
Andrew A. Neuschwander, RHCE
Systems/Software Engineer
College of Forestry and Conservation
The University of Montana
http://www.ntsg.umt.edu
andrew at ntsg.umt.edu - 406.243.6310


From ralphzukeb at gmail.com  Fri Apr 24 06:39:14 2009
From: ralphzukeb at gmail.com (Ralph Zukeb)
Date: Fri, 24 Apr 2009 07:39:14 +0100
Subject: [Linux-cluster] How often are services monitored
Message-ID: <6e4a03690904232339x3e8e47beh617a06b084774173@mail.gmail.com>

Hello!

My last question ever.

How often is /etc/init.d/ScriptName status run, and is this configurable?

Thanks a lot again,

Ralph


From gianluca.cecchi at gmail.com  Fri Apr 24 08:58:37 2009
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Fri, 24 Apr 2009 10:58:37 +0200
Subject: [Linux-cluster] expected behaviour when qdisk heuristic fails for
	both nodes
Message-ID: <561c252c0904240158v30311eb8nfc2b601fc1253341@mail.gmail.com>

I'm testing a situation where the gateway (192.168.1.1) fails for some
time (a minute or so) for both nodes of a two-node cluster with qdisk,
and qdiskd heuristic is set up with this gateway as the ping device.
fencing is provided by iLO
cluster version is what provided with rhel 5.3 and with the cman
unofficial patch released before 2.0.98-1.el5_3.1 patch
(see https://bugzilla.redhat.com/show_bug.cgi?id=485026 )
I'm going to update openais rgmanager and cman as for the 9th of April
advise, but at the moment I would like to stay with this, to simulate
a production cluster I have.
relevant parts of cluster.conf are:

        <cman expected_votes="3" two_node="0"/>

        <quorumd device="/dev/mapper/mpath3" interval="3"
label="cluquorum" log_facility="local4" log_level="7" tko="5"
votes="1">
                <heuristic interval="2" program="ping -c1 -w1
192.168.1.1" score="1" tko="3"/>
        </quorumd>

So, when the gateway becomes unreachable by both the nodes I have this
situation:

1) node2 (srv02) has the services (vip, filesystems and an Oracle RDBMS) running
2) In qdiskd.log of it I get:
Apr 23 18:06:24 srv02 qdiskd[10702]: <debug> Heuristic: 'ping -c1 -w1
192.168.1.1' missed (1/3)
Apr 23 18:06:27 srv02 qdiskd[10702]: <debug> Heuristic: 'ping -c1 -w1
192.168.1.1' missed (2/3)
Apr 23 18:06:30 srv02 qdiskd[10702]: <info> Heuristic: 'ping -c1 -w1
192.168.1.1' DOWN (3/3)
Apr 23 18:06:31 srv02 qdiskd[10702]: <notice> Score insufficient for
master operation (0/1; required=1); downgrading

In messages:
Apr 23 18:06:30 srv02 qdiskd[10702]: <info> Heuristic: 'ping -c1 -w1
192.168.1.1' DOWN (3/3)
Apr 23 18:06:31 srv02 qdiskd[10702]: <notice> Score insufficient for
master operation (0/1; required=1); downgrading
Apr 23 18:06:31 srv02 kernel: md: stopping all md devices.

and then I get the following message about the reboot (see below)
Apr 23 18:08:50 srv02 syslogd 1.4.1: restart (remote reception).
Apr 23 18:08:50 srv02 kernel: klogd 1.4.1, log source = /proc/kmsg started.


3) In qdiskd.log of node1 (srv01) I get:
Apr 23 18:06:25 srv01 qdiskd[5946]: <debug> Heuristic: 'ping -c1 -w1
192.168.1.1' missed (1/3)
Apr 23 18:06:28 srv01 qdiskd[5946]: <debug> Heuristic: 'ping -c1 -w1
192.168.1.1' missed (2/3)
Apr 23 18:06:31 srv01 qdiskd[5946]: <info> Heuristic: 'ping -c1 -w1
192.168.1.1' DOWN (3/3)
Apr 23 18:06:32 srv01 qdiskd[5946]: <notice> Score insufficient for
master operation (0/1; required=1); downgrading

In mesages:
Apr 23 18:06:31 srv01 qdiskd[5946]: <info> Heuristic: 'ping -c1 -w1
192.168.1.1' DOWN (3/3)
Apr 23 18:06:32 srv01 qdiskd[5946]: <notice> Score insufficient for
master operation (0/1; required=1); downgrading
Apr 23 18:06:32 srv01 kernel: md: stopping all md devices.


and then I get the following message about the reboot (see below)
Apr 23 18:08:50 srv01 syslogd 1.4.1: restart (remote reception).
Apr 23 18:08:50 srv01 kernel: klogd 1.4.1, log source = /proc/kmsg started.

4) from iLO logs of the two nodes, it seems they fenced each other
srv02 iLO log (iLO is 1 hour behind)
Informational iLO 2 04/23/2009 17:07 04/23/2009 17:07 1 Server power restored.
Caution iLO 2 04/23/2009 17:07 04/23/2009 17:07 1 Server reset.

srv01 iLO log (iLO is 1 hour behind)
Informational iLO 2 04/23/2009 17:06 04/23/2009 17:06 1 Server power restored.
Caution iLO 2 04/23/2009 17:06 04/23/2009 17:06 1 Server reset.

the final setup is that srv01 starts a little before srv02, the
gateway is up at this time, they form the quorum because are both up,
and then also qdisk comes available and they reach 3 votes.

the question is:
is it reciprocal fencing the expected behaviour in this case? Or
should they remain both in a sort of waiting state till the network
becomes available again and
writing a sort of
 Cluster is not quorate.  Refusing connection.
message?
What exactly means "downgrading" inside the phrase "Score insufficient
for master operation (0/1; required=1); downgrading"?  That it will
stoip all the services or that it will be fenced?

Based on my cluster.conf, is it correct to say that in my case if
network gateway remains unreachable for something between 7 and 9
seconds I will get this behaviour?
Any suggestions?

Thanks
Gianluca


From gianluca.cecchi at gmail.com  Fri Apr 24 11:15:03 2009
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Fri, 24 Apr 2009 13:15:03 +0200
Subject: [Linux-cluster] Re: Info on lvm setup for cluster without clvmd
In-Reply-To: <561c252c0904080819m542212f8me04de1d73db15a77@mail.gmail.com>
References: <561c252c0904080819m542212f8me04de1d73db15a77@mail.gmail.com>
Message-ID: <561c252c0904240415x5d8613d3pa373f368542f6a64@mail.gmail.com>

On Wed, 8 Apr 2009 19:37:51 +0200 Nemeth, Norber wrote:
> Have you checked this? : http://sources.redhat.com/cluster/wiki/LVMFailover
Thanks, I then checked it and also the "official" RH EL knowledge base at
http://kbase.redhat.com/faq/docs/DOC-3068
that describes just the same things.

I've done some tests and it seems to work quite well.
only thing I would post a bugzilla entry because there is a "unhappy"
check inside /usr/share/cluster/lvm.sh
so that if you edit on a node the file lvm.conf (for example I edited
it to set activation=1 for more debugging)
then you will not be able to relocate an HALVM based service on it (at
least till your next kernel update....)
The code is:

        # Next, we need to ensure that their initrd has been updated
        # If not, the machine could boot and activate the VG outside
        # the control of rgmanager
        ##
        # Fixme: we might be able to perform a better check...
        if [ "$(find /boot -name *.img -newer /etc/lvm/lvm.conf)" == "" ]; then
                ocf_log err "HA LVM:  Improper setup detected"
                ocf_log err "- initrd image needs to be newer than lvm.conf"
                return $OCF_ERR_GENERIC
        fi

Apart from the unhappy check itself, it seems to not write the second
debug line in /var/log/messages.....
In fact I only get
Apr 24 13:01:19 orastud2 clurgmgrd[6809]: <notice> Starting stopped
service service:DWHSRV
Apr 24 13:01:19 orastud2 clurgmgrd: [6809]: <err> HA LVM:  Improper
setup detected
Apr 24 13:01:19 orastud2 clurgmgrd[6809]: <notice> start on lvm
"DWH_APPL" returned 1 (generic error)
Apr 24 13:01:19 orastud2 clurgmgrd[6809]: <warning> #68: Failed to
start service:DWHSRV; return value: 1
Apr 24 13:01:19 orastud2 clurgmgrd[6809]: <notice> Stopping service
service:DWHSRV

and not the message regarding
- initrd image needs to be newer than lvm.conf

I presume that the leading "minus" could be the problem, as if I
change lvm.sh and put
ocf_log err "initrd image needs to be newer than lvm.conf"

at least corectly I get
Apr 24 13:13:19 orastud2 clurgmgrd[6809]: <notice> Starting stopped
service service:DWHSRV
Apr 24 13:13:20 orastud2 clurgmgrd: [6809]: <err> HA LVM:  Improper
setup detected
Apr 24 13:13:20 orastud2 clurgmgrd: [6809]: <err> initrd image needs
to be newer than lvm.conf
Apr 24 13:13:20 orastud2 clurgmgrd[6809]: <notice> start on lvm
"DWH_APPL" returned 1 (generic error)
Apr 24 13:13:20 orastud2 clurgmgrd[6809]: <warning> #68: Failed to
start service:DWHSRV; return value: 1
Apr 24 13:13:20 orastud2 clurgmgrd[6809]: <notice> Stopping service
service:DWHSRV

At this point it means that the second line should change for all the
other tests inside the script......

You can test this using the touch command on lvm.conf for example and
trying a relocation of a service to that node.
Then if you touch any .img file inside /boot directory, you are able
to relocate again....
umh...

Thanks anyway for the original pointer....

Gianluca
On Wed, Apr 8, 2009 at 5:19 PM, Gianluca Cecchi
<gianluca.cecchi at gmail.com> wrote:
> Hello,
> I would like to setup a two-node cluster where I will have some
> services relying on filesystems on lvm resources.
> I'm using rh el 5U3 but I only have entitlements for RHEL Clustering
> and not for Cluster-Storage, so that I cannot use clvmd as in other
> clusters I set up previously.
> I don't need GFS so I think this is the correct setup for me, also
> from a legal point of view
> From a documentation point of view I see:
> Note:
> Shared storage for use in Red Hat Cluster Suite requires that you be
> running the cluster
> logical volume manager daemon (clvmd) or the High Availability Logical Volume
> Management agents (HA-LVM). If you are not able to use either the
> clvmd daemon or
> HA-LVM for operational reasons or because you do not have the correct
> entitlements, you
> must not use single-instance LVM on the shared disk as this may result
> in data corruption.
> If you have any concerns please contact your Red Hat service representative.
>
> I suppose that with HA-LVM here we are talking about lvm.sh script
> (with the other two lvm_by_lv.sh and lvm_by_vg.sh referred into it)
> inside /usr/share/cluster/ directory.
>
> At the moment the cluster is not formed at all; I only have setup a
> basic cluster.conf without the lvm resources in it.
> I have setup volume groups and logical volumes on one node. The disks
> are seen by the other node too, but correctly it is not seeing the LVM
> parts
> In lvm.conf of both I have
> locking_type = 1
> and I think it should remain this way in my setup, correct?
>
> Which is the correct approach in my situation if I want to go straight
> with command line and not graphical tools?
> In old days when lvm was not cluster-aware I had to:
> vgchange -an VG on the first node and then vgchange -ay VG on the
> second to let it create the devices and the lvm cache for the first
> time after creation.
>
> After this, in lvm.sh script it seems I have to populate the
> "activation" section filling up the volume_list part, to prevent
> concurrent activation of volume groups used by the cluster.
>
> Any good documentation reference for this? It seems I didn't find anything....
>
> I thank all the gui creators (Conga and system-config-cluster), and I
> think there is a good audience for them, but I would like to be able
> at least to do it manually too.... I think it helps very much to
> understand internal mechanisms and eventually debugging when problems
> arise...
>
> Thanks in advance,
> Gianluca
>


From mgrac at redhat.com  Fri Apr 24 11:24:01 2009
From: mgrac at redhat.com (=?ISO-8859-1?Q?Marek_=27marx=27_Gr=E1c?=)
Date: Fri, 24 Apr 2009 13:24:01 +0200
Subject: [Linux-cluster] Fencing with Dell Remote Access Cards
In-Reply-To: <BAY117-W48BC3877335411EAC2DE52A2740@phx.gbl>
References: <BAY117-W48BC3877335411EAC2DE52A2740@phx.gbl>
Message-ID: <49F1A151.6000901@redhat.com>

Hi,

Phil Jones wrote:
> Can someone tell me the proper syntax for the fencedevices section of 
> the cluster.conf file using DRAC5?

fence_drac5 uses same standard options as other fence agents. So most 
important for you are:

ipaddr="name of machine"
login="login"
passwd="password"
secure="1"   [ existence of 'secure' itself should be enough to use ssh 
instead of telnet ]

Currently I'm working on updating manual pages for all new fence agents. 
They should be ready soon

m,


From mgrac at redhat.com  Fri Apr 24 11:31:54 2009
From: mgrac at redhat.com (=?ISO-8859-1?Q?Marek_=27marx=27_Gr=E1c?=)
Date: Fri, 24 Apr 2009 13:31:54 +0200
Subject: [Linux-cluster] DRAC5 fencing problem with Redhat 5.3 cluster
In-Reply-To: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com>
References: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com>
Message-ID: <49F1A32A.7090705@redhat.com>

Dan Hayes wrote:
> I have 8 Dell servers all loaded with Redhat 5.3.  The machines have 
> an external and internal IP address.  The cluster is configured on the 
> internal ip address.
>
> The cluster appears to be working fine except for fencing.  I try to 
> manually fence a node, but get a connection error
> [root at w002 ~]# fence_node lb001.domain.com <http://lb001.domain.com>
> agent "fence_drac5" reports: Unable to connect/login to fencing device
>
> I was able to get the fence_drac5 command to work directly by adding 
> the "-x" option for using SSH.  It says the connection times out, but 
> it does shut down the other machine
> [root at w002 ~]# fence_drac5 -a 165.289.178.221 -l root -p pass -x 
> lb001.domain.com <http://lb001.domain.com>
> Connection timed out
>
> Why does the connection time out?  And how do I add the "-x" option to 
> the cluster.conf file so that the fencing agent can connect?
Can you send me verbose output (-v) ? It is possible that the machine 
needs more time than POWER_TIMEOUT (20 seconds) or output of status is 
not parsed correctly ('Starting machine' instead of ON/OFF).

-x is equal to option secure="1" (we don't care about inserted value)

m,


From m.radzewicz at tech.crmedia.pl  Fri Apr 24 15:32:21 2009
From: m.radzewicz at tech.crmedia.pl (=?ISO-8859-2?Q?Miko=B3aj_Radzewicz?=)
Date: Fri, 24 Apr 2009 17:32:21 +0200
Subject: [Linux-cluster] accessing GFS with fail of one cman
Message-ID: <49F1DB85.3070502@tech.crmedia.pl>

hello,
I'm setting two node cluster but for now I need only one of them...
I have configured the cluster for two nodes and with "two node" option
but the cluster doesn't want to start... (I'm not able to turn the cman
demon on the second cluster node). It is some why to mount the GFS file
system locally with cman demon or without it? I have tried to mount it
with lock_nolock option but it doesn't help. I have below error:

/sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device

I found out that there is no problem with mounting it when the cluster
start properly(and later one of node will die) but what if I need access
to resource and I am not able to get the second cman working??? Please
help me to solve this issue!

Is it some way to made 4-nodes cluster to keep on working when 2 of its
member will fail?


Is it generally some way to access files on the GFS with one single server?
-- 
Miko?aj


From dhayes501 at gmail.com  Fri Apr 24 16:13:07 2009
From: dhayes501 at gmail.com (Dan Hayes)
Date: Fri, 24 Apr 2009 11:13:07 -0500
Subject: [Linux-cluster] DRAC5 fencing problem with Redhat 5.3 cluster
In-Reply-To: <49F1A32A.7090705@redhat.com>
References: <55bfc63d0904221000i6f2804f7t3586a6087ca6b407@mail.gmail.com> 
	<49F1A32A.7090705@redhat.com>
Message-ID: <55bfc63d0904240913gd2ed864k85044396b666a035@mail.gmail.com>

adding secure="1" to cluster.conf fixed the problem! thank you!

On Fri, Apr 24, 2009 at 6:31 AM, Marek 'marx' Gr?c <mgrac at redhat.com> wrote:

> Dan Hayes wrote:
>
>> I have 8 Dell servers all loaded with Redhat 5.3.  The machines have an
>> external and internal IP address.  The cluster is configured on the internal
>> ip address.
>>
>> The cluster appears to be working fine except for fencing.  I try to
>> manually fence a node, but get a connection error
>> [root at w002 ~]# fence_node lb001.domain.com <http://lb001.domain.com>
>> agent "fence_drac5" reports: Unable to connect/login to fencing device
>>
>> I was able to get the fence_drac5 command to work directly by adding the
>> "-x" option for using SSH.  It says the connection times out, but it does
>> shut down the other machine
>> [root at w002 ~]# fence_drac5 -a 165.289.178.221 -l root -p pass -x
>> lb001.domain.com <http://lb001.domain.com>
>> Connection timed out
>>
>> Why does the connection time out?  And how do I add the "-x" option to the
>> cluster.conf file so that the fencing agent can connect?
>>
> Can you send me verbose output (-v) ? It is possible that the machine needs
> more time than POWER_TIMEOUT (20 seconds) or output of status is not parsed
> correctly ('Starting machine' instead of ON/OFF).
>
> -x is equal to option secure="1" (we don't care about inserted value)
>
> m,
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090424/b5652bd7/attachment.htm>

From m.radzewicz at tech.crmedia.pl  Fri Apr 24 17:10:13 2009
From: m.radzewicz at tech.crmedia.pl (=?ISO-8859-2?Q?Miko=B3aj_Radzewicz?=)
Date: Fri, 24 Apr 2009 19:10:13 +0200
Subject: [Linux-cluster] accessing GFS with fail of one cman
In-Reply-To: <49F1DB85.3070502@tech.crmedia.pl>
References: <49F1DB85.3070502@tech.crmedia.pl>
Message-ID: <49F1F275.9030404@tech.crmedia.pl>

so although i managed to set run this cluster - i don't know why i
didn't try this earlier, i am still not able to mount the GFS, I am
still geting the same error message:
/sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device

although the device is seen....

Miko?aj Radzewicz pisze:
> hello,
> I'm setting two node cluster but for now I need only one of them...
> I have configured the cluster for two nodes and with "two node" option
> but the cluster doesn't want to start... (I'm not able to turn the cman
> demon on the second cluster node). It is some why to mount the GFS file
> system locally with cman demon or without it? I have tried to mount it
> with lock_nolock option but it doesn't help. I have below error:
> 
> /sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device
> 
> I found out that there is no problem with mounting it when the cluster
> start properly(and later one of node will die) but what if I need access
> to resource and I am not able to get the second cman working??? Please
> help me to solve this issue!
> 
> Is it some way to made 4-nodes cluster to keep on working when 2 of its
> member will fail?
> 
> 
> Is it generally some way to access files on the GFS with one single server?


-- 

Miko?aj


From jumanjiman at gmail.com  Fri Apr 24 17:14:08 2009
From: jumanjiman at gmail.com (Paul Morgan)
Date: Fri, 24 Apr 2009 17:14:08 +0000
Subject: [Linux-cluster] accessing GFS with fail of one cman
In-Reply-To: <49F1F275.9030404@tech.crmedia.pl>
References: <49F1DB85.3070502@tech.crmedia.pl><49F1F275.9030404@tech.crmedia.pl>
Message-ID: <259837434-1240593291-cardhu_decombobulator_blackberry.rim.net-1027547547-@bxe1239.bisx.prod.on.blackberry>

Do you have a /dev/sdf?
And if so does it have a gfs or gfs2 filesystem, or is the filesystem on /dev/sdf1 (a partition)?

If in doubt, use gfs_tool &/or gfs2_tool to examine the sb (superblock) of the device you're trying to mount.

Hth,
-paul

-----Original Message-----
From: Miko?aj Radzewicz <m.radzewicz at tech.crmedia.pl>

Date: Fri, 24 Apr 2009 19:10:13 
To: linux clustering<linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] accessing GFS with fail of one cman


so although i managed to set run this cluster - i don't know why i
didn't try this earlier, i am still not able to mount the GFS, I am
still geting the same error message:
/sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device

although the device is seen....

Miko?aj Radzewicz pisze:
> hello,
> I'm setting two node cluster but for now I need only one of them...
> I have configured the cluster for two nodes and with "two node" option
> but the cluster doesn't want to start... (I'm not able to turn the cman
> demon on the second cluster node). It is some why to mount the GFS file
> system locally with cman demon or without it? I have tried to mount it
> with lock_nolock option but it doesn't help. I have below error:
> 
> /sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device
> 
> I found out that there is no problem with mounting it when the cluster
> start properly(and later one of node will die) but what if I need access
> to resource and I am not able to get the second cman working??? Please
> help me to solve this issue!
> 
> Is it some way to made 4-nodes cluster to keep on working when 2 of its
> member will fail?
> 
> 
> Is it generally some way to access files on the GFS with one single server?


-- 

Miko?aj


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Fri Apr 24 17:29:35 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:29:35 -0400
Subject: [Linux-cluster] prevent ping-pong
In-Reply-To: <fb143d260904091535l26030d31nf1679075ba6ce7cc@mail.gmail.com>
References: <fb143d260904091535l26030d31nf1679075ba6ce7cc@mail.gmail.com>
Message-ID: <1240594175.27154.14.camel@ayanami>

On Fri, 2009-04-10 at 00:35 +0200, Pavlos Parissis wrote:
> Hi,
> 
> Is there a way to configure cluster to not ping-pong a service more
> than N times?
> 
> I have an application where sometimes cluster gets a no zero status on
> the check and keeps failover back and forth the service.
> I want the cluster to try only N times to fail over the service and
> then disables it if it can't start it or gets a no zero status on
> checks after N times.

On most-current RHEL 4 + errata, you can do a 'max_restarts' combined
with 'restart_expire_time' to achieve a maximum restart before failover,
but there's no current limit to the # of restarts / failovers before a
disable.

-- Lon


From lhh at redhat.com  Fri Apr 24 17:30:47 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:30:47 -0400
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
Message-ID: <1240594247.27154.16.camel@ayanami>

On Tue, 2009-04-14 at 12:13 -0600, Spencer Parker wrote:
> I am running a MySQL cluster using cluster services and I have one
> issue when it comes to NFS.  The MySQl services run fine until add in
> an NFS mount.  The NFS mount is where all of the MySQl databases live
> at.  I can get the NFS share to mount properly on the cluster
> machines, but the log files keep telling it errors out.  Once it
> errors out the service then stops.  I have tried restarting the
> service, but that has it remounting the share over the top of the old
> one.  It never unmounts the NFS share upon failure.

Can you paste your service configuration somewhere so I can look at it?


-- Lon


From lhh at redhat.com  Fri Apr 24 17:31:24 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:31:24 -0400
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <1240594247.27154.16.camel@ayanami>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<1240594247.27154.16.camel@ayanami>
Message-ID: <1240594284.27154.18.camel@ayanami>

On Fri, 2009-04-24 at 13:30 -0400, Lon Hohberger wrote:
> On Tue, 2009-04-14 at 12:13 -0600, Spencer Parker wrote:
> > I am running a MySQL cluster using cluster services and I have one
> > issue when it comes to NFS.  The MySQl services run fine until add in
> > an NFS mount.  The NFS mount is where all of the MySQl databases live
> > at.  I can get the NFS share to mount properly on the cluster
> > machines, but the log files keep telling it errors out.  Once it
> > errors out the service then stops.  I have tried restarting the
> > service, but that has it remounting the share over the top of the old
> > one.  It never unmounts the NFS share upon failure.
> 
> Can you paste your service configuration somewhere so I can look at it?

Nevermind, it was farther down the thread... got it.

-- Lon


From lhh at redhat.com  Fri Apr 24 17:34:50 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:34:50 -0400
Subject: [Linux-cluster] NFS mount problem..
In-Reply-To: <c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>
References: <c7a040fa0904141113u27bcfdc9u4b2ffbd9892d83af@mail.gmail.com>
	<49E4D48A.7060307@sivell.com>
	<c7a040fa0904141126s2dbeaa08r7994bf3227b27da4@mail.gmail.com>
	<49E4D65F.2020905@sivell.com>
	<c7a040fa0904141141t2d00b744s7f0ee0f5f5bb2210@mail.gmail.com>
	<c7a040fa0904141246p3cc99cf1w297602139c25e84d@mail.gmail.com>
Message-ID: <1240594490.27154.22.camel@ayanami>

On Tue, 2009-04-14 at 13:46 -0600, Spencer Parker wrote:
> I found my problem.  It was the trailing slash on /mnt/mysql

>                                 <netfs export="/vol/test_mysql/mysql"
>         exportpath="/vol/test_mysql/mysql" force_unmount="1"
>         fstype="nfs" host="netapp" mountpoint="/mnt/mysql/"
>         name="mysql_data" nfstype="nfs"
>         options="defaults,rw,async,nfsvers=3,mountvers=3,proto=tcp"/>

exportpath doesn't appear to mean anything in the resource agent, by the
way (though it won't cause any problems either).

-- Lon


From lhh at redhat.com  Fri Apr 24 17:37:41 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:37:41 -0400
Subject: [Linux-cluster] Failback option on failover domains
In-Reply-To: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
References: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
Message-ID: <1240594661.27154.24.camel@ayanami>

On Wed, 2009-04-15 at 19:12 +0200, Pavlos Parissis wrote:
> Hello,
> 
> Does anyone know if the failback option is available on RedHat 4.X
> systems?
> 
> I went through all docs [1] and it seams to be that is only available
> on RedHat 5.X version,
> am I right?

It's in 4.y.latest too.

-- Lon


From lhh at redhat.com  Fri Apr 24 17:41:21 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:41:21 -0400
Subject: [Linux-cluster] Service not relocated after successful fence
	of its owner
In-Reply-To: <1240378858.15385.9.camel@localhost>
References: <1240378858.15385.9.camel@localhost>
Message-ID: <1240594881.27154.28.camel@ayanami>

On Wed, 2009-04-22 at 01:40 -0400, Maykel Moya wrote:
> I still can't get my service automatically relocated after
> _successfully_ fencing its owner node.
> 
> I have a 4 node cluster n{1,2,3,4} and 4 services s{1,2,3,4}. My fence
> device uses 'off' as action, so a successful fence means the node is
> off.
> 
> Say, s4 running on n4 and I do a 'ip link set eth0 down' on n4. n4 get
> successfully fenced but s4 is never relocated to one of the other
> available nodes which means s4 is not available.
> 
> Find attached the cluster.conf.

Conf looks okay, what do the logs say?  Any other errors?  It looks like
things should be working correctly.

'cman_tool services' and 'cman_tool nodes' output would be helpful,
too.  

-- Lon


From lhh at redhat.com  Fri Apr 24 17:47:41 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:47:41 -0400
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
Message-ID: <1240595261.27154.36.camel@ayanami>

On Wed, 2009-04-22 at 10:55 +0100, Ralph Zukeb wrote:
> Hello again! Thanks for the tips so far.
> 
> How can I ensure that the node holding the services and resources in a
> two node cluster keeps holding them if the two nodes cannot see each
> other?
> 
> I am not using a quorum disk.

You can't - at least not easily.

You could use a complicated qdiskd heuristic to do this, or you could
simply use a simple application which bumps quorum counts based on
resource location(s) (such an app doesn't really exist today, but is not
difficult to implement).

Basically, the infrastructure (membership/locking/fencing/etc.) is a
separate entity which is not tied to service management.  Put another
way, while the rgmanager relies on events from CMAN, the reverse is not
true.

-- Lon


From lhh at redhat.com  Fri Apr 24 17:49:44 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:49:44 -0400
Subject: [Linux-cluster] Two node cluster without quorum disk
In-Reply-To: <6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
References: <6e4a03690904220255l4b30f983rcfb23069f48ebc6a@mail.gmail.com>
	<200904221529.34421.mrugeshkarnik@gmail.com>
	<6e4a03690904220314n10f5bbcegf14bc9e7df749a@mail.gmail.com>
Message-ID: <1240595384.27154.40.camel@ayanami>

On Wed, 2009-04-22 at 11:14 +0100, Ralph Zukeb wrote:

> I tried this, but when I killed the switch for the cluster traffic,
> BOTH nodes got fenced! Can I avoid this?

Put fencing on the same network as cluster traffic.

Two_node is designed to operate this way - it protects against cable
pulls and so forth.  Separating it while not using a fencing device
which only permits one login at a time (or using multiple fencing
devices) can end you with a dead cluster.

-- Lon


From lhh at redhat.com  Fri Apr 24 17:55:42 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Apr 2009 13:55:42 -0400
Subject: [Linux-cluster] Restart children but relocate parent
In-Reply-To: <6e4a03690904230058v7d853c12qfc0f98ef3fc2bcde@mail.gmail.com>
References: <6e4a03690904230058v7d853c12qfc0f98ef3fc2bcde@mail.gmail.com>
Message-ID: <1240595742.27154.47.camel@ayanami>

On Thu, 2009-04-23 at 08:58 +0100, Ralph Zukeb wrote:
> Hello,
> 
> I have two init scripts:
>  /etc/init.d/noBugs
>  /etc/init.d/buggy
> 
> buggy is dependent on noBugs.
> 
> If buggy dies, I want to restart it. If noBugs dies, I want to
> relocate both noBugs and buggy to the other node.
> 
> How can I represent this? Would a failover domain achieve this?

<resources>
  <script file="/etc/init.d/noBugs" name="noBugs"/>
  <script file="/etc/init.d/buggy" name="buggy"/>
</resources>

<service name="the_awesome_service">
  <script ref="noBugs"/>
  <script ref="buggy" __independent_subtree="1" />
</service>

-- Lon


From arwin.tugade at csun.edu  Fri Apr 24 19:36:43 2009
From: arwin.tugade at csun.edu (Arwin L Tugade)
Date: Fri, 24 Apr 2009 12:36:43 -0700
Subject: [Linux-cluster] nfs/smb slow on gfs2
In-Reply-To: <49F0AD3D.5060406@ntsg.umt.edu>
References: <49F0AD3D.5060406@ntsg.umt.edu>
Message-ID: <6708F96BBF31F846BFA56EC0AE37D62281E8B38F10@CSUN-EX-V01.csun.edu>

For me at least on gfs, I'm using lock_dlm and I was getting like 1.3-1.5 MB/s transfer rates on samba.  I know you said you using lock_nolock but I say give ping_pong a shot:

http://wiki.samba.org/index.php/Ping_pong


Another thing to try to see if locking is the culprit is in smb.conf set posix locking = no and try a transfer.  That's why my transfer rates were so bad.  With ping_pong I noticed 94-100 locks per seconds.  Turns out gfs_controld has 100 locks per/s as the default and can be lifted by -l0 flag.  With that on, I then got like 25-30k locks per/s and my transfer rates went up with samba to like 8 - 9 MB/s (on 100mbit lan).  I also tried the plock optimization that I believe was introduced not to many versions ago and the rate when up to like 100k and transfer rate got a little better.  It's probably best to put some lower limit as a side effect is more network load and lock coordination amongst the nodes, I'm trying to find a good balance with my setup.  I tried all this in a test environment btw and I'm still tweaking/learning about it :).  I mean that was my personal experience with gfs (not gfs2, don't know if it's the same).  Hope this helps.

Regards,
Arwin

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Andrew A. Neuschwander
Sent: Thursday, April 23, 2009 11:03 AM
To: linux clustering
Subject: [Linux-cluster] nfs/smb slow on gfs2

I have a local only (lock_nolock) gfs2 filesystem which is served to 
clients via smb and nfs v3. Crawling around the filesystem is extremely 
slow over smb and nfs. Stat, ls and find over the network take many 
orders of magnitude longer than I'd expect.

The same tasks are very fast locally on the gfs2 filesystem. Also, for 
comparison, I created an ext3 on the same server and exported it via smb 
and nfs. The same task over nfs or samba on ext3 is nearly as fast as 
doing the task locally.

Is there any way to optimized the interaction between smb/nfs and gfs2?

I'm using gfs2 because I needed a filesystem over 8TB. I'm on CentOS5.3 
(2.6.18-128.1.6.el5).

The storage is locally attached. It is two hardware RAID6 (11 disks each 
+ 1 hotspare) arrays striped together with software raid, then managed 
with lvm2. Both the ext3 and gfs2 are on this storage. Large file 
sequential reads are fast. For example, dd if=file-on-nfs of=/dev/null 
will where file-on-nfs is an 80GB file stored on the gfs2 will max out a 
GigE connection.

Any insight or advice on speeding up these operations would be appreciated.

Thanks,
-Andrew
--
Andrew A. Neuschwander, RHCE
Systems/Software Engineer
College of Forestry and Conservation
The University of Montana
http://www.ntsg.umt.edu
andrew at ntsg.umt.edu - 406.243.6310

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From sunhux at gmail.com  Sat Apr 25 05:39:45 2009
From: sunhux at gmail.com (sunhux G)
Date: Sat, 25 Apr 2009 13:39:45 +0800
Subject: [Linux-cluster] Autostart of cman/fencing,
	qdiskd & cluster services when server is 
	rebooted : can't see/mount SAN disk partition occasionally
Message-ID: <60f08e700904242239u3bc2d261l68a893cd8704a68b@mail.gmail.com>

Hi,


My colleague has difficulty with this : he said he could only start up the
various
Redhat (RHES 5.1) cluster services manually after the Redhat LInux OS is
rebooted/booted up :

a) cd /etc/init.d
b) ./cman start
c) ./clvmd ...
d) ./qdiskd ...
e) ./rgmanager ...

I'm not sure if the sequence of d or e should be interchanged.

Anyway, we would like all the above to be automatic started when the
pair of linux servers (call it lnx3 and lnx4) are booted up.

Q1 :
Would the following actually help or what we're missing  :

   1. Enable cluster software to start upon reboot. At each node run
   /sbin/chkconfig as follows:

   # *chkconfig --level 2345 rgmanager on*
   # *chkconfig --level 2345 gfs on*
   # *chkconfig --level 2345 clvmd on*
   # *chkconfig --level 2345 cman on*


We're not using GFS as the Netapp SAN partition is probably mounted as
Unix file system (UFS)


Q2:
Also, occasionally, when both servers lose connection to the network
together, the SAN partition would be lost and despite rebooting the
servers and running those commands given in a-e above on the primary
node alone (or both nodes), we are not able to mount the SAN partition.
Sometimes after waiting for about ten minutes, it's able to mount.  So
what did we miss, or is this just a matter of waiting or between the a-e
services, we need to pause for a while before proceeding to the next
command?

Q3:
How do I know if our current SAN partition is GFS or not?  (As the
previous person has left after setting up the cluster which I felt is
still missing something)

Q4:
How can I convert it to GFS if it's not on GFS yet?


Q5:
We're not using Oracle RAC, how can I set up Oracle instances as
part of the cluster failover service?


Q6:
We have a pair of cross cables (as fencing) connecting NIC ports of
lnx3 & lnx4 servers : at all times, only one of the fencing NIC port's LED
on lnx3's is on while on lnx4, it's also one of the NIC port's LED is on.
Is this normal?


Sorry, rather green to this, so need step by step instruction

Thanks
U
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090425/28baa361/attachment.htm>

From vikash at netvigator.com  Sat Apr 25 08:26:06 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Sat, 25 Apr 2009 16:26:06 +0800
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <7.1.0.9.2.20090421105051.05f51860@netvigator.com>
References: <200904201549.n3KFndL0008930@mx3.redhat.com>
	<64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local>
	<7.1.0.9.2.20090421105051.05f51860@netvigator.com>
Message-ID: <200904250826.n3P8QJwo032632@mx3.redhat.com>

Hi,

Can I downgrade the lock manage from lock_dlm to no_lock? Do I need 
to un-mount the gfs partition before changing? I want to see if it 
makes any performance improvements.

Thanks,
Vikash.


At 11:18 AM 21-04-09, Vikash Khatuwala wrote:
>Hi,
>
>I am using Virtuozzo OS visualization which does not have a single 
>file for the entire VM's filesystem. All VMs are simply 
>sub-directories and OS files are stored in a common templates 
>directory which is sym linked to back to the VM's directory, so if 
>an OS file is changed inside the VM then the symlink breaks and a 
>new file is put in the VM's private directory. I cant use GFS2 
>because it is not supported by Virtuozzo. All VMs are simply running 
>web/db/ftp.
>
>So this basically means that there are a lot of symbolic links 
>(small files). The GFS has a block size of 4K so I also chose 4K as 
>my block size for my performance testing to asses the worst case 
>scenario. If I change the block size to 256K then the performance 
>difference between ext3 and GFS are minimal. Also when I migrate the 
>VM out from GFS(RAID5 SAS 15K) to ext3(single disk SATA), there is a 
>significant noticeable performance gain!
>
>Below tests are on the same disk set (5 disk RAID5 SAS 15K) with 2 
>partitions, GFS and ext3.
>Results at 4K random reads:
>GFS : about 1500K/s
>ext3 : about 7000K/s
>
>Results at 256K random reads:
>GFS : about 45000K/s
>ext3 : about 50000K/s
>
>Results at 256K sequential reads:
>GFS : over 110,000K/s (my single GB NIC maxes out)
>ext3 : over 110,000K/s (my single GB NIC maxes out)
>
>fio test file as below only rw and blocksize were changed for the 3 
>different scenarios above.
>[random-read1]
>rw=randread
>size=10240m
>directory=/vz/tmp
>ioengine=libaio
>iodepth=16
>direct=1
>invalidate=1
>blocksize=4k
>
>[random-read2]
>rw=randread
>size=10240m
>directory=/vz/tmp
>ioengine=libaio
>iodepth=16
>direct=1
>invalidate=1
>blocksize=4k
>
>Thanks,
>Vikash.
>
>
>At 01:00 AM 21-04-09, Jeff Sturm wrote:
>> > -----Original Message-----
>> > From: linux-cluster-bounces at redhat.com
>> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash
>> > Khatuwala
>> > Sent: Monday, April 20, 2009 11:23 AM
>> > To: linux-cluster at redhat.com
>> > Subject: [Linux-cluster] GFS performance.
>> >
>> > OS : CentOS 5.2
>> > FS : GFS
>>
>>Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to have some
>>performance improvements over GFS1.
>>
>> > Now I need to make a decision to go with GFS or not, clearly
>> > at 4 times less performance we cannot afford it, also it
>> > doesn't sound right so would like to find out whats wrong.
>>
>>Be careful with benchmarks, as they often do not give you a good
>>indication of real-world performance.
>>
>>Are you more concerned with latency or throughput?  Any single read will
>>almost certainly take longer to complete over GFS than EXT3.  There's
>>simply more overhead involved with any cluster filesystem.  However,
>>that's not to say you're limited as to how many reads you can execute in
>>parallel.  So the overall number of reads you can perform in a given
>>time interval may not be 4x at all (are you running a parallel
>>benchmark?)
>>
>>Jeff
>>
>>
>>--
>>Linux-cluster mailing list
>>Linux-cluster at redhat.com
>>https://www.redhat.com/mailman/listinfo/linux-cluster


From jeff.sturm at eprize.com  Sat Apr 25 19:56:27 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Sat, 25 Apr 2009 15:56:27 -0400
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <200904250826.n3P8QJwo032632@mx3.redhat.com>
References: <200904201549.n3KFndL0008930@mx3.redhat.com><64D0546C5EBBD147B75DE133D798665F02FDB8EB@hugo.eprize.local><7.1.0.9.2.20090421105051.05f51860@netvigator.com>
	<200904250826.n3P8QJwo032632@mx3.redhat.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB9C6@hugo.eprize.local>

Yes, and yes.  Use the "gfs_tool sb <device> proto no_lock" command on
an unmounted filesystem, and remount.  (Obviously, you cannot mount the
fs on more than one node after you do this.)

Jeff 

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash 
> Khatuwala
> Sent: Saturday, April 25, 2009 4:26 AM
> To: linux clustering
> Subject: RE: [Linux-cluster] GFS performance.
> 
> Hi,
> 
> Can I downgrade the lock manage from lock_dlm to no_lock? Do 
> I need to un-mount the gfs partition before changing? I want 
> to see if it makes any performance improvements.
> 
> Thanks,
> Vikash.
> 
> 
> At 11:18 AM 21-04-09, Vikash Khatuwala wrote:
> >Hi,
> >
> >I am using Virtuozzo OS visualization which does not have a 
> single file 
> >for the entire VM's filesystem. All VMs are simply 
> sub-directories and 
> >OS files are stored in a common templates directory which is 
> sym linked 
> >to back to the VM's directory, so if an OS file is changed 
> inside the 
> >VM then the symlink breaks and a new file is put in the VM's private 
> >directory. I cant use GFS2 because it is not supported by Virtuozzo. 
> >All VMs are simply running web/db/ftp.
> >
> >So this basically means that there are a lot of symbolic 
> links (small 
> >files). The GFS has a block size of 4K so I also chose 4K as 
> my block 
> >size for my performance testing to asses the worst case 
> scenario. If I 
> >change the block size to 256K then the performance 
> difference between 
> >ext3 and GFS are minimal. Also when I migrate the VM out 
> from GFS(RAID5 
> >SAS 15K) to ext3(single disk SATA), there is a significant 
> noticeable 
> >performance gain!
> >
> >Below tests are on the same disk set (5 disk RAID5 SAS 15K) with 2 
> >partitions, GFS and ext3.
> >Results at 4K random reads:
> >GFS : about 1500K/s
> >ext3 : about 7000K/s
> >
> >Results at 256K random reads:
> >GFS : about 45000K/s
> >ext3 : about 50000K/s
> >
> >Results at 256K sequential reads:
> >GFS : over 110,000K/s (my single GB NIC maxes out)
> >ext3 : over 110,000K/s (my single GB NIC maxes out)
> >
> >fio test file as below only rw and blocksize were changed for the 3 
> >different scenarios above.
> >[random-read1]
> >rw=randread
> >size=10240m
> >directory=/vz/tmp
> >ioengine=libaio
> >iodepth=16
> >direct=1
> >invalidate=1
> >blocksize=4k
> >
> >[random-read2]
> >rw=randread
> >size=10240m
> >directory=/vz/tmp
> >ioengine=libaio
> >iodepth=16
> >direct=1
> >invalidate=1
> >blocksize=4k
> >
> >Thanks,
> >Vikash.
> >
> >
> >At 01:00 AM 21-04-09, Jeff Sturm wrote:
> >> > -----Original Message-----
> >> > From: linux-cluster-bounces at redhat.com 
> >> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash 
> >> > Khatuwala
> >> > Sent: Monday, April 20, 2009 11:23 AM
> >> > To: linux-cluster at redhat.com
> >> > Subject: [Linux-cluster] GFS performance.
> >> >
> >> > OS : CentOS 5.2
> >> > FS : GFS
> >>
> >>Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to 
> have some 
> >>performance improvements over GFS1.
> >>
> >> > Now I need to make a decision to go with GFS or not, 
> clearly at 4 
> >> > times less performance we cannot afford it, also it 
> doesn't sound 
> >> > right so would like to find out whats wrong.
> >>
> >>Be careful with benchmarks, as they often do not give you a good 
> >>indication of real-world performance.
> >>
> >>Are you more concerned with latency or throughput?  Any single read 
> >>will almost certainly take longer to complete over GFS than EXT3.  
> >>There's simply more overhead involved with any cluster filesystem.  
> >>However, that's not to say you're limited as to how many 
> reads you can 
> >>execute in parallel.  So the overall number of reads you 
> can perform 
> >>in a given time interval may not be 4x at all (are you running a 
> >>parallel
> >>benchmark?)
> >>
> >>Jeff
> >>
> >>
> >>--
> >>Linux-cluster mailing list
> >>Linux-cluster at redhat.com
> >>https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 


From moya at latertulia.org  Sun Apr 26 06:19:47 2009
From: moya at latertulia.org (Maykel Moya)
Date: Sun, 26 Apr 2009 02:19:47 -0400
Subject: [Linux-cluster] Service not relocated after successful fence
	of its owner
In-Reply-To: <1240594881.27154.28.camel@ayanami>
References: <1240378858.15385.9.camel@localhost>
	<1240594881.27154.28.camel@ayanami>
Message-ID: <1240726787.12595.19.camel@localhost>

El vie, 24-04-2009 a las 13:41 -0400, Lon Hohberger escribi?:

> On Wed, 2009-04-22 at 01:40 -0400, Maykel Moya wrote:
> > I still can't get my service automatically relocated after
> > _successfully_ fencing its owner node.
> > 
> > I have a 4 node cluster n{1,2,3,4} and 4 services s{1,2,3,4}. My fence
> > device uses 'off' as action, so a successful fence means the node is
> > off.
> > 
> > Say, s4 running on n4 and I do a 'ip link set eth0 down' on n4. n4 get
> > successfully fenced but s4 is never relocated to one of the other
> > available nodes which means s4 is not available.
> > 
> > Find attached the cluster.conf.
> 
> Conf looks okay, what do the logs say?  Any other errors?  It looks like
> things should be working correctly.

The relevant part
----
Apr 26 02:08:29 e1b01 kernel: [345031.041719] dlm: closing connection to
node 4
Apr 26 02:08:29 e1b01 clurgmgrd[3880]: <debug> Membership Change Event
Apr 26 02:08:29 e1b01 clurgmgrd[3880]: <info> State change: e1b04 DOWN
Apr 26 02:08:29 e1b01 clurgmgrd[3880]: <debug> Membership Change Event
Apr 26 02:08:29 e1b01 clurgmgrd[3880]: <debug> Membership Change Event
Apr 26 02:08:29 e1b01 clurgmgrd[3880]: <debug> Membership Change Event
Apr 26 02:08:29 e1b01 fenced[3850]: e1b04 not a cluster member after 0
sec post_fail_delay
Apr 26 02:08:29 e1b01 fenced[3850]: fencing node "e1b04"
Apr 26 02:08:40 e1b01 fenced[3850]: can't get node number for node
??#010P?#010#020
Apr 26 02:08:40 e1b01 fenced[3850]: fence "e1b04" success
----

> 'cman_tool services' and 'cman_tool nodes' output would be helpful,
> too.  

It's a bit odd, clustat saying that node e1b04 is offline but service4
is owned by e1b04 and started.

e1b01:/var/log# clustat 
Cluster Status for cinfomed @ Sun Apr 26 02:09:07 2009
Member Status: Quorate

 Member Name               ID   Status
 ------ ----               ---- ------
 e1b01                         1 Online, Local, rgmanager
 e1b02                         2 Online, rgmanager
 e1b03                         3 Online, rgmanager
 e1b04                         4 Offline

 Service Name              Owner (Last)             State         
 ------- ----              ----- ------             -----         
 service:vmail1_svc        e1b01                    started       
 service:vmail2_svc        e1b04                    started       
 service:vmail3_svc        e1b03                    started       
 service:vmail4_svc        e1b04                    started       

e1b01:/var/log# cman_tool services
type             level name       id       state       
fence            0     default    00010001 none        
[1 2 3]
dlm              1     rgmanager  00010004 none        
[1 2 3]

e1b01:/var/log# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M   1404   2009-04-22 02:17:09  e1b01
   2   M   1432   2009-04-22 02:51:31  e1b02
   3   M   1412   2009-04-22 02:17:11  e1b03
   4   X   1408                        e1b04


Forgot to mention

e1b01:/var/log# lsb_release -a
No LSB modules are available.
Distributor ID:	Debian
Description:	Debian GNU/Linux 5.0.1 (lenny)
Release:	5.0.1
Codename:	lenny

e1b01:/var/log# cman_tool -V
cman_tool 2.03.09 (built Nov  3 2008 18:22:25)
Copyright (C) Red Hat, Inc.  2004-2008  All rights reserved.

e1b01:/var/log# uname -r
2.6.26-2-686

----
This is the only thing I'm missing to deploy, have tried fencing with
'reboot', with 'off', setting service recovery policy and 'relocate' and
nothing solves it. If a node goes down, the service is not migrated
after fence it.

Regards,
maykel


From gianluca.cecchi at gmail.com  Mon Apr 27 11:47:41 2009
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Mon, 27 Apr 2009 13:47:41 +0200
Subject: [Linux-cluster] dynamically reconfigure qdiskd heuristic: possible?
Message-ID: <561c252c0904270447w328acbbta758e4cd89d7a3cc@mail.gmail.com>

Is there any command for this? I'm on rh el 5.3
For example if I want change the line

<heuristic interval="2" program="ping -c1 -w1 10.4.5.250" score="1" tko="3"/>

so that it becomes
<heuristic interval="2" program="ping -c1 -w1 10.4.5.250" score="1" tko="30"/>

It seems that with
1) ccs_tool update /etc/cluster/cluster.conf
2) cman_tool version -r new_vers_number

it doesn't work dynamically....

thanks,
Gianluca


From m.radzewicz at tech.crmedia.pl  Mon Apr 27 12:40:29 2009
From: m.radzewicz at tech.crmedia.pl (=?UTF-8?B?TWlrb8WCYWogUmFkemV3aWN6?=)
Date: Mon, 27 Apr 2009 14:40:29 +0200
Subject: [Linux-cluster] accessing GFS with fail of one cman
In-Reply-To: <259837434-1240593291-cardhu_decombobulator_blackberry.rim.net-1027547547-@bxe1239.bisx.prod.on.blackberry>
References: <49F1DB85.3070502@tech.crmedia.pl><49F1F275.9030404@tech.crmedia.pl>
	<259837434-1240593291-cardhu_decombobulator_blackberry.rim.net-1027547547-@bxe1239.bisx.prod.on.blackberry>
Message-ID: <49F5A7BD.7070204@tech.crmedia.pl>

yep, there is a device. I have found out that the gfs module displayed
same errors during the system start. I updated the kernel and downloaded
the lastest rpm-s and it is ok.

thanks for helping.
Paul Morgan pisze:
> Do you have a /dev/sdf?
> And if so does it have a gfs or gfs2 filesystem, or is the filesystem on /dev/sdf1 (a partition)?
> 
> If in doubt, use gfs_tool &/or gfs2_tool to examine the sb (superblock) of the device you're trying to mount.
> 
> Hth,
> -paul
> 
> -----Original Message-----
> From: Miko?aj Radzewicz <m.radzewicz at tech.crmedia.pl>
> 
> Date: Fri, 24 Apr 2009 19:10:13 
> To: linux clustering<linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] accessing GFS with fail of one cman
> 
> 
> so although i managed to set run this cluster - i don't know why i
> didn't try this earlier, i am still not able to mount the GFS, I am
> still geting the same error message:
> /sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device
> 
> although the device is seen....
> 
> Miko?aj Radzewicz pisze:
>> hello,
>> I'm setting two node cluster but for now I need only one of them...
>> I have configured the cluster for two nodes and with "two node" option
>> but the cluster doesn't want to start... (I'm not able to turn the cman
>> demon on the second cluster node). It is some why to mount the GFS file
>> system locally with cman demon or without it? I have tried to mount it
>> with lock_nolock option but it doesn't help. I have below error:
>>
>> /sbin/mount.gfs: error mounting /dev/sdf on /mnt/tmp: No such device
>>
>> I found out that there is no problem with mounting it when the cluster
>> start properly(and later one of node will die) but what if I need access
>> to resource and I am not able to get the second cman working??? Please
>> help me to solve this issue!
>>
>> Is it some way to made 4-nodes cluster to keep on working when 2 of its
>> member will fail?
>>
>>
>> Is it generally some way to access files on the GFS with one single server?
> 
> 


-- 
Miko?aj


From robertofratelli at yahoo.com  Mon Apr 27 13:17:47 2009
From: robertofratelli at yahoo.com (Roberto Fratelli)
Date: Mon, 27 Apr 2009 06:17:47 -0700 (PDT)
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
Message-ID: <970694.34996.qm@web33308.mail.mud.yahoo.com>


Hello everyone !

I'm trying to set up a quorum disk in a 2-node redhat cluster (RHEL 4.7). Both nodes are attached to a SAN Clariion over qlogic HBA's and i'm using EMC's powerpath.

The problem i'm facing is that the disks's device names are different across the nodes. For example, the lun i elected to be my quorum disk is called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at node-b, so i can't use the device name (/dev/emcpowerXX) in my cluster.conf file. In a attempt to overcome this i've tried to use the disk label instead :

<quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677" interval="2" tko="10" votes="1"/>

But did not work also, because this way it finds '/dev/sdXX' before '/dev/emcpowerXX':

root at node-a:~ # cman_tool nodes
Node  Votes Exp Sts  Name
   0    1    0   M   /dev/sdf1
   1    1    2   M   node-a
   2    1    2   M   node-b
root at node-a:~ #


Since this '/dev/sdf' is just one of the paths i have under powerpath (it has 4 paths per HBA, 8 paths per lun total) i can't keep it this way because in a event of lun trespass (clariion does it very often) i would loose quorum. My understanding is that i need some kind of "persistent binding" so i can use Powerpath's pseudo-device '/dev/emcpowerXX', so i tried to create a UDEV rule like this:

[root at node-a RSM]# cat 75-quorum.rules
KERNEL=="emcpower[a-z]1, SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677", SYMLINK+="discodequorum%n"
[root at node-a RSM]#


Which did not worked, i think because UDEV starts way before powerpath. Does anyone here faced this problem before ? how can i keep the device names persistent across the nodes (at least for the quorum disk) ?


Below some more info on my quorum's disk:


root at node-a:~ # mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

root at node-a:~ #


[root at node-b ~]# mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerd1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

[root at node-b ~]#


From stevan.colaco at gmail.com  Mon Apr 27 13:52:45 2009
From: stevan.colaco at gmail.com (Stevan Colaco)
Date: Mon, 27 Apr 2009 16:52:45 +0300 (AST)
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
In-Reply-To: <18505152.141240840000890.JavaMail.SYSTEM@sysadmin>
Message-ID: <32111280.161240840360468.JavaMail.SYSTEM@sysadmin>

Hello,

There are different ways one may write the persistent naming udev rules.

Instead of SYSFS{serial}, it would be better if you can use scsi_id

first try to get the scsi id of the quorum device

#/sbin/scsi_id -g -s /block/emcpowerf

you should receive a lengthy number. 

Then you can use the rule something like below 

[root at node-a RSM]# cat 75-quorum.rules
KERNEL=="emcpower[a-z]1, PROGRAM=="/sbin/scsi_id -g -s /block/emcpowerf", RESULT=="Paste the output of scsi_id here" SYMLINK+="discodequorum"


regards,
-Steve

[root at node-a RSM]#

----- Original Message -----
From: "Roberto Fratelli" <robertofratelli at yahoo.com>
To: linux-cluster at redhat.com
Sent: Monday, April 27, 2009 4:17:47 PM GMT +03:00 Iraq
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath


Hello everyone !

I'm trying to set up a quorum disk in a 2-node redhat cluster (RHEL 4.7). Both nodes are attached to a SAN Clariion over qlogic HBA's and i'm using EMC's powerpath.

The problem i'm facing is that the disks's device names are different across the nodes. For example, the lun i elected to be my quorum disk is called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at node-b, so i can't use the device name (/dev/emcpowerXX) in my cluster.conf file. In a attempt to overcome this i've tried to use the disk label instead :

<quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677" interval="2" tko="10" votes="1"/>

But did not work also, because this way it finds '/dev/sdXX' before '/dev/emcpowerXX':

root at node-a:~ # cman_tool nodes
Node  Votes Exp Sts  Name
   0    1    0   M   /dev/sdf1
   1    1    2   M   node-a
   2    1    2   M   node-b
root at node-a:~ #


Since this '/dev/sdf' is just one of the paths i have under powerpath (it has 4 paths per HBA, 8 paths per lun total) i can't keep it this way because in a event of lun trespass (clariion does it very often) i would loose quorum. My understanding is that i need some kind of "persistent binding" so i can use Powerpath's pseudo-device '/dev/emcpowerXX', so i tried to create a UDEV rule like this:

[root at node-a RSM]# cat 75-quorum.rules
KERNEL=="emcpower[a-z]1, SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677", SYMLINK+="discodequorum%n"
[root at node-a RSM]#


Which did not worked, i think because UDEV starts way before powerpath. Does anyone here faced this problem before ? how can i keep the device names persistent across the nodes (at least for the quorum disk) ?


Below some more info on my quorum's disk:


root at node-a:~ # mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

root at node-a:~ #


[root at node-b ~]# mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerd1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

[root at node-b ~]#


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From rhurst at bidmc.harvard.edu  Mon Apr 27 14:03:21 2009
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Mon, 27 Apr 2009 10:03:21 -0400
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's
 Powerpath
In-Reply-To: <970694.34996.qm@web33308.mail.mud.yahoo.com>
References: <970694.34996.qm@web33308.mail.mud.yahoo.com>
Message-ID: <1240841001.4270.6.camel@WSBID06223.bidmc.harvard.edu>

Don't know if this will help your particular issue, but did you
change /etc/lvm/lvm.conf to only look at PowerPath devices, and not SCSI
disks?  I don't use a quorum disk, so I would like to see what the final
solution is, too.

For example, on an HP DL385:

    filter = [ "a/cciss/", "a/emcpower/", "r/.*/" ]

This tells LVM to only use PVs on the internal Compaq SCSI RAID
controller and under PowerPath control.  If you do not do specify this,
and if a path goes down, you are at risk of losing the entire VG.


________________________________________________________________________


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.


On Mon, 2009-04-27 at 06:17 -0700, Roberto Fratelli wrote:

> Hello everyone !
> 
> I'm trying to set up a quorum disk in a 2-node redhat cluster (RHEL 4.7). Both nodes are attached to a SAN Clariion over qlogic HBA's and i'm using EMC's powerpath.
> 
> The problem i'm facing is that the disks's device names are different across the nodes. For example, the lun i elected to be my quorum disk is called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at node-b, so i can't use the device name (/dev/emcpowerXX) in my cluster.conf file. In a attempt to overcome this i've tried to use the disk label instead :
> 
> <quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677" interval="2" tko="10" votes="1"/>
> 
> But did not work also, because this way it finds '/dev/sdXX' before '/dev/emcpowerXX':
> 
> root at node-a:~ # cman_tool nodes
> Node  Votes Exp Sts  Name
>    0    1    0   M   /dev/sdf1
>    1    1    2   M   node-a
>    2    1    2   M   node-b
> root at node-a:~ #
> 
> 
> Since this '/dev/sdf' is just one of the paths i have under powerpath (it has 4 paths per HBA, 8 paths per lun total) i can't keep it this way because in a event of lun trespass (clariion does it very often) i would loose quorum. My understanding is that i need some kind of "persistent binding" so i can use Powerpath's pseudo-device '/dev/emcpowerXX', so i tried to create a UDEV rule like this:
> 
> [root at node-a RSM]# cat 75-quorum.rules
> KERNEL=="emcpower[a-z]1, SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677", SYMLINK+="discodequorum%n"
> [root at node-a RSM]#
> 
> 
> Which did not worked, i think because UDEV starts way before powerpath. Does anyone here faced this problem before ? how can i keep the device names persistent across the nodes (at least for the quorum disk) ?
> 
> 
> Below some more info on my quorum's disk:
> 
> 
> root at node-a:~ # mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdo1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdx1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdag1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdap1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sday1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdbh1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdbq1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/emcpowerf1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> root at node-a:~ #
> 
> 
> 
> [root at node-b ~]# mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdo1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdx1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdag1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdap1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sday1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdbh1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/sdbq1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> /dev/emcpowerd1:
>         Magic:                eb7a62c2
>         Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
>         Created:              Mon Dec 24 11:47:11 2007
>         Host:                 node-a
>         Kernel Sector Size:   512
> 
> [root at node-b ~]#
> 
> 
>       
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090427/d80dad3d/attachment.htm>

From berthiaume_wayne at emc.com  Mon Apr 27 14:43:02 2009
From: berthiaume_wayne at emc.com (berthiaume_wayne at emc.com)
Date: Mon, 27 Apr 2009 10:43:02 -0400
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
In-Reply-To: <970694.34996.qm@web33308.mail.mud.yahoo.com>
References: <970694.34996.qm@web33308.mail.mud.yahoo.com>
Message-ID: <8F08A56613D77044BD153E2AC5DA0F8401CC1131@CORPUSMX40A.corp.emc.com>

Hi Roberto.

	 You can use the emcpadm() command to rename the device on each
node so they are all the same. The syntax is "emcpadm rename[pseudo] -s
src pseudodevice -t tgt pseudodevice". For example, if on node A the
device is emcpowerg and you wish it to be be emcpowerq you would issue:

# emcpadm rename -s emcpowerg -t emcpowerq 

	For each node run the "powermt display dev=all" and review the
Logical device ID that is displayed for each LUN on each node. Find your
quorom disk on each - for example, Logical device
ID=6006016018C80800B5069570B8A9D911

# powermt display dev=emcpowerg

Pseudo name=emcpowerg
CLARiiON ID=WRE00100100415
Logical device ID=6006016018C80800B5069570B8A9D911
state=alive; policy=CLAROpt; priority=0; queued-IOs=0
Owner: default=SP A, current=SP A
======================================================================
------------ Host ----------- - Stor - -- I/O Path - -- Stats ---
### HW Path I/O Paths Interf. Mode State Q-IOs Errors
======================================================================
3 lpfc sdaa SP A0 active alive 0 0
2 lpfc sdj SP A1 active alive 0 0

Then rename it....

# emcpadm rename -s emcpowerg -t emcpowerq 

...and your in business.

Regards,
Wayne.

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Roberto Fratelli
Sent: Monday, April 27, 2009 9:18 AM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath


Hello everyone !

I'm trying to set up a quorum disk in a 2-node redhat cluster (RHEL
4.7). Both nodes are attached to a SAN Clariion over qlogic HBA's and
i'm using EMC's powerpath.

The problem i'm facing is that the disks's device names are different
across the nodes. For example, the lun i elected to be my quorum disk is
called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at node-b, so i
can't use the device name (/dev/emcpowerXX) in my cluster.conf file. In
a attempt to overcome this i've tried to use the disk label instead :

<quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677" interval="2"
tko="10" votes="1"/>

But did not work also, because this way it finds '/dev/sdXX' before
'/dev/emcpowerXX':

root at node-a:~ # cman_tool nodes
Node  Votes Exp Sts  Name
   0    1    0   M   /dev/sdf1
   1    1    2   M   node-a
   2    1    2   M   node-b
root at node-a:~ #


Since this '/dev/sdf' is just one of the paths i have under powerpath
(it has 4 paths per HBA, 8 paths per lun total) i can't keep it this way
because in a event of lun trespass (clariion does it very often) i would
loose quorum. My understanding is that i need some kind of "persistent
binding" so i can use Powerpath's pseudo-device '/dev/emcpowerXX', so i
tried to create a UDEV rule like this:

[root at node-a RSM]# cat 75-quorum.rules
KERNEL=="emcpower[a-z]1,
SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677",
SYMLINK+="discodequorum%n"
[root at node-a RSM]#


Which did not worked, i think because UDEV starts way before powerpath.
Does anyone here faced this problem before ? how can i keep the device
names persistent across the nodes (at least for the quorum disk) ?


Below some more info on my quorum's disk:


root at node-a:~ # mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

root at node-a:~ #


[root at node-b ~]# mkqdisk -L
mkqdisk v0.5.2
/dev/sdf1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdo1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdx1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdag1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdap1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sday1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbh1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/sdbq1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

/dev/emcpowerd1:
        Magic:                eb7a62c2
        Label:                97196ef0-da2f-4b79-a5c5-999dce70d677
        Created:              Mon Dec 24 11:47:11 2007
        Host:                 node-a
        Kernel Sector Size:   512

[root at node-b ~]#


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From vikash at netvigator.com  Mon Apr 27 14:46:22 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Mon, 27 Apr 2009 22:46:22 +0800
Subject: [Linux-cluster] GFS performance.
Message-ID: <200904271513.n3RFDJkX019235@mx3.redhat.com>

Hi Jeff,

I tried that but I could not mount the partition anymore.

# gfs_tool sb /dev/mapper/cvg-vz01 proto no_lock
You shouldn't change any of these values if the filesystem is mounted.

Are you sure? [y/n] y

current lock protocol name = "lock_dlm"
new lock protocol name = "no_lock"
Done
# mount /vz
/sbin/mount.gfs: error mounting /dev/mapper/cvg-vz01 on /vz: No such 
file or directory

After I change it back to lock_dlm, I can mount the volume as usual.

Is there anything else I need to do?

Regards,
Vikash.

At 03:56 AM 26-04-09, Jeff Sturm wrote:
>Yes, and yes.  Use the "gfs_tool sb <device> proto no_lock" command on
>an unmounted filesystem, and remount.  (Obviously, you cannot mount the
>fs on more than one node after you do this.)
>
>Jeff
>
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash
> > Khatuwala
> > Sent: Saturday, April 25, 2009 4:26 AM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] GFS performance.
> >
> > Hi,
> >
> > Can I downgrade the lock manage from lock_dlm to no_lock? Do
> > I need to un-mount the gfs partition before changing? I want
> > to see if it makes any performance improvements.
> >
> > Thanks,
> > Vikash.
> >
> >
> > At 11:18 AM 21-04-09, Vikash Khatuwala wrote:
> > >Hi,
> > >
> > >I am using Virtuozzo OS visualization which does not have a
> > single file
> > >for the entire VM's filesystem. All VMs are simply
> > sub-directories and
> > >OS files are stored in a common templates directory which is
> > sym linked
> > >to back to the VM's directory, so if an OS file is changed
> > inside the
> > >VM then the symlink breaks and a new file is put in the VM's private
> > >directory. I cant use GFS2 because it is not supported by Virtuozzo.
> > >All VMs are simply running web/db/ftp.
> > >
> > >So this basically means that there are a lot of symbolic
> > links (small
> > >files). The GFS has a block size of 4K so I also chose 4K as
> > my block
> > >size for my performance testing to asses the worst case
> > scenario. If I
> > >change the block size to 256K then the performance
> > difference between
> > >ext3 and GFS are minimal. Also when I migrate the VM out
> > from GFS(RAID5
> > >SAS 15K) to ext3(single disk SATA), there is a significant
> > noticeable
> > >performance gain!
> > >
> > >Below tests are on the same disk set (5 disk RAID5 SAS 15K) with 2
> > >partitions, GFS and ext3.
> > >Results at 4K random reads:
> > >GFS : about 1500K/s
> > >ext3 : about 7000K/s
> > >
> > >Results at 256K random reads:
> > >GFS : about 45000K/s
> > >ext3 : about 50000K/s
> > >
> > >Results at 256K sequential reads:
> > >GFS : over 110,000K/s (my single GB NIC maxes out)
> > >ext3 : over 110,000K/s (my single GB NIC maxes out)
> > >
> > >fio test file as below only rw and blocksize were changed for the 3
> > >different scenarios above.
> > >[random-read1]
> > >rw=randread
> > >size=10240m
> > >directory=/vz/tmp
> > >ioengine=libaio
> > >iodepth=16
> > >direct=1
> > >invalidate=1
> > >blocksize=4k
> > >
> > >[random-read2]
> > >rw=randread
> > >size=10240m
> > >directory=/vz/tmp
> > >ioengine=libaio
> > >iodepth=16
> > >direct=1
> > >invalidate=1
> > >blocksize=4k
> > >
> > >Thanks,
> > >Vikash.
> > >
> > >
> > >At 01:00 AM 21-04-09, Jeff Sturm wrote:
> > >> > -----Original Message-----
> > >> > From: linux-cluster-bounces at redhat.com
> > >> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash
> > >> > Khatuwala
> > >> > Sent: Monday, April 20, 2009 11:23 AM
> > >> > To: linux-cluster at redhat.com
> > >> > Subject: [Linux-cluster] GFS performance.
> > >> >
> > >> > OS : CentOS 5.2
> > >> > FS : GFS
> > >>
> > >>Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to
> > have some
> > >>performance improvements over GFS1.
> > >>
> > >> > Now I need to make a decision to go with GFS or not,
> > clearly at 4
> > >> > times less performance we cannot afford it, also it
> > doesn't sound
> > >> > right so would like to find out whats wrong.
> > >>
> > >>Be careful with benchmarks, as they often do not give you a good
> > >>indication of real-world performance.
> > >>
> > >>Are you more concerned with latency or throughput?  Any single read
> > >>will almost certainly take longer to complete over GFS than EXT3.
> > >>There's simply more overhead involved with any cluster filesystem.
> > >>However, that's not to say you're limited as to how many
> > reads you can
> > >>execute in parallel.  So the overall number of reads you
> > can perform
> > >>in a given time interval may not be 4x at all (are you running a
> > >>parallel
> > >>benchmark?)
> > >>
> > >>Jeff
> > >>
> > >>
> > >>--
> > >>Linux-cluster mailing list
> > >>Linux-cluster at redhat.com
> > >>https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster


From rpeterso at redhat.com  Mon Apr 27 15:33:32 2009
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 27 Apr 2009 11:33:32 -0400 (EDT)
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <200904271513.n3RFDJkX019235@mx3.redhat.com>
Message-ID: <1348303238.97841240846412505.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- "Vikash Khatuwala" <vikash at netvigator.com> wrote:
| Hi Jeff,
| 
| I tried that but I could not mount the partition anymore.
| 
| # gfs_tool sb /dev/mapper/cvg-vz01 proto no_lock
| You shouldn't change any of these values if the filesystem is
| mounted.
| 
| Are you sure? [y/n] y
| 
| current lock protocol name = "lock_dlm"
| new lock protocol name = "no_lock"

Hi,

That should be "lock_nolock" not "no_lock".

Regards,

Bob Peterson
Red Hat GFS


From jeff.sturm at eprize.com  Mon Apr 27 15:41:28 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Mon, 27 Apr 2009 11:41:28 -0400
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <1348303238.97841240846412505.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <200904271513.n3RFDJkX019235@mx3.redhat.com>
	<1348303238.97841240846412505.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDB9D9@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob Peterson
> Sent: Monday, April 27, 2009 11:34 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS performance.
> 
> | current lock protocol name = "lock_dlm"
> | new lock protocol name = "no_lock"
> 
> Hi,
> 
> That should be "lock_nolock" not "no_lock".

Yes--sorry for the confusion.

Jeff


From vikash at netvigator.com  Mon Apr 27 16:10:11 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Tue, 28 Apr 2009 00:10:11 +0800
Subject: [Linux-cluster] GFS performance.
Message-ID: <200904271610.n3RGAa0d022305@mx3.redhat.com>

Hi,

I am using Virtuozzo OS visualization which does not have a single 
file for the entire VM's filesystem. All VMs are simply 
sub-directories and OS files are stored in a common templates 
directory which is sym linked to back to the VM's directory, so if an 
OS file is changed inside the VM then the symlink breaks and a new 
file is put in the VM's private directory. I cant use GFS2 because it 
is not supported by Virtuozzo. All VMs are simply running web/db/ftp.

So this basically means that there are a lot of symbolic links (small 
files). The GFS has a block size of 4K so I also chose 4K as my block 
size for my performance testing to asses the worst case scenario. If 
I change the block size to 256K then the performance difference 
between ext3 and GFS are minimal. Also when I migrate the VM out from 
GFS(RAID5 SAS 15K) to ext3(single disk SATA), there is a significant 
noticeable performance gain!

Below tests are on the same disk set (5 disk RAID5 SAS 15K) with 2 
partitions, GFS and ext3.
Results at 4K random reads:
GFS : about 1500K/s
ext3 : about 7000K/s

Results at 256K random reads:
GFS : about 45000K/s
ext3 : about 50000K/s

Results at 256K sequential reads:
GFS : over 110,000K/s (my single GB NIC maxes out)
ext3 : over 110,000K/s (my single GB NIC maxes out)

fio test file as below only rw and blocksize were changed for the 3 
different scenarios above.
[random-read1]
rw=randread
size=10240m
directory=/vz/tmp
ioengine=libaio
iodepth=16
direct=1
invalidate=1
blocksize=4k

[random-read2]
rw=randread
size=10240m
directory=/vz/tmp
ioengine=libaio
iodepth=16
direct=1
invalidate=1
blocksize=4k

Thanks,
Vikash.


At 01:00 AM 21-04-09, Jeff Sturm wrote:
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vikash
> > Khatuwala
> > Sent: Monday, April 20, 2009 11:23 AM
> > To: linux-cluster at redhat.com
> > Subject: [Linux-cluster] GFS performance.
> >
> > OS : CentOS 5.2
> > FS : GFS
>
>Can you easily install CentOS 5.3 and GFS2?  GFS2 claims to have some
>performance improvements over GFS1.
>
> > Now I need to make a decision to go with GFS or not, clearly
> > at 4 times less performance we cannot afford it, also it
> > doesn't sound right so would like to find out whats wrong.
>
>Be careful with benchmarks, as they often do not give you a good
>indication of real-world performance.
>
>Are you more concerned with latency or throughput?  Any single read will
>almost certainly take longer to complete over GFS than EXT3.  There's
>simply more overhead involved with any cluster filesystem.  However,
>that's not to say you're limited as to how many reads you can execute in
>parallel.  So the overall number of reads you can perform in a given
>time interval may not be 4x at all (are you running a parallel
>benchmark?)
>
>Jeff
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster


From robertofratelli at yahoo.com  Mon Apr 27 16:59:46 2009
From: robertofratelli at yahoo.com (Roberto Fratelli)
Date: Mon, 27 Apr 2009 09:59:46 -0700 (PDT)
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
Message-ID: <86526.924.qm@web33301.mail.mud.yahoo.com>


Excellent ! thanks a million ! this emcpadm command was unknown to me. I just tested it and works great ! 

Just another question: Is it necessary to issue 'powermt save' after renaming the pseudo-devices ?

Regards,

Roberto


--- On Mon, 4/27/09, berthiaume_wayne at emc.com <berthiaume_wayne at emc.com> wrote:

> From: berthiaume_wayne at emc.com <berthiaume_wayne at emc.com>
> Subject: RE: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
> To: linux-cluster at redhat.com
> Date: Monday, April 27, 2009, 9:43 AM
> Hi Roberto.
> 
> ?????You can use the emcpadm()
> command to rename the device on each
> node so they are all the same. The syntax is "emcpadm
> rename[pseudo] -s
> src pseudodevice -t tgt pseudodevice". For example, if on
> node A the
> device is emcpowerg and you wish it to be be emcpowerq you
> would issue:
> 
> # emcpadm rename -s emcpowerg -t emcpowerq 
> 
> ??? For each node run the "powermt display
> dev=all" and review the
> Logical device ID that is displayed for each LUN on each
> node. Find your
> quorom disk on each - for example, Logical device
> ID=6006016018C80800B5069570B8A9D911
> 
> # powermt display dev=emcpowerg
> 
> Pseudo name=emcpowerg
> CLARiiON ID=WRE00100100415
> Logical device ID=6006016018C80800B5069570B8A9D911
> state=alive; policy=CLAROpt; priority=0; queued-IOs=0
> Owner: default=SP A, current=SP A
> ======================================================================
> ------------ Host ----------- - Stor - -- I/O Path - --
> Stats ---
> ### HW Path I/O Paths Interf. Mode State Q-IOs Errors
> ======================================================================
> 3 lpfc sdaa SP A0 active alive 0 0
> 2 lpfc sdj SP A1 active alive 0 0
> 
> Then rename it....
> 
> # emcpadm rename -s emcpowerg -t emcpowerq 
> 
> ...and your in business.
> 
> Regards,
> Wayne.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Roberto Fratelli
> Sent: Monday, April 27, 2009 9:18 AM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] Persistent naming with UDEV and
> EMC's Powerpath
> 
> 
> Hello everyone !
> 
> I'm trying to set up a quorum disk in a 2-node redhat
> cluster (RHEL
> 4.7). Both nodes are attached to a SAN Clariion over qlogic
> HBA's and
> i'm using EMC's powerpath.
> 
> The problem i'm facing is that the disks's device names are
> different
> across the nodes. For example, the lun i elected to be my
> quorum disk is
> called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at
> node-b, so i
> can't use the device name (/dev/emcpowerXX) in my
> cluster.conf file. In
> a attempt to overcome this i've tried to use the disk label
> instead :
> 
> <quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677"
> interval="2"
> tko="10" votes="1"/>
> 
> But did not work also, because this way it finds
> '/dev/sdXX' before
> '/dev/emcpowerXX':
> 
> root at node-a:~ # cman_tool nodes
> Node? Votes Exp Sts? Name
> ???0? ? 1? ?
> 0???M???/dev/sdf1
> ???1? ? 1? ?
> 2???M???node-a
> ???2? ? 1? ?
> 2???M???node-b
> root at node-a:~ #
> 
> 
> Since this '/dev/sdf' is just one of the paths i have under
> powerpath
> (it has 4 paths per HBA, 8 paths per lun total) i can't
> keep it this way
> because in a event of lun trespass (clariion does it very
> often) i would
> loose quorum. My understanding is that i need some kind of
> "persistent
> binding" so i can use Powerpath's pseudo-device
> '/dev/emcpowerXX', so i
> tried to create a UDEV rule like this:
> 
> [root at node-a RSM]# cat 75-quorum.rules
> KERNEL=="emcpower[a-z]1,
> SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677",
> SYMLINK+="discodequorum%n"
> [root at node-a RSM]#
> 
> 
> Which did not worked, i think because UDEV starts way
> before powerpath.
> Does anyone here faced this problem before ? how can i keep
> the device
> names persistent across the nodes (at least for the quorum
> disk) ?
> 
> 
> Below some more info on my quorum's disk:
> 
> 
> root at node-a:~ # mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> root at node-a:~ #
> 
> 
> 
> [root at node-b ~]# mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerd1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> [root at node-b ~]#
> 
> 
> ? ? ? 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From vikash at netvigator.com  Mon Apr 27 17:02:01 2009
From: vikash at netvigator.com (Vikash Khatuwala)
Date: Tue, 28 Apr 2009 01:02:01 +0800
Subject: [Linux-cluster] GFS performance.
In-Reply-To: <1348303238.97841240846412505.JavaMail.root@zmail06.collab.
	prod.int.phx2.redhat.com>
References: <200904271513.n3RFDJkX019235@mx3.redhat.com>
	<1348303238.97841240846412505.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <200904271702.n3RH2ARI022528@mx3.redhat.com>

Hi,

That worked. thanks a lot.

Yes it does improve performance, however still not as good as ext3 itself.

Regards,
Vikash.

At 11:33 PM 27-04-09, Bob Peterson wrote:
>----- "Vikash Khatuwala" <vikash at netvigator.com> wrote:
>| Hi Jeff,
>|
>| I tried that but I could not mount the partition anymore.
>|
>| # gfs_tool sb /dev/mapper/cvg-vz01 proto no_lock
>| You shouldn't change any of these values if the filesystem is
>| mounted.
>|
>| Are you sure? [y/n] y
>|
>| current lock protocol name = "lock_dlm"
>| new lock protocol name = "no_lock"
>
>Hi,
>
>That should be "lock_nolock" not "no_lock".
>
>Regards,
>
>Bob Peterson
>Red Hat GFS
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster


From robertofratelli at yahoo.com  Mon Apr 27 17:09:46 2009
From: robertofratelli at yahoo.com (Roberto Fratelli)
Date: Mon, 27 Apr 2009 10:09:46 -0700 (PDT)
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
Message-ID: <95098.35928.qm@web33306.mail.mud.yahoo.com>


Hi Stevan,

I saw this in some examples, but somehow it just don't work for me :

# /sbin/scsi_id -g -s /block/emcpowerf
Cannot find sysfs device associated with /sys/block/emcpowerf
#


Weird.. A guy from EMC just sent me something that will fit just perfect. I'll share with the entire list.

Regards,


--- On Mon, 4/27/09, Stevan Colaco <stevan.colaco at gmail.com> wrote:

> From: Stevan Colaco <stevan.colaco at gmail.com>
> Subject: Re: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
> To: "linux clustering" <linux-cluster at redhat.com>
> Date: Monday, April 27, 2009, 8:52 AM
> Hello,
> 
> There are different ways one may write the persistent
> naming udev rules.
> 
> Instead of SYSFS{serial}, it would be better if you can use
> scsi_id
> 
> first try to get the scsi id of the quorum device
> 
> #/sbin/scsi_id -g -s /block/emcpowerf
> 
> you should receive a lengthy number. 
> 
> Then you can use the rule something like below 
> 
> [root at node-a RSM]# cat 75-quorum.rules
> KERNEL=="emcpower[a-z]1, PROGRAM=="/sbin/scsi_id -g -s
> /block/emcpowerf", RESULT=="Paste the output of scsi_id
> here" SYMLINK+="discodequorum"
> 
> 
> regards,
> -Steve
> 
> [root at node-a RSM]#
> 
> ----- Original Message -----
> From: "Roberto Fratelli" <robertofratelli at yahoo.com>
> To: linux-cluster at redhat.com
> Sent: Monday, April 27, 2009 4:17:47 PM GMT +03:00 Iraq
> Subject: [Linux-cluster] Persistent naming with UDEV and
> EMC's Powerpath
> 
> 
> Hello everyone !
> 
> I'm trying to set up a quorum disk in a 2-node redhat
> cluster (RHEL 4.7). Both nodes are attached to a SAN
> Clariion over qlogic HBA's and i'm using EMC's powerpath.
> 
> The problem i'm facing is that the disks's device names are
> different across the nodes. For example, the lun i elected
> to be my quorum disk is called '/dev/emcpowerf' at node-a
> and '/dev/emcpowerd' at node-b, so i can't use the device
> name (/dev/emcpowerXX) in my cluster.conf file. In a attempt
> to overcome this i've tried to use the disk label instead :
> 
> <quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677"
> interval="2" tko="10" votes="1"/>
> 
> But did not work also, because this way it finds
> '/dev/sdXX' before '/dev/emcpowerXX':
> 
> root at node-a:~ # cman_tool nodes
> Node? Votes Exp Sts? Name
> ???0? ? 1? ?
> 0???M???/dev/sdf1
> ???1? ? 1? ?
> 2???M???node-a
> ???2? ? 1? ?
> 2???M???node-b
> root at node-a:~ #
> 
> 
> Since this '/dev/sdf' is just one of the paths i have under
> powerpath (it has 4 paths per HBA, 8 paths per lun total) i
> can't keep it this way because in a event of lun trespass
> (clariion does it very often) i would loose quorum. My
> understanding is that i need some kind of "persistent
> binding" so i can use Powerpath's pseudo-device
> '/dev/emcpowerXX', so i tried to create a UDEV rule like
> this:
> 
> [root at node-a RSM]# cat 75-quorum.rules
> KERNEL=="emcpower[a-z]1,
> SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677",
> SYMLINK+="discodequorum%n"
> [root at node-a RSM]#
> 
> 
> Which did not worked, i think because UDEV starts way
> before powerpath. Does anyone here faced this problem before
> ? how can i keep the device names persistent across the
> nodes (at least for the quorum disk) ?
> 
> 
> Below some more info on my quorum's disk:
> 
> 
> root at node-a:~ # mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> root at node-a:~ #
> 
> 
> 
> [root at node-b ~]# mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerd1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> [root at node-b ~]#
> 
> 
> ? ? ? 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From pavlos.parissis at gmail.com  Mon Apr 27 17:35:53 2009
From: pavlos.parissis at gmail.com (Pavlos Parissis)
Date: Mon, 27 Apr 2009 19:35:53 +0200
Subject: [Linux-cluster] Failback option on failover domains
In-Reply-To: <1240594661.27154.24.camel@ayanami>
References: <fb143d260904151012t262e87d3n97bd9f121efdd6e1@mail.gmail.com>
	<1240594661.27154.24.camel@ayanami>
Message-ID: <fb143d260904271035p54505be5y580c5fbfee4c2924@mail.gmail.com>

2009/4/24 Lon Hohberger <lhh at redhat.com>

> On Wed, 2009-04-15 at 19:12 +0200, Pavlos Parissis wrote:
> > Hello,
> >
> > Does anyone know if the failback option is available on RedHat 4.X
> > systems?
> >
> > I went through all docs [1] and it seams to be that is only available
> > on RedHat 5.X version,
> > am I right?
>
> It's in 4.y.latest too.
>


I have installed rgmanager-1.9.80-1 and when I run rg_test on my conf I
didn't see it.
May be I need to do a full upgrade to 4.u7, I am still on 4.u4.


Thanks,
Pavlos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090427/216f4e29/attachment.htm>

From pavlos.parissis at gmail.com  Mon Apr 27 17:36:40 2009
From: pavlos.parissis at gmail.com (Pavlos Parissis)
Date: Mon, 27 Apr 2009 19:36:40 +0200
Subject: [Linux-cluster] prevent ping-pong
In-Reply-To: <1240594175.27154.14.camel@ayanami>
References: <fb143d260904091535l26030d31nf1679075ba6ce7cc@mail.gmail.com>
	<1240594175.27154.14.camel@ayanami>
Message-ID: <fb143d260904271036v38807d89qcbd5a8967f7f5e34@mail.gmail.com>

2009/4/24 Lon Hohberger <lhh at redhat.com>

> On Fri, 2009-04-10 at 00:35 +0200, Pavlos Parissis wrote:
> > Hi,
> >
> > Is there a way to configure cluster to not ping-pong a service more
> > than N times?
> >
> > I have an application where sometimes cluster gets a no zero status on
> > the check and keeps failover back and forth the service.
> > I want the cluster to try only N times to fail over the service and
> > then disables it if it can't start it or gets a no zero status on
> > checks after N times.
>
> On most-current RHEL 4 + errata, you can do a 'max_restarts' combined
> with 'restart_expire_time' to achieve a maximum restart before failover,
> but there's no current limit to the # of restarts / failovers before a
> disable.


Thanks

Pavlos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090427/dec25623/attachment.htm>

From berthiaume_wayne at emc.com  Mon Apr 27 18:12:00 2009
From: berthiaume_wayne at emc.com (berthiaume_wayne at emc.com)
Date: Mon, 27 Apr 2009 14:12:00 -0400
Subject: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
In-Reply-To: <86526.924.qm@web33301.mail.mud.yahoo.com>
References: <86526.924.qm@web33301.mail.mud.yahoo.com>
Message-ID: <8F08A56613D77044BD153E2AC5DA0F8401CC131D@CORPUSMX40A.corp.emc.com>

Hi Roberto.

	It won't hurt to do it and would be advisable.

Regards,
Wayne. 

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Roberto Fratelli
Sent: Monday, April 27, 2009 1:00 PM
To: linux clustering
Subject: RE: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath


Excellent ! thanks a million ! this emcpadm command was unknown to me. I just tested it and works great ! 

Just another question: Is it necessary to issue 'powermt save' after renaming the pseudo-devices ?

Regards,

Roberto


--- On Mon, 4/27/09, berthiaume_wayne at emc.com <berthiaume_wayne at emc.com> wrote:

> From: berthiaume_wayne at emc.com <berthiaume_wayne at emc.com>
> Subject: RE: [Linux-cluster] Persistent naming with UDEV and EMC's Powerpath
> To: linux-cluster at redhat.com
> Date: Monday, April 27, 2009, 9:43 AM
> Hi Roberto.
> 
> ?????You can use the emcpadm()
> command to rename the device on each
> node so they are all the same. The syntax is "emcpadm
> rename[pseudo] -s
> src pseudodevice -t tgt pseudodevice". For example, if on
> node A the
> device is emcpowerg and you wish it to be be emcpowerq you
> would issue:
> 
> # emcpadm rename -s emcpowerg -t emcpowerq 
> 
> ??? For each node run the "powermt display
> dev=all" and review the
> Logical device ID that is displayed for each LUN on each
> node. Find your
> quorom disk on each - for example, Logical device
> ID=6006016018C80800B5069570B8A9D911
> 
> # powermt display dev=emcpowerg
> 
> Pseudo name=emcpowerg
> CLARiiON ID=WRE00100100415
> Logical device ID=6006016018C80800B5069570B8A9D911
> state=alive; policy=CLAROpt; priority=0; queued-IOs=0
> Owner: default=SP A, current=SP A
> ======================================================================
> ------------ Host ----------- - Stor - -- I/O Path - --
> Stats ---
> ### HW Path I/O Paths Interf. Mode State Q-IOs Errors
> ======================================================================
> 3 lpfc sdaa SP A0 active alive 0 0
> 2 lpfc sdj SP A1 active alive 0 0
> 
> Then rename it....
> 
> # emcpadm rename -s emcpowerg -t emcpowerq 
> 
> ...and your in business.
> 
> Regards,
> Wayne.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Roberto Fratelli
> Sent: Monday, April 27, 2009 9:18 AM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] Persistent naming with UDEV and
> EMC's Powerpath
> 
> 
> Hello everyone !
> 
> I'm trying to set up a quorum disk in a 2-node redhat
> cluster (RHEL
> 4.7). Both nodes are attached to a SAN Clariion over qlogic
> HBA's and
> i'm using EMC's powerpath.
> 
> The problem i'm facing is that the disks's device names are
> different
> across the nodes. For example, the lun i elected to be my
> quorum disk is
> called '/dev/emcpowerf' at node-a and '/dev/emcpowerd' at
> node-b, so i
> can't use the device name (/dev/emcpowerXX) in my
> cluster.conf file. In
> a attempt to overcome this i've tried to use the disk label
> instead :
> 
> <quorumd label="97196ef0-da2f-4b79-a5c5-999dce70d677"
> interval="2"
> tko="10" votes="1"/>
> 
> But did not work also, because this way it finds
> '/dev/sdXX' before
> '/dev/emcpowerXX':
> 
> root at node-a:~ # cman_tool nodes
> Node? Votes Exp Sts? Name
> ???0? ? 1? ?
> 0???M???/dev/sdf1
> ???1? ? 1? ?
> 2???M???node-a
> ???2? ? 1? ?
> 2???M???node-b
> root at node-a:~ #
> 
> 
> Since this '/dev/sdf' is just one of the paths i have under
> powerpath
> (it has 4 paths per HBA, 8 paths per lun total) i can't
> keep it this way
> because in a event of lun trespass (clariion does it very
> often) i would
> loose quorum. My understanding is that i need some kind of
> "persistent
> binding" so i can use Powerpath's pseudo-device
> '/dev/emcpowerXX', so i
> tried to create a UDEV rule like this:
> 
> [root at node-a RSM]# cat 75-quorum.rules
> KERNEL=="emcpower[a-z]1,
> SYSFS{label}=="97196ef0-da2f-4b79-a5c5-999dce70d677",
> SYMLINK+="discodequorum%n"
> [root at node-a RSM]#
> 
> 
> Which did not worked, i think because UDEV starts way
> before powerpath.
> Does anyone here faced this problem before ? how can i keep
> the device
> names persistent across the nodes (at least for the quorum
> disk) ?
> 
> 
> Below some more info on my quorum's disk:
> 
> 
> root at node-a:~ # mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> root at node-a:~ #
> 
> 
> 
> [root at node-b ~]# mkqdisk -L
> mkqdisk v0.5.2
> /dev/sdf1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdo1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdx1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdag1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdap1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sday1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbh1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/sdbq1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> /dev/emcpowerd1:
> ? ? ? ? Magic:? ? ?
> ? ? ? ? ? eb7a62c2
> ? ? ? ? Label:? ? ?
> ? ? ? ? ?
> 97196ef0-da2f-4b79-a5c5-999dce70d677
> ? ? ? ? Created:? ? ?
> ? ? ? ? Mon Dec 24 11:47:11 2007
> ? ? ? ? Host:? ? ?
> ? ? ? ? ???node-a
> ? ? ? ? Kernel Sector
> Size:???512
> 
> [root at node-b ~]#
> 
> 
> ? ? ? 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From JBayles at readytechs.com  Mon Apr 27 18:13:08 2009
From: JBayles at readytechs.com (Jonathan Bayles)
Date: Mon, 27 Apr 2009 13:13:08 -0500
Subject: [Linux-cluster] Seeking blueprint
Message-ID: <9EB99FF0783ACA4F8B4B8A666D8462E20C8AAC@mail-11ps.atlarge.net>

Right now I have two boxes using drbd to replicate LV's in a
primary/secondary role setup. GNBD is serving the block devices out to
xen domU's located on other servers. I am interested making better use
of the second machine. Is there any way I can re-engineer this setup to
exploit the second machine but continue to serve block devices to my
clients? 

Jonathan Bayles


From esggrupos at gmail.com  Tue Apr 28 15:21:13 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Tue, 28 Apr 2009 17:21:13 +0200
Subject: [Linux-cluster] nodes halted with net lost
Message-ID: <3128ba140904280821m2bb71a03pe6b2c295a1be173a@mail.gmail.com>

Hello All,

I have a two nodes cluster on HP Servers. I have fencing configured using
the ILO cards that cames with this servers.

The cluster works fine but I have detected an strange behaviour.

The nodes are connected through a single switcher (I know, this is a single
point of failure...). If I reboot the switcher, the two nodes halt. (through
fencing it can be done because the go through the same switcher)

I don?t know if this behaviour is normal and if its possible to control it.
I want that when this happens the nodes dont do nothing  or at least they
reboot, not halt.

anyone can tell me if this is normal and where it can be configured?

Thanks in advance

ESG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090428/aff371b0/attachment.htm>

From gordan at bobich.net  Tue Apr 28 15:41:17 2009
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 28 Apr 2009 16:41:17 +0100
Subject: [Linux-cluster] nodes halted with net lost
In-Reply-To: <3128ba140904280821m2bb71a03pe6b2c295a1be173a@mail.gmail.com>
References: <3128ba140904280821m2bb71a03pe6b2c295a1be173a@mail.gmail.com>
Message-ID: <d91927b69e2d38ed8edeb0a6d6fa5ec9@localhost>

On Tue, 28 Apr 2009 17:21:13 +0200, ESGLinux <esggrupos at gmail.com> wrote:

> The nodes are connected through a single switcher (I know, this is a
single
> point of failure...). If I reboot the switcher, the two nodes halt.
> (through
> fencing it can be done because the go through the same switcher)

If they can't fence each other, cluster services will pause until fencing
can
be performed and verified. If this isn't happening (because the only path
between them with also covers fencing, is gone), then the behaviour you are
seeing is expected. But when the switch comes back up, they should resume.

If they don't resume when the switch comes back up, then that sounds like a
fencing configuration issue. Have you verified that fencing works and that
each node can successfully fence the other?

It is normally a good idea to isolate the cluster communication to a
dedicated interface. If you only have 2 nodes, you could just connect them
directly on a dedicated interface, without a switch.

> I don?t know if this behaviour is normal and if its possible to control
> it.
> I want that when this happens the nodes dont do nothing  or at least they
> reboot, not halt.

You can configure the action you want the fencing agent to perform. Look
up the man page for the fencing agent you are using. I thought the default
was to reboot (at least it is for the DRAC agent).

Gordan


From theophanis_kontogiannis at yahoo.gr  Tue Apr 28 15:49:10 2009
From: theophanis_kontogiannis at yahoo.gr (Theophanis Kontogiannis)
Date: Tue, 28 Apr 2009 18:49:10 +0300
Subject: [Linux-cluster] Question about controlling the start of services
	with RIND
Message-ID: <006901c9c818$e0731ae0$a15950a0$@gr>

Hello All,

 
>From you experience (cause my experience on the issue is none) could I use RIND for the following?

 
I have DRBD ? CLVMD ? GFS2 (that eventually gets mounted) and it is configured as a parent ? child hierarchy service (with that sequence) on a two nodes cluster (RHEL5.3).

 
Each node starts the service for itself.

 
Services like Apache, that have all the files under the file system to be mounted, also try to start at the same time.

 
They do not find their files (since the service filesystem takes some time to start) and fail.

 
Could I use RIND somehow to make the rest of the clustered services to start only if the filesystem service has started?

 
Thank you All for your time,

 
Theophanis Kontogiannis

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090428/63b5de06/attachment.htm>

From theophanis_kontogiannis at yahoo.gr  Tue Apr 28 16:48:04 2009
From: theophanis_kontogiannis at yahoo.gr (Theophanis Kontogiannis)
Date: Tue, 28 Apr 2009 19:48:04 +0300
Subject: [Linux-cluster] How to add the new drbd resource agent to
	system-config-cluster?
Message-ID: <007401c9c821$1a83c8d0$4f8b5a70$@gr>

Hello All,

 
I have RHEL5.3 and DBRD 8.3.1

 
How can I add the new DRBD resource agent (already installed under
/usr/share/cluster) on the drop down menus of system-config-cluster?

 
Obviously this is a subset of the more general question on how to add new
resource agents to be expoited with system-config-cluster.

 
Thank you All for your time.

 
Theophanis Kontogiannis

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090428/1ce89725/attachment.htm>

From esggrupos at gmail.com  Tue Apr 28 17:07:17 2009
From: esggrupos at gmail.com (ESGLinux)
Date: Tue, 28 Apr 2009 19:07:17 +0200
Subject: [Linux-cluster] nodes halted with net lost
In-Reply-To: <d91927b69e2d38ed8edeb0a6d6fa5ec9@localhost>
References: <3128ba140904280821m2bb71a03pe6b2c295a1be173a@mail.gmail.com>
	<d91927b69e2d38ed8edeb0a6d6fa5ec9@localhost>
Message-ID: <3128ba140904281007x6f0edf29g3ff23f2cca0cc15a@mail.gmail.com>

Hello Gordan,

Fencing works fine when the switcher is ok. I can fence a node from the
other, and when the communication is lost the cluster fence the node that
needs to fence. (I have a funny issue when I start the firewall and the
communication was lost, the nodes fences each other...)

You say its normal that the nodes halt until the switch comes up, but I
prefer that they reboots. I use fence_ipmilan agent to fence
but I dont know how to configure to do what I want

any idea?

thanks

ESG


2009/4/28 Gordan Bobic <gordan at bobich.net>

> On Tue, 28 Apr 2009 17:21:13 +0200, ESGLinux <esggrupos at gmail.com> wrote:
>
> > The nodes are connected through a single switcher (I know, this is a
> single
> > point of failure...). If I reboot the switcher, the two nodes halt.
> > (through
> > fencing it can be done because the go through the same switcher)
>
> If they can't fence each other, cluster services will pause until fencing
> can
> be performed and verified. If this isn't happening (because the only path
> between them with also covers fencing, is gone), then the behaviour you are
> seeing is expected. But when the switch comes back up, they should resume.
>
> If they don't resume when the switch comes back up, then that sounds like a
> fencing configuration issue. Have you verified that fencing works and that
> each node can successfully fence the other?
>
> It is normally a good idea to isolate the cluster communication to a
> dedicated interface. If you only have 2 nodes, you could just connect them
> directly on a dedicated interface, without a switch.
>
> > I don?t know if this behaviour is normal and if its possible to control
> > it.
> > I want that when this happens the nodes dont do nothing  or at least they
> > reboot, not halt.
>
> You can configure the action you want the fencing agent to perform. Look
> up the man page for the fencing agent you are using. I thought the default
> was to reboot (at least it is for the DRAC agent).
>
> Gordan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090428/2af00edb/attachment.htm>

From mrugeshkarnik at gmail.com  Wed Apr 29 10:17:44 2009
From: mrugeshkarnik at gmail.com (Mrugesh Karnik)
Date: Wed, 29 Apr 2009 15:47:44 +0530
Subject: [Linux-cluster] qdiskd error `does match kernel's reported sector
	size'
Message-ID: <200904291547.44205.mrugeshkarnik@gmail.com>

Hi,

I have a two node cluster configured with qdisk. The original setup is with 
both nodes being CentOS 5.3. Today, I decided to see if I could get GFS2 
working with Debian. So I installed Lenny on one of the nodes. Made it a dual 
boot system with the same cluster configuration. The other node wasn't touched 
at all and was always online with the qdisk making the cluster quorate.

On the Debian node, I can join the quorate cluster and the fence domain as 
well. However, qdiskd spits out the following in the logs and doesn't start 
up:
Apr 28 19:05:32 eru qdiskd[3552]: <crit> Specified device /dev/mpath/dom0qdisk 
does match kernel's reported sector size (0 != -1)

Here's what I've done wrt the qdisk device. Debian creates the multipath 
device links somewhere under /dev/disk/by-id/. The cluster configuration is 
for /dev/mpath/dom0qdisk and I didn't want to change it, so I added a udev 
rule and had the /dev/mpath/ symlinks created. Other than this, everything 
else has been configured exactly the same as the CentOS setup.

After this whole shebang, I again booted into CentOS and everything works 
fine.

The only difference I've noticed is that the major number of the mpath device 
is 253 on CentOS and 254 on Debian. Though both 253 and 254 fall under the 
reserved range for local/experimental block devices, so I don't think that'd 
be a problem.

I later updated the config and changed the identity of the Debian node and 
made it a third member of the cluster. Still, no go. I even tried to use 
/dev/mapper/dom0qdisk directly as well, but still, the problem persists.

Could anyone please help me with the issue?

Thanks,
Mrugesh


From billpp at gmail.com  Wed Apr 29 20:21:56 2009
From: billpp at gmail.com (Flavio Junior)
Date: Wed, 29 Apr 2009 17:21:56 -0300
Subject: [Linux-cluster] RHCS 4-node cluster: Networking/Membership issues
Message-ID: <58aa8d780904291321i21a8914fl956db51d61664a51@mail.gmail.com>

Hi folks,

I've been trying to set up a 4-node RHCS+GFS cluster for awhile. I've
another 2-node cluster using CentOS 5.3 without problem.

Well.. My scenario is as follow:

* System configuration and info: http://pastebin.com/f41d63624

* Network: http://www.uploadimagens.com/upload/2ac9074fbb10c2479c59abe419880dc8.jpg
  * Switches on loop are 3Com 2924 (or 2948)-SFP
  * Have STP enabled (RSTP auto)
  * IGMP Snooping Disabled as:
http://magazine.redhat.com/2007/08/23/automated-failover-and-recovery-of-virtualized-guests-in-advanced-platform/
comment 32
  * Yellow lines are a fiber link 990ft (330mts) single-mode
  * I'm using a dedicated tagged VLAN for cluster-heartbeat
  * I'm using 2 NIC's with bonding mode=1 (active/backup) for
heartbeat and 4 NIC's to "public"
  * Every node has your public four cables plugged on same switch and
Link-Aggregation on it
  * Looking to the picture, that 2 switches with below fiber link is
where the nodes are plugged. 2 nodes each build.

SAN: http://img139.imageshack.us/img139/642/clusters.jpg
  * Switches: Brocade TotalStorage 16SAN-B
  * Storages: IBM DS4700 72A (using ERM for sync replication (storage level))

My problem is:

I can't get the 4 nodes up. Every time the fourth (sometimes even the
third) node becomes online i got one or two of them fenced. I keep
getting messages about openais/cman, cpg_mcast_joined very often:
--- snipped ---
Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1098900
Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1099000
--- snipped ---

Is really seldom the times I can get a node to boot up and join on
fence domain, almost every time it hangs and i need to reboot and try
again or either reboot, enter single mode, disable cman, reboot, keep
trying to service cman start/stop. Sometimes another nodes can see the
node in domain but boot keeps hangs on "Starting fenced..."

########
[root at athos ~]# cman_tool services
type             level name     id       state
fence            0     default  00010001 none
[1 3 4]
dlm              1     clvmd    00020001 none
[1 3 4]
[root at athos ~]# cman_tool nodes -f
Node  Sts   Inc   Joined               Name
   0   M      0   2009-04-29 15:16:47
/dev/disk/by-id/scsi-3600a0b800048834e000014fb49dcc47b
   1   M   7556   2009-04-29 15:16:35  athos-priv
       Last fenced:   2009-04-29 15:13:49 by athos-ipmi
   2   X   7820                        porthos-priv
       Last fenced:   2009-04-29 15:31:01 by porthos-ipmi
       Node has not been fenced since it went down
   3   M   7696   2009-04-29 15:27:15  aramis-priv
       Last fenced:   2009-04-29 15:24:17 by aramis-ipmi
   4   M   8232   2009-04-29 16:12:34  dartagnan-priv
       Last fenced:   2009-04-29 16:09:53 by dartagnan-ipmi
[root at athos ~]# ssh root at aramis-priv
ssh: connect to host aramis-priv port 22: Connection refused
[root at athos ~]# ssh root at dartagnan-priv
ssh: connect to host dartagnan-priv port 22: Connection refused
[root at athos ~]#
#########

(I know how unreliable is ssh, but I'm seeing the console screen
hanged.. Just trying to show it)


The BIG log file: http://pastebin.com/f453c220
Every entry on this log after 16:54h is when node2 (porthos-priv
172.16.1.2) was booting and hanged on "Starting fenced..."


I've no more ideias to try solve this problem, any hints is
appreciated. If you need any other info, just tell me how to get it
and I'll post just after I read.


Very thanks, in advance.

--

Fl?vio do Carmo J?nior aka waKKu


From a.alawi at auckland.ac.nz  Thu Apr 30 01:09:19 2009
From: a.alawi at auckland.ac.nz (Abraham Alawi)
Date: Thu, 30 Apr 2009 13:09:19 +1200
Subject: [Linux-cluster] Question about controlling the start of services
	with RIND
In-Reply-To: <006901c9c818$e0731ae0$a15950a0$@gr>
References: <006901c9c818$e0731ae0$a15950a0$@gr>
Message-ID: <5113263A-4FD0-468B-B72B-ABA4E60F86EF@auckland.ac.nz>

I don't know what RIND is but is there any reason for not changing the  
sequence of startup services in the runlevel? (i.e. S30gfs, S40httpd)

On 29/04/2009, at 3:49 AM, Theophanis Kontogiannis wrote:

> Hello All,
>
> From you experience (cause my experience on the issue is none) could  
> I use RIND for the following?
>
> I have DRBD ? CLVMD ? GFS2 (that eventually gets mounted) and it is  
> configured as a parent ? child hierarchy service (with that  
> sequence) on a two nodes cluster (RHEL5.3).
>
> Each node starts the service for itself.
>
> Services like Apache, that have all the files under the file system  
> to be mounted, also try to start at the same time.
>
> They do not find their files (since the service filesystem takes  
> some time to start) and fail.
>
> Could I use RIND somehow to make the rest of the clustered services  
> to start only if the filesystem service has started?
>
> Thank you All for your time,
>
> Theophanis Kontogiannis
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

''''''''''''''''''''''''''''''''''''''''''''''''''''''
Abraham Alawi

Unix/Linux Systems Administrator
Science IT
University of Auckland
e: a.alawi at auckland.ac.nz
p: +64-9-373 7599, ext#: 87572

''''''''''''''''''''''''''''''''''''''''''''''''''''''

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090430/d5999547/attachment.htm>

From a.alawi at auckland.ac.nz  Thu Apr 30 01:19:42 2009
From: a.alawi at auckland.ac.nz (Abraham Alawi)
Date: Thu, 30 Apr 2009 13:19:42 +1200
Subject: [Linux-cluster] RHCS 4-node cluster: Networking/Membership issues
In-Reply-To: <58aa8d780904291321i21a8914fl956db51d61664a51@mail.gmail.com>
References: <58aa8d780904291321i21a8914fl956db51d61664a51@mail.gmail.com>
Message-ID: <1DA7B2AF-E920-4701-A49B-9806478144F7@auckland.ac.nz>

If not tried already, the following settings in cluster.conf might  
help especially "clean_start"

<fence_daemon clean_start="1" post_fail_delay="5" post_join_delay="15"/>
clean_start --> assume the cluster is in healthy state upon startup
post_fail_delay --> seconds to wait before fencing a node that thinks  
it should be fenced (i.e. lost connection with)
post_join_delay --> seconds to wait before fencing any node that  
should be fenced upon startup (right after joining)

On 30/04/2009, at 8:21 AM, Flavio Junior wrote:

> Hi folks,
>
> I've been trying to set up a 4-node RHCS+GFS cluster for awhile. I've
> another 2-node cluster using CentOS 5.3 without problem.
>
> Well.. My scenario is as follow:
>
> * System configuration and info: http://pastebin.com/f41d63624
>
> * Network: http://www.uploadimagens.com/upload/2ac9074fbb10c2479c59abe419880dc8.jpg
>  * Switches on loop are 3Com 2924 (or 2948)-SFP
>  * Have STP enabled (RSTP auto)
>  * IGMP Snooping Disabled as:
> http://magazine.redhat.com/2007/08/23/automated-failover-and-recovery-of-virtualized-guests-in-advanced-platform/
> comment 32
>  * Yellow lines are a fiber link 990ft (330mts) single-mode
>  * I'm using a dedicated tagged VLAN for cluster-heartbeat
>  * I'm using 2 NIC's with bonding mode=1 (active/backup) for
> heartbeat and 4 NIC's to "public"
>  * Every node has your public four cables plugged on same switch and
> Link-Aggregation on it
>  * Looking to the picture, that 2 switches with below fiber link is
> where the nodes are plugged. 2 nodes each build.
>
> SAN: http://img139.imageshack.us/img139/642/clusters.jpg
>  * Switches: Brocade TotalStorage 16SAN-B
>  * Storages: IBM DS4700 72A (using ERM for sync replication (storage  
> level))
>
> My problem is:
>
> I can't get the 4 nodes up. Every time the fourth (sometimes even the
> third) node becomes online i got one or two of them fenced. I keep
> getting messages about openais/cman, cpg_mcast_joined very often:
> --- snipped ---
> Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1098900
> Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1099000
> --- snipped ---
>
> Is really seldom the times I can get a node to boot up and join on
> fence domain, almost every time it hangs and i need to reboot and try
> again or either reboot, enter single mode, disable cman, reboot, keep
> trying to service cman start/stop. Sometimes another nodes can see the
> node in domain but boot keeps hangs on "Starting fenced..."
>
> ########
> [root at athos ~]# cman_tool services
> type             level name     id       state
> fence            0     default  00010001 none
> [1 3 4]
> dlm              1     clvmd    00020001 none
> [1 3 4]
> [root at athos ~]# cman_tool nodes -f
> Node  Sts   Inc   Joined               Name
>   0   M      0   2009-04-29 15:16:47
> /dev/disk/by-id/scsi-3600a0b800048834e000014fb49dcc47b
>   1   M   7556   2009-04-29 15:16:35  athos-priv
>       Last fenced:   2009-04-29 15:13:49 by athos-ipmi
>   2   X   7820                        porthos-priv
>       Last fenced:   2009-04-29 15:31:01 by porthos-ipmi
>       Node has not been fenced since it went down
>   3   M   7696   2009-04-29 15:27:15  aramis-priv
>       Last fenced:   2009-04-29 15:24:17 by aramis-ipmi
>   4   M   8232   2009-04-29 16:12:34  dartagnan-priv
>       Last fenced:   2009-04-29 16:09:53 by dartagnan-ipmi
> [root at athos ~]# ssh root at aramis-priv
> ssh: connect to host aramis-priv port 22: Connection refused
> [root at athos ~]# ssh root at dartagnan-priv
> ssh: connect to host dartagnan-priv port 22: Connection refused
> [root at athos ~]#
> #########
>
> (I know how unreliable is ssh, but I'm seeing the console screen
> hanged.. Just trying to show it)
>
>
> The BIG log file: http://pastebin.com/f453c220
> Every entry on this log after 16:54h is when node2 (porthos-priv
> 172.16.1.2) was booting and hanged on "Starting fenced..."
>
>
> I've no more ideias to try solve this problem, any hints is
> appreciated. If you need any other info, just tell me how to get it
> and I'll post just after I read.
>
>
> Very thanks, in advance.
>
> --
>
> Fl?vio do Carmo J?nior aka waKKu
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

''''''''''''''''''''''''''''''''''''''''''''''''''''''
Abraham Alawi

Unix/Linux Systems Administrator
Science IT
University of Auckland
e: a.alawi at auckland.ac.nz
p: +64-9-373 7599, ext#: 87572

''''''''''''''''''''''''''''''''''''''''''''''''''''''


From theophanis_kontogiannis at yahoo.gr  Thu Apr 30 06:44:19 2009
From: theophanis_kontogiannis at yahoo.gr (Theophanis Kontogiannis)
Date: Thu, 30 Apr 2009 09:44:19 +0300
Subject: [Linux-cluster] Question about controlling the start of
	services	with RIND
In-Reply-To: <5113263A-4FD0-468B-B72B-ABA4E60F86EF@auckland.ac.nz>
References: <006901c9c818$e0731ae0$a15950a0$@gr>
	<5113263A-4FD0-468B-B72B-ABA4E60F86EF@auckland.ac.nz>
Message-ID: <00f201c9c95f$188957e0$499c07a0$@gr>

Hello Abraham and All,

 
Thank you for your answer.

 
RIND is event scripting for rgmanager as described here http://sources.redhat.com/cluster/wiki/EventScripting

 
How would the startup sequence under runlevel affect the startup of a clustered service, since this is controlled by rgmanager? I do not see the link.

 
After all, should we not do chkconfig ?del <service> before configuring a service as clustered service?

 
Thank you All for your time,

 
Theophanis Kontogiannis

 
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Abraham Alawi
Sent: Thursday, April 30, 2009 4:09 AM
To: linux clustering
Subject: Re: [Linux-cluster] Question about controlling the start of services with RIND

 
I don't know what RIND is but is there any reason for not changing the sequence of startup services in the runlevel? (i.e. S30gfs, S40httpd)

 
On 29/04/2009, at 3:49 AM, Theophanis Kontogiannis wrote:


Hello All,

 
>From you experience (cause my experience on the issue is none) could I use RIND for the following?

 
I have DRBD ? CLVMD ? GFS2 (that eventually gets mounted) and it is configured as a parent ? child hierarchy service (with that sequence) on a two nodes cluster (RHEL5.3).

 
Each node starts the service for itself.

 
Services like Apache, that have all the files under the file system to be mounted, also try to start at the same time.

 
They do not find their files (since the service filesystem takes some time to start) and fail.

 
Could I use RIND somehow to make the rest of the clustered services to start only if the filesystem service has started?

 
Thank you All for your time,

 
Theophanis Kontogiannis

 
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

 
''''''''''''''''''''''''''''''''''''''''''''''''''''''
Abraham Alawi

Unix/Linux Systems Administrator
Science IT
University of Auckland
e: a.alawi at auckland.ac.nz
p: +64-9-373 7599, ext#: 87572

''''''''''''''''''''''''''''''''''''''''''''''''''''''

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090430/df036b8b/attachment.htm>

From billpp at gmail.com  Thu Apr 30 16:41:46 2009
From: billpp at gmail.com (Flavio Junior)
Date: Thu, 30 Apr 2009 13:41:46 -0300
Subject: [Linux-cluster] RHCS 4-node cluster: Networking/Membership issues
In-Reply-To: <1DA7B2AF-E920-4701-A49B-9806478144F7@auckland.ac.nz>
References: <58aa8d780904291321i21a8914fl956db51d61664a51@mail.gmail.com>
	<1DA7B2AF-E920-4701-A49B-9806478144F7@auckland.ac.nz>
Message-ID: <58aa8d780904300941l6146d881lfe102d8ebd6c8e32@mail.gmail.com>

Hi Abraham, thanks for your answer.

I'd configured your suggestion to cluster.conf but still gets the same problem.

Here is what I did:
* Disable cman init script on boot for all nodes
* Edit config file and copy it for all nodes
* reboot all
* start cman on node1 (OK)
* start cman on node2 (OK)
* start cman on node3 (problems to become member, fence node2)

Here is the log file with this process 'til the fence:
http://pastebin.com/f477e7114

PS: node1 and node2 as on the same switch at site1. node3 and node4 as
on the same switch at site2.

Thanks again, any other suggestions ?

I dont know if it would help but, is corosync a feasible option for
production use?

--

Fl?vio do Carmo J?nior aka waKKu

On Wed, Apr 29, 2009 at 10:19 PM, Abraham Alawi <a.alawi at auckland.ac.nz> wrote:
> If not tried already, the following settings in cluster.conf might help
> especially "clean_start"
>
> <fence_daemon clean_start="1" post_fail_delay="5" post_join_delay="15"/>
> clean_start --> assume the cluster is in healthy state upon startup
> post_fail_delay --> seconds to wait before fencing a node that thinks it
> should be fenced (i.e. lost connection with)
> post_join_delay --> seconds to wait before fencing any node that should be
> fenced upon startup (right after joining)
>
> On 30/04/2009, at 8:21 AM, Flavio Junior wrote:
>
>> Hi folks,
>>
>> I've been trying to set up a 4-node RHCS+GFS cluster for awhile. I've
>> another 2-node cluster using CentOS 5.3 without problem.
>>
>> Well.. My scenario is as follow:
>>
>> * System configuration and info: http://pastebin.com/f41d63624
>>
>> * Network:
>> http://www.uploadimagens.com/upload/2ac9074fbb10c2479c59abe419880dc8.jpg
>> ?* Switches on loop are 3Com 2924 (or 2948)-SFP
>> ?* Have STP enabled (RSTP auto)
>> ?* IGMP Snooping Disabled as:
>>
>> http://magazine.redhat.com/2007/08/23/automated-failover-and-recovery-of-virtualized-guests-in-advanced-platform/
>> comment 32
>> ?* Yellow lines are a fiber link 990ft (330mts) single-mode
>> ?* I'm using a dedicated tagged VLAN for cluster-heartbeat
>> ?* I'm using 2 NIC's with bonding mode=1 (active/backup) for
>> heartbeat and 4 NIC's to "public"
>> ?* Every node has your public four cables plugged on same switch and
>> Link-Aggregation on it
>> ?* Looking to the picture, that 2 switches with below fiber link is
>> where the nodes are plugged. 2 nodes each build.
>>
>> SAN: http://img139.imageshack.us/img139/642/clusters.jpg
>> ?* Switches: Brocade TotalStorage 16SAN-B
>> ?* Storages: IBM DS4700 72A (using ERM for sync replication (storage
>> level))
>>
>> My problem is:
>>
>> I can't get the 4 nodes up. Every time the fourth (sometimes even the
>> third) node becomes online i got one or two of them fenced. I keep
>> getting messages about openais/cman, cpg_mcast_joined very often:
>> --- snipped ---
>> Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1098900
>> Apr 29 16:08:23 athos groupd[5393]: cpg_mcast_joined retry 1099000
>> --- snipped ---
>>
>> Is really seldom the times I can get a node to boot up and join on
>> fence domain, almost every time it hangs and i need to reboot and try
>> again or either reboot, enter single mode, disable cman, reboot, keep
>> trying to service cman start/stop. Sometimes another nodes can see the
>> node in domain but boot keeps hangs on "Starting fenced..."
>>
>> ########
>> [root at athos ~]# cman_tool services
>> type ? ? ? ? ? ? level name ? ? id ? ? ? state
>> fence ? ? ? ? ? ?0 ? ? default ?00010001 none
>> [1 3 4]
>> dlm ? ? ? ? ? ? ?1 ? ? clvmd ? ?00020001 none
>> [1 3 4]
>> [root at athos ~]# cman_tool nodes -f
>> Node ?Sts ? Inc ? Joined ? ? ? ? ? ? ? Name
>> ?0 ? M ? ? ?0 ? 2009-04-29 15:16:47
>> /dev/disk/by-id/scsi-3600a0b800048834e000014fb49dcc47b
>> ?1 ? M ? 7556 ? 2009-04-29 15:16:35 ?athos-priv
>> ? ? ?Last fenced: ? 2009-04-29 15:13:49 by athos-ipmi
>> ?2 ? X ? 7820 ? ? ? ? ? ? ? ? ? ? ? ?porthos-priv
>> ? ? ?Last fenced: ? 2009-04-29 15:31:01 by porthos-ipmi
>> ? ? ?Node has not been fenced since it went down
>> ?3 ? M ? 7696 ? 2009-04-29 15:27:15 ?aramis-priv
>> ? ? ?Last fenced: ? 2009-04-29 15:24:17 by aramis-ipmi
>> ?4 ? M ? 8232 ? 2009-04-29 16:12:34 ?dartagnan-priv
>> ? ? ?Last fenced: ? 2009-04-29 16:09:53 by dartagnan-ipmi
>> [root at athos ~]# ssh root at aramis-priv
>> ssh: connect to host aramis-priv port 22: Connection refused
>> [root at athos ~]# ssh root at dartagnan-priv
>> ssh: connect to host dartagnan-priv port 22: Connection refused
>> [root at athos ~]#
>> #########
>>
>> (I know how unreliable is ssh, but I'm seeing the console screen
>> hanged.. Just trying to show it)
>>
>>
>> The BIG log file: http://pastebin.com/f453c220
>> Every entry on this log after 16:54h is when node2 (porthos-priv
>> 172.16.1.2) was booting and hanged on "Starting fenced..."
>>
>>
>> I've no more ideias to try solve this problem, any hints is
>> appreciated. If you need any other info, just tell me how to get it
>> and I'll post just after I read.
>>
>>
>> Very thanks, in advance.
>>
>> --
>>
>> Fl?vio do Carmo J?nior aka waKKu
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> ''''''''''''''''''''''''''''''''''''''''''''''''''''''
> Abraham Alawi
>
> Unix/Linux Systems Administrator
> Science IT
> University of Auckland
> e: a.alawi at auckland.ac.nz
> p: +64-9-373 7599, ext#: 87572
>
> ''''''''''''''''''''''''''''''''''''''''''''''''''''''
>
>


From lhh at redhat.com  Thu Apr 30 16:53:46 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Apr 2009 12:53:46 -0400
Subject: [Linux-cluster] dynamically reconfigure qdiskd heuristic:
	possible?
In-Reply-To: <561c252c0904270447w328acbbta758e4cd89d7a3cc@mail.gmail.com>
References: <561c252c0904270447w328acbbta758e4cd89d7a3cc@mail.gmail.com>
Message-ID: <1241110426.5206.155.camel@ayanami>

On Mon, 2009-04-27 at 13:47 +0200, Gianluca Cecchi wrote:
> Is there any command for this? I'm on rh el 5.3
> For example if I want change the line
> 
> <heuristic interval="2" program="ping -c1 -w1 10.4.5.250" score="1" tko="3"/>
> 
> so that it becomes
> <heuristic interval="2" program="ping -c1 -w1 10.4.5.250" score="1" tko="30"/>
> 
> It seems that with
> 1) ccs_tool update /etc/cluster/cluster.conf
> 2) cman_tool version -r new_vers_number
> 
> it doesn't work dynamically....

Correct, heuristics and timings require a restart.  STABLE3 allows
reconfiguring some things dynamically, but not heuristics because
changing them is very hard to synchronize cluster-wide (particularly
when for example qdiskd is blocked waiting for a write or something).

Instead, start up the cluster (on all nodes) make the update, then
restart qdiskd on each node in turn.

-- Lon


From virginian at blueyonder.co.uk  Thu Apr 30 17:22:53 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Thu, 30 Apr 2009 18:22:53 +0100
Subject: [Linux-cluster] Hardware options
Message-ID: <B291C4D226224501B126BC9C52E59248@Desktop>

I was hit by a rather large electricity bill recently (at home). My current cluster set up comprises 2 x HP Proliant DL380 G3s and an MSA 500 storage array (all three are very heavy on the juice!). I decided that if I want to continue playing with RHCS at home I needed to look for a cheaper, greener option. I can easily get a couple of PC's (dual or quad core cpus and plenty of RAM for running a virtualised cluster) but the stumbling block has been a cheap low power shared storage solution. The best that I have come up with so far is an offering from Maxtor and from LaCie, which basically comprises an esata disk enclosure that has two firewire 800 ports. I believe that Linux will support these dual firewire 800 enclosures but I am a little concerned about the speed (91MB/s) in comparison to a SCSI disk array. Ideally, I would prefer a disk enclosure / array with dual esata ports but I haven't been able to find anything. 

My question is, does anybody have a low cost hardware specification that they are running xen and RHCS on with shared storage that won't cost the earth and won't hit me in the wallet when it comes to paying the electricity bill? 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090430/611be5dc/attachment.htm>

From lhh at redhat.com  Thu Apr 30 18:26:19 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Apr 2009 14:26:19 -0400
Subject: [Linux-cluster] Hardware options
In-Reply-To: <B291C4D226224501B126BC9C52E59248@Desktop>
References: <B291C4D226224501B126BC9C52E59248@Desktop>
Message-ID: <1241115979.5206.161.camel@ayanami>

On Thu, 2009-04-30 at 18:22 +0100, Virginian wrote:
>  
> My question is, does anybody have a low cost hardware specification
> that they are running xen and RHCS on with shared storage that won't
> cost the earth and won't hit me in the wallet when it comes to paying
> the electricity bill? 

If you're not trying to migrate VMs between physical machines and are
primarily concerned with tinkering with other parts of the cluster, you
can use a notebook or desktop w/ VT support, Fedora 10, and KVM.  From
there, you can install RHEL5 or the operating system of your choice in
to the VMs and cluster the VMs.

It doesn't protect you against hardware failures at all, but it's a good
low-cost development platform.

Obviously, you can't put a VM in a VM, so you can't tinker with VM live
migration or anything like that.

-- Lon


From lhh at redhat.com  Thu Apr 30 18:27:25 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Apr 2009 14:27:25 -0400
Subject: [Linux-cluster] qdiskd error `does match kernel's reported
	sector size'
In-Reply-To: <200904291547.44205.mrugeshkarnik@gmail.com>
References: <200904291547.44205.mrugeshkarnik@gmail.com>
Message-ID: <1241116045.5206.163.camel@ayanami>

On Wed, 2009-04-29 at 15:47 +0530, Mrugesh Karnik wrote:
> Hi,
> 
> I have a two node cluster configured with qdisk. The original setup is with 
> both nodes being CentOS 5.3. Today, I decided to see if I could get GFS2 
> working with Debian. So I installed Lenny on one of the nodes. Made it a dual 
> boot system with the same cluster configuration. The other node wasn't touched 
> at all and was always online with the qdisk making the cluster quorate.

What tree did you build on the Debian node?  This was a problem awhile
ago but (I thought) has been fixed for some time.

-- Lon


From lhh at redhat.com  Thu Apr 30 18:29:18 2009
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Apr 2009 14:29:18 -0400
Subject: [Linux-cluster] Question about controlling the start of
	services with RIND
In-Reply-To: <006901c9c818$e0731ae0$a15950a0$@gr>
References: <006901c9c818$e0731ae0$a15950a0$@gr>
Message-ID: <1241116158.5206.165.camel@ayanami>

On Tue, 2009-04-28 at 18:49 +0300, Theophanis Kontogiannis wrote:

> Could I use RIND somehow to make the rest of the clustered services to
> start only if the filesystem service has started?

Yeah, just set:

<service name="foo" depend="file_system_service_name" ...>
  ...
</service>

-- Lon


From virginian at blueyonder.co.uk  Thu Apr 30 22:21:46 2009
From: virginian at blueyonder.co.uk (Virginian)
Date: Thu, 30 Apr 2009 23:21:46 +0100
Subject: [Linux-cluster] Hardware options
References: <B291C4D226224501B126BC9C52E59248@Desktop>
	<1241115979.5206.161.camel@ayanami>
Message-ID: <D152B2CA863449249689A77F7E87B2F2@Desktop>

Hi Lon,

That is a good solution and could be a possible second cluster for me to 
tinker with. However, I was rather hoping to use two nodes though and shared 
storage to have two physical nodes clustered and then tinker with a virtual 
cluster too (same as my existing cluster set up but a lot less heavy on the 
electricity bill!!). I haven't tried kvm yet but it is something I would 
like to have a play with (I've used xen and vmware up to now).

Regards

John

----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "linux clustering" <linux-cluster at redhat.com>
Sent: Thursday, April 30, 2009 7:26 PM
Subject: Re: [Linux-cluster] Hardware options


> On Thu, 2009-04-30 at 18:22 +0100, Virginian wrote:
>>
>> My question is, does anybody have a low cost hardware specification
>> that they are running xen and RHCS on with shared storage that won't
>> cost the earth and won't hit me in the wallet when it comes to paying
>> the electricity bill?
>
> If you're not trying to migrate VMs between physical machines and are
> primarily concerned with tinkering with other parts of the cluster, you
> can use a notebook or desktop w/ VT support, Fedora 10, and KVM.  From
> there, you can install RHEL5 or the operating system of your choice in
> to the VMs and cluster the VMs.
>
> It doesn't protect you against hardware failures at all, but it's a good
> low-cost development platform.
>
> Obviously, you can't put a VM in a VM, so you can't tinker with VM live
> migration or anything like that.
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From jeff.sturm at eprize.com  Thu Apr 30 22:52:09 2009
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Thu, 30 Apr 2009 18:52:09 -0400
Subject: [Linux-cluster] Hardware options
In-Reply-To: <D152B2CA863449249689A77F7E87B2F2@Desktop>
References: <B291C4D226224501B126BC9C52E59248@Desktop><1241115979.5206.161.camel@ayanami>
	<D152B2CA863449249689A77F7E87B2F2@Desktop>
Message-ID: <64D0546C5EBBD147B75DE133D798665F02FDBA7E@hugo.eprize.local>

I wonder if you can host a small cluster on Amazon's EC2 infrastructure?
Anyone tried it?

It's not free, but may be very inexpensive for small deployments, and
can be dynamically provisioned to save $$$.  It could be less than your
current electricity bill.

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Virginian
> Sent: Thursday, April 30, 2009 6:22 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Hardware options
> 
> Hi Lon,
> 
> That is a good solution and could be a possible second 
> cluster for me to tinker with. However, I was rather hoping 
> to use two nodes though and shared storage to have two 
> physical nodes clustered and then tinker with a virtual 
> cluster too (same as my existing cluster set up but a lot 
> less heavy on the electricity bill!!). I haven't tried kvm 
> yet but it is something I would like to have a play with 
> (I've used xen and vmware up to now).
> 
> Regards
> 
> John
> 
> ----- Original Message -----
> From: "Lon Hohberger" <lhh at redhat.com>
> To: "linux clustering" <linux-cluster at redhat.com>
> Sent: Thursday, April 30, 2009 7:26 PM
> Subject: Re: [Linux-cluster] Hardware options
> 
> 
> > On Thu, 2009-04-30 at 18:22 +0100, Virginian wrote:
> >>
> >> My question is, does anybody have a low cost hardware specification
> >> that they are running xen and RHCS on with shared storage 
> that won't
> >> cost the earth and won't hit me in the wallet when it 
> comes to paying
> >> the electricity bill?
> >
> > If you're not trying to migrate VMs between physical 
> machines and are
> > primarily concerned with tinkering with other parts of the 
> cluster, you
> > can use a notebook or desktop w/ VT support, Fedora 10, and 
> KVM.  From
> > there, you can install RHEL5 or the operating system of 
> your choice in
> > to the VMs and cluster the VMs.
> >
> > It doesn't protect you against hardware failures at all, 
> but it's a good
> > low-cost development platform.
> >
> > Obviously, you can't put a VM in a VM, so you can't tinker 
> with VM live
> > migration or anything like that.
> >
> > -- Lon
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
>