From wcheng at redhat.com  Tue Jan  1 17:00:32 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 01 Jan 2008 12:00:32 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <477A71B0.1080804@redhat.com>

Kamal Jain wrote:

> A challenge we?re dealing with is a massive number of small files, so 
> there is a lot of file-level overhead, and as you saw in the 
> charts?the random reads and writes were not friends of GFS.
>

It is expected that GFS2 would do better in this area butt this does 
*not* imply GFS(1) is not fixable. One thing would be helpful is sending 
us the benchmark (or test program that can reasonably represent your 
application IO patterns) you used to generate the performance data. Then 
we'll see what can be done from there ....

-- Wendy




From jos at xos.nl  Tue Jan  1 17:34:18 2008
From: jos at xos.nl (Jos Vos)
Date: Tue, 1 Jan 2008 18:34:18 +0100
Subject: [Linux-cluster] GFS performance
In-Reply-To: <477A71B0.1080804@redhat.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
Message-ID: <20080101173418.GB27030@jasmine.xos.nl>

On Tue, Jan 01, 2008 at 12:00:32PM -0500, Wendy Cheng wrote:

> It is expected that GFS2 would do better in this area butt this does 
> *not* imply GFS(1) is not fixable. One thing would be helpful is sending 
> us the benchmark (or test program that can reasonably represent your 
> application IO patterns) you used to generate the performance data. Then 
> we'll see what can be done from there ....

Take a typical public mirror tree (like Fedora, but FreeBSD gives you
even more fun, as it has *huge* directories), start the rsync service
and let a bunch of clients rsync some trees.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204



From wcheng at redhat.com  Tue Jan  1 17:56:12 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 01 Jan 2008 12:56:12 -0500
Subject: [Linux-cluster] gfs2 hang
In-Reply-To: <20071228085734.GA23405@jasmine.xos.nl>
References: <477419CF.1040002@cgl.ucsf.edu>	<20071227214025.GB16736@jasmine.xos.nl>	<47741D14.7020109@cgl.ucsf.edu>	<Pine.LNX.4.64.0712280824220.8683@skynet.shatteredsilicon.net>
	<20071228085734.GA23405@jasmine.xos.nl>
Message-ID: <477A7EBC.7060201@redhat.com>

Jos Vos wrote:

>
>The one thing that's horribly wrong in some applications is performance.
>If you need to have large amounts of files and frequent directory scans
>(i.e. rsync etc.), you're lost.
>
>  
>
On GFS(1) part, the glock trimming patch 
(http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4) 
was developed for customers with rsync issues. Field data have shown 
positive results. It is released on RHEL 5.1, as well on RHEL 4.6. Check 
out the usage part of above write-up.

-- Wendy



From jos at xos.nl  Tue Jan  1 17:44:22 2008
From: jos at xos.nl (Jos Vos)
Date: Tue, 1 Jan 2008 18:44:22 +0100
Subject: [Linux-cluster] gfs2 hang
In-Reply-To: <477A7EBC.7060201@redhat.com>
References: <477419CF.1040002@cgl.ucsf.edu>
	<20071227214025.GB16736@jasmine.xos.nl>
	<47741D14.7020109@cgl.ucsf.edu>
	<Pine.LNX.4.64.0712280824220.8683@skynet.shatteredsilicon.net>
	<20071228085734.GA23405@jasmine.xos.nl>
	<477A7EBC.7060201@redhat.com>
Message-ID: <20080101174422.GD27030@jasmine.xos.nl>

On Tue, Jan 01, 2008 at 12:56:12PM -0500, Wendy Cheng wrote:

> On GFS(1) part, the glock trimming patch 
> (http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4) 
> was developed for customers with rsync issues. Field data have shown 
> positive results. It is released on RHEL 5.1, as well on RHEL 4.6. Check 
> out the usage part of above write-up.

Checking the 5.1 bahavior is on my todo-list... will post the results
afterwards.  Current experiences are based on 5.0, yes.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204



From raycharles_man at yahoo.com  Wed Jan  2 03:09:21 2008
From: raycharles_man at yahoo.com (Ray Charles)
Date: Tue, 1 Jan 2008 19:09:21 -0800 (PST)
Subject: [Linux-cluster] clvmd fails to start on second node.
Message-ID: <580534.86509.qm@web32111.mail.mud.yahoo.com>



Hi, 

I am following the directions of the VMClusterCookbook
and have the early makings of a virtualized two node
cluster. On both guest-nodes i can initialize cman and
achieve quorate.  However, things go sideways when I
try to initialize clvmd on both nodes. The first node
brings up clvmd cleanly but the second reports FAILED
upon coming up and the volumes i wish to mount become
inaccessible.  Further, I've no firewalling on the
guests. I 've read a similar post but that was from
many months ago, on older version not much use. Logs
and output below..

Thanks in advance for any responses!

>From the consoles..

[root at vsp07 ~]# /etc/init.d/clvmd start
Starting clvmd: dlm: Using TCP for communications
[  OK  ]
Activating VGs:   2 logical volume(s) in volume group
"VolGroup00" now active
[  OK  ]
[root at vsp07 ~]#


[root at vsp08 ~]# /etc/init.d/clvmd start
Starting clvmd: clvmd startup timed out
                                                      
    [FAILED]

>From the /var/log/messages...

[root at vsp07 ~]# tail -f /var/log/messages
Jan  1 19:28:13 vsp07 kernel: dlm: Using TCP for
communications
Jan  1 19:28:14 vsp07 clvmd: Cluster LVM daemon
started - connected to CMAN
Jan  1 19:28:27 vsp07 kernel: dlm: connecting to 2
Jan  1 19:28:27 vsp07 kernel: dlm: connect from non
cluster node

[root at vsp08 ~]# tail -f /var/log/messages
Jan  1 19:28:27 vsp08 kernel: dlm: Using TCP for
communications
Jan  1 19:28:27 vsp08 kernel: dlm: connect from non
cluster node
Jan  1 19:28:27 vsp08 kernel: dlm: connecting to 1


Here is my cluster.conf file for the guests
Cent0S-5.1..

<?xml version="1.0"?>
<cluster alias="testbag" config_version="27"
name="testbag">

<cman expected_votes="1" two_node="1"/>
<totem token="21000"/>

        <clusternodes>
                <clusternode name="vsp07" votes="1"
nodeid="1">
                        <fence>
                                <method name="1">
                                        <device
domain="vsp07" name="xvm"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="vsp08" votes="1"
nodeid="2">
                        <fence>
                                <method name="2">
                                        <device
domain="vsp08" name="xvm"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <fencedevices>
                <fencedevice agent="fence_xvm"
name="xvm"/>
        </fencedevices>
</cluster>







      ____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs



From pcaulfie at redhat.com  Wed Jan  2 07:55:39 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Wed, 02 Jan 2008 07:55:39 +0000
Subject: [Linux-cluster] clvmd fails to start on second node.
In-Reply-To: <580534.86509.qm@web32111.mail.mud.yahoo.com>
References: <580534.86509.qm@web32111.mail.mud.yahoo.com>
Message-ID: <477B437B.6020503@redhat.com>

Ray Charles wrote:
> 
> Hi, 
> 
> I am following the directions of the VMClusterCookbook
> and have the early makings of a virtualized two node
> cluster. On both guest-nodes i can initialize cman and
> achieve quorate.  However, things go sideways when I
> try to initialize clvmd on both nodes. The first node
> brings up clvmd cleanly but the second reports FAILED
> upon coming up and the volumes i wish to mount become
> inaccessible.  Further, I've no firewalling on the
> guests. I 've read a similar post but that was from
> many months ago, on older version not much use. Logs
> and output below..
> 
> Thanks in advance for any responses!
> 
>>From the consoles..
> 
> [root at vsp07 ~]# /etc/init.d/clvmd start
> Starting clvmd: dlm: Using TCP for communications
> [  OK  ]
> Activating VGs:   2 logical volume(s) in volume group
> "VolGroup00" now active
> [  OK  ]
> [root at vsp07 ~]#
> 
> 
> [root at vsp08 ~]# /etc/init.d/clvmd start
> Starting clvmd: clvmd startup timed out
>                                                       
>     [FAILED]
> 
>>From the /var/log/messages...
> 
> [root at vsp07 ~]# tail -f /var/log/messages
> Jan  1 19:28:13 vsp07 kernel: dlm: Using TCP for
> communications
> Jan  1 19:28:14 vsp07 clvmd: Cluster LVM daemon
> started - connected to CMAN
> Jan  1 19:28:27 vsp07 kernel: dlm: connecting to 2
> Jan  1 19:28:27 vsp07 kernel: dlm: connect from non
> cluster node
> 
> [root at vsp08 ~]# tail -f /var/log/messages
> Jan  1 19:28:27 vsp08 kernel: dlm: Using TCP for
> communications
> Jan  1 19:28:27 vsp08 kernel: dlm: connect from non
> cluster node
> Jan  1 19:28:27 vsp08 kernel: dlm: connecting to 1
> 

That looks like an old and buggy dlm kernel module. I don't know
off-hand what the version numbers are, but see if you can find an
updated version.

Patrick



From swhiteho at redhat.com  Wed Jan  2 09:21:22 2008
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 02 Jan 2008 09:21:22 +0000
Subject: [Linux-cluster] gfs2 hang
In-Reply-To: <477419CF.1040002@cgl.ucsf.edu>
References: <477419CF.1040002@cgl.ucsf.edu>
Message-ID: <1199265682.22038.15.camel@quoit>

Hi,

On Thu, 2007-12-27 at 13:31 -0800, Scooter Morris wrote:
> Greetings,
>     We've got a two-node cluster running RHEL 5.1 that we've been 
> experimenting with and have discovered a problem with gfs2.  As part of 
> our build environment, we have some find scripts that walk a directory tree:
> 
> #! /bin/sh
> for html in `/usr/bin/find curGenerated -name \*.html -print` ; do \
>  cat $html > tmpCR.html ; \
>  /bin/mv tmpCR.html $html ; \
> done
> 
> The curGenerated directory has about 141 subdirectories, each of which 
> has from 2-10 subdirectories.  What we find is that this find script 
> will hang the operating system when it is executed within a gfs2 
> partition that is shared between the two nodes.  Fencing is configured 
> and detects the hung node and restarts it, but that's not much of a 
> consolation.  The gfs2 partition lives on a fibreChannel array (HP 
> EVA5000), and quotas are not turned on.  The gfs2 filesystem continues 
> to operate normally on the other node.
> 
> Is this a known bug in gfs2?   Is there something we could do to help 
> find this problem?
> 
> Thanks!
> 
> -- scooter
> 
I think this is probably a known bug, bz #404711 which is fixed in
upstream and also for 5.2. It triggers when rename is called in the
situation where it needs to allocate an extra block for the directory
and also there is a target file being unlinked, and also where both of
these operations happen to occur in the same resource group.

If this doesn't turn out to be the case, then please file a bugzilla,

Steve.





From janne.peltonen at helsinki.fi  Wed Jan  2 11:37:35 2008
From: janne.peltonen at helsinki.fi (Janne Peltonen)
Date: Wed, 2 Jan 2008 13:37:35 +0200
Subject: [Linux-cluster] <err> #48: Unable to obtain cluster lock: Invalid
	argument
Message-ID: <20080102113734.GV19197@helsinki.fi>

Hi.

After running a cluster node in a production cluster since July, I got
the folllowing error:

<err> #48: Unable to obtain cluster lock: Invalid argument 

Which resulted in a reboot:

--clip--
Dec 27 02:50:31 pcn1 clurgmgrd[6217]: <err> #48: Unable to obtain
cluster lock: Invalid argument 
Dec 27 02:50:31 pcn1 clurgmgrd[6217]: <notice> Stopping service
service:p01 
Dec 27 02:50:34 pcn1 in.rdiscd[30325]: setsockopt (IP_ADD_MEMBERSHIP):
Address already in use
Dec 27 02:50:34 pcn1 in.rdiscd[30325]: Failed joining addresses 
Dec 27 02:50:38 pcn1 snmpd[15929]: error on subcontainer 'ia_addr'
insert (-1) 
Dec 27 02:50:38 pcn1 snmpd[15929]: error on subcontainer 'ia_addr'
insert (-1) 
Dec 27 02:50:38 pcn1 snmpd[15929]: error on subcontainer '' insert (-1) 
Dec 27 02:50:38 pcn1 snmpd[15929]: error on subcontainer '' insert (-1) 
Dec 27 02:50:45 pcn1 clurgmgrd[6217]: <notice> Service service:p01 is
recovering 
Dec 27 02:50:45 pcn1 clurgmgrd[6217]: <notice> Recovering failed service
service:p01 
Dec 27 02:50:45 pcn1 kernel: dlm: add_to_waiters error 1
Dec 27 02:50:45 pcn1 kernel: dlm: remove_from_waiters error
Dec 27 02:50:45 pcn1 kernel: dlm: rgmanager: receive_unlock_reply not on
waiters
Dec 27 02:50:45 pcn1 clurgmgrd[6216]: <crit> Watchdog: Daemon died,
rebooting... 
Dec 27 02:50:45 pcn1 kernel: md: stopping all md devices.
Dec 27 02:55:23 pcn1 syslogd 1.4.1: restart.
--clip--

Other members of the cluster noticed the missing member, fenced it,
failed services over, and back (when the missing node had rejoined):

--clip--
Dec 27 02:50:56 pcn2 openais[4588]: [TOTEM] The token was lost in the OPERATIONAL state. 
Dec 27 02:50:56 pcn2 openais[4588]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes). 
Dec 27 02:50:56 pcn2 openais[4588]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes). 
Dec 27 02:50:56 pcn2 openais[4588]: [TOTEM] entering GATHER state from 2. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] entering GATHER state from 11. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] Saving state aru 6a4 high seq received 6a4 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] entering COMMIT state. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] entering RECOVERY state. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [0] member 10.3.0.10: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 0 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [1] member 10.3.0.12: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 0 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [2] member 10.3.0.13: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 0 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [3] member 10.3.0.14: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 0 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [4] member 10.3.0.15: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 1 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] position [5] member 10.3.0.16: 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] previous ring seq 324 rep 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] aru 6a4 high delivered 6a4 received flag 1 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] Did not need to originate any messages in recovery. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] Storing new sequence id for ring 14c 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] New Configuration: 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.16)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] Members Left: 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] Members Joined: 
Dec 27 02:51:01 pcn2 openais[4588]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] New Configuration: 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.16)  
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] Members Left: 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] Members Joined: 
Dec 27 02:51:01 pcn2 openais[4588]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:51:01 pcn2 openais[4588]: [TOTEM] entering OPERATIONAL state. 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.10 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.12 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.13 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.14 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.15 
Dec 27 02:51:01 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.16 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 3 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 4 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 5 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 6 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 100 
Dec 27 02:51:01 pcn2 openais[4588]: [CPG  ] got joinlist message from node 2 
Dec 27 02:51:01 pcn2 kernel: dlm: closing connection to node 1
Dec 27 02:51:01 pcn2 fenced[4614]: pcn1-hb not a cluster member after 0 sec post_fail_delay
Dec 27 02:51:01 pcn2 fenced[4614]: fencing node "pcn1-hb"
Dec 27 02:52:13 pcn2 fenced[4614]: fence "pcn1-hb" success
Dec 27 02:52:18 pcn2 ccsd[4541]: Attempt to close an unopened CCS descriptor (799075500). 
Dec 27 02:52:18 pcn2 ccsd[4541]: Error while processing disconnect: Invalid request descriptor 
Dec 27 02:52:20 pcn2 clurgmgrd[6262]: <notice> Taking over service service:p01 from down member pcn1-hb 
Dec 27 02:52:20 pcn2 clurgmgrd[6262]: <notice> Taking over service service:i01 from down member pcn1-hb 
Dec 27 02:52:20 pcn2 kernel: kjournald starting.  Commit interval 5 seconds
Dec 27 02:52:20 pcn2 kernel: EXT3 FS on dm-65, internal journal
Dec 27 02:52:20 pcn2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 27 02:52:21 pcn2 clurgmgrd[6262]: <notice> Taking over service service:i13 from down member pcn1-hb 
Dec 27 02:52:21 pcn2 in.rdiscd[2158]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
Dec 27 02:52:21 pcn2 in.rdiscd[2158]: Failed joining addresses 
Dec 27 02:52:22 pcn2 kernel: kjournald starting.  Commit interval 5 seconds
Dec 27 02:52:22 pcn2 kernel: EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
Dec 27 02:52:22 pcn2 kernel: EXT3 FS on dm-14, internal journal
Dec 27 02:52:22 pcn2 kernel: EXT3-fs: recovery complete.
Dec 27 02:52:22 pcn2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 27 02:52:24 pcn2 clurgmgrd[6262]: <notice> Service service:p01 started 
Dec 27 02:52:25 pcn2 last message repeated 2 times
Dec 27 02:52:27 pcn2 kernel: kjournald starting.  Commit interval 5 seconds
Dec 27 02:52:27 pcn2 kernel: EXT3 FS on dm-2, internal journal
Dec 27 02:52:27 pcn2 kernel: EXT3-fs: dm-2: 3 orphan inodes deleted
Dec 27 02:52:27 pcn2 kernel: EXT3-fs: recovery complete.
Dec 27 02:52:27 pcn2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 27 02:52:29 pcn2 kernel: kjournald starting.  Commit interval 5 seconds
Dec 27 02:52:29 pcn2 kernel: EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
Dec 27 02:52:29 pcn2 kernel: EXT3 FS on dm-38, internal journal
Dec 27 02:52:29 pcn2 kernel: EXT3-fs: recovery complete.
Dec 27 02:52:29 pcn2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 27 02:52:30 pcn2 in.rdiscd[3313]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
Dec 27 02:52:30 pcn2 in.rdiscd[3313]: Failed joining addresses 
Dec 27 02:52:32 pcn2 clurgmgrd[6262]: <notice> Service service:i13 started 
Dec 27 02:52:35 pcn2 kernel: kjournald starting.  Commit interval 5 seconds
Dec 27 02:52:35 pcn2 kernel: EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
Dec 27 02:52:35 pcn2 kernel: EXT3 FS on dm-26, internal journal
Dec 27 02:52:35 pcn2 kernel: EXT3-fs: recovery complete.
Dec 27 02:52:35 pcn2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 27 02:52:37 pcn2 in.rdiscd[3833]: setsockopt (IP_ADD_MEMBERSHIP): Address already in use
Dec 27 02:52:37 pcn2 in.rdiscd[3833]: Failed joining addresses 
Dec 27 02:52:38 pcn2 clurgmgrd[6262]: <notice> Service service:i01 started 
Dec 27 02:53:25 pcn2 last message repeated 2 times
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] entering GATHER state from 11. 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] Saving state aru c8 high seq received c8 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] entering COMMIT state. 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] entering RECOVERY state. 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [0] member 10.3.0.10: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [1] member 10.3.0.11: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 288 rep 10.3.0.11 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru 9 high delivered 9 received flag 0 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [2] member 10.3.0.12: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [3] member 10.3.0.13: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [4] member 10.3.0.14: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [5] member 10.3.0.15: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 1 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] position [6] member 10.3.0.16: 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] aru c8 high delivered c8 received flag 1 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] Did not need to originate any messages in recovery. 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] Storing new sequence id for ring 150 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn2 openais[4588]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.16)  
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn2 openais[4588]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn2 openais[4588]: [TOTEM] entering OPERATIONAL state. 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.10 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.11 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.12 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.13 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.14 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.15 
Dec 27 02:55:26 pcn2 openais[4588]: [CLM  ] got nodejoin message 10.3.0.16 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 100 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 2 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 3 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 4 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 5 
Dec 27 02:55:26 pcn2 openais[4588]: [CPG  ] got joinlist message from node 6 
Dec 27 02:55:35 pcn2 kernel: dlm: connecting to 1
--clip--

--clip--
Dec 27 02:55:24 pcn1 ccsd[4132]: Starting ccsd 2.0.69: 
Dec 27 02:55:24 pcn1 ccsd[4132]:  Built: Jun 27 2007 15:21:32 
Dec 27 02:55:24 pcn1 ccsd[4132]:  Copyright (C) Red Hat, Inc.  2004  All rights reserved. 
Dec 27 02:55:24 pcn1 ccsd[4132]: cluster.conf (cluster name = mappi-primary, version = 109) found. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] AIS Executive Service RELEASE 'subrev 1324 version 0.80.2' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contribu
tors. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Copyright (C) 2006 Red Hat, Inc. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] AIS Executive Service: started and ready to provide service. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Using default multicast address of 239.192.46.199 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_cpg loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais cluster closed process g
roup service v1.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_cfg loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais configuration service' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_msg loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais message service B.01.01'
 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_lck loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais distributed locking serv
ice B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_evt loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais event service B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_ckpt loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais checkpoint service B.01.
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_amf loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais availability management 
framework B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_clm loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais cluster membership servi
ce B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_evs loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais extended virtual synchro
ny service' 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] openais component openais_cman loaded. 
Dec 27 02:55:26 pcn1 openais[4143]: [MAIN ] Registering service handler 'openais CMAN membership service 
2.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] join (60 ms) send_join (0 ms) consensus (4800 ms) merge (200 
ms) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1500
 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] window size per rotation (50 messages) maximum messages per r
otation (17 messages) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] send threads (0 threads) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] RRP token expired timeout (495 ms) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] RRP token problem counter (2000 ms) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] RRP threshold (10 problem count) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] RRP mode set to none. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] heartbeat_failures_allowed (0) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] max_network_delay (50 ms) 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allow
ed > 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes). 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes). 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] The network interface [10.3.0.11] is now up. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Created or loaded sequence id 284.10.3.0.11 for this ring. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering GATHER state from 15. 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais extended virtual synchr
ony service' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais cluster membership serv
ice B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais availability management
 framework B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais checkpoint service B.01
.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais event service B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais distributed locking ser
vice B.01.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais message service B.01.01
' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais configuration service' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais cluster closed process 
group service v1.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [SERV ] Initialising service handler 'openais CMAN membership service
 2.01' 
Dec 27 02:55:26 pcn1 openais[4143]: [CMAN ] CMAN 2.0.69 (built Jun 27 2007 15:21:36) started 
Dec 27 02:55:26 pcn1 openais[4143]: [SYNC ] Not using a virtual synchrony filter. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Creating commit token because I am the rep. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Saving state aru 0 high seq received 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering COMMIT state. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering RECOVERY state. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [0] member 10.3.0.11: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 284 rep 10.3.0.11 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru 0 high delivered 0 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Did not need to originate any messages in recovery. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Storing new sequence id for ring 120 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Sending initial ORF token 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn1 openais[4143]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn1 openais[4143]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering OPERATIONAL state. 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.11 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering GATHER state from 11. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Saving state aru 9 high seq received 9 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering COMMIT state. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering RECOVERY state. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [0] member 10.3.0.10: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [1] member 10.3.0.11: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 288 rep 10.3.0.11 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru 9 high delivered 9 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [2] member 10.3.0.12: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [3] member 10.3.0.13: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [4] member 10.3.0.14: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 0 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [5] member 10.3.0.15: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 1 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] position [6] member 10.3.0.16: 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] previous ring seq 332 rep 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] aru c8 high delivered c8 received flag 1 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Did not need to originate any messages in recovery. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] Storing new sequence id for ring 150 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn1 openais[4143]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] CLM CONFIGURATION CHANGE 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] New Configuration: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.11)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.16)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Left: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] Members Joined: 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.10)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.12)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.13)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.14)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.15)  
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ]     r(0) ip(10.3.0.16)  
Dec 27 02:55:26 pcn1 openais[4143]: [SYNC ] This node is within the primary component and will provide se
rvice. 
Dec 27 02:55:26 pcn1 openais[4143]: [TOTEM] entering OPERATIONAL state. 
Dec 27 02:55:26 pcn1 openais[4143]: [CMAN ] quorum regained, resuming activity 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.10 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.11 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.12 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.13 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.14 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.15 
Dec 27 02:55:26 pcn1 openais[4143]: [CLM  ] got nodejoin message 10.3.0.16 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 100 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 2 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 3 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 4 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 5 
Dec 27 02:55:26 pcn1 openais[4143]: [CPG  ] got joinlist message from node 6 
Dec 27 02:55:26 pcn1 ccsd[4132]: Initial status:: Quorate 
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 100
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 2
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 3
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 5
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 6
Dec 27 02:55:35 pcn1 kernel: dlm: got connection from 4
Dec 27 02:55:35 pcn1 clvmd: Cluster LVM daemon started - connected to CMAN
Dec 27 03:01:04 pcn1 clurgmgrd[5515]: <notice> Starting stopped service service:i03 
Dec 27 03:01:04 pcn1 clurgmgrd[5515]: <notice> Starting stopped service service:i15 
[etc]
--clip--

Now I tried googling around for the mysterious error message #48, and couldn't
find any info. What might've been up?


--Janne
-- 
Janne Peltonen <janne.peltonen at helsinki.fi>



From kjain at aurarianetworks.com  Wed Jan  2 14:31:13 2008
From: kjain at aurarianetworks.com (Kamal Jain)
Date: Wed, 2 Jan 2008 09:31:13 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <477A71B0.1080804@redhat.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
Message-ID: <BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>

Hi Wendy,

IOZONE v3.283 was used to generate the results I posted.

An example invocation line [for the IOPS result]:


./iozone -O -l 1 -u 8 -T -b /root/iozone_IOPS_1_TO_8_THREAD_1_DISK_ISCSI_DIRECT.xls -F /mnt/iscsi_direct1/iozone/iozone1.tmp ...


It's for 1 to 8 threads, and I provided 8 file names through I'm only showing one in the line above.  The file destinations were on the same disk for a single disk test, and on alternating disks for a 2-disk test.  I believe IOZONE uses a simple random string, repeated in certain default record sizes, when performing its various operations.

- K



-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Wendy Cheng
Sent: Tuesday, January 01, 2008 12:01 PM
To: linux clustering
Subject: Re: [Linux-cluster] GFS performance

Kamal Jain wrote:

> A challenge we're dealing with is a massive number of small files, so
> there is a lot of file-level overhead, and as you saw in the
> charts...the random reads and writes were not friends of GFS.
>

It is expected that GFS2 would do better in this area butt this does
*not* imply GFS(1) is not fixable. One thing would be helpful is sending
us the benchmark (or test program that can reasonably represent your
application IO patterns) you used to generate the performance data. Then
we'll see what can be done from there ....

-- Wendy


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From lhh at redhat.com  Wed Jan  2 16:08:57 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 02 Jan 2008 11:08:57 -0500
Subject: [Linux-cluster] <err> #48: Unable to obtain cluster lock:
	Invalid argument
In-Reply-To: <20080102113734.GV19197@helsinki.fi>
References: <20080102113734.GV19197@helsinki.fi>
Message-ID: <1199290137.5980.24.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-02 at 13:37 +0200, Janne Peltonen wrote:
> Hi.
> 
> After running a cluster node in a production cluster since July, I got
> the folllowing error:
> 
> <err> #48: Unable to obtain cluster lock: Invalid argument 

What version of rgmanager was it?

-- Lon




From lhh at redhat.com  Wed Jan  2 16:12:28 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 02 Jan 2008 11:12:28 -0500
Subject: [Linux-cluster] RH Cluster issue-Network Failover not happening
In-Reply-To: <002901c8451a$95738020$4e030196@mhd.co.om>
References: <002901c8451a$95738020$4e030196@mhd.co.om>
Message-ID: <1199290348.5980.29.camel@ayanami.boston.devel.redhat.com>

On Sun, 2007-12-23 at 08:16 +0400, Harun wrote:
> Issue: When network cable is disconnected from the Primary, primary restart
> unclean and the failover to secondary do not happens. The shared drives
> don't get mounted automatically for secondary neither gets it mounted on
> primary, after the primary restarts. I have to then manually shut down both
> Primary and Secondary, and start primary first and then secondary for the
> setup to work fine again.

> I want to test a live production setup... a Linux Cluster with 2 nodes, in
> Linux Advanced Server (Linux DB-Primary 2.4.21-37.ELsmp #1 SMP Wed Sep 7
> 13:28:55 EDT 2005 i686 i686 i386 GNU/Linux ), Oracle data base is running on
> this setup.
> 
> The clumanager version is 1.2.28 and redhat-config-cluster version is 1.0.8
> on both primary and secondary.I want to resolve the issue with out any
> upgradations. Do you think that updating can resolve the issue? If
> upgradation is required please guide how to go ahead. I am trying to resolve
> this issue with out any patch update.
> Is this a configuration problem or some knows issue with the version used.
> 
> Cluster.xml looks like this.
> 
>   <?xml version="1.0" ?> 
> - <cluconfig version="3.0">
>   <clumembd broadcast="no" interval="1000000" loglevel="5" multicast="yes"
> multicast_ipaddress="225.0.0.11" thread="yes" tko_count="25" /> 
>   <cluquorumd loglevel="5" pinginterval="2" tiebreaker_ip="" /> 

Set a tiebreaker_ip if you don't want it to survive network splits.
This IP needs to be on the same network as the IPs which map to the
hosts "DB-Primary" and "DB-Secondary", but must not reside on the hosts
themselves (use a switch IP, another host, or gateway)

Also, set monitor_link to 1 in the service_ipaddress ...

-- Lon



From lhh at redhat.com  Wed Jan  2 16:14:48 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 02 Jan 2008 11:14:48 -0500
Subject: [Linux-cluster] Channel Bonding issue in Cluster Suite Setup
In-Reply-To: <839293.38853.qm@web50606.mail.re2.yahoo.com>
References: <839293.38853.qm@web50606.mail.re2.yahoo.com>
Message-ID: <1199290488.5980.31.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-12-27 at 06:29 -0800, Roger Pe?a wrote:
> --- Balaji <balajisundar at midascomm.com> wrote:

> both servers thinks that the other one are death, so I
> guess you have a problem with comunication between the
> nodes after the bonding is set
> 
> What I wonder now is why fencing do not work :-(
> it could be dangerus to have both nodes accessing the
> same storage without know it :-(

Right.

Why did both become quorate?  Is fencing "not there"?

-- Lon



From lhh at redhat.com  Wed Jan  2 16:17:31 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 02 Jan 2008 11:17:31 -0500
Subject: [Linux-cluster] GFS without RHCM but with Heartbeat V2 and drbd ?
In-Reply-To: <4774D07E.6030701@arcor.de>
References: <20071227170006.AE68F733D5@hormel.redhat.com>
	<4774D07E.6030701@arcor.de>
Message-ID: <1199290651.5980.34.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-12-28 at 11:31 +0100, Holger Woehle wrote:
> Hi,
> at the moment i am evaluating RHCM and Heartbeat V2 to refine our 
> Heartbeat V1 Cluster.
> My question as in the subject:
> Is it possible to use GFS without the RHCM ?

Not sure.  You don't need rgmanager, but you need
fencing/membership/etc.

Nothing precludes you from running HBv2 in addition to RHCM+GFS.

> I want to build a 2 node cluster with drbd activ/activ and RHCM/GFS or 
> Heartbeat V2 with filesystem OCFS or GFS.

Here's how to get a 2-node DRBD setup going w/ RHCM (without HBv2):

http://sources.redhat.com/cluster/wiki/DRBD_Cookbook

-- Lon



From janne.peltonen at helsinki.fi  Wed Jan  2 16:25:02 2008
From: janne.peltonen at helsinki.fi (Janne Peltonen)
Date: Wed, 2 Jan 2008 18:25:02 +0200
Subject: [Linux-cluster] <err> #48: Unable to obtain cluster lock: Invalid
	argument
In-Reply-To: <1199290137.5980.24.camel@ayanami.boston.devel.redhat.com>
References: <20080102113734.GV19197@helsinki.fi>
	<1199290137.5980.24.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20080102162502.GA4504@helsinki.fi>

On Wed, Jan 02, 2008 at 11:08:57AM -0500, Lon Hohberger wrote:
> On Wed, 2008-01-02 at 13:37 +0200, Janne Peltonen wrote:
> > Hi.
> > 
> > After running a cluster node in a production cluster since July, I got
> > the folllowing error:
> > 
> > <err> #48: Unable to obtain cluster lock: Invalid argument 
> 
> What version of rgmanager was it?

[jmmpelto at pcn1 log]$ rpm -q rgmanager
rgmanager-2.0.27-2.1lhh.el5

There were also a couple nodes with a newer rgmanager in the same
cluster:

[jmmpelto at pcn5 mappi2]$ rpm -q rgmanager
rgmanager-2.0.31-1.el5.centos


--Janne
-- 
Janne Peltonen <janne.peltonen at helsinki.fi>



From wcheng at redhat.com  Wed Jan  2 17:27:05 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 02 Jan 2008 12:27:05 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>	<1198770380.4932.23.camel@WSBID06223>	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <477BC969.6050506@redhat.com>

Kamal Jain wrote:
> Hi Wendy,
>
> IOZONE v3.283 was used to generate the results I posted.
>
> An example invocation line [for the IOPS result]:
>
>
> ./iozone -O -l 1 -u 8 -T -b /root/iozone_IOPS_1_TO_8_THREAD_1_DISK_ISCSI_DIRECT.xls -F /mnt/iscsi_direct1/iozone/iozone1.tmp ...
>
>
> It's for 1 to 8 threads, and I provided 8 file names through I'm only showing one in the line above.  The file destinations were on the same disk for a single disk test, and on alternating disks for a 2-disk test.  I believe IOZONE uses a simple random string, repeated in certain default record sizes, when performing its various operations.
>
>   
Intuitively (by reading your iozone command), this is a locking issue. 
There are lots to say on your setup, mostly because all data and lock 
traffic are funneling thru the same network. Remember locking is mostly 
to do with *latency*, not bandwidth. So even your network is not 
saturated, the performance can go down. It is different from the rsync 
issue (as described by Jos Vos) so the glock trimming patch is not 
helpful in this case.

However, I won't know for sure until we get the data analyzed. Thanks 
for the input.

-- Wendy



From jparsons at redhat.com  Wed Jan  2 20:34:39 2008
From: jparsons at redhat.com (James Parsons)
Date: Wed, 02 Jan 2008 15:34:39 -0500
Subject: [Linux-cluster] Error messages during Fence operation
In-Reply-To: <4779380A.6000706@noaa.gov>
References: <47793613.9000304@noaa.gov> <4779380A.6000706@noaa.gov>
Message-ID: <477BF55F.5000900@redhat.com>

Randy Brown wrote:

> I forgot....I'm using Centos 5 with latest patches and kernel.
>
> Randy Brown wrote:
>
>> I am using an APC Masterswitch Plus as my fencing device.  I am 
>> seeing this in my logs now when fencing occurs:
>>
>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: agent "fence_apc" reports: 
>> Traceback (most recent call last):   File "/sbin/fence_apc", line 
>> 829, in ?     main()   File "/sbin/fence_apc", line 289, in main     
>> do_login(sock)   File "/sbin/fence_apc", line 444, in do_login     i, 
>> mo, txt = sock.expect(regex_list, TELNET_TIMEOUT)
>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: agent "fence_apc" 
>> reports:   File "/usr/lib/python2.4/telnetlib.py", line 620, in 
>> expect     text = self.read_very_lazy()   File 
>> "/usr/lib/python2.4/telnetlib.py", line 400, in read_very_lazy     
>> raise EOFError, 'telnet connection closed' EOFError: telnet 
>> connection closed
>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: fence 
>> "nfs2-cluster.nws.noaa.gov" failed
>>
>> This used to work just fine.  If I run `fence_apc -a 192.168.42.30 -l 
>> cluster -n 1:7 -o Reboot -p <my password>` from the command line, 
>> fencing works as expected.  The relevant lines from my cluster.conf 
>> file are below.  I will gladly provide more information as necessary.
>
Is it possible that you are already telnet'ed into the switch from a 
terminal or somesuch when the fence attempt takes place? APC switches 
allow only one login at a time. I should/will add a log comment that 
mentions this as a possible reason.

If this is not the issue, well, we can keep digging...

-J



From jamesc at exa.com  Wed Jan  2 22:35:23 2008
From: jamesc at exa.com (James Chamberlain)
Date: Wed, 2 Jan 2008 17:35:23 -0500 (EST)
Subject: [Linux-cluster] Instability troubles
Message-ID: <Pine.LNX.4.64.0801021631370.27708@hawking.exa.com>

Hi all,

I'm having some major stability problems with my three-node CS/GFS cluster. 
Every two or three days, one of the nodes fences another, and I have to 
hard-reboot the entire cluster to recover.  I have had this happen twice 
today.  I don't know what's triggering the fencing, since all the nodes 
appear to me to be up and running when it happens.  In fact, I was logged 
on to node3 just now, running 'top', when node2 fenced it.

When they come up, they don't automatically mount their GFS filesystems, 
even with "_netdev" specified as a mount option; however, the node which 
comes up first mounts them all as part of bringing all the services up.

I did notice a couple of disconcerting things earlier today.  First, I was 
running "watch clustat".  (I prefer to see the time updating, where I 
can't with "clustat -i")  At one point, "clustat" crashed as follows:

Jan  2 15:19:54 node2 kernel: clustat[17720]: segfault at 0000000000000024 
rip 0000003629e75bc0 rsp 00007fff18827178 error 4

Fairly shortly thereafter, clustat reported that node3 as "Online, 
Estranged, rgmanager".  Can anyone shed light on what that means? 
Google's not telling me much.

At the moment, all three nodes are running CentOS 5.1, with kernel 
2.6.18-53.1.4.el5.  Can anyone point me in the right direction to resolve 
these problems?  I wasn't having trouble like this when I was running a 
CentOS 4 CS/GFS cluster.  Is it possible to downgrade, likely via a full 
rebuild of all the nodes, from CentOS 5 CS/GFS to 4?  Should I instead 
consider setting up a single node to mount the GFS filesystems and serve 
them out, to get around these fencing issues?

Thanks,

James



From williamottley at gmail.com  Thu Jan  3 00:58:41 2008
From: williamottley at gmail.com (William Ottley)
Date: Wed, 2 Jan 2008 19:58:41 -0500
Subject: [Linux-cluster] Lars' method???
Message-ID: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>

Hello all,
I'm hoping that this layout here will make it easy for anyone to
figure out what booboo i've done?
I'm attempting to use Lars' method, since i really don't know how to
setup with only 1 (gateways have confused me)

i just can't get anything working...

client: 192.168.2.10 -> 192.168.2.1 via crossover cable (ping OK)
web browser is pointed to 192.168.2.100

LVS (centos 5.1, pulse, piranha):
eth0: 192.168.2.1/gw {none}
eth1: 192.168.0.111/gw 192.168.0.1
eth0:1 - 192.168.2.100 {VIP}

echo 1 > /proc/sys/net/ipv4/ip_forward

no iptables is running, httpd default port is 8080, and piranha_GUI is
listening at 3636


RIP1:
eth0: 192.168.0.15 / gw 192.168.0.1
/etc/sysconfig/network-scripts/lo:0
lo:0 - 192.168.2.100
ifup lo:0
ping 192.168.2.100 and 192.168.0.111 {OK}
echo 1 > /proc/sys/net/ipv4/ip_forward


RIP2:
eth0: 192.168.0.11 / gw 192.168.0.1
/etc/sysconfig/network-scripts/lo:0
lo:0 - 192.168.2.100
ifup lo:0
ping 192.168.2.100 and 192.168.0.111 {OK}
echo 1 > /proc/sys/net/ipv4/ip_forward

/etc/sysconfig/ha/lvs.cf:
serial_no = 19
primary = 192.168.2.1
service = lvs
backup = 0.0.0.0
heartbeat = 1
heartbeat_port = 539
keepalive = 6
deadtime = 18
network = nat
nat_router = 192.168.0.111 eth1
nat_nmask = 255.255.255.0
debug_level = NONE
virtual all-web {
     active = 1
     address = 192.168.2.100 eth0:1
     vip_nmask = 255.255.255.0
     port = 80
     send = "GET / HTTP/1.0\r\n\r\n"
     expect = "HTTP"
     use_regex = 0
     load_monitor = none
     scheduler = rr
     protocol = tcp
     timeout = 6
     reentry = 15
     quiesce_server = 0
     server offsite1 {
         address = 192.168.0.11
         active = 1
         weight = 1
     }
     server offsite2 {
         address = 192.168.0.15
         active = 1
         weight = 1
     }
}


[root at localhost ha]# ipvsadm
IP Virtual Server version 1.2.1 (size=4096)
Prot LocalAddress:Port Scheduler Flags
  -> RemoteAddress:Port           Forward Weight ActiveConn
InActConn/etc/init.d/pulse start
TCP  192.168.2.100:http rr
localhost pulse[4676]: STARTING PULSE AS MASTER
  -> 192.168.0.15:http            Masq    1      0          0
localhost pulse[4676]: partner dead: activating lvs
  -> 192.168.0.11:http            Masq    1      0          0
localhost lvs[4678]: starting virtual service all-web active: 80


localhost nanny[4684]: starting LVS client monitor for 192.168.2.100:80
localhost lvs[4678]: create_monitor for all-web/offsite1 running as pid 4684
localhost nanny[4685]: starting LVS client monitor for 192.168.2.100:80
localhost lvs[4678]: create_monitor for all-web/offsite2 running as pid 4685
localhost kernel: eth1:  setting full-duplex.
localhost pulse[4681]: gratuitous lvs arps finished
localhost nanny[4684]: making 192.168.0.11:80 available
localhost nanny[4685]: making 192.168.0.15:80 available

PIRANHA CONFIGURATION TOOL

 CURRENT LVS ROUTING TABLE
IP Virtual Server version 1.2.1 (size=4096)
Prot LocalAddress:Port Scheduler Flags
-> RemoteAddress:Port Forward Weight ActiveConn InActConn
TCP 192.168.2.100:80 rr
-> 192.168.0.15:80 Masq 1 0 0
-> 192.168.0.11:80 Masq 1 0 0

CURRENT LVS PROCESSES
root 4676 0.0 0.0 1868 448 ? Ss 14:42 0:00 pulse
root 4678 0.0 0.1 1848 620 ? Ss 14:42 0:00 /usr/sbin/lvsd --nofork -c
/etc/sysconfig/ha/lvs.cf
root 4684 0.0 0.1 1836 668 ? Ss 14:42 0:00 /usr/sbin/nanny -c -h
192.168.0.11 -p 80 -s GET / HTTP/1.0\r\n\r\n -x HTTP -a 15 -I
/sbin/ipvsadm -t 6 -w 1 -V 192.168.2.100 -M m -U none --lvs
root 4685 0.0 0.1 1832 664 ? Ss 14:42 0:00 /usr/sbin/nanny -c -h
192.168.0.15 -p 80 -s GET / HTTP/1.0\r\n\r\n -x HTTP -a 15 -I
/sbin/ipvsadm -t 6 -w 1 -V 192.168.2.100 -M m -U none --lvs


-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From randy.brown at noaa.gov  Thu Jan  3 13:17:00 2008
From: randy.brown at noaa.gov (Randy Brown)
Date: Thu, 03 Jan 2008 08:17:00 -0500
Subject: [Linux-cluster] Error messages during Fence operation
In-Reply-To: <477BF55F.5000900@redhat.com>
References: <47793613.9000304@noaa.gov> <4779380A.6000706@noaa.gov>
	<477BF55F.5000900@redhat.com>
Message-ID: <477CE04C.4000002@noaa.gov>

Thanks.  That makes sense and I hadn't thought of that.  I don't see any 
other connections.  However, it appears to have properly fenced one of 
the nodes last night and I don't believe I've changed anything in the 
config.   Maybe I did have another connection and something I did 
cleared it without me realizing it.  As long as it's working. :)

I'm still pretty "green" when it comes to clustering and SANS and 
sincerely appreciate the quality responses and willingness to help on 
this list.

Randy

James Parsons wrote:
> Randy Brown wrote:
>
>> I forgot....I'm using Centos 5 with latest patches and kernel.
>>
>> Randy Brown wrote:
>>
>>> I am using an APC Masterswitch Plus as my fencing device.  I am 
>>> seeing this in my logs now when fencing occurs:
>>>
>>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: agent "fence_apc" 
>>> reports: Traceback (most recent call last):   File 
>>> "/sbin/fence_apc", line 829, in ?     main()   File 
>>> "/sbin/fence_apc", line 289, in main     do_login(sock)   File 
>>> "/sbin/fence_apc", line 444, in do_login     i, mo, txt = 
>>> sock.expect(regex_list, TELNET_TIMEOUT)
>>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: agent "fence_apc" 
>>> reports:   File "/usr/lib/python2.4/telnetlib.py", line 620, in 
>>> expect     text = self.read_very_lazy()   File 
>>> "/usr/lib/python2.4/telnetlib.py", line 400, in read_very_lazy     
>>> raise EOFError, 'telnet connection closed' EOFError: telnet 
>>> connection closed
>>> Dec 31 11:36:26 nfs1-cluster fenced[3848]: fence 
>>> "nfs2-cluster.nws.noaa.gov" failed
>>>
>>> This used to work just fine.  If I run `fence_apc -a 192.168.42.30 
>>> -l cluster -n 1:7 -o Reboot -p <my password>` from the command line, 
>>> fencing works as expected.  The relevant lines from my cluster.conf 
>>> file are below.  I will gladly provide more information as necessary.
>>
> Is it possible that you are already telnet'ed into the switch from a 
> terminal or somesuch when the fence attempt takes place? APC switches 
> allow only one login at a time. I should/will add a log comment that 
> mentions this as a possible reason.
>
> If this is not the issue, well, we can keep digging...
>
> -J
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: randy_brown.vcf
Type: text/x-vcard
Size: 313 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080103/f3def3f6/attachment.vcf>

From beekhof at gmail.com  Thu Jan  3 13:17:45 2008
From: beekhof at gmail.com (Andrew Beekhof)
Date: Thu, 3 Jan 2008 14:17:45 +0100
Subject: [Linux-cluster] GFS without RHCM but with Heartbeat V2 and drbd ?
In-Reply-To: <1199290651.5980.34.camel@ayanami.boston.devel.redhat.com>
References: <20071227170006.AE68F733D5@hormel.redhat.com>
	<4774D07E.6030701@arcor.de>
	<1199290651.5980.34.camel@ayanami.boston.devel.redhat.com>
Message-ID: <5CD2757B-950A-4C20-86C9-EDA4331B95E6@gmail.com>


On Jan 2, 2008, at 5:17 PM, Lon Hohberger wrote:

> On Fri, 2007-12-28 at 11:31 +0100, Holger Woehle wrote:
>> Hi,
>> at the moment i am evaluating RHCM and Heartbeat V2 to refine our
>> Heartbeat V1 Cluster.
>> My question as in the subject:
>> Is it possible to use GFS without the RHCM ?
>
> Not sure.  You don't need rgmanager, but you need
> fencing/membership/etc.
>
> Nothing precludes you from running HBv2 in addition to RHCM+GFS.

Providing you arrange for "non-overlapping areas of concern" -  
otherwise you could get the two cluster managers trying to pull the  
cluster in opposite directions.

A third option is to run the CRM^ (the part that is new with Heartbeat  
v2) on top of OpenAIS so that the CRM and GFS are sharing the same  
"membership/etc" infrastructure.

^ Now its own project called Pacemaker and with support for both  
cluster stacks.
For more details, see: http://clusterlabs.org

>> I want to build a 2 node cluster with drbd activ/activ and RHCM/GFS  
>> or
>> Heartbeat V2 with filesystem OCFS or GFS.
>
> Here's how to get a 2-node DRBD setup going w/ RHCM (without HBv2):
>
> http://sources.redhat.com/cluster/wiki/DRBD_Cookbook
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From kjain at aurarianetworks.com  Thu Jan  3 14:40:25 2008
From: kjain at aurarianetworks.com (Kamal Jain)
Date: Thu, 3 Jan 2008 09:40:25 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <477BC969.6050506@redhat.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
Message-ID: <BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>

Hi Wendy,

Thanks for looking into this, and for your preliminary feedback.

I am surprised that handling locking for 8 files might cause major performance degradation with GFS versus iSCSI-direct. As for latency, all the devices are directly connected to a Cisco 3560G switch and on the same VLAN, so I expect Ethernet/layer-2 latencies to be sub-millisecond.  Also, note that the much faster iSCSI performance was on the same GbE connections between the same devices and systems, so network throughput and latency are the same.

GFS overhead, in handling locking (most likely) and any GFS filesystem overhead are the likely causes IMO.

Looking forward to any analysis and guidance you may be able to provide on getting GFS performance closer to iSCSI-direct.

- K



-----Original Message-----

Intuitively (by reading your iozone command), this is a locking issue.
There are lots to say on your setup, mostly because all data and lock
traffic are funneling thru the same network. Remember locking is mostly
to do with *latency*, not bandwidth. So even your network is not
saturated, the performance can go down. It is different from the rsync
issue (as described by Jos Vos) so the glock trimming patch is not
helpful in this case.

However, I won't know for sure until we get the data analyzed. Thanks
for the input.

-- Wendy




From lhh at redhat.com  Thu Jan  3 15:38:40 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 03 Jan 2008 10:38:40 -0500
Subject: [Linux-cluster] Instability troubles
In-Reply-To: <Pine.LNX.4.64.0801021631370.27708@hawking.exa.com>
References: <Pine.LNX.4.64.0801021631370.27708@hawking.exa.com>
Message-ID: <1199374720.9564.20.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-02 at 17:35 -0500, James Chamberlain wrote:
> Hi all,
> 
> I'm having some major stability problems with my three-node CS/GFS cluster. 
> Every two or three days, one of the nodes fences another, and I have to 
> hard-reboot the entire cluster to recover.  I have had this happen twice 
> today.  I don't know what's triggering the fencing, since all the nodes 
> appear to me to be up and running when it happens.  In fact, I was logged 
> on to node3 just now, running 'top', when node2 fenced it.
> 
> When they come up, they don't automatically mount their GFS filesystems, 
> even with "_netdev" specified as a mount option; however, the node which 
> comes up first mounts them all as part of bringing all the services up.
> 
> I did notice a couple of disconcerting things earlier today.  First, I was 
> running "watch clustat".  (I prefer to see the time updating, where I 
> can't with "clustat -i")

The time is displayed in RHEL5 CVS version, and will go out with 5.2.


>   At one point, "clustat" crashed as follows:
> 
> Jan  2 15:19:54 node2 kernel: clustat[17720]: segfault at 0000000000000024 
> rip 0000003629e75bc0 rsp 00007fff18827178 error 4

A clustat crash is not a cause for a fence operation.  That is, this
might be related, but is definitely not the cause of a node being
evicted.


> Fairly shortly thereafter, clustat reported that node3 as "Online, 
> Estranged, rgmanager".  Can anyone shed light on what that means? 
> Google's not telling me much.

Ordinarily, this happens when you have a node join the cluster manually
w/o giving it the configuration file.  CMAN would assign it a node ID -
but the node is not in the cluster configuration - so clustat would
display the node as 'Estranged'.

In your case, I'm not sure what the problem would be.


> At the moment, all three nodes are running CentOS 5.1, with kernel 
> 2.6.18-53.1.4.el5.  Can anyone point me in the right direction to resolve 
> these problems?  I wasn't having trouble like this when I was running a 
> CentOS 4 CS/GFS cluster.  Is it possible to downgrade, likely via a full 
> rebuild of all the nodes, from CentOS 5 CS/GFS to 4?  Should I instead 
> consider setting up a single node to mount the GFS filesystems and serve 
> them out, to get around these fencing issues?

I'd be interested a core file.  Try to reproduce your clustat crash with
'ulimit -c unlimited' set before running clustat.  I haven't seen
clustat crash in a very long time, so I'm interested in the cause.
(Also, after the crash, check to see if ccsd is running...)

Maybe it will uncover some other hints as to the cause of the behavior
you saw.

If ccsd indeed failed for some reason, it would cause fencing to fail as
well because the fence daemon would be unable to read fencing actions.

Even given all of this, this doesn't explain why the node needed to be
fenced in the first place.  Were there any log messages indicating why
the node needed to be fenced?

The RHEL5 / CentOS5 release of Cluster Suite has a fairly aggressive
node death timeout (5 seconds); maybe increasing it would help.

<cluster ...>
   <cman .../>
   <totem token="21000"/> <!-- add this -->
   ...
</cluster>

-- Lon



From lhh at redhat.com  Thu Jan  3 15:49:09 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 03 Jan 2008 10:49:09 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
Message-ID: <1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-02 at 19:58 -0500, William Ottley wrote:
> Hello all,
> I'm hoping that this layout here will make it easy for anyone to
> figure out what booboo i've done?
> I'm attempting to use Lars' method, since i really don't know how to
> setup with only 1 (gateways have confused me)
> 
> i just can't get anything working...
> 
> client: 192.168.2.10 -> 192.168.2.1 via crossover cable (ping OK)
> web browser is pointed to 192.168.2.100
> 
> LVS (centos 5.1, pulse, piranha):
> eth0: 192.168.2.1/gw {none}
> eth1: 192.168.0.111/gw 192.168.0.1
> eth0:1 - 192.168.2.100 {VIP}
> 
> echo 1 > /proc/sys/net/ipv4/ip_forward
> 
> no iptables is running, httpd default port is 8080, and piranha_GUI is
> listening at 3636

When using NAT, do not do the lo:0 hack.  That's for direct routing
only.

You should not be using a nat_router ip/device unless you have two LVS
directors.

https://www.redhat.com/docs/manuals/enterprise/RHEL-3-Manual/cluster-suite/s1-piranha-globalset.html

-- Lon




From williamottley at gmail.com  Thu Jan  3 15:54:04 2008
From: williamottley at gmail.com (William Ottley)
Date: Thu, 3 Jan 2008 10:54:04 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
	<1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
Message-ID: <8108f4850801030754u2419b466i74262e70e4b89804@mail.gmail.com>

thanks Lon for the pointers. I've tried so many different methods just
to get this to work: lvs-dr, lvs-nat with 2 nics, no go.. I can't seem
to get anything to work.

so i'm doing serious google searches, and there's sooo many how-tos
that tell you to do this, or do that, etc. and i'm like well which
one? I'm on one now, that says for nat, create a copy of the eth1
(private IP nic) eth1:1 and assign 192.168.0.254, and use that as the
default gateway for all the RIP... is this true?

Thing is, i have to manually copy the eth1 file to eth1:1 and assign
the IP, yet with the virtual IP, eth0:1 is created automatically... so
this makes me believe something is wrong.

I use pulse / piranha: what tools can I use to test and see if web
traffic IS going to the RIP or not???

thanks!

Will

On Jan 3, 2008 10:49 AM, Lon Hohberger <lhh at redhat.com> wrote:
> On Wed, 2008-01-02 at 19:58 -0500, William Ottley wrote:
> > Hello all,
> > I'm hoping that this layout here will make it easy for anyone to
> > figure out what booboo i've done?
> > I'm attempting to use Lars' method, since i really don't know how to
> > setup with only 1 (gateways have confused me)
> >
> > i just can't get anything working...
> >
> > client: 192.168.2.10 -> 192.168.2.1 via crossover cable (ping OK)
> > web browser is pointed to 192.168.2.100
> >
> > LVS (centos 5.1, pulse, piranha):
> > eth0: 192.168.2.1/gw {none}
> > eth1: 192.168.0.111/gw 192.168.0.1
> > eth0:1 - 192.168.2.100 {VIP}
> >
> > echo 1 > /proc/sys/net/ipv4/ip_forward
> >
> > no iptables is running, httpd default port is 8080, and piranha_GUI is
> > listening at 3636
>
> When using NAT, do not do the lo:0 hack.  That's for direct routing
> only.
>
> You should not be using a nat_router ip/device unless you have two LVS
> directors.
>
> https://www.redhat.com/docs/manuals/enterprise/RHEL-3-Manual/cluster-suite/s1-piranha-globalset.html
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From williamottley at gmail.com  Thu Jan  3 16:54:20 2008
From: williamottley at gmail.com (William Ottley)
Date: Thu, 3 Jan 2008 11:54:20 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
	<1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
Message-ID: <8108f4850801030854q73bff5c9y6bcb0feace6867cc@mail.gmail.com>

Hey Lon!
woohoo: that's what was causing the problem: lo:1 which was my VIP!
i removed it and now my test site is working! thanks so much.

now if anyone knows of good diagnostic tools?

because I now have to figure out how to do LVS-TUN.....

william

On Jan 3, 2008 10:49 AM, Lon Hohberger <lhh at redhat.com> wrote:
> On Wed, 2008-01-02 at 19:58 -0500, William Ottley wrote:
> > Hello all,
> > I'm hoping that this layout here will make it easy for anyone to
> > figure out what booboo i've done?
> > I'm attempting to use Lars' method, since i really don't know how to
> > setup with only 1 (gateways have confused me)
> >
> > i just can't get anything working...
> >
> > client: 192.168.2.10 -> 192.168.2.1 via crossover cable (ping OK)
> > web browser is pointed to 192.168.2.100
> >
> > LVS (centos 5.1, pulse, piranha):
> > eth0: 192.168.2.1/gw {none}
> > eth1: 192.168.0.111/gw 192.168.0.1
> > eth0:1 - 192.168.2.100 {VIP}
> >
> > echo 1 > /proc/sys/net/ipv4/ip_forward
> >
> > no iptables is running, httpd default port is 8080, and piranha_GUI is
> > listening at 3636
>
> When using NAT, do not do the lo:0 hack.  That's for direct routing
> only.
>
> You should not be using a nat_router ip/device unless you have two LVS
> directors.
>
> https://www.redhat.com/docs/manuals/enterprise/RHEL-3-Manual/cluster-suite/s1-piranha-globalset.html
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From lhh at redhat.com  Thu Jan  3 17:40:48 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 03 Jan 2008 12:40:48 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <8108f4850801030754u2419b466i74262e70e4b89804@mail.gmail.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
	<1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
	<8108f4850801030754u2419b466i74262e70e4b89804@mail.gmail.com>
Message-ID: <1199382048.9564.32.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-03 at 10:54 -0500, William Ottley wrote:
> thanks Lon for the pointers. I've tried so many different methods just
> to get this to work: lvs-dr, lvs-nat with 2 nics, no go.. I can't seem
> to get anything to work.
> 
> so i'm doing serious google searches, and there's sooo many how-tos
> that tell you to do this, or do that, etc. and i'm like well which
> one? I'm on one now, that says for nat, create a copy of the eth1
> (private IP nic) eth1:1 and assign 192.168.0.254, and use that as the
> default gateway for all the RIP... is this true?
> 
> Thing is, i have to manually copy the eth1 file to eth1:1 and assign
> the IP, yet with the virtual IP, eth0:1 is created automatically... so
> this makes me believe something is wrong.

You could just change the 'nat router device' to eth1:1 in the
piranha-gui.

> I use pulse / piranha: what tools can I use to test and see if web
> traffic IS going to the RIP or not???

The web browser should work...

With NAT, the real servers need no special configuration apart from the
gateway being a NAT-side IP on the LVS director.  That's why it should
be easy to set up.

-- Lon




From lhh at redhat.com  Thu Jan  3 17:41:47 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 03 Jan 2008 12:41:47 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <8108f4850801030854q73bff5c9y6bcb0feace6867cc@mail.gmail.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
	<1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
	<8108f4850801030854q73bff5c9y6bcb0feace6867cc@mail.gmail.com>
Message-ID: <1199382107.9564.34.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-03 at 11:54 -0500, William Ottley wrote:
> Hey Lon!
> woohoo: that's what was causing the problem: lo:1 which was my VIP!
> i removed it and now my test site is working! thanks so much.
> 
> now if anyone knows of good diagnostic tools?
> 
> because I now have to figure out how to do LVS-TUN.....

Piranha can do DR and NAT, but doesn't correctly set up tunneling.

-- Lon



From williamottley at gmail.com  Thu Jan  3 17:46:02 2008
From: williamottley at gmail.com (William Ottley)
Date: Thu, 3 Jan 2008 12:46:02 -0500
Subject: [Linux-cluster] Lars' method???
In-Reply-To: <1199382048.9564.32.camel@ayanami.boston.devel.redhat.com>
References: <8108f4850801021658q5d7a8065i8a3d68be5cba57b5@mail.gmail.com>
	<1199375349.9564.25.camel@ayanami.boston.devel.redhat.com>
	<8108f4850801030754u2419b466i74262e70e4b89804@mail.gmail.com>
	<1199382048.9564.32.camel@ayanami.boston.devel.redhat.com>
Message-ID: <8108f4850801030946vb12fd53p91c5d6d063620ad2@mail.gmail.com>

Hey Lon,

thanks for taking the time to respond. The funny thing about the
piranha-gui, is that I did point the gateway IP, and I can see in the
config (lvs.cf) that the nat gateway is pointing to eth1:1, BUT, no
eth1:1 exists at boot up, or anything: like how the VIP is..

I had to manually copy the ifcfg-eth1 to ifcfg-eth1:1 and start it that way....

and what tools do I use to troubleshoot?

my end goal, is to create a lvs-tun... can this be done, with lvs-nat (2 nics)??
I suspect so....

On Jan 3, 2008 12:40 PM, Lon Hohberger <lhh at redhat.com> wrote:
> On Thu, 2008-01-03 at 10:54 -0500, William Ottley wrote:
> > thanks Lon for the pointers. I've tried so many different methods just
> > to get this to work: lvs-dr, lvs-nat with 2 nics, no go.. I can't seem
> > to get anything to work.
> >
> > so i'm doing serious google searches, and there's sooo many how-tos
> > that tell you to do this, or do that, etc. and i'm like well which
> > one? I'm on one now, that says for nat, create a copy of the eth1
> > (private IP nic) eth1:1 and assign 192.168.0.254, and use that as the
> > default gateway for all the RIP... is this true?
> >
> > Thing is, i have to manually copy the eth1 file to eth1:1 and assign
> > the IP, yet with the virtual IP, eth0:1 is created automatically... so
> > this makes me believe something is wrong.
>
> You could just change the 'nat router device' to eth1:1 in the
> piranha-gui.
>
> > I use pulse / piranha: what tools can I use to test and see if web
> > traffic IS going to the RIP or not???
>
> The web browser should work...
>
> With NAT, the real servers need no special configuration apart from the
> gateway being a NAT-side IP on the LVS director.  That's why it should
> be easy to set up.
>
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From williamottley at gmail.com  Thu Jan  3 19:51:15 2008
From: williamottley at gmail.com (William Ottley)
Date: Thu, 3 Jan 2008 14:51:15 -0500
Subject: [Linux-cluster] had it working.... lvs-nat need help,
	$50 out of my own pocket?
Message-ID: <8108f4850801031151w6e8933d7x9b77637f373d7274@mail.gmail.com>

Hey all,
I really am stuck with this test environment. I have followed examples
from the howto's, and everything worked for a split second, and than
it stopped working.

There are sooo many different things that need to be done, that are
conflicting from different howto's.
Is there anyone willing to take the time and help, if I fork out $50
out of my own pocket? (i'm poor, but I need to get this working).
I can give all the configs, etc...
Thank you

William

-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From chrisp at tangent.co.za  Thu Jan  3 21:06:38 2008
From: chrisp at tangent.co.za (Chris Picton)
Date: Thu, 03 Jan 2008 23:06:38 +0200
Subject: [Linux-cluster] GNBD/GFS/cluster questions
Message-ID: <477D4E5E.5050903@tangent.co.za>

Hi all

I have a question regarding gnbd and clustering.

I currently have two servers (store1 and store2) sharing a block device 
(/dev/sdc) via drbd.

The 'Primary' server exports this device via gnbd, and the export fails 
over along with the drbd primary node.
A third server (gfs1) imports the gnbd device (which is part of an lvm) 
and mounts a gfs2 filesystem on it.

I currently do not want to run drbd in primary/primary mode, as I have 
read that there are potentially some performance issues with this.

I have written two custom scripts to handle the drbd and gnbd resource 
failover.

If I am not going to mount the gfs2 filesystem on store1, or store2, can 
I create a 'private' cluster between only those two machines, with their 
own dlm, or do all of the other machines (only gfs1 in this example) 
which will be mounting the gfs filesystem have to use the same dlm, and 
be part of the same cluster?  If I were to share the drbd device via 
iscsi, then I see no need for the importing devices to even be aware 
that the device is being exported by a cluster of machines - does this 
hold true for gnbd as well?

Is there any benefit of have *all* my servers in the same cluster, or 
can I split them up into smaller logically separated clusters.

Eg, if I add another two servers, with a failover gnbd export, do they 
also have to be part of the same global cluster, if they will be sharing 
the gnbd device into the same clvm?  Or can they have their own 
'private' cluster between themselves as well?

Regards

Chris




From christopher.barry at qlogic.com  Thu Jan  3 21:27:28 2008
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Thu, 3 Jan 2008 15:27:28 -0600
Subject: [Linux-cluster] GNBD/GFS/cluster questions
References: <477D4E5E.5050903@tangent.co.za>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB2453AD49@EPEXCH1.qlogic.org>




-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Chris Picton
Sent: Thu 1/3/2008 4:06 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] GNBD/GFS/cluster questions
 
Hi all

I have a question regarding gnbd and clustering.

I currently have two servers (store1 and store2) sharing a block device 
(/dev/sdc) via drbd.

The 'Primary' server exports this device via gnbd, and the export fails 
over along with the drbd primary node.
A third server (gfs1) imports the gnbd device (which is part of an lvm) 
and mounts a gfs2 filesystem on it.

I currently do not want to run drbd in primary/primary mode, as I have 
read that there are potentially some performance issues with this.

I have written two custom scripts to handle the drbd and gnbd resource 
failover.

If I am not going to mount the gfs2 filesystem on store1, or store2, can 
I create a 'private' cluster between only those two machines, with their 
own dlm, or do all of the other machines (only gfs1 in this example) 
which will be mounting the gfs filesystem have to use the same dlm, and 
be part of the same cluster?  If I were to share the drbd device via 
iscsi, then I see no need for the importing devices to even be aware 
that the device is being exported by a cluster of machines - does this 
hold true for gnbd as well?

Is there any benefit of have *all* my servers in the same cluster, or 
can I split them up into smaller logically separated clusters.

Eg, if I add another two servers, with a failover gnbd export, do they 
also have to be part of the same global cluster, if they will be sharing 
the gnbd device into the same clvm?  Or can they have their own 
'private' cluster between themselves as well?

Regards

Chris


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



I'm not an expert, as I have not used gnbd, but I would say you are correct.

The two nodes that export, and do not mount, do not need to be a part of the
gfs cluster - and really probably should not be. 

Thay can simply be their own active/passive failover cluster. Another 2 nodes, that
export a different volume for the gfs cluster to use should be fine as well.

Think of the pairs of gnbd nodes as two slices of an array.


-C


-C
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3679 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080103/9a1d1c50/attachment.bin>

From jason at lexxcom.com.au  Fri Jan  4 23:01:44 2008
From: jason at lexxcom.com.au (Jason Stewart)
Date: Fri, 04 Jan 2008 17:01:44 -0600
Subject: [Linux-cluster] GFS not syncing
Message-ID: <477EBAD8.8010508@lexxcom.com.au>

I have been spending the last few weeks working on this and after a lot 
of trial and error have managed to get everything to appear that it is 
working. I am a newbie to all this stuff

The problem that I am having is that  any changes made in the mounted 
GFS directory are not being seen by the other node, I have hunted 
around  but can't seen to find any errors or messages in the logs that 
anything is wrong. I can do and ccs_tool update to update the 
cluster.conf file, so I am assuming that ccs and cman are running 
correctly so there must be something with GFS.

I am not sure where to look now, I have done some extensive research on 
google but could not find anything. any information would be handy.


<?xml version="1.0"?>
<cluster name="alpha" config_version="8">
  <clusternodes>
        <clusternode name="lbserver01" votes="2" nodeid="1">
                <fence>
                        <method name="single">
                                <device name="human" 
ipaddr="192.168.1.111"/>
                        </method>
                </fence>
        </clusternode>
        <clusternode name="lbserver02" votes="1" nodeid="2">
                <fence>
                        <method name="single">
                                <device name="human" 
ipaddr="192.168.1.112"/>
                        </method>
                </fence>
        </clusternode>
</clusternodes>
  <fencedevices>
        <fencedevice name="human" agent="fence_manual"/>
  </fencedevices>
  <rm>
    <failoverdomains/>
    <resources/>
  </rm>
</cluster>



From Alain.Moulle at bull.net  Fri Jan  4 08:02:58 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 04 Jan 2008 09:02:58 +0100
Subject: [Linux-cluster] Last tuning on Quorum Disk / question
Message-ID: <477DE832.3010108@bull.net>

Hi Lon,

Finally, I adopt this quorum disk configuration :

<quorumd interval="1" label="QDISK0" min_score="1" tko="10" votes="1">
       <heuristic interval=2 tko="3" program="ping -t1 -c1 172.21.1.12" score=1/>
      <heuristic interval=2 program="ping -t3 -c1 172.21.1.12" score=1/>
</quorumd>


I just wonder if the interval values for quorum disk with regard to
the one for heuristic is the best choice or not ?

And which are the rules to fit the good value for interval and tko
on heuristic ? (I don't completely understand why your both heuristics
avoids suicide if one ping get lots, it seems to be due to tko value
but ... )


Thanks
Regards
And best whiches for 2008 ;-)

Alain Moull?

> Also - your heuristic should be more like one of the following:

>  <heuristic interval="2"
>             tko="3"
>             program="ping -t1 -c1 <another host>"
>             score="1"/>

>  <heuristic interval="2"
>             program="ping -t3 -c1 <another host>"
>             score="1"/>

>Reason: You don't want a single ICMP packet to determine node fitness.
>If that ping gets lost (network being full, or any reason really), the
>node will commit suicide.  (The man page probably needs updating about
>that!)
> Lon



From orkcu at yahoo.com  Fri Jan  4 13:31:03 2008
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Fri, 4 Jan 2008 05:31:03 -0800 (PST)
Subject: [Linux-cluster] GFS not syncing
In-Reply-To: <477EBAD8.8010508@lexxcom.com.au>
Message-ID: <585395.47393.qm@web50609.mail.re2.yahoo.com>
--- Jason Stewart <jason at lexxcom.com.au> wrote:

> I have been spending the last few weeks working on
> this and after a lot 
> of trial and error have managed to get everything to
> appear that it is 
> working. I am a newbie to all this stuff
> 
well, in your configuration I can't see where the
services are declare, do you have any?

also, you are using manual fencing, this is ok for
testing purpose but definitly not for production

so, can you send the GFS mount options for the FS you
are working with? are they inthe fstab?

and also can you send the command you used to create
the GFS

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


      ____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 



From wferi at niif.hu  Fri Jan  4 14:49:24 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Fri, 04 Jan 2008 15:49:24 +0100
Subject: [Linux-cluster] GFS: assertion failure in add_to_queue
Message-ID: <87y7b5odcb.fsf@tac.ki.iif.hu>

Hi,

I'm using a 1-node GFS1 "cluster" with DLM locking and sporadically
(say once a week) get the following in the kernel logs (Linux 2.6.23):

GFS: fsid=noc:cricket.0: warning: assertion "(tmp_gh->gh_flags & GL_LOCAL_EXCL) || !(gh->gh_flags & GL_LOCAL_EXCL)" failed
GFS: fsid=noc:cricket.0:   function = add_to_queue
GFS: fsid=noc:cricket.0:   file = /home/wferi/cluster/cluster-2.01.00/gfs-kernel/src/gfs/glock.c, line = 1420
GFS: fsid=noc:cricket.0:   time = 1197666253

The filesystem is under constant reading/writing and seems to operate
without any application-visible errors (or at least I coultn't find
error messages).

Maybe it's no big deal, but something is not quite right.
But what?  Anybody has an idea?
-- 
Thanks,
Feri.



From wferi at niif.hu  Fri Jan  4 15:06:17 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Fri, 04 Jan 2008 16:06:17 +0100
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	(Kamal Jain's message of "Thu, 3 Jan 2008 09:40:25 -0500")
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <877iipock6.fsf@tac.ki.iif.hu>

Kamal Jain <kjain at aurarianetworks.com> writes:

> I am surprised that handling locking for 8 files might cause major
> performance degradation with GFS versus iSCSI-direct.
>
> As for latency, all the devices are directly connected to a Cisco
> 3560G switch and on the same VLAN, so I expect Ethernet/layer-2
> latencies to be sub-millisecond.  Also, note that the much faster
> iSCSI performance was on the same GbE connections between the same
> devices and systems, so network throughput and latency are the same.
>
> GFS overhead, in handling locking (most likely) and any GFS
> filesystem overhead are the likely causes IMO.
>
> Looking forward to any analysis and guidance you may be able to
> provide on getting GFS performance closer to iSCSI-direct.

I'm really interested in the outcome of this discussion.  Meanwhile I
can add that 'gfs_controld -l0' and 'gfs_tool settune /mnt demote_secs 600'
(as recommended on this list by the kind developers) helped me
tremendously dealing with lots of files.
-- 
Regards,
Feri.



From kjain at aurarianetworks.com  Fri Jan  4 15:15:26 2008
From: kjain at aurarianetworks.com (Kamal Jain)
Date: Fri, 4 Jan 2008 10:15:26 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <877iipock6.fsf@tac.ki.iif.hu>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	<877iipock6.fsf@tac.ki.iif.hu>
Message-ID: <BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>

Feri,

Thanks for the information.  A number of people have emailed me expressing some level of interest in the outcome of this, so hopefully I will soon be able to do some tuning and performance experiments and report back our results.

On the demote_secs tuning parameter, I see you're suggesting 600 seconds, which appears to be longer than the default 300 seconds as stated by Wendy Cheng at http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 -- we're running RHEL4.5.  Wouldn't a SHORTER demote period be better for lots of files, whereas perhaps a longer demote period might be more efficient for a smaller number of files being locked for long periods of time?

On a related note, I converted a couple of the clusters in our lab from GULM to DLM and while performance is not necessarily noticeably improved (though more detailed testing was done after the conversion), we did notice that both clusters became more stable in the DLM configuration.

Has anyone here had a similar experience and can shed some light as to why?  When we would do long-running application testing on GFS volumes with GULM, after a while many commands that in any way might touch the disks would hang, like "df", "mount" or even "ls".

So far with DLM things have been much more stable.  No other tuning or adjustment has been done; both times things were default settings.

- K


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ferenc Wagner
Sent: Friday, January 04, 2008 10:06 AM
To: linux clustering
Subject: Re: [Linux-cluster] GFS performance

Kamal Jain <kjain at aurarianetworks.com> writes:

> I am surprised that handling locking for 8 files might cause major
> performance degradation with GFS versus iSCSI-direct.
>
> As for latency, all the devices are directly connected to a Cisco
> 3560G switch and on the same VLAN, so I expect Ethernet/layer-2
> latencies to be sub-millisecond.  Also, note that the much faster
> iSCSI performance was on the same GbE connections between the same
> devices and systems, so network throughput and latency are the same.
>
> GFS overhead, in handling locking (most likely) and any GFS
> filesystem overhead are the likely causes IMO.
>
> Looking forward to any analysis and guidance you may be able to
> provide on getting GFS performance closer to iSCSI-direct.

I'm really interested in the outcome of this discussion.  Meanwhile I
can add that 'gfs_controld -l0' and 'gfs_tool settune /mnt demote_secs 600'
(as recommended on this list by the kind developers) helped me
tremendously dealing with lots of files.
--
Regards,
Feri.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From wferi at niif.hu  Fri Jan  4 15:34:51 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Fri, 04 Jan 2008 16:34:51 +0100
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
	(Kamal Jain's message of "Fri, 4 Jan 2008 10:15:26 -0500")
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	<877iipock6.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <87wsqpmwo4.fsf@tac.ki.iif.hu>

Kamal Jain <kjain at aurarianetworks.com> writes:

> On the demote_secs tuning parameter, I see you're suggesting 600
> seconds, which appears to be longer than the default 300 seconds as
> stated by Wendy Cheng at
> http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4
> -- we're running RHEL4.5.  Wouldn't a SHORTER demote period be
> better for lots of files, whereas perhaps a longer demote period
> might be more efficient for a smaller number of files being locked
> for long periods of time?

It depends on your usage pattern.  I had to access lots of files
repeatedly, ie. cycling over them periodically by one machine in the
cluster.  It helped me a LOT to keep those GFS locks cached on that
machine, while the others were all right without being lock masters as
they ever needed some of the files only, not all of them.

> On a related note, I converted a couple of the clusters in our lab
> from GULM to DLM and while performance is not necessarily noticeably
> improved (though more detailed testing was done after the
> conversion), we did notice that both clusters became more stable in
> the DLM configuration.

I've never tried GULM, so I can't comment on this.
-- 
Regards,
Feri.



From kjain at aurarianetworks.com  Fri Jan  4 15:41:24 2008
From: kjain at aurarianetworks.com (Kamal Jain)
Date: Fri, 4 Jan 2008 10:41:24 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <87wsqpmwo4.fsf@tac.ki.iif.hu>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	<877iipock6.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
	<87wsqpmwo4.fsf@tac.ki.iif.hu>
Message-ID: <BD1F491BE5FEB942A95DC075D99F15CA1AB0263777@exch-be-01-prod.hq.aurarianetworks.com>

Well, in our applications usage we don't keep cycling over the same files over and over again, we run through lots of files and keep a handful open at any point in time, so perhaps shorter demote_secs is good for us.

I have not been able to find out about 'gfs_controld -l0' -- where is that set and what does "-l0" do?

Thanks,
- K


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ferenc Wagner
Sent: Friday, January 04, 2008 10:35 AM
To: linux clustering
Subject: Re: [Linux-cluster] GFS performance

Kamal Jain <kjain at aurarianetworks.com> writes:

> On the demote_secs tuning parameter, I see you're suggesting 600
> seconds, which appears to be longer than the default 300 seconds as
> stated by Wendy Cheng at
> http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4
> -- we're running RHEL4.5.  Wouldn't a SHORTER demote period be
> better for lots of files, whereas perhaps a longer demote period
> might be more efficient for a smaller number of files being locked
> for long periods of time?

It depends on your usage pattern.  I had to access lots of files
repeatedly, ie. cycling over them periodically by one machine in the
cluster.  It helped me a LOT to keep those GFS locks cached on that
machine, while the others were all right without being lock masters as
they ever needed some of the files only, not all of them.

> On a related note, I converted a couple of the clusters in our lab
> from GULM to DLM and while performance is not necessarily noticeably
> improved (though more detailed testing was done after the
> conversion), we did notice that both clusters became more stable in
> the DLM configuration.

I've never tried GULM, so I can't comment on this.
--
Regards,
Feri.



From wferi at niif.hu  Fri Jan  4 16:00:43 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Fri, 04 Jan 2008 17:00:43 +0100
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB0263777@exch-be-01-prod.hq.aurarianetworks.com>
	(Kamal Jain's message of "Fri, 4 Jan 2008 10:41:24 -0500")
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	<877iipock6.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
	<87wsqpmwo4.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263777@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <87k5mpmvh0.fsf@tac.ki.iif.hu>

Kamal Jain <kjain at aurarianetworks.com> writes:

> Well, in our applications usage we don't keep cycling over the same
> files over and over again, we run through lots of files and keep a
> handful open at any point in time, so perhaps shorter demote_secs is
> good for us.

It there's no single machine which does most of the accesses, then
probably so.

> I have not been able to find out about 'gfs_controld -l0' -- where
> is that set and what does "-l0" do?

Try gfs_controld -h for some help.  Basically, acquiring of Posix
locks (fcntl locks) is artifically throttled on GFS by default.  If
you invoke gfs_controld with the -l0 option, this throttling is turned
off.  It probably doesn't buy you much unless your application uses
this type of locks.

I hope I recalled the above correctly.  Somebody told me that this
default is likely to be changed in the future, tough.
-- 
Regards,
Feri.



From wcheng at redhat.com  Fri Jan  4 16:04:20 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 04 Jan 2008 11:04:20 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>	<1198770380.4932.23.camel@WSBID06223>	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>	<477A71B0.1080804@redhat.com>	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>	<477BC969.6050506@redhat.com>	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>	<877iipock6.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
Message-ID: <477E5904.6020804@redhat.com>

Kamal Jain wrote:
> Feri,
>
> Thanks for the information.  A number of people have emailed me expressing some level of interest in the outcome of this, so hopefully I will soon be able to do some tuning and performance experiments and report back our results.
>
> On the demote_secs tuning parameter, I see you're suggesting 600 seconds, which appears to be longer than the default 300 seconds as stated by Wendy Cheng at http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 -- we're running RHEL4.5.  Wouldn't a SHORTER demote period be better for lots of files, whereas perhaps a longer demote period might be more efficient for a smaller number of files being locked for long periods of time?
>   

This demote_secs tunable is a little bit tricky :) ... What happens here 
is that, GFS caches glocks that could get accumulated to a huge amount 
of count. Unless vm releases these inodes (files) associated with these 
glocks, current GFS internal daemons will do *fruitless* scan trying to 
remove these glock (but never succeed). If you set the demote_secs to a 
large number, it will *reduce* the wake-up frequencies of these daemons 
doing these fruitless works, that, in turns, leaving more CPU cycles for 
real works. Without glock trimming patch in place, that is a way to tune 
a system that is constantly touching large amount of files (such as 
rsync). Ditto for "scand" wake-up internal, making it larger will help 
the performance in this situation.

With the *new* glock trimming patch, we actually remove the memory 
reference count so glock can be "demoted" and subsequently removed from 
the system if in idle states. To demote the glock, we need gfs_scand 
daemon to wake up often - this implies we need smaller demote_secs for 
it to be effective.
> On a related note, I converted a couple of the clusters in our lab from GULM to DLM and while performance is not necessarily noticeably improved (though more detailed testing was done after the conversion), we did notice that both clusters became more stable in the DLM configuration.
>   
This is mostly because DLM is the current default lock manager (with 
on-going development efforts) while GULM is not actively maintained.

-- Wendy



From kjain at aurarianetworks.com  Fri Jan  4 16:16:47 2008
From: kjain at aurarianetworks.com (Kamal Jain)
Date: Fri, 4 Jan 2008 11:16:47 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <477E5904.6020804@redhat.com>
References: <BD1F491BE5FEB942A95DC075D99F15CA1AB026333F@exch-be-01-prod.hq.aurarianetworks.com>
	<1198770380.4932.23.camel@WSBID06223>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026338F@exch-be-01-prod.hq.aurarianetworks.com>
	<477A71B0.1080804@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB0263564@exch-be-01-prod.hq.aurarianetworks.com>
	<477BC969.6050506@redhat.com>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026364C@exch-be-01-prod.hq.aurarianetworks.com>
	<877iipock6.fsf@tac.ki.iif.hu>
	<BD1F491BE5FEB942A95DC075D99F15CA1AB026376D@exch-be-01-prod.hq.aurarianetworks.com>
	<477E5904.6020804@redhat.com>
Message-ID: <BD1F491BE5FEB942A95DC075D99F15CA1AB0263784@exch-be-01-prod.hq.aurarianetworks.com>

Ah ha!  I think this is starting to make sense now, Wendy.

And thank you for the explanation of why we should be using DLM rather than GULM.

So without the patch, which we do not have, it might be good to increase demote_secs [per GFS mount] to 600 or even more seconds, and scand_secs to...what's a reasonable/safe value on that?  It sounds like without the patch all we're doing -- to paraphrase you -- is reducing the frequency of operations which do no good and cause harm in the form of CPU and I/O resource usage.

The patch is built into RHEL 4.6 and 5.1, right?  When are those expected to be available (we only care about 4.6 right now) and/or how do we get the standalone patch?

Thanks again to everyone for the feedback and information.

- K



-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Wendy Cheng
Sent: Friday, January 04, 2008 11:04 AM
To: linux clustering
Subject: Re: [Linux-cluster] GFS performance

Kamal Jain wrote:
> Feri,
>
> Thanks for the information.  A number of people have emailed me expressing some level of interest in the outcome of this, so hopefully I will soon be able to do some tuning and performance experiments and report back our results.
>
> On the demote_secs tuning parameter, I see you're suggesting 600 seconds, which appears to be longer than the default 300 seconds as stated by Wendy Cheng at http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 -- we're running RHEL4.5.  Wouldn't a SHORTER demote period be better for lots of files, whereas perhaps a longer demote period might be more efficient for a smaller number of files being locked for long periods of time?
>

This demote_secs tunable is a little bit tricky :) ... What happens here
is that, GFS caches glocks that could get accumulated to a huge amount
of count. Unless vm releases these inodes (files) associated with these
glocks, current GFS internal daemons will do *fruitless* scan trying to
remove these glock (but never succeed). If you set the demote_secs to a
large number, it will *reduce* the wake-up frequencies of these daemons
doing these fruitless works, that, in turns, leaving more CPU cycles for
real works. Without glock trimming patch in place, that is a way to tune
a system that is constantly touching large amount of files (such as
rsync). Ditto for "scand" wake-up internal, making it larger will help
the performance in this situation.

With the *new* glock trimming patch, we actually remove the memory
reference count so glock can be "demoted" and subsequently removed from
the system if in idle states. To demote the glock, we need gfs_scand
daemon to wake up often - this implies we need smaller demote_secs for
it to be effective.
> On a related note, I converted a couple of the clusters in our lab from GULM to DLM and while performance is not necessarily noticeably improved (though more detailed testing was done after the conversion), we did notice that both clusters became more stable in the DLM configuration.
>
This is mostly because DLM is the current default lock manager (with
on-going development efforts) while GULM is not actively maintained.

-- Wendy



From Paul.McDowell at celera.com  Fri Jan  4 17:59:10 2008
From: Paul.McDowell at celera.com (Paul n McDowell)
Date: Fri, 4 Jan 2008 12:59:10 -0500
Subject: [Linux-cluster] GFS performance
In-Reply-To: <477E5904.6020804@redhat.com>
Message-ID: <OFB8C0F773.67F6457D-ON852573C6.0058F23B-852573C6.0062CD4D@applera.com>

Hi all..

I feel compelled to chime in on this GFS performance thread as we have a 
three node GFS environment running RHEL4.6 that was suffering from severe 
memory utilization (100% on a 32GB system) on all nodes and unacceptably 
poor performance.  The three nodes serve five GFS file systems which range 
from 100GB to 1.2TB in size and are home to a diverse combination of very 
large and very small files.

The degradation in performance always coincided with backup process 
starting, i.e. large numbers of inodes being read and cached, and was so 
bad that I was considering abandoning our GFS implementation altogether. 
Basic Unix commands such as df, ls and mkdir either took several minutes 
to complete or never finished at all.  The only way to resolve the problem 
was to reboot all three production nodes which alleviated the problem 
until the next backup started.

With a recommendation from RedHat support I implemented the tunable GFS 
parameter that Wendy describes in 
http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
by setting glock_purge to 50 for all file systems and it has made a 
dramatic difference.  The memory utilization is no longer apparent and 
overall performance is very acceptable even when backups are running.

If you're are not at update 6 yet then I would urge you to upgrade as soon 
as possible to take advantage of this new feature.

Regards,

Paul McDowell
Celera






Wendy Cheng <wcheng at redhat.com> 
Sent by: linux-cluster-bounces at redhat.com
01/04/2008 11:04 AM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] GFS performance






Kamal Jain wrote:
> Feri,
>
> Thanks for the information.  A number of people have emailed me 
expressing some level of interest in the outcome of this, so hopefully I 
will soon be able to do some tuning and performance experiments and report 
back our results.
>
> On the demote_secs tuning parameter, I see you're suggesting 600 
seconds, which appears to be longer than the default 300 seconds as stated 
by Wendy Cheng at 
http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
-- we're running RHEL4.5.  Wouldn't a SHORTER demote period be better for 
lots of files, whereas perhaps a longer demote period might be more 
efficient for a smaller number of files being locked for long periods of 
time?
> 

This demote_secs tunable is a little bit tricky :) ... What happens here 
is that, GFS caches glocks that could get accumulated to a huge amount 
of count. Unless vm releases these inodes (files) associated with these 
glocks, current GFS internal daemons will do *fruitless* scan trying to 
remove these glock (but never succeed). If you set the demote_secs to a 
large number, it will *reduce* the wake-up frequencies of these daemons 
doing these fruitless works, that, in turns, leaving more CPU cycles for 
real works. Without glock trimming patch in place, that is a way to tune 
a system that is constantly touching large amount of files (such as 
rsync). Ditto for "scand" wake-up internal, making it larger will help 
the performance in this situation.

With the *new* glock trimming patch, we actually remove the memory 
reference count so glock can be "demoted" and subsequently removed from 
the system if in idle states. To demote the glock, we need gfs_scand 
daemon to wake up often - this implies we need smaller demote_secs for 
it to be effective.
> On a related note, I converted a couple of the clusters in our lab from 
GULM to DLM and while performance is not necessarily noticeably improved 
(though more detailed testing was done after the conversion), we did 
notice that both clusters became more stable in the DLM configuration.
> 
This is mostly because DLM is the current default lock manager (with 
on-going development efforts) while GULM is not actively maintained.

-- Wendy

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080104/4f5eb69c/attachment.htm>

From joseparrella at gmail.com  Fri Jan  4 18:20:24 2008
From: joseparrella at gmail.com (=?ISO-8859-1?Q?Jos=E9_Miguel_Parrella_Romero?=)
Date: Fri, 04 Jan 2008 13:50:24 -0430
Subject: [Linux-cluster] I/O errors and performance in GFS mounts
Message-ID: <477E78E8.5010306@gmail.com>

Greetings,

I have a two-node cluster based on Itanium2 machines using GFS for
shared storage with fibre channel as transport. The whole setup has been
working OK for three months now, and I have another two-node setup which
is also working OK, except for some fibre issues (see below) // other
clustering applications are working OK.

I'm using clvmd and I've setup two LV, mkfs'ed them with GFS and mounted
them in both nodes without any problems (based, of course, on the cman
cluster definitions). I've setup manual fencing since I don't have
proper devices to help me with that at the time.

Since a couple of days now I've seen a lot of I/O errors with the GFS
mounts, for example when using df to look at the available space on
local mounts, and of course when ls'ing the shares. Sometimes df also
reports incorrect size information (for example only 677 MB. used when
the share has circa 60 GB.)

This problem only occurs in one of the two nodes at the same time, and
it is mostly random. The cluster hosts IMAP (Dovecot) and SMTP (Postfix)
services, which turn unusable (except for non-local mail transport in
Postfix) when this I/O errors appear.

Searching for errors on dmesg and syslog throws several, continual
errors such as this one:

<error>
GFS: can't mount proto = lock_dlm, table = mail:inbox, hostdata =
...
</error>

Where ... varies from:

kernel unaligned access to 0xfffffffffffffffd, ip=0xa000000100187d81
mount[2200]: error during unaligned kernel access

to

mount[5221]: NaT consumption 2216203124768 [4]

I'm aware that unaligned kernel access are not a bug, but rather a
well-handled inconsistency, but these one seems to mess with GFS way too
much. I fsck'ed the filesystems and this seemed to help a little, but
I'm still getting slow times when ls'ing the GFS filesystems.

We've chosen GFS over HA NFS, but we're getting this kind of performance
problems. Some of our problems are due to fibre issues, for example
unexpected LOOP DOWN's, but this time it seems more like a software
issue. I'm running kernel 2.6.18 in Debian Etch.

I would like to know if some of you have run into this problem. Maybe
I'm missing some critical part in my cluster setup.

Greetings,
Jose



From lhh at redhat.com  Fri Jan  4 19:23:53 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 04 Jan 2008 14:23:53 -0500
Subject: [Linux-cluster] Last tuning on Quorum Disk / question
In-Reply-To: <477DE832.3010108@bull.net>
References: <477DE832.3010108@bull.net>
Message-ID: <1199474633.16312.10.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-04 at 09:02 +0100, Alain Moulle wrote:
> Hi Lon,
> 
> Finally, I adopt this quorum disk configuration :
> 
> <quorumd interval="1" label="QDISK0" min_score="1" tko="10" votes="1">
>        <heuristic interval=2 tko="3" program="ping -t1 -c1 172.21.1.12" score=1/>
>       <heuristic interval=2 program="ping -t3 -c1 172.21.1.12" score=1/>
> </quorumd>
> 
> 
> I just wonder if the interval values for quorum disk with regard to
> the one for heuristic is the best choice or not ?

* should have quotes around attr values:
  interval=2 -> interval="2"
  score=1 -> score="1"

* -cX is the number of pings to send.  When using -c1, you should 
   use tko="3" or something similar.

* -tX is internet time to live - usually the number of router hops.  
  For a local gateway, X should be 1.

> And which are the rules to fit the good value for interval and tko
> on heuristic ? (I don't completely understand why your both heuristics
> avoids suicide if one ping get lots, it seems to be due to tko value
> but ... )

1.  <heuristic interval="2" tko="3" program="ping -t1 -c1 172.21.1.12"
score="1"/>

"Send one ping 172.21.1.12 one time with a max IP TTL of 1.  Do this
every 2 seconds.  If this execution fails 3 times, we're done."

2.  <heuristic interval="2" program="ping -t1 -c1 172.21.1.12"
score="1"/>

"Ping 172.21.1.12 one time.  Do this every 2 seconds.  If we fail to get
a response from this operation, we're done."

3.  <heuristic interval="2" program="ping -t1 -c3 172.21.1.12"
score="1"/>

"Send 3 pings to 172.21.1.12.  Do this every 2 seconds.  If we fail to
get a response from this operation, we are done."


... 1 and 3 are almost equivalent: 3 ping packets must be lost to decide
the heuristic is dead.

... 2, however, means that if the ping packet is /ever/ lost, the
heuristic is dead.

-- Lon




From charlieb-linux-cluster at e-smith.com  Fri Jan  4 21:18:45 2008
From: charlieb-linux-cluster at e-smith.com (Charlie Brady)
Date: Fri, 4 Jan 2008 16:18:45 -0500 (EST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
Message-ID: <Pine.LNX.4.44.0801041555230.12620-100000@allspice.nssg.mitel.com>


I'm helping a colleague to collect information on an application lockup 
problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.

I'd appreciate advice as to what information to collect next.

Packages in use are:

kernel-smp-2.6.9-67.EL.i686.rpm
dlm-1.0.7-1.i686.rpm
dlm-kernel-smp-2.6.9-52.2.i686.rpm
GFS-kernel-smp-2.6.9-75.9.i686.rpm
GFS-6.1.15-1.i386.rpm
ccs-1.0.11-1.i686.rpm
cman-1.0.17-0.i686.rpm
cman-kernel-smp-2.6.9-53.5.i686.rpm

We've reduced the application code to a simple test case. The following 
code run on each node will soon block, and doesn't receive signals until 
the peer node is shutdown:

...
    fl.l_whence=SEEK_SET;
    fl.l_start=0;
    fl.l_len=1;

    while (1)
    {
      fl.l_type=F_WRLCK;
      retval=fcntl(filedes,F_SETLKW,&fl);
      if (retval==-1)
      {
        perror("lock");
        exit(1);
      }
      // attempt to unlock the index file
      fl.l_type=F_UNLCK;
      retval=fcntl(filedes,F_SETLKW,&fl);
      if (retval==-1)
      {
        perror("unlock");
        exit(1);
      }
    }
...

/proc/cluster/dlm_debug on the respectives nodes showed this on most 
recent run:

Node1:

 2
FS1 send einval to 2
FS1 send einval to 2
[above line many times]
FS1 send einval to 2
FS1 send einval to 2
FS1 grant lock on lockqueue 2
FS1 process_lockqueue_reply id 5400c2 state 0

Node 2:

FS1 (31613) req reply einval 3de002b1 fr 1 r 1        7
FS1 (31613) req reply einval 3ea30356 fr 1 r 1        7
FS1 (31613) req reply einval 3f0100d5 fr 1 r 1        7
FS1 (31613) req reply einval 3df10367 fr 1 r 1        7
FS1 (31613) req reply einval 3fa600be fr 1 r 1        7
FS1 (31613) req reply einval 3f430355 fr 1 r 1        7
FS1 (31613) req reply einval 3fd20096 fr 1 r 1        7
FS1 (31613) req reply einval 3fc900d3 fr 1 r 1        7
FS1 (31613) req reply einval 3fe60375 fr 1 r 1        7
FS1 (31613) req reply einval 3f870143 fr 1 r 1        7
FS1 (31613) req reply einval 3f690239 fr 1 r 1        7
FS1 (31613) req reply einval 3eb40379 fr 1 r 1        7
FS1 (31613) req reply einval 3fb00352 fr 1 r 1        7
FS1 (31613) req reply einval 40a002f6 fr 1 r 1        7
FS1 (31613) req reply einval 3fb90265 fr 1 r 1        7
FS1 (31613) req reply einval 400b0326 fr 1 r 1        7

I have lockdump files from each node, but don't know how to interpret 
them.

On shutdown, GFS unmount failed, and kernel panic followed:

Turning off quotas:                                        [  OK  ]
Unmounting file systems:  umount2: Device or resource busy
umount: /diskarray: device is busy
umount2: Device or resource busy
umount: /diskarray: device is busy
CMAN: No functional network interfaces, leaving cluster
CMAN: sendmsg failed: -22
CMAN: we are leaving the cluster.
WARNING: dlm_emergency_shutdown
SM: 00000002 sm_stop: SG still joined
SM: 01000004 sm_stop: SG still joined
SM: 02000006 sm_stop: SG still joined
ds: 007b   es: 007b   ss: 0068
Process gfs_glockd (pid: 5654, threadinfo=f40d2000 task=f3c4b230)
Stack: f8ade2d3 f8bb8000 00000003 f2c4ee80 f8ad98b2 f8c28ede 00000001 
f33c0c7c
       f33c0c60 f8c1ed63 f8c55da0 d4aa4940 f33c0c60 f8c55da0 f33c0c60 
f8c1e257
       f33c0c60 00000001 f33c0cf4 f8c1e30e f33c0c60 f33c0c7c f8c1e431 
00000001
Call Trace:
 [<f8ad98b2>] lm_dlm_unlock+0x14/0x1c [lock_dlm]
 [<f8c28ede>] gfs_lm_unlock+0x2c/0x42 [gfs]
 [<f8c1ed63>] gfs_glock_drop_th+0xf3/0x12d [gfs]
 [<f8c1e257>] rq_demote+0x7f/0x98 [gfs]
 [<f8c1e30e>] run_queue+0x5a/0xc1 [gfs]
 [<f8c1e431>] unlock_on_glock+0x1f/0x28 [gfs]
 [<f8c203e9>] gfs_reclaim_glock+0xc3/0x13c [gfs]
 [<f8c12e05>] gfs_glockd+0x39/0xde [gfs]
 [<c011e7b9>] default_wake_function+0x0/0xc
 [<c02d8522>] ret_from_fork+0x6/0x14
 [<c011e7b9>] default_wake_function+0x0/0xc
 [<f8c12dcc>] gfs_glockd+0x0/0xde [gfs]
 [<c01041f5>] kernel_thread_helper+0x5/0xb
Code: 73 34 8b 03 ff 73 2c ff 73 08 ff 73 04 ff 73 0c 56 ff 70 18 68 ef e3 
ad f8
 e8 de 92 64 c7 83 c4 34 68 d3 e2 ad f8 e8 d1 92 64 c7 <0f> 0b 69 01 1b e2 
ad f8
 68 d5 e2 ad f8 e8 8c 8a 64 c7 5b 5e 5f
 <0>Fatal exception: panic in 5 seconds
Kernel panic - not syncing: Fatal exception



---
Charlie




From williamottley at gmail.com  Sat Jan  5 19:15:16 2008
From: williamottley at gmail.com (William Ottley)
Date: Sat, 5 Jan 2008 14:15:16 -0500
Subject: [Linux-cluster] where would the VIP be?
Message-ID: <8108f4850801051115s30314637xe5f81b4646e93c0c@mail.gmail.com>

I'm trying to setup a LVS-TUN, which has 3 internet connections.
eth0 - public (client)
eth1 - public TUN to webserver 1
eth2 - public TUN to webserver 2 and webserver 3

where would the VIP be? eth0:1?, also, do we enable forward to the
webservers or just the LVS?

I'm also confused about the lo:0. do we do that on the webservers or
just do the:

/etc/sysctl.conf:
		net.ipv4.conf.eth0.arp_ignore = 1
		net.ipv4.conf.eth0.arp_announce = 2
		net.ipv4.conf.all.arp_ignore = 1
		net.ipv4.conf.all.arp_announce = 2
sysctl -p


thanks for any insight!

William

-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From chawkins at veracitynetworks.com  Sat Jan  5 19:43:52 2008
From: chawkins at veracitynetworks.com (Christopher Hawkins)
Date: Sat, 5 Jan 2008 14:43:52 -0500
Subject: [Linux-cluster] where would the VIP be?
In-Reply-To: <8108f4850801051115s30314637xe5f81b4646e93c0c@mail.gmail.com>
Message-ID: <200801051944.m05Jio1Z017502@mxmail.leaseoptions.com>

The VIP should be on eth0, like you said - the clients need to be able to
reach it. And on the real servers (web servers), you would do the VIP on
lo:0 AND the sysctl.conf settings. 

Chris

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of William Ottley
Sent: Saturday, January 05, 2008 2:15 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] where would the VIP be?

I'm trying to setup a LVS-TUN, which has 3 internet connections.
eth0 - public (client)
eth1 - public TUN to webserver 1
eth2 - public TUN to webserver 2 and webserver 3

where would the VIP be? eth0:1?, also, do we enable forward to the
webservers or just the LVS?

I'm also confused about the lo:0. do we do that on the webservers or just do
the:

/etc/sysctl.conf:
		net.ipv4.conf.eth0.arp_ignore = 1
		net.ipv4.conf.eth0.arp_announce = 2
		net.ipv4.conf.all.arp_ignore = 1
		net.ipv4.conf.all.arp_announce = 2
sysctl -p


thanks for any insight!

William

--
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you want to
believe. You take the red pill - you stay in Wonderland and I show you how
deep the rabbit-hole goes.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From williamottley at gmail.com  Sat Jan  5 19:54:44 2008
From: williamottley at gmail.com (William Ottley)
Date: Sat, 5 Jan 2008 14:54:44 -0500
Subject: [Linux-cluster] where would the VIP be?
In-Reply-To: <200801051944.m05Jio1Z017502@mxmail.leaseoptions.com>
References: <8108f4850801051115s30314637xe5f81b4646e93c0c@mail.gmail.com>
	<200801051944.m05Jio1Z017502@mxmail.leaseoptions.com>
Message-ID: <8108f4850801051154k7d13e7efyb3e124ad6dcfb066@mail.gmail.com>

Hey Chris! thanks!
I use centos 5.1, and I have kernel 2.6.18-53.1.4.el5 on all the machines.

SO I'll setup lo:0 to point to the VIP, and use the sysctl.conf...

thanks sooo much!!

Will

On Jan 5, 2008 2:43 PM, Christopher Hawkins
<chawkins at veracitynetworks.com> wrote:
> The VIP should be on eth0, like you said - the clients need to be able to
> reach it. And on the real servers (web servers), you would do the VIP on
> lo:0 AND the sysctl.conf settings.
>
> Chris
>
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of William Ottley
> Sent: Saturday, January 05, 2008 2:15 PM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] where would the VIP be?
>
> I'm trying to setup a LVS-TUN, which has 3 internet connections.
> eth0 - public (client)
> eth1 - public TUN to webserver 1
> eth2 - public TUN to webserver 2 and webserver 3
>
> where would the VIP be? eth0:1?, also, do we enable forward to the
> webservers or just the LVS?
>
> I'm also confused about the lo:0. do we do that on the webservers or just do
> the:
>
> /etc/sysctl.conf:
>                 net.ipv4.conf.eth0.arp_ignore = 1
>                 net.ipv4.conf.eth0.arp_announce = 2
>                 net.ipv4.conf.all.arp_ignore = 1
>                 net.ipv4.conf.all.arp_announce = 2
> sysctl -p
>
>
> thanks for any insight!
>
> William
>
> --
> ---------------
> Morpheus: After this, there is no turning back. You take the blue pill
> - the story ends, you wake up in your bed and believe whatever you want to
> believe. You take the red pill - you stay in Wonderland and I show you how
> deep the rabbit-hole goes.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From williamottley at gmail.com  Sat Jan  5 22:51:39 2008
From: williamottley at gmail.com (William Ottley)
Date: Sat, 5 Jan 2008 17:51:39 -0500
Subject: [Linux-cluster] would this configuration work for lvs-dr?
Message-ID: <8108f4850801051451n1bf70ae6nb62187498cac7db8@mail.gmail.com>

could someone please let me know if this setup will work for lvs-dr?
I use pulse / piranha / ipvsadm
I just can't get anything to work. and I'm thinking maybe the GW's are
what causing the problems.
I got help with regards to the forwarding bit, and not to use lo:0 but
nothing seems to work.

LVS:
eth0:		192.168.2.1 / gw: 192.168.2.10 (CIP, direct connect to eth0)
eth0:1	192.168.2.100 (VIP)
eth1:		192.168.3.1 / gw 192.168.2.1
eth2:		192.168.0.111 / gw 192.168.2.1
sysctl.conf: ipv4.ip_forward = 1

RS#1
IP: 192.168.3.10
GW: 192.168.3.1
sysctl.conf: ipv4.ip_forward = 1
sysctl.conf: net.ipv4.conf.lo.arp_ignore = 1
sysctl.conf: net.ipv4.conf.lo.arp_announce = 2
sysctl.conf: net.ipv4.conf.all.arp_ignore = 1
sysctl.conf: net.ipv4.conf.all.arp_announce = 2

RS#2
IP: 192.168.0.10
GW: 192.168.0.1
sysctl.conf: ipv4.ip_forward = 1
sysctl.conf: net.ipv4.conf.lo.arp_ignore = 1
sysctl.conf: net.ipv4.conf.lo.arp_announce = 2
sysctl.conf: net.ipv4.conf.all.arp_ignore = 1
sysctl.conf: net.ipv4.conf.all.arp_announce = 2

-- 
---------------
Morpheus: After this, there is no turning back. You take the blue pill
- the story ends, you wake up in your bed and believe whatever you
want to believe. You take the red pill - you stay in Wonderland and I
show you how deep the rabbit-hole goes.



From mrpquter at yahoo.com  Sun Jan  6 14:26:37 2008
From: mrpquter at yahoo.com (Michael Harrison)
Date: Sun, 6 Jan 2008 06:26:37 -0800 (PST)
Subject: [Linux-cluster] RHEL4 U4 cman heartbeats on multiple interfaces
Message-ID: <95038.95673.qm@web54407.mail.yahoo.com>

Hi,

Is cman capable of being configured with redundant network interfaces ?
I have in mind using a private network as primary, and public network
connection as secondary hearbeat in case the primary goes down.

Thanks!
-Mike



      ____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs



From mrpquter at yahoo.com  Mon Jan  7 01:00:48 2008
From: mrpquter at yahoo.com (Michael Harrison)
Date: Sun, 6 Jan 2008 17:00:48 -0800 (PST)
Subject: [Linux-cluster] RHEL4 U4 cman heartbeats on multiple interfaces
In-Reply-To: <95038.95673.qm@web54407.mail.yahoo.com>
Message-ID: <465896.59881.qm@web54410.mail.yahoo.com>


I missed this entry in the faq when I posted my question. Sorry!

http://sources.redhat.com/cluster/faq.html#rgm_nicfailover

The answer is no, it can't.

--- Michael Harrison <mrpquter at yahoo.com> wrote:

> Hi,
> 
> Is cman capable of being configured with redundant network interfaces
> ?
> I have in mind using a private network as primary, and public network
> connection as secondary hearbeat in case the primary goes down.
> 
> Thanks!
> -Mike
> 
> 
> 
>      
>
____________________________________________________________________________________
> Never miss a thing.  Make Yahoo your home page. 
> http://www.yahoo.com/r/hs
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



      ____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs



From mrpquter at yahoo.com  Mon Jan  7 01:45:33 2008
From: mrpquter at yahoo.com (Michael Harrison)
Date: Sun, 6 Jan 2008 17:45:33 -0800 (PST)
Subject: [Linux-cluster] freezing a service
Message-ID: <394625.68046.qm@web54401.mail.yahoo.com>

Hi,

Is it possible to freeze a service, so that rgmanager effectively
ignores it? In other words, when doing maintenance on a production
cluster, it's sometimes necessary to stop the cluster services on that
node. When rgmanager comes down, I'd like it to leave whatever services
are running on the node alone, and not fail them over.

I looked at the docs and utilities and didn't find anything like that.
Don't know if I missed it somewhere.

Thanks,
-Mike



      ____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 



From james at cloud9.co.uk  Mon Jan  7 09:11:53 2008
From: james at cloud9.co.uk (James Fidell)
Date: Mon, 07 Jan 2008 09:11:53 +0000
Subject: [Linux-cluster] GFS tuning advice sought
Message-ID: <4781ECD9.6020903@cloud9.co.uk>

I have a 3-node cluster built on CentOS 5.1, fully updated, providing
Maildir mail spool filesystems to dovecot-based IMAP servers.  As it
stands GFS is in its default configuration -- no tuning has been done
so far.

Mostly, it's working fine.  Unfortunately we do have a few people with
tens of thousands of emails in single mailboxes who are seeing fairly
significant performance problems when fetching their email and in this
instance "make your mailbox smaller" isn't an acceptable solution :(

Is there any GFS tuning I can do which might help speed up access to
these mailboxes?

Thanks,
James



From cluster at defuturo.co.uk  Fri Jan  4 17:33:56 2008
From: cluster at defuturo.co.uk (Robert Clark)
Date: Fri, 04 Jan 2008 17:33:56 +0000
Subject: [Linux-cluster] GFS: assertion failure in add_to_queue
In-Reply-To: <87y7b5odcb.fsf@tac.ki.iif.hu>
References: <87y7b5odcb.fsf@tac.ki.iif.hu>
Message-ID: <1199468036.2388.5.camel@rutabaga.defuturo.co.uk>

On Fri, 2008-01-04 at 15:49 +0100, Ferenc Wagner wrote:

> I'm using a 1-node GFS1 "cluster" with DLM locking and sporadically
> (say once a week) get the following in the kernel logs (Linux 2.6.23):
> 
> GFS: fsid=noc:cricket.0: warning: assertion "(tmp_gh->gh_flags & GL_LOCAL_EXCL) || !(gh->gh_flags & GL_LOCAL_EXCL)" failed
> GFS: fsid=noc:cricket.0:   function = add_to_queue
> GFS: fsid=noc:cricket.0:   file = /home/wferi/cluster/cluster-2.01.00/gfs-kernel/src/gfs/glock.c, line = 1420
> GFS: fsid=noc:cricket.0:   time = 1197666253

  Could be this:

    https://bugzilla.redhat.com/show_bug.cgi?id=272301

We're seeing it too. The trigger is a process attempting multiple locks
on a single file. Occasionally it seems to cause an oops & panic as
well.

  As far as I know, there's no fix available for it at the moment.

	Robert



From wferi at niif.hu  Mon Jan  7 10:57:45 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Mon, 07 Jan 2008 11:57:45 +0100
Subject: [Linux-cluster] GFS: assertion failure in add_to_queue
In-Reply-To: <1199468036.2388.5.camel@rutabaga.defuturo.co.uk> (Robert Clark's
	message of "Fri, 04 Jan 2008 17:33:56 +0000")
References: <87y7b5odcb.fsf@tac.ki.iif.hu>
	<1199468036.2388.5.camel@rutabaga.defuturo.co.uk>
Message-ID: <873at9q4wm.fsf@tac.ki.iif.hu>

Robert Clark <cluster at defuturo.co.uk> writes:

> On Fri, 2008-01-04 at 15:49 +0100, Ferenc Wagner wrote:
>
>> I'm using a 1-node GFS1 "cluster" with DLM locking and sporadically
>> (say once a week) get the following in the kernel logs (Linux 2.6.23):
>> 
>> GFS: fsid=noc:cricket.0: warning: assertion "(tmp_gh->gh_flags & GL_LOCAL_EXCL) || !(gh->gh_flags & GL_LOCAL_EXCL)" failed
>> GFS: fsid=noc:cricket.0:   function = add_to_queue
>> GFS: fsid=noc:cricket.0:   file = /home/wferi/cluster/cluster-2.01.00/gfs-kernel/src/gfs/glock.c, line = 1420
>> GFS: fsid=noc:cricket.0:   time = 1197666253
>
>   Could be this:
>
>     https://bugzilla.redhat.com/show_bug.cgi?id=272301
>
> We're seeing it too. The trigger is a process attempting multiple locks
> on a single file. Occasionally it seems to cause an oops & panic as
> well.
>
> As far as I know, there's no fix available for it at the moment.

Thanks for the reply.  Though that's far from comforting...
-- 
Regards,
Feri.



From wcheng at redhat.com  Mon Jan  7 13:51:44 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Mon, 07 Jan 2008 08:51:44 -0500
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <4781ECD9.6020903@cloud9.co.uk>
References: <4781ECD9.6020903@cloud9.co.uk>
Message-ID: <47822E70.9000507@redhat.com>

James Fidell wrote:

>I have a 3-node cluster built on CentOS 5.1, fully updated, providing
>Maildir mail spool filesystems to dovecot-based IMAP servers.  As it
>stands GFS is in its default configuration -- no tuning has been done
>so far.
>
>Mostly, it's working fine.  Unfortunately we do have a few people with
>tens of thousands of emails in single mailboxes who are seeing fairly
>significant performance problems when fetching their email and in this
>instance "make your mailbox smaller" isn't an acceptable solution :(
>
>Is there any GFS tuning I can do which might help speed up access to
>these mailboxes?
>
>
>  
>
You probably need GFS2 in this case. To fix mail server issues in GFS1 
would be too intrusive with current state of development cycle.

-- Wendy



From isplist at logicore.net  Mon Jan  7 14:55:03 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 7 Jan 2008 08:55:03 -0600
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <47822E70.9000507@redhat.com>
Message-ID: <2008178553.393310@leena>

>> Is there any GFS tuning I can do which might help speed up access to
>> these mailboxes?
>> 
> You probably need GFS2 in this case. To fix mail server issues in GFS1
> would be too intrusive with current state of development cycle.

Wendy,

I noticed you mention that GFS2 might be best for this. Would this apply for 
web servers as well? I've been using GFS on RHEL4 for web server cluster 
sharing. Would I be better to look at GFS2 for performance?

Mike





From rmaureira at solint.cl  Mon Jan  7 19:36:44 2008
From: rmaureira at solint.cl (Robinson Maureira Castillo)
Date: Mon, 07 Jan 2008 16:36:44 -0300
Subject: [Linux-cluster] freezing a service
In-Reply-To: <394625.68046.qm@web54401.mail.yahoo.com>
References: <394625.68046.qm@web54401.mail.yahoo.com>
Message-ID: <47827F4C.5050108@solint.cl>

Hi there, you can always stop and disable a service using:

clusvcadm -d <service name>

And start and re-enabling it later with:

clusvcadm -e <service name>


Hope it helps.

Best regards,

Michael Harrison wrote:
> Hi,
> 
> Is it possible to freeze a service, so that rgmanager effectively
> ignores it? In other words, when doing maintenance on a production
> cluster, it's sometimes necessary to stop the cluster services on that
> node. When rgmanager comes down, I'd like it to leave whatever services
> are running on the node alone, and not fail them over.
> 
> I looked at the docs and utilities and didn't find anything like that.
> Don't know if I missed it somewhere.
> 
> Thanks,
> -Mike
> 
> 
> 
>       ____________________________________________________________________________________
> Be a better friend, newshound, and 
> know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Robinson Maureira Castillo
Jefe de Soporte
SOLINT

F: +56 2 4119047
C: +56 9 95994987



From mrpquter at yahoo.com  Mon Jan  7 20:04:19 2008
From: mrpquter at yahoo.com (Michael Harrison)
Date: Mon, 7 Jan 2008 12:04:19 -0800 (PST)
Subject: [Linux-cluster] freezing a service
In-Reply-To: <47827F4C.5050108@solint.cl>
Message-ID: <302270.15413.qm@web54401.mail.yahoo.com>


What I'd like to do is stop the cluster components, leaving the
services running. So for example, stop rgmanager, fenced, cman, ccsd
for upgrades or whatever, and not have rgmanager fail over an IP
address that it would otherwise control.

Does it make sense?

Cheers,
-Mike

--- Robinson Maureira Castillo <rmaureira at solint.cl> wrote:

> Hi there, you can always stop and disable a service using:
> 
> clusvcadm -d <service name>
> 
> And start and re-enabling it later with:
> 
> clusvcadm -e <service name>
> 
> 
> Hope it helps.
> 
> Best regards,
> 
> Michael Harrison wrote:
> > Hi,
> > 
> > Is it possible to freeze a service, so that rgmanager effectively
> > ignores it? In other words, when doing maintenance on a production
> > cluster, it's sometimes necessary to stop the cluster services on
> that
> > node. When rgmanager comes down, I'd like it to leave whatever
> services
> > are running on the node alone, and not fail them over.
> > 
> > I looked at the docs and utilities and didn't find anything like
> that.
> > Don't know if I missed it somewhere.
> > 
> > Thanks,
> > -Mike
> > 
> > 
> > 
> >      
>
____________________________________________________________________________________
> > Be a better friend, newshound, and 
> > know-it-all with Yahoo! Mobile.  Try it now. 
> http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> -- 
> Robinson Maureira Castillo
> Jefe de Soporte
> SOLINT
> 
> F: +56 2 4119047
> C: +56 9 95994987
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



      ____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs



From gordan at bobich.net  Mon Jan  7 21:24:51 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 07 Jan 2008 21:24:51 +0000
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <47822E70.9000507@redhat.com>
References: <4781ECD9.6020903@cloud9.co.uk> <47822E70.9000507@redhat.com>
Message-ID: <478298A3.7010605@bobich.net>

> James Fidell wrote:
> 
>> I have a 3-node cluster built on CentOS 5.1, fully updated, providing
>> Maildir mail spool filesystems to dovecot-based IMAP servers.  As it
>> stands GFS is in its default configuration -- no tuning has been done
>> so far.
>>
>> Mostly, it's working fine.  Unfortunately we do have a few people with
>> tens of thousands of emails in single mailboxes who are seeing fairly
>> significant performance problems when fetching their email and in this
>> instance "make your mailbox smaller" isn't an acceptable solution :(
>>
>> Is there any GFS tuning I can do which might help speed up access to
>> these mailboxes?

I have implemented similar mail systems in the past, and I hate to tell 
you this, but if you really have tens of thousands of emails in a single 
folder and you need to sift through them frequently, even a 
single-user-server with ext3 won't give you any kind of a sane 
performance over a WAN. Even on a 100Mb LAN, things will end up timing 
out, even without clustering.

You'll have to split the huge mail folders up into several smaller folders.

Gordan



From lhh at redhat.com  Mon Jan  7 22:48:10 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 07 Jan 2008 17:48:10 -0500
Subject: [Linux-cluster] freezing a service
In-Reply-To: <394625.68046.qm@web54401.mail.yahoo.com>
References: <394625.68046.qm@web54401.mail.yahoo.com>
Message-ID: <1199746090.16312.12.camel@ayanami.boston.devel.redhat.com>

On Sun, 2008-01-06 at 17:45 -0800, Michael Harrison wrote:
> Hi,
> 
> Is it possible to freeze a service, so that rgmanager effectively
> ignores it? In other words, when doing maintenance on a production
> cluster, it's sometimes necessary to stop the cluster services on that
> node. When rgmanager comes down, I'd like it to leave whatever services
> are running on the node alone, and not fail them over.
> 
> I looked at the docs and utilities and didn't find anything like that.
> Don't know if I missed it somewhere.

In HEAD, yes.

If you're not running HEAD...

clusvcadm -d <service>
rg_test test /etc/cluster/cluster.conf start service <service>
[do maintenance here]
rg_test test /etc/cluster/cluster.conf stop service <service>
clusvcadm -e <service>

-- Lon







From lhh at redhat.com  Mon Jan  7 22:50:10 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 07 Jan 2008 17:50:10 -0500
Subject: [Linux-cluster] freezing a service
In-Reply-To: <1199746090.16312.12.camel@ayanami.boston.devel.redhat.com>
References: <394625.68046.qm@web54401.mail.yahoo.com>
	<1199746090.16312.12.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1199746210.16312.15.camel@ayanami.boston.devel.redhat.com>

On Mon, 2008-01-07 at 17:48 -0500, Lon Hohberger wrote:
> On Sun, 2008-01-06 at 17:45 -0800, Michael Harrison wrote:
> > Hi,
> > 
> > Is it possible to freeze a service, so that rgmanager effectively
> > ignores it? In other words, when doing maintenance on a production
> > cluster, it's sometimes necessary to stop the cluster services on that
> > node. When rgmanager comes down, I'd like it to leave whatever services
> > are running on the node alone, and not fail them over.

Also, there's a BZ open about supporting teardown/start w/o doing a
failover.

The 'freeze' function in head doesn't let you stop the cluster services,
but it lets you work on the service itself w/o rgmanager checking it or
moving it while you're performing maintenance.

-- Lon




From mrpquter at yahoo.com  Mon Jan  7 22:54:45 2008
From: mrpquter at yahoo.com (Michael Harrison)
Date: Mon, 7 Jan 2008 14:54:45 -0800 (PST)
Subject: [Linux-cluster] freezing a service
In-Reply-To: <1199746090.16312.12.camel@ayanami.boston.devel.redhat.com>
Message-ID: <650291.92797.qm@web54404.mail.yahoo.com>

Ah, thanks.

No, for the time being I'm running the bits released for RHEL4U4. Good
to know that freezing will be available at sometime in the future.

BTW, what will it be called ? Freezing ?

-Mike

--- Lon Hohberger <lhh at redhat.com> wrote:

> On Sun, 2008-01-06 at 17:45 -0800, Michael Harrison wrote:
> > Hi,
> > 
> > Is it possible to freeze a service, so that rgmanager effectively
> > ignores it? In other words, when doing maintenance on a production
> > cluster, it's sometimes necessary to stop the cluster services on
> that
> > node. When rgmanager comes down, I'd like it to leave whatever
> services
> > are running on the node alone, and not fail them over.
> > 
> > I looked at the docs and utilities and didn't find anything like
> that.
> > Don't know if I missed it somewhere.
> 
> In HEAD, yes.
> 
> If you're not running HEAD...
> 
> clusvcadm -d <service>
> rg_test test /etc/cluster/cluster.conf start service <service>
> [do maintenance here]
> rg_test test /etc/cluster/cluster.conf stop service <service>
> clusvcadm -e <service>
> 
> -- Lon
> 
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



      ____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 



From lhh at redhat.com  Mon Jan  7 23:06:10 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 07 Jan 2008 18:06:10 -0500
Subject: [Linux-cluster] would this configuration work for lvs-dr?
In-Reply-To: <8108f4850801051451n1bf70ae6nb62187498cac7db8@mail.gmail.com>
References: <8108f4850801051451n1bf70ae6nb62187498cac7db8@mail.gmail.com>
Message-ID: <1199747170.16312.32.camel@ayanami.boston.devel.redhat.com>

With direct routing, all the nodes must be visible to the outside world
using the same route as the director(s).  If you're trying to route
*through* your director, you need to use NAT (or tun, which I've never
used).  Direct routing means that you are not using the director as a
router, just a load balancer.

That is, assuming you have a gateway for all 3 hosts that's @
192.168.2.254...

Director: 
 eth0    192.168.2.1
 eth0:0  192.168.2.100 (vip)
 gateway / default route 192.168.2.254

Real server #1:
 eth0    192.168.2.2
 gateway / default route 192.168.2.254

Real server #2:
 eth0    192.168.2.3
 gateway / gateway route 192.168.2.254


Once that's done, you need to get the real servers to process requests
for 192.168.1.100.

I wrote this some years ago, but here are two ways of getting it
working:

http://people.redhat.com/lhh/piranha-direct-routing-howto.txt

Depending on how you do it, you will either place 192.168.1.100 as
eth0:0 and do an arptables_jf setup, or you will not put 192.168.1.100
on *any* of the real servers and instead use an iptables hack to use a
transparent proxy to rewrite outbound packets to be sourced from
192.168.1.100.

Some people put the VIP on lo:0 - I have never done that nor could I
tell you the advantages or disadvantages.  I've also never played with
sysctl.conf settings.

-- Lon



From wcheng at redhat.com  Tue Jan  8 04:17:19 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Mon, 07 Jan 2008 23:17:19 -0500
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <2008178553.393310@leena>
References: <2008178553.393310@leena>
Message-ID: <4782F94F.4070302@redhat.com>

isplist at logicore.net wrote:

>>>Is there any GFS tuning I can do which might help speed up access to
>>>these mailboxes?
>>>
>>>      
>>>
>>You probably need GFS2 in this case. To fix mail server issues in GFS1
>>would be too intrusive with current state of development cycle.
>>    
>>
>
>Wendy,
>
>I noticed you mention that GFS2 might be best for this. Would this apply for 
>web servers as well? I've been using GFS on RHEL4 for web server cluster 
>sharing. Would I be better to look at GFS2 for performance?
>
>
>  
>
Not sure about web servers though - I think it depends on access patterns.

-- Wendy



From gordan at bobich.net  Tue Jan  8 07:23:25 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 08 Jan 2008 07:23:25 +0000
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <2008178553.393310@leena>
References: <2008178553.393310@leena>
Message-ID: <478324ED.2070403@bobich.net>

isplist at logicore.net wrote:
>>> Is there any GFS tuning I can do which might help speed up access to
>>> these mailboxes?
>>>
>> You probably need GFS2 in this case. To fix mail server issues in GFS1
>> would be too intrusive with current state of development cycle.
> 
> I noticed you mention that GFS2 might be best for this. Would this apply for 
> web servers as well? I've been using GFS on RHEL4 for web server cluster 
> sharing. Would I be better to look at GFS2 for performance?

Web server disk I/O is likely to be mostly read-only, so I doubt disk 
performance will ever be your bottleneck. It's bouncing write-locks 
around that slows clustered file systems down.

Gordan



From lhh at redhat.com  Tue Jan  8 14:36:00 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 08 Jan 2008 09:36:00 -0500
Subject: [Linux-cluster] would this configuration work for lvs-dr?
In-Reply-To: <1199747170.16312.32.camel@ayanami.boston.devel.redhat.com>
References: <8108f4850801051451n1bf70ae6nb62187498cac7db8@mail.gmail.com>
	<1199747170.16312.32.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1199802960.16312.35.camel@ayanami.boston.devel.redhat.com>

On Mon, 2008-01-07 at 18:06 -0500, Lon Hohberger wrote:

> Once that's done, you need to get the real servers to process requests
> for 192.168.1.100.

typo ... 2.100, not 1.100.


> Depending on how you do it, you will either place 192.168.1.100 as
> eth0:0 and do an arptables_jf setup, or you will not put 192.168.1.100
> on *any* of the real servers and instead use an iptables hack to use a
> transparent proxy to rewrite outbound packets to be sourced from
> 192.168.1.100.

Same here.

-- Lon



From Alain.Moulle at bull.net  Tue Jan  8 14:39:30 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 08 Jan 2008 15:39:30 +0100
Subject: [Linux-cluster] CS5 and dlm-kernel ?
Message-ID: <47838B22.5000608@bull.net>

Hi

I thought there was always a dlm and dlm-kernel likewise in CS4,
but it seems rpms don't exist anymore ?
So no more kernel module at all with CS5 ?

Alain Moull?



From msebedio at invap.com.ar  Tue Jan  8 15:58:22 2008
From: msebedio at invap.com.ar (Mariel Sebedio)
Date: Tue, 08 Jan 2008 12:58:22 -0300
Subject: [Linux-cluster] NTP
In-Reply-To: <9F22D67428CC144B93A198ECA35B85BF13B59B0097@PS-MAILBOX.Progressoft.com>
References: <9F22D67428CC144B93A198ECA35B85BF13B59B0097@PS-MAILBOX.Progressoft.com>
Message-ID: <47839D9E.3080709@invap.com.ar>

Hello, I had the same problem and configuring this files with this 
settings... (Sorry mi English)

The server A:
Edit  /etc/ntp.conf and setting this
Step 1 - server XXXXX (IP o name in /etc/host from where take the date-Time)
Step 2 - restrict XXXXX mask 255.255.255.255 nomodify notrap noquery
Step 3 - restrict XXXXX mask 255.255.255.0 nomodify notrap (in this case 
restrict the net where is server B, dont put noquery because server B 
query for time a this server)

In the file /etc/ntp/step-tickers mus be contain the IP or name in step 1

Start /etc/init.d/ntpd and in this case Sync whit the server that you 
defined in Step 1

In Server B

Edit  /etc/ntp.conf and setting this
Step 1 - server XXXXX (IP o name in /etc/host SERVER A)
Step 2 - restrict IP o name SERVER A mask 255.255.255.255 nomodify 
notrap noquery

In the file /etc/ntp/step-tickers mus be contain the IP or name in step 
1 (SERVER A)

Start /etc/init.d/ntpd and in this case Sync whit the SERVER A

Good luck!!
Mariel

Yazan Albakheit wrote:

> Dear ,
>
>  
>
>    Can you Help me in configuring the NTP between two nodes running 
> RHEL_AS_V4_U5  .
>
>  
>
>    I Have   two Server (A,B)   I want server B  to take its time from 
> server A  only.
>
>  
>
>  
>
> Thanks.
>
>  
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Lic. Mariel Sebedio
Division Computos y Sistemas
INVAP S.E. - www.invap.com.ar



From kanderso at redhat.com  Tue Jan  8 15:09:18 2008
From: kanderso at redhat.com (Kevin Anderson)
Date: Tue, 08 Jan 2008 09:09:18 -0600
Subject: [Linux-cluster] CS5 and dlm-kernel ?
In-Reply-To: <47838B22.5000608@bull.net>
References: <47838B22.5000608@bull.net>
Message-ID: <1199804958.2750.8.camel@dhcp80-204.msp.redhat.com>

On Tue, 2008-01-08 at 15:39 +0100, Alain Moulle wrote:
> Hi
> 
> I thought there was always a dlm and dlm-kernel likewise in CS4,
> but it seems rpms don't exist anymore ?

> So no more kernel module at all with CS5 ?
> 
dlm kernel module is part of the upstream and base rhel5 kernel now,
doesn't need a separate rpm as it is included as part of the kernel rpm.

Kevin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080108/7b956ebb/attachment.htm>

From isplist at logicore.net  Tue Jan  8 15:49:08 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 8 Jan 2008 09:49:08 -0600
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <478324ED.2070403@bobich.net>
Message-ID: <2008189498.774713@leena>

> Web server disk I/O is likely to be mostly read-only, so I doubt disk
> performance will ever be your bottleneck. It's bouncing write-locks
> around that slows clustered file systems down.

True and other than media, all writes are to the MySQL servers. Still, I 
wondered since the web servers are all sharing a GFS space for their pages.

Mike





From charlieb-linux-cluster at e-smith.com  Tue Jan  8 20:51:36 2008
From: charlieb-linux-cluster at e-smith.com (Charlie Brady)
Date: Tue, 8 Jan 2008 15:51:36 -0500 (EST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5, kernel
	2.6.9-67.EL)
In-Reply-To: <Pine.LNX.4.44.0801041555230.12620-100000@allspice.nssg.mitel.com>
Message-ID: <Pine.LNX.4.44.0801081548350.27927-100000@allspice.nssg.mitel.com>


On Fri, 4 Jan 2008, Charlie Brady wrote:

> I'm helping a colleague to collect information on an application lockup 
> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.
> 
> I'd appreciate advice as to what information to collect next.

Nobody have any advice?

---
Charlie



From kanderso at redhat.com  Tue Jan  8 21:00:42 2008
From: kanderso at redhat.com (Kevin Anderson)
Date: Tue, 08 Jan 2008 15:00:42 -0600
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <Pine.LNX.4.44.0801081548350.27927-100000@allspice.nssg.mitel.com>
References: <Pine.LNX.4.44.0801081548350.27927-100000@allspice.nssg.mitel.com>
Message-ID: <1199826042.2774.13.camel@localhost.localdomain>

On Tue, 2008-01-08 at 15:51 -0500, Charlie Brady wrote:
> On Fri, 4 Jan 2008, Charlie Brady wrote:
> 
> > I'm helping a colleague to collect information on an application lockup 
> > problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.
> > 
Missed this on my first read.  What do you mean by a shared scsi array?
What hardware are you using for shared storage?

Kevin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080108/eb0df8a0/attachment.htm>

From gordan at bobich.net  Tue Jan  8 21:08:37 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 08 Jan 2008 21:08:37 +0000
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5, kernel
	2.6.9-67.EL)
In-Reply-To: <Pine.LNX.4.44.0801081548350.27927-100000@allspice.nssg.mitel.com>
References: <Pine.LNX.4.44.0801081548350.27927-100000@allspice.nssg.mitel.com>
Message-ID: <4783E655.3080102@bobich.net>

Charlie Brady wrote:
> On Fri, 4 Jan 2008, Charlie Brady wrote:
> 
>> I'm helping a colleague to collect information on an application lockup 
>> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.
>>
>> I'd appreciate advice as to what information to collect next.
> 
> Nobody have any advice?

Shared SCSI as in iSCSI SAN or as in a shared SCSI bus with two machines 
connected via a SCSI cable?

Gordan



From snowhare at nihongo.org  Tue Jan  8 21:31:58 2008
From: snowhare at nihongo.org (Benjamin Franz)
Date: Tue, 8 Jan 2008 13:31:58 -0800 (PST)
Subject: [Linux-cluster] fence_apc - Perl based CLI version
Message-ID: <Pine.LNX.4.64.0801081320140.15301@nihongo.org>

As others have reported, the current fence_apc shipping with 
RHEL5.1/CentOS5.1 simply does not work reliably on newer APC firmwares. It 
breaks under all kinds of conditions (some as simple as 'works on some 
ports but not on other ports').

Since I *really* need it to work, I hacked together a Perl version 
(derived from the old fence_apc.pl in CVS) that uses the APC command line 
interface and dispenses with the 'menu scraping' interface entirely.

I don't have any switches here that use a switchnum interface so I 
couldn't hack anything together for that. But it appears to reliably do 
what it is supposed to do (at least on my APC7900 switches running AOS 
3.3.4): Fence.

It would make a great deal of sense for someone to add it to the Luci/CMAN 
list of supported fences. Maybe "APC Power Device (CLI) / fence_apc_cli"?

-- 
Benjamin Franz

"It is moronic to predict without first establishing an error rate
  for a prediction and keeping track of one?s past record of accuracy."
                     -- Nassim Nicholas Taleb, Fooled By Randomness
-------------- next part --------------
#!/usr/bin/perl

#########################################3
# CLI APC Fencing. This only works with APC AOS v2.7.0 or later
# but it is a LOT simpler and more robust than the old menu scraping code.
#

use strict;
use warnings;
use Getopt::Std;
use Net::Telnet ();

# WARNING!! Do not add code bewteen "#BEGIN_VERSION_GENERATION" and 
# "#END_VERSION_GENERATION"  It is generated by the Makefile

my ($FENCE_RELEASE_NAME, $REDHAT_COPYRIGHT, $BUILD_DATE);

#BEGIN_VERSION_GENERATION
$FENCE_RELEASE_NAME="";
$REDHAT_COPYRIGHT="";
$BUILD_DATE="";
#END_VERSION_GENERATION


###############################################################################
###############################################################################
##
##  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
##  Copyright (C) 2004-2006 Red Hat, Inc.  All rights reserved.
##  
##  This copyrighted material is made available to anyone wishing to use,
##  modify, copy, or redistribute it subject to the terms and conditions
##  of the GNU General Public License v.2.
##
###############################################################################
###############################################################################

# Get the program name from $0 and strip directory names
my $Program_Name = $0;
$Program_Name =~ s/.*\///;

my $login_prompt   = '/ : /';
my $command_prompt = '/APC> $/';

my $debug_log = '/tmp/apclog'; # Location of debugging log when in verbose mode

my $telnet_timeout = 2;        # Seconds to wait for matching telent response
my $open_wait      = 5;        # Seconds to wait between each telnet attempt
my $max_open_tries = 3;        # How many telnet attempts to make.  Because the 
                               # APC can fail repeated login attempts, this number
                               # should be more than 1

my $reboot_duration = 30;      # Number of seconds plugs are turned off during a reboot command
my $power_off_delay = 0;       # Number of seconds to wait before actually turning off a plug
my $power_on_delay  = 30;      # Number of seconds to wait before actually turning on a plug

our %Opts = (
	'o' => 'reboot',
        );
our $SwitchNum;
our $Logged_In = 0;

our $t = Net::Telnet->new;      # Our telnet object instance 

### START MAIN #######################################################

if (@ARGV > 0) {
	getopts("a:hl:n:o:p:qTvV", \%Opts) || fail_usage();
	usage() if defined $Opts{'h'};
	version() if defined $Opts{'V'};

	fail_usage("Unknown parameter.") if (@ARGV > 0);

	fail_usage("No '-a' flag specified.") unless defined $Opts{'a'};
	fail_usage("No '-n' flag specified.") unless defined $Opts{'n'};
	fail_usage("No '-l' flag specified.") unless defined $Opts{'l'};
	fail_usage("No '-p' flag specified.") unless defined $Opts{'p'};
	fail_usage("Unrecognised action '$Opts{'o'}' for '-o' flag") unless $Opts{'o'} =~ /^(Off|On|Reboot)$/i;

	if ( $Opts{'n'} =~ /(\d+):(\d+)/ ) {
		$SwitchNum = $1;
		$Opts{'n'} = $2;
	}

} else {
	get_options_stdin();

	fail("failed: no IP address") unless defined $Opts{'a'};
	fail("failed: no plug number") unless defined $Opts{'n'};
	fail("failed: no login name") unless defined $Opts{'l'};
	fail("failed: no password") unless defined $Opts{'p'};
	fail("failed: unrecognised action: $Opts{'o'}") unless $Opts{'o'} =~ /^(Off|On|Reboot)$/i;
} 
my $option = lc($Opts{'o'});
my $plug   = $Opts{'n'};

$t->prompt($command_prompt);
$t->timeout($telnet_timeout);
$t->input_log($debug_log) if $Opts{'v'};
$t->errmode('return');  

login();
$t->errmode(\&telnet_error);  
my $cmd_results = '';

my $ok;
if ($option eq 'reboot') {
	$t->cmd( String => "rebootduration $plug:$reboot_duration", Output => \$cmd_results );
}
if ($option eq 'off') {
	$t->cmd( String => "poweroffdelay $plug:$power_off_delay", Output => \$cmd_results );
}
if ($option eq 'on') {
	$t->cmd( String => "powerondelay $plug:$power_on_delay", Output => \$cmd_results );
}
$ok = $t->cmd( String => "$option $plug", Output => \$cmd_results );

#print $cmd_results;
logout();
exit 0;

### END MAIN #######################################################

sub usage {
        print <<"EOT";
Usage:

$Program_Name [options]

Options:
  -a <ip>          IP address or hostname of MasterSwitch
  -h               usage
  -l <name>        Login name
  -n <num>         Outlet number to change: [<switch>:]<outlet>
  -o <string>      Action: Reboot (default), Off or On
  -p <string>      Login password
  -q               quiet mode
  -T               Test mode (cancels action)
  -V               version
  -v               Log to file /tmp/apclog

EOT
        exit 0;
}

sub fail {
        my ($msg)=@_;
        print $msg."\n" unless defined $Opts{'q'};

        if (defined $t) {
                # make sure we don't get stuck in a loop due to errors
                $t->errmode('return');

                if ($Logged_In) {
			logout();
		}
                $t->close();
        }
        exit 1;
}

sub fail_usage {
        my ($msg)=@_;
        print STDERR $msg."\n" if $msg;
        print STDERR "Please use '-h' for usage.\n";
        exit 1;
}

sub version {
        print "$Program_Name $FENCE_RELEASE_NAME $BUILD_DATE\n";
        print "$REDHAT_COPYRIGHT\n" if ( $REDHAT_COPYRIGHT );
        exit 0;
}

sub login {
	for (my $i=0; $i<$max_open_tries; $i++) {
		$t->open($Opts{'a'});
		my ($prompt) = $t->waitfor($login_prompt);
  
		# Expect 'User Name : ' 
		if ((not defined $prompt) || ($prompt !~ /name/i)) {
			$t->close();
			sleep($open_wait);
			next;        
		}

		$t->print($Opts{'l'});
		($prompt) = $t->waitfor($login_prompt);

		# Expect 'Password  : ' 
		if ((not defined $prompt) || ($prompt !~ /password/i )) {
			$t->close();
			sleep($open_wait);
			next;         
		}
  
		# Send password
		$t->print("$Opts{'p'} -c"); # The appended ' -c' activates the CLI interface  

		my ($dummy, $login_result) = $t->waitfor('/(APC>|(?i:user name|password)\s*:) /');
		if ($login_result =~ m/APC> /) {
			$Logged_In = 1;

			# send newline to flush prompt
			$t->print("");  

			return;
		} else {
			fail("invalid username or password ($login_result)");
		}
	}
	fail("failed: telnet failed: " . $t->errmsg."\n"); 
}

sub logout {
	$t->cmd("logout");
	return;
}

sub get_options_stdin {
	my $opt;
	my $line = 0;
	while( defined($opt = <>) ) {
		chomp $opt;

		# strip leading and trailing whitespace
		$opt =~ s/^\s*//;
		$opt =~ s/\s*$//;

		# skip comments
		next if ($opt =~ m/^#/);
	
		$line += 1;
		next if ($opt eq '');

		my ($name, $val) = split(/\s*=\s*/, $opt);

		if ( $name eq "" ) {
			print STDERR "parse error: illegal name in option $line\n";
			exit 2;
		} 
		elsif ($name eq "agent" )   { } # DO NOTHING -- this field is used by fenced 
		elsif ($name eq "ipaddr" )  { $Opts{'a'} = $val; } 
		elsif ($name eq "login" )   { $Opts{'l'} = $val; } 
		elsif ($name eq "option" )  { $Opts{'o'} = $val; } 
		elsif ($name eq "passwd" )  { $Opts{'p'} = $val; } 
		elsif ($name eq "port" )    { $Opts{'n'} = $val; } 
		elsif ($name eq "switch" )  { $SwitchNum = $val; } 
		elsif ($name eq "test" )    { $Opts{'T'} = $val; } 
		elsif ($name eq "verbose" ) { $Opts{'v'} = $val; } 
	}
}
		

sub telnet_error {
	if ($t->errmsg ne "pattern match timed-out") {
		fail("failed: telnet returned: " . $t->errmsg . "\n");
	} else {
		$t->print("");
	}
}



From swplotner at amherst.edu  Tue Jan  8 21:44:05 2008
From: swplotner at amherst.edu (Steffen Plotner)
Date: Tue, 8 Jan 2008 16:44:05 -0500
Subject: [Linux-cluster] fence_apc - Perl based CLI version
In-Reply-To: <Pine.LNX.4.64.0801081320140.15301@nihongo.org>
Message-ID: <150F55E3591CD042B77ED3DB957854652B610E@mail7.amherst.edu>

Hi,

I currently use a different method of fencing in our clusters (using
iscsi ietd and iptables currently), however we have the APC7900 PDUs and
do control them using SNMP:

Use SNMP set commands to turn off and on the ports:

snmpset -c <password> <host>
PowerNet-MIB::rPDUOutletControlOutletCommand.<port> i 2

The 'i' refers to an integer and the digit 2 means to power off the
port.

That has worked very reliably.

Steffen


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Benjamin Franz
> Sent: Tuesday, January 08, 2008 4:32 PM
> To: linux clustering
> Subject: [Linux-cluster] fence_apc - Perl based CLI version
> 
> As others have reported, the current fence_apc shipping with
> RHEL5.1/CentOS5.1 simply does not work reliably on newer APC 
> firmwares. It breaks under all kinds of conditions (some as 
> simple as 'works on some ports but not on other ports').
> 
> Since I *really* need it to work, I hacked together a Perl 
> version (derived from the old fence_apc.pl in CVS) that uses 
> the APC command line interface and dispenses with the 'menu 
> scraping' interface entirely.
> 
> I don't have any switches here that use a switchnum interface 
> so I couldn't hack anything together for that. But it appears 
> to reliably do what it is supposed to do (at least on my 
> APC7900 switches running AOS
> 3.3.4): Fence.
> 
> It would make a great deal of sense for someone to add it to 
> the Luci/CMAN list of supported fences. Maybe "APC Power 
> Device (CLI) / fence_apc_cli"?
> 
> --
> Benjamin Franz
> 
> "It is moronic to predict without first establishing an error rate
>   for a prediction and keeping track of one's past record of 
> accuracy."
>                      -- Nassim Nicholas Taleb, Fooled By Randomness
> 



From teigland at redhat.com  Tue Jan  8 22:56:09 2008
From: teigland at redhat.com (David Teigland)
Date: Tue, 8 Jan 2008 16:56:09 -0600
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <Pine.LNX.4.44.0801041555230.12620-100000@allspice.nssg.mitel.com>
References: <Pine.LNX.4.44.0801041555230.12620-100000@allspice.nssg.mitel.com>
Message-ID: <20080108225609.GB27979@redhat.com>

On Fri, Jan 04, 2008 at 04:18:45PM -0500, Charlie Brady wrote:
> We've reduced the application code to a simple test case. The following 
> code run on each node will soon block, and doesn't receive signals until 
> the peer node is shutdown:
> 
> ...
>     fl.l_whence=SEEK_SET;
>     fl.l_start=0;
>     fl.l_len=1;
> 
>     while (1)
>     {
>       fl.l_type=F_WRLCK;
>       retval=fcntl(filedes,F_SETLKW,&fl);
>       if (retval==-1)
>       {
>         perror("lock");
>         exit(1);
>       }
>       // attempt to unlock the index file
>       fl.l_type=F_UNLCK;
>       retval=fcntl(filedes,F_SETLKW,&fl);
>       if (retval==-1)
>       {
>         perror("unlock");
>         exit(1);
>       }
>     }

Yes, this stresses a problematic design limitation in the RHEL4 dlm where
the dlm master node is ping-ponging all over the place and becomes so
unstable that everything comes to a halt.  One possible work-around is to
modify the program to hold a lock on filedes to keep the master stable,
e.g.  hold a zero length lock at some unused offset like 0xFFFFFF.

Dave



From charlieb-linux-cluster at e-smith.com  Wed Jan  9 03:39:51 2008
From: charlieb-linux-cluster at e-smith.com (Charlie Brady)
Date: Tue, 8 Jan 2008 22:39:51 -0500 (EST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5, kernel
	2.6.9-67.EL)
In-Reply-To: <4783E655.3080102@bobich.net>
Message-ID: <Pine.LNX.4.44.0801082236230.25141-100000@allspice.nssg.mitel.com>


On Tue, 8 Jan 2008, Gordan Bobic wrote:

> Charlie Brady wrote:
> > On Fri, 4 Jan 2008, Charlie Brady wrote:
> > 
> >> I'm helping a colleague to collect information on an application lockup 
> >> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.
> >>
> >> I'd appreciate advice as to what information to collect next.
> > 
> > Nobody have any advice?
> 
> Shared SCSI as in iSCSI SAN or as in a shared SCSI bus with two machines 
> connected via a SCSI cable?

The latter. I don't have the details immediately at hand, but it's all HP 
gear. A pair of DL380s with an external SCSI array (MSAxx), IIRC.

---
Charlie



From charlieb-linux-cluster at e-smith.com  Wed Jan  9 03:43:16 2008
From: charlieb-linux-cluster at e-smith.com (Charlie Brady)
Date: Tue, 8 Jan 2008 22:43:16 -0500 (EST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5, kernel
	2.6.9-67.EL)
In-Reply-To: <20080108225609.GB27979@redhat.com>
Message-ID: <Pine.LNX.4.44.0801082241390.25141-100000@allspice.nssg.mitel.com>


On Tue, 8 Jan 2008, David Teigland wrote:

> On Fri, Jan 04, 2008 at 04:18:45PM -0500, Charlie Brady wrote:
> > We've reduced the application code to a simple test case. The following 
> > code run on each node will soon block, and doesn't receive signals until 
> > the peer node is shutdown:
...
> Yes, this stresses a problematic design limitation in the RHEL4 dlm where
> the dlm master node is ping-ponging all over the place and becomes so
> unstable that everything comes to a halt.  One possible work-around is to
> modify the program to hold a lock on filedes to keep the master stable,
> e.g.  hold a zero length lock at some unused offset like 0xFFFFFF.

Thanks. I've passed the advice on.

--
Charlie



From gnobal at gmail.com  Wed Jan  9 08:51:52 2008
From: gnobal at gmail.com (Amit Schreiber)
Date: Wed, 9 Jan 2008 10:51:52 +0200
Subject: [Linux-cluster] fence_apc - Perl based CLI version
In-Reply-To: <150F55E3591CD042B77ED3DB957854652B610E@mail7.amherst.edu>
References: <Pine.LNX.4.64.0801081320140.15301@nihongo.org>
	<150F55E3591CD042B77ED3DB957854652B610E@mail7.amherst.edu>
Message-ID: <f4c96560801090051r1bc7e8aeoa7b7e3cd933eac71@mail.gmail.com>

Hi,

There's a fence_apc_snmp.py script available in the cluster code repository:
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/apc/?cvsroot=cluster

I tested it a little (replaced /sbin/fence_apc with it - they both
have the same CLI parameters) and it seems to work where the fence_apc
script shipped with RHEL5 fails.

Amit

On Jan 8, 2008 11:44 PM, Steffen Plotner <swplotner at amherst.edu> wrote:
> Hi,
>
> I currently use a different method of fencing in our clusters (using
> iscsi ietd and iptables currently), however we have the APC7900 PDUs and
> do control them using SNMP:
>
> Use SNMP set commands to turn off and on the ports:
>
> snmpset -c <password> <host>
> PowerNet-MIB::rPDUOutletControlOutletCommand.<port> i 2
>
> The 'i' refers to an integer and the digit 2 means to power off the
> port.
>
> That has worked very reliably.
>
> Steffen
>
>
>
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Benjamin Franz
> > Sent: Tuesday, January 08, 2008 4:32 PM
> > To: linux clustering
> > Subject: [Linux-cluster] fence_apc - Perl based CLI version
> >
> > As others have reported, the current fence_apc shipping with
> > RHEL5.1/CentOS5.1 simply does not work reliably on newer APC
> > firmwares. It breaks under all kinds of conditions (some as
> > simple as 'works on some ports but not on other ports').
> >
> > Since I *really* need it to work, I hacked together a Perl
> > version (derived from the old fence_apc.pl in CVS) that uses
> > the APC command line interface and dispenses with the 'menu
> > scraping' interface entirely.
> >
> > I don't have any switches here that use a switchnum interface
> > so I couldn't hack anything together for that. But it appears
> > to reliably do what it is supposed to do (at least on my
> > APC7900 switches running AOS
> > 3.3.4): Fence.
> >
> > It would make a great deal of sense for someone to add it to
> > the Luci/CMAN list of supported fences. Maybe "APC Power
> > Device (CLI) / fence_apc_cli"?
> >
> > --
> > Benjamin Franz
> >
> > "It is moronic to predict without first establishing an error rate
> >   for a prediction and keeping track of one's past record of
> > accuracy."
> >                      -- Nassim Nicholas Taleb, Fooled By Randomness
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From Alain.Moulle at bull.net  Wed Jan  9 14:04:03 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 09 Jan 2008 15:04:03 +0100
Subject: [Linux-cluster] CS5 : clurgmgrd[28359]: segfault
Message-ID: <4784D453.6050005@bull.net>

Hi

Testing the CS5 on a two-nodes cluster with quorum disk, when I did
the test ifdown on the heart-beat interface, I got a segfault in log :

Jan  9 09:45:16 s_sys at am1 avahi-daemon[3106]: Interface eth0.IPv6 no longer
relevant for mDNS.
Jan  9 09:45:18 s_sys at am1 qdiskd[28265]: <debug> Heuristic: 'ping -t1 -c1
172.19.1.99' missed (1/3)
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] The token was lost in the
OPERATIONAL state.
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] Receive multicast socket recv
buffer size (288000 bytes).
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] Transmit multicast socket send
buffer size (262142 bytes).
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] The network interface is down.
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] entering GATHER state from 15.
Jan  9 09:45:25 s_sys at am1 openais[28300]: [TOTEM] entering GATHER state from 2.
Jan  9 09:45:28 s_sys at am1 qdiskd[28265]: <debug> Heuristic: 'ping -t1 -c1
172.19.1.99' missed (2/3)
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] entering GATHER state from 0.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] Creating commit token because
I am the rep.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] Saving state aru 5c high seq
received 5c
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] Storing new sequence id for
ring 12c
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] entering COMMIT state.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] entering RECOVERY state.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] position [0] member 127.0.0.1:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] previous ring seq 296 rep
172.19.1.78
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] aru 5c high delivered 5c
received flag 1
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] Did not need to originate any
messages in recovery.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] Sending initial ORF token
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] CLM CONFIGURATION CHANGE
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] New Configuration:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ]       r(0) ip(127.0.0.1)
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] Members Left:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ]       r(0) ip(172.19.1.79)
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] Members Joined:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] CLM CONFIGURATION CHANGE
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] New Configuration:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ]       r(0) ip(127.0.0.1)
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] Members Left:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] Members Joined:
Jan  9 09:45:30 s_sys at am1 openais[28300]: [SYNC ] This node is within the
primary component and will provide service.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] entering OPERATIONAL state.
Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] got nodejoin message 172.16.101.91
Jan  9 09:45:30 s_sys at am1 openais[28300]: [EVT  ] recovery error node: r(0)
ip(127.0.0.1)  not found
Jan  9 09:45:30 s_kernel at am1 kernel: clurgmgrd[28359]: segfault at
0000000000000000 rip 0000000000408c4a rsp 00007fff04a2c450 error 4
Jan  9 09:45:30 s_sys at am1 gfs_controld[28328]: cluster is down, exiting
Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 2
Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 0
Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 1
Jan  9 09:45:30 s_sys at am1 dlm_controld[28322]: cluster is down, exiting
Jan  9 09:45:30 s_sys at am1 fenced[28316]: cman_get_nodes error -1 104
Jan  9 09:45:30 s_sys at am1 fenced[28316]: cluster is down, exiting
Jan  9 09:45:30 s_sys at am1 clurgmgrd[28358]: <crit> Watchdog: Daemon died,
rebooting...
Jan  9 09:45:30 s_sys at am1 shutdown[18377]: shutting down for system halt

Is-it already a known problem ?

Thanks
Regards
Alain Moull?




From Alexandre.Racine at mhicc.org  Wed Jan  9 16:23:41 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Wed, 9 Jan 2008 11:23:41 -0500
Subject: [Linux-cluster] scsi reservation
References: <4784D453.6050005@bull.net>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D1275@cumulonimbus.RG.local>

Hi all,

I am currently using version 1.0.4 of GFS and the scsi reservation binairies (scsi_reserve, fence_scsi, etc) are not there. Is it suppose to be like this or this is the distro I a using playing games with me (not my choice! It's Gentoo).

If it's normal that they are not there, is there a reason for this? Does it work well?

Because it's still here : http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?cvsroot=cluster


Thanks.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2541 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080109/50c694ca/attachment.bin>

From Abdel.Sadek at lsi.com  Wed Jan  9 16:45:10 2008
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Wed, 9 Jan 2008 09:45:10 -0700
Subject: [Linux-cluster] RE: scsi reservation
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D1275@cumulonimbus.RG.local>
Message-ID: <C776378855970A4DADE4A476447F639101168951@NAMAIL3.ad.lsil.com>

I believe you may not have the sg3_utils packages installed. I'll first
check for that.

Thanks.
Abdel..


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alexandre Racine
Sent: Wednesday, January 09, 2008 10:24 AM
To: linux clustering
Subject: scsi reservation

Hi all,

I am currently using version 1.0.4 of GFS and the scsi reservation
binairies (scsi_reserve, fence_scsi, etc) are not there. Is it suppose
to be like this or this is the distro I a using playing games with me
(not my choice! It's Gentoo).

If it's normal that they are not there, is there a reason for this? Does
it work well?

Because it's still here :
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?
cvsroot=cluster


Thanks.



From Alexandre.Racine at mhicc.org  Wed Jan  9 17:28:41 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Wed, 9 Jan 2008 12:28:41 -0500
Subject: [Linux-cluster] RE: scsi reservation
References: <C776378855970A4DADE4A476447F639101168951@NAMAIL3.ad.lsil.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D1279@cumulonimbus.RG.local>

You are right, that package was not installed.

So now I installed the package, and recompiled "fence", but "fence_scsi" is still not there in /sbin/

Any more idea? (Thanks for the first hint).


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org



-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Sadek, Abdel
Sent: Wed 2008-01-09 11:45
To: linux clustering
Subject: [Linux-cluster] RE: scsi reservation
 
I believe you may not have the sg3_utils packages installed. I'll first
check for that.

Thanks.
Abdel..


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alexandre Racine
Sent: Wednesday, January 09, 2008 10:24 AM
To: linux clustering
Subject: scsi reservation

Hi all,

I am currently using version 1.0.4 of GFS and the scsi reservation
binairies (scsi_reserve, fence_scsi, etc) are not there. Is it suppose
to be like this or this is the distro I a using playing games with me
(not my choice! It's Gentoo).

If it's normal that they are not there, is there a reason for this? Does
it work well?

Because it's still here :
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?
cvsroot=cluster


Thanks.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3241 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080109/d5392ff7/attachment.bin>

From kanderso at redhat.com  Wed Jan  9 19:53:17 2008
From: kanderso at redhat.com (Kevin Anderson)
Date: Wed, 09 Jan 2008 13:53:17 -0600
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <Pine.LNX.4.44.0801082236230.25141-100000@allspice.nssg.mitel.com>
References: <Pine.LNX.4.44.0801082236230.25141-100000@allspice.nssg.mitel.com>
Message-ID: <1199908397.3277.44.camel@dhcp80-204.msp.redhat.com>

On Tue, 2008-01-08 at 22:39 -0500, Charlie Brady wrote:
> On Tue, 8 Jan 2008, Gordan Bobic wrote:
> 
> > Charlie Brady wrote:
> > > On Fri, 4 Jan 2008, Charlie Brady wrote:
> > > 
> > >> I'm helping a colleague to collect information on an application lockup 
> > >> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array.
> > >>
> > >> I'd appreciate advice as to what information to collect next.
> > > 
> > > Nobody have any advice?
> > 
> > Shared SCSI as in iSCSI SAN or as in a shared SCSI bus with two machines 
> > connected via a SCSI cable?
> 
> The latter. I don't have the details immediately at hand, but it's all HP 
> gear. A pair of DL380s with an external SCSI array (MSAxx), IIRC.
> 
 If it is a MSA20, MSA30 or MSA500 - they won't work with GFS.  Shared
SCSI bus isn't really shared, accesses lock the bus such that when one
node accesses the storage the other node is locked out.  GFS requires
the ability to do shared concurrent access to the storage devices.  This
probably explains the hangs you were seeing.  So, either get an iSCSI or
fibre channel storage array, or go strictly with a failover storage
architecture, such that only one node has the filesystem mounted at any
one time.  In that case, you don't need gfs anymore, just cluster suite
to manage the failover.

Kevin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080109/a7f0cc79/attachment.htm>

From comaniliut at yahoo.com  Wed Jan  9 20:56:06 2008
From: comaniliut at yahoo.com (Coman ILIUT)
Date: Wed, 9 Jan 2008 12:56:06 -0800 (PST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
Message-ID: <31596.3345.qm@web51308.mail.re2.yahoo.com>

We're using an MSA500 actually, so what you're saying is that we're not using the proper hardware for GFS.
Can you tell us how bad is this? The reason I'm asking is because we are already at the second version of our product using this solution and we did not have any issues before. So we never considered the hardware to be an issue.

When we picked this solution, HP presented MSA500 as being able to do concurrent access to files (of course there's some serialization inside, there's only one set of reading heads in the hard disk). Also, HP DL360 have the ILO interface, which is supported by GFS.

The difference now is that we are using file locking heavily and we're using files in multi-access mode. Everything seems to work fine, except for the locking.

Coman

Kevin Anderson <kanderso at redhat.com> wrote:           On Tue, 2008-01-08 at 22:39 -0500, Charlie Brady wrote:  On Tue, 8 Jan 2008, Gordan Bobic wrote:  > Charlie Brady wrote: > > On Fri, 4 Jan 2008, Charlie Brady wrote: > >  > >> I'm helping a colleague to collect information on an application lockup  > >> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array. > >> > >> I'd appreciate advice as to what information to collect next. > >  > > Nobody have any advice? >  > Shared SCSI as in iSCSI SAN or as in a shared SCSI bus with two machines  > connected via a SCSI cable?  The latter. I don't have the details immediately at hand, but it's all HP  gear. A pair of DL380s with an external SCSI array (MSAxx), IIRC.     If it is a MSA20, MSA30 or MSA500 - they won't work with GFS.  Shared SCSI bus isn't really shared, accesses lock the bus such that when one node accesses the storage the other node is locked out.  GFS requires the ability to do
 shared concurrent access to the storage devices.  This probably explains the hangs you were seeing.  So, either get an iSCSI or fibre channel storage array, or go strictly with a failover storage architecture, such that only one node has the filesystem mounted at any one time.  In that case, you don't need gfs anymore, just cluster suite to manage the failover.
 
 Kevin
 
 --
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster




      Looking for a X-Mas gift?  Everybody needs a Flickr Pro Account.

 

http://www.flickr.com/gift/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080109/9985e33e/attachment.htm>

From kanderso at redhat.com  Wed Jan  9 21:47:21 2008
From: kanderso at redhat.com (Kevin Anderson)
Date: Wed, 09 Jan 2008 15:47:21 -0600
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <31596.3345.qm@web51308.mail.re2.yahoo.com>
References: <31596.3345.qm@web51308.mail.re2.yahoo.com>
Message-ID: <1199915241.3277.46.camel@dhcp80-204.msp.redhat.com>

Sorry, Lon gave me updated info about the MSA500.  It isn't a parallel
shared scsi bus configuration, so might work with gfs.  However, we have
never run with it before and not sure about the performance
characteristics.

Kevin

On Wed, 2008-01-09 at 12:56 -0800, Coman ILIUT wrote:
> We're using an MSA500 actually, so what you're saying is that we're
> not using the proper hardware for GFS.
> Can you tell us how bad is this? The reason I'm asking is because we
> are already at the second version of our product using this solution
> and we did not have any issues before. So we never considered the
> hardware to be an issue.
> 
> When we picked this solution, HP presented MSA500 as being able to do
> concurrent access to files (of course there's some serialization
> inside, there's only one set of reading heads in the hard disk). Also,
> HP DL360 have the ILO interface, which is supported by GFS.
> 
> The difference now is that we are using file locking heavily and we're
> using files in multi-access mode. Everything seems to work fine,
> except for the locking.
> 
> Coman
> 
> Kevin Anderson <kanderso at redhat.com> wrote:
>         On Tue, 2008-01-08 at 22:39 -0500, Charlie Brady wrote: 
>         > On Tue, 8 Jan 2008, Gordan Bobic wrote:  > Charlie Brady wrote: > > On Fri, 4 Jan 2008, Charlie Brady wrote: > >  > >> I'm helping a colleague to collect information on an application lockup  > >> problem on a two-node DLM/GFS cluster, with GFS on a shared SCSI array. > >> > >> I'd appreciate advice as to what information to collect next. > > 
>         >  > > Nobody have any advice? >  > Shared SCSI as in iSCSI SAN or as in a shared SCSI bus with two machines  > connected via a SCSI cable?  The latter. I don't have the details immediately at hand, but it's all HP  gear. A pair of DL380s with an external SCSI array (MSAxx), IIRC.  
>         If it is a MSA20, MSA30 or MSA500 - they won't work with GFS.
>         Shared SCSI bus isn't really shared, accesses lock the bus
>         such that when one node accesses the storage the other node is
>         locked out.  GFS requires the ability to do shared concurrent
>         access to the storage devices.  This probably explains the
>         hangs you were seeing.  So, either get an iSCSI or fibre
>         channel storage array, or go strictly with a failover storage
>         architecture, such that only one node has the filesystem
>         mounted at any one time.  In that case, you don't need gfs
>         anymore, just cluster suite to manage the failover.
>         
>         Kevin
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> 
> ______________________________________________________________________
> Ask a question on any topic and get answers from real people. Go to
> Yahoo! Answers. 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080109/bc75860f/attachment.htm>

From lhh at redhat.com  Wed Jan  9 21:54:09 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 09 Jan 2008 16:54:09 -0500
Subject: [Linux-cluster] CS5 : clurgmgrd[28359]: segfault
In-Reply-To: <4784D453.6050005@bull.net>
References: <4784D453.6050005@bull.net>
Message-ID: <1199915649.16312.149.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-09 at 15:04 +0100, Alain Moulle wrote:
> Hi
> 
> Testing the CS5 on a two-nodes cluster with quorum disk, when I did
> the test ifdown on the heart-beat interface, I got a segfault in log :

> Jan  9 09:45:30 s_sys at am1 openais[28300]: [TOTEM] entering OPERATIONAL state.
> Jan  9 09:45:30 s_sys at am1 openais[28300]: [CLM  ] got nodejoin message 172.16.101.91
> Jan  9 09:45:30 s_sys at am1 openais[28300]: [EVT  ] recovery error node: r(0)
> ip(127.0.0.1)  not found
> Jan  9 09:45:30 s_kernel at am1 kernel: clurgmgrd[28359]: segfault at
> 0000000000000000 rip 0000000000408c4a rsp 00007fff04a2c450 error 4
> Jan  9 09:45:30 s_sys at am1 gfs_controld[28328]: cluster is down, exiting
> Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 2
> Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 0
> Jan  9 09:45:30 s_kernel at am1 kernel: dlm: closing connection to node 1
> Jan  9 09:45:30 s_sys at am1 dlm_controld[28322]: cluster is down, exiting
> Jan  9 09:45:30 s_sys at am1 fenced[28316]: cman_get_nodes error -1 104
> Jan  9 09:45:30 s_sys at am1 fenced[28316]: cluster is down, exiting
> Jan  9 09:45:30 s_sys at am1 clurgmgrd[28358]: <crit> Watchdog: Daemon died,
> rebooting...
> Jan  9 09:45:30 s_sys at am1 shutdown[18377]: shutting down for system halt
> 
> Is-it already a known problem ?

openais died, causing the dlm to go away and rgmanager to crash - the
"nanny" clurgmgrd process rebooted the node.

Although the segfault is probably less than ideal, the nanny process
killing the node is probably fine since the node needs to be fenced at
this point anyway.

What should of happened with rgmanager is:
* it should have seen a negative quorum transition,
* halted cluster services uncleanly, and
* wait to be fenced.

-- Lon



From lhh at redhat.com  Wed Jan  9 21:59:46 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 09 Jan 2008 16:59:46 -0500
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <1199915241.3277.46.camel@dhcp80-204.msp.redhat.com>
References: <31596.3345.qm@web51308.mail.re2.yahoo.com>
	<1199915241.3277.46.camel@dhcp80-204.msp.redhat.com>
Message-ID: <1199915986.16312.155.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-09 at 15:47 -0600, Kevin Anderson wrote:
> Sorry, Lon gave me updated info about the MSA500.  It isn't a parallel
> shared scsi bus configuration, so might work with gfs.  However, we
> have never run with it before and not sure about the performance
> characteristics.

It's a multi-port SCSI RAID array, but it's not a multi-initiator
parallel SCSI bus (which absolutely does not work with GFS: ex: Dell
PowerVault 220S).

The MSA500 has an on-box RAID controller with multiple SCSI ports, which
are attached using SCSI cables to CCISS controllers in the host
machines.  While CCISS are host-RAID controllers, as I understand it,
when talking to MSA500 arrays, they act just like dumb SCSI controllers
(that are 10x the cost of regular dumb SCSI controllers, of course!) -
and do nothing "intelligent" at all - leaving it up to the MSA
controller to handle all the RAID operations.

Also, if I'm not mistaken, each port on the MSA RAID controller is
actually its own SCSI (well, cciss) bus, so you shouldn't hit typical
SCSI bus problems.  For example, you should not see bus resets for
example during reboot of one of the nodes.

-- Lon




From lhh at redhat.com  Wed Jan  9 22:09:40 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 09 Jan 2008 17:09:40 -0500
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5,
	kernel 2.6.9-67.EL)
In-Reply-To: <1199915986.16312.155.camel@ayanami.boston.devel.redhat.com>
References: <31596.3345.qm@web51308.mail.re2.yahoo.com>
	<1199915241.3277.46.camel@dhcp80-204.msp.redhat.com>
	<1199915986.16312.155.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1199916580.16312.164.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-09 at 16:59 -0500, Lon Hohberger wrote:
> On Wed, 2008-01-09 at 15:47 -0600, Kevin Anderson wrote:
> > Sorry, Lon gave me updated info about the MSA500.  It isn't a parallel
> > shared scsi bus configuration, so might work with gfs.  However, we
> > have never run with it before and not sure about the performance
> > characteristics.
> 
> It's a multi-port SCSI RAID array, but it's not a multi-initiator
> parallel SCSI bus (which absolutely does not work with GFS: ex: Dell
> PowerVault 220S).
> 
> The MSA500 has an on-box RAID controller with multiple SCSI ports, which
> are attached using SCSI cables to CCISS controllers in the host
> machines.  While CCISS are host-RAID controllers, as I understand it,
> when talking to MSA500 arrays, they act just like dumb SCSI controllers
> (that are 10x the cost of regular dumb SCSI controllers, of course!) -
> and do nothing "intelligent" at all - leaving it up to the MSA
> controller to handle all the RAID operations.

Hmm, I have an archaic G1 -- apparently the MSA500G2 comes with two
plain-Jane SCSI controllers:

http://h18004.www1.hp.com/storage/disk_storage/msa_diskarrays/san_arrays/msa500g2/index.html

"Two Ultra320 SCSI adapters included in MSA500 G2 package - no
additional HBA purchase necessary"


More important, however, is:

http://h18004.www1.hp.com/storage/disk_storage/msa_diskarrays/san_arrays/index.html

(Particularly - Note the "Dual Ultra320 SCSI Channels")

Also in the FAQ:

http://h18004.www1.hp.com/storage/disk_storage/msa_diskarrays/san_arrays/msa500g2/qa.html#9

"There is a market that is not currently being addressed by current SCSI
JBOD or SAN products. There are limited products that offer such high
availability features such as failover controllers on the storage
enclosure and battery backed cache in an entry level product. The
Modular Smart Array 500 G2 addresses this market head-on offering the
entry level clustering and shared storage at an affordable price."

So, in *theory* - the MSA500G2 should work, but as Kevin said, we have
not tested it with GFS.

-- Lon



From jamesc at exa.com  Wed Jan  9 22:16:11 2008
From: jamesc at exa.com (James Chamberlain)
Date: Wed, 9 Jan 2008 17:16:11 -0500 (EST)
Subject: [Linux-cluster] Instability troubles
In-Reply-To: <1199374720.9564.20.camel@ayanami.boston.devel.redhat.com>
References: <Pine.LNX.4.64.0801021631370.27708@hawking.exa.com>
	<1199374720.9564.20.camel@ayanami.boston.devel.redhat.com>
Message-ID: <Pine.LNX.4.64.0801091608350.19639@hawking.exa.com>

On Thu, 3 Jan 2008, Lon Hohberger wrote:

> On Wed, 2008-01-02 at 17:35 -0500, James Chamberlain wrote:
>> Hi all,
>>
>> I'm having some major stability problems with my three-node CS/GFS cluster.
>> Every two or three days, one of the nodes fences another, and I have to
>> hard-reboot the entire cluster to recover.  I have had this happen twice
>> today.  I don't know what's triggering the fencing, since all the nodes
>> appear to me to be up and running when it happens.  In fact, I was logged
>> on to node3 just now, running 'top', when node2 fenced it.
>>
>> When they come up, they don't automatically mount their GFS filesystems,
>> even with "_netdev" specified as a mount option; however, the node which
>> comes up first mounts them all as part of bringing all the services up.
>>
>> I did notice a couple of disconcerting things earlier today.  First, I was
>> running "watch clustat".  (I prefer to see the time updating, where I
>> can't with "clustat -i")
>
> The time is displayed in RHEL5 CVS version, and will go out with 5.2.
>
>
>>   At one point, "clustat" crashed as follows:
>>
>> Jan  2 15:19:54 node2 kernel: clustat[17720]: segfault at 0000000000000024
>> rip 0000003629e75bc0 rsp 00007fff18827178 error 4
>
> A clustat crash is not a cause for a fence operation.  That is, this
> might be related, but is definitely not the cause of a node being
> evicted.
>
>
>> Fairly shortly thereafter, clustat reported that node3 as "Online,
>> Estranged, rgmanager".  Can anyone shed light on what that means?
>> Google's not telling me much.
>
> Ordinarily, this happens when you have a node join the cluster manually
> w/o giving it the configuration file.  CMAN would assign it a node ID -
> but the node is not in the cluster configuration - so clustat would
> display the node as 'Estranged'.
>
> In your case, I'm not sure what the problem would be.

I have a theory (see below).  Does it give you any ideas what might have 
happened here?

>> At the moment, all three nodes are running CentOS 5.1, with kernel
>> 2.6.18-53.1.4.el5.  Can anyone point me in the right direction to resolve
>> these problems?  I wasn't having trouble like this when I was running a
>> CentOS 4 CS/GFS cluster.  Is it possible to downgrade, likely via a full
>> rebuild of all the nodes, from CentOS 5 CS/GFS to 4?  Should I instead
>> consider setting up a single node to mount the GFS filesystems and serve
>> them out, to get around these fencing issues?
>
> I'd be interested a core file.  Try to reproduce your clustat crash with
> 'ulimit -c unlimited' set before running clustat.  I haven't seen
> clustat crash in a very long time, so I'm interested in the cause.
> (Also, after the crash, check to see if ccsd is running...)

I'll see what I can do for you.

> Maybe it will uncover some other hints as to the cause of the behavior
> you saw.
>
> If ccsd indeed failed for some reason, it would cause fencing to fail as
> well because the fence daemon would be unable to read fencing actions.
>
> Even given all of this, this doesn't explain why the node needed to be
> fenced in the first place.  Were there any log messages indicating why
> the node needed to be fenced?
>
> The RHEL5 / CentOS5 release of Cluster Suite has a fairly aggressive
> node death timeout (5 seconds); maybe increasing it would help.
>
> <cluster ...>
>   <cman .../>
>   <totem token="21000"/> <!-- add this -->
>   ...
> </cluster>

I've come up with a theory on what's been going on, and so far, that theory 
appears to be panning out.  At the very least, I haven't had any further 
crashes (yet).  I'm hoping someone can validate it or tell me I need to 
keep looking.

On each of the three nodes in my cluster, eth0 is used for cluster services 
(NFS) and the cluster's multicast group, and eth1 is used for iSCSI.  I 
noticed that two of the three nodes were using DHCP on eth0, and that the 
problems always seemed to happen when the cluster was under a heavy load. 
My DHCP server was configured to give these nodes the same address every 
time, so they essentially had static addresses - they just used DHCP to get 
them.  I think I spotted that there was a DHCP renewal going on at or just 
before the fencing started each time.  My theory is that, under heavy load, 
this DHCP renewal process was somehow interfering with either the primary 
IP address for eth0 or with the cluster's multicast traffic and was causing 
the affected node(s) to get booted from the cluster.  I have since switched 
all the nodes to use truely static addressing, and have not had a problem 
in the intervening week.  I have not yet tried the "<totem token=21000/>" 
trick that Lon mentioned, but I'm keeping that handy should problems crop 
up again.

Thanks,

James



From charlieb-linux-cluster at e-smith.com  Wed Jan  9 23:20:07 2008
From: charlieb-linux-cluster at e-smith.com (Charlie Brady)
Date: Wed, 9 Jan 2008 18:20:07 -0500 (EST)
Subject: [Linux-cluster] fcntl locking lockup (dlm 1.07, GFS 6.1.5, kernel
	2.6.9-67.EL)
In-Reply-To: <1199908397.3277.44.camel@dhcp80-204.msp.redhat.com>
Message-ID: <Pine.LNX.4.44.0801091815090.31336-100000@allspice.nssg.mitel.com>


On Wed, 9 Jan 2008, Kevin Anderson wrote:

>  If it is a MSA20, MSA30 or MSA500 - they won't work with GFS.  Shared
> SCSI bus isn't really shared, accesses lock the bus such that when one
> node accesses the storage the other node is locked out.

But only temporarily, surely. The filesystem should expect some latency, 
and all I/O is eventually serialised somewhere.

> GFS requires the ability to do shared concurrent access to the storage
> devices.  This probably explains the hangs you were seeing.

I doubt it. Both nodes were still able to access the file system. I also 
think there that shouldn't be any disk I/O behind fcntl(). Am I wrong?

---
Charlie



From jorge.gonzalez at degesys.com  Thu Jan 10 16:18:21 2008
From: jorge.gonzalez at degesys.com (Jorge Gonzalez)
Date: Thu, 10 Jan 2008 17:18:21 +0100
Subject: [Linux-cluster] Cluster fails after fencing by DRAC
Message-ID: <4786454D.7030204@degesys.com>

Hi all!

I have a problem with 3 nodes cluster. When I run "fence_node node1" the 
node1 reeboot by drac succesfully. When node1 restarts  then gets frozen:

------------------
starting clvmd: dlm: got connection fron 32
dlm: connecting to 33
dlm: got connection fron 33
[frozen]

* cman_tool services shows:
type             level name       id       state      
fence            0     default    0001001f none       
[31 32 33]
dlm              1     clvmd      00010020 none       
[31 32 33]
dlm              1     rgmanager  00020020 none       
[32 33]

It seems rgmanager has not 31 (?)

* clustat shows:
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xenr3u1.domain.com                  31 Online
  xenr3u2.domain.com                 32 Online, Local
  xenr3u3.domain.com                 33 Online

-------------------

Then I rebooted again the node1:
Starting cluster
    Loading modules DLM .......
done
starting ccsd
starting cman
starting daemons
starting fencing
[frozen again]

after long time starting fencing [done] but cman_tool services fails

* cman_tool services shows:
type             level name       id       state      
fence            0     default    0001001f FAIL_ALL_STOPPED
[31 32 33]
dlm              1     clvmd      00010020 FAIL_STOP_WAIT
[31 32 33]
dlm              1     rgmanager  00020020 FAIL_STOP_WAIT

* clustat shows:
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xenr3u1.domain.com                  31 Online
  xenr3u2.domain.com                 32 Online, Local
  xenr3u3.domain.com                 33 Online

/etc/init.d/rgmanager restart
Shutting down Cluster Service Manager...
Waiting for services to stop:
[long timeeeeeeee]
----------------------------------

I saw this page translated to english 
(http://translate.google.com/translate?u=http%3A%2F%2Fken-etsu-tech.blogspot.com%2F2007%2F11%2Fred-hat-cluster-kernel-xen.html&langpair=ja%7Cen&hl=es&ie=UTF-8). 

It's exactly the same. A kernel bug? clvmd bug?

Linux xenr3u2 2.6.18-8.1.15.el5xen #1 SMP Mon Oct 22 09:01:12 EDT 2007 
x86_64 x86_64 x86_64 GNU/Linux
cman-2.0.64-1.0.1.el5
rgmanager-2.0.24-1.el5.centos
lvm2-cluster-2.02.16-3.el5



Sometimes the node starts ok and cman_tool is also ok.

* /etc/lvm.conf:

devices {
    dir = "/dev"
    scan = [ "/dev" ]
    filter = [ "a/.*/" ]   
    cache = "/etc/lvm/.cache"
    write_cache_state = 1
    sysfs_scan = 1   
    md_component_detection = 1
}
log {   
    verbose = 0
    syslog = 1
    overwrite = 0  
    level = 0
    indent = 1
    command_names = 0
    prefix = "  "
}
backup {
    backup = 1
    backup_dir = "/etc/lvm/backup"
    archive = 1
    archive_dir = "/etc/lvm/archive"
    retain_min = 10
    retain_days = 30
}
shell {
    history_size = 100
}
global {
    library_dir = "/usr/lib64"
    umask = 077
    test = 0
    activation = 1
    proc = "/proc"
    locking_type = 3
    fallback_to_clustered_locking = 1
    fallback_to_local_locking = 1
    locking_dir = "/var/lock/lvm"
}
activation {
    missing_stripe_filler = "/dev/ioerror"
    reserved_stack = 256
    reserved_memory = 8192
    process_priority = -18
    mirror_region_size = 512
    mirror_log_fault_policy = "allocate"
    mirror_device_fault_policy = "remove"
}



That's all ;-)
Thanks in advance








-------------- next part --------------
A non-text attachment was scrubbed...
Name: jorge.gonzalez.vcf
Type: text/x-vcard
Size: 350 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080110/b6a70fe2/attachment.vcf>

From Mathieu.MARY at neufcegetel.fr  Fri Jan 11 11:00:09 2008
From: Mathieu.MARY at neufcegetel.fr (MARY, Mathieu)
Date: Fri, 11 Jan 2008 12:00:09 +0100
Subject: [Linux-cluster] Cluster fails after fencing by DRAC
In-Reply-To: <4786454D.7030204@degesys.com>
Message-ID: <20080111102140.157A920B0F8@smtp3.ldcom.fr>


hello,

sorry to ask but is the "none" state a normal state for services?
I have issues with cluster services too and I've been told that this state 
is not normal and indicates that the nodes didn't join the fence domain that causing issues with rgmanager too.

what does show clustat and cman_tool services at startup ?


regards, 
 
Mathieu

-----Message d'origine-----
De?: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] De la part de Jorge Gonzalez
Envoy??: jeudi 10 janvier 2008 17:18
??: linux-cluster at redhat.com
Objet?: [Linux-cluster] Cluster fails after fencing by DRAC

Hi all!

I have a problem with 3 nodes cluster. When I run "fence_node node1" the 
node1 reeboot by drac succesfully. When node1 restarts  then gets frozen:

------------------
starting clvmd: dlm: got connection fron 32
dlm: connecting to 33
dlm: got connection fron 33
[frozen]

* cman_tool services shows:
type             level name       id       state      
fence            0     default    0001001f none       
[31 32 33]
dlm              1     clvmd      00010020 none       
[31 32 33]
dlm              1     rgmanager  00020020 none       
[32 33]

It seems rgmanager has not 31 (?)

* clustat shows:
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xenr3u1.domain.com                  31 Online
  xenr3u2.domain.com                 32 Online, Local
  xenr3u3.domain.com                 33 Online

-------------------

Then I rebooted again the node1:
Starting cluster
    Loading modules DLM .......
done
starting ccsd
starting cman
starting daemons
starting fencing
[frozen again]

after long time starting fencing [done] but cman_tool services fails

* cman_tool services shows:
type             level name       id       state      
fence            0     default    0001001f FAIL_ALL_STOPPED
[31 32 33]
dlm              1     clvmd      00010020 FAIL_STOP_WAIT
[31 32 33]
dlm              1     rgmanager  00020020 FAIL_STOP_WAIT

* clustat shows:
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xenr3u1.domain.com                  31 Online
  xenr3u2.domain.com                 32 Online, Local
  xenr3u3.domain.com                 33 Online

/etc/init.d/rgmanager restart
Shutting down Cluster Service Manager...
Waiting for services to stop:
[long timeeeeeeee]
----------------------------------

I saw this page translated to english 
(http://translate.google.com/translate?u=http%3A%2F%2Fken-etsu-tech.blogspot.com%2F2007%2F11%2Fred-hat-cluster-kernel-xen.html&langpair=ja%7Cen&hl=es&ie=UTF-8). 

It's exactly the same. A kernel bug? clvmd bug?

Linux xenr3u2 2.6.18-8.1.15.el5xen #1 SMP Mon Oct 22 09:01:12 EDT 2007 
x86_64 x86_64 x86_64 GNU/Linux
cman-2.0.64-1.0.1.el5
rgmanager-2.0.24-1.el5.centos
lvm2-cluster-2.02.16-3.el5



Sometimes the node starts ok and cman_tool is also ok.

* /etc/lvm.conf:

devices {
    dir = "/dev"
    scan = [ "/dev" ]
    filter = [ "a/.*/" ]   
    cache = "/etc/lvm/.cache"
    write_cache_state = 1
    sysfs_scan = 1   
    md_component_detection = 1
}
log {   
    verbose = 0
    syslog = 1
    overwrite = 0  
    level = 0
    indent = 1
    command_names = 0
    prefix = "  "
}
backup {
    backup = 1
    backup_dir = "/etc/lvm/backup"
    archive = 1
    archive_dir = "/etc/lvm/archive"
    retain_min = 10
    retain_days = 30
}
shell {
    history_size = 100
}
global {
    library_dir = "/usr/lib64"
    umask = 077
    test = 0
    activation = 1
    proc = "/proc"
    locking_type = 3
    fallback_to_clustered_locking = 1
    fallback_to_local_locking = 1
    locking_dir = "/var/lock/lvm"
}
activation {
    missing_stripe_filler = "/dev/ioerror"
    reserved_stack = 256
    reserved_memory = 8192
    process_priority = -18
    mirror_region_size = 512
    mirror_log_fault_policy = "allocate"
    mirror_device_fault_policy = "remove"
}



That's all ;-)
Thanks in advance











From saza_thi at yahoo.com  Fri Jan 11 11:56:03 2008
From: saza_thi at yahoo.com (sahai srichock)
Date: Fri, 11 Jan 2008 03:56:03 -0800 (PST)
Subject: [Linux-cluster] cluster down network
Message-ID: <79685.6189.qm@web54202.mail.re2.yahoo.com>

I have two node cluster .


/etc/cluster/cluster.conf

     <?xml version="1.0"?>
<cluster alias="saza" config_version="38" name="saza">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="node1.network.com" nodeid="1" votes="1">
                        <fence/>
                </clusternode>
                <clusternode name="node2.network.com" nodeid="2" votes="1">
                        <fence/>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices/>
        <rm>
                <failoverdomains>
                        <failoverdomain name="cenall" ordered="1" restricted="1">
                                <failoverdomainnode name="node1.network.com" priority="1"/>
                                <failoverdomainnode name="node2.network.com" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <script file="/etc/init.d/httpd" name="httpd"/>
                        <fs device="/dev/sda3" force_fsck="0" force_unmount="0" fsid="6443" fstype="ext3" mountpoint="/var/www/html" name="httpd-content" options="" self_fence="0"/>
                        <ip address="10.28.99.81" monitor_link="1"/>
                </resources>
                <service autostart="1" domain="cenall" name="webserver">
                        <script ref="httpd"/>
                        <fs ref="httpd-content"/>
                        <ip ref="10.28.99.81"/>
                </service>
        </rm>
</cluster>

when i restart network on node2. 
service network shutdown
service network start.

and then

on node2
i can't start cman .

start fencing ... failed

Message log on node2


openais[22433]: [SYNC ] Not using a virtual synchrony filter. 
Jan 12 02:05:02 clus2 groupd[22442]: found uncontrolled kernel object rgmanager in /sys/kernel/dlm
Jan 12 02:05:02 clus2 openais[22433]: [TOTEM] Creating commit token because I am the rep. 
Jan 12 02:05:02 clus2 groupd[22442]: local node must be reset to clear 1 uncontrolled instances of gfs and/or dlm
Jan 12 02:05:02 clus2 openais[22433]: [TOTEM] Saving state aru 0 high seq received 0 
Jan 12 02:05:02 clus2 fence_node[22467]: Fence of "node2.network.com" was unsuccessful 
Jan 12 02:05:02 clus2 openais[22433]: [TOTEM] entering COMMIT state. 
Jan 12 02:05:02 clus2 gfs_controld[22460]: groupd_dispatch error -1 errno 104
Jan 12 02:05:02 clus2 dlm_controld[22454]: groupd is down, exiting
Jan 12 02:05:02 clus2 fenced[22448]: groupd is down, exiting
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] entering RECOVERY state. 
Jan 12 02:05:03 clus2 gfs_controld[22460]: groupd connection died
Jan 12 02:05:03 clus2 kernel: dlm: closing connection to node 2
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] position [0] member 192.168.100.2: 
Jan 12 02:05:03 clus2 gfs_controld[22460]: cluster is down, exiting
Jan 12 02:05:03 clus2 kernel: dlm: closing connection to node 1
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] previous ring seq 0 rep 192.168.100.2 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] aru 0 high delivered 0 received flag 0 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] Did not need to originate any messages in recovery. 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] Storing new sequence id for ring 4 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] Sending initial ORF token 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] New Configuration: 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] Members Left: 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] Members Joined: 
Jan 12 02:05:03 clus2 openais[22433]: [SYNC ] This node is within the primary component and will provide service. 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] New Configuration: 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.2)  
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] Members Left: 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] Members Joined: 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.2)  
Jan 12 02:05:03 clus2 openais[22433]: [SYNC ] This node is within the primary component and will provide service. 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] entering OPERATIONAL state. 
Jan 12 02:05:03 clus2 openais[22433]: [CMAN ] quorum regained, resuming activity 
Jan 12 02:05:03 clus2 openais[22433]: [CLM  ] got nodejoin message 192.168.100.2 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] entering GATHER state from 11. 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] Saving state aru 9 high seq received 9 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] entering COMMIT state. 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] entering RECOVERY state. 
Jan 12 02:05:03 clus2 openais[22433]: [TOTEM] position [0] member 192.168.100.1: 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] previous ring seq 132 rep 192.168.100.1 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] aru c high delivered c received flag 0 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] position [1] member 192.168.100.2: 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] previous ring seq 4 rep 192.168.100.2 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] aru 9 high delivered 9 received flag 0 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] Did not need to originate any messages in recovery. 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] Storing new sequence id for ring 88 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] New Configuration: 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.2)  
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] Members Left: 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] Members Joined: 
Jan 12 02:05:04 clus2 openais[22433]: [SYNC ] This node is within the primary component and will provide service. 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] New Configuration: 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.1)  
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.2)  
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] Members Left: 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] Members Joined: 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ]   r(0) ip(192.168.100.1)  
Jan 12 02:05:04 clus2 openais[22433]: [SYNC ] This node is within the primary component and will provide service. 
Jan 12 02:05:04 clus2 openais[22433]: [TOTEM] entering OPERATIONAL state. 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] got nodejoin message 192.168.100.1 
Jan 12 02:05:04 clus2 openais[22433]: [CLM  ] got nodejoin message 192.168.100.2 
Jan 12 02:05:04 clus2 openais[22433]: [CPG  ] got joinlist message from node 1 
Jan 12 02:05:04 clus2 openais[22433]: [CMAN ] cman killed by node 2 for reason 2 
Jan 12 02:05:32 clus2 ccsd[22427]: Unable to connect to cluster infrastructure after 30 seconds. 


but on node1 can   restart cman. but message log show 
fence node2.network.com failed.

how  to join member again ?


      ____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/4d85c017/attachment.htm>

From lhh at redhat.com  Fri Jan 11 14:13:30 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 11 Jan 2008 09:13:30 -0500
Subject: [Linux-cluster] Cluster fails after fencing by DRAC
In-Reply-To: <20080111102140.157A920B0F8@smtp3.ldcom.fr>
References: <20080111102140.157A920B0F8@smtp3.ldcom.fr>
Message-ID: <1200060810.16312.172.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-11 at 12:00 +0100, MARY, Mathieu wrote:
> hello,
> 
> sorry to ask but is the "none" state a normal state for services?

For cman services, yes.

> I have issues with cluster services too and I've been told that this state 
> is not normal and indicates that the nodes didn't join the fence domain that causing issues with rgmanager too.

That's kind of right - but it's really any cman service.  If one cman
service is stuck in recovery/fail/join/etc. states, it blocks other cman
services from transitioning.

-- Lon




From Alain.Moulle at bull.net  Fri Jan 11 14:19:55 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 11 Jan 2008 15:19:55 +0100
Subject: [Linux-cluster] CS5  / something weird during tests
Message-ID: <47877B0B.8010603@bull.net>

Hi

On my two-nodes cluster with qdiskd :
when testing CS5 via a ifdown eth0 on node2(where is the heart-beat)
I have a strange behavior : the node2 is rebooted and service
is failovered by node1, fine. But after the reboot of node2 and
re-launch of CS 4 daemons, I can't see via clustat any information
from rgmanager, and cant't stop it if I try service rgmanager stop ...
It seems to be stalled in a strange state ...
An idea ?
Thanks
Regards
Alain Moull?

On node2 after reboot :
# clustat
msg_open: No such file or directory
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xena140                               1 Online
  xena141                               2 Online, Local
  /dev/sdc                              0 Online, Quorum Disk

[root at xena141 log]# service rgmanager status
clurgmgrd (pid 7805 7804) is running...

whereas on the other node I still have service information :

[root at xena140 log]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xena140                               1 Online, Local, rgmanager
  xena141                               2 Offline
  /dev/sdb                              0 Online, Quorum Disk

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  service:test_xena140 xena140                        started
  service:test_xena141 xena140                        started



From pcaulfie at redhat.com  Fri Jan 11 14:35:16 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Fri, 11 Jan 2008 14:35:16 +0000
Subject: [Linux-cluster] cluster down network
In-Reply-To: <79685.6189.qm@web54202.mail.re2.yahoo.com>
References: <79685.6189.qm@web54202.mail.re2.yahoo.com>
Message-ID: <47877EA4.7050402@redhat.com>

sahai srichock wrote:
> I have two node cluster .
>  

>  
> when i restart network on node2.
> service network shutdown
> service network start.
>  
> and then
>  
> on node2
> i can't start cman .

Is cman running when you take the network down? you should shut down
cman before removing the network or it will fail when the network is
restarted.

The proper order is

service cman stop
service network stop
...
service network start
service cman start

Patrick



From rhurst at bidmc.harvard.edu  Fri Jan 11 15:36:40 2008
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Fri, 11 Jan 2008 10:36:40 -0500
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
Message-ID: <1200065800.6508.20.camel@WSBID06223>

Has anyone migrated from an existing 4.5 to the newer 4.6 cluster suite?
We would like to roll out 4.6 in our 11-node cluster, but only one node
at a time, over the course of 2-weeks.  Does the two versions intermix
well enough to do that?   Or do we need to do take some kind of special
care or precaution, like when injecting this dilithium crystal chamber
with an inverse tachyon pulse from the main deflector dish?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/4c0cbb22/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3227 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/4c0cbb22/attachment.p7s>

From swplotner at amherst.edu  Fri Jan 11 16:00:10 2008
From: swplotner at amherst.edu (Steffen Plotner)
Date: Fri, 11 Jan 2008 11:00:10 -0500
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <1200065800.6508.20.camel@WSBID06223>
Message-ID: <150F55E3591CD042B77ED3DB957854652B6127@mail7.amherst.edu>

We have migrated from 4.5 to 4.6 without any problems, however all nodes
at once - would not recommend different versions on nodes within the
same cluster.
 
Others might have other ideas?


________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
rhurst at bidmc.harvard.edu
	Sent: Friday, January 11, 2008 10:37 AM
	To: linux-cluster at redhat.com
	Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
	
	
	Has anyone migrated from an existing 4.5 to the newer 4.6
cluster suite?  We would like to roll out 4.6 in our 11-node cluster,
but only one node at a time, over the course of 2-weeks.  Does the two
versions intermix well enough to do that?   Or do we need to do take
some kind of special care or precaution, like when injecting this
dilithium crystal chamber with an inverse tachyon pulse from the main
deflector dish?
	
	

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/7874173d/attachment.htm>

From gordan at bobich.net  Fri Jan 11 16:13:38 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 11 Jan 2008 16:13:38 +0000 (GMT)
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <150F55E3591CD042B77ED3DB957854652B6127@mail7.amherst.edu>
References: <150F55E3591CD042B77ED3DB957854652B6127@mail7.amherst.edu>
Message-ID: <alpine.LRH.1.00.0801111612200.31632@skynet.shatteredsilicon.net>

I have done similar, but again, all nodes at once (due to the fact that 
they all had all the filesystems including root shared via GFS). My 
upgrade was 5.0->5.1, though.

Gordan

On Fri, 11 Jan 2008, Steffen Plotner wrote:

> We have migrated from 4.5 to 4.6 without any problems, however all nodes
> at once - would not recommend different versions on nodes within the
> same cluster.
>
> Others might have other ideas?
>
>
> ________________________________
>
> 	From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of
> rhurst at bidmc.harvard.edu
> 	Sent: Friday, January 11, 2008 10:37 AM
> 	To: linux-cluster at redhat.com
> 	Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
>
>
> 	Has anyone migrated from an existing 4.5 to the newer 4.6
> cluster suite?  We would like to roll out 4.6 in our 11-node cluster,
> but only one node at a time, over the course of 2-weeks.  Does the two
> versions intermix well enough to do that?   Or do we need to do take
> some kind of special care or precaution, like when injecting this
> dilithium crystal chamber with an inverse tachyon pulse from the main
> deflector dish?
>
>
>
>



From janne.peltonen at helsinki.fi  Fri Jan 11 16:22:14 2008
From: janne.peltonen at helsinki.fi (Janne Peltonen)
Date: Fri, 11 Jan 2008 18:22:14 +0200
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <alpine.LRH.1.00.0801111612200.31632@skynet.shatteredsilicon.net>
References: <150F55E3591CD042B77ED3DB957854652B6127@mail7.amherst.edu>
	<alpine.LRH.1.00.0801111612200.31632@skynet.shatteredsilicon.net>
Message-ID: <20080111162214.GK7481@helsinki.fi>

On Fri, Jan 11, 2008 at 04:13:38PM +0000, gordan at bobich.net wrote:
> I have done similar, but again, all nodes at once (due to the fact that 
> they all had all the filesystems including root shared via GFS). My 
> upgrade was 5.0->5.1, though.

I've done two successful upgrades 5.0-5.1 as a 'rolling' upgrade, that
is, with mixed versions of the nodes. No problems.

On the other hand, my colleagues have had trouble with 4.5->4.6
migration.
-- 
Janne Peltonen <janne.peltonen at helsinki.fi>



From Christopher.Barry at qlogic.com  Fri Jan 11 16:23:05 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Fri, 11 Jan 2008 11:23:05 -0500
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <1200065800.6508.20.camel@WSBID06223>
References: <1200065800.6508.20.camel@WSBID06223>
Message-ID: <1200068585.5264.11.camel@localhost>

On Fri, 2008-01-11 at 10:36 -0500, rhurst at bidmc.harvard.edu wrote:
> Has anyone migrated from an existing 4.5 to the newer 4.6 cluster
> suite?  We would like to roll out 4.6 in our 11-node cluster, but only
> one node at a time, over the course of 2-weeks.  Does the two versions
> intermix well enough to do that?   Or do we need to do take some kind
> of special care or precaution, like when injecting this dilithium
> crystal chamber with an inverse tachyon pulse from the main deflector
> dish?

LOL! I'd love to say "Make it so, Numba One", but I'm not sure ;)


-C



From Paul.McDowell at celera.com  Fri Jan 11 16:35:33 2008
From: Paul.McDowell at celera.com (Paul n McDowell)
Date: Fri, 11 Jan 2008 11:35:33 -0500
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <1200065800.6508.20.camel@WSBID06223>
Message-ID: <OF5EF97625.EAE17F95-ON852573CD.005AA082-852573CD.005B2487@applera.com>

I did a rolling upgrade of a 5 node GFS environment from 4.5 to 4.6 over a 
week and had no interoperability issues.  I made sure that I had a solid 
roll-back plan before I upgraded each node just in case.





rhurst at bidmc.harvard.edu 
Sent by: linux-cluster-bounces at redhat.com
01/11/2008 10:36 AM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux-cluster at redhat.com
cc

Subject
[Linux-cluster] RHEL 4.5 -> 4.6 migration






Has anyone migrated from an existing 4.5 to the newer 4.6 cluster suite? 
We would like to roll out 4.6 in our 11-node cluster, but only one node at 
a time, over the course of 2-weeks.  Does the two versions intermix well 
enough to do that?   Or do we need to do take some kind of special care or 
precaution, like when injecting this dilithium crystal chamber with an 
inverse tachyon pulse from the main deflector dish?
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/2554b317/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/octet-stream
Size: 3227 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/2554b317/attachment.obj>

From gordan at bobich.net  Fri Jan 11 16:37:57 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 11 Jan 2008 16:37:57 +0000 (GMT)
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <OF5EF97625.EAE17F95-ON852573CD.005AA082-852573CD.005B2487@applera.com>
References: <OF5EF97625.EAE17F95-ON852573CD.005AA082-852573CD.005B2487@applera.com>
Message-ID: <alpine.LRH.1.00.0801111636370.31828@skynet.shatteredsilicon.net>

I'm curious - please, do tell about the solid rollback plan. Something 
like the stackable wayback file system for root?

On Fri, 11 Jan 2008, Paul n McDowell wrote:

> I did a rolling upgrade of a 5 node GFS environment from 4.5 to 4.6 over a
> week and had no interoperability issues.  I made sure that I had a solid
> roll-back plan before I upgraded each node just in case.
>
>
>
>
>
> rhurst at bidmc.harvard.edu
> Sent by: linux-cluster-bounces at redhat.com
> 01/11/2008 10:36 AM
> Please respond to
> linux clustering <linux-cluster at redhat.com>
>
>
> To
> linux-cluster at redhat.com
> cc
>
> Subject
> [Linux-cluster] RHEL 4.5 -> 4.6 migration
>
>
>
>
>
>
> Has anyone migrated from an existing 4.5 to the newer 4.6 cluster suite?
> We would like to roll out 4.6 in our 11-node cluster, but only one node at
> a time, over the course of 2-weeks.  Does the two versions intermix well
> enough to do that?   Or do we need to do take some kind of special care or
> precaution, like when injecting this dilithium crystal chamber with an
> inverse tachyon pulse from the main deflector dish?
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From rohara at redhat.com  Fri Jan 11 17:13:09 2008
From: rohara at redhat.com (Ryan O'Hara)
Date: Fri, 11 Jan 2008 11:13:09 -0600
Subject: [Linux-cluster] RE: scsi reservation
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D1279@cumulonimbus.RG.local>
References: <C776378855970A4DADE4A476447F639101168951@NAMAIL3.ad.lsil.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D6D1279@cumulonimbus.RG.local>
Message-ID: <4787A3A5.5070800@redhat.com>

Did you pull the source from cvs or did you grab one of the tar.gz files?

Ryan


Alexandre Racine wrote:
> You are right, that package was not installed.
> 
> So now I installed the package, and recompiled "fence", but "fence_scsi" is still not there in /sbin/
> 
> Any more idea? (Thanks for the first hint).
> 
> 
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com on behalf of Sadek, Abdel
> Sent: Wed 2008-01-09 11:45
> To: linux clustering
> Subject: [Linux-cluster] RE: scsi reservation
>  
> I believe you may not have the sg3_utils packages installed. I'll first
> check for that.
> 
> Thanks.
> Abdel..
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alexandre Racine
> Sent: Wednesday, January 09, 2008 10:24 AM
> To: linux clustering
> Subject: scsi reservation
> 
> Hi all,
> 
> I am currently using version 1.0.4 of GFS and the scsi reservation
> binairies (scsi_reserve, fence_scsi, etc) are not there. Is it suppose
> to be like this or this is the distro I a using playing games with me
> (not my choice! It's Gentoo).
> 
> If it's normal that they are not there, is there a reason for this? Does
> it work well?
> 
> Because it's still here :
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?
> cvsroot=cluster
> 
> 
> Thanks.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From lhh at redhat.com  Fri Jan 11 17:57:42 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 11 Jan 2008 12:57:42 -0500
Subject: [Linux-cluster] CS5  / something weird during tests
In-Reply-To: <47877B0B.8010603@bull.net>
References: <47877B0B.8010603@bull.net>
Message-ID: <1200074262.16312.197.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-11 at 15:19 +0100, Alain Moulle wrote:
> Hi
> 
> On my two-nodes cluster with qdiskd :
> when testing CS5 via a ifdown eth0 on node2(where is the heart-beat)
> I have a strange behavior : the node2 is rebooted and service
> is failovered by node1, fine. But after the reboot of node2 and
> re-launch of CS 4 daemons, I can't see via clustat any information

CS5 and CS4 do not work together, and do not happily coexist on the same
machine.

> from rgmanager, and cant't stop it if I try service rgmanager stop ...
> It seems to be stalled in a strange state ...

cman_tool services output would be helpful

-- Lon




From Paul.McDowell at celera.com  Fri Jan 11 19:30:00 2008
From: Paul.McDowell at celera.com (Paul n McDowell)
Date: Fri, 11 Jan 2008 14:30:00 -0500
Subject: [Linux-cluster] RHEL 4.5 -> 4.6 migration
In-Reply-To: <alpine.LRH.1.00.0801111636370.31828@skynet.shatteredsilicon.net>
Message-ID: <OF744E90F3.E0CE88A1-ON852573CD.0068000C-852573CD.006B198A@applera.com>

Well, in my case that was fairly easy.  We have hardware mirrored system 
disks (/, /usr, /var, /root, /opt.....) so prior to performing my 
upgrades, I migrated any services that were running on that node, quiesced 
the system and then broke the mirror.   I then brought the system back up 
with all the cluster services switched off, performed the upgrade and in 
each case the node then re-joined the cluster without a problem. 

I'm not familiar with version control that you can perform with "Wayback" 
but the principal would be the same, ie, keeping a known good OS version 
that you could fall back to in case of a problem.







gordan at bobich.net 
Sent by: linux-cluster-bounces at redhat.com
01/11/2008 11:37 AM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] RHEL 4.5 -> 4.6 migration






I'm curious - please, do tell about the solid rollback plan. Something 
like the stackable wayback file system for root?

On Fri, 11 Jan 2008, Paul n McDowell wrote:

> I did a rolling upgrade of a 5 node GFS environment from 4.5 to 4.6 over 
a
> week and had no interoperability issues.  I made sure that I had a solid
> roll-back plan before I upgraded each node just in case.
>
>
>
>
>
> rhurst at bidmc.harvard.edu
> Sent by: linux-cluster-bounces at redhat.com
> 01/11/2008 10:36 AM
> Please respond to
> linux clustering <linux-cluster at redhat.com>
>
>
> To
> linux-cluster at redhat.com
> cc
>
> Subject
> [Linux-cluster] RHEL 4.5 -> 4.6 migration
>
>
>
>
>
>
> Has anyone migrated from an existing 4.5 to the newer 4.6 cluster suite?
> We would like to roll out 4.6 in our 11-node cluster, but only one node 
at
> a time, over the course of 2-weeks.  Does the two versions intermix well
> enough to do that?   Or do we need to do take some kind of special care 
or
> precaution, like when injecting this dilithium crystal chamber with an
> inverse tachyon pulse from the main deflector dish?
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080111/e800003f/attachment.htm>

From cliff-lc at cliff.hones.org.uk  Fri Jan 11 19:38:59 2008
From: cliff-lc at cliff.hones.org.uk (Cliff Hones)
Date: Fri, 11 Jan 2008 19:38:59 +0000
Subject: [Linux-cluster] Graceful recover after connectivity failure
Message-ID: <4787C5D3.8020500@cliff.hones.org.uk>

I am using Centos5.1 with GNBD and GNBD fencing.

Following the failure of a cluster member - eg a temporary
loss of connectivity - which results in the node being
fenced, is there a clean way to re-join the cluster without
having to reboot the affected node?

I am finding that it is impossible to shut down or restart the
cluster components on the affected node, and even trying to force
a reboot from a ssh session just hangs.

There seems to be a chicken-and-egg situation - a gfs filesystem
cannot be unmounted if the node is fenced, and cman/clvmd cannot
be stopped/restarted if a filesystem is mounted.   Forcibly
trying to kill the cluster processes also fails.

-- Cliff



From teemu.m2 at luukku.com  Fri Jan 11 20:56:55 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Fri, 11 Jan 2008 22:56:55 +0200 (EET)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <1200085015746.teemu.m2.67987.rqRlPdzG0SOEGEKzBhBdKA@luukku.com>

Hi
We are having some problems to get fence working with this hardware.

Proliant 360 G5.. iLo firmware 1.42
and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS

This script: fence_ilo
Fence shutdown works fine, but it don't restart it.
agent "fence_ilo" reports: failed to turn on

Have someone working script to get this work?




...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From lhh at redhat.com  Fri Jan 11 22:19:00 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 11 Jan 2008 17:19:00 -0500
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <1200085015746.teemu.m2.67987.rqRlPdzG0SOEGEKzBhBdKA@luukku.com>
References: <1200085015746.teemu.m2.67987.rqRlPdzG0SOEGEKzBhBdKA@luukku.com>
Message-ID: <1200089940.16312.210.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
> Hi
> We are having some problems to get fence working with this hardware.
> 
> Proliant 360 G5.. iLo firmware 1.42
> and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
> 
> This script: fence_ilo
> Fence shutdown works fine, but it don't restart it.
> agent "fence_ilo" reports: failed to turn on
> 
> Have someone working script to get this work?

Enable IPMI on the iLO2 management card and use it instead.

-- Lon




From jamesc at exa.com  Sat Jan 12 00:36:56 2008
From: jamesc at exa.com (James Chamberlain)
Date: Fri, 11 Jan 2008 19:36:56 -0500 (EST)
Subject: [Linux-cluster] Instability troubles
In-Reply-To: <Pine.LNX.4.64.0801091608350.19639@hawking.exa.com>
References: <Pine.LNX.4.64.0801021631370.27708@hawking.exa.com>
	<1199374720.9564.20.camel@ayanami.boston.devel.redhat.com>
	<Pine.LNX.4.64.0801091608350.19639@hawking.exa.com>
Message-ID: <Pine.LNX.4.64.0801111930340.3433@hawking.exa.com>

> I have since switched all the nodes to use truely static addressing, and 
> have not had a problem in the intervening week.  I have not yet tried the 
> "<totem token=21000/>" trick that Lon mentioned, but I'm keeping that 
> handy should problems crop up again.

I take it back - just had another problem.  I'm adding the <totem /> line 
to /etc/cluster/cluster.conf.  We'll see what happens with that.

James



From teemu.m2 at luukku.com  Sat Jan 12 10:51:16 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Sat, 12 Jan 2008 12:51:16 +0200 (EET)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <1200135076309.teemu.m2.12367.YzWTz0Jh498RW4M5bnoweQ@luukku.com>

Hi, and thanks for your answere..

I don't know noting about impi, but i quess it's working same like Fence_ilo.. i must take and search little info about it.

I think you mean that i use in my cluster IMPI fence-device, and in iLo i put some IMPI-enabled.. is'it that what you mean.

Sorry about my stupido..

Regards Teemu



Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
> > Hi
> > We are having some problems to get fence working with this hardware.
> > 
> > Proliant 360 G5.. iLo firmware 1.42
> > and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
> > 
> > This script: fence_ilo
> > Fence shutdown works fine, but it don't restart it.
> > agent "fence_ilo" reports: failed to turn on
> > 
> > Have someone working script to get this work?
> 
> Enable IPMI on the iLO2 management card and use it instead.
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From teemu.m2 at luukku.com  Sat Jan 12 14:40:20 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Sat, 12 Jan 2008 16:40:20 +0200 (EET)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <1200148820985.teemu.m2.25434.zEKZsE60mK_Oq75aFgY4Mw@luukku.com>

Hello again.
Hmm, i don't know how to enable ipmi in Hp360 iLo2, can somebody help me and explain it to me?

And how is the config-parameters in cluster-suite at fence device.

Sorry, i don't understand it, and didn?'t get it work..
.ipmilan: Failed to connect after 30 seconds Failed


Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
> > Hi
> > We are having some problems to get fence working with this hardware.
> > 
> > Proliant 360 G5.. iLo firmware 1.42
> > and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
> > 
> > This script: fence_ilo
> > Fence shutdown works fine, but it don't restart it.
> > agent "fence_ilo" reports: failed to turn on
> > 
> > Have someone working script to get this work?
> 
> Enable IPMI on the iLO2 management card and use it instead.
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From tameikareed at satx.rr.com  Sat Jan 12 16:24:23 2008
From: tameikareed at satx.rr.com (tameikareed)
Date: Sat, 12 Jan 2008 10:24:23 -0600
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <1200148820985.teemu.m2.25434.zEKZsE60mK_Oq75aFgY4Mw@luukku.com>
References: <1200148820985.teemu.m2.25434.zEKZsE60mK_Oq75aFgY4Mw@luukku.com>
Message-ID: <4788E9B7.8050607@satx.rr.com>

Hi,

Do you have IPMI turned on in your kernel.  It should be if you didn't 
recompile your kernel, but it not you can install that module.  As far 
as recompiling your kernel, RedHat does not support that so you will 
have to download the vanilla kernel from kernel.org.  This will be the 
source kernel.  Once you have that I would open up a second screen and 
do a make menuconfig on the kernel that you do have installed under 
/usr/src/kernels/<kernel-version>. In the other screen install the 
source kernel with rpm -ivh <kernel src version>. Then build the kernel 
by going over to /usr/src/redhat/SPECS and doing rpmbuild -bp 
kernel<version>.spec.  Below is a list of options:

        -ba    Build binary and source packages (after doing the %prep, 
%build, and %install stages).

       -bb    Build a binary package (after doing the %prep, %build, and 
%install stages).

       -bp    Executes the "%prep" stage from the spec file. Normally 
this involves unpacking the sources and applying any patches.

       -bc    Do  the "%build" stage from the spec file (after doing the 
%prep stage).  This generally involves the equivalent of a "make".

       -bi    Do the "%install" stage from the spec file (after doing 
the %prep and %build stages).  This generally involves the equivalent
              of a "make install".

       -bl    Do  a  "list check".  The "%files" section from the spec 
file is macro expanded, and checks are made to verify that each file
              exists.
After that you need to make a symbolic link over in the /usr/src/kernel/ 
directory linking back to your source directory of the 
kernel.(/usr/src/kernels/BUILD)

As far as with HP hardware, you can look on the disk. Or check this link 
out:
http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lang=en&cc=us&mode=3&taskId=135&swItem=MTX-UNITY-I25296

I would go with the HP drivers . This should have came in hpasm package. 
I have include the instructions from the hp site:

*Installation:*
Login as the system administrator (root), download the RPM to a 
directory on your hard drive and change to that directory.

If a previous version of the hp-OpenIPMI package has been installed, it 
must be removed before this package can be installed. To remove the 
previous version and any packages dependent on it, type the following:

      rpm -e hp-OpenIPMI

To install the package, type:

      rpm -ivh hp-OpenIPMI-< version >.rpm

The drivers will be installed to the /opt/hp/hp-OpenIPMI directory but 
not inserted. For HP ProLiant servers, the hpasm package will 
automatically load the hp-OpenIPMI drivers if required for a particular 
HP ProLiant server.

Additional information and help is available by typing either of the 
following:

      man hp-OpenIPMI

Additional information about features, installation, use, and 
troubleshooting the HP Server Management Drivers and Agents is available 
in the HOWTO documents available here: 
http://www.compaq.com/products/servers/linux/documentation.html.

Well hope this helps,
Tameika

m.. mm.. wrote:
> Hello again.
> Hmm, i don't know how to enable ipmi in Hp360 iLo2, can somebody help me and explain it to me?
>
> And how is the config-parameters in cluster-suite at fence device.
>
> Sorry, i don't understand it, and didn?'t get it work..
> .ipmilan: Failed to connect after 30 seconds Failed
>
>
> Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
>   
>> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
>>     
>>> Hi
>>> We are having some problems to get fence working with this hardware.
>>>
>>> Proliant 360 G5.. iLo firmware 1.42
>>> and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
>>>
>>> This script: fence_ilo
>>> Fence shutdown works fine, but it don't restart it.
>>> agent "fence_ilo" reports: failed to turn on
>>>
>>> Have someone working script to get this work?
>>>       
>> Enable IPMI on the iLO2 management card and use it instead.
>>
>> -- Lon
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>     
>
>
> ...................................................................
> Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
> Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>   

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080112/dc8c741a/attachment.htm>

From teemu.m2 at luukku.com  Sat Jan 12 18:42:45 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Sat, 12 Jan 2008 20:42:45 +0200 (EET)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <1200163365971.teemu.m2.655.JzOnwZSzXfM3REzTKk5v2g@luukku.com>

I don't get it working:
This error message comes..when i try to fence node2.

Jan 12 20:38:57 xxx4n1 fence_node[20776]: agent "fence_ipmilan" reports: Rebooting machine @ IPMI:192.168.69.4...ipmilan: Failed to connect after 30 seconds Failed
Jan 12 20:38:57 xxx4n1 ccsd[3239]: Attempt to close an unopened CCS descriptor (142830).
Jan 12 20:38:57 xxx4n1 ccsd[3239]: Error while processing disconnect: Invalid request descriptor

Some ideas?

tameikareed kirjoitti 12.01.2008 kello 18:24:
> Hi,
> 
> Do you have IPMI turned on in your kernel.  It should be if you didn't 
> recompile your kernel, but it not you can install that module.  As far 
> as recompiling your kernel, RedHat does not support that so you will 
> have to download the vanilla kernel from kernel.org.  This will be the 
> source kernel.  Once you have that I would open up a second screen and 
> do a make menuconfig on the kernel that you do have installed under 
> /usr/src/kernels/<kernel-version>. In the other screen install the 
> source kernel with rpm -ivh <kernel src version>. Then build the kernel 
> by going over to /usr/src/redhat/SPECS and doing rpmbuild -bp 
> kernel<version>.spec.  Below is a list of options:
> 
>         -ba    Build binary and source packages (after doing the %prep, 
> %build, and %install stages).
> 
>        -bb    Build a binary package (after doing the %prep, %build,
>  and 
> %install stages).
> 
>        -bp    Executes the "%prep" stage from the spec file. Normally 
> this involves unpacking the sources and applying any patches.
> 
>        -bc    Do  the "%build" stage from the spec file (after doing
>  the 
> %prep stage).  This generally involves the equivalent of a "make".
> 
>        -bi    Do the "%install" stage from the spec file (after doing 
> the %prep and %build stages).  This generally involves the equivalent
>               of a "make install".
> 
>        -bl    Do  a  "list check".  The "%files" section from the spec 
> file is macro expanded, and checks are made to verify that each file
>               exists.
> After that you need to make a symbolic link over in the
>  /usr/src/kernel/ 
> directory linking back to your source directory of the 
> kernel.(/usr/src/kernels/BUILD)
> 
> As far as with HP hardware, you can look on the disk. Or check this
>  link 
> out:
> http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lan
> g=en&cc=us&mode=3&taskId=135&swItem=MTX-UNITY-I25296
> 
> I would go with the HP drivers . This should have came in hpasm
>  package. 
> I have include the instructions from the hp site:
> 
> *Installation:*
> Login as the system administrator (root), download the RPM to a 
> directory on your hard drive and change to that directory.
> 
> If a previous version of the hp-OpenIPMI package has been installed, it 
> must be removed before this package can be installed. To remove the 
> previous version and any packages dependent on it, type the following:
> 
>       rpm -e hp-OpenIPMI
> 
> To install the package, type:
> 
>       rpm -ivh hp-OpenIPMI-< version >.rpm
> 
> The drivers will be installed to the /opt/hp/hp-OpenIPMI directory but 
> not inserted. For HP ProLiant servers, the hpasm package will 
> automatically load the hp-OpenIPMI drivers if required for a particular 
> HP ProLiant server.
> 
> Additional information and help is available by typing either of the 
> following:
> 
>       man hp-OpenIPMI
> 
> Additional information about features, installation, use, and 
> troubleshooting the HP Server Management Drivers and Agents is
>  available 
> in the HOWTO documents available here: 
> http://www.compaq.com/products/servers/linux/documentation.html.
> 
> Well hope this helps,
> Tameika
> 
> m.. mm.. wrote:
> > Hello again.
> > Hmm, i don't know how to enable ipmi in Hp360 iLo2, can somebody help me
> and explain it to me?
> >
> > And how is the config-parameters in cluster-suite at fence device.
> >
> > Sorry, i don't understand it, and didn?'t get it work..
> > .ipmilan: Failed to connect after 30 seconds Failed
> >
> >
> > Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
> >   
> >> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
> >>     
> >>> Hi
> >>> We are having some problems to get fence working with this hardware.
> >>>
> >>> Proliant 360 G5.. iLo firmware 1.42
> >>> and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
> >>>
> >>> This script: fence_ilo
> >>> Fence shutdown works fine, but it don't restart it.
> >>> agent "fence_ilo" reports: failed to turn on
> >>>
> >>> Have someone working script to get this work?
> >>>       
> >> Enable IPMI on the iLO2 management card and use it instead.
> >>
> >> -- Lon
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>     
> >
> >
> > ...................................................................
> > Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
> > Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >   
> 


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From Alain.Moulle at bull.net  Mon Jan 14 08:50:37 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 14 Jan 2008 09:50:37 +0100
Subject: [Linux-cluster] CS5  / something weird during tests
Message-ID: <478B225D.9000905@bull.net>

On Fri, 2008-01-11 at 15:19 +0100, Alain Moulle wrote:

>> Hi
>>
>> On my two-nodes cluster with qdiskd :
>> when testing CS5 via a ifdown eth0 on node2(where is the heart-beat)
>> I have a strange behavior : the node2 is rebooted and service
>> is failovered by node1, fine. But after the reboot of node2 and
>> re-launch of CS 4 daemons, I can't see via clustat any information


>CS5 and CS4 do not work together, and do not happily coexist on the same
>machine.

Sorry it was a mistake in my email, I only use CS5 on the two-nodes cluster.

>from rgmanager, and cant't stop it if I try service rgmanager stop ...
>> It seems to be stalled in a strange state ...

>cman_tool services output would be helpful
>-- Lon

On node where clustat does not display services information:
[root at xena141 ~]# clustat
msg_open: No such file or directory
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xena140                               1 Online
  xena141                               2 Online, Local
  /dev/sdc                              0 Online, Quorum Disk

cman_tool services returns :
[root at xena141 ~]# cman_tool services
type             level name       id       state
fence            0     default    00010001 none
[1 2]
dlm              1     rgmanager  00020001 none
[1 2]

Thanks for help.
Regards
Alain Moull?




From pcaulfie at redhat.com  Mon Jan 14 09:03:40 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Mon, 14 Jan 2008 09:03:40 +0000
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <4787C5D3.8020500@cliff.hones.org.uk>
References: <4787C5D3.8020500@cliff.hones.org.uk>
Message-ID: <478B256C.9020009@redhat.com>

Cliff Hones wrote:
> I am using Centos5.1 with GNBD and GNBD fencing.
> 
> Following the failure of a cluster member - eg a temporary
> loss of connectivity - which results in the node being
> fenced, is there a clean way to re-join the cluster without
> having to reboot the affected node?

Basically, no.

If a node is apart from the cluster for any period of time, it can't
tell whether the state of that cluster has changed while it was
disconnected. So it must be fenced and restart the cluster software from
the beginning to rebuild it's state from scratch.


> I am finding that it is impossible to shut down or restart the
> cluster components on the affected node, and even trying to force
> a reboot from a ssh session just hangs.
> 
> There seems to be a chicken-and-egg situation - a gfs filesystem
> cannot be unmounted if the node is fenced, and cman/clvmd cannot
> be stopped/restarted if a filesystem is mounted.   Forcibly
> trying to kill the cluster processes also fails.
> 

Patrick



From fajar at telkom.net.id  Mon Jan 14 09:31:35 2008
From: fajar at telkom.net.id (Fajar A. Nugraha)
Date: Mon, 14 Jan 2008 16:31:35 +0700
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <478B256C.9020009@redhat.com>
References: <4787C5D3.8020500@cliff.hones.org.uk> <478B256C.9020009@redhat.com>
Message-ID: <478B2BF7.2050303@telkom.net.id>

>
> Cliff Hones wrote:
>   
>> I am using Centos5.1 with GNBD and GNBD fencing.
>>
>>     

Is gnbd fencing supported for production environment?
I tried several fencing methods, including manual, san-switch, 
blade-center, and ILO fencing, and so far the only reliable method based 
on my experience are ones that can restart the nodes by force (in my 
case blade-center and ILO).

>> I am finding that it is impossible to shut down or restart the
>> cluster components on the affected node, and even trying to force
>> a reboot from a ssh session just hangs.
>>
>>     

By "force a reboot", do you mean "reboot -f"?
I had some situation where "reboot" simply hangs (mostly related with 
I/O problems), but "reboot -f" works every time.

Regards,

Fajar



From jamshed.jimmy at gmail.com  Mon Jan 14 10:01:56 2008
From: jamshed.jimmy at gmail.com (Jamshed Zaidi)
Date: Mon, 14 Jan 2008 15:01:56 +0500
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some Help
	me?????
Message-ID: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>

Hi guys, I am new to this mailing list and a newbie with clustering
file system is there any one who can help me????? I having a problem
in mounting gfs2 filesytem details are listed below.

i have created a file of 1GB of size name test and format it with gfs2
file system by running the folloiwing command
# mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
then i run the fsck command like this

# gfs2_fsck -v test
Initializing fsck
Initializing lists...
Initializing special inodes...
Segmentation fault

when i use to mount my gfs2 file system then following list of messages appears
# mount.gfs2 test /mnt/gfsfilesystem/ (command)
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: waiting for gfs_controld to start
mount.gfs2: gfs_controld not running
mount.gfs2: error mounting lockproto lock_dlm

I am running CentOS 5.1 on my machine.
Thanx for anticipation

-- 
Syed Jamshed Zaidi (Jamy-Virus)
Linux Admin/Programmer @ Naseeb Networks
0321-4087492
"Shoot for the moon. Even if you miss, you'll land among the stars"



From gordan at bobich.net  Mon Jan 14 10:05:57 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 14 Jan 2008 10:05:57 +0000 (GMT)
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some
	Help	me?????
In-Reply-To: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>

GFS2 isn't stable yet. Use GFS1.

Gordan

On Mon, 14 Jan 2008, Jamshed Zaidi wrote:

> Hi guys, I am new to this mailing list and a newbie with clustering
> file system is there any one who can help me????? I having a problem
> in mounting gfs2 filesytem details are listed below.
>
> i have created a file of 1GB of size name test and format it with gfs2
> file system by running the folloiwing command
> # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
> then i run the fsck command like this
>
> # gfs2_fsck -v test
> Initializing fsck
> Initializing lists...
> Initializing special inodes...
> Segmentation fault
>
> when i use to mount my gfs2 file system then following list of messages appears
> # mount.gfs2 test /mnt/gfsfilesystem/ (command)
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: gfs_controld not running
> mount.gfs2: error mounting lockproto lock_dlm
>
> I am running CentOS 5.1 on my machine.
> Thanx for anticipation
>
> --
> Syed Jamshed Zaidi (Jamy-Virus)
> Linux Admin/Programmer @ Naseeb Networks
> 0321-4087492
> "Shoot for the moon. Even if you miss, you'll land among the stars"
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From swhiteho at redhat.com  Mon Jan 14 10:09:15 2008
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 14 Jan 2008 10:09:15 +0000
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some
	Help me?????
In-Reply-To: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
Message-ID: <1200305355.22038.115.camel@quoit>

Hi,

On Mon, 2008-01-14 at 15:01 +0500, Jamshed Zaidi wrote:
> Hi guys, I am new to this mailing list and a newbie with clustering
> file system is there any one who can help me????? I having a problem
> in mounting gfs2 filesytem details are listed below.
> 
> i have created a file of 1GB of size name test and format it with gfs2
> file system by running the folloiwing command
> # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
> then i run the fsck command like this
> 
> # gfs2_fsck -v test
> Initializing fsck
> Initializing lists...
> Initializing special inodes...
> Segmentation fault
> 
What version are you using? Maybe this is an old one?

> when i use to mount my gfs2 file system then following list of messages appears
> # mount.gfs2 test /mnt/gfsfilesystem/ (command)
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: waiting for gfs_controld to start
> mount.gfs2: gfs_controld not running
> mount.gfs2: error mounting lockproto lock_dlm
> 
> I am running CentOS 5.1 on my machine.
> Thanx for anticipation
> 
That just means that you didn't start cman before you tried to mount the
filesystem. Ah, if you are using 5.1 that explains the problem above.
Its better to use the latest upstream code for the time being as its
much more uptodate,

Steve.




From jamshed.jimmy at gmail.com  Mon Jan 14 10:44:22 2008
From: jamshed.jimmy at gmail.com (Jamshed Zaidi)
Date: Mon, 14 Jan 2008 15:44:22 +0500
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some Help
	me?????
In-Reply-To: <alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
	<alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>
Message-ID: <7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>

Now i am using gfs instead of gfs2, gfs_fsck  works fine but still
having problem in mounting
#mount.gfs test /mnt/gfsfilesystem/

mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: waiting for gfs_controld to start
mount.gfs: gfs_controld not running
mount.gfs: error mounting lockproto lock_dlm

from where do i start gfs_controld service this service doesn't
appears to me in /etc/init.d/. Is there anything that i need to
install????

On 1/14/08, gordan at bobich.net <gordan at bobich.net> wrote:
> GFS2 isn't stable yet. Use GFS1.
>
> Gordan
>
> On Mon, 14 Jan 2008, Jamshed Zaidi wrote:
>
> > Hi guys, I am new to this mailing list and a newbie with clustering
> > file system is there any one who can help me????? I having a problem
> > in mounting gfs2 filesytem details are listed below.
> >
> > i have created a file of 1GB of size name test and format it with gfs2
> > file system by running the folloiwing command
> > # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
> > then i run the fsck command like this
> >
> > # gfs2_fsck -v test
> > Initializing fsck
> > Initializing lists...
> > Initializing special inodes...
> > Segmentation fault
> >
> > when i use to mount my gfs2 file system then following list of messages
> appears
> > # mount.gfs2 test /mnt/gfsfilesystem/ (command)
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: waiting for gfs_controld to start
> > mount.gfs2: gfs_controld not running
> > mount.gfs2: error mounting lockproto lock_dlm
> >
> > I am running CentOS 5.1 on my machine.
> > Thanx for anticipation
> >
> > --
> > Syed Jamshed Zaidi (Jamy-Virus)
> > Linux Admin/Programmer @ Naseeb Networks
> > 0321-4087492
> > "Shoot for the moon. Even if you miss, you'll land among the stars"
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Syed Jamshed Zaidi (Jamy-Virus)
Linux Admin/Programmer @ Naseeb Networks
0321-4087492
"Shoot for the moon. Even if you miss, you'll land among the stars"



From gordan at bobich.net  Mon Jan 14 10:47:35 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 14 Jan 2008 10:47:35 +0000 (GMT)
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some
	Help	me?????
In-Reply-To: <7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
	<alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>
	<7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801141046190.16086@skynet.shatteredsilicon.net>

Make sure you start the cman and gfs services.

Gordan

On Mon, 14 Jan 2008, Jamshed Zaidi wrote:

> Now i am using gfs instead of gfs2, gfs_fsck  works fine but still
> having problem in mounting
> #mount.gfs test /mnt/gfsfilesystem/
>
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: gfs_controld not running
> mount.gfs: error mounting lockproto lock_dlm
>
> from where do i start gfs_controld service this service doesn't
> appears to me in /etc/init.d/. Is there anything that i need to
> install????
>
> On 1/14/08, gordan at bobich.net <gordan at bobich.net> wrote:
>> GFS2 isn't stable yet. Use GFS1.
>>
>> Gordan
>>
>> On Mon, 14 Jan 2008, Jamshed Zaidi wrote:
>>
>>> Hi guys, I am new to this mailing list and a newbie with clustering
>>> file system is there any one who can help me????? I having a problem
>>> in mounting gfs2 filesytem details are listed below.
>>>
>>> i have created a file of 1GB of size name test and format it with gfs2
>>> file system by running the folloiwing command
>>> # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
>>> then i run the fsck command like this
>>>
>>> # gfs2_fsck -v test
>>> Initializing fsck
>>> Initializing lists...
>>> Initializing special inodes...
>>> Segmentation fault
>>>
>>> when i use to mount my gfs2 file system then following list of messages
>> appears
>>> # mount.gfs2 test /mnt/gfsfilesystem/ (command)
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: waiting for gfs_controld to start
>>> mount.gfs2: gfs_controld not running
>>> mount.gfs2: error mounting lockproto lock_dlm
>>>
>>> I am running CentOS 5.1 on my machine.
>>> Thanx for anticipation
>>>
>>> --
>>> Syed Jamshed Zaidi (Jamy-Virus)
>>> Linux Admin/Programmer @ Naseeb Networks
>>> 0321-4087492
>>> "Shoot for the moon. Even if you miss, you'll land among the stars"
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Syed Jamshed Zaidi (Jamy-Virus)
> Linux Admin/Programmer @ Naseeb Networks
> 0321-4087492
> "Shoot for the moon. Even if you miss, you'll land among the stars"
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From swhiteho at redhat.com  Mon Jan 14 10:49:55 2008
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 14 Jan 2008 10:49:55 +0000
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some
	Help me?????
In-Reply-To: <7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
	<alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>
	<7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
Message-ID: <1200307795.22038.117.camel@quoit>

Hi,

On Mon, 2008-01-14 at 15:44 +0500, Jamshed Zaidi wrote:
> Now i am using gfs instead of gfs2, gfs_fsck  works fine but still
> having problem in mounting
> #mount.gfs test /mnt/gfsfilesystem/
> 
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: gfs_controld not running
> mount.gfs: error mounting lockproto lock_dlm
> 
> from where do i start gfs_controld service this service doesn't
> appears to me in /etc/init.d/. Is there anything that i need to
> install????
> 
As before, you've not started cman (which ought to be in /etc/init.d/ if
you've installed the package),

Steve.

> On 1/14/08, gordan at bobich.net <gordan at bobich.net> wrote:
> > GFS2 isn't stable yet. Use GFS1.
> >
> > Gordan
> >
> > On Mon, 14 Jan 2008, Jamshed Zaidi wrote:
> >
> > > Hi guys, I am new to this mailing list and a newbie with clustering
> > > file system is there any one who can help me????? I having a problem
> > > in mounting gfs2 filesytem details are listed below.
> > >
> > > i have created a file of 1GB of size name test and format it with gfs2
> > > file system by running the folloiwing command
> > > # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
> > > then i run the fsck command like this
> > >
> > > # gfs2_fsck -v test
> > > Initializing fsck
> > > Initializing lists...
> > > Initializing special inodes...
> > > Segmentation fault
> > >
> > > when i use to mount my gfs2 file system then following list of messages
> > appears
> > > # mount.gfs2 test /mnt/gfsfilesystem/ (command)
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: gfs_controld not running
> > > mount.gfs2: error mounting lockproto lock_dlm
> > >
> > > I am running CentOS 5.1 on my machine.
> > > Thanx for anticipation
> > >
> > > --
> > > Syed Jamshed Zaidi (Jamy-Virus)
> > > Linux Admin/Programmer @ Naseeb Networks
> > > 0321-4087492
> > > "Shoot for the moon. Even if you miss, you'll land among the stars"
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 
> 



From jamshed.jimmy at gmail.com  Mon Jan 14 11:18:08 2008
From: jamshed.jimmy at gmail.com (Jamshed Zaidi)
Date: Mon, 14 Jan 2008 16:18:08 +0500
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some Help
	me?????
In-Reply-To: <7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>
	<alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>
	<7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
Message-ID: <7ed8d870801140318v274462e2g586446601016ad14@mail.gmail.com>

sorry i didnt get you earlier but when i run
# /etc/init.d/cman start then
Starting cluster:
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed
/usr/sbin/cman_tool: ccsd is not running

and when i run
# /usr/sbin/cman_tool join -w
/usr/sbin/cman_tool: ccsd is not running

from where do i get ccsd running. i didn't any service of ccsd on my system.

On 1/14/08, Jamshed Zaidi <jamshed.jimmy at gmail.com> wrote:
> Now i am using gfs instead of gfs2, gfs_fsck  works fine but still
> having problem in mounting
> #mount.gfs test /mnt/gfsfilesystem/
>
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: waiting for gfs_controld to start
> mount.gfs: gfs_controld not running
> mount.gfs: error mounting lockproto lock_dlm
>
> from where do i start gfs_controld service this service doesn't
> appears to me in /etc/init.d/. Is there anything that i need to
> install????
>
> On 1/14/08, gordan at bobich.net <gordan at bobich.net> wrote:
> > GFS2 isn't stable yet. Use GFS1.
> >
> > Gordan
> >
> > On Mon, 14 Jan 2008, Jamshed Zaidi wrote:
> >
> > > Hi guys, I am new to this mailing list and a newbie with clustering
> > > file system is there any one who can help me????? I having a problem
> > > in mounting gfs2 filesytem details are listed below.
> > >
> > > i have created a file of 1GB of size name test and format it with gfs2
> > > file system by running the folloiwing command
> > > # mkfs.gfs2 -t mycluster:test -b 512 -p lock_dlm -J 8 -j 4 test
> > > then i run the fsck command like this
> > >
> > > # gfs2_fsck -v test
> > > Initializing fsck
> > > Initializing lists...
> > > Initializing special inodes...
> > > Segmentation fault
> > >
> > > when i use to mount my gfs2 file system then following list of messages
> > appears
> > > # mount.gfs2 test /mnt/gfsfilesystem/ (command)
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: waiting for gfs_controld to start
> > > mount.gfs2: gfs_controld not running
> > > mount.gfs2: error mounting lockproto lock_dlm
> > >
> > > I am running CentOS 5.1 on my machine.
> > > Thanx for anticipation
> > >
> > > --
> > > Syed Jamshed Zaidi (Jamy-Virus)
> > > Linux Admin/Programmer @ Naseeb Networks
> > > 0321-4087492
> > > "Shoot for the moon. Even if you miss, you'll land among the stars"
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
> --
> Syed Jamshed Zaidi (Jamy-Virus)
> Linux Admin/Programmer @ Naseeb Networks
> 0321-4087492
> "Shoot for the moon. Even if you miss, you'll land among the stars"
>


-- 
Syed Jamshed Zaidi (Jamy-Virus)
Linux Admin/Programmer @ Naseeb Networks
0321-4087492
"Shoot for the moon. Even if you miss, you'll land among the stars"



From cliff-lc at cliff.hones.org.uk  Mon Jan 14 13:33:32 2008
From: cliff-lc at cliff.hones.org.uk (Cliff Hones)
Date: Mon, 14 Jan 2008 13:33:32 +0000
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <478B256C.9020009@redhat.com>
References: <4787C5D3.8020500@cliff.hones.org.uk> <478B256C.9020009@redhat.com>
Message-ID: <478B64AC.9000201@cliff.hones.org.uk>

Patrick Caulfeld wrote:
> Cliff Hones wrote:
>> I am using Centos5.1 with GNBD and GNBD fencing.
>>
>> Following the failure of a cluster member - eg a temporary
>> loss of connectivity - which results in the node being
>> fenced, is there a clean way to re-join the cluster without
>> having to reboot the affected node?
> 
> Basically, no.
> 
> If a node is apart from the cluster for any period of time, it can't
> tell whether the state of that cluster has changed while it was
> disconnected. So it must be fenced and restart the cluster software from
> the beginning to rebuild it's state from scratch.

Yes - I realised that.  If one of the power fencing mechanisms
is needed the node will reboot and restart its cluster - hopefully
automatically.  If gnbd fencing is used, the node is left up and
running, but locked out of the shared storage.  What I would like
is to find a "clean" way to restart the node's cluster s/w.
A manual reboot is one way - either by power-cycling or (provided
it doesn't hang) by a shutdown or reboot command.  I would prefer
something less drastic than a complete reboot - eg a controlled
shutdown and then restart of the cluster software, but I have not
managed to achieve this.  Has anyone any idea if this is possible?

If rebooting is the only (or best) option then it needs to be possible
remotely - eg using ssh access.  Unfortunately, when the node has
been fenced, the normal "shutdown" hangs during the cluster shutdown
scripts, and by the time it hangs the ssh daemon has already been
stopped, so it is impossible to force remotely.  While a forced
reboot bypassing the init scripts is possible, it is hardly a "clean"
way to shut down.

Fajar A. Nugraha wrote:
> By "force a reboot", do you mean "reboot -f"?
> I had some situation where "reboot" simply hangs (mostly related with
 > I/O problems), but "reboot -f" works every time.

My understanding of reboot is that it acts as "reboot -f" if run
during a hung a shutdown (actually, if run when init level is 0 or 6).
So it seems safest to try a normal shutdown, and then if it hangs
do a reboot from the system console.  This normally works, but on at
least two occasions the system has still locked, this time with
no login capability at all - even changing to another console session
using ctrl/f<n> on the system console fails.

While I can see that in a controlled production environment the
loss of a server node may be best handled by a forced (power)
reboot, it does seem unfortunate if this has to be done in,
say, an office environment when communication has been temporarily
lost eg while rearranging n/w cabling.

-- Cliff



From mathieu.avila at seanodes.com  Mon Jan 14 14:06:26 2008
From: mathieu.avila at seanodes.com (Mathieu Avila)
Date: Mon, 14 Jan 2008 15:06:26 +0100
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
Message-ID: <20080114150626.24a9b281@mathieu.toulouse>

Hello GFS developpers,

I have a few questions regarding how locking is performed in GFS, and
the improvements brought by GFS2.

When I perform "ls" on a root directory of GFS1 that's been freshly
mounted, it takes a time linear to the size of the FS. Nevertheless, it
appears that the number of locks taken by GFS is always the same.
When i perform this a second time, the command returns almost directly.
What's the problem ? Was it solved in GFS2 ?

When I perform "mkdir" or "touch" on either the root of a freshly
mounted GFS1, or either on a subdirectory, it takes a time linear to
the size of the FS. I understand that it must determine the best RG to
put the dinode into, therefore reading a number of RG linear to the
size of the FS (if I don't play with the RG size), and taking a number
of locks also linear to the size of the FS.... This is the same
behaviour as when I perform "df", i guess.
Is this behaviour different in GFS2 ? Wouldn't be a possibility for
better behaviours, like, for example, taking the first free RG, if
we encounter such a RG (which is the case when the FS has just
been formated) ? Or maintaining fuzzy data about RG in an inode
(just like it is done for the fuzzy statfs)  ? Or maybe this is useless,
since it happens only at the first time after the FS is mounted on the
first node, and you consider that a FS is not mounted/unmounted
frequently ?
However, has this been changed in GFS2 ?

When i read this:
http://sourceware.org/cluster/faq.html#gfs_tuning
I understand that i should increase the size of the RG on big FS.
However, the code says that some data structures are loaded in memory
for each RG that's being locked (notably 2 bitmaps). So there's a
memory overhead when I increase the size of the RG. I also understand
that increasing the size of the RG increases the risk to have 2 or more
nodes working in the same RGs (is this right ?). What is the maximum
size of RG I should be using ?

More generally, is there the list of hard points in GFS1 that you've
been trying to solve with GFS2, somewhere accessible on the web ? Also,
what is actually the maximum size that GFS2 is known to be working on ?
(both in terms of nodes and real size)

Thanks for reading this (too long) list of questions :-)

--
Mathieu


--------------------------------------------------------------------------------
Les opinions et prises de position emises par le signataire du present
message lui sont propres et ne sauraient engager la responsabilite de la
societe SEANODES.

Ce message ainsi que les eventuelles pieces jointes constituent une
correspondance privee et confidentielle a l'attention exclusive du
destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
present message ou une personne susceptible de pouvoir le lui delivrer, il
vous est signifie que toute divulgation, distribution ou copie de cette
transmission est strictement interdite. Si vous avez recu ce message par
erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
lui retourner le present message, puis d'effacer immediatement ce message de
votre systeme.


The views and opinions expressed by the author of this message are personal.
SEANODES shall assume no liability, express or implied for such message.

This e-mail and any attachments is a confidential correspondence intended
only for use of the individual or entity named above. If you are not the
intended recipient or the agent responsible for delivering the message to
the intended recipient, you are hereby notified that any disclosure,
distribution or copying of this communication is strictly prohibited. If you
have received this communication in error, please notify the sender by phone
or by replying this message, and then delete this message from your system. 



From swhiteho at redhat.com  Mon Jan 14 14:18:54 2008
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 14 Jan 2008 14:18:54 +0000
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
In-Reply-To: <20080114150626.24a9b281@mathieu.toulouse>
References: <20080114150626.24a9b281@mathieu.toulouse>
Message-ID: <1200320334.22038.129.camel@quoit>

Hi,

On Mon, 2008-01-14 at 15:06 +0100, Mathieu Avila wrote:
> Hello GFS developpers,
> 
> I have a few questions regarding how locking is performed in GFS, and
> the improvements brought by GFS2.
> 
> When I perform "ls" on a root directory of GFS1 that's been freshly
> mounted, it takes a time linear to the size of the FS. Nevertheless, it
> appears that the number of locks taken by GFS is always the same.
> When i perform this a second time, the command returns almost directly.
> What's the problem ? Was it solved in GFS2 ?
> 
I suspect that the time is proportional to the max number of entries
that the directory has ever contained at one time. This is the same
under both GFS1 and GFS2, although there is a bugzilla #223783 which is
designed to address the main part of the problem.

> When I perform "mkdir" or "touch" on either the root of a freshly
> mounted GFS1, or either on a subdirectory, it takes a time linear to
> the size of the FS. I understand that it must determine the best RG to
> put the dinode into, therefore reading a number of RG linear to the
> size of the FS (if I don't play with the RG size), and taking a number
> of locks also linear to the size of the FS.... This is the same
> behaviour as when I perform "df", i guess.
Basically yes. It reads all the RGs, although in the allocation case it
doesn't need to read all the RGs to work out where to put newly
allocated blocks, it only needs to read some of them. That also needs to
be fixed at some stage in the future.

> Is this behaviour different in GFS2 ? Wouldn't be a possibility for
> better behaviours, like, for example, taking the first free RG, if
> we encounter such a RG (which is the case when the FS has just
> been formated) ? Or maintaining fuzzy data about RG in an inode
> (just like it is done for the fuzzy statfs)  ? Or maybe this is useless,
> since it happens only at the first time after the FS is mounted on the
> first node, and you consider that a FS is not mounted/unmounted
> frequently ?
> However, has this been changed in GFS2 ?
> 
Not yet, but watch this space :-)

> When i read this:
> http://sourceware.org/cluster/faq.html#gfs_tuning
> I understand that i should increase the size of the RG on big FS.
> However, the code says that some data structures are loaded in memory
> for each RG that's being locked (notably 2 bitmaps). So there's a
> memory overhead when I increase the size of the RG. I also understand
> that increasing the size of the RG increases the risk to have 2 or more
> nodes working in the same RGs (is this right ?). What is the maximum
> size of RG I should be using ?
> 
RGs are limited to 2^32 blocks, including the RG header. Generally you
want to use a number or RGs >> number of nodes. Provided this is true
then you can make the RGs as large as you like (up to the 2^32 block
limit) without compromising performance.

There is not a lot to worry about so far as memory overhead goes. Either
you have more fewer larger RGs or more smaller RGs for a given
filesystem size.  There is a lot of overhead at the moment, but thats
something we need to address and not something thats really within the
users control.

> More generally, is there the list of hard points in GFS1 that you've
> been trying to solve with GFS2, somewhere accessible on the web ? Also,
> what is actually the maximum size that GFS2 is known to be working on ?
> (both in terms of nodes and real size)
> 
Really only whats in our bugzilla. Just search for all the bugs with
GFS2 somewhere in the title. I don't know what the max size of any
current GFS2 system is I'm afraid,

Steve.




From gordan at bobich.net  Mon Jan 14 14:26:26 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 14 Jan 2008 14:26:26 +0000 (GMT)
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
In-Reply-To: <1200320334.22038.129.camel@quoit>
References: <20080114150626.24a9b281@mathieu.toulouse>
	<1200320334.22038.129.camel@quoit>
Message-ID: <alpine.LRH.1.00.0801141423430.17443@skynet.shatteredsilicon.net>



>> More generally, is there the list of hard points in GFS1 that you've
>> been trying to solve with GFS2, somewhere accessible on the web ? Also,
>> what is actually the maximum size that GFS2 is known to be working on ?
>> (both in terms of nodes and real size)
>>
> Really only whats in our bugzilla. Just search for all the bugs with
> GFS2 somewhere in the title. I don't know what the max size of any
> current GFS2 system is I'm afraid,

Not sure if the information here is accurate, but:
http://en.wikipedia.org/wiki/Comparison_of_file_systems
"Depends on kernel version and arch. For 2.4 kernels the max is 2 TiB. For 
32-bit 2.6 kernels it is 16 TiB. For 64-bit 2.6 kernels it is 8 EiB."

I am not sure if "64-bit" includes x86-64, because that isn't really 
64-bit despite what it's name might imply.

Gordan



From swhiteho at redhat.com  Mon Jan 14 14:33:25 2008
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 14 Jan 2008 14:33:25 +0000
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
In-Reply-To: <alpine.LRH.1.00.0801141423430.17443@skynet.shatteredsilicon.net>
References: <20080114150626.24a9b281@mathieu.toulouse>
	<1200320334.22038.129.camel@quoit>
	<alpine.LRH.1.00.0801141423430.17443@skynet.shatteredsilicon.net>
Message-ID: <1200321205.22038.134.camel@quoit>

Hi,

On Mon, 2008-01-14 at 14:26 +0000, gordan at bobich.net wrote:
> 
> >> More generally, is there the list of hard points in GFS1 that you've
> >> been trying to solve with GFS2, somewhere accessible on the web ? Also,
> >> what is actually the maximum size that GFS2 is known to be working on ?
> >> (both in terms of nodes and real size)
> >>
> > Really only whats in our bugzilla. Just search for all the bugs with
> > GFS2 somewhere in the title. I don't know what the max size of any
> > current GFS2 system is I'm afraid,
> 
> Not sure if the information here is accurate, but:
> http://en.wikipedia.org/wiki/Comparison_of_file_systems
> "Depends on kernel version and arch. For 2.4 kernels the max is 2 TiB. For 
> 32-bit 2.6 kernels it is 16 TiB. For 64-bit 2.6 kernels it is 8 EiB."
> 
It is accurate and if you look at the list of page editors you'll see
that I updated it recently.

> I am not sure if "64-bit" includes x86-64, because that isn't really 
> 64-bit despite what it's name might imply.
> 
> Gordan
> 
Thats true, but I don't think that was the question. I thought that he
was asking about the current sizes that were actually being used rather
than the max theoretical sizes,

Steve.


> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From harry.sutton at hp.com  Mon Jan 14 14:41:53 2008
From: harry.sutton at hp.com (Sutton, Harry (MSE))
Date: Mon, 14 Jan 2008 09:41:53 -0500
Subject: [Linux-cluster] Problem in Mounting gfs2 filesystem Can some
	Help	me?????
In-Reply-To: <7ed8d870801140318v274462e2g586446601016ad14@mail.gmail.com>
References: <7ed8d870801140201p434db5abv46b16462a269fb45@mail.gmail.com>	<alpine.LRH.1.00.0801141005340.15538@skynet.shatteredsilicon.net>	<7ed8d870801140244w3999a574m7f2f28370da66d3@mail.gmail.com>
	<7ed8d870801140318v274462e2g586446601016ad14@mail.gmail.com>
Message-ID: <478B74B1.1080109@hp.com>

An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/92b86641/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 6255 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/92b86641/attachment.bin>

From mad at wol.de  Mon Jan 14 15:16:10 2008
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Mon, 14 Jan 2008 16:16:10 +0100
Subject: [Linux-cluster] Upgrade from cluster 1.04.00 to 2.01.00
Message-ID: <1200323770.29827.37.camel@marc>

Hello List,


can someone guide me with some information about things
i have to check and common pitfalls on the upgrade path
from cluster suite 1.04.00 to 2.01.00?

What kernel should be used for release 2.01.00?
Is there a 2.02.00 Package already scheduled in near future?


thanks in advance,

Marc





From Alexandre.Racine at mhicc.org  Mon Jan 14 19:39:25 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Mon, 14 Jan 2008 14:39:25 -0500
Subject: [Linux-cluster] Upgrade from cluster 1.04.00 to 2.01.00
References: <1200323770.29827.37.camel@marc>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D128A@cumulonimbus.RG.local>

Probably that all developers will tell you not to use version 2 since it is not stable yet.


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org



-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Marc - A. Dahlhaus [ Administration | Westermann GmbH ]
Sent: Mon 2008-01-14 10:16
To: linux clustering
Subject: [Linux-cluster] Upgrade from cluster 1.04.00 to 2.01.00
 
Hello List,


can someone guide me with some information about things
i have to check and common pitfalls on the upgrade path
from cluster suite 1.04.00 to 2.01.00?

What kernel should be used for release 2.01.00?
Is there a 2.02.00 Package already scheduled in near future?


thanks in advance,

Marc



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3092 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/6974a0ef/attachment.bin>

From lhh at redhat.com  Mon Jan 14 16:06:41 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 14 Jan 2008 11:06:41 -0500
Subject: [Linux-cluster] CS5  / something weird during tests
In-Reply-To: <478B225D.9000905@bull.net>
References: <478B225D.9000905@bull.net>
Message-ID: <1200326801.16786.6.camel@ayanami.boston.devel.redhat.com>

On Mon, 2008-01-14 at 09:50 +0100, Alain Moulle wrote:
> On Fri, 2008-01-11 at 15:19 +0100, Alain Moulle wrote:
> 
> >> Hi
> >>
> >> On my two-nodes cluster with qdiskd :
> >> when testing CS5 via a ifdown eth0 on node2(where is the heart-beat)
> >> I have a strange behavior : the node2 is rebooted and service
> >> is failovered by node1, fine. But after the reboot of node2 and
> >> re-launch of CS 4 daemons, I can't see via clustat any information
> 
> 
> >CS5 and CS4 do not work together, and do not happily coexist on the same
> >machine.
> 
> Sorry it was a mistake in my email, I only use CS5 on the two-nodes cluster.
> 
> >from rgmanager, and cant't stop it if I try service rgmanager stop ...
> >> It seems to be stalled in a strange state ...
> 
> >cman_tool services output would be helpful
> >-- Lon
> 
> On node where clustat does not display services information:
> [root at xena141 ~]# clustat
> msg_open: No such file or directory
> Member Status: Quorate

Not sure what might cause it.  Assuming you have 5.1[.z] installed:

(a) install matching -debuginfo RPM
(b) killall -USR1 clurgmgrd

Then attach /tmp/rgmanager-dump

Also, note that you can't run 'clustat' as non-root, and there's a
chance SELinux is preventing creation of the rgmanager admin socket.

-- Lon



From mad at wol.de  Mon Jan 14 16:10:02 2008
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Mon, 14 Jan 2008 17:10:02 +0100
Subject: [Linux-cluster] Upgrade from cluster 1.04.00 to 2.01.00
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D128A@cumulonimbus.RG.local>
References: <1200323770.29827.37.camel@marc>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D6D128A@cumulonimbus.RG.local>
Message-ID: <1200327002.29827.48.camel@marc>

Hello Alexandre,

Am Montag, den 14.01.2008, 14:39 -0500 schrieb Alexandre Racine:
> Probably that all developers will tell you not to use version 2 since it is not stable yet.

we plan to upgrade the cluster suite to the new openais based
infrastructure rolled out since RHEL 5.0 (wich is in releases >=
2.00.00) and will continue to use gfs as filesystem not gfs2
wich is not "ready for production TM" yet.

Thanks for your reply,

Marc



From mathieu.avila at seanodes.com  Mon Jan 14 16:23:02 2008
From: mathieu.avila at seanodes.com (Mathieu Avila)
Date: Mon, 14 Jan 2008 17:23:02 +0100
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
In-Reply-To: <1200321205.22038.134.camel@quoit>
References: <20080114150626.24a9b281@mathieu.toulouse>
	<1200320334.22038.129.camel@quoit>
	<alpine.LRH.1.00.0801141423430.17443@skynet.shatteredsilicon.net>
	<1200321205.22038.134.camel@quoit>
Message-ID: <20080114172302.06e7c59b@mathieu.toulouse>

Le Mon, 14 Jan 2008 14:33:25 +0000,
Steven Whitehouse <swhiteho at redhat.com> a ?crit :

> Hi,
> 
> 
> Thats true, but I don't think that was the question. I thought that he
> was asking about the current sizes that were actually being used
> rather than the max theoretical sizes,
> 
> Steve.
> 
> 

Exactly.

Even if the theoritical limit is an important factor, i.e an unbreakable
limit without redesign, the practical limit is also of importance, and
often much less impressive than the theoritical one. Knowing what
people have been able to create and *use* *everyday* *without* *pain*,
is very interesting.

 I've heard about 30To somewhere on this list... Is there more than
that ? What about maximum number of nodes ? Has somebody "played" with
64 nodes ? 128 nodes ? Even more... ?

If anybody had any bigger figure and wanted to share, please do.

--
Mathieu


--------------------------------------------------------------------------------
Les opinions et prises de position emises par le signataire du present
message lui sont propres et ne sauraient engager la responsabilite de la
societe SEANODES.

Ce message ainsi que les eventuelles pieces jointes constituent une
correspondance privee et confidentielle a l'attention exclusive du
destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
present message ou une personne susceptible de pouvoir le lui delivrer, il
vous est signifie que toute divulgation, distribution ou copie de cette
transmission est strictement interdite. Si vous avez recu ce message par
erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
lui retourner le present message, puis d'effacer immediatement ce message de
votre systeme.


The views and opinions expressed by the author of this message are personal.
SEANODES shall assume no liability, express or implied for such message.

This e-mail and any attachments is a confidential correspondence intended
only for use of the individual or entity named above. If you are not the
intended recipient or the agent responsible for delivering the message to
the intended recipient, you are hereby notified that any disclosure,
distribution or copying of this communication is strictly prohibited. If you
have received this communication in error, please notify the sender by phone
or by replying this message, and then delete this message from your system. 



From comaniliut at yahoo.com  Mon Jan 14 16:28:40 2008
From: comaniliut at yahoo.com (Coman ILIUT)
Date: Mon, 14 Jan 2008 08:28:40 -0800 (PST)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <609632.88457.qm@web51308.mail.re2.yahoo.com>

We have ILO fencing working fine with ILO2 on DL360G5 here. It may be because we are using our own fencing method. We are sending the XML text to do the RESET_SERVER to the ILO and it works. I believe the standard fence_ilo script sends a HOLD_PWR_BTN or  PRESS_PWR_BTN command instead.
HP used to have a PDF on their web site describing the ILO interface. It is basically a matter of opening a secure socket connection to the ILO address and sending commands in XML format. Of course the art is really about finding that PDF on the huge HP web site...

Coman

----- Original Message ----
From: m.. mm.. <teemu.m2 at luukku.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Saturday, January 12, 2008 9:40:20 AM
Subject: Re: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42

Hello again.
Hmm, i don't know how to enable ipmi in Hp360 iLo2, can somebody help
 me and explain it to me?

And how is the config-parameters in cluster-suite at fence device.

Sorry, i don't understand it, and didn?'t get it work..
.ipmilan: Failed to connect after 30 seconds Failed


Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
> > Hi
> > We are having some problems to get fence working with this
 hardware.
> > 
> > Proliant 360 G5.. iLo firmware 1.42
> > and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
> > 
> > This script: fence_ilo
> > Fence shutdown works fine, but it don't restart it.
> > agent "fence_ilo" reports: failed to turn on
> > 
> > Have someone working script to get this work?
> 
> Enable IPMI on the iLO2 management card and use it instead.
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster






      Looking for the perfect gift? Give the gift of Flickr! 

http://www.flickr.com/gift/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/7d406a30/attachment.htm>

From holger.ratzel at she.net  Mon Jan 14 17:30:16 2008
From: holger.ratzel at she.net (Holger L. Ratzel)
Date: Mon, 14 Jan 2008 18:30:16 +0100
Subject: [Linux-cluster] Help with a two node cluster for a web server needed
Message-ID: <200801141830.16714.holger.ratzel@she.net>

Hi,

I've now tried to get a working cluster config for quite a while. But I didn't 
succeed :-( Now I'm hoping that someone on this list can help me. Perhaps 
with a correct 'cluster.conf'...

The cluster consists of two nodes running the lates RHEL5.1. They are using 
RPS-10 power switches which are attached to /dev/ttyS0. The Apache should 
allways run the node with the connection to the rest of the network. So I've 
configured the qdiskd to use a 'ping' heuristic. But every time I tried to 
simulate a network partition by pulling the network cable from the server 
running Apache (testcluster-1), it ended with both nodes fencing each 
other :-( I've then tried to modify the timeout parameters slightly, but with 
no success...

You can finde my lates 'cluster.conf' attached to this mail. I've also 
attached a part form '/var/log/messages' for both nodes, showing what happend 
during my last test.

So, how should a correct 'cluster.conf' look like?

Many thanks and best regards,

	Holger

-- 
----------------- SHE - IT-Sicherheit von Experten ------------------
SHE Informationstechnologie AG
Holger L. Ratzel                               Fon:+49 621 5200 - 210 
Service Delivery & Support                     Fax:+49 621 5200 - 555
Donnersbergweg 3                                holger.ratzel at she.net
D-67059 Ludwigshafen                              http://www.she.net/
Sitz der Gesellschaft und Registergericht Ludwigshafen HRB 4593
Aufsichtsratsvorsitzender: Ulrich Engelhardt
Vorstand: Klaus Schulz
----- I haven't lost my mind; it's backed up on tape somewhere! -----

PGP-Fingerprint:
9A 73 40 22 72 64 BE D1  D8 1A 54 3C 5B 64 AF C3  CC E3 CA A8
Get my PGP public key at: http://pgp.she.net/
-------------- next part --------------
<?xml version="1.0"?>
<cluster alias="Test" config_version="25" name="Test">
	<quorumd interval="1" label="Qdisk1" tko="5" votes="1">
		<heuristic interval="1" program="ping 10.200.10.1 -c1 -t1" score="1" tko="3"/>
	</quorumd>
	<fence_daemon post_fail_delay="0" post_join_delay="3"/>
	<clusternodes>
		<clusternode name="testcluster-2" nodeid="2" votes="1">
			<fence>
				<method name="1">
					<device name="RPS"/>
				</method>
			</fence>
			<multicast addr="224.0.0.10" interface="eth0"/>
		</clusternode>
		<clusternode name="testcluster-1" nodeid="1" votes="1">
			<fence>
				<method name="1">
					<device name="RPS"/>
				</method>
			</fence>
			<multicast addr="224.0.0.10" interface="eth0"/>
		</clusternode>
	</clusternodes>
	<cman expected_votes="3" two_node="0">
		<multicast addr="224.0.0.10"/>
	</cman>
	<fencedevices>
		<fencedevice agent="fence_rps10" device="/dev/ttyS0" name="RPS" option="reboot" port="0"/>
	</fencedevices>
	<rm>
		<failoverdomains>
			<failoverdomain name="Apache" ordered="1" restricted="1">
				<failoverdomainnode name="testcluster-1" priority="1"/>
				<failoverdomainnode name="testcluster-2" priority="2"/>
			</failoverdomain>
		</failoverdomains>
		<resources>
			<ip address="10.200.10.189" monitor_link="1"/>
			<script file="/etc/init.d/httpd" name="Apache"/>
			<fs device="/dev/sdb1" force_fsck="0" force_unmount="1" fsid="26076" fstype="ext3" mountpoint="/data/httpd" name="DISK_Apache" options="" self_fence="0"/>
		</resources>
		<service autostart="1" domain="Apache" name="HTTPD">
			<ip ref="10.200.10.189"/>
			<script ref="Apache"/>
			<fs ref="DISK_Apache"/>
		</service>
	</rm>
</cluster>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node1.log
Type: text/x-log
Size: 46425 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/a5d43f69/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node2.log
Type: text/x-log
Size: 45577 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/a5d43f69/attachment-0001.bin>

From Alexandre.Racine at mhicc.org  Mon Jan 14 18:59:14 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Mon, 14 Jan 2008 13:59:14 -0500
Subject: [Linux-cluster] RE: scsi reservation
References: <C776378855970A4DADE4A476447F639101168951@NAMAIL3.ad.lsil.com><C43CF0825BF59D4FBC1F6A2AF45EB88D6D1279@cumulonimbus.RG.local>
	<4787A3A5.5070800@redhat.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D128D@cumulonimbus.RG.local>

Hi,

That's actually Gentoo packages (well, package that need to compile).

Where can I find the source of fence?

Also, looking here http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?cvsroot=cluster files seems to be bash and perl, so I guess that I can use them just by downloading them. If there something wrong, I'll probably get some errors messages :)





-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Ryan O'Hara
Sent: Fri 2008-01-11 12:13
To: linux clustering
Subject: Re: [Linux-cluster] RE: scsi reservation
 
Did you pull the source from cvs or did you grab one of the tar.gz files?

Ryan


Alexandre Racine wrote:
> You are right, that package was not installed.
> 
> So now I installed the package, and recompiled "fence", but "fence_scsi" is still not there in /sbin/
> 
> Any more idea? (Thanks for the first hint).
> 
> 
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com on behalf of Sadek, Abdel
> Sent: Wed 2008-01-09 11:45
> To: linux clustering
> Subject: [Linux-cluster] RE: scsi reservation
>  
> I believe you may not have the sg3_utils packages installed. I'll first
> check for that.
> 
> Thanks.
> Abdel..
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alexandre Racine
> Sent: Wednesday, January 09, 2008 10:24 AM
> To: linux clustering
> Subject: scsi reservation
> 
> Hi all,
> 
> I am currently using version 1.0.4 of GFS and the scsi reservation
> binairies (scsi_reserve, fence_scsi, etc) are not there. Is it suppose
> to be like this or this is the distro I a using playing games with me
> (not my choice! It's Gentoo).
> 
> If it's normal that they are not there, is there a reason for this? Does
> it work well?
> 
> Because it's still here :
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/scsi/?
> cvsroot=cluster
> 
> 
> Thanks.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3757 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/2d629d64/attachment.bin>

From rohara at redhat.com  Mon Jan 14 19:48:22 2008
From: rohara at redhat.com (Ryan O'Hara)
Date: Mon, 14 Jan 2008 13:48:22 -0600
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <1200163365971.teemu.m2.655.JzOnwZSzXfM3REzTKk5v2g@luukku.com>
References: <1200163365971.teemu.m2.655.JzOnwZSzXfM3REzTKk5v2g@luukku.com>
Message-ID: <478BBC86.7000003@redhat.com>

m.. mm.. wrote:
> I don't get it working:
> This error message comes..when i try to fence node2.
> 
> Jan 12 20:38:57 xxx4n1 fence_node[20776]: agent "fence_ipmilan" reports: Rebooting machine @ IPMI:192.168.69.4...ipmilan: Failed to connect after 30 seconds Failed
> Jan 12 20:38:57 xxx4n1 ccsd[3239]: Attempt to close an unopened CCS descriptor (142830).
> Jan 12 20:38:57 xxx4n1 ccsd[3239]: Error while processing disconnect: Invalid request descriptor
> 
> Some ideas?

Actually, I'm not sure that IPMI will work with your hardware. Some of 
the HP machines come with "ILO MP", which does support IPMI. In that 
case, you would want to use IPMI to do fencing. I'm not familiar with 
your hardware, but I am guessing that you don't have ILO MP, and thus 
IPMI is not going to work.

> tameikareed kirjoitti 12.01.2008 kello 18:24:
>> Hi,
>>
>> Do you have IPMI turned on in your kernel.  It should be if you didn't 
>> recompile your kernel, but it not you can install that module.  As far 
>> as recompiling your kernel, RedHat does not support that so you will 
>> have to download the vanilla kernel from kernel.org.  This will be the 
>> source kernel.  Once you have that I would open up a second screen and 
>> do a make menuconfig on the kernel that you do have installed under 
>> /usr/src/kernels/<kernel-version>. In the other screen install the 
>> source kernel with rpm -ivh <kernel src version>. Then build the kernel 
>> by going over to /usr/src/redhat/SPECS and doing rpmbuild -bp 
>> kernel<version>.spec.  Below is a list of options:
>>
>>         -ba    Build binary and source packages (after doing the %prep, 
>> %build, and %install stages).
>>
>>        -bb    Build a binary package (after doing the %prep, %build,
>>  and 
>> %install stages).
>>
>>        -bp    Executes the "%prep" stage from the spec file. Normally 
>> this involves unpacking the sources and applying any patches.
>>
>>        -bc    Do  the "%build" stage from the spec file (after doing
>>  the 
>> %prep stage).  This generally involves the equivalent of a "make".
>>
>>        -bi    Do the "%install" stage from the spec file (after doing 
>> the %prep and %build stages).  This generally involves the equivalent
>>               of a "make install".
>>
>>        -bl    Do  a  "list check".  The "%files" section from the spec 
>> file is macro expanded, and checks are made to verify that each file
>>               exists.
>> After that you need to make a symbolic link over in the
>>  /usr/src/kernel/ 
>> directory linking back to your source directory of the 
>> kernel.(/usr/src/kernels/BUILD)
>>
>> As far as with HP hardware, you can look on the disk. Or check this
>>  link 
>> out:
>> http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lan
>> g=en&cc=us&mode=3&taskId=135&swItem=MTX-UNITY-I25296
>>
>> I would go with the HP drivers . This should have came in hpasm
>>  package. 
>> I have include the instructions from the hp site:
>>
>> *Installation:*
>> Login as the system administrator (root), download the RPM to a 
>> directory on your hard drive and change to that directory.
>>
>> If a previous version of the hp-OpenIPMI package has been installed, it 
>> must be removed before this package can be installed. To remove the 
>> previous version and any packages dependent on it, type the following:
>>
>>       rpm -e hp-OpenIPMI
>>
>> To install the package, type:
>>
>>       rpm -ivh hp-OpenIPMI-< version >.rpm
>>
>> The drivers will be installed to the /opt/hp/hp-OpenIPMI directory but 
>> not inserted. For HP ProLiant servers, the hpasm package will 
>> automatically load the hp-OpenIPMI drivers if required for a particular 
>> HP ProLiant server.
>>
>> Additional information and help is available by typing either of the 
>> following:
>>
>>       man hp-OpenIPMI
>>
>> Additional information about features, installation, use, and 
>> troubleshooting the HP Server Management Drivers and Agents is
>>  available 
>> in the HOWTO documents available here: 
>> http://www.compaq.com/products/servers/linux/documentation.html.
>>
>> Well hope this helps,
>> Tameika
>>
>> m.. mm.. wrote:
>>> Hello again.
>>> Hmm, i don't know how to enable ipmi in Hp360 iLo2, can somebody help me
>> and explain it to me?
>>> And how is the config-parameters in cluster-suite at fence device.
>>>
>>> Sorry, i don't understand it, and didn?'t get it work..
>>> .ipmilan: Failed to connect after 30 seconds Failed
>>>
>>>
>>> Lon Hohberger kirjoitti 12.01.2008 kello 00:19:
>>>   
>>>> On Fri, 2008-01-11 at 22:56 +0200, m.. mm.. wrote:
>>>>     
>>>>> Hi
>>>>> We are having some problems to get fence working with this hardware.
>>>>>
>>>>> Proliant 360 G5.. iLo firmware 1.42
>>>>> and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
>>>>>
>>>>> This script: fence_ilo
>>>>> Fence shutdown works fine, but it don't restart it.
>>>>> agent "fence_ilo" reports: failed to turn on
>>>>>
>>>>> Have someone working script to get this work?
>>>>>       
>>>> Enable IPMI on the iLO2 management card and use it instead.
>>>>
>>>> -- Lon
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>     
>>>
>>> ...................................................................
>>> Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
>>> Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>>   
> 
> 
> ...................................................................
> Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
> Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From lhh at redhat.com  Mon Jan 14 20:03:46 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 14 Jan 2008 15:03:46 -0500
Subject: [Linux-cluster] Help with a two node cluster for a web server
	needed
In-Reply-To: <200801141830.16714.holger.ratzel@she.net>
References: <200801141830.16714.holger.ratzel@she.net>
Message-ID: <1200341026.16786.56.camel@ayanami.boston.devel.redhat.com>

So, what was happening was this:

1. unplug cable
2. cman transitions
3. fencing occurs
4. qdiskd detects negative transition

Here's what we want on the "dead" node:

1. unplug cable
2. qdiskd detects negative transition from heuristic



Here's your configuration:

    <quorumd interval="1" label="Qdisk1" tko="5" votes="1">
        <heuristic interval="1" program="ping 10.200.10.1 -c1 -t1"
score="1" tko="3"/>
    </quorumd>


First, let's ping the router with the cable unplugged to see how long it
takes for our heuristic to complete when things are "broken".  On my
machine:

[lhh at ayanami ~]$ time ping -c1 -t1 frederick
PING frederick (12.1.2.99) 56(84) bytes of data.
>From ayanami (12.1.2.37) icmp_seq=1 Destination Host Unreachable

--- frederick ping statistics ---
1 packets transmitted, 0 received, +1 errors, 100% packet loss, time 0ms


real    0m3.006s
^^^^^^^^^^^^^^^^
user    0m0.000s
sys     0m0.000s

Ok - so, 3 seconds for ping to "not find" a host if routing is wrong or
the host is down, sleep 1 second, repeat 3x (tko!) - if dead 3x (tko
count), qdisk removes the vote from CMAN.  That means if the host is
down, it will take qdisk about 3 * (3+1) = 12 seconds to kill its vote
with CMAN. [NOTE: keep in mind, it might not be 3 seconds for your
configuration...]

CMAN's default failover time is 5 seconds (this is really openais's
Totem protocol token timeout, if you want to be technical).

12 > 5, meaning qdiskd can't do much to help before CMAN takes action.
We need to flip these times so that CMAN times out *after* qdisk.  This
way, qdiskd can say "Ok! I'm dead!" - and either take action (reboot by
default) or remove its vote from CMAN.


So, the practical rules for timings are basically like this:

* Heuristics should transition before QDisk.  x < y.

* Qdisk should transition before CMAN - in a little less than 1/2 the
time, actually.  y * 2 < z


Option 1:

Make 1 tko sufficient by making the heuristic do more work.  In my quick
testing, the same 3 seconds for 1 packet was used for 3 packets.

Also, we still want CMAN to time out after qdisk - which it won't yet.
So, we need to add a tag to cluster.conf that instructs totem to report
a node as down after a period longer than qdisk (a little more than
double, as noted above):
  
    ...
    <quorumd ...>
        <heuristic interval="1" program="ping 10.200.10.1 -c3 -t1"
score="1" tko="1"/>
    </quorumd>
    <totem token="11000"/>
    ...

This says 110000 milliseconds, or 11 seconds, is required before totem
(and therefore, CMAN) will declare a node dead (2 * qdisk_timeout) = 10.
Toss in a second for fun, we get 11 seconds.

Since the ping timeout for -c3 is 3 seconds and we have a tko of 1, it
should take 3-4 seconds for ping to return a failure.

3 < 5
5 * 2 < 11


Option 2: 

Make things fit around your heuristic.  Given our 12 second "negative"
case for our heuristic/tko, we can simply make qdisk time out in >12
seconds.  Then, we double that and add a bit for CMAN:

    ...
    <quorumd interval="1" label="Qdisk1" tko="13" votes="1">
        <heuristic interval="1" program="ping 10.200.10.1 -c1 -t1"
score="1" tko="3"/>
    </quorumd>
    <totem token="27000"/>
    ...

12 < 13 
13 * 2 < 27


Let me know if this helps you, so I can add it to the Wiki and further
clarify the manual pages.  Either of these should get you up and
working.

-- Lon



From garromo at us.ibm.com  Mon Jan 14 21:58:28 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Mon, 14 Jan 2008 14:58:28 -0700
Subject: [Linux-cluster] system-config-cluster problem
Message-ID: <OF891A273D.275B41C1-ON872573D0.0078615F-872573D0.00789E11@us.ibm.com>

I just installed RHCS on AS 4.5, and I am trying to run  the 
system-config-cluster.

[root at myhost ~]# DISPLAY=myhost.domain.com:1

[root at myhost ~]# export DISPLAY

[root at myhost ~]# system-config-cluster &
[1] 24013

[root at lxdnt687 ~]# Traceback (most recent call last):
  File "/usr/sbin/system-config-cluster", line 52, in ?
    from ConfigTab import ConfigTab
  File "/usr/share/system-config-cluster/ConfigTab.py", line 27, in ?
    from ConfigTabController import ConfigTabController
  File "/usr/share/system-config-cluster/ConfigTabController.py", line 35, 
in ?
    from FaildomController import FaildomController
  File "/usr/share/system-config-cluster/FaildomController.py", line 213
    if val == "Yes" or val == "yes" or val="1":
                                          ^
SyntaxError: invalid syntax


Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/e2517d9a/attachment.htm>

From accdias+cluster at gmail.com  Mon Jan 14 22:24:41 2008
From: accdias+cluster at gmail.com (Antonio Dias)
Date: Mon, 14 Jan 2008 20:24:41 -0200
Subject: [Linux-cluster] system-config-cluster problem
In-Reply-To: <OF891A273D.275B41C1-ON872573D0.0078615F-872573D0.00789E11@us.ibm.com>
References: <OF891A273D.275B41C1-ON872573D0.0078615F-872573D0.00789E11@us.ibm.com>
Message-ID: <204313690801141424m4d142882j4c0bf259f3f757d6@mail.gmail.com>

2008/1/14 Gary Romo <garromo at us.ibm.com>:
>   File "/usr/share/system-config-cluster/FaildomController.py", line 213
>     if val == "Yes" or val == "yes" or val="1":

Edit /usr/share/system-config-cluster/FaildomController.py and change
line 213 to be like this:

if val == "Yes" or val == "yes" or val == "1":

-- 
Antonio Dias



From garromo at us.ibm.com  Mon Jan 14 22:50:27 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Mon, 14 Jan 2008 15:50:27 -0700
Subject: [Linux-cluster] system-config-cluster problem
In-Reply-To: <204313690801141424m4d142882j4c0bf259f3f757d6@mail.gmail.com>
Message-ID: <OF254A17AA.521354B4-ON872573D0.007D6DC8-872573D0.007D6054@us.ibm.com>

That Did it.  I'm guessing you have run into this before.
What is the issue?  a bug?

Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com



"Antonio Dias" <accdias+cluster at gmail.com> 
Sent by: linux-cluster-bounces at redhat.com
01/14/2008 03:24 PM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
"linux clustering" <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] system-config-cluster problem






2008/1/14 Gary Romo <garromo at us.ibm.com>:
>   File "/usr/share/system-config-cluster/FaildomController.py", line 213
>     if val == "Yes" or val == "yes" or val="1":

Edit /usr/share/system-config-cluster/FaildomController.py and change
line 213 to be like this:

if val == "Yes" or val == "yes" or val == "1":

-- 
Antonio Dias

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080114/a8991f83/attachment.htm>

From fajar at telkom.net.id  Tue Jan 15 01:48:36 2008
From: fajar at telkom.net.id (Fajar A. Nugraha)
Date: Tue, 15 Jan 2008 08:48:36 +0700
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <478B64AC.9000201@cliff.hones.org.uk>
References: <4787C5D3.8020500@cliff.hones.org.uk> <478B256C.9020009@redhat.com>
	<478B64AC.9000201@cliff.hones.org.uk>
Message-ID: <478C10F4.8090803@telkom.net.id>

Cliff Hones wrote:
>
> While I can see that in a controlled production environment the
> loss of a server node may be best handled by a forced (power)
> reboot, it does seem unfortunate if this has to be done in,
> say, an office environment when communication has been temporarily
> lost eg while rearranging n/w cabling.
>

AFAIK the prequisite for a cluster of any kind (be it RHEL or RAC) is 
that you have a failure-resistant network. This can be achieved for 
example by using dedicated heartbeat switches or cross-cables (in case 
of two nodes), plus ethernet bonding (in linux) for redundancy.

While I understand your requirement, I don't think an environment with 
(possibly) unreliable n/w is a good place for a cluster. Perhaps a 
simple thin client is more appropriate.

Regards,

Fajar



From aichains at nonstophate.com  Tue Jan 15 02:21:39 2008
From: aichains at nonstophate.com (aichains)
Date: Mon, 14 Jan 2008 21:21:39 -0500
Subject: [Linux-cluster] GFS tuning advice sought
In-Reply-To: <2008189498.774713@leena>
References: <2008189498.774713@leena>
Message-ID: <1200363699.12011.36.camel@floodgate.nonstophate.local>

On Tue, 2008-01-08 at 09:49 -0600, isplist at logicore.net wrote:
> > Web server disk I/O is likely to be mostly read-only, so I doubt disk
> > performance will ever be your bottleneck. It's bouncing write-locks
> > around that slows clustered file systems down.
> 
> True and other than media, all writes are to the MySQL servers. Still, I 
> wondered since the web servers are all sharing a GFS space for their pages.
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


hi mike,

consider looking into memcached.



-- 
aichains <aichains at nonstophate.com>



From Alain.Moulle at bull.net  Tue Jan 15 07:29:53 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 15 Jan 2008 08:29:53 +0100
Subject: [Linux-cluster] CS5 last rpm releases ?
Message-ID: <478C60F1.5000306@bull.net>

Hi

My rpm releases are currently :
cman-2.0.73-1.el5
rgmanager-2.0.31-1.el5
system-config-cluster-1.0.50-1.3.noarch.rpm
and
perl-Net-Telnet-3.03-5.noarch.rpm
openais-0.80.3-7.el5.x86_64.rpm

I just wonder if I am up to date or if there
are already some new releases available ?
and if so what are the defect fixed ?

Thanks
Regards
Alain Moull?



From fajar at telkom.net.id  Tue Jan 15 07:41:24 2008
From: fajar at telkom.net.id (Fajar A. Nugraha)
Date: Tue, 15 Jan 2008 14:41:24 +0700
Subject: [Linux-cluster] CS5 last rpm releases ?
In-Reply-To: <478C60F1.5000306@bull.net>
References: <478C60F1.5000306@bull.net>
Message-ID: <478C63A4.5040706@telkom.net.id>

Alain Moulle wrote:
> Hi
>
> My rpm releases are currently :
> cman-2.0.73-1.el5
> rgmanager-2.0.31-1.el5
> system-config-cluster-1.0.50-1.3.noarch.rpm
> and
> perl-Net-Telnet-3.03-5.noarch.rpm
> openais-0.80.3-7.el5.x86_64.rpm
>
> I just wonder if I am up to date or if there
> are already some new releases available ?
>   
It's up to date. If you have RHEL maintenance, RHN will notify you when 
a new package is available. If you use CentOS, I think there's a package 
that will enable automated update (or you can check manually with "yum 
check-update")
> and if so what are the defect fixed ?
>
>   
The info is available on RHN, or manually via rpm -q --changelog

Regards,

Fajar



From Alain.Moulle at bull.net  Tue Jan 15 10:20:49 2008
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 15 Jan 2008 11:20:49 +0100
Subject: [Linux-cluster] CS5 : cman-2.0.73-1.el5_1.1.x86_64.rpm versus
	cman-2.0.73-1.el5.x86_64.rpm
Message-ID: <478C8901.5080403@bull.net>

Hi

recall : two-nodes cluster with Quorum Disk
Back to my problem of bad rgmanager behavior after a test of
ifdown ethO (heart-beat interface) , I recall that after the
reboot of the node, the rgmanager does not respond and clustat
does not display any information from rgmanager on this node,
and a stop on rgmanager stalls.

The news is that the rpm cman-2.0.73-1.el5_1.1.x86_64.rpm
resolves the problem, whereas with cman-2.0.73-1.el5.x86_64.rpm
the problem is systematical .

Could you confirm the 1.1 has at least a fix in relation with
the problem I have ?

That's why I asked in my last email which up to date cman rpm
I must use ?

Thanks
Regards
Alain Moull?




From rmaureira at solint.cl  Tue Jan 15 12:10:33 2008
From: rmaureira at solint.cl (Robinson Maureira Castillo)
Date: Tue, 15 Jan 2008 09:10:33 -0300
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <478BBC86.7000003@redhat.com>
References: <1200163365971.teemu.m2.655.JzOnwZSzXfM3REzTKk5v2g@luukku.com>
	<478BBC86.7000003@redhat.com>
Message-ID: <478CA2B9.7090106@solint.cl>

Ryan O'Hara wrote:
> 
> Actually, I'm not sure that IPMI will work with your hardware. Some of
> the HP machines come with "ILO MP", which does support IPMI. In that
> case, you would want to use IPMI to do fencing. I'm not familiar with
> your hardware, but I am guessing that you don't have ILO MP, and thus
> IPMI is not going to work.
> 
iLO MP is only present on HP Itanium machines (Integrity product line).
The referenced proliant server uses HP iLO2, and according to the docs,
it works with IPMI

You need to install the following package:

http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lang=en&cc=us&prodTypeId=15351&prodSeriesId=1121486&prodNameId=3288144&swEnvOID=4006&swLang=8&mode=2&taskId=135&swItem=MTX-c237b96bd1e2416b85f1178ba1

Best regards,
-- 
Robinson Maureira Castillo
Jefe de Soporte
SOLINT

F: +56 2 4119047
C: +56 9 95994987



From Klaus.Steinberger at physik.uni-muenchen.de  Tue Jan 15 13:37:15 2008
From: Klaus.Steinberger at physik.uni-muenchen.de (Klaus Steinberger)
Date: Tue, 15 Jan 2008 14:37:15 +0100
Subject: [Linux-cluster] fencing with smash-clp
Message-ID: <200801151437.17992.Klaus.Steinberger@physik.uni-muenchen.de>

Hello,

has anybody a fence script with ssh (or telnet) access to a remote service 
board supporting smash-clp?

Sincerly,
Klaus

-- 
Klaus Steinberger         Beschleunigerlaboratorium
Phone: (+49 89)289 14287  Am Coulombwall 6, D-85748 Garching, Germany
FAX:   (+49 89)289 14280  EMail: Klaus.Steinberger at Physik.Uni-Muenchen.DE
URL: http://www.physik.uni-muenchen.de/~Klaus.Steinberger/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2002 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080115/71d96428/attachment.p7s>

From jparsons at redhat.com  Tue Jan 15 14:39:42 2008
From: jparsons at redhat.com (jim parsons)
Date: Tue, 15 Jan 2008 09:39:42 -0500
Subject: [Linux-cluster] system-config-cluster problem
In-Reply-To: <OF254A17AA.521354B4-ON872573D0.007D6DC8-872573D0.007D6054@us.ibm.com>
References: <OF254A17AA.521354B4-ON872573D0.007D6DC8-872573D0.007D6054@us.ibm.com>
Message-ID: <1200407982.3301.2.camel@localhost.localdomain>

This is a bug, and an asynchronous errata is on its way out.


On Mon, 2008-01-14 at 15:50 -0700, Gary Romo wrote:
> 
> That Did it.  I'm guessing you have run into this before. 
> What is the issue?  a bug? 
> 
> Gary Romo
> IBM Global Technology Services
> 303.458.4415
> Email: garromo at us.ibm.com
> Pager:1.877.552.9264
> Text message: gromo at skytel.com 
> 
> 
> "Antonio Dias" <accdias
> +cluster at gmail.com> 
> Sent by:
> linux-cluster-bounces at redhat.com 
> 
> 01/14/2008 03:24 PM 
>          Please respond to
>          linux clustering
>     <linux-cluster at redhat.com>
> 
> 
> 
> 
>                To
> "linux
> clustering"
> <linux-cluster at redhat.com> 
>                cc
> 
>           Subject
> Re:
> [Linux-cluster]
> system-config-cluster problem
> 
> 
> 
> 
> 
> 
> 
> 
> 2008/1/14 Gary Romo <garromo at us.ibm.com>:
> >   File "/usr/share/system-config-cluster/FaildomController.py", line
> 213
> >     if val == "Yes" or val == "yes" or val="1":
> 
> Edit /usr/share/system-config-cluster/FaildomController.py and change
> line 213 to be like this:
> 
> if val == "Yes" or val == "yes" or val == "1":
> 
> -- 
> Antonio Dias
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From holger.ratzel at she.net  Tue Jan 15 14:45:25 2008
From: holger.ratzel at she.net (Holger L. Ratzel)
Date: Tue, 15 Jan 2008 15:45:25 +0100
Subject: [Linux-cluster] Help with a two node cluster for a web
	serverneeded
In-Reply-To: <1200341026.16786.56.camel@ayanami.boston.devel.redhat.com>
References: <200801141830.16714.holger.ratzel@she.net>
	<1200341026.16786.56.camel@ayanami.boston.devel.redhat.com>
Message-ID: <200801151545.26113.holger.ratzel@she.net>

Hi,

Am Montag 14 Januar 2008 21:03:46 schrieb Lon Hohberger:
> So, what was happening was this:
[...]
>
> First, let's ping the router with the cable unplugged to see how long it
> takes for our heuristic to complete when things are "broken".  On my
> machine:
>
> [lhh at ayanami ~]$ time ping -c1 -t1 frederick
> PING frederick (12.1.2.99) 56(84) bytes of data.
>
> >From ayanami (12.1.2.37) icmp_seq=1 Destination Host Unreachable
>
> --- frederick ping statistics ---
> 1 packets transmitted, 0 received, +1 errors, 100% packet loss, time 0ms
>
>
> real    0m3.006s
> ^^^^^^^^^^^^^^^^
> user    0m0.000s
> sys     0m0.000s
>

[root at testcluster-1 cluster]# time ping -c3 -t1 10.200.10.1
PING 10.200.10.1 (10.200.10.1) 56(84) bytes of data.
64 bytes from 10.200.10.1: icmp_seq=1 ttl=64 time=1.41 ms
64 bytes from 10.200.10.1: icmp_seq=2 ttl=64 time=1.29 ms
64 bytes from 10.200.10.1: icmp_seq=3 ttl=64 time=1.32 ms

--- 10.200.10.1 ping statistics ---
3 packets transmitted, 3 received, 0% packet loss, time 2001ms
rtt min/avg/max/mdev = 1.290/1.341/1.410/0.065 ms

real    0m2.007s
^^^^^^^^^^^^^^^^
user    0m0.001s
sys     0m0.002s
[root at testcluster-1 cluster]# time ping -c3 -t1 10.200.10.65
PING 10.200.10.65 (10.200.10.65) 56(84) bytes of data.
From 10.200.10.187 icmp_seq=1 Destination Host Unreachable
From 10.200.10.187 icmp_seq=2 Destination Host Unreachable
From 10.200.10.187 icmp_seq=3 Destination Host Unreachable

--- 10.200.10.65 ping statistics ---
3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 1999ms
, pipe 3

real    0m3.004s
^^^^^^^^^^^^^^^^
user    0m0.001s
sys     0m0.002s

[...]
>
> Option 1:
>
> Make 1 tko sufficient by making the heuristic do more work.  In my quick
> testing, the same 3 seconds for 1 packet was used for 3 packets.
>
[...]
>
> Option 2:
>
> Make things fit around your heuristic.  Given our 12 second "negative"
> case for our heuristic/tko, we can simply make qdisk time out in >12
> seconds.  Then, we double that and add a bit for CMAN:

First I've tried both options, but with no success. Then I've tried to do 
both, to be on the safe side:

        ...
        <quorumd interval="1" label="Qdisk1" tko="15" votes="1">
                <heuristic interval="1" program="ping 10.200.10.1 -c3 -t1" 
score="1" tko="1"/>
        </quorumd>
        <totem token="40000"/>
        ...

Then I watched what happend by running 'clustat -li 1' on node 1. After that I 
unpluged the cable from node 1. After about 45 seconds node 2 was reported 
dead by clustat, but the quorum disk was still reported online. And then 
about 10 seconds later both nodes fenced each other...

I've attached parts of '/var/log/messages' from both nodes. Perhaps it 
contains helpfull information.

Many thanks and best regards,

	Holger

-- 
----------------- SHE - IT-Sicherheit von Experten ------------------
SHE Informationstechnologie AG
Holger L. Ratzel                               Fon:+49 621 5200 - 210 
Service Delivery & Support                     Fax:+49 621 5200 - 555
Donnersbergweg 3                                holger.ratzel at she.net
D-67059 Ludwigshafen                              http://www.she.net/
Sitz der Gesellschaft und Registergericht Ludwigshafen HRB 4593
Aufsichtsratsvorsitzender: Ulrich Engelhardt
Vorstand: Klaus Schulz
------------------------ I am root. Fear me! ------------------------

PGP-Fingerprint:
9A 73 40 22 72 64 BE D1  D8 1A 54 3C 5B 64 AF C3  CC E3 CA A8
Get my PGP public key at: http://pgp.she.net/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node1.log
Type: text/x-log
Size: 46707 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080115/13cc73fe/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node2.log
Type: text/x-log
Size: 45890 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080115/13cc73fe/attachment-0001.bin>

From cliff-lc at cliff.hones.org.uk  Tue Jan 15 14:51:02 2008
From: cliff-lc at cliff.hones.org.uk (Cliff Hones)
Date: Tue, 15 Jan 2008 14:51:02 +0000
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <478C10F4.8090803@telkom.net.id>
References: <4787C5D3.8020500@cliff.hones.org.uk>
	<478B256C.9020009@redhat.com>	<478B64AC.9000201@cliff.hones.org.uk>
	<478C10F4.8090803@telkom.net.id>
Message-ID: <478CC856.3030003@cliff.hones.org.uk>

Fajar A. Nugraha wrote:
> AFAIK the prequisite for a cluster of any kind (be it RHEL or RAC) is 
> that you have a failure-resistant network. This can be achieved for 
> example by using dedicated heartbeat switches or cross-cables (in case 
> of two nodes), plus ethernet bonding (in linux) for redundancy.
> 
> While I understand your requirement, I don't think an environment with 
> (possibly) unreliable n/w is a good place for a cluster. Perhaps a 
> simple thin client is more appropriate.

We actually have two areas in which we wanted to use GFS - one is an
office environment where the network, while not unreliable, is subject
to occasional reconfiguration as machines/switches are moved.  The other
is a datacentre environment where the infrastructure should be resilient.

In both cases, our primary need for clustering is to enable GFS to be
used.  Our local office setup could dispense with GFS/clustering and
we could use other data sharing solutions such as NFS; however, we
were planning to use a common solution so as to minimise maintenance
costs and maximise our familiarity with the technology.

-- Cliff





From gordan at bobich.net  Tue Jan 15 15:03:42 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 15 Jan 2008 15:03:42 +0000 (GMT)
Subject: [Linux-cluster] Graceful recover after connectivity failure
In-Reply-To: <478CC856.3030003@cliff.hones.org.uk>
References: <4787C5D3.8020500@cliff.hones.org.uk> <478B256C.9020009@redhat.com>
	<478B64AC.9000201@cliff.hones.org.uk>
	<478C10F4.8090803@telkom.net.id>
	<478CC856.3030003@cliff.hones.org.uk>
Message-ID: <alpine.LRH.1.00.0801151455580.24374@skynet.shatteredsilicon.net>

On Tue, 15 Jan 2008, Cliff Hones wrote:

> Fajar A. Nugraha wrote:
>> AFAIK the prequisite for a cluster of any kind (be it RHEL or RAC) is that 
>> you have a failure-resistant network. This can be achieved for example by 
>> using dedicated heartbeat switches or cross-cables (in case of two nodes), 
>> plus ethernet bonding (in linux) for redundancy.
>> 
>> While I understand your requirement, I don't think an environment with 
>> (possibly) unreliable n/w is a good place for a cluster. Perhaps a simple 
>> thin client is more appropriate.
>
> We actually have two areas in which we wanted to use GFS - one is an
> office environment where the network, while not unreliable, is subject
> to occasional reconfiguration as machines/switches are moved.  The other
> is a datacentre environment where the infrastructure should be resilient.
>
> In both cases, our primary need for clustering is to enable GFS to be
> used.  Our local office setup could dispense with GFS/clustering and
> we could use other data sharing solutions such as NFS; however, we
> were planning to use a common solution so as to minimise maintenance
> costs and maximise our familiarity with the technology.

Are you saying that you are planning to use cluster / GFS across more than 
one switch (or a redunddant pair thereof)? That is quite unusual, as the 
performance would likely suffer.

If you require operation under unreliable conditions, you should probably 
look into using NFS over UDP, or for more transparent outages, Coda. Coda 
supports disconnected operation and files get cached locally on the 
clients. Provided multiple writers don't end up clobbering each other's 
files often during disconnection (which usually leads to a requirement for 
manual conflict resolution), you may find that it is a better solution for 
you than clustering.

Remember that GFS, NFS/CIFS and Coda are designed for three distinctly 
different environments. If you are thinking about using GFS for connecting 
desktops to the shared storage just to use the same technology for 
everything, not only is that the wrong tool for the job, it will stop 
working as soon as 1/2 of the machines are switched off/disconnected.

Gordan



From cluster at defuturo.co.uk  Tue Jan 15 14:32:42 2008
From: cluster at defuturo.co.uk (Robert Clark)
Date: Tue, 15 Jan 2008 14:32:42 +0000
Subject: [Linux-cluster] cmirror/lvm: "failed to remove faulty devices" and
	FS corruption
Message-ID: <1200407562.2424.50.camel@rutabaga.defuturo.co.uk>

  We're testing cmirror on 4.6 at the moment. Yesterday, one of our
cluster nodes lost sight of an AoE disk. Unfortunately, cmirror wasn't
able to recover:

Jan 14 12:26:14 node06 lvm[3391]: Mirror device, 253:9, has failed. 
Jan 14 12:26:14 node06 lvm[3391]: Device failure in lvm_cluster1-desktop--home--1 
Jan 14 12:26:14 node06 lvm[3391]: WARNING: dev_open(/etc/lvm/lvm.conf) called while suspended
Jan 14 12:26:14 node06 lvm[3391]: WARNING: dev_open(/dev/ramdisk) called while suspended
Jan 14 12:26:14 node06 lvm[3391]: WARNING: dev_open(/dev/cdrom) called while suspended
[ Lots more similar WARNINGs]
Jan 14 12:26:14 node06 lvm[3391]: Another thread is handling an event.  Waiting...
Jan 14 12:26:16 node06 lvm[3391]: Failed to remove faulty devices in lvm_cluster1-desktop--home--1 
Jan 14 12:26:16 node06 lvm[3391]: Mirror device, 253:14, has failed. 
Jan 14 12:26:16 node06 lvm[3391]: Device failure in lvm_cluster1-var 
Jan 14 12:26:16 node06 lvm[3391]: WARNING: dev_open(/etc/lvm/lvm.conf) called while suspended
Jan 14 12:26:16 node06 lvm[3391]: Another thread is handling an event.  Waiting...
Jan 14 12:26:16 node06 lvm[3391]: WARNING: dev_open(/dev/ramdisk) called while suspended
Jan 14 12:26:16 node06 lvm[3391]: WARNING: dev_open(/dev/cdrom) called while suspended
Jan 14 12:26:16 node06 lvm[3391]: WARNING: dev_open(/dev/loop0) called while suspended
[ Lots more similar WARNINGs]
Jan 14 12:26:18 node06 lvm[3391]: Failed to remove faulty devices in lvm_cluster1-var 
Jan 14 12:26:18 node06 lvm[3391]: Mirror device, 253:9, has failed. 

  This keeps looping. A couple of minutes later, though:

Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3: fatal: filesystem consistency error 
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3:   inode = 4391966/4391966 
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3:   function = gfs_change_nlink 
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3:   file = /builddir/build/BUILD/gfs-kernel-2.6.9-75/hugemem/src/gfs/inode.c, line = 845 
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3:   time = 1200313692 
Jan 14 12:28:12 node03 kernel: ------------[ cut here ]------------ 
Jan 14 12:28:12 node03 kernel: kernel BUG at fs/locks.c:1805! 
Jan 14 12:28:12 node03 kernel: invalid operand: 0000 [#1] 
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3: about to withdraw from the cluster 
Jan 14 12:28:12 node03 kernel: SMP GFS: fsid=cluster1:var.3: waiting for outstanding I/O 
Jan 14 12:28:12 node03 kernel:  
Jan 14 12:28:12 node03 kernel: GFS: fsid=cluster1:var.3: telling LM to withdraw 
Jan 14 12:28:12 node03 kernel: Modules linked in: i2c_dev i2c_core lock_dlm(U) gfs(U) lock_harness(U) ext3 jbd dm_cmirror(U) dm_mirror dlm(U) cman(U) bonding
(U) md5 ipv6 aoe(U) dm_mod button battery ac uhci_hcd ehci_hcd e1000 sd_mod ata_piix libata scsi_mod 
Jan 14 12:28:12 node03 kernel: CPU:    2 
Jan 14 12:28:12 node03 kernel: EIP:    0060:[<0216eb06>]    Not tainted VLI 
Jan 14 12:28:12 node03 kernel: EFLAGS: 00010246   (2.6.9-67.ELhugemem)  
Jan 14 12:28:12 node03 kernel: EIP is at locks_remove_flock+0xa1/0xe1 
Jan 14 12:28:12 node03 kernel: eax: db70724c   ebx: db707c0c   ecx: daf4e184   edx: 00000001 
Jan 14 12:28:12 node03 kernel: esi: 00000000   edi: e110c998   ebp: 5fd4f6c0   esp: d9178f18 
Jan 14 12:28:12 node03 kernel: ds: 007b   es: 007b   ss: 0068 
Jan 14 12:28:12 node03 kernel: Process rrdtool (pid: 19569, threadinfo=d9178000 task=5eff4d70) 
Jan 14 12:28:12 node03 kernel: Stack: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000  
Jan 14 12:28:12 node03 kernel:        00000000 00000000 00000000 00000000 00000202 00000000 00000000 00000000  
Jan 14 12:28:12 node03 kernel:        00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000  
Jan 14 12:28:12 node03 kernel: Call Trace: 
Jan 14 12:28:12 node03 kernel:  [<0215ba4e>] __fput+0x41/0x100 
Jan 14 12:28:12 node03 kernel:  [<0215a763>] filp_close+0x59/0x5f 
Jan 14 12:28:12 node03 kernel:  [<0215a7c4>] sys_close+0x5b/0x92 
Jan 14 12:28:12 node03 kernel: Code: 38 39 68 2c 75 2d 0f b6 50 30 f6 c2 02 74 09 89 d8 e8 c9 e0 ff ff eb 1d f6 c2 20 74 0e ba 02 00 00 00 89 d8 e8 3c ee ff 
ff eb 0a <0f> 0b 0d 07 f5 54 2e 02 89 c3 8b 03 eb c4 b8 00 f0 ff ff 21 e0  
Jan 14 12:28:12 node03 kernel:  <0>Fatal exception: panic in 5 seconds 
Jan 14 12:28:16 node03 kernel: lock_dlm: withdraw abandoned memory 
Jan 14 12:28:16 node03 kernel: GFS: fsid=cluster1:var.3: withdrawn 
Jan 14 12:28:16 node03 kernel:   mh_magic = 0x01161970 
Jan 14 12:28:16 node03 kernel:   mh_type = 4 
Jan 14 12:28:17 node03 kernel:   mh_generation = 3008 
Jan 14 12:28:17 node03 kernel:   mh_format = 400 
Jan 14 12:28:17 node03 kernel:   mh_incarn = 1252 
Jan 14 12:28:17 node03 kernel:   no_formal_ino = 4391966 
Jan 14 12:28:17 node03 kernel:   no_addr = 4391966 
Jan 14 12:28:17 node03 kernel:   di_mode = 0644 
Jan 14 12:28:17 node03 kernel:   di_uid = 402 
Jan 14 12:28:17 node03 kernel:   di_gid = 402 
Jan 14 12:28:17 node03 kernel:   di_nlink = 0 
Jan 14 12:28:17 node03 kernel:   di_size = 3975 
Jan 14 12:28:17 node03 kernel:   di_blocks = 2 
Jan 14 12:28:17 node03 kernel:   di_atime = 1200313668 
Jan 14 12:28:17 node03 kernel:   di_mtime = 1200313668 
Jan 14 12:28:17 node03 kernel:   di_ctime = 1200313687 
Jan 14 12:28:17 node03 kernel:   di_major = 0 
Jan 14 12:28:17 node03 kernel:   di_minor = 0 
Jan 14 12:28:17 node03 kernel:   di_rgrp = 4390969 
Jan 14 12:28:17 node03 kernel:   di_goal_rgrp = 16384000 
Jan 14 12:28:17 node03 kernel:   di_goal_dblk = 5788 
Jan 14 12:28:17 node03 kernel:   di_goal_mblk = 0 
Jan 14 12:28:17 node03 kernel:   di_flags = 0x00000000 
Jan 14 12:28:17 node03 kernel:   di_payload_format = 0 
Jan 14 12:28:17 node03 kernel:   di_type = 1 
Jan 14 12:28:17 node03 kernel:   di_height = 1 
Jan 14 12:28:17 node03 kernel:   di_incarn = 0 
Jan 14 12:28:17 node03 kernel:   di_pad = 0 
Jan 14 12:28:17 node03 kernel:   di_depth = 0 
Jan 14 12:28:17 node03 kernel:   di_entries = 0 
Jan 14 12:28:17 node03 kernel:   no_formal_ino = 0 
Jan 14 12:28:17 node03 kernel:   no_addr = 0 
Jan 14 12:28:17 node03 kernel:   di_eattr = 0 
Jan 14 12:28:17 node03 kernel:   di_reserved = 
Jan 14 12:28:17 node03 kernel: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  
Jan 14 12:28:17 node03 last message repeated 2 times 
Jan 14 12:28:17 node03 kernel: 00 00 00 00 00 00 00 00  

  Only the one node had problems accessing the AoE disk and all other
nodes which didn't try to access the specific directory with the
corruption continued to happily read and write to the filesystem, right
up until I took the whole cluster down for an fsck (which recovered
everything nicely).

  I assume the kernel BUG is just a result of node03's less than
graceful exit from the cluster while rrdtool was holding some locks, but
I'm not sure why the mirror didn't degrade to linear nor why we ended up
with filesystem corruption. We're running kernel-hugemem-2.6.9-67.EL so
we don't have the fix for bz399661, but that affects I/O after a
successful recovery, so I don't think it's that.

  Has anyone got any ideas where I can look next to track this down?

	Thanks,

		Robert



From teemu.m2 at luukku.com  Tue Jan 15 16:41:21 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Tue, 15 Jan 2008 18:41:21 +0200 (EET)
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
Message-ID: <1200415281430.teemu.m2.63882.IyMT4Lo7k25jKRtfcgBsfA@luukku.com>

Hi,
I tried with this driver to, but did'nt get it work.

..


Robinson Maureira Castillo kirjoitti 15.01.2008 kello 14:10:
> Ryan O'Hara wrote:
> > 
> > Actually, I'm not sure that IPMI will work with your hardware. Some of
> > the HP machines come with "ILO MP", which does support IPMI. In that
> > case, you would want to use IPMI to do fencing. I'm not familiar with
> > your hardware, but I am guessing that you don't have ILO MP, and thus
> > IPMI is not going to work.
> > 
> iLO MP is only present on HP Itanium machines (Integrity product line).
> The referenced proliant server uses HP iLO2, and according to the docs,
> it works with IPMI
> 
> You need to install the following package:
> 
> http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lan
> g=en&cc=us&prodTypeId=15351&prodSeriesId=1121486&prodNameId=3288144&swEnvOID
> =4006&swLang=8&mode=2&taskId=135&swItem=MTX-c237b96bd1e2416b85f1178ba1
> 
> Best regards,
> -- 
> Robinson Maureira Castillo
> Jefe de Soporte
> SOLINT
> 
> F: +56 2 4119047
> C: +56 9 95994987
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster





...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From teemu.m2 at luukku.com  Tue Jan 15 18:06:34 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Tue, 15 Jan 2008 20:06:34 +0200 (EET)
Subject: [Linux-cluster] GFS2 mount problems
Message-ID: <1200420394859.teemu.m2.74523.q7XHPJVtaFTCgb8aR5y0hw@luukku.com>

Hi,

After GFS2 problems, and type (unmounted) gfs2_fsck i get this errors when i am starting GFS2 mount. Cluster start has same problems.

Has anybody any ideas to correct this problems.


Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: Trying to join cluster "fsck_dlm", "wml0104:data_gfs"
Jan 15 19:56:45 xxxxxn1 kernel: GFS2: can't find protocol fsck_dlm
Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: can't mount proto=fsck_dlm, table=xxxx4:data_gfs, hostdata


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From rpeterso at redhat.com  Tue Jan 15 20:03:37 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 15 Jan 2008 14:03:37 -0600
Subject: [Linux-cluster] GFS2 mount problems
In-Reply-To: <1200420394859.teemu.m2.74523.q7XHPJVtaFTCgb8aR5y0hw@luukku.com>
References: <1200420394859.teemu.m2.74523.q7XHPJVtaFTCgb8aR5y0hw@luukku.com>
Message-ID: <1200427417.13758.33.camel@technetium.msp.redhat.com>


On Tue, 2008-01-15 at 20:06 +0200, m.. mm.. wrote:
> Hi,
> 
> After GFS2 problems, and type (unmounted) gfs2_fsck i get this errors when i am starting GFS2 mount. Cluster start has same problems.
> 
> Has anybody any ideas to correct this problems.
> 
> 
> Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: Trying to join cluster "fsck_dlm", "wml0104:data_gfs"
> Jan 15 19:56:45 xxxxxn1 kernel: GFS2: can't find protocol fsck_dlm
> Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: can't mount proto=fsck_dlm, table=xxxx4:data_gfs, hostdata

Hi,

This means that gfs2_fsck ended abnormally without cleaning up (for
example, with a segfault.)  You can fix it by doing something like
this:

gfs2_tool sb /dev/your/device proto "lock_dlm"

Any idea how you crashed gfs2_fsck?

Regards,

Bob Peterson
Red Hat GFS




From teemu.m2 at luukku.com  Tue Jan 15 20:33:35 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Tue, 15 Jan 2008 22:33:35 +0200 (EET)
Subject: [Linux-cluster] GFS2 mount problems
Message-ID: <1200429215566.teemu.m2.92991.rBA9MubbCjrbucasH13vUg@luukku.com>

Hi,

Thanks, 

To you question: i don't know what happend or what when't wrong. Sorry.. strange..

Bob Peterson kirjoitti 15.01.2008 kello 22:03:
> 
> On Tue, 2008-01-15 at 20:06 +0200, m.. mm.. wrote:
> > Hi,
> > 
> > After GFS2 problems, and type (unmounted) gfs2_fsck i get this errors
> when i am starting GFS2 mount. Cluster start has same problems.
> > 
> > Has anybody any ideas to correct this problems.
> > 
> > 
> > Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: Trying to join cluster
> "fsck_dlm", "wml0104:data_gfs"
> > Jan 15 19:56:45 xxxxxn1 kernel: GFS2: can't find protocol fsck_dlm
> > Jan 15 19:56:45 xxxxxn1 kernel: GFS2: fsid=: can't mount proto=fsck_dlm,
> table=xxxx4:data_gfs, hostdata
> 
> Hi,
> 
> This means that gfs2_fsck ended abnormally without cleaning up (for
> example, with a segfault.)  You can fix it by doing something like
> this:
> 
> gfs2_tool sb /dev/your/device proto "lock_dlm"
> 
> Any idea how you crashed gfs2_fsck?
> 
> Regards,
> 
> Bob Peterson
> Red Hat GFS
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From mathieu.avila at seanodes.com  Wed Jan 16 10:09:07 2008
From: mathieu.avila at seanodes.com (Mathieu Avila)
Date: Wed, 16 Jan 2008 11:09:07 +0100
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
Message-ID: <20080116110907.1648ab29@mathieu.toulouse>

Hello GFS developers,

I am in the process of evaluating the performance gain of
the "statfs_fast" patch.
Once the FS is mounted, I perform "gfs_tool settune  ...." and then i
measure the time to perform "df" on a partially filled FS. The time is
almost the same, "df" returns almost instantly, with a value really
near the truth, and progressively reaching the true one.

But I have noticed that when the FS size increases, the time to
perform  "gfs_tool settune  ...." increases dramatically. In fact,
after a few measures, it appears that the time to perform "df" without
fuzzy statfs is the same as the time to activate fuzzy statfs. 

I've read the patch, and from what I understand, there is an inode used
to store the fuzzy value of statfs, read once and updated later in
background. So the behavior i experienced shouldn't happen. So, please,
what am I doing wrong ?

--
Mathieu


--------------------------------------------------------------------------------
Les opinions et prises de position emises par le signataire du present
message lui sont propres et ne sauraient engager la responsabilite de la
societe SEANODES.

Ce message ainsi que les eventuelles pieces jointes constituent une
correspondance privee et confidentielle a l'attention exclusive du
destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
present message ou une personne susceptible de pouvoir le lui delivrer, il
vous est signifie que toute divulgation, distribution ou copie de cette
transmission est strictement interdite. Si vous avez recu ce message par
erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
lui retourner le present message, puis d'effacer immediatement ce message de
votre systeme.


The views and opinions expressed by the author of this message are personal.
SEANODES shall assume no liability, express or implied for such message.

This e-mail and any attachments is a confidential correspondence intended
only for use of the individual or entity named above. If you are not the
intended recipient or the agent responsible for delivering the message to
the intended recipient, you are hereby notified that any disclosure,
distribution or copying of this communication is strictly prohibited. If you
have received this communication in error, please notify the sender by phone
or by replying this message, and then delete this message from your system. 



From wcheng at redhat.com  Wed Jan 16 16:12:43 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 16 Jan 2008 11:12:43 -0500
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
In-Reply-To: <20080116110907.1648ab29@mathieu.toulouse>
References: <20080116110907.1648ab29@mathieu.toulouse>
Message-ID: <478E2CFB.60101@redhat.com>

Mathieu Avila wrote:
> Hello GFS developers,
>
> I am in the process of evaluating the performance gain of
> the "statfs_fast" patch.
> Once the FS is mounted, I perform "gfs_tool settune  ...." and then i
> measure the time to perform "df" on a partially filled FS. The time is
> almost the same, "df" returns almost instantly, with a value really
> near the truth, and progressively reaching the true one.
>
> But I have noticed that when the FS size increases, the time to
> perform  "gfs_tool settune  ...." increases dramatically. In fact,
> after a few measures, it appears that the time to perform "df" without
> fuzzy statfs is the same as the time to activate fuzzy statfs. 
>   
In theory, this shouldn't happen. Are you on RHEL 4 or RHEL 5 ? And what 
is the FS size that causes this problem ?

-- Wendy
> I've read the patch, and from what I understand, there is an inode used
> to store the fuzzy value of statfs, read once and updated later in
> background. So the behavior i experienced shouldn't happen. So, please,
> what am I doing wrong ?
>
> --
> Mathieu
>
>
> --------------------------------------------------------------------------------
> Les opinions et prises de position emises par le signataire du present
> message lui sont propres et ne sauraient engager la responsabilite de la
> societe SEANODES.
>
> Ce message ainsi que les eventuelles pieces jointes constituent une
> correspondance privee et confidentielle a l'attention exclusive du
> destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
> present message ou une personne susceptible de pouvoir le lui delivrer, il
> vous est signifie que toute divulgation, distribution ou copie de cette
> transmission est strictement interdite. Si vous avez recu ce message par
> erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
> lui retourner le present message, puis d'effacer immediatement ce message de
> votre systeme.
>
>
> The views and opinions expressed by the author of this message are personal.
> SEANODES shall assume no liability, express or implied for such message.
>
> This e-mail and any attachments is a confidential correspondence intended
> only for use of the individual or entity named above. If you are not the
> intended recipient or the agent responsible for delivering the message to
> the intended recipient, you are hereby notified that any disclosure,
> distribution or copying of this communication is strictly prohibited. If you
> have received this communication in error, please notify the sender by phone
> or by replying this message, and then delete this message from your system. 
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   



From wcheng at redhat.com  Wed Jan 16 16:25:38 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 16 Jan 2008 11:25:38 -0500
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
In-Reply-To: <478E2CFB.60101@redhat.com>
References: <20080116110907.1648ab29@mathieu.toulouse>
	<478E2CFB.60101@redhat.com>
Message-ID: <478E3002.5020603@redhat.com>

Wendy Cheng wrote:
> Mathieu Avila wrote:
>> Hello GFS developers,
>>
>> I am in the process of evaluating the performance gain of
>> the "statfs_fast" patch.
>> Once the FS is mounted, I perform "gfs_tool settune ...." and then i
>> measure the time to perform "df" on a partially filled FS. The time is
>> almost the same, "df" returns almost instantly, with a value really
>> near the truth, and progressively reaching the true one.
>>
>> But I have noticed that when the FS size increases, the time to
>> perform "gfs_tool settune ...." increases dramatically. In fact,
>> after a few measures, it appears that the time to perform "df" without
>> fuzzy statfs is the same as the time to activate fuzzy statfs. 
> In theory, this shouldn't happen. Are you on RHEL 4 or RHEL 5 ? And 
> what is the FS size that causes this problem ?
>
I just did a quick try. It doesn't happen to me. By reading your note, 
were you *repeatedly* issuing "gfs_tool settune .." then followed by 
"df" ? Remember the "settune" is expected to be run *once* right after 
the particular GFS filesystem is mounted. You certainly *can* run it 
multiple times. It won't hurt anything. However, each time the "settune" 
is invoked, the code has to perform a regular "df" (i.e. that's the way 
it initializes itself). I suspect this is the cause of your issue. Let 
me know either way.

-- Wendy



From mathieu.avila at seanodes.com  Wed Jan 16 18:12:25 2008
From: mathieu.avila at seanodes.com (Mathieu Avila)
Date: Wed, 16 Jan 2008 19:12:25 +0100
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
In-Reply-To: <478E3002.5020603@redhat.com>
References: <20080116110907.1648ab29@mathieu.toulouse>
	<478E2CFB.60101@redhat.com> <478E3002.5020603@redhat.com>
Message-ID: <20080116191225.3bcc8d75@mathieu.toulouse>

Le Wed, 16 Jan 2008 11:25:38 -0500,
Wendy Cheng <wcheng at redhat.com> a ?crit :

> Wendy Cheng wrote:
> > Mathieu Avila wrote:
> >> Hello GFS developers,
> >>
> >> I am in the process of evaluating the performance gain of
> >> the "statfs_fast" patch.
> >> Once the FS is mounted, I perform "gfs_tool settune ...." and then
> >> i measure the time to perform "df" on a partially filled FS. The
> >> time is almost the same, "df" returns almost instantly, with a
> >> value really near the truth, and progressively reaching the true
> >> one.
> >>
> >> But I have noticed that when the FS size increases, the time to
> >> perform "gfs_tool settune ...." increases dramatically. In fact,
> >> after a few measures, it appears that the time to perform "df"
> >> without fuzzy statfs is the same as the time to activate fuzzy
> >> statfs. 
> > In theory, this shouldn't happen. Are you on RHEL 4 or RHEL 5 ? And 
> > what is the FS size that causes this problem ?
> >
> I just did a quick try. It doesn't happen to me. By reading your
> note, were you *repeatedly* issuing "gfs_tool settune .." then
> followed by "df" ? Remember the "settune" is expected to be run
> *once* right after the particular GFS filesystem is mounted. You
> certainly *can* run it multiple times. It won't hurt anything.
> However, each time the "settune" is invoked, the code has to perform
> a regular "df" (i.e. that's the way it initializes itself). I suspect
> this is the cause of your issue. Let me know either way.
> 

I am using "cluster-1.03" with the statfs_fast patch from:
http://www.redhat.com/archives/cluster-devel/2007-March/msg00124.html
(has this been changed after ?)
All this on a Centos 5.

My use case is :
 * mkfs of a volume
 * mount on all 6 nodes 
 * timing of "settune statfs_fast 1", on all 6 nodes. 
 * timing of "df" on one node.
All commands are executed immediately one after the other.

So i issued only one "settune", on all nodes, and was expecting it to
return immediately. From what you've just said (settune performing a
real "df"), i guess this behaviour is normal.

I don't understand why it's necessary to perform a real "df" in
"settune". Isn't the licence inode used to store the previous
values of "df" so that it can give an immediate answer to "df", and then
perform a real regular "df" in background to upgrade the "cached df" to
the real value ?

--
Mathieu


--------------------------------------------------------------------------------
Les opinions et prises de position emises par le signataire du present
message lui sont propres et ne sauraient engager la responsabilite de la
societe SEANODES.

Ce message ainsi que les eventuelles pieces jointes constituent une
correspondance privee et confidentielle a l'attention exclusive du
destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
present message ou une personne susceptible de pouvoir le lui delivrer, il
vous est signifie que toute divulgation, distribution ou copie de cette
transmission est strictement interdite. Si vous avez recu ce message par
erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
lui retourner le present message, puis d'effacer immediatement ce message de
votre systeme.


The views and opinions expressed by the author of this message are personal.
SEANODES shall assume no liability, express or implied for such message.

This e-mail and any attachments is a confidential correspondence intended
only for use of the individual or entity named above. If you are not the
intended recipient or the agent responsible for delivering the message to
the intended recipient, you are hereby notified that any disclosure,
distribution or copying of this communication is strictly prohibited. If you
have received this communication in error, please notify the sender by phone
or by replying this message, and then delete this message from your system. 



From wcheng at redhat.com  Wed Jan 16 18:58:35 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 16 Jan 2008 13:58:35 -0500
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
In-Reply-To: <20080116191225.3bcc8d75@mathieu.toulouse>
References: <20080116110907.1648ab29@mathieu.toulouse>	<478E2CFB.60101@redhat.com>
	<478E3002.5020603@redhat.com>
	<20080116191225.3bcc8d75@mathieu.toulouse>
Message-ID: <478E53DB.4030306@redhat.com>

Mathieu Avila wrote:
>
>>>>
>>>> I am in the process of evaluating the performance gain of
>>>> the "statfs_fast" patch.
>>>> Once the FS is mounted, I perform "gfs_tool settune ...." and then
>>>> i measure the time to perform "df" on a partially filled FS. The
>>>> time is almost the same, "df" returns almost instantly, with a
>>>> value really near the truth, and progressively reaching the true
>>>> one.
>>>>
>>>> But I have noticed that when the FS size increases, the time to
>>>> perform "gfs_tool settune ...." increases dramatically. In fact,
>>>> after a few measures, it appears that the time to perform "df"
>>>> without fuzzy statfs is the same as the time to activate fuzzy
>>>> statfs. 
>>>>         
>>> In theory, this shouldn't happen. Are you on RHEL 4 or RHEL 5 ? And 
>>> what is the FS size that causes this problem ?
>>>
>>>       
>> I just did a quick try. It doesn't happen to me. By reading your
>> note, were you *repeatedly* issuing "gfs_tool settune .." then
>> followed by "df" ? Remember the "settune" is expected to be run
>> *once* right after the particular GFS filesystem is mounted. You
>> certainly *can* run it multiple times. It won't hurt anything.
>> However, each time the "settune" is invoked, the code has to perform
>> a regular "df" (i.e. that's the way it initializes itself). I suspect
>> this is the cause of your issue. Let me know either way.
>>
>>     
>
> I am using "cluster-1.03" with the statfs_fast patch from:
> http://www.redhat.com/archives/cluster-devel/2007-March/msg00124.html
> (has this been changed after ?)
> All this on a Centos 5.
>
> My use case is :
>  * mkfs of a volume
>  * mount on all 6 nodes 
>  * timing of "settune statfs_fast 1", on all 6 nodes. 
>  * timing of "df" on one node.
> All commands are executed immediately one after the other.
>
> So i issued only one "settune", on all nodes, and was expecting it to
> return immediately. From what you've just said (settune performing a
> real "df"), i guess this behaviour is normal.
>   

yes ...

> I don't understand why it's necessary to perform a real "df" in
> "settune". Isn't the licence inode used to store the previous
> values of "df" so that it can give an immediate answer to "df", and then
> perform a real regular "df" in background to upgrade the "cached df" to
> the real value ?
>   

For GFS1, we can't change disk layout so we borrow the "license" file 
that happens to be an unused on-disk GFS1 file. There is only one per 
file system, comparing to GFS2 that uses N+1 files (N is the number of 
nodes in this cluster) to handle the "df" statistics. Every node keeps 
its changes in memory buffer and syncs its local changes to the master 
(license) file every 30 seconds. Upon unclean shutdown (or crash), the 
local changes in the memory buffer will be lost. To re-sync the correct 
statistics, we need to use real "df" command (that scans the on-disk 
RGRP disk structures) to adjust the correct statistics. For details, 
check out one of my old write-ups in:

http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_fast_statfs.R4

-- Wendy



From wcheng at redhat.com  Wed Jan 16 19:16:01 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 16 Jan 2008 14:16:01 -0500
Subject: [Linux-cluster] Behavior of "statfs_fast" settune
In-Reply-To: <478E53DB.4030306@redhat.com>
References: <20080116110907.1648ab29@mathieu.toulouse>	<478E2CFB.60101@redhat.com>	<478E3002.5020603@redhat.com>	<20080116191225.3bcc8d75@mathieu.toulouse>
	<478E53DB.4030306@redhat.com>
Message-ID: <478E57F1.4030505@redhat.com>


>
> For GFS1, we can't change disk layout so we borrow the "license" file 
> that happens to be an unused on-disk GFS1 file. There is only one per 
> file system, comparing to GFS2 that uses N+1 files (N is the number of 
> nodes in this cluster) to handle the "df" statistics. Every node keeps 
> its changes in memory buffer and syncs its local changes to the master 
> (license) file every 30 seconds. Upon unclean shutdown (or crash), the 
> local changes in the memory buffer will be lost. To re-sync the 
> correct statistics, we need to use real "df" command (that scans the 
> on-disk RGRP disk structures) to adjust the correct statistics. For 
> details, check out one of my old write-ups in:
>
> http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_fast_statfs.R4
>

BTW, I'm not very happy with this implementation and there are few other 
ideas on the table. However, since GFS2 is imminent (I hope), will keep 
the code as it is today. 

-- Wendy




From mathieu.avila at seanodes.com  Thu Jan 17 12:21:47 2008
From: mathieu.avila at seanodes.com (Mathieu Avila)
Date: Thu, 17 Jan 2008 13:21:47 +0100
Subject: [Linux-cluster] Locking and performance questions regarding GFS1/2
In-Reply-To: <1200320334.22038.129.camel@quoit>
References: <20080114150626.24a9b281@mathieu.toulouse>
	<1200320334.22038.129.camel@quoit>
Message-ID: <20080117132147.1cb9ec5f@mathieu.toulouse>

Hi Steven,

Thank you for your answer,

Le Mon, 14 Jan 2008 14:18:54 +0000,
Steven Whitehouse <swhiteho at redhat.com> a ?crit :


> Basically yes. It reads all the RGs, although in the allocation case
> it doesn't need to read all the RGs to work out where to put newly
> allocated blocks, it only needs to read some of them. That also needs
> to be fixed at some stage in the future.
> 

If i trust the code (and i _do_ trust the code :-) )

cluster-1.03, rgrp.c , L1193,
--
static int
get_local_rgrp(struct gfs_inode *ip)
{
(...)
	for (;;) {
		error = gfs_glock_nq_init(rgd->rd_gl,
					  LM_ST_EXCLUSIVE, flags,
					  &al->al_rgd_gh);
		switch (error) {
		case 0:
			if (try_rgrp_fit(rgd, al))
				goto out;
			gfs_glock_dq_uninit(&al->al_rgd_gh);
			break;
--

When the FS is freshly formated, we should return quite instantly from
this function; however, that's not the behaviour i observed. What am i
getting wrong ?

> RGs are limited to 2^32 blocks, including the RG header. Generally you
> want to use a number or RGs >> number of nodes. Provided this is true
> then you can make the RGs as large as you like (up to the 2^32 block
> limit) without compromising performance.
> 

Then i guess the attached patch could help having RG sized more than
2G.
The patch is not complete: the 2^32 block limit is not expressed
correctly, although it proved to work on a cluster of 6  64bits nodes,
with RG sizes of 4G and 8G and 16G on a 1T FS. In those cases, i got 256
and 128 and 64  RGs. Are those numbers respecting the constraint :
"RGs >> number of nodes,"
as you expressed it ?
(I guess it depends on the usage of the FS : lots
of cross-accesses from one node to other nodes' file , or at the other
side all nodes working mostly on their own files/directories ?)

> Steve.
> 

--
Mathieu




--------------------------------------------------------------------------------
Les opinions et prises de position emises par le signataire du present
message lui sont propres et ne sauraient engager la responsabilite de la
societe SEANODES.

Ce message ainsi que les eventuelles pieces jointes constituent une
correspondance privee et confidentielle a l'attention exclusive du
destinataire designe ci-dessus. Si vous n'etes pas le destinataire du
present message ou une personne susceptible de pouvoir le lui delivrer, il
vous est signifie que toute divulgation, distribution ou copie de cette
transmission est strictement interdite. Si vous avez recu ce message par
erreur, nous vous remercions d'en informer l'expediteur par telephone ou de
lui retourner le present message, puis d'effacer immediatement ce message de
votre systeme.


The views and opinions expressed by the author of this message are personal.
SEANODES shall assume no liability, express or implied for such message.

This e-mail and any attachments is a confidential correspondence intended
only for use of the individual or entity named above. If you are not the
intended recipient or the agent responsible for delivering the message to
the intended recipient, you are hereby notified that any disclosure,
distribution or copying of this communication is strictly prohibited. If you
have received this communication in error, please notify the sender by phone
or by replying this message, and then delete this message from your system. 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: mkfs_rg_bigger_than_2G.patch
Type: text/x-patch
Size: 1152 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/7c4710b4/attachment.bin>

From lhh at redhat.com  Thu Jan 17 16:22:22 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 17 Jan 2008 11:22:22 -0500
Subject: [Linux-cluster] [Fwd: [Openais] ANNOUNCE: First release of the
	Pacemaker cluster resource manager, formerly part of Heartbeat]
Message-ID: <1200586943.16786.91.camel@ayanami.boston.devel.redhat.com>


-------------- next part --------------
An embedded message was scrubbed...
From: Andrew Beekhof <abeekhof at suse.de>
Subject: [Openais] ANNOUNCE: First release of the Pacemaker cluster	resource manager, formerly part of Heartbeat
Date: Thu, 17 Jan 2008 08:42:51 +0100
Size: 10659
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/64a44e6a/attachment.eml>

From lhh at redhat.com  Thu Jan 17 16:28:42 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 17 Jan 2008 11:28:42 -0500
Subject: [Linux-cluster] CS5 : cman-2.0.73-1.el5_1.1.x86_64.rpm versus
	cman-2.0.73-1.el5.x86_64.rpm
In-Reply-To: <478C8901.5080403@bull.net>
References: <478C8901.5080403@bull.net>
Message-ID: <1200587324.16786.96.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-15 at 11:20 +0100, Alain Moulle wrote:
> Hi
> 
> recall : two-nodes cluster with Quorum Disk
> Back to my problem of bad rgmanager behavior after a test of
> ifdown ethO (heart-beat interface) , I recall that after the
> reboot of the node, the rgmanager does not respond and clustat
> does not display any information from rgmanager on this node,
> and a stop on rgmanager stalls.
> 
> The news is that the rpm cman-2.0.73-1.el5_1.1.x86_64.rpm
> resolves the problem, whereas with cman-2.0.73-1.el5.x86_64.rpm
> the problem is systematical .
> 
> Could you confirm the 1.1 has at least a fix in relation with
> the problem I have ?

el5_1.1 fixes a bug with dlm_controld when qdisk is operating, meaning
rgmanager can hang with 1.

-- Lon



From garromo at us.ibm.com  Thu Jan 17 16:36:16 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Thu, 17 Jan 2008 09:36:16 -0700
Subject: [Linux-cluster] BladeCenter Fencing errors
Message-ID: <OF8E10B42E.0B8814AB-ON872573D3.0059EF4B-872573D3.005B1E6B@us.ibm.com>

I am running AS4.6, two node cluster, using bladecenter fencing.  I am 
getting the following errors in /var/log/messages;
Anyone know of some good config instructions when using the BladeCenter 
for fencing?

Jan 17 09:21:17 myhost fenced[3229]: fence "myhost" failed
Jan 17 09:21:22 myhost fenced[3229]: fencing node "myhost"
Jan 17 09:21:22 myhost fenced[3229]: agent "fence_bladecenter" reports: 
problem connecting to "172.26.1.143", port 23: Connection refused at 
/sbin/fence_bladecenter line 273 

Jan 17 09:21:22 myhost fenced[3229]: fence "myhost" failed
Jan 17 09:21:25 myhost gconfd (root-21395): GConf server is not in use, 
shutting down.
Jan 17 09:21:25 myhost gconfd (root-21395): Exiting

Jan 17 09:21:27 myhost fenced[3229]: fencing node "myhost"
Jan 17 09:21:27 myhost fenced[3229]: agent "fence_bladecenter" reports: 
problem connecting to "172.26.1.143", port 23: Connection refused at 
/sbin/fence_bladecenter line 273 

Jan 17 09:21:27 myhost fenced[3229]: fence "myhost" failed
Jan 17 09:21:32 myhost fenced[3229]: fencing node "myhost"
Jan 17 09:21:32 myhost fenced[3229]: agent "fence_bladecenter" reports: 
problem connecting to "172.26.1.143", port 23: Connection refused at 
/sbin/fence_bladecenter line 273 

Line 273 in /sbin_bladecenter reads;

$t->open($host);

-Gary
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/2a836c72/attachment.htm>

From rainer at ultra-secure.de  Thu Jan 17 16:51:55 2008
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Thu, 17 Jan 2008 17:51:55 +0100
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <OF8E10B42E.0B8814AB-ON872573D3.0059EF4B-872573D3.005B1E6B@us.ibm.com>
References: <OF8E10B42E.0B8814AB-ON872573D3.0059EF4B-872573D3.005B1E6B@us.ibm.com>
Message-ID: <478F87AB.70803@ultra-secure.de>

Gary Romo schrieb:
>
> I am running AS4.6, two node cluster, using bladecenter fencing.  I am
> getting the following errors in /var/log/messages;
> Anyone know of some good config instructions when using the
> BladeCenter for fencing?
>
> Jan 17 09:21:17 myhost fenced[3229]: fence "myhost" failed
> Jan 17 09:21:22 myhost fenced[3229]: fencing node "myhost"
> Jan 17 09:21:22 myhost fenced[3229]: agent "fence_bladecenter"
> reports: problem connecting to "172.26.1.143", port 23: Connection
> refused at /sbin/fence_bladecenter line 273


Can you telnet to the management-ip?




Rainer



From garromo at us.ibm.com  Thu Jan 17 17:21:41 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Thu, 17 Jan 2008 10:21:41 -0700
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <478F87AB.70803@ultra-secure.de>
Message-ID: <OF2286DE9A.0E7AF773-ON872573D3.005F4ED0-872573D3.005F46D7@us.ibm.com>

Yes.  Actually we us ssh here, not telnet.

Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com



Rainer Duffner <rainer at ultra-secure.de> 
Sent by: linux-cluster-bounces at redhat.com
01/17/2008 09:51 AM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] BladeCenter Fencing errors






Gary Romo schrieb:
>
> I am running AS4.6, two node cluster, using bladecenter fencing.  I am
> getting the following errors in /var/log/messages;
> Anyone know of some good config instructions when using the
> BladeCenter for fencing?
>
> Jan 17 09:21:17 myhost fenced[3229]: fence "myhost" failed
> Jan 17 09:21:22 myhost fenced[3229]: fencing node "myhost"
> Jan 17 09:21:22 myhost fenced[3229]: agent "fence_bladecenter"
> reports: problem connecting to "172.26.1.143", port 23: Connection
> refused at /sbin/fence_bladecenter line 273


Can you telnet to the management-ip?




Rainer

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/e708e127/attachment.htm>

From npf-mlists at eurotux.com  Thu Jan 17 17:31:26 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Thu, 17 Jan 2008 17:31:26 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
Message-ID: <200801171731.26493.npf-mlists@eurotux.com>

Hi,

we have a 20 node cluster with clvmd. It was working fine, but now i can't 
interact with clvmd. This problem aparently started after a node reboot. I've 
rebooted that node after and other 2 after and it that problem remains.

When the machine is starting it stops in clvmd and doing a ps fax i can see 
that its stopped in vgscan.

Machine xen1, xen3 and xen18 are all stopped. Machine xen18 was the first 
rebooted and the problem started after that.

[root at xen2 ~]# cman_tool services
type             level name     id       state
fence            0     default  00010001 none
[2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]
dlm              1     clvmd    00010002 none
[2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]

cluster.conf:
attached

uname:
Linux xen9.dc.test.pt 2.6.18-8.1.3.el5xen #1 SMP Mon Apr 30 20:26:10 EDT 2007 
x86_64 x86_64 x86_64 GNU/Linux

cman-2.0.60-1.el5
lvm2-cluster-2.02.16-3.el5

lvm.conf:
attached

[root at xen9 ~]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  xen1.dc.test.pt                      1 Offline
  xen2.dc.test.pt                      2 Online
  xen3.dc.test.pt                      3 Offline
  xen4.dc.test.pt                      4 Online
  xen5.dc.test.pt                      5 Online
  xen6.dc.test.pt                      6 Online
  xen7.dc.test.pt                      7 Online
  xen8.dc.test.pt                      8 Online
  xen9.dc.test.pt                      9 Online, Local
  xen10.dc.test.pt                    10 Online
  xen11.dc.test.pt                    11 Online
  xen12.dc.test.pt                    12 Online
  xen13.dc.test.pt                    13 Online
  xen14.dc.test.pt                    14 Online
  xen17.dc.test.pt                    15 Online
  xen18.dc.test.pt                    16 Offline
  xen19.dc.test.pt                    17 Online
  xen20.dc.test.pt                    18 Online
  xen21.dc.test.pt                    19 Online
  xen22.dc.test.pt                    20 Online

Attached i also send the result of ps fax in xen2.

Any info?
Thanks
Nuno Fernandes
-------------- next part --------------
<?xml version="1.0"?>
<cluster config_version="3" name="clt_test">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="xen1.dc.test.pt" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen2.dc.test.pt" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen2"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen3.dc.test.pt" nodeid="3" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen3"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen4.dc.test.pt" nodeid="4" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen4"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen5.dc.test.pt" nodeid="5" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen5"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen6.dc.test.pt" nodeid="6" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen6"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen7.dc.test.pt" nodeid="7" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen7"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen8.dc.test.pt" nodeid="8" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen8"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen9.dc.test.pt" nodeid="9" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen9"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen10.dc.test.pt" nodeid="10" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen10"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen11.dc.test.pt" nodeid="11" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen11"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen12.dc.test.pt" nodeid="12" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen12"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen13.dc.test.pt" nodeid="13" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen13"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen14.dc.test.pt" nodeid="14" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen14"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen17.dc.test.pt" nodeid="15" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen17"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen18.dc.test.pt" nodeid="16" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen18"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen19.dc.test.pt" nodeid="17" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen19"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen20.dc.test.pt" nodeid="18" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen20"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen21.dc.test.pt" nodeid="19" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen21"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xen22.dc.test.pt" nodeid="20" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="xen22"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="xen1_ilo" login="fenceuser" name="xen1" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen2_ilo" login="fenceuser" name="xen2" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen3_ilo" login="fenceuser" name="xen3" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen4_ilo" login="fenceuser" name="xen4" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen5_ilo" login="fenceuser" name="xen5" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen6_ilo" login="fenceuser" name="xen6" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen7_ilo" login="fenceuser" name="xen7" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen8_ilo" login="fenceuser" name="xen8" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen9_ilo" login="fenceuser" name="xen9" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen10_ilo" login="fenceuser" name="xen10" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen11_ilo" login="fenceuser" name="xen11" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen12_ilo" login="fenceuser" name="xen12" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen13_ilo" login="fenceuser" name="xen13" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen14_ilo" login="fenceuser" name="xen14" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen17_ilo" login="fenceuser" name="xen17" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen18_ilo" login="fenceuser" name="xen18" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen19_ilo" login="fenceuser" name="xen19" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen20_ilo" login="fenceuser" name="xen20" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen21_ilo" login="fenceuser" name="xen21" passwd="fencepass"/>
                <fencedevice agent="fence_ilo" hostname="xen22_ilo" login="fenceuser" name="xen22" passwd="fencepass"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
</cluster>
-------------- next part --------------
# This is an example configuration file for the LVM2 system.
# It contains the default settings that would be used if there was no
# /etc/lvm/lvm.conf file.
#
# Refer to 'man lvm.conf' for further information including the file layout.
#
# To put this file in a different directory and override /etc/lvm set
# the environment variable LVM_SYSTEM_DIR before running the tools.


# This section allows you to configure which block devices should
# be used by the LVM system.
devices {

    # Where do you want your volume groups to appear ?
    dir = "/dev"

    # An array of directories that contain the device nodes you wish
    # to use with LVM2.
    scan = [ "/dev/mapper" ]

    # A filter that tells LVM2 to only use a restricted set of devices.
    # The filter consists of an array of regular expressions.  These
    # expressions can be delimited by a character of your choice, and
    # prefixed with either an 'a' (for accept) or 'r' (for reject).
    # The first expression found to match a device name determines if
    # the device will be accepted or rejected (ignored).  Devices that
    # don't match any patterns are accepted.

    # Be careful if there there are symbolic links or multiple filesystem
    # entries for the same device as each name is checked separately against
    # the list of patterns.  The effect is that if any name matches any 'a'
    # pattern, the device is accepted; otherwise if any name matches any 'r'
    # pattern it is rejected; otherwise it is accepted.

    # Don't have more than one filter line active at once: only one gets used.

    # Run vgscan after you change this parameter to ensure that
    # the cache file gets regenerated (see below).
    # If it doesn't do what you expect, check the output of 'vgscan -vvvv'.


    # By default we accept every block device:
    filter = [ "r/sd.*/", "r/disk/", "a/.*/" ]

    # Exclude the cdrom drive
    # filter = [ "r|/dev/cdrom|" ]

    # When testing I like to work with just loopback devices:
    # filter = [ "r/sd.*/", "a/.*/" ]

    # Or maybe all loops and ide drives except hdc:
    # filter =[ "a|loop|", "r|/dev/hdc|", "a|/dev/ide|", "r|.*|" ]

    # Use anchors if you want to be really specific
    # filter = [ "a|^/dev/hda8$|", "r/.*/" ]

    # The results of the filtering are cached on disk to avoid
    # rescanning dud devices (which can take a very long time).  By
    # default this cache file is hidden in the /etc/lvm directory.
    # It is safe to delete this file: the tools regenerate it.
    cache = "/etc/lvm/.cache"

    # You can turn off writing this cache file by setting this to 0.
    write_cache_state = 1

    # Advanced settings.

    # List of pairs of additional acceptable block device types found
    # in /proc/devices with maximum (non-zero) number of partitions.
    # types = [ "fd", 16 ]

    # If sysfs is mounted (2.6 kernels) restrict device scanning to
    # the block devices it believes are valid.
    # 1 enables; 0 disables.
    sysfs_scan = 1

    # By default, LVM2 will ignore devices used as components of
    # software RAID (md) devices by looking for md superblocks.
    # 1 enables; 0 disables.
    md_component_detection = 0
}

# This section that allows you to configure the nature of the
# information that LVM2 reports.
log {

    # Controls the messages sent to stdout or stderr.
    # There are three levels of verbosity, 3 being the most verbose.
    verbose = 0

    # Should we send log messages through syslog?
    # 1 is yes; 0 is no.
    syslog = 1

    # Should we log error and debug messages to a file?
    # By default there is no log file.
    #file = "/var/log/lvm2.log"

    # Should we overwrite the log file each time the program is run?
    # By default we append.
    overwrite = 0

    # What level of log messages should we send to the log file and/or syslog?
    # There are 6 syslog-like log levels currently in use - 2 to 7 inclusive.
    # 7 is the most verbose (LOG_DEBUG).
    level = 0

    # Format of output messages
    # Whether or not (1 or 0) to indent messages according to their severity
    indent = 1

    # Whether or not (1 or 0) to display the command name on each line output
    command_names = 0

    # A prefix to use before the message text (but after the command name,
    # if selected).  Default is two spaces, so you can see/grep the severity
    # of each message.
    prefix = "  "

    # To make the messages look similar to the original LVM tools use:
    #   indent = 0
    #   command_names = 1
    #   prefix = " -- "

    # Set this if you want log messages during activation.
    # Don't use this in low memory situations (can deadlock).
    # activation = 0
}

# Configuration of metadata backups and archiving.  In LVM2 when we
# talk about a 'backup' we mean making a copy of the metadata for the
# *current* system.  The 'archive' contains old metadata configurations.
# Backups are stored in a human readeable text format.
backup {

    # Should we maintain a backup of the current metadata configuration ?
    # Use 1 for Yes; 0 for No.
    # Think very hard before turning this off!
    backup = 1

    # Where shall we keep it ?
    # Remember to back up this directory regularly!
    backup_dir = "/etc/lvm/backup"

    # Should we maintain an archive of old metadata configurations.
    # Use 1 for Yes; 0 for No.
    # On by default.  Think very hard before turning this off.
    archive = 1

    # Where should archived files go ?
    # Remember to back up this directory regularly!
    archive_dir = "/etc/lvm/archive"

    # What is the minimum number of archive files you wish to keep ?
    retain_min = 10

    # What is the minimum time you wish to keep an archive file for ?
    retain_days = 30
}

# Settings for the running LVM2 in shell (readline) mode.
shell {

    # Number of lines of history to store in ~/.lvm_history
    history_size = 100
}


# Miscellaneous global LVM2 settings
global {
    library_dir = "/usr/lib64"

    # The file creation mask for any files and directories created.
    # Interpreted as octal if the first digit is zero.
    umask = 077

    # Allow other users to read the files
    #umask = 022

    # Enabling test mode means that no changes to the on disk metadata
    # will be made.  Equivalent to having the -t option on every
    # command.  Defaults to off.
    test = 0

    # Whether or not to communicate with the kernel device-mapper.
    # Set to 0 if you want to use the tools to manipulate LVM metadata
    # without activating any logical volumes.
    # If the device-mapper kernel driver is not present in your kernel
    # setting this to 0 should suppress the error messages.
    activation = 1

    # If we can't communicate with device-mapper, should we try running
    # the LVM1 tools?
    # This option only applies to 2.4 kernels and is provided to help you
    # switch between device-mapper kernels and LVM1 kernels.
    # The LVM1 tools need to be installed with .lvm1 suffices
    # e.g. vgscan.lvm1 and they will stop working after you start using
    # the new lvm2 on-disk metadata format.
    # The default value is set when the tools are built.
    # fallback_to_lvm1 = 0

    # The default metadata format that commands should use - "lvm1" or "lvm2".
    # The command line override is -M1 or -M2.
    # Defaults to "lvm1" if compiled in, else "lvm2".
    # format = "lvm1"

    # Location of proc filesystem
    proc = "/proc"

    # Type of locking to use. Defaults to local file-based locking (1).
    # Turn locking off by setting to 0 (dangerous: risks metadata corruption
    # if LVM2 commands get run concurrently).
    # Type 2 uses the external shared library locking_library.
    # Type 3 uses built-in clustered locking.
    locking_type = 3

    # If using external locking (type 2) and initialisation fails,
    # with this set to 1 an attempt will be made to use the built-in
    # clustered locking.
    # If you are using a customised locking_library you should set this to 0.
    fallback_to_clustered_locking = 1

    # If an attempt to initialise type 2 or type 3 locking failed, perhaps
    # because cluster components such as clvmd are not running, with this set
    # to 1 an attempt will be made to use local file-based locking (type 1).
    # If this succeeds, only commands against local volume groups will proceed.
    # Volume Groups marked as clustered will be ignored.
    fallback_to_local_locking = 1

    # Local non-LV directory that holds file-based locks while commands are
    # in progress.  A directory like /tmp that may get wiped on reboot is OK.
    locking_dir = "/var/lock/lvm"

    # Other entries can go here to allow you to load shared libraries
    # e.g. if support for LVM1 metadata was compiled as a shared library use
    #   format_libraries = "liblvm2format1.so"
    # Full pathnames can be given.

    # Search this directory first for shared libraries.
    #   library_dir = "/lib"

    # The external locking library to load if locking_type is set to 2.
    #   locking_library = "liblvm2clusterlock.so"
}

activation {
    # Device used in place of missing stripes if activating incomplete volume.
    # For now, you need to set this up yourself first (e.g. with 'dmsetup')
    # For example, you could make it return I/O errors using the 'error'
    # target or make it return zeros.
    missing_stripe_filler = "/dev/ioerror"

    # How much stack (in KB) to reserve for use while devices suspended
    reserved_stack = 256

    # How much memory (in KB) to reserve for use while devices suspended
    reserved_memory = 8192

    # Nice value used while devices suspended
    process_priority = -18

    # If volume_list is defined, each LV is only activated if there is a
    # match against the list.
    #   "vgname" and "vgname/lvname" are matched exactly.
    #   "@tag" matches any tag set in the LV or VG.
    #   "@*" matches if any tag defined on the host is also set in the LV or VG
    #
    # volume_list = [ "vg1", "vg2/lvol1", "@tag1", "@*" ]

    # Size (in KB) of each copy operation when mirroring
    mirror_region_size = 512

    # 'mirror_image_fault_policy' and 'mirror_log_fault_policy' define
    # how a device failure affecting a mirror is handled.
    # A mirror is composed of mirror images (copies) and a log.
    # A disk log ensures that a mirror does not need to be re-synced
    # (all copies made the same) every time a machine reboots or crashes.
    #
    # In the event of a failure, the specified policy will be used to
    # determine what happens:
    #
    # "remove" - Simply remove the faulty device and run without it.  If
    #            the log device fails, the mirror would convert to using
    #            an in-memory log.  This means the mirror will not
    #            remember its sync status across crashes/reboots and
    #            the entire mirror will be re-synced.  If a
    #            mirror image fails, the mirror will convert to a
    #            non-mirrored device if there is only one remaining good
    #            copy.
    #
    # "allocate" - Remove the faulty device and try to allocate space on
    #            a new device to be a replacement for the failed device.
    #            Using this policy for the log is fast and maintains the
    #            ability to remember sync state through crashes/reboots.
    #            Using this policy for a mirror device is slow, as it
    #            requires the mirror to resynchronize the devices, but it
    #            will preserve the mirror characteristic of the device.
    #            This policy acts like "remove" if no suitable device and
    #            space can be allocated for the replacement.
    #            Currently this is not implemented properly and behaves
    #            similarly to:
    #
    # "allocate_anywhere" - Operates like "allocate", but it does not
    #            require that the new space being allocated be on a
    #            device is not part of the mirror.  For a log device
    #            failure, this could mean that the log is allocated on
    #            the same device as a mirror device.  For a mirror
    #            device, this could mean that the mirror device is
    #            allocated on the same device as another mirror device.
    #            This policy would not be wise for mirror devices
    #            because it would break the redundant nature of the
    #            mirror.  This policy acts like "remove" if no suitable
    #            device and space can be allocated for the replacement.

    mirror_log_fault_policy = "allocate"
    mirror_device_fault_policy = "remove"
}


####################
# Advanced section #
####################

# Metadata settings
#
# metadata {
    # Default number of copies of metadata to hold on each PV.  0, 1 or 2.
    # You might want to override it from the command line with 0
    # when running pvcreate on new PVs which are to be added to large VGs.

    # pvmetadatacopies = 1

    # Approximate default size of on-disk metadata areas in sectors.
    # You should increase this if you have large volume groups or
    # you want to retain a large on-disk history of your metadata changes.

    # pvmetadatasize = 255

    # List of directories holding live copies of text format metadata.
    # These directories must not be on logical volumes!
    # It's possible to use LVM2 with a couple of directories here,
    # preferably on different (non-LV) filesystems, and with no other
    # on-disk metadata (pvmetadatacopies = 0). Or this can be in
    # addition to on-disk metadata areas.
    # The feature was originally added to simplify testing and is not
    # supported under low memory situations - the machine could lock up.
    #
    # Never edit any files in these directories by hand unless you
    # you are absolutely sure you know what you are doing! Use
    # the supplied toolset to make changes (e.g. vgcfgrestore).

    # dirs = [ "/etc/lvm/metadata", "/mnt/disk2/lvm/metadata2" ]
#}

# Event daemon
#
# dmeventd {
    # mirror_library is the library used when monitoring a mirror device.
    #
    # "libdevmapper-event-lvm2mirror.so" attempts to recover from failures.
    # It removes failed devices from a volume group and reconfigures a
    # mirror as necessary.
    #
    # mirror_library = "libdevmapper-event-lvm2mirror.so"
#}
-------------- next part --------------
[root at xen2 ~]# ps fax
  PID TTY      STAT   TIME COMMAND
    1 ?        Ss     0:02 init [3]
    2 ?        S      0:46 [migration/0]
    3 ?        SN     0:01 [ksoftirqd/0]
    4 ?        S      0:00 [watchdog/0]
    5 ?        S      1:37 [migration/1]
    6 ?        SN     0:03 [ksoftirqd/1]
    7 ?        S      0:00 [watchdog/1]
    8 ?        S      1:27 [migration/2]
    9 ?        SN     3:45 [ksoftirqd/2]
   10 ?        S      0:00 [watchdog/2]
   11 ?        S      1:24 [migration/3]
   12 ?        SN     0:19 [ksoftirqd/3]
   13 ?        S      0:00 [watchdog/3]
   14 ?        S<     0:04 [events/0]
   15 ?        S<     0:05 [events/1]
   16 ?        S<     0:05 [events/2]
   17 ?        S<     0:10 [events/3]
   18 ?        S<     0:00 [khelper]
   19 ?        S<     0:00 [kthread]
   21 ?        S<     0:00  \_ [xenwatch]
   22 ?        S<     0:00  \_ [xenbus]
   27 ?        S<     0:00  \_ [kblockd/0]
   28 ?        S<     0:00  \_ [kblockd/1]
   29 ?        S<     0:00  \_ [kblockd/2]
   30 ?        S<     0:00  \_ [kblockd/3]
   31 ?        S<     0:00  \_ [kacpid]
  165 ?        S<     0:00  \_ [cqueue/0]
  166 ?        S<     0:00  \_ [cqueue/1]
  167 ?        S<     0:00  \_ [cqueue/2]
  168 ?        S<     0:00  \_ [cqueue/3]
  172 ?        S<     0:00  \_ [khubd]
  174 ?        S<     0:00  \_ [kseriod]
  251 ?        S<    21:24  \_ [kswapd0]
  252 ?        S<     0:00  \_ [aio/0]
  253 ?        S<     0:00  \_ [aio/1]
  254 ?        S<     0:00  \_ [aio/2]
  255 ?        S<     0:00  \_ [aio/3]
  389 ?        S<     0:00  \_ [kpsmoused]
  445 ?        S<     0:00  \_ [kmirrord]
  456 ?        S<     0:00  \_ [ksnapd]
  457 ?        D<     4:54  \_ [kjournald]
  484 ?        S<     0:00  \_ [kauditd]
  743 ?        S<     0:00  \_ [scsi_eh_0]
  858 ?        S<     0:02  \_ [kedac]
 1401 ?        S<     0:00  \_ [qla2xxx_0_dpc]
 1402 ?        S<     0:00  \_ [scsi_wq_0]
 1403 ?        S<     0:00  \_ [fc_wq_0]
 1404 ?        S<     0:00  \_ [fc_dl_0]
 1405 ?        S<     0:00  \_ [scsi_eh_1]
 1414 ?        S<     0:00  \_ [qla2xxx_1_dpc]
 1415 ?        S<     0:00  \_ [scsi_wq_1]
 1416 ?        S<     0:00  \_ [fc_wq_1]
 1417 ?        S<     0:00  \_ [fc_dl_1]
 3005 ?        S<     0:00  \_ [kmpathd/0]
 3006 ?        S<     0:00  \_ [kmpathd/1]
 3007 ?        S<     0:00  \_ [kmpathd/2]
 3008 ?        S<     0:00  \_ [kmpathd/3]
 3327 ?        S<     0:00  \_ [kjournald]
13683 ?        S<     3:44  \_ [xvd 12 07:00]
21866 ?        S<     0:10  \_ [xvd 13 fd:59]
21867 ?        S<     0:00  \_ [xvd 13 fd:5a]
21868 ?        S<     0:57  \_ [xvd 13 fd:58]
18314 ?        S<     1:46  \_ [xvd 12 fd:76]
28672 ?        S<     0:00  \_ [user_dlm]
28682 ?        S<     0:32  \_ [o2net]
28900 ?        S<     0:04  \_ [o2hb-E299E3B2E8]
28924 ?        S<     0:00  \_ [ocfs2_wq]
28925 ?        S<     0:00  \_ [ocfs2vote-0]
28926 ?        S<     0:00  \_ [dlm_thread]
28927 ?        S<     0:00  \_ [dlm_reco_thread]
28928 ?        S<     0:00  \_ [dlm_wq]
28929 ?        S<     0:00  \_ [kjournald]
28930 ?        S<     0:00  \_ [ocfs2cmt-0]
31798 ?        S<     0:00  \_ [dlm_astd]
31799 ?        S<     0:00  \_ [dlm_scand]
31800 ?        S<     0:00  \_ [dlm_recvd]
31801 ?        S<     0:00  \_ [dlm_sendd]
31802 ?        S<     0:00  \_ [dlm_recoverd]
26600 ?        S<     0:07  \_ [xvd 17 fd:7b]
26601 ?        S<     0:17  \_ [xvd 17 fd:7f]
26886 ?        S<     0:00  \_ [xvd 17 fd:7c]
17408 ?        S      0:00  \_ [pdflush]
19041 ?        S      0:00  \_ [pdflush]
 4456 ?        S<sl   0:02 auditd
 4458 ?        S<s    0:02  \_ python /sbin/audispd
 4617 ?        Ds     0:26 syslogd -m 0
 4687 ?        Ss     0:00 klogd -x
 4753 ?        Ss     0:59 irqbalance
 4839 ?        Ss     0:00 portmap
 4897 ?        Ss     0:00 rpc.statd
 5050 ?        Ss     0:01 rpc.idmapd
 5090 ?        Ss     0:00 /usr/sbin/sshd
32483 ?        Ss     0:00  \_ sshd: root at pts/3
32485 pts/3    Ss     0:00      \_ -bash
 1852 pts/3    R+     0:00          \_ ps fax
 5176 ?        Ssl   19:11 /sbin/ccsd
 5182 ?        SLl   12:13 aisexec
 5190 ?        Ss     0:00 /sbin/groupd
 5198 ?        Ss     0:00 /sbin/fenced
 5204 ?        Ss     0:00 /sbin/dlm_controld
 5210 ?        Ss     0:00 /sbin/gfs_controld
 5957 ?        Ss     0:00 dbus-daemon --system
 6539 ?        Ssl    1:15 pcscd
 6565 ?        Ssl    0:08 automount
 6647 ?        Ss     0:03 sendmail: accepting connections
 6655 ?        Ss     0:00 sendmail: Queue runner at 01:00:00 for /var/spool/clientmqueue
 6670 ?        Ss     0:00 gpm -m /dev/input/mice -t exps2
 6713 ?        Ss     0:00 xfs -droppriv -daemon
 6742 ?        Ss     0:00 /usr/sbin/atd
 7015 ?        S    1119:33 xenstored --pid-file /var/run/xenstore.pid
 7020 ?        S      0:00 python /usr/sbin/xend start
 7021 ?        SLl  393:12  \_ python /usr/sbin/xend start
 7023 ?        Sl     0:00 xenconsoled
 7025 ?        Ssl    0:04 blktapctrl
 7072 ?        S<sl 549:17 modclusterd
 7167 ?        Ss     0:00 /usr/sbin/oddjobd -p /var/run/oddjobd.pid -t 300
 7218 ?        Ss     0:00 /usr/sbin/saslauthd -m /var/run/saslauthd -a pam
 7219 ?        S      0:00  \_ /usr/sbin/saslauthd -m /var/run/saslauthd -a pam
 7220 ?        S      0:00  \_ /usr/sbin/saslauthd -m /var/run/saslauthd -a pam
 7221 ?        S      0:00  \_ /usr/sbin/saslauthd -m /var/run/saslauthd -a pam
 7222 ?        S      0:00  \_ /usr/sbin/saslauthd -m /var/run/saslauthd -a pam
 7230 ?        S<s    0:06 ricci -u 100
 7234 tty1     Ss+    0:00 /sbin/mingetty tty1
 7235 tty2     Ss+    0:00 /sbin/mingetty tty2
 7236 tty3     Ss+    0:00 /sbin/mingetty tty3
 7239 tty4     Ss+    0:00 /sbin/mingetty tty4
 7244 tty5     Ss+    0:00 /sbin/mingetty tty5
 7270 tty6     Ss+    0:00 /sbin/mingetty tty6
 7271 ?        Ss     0:00 /bin/sh /usr/local/bin/svscanboot
 7307 ?        S      0:03  \_ svscan /service
 7309 ?        S      0:00      \_ supervise nrpe
 7317 ?        S      0:09      |   \_ tcpserver -l0 -v -H -R -x /etc/tcprules.d/nrpe.cdb -u nagios -g nagios 0 5666 /usr/local/bin/setuidgid nagios /etc/nrp
 7311 ?        S      0:00      \_ supervise log
 7314 ?        S      0:00      |   \_ cat -
20735 ?        S      0:00      \_ supervise log
20736 ?        S      0:00      |   \_ cat -
29308 ?        S      0:00      \_ supervise xvm
 8629 ?        S      0:00      |   \_ /bin/sh ./run
 8630 ?        S    740:18      |       \_ /usr/bin/perl /usr/sbin/xvmd --auto-start -D ou=XenHosts,dc=dc,dc=test,dc=pt -w testing
22320 ?        S      0:00      \_ supervise xmlpulse
22322 ?        S      0:00      |   \_ tcpserver -l0 -v -H -R -x /etc/tcprules.d/nrpe.cdb 0 5667 /usr/local/sbin/xm-xmlpulse
22321 ?        S      0:00      \_ supervise log
22323 ?        S      0:00          \_ cat -
11018 ?        Ss     0:02 crond
24722 ?        S<     0:00 [krfcommd]
 4684 ?        S<s    0:00 /sbin/udevd -d
13602 ?        S<    25:45 [loop0]
19354 ?        SLl   46:35 /sbin/multipathd
31797 ?        Ssl    0:00 clvmd -T40
24499 ?        SLs    0:01 ntpd -u ntp:ntp -p /var/run/ntpd.pid -g
27117 ?        S      0:03 /usr/sbin/snmpd -Lsd -Lf /dev/null -p /var/run/snmpd -a
  492 pts/3    S      0:00 vgdisplay -C


From jparsons at redhat.com  Thu Jan 17 19:05:43 2008
From: jparsons at redhat.com (jim parsons)
Date: Thu, 17 Jan 2008 14:05:43 -0500
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <OF2286DE9A.0E7AF773-ON872573D3.005F4ED0-872573D3.005F46D7@us.ibm.com>
References: <OF2286DE9A.0E7AF773-ON872573D3.005F4ED0-872573D3.005F46D7@us.ibm.com>
Message-ID: <1200596743.3487.9.camel@localhost.localdomain>

On Thu, 2008-01-17 at 10:21 -0700, Gary Romo wrote:
> 
> Yes.  Actually we us ssh here, not telnet. 
Unless you have modified the fence_bladecenter agent, it is unable to
make an ssh connection. It tries to use telnet.

There are open tickets against the agent right now, asking for ssh
support, and in fact I worked on this during the current development
cycle, but it will not make it into the next update release due to perl
module dependencies. After we are through the current crunch, the ssh
versions of 4 popular fence agents (including fence_bladecenter) will be
made available for testing if you are willing to install the necessary
perl modules yourself. In the meanwhile, we will be working to have the
necessary deps included in a future update.

Sorry this does not help you immediately.  If the network that you are
fencing on is secure, why not try telnet?

-J

> 
> Gary Romo
> IBM Global Technology Services
> 303.458.4415
> Email: garromo at us.ibm.com
> Pager:1.877.552.9264
> Text message: gromo at skytel.com 
> 
> 
> Rainer Duffner
> <rainer at ultra-secure.de> 
> Sent by:
> linux-cluster-bounces at redhat.com 
> 
> 01/17/2008 09:51 AM 
>          Please respond to
>          linux clustering
>     <linux-cluster at redhat.com>
> 
> 
> 
> 
>                To
> linux clustering
> <linux-cluster at redhat.com> 
>                cc
> 
>           Subject
> Re:
> [Linux-cluster]
> BladeCenter
> Fencing errors
> 
> 
> 
> 
> 
> 
> 
> 
> Gary Romo schrieb:
> >
> > I am running AS4.6, two node cluster, using bladecenter fencing.  I
> am
> > getting the following errors in /var/log/messages;
> > Anyone know of some good config instructions when using the
> > BladeCenter for fencing?
> >
> > Jan 17 09:21:17 myhost fenced[3229]: fence "myhost" failed
> > Jan 17 09:21:22 myhost fenced[3229]: fencing node "myhost"
> > Jan 17 09:21:22 myhost fenced[3229]: agent "fence_bladecenter"
> > reports: problem connecting to "172.26.1.143", port 23: Connection
> > refused at /sbin/fence_bladecenter line 273
> 
> 
> Can you telnet to the management-ip?
> 
> 
> 
> 
> Rainer
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From dan_goetzman at bmc.com  Thu Jan 17 21:01:04 2008
From: dan_goetzman at bmc.com (Dan Goetzman)
Date: Thu, 17 Jan 2008 15:01:04 -0600
Subject: [Linux-cluster] HP-UX NFS client failures to a NFS cluster when GFS
 is the backing filesystem
Message-ID: <478FC210.90402@bmc.com>

Problem: HP-UX NFS client failures to a NFS cluster when GFS is the 
backing filesystem.

$ cp any_file /path/to/nfs/server/new_file
cp: cannot create /path/to/nfs/server/new_file: Permission denied
$ ls -l
---------- 1 owner group 0 Jan 17 10:01 new_file
$

 Note: Permission denied error, and the file is created, 0 size with no 
permissions bits. Tested on CentOS 5.

My attempt to gather data on the problem:

A wireshark trace shows the HP-UX client make a NFS RPC CREATE 
(UNCHECKED, MODE=0) call and the server returns NFS3ERR_NOACCES. As this 
only happens when the server side filesystem is GFS and not when using 
ext3, I moved to the server side...

Looking at the server side it appears that nfsd calls vfs_create OK, but 
a later call to nfsd_setattr (I assume set the actual file permissions) 
fails. Attempting to trace what nfsd_setattr is doing, it seems to 
eventually call gfs_setattr and then generic_permission. The call to 
generic_permission fails with -EACCESS. Apparently due to owner not 
having any access mode bits set?

I am not a kernel type. I looked and ext3_setattr does not seem to use 
generic_permission.
This problem would seem to make using GFS to build a clustered NFS 
server a problem (if you have HP-UX NFS clients!).
Anyone know why this is a problem when using GFS?

-Dan Goetzman

Note: I read linux-cluster in digest mode.




From garromo at us.ibm.com  Thu Jan 17 21:06:38 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Thu, 17 Jan 2008 14:06:38 -0700
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <1200596743.3487.9.camel@localhost.localdomain>
Message-ID: <OF87ECCA68.49323639-ON872573D3.007371A3-872573D3.0073DEEF@us.ibm.com>

I enabled telnet on the MM, now I am getting these messsages;

Jan 17 14:00:24 node1 fenced[3229]: fence "node2" failed
Jan 17 14:00:29 node1 fenced[3229]: fencing node "node2"
Jan 17 14:00:40 node1 fenced[3229]: agent "fence_bladecenter" reports: 
pattern match timed-out at /sbin/fence_bladecenter line 189 

Jan 17 14:00:40 node1 fenced[3229]: fence "node2" failed
Jan 17 14:00:45 node1 fenced[3229]: fencing node "node2"
Jan 17 14:00:56 node1 fenced[3229]: agent "fence_bladecenter" reports: 
pattern match timed-out at /sbin/fence_bladecenter line 189 

Jan 17 14:00:56 node1 fenced[3229]: fence "node2" failed
Jan 17 14:01:01 node1 fenced[3229]: fencing node "node2"
Jan 17 14:01:12 node1 fenced[3229]: agent "fence_bladecenter" reports: 
pattern match timed-out at /sbin/fence_bladecenter line 189 

Line 189 looks like this;

 ($text, $match) = $t->waitfor("/system:blade\\[$bladenum\\]>/");


I am getting these on thesecond node;

Jan 17 14:03:24 mode2 fenced[3340]: fence "node1" failed
Jan 17 14:03:29 node2 fenced[3340]: fencing node "node1"
Jan 17 14:03:29 node2 fenced[3340]: fence "node1" failed
Jan 17 14:03:34 node2 fenced[3340]: fencing node "node1"
Jan 17 14:03:34 node2 fenced[3340]: fence "node1" failed


Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com



jim parsons <jparsons at redhat.com> 
Sent by: linux-cluster-bounces at redhat.com
01/17/2008 12:05 PM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] BladeCenter Fencing errors






On Thu, 2008-01-17 at 10:21 -0700, Gary Romo wrote:
> 
> Yes.  Actually we us ssh here, not telnet. 
Unless you have modified the fence_bladecenter agent, it is unable to
make an ssh connection. It tries to use telnet.

There are open tickets against the agent right now, asking for ssh
support, and in fact I worked on this during the current development
cycle, but it will not make it into the next update release due to perl
module dependencies. After we are through the current crunch, the ssh
versions of 4 popular fence agents (including fence_bladecenter) will be
made available for testing if you are willing to install the necessary
perl modules yourself. In the meanwhile, we will be working to have the
necessary deps included in a future update.

Sorry this does not help you immediately.  If the network that you are
fencing on is secure, why not try telnet?

-J

> 
> Gary Romo
> IBM Global Technology Services
> 303.458.4415
> Email: garromo at us.ibm.com
> Pager:1.877.552.9264
> Text message: gromo at skytel.com 
> 
> 
> Rainer Duffner
> <rainer at ultra-secure.de> 
> Sent by:
> linux-cluster-bounces at redhat.com 
> 
> 01/17/2008 09:51 AM 
>          Please respond to
>          linux clustering
>     <linux-cluster at redhat.com>
> 
> 
> 
> 
>                To
> linux clustering
> <linux-cluster at redhat.com> 
>                cc
> 
>           Subject
> Re:
> [Linux-cluster]
> BladeCenter
> Fencing errors
> 
> 
> 
> 
> 
> 
> 
> 
> Gary Romo schrieb:
> >
> > I am running AS4.6, two node cluster, using bladecenter fencing.  I
> am
> > getting the following errors in /var/log/messages;
> > Anyone know of some good config instructions when using the
> > BladeCenter for fencing?
> >
> > Jan 17 09:21:17 myhost fenced[3229]: fence "myhost" failed
> > Jan 17 09:21:22 myhost fenced[3229]: fencing node "myhost"
> > Jan 17 09:21:22 myhost fenced[3229]: agent "fence_bladecenter"
> > reports: problem connecting to "172.26.1.143", port 23: Connection
> > refused at /sbin/fence_bladecenter line 273
> 
> 
> Can you telnet to the management-ip?
> 
> 
> 
> 
> Rainer
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/b254d49e/attachment.htm>

From jparsons at redhat.com  Thu Jan 17 22:40:13 2008
From: jparsons at redhat.com (jim parsons)
Date: Thu, 17 Jan 2008 17:40:13 -0500
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <OF87ECCA68.49323639-ON872573D3.007371A3-872573D3.0073DEEF@us.ibm.com>
References: <OF87ECCA68.49323639-ON872573D3.007371A3-872573D3.0073DEEF@us.ibm.com>
Message-ID: <1200609613.3476.10.camel@localhost.localdomain>

On Thu, 2008-01-17 at 14:06 -0700, Gary Romo wrote:
> 
> I enabled telnet on the MM, now I am getting these messsages; 
> 
> Jan 17 14:00:24 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:00:29 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:00:40 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189  
> 
> Jan 17 14:00:40 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:00:45 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:00:56 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189  
> 
> Jan 17 14:00:56 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:01:01 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:01:12 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189  
> 
> Line 189 looks like this; 
> 
>  ($text, $match) = $t->waitfor("/system:blade\\[$bladenum\\]>/"); 
> 
> 
> I am getting these on thesecond node; 
> 
> Jan 17 14:03:24 mode2 fenced[3340]: fence "node1" failed 
> Jan 17 14:03:29 node2 fenced[3340]: fencing node "node1" 
> Jan 17 14:03:29 node2 fenced[3340]: fence "node1" failed 
> Jan 17 14:03:34 node2 fenced[3340]: fencing node "node1" 
> Jan 17 14:03:34 node2 fenced[3340]: fence "node1" failed 
> 
Ah, yuck. Well, let's figure out what is going on here.
Can you post the clusternodes and fencedevices sections of your
cluster.conf here? Just XXXX out any passwords.

On one of the cluster nodes, can you run 
'/sbin/fence_bladecenter -a <ip or hostname of bladecenter> -l <login>
-p <passwd> -n <blade number of another running node> -o status -v'

Do you know firmware details about your bladecenter? The
fence_bladecenter script hasn't changed in years...The tested firmware
versions are in the top of the file. Maybe the interface has changed. If
so, the debuglog should give us information.

This will get us started.

-Jim



From garromo at us.ibm.com  Thu Jan 17 23:51:11 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Thu, 17 Jan 2008 16:51:11 -0700
Subject: [Linux-cluster] BladeCenter Fencing errors
In-Reply-To: <1200609613.3476.10.camel@localhost.localdomain>
Message-ID: <OF9686199E.F591C161-ON872573D3.007FBEB7-872573D3.0082EFA0@us.ibm.com>

See below;

Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com



jim parsons <jparsons at redhat.com> 
Sent by: linux-cluster-bounces at redhat.com
01/17/2008 03:40 PM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc
linux-cluster-bounces at redhat.com
Subject
Re: [Linux-cluster] BladeCenter Fencing errors






On Thu, 2008-01-17 at 14:06 -0700, Gary Romo wrote:
> 
> I enabled telnet on the MM, now I am getting these messsages; 
> 
> Jan 17 14:00:24 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:00:29 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:00:40 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189 
> 
> Jan 17 14:00:40 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:00:45 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:00:56 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189 
> 
> Jan 17 14:00:56 node1 fenced[3229]: fence "node2" failed 
> Jan 17 14:01:01 node1 fenced[3229]: fencing node "node2" 
> Jan 17 14:01:12 node1 fenced[3229]: agent "fence_bladecenter" reports:
> pattern match timed-out at /sbin/fence_bladecenter line 189 
> 
> Line 189 looks like this; 
> 
>  ($text, $match) = $t->waitfor("/system:blade\\[$bladenum\\]>/"); 
> 
> 
> I am getting these on thesecond node; 
> 
> Jan 17 14:03:24 mode2 fenced[3340]: fence "node1" failed 
> Jan 17 14:03:29 node2 fenced[3340]: fencing node "node1" 
> Jan 17 14:03:29 node2 fenced[3340]: fence "node1" failed 
> Jan 17 14:03:34 node2 fenced[3340]: fencing node "node1" 
> Jan 17 14:03:34 node2 fenced[3340]: fence "node1" failed 
> 
Ah, yuck. Well, let's figure out what is going on here.
Can you post the clusternodes and fencedevices sections of your
cluster.conf here? Just XXXX out any passwords.

<?xml version="1.0"?>
<cluster alias="rhcs-1-clus" config_version="4" name="rhcs-1-clus">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="node1" votes="1">
                        <multicast addr="XXX.XXX.127.204" 
interface="eth0"/>
                        <fence>
                                <method name="1">
                                        <device blade="2" 
name="chassis_fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node2" votes="1">
                        <multicast addr="XXX.XXX.127.204" 
interface="eth0"/>
                        <fence>
                                <method name="1">
                                        <device blade="3" 
name="chassis_fence"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1">
                <multicast addr="XXX.XXX.127.204"/>
        </cman>
        <fencedevices>
                <fencedevice agent="fence_bladecenter" 
ipaddr="XXX.XXX.1.143" login="rchs_fence" name="chassis_fence" 
passwd="XXXXXXX"/>
        </fencedevices>

On one of the cluster nodes, can you run 
'/sbin/fence_bladecenter -a <ip or hostname of bladecenter> -l <login>
-p <passwd> -n <blade number of another running node> -o status -v'

[root at lxdnt648 ~]# /sbin/fence_bladecenter -a chassis -l rchs_fence -p 
XXXXXXX -n 2 -o status -v
Please use '-h' for usage.

Do you know firmware details about your bladecenter? The
fence_bladecenter script hasn't changed in years...The tested firmware
versions are in the top of the file. Maybe the interface has changed. If
so, the debuglog should give us information.


 1  
  chassis  
  Main application  
  BRET85M  
  CNETMNUS.PKT  
  01-10-07  
16
     
     
  Boot ROM*  
  BRBR82A  
  CNETBRUS.PKT  
  06-01-05  
16
     
     
  Remote control  
  BRRG85M  
  CNETRGUS.PKT  
  01-10-07  
16


This will get us started.

-Jim

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/bb9272f0/attachment.htm>

From bfilipek at crscold.com  Fri Jan 18 05:05:35 2008
From: bfilipek at crscold.com (Brad Filipek)
Date: Thu, 17 Jan 2008 23:05:35 -0600
Subject: [Linux-cluster] shared ext3
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>

I know ext3 is not "cluster aware", but what if I had a SAN with an ext3 partition on it and one node connected to it. If I was to unmount the partition, physically disconnect the server from the SAN, connect another server to the SAN, and then mount to the ext3 partition, would there be any issues? I am not looking to access the partition simultaneously, just one at a time. I am asking incase the server connected to the SAN dies and I need to access the data on this ext3 volume from another server. Will it work?  

Thank you
Brad 


Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080117/264a5dbe/attachment.htm>

From dinesh at patel2202.fsnet.co.uk  Fri Jan 18 05:49:56 2008
From: dinesh at patel2202.fsnet.co.uk (Dinesh)
Date: Fri, 18 Jan 2008 05:49:56 -0000
Subject: [Linux-cluster] RHEL5 2 node setup
Message-ID: <!&!AAAAAAAAAAAYAAAAAAAAAMaIXbJyscdBq0RaarBwTRnCgAAAEAAAANQ5U1FPcRZIu4VR1ihYTU0BAAAAAA==@patel2202.fsnet.co.uk>

I have a problem with a 2 node cluster running on RHEL5. The two nodes are
configures to mount a GFS filesystem. The problem is, with one node shutdown
(or not running cman) I can not get the GFS file system to mount.

 

The error reported in group_tool dump gfs is ?mount: not in default fence
domain?.

 

Do I need cman running on both nodes?


No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.516 / Virus Database: 269.19.1/1220 - Release Date: 11/01/2008
18:09
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080118/e80ab2e3/attachment.htm>

From nlam87346 at library.usyd.edu.au  Fri Jan 18 05:56:45 2008
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Fri, 18 Jan 2008 16:56:45 +1100
Subject: [Linux-cluster] shared ext3
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
Message-ID: <1200635805.3645.16.camel@zaniah.library.usyd.edu.au>

On Thu, 2008-01-17 at 23:05 -0600, Brad Filipek wrote:
> I know ext3 is not "cluster aware", but what if I had a SAN with an
> ext3 partition on it and one node connected to it. If I was to unmount
> the partition, physically disconnect the server from the SAN, connect
> another server to the SAN, and then mount to the ext3 partition, would
> there be any issues? I am not looking to access the partition
> simultaneously, just one at a time. I am asking incase the server
> connected to the SAN dies and I need to access the data on this ext3
> volume from another server. Will it work? 


Yes, this will work. As long as you're sure you're not going to get any
simultaneous access it will be fine.

Theoretically, the only time you can do reliable concurrent access to
ext3 is when *all* servers are mounting the partition in read-only mode.





From Mathieu.MARY at neufcegetel.fr  Fri Jan 18 12:47:46 2008
From: Mathieu.MARY at neufcegetel.fr (MARY, Mathieu)
Date: Fri, 18 Jan 2008 13:47:46 +0100
Subject: [Linux-cluster] shared ext3
In-Reply-To: <1200635805.3645.16.camel@zaniah.library.usyd.edu.au>
Message-ID: <20080118120857.F1E2F20B2FB@smtp3.ldcom.fr>

i use it on a two node cluster, it works fine the service manager handles the umount of the filesystem before mounting it to the other node

 
 

-----Message d'origine-----
De?: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] De la part de Nikolas Lam
Envoy??: vendredi 18 janvier 2008 06:57
??: linux clustering
Objet?: Re: [Linux-cluster] shared ext3

On Thu, 2008-01-17 at 23:05 -0600, Brad Filipek wrote:
> I know ext3 is not "cluster aware", but what if I had a SAN with an
> ext3 partition on it and one node connected to it. If I was to unmount
> the partition, physically disconnect the server from the SAN, connect
> another server to the SAN, and then mount to the ext3 partition, would
> there be any issues? I am not looking to access the partition
> simultaneously, just one at a time. I am asking incase the server
> connected to the SAN dies and I need to access the data on this ext3
> volume from another server. Will it work? 


Yes, this will work. As long as you're sure you're not going to get any
simultaneous access it will be fine.

Theoretically, the only time you can do reliable concurrent access to
ext3 is when *all* servers are mounting the partition in read-only mode.



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From wcheng at redhat.com  Fri Jan 18 14:45:36 2008
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 18 Jan 2008 09:45:36 -0500
Subject: [Linux-cluster] shared ext3
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
Message-ID: <4790BB90.6040502@redhat.com>

Brad Filipek wrote:
>
> I know ext3 is not "cluster aware", but what if I had a SAN with an 
> ext3 partition on it and one node connected to it. If I was to unmount 
> the partition, physically disconnect the server from the SAN, connect 
> another server to the SAN, and then mount to the ext3 partition, would 
> there be any issues? I am not looking to access the partition 
> simultaneously, just one at a time. I am asking incase the server 
> connected to the SAN dies and I need to access the data on this ext3 
> volume from another server. Will it work?
>

Yes, that's should work quite well. Actually that's how people use ext3 
in a cluster environment.

-- Wendy



From brad at bradandkim.net  Fri Jan 18 17:36:21 2008
From: brad at bradandkim.net (brad at bradandkim.net)
Date: Fri, 18 Jan 2008 11:36:21 -0600 (CST)
Subject: [Linux-cluster] Primary attribute collision
Message-ID: <35561.129.237.174.153.1200677781.squirrel@webmail.bradandkim.net>

I am testing a cluster that would have the following setup:

Server1 - production postgres server
Server2 - production postgres server
Server3 - production postgres server
Server4 - hot standby for any of the 3 production postgres servers

So there would be 3 failover domains and Server4 would be in all of them. 
Then there would be 3 services, one in each failover domain.  This may not
be a recommended setup but I think the odds of 2 production servers
failing at the same time are low enough to make this worth looking at.  If
they do, the first service to fail over would succeed and the other would
fail.  I would have issues one way or another if more than one server
failed at a time.

In setting this up I get a pricmary attribute collision since the shared
file system mount point is the same for more than one file system resource
(even though they are private resources).  Obviously logic is built in to
see that Server4 would have a problem mounting to the same mount point
more than once.

Has anyone tried this or have a suggestion for a workaround?  I thought
about unique mount points and then adding a script to each service that
would set up a sym link, but there might be a more elegant way.

Thanks,

Brad Crotchett
brad at bradandkim.net
http://www.bradandkim.net



From probst_christopher at hotmail.com  Fri Jan 18 18:47:21 2008
From: probst_christopher at hotmail.com (Christopher Probst)
Date: Fri, 18 Jan 2008 18:47:21 +0000
Subject: [Linux-cluster] Finding out properties of GFS formatted partition
Message-ID: <BAY143-W453359C5924C23F907CE32EF420@phx.gbl>


Hello,

This is my first post/question on this mailing list, so I am sorry if the question sound naive. I have a GFS formatted partition(dev/hdb1) that is assigned to a cluster. I would like to know, if I can extract the following info of a particular GFS formatted partition

1) Cluster name it is assigned to
2) Number of journals;
3) lock method used.

Is there any way to do this without getting a mount point involved?

Thank you in advance
Christopher

_________________________________________________________________

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080118/2ed35fea/attachment.htm>

From yanv at xandros.com  Fri Jan 18 18:55:18 2008
From: yanv at xandros.com (Yan Vinogradov)
Date: Fri, 18 Jan 2008 13:55:18 -0500
Subject: [Linux-cluster] Configuring multicast on RHEL4
Message-ID: <4790F616.1070509@xandros.com>

Hi all!

My understanding is RHEL4 clusters default to broadcast upon creation. 
If later on the user chooses to switch to multicast - how does the 
clustering software know what interface name is to be used for each node 
of the cluster? 'system-config-cluster' utility seems to just be using a 
hardcoded value of "eth0" regardless of the actual name of the interface.

Thank you,
Yan



From lhh at redhat.com  Fri Jan 18 20:22:19 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Jan 2008 15:22:19 -0500
Subject: [Linux-cluster] shared ext3
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024727@SRVEDI.upark.crscold.com>
Message-ID: <1200687739.23054.76.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-17 at 23:05 -0600, Brad Filipek wrote:
> I know ext3 is not "cluster aware", but what if I had a SAN with an
> ext3 partition on it and one node connected to it. If I was to unmount
> the partition, physically disconnect the server from the SAN, connect
> another server to the SAN, and then mount to the ext3 partition, would
> there be any issues? I am not looking to access the partition
> simultaneously, just one at a time. I am asking incase the server
> connected to the SAN dies and I need to access the data on this ext3
> volume from another server. Will it work? 

As long as you don't mount on >1 node, you don't even have to disconnect
it.

-- Lon





From lhh at redhat.com  Fri Jan 18 20:23:53 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Jan 2008 15:23:53 -0500
Subject: [Linux-cluster] RHEL5 2 node setup
In-Reply-To: <!&!AAAAAAAAAAAYAAAAAAAAAMaIXbJyscdBq0RaarBwTRnCgAAAEAAAANQ5U1FPcRZIu4VR1ihYTU0BAAAAAA==@patel2202.fsnet.co.uk>
References: <!&!AAAAAAAAAAAYAAAAAAAAAMaIXbJyscdBq0RaarBwTRnCgAAAEAAAANQ5U1FPcRZIu4VR1ihYTU0BAAAAAA==@patel2202.fsnet.co.uk>
Message-ID: <1200687833.23054.78.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-18 at 05:49 +0000, Dinesh wrote:

> Do I need cman running on both nodes?

You must run CMAN in order to mount GFS on a node.

-- Lon




From lhh at redhat.com  Fri Jan 18 20:25:39 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Jan 2008 15:25:39 -0500
Subject: [Linux-cluster] Primary attribute collision
In-Reply-To: <35561.129.237.174.153.1200677781.squirrel@webmail.bradandkim.net>
References: <35561.129.237.174.153.1200677781.squirrel@webmail.bradandkim.net>
Message-ID: <1200687939.23054.81.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-18 at 11:36 -0600, brad at bradandkim.net wrote:

> Has anyone tried this or have a suggestion for a workaround?  I thought
> about unique mount points and then adding a script to each service that
> would set up a sym link, but there might be a more elegant way.

Change /usr/share/cluster/fs.sh and make mountpoint not 'unique' in the
XML metadata.

If two services try to use the same mountpoint on the same node, both
services will start bouncing, however. ;)

-- Lon



From lhh at redhat.com  Fri Jan 18 20:26:36 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Jan 2008 15:26:36 -0500
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <478BBC86.7000003@redhat.com>
References: <1200163365971.teemu.m2.655.JzOnwZSzXfM3REzTKk5v2g@luukku.com>
	<478BBC86.7000003@redhat.com>
Message-ID: <1200687996.23054.83.camel@ayanami.boston.devel.redhat.com>

On Mon, 2008-01-14 at 13:48 -0600, Ryan O'Hara wrote:
> m.. mm.. wrote:
> > I don't get it working:
> > This error message comes..when i try to fence node2.
> > 
> > Jan 12 20:38:57 xxx4n1 fence_node[20776]: agent "fence_ipmilan" reports: Rebooting machine @ IPMI:192.168.69.4...ipmilan: Failed to connect after 30 seconds Failed
> > Jan 12 20:38:57 xxx4n1 ccsd[3239]: Attempt to close an unopened CCS descriptor (142830).
> > Jan 12 20:38:57 xxx4n1 ccsd[3239]: Error while processing disconnect: Invalid request descriptor
> > 
> > Some ideas?
> 
> Actually, I'm not sure that IPMI will work with your hardware. Some of 
> the HP machines come with "ILO MP", which does support IPMI. In that 
> case, you would want to use IPMI to do fencing. I'm not familiar with 
> your hardware, but I am guessing that you don't have ILO MP, and thus 
> IPMI is not going to work.

FWIW, I was wrong - I was thinking ilo MP, not ilo2.  Sorry for the
confusion.

Thanks, Ryan.

-- Lon



From brad at bradandkim.net  Fri Jan 18 21:22:22 2008
From: brad at bradandkim.net (brad at bradandkim.net)
Date: Fri, 18 Jan 2008 15:22:22 -0600 (CST)
Subject: [Linux-cluster] Primary attribute collision
In-Reply-To: <1200687939.23054.81.camel@ayanami.boston.devel.redhat.com>
References: <35561.129.237.174.153.1200677781.squirrel@webmail.bradandkim.net>
	<1200687939.23054.81.camel@ayanami.boston.devel.redhat.com>
Message-ID: <60262.129.237.174.153.1200691342.squirrel@webmail.bradandkim.net>


> On Fri, 2008-01-18 at 11:36 -0600, brad at bradandkim.net wrote:
>
>> Has anyone tried this or have a suggestion for a workaround?  I thought
>> about unique mount points and then adding a script to each service that
>> would set up a sym link, but there might be a more elegant way.
>
> Change /usr/share/cluster/fs.sh and make mountpoint not 'unique' in the
> XML metadata.
>
> If two services try to use the same mountpoint on the same node, both
> services will start bouncing, however. ;)
>
> -- Lon

I knew there was a more elegant (and easier) way  :)

Thanks Lon!

Brad Crotchett
brad at bradandkim.net
http://www.bradandkim.net



From eric.williams at redhat.com  Sat Jan 19 08:29:29 2008
From: eric.williams at redhat.com (Eric Williams)
Date: Sat, 19 Jan 2008 08:29:29 +0000
Subject: [Linux-cluster] Configuring multicast on RHEL4
In-Reply-To: <4790F616.1070509@xandros.com>
References: <4790F616.1070509@xandros.com>
Message-ID: <1200731264-sup-5758@eric.fab.redhat.com>

Excerpts from Yan Vinogradov's message of Fri Jan 18 18:55:18 +0000 2008:
> Hi all!
> 
> My understanding is RHEL4 clusters default to broadcast upon creation. 
> If later on the user chooses to switch to multicast - how does the 
> clustering software know what interface name is to be used for each node 
> of the cluster? 'system-config-cluster' utility seems to just be using a 
> hardcoded value of "eth0" regardless of the actual name of the interface.
> 
> Thank you,
> Yan
> 


Have a look at the following tip:

  http://sources.redhat.com/cluster/faq.html#broadcastmulticast2
  
  How can I configure my RHEL4 cluster to use multicast rather than
  broadcast?

  Put something like this in your cluster.conf file:

  <clusternode name="nd1">
	<multicast addr="224.0.0.1" interface="eth0"/>
  </clusternode>


cya,
eric

-- 
Eric Williams
GSS-EMEA
 08:29:01 up 4 days, 14:45,  0 users,  load average: 0.24, 0.28, 0.27



From rpeterso at redhat.com  Sat Jan 19 14:55:14 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Sat, 19 Jan 2008 08:55:14 -0600
Subject: [Linux-cluster] Finding out properties of GFS formatted partition
In-Reply-To: <BAY143-W453359C5924C23F907CE32EF420@phx.gbl>
References: <BAY143-W453359C5924C23F907CE32EF420@phx.gbl>
Message-ID: <1200754515.3640.66.camel@technetium.msp.redhat.com>

On Fri, 2008-01-18 at 18:47 +0000, Christopher Probst wrote:
> Hello,
> 
> This is my first post/question on this mailing list, so I am sorry if
> the question sound naive. I have a GFS formatted partition(dev/hdb1)
> that is assigned to a cluster. I would like to know, if I can extract
> the following info of a particular GFS formatted partition
> 
> 1) Cluster name it is assigned to
> 2) Number of journals;
> 3) lock method used.
> 
> Is there any way to do this without getting a mount point involved?
> 
> Thank you in advance
> Christopher

Hi Christopher,

1. Cluster name is easy:
   gfs_tool sb /dev/sdb1 table
2. Number of journals is a bit more difficult.  If the FS is mounted
   you can do:    gfs_tool df <mount point>
   However, since you said "without getting a mount point involved" I'll
   assume that it's not mounted.  You can still find out the number of
   journals.  In RHEL5, Centos5 and equivalent, you can do this:
   gfs2_edit -p jindex /dev/sdb1
   (The gfs2_edit program recognizes gfs1 file systems as well as gfs2)
   I'll warn you that the output is not very user-friendly.

   In RHEL4, Centos4 and equivalent there's no "good" way unless the
   file system is mounted (again, use gfs_tool df).  There is a "not
   so good" way, which is to use "gfs_edit" to poke around, but you've
   got to know what you're doing.  You basically have to jump from the
   superblock to the jindex and see how many entries are there.
   Unlike gfs2_edit, gfs_edit is primitive and has no print option.

   It may not be relevant at this point in time, but for gfs2 starting
   when RHEL5.2 is released (I think), I also added:
   "gfs2_tool journals <mountpoint>".
3. Lock method is easy:
   gfs_tool sb /dev/sdb1 table

Regards,

Bob Peterson
Red Hat GFS




From harun at mhd.co.om  Sun Jan 20 10:53:19 2008
From: harun at mhd.co.om (Harun)
Date: Sun, 20 Jan 2008 14:53:19 +0400
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <0942b3a80000aa1a@mhd.co.om>
Message-ID: <001801c85b52$adc9d270$4e030196@mhd.co.om>

Dear All,

Can any one explain me what exactly is the tiebreaker IP and how does it
function? What is the use if we set the tiebreaker IP as the Default Gateway
address?

Thanks and Regards,
Harun


<<<<   Disclaimer Message  >>>>
"This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the named addressee, please notify the sender immediately after deleting this e-mail from your system and do not disseminate, distribute or copy this e-mail. The sender does not accept liability for any errors or omissions in the contents of this message, which arise as a result of erroneous e-mail transmission."
[Mohsin Haider Darwish LLC & Group Companies, PO.Box 880, Ruwi-112, Oman]



From lists at brimer.org  Sun Jan 20 15:31:26 2008
From: lists at brimer.org (Barry Brimer)
Date: Sun, 20 Jan 2008 09:31:26 -0600 (CST)
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <001801c85b52$adc9d270$4e030196@mhd.co.om>
References: <001801c85b52$adc9d270$4e030196@mhd.co.om>
Message-ID: <Pine.LNX.4.61.0801200918520.28336@localhost.localdomain>

> Can any one explain me what exactly is the tiebreaker IP and how does it
> function? What is the use if we set the tiebreaker IP as the Default Gateway
> address?

In clustering, it is important that the cluster nodes are able to 
communicate with one another.  It is also important that the cluster nodes 
agree on the status of the cluster.  To acheive this, various methods are 
used to communicate between cluster nodes to inform the other nodes that 
this node is active and participating in the cluster.  Quorum is usually 
defined as "greater than one half".  In a cluster larger than 2 nodes, 
the cluster nodes can determine that if they stop receiving cluster 
communications (usually referred to as heartbeat) from a particular node, 
they assume that the non-responsive node is not functioning correctly, and 
one of the remaining nodes in the cluster will fence the non-responsive 
node.  Fencing is a nice way of saying "make sure the non-responsive node 
can not write anything to our disks, by whatever means necessary".  This 
usually involves the equivalent of pulling the power plug out of the 
non-responsive node.  Why be so harsh?  Why not do a normal shutdown?  If 
the non-responsive node has data in buffers that has not been written to 
disk, and the other cluster nodes feel that this node is having a problem, 
they want to ensure that the non-responsive node can not write its buffers 
out to disk, in order to make sure that the non-responsive node has no 
chance of corrupting the data used by the cluster.  This is all fine, 
because if you have greater than 2 nodes, you should be able to get 
agreement by a majority on whether a node is functioning, and therefore 
whether the cluster is allowed to operate.  In a two-node cluster, we need 
to have some other way to determine which cluster member is healthy, and 
which one isn't.  If a cluster node were functioning correctly, it would 
be able to reach its default gateway.  Therefore the tiebreaker IP address 
is the default gateway because both machines should be able to reach it if 
they were functioning properly.  Therefore if one node is able to reach 
the tiebreaker IP address, and one isn't, it is assumed that the properly 
running node is the one that can reach the default gateway, and that 
allows the tie to be broken and allows that node to fence the other node.

Barry



From snowhare at nihongo.org  Sun Jan 20 19:49:07 2008
From: snowhare at nihongo.org (Benjamin Franz)
Date: Sun, 20 Jan 2008 11:49:07 -0800 (PST)
Subject: [Linux-cluster] importing/unimporting GNBD devices on CentOS5.1
Message-ID: <Pine.LNX.4.64.0801201139470.22690@nihongo.org>

I've looked all over the the Redhat docs and in the various packages, and 
I can't find a single example of GNBD devices being _automatically_ 
imported/unimported (ala gnbd-client) on the Redhat Cluster system (as 
opposed to toy examples using gnbd_import directly without an init.d 
script).

Additionally, after poking at manually mounting and unmounting GFS + LVM 
on GNBD + multipath for a while, I discovered that the clvmd init.d script 
fails to deactivate active LVs on a clustered VG before trying to 
deactivate the clustered VG (which, understandably, makes the LVM system 
unhappy, which in turn makes "gnbd_import -R" very unhappy when it comes 
time to unimport the GNBD devices.

Am I missing something? Is there something buried deep in the clustering 
system that is activates GNBD devices and deactivates LVs at the 
appropriate times?

-- 
Benjamin Franz

  I don't think it's a good idea to follow the catalog's suggestion to force
  the "Double Helix" on your biologist friends, or the "Wavelength" on your
  physicist friends. They have access to virulent pathogens and liquid
  nitrogen, and I just know it will end in tears before bedtime.
                             - Verzoeking, LiveJournal Entry, 22 Nov 2005



From isplist at logicore.net  Sun Jan 20 20:41:31 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 20 Jan 2008 14:41:31 -0600
Subject: [Linux-cluster] GFS without cluster?
Message-ID: <2008120144131.528581@leena>

Is there anything out there that would allow me to share my GFS storage with 
other systems which aren't in the cluster? 

Or, is there any way of setting up a cluster so that none of the nodes would 
ever take the cluster down? Perhaps having a very low vote such as 2 voting 
nodes to make a cluster yet have 30 machines in it?

The reason I am asking this is because I badly need to have shared storage 
like GFS but having to maintain a cluster is making the whole idea go south. I 
need to be able to add/remove servers without having to worry about keeping 
the cluster intact all the time. 

Keeping it intact for highly reliable services such as web servers and so on 
is easy enough by creating small clusters. My problem is that I am needing 
more and more, GFS like sharing but network wide to any machine. I know I can 
get at the data from behind a server but I'd prefer to have it in front such 
as FC gives me.

I'll be happy to try to explain this better if someone has any questions. I'm 
just not sure how to explain this :).

Mike





From Christopher.Barry at qlogic.com  Sun Jan 20 23:23:29 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Sun, 20 Jan 2008 18:23:29 -0500
Subject: [Linux-cluster] GFS without cluster?
In-Reply-To: <2008120144131.528581@leena>
References: <2008120144131.528581@leena>
Message-ID: <1200871409.11487.12.camel@localhost>

On Sun, 2008-01-20 at 14:41 -0600, isplist at logicore.net wrote:
> Is there anything out there that would allow me to share my GFS storage with 
> other systems which aren't in the cluster? 

export the gfs volume with nfs from the cluster.

> 
> Or, is there any way of setting up a cluster so that none of the nodes would 
> ever take the cluster down? Perhaps having a very low vote such as 2 voting 
> nodes to make a cluster yet have 30 machines in it?

quorum disk.

> 
> The reason I am asking this is because I badly need to have shared storage 
> like GFS but having to maintain a cluster is making the whole idea go south. I 
> need to be able to add/remove servers without having to worry about keeping 
> the cluster intact all the time. 

that type of dynamic access requires another mode, like nfs or cifs.

> 
> Keeping it intact for highly reliable services such as web servers and so on 
> is easy enough by creating small clusters. My problem is that I am needing 
> more and more, GFS like sharing but network wide to any machine. I know I can 
> get at the data from behind a server but I'd prefer to have it in front such 
> as FC gives me.

nice dream... ;)

> 
> I'll be happy to try to explain this better if someone has any questions. I'm 
> just not sure how to explain this :).
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From isplist at logicore.net  Mon Jan 21 00:52:27 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 20 Jan 2008 18:52:27 -0600
Subject: [Linux-cluster] GFS without cluster?
In-Reply-To: <1200871409.11487.12.camel@localhost>
Message-ID: <2008120185227.634186@leena>

>> Is there anything out there that would allow me to share my GFS storage
>> with other systems which aren't in the cluster?
>> 
> export the gfs volume with nfs from the cluster.

I'll have to look into this. I'm guessing you don't mean to copy the gfs 
volume to an nfs accessible one but to allow access to it over the cluster as 
any other shared service? I'll look into it.

>> nodes to make a cluster yet have 30 machines in it?
>> 
> quorum disk.

Ah, have never used it since I didn't use it. Thanks, I'll look into this too.

> that type of dynamic access requires another mode, like nfs or cifs.

Ya, there's my problem :).
 
>> more and more, GFS like sharing but network wide to any machine. I know I
>> can get at the data from behind a server but I'd prefer to have it in front
>> such as FC gives me.
>> 
> nice dream... ;)

That's how new applications start, with a need and a dream :). Ok, so there is 
no way to do what I need then. 

Thanks very much for your input.

Mike





From joseparrella at gmail.com  Mon Jan 21 13:18:03 2008
From: joseparrella at gmail.com (=?ISO-8859-1?Q?Jos=E9_Miguel_Parrella_Romero?=)
Date: Mon, 21 Jan 2008 08:48:03 -0430
Subject: [Linux-cluster] GFS without cluster?
In-Reply-To: <2008120144131.528581@leena>
References: <2008120144131.528581@leena>
Message-ID: <47949B8B.8010403@gmail.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

El 20/01/08 16:11, isplist at logicore.net escribi?:
| Is there anything out there that would allow me to share my GFS
storage with
| other systems which aren't in the cluster?

You can serve a GFS standalone filesystem to other machines using
network filesystems such as NFS. You can even find active/active NFS
setups using Heartbeat, but you'll still have to solve locking issues.

Keep in mind that if you're going for network filesystems you can use
any traditional filesystem such as ext3, and it might make more sense
since you won't actually mount two or more GFS filesystems.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHlJuLUWAsjQBcO4IRAscfAJ9lR0NgLSSSDtlDVmVIT7MqhKgPuwCeJDxr
L8AK3Wk9MhXTdnXUdhQd9sA=
=fFmD
-----END PGP SIGNATURE-----



From npf-mlists at eurotux.com  Mon Jan 21 15:10:47 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Mon, 21 Jan 2008 15:10:47 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <200801171731.26493.npf-mlists@eurotux.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
Message-ID: <200801211510.48228.npf-mlists@eurotux.com>

Hi,

Does anyone have any info on this or similar problem?

Thanks
Nuno Fernandes

On Thursday 17 January 2008 17:31:26 Nuno Fernandes wrote:
> Hi,
>
> we have a 20 node cluster with clvmd. It was working fine, but now i can't
> interact with clvmd. This problem aparently started after a node reboot.
> I've rebooted that node after and other 2 after and it that problem
> remains.
>
> When the machine is starting it stops in clvmd and doing a ps fax i can see
> that its stopped in vgscan.
>
> Machine xen1, xen3 and xen18 are all stopped. Machine xen18 was the first
> rebooted and the problem started after that.
>
> [root at xen2 ~]# cman_tool services
> type             level name     id       state
> fence            0     default  00010001 none
> [2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]
> dlm              1     clvmd    00010002 none
> [2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]
>
> cluster.conf:
> attached
>
> uname:
> Linux xen9.dc.test.pt 2.6.18-8.1.3.el5xen #1 SMP Mon Apr 30 20:26:10 EDT
> 2007 x86_64 x86_64 x86_64 GNU/Linux
>
> cman-2.0.60-1.el5
> lvm2-cluster-2.02.16-3.el5
>
> lvm.conf:
> attached
>
> [root at xen9 ~]# clustat
> Member Status: Quorate
>
>   Member Name                        ID   Status
>   ------ ----                        ---- ------
>   xen1.dc.test.pt                      1 Offline
>   xen2.dc.test.pt                      2 Online
>   xen3.dc.test.pt                      3 Offline
>   xen4.dc.test.pt                      4 Online
>   xen5.dc.test.pt                      5 Online
>   xen6.dc.test.pt                      6 Online
>   xen7.dc.test.pt                      7 Online
>   xen8.dc.test.pt                      8 Online
>   xen9.dc.test.pt                      9 Online, Local
>   xen10.dc.test.pt                    10 Online
>   xen11.dc.test.pt                    11 Online
>   xen12.dc.test.pt                    12 Online
>   xen13.dc.test.pt                    13 Online
>   xen14.dc.test.pt                    14 Online
>   xen17.dc.test.pt                    15 Online
>   xen18.dc.test.pt                    16 Offline
>   xen19.dc.test.pt                    17 Online
>   xen20.dc.test.pt                    18 Online
>   xen21.dc.test.pt                    19 Online
>   xen22.dc.test.pt                    20 Online
>
> Attached i also send the result of ps fax in xen2.
>
> Any info?
> Thanks
> Nuno Fernandes



From fog at t.is  Mon Jan 21 15:50:02 2008
From: fog at t.is (=?utf-8?B?RmlubnVyIMOWcm4gR3XDsG11bmRzc29uIC0gVE0gU29mdA==?= =?utf-8?B?d2FyZQ==?=)
Date: Mon, 21 Jan 2008 15:50:02 -0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <200801211510.48228.npf-mlists@eurotux.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
	<200801211510.48228.npf-mlists@eurotux.com>
Message-ID: <3DDA6E3E456E144DA3BB0A62A7F7F77901BFB8E1@SKYHQAMX08.klasi.is>

Hi,

Althought i have not been in a similar situation, i would start by upgrading your nodes to the latest kernel & updates and see if you can re-produce the problem there. Otherwise your best bet would be contacting Redhat Support.

Thanks,
Finnur

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nuno Fernandes
Sent: 21. jan?ar 2008 15:11
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] RHEL5 CLVMD hang

Hi,

Does anyone have any info on this or similar problem?

Thanks
Nuno Fernandes

On Thursday 17 January 2008 17:31:26 Nuno Fernandes wrote:
> Hi,
>
> we have a 20 node cluster with clvmd. It was working fine, but now i can't
> interact with clvmd. This problem aparently started after a node reboot.
> I've rebooted that node after and other 2 after and it that problem
> remains.
>
> When the machine is starting it stops in clvmd and doing a ps fax i can see
> that its stopped in vgscan.
>
> Machine xen1, xen3 and xen18 are all stopped. Machine xen18 was the first
> rebooted and the problem started after that.
>
> [root at xen2 ~]# cman_tool services
> type             level name     id       state
> fence            0     default  00010001 none
> [2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]
> dlm              1     clvmd    00010002 none
> [2 4 5 6 7 8 9 10 11 12 13 14 15 17 18 19 20]
>
> cluster.conf:
> attached
>
> uname:
> Linux xen9.dc.test.pt 2.6.18-8.1.3.el5xen #1 SMP Mon Apr 30 20:26:10 EDT
> 2007 x86_64 x86_64 x86_64 GNU/Linux
>
> cman-2.0.60-1.el5
> lvm2-cluster-2.02.16-3.el5
>
> lvm.conf:
> attached
>
> [root at xen9 ~]# clustat
> Member Status: Quorate
>
>   Member Name                        ID   Status
>   ------ ----                        ---- ------
>   xen1.dc.test.pt                      1 Offline
>   xen2.dc.test.pt                      2 Online
>   xen3.dc.test.pt                      3 Offline
>   xen4.dc.test.pt                      4 Online
>   xen5.dc.test.pt                      5 Online
>   xen6.dc.test.pt                      6 Online
>   xen7.dc.test.pt                      7 Online
>   xen8.dc.test.pt                      8 Online
>   xen9.dc.test.pt                      9 Online, Local
>   xen10.dc.test.pt                    10 Online
>   xen11.dc.test.pt                    11 Online
>   xen12.dc.test.pt                    12 Online
>   xen13.dc.test.pt                    13 Online
>   xen14.dc.test.pt                    14 Online
>   xen17.dc.test.pt                    15 Online
>   xen18.dc.test.pt                    16 Offline
>   xen19.dc.test.pt                    17 Online
>   xen20.dc.test.pt                    18 Online
>   xen21.dc.test.pt                    19 Online
>   xen22.dc.test.pt                    20 Online
>
> Attached i also send the result of ps fax in xen2.
>
> Any info?
> Thanks
> Nuno Fernandes

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From pcaulfie at redhat.com  Mon Jan 21 15:58:38 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Mon, 21 Jan 2008 15:58:38 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <200801211510.48228.npf-mlists@eurotux.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
	<200801211510.48228.npf-mlists@eurotux.com>
Message-ID: <4794C12E.8040304@redhat.com>

Nuno Fernandes wrote:
> Hi,
> 
> Does anyone have any info on this or similar problem?
> 

I think we've seen something similar internally. Though I haven't had
chance to investigate it yet.

The information to dig out is the DLM debugging.

# echo 255 > /sys/kernel/config/dlm/cluster/log_debug

and post the results from one of the stuck nodes.


Patrick



From npf-mlists at eurotux.com  Mon Jan 21 17:26:58 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Mon, 21 Jan 2008 17:26:58 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <4794C12E.8040304@redhat.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
	<200801211510.48228.npf-mlists@eurotux.com>
	<4794C12E.8040304@redhat.com>
Message-ID: <200801211726.58201.npf-mlists@eurotux.com>

On Monday 21 January 2008 15:58:38 Patrick Caulfeld wrote:
> echo 255 > /sys/kernel/config/dlm/cluster/log_debug

echo 255 > /sys/kernel/config/dlm/cluster/log_debug
-bash: /sys/kernel/config/dlm/cluster/log_debug: Permission denied

ls -la /sys/kernel/config/dlm/cluster/
total 0
drwxr-xr-x  4 root root 0 May 27  2007 .
drwxr-xr-x  3 root root 0 May 27  2007 ..
drwxr-xr-x 19 root root 0 Jan 17 16:36 comms
drwxr-xr-x  3 root root 0 Nov 27 14:48 spaces

Thanks,
Nuno Fernandes



From isplist at logicore.net  Mon Jan 21 18:46:24 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 21 Jan 2008 12:46:24 -0600
Subject: [Linux-cluster] RAID Cache
Message-ID: <2008121124624.354688@leena>

Anyone here know of any hardware which would be an external RAID cache? 
Something like a cache proxy server but for storage.

I realize there are many existing solutions out there such as cache on the 
storage device, accelerators, cache devices, etc. I use them all but I'm just 
curious if such a thing exists.

Mike





From bmarzins at redhat.com  Mon Jan 21 19:28:19 2008
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 21 Jan 2008 13:28:19 -0600
Subject: [Linux-cluster] importing/unimporting GNBD devices on CentOS5.1
In-Reply-To: <Pine.LNX.4.64.0801201139470.22690@nihongo.org>
References: <Pine.LNX.4.64.0801201139470.22690@nihongo.org>
Message-ID: <20080121192819.GD1929@ether.msp.redhat.com>

On Sun, Jan 20, 2008 at 11:49:07AM -0800, Benjamin Franz wrote:
> I've looked all over the the Redhat docs and in the various packages, and 
> I can't find a single example of GNBD devices being _automatically_ 
> imported/unimported (ala gnbd-client) on the Redhat Cluster system (as 
> opposed to toy examples using gnbd_import directly without an init.d 
> script).
> 
> Additionally, after poking at manually mounting and unmounting GFS + LVM 
> on GNBD + multipath for a while, I discovered that the clvmd init.d script 
> fails to deactivate active LVs on a clustered VG before trying to 
> deactivate the clustered VG (which, understandably, makes the LVM system 
> unhappy, which in turn makes "gnbd_import -R" very unhappy when it comes 
> time to unimport the GNBD devices.
> 
> Am I missing something? Is there something buried deep in the clustering 
> system that is activates GNBD devices and deactivates LVs at the 
> appropriate times?

There is no GNBD init script. Most people simply make a quick one
themselves.

Sorry.
-Ben

 
> -- 
> Benjamin Franz
> 
>  I don't think it's a good idea to follow the catalog's suggestion to force
>  the "Double Helix" on your biologist friends, or the "Wavelength" on your
>  physicist friends. They have access to virulent pathogens and liquid
>  nitrogen, and I just know it will end in tears before bedtime.
>                             - Verzoeking, LiveJournal Entry, 22 Nov 2005
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From pcaulfie at redhat.com  Tue Jan 22 09:13:55 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Tue, 22 Jan 2008 09:13:55 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <200801211726.58201.npf-mlists@eurotux.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
	<200801211510.48228.npf-mlists@eurotux.com>
	<4794C12E.8040304@redhat.com>
	<200801211726.58201.npf-mlists@eurotux.com>
Message-ID: <4795B3D3.7030606@redhat.com>

Nuno Fernandes wrote:
> On Monday 21 January 2008 15:58:38 Patrick Caulfeld wrote:
>> echo 255 > /sys/kernel/config/dlm/cluster/log_debug
> 
> echo 255 > /sys/kernel/config/dlm/cluster/log_debug
> -bash: /sys/kernel/config/dlm/cluster/log_debug: Permission denied
> 
> ls -la /sys/kernel/config/dlm/cluster/
> total 0
> drwxr-xr-x  4 root root 0 May 27  2007 .
> drwxr-xr-x  3 root root 0 May 27  2007 ..
> drwxr-xr-x 19 root root 0 Jan 17 16:36 comms
> drwxr-xr-x  3 root root 0 Nov 27 14:48 spaces

No debug options! you need to upgrade the kernel I'm afraid. It might
even fix the bug ;-)

Patrick



From nattaponv at hotmail.com  Tue Jan 22 09:28:06 2008
From: nattaponv at hotmail.com (nattapon viroonsri)
Date: Tue, 22 Jan 2008 09:28:06 +0000
Subject: [Linux-cluster] Fening Agent with Blade Server
Message-ID: <BLU126-F11134BFF9C2C30361A5253A63E0@phx.gbl>




Due to Dell PE1955 Blade  share power supply and fencing device (DRAC/MC)
Is it possible tu run redhat cluster on Blade server?
If so which fencing agent should i use with Dell Blade server

Any suggestions
thanks

Nattapon

_________________________________________________________________
Express yourself instantly with MSN Messenger! Download today it's FREE! 
http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/



From nattaponv at hotmail.com  Tue Jan 22 09:33:55 2008
From: nattaponv at hotmail.com (nattapon viroonsri)
Date: Tue, 22 Jan 2008 09:33:55 +0000
Subject: [Linux-cluster] Drac single point of failure
Message-ID: <BLU126-F66728178F2C052FBE9FB0A63E0@phx.gbl>

In case i use drac card as fencing device. but drac is single point of 
failiur
Add apc as secondary fecing device is the only way to solve this ?
or have any other way to solve this , pls advise


Thanks,
Nattapon

_________________________________________________________________
FREE pop-up blocking with the new MSN Toolbar - get it now! 
http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/



From isplist at logicore.net  Tue Jan 22 15:55:53 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 22 Jan 2008 09:55:53 -0600
Subject: [Linux-cluster] Fening Agent with Blade Server
In-Reply-To: <BLU126-F11134BFF9C2C30361A5253A63E0@phx.gbl>
Message-ID: <200812295553.208867@leena>

> Due to Dell PE1955 Blade  share power supply and fencing device (DRAC/MC)
> Is it possible tu run redhat cluster on Blade server?
> If so which fencing agent should i use with Dell Blade server

This probably won't help you much unless your setup is similar but... 
I'm using blades also and I had no way of controlling power to any of the 
blades without getting into something very complicated. I'm using fibre 
channel HBA's so was able to use brocade FC switches as the fencing device.






From npf-mlists at eurotux.com  Tue Jan 22 17:07:51 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Tue, 22 Jan 2008 17:07:51 +0000
Subject: [Linux-cluster] RHEL5 CLVMD hang
In-Reply-To: <4795B3D3.7030606@redhat.com>
References: <200801171731.26493.npf-mlists@eurotux.com>
	<200801211726.58201.npf-mlists@eurotux.com>
	<4795B3D3.7030606@redhat.com>
Message-ID: <200801221707.51986.npf-mlists@eurotux.com>

On Tuesday 22 January 2008 09:13:55 Patrick Caulfeld wrote:
> Nuno Fernandes wrote:
> > On Monday 21 January 2008 15:58:38 Patrick Caulfeld wrote:
> >> echo 255 > /sys/kernel/config/dlm/cluster/log_debug
> >
> > echo 255 > /sys/kernel/config/dlm/cluster/log_debug
> > -bash: /sys/kernel/config/dlm/cluster/log_debug: Permission denied
> >
> > ls -la /sys/kernel/config/dlm/cluster/
> > total 0
> > drwxr-xr-x  4 root root 0 May 27  2007 .
> > drwxr-xr-x  3 root root 0 May 27  2007 ..
> > drwxr-xr-x 19 root root 0 Jan 17 16:36 comms
> > drwxr-xr-x  3 root root 0 Nov 27 14:48 spaces
>
> No debug options! you need to upgrade the kernel I'm afraid. It might
> even fix the bug ;-)
>
> Patrick


Solved. Rebooted the whole cluster! :(

Thanks
Nuno Fernandes



From lhh at redhat.com  Tue Jan 22 17:33:01 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 22 Jan 2008 12:33:01 -0500
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <001801c85b52$adc9d270$4e030196@mhd.co.om>
References: <001801c85b52$adc9d270$4e030196@mhd.co.om>
Message-ID: <1201023181.5420.73.camel@ayanami.boston.devel.redhat.com>

On Sun, 2008-01-20 at 14:53 +0400, Harun wrote:
> Dear All,
> 
> Can any one explain me what exactly is the tiebreaker IP and how does it
> function? What is the use if we set the tiebreaker IP as the Default Gateway
> address?

It lets you ping a gateway in order to maintain quorum after one of the
cluster nodes loses its network connectivity or goes down.  I.e.:

    192.168.1.1
        ||
      switch
      |    |
      X    |
      |    |
  node0    node1

Node 1 can still see the switch, so it will stonith/fence node 0 and
continue.

-- Lon




From lhh at redhat.com  Tue Jan 22 20:00:36 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 22 Jan 2008 15:00:36 -0500
Subject: [Linux-cluster] [PATCH] Make fence_xvm[d]'s multicast TTL
	configurabl; e
Message-ID: <1201032036.5420.82.camel@ayanami.boston.devel.redhat.com>

Lets you set TTL for fence_xvm from the command line (or cluster.conf)

-- Lon

Index: fence_xvm.c
===================================================================
RCS file: /cvs/cluster/cluster/fence/agents/xvm/fence_xvm.c,v
retrieving revision 1.7
diff -u -r1.7 fence_xvm.c
--- fence_xvm.c	17 Oct 2007 18:27:27 -0000	1.7
+++ fence_xvm.c	22 Jan 2008 19:56:56 -0000
@@ -155,7 +155,8 @@
 			mc_sock = ipv4_send_sk(ipa->ipa_address, args->addr,
 					       args->port,
 					       (struct sockaddr *)&tgt4,
-					       sizeof(struct sockaddr_in));
+					       sizeof(struct sockaddr_in),
+					       args->ttl);
 			tgt = (struct sockaddr *)&tgt4;
 			tgt_len = sizeof(tgt4);
 			
@@ -163,7 +164,8 @@
 			mc_sock = ipv6_send_sk(ipa->ipa_address, args->addr,
 					       args->port,
 					       (struct sockaddr *)&tgt6,
-					       sizeof(struct sockaddr_in6));
+					       sizeof(struct sockaddr_in6),
+					       args->ttl);
 			tgt = (struct sockaddr *)&tgt6;
 			tgt_len = sizeof(tgt6);
 		} else {
@@ -317,7 +319,7 @@
 main(int argc, char **argv)
 {
 	fence_xvm_args_t args;
-	char *my_options = "di:a:p:r:C:c:k:H:uo:t:?hV";
+	char *my_options = "di:a:p:T:r:C:c:k:H:uo:t:?hV";
 
 	args_init(&args);
 	if (argc == 1) {
Index: mcast.c
===================================================================
RCS file: /cvs/cluster/cluster/fence/agents/xvm/mcast.c,v
retrieving revision 1.3
diff -u -r1.3 mcast.c
--- mcast.c	17 Oct 2007 18:27:27 -0000	1.3
+++ mcast.c	22 Jan 2008 19:56:56 -0000
@@ -111,7 +111,7 @@
  */
 int
 ipv4_send_sk(char *send_addr, char *addr, int port, struct sockaddr
*tgt,
-	     socklen_t tgt_len)
+	     socklen_t tgt_len, int ttl)
 {
 	int val;
 	struct ip_mreq mreq;
@@ -182,8 +182,8 @@
 	/*
 	 * set time to live to 2 hops.
 	 */
-	dbg_printf(4, "Setting TTL to 2 for fd%d\n", sock);
-	val = 2;
+	dbg_printf(4, "Setting TTL to %d for fd%d\n", ttl, sock);
+	val = ttl;
 	if (setsockopt(sock, SOL_IP, IP_MULTICAST_TTL, &val,
 		       sizeof(val)))
 		printf("warning: setting TTL failed %s\n", strerror(errno));
@@ -278,7 +278,7 @@
  */
 int
 ipv6_send_sk(char *send_addr, char *addr, int port, struct sockaddr
*tgt,
-	     socklen_t tgt_len)
+	     socklen_t tgt_len, int ttl)
 {
 	int val;
 	struct ipv6_mreq mreq;
@@ -361,7 +361,7 @@
 	/*
 	 * set time to live to 2 hops.
 	 */
-	val = 2;
+	val = ttl;
 	if (setsockopt(sock, IPPROTO_IPV6, IPV6_MULTICAST_HOPS, &val,
 		       sizeof(val)))
 		printf("warning: setting TTL failed %s\n", strerror(errno));
Index: mcast.h
===================================================================
RCS file: /cvs/cluster/cluster/fence/agents/xvm/mcast.h,v
retrieving revision 1.1
diff -u -r1.1 mcast.h
--- mcast.h	5 Oct 2006 16:11:36 -0000	1.1
+++ mcast.h	22 Jan 2008 19:56:56 -0000
@@ -24,9 +24,11 @@
 
 int ipv4_recv_sk(char *addr, int port);
 int ipv4_send_sk(char *src_addr, char *addr, int port,
-		 struct sockaddr *src, socklen_t slen);
+		 struct sockaddr *src, socklen_t slen,
+		 int ttl);
 int ipv6_recv_sk(char *addr, int port);
 int ipv6_send_sk(char *src_addr, char *addr, int port,
-		 struct sockaddr *src, socklen_t slen);
+		 struct sockaddr *src, socklen_t slen,
+		 int ttl);
 
 #endif
Index: options.c
===================================================================
RCS file: /cvs/cluster/cluster/fence/agents/xvm/options.c,v
retrieving revision 1.6
diff -u -r1.6 options.c
--- options.c	7 Jan 2008 05:52:28 -0000	1.6
+++ options.c	22 Jan 2008 19:56:57 -0000
@@ -96,6 +96,17 @@
 
 
 static inline void
+assign_ttl(fence_xvm_args_t *args, struct arg_info *arg, char *value)
+{
+	int ttl;
+	ttl = atoi(value);
+	if (ttl < 1 || ttl > 255)
+		ttl = 2;
+	args->ttl = ttl;
+}
+
+
+static inline void
 assign_port(fence_xvm_args_t *args, struct arg_info *arg, char *value)
 {
 	args->port = atoi(value);
@@ -260,6 +271,7 @@
 }
 
 
+
 /** ALL valid command line and stdin arguments for this fencing agent
*/
 static struct arg_info _arg_info[] = {
 	{ '\xff', NULL, "agent",
@@ -286,6 +298,10 @@
 	  "Multicast address (default=225.0.0.12 / ff02::3:1)",
 	  assign_address },
 
+	{ 'T', "-T <ttl>", "multicast_ttl",
+	  "Multicast time-to-live (in hops; default=2)",
+	  assign_ttl },
+
 	{ 'p', "-p <port>", "port",
 	  "IP port (default=1229)",
 	  assign_port },
@@ -399,6 +415,7 @@
 	args->retr_time = 20;
 	args->flags = 0;
 	args->debug = 0;
+	args->ttl = 2;
 }
 
 
Index: options.h
===================================================================
RCS file: /cvs/cluster/cluster/fence/agents/xvm/options.h,v
retrieving revision 1.3
diff -u -r1.3 options.h
--- options.h	26 Jun 2007 17:23:41 -0000	1.3
+++ options.h	22 Jan 2008 19:56:57 -0000
@@ -45,6 +45,7 @@
 	int retr_time;
 	arg_flags_t flags;
 	int debug;
+	int ttl;
 } fence_xvm_args_t;
 
 /* Private structure for commandline / stdin fencing args */




From snowhare at nihongo.org  Tue Jan 22 20:25:39 2008
From: snowhare at nihongo.org (Benjamin Franz)
Date: Tue, 22 Jan 2008 12:25:39 -0800 (PST)
Subject: [Linux-cluster] DRBD Primary/Primary + GNBD + multipath + GFS with
 GNBD server reboot == kaboom?
Message-ID: <Pine.LNX.4.64.0801221039210.5440@nihongo.org>

I configured an eight node cluster with two machines paired up with 
Primary/Primary DRBD served via GNBD to the other machines with roundrobin 
multipath + GFS on the client machines (the DRBD devices being served via 
both DRBD machines as GNBD devices). I successfully balanced the clients 
into mounting the GFS based file systems on boot but attempting to reboot 
one of the DRBD pair machines ('reboot') resulted in multipath losing its 
mind on all of the client machines.

It failed to transparently cut the failed GNBD server node out, instead 
failing _both_ multipathed GNDB devices (which makes no particular sense 
to me) and even after restoring the downed DRBD box and associated GNBD 
server one client box was left so 'jammed' on the GFS file system that I 
had to power cycle it to restore it (and all of the other boxes mounting 
the same GFS file system) to normal operation.

This *didn't* happen when I simply powered off one of the DRBD paired 
machines. Then multipath et al kept on working fine with the single GNBD 
server node just like it was supposed to. Recovery went fine as well.

Among all the other messages (see attached file) I got this in 
/var/log/messages on the 'jammed' GNBD client machine:

Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: fatal: I/O error
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   block = 26
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   function = gfs_dreread
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   file = /builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/dio.c, line = 576
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   time = 1201024470
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: about to withdraw from the cluster
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: telling LM to withdraw
Jan 22 09:54:31 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254
Jan 22 09:54:31 pbox7 gnbd_recvd[5029]: reconnecting
Jan 22 09:54:31 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254
Jan 22 09:54:31 pbox7 gnbd_recvd[5007]: reconnecting
Jan 22 09:54:31 pbox7 kernel: GFS: fsid=Production0:web1.3: withdrawn
Jan 22 09:54:31 pbox7 kernel:
Jan 22 09:54:31 pbox7 kernel: Call Trace:
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ffc94>] :gfs:gfs_lm_withdraw+0xc4/0xd3
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8001b9e6>] generic_make_request+0x1e8/0x1ff
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff88414dcf>] :gfs:gfs_io_error_bh_i+0x32/0x37
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883eca98>] :gfs:gfs_dreread+0xad/0xc7
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ecada>] :gfs:gfs_dread+0x28/0x43
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ed522>] :gfs:gfs_get_meta_buffer+0xc6/0x247
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883fa25b>] :gfs:gfs_copyin_dinode+0x1d/0x131
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f8c78>] :gfs:inode_go_lock+0x27/0x3f
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6a50>] :gfs:glock_wait_internal+0x222/0x2bb
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6e7e>] :gfs:gfs_glock_nq+0x395/0x3d5
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6ed4>] :gfs:gfs_glock_nq_init+0x16/0x2a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8840a782>] :gfs:gfs_getattr+0x41/0x67
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8000de0b>] vfs_getattr+0x2d/0xa9
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80028392>] vfs_stat_fd+0x32/0x4a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80064a9d>] do_page_fault+0x4eb/0x81d
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80039d83>] tty_ldisc_deref+0x68/0x7b
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80094baa>] recalc_sigpending_and_wake+0x9/0x1a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80028924>] do_sigaction+0x72/0x195
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff800231e6>] sys_newstat+0x19/0x31
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8005bde9>] error_exit+0x0/0x84
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8005b116>] system_call+0x7e/0x83

On a repeat experiment, here is what one of the client system looked like 
just before I 'reboot'ed one of the DRBD machines:

[root at pbox7 tmp]# multipath -ll
mpath2 (GNBD-freerun-production0-drbd-web1) dm-11 GNBD,GNBD 
[size=349G][features=0][hwhandler=0] \_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd2 252:2 [active][ready]
  \_ #:#:#:# gnbd5 252:5 [active][ready]
mpath1 (GNBD-freerun-production0-drbd-misc1) dm-10 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd1 252:1 [active][ready]
  \_ #:#:#:# gnbd4 252:4 [active][ready]
mpath0 (GNBD-freerun-production0-drbd-log1) dm-9 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd0 252:0 [active][ready]
  \_ #:#:#:# gnbd3 252:3 [active][ready]

[root at pbox7 tmp]# df
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/mapper/pbox7-root0
                       48755504   6479324  39759600  15% /
/dev/sda1               194442     31032    153371  17% /boot
tmpfs                  8220020         0   8220020   0% /dev/shm
/dev/mapper/ProdWeb1-data0
                      359748608 178031212 181717396  50% /u1
/dev/mapper/ProdMisc1-data0
                      359748608       152 359748456   1% /u2
/dev/mapper/ProdLog1-data0
                      359748608       152 359748456   1% /u3

[root at pbox7 tmp]# gnbd_import -l
Device name : log1a
----------------------
     Minor # : 0
  sysfs name : /block/gnbd0
      Server : pbox9.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

Device name : misc1a
----------------------
     Minor # : 1
  sysfs name : /block/gnbd1
      Server : pbox9.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

Device name : web1a
----------------------
     Minor # : 2
  sysfs name : /block/gnbd2
      Server : pbox9.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

Device name : log1b
----------------------
     Minor # : 3
  sysfs name : /block/gnbd3
      Server : pbox10.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

Device name : misc1b
----------------------
     Minor # : 4
  sysfs name : /block/gnbd4
      Server : pbox10.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

Device name : web1b
----------------------
     Minor # : 5
  sysfs name : /block/gnbd5
      Server : pbox10.internal.freerun.com
        Port : 14567
       State : Open Connected Clear
    Readonly : No
     Sectors : 732399448

After 'reboot'ing one of the DRBD machines (while the machine was still 
off):

[root at pbox7 ~]# multipath -ll
gnbd0: checker msg is "directio checker reports path is down"
gnbd1: checker msg is "directio checker reports path is down"
gnbd2: checker msg is "directio checker reports path is down"
mpath2 (GNBD-freerun-production0-drbd-web1) dm-11 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=0][active]
  \_ #:#:#:# gnbd2 252:2 [failed][faulty]
  \_ #:#:#:# gnbd5 252:5 [failed][faulty]
mpath1 (GNBD-freerun-production0-drbd-misc1) dm-10 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=0][active]
  \_ #:#:#:# gnbd1 252:1 [failed][faulty]
  \_ #:#:#:# gnbd4 252:4 [failed][faulty]
mpath0 (GNBD-freerun-production0-drbd-log1) dm-9 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=0][active]
  \_ #:#:#:# gnbd0 252:0 [failed][faulty]
  \_ #:#:#:# gnbd3 252:3 [failed][faulty]

And a minute later (after the DRBD/GNBD Server machine finished 
rebooting):

[root at pbox7 ~]# multipath -ll
mpath2 (GNBD-freerun-production0-drbd-web1) dm-11 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd2 252:2 [active][ready]
  \_ #:#:#:# gnbd5 252:5 [active][ready]
mpath1 (GNBD-freerun-production0-drbd-misc1) dm-10 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd1 252:1 [active][ready]
  \_ #:#:#:# gnbd4 252:4 [active][ready]
mpath0 (GNBD-freerun-production0-drbd-log1) dm-9 GNBD,GNBD
[size=349G][features=0][hwhandler=0]
\_ round-robin 0 [prio=2][active]
  \_ #:#:#:# gnbd0 252:0 [active][ready]
  \_ #:#:#:# gnbd3 252:3 [active][ready]

Any ideas or suggestions as to what is going wrong with 'reboot'ing one of 
the DRBD machines that causes multipath to lose *all* the paths?

-- 
Benjamin Franz

  I don't think it's a good idea to follow the catalog's suggestion to force
  the "Double Helix" on your biologist friends, or the "Wavelength" on your
  physicist friends. They have access to virulent pathogens and liquid
  nitrogen, and I just know it will end in tears before bedtime.
                             - Verzoeking, LiveJournal Entry, 22 Nov 2005
-------------- next part --------------
#!/bin/bash
#
# gnbd-srv      Start/Stop GNBD Server 
#
# chkconfig: 2345 90 09
# description: This shell script takes care of exporting and unexporting
#              GNBDs devices.
# processname: gnbd_serv
# pidfile: /var/run/gnbd/gnbd_serv.pid

# source function library
. /etc/rc.d/init.d/functions

[ -r /etc/sysconfig/gnbd ] && . /etc/sysconfig/gnbd

GNBD_SERV_TIMEOUT=${GNBD_SERV_TIMEOUT:-5}
prog="GNBD Server"

start() {
	if [ ! -f /etc/gnbdtab ]; then
		exit 0
	fi

	echo -n $"Starting $prog: "

	# start gnbd_serv
	if [ -e /var/run/gnbd/gnbd_serv.pid ] && \
	   [ -e /proc/`cat /var/run/gnbd/gnbd_serv.pid` ]; then
		echo -n $"gnbd_serv is already running"
		failure
		echo
		return 1
	fi
	gnbd_serv -q $GNBD_SERV_OPTS
	if [ $? -ne 0 ]; then
		echo -n $"cannot start gnbd_serv"
		failure
		return 1
	fi

	# export all GNBDs devices
	GNBD_EXPORT=`cat /etc/gnbdtab | egrep '^export'`
	if [ -n "$GNBD_EXPORT" ]; then
		cat /etc/gnbdtab | \
		awk '/^export/ { print "-d", $3, "-e", $2, "-u", $4 }' | \
		xargs -l gnbd_export $GNBD_EXPORT_OPTS > /dev/null
		if [ $? -ne 0 ]; then
			echo -n $"failed to export GNBDs devices"
			gnbd_serv -q -K
			failure
			return 1
		fi
	fi

	success
	echo
	return 0
}

stop() {
	echo -n $"Stopping $prog: "

	# check
	if [ ! -e /var/run/gnbd/gnbd_serv.pid ] || \
           [ ! -e /proc/`cat /var/run/gnbd/gnbd_serv.pid` ]; then
		echo -n $"gnbd_serv is not running"
		failure
		echo
		return 1
	fi

	# unexport all GNBDs devices
	gnbd_export -q -R
	if [ $? -ne 0 ]; then
		echo -n $"failed to unexport all GNBDs devices"
		failure
		return 1
	fi

	# stop gnbd_serv
	GNBD_SERV_PROCS="$(pgrep gnbd_serv)"
	while [ -n "$GNBD_SERV_PROCS" ] && \
	      [ $GNBD_SERV_TIMEOUT -gt 0 ]; do
		gnbd_serv -q -k
		sleep 1
		GNBD_SERV_TIMEOUT=$(( $GNBD_SERV_TIME -1 ))
		GNBD_SERV_PROCS="$(pgrep gnbd_serv)"
	done
	if [ -n "$GNBD_SERV_PROCS" ]; then
		gnbd_serv -q -K
	fi
	if [ $? -ne 0 ]; then
		echo -n $"failed to stop gnbd_serv"
		failure
		return 1
	fi

	success
	echo
	return 0
}


case "$1" in
	start)
		start
		;;
	stop)
		stop
		;;
	status)
		status gnbd_serv
		;;
	*)
		echo $"Usage: $0 {start|stop|status}"
		exit 1
		;;
esac

exit $?

-------------- next part --------------
# This is an example configuration file for device mapper multipath.
# For a complete list of the default configuration values, see
# /usr/share/doc/device-mapper-multipath-0.4.5/multipath.conf.defaults
# For a list of configuration options with descriptions, see
# /usr/share/doc/device-mapper-multipath-0.4.5/multipath.conf.annotated


# Blacklist all devices by default. Remove this to enable multipathing
# on the default devices. 
#blacklist {
#        devnode "*"
#}

## By default, devices with vendor = "IBM" and product = "S/390.*" are
## blacklisted. To enable mulitpathing on these devies, uncomment the
## following lines.
#blacklist_exceptions {
#	device {
#		vendor	"IBM"
#		product	"S/390.*"
#	}
#}

## Use user friendly names, instead of using WWIDs as names.
defaults {
	user_friendly_names yes
}
##
## This is a template multipath-tools configuration file
## Uncomment the lines relevent to your environment
##
#defaults {
#	udev_dir		/dev
#	polling_interval 	10
#	selector		"round-robin 0"
#	path_grouping_policy	multibus
#	getuid_callout		"/sbin/scsi_id -g -u -s /block/%n"
#	prio_callout		/bin/true
#	path_checker		readsector0
#	rr_min_io		100
#	rr_weight		priorities
#	failback		immediate
#	no_path_retry		fail
#	user_friendly_name	yes
#}
##
## The wwid line in the following blacklist section is shown as an example
## of how to blacklist devices by wwid.  The 3 devnode lines are the
## compiled in default blacklist. If you want to blacklist entire types
## of devices, such as all scsi devices, you should use a devnode line.
## However, if you want to blacklist specific devices, you should use
## a wwid line.  Since there is no guarantee that a specific device will
## not change names on reboot (from /dev/sda to /dev/sdb for example)
## devnode lines are not recommended for blacklisting specific devices.
##
blacklist {
#       wwid 26353900f02796769
	devnode "^(ram|raw|loop|fd|md|dm-|sr|scd|st|sda|sdb|sdc|sdd)[0-9]*"
	devnode "^hd[a-z]"
}
#multipaths {
#	multipath {
#		wwid			3600508b4000156d700012000000b0000
#		alias			yellow
#		path_grouping_policy	multibus
#		path_checker		readsector0
#		path_selector		"round-robin 0"
#		failback		manual
#		rr_weight		priorities
#		no_path_retry		5
#	}
#	multipath {
#		wwid			1DEC_____321816758474
#		alias			red
#	}
#}
#devices {
#	device {
#		vendor			"COMPAQ  "
#		product			"HSV110 (C)COMPAQ"
#		path_grouping_policy	multibus
#		getuid_callout          "/sbin/scsi_id -g -u -s /block/%n"
#		path_checker		readsector0
#		path_selector		"round-robin 0"
#		hardware_handler	"0"
#		failback		15
#		rr_weight		priorities
#		no_path_retry		queue
#	}
#	device {
#		vendor			"COMPAQ  "
#		product			"MSA1000         "
#		path_grouping_policy	multibus
#	}
#}
-------------- next part --------------
<?xml version="1.0"?>
<cluster alias="Production0" config_version="23" name="Production0">
	<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
	<clusternodes>
		<clusternode name="pbox7.freerun.com" nodeid="1" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu1" option="off" port="6"/>
					<device name="apcpdu3" option="off" port="6"/>
					<device name="apcpdu1" option="on" port="6"/>
					<device name="apcpdu3" option="on" port="6"/>
				</method>
				<method name="1">
					<device name="apcpdu1" option="off" port="6"/>
					<device name="apcpdu3" option="off" port="6"/>
					<device name="apcpdu1" option="on" port="6"/>
					<device name="apcpdu3" option="on" port="6"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox6.freerun.com" nodeid="3" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu2" option="off" port="5"/>
					<device name="apcpdu3" option="off" port="5"/>
					<device name="apcpdu2" option="on" port="5"/>
					<device name="apcpdu3" option="on" port="5"/>
				</method>
				<method name="1">
					<device name="apcpdu2" option="off" port="5"/>
					<device name="apcpdu3" option="off" port="5"/>
					<device name="apcpdu2" option="on" port="5"/>
					<device name="apcpdu3" option="on" port="5"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox2.freerun.com" nodeid="4" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu1" port="1"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox8.freerun.com" nodeid="5" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu1" option="off" port="7"/>
					<device name="apcpdu4" option="off" port="7"/>
					<device name="apcpdu1" option="on" port="7"/>
					<device name="apcpdu4" option="on" port="7"/>
				</method>
				<method name="1">
					<device name="apcpdu1" option="off" port="7"/>
					<device name="apcpdu4" option="off" port="7"/>
					<device name="apcpdu1" option="on" port="7"/>
					<device name="apcpdu4" option="on" port="7"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox5.freerun.com" nodeid="6" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu2" option="off" port="4"/>
					<device name="apcpdu4" option="off" port="4"/>
					<device name="apcpdu2" option="on" port="4"/>
					<device name="apcpdu4" option="on" port="4"/>
				</method>
				<method name="1">
					<device name="apcpdu2" option="off" port="4"/>
					<device name="apcpdu4" option="off" port="4"/>
					<device name="apcpdu2" option="on" port="4"/>
					<device name="apcpdu4" option="on" port="4"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox1.freerun.com" nodeid="8" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu4" port="1"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox9.freerun.com" nodeid="2" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu2" option="off" port="2"/>
					<device name="apcpdu3" option="off" port="2"/>
					<device name="apcpdu4" option="off" port="2"/>
					<device name="apcpdu2" option="on" port="2"/>
					<device name="apcpdu3" option="on" port="2"/>
					<device name="apcpdu4" option="on" port="2"/>
				</method>
				<method name="1">
					<device name="apcpdu2" option="off" port="2"/>
					<device name="apcpdu3" option="off" port="2"/>
					<device name="apcpdu4" option="off" port="2"/>
					<device name="apcpdu2" option="on" port="2"/>
					<device name="apcpdu3" option="on" port="2"/>
					<device name="apcpdu4" option="on" port="2"/>
				</method>
				<method name="1">
					<device name="apcpdu2" option="off" port="2"/>
					<device name="apcpdu3" option="off" port="2"/>
					<device name="apcpdu4" option="off" port="2"/>
					<device name="apcpdu2" option="on" port="2"/>
					<device name="apcpdu3" option="on" port="2"/>
					<device name="apcpdu4" option="on" port="2"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="pbox10.freerun.com" nodeid="7" votes="1">
			<fence>
				<method name="1">
					<device name="apcpdu1" option="off" port="3"/>
					<device name="apcpdu2" option="off" port="3"/>
					<device name="apcpdu3" option="off" port="3"/>
					<device name="apcpdu1" option="on" port="3"/>
					<device name="apcpdu2" option="on" port="3"/>
					<device name="apcpdu3" option="on" port="3"/>
				</method>
				<method name="1">
					<device name="apcpdu1" option="off" port="3"/>
					<device name="apcpdu2" option="off" port="3"/>
					<device name="apcpdu3" option="off" port="3"/>
					<device name="apcpdu1" option="on" port="3"/>
					<device name="apcpdu2" option="on" port="3"/>
					<device name="apcpdu3" option="on" port="3"/>
				</method>
				<method name="1">
					<device name="apcpdu1" option="off" port="3"/>
					<device name="apcpdu2" option="off" port="3"/>
					<device name="apcpdu3" option="off" port="3"/>
					<device name="apcpdu1" option="on" port="3"/>
					<device name="apcpdu2" option="on" port="3"/>
					<device name="apcpdu3" option="on" port="3"/>
				</method>
			</fence>
		</clusternode>
	</clusternodes>
	<cman/>
	<fencedevices>
		<fencedevice agent="fence_apc" ipaddr="apcpdu1.freerun.com" login="device" name="apcpdu1" passwd="apassword"/>
		<fencedevice agent="fence_apc" ipaddr="apcpdu2.freerun.com" login="device" name="apcpdu2" passwd="apassword"/>
		<fencedevice agent="fence_apc" ipaddr="apcpdu3.freerun.com" login="device" name="apcpdu3" passwd="apassword"/>
		<fencedevice agent="fence_apc" ipaddr="apcpdu4.freerun.com" login="device" name="apcpdu4" passwd="apassword"/>
	</fencedevices>
	<rm>
		<failoverdomains>
			<failoverdomain name="LinuxVirtualServer" ordered="1" restricted="1">
				<failoverdomainnode name="pbox7.freerun.com" priority="3"/>
				<failoverdomainnode name="pbox6.freerun.com" priority="2"/>
				<failoverdomainnode name="pbox8.freerun.com" priority="4"/>
				<failoverdomainnode name="pbox5.freerun.com" priority="1"/>
				<failoverdomainnode name="pbox9.freerun.com" priority="5"/>
				<failoverdomainnode name="pbox10.freerun.com" priority="6"/>
			</failoverdomain>
			<failoverdomain name="SonicGateway" ordered="1" restricted="1">
				<failoverdomainnode name="pbox2.freerun.com" priority="2"/>
				<failoverdomainnode name="pbox1.freerun.com" priority="1"/>
			</failoverdomain>
		</failoverdomains>
		<resources/>
	</rm>
</cluster>
-------------- next part --------------
Jan 22 09:52:07 pbox7 clurgmgrd[5892]: <notice> Member 2 shutting down 
Jan 22 09:52:23 pbox7 kernel: gnbd0: Receive control failed (result -32)
Jan 22 09:52:23 pbox7 kernel: gnbd0: shutting down socket
Jan 22 09:52:23 pbox7 kernel: exiting GNBD_DO_IT ioctl
Jan 22 09:52:23 pbox7 gnbd_recvd[4974]: client lost connection with pbox9.internal.freerun.com : Broken pipe 
Jan 22 09:52:23 pbox7 gnbd_recvd[4974]: reconnecting 
Jan 22 09:52:23 pbox7 kernel: gnbd2: Receive control failed (result -32)
Jan 22 09:52:23 pbox7 kernel: gnbd2: shutting down socket
Jan 22 09:52:23 pbox7 kernel: exiting GNBD_DO_IT ioctl
Jan 22 09:52:23 pbox7 gnbd_recvd[4994]: client lost connection with pbox9.internal.freerun.com : Broken pipe 
Jan 22 09:52:23 pbox7 gnbd_recvd[4994]: reconnecting 
Jan 22 09:52:23 pbox7 kernel: gnbd1: Receive control failed (result -32)
Jan 22 09:52:23 pbox7 kernel: gnbd1: shutting down socket
Jan 22 09:52:23 pbox7 kernel: exiting GNBD_DO_IT ioctl
Jan 22 09:52:23 pbox7 gnbd_recvd[4984]: client lost connection with pbox9.internal.freerun.com : Broken pipe 
Jan 22 09:52:23 pbox7 gnbd_recvd[4984]: reconnecting 
Jan 22 09:52:28 pbox7 gnbd_recvd[4974]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:28 pbox7 gnbd_recvd[4974]: reconnecting 
Jan 22 09:52:28 pbox7 gnbd_recvd[4994]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:28 pbox7 gnbd_recvd[4994]: reconnecting 
Jan 22 09:52:28 pbox7 gnbd_recvd[4984]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:28 pbox7 gnbd_recvd[4984]: reconnecting 
Jan 22 09:52:30 pbox7 kernel: gnbd0: Attempted send on closed socket
Jan 22 09:52:32 pbox7 kernel: device-mapper: multipath: Failing path 252:0.
Jan 22 09:52:32 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:52:32 pbox7 multipathd: checker failed path 252:0 in map mpath0 
Jan 22 09:52:32 pbox7 multipathd: mpath0: remaining active paths: 1 
Jan 22 09:52:33 pbox7 kernel: gnbd1: Attempted send on closed socket
Jan 22 09:52:33 pbox7 gnbd_recvd[4974]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:33 pbox7 gnbd_recvd[4974]: reconnecting 
Jan 22 09:52:33 pbox7 gnbd_recvd[4994]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:33 pbox7 gnbd_recvd[4994]: reconnecting 
Jan 22 09:52:33 pbox7 gnbd_recvd[4984]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : Connection refused 
Jan 22 09:52:33 pbox7 gnbd_recvd[4984]: reconnecting 
Jan 22 09:52:35 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:52:35 pbox7 kernel: device-mapper: multipath: Failing path 252:1.
Jan 22 09:52:35 pbox7 kernel: gnbd2: Attempted send on closed socket
Jan 22 09:52:35 pbox7 multipathd: checker failed path 252:1 in map mpath1 
Jan 22 09:52:35 pbox7 multipathd: mpath1: remaining active paths: 1 
Jan 22 09:52:37 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:52:37 pbox7 multipathd: checker failed path 252:2 in map mpath2 
Jan 22 09:52:37 pbox7 kernel: device-mapper: multipath: Failing path 252:2.
Jan 22 09:52:38 pbox7 openais[3472]: [TOTEM] entering GATHER state from 12. 
Jan 22 09:52:43 pbox7 multipathd: mpath2: remaining active paths: 1 
Jan 22 09:52:43 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] entering GATHER state from 0. 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] Saving state aru eb high seq received eb 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] Storing new sequence id for ring 50c 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] entering COMMIT state. 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] entering RECOVERY state. 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [0] member 64.142.17.1: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [1] member 64.142.17.2: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [2] member 64.142.17.5: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [3] member 64.142.17.6: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [4] member 64.142.17.7: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [5] member 64.142.17.8: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] position [6] member 64.142.17.10: 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] previous ring seq 1288 rep 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] aru eb high delivered eb received flag 1 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] Did not need to originate any messages in recovery. 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] New Configuration: 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.1)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.2)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.5)  
Jan 22 09:52:43 pbox7 kernel: dlm: closing connection to node 2
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.6)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.7)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.8)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.10)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] Members Left: 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.9)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] Members Joined: 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] CLM CONFIGURATION CHANGE 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] New Configuration: 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.1)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.2)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.5)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.6)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.7)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.8)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] 	r(0) ip(64.142.17.10)  
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] Members Left: 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] Members Joined: 
Jan 22 09:52:43 pbox7 openais[3472]: [SYNC ] This node is within the primary component and will provide service. 
Jan 22 09:52:43 pbox7 openais[3472]: [TOTEM] entering OPERATIONAL state. 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.1 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.2 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.5 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.6 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.7 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.8 
Jan 22 09:52:43 pbox7 openais[3472]: [CLM  ] got nodejoin message 64.142.17.10 
Jan 22 09:52:46 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:52:48 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:52:48 pbox7 kernel: gnbd_monitor 11430 called gnbd_end_request with an error
Jan 22 09:52:48 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:52:48 pbox7 kernel: gnbd_monitor 11431 called gnbd_end_request with an error
Jan 22 09:52:48 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:52:48 pbox7 kernel: gnbd_monitor 11432 called gnbd_end_request with an error
Jan 22 09:52:48 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 7 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 8 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 4 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 6 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 3 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 1 
Jan 22 09:52:51 pbox7 openais[3472]: [CPG  ] got joinlist message from node 5 
Jan 22 09:52:52 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:52:53 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:52:53 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:52:57 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:52:57 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:52:57 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:52:58 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:52:58 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:52:58 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:52:58 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:52:58 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:02 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:02 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:02 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:02 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:04 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:04 pbox7 kernel: device-mapper: multipath: Failing path 252:3.
Jan 22 09:53:04 pbox7 multipathd: checker failed path 252:3 in map mpath0 
Jan 22 09:53:04 pbox7 multipathd: mpath0: remaining active paths: 0 
Jan 22 09:53:05 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:05 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:05 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:05 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:05 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:07 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:07 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:07 pbox7 kernel: device-mapper: multipath: Failing path 252:5.
Jan 22 09:53:07 pbox7 multipathd: checker failed path 252:5 in map mpath2 
Jan 22 09:53:07 pbox7 multipathd: mpath2: remaining active paths: 0 
Jan 22 09:53:11 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:11 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:11 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:13 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:14 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:14 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:14 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:14 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:14 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:16 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:16 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:20 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:20 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:20 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:22 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:23 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:23 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:23 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:23 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:23 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:25 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:25 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:29 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:29 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:29 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:31 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:32 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:32 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:32 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:32 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:32 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:34 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:34 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:38 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:38 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:38 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:40 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:40 pbox7 gnbd_recvd[11488]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:53:40 pbox7 gnbd_recvd[11488]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:53:40 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:53:41 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:41 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:41 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:41 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:41 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:43 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:43 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:43 pbox7 gnbd_recvd[11504]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:53:43 pbox7 gnbd_recvd[11504]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:53:43 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:53:46 pbox7 gnbd_recvd[11513]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:53:46 pbox7 gnbd_recvd[11513]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:53:46 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:53:47 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:47 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:47 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:49 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:50 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:50 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:50 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:50 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:50 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:53:52 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:53:52 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:53:56 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:56 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:53:56 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:53:58 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:53:59 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:59 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:53:59 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:53:59 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:53:59 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:01 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:01 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:05 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:05 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:05 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:06 pbox7 kernel: gnbd (pid 5029: gnbd_recvd) got signal
Jan 22 09:54:06 pbox7 kernel: gnbd5: Receive control failed (result -4)
Jan 22 09:54:06 pbox7 kernel: gnbd5: shutting down socket
Jan 22 09:54:06 pbox7 kernel: exiting GNBD_DO_IT ioctl
Jan 22 09:54:06 pbox7 gnbd_recvd[5029]: client lost connection with pbox10.internal.freerun.com : Interrupted system call 
Jan 22 09:54:06 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:06 pbox7 kernel: gnbd (pid 5007: gnbd_recvd) got signal
Jan 22 09:54:06 pbox7 kernel: gnbd3: Receive control failed (result -4)
Jan 22 09:54:06 pbox7 kernel: gnbd3: shutting down socket
Jan 22 09:54:06 pbox7 kernel: exiting GNBD_DO_IT ioctl
Jan 22 09:54:06 pbox7 gnbd_recvd[5007]: client lost connection with pbox10.internal.freerun.com : Interrupted system call 
Jan 22 09:54:06 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:07 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:08 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:08 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:08 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:08 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:08 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:10 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:10 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:11 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:11 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:11 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:11 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:14 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:14 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:14 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:16 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:16 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:16 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:16 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:16 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:17 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:17 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:17 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:17 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:17 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:19 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:19 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:21 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:21 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:21 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:21 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:23 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:23 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:23 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:25 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:26 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:26 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:26 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:26 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:26 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:26 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:26 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:26 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:26 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:28 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:28 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: fatal: I/O error
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   block = 26
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   function = gfs_dreread
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   file = /builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/dio.c, line = 576
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3:   time = 1201024470
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: about to withdraw from the cluster
Jan 22 09:54:30 pbox7 kernel: GFS: fsid=Production0:web1.3: telling LM to withdraw
Jan 22 09:54:31 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:31 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:31 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:31 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:31 pbox7 kernel: GFS: fsid=Production0:web1.3: withdrawn
Jan 22 09:54:31 pbox7 kernel: 
Jan 22 09:54:31 pbox7 kernel: Call Trace:
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ffc94>] :gfs:gfs_lm_withdraw+0xc4/0xd3
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8001b9e6>] generic_make_request+0x1e8/0x1ff
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff88414dcf>] :gfs:gfs_io_error_bh_i+0x32/0x37
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883eca98>] :gfs:gfs_dreread+0xad/0xc7
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ecada>] :gfs:gfs_dread+0x28/0x43
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883ed522>] :gfs:gfs_get_meta_buffer+0xc6/0x247
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883fa25b>] :gfs:gfs_copyin_dinode+0x1d/0x131
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f8c78>] :gfs:inode_go_lock+0x27/0x3f
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6a50>] :gfs:glock_wait_internal+0x222/0x2bb
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6e7e>] :gfs:gfs_glock_nq+0x395/0x3d5
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff883f6ed4>] :gfs:gfs_glock_nq_init+0x16/0x2a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8840a782>] :gfs:gfs_getattr+0x41/0x67
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8000de0b>] vfs_getattr+0x2d/0xa9
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80028392>] vfs_stat_fd+0x32/0x4a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80064a9d>] do_page_fault+0x4eb/0x81d
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80039d83>] tty_ldisc_deref+0x68/0x7b
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80094baa>] recalc_sigpending_and_wake+0x9/0x1a
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff80028924>] do_sigaction+0x72/0x195
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff800231e6>] sys_newstat+0x19/0x31
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8005bde9>] error_exit+0x0/0x84
Jan 22 09:54:31 pbox7 kernel:  [<ffffffff8005b116>] system_call+0x7e/0x83
Jan 22 09:54:31 pbox7 kernel: 
Jan 22 09:54:32 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:32 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:32 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:34 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:35 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:35 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:35 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:35 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:35 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:36 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:36 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:36 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:36 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:37 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:37 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:39 pbox7 gnbd_recvd[11632]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:54:39 pbox7 gnbd_recvd[11632]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:54:39 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:54:41 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:41 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:41 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:42 pbox7 gnbd_recvd[11673]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:54:42 pbox7 gnbd_recvd[11673]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:54:42 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:54:43 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:44 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:44 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:44 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:44 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:44 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:45 pbox7 gnbd_recvd[11678]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:54:45 pbox7 gnbd_recvd[11678]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:54:45 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:54:46 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:46 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:50 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:50 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:50 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:54:50 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:50 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:50 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:50 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:52 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:54:53 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:53 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:54:53 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:53 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:54:53 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:54:55 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:54:55 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:54:55 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:55 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:54:55 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:54:55 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:54:59 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:54:59 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:54:59 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:55:00 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:00 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:00 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:00 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:01 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:02 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:02 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:55:02 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:55:02 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:02 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:55:04 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:55:04 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:05 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:05 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:05 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:05 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:08 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:08 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:55:08 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:55:10 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:10 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:10 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:10 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:10 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:11 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:11 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:55:11 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:55:11 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:11 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:55:13 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:55:13 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:15 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:15 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:15 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:15 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:17 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:17 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:55:17 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:55:19 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:20 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:20 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:55:20 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:55:20 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:20 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:55:20 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:20 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:20 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:20 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:22 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:55:22 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:25 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:25 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:25 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:25 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:26 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:26 pbox7 kernel: end_request: I/O error, dev gnbd0, sector 0
Jan 22 09:55:26 pbox7 multipathd: gnbd0: directio checker reports path is down 
Jan 22 09:55:28 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:29 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:29 pbox7 kernel: end_request: I/O error, dev gnbd1, sector 0
Jan 22 09:55:29 pbox7 multipathd: gnbd1: directio checker reports path is down 
Jan 22 09:55:29 pbox7 kernel: multipathd 3334 called gnbd_end_request with an error
Jan 22 09:55:29 pbox7 kernel: end_request: I/O error, dev gnbd2, sector 0
Jan 22 09:55:30 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:30 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:30 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:30 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:31 pbox7 multipathd: gnbd2: directio checker reports path is down 
Jan 22 09:55:31 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:31 pbox7 multipathd: mpath0: stop event checker thread 
Jan 22 09:55:35 pbox7 kernel: gnbd3: Attempted send on closed socket
Jan 22 09:55:35 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:35 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:35 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:35 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:37 pbox7 multipathd: gnbd3: checker msg is "directio checker reports path is down" 
Jan 22 09:55:37 pbox7 kernel: gnbd5: Attempted send on closed socket
Jan 22 09:55:38 pbox7 gnbd_recvd[11776]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:55:38 pbox7 gnbd_recvd[11776]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:55:38 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:55:39 pbox7 multipathd: gnbd5: checker msg is "directio checker reports path is down" 
Jan 22 09:55:41 pbox7 multipathd: gnbd3: checker msg is "directio checker reports path is down" 
Jan 22 09:55:41 pbox7 gnbd_recvd[11788]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:55:41 pbox7 gnbd_recvd[11788]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:55:41 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:55:43 pbox7 multipathd: gnbd5: checker msg is "directio checker reports path is down" 
Jan 22 09:55:43 pbox7 multipathd: mpath0: event checker started 
Jan 22 09:55:43 pbox7 multipathd: mpath1: event checker started 
Jan 22 09:55:43 pbox7 multipathd: mpath2: event checker started 
Jan 22 09:55:43 pbox7 multipathd: path checkers start up 
Jan 22 09:55:43 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:55:43 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:55:43 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:55:44 pbox7 gnbd_recvd[11793]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:55:44 pbox7 gnbd_recvd[11793]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:55:44 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:55:45 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:47 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:49 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:49 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:49 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:49 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:52 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:55:54 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:55:54 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:55:54 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:55:54 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:54 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:54 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:54 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:55:56 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:55:59 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:59 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:55:59 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:55:59 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:01 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:03 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:03 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:03 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:04 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:04 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:04 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:04 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:05 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:09 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:09 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:09 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:09 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:10 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:12 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:12 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:12 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:14 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:14 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:14 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:14 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:14 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:15 pbox7 kernel: gnbd3: Attempted send on closed socket
Jan 22 09:56:17 pbox7 kernel: gnbd5: Attempted send on closed socket
Jan 22 09:56:19 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:19 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:19 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:19 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:19 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:21 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:21 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:21 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:23 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:23 pbox7 multipathd: dm-10: add map (uevent) 
Jan 22 09:56:23 pbox7 multipathd: dm-10: devmap already registered 
Jan 22 09:56:24 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:24 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:24 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:24 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:28 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:29 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:29 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:29 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:29 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:30 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:30 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:30 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:32 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:34 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:34 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:34 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:34 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:37 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:37 pbox7 gnbd_recvd[11891]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:56:37 pbox7 gnbd_recvd[11891]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:56:37 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:56:39 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:39 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:39 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:40 pbox7 gnbd_recvd[11894]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:56:40 pbox7 gnbd_recvd[11894]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:56:40 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:56:41 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:43 pbox7 gnbd_recvd[11904]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:56:43 pbox7 gnbd_recvd[11904]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:56:43 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:56:46 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:48 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:48 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:48 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:48 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:48 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:48 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:48 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:50 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:56:53 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:53 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:53 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:53 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:55 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:56:57 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:56:57 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:56:57 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:56:58 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:58 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:56:58 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:56:58 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:56:59 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:03 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:03 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:03 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:03 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:04 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:06 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:06 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:06 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:08 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:08 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:08 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:08 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:08 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:13 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:13 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:13 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:13 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:13 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:15 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:15 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:15 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:17 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:18 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:18 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:18 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:18 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:22 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:23 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:23 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:23 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:23 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:24 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:24 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:24 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:26 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:28 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:28 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:28 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:28 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:31 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:33 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:33 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:33 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:33 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:33 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:33 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:33 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:35 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:36 pbox7 gnbd_recvd[11973]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:57:36 pbox7 gnbd_recvd[11973]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:57:36 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:57:39 pbox7 gnbd_recvd[11983]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:57:39 pbox7 gnbd_recvd[11983]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:57:39 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:57:40 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:40 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:40 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:42 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:42 pbox7 gnbd_recvd[11993]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:57:42 pbox7 gnbd_recvd[11993]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:57:42 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:57:44 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:47 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:47 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:47 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:47 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:49 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:57:51 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:57:51 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:57:51 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:57:52 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:52 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:52 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:52 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:53 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:57:57 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:57 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:57:57 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:57:57 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:57:58 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:00 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:00 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:00 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:02 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:02 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:02 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:02 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:02 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:07 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:07 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:07 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:07 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:07 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:09 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:09 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:09 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:11 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:12 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:12 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:12 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:12 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:16 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:17 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:17 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:17 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:17 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:18 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:18 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:18 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:20 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:22 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:22 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:22 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:22 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:25 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:27 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:27 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:27 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:27 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:27 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:27 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:27 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:29 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:32 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:32 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:32 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:32 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:34 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:35 pbox7 gnbd_recvd[12134]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:58:35 pbox7 gnbd_recvd[12134]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:58:35 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:58:36 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:36 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:36 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:38 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:38 pbox7 gnbd_recvd[12151]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:58:38 pbox7 gnbd_recvd[12151]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:58:38 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:58:41 pbox7 gnbd_recvd[12155]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:58:41 pbox7 gnbd_recvd[12155]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:58:41 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:58:43 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:43 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:43 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:45 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:46 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:46 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:46 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:46 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:47 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:51 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:51 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:51 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:51 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:58:52 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:58:54 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:58:54 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:58:54 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:58:56 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:58:56 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:56 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:58:56 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:58:56 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:01 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:01 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:01 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:01 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:01 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:03 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:03 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:03 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:05 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:06 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:06 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:06 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:06 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:10 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:11 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:11 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:11 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:11 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:12 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:12 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:12 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:14 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:16 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:16 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:16 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:16 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:19 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:21 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:21 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:21 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:21 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:21 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:21 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:21 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:23 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:26 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:26 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:26 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:26 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:28 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:30 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:30 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:30 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:31 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:31 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:31 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:31 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:32 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:34 pbox7 gnbd_recvd[12237]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:59:34 pbox7 gnbd_recvd[12237]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:59:34 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:59:37 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:37 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:37 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:37 pbox7 gnbd_recvd[12246]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:59:37 pbox7 gnbd_recvd[12246]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:59:37 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:59:39 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:40 pbox7 gnbd_recvd[12249]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 09:59:40 pbox7 gnbd_recvd[12249]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 09:59:40 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 09:59:41 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:45 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:45 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:45 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:45 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:46 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:48 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:48 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:48 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:50 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 09:59:50 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:50 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:50 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:50 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:55 pbox7 multipathd: gnbd0: unusable path 
Jan 22 09:59:55 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:55 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 09:59:55 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 09:59:55 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 09:59:57 pbox7 multipathd: gnbd1: unusable path 
Jan 22 09:59:57 pbox7 multipathd: gnbd2: unusable path 
Jan 22 09:59:57 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 09:59:59 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:00 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:00 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:00 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:00 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:04 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:05 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:05 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:05 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:05 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:06 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:06 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:06 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:08 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:10 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:10 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:10 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:10 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:13 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:15 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:15 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:15 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:15 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:15 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:15 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:15 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:17 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:20 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:20 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:20 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:20 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:22 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:24 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:24 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:24 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:25 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:25 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:25 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:25 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:26 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:30 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:30 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:30 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:30 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:31 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:33 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:33 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:33 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:33 pbox7 gnbd_recvd[12330]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:00:33 pbox7 gnbd_recvd[12330]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:00:33 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:00:35 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:36 pbox7 gnbd_recvd[12333]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:00:36 pbox7 gnbd_recvd[12333]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:00:36 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:00:39 pbox7 gnbd_recvd[12342]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:00:39 pbox7 gnbd_recvd[12342]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:00:39 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:00:40 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:40 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:40 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:42 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:44 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:44 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:44 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:44 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:44 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:49 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:49 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:49 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:49 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:49 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:51 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:00:51 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:00:51 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:00:53 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:00:54 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:54 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:54 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:54 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:00:58 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:00:59 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:59 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:00:59 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:00:59 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:00 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:00 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:00 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:02 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:04 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:04 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:04 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:04 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:07 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:09 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:09 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:09 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:09 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:09 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:09 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:09 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:11 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:14 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:14 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:14 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:14 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:16 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:18 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:18 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:18 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:19 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:19 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:19 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:19 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:20 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:24 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:24 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:24 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:24 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:24 pbox7 kernel: gnbd0: GNBD_DISCONNECT
Jan 22 10:01:25 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:27 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:27 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:27 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:29 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:29 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:29 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:29 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:29 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:31 pbox7 gnbd_recvd[12452]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:01:31 pbox7 gnbd_recvd[12452]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:01:31 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:01:34 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:34 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:34 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:34 pbox7 gnbd_recvd[12456]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:01:34 pbox7 gnbd_recvd[12456]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:01:34 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:01:36 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:37 pbox7 gnbd_recvd[12466]: ERROR [gnbd_recvd.c:213] cannot connect to server pbox9.internal.freerun.com (-1) : No route to host 
Jan 22 10:01:37 pbox7 gnbd_recvd[12466]: ERROR [gnbd_recvd.c:390] quitting 
Jan 22 10:01:37 pbox7 gnbd_monitor[4971]: ERROR [gnbd_monitor.c:671] gnbd_recvd failed (1) 
Jan 22 10:01:38 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:42 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:42 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:42 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:42 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:43 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:45 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:45 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:45 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:47 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:47 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:47 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:47 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:47 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:52 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:01:52 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:52 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:52 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:52 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:01:54 pbox7 multipathd: gnbd1: unusable path 
Jan 22 10:01:54 pbox7 multipathd: gnbd2: unusable path 
Jan 22 10:01:54 pbox7 multipathd: gnbd3: directio checker reports path is down 
Jan 22 10:01:56 pbox7 multipathd: gnbd5: directio checker reports path is down 
Jan 22 10:01:57 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:57 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:01:57 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:01:57 pbox7 gnbd_recvd[5007]: reconnecting 
Jan 22 10:02:01 pbox7 multipathd: gnbd0: unusable path 
Jan 22 10:02:02 pbox7 gnbd_recvd[5029]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 
Jan 22 10:02:02 pbox7 gnbd_recvd[5029]: reconnecting 
Jan 22 10:02:02 pbox7 gnbd_recvd[5007]: ERROR [gnbd_recvd.c:233] reading kill server reply from pbox10.internal.freerun.com failed : Unknown error 254 

From Jeff.Wasilko at Tufts.EDU  Tue Jan 22 20:29:26 2008
From: Jeff.Wasilko at Tufts.EDU (Jeff Wasilko)
Date: Tue, 22 Jan 2008 15:29:26 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
Message-ID: <E25B9DF3-7809-4E49-AC19-B9307381CCCC@Tufts.EDU>

Could someone comment on the potential timeline for supporting  
software raid in CLVM? Any hopes of it soon?

Thanks!

(from http://www.redhat.com/magazine/009jul05/departments/tips_tricks/ )


Does Red Hat Cluster Suite support software RAID (like Veritas and HP)?
by John Berninger

At the present time, software RAID drivers are not supported in the  
Red Hat Cluster Suite (RHCS) due to state maintenance issues between  
nodes.

The present versions of mdadm and RHCS are unable to transfer state  
information from one node to another in a cluster, so the node not  
actively using the file system is unable to determine the status of  
the RAID in case of a fence or STONITH operation. This inability  
could lead to file system corruption if a state is calculated  
incorrectly, which would likely be worse than not having the RAID in  
place in the beginning.

Red Hat engineers are working on a new volume manager, the Cluster  
Logical Volume Manager (CLVM), which will enable the use of LVM on  
shared storage. Once this work is complete, finalized testing and  
quality assurance will begin on making multi-pathed CLVM available  
for production use, which will act in place of software RAID in a  
cluster environment, providing similar functionality through  
different means. 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080122/11395320/attachment.htm>

From gordan at bobich.net  Tue Jan 22 20:35:29 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 22 Jan 2008 20:35:29 +0000
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <E25B9DF3-7809-4E49-AC19-B9307381CCCC@Tufts.EDU>
References: <E25B9DF3-7809-4E49-AC19-B9307381CCCC@Tufts.EDU>
Message-ID: <47965391.4070608@bobich.net>

Actually, can somebody elaborate what this means?

Does it mean that it isn't possible to put a DRBD on top of a /dev/mdX 
device?

Gordan

Jeff Wasilko wrote:
> Could someone comment on the potential timeline for supporting software 
> raid in CLVM? Any hopes of it soon?
> 
> Thanks!
> 
> (from http://www.redhat.com/magazine/009jul05/departments/tips_tricks/ )
> 
> 
> Does Red Hat Cluster Suite support software RAID (like Veritas and HP)?
> by John Berninger
> 
> At the present time, software RAID drivers are not supported in the Red 
> Hat Cluster Suite (RHCS) due to state maintenance issues between nodes.
> 
> The present versions of mdadm and RHCS are unable to transfer state 
> information from one node to another in a cluster, so the node not 
> actively using the file system is unable to determine the status of the 
> RAID in case of a fence or STONITH operation. This inability could lead 
> to file system corruption if a state is calculated incorrectly, which 
> would likely be worse than not having the RAID in place in the beginning.
> 
> Red Hat engineers are working on a new volume manager, the Cluster 
> Logical Volume Manager (CLVM), which will enable the use of LVM on 
> shared storage. Once this work is complete, finalized testing and 
> quality assurance will begin on making multi-pathed CLVM available for 
> production use, which will act in place of software RAID in a cluster 
> environment, providing similar functionality through different means. 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From chawkins at veracitynetworks.com  Tue Jan 22 20:47:10 2008
From: chawkins at veracitynetworks.com (Christopher Hawkins)
Date: Tue, 22 Jan 2008 15:47:10 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <47965391.4070608@bobich.net>
Message-ID: <200801222048.m0MKmUYG010133@mxmail.leaseoptions.com>

I think he is referring only to the CLVM Mirroring feature. AFAIK, you can
put drbd on top of a raid device with no problem.   

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Gordan Bobic
Sent: Tuesday, January 22, 2008 3:35 PM
To: linux clustering
Subject: Re: [Linux-cluster] any timeline for supporting software RAID?

Actually, can somebody elaborate what this means?

Does it mean that it isn't possible to put a DRBD on top of a /dev/mdX
device?

Gordan

Jeff Wasilko wrote:
> Could someone comment on the potential timeline for supporting 
> software raid in CLVM? Any hopes of it soon?
> 
> Thanks!
> 
> (from http://www.redhat.com/magazine/009jul05/departments/tips_tricks/ 
> )
> 
> 
> Does Red Hat Cluster Suite support software RAID (like Veritas and HP)?
> by John Berninger
> 
> At the present time, software RAID drivers are not supported in the 
> Red Hat Cluster Suite (RHCS) due to state maintenance issues between
nodes.
> 
> The present versions of mdadm and RHCS are unable to transfer state 
> information from one node to another in a cluster, so the node not 
> actively using the file system is unable to determine the status of 
> the RAID in case of a fence or STONITH operation. This inability could 
> lead to file system corruption if a state is calculated incorrectly, 
> which would likely be worse than not having the RAID in place in the
beginning.
> 
> Red Hat engineers are working on a new volume manager, the Cluster 
> Logical Volume Manager (CLVM), which will enable the use of LVM on 
> shared storage. Once this work is complete, finalized testing and 
> quality assurance will begin on making multi-pathed CLVM available for 
> production use, which will act in place of software RAID in a cluster 
> environment, providing similar functionality through different means.
> 
> 
> ----------------------------------------------------------------------
> --
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From Jeff.Wasilko at tufts.edu  Tue Jan 22 20:53:20 2008
From: Jeff.Wasilko at tufts.edu (Jeff Wasilko)
Date: Tue, 22 Jan 2008 15:53:20 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <200801222048.m0MKmUYG010133@mxmail.leaseoptions.com>
References: <47965391.4070608@bobich.net>
	<200801222048.m0MKmUYG010133@mxmail.leaseoptions.com>
Message-ID: <20080122205319.GF19080@mila.usg.tufts.edu>

On Tue, Jan 22, 2008 at 03:47:10PM -0500, Christopher Hawkins wrote:
> I think he is referring only to the CLVM Mirroring feature. AFAIK, you can
> put drbd on top of a raid device with no problem.   

I'm looking for mirroring support for failover (not GFS) filesystems.
The app in this case is Zimbra (a mail server).

-j



From chawkins at veracitynetworks.com  Tue Jan 22 20:59:47 2008
From: chawkins at veracitynetworks.com (Christopher Hawkins)
Date: Tue, 22 Jan 2008 15:59:47 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <20080122205319.GF19080@mila.usg.tufts.edu>
Message-ID: <200801222101.m0ML17fI011332@mxmail.leaseoptions.com>

If you don't need GFS... Then drbd + heartbeat is a good way to go if you
want to mirror the local storage on each machine.

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jeff Wasilko
Sent: Tuesday, January 22, 2008 3:53 PM
To: linux clustering
Subject: Re: [Linux-cluster] any timeline for supporting software RAID?

On Tue, Jan 22, 2008 at 03:47:10PM -0500, Christopher Hawkins wrote:
> I think he is referring only to the CLVM Mirroring feature. AFAIK, you can
> put drbd on top of a raid device with no problem.   

I'm looking for mirroring support for failover (not GFS) filesystems.
The app in this case is Zimbra (a mail server).

-j

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From Jeff.Wasilko at tufts.edu  Tue Jan 22 21:18:21 2008
From: Jeff.Wasilko at tufts.edu (Jeff Wasilko)
Date: Tue, 22 Jan 2008 16:18:21 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
Message-ID: <20080122211821.GI19080@mila.usg.tufts.edu>

On Tue, Jan 22, 2008 at 03:59:47PM -0500, Christopher Hawkins wrote:
> If you don't need GFS... Then drbd + heartbeat is a good way to go if you
> want to mirror the local storage on each machine.

True, except Zimbra has explicit support for RHCS, and we'd be off
rolling our own solution with drbd + heartbeat....

-j



From nlam87346 at library.usyd.edu.au  Tue Jan 22 22:46:53 2008
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Wed, 23 Jan 2008 09:46:53 +1100
Subject: [Linux-cluster] backing up a shared ext3 volume
Message-ID: <1201042013.3533.8.camel@zaniah.library.usyd.edu.au>

Hi,

I've got a two node cluster with a (non-concurrent) shared ext3 volume
containing data that needs to be backed up nightly using (EMC) Legato
Networker.

It seems that Networker uses the contents of /etc/fstab to determine
what filesystems are available, rather than calling mount or something
that will display all mounted filesystems. Because the shared volume is
not in /etc/fstab, it doesn't get picked up by Networker.

I'm having trouble finding documentation on how to get around this
problem (don't you just love proprietary "enterprise" solutions?). Has
anyone encountered this or a similar issue before?

I'm going to try a symlink for this evening's run, but I'm not hopeful
that will work.

Cheers,

Nik



From Christopher.Barry at qlogic.com  Tue Jan 22 22:54:15 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Tue, 22 Jan 2008 17:54:15 -0500
Subject: [Linux-cluster] backing up a shared ext3 volume
In-Reply-To: <1201042013.3533.8.camel@zaniah.library.usyd.edu.au>
References: <1201042013.3533.8.camel@zaniah.library.usyd.edu.au>
Message-ID: <1201042455.12853.101.camel@localhost>

On Wed, 2008-01-23 at 09:46 +1100, Nikolas Lam wrote:
> Hi,
> 
> I've got a two node cluster with a (non-concurrent) shared ext3 volume
> containing data that needs to be backed up nightly using (EMC) Legato
> Networker.
> 
> It seems that Networker uses the contents of /etc/fstab to determine
> what filesystems are available, rather than calling mount or something
> that will display all mounted filesystems. Because the shared volume is
> not in /etc/fstab, it doesn't get picked up by Networker.
> 
> I'm having trouble finding documentation on how to get around this
> problem (don't you just love proprietary "enterprise" solutions?). Has
> anyone encountered this or a similar issue before?


You might try inserting the mount line, with the noauto option. This
would give Networking the data it needs, but not mount the filesystem.


-C



From nlam87346 at library.usyd.edu.au  Wed Jan 23 02:34:46 2008
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Wed, 23 Jan 2008 13:34:46 +1100
Subject: [Linux-cluster] backing up a shared ext3 volume
In-Reply-To: <1201042455.12853.101.camel@localhost>
References: <1201042013.3533.8.camel@zaniah.library.usyd.edu.au>
	<1201042455.12853.101.camel@localhost>
Message-ID: <1201055686.3533.14.camel@zaniah.library.usyd.edu.au>

On Tue, 2008-01-22 at 17:54 -0500, chris barry wrote:
> On Wed, 2008-01-23 at 09:46 +1100, Nikolas Lam wrote:
> > Hi,
> > 
> > I've got a two node cluster with a (non-concurrent) shared ext3 volume
> > containing data that needs to be backed up nightly using (EMC) Legato
> > Networker.
> > 
> > It seems that Networker uses the contents of /etc/fstab to determine
> > what filesystems are available, rather than calling mount or something
> > that will display all mounted filesystems. Because the shared volume is
> > not in /etc/fstab, it doesn't get picked up by Networker.
> > 
> > I'm having trouble finding documentation on how to get around this
> > problem (don't you just love proprietary "enterprise" solutions?). Has
> > anyone encountered this or a similar issue before?
> 
> 
> You might try inserting the mount line, with the noauto option. This
> would give Networking the data it needs, but not mount the filesystem.

Thanks for the suggestion. I'll it tonight.





From harun at mhd.co.om  Wed Jan 23 03:48:49 2008
From: harun at mhd.co.om (Harun)
Date: Wed, 23 Jan 2008 07:48:49 +0400
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <0febbe710001260c@mhd.co.om>
Message-ID: <005701c85d72$deab6ad0$4e030196@mhd.co.om>

Dear Barry,

As you said, Fencing is a nice way of saying "make sure the non-responsive
node can not write anything to our disks, by whatever means necessary".
This usually involves the equivalent of pulling the power plug out of the 
non-responsive node.  Why be so harsh?  Why not do a normal shutdown?

So does that means that even in any case of cluster failure (suppose a
network fail), the node will shutdown abnormally only, or it will be a clean
shutdown. And once a node is shutdown due to a failure, will the node
automatically come up or does it need to be manually brought up.

Regards,
Harun

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Barry Brimer
Sent: Sunday, January 20, 2008 7:31 PM
To: linux clustering
Subject: Re: [Linux-cluster] Tiebreaker IP Address

> Can any one explain me what exactly is the tiebreaker IP and how does it
> function? What is the use if we set the tiebreaker IP as the Default
Gateway
> address?

In clustering, it is important that the cluster nodes are able to 
communicate with one another.  It is also important that the cluster nodes 
agree on the status of the cluster.  To acheive this, various methods are 
used to communicate between cluster nodes to inform the other nodes that 
this node is active and participating in the cluster.  Quorum is usually 
defined as "greater than one half".  In a cluster larger than 2 nodes, 
the cluster nodes can determine that if they stop receiving cluster 
communications (usually referred to as heartbeat) from a particular node, 
they assume that the non-responsive node is not functioning correctly, and 
one of the remaining nodes in the cluster will fence the non-responsive 
node.  Fencing is a nice way of saying "make sure the non-responsive node 
can not write anything to our disks, by whatever means necessary".  This 
usually involves the equivalent of pulling the power plug out of the 
non-responsive node.  Why be so harsh?  Why not do a normal shutdown?  If 
the non-responsive node has data in buffers that has not been written to 
disk, and the other cluster nodes feel that this node is having a problem, 
they want to ensure that the non-responsive node can not write its buffers 
out to disk, in order to make sure that the non-responsive node has no 
chance of corrupting the data used by the cluster.  This is all fine, 
because if you have greater than 2 nodes, you should be able to get 
agreement by a majority on whether a node is functioning, and therefore 
whether the cluster is allowed to operate.  In a two-node cluster, we need 
to have some other way to determine which cluster member is healthy, and 
which one isn't.  If a cluster node were functioning correctly, it would 
be able to reach its default gateway.  Therefore the tiebreaker IP address 
is the default gateway because both machines should be able to reach it if 
they were functioning properly.  Therefore if one node is able to reach 
the tiebreaker IP address, and one isn't, it is assumed that the properly 
running node is the one that can reach the default gateway, and that 
allows the tie to be broken and allows that node to fence the other node.

Barry

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


<<<<   Disclaimer Message  >>>>
"This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the named addressee, please notify the sender immediately after deleting this e-mail from your system and do not disseminate, distribute or copy this e-mail. The sender does not accept liability for any errors or omissions in the contents of this message, which arise as a result of erroneous e-mail transmission."
[Mohsin Haider Darwish LLC & Group Companies, PO.Box 880, Ruwi-112, Oman]



From Christopher.Barry at qlogic.com  Wed Jan 23 05:04:33 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Wed, 23 Jan 2008 00:04:33 -0500
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
References: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
Message-ID: <1201064674.12853.134.camel@localhost>

On Wed, 2008-01-23 at 07:48 +0400, Harun wrote:
> Dear Barry,
> 
> As you said, Fencing is a nice way of saying "make sure the non-responsive
> node can not write anything to our disks, by whatever means necessary".
> This usually involves the equivalent of pulling the power plug out of the 
> non-responsive node.  Why be so harsh?  Why not do a normal shutdown?
> 

If I may jump in here for a sec, 'The most important thing is your
data(tm)'. Why even risk that? Also, if the node is acting flaky, do you
want to park the cluster while it 'attempts' to shutdown cleanly? Users
are stalled while this is happening - better to kill it and move on. The
less disruption, the better - that's the whole point after all, isn't
it? Shoot first, ask questions later.

> So does that means that even in any case of cluster failure (suppose a
> network fail), the node will shutdown abnormally only, or it will be a clean
> shutdown. And once a node is shutdown due to a failure, will the node
> automatically come up or does it need to be manually brought up.

It can do either - you decide.

> 
> Regards,
> Harun



From lists at brimer.org  Wed Jan 23 05:05:14 2008
From: lists at brimer.org (Barry Brimer)
Date: Tue, 22 Jan 2008 23:05:14 -0600 (CST)
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
References: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
Message-ID: <Pine.LNX.4.61.0801222302340.28726@localhost.localdomain>

> As you said, Fencing is a nice way of saying "make sure the non-responsive
> node can not write anything to our disks, by whatever means necessary".
> This usually involves the equivalent of pulling the power plug out of the
> non-responsive node.  Why be so harsh?  Why not do a normal shutdown?

A "normal" shutdown will always flush buffers to disk.  The most important 
thing is the integrity of our data.  If the cluster has determined the 
node is not functioning properly, we don't want to give it the opportunity 
to write bad/corrupted data to our disk.  By "pulling the plug" it will 
not be able to do so.

> So does that means that even in any case of cluster failure (suppose a
> network fail), the node will shutdown abnormally only, or it will be a clean
> shutdown. And once a node is shutdown due to a failure, will the node
> automatically come up or does it need to be manually brought up.

As mentioned before, fencing is "pulling the plug" .. if fencing is set up 
correctly, the node will reboot and rejoin the cluster.

Barry
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Barry Brimer
> Sent: Sunday, January 20, 2008 7:31 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Tiebreaker IP Address
>
>> Can any one explain me what exactly is the tiebreaker IP and how does it
>> function? What is the use if we set the tiebreaker IP as the Default
> Gateway
>> address?
>
> In clustering, it is important that the cluster nodes are able to
> communicate with one another.  It is also important that the cluster nodes
> agree on the status of the cluster.  To acheive this, various methods are
> used to communicate between cluster nodes to inform the other nodes that
> this node is active and participating in the cluster.  Quorum is usually
> defined as "greater than one half".  In a cluster larger than 2 nodes,
> the cluster nodes can determine that if they stop receiving cluster
> communications (usually referred to as heartbeat) from a particular node,
> they assume that the non-responsive node is not functioning correctly, and
> one of the remaining nodes in the cluster will fence the non-responsive
> node.  Fencing is a nice way of saying "make sure the non-responsive node
> can not write anything to our disks, by whatever means necessary".  This
> usually involves the equivalent of pulling the power plug out of the
> non-responsive node.  Why be so harsh?  Why not do a normal shutdown?  If
> the non-responsive node has data in buffers that has not been written to
> disk, and the other cluster nodes feel that this node is having a problem,
> they want to ensure that the non-responsive node can not write its buffers
> out to disk, in order to make sure that the non-responsive node has no
> chance of corrupting the data used by the cluster.  This is all fine,
> because if you have greater than 2 nodes, you should be able to get
> agreement by a majority on whether a node is functioning, and therefore
> whether the cluster is allowed to operate.  In a two-node cluster, we need
> to have some other way to determine which cluster member is healthy, and
> which one isn't.  If a cluster node were functioning correctly, it would
> be able to reach its default gateway.  Therefore the tiebreaker IP address
> is the default gateway because both machines should be able to reach it if
> they were functioning properly.  Therefore if one node is able to reach
> the tiebreaker IP address, and one isn't, it is assumed that the properly
> running node is the one that can reach the default gateway, and that
> allows the tie to be broken and allows that node to fence the other node.
>
> Barry
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> <<<<   Disclaimer Message  >>>>
> "This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the named addressee, please notify the sender immediately after deleting this e-mail from your system and do not disseminate, distribute or copy this e-mail. The sender does not accept liability for any errors or omissions in the contents of this message, which arise as a result of erroneous e-mail transmission."
> [Mohsin Haider Darwish LLC & Group Companies, PO.Box 880, Ruwi-112, Oman]
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> !DSPAM:4796b976281132343459193!
>
>



From bpkroth at wisc.edu  Wed Jan 23 14:00:40 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Wed, 23 Jan 2008 08:00:40 -0600
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <Pine.LNX.4.61.0801222302340.28726@localhost.localdomain>
References: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
	<Pine.LNX.4.61.0801222302340.28726@localhost.localdomain>
Message-ID: <20080123140040.GA15519@omnius.hslc.wisc.edu>

Barry Brimer <lists at brimer.org>:
>> As you said, Fencing is a nice way of saying "make sure the non-responsive
>> node can not write anything to our disks, by whatever means necessary".
>> This usually involves the equivalent of pulling the power plug out of the
>> non-responsive node.  Why be so harsh?  Why not do a normal shutdown?
>
> A "normal" shutdown will always flush buffers to disk.  The most important 
> thing is the integrity of our data.  If the cluster has determined the node 
> is not functioning properly, we don't want to give it the opportunity to 
> write bad/corrupted data to our disk.  By "pulling the plug" it will not be 
> able to do so.
>
>> So does that means that even in any case of cluster failure (suppose a
>> network fail), the node will shutdown abnormally only, or it will be a 
>> clean
>> shutdown. And once a node is shutdown due to a failure, will the node
>> automatically come up or does it need to be manually brought up.
>
> As mentioned before, fencing is "pulling the plug" .. if fencing is set up 
> correctly, the node will reboot and rejoin the cluster.

Unless it's been simply IO fenced (unplugged from the shared storage).  In
that case you'll need to reboot and verify its "live" status in the
cluster before you unfence it.  In my experience though the node tends to
freak out if it's only been IO fenced and an unclean reboot is necessary
anyways.  This is probably what you meant by "if fencing is setup
correctly" - both IO and Power fencing methods should be used.

Brian

>
> Barry
>> -----Original Message-----
>> From: linux-cluster-bounces at redhat.com
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Barry Brimer
>> Sent: Sunday, January 20, 2008 7:31 PM
>> To: linux clustering
>> Subject: Re: [Linux-cluster] Tiebreaker IP Address
>>
>>> Can any one explain me what exactly is the tiebreaker IP and how does it
>>> function? What is the use if we set the tiebreaker IP as the Default
>> Gateway
>>> address?
>>
>> In clustering, it is important that the cluster nodes are able to
>> communicate with one another.  It is also important that the cluster nodes
>> agree on the status of the cluster.  To acheive this, various methods are
>> used to communicate between cluster nodes to inform the other nodes that
>> this node is active and participating in the cluster.  Quorum is usually
>> defined as "greater than one half".  In a cluster larger than 2 nodes,
>> the cluster nodes can determine that if they stop receiving cluster
>> communications (usually referred to as heartbeat) from a particular node,
>> they assume that the non-responsive node is not functioning correctly, and
>> one of the remaining nodes in the cluster will fence the non-responsive
>> node.  Fencing is a nice way of saying "make sure the non-responsive node
>> can not write anything to our disks, by whatever means necessary".  This
>> usually involves the equivalent of pulling the power plug out of the
>> non-responsive node.  Why be so harsh?  Why not do a normal shutdown?  If
>> the non-responsive node has data in buffers that has not been written to
>> disk, and the other cluster nodes feel that this node is having a problem,
>> they want to ensure that the non-responsive node can not write its buffers
>> out to disk, in order to make sure that the non-responsive node has no
>> chance of corrupting the data used by the cluster.  This is all fine,
>> because if you have greater than 2 nodes, you should be able to get
>> agreement by a majority on whether a node is functioning, and therefore
>> whether the cluster is allowed to operate.  In a two-node cluster, we need
>> to have some other way to determine which cluster member is healthy, and
>> which one isn't.  If a cluster node were functioning correctly, it would
>> be able to reach its default gateway.  Therefore the tiebreaker IP address
>> is the default gateway because both machines should be able to reach it if
>> they were functioning properly.  Therefore if one node is able to reach
>> the tiebreaker IP address, and one isn't, it is assumed that the properly
>> running node is the one that can reach the default gateway, and that
>> allows the tie to be broken and allows that node to fence the other node.
>>
>> Barry
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>> <<<<   Disclaimer Message  >>>>
>> "This email and any files transmitted with it are confidential and 
>> intended solely for the use of the individual or entity to whom they are 
>> addressed. If you are not the named addressee, please notify the sender 
>> immediately after deleting this e-mail from your system and do not 
>> disseminate, distribute or copy this e-mail. The sender does not accept 
>> liability for any errors or omissions in the contents of this message, 
>> which arise as a result of erroneous e-mail transmission."
>> [Mohsin Haider Darwish LLC & Group Companies, PO.Box 880, Ruwi-112, Oman]
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> !DSPAM:4796b976281132343459193!
>>
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/c832959e/attachment.bin>

From Christopher.Barry at qlogic.com  Wed Jan 23 15:21:51 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Wed, 23 Jan 2008 10:21:51 -0500
Subject: [Linux-cluster] Tiebreaker IP Address
In-Reply-To: <20080123140040.GA15519@omnius.hslc.wisc.edu>
References: <005701c85d72$deab6ad0$4e030196@mhd.co.om>
	<Pine.LNX.4.61.0801222302340.28726@localhost.localdomain>
	<20080123140040.GA15519@omnius.hslc.wisc.edu>
Message-ID: <1201101712.12853.165.camel@localhost>

On Wed, 2008-01-23 at 08:00 -0600, Brian Kroth wrote:
> Barry Brimer <lists at brimer.org>:
> > As mentioned before, fencing is "pulling the plug" .. if fencing is set up 
> > correctly, the node will reboot and rejoin the cluster.
> 
> Unless it's been simply IO fenced (unplugged from the shared storage).  In
> that case you'll need to reboot and verify its "live" status in the
> cluster before you unfence it.  In my experience though the node tends to
> freak out if it's only been IO fenced and an unclean reboot is necessary
> anyways.  

This has been my experience as well. In a (still running) circa 2002
Sistina GFS FC fenced cluster we're running (but moving off of),
invariably after one node hangs, it's IO fenced, and I have to
power-cycle it, and then typically one or two days later the whole
cluster becomes unstable, and I have to reboot both nodes to clean
things up. After that, typical uptime has been 100 - 150 days.

> This is probably what you meant by "if fencing is setup
> correctly" - both IO and Power fencing methods should be used.




From isplist at logicore.net  Wed Jan 23 15:45:25 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 09:45:25 -0600
Subject: [Linux-cluster] Open Source VM Solutions?
Message-ID: <200812394525.338903@leena>

Bit of topic but it's a good place to ask.

I've just recently started thinking that I might want to check out virtual 
machines using GFS and fibre channel. I just found that Zen seems to have been 
bought out and I'm not coming across much else for well developed open source.

Is there anything else that others on this list are using which might be 
interesting?

Mike





From gordan at bobich.net  Wed Jan 23 15:54:57 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 23 Jan 2008 15:54:57 +0000 (GMT)
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200812394525.338903@leena>
References: <200812394525.338903@leena>
Message-ID: <alpine.LRH.1.00.0801231553500.3565@skynet.shatteredsilicon.net>


> I've just recently started thinking that I might want to check out virtual
> machines using GFS and fibre channel. I just found that Zen seems to have been
> bought out and I'm not coming across much else for well developed open source.
>
> Is there anything else that others on this list are using which might be
> interesting?

I presume you mean Xen, rather than Zen:
http://www.cl.cam.ac.uk/research/srg/netos/xen/

Who is supposed to have bought it? Google didn't seem to find any obvious 
references.

Gordan



From rhurst at bidmc.harvard.edu  Wed Jan 23 15:57:38 2008
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Wed, 23 Jan 2008 10:57:38 -0500
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <alpine.LRH.1.00.0801231553500.3565@skynet.shatteredsilicon.net>
References: <200812394525.338903@leena>
	<alpine.LRH.1.00.0801231553500.3565@skynet.shatteredsilicon.net>
Message-ID: <1201103858.6404.5.camel@WSBID06223.bidmc.harvard.edu>

http://www.citrixxenserver.com/Pages/default.aspx

On Wed, 2008-01-23 at 15:54 +0000, gordan at bobich.net wrote:

> > I've just recently started thinking that I might want to check out virtual
> > machines using GFS and fibre channel. I just found that Zen seems to have been
> > bought out and I'm not coming across much else for well developed open source.
> >
> > Is there anything else that others on this list are using which might be
> > interesting?
> 
> I presume you mean Xen, rather than Zen:
> http://www.cl.cam.ac.uk/research/srg/netos/xen/
> 
> Who is supposed to have bought it? Google didn't seem to find any obvious 
> references.
> 
> Gordan
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/a3fcefc3/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3227 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/a3fcefc3/attachment.p7s>

From isplist at logicore.net  Wed Jan 23 16:02:30 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 10:02:30 -0600
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <alpine.LRH.1.00.0801231553500.3565@skynet.shatteredsilicon.net>
Message-ID: <200812310230.232266@leena>

> I presume you mean Xen, rather than Zen:
> http://www.cl.cam.ac.uk/research/srg/netos/xen/

Sorry, have not had my coffee yet :). Yes, the citrix folks seem to have 
bought XenSource. 

Wasn't that the big open source one?

Mike





From Christopher.Barry at qlogic.com  Wed Jan 23 16:06:07 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Wed, 23 Jan 2008 11:06:07 -0500
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <1201103858.6404.5.camel@WSBID06223.bidmc.harvard.edu>
References: <200812394525.338903@leena>
	<alpine.LRH.1.00.0801231553500.3565@skynet.shatteredsilicon.net>
	<1201103858.6404.5.camel@WSBID06223.bidmc.harvard.edu>
Message-ID: <1201104367.12853.180.camel@localhost>

On Wed, 2008-01-23 at 10:57 -0500, rhurst at bidmc.harvard.edu wrote:
> http://www.citrixxenserver.com/Pages/default.aspx
> 
> On Wed, 2008-01-23 at 15:54 +0000, gordan at bobich.net wrote: 
> > > I've just recently started thinking that I might want to check out virtual
> > > machines using GFS and fibre channel. I just found that Zen seems to have been
> > > bought out and I'm not coming across much else for well developed open source.
> > >
> > > Is there anything else that others on this list are using which might be
> > > interesting?
> > 
> > I presume you mean Xen, rather than Zen:
> > http://www.cl.cam.ac.uk/research/srg/netos/xen/
> > 
> > Who is supposed to have bought it? Google didn't seem to find any obvious 
> > references.
> > 
> > Gordan

>From the article. The core is is still OSS.

"Citrix also reiterated its commitment to maintaining and growing
support for the independent Xen open source community which develops the
underlying virtualization engine used by many commercial products
throughout the industry, including those from XenSource."



-C



From gordan at bobich.net  Wed Jan 23 16:07:47 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 23 Jan 2008 16:07:47 +0000 (GMT)
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200812310230.232266@leena>
References: <200812310230.232266@leena>
Message-ID: <alpine.LRH.1.00.0801231606480.3565@skynet.shatteredsilicon.net>

On Wed, 23 Jan 2008, isplist at logicore.net wrote:

>> I presume you mean Xen, rather than Zen:
>> http://www.cl.cam.ac.uk/research/srg/netos/xen/
>
> Sorry, have not had my coffee yet :). Yes, the citrix folks seem to have
> bought XenSource.
>
> Wasn't that the big open source one?

It is still big and it is still OSS. I don't see either of those being 
affected by the Citrix acquisition.

Gordan



From chawkins at veracitynetworks.com  Wed Jan 23 16:07:41 2008
From: chawkins at veracitynetworks.com (Christopher Hawkins)
Date: Wed, 23 Jan 2008 11:07:41 -0500
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200812310230.232266@leena>
Message-ID: <200801231609.m0NG98vA025766@mxmail.leaseoptions.com>

They were one of them. Check out OpenVZ by Virtuozzo and also KVM, which is
new but is now in the linux kernel.  

Chris

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of isplist at logicore.net
Sent: Wednesday, January 23, 2008 11:03 AM
To: linux clustering
Subject: Re: [Linux-cluster] Open Source VM Solutions?

> I presume you mean Xen, rather than Zen:
> http://www.cl.cam.ac.uk/research/srg/netos/xen/

Sorry, have not had my coffee yet :). Yes, the citrix folks seem to have
bought XenSource. 

Wasn't that the big open source one?

Mike



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From isplist at logicore.net  Wed Jan 23 16:21:59 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 10:21:59 -0600
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <1201104367.12853.180.camel@localhost>
Message-ID: <2008123102159.519639@leena>

> "Citrix also reiterated its commitment to maintaining and growing
> support for the independent Xen open source community which develops the
> underlying virtualization engine used by many commercial products
> throughout the industry, including those from XenSource."

So anyone using it or thinking about it would still consider it viable as an 
open source project then?

Mike





From isplist at logicore.net  Wed Jan 23 16:23:31 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 10:23:31 -0600
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200801231609.m0NG98vA025766@mxmail.leaseoptions.com>
Message-ID: <2008123102331.779966@leena>

Thanks for the leads Chris and Gordan. Always like the ask the folks on this 
list when considering such things.

Mike


On Wed, 23 Jan 2008 11:07:41 -0500, Christopher Hawkins wrote:
> They were one of them. Check out OpenVZ by Virtuozzo and also KVM, which is
> 
> new but is now in the linux kernel.
> 
> Chris
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of isplist at logicore.net
> Sent: Wednesday, January 23, 2008 11:03 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Open Source VM Solutions?
> 
>> I presume you mean Xen, rather than Zen:
>> http://www.cl.cam.ac.uk/research/srg/netos/xen/
>> 
> Sorry, have not had my coffee yet :). Yes, the citrix folks seem to have
> bought XenSource.
> 
> Wasn't that the big open source one?
> 
> Mike
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster






From probst_christopher at hotmail.com  Wed Jan 23 16:26:20 2008
From: probst_christopher at hotmail.com (Christopher Probst)
Date: Wed, 23 Jan 2008 16:26:20 +0000
Subject: [Linux-cluster] Finding out properties of GFS formatted partition
In-Reply-To: <1200754515.3640.66.camel@technetium.msp.redhat.com>
References: <BAY143-W453359C5924C23F907CE32EF420@phx.gbl>
	<1200754515.3640.66.camel@technetium.msp.redhat.com>
Message-ID: <BAY143-W44F657BE865C1A13B772B1EF3F0@phx.gbl>


Bob,

I would like to thank you for the time you took in helping me. I have implemented a GUI around what you wrote and it works fine. Thank you very much.

Christopher

----------------------------------------
> Subject: Re: [Linux-cluster] Finding out properties of GFS formatted partition
> From: rpeterso at redhat.com
> To: linux-cluster at redhat.com
> Date: Sat, 19 Jan 2008 08:55:14 -0600
> 
> On Fri, 2008-01-18 at 18:47 +0000, Christopher Probst wrote:
>> Hello,
>> 
>> This is my first post/question on this mailing list, so I am sorry if
>> the question sound naive. I have a GFS formatted partition(dev/hdb1)
>> that is assigned to a cluster. I would like to know, if I can extract
>> the following info of a particular GFS formatted partition
>> 
>> 1) Cluster name it is assigned to
>> 2) Number of journals;
>> 3) lock method used.
>> 
>> Is there any way to do this without getting a mount point involved?
>> 
>> Thank you in advance
>> Christopher
> 
> Hi Christopher,
> 
> 1. Cluster name is easy:
>    gfs_tool sb /dev/sdb1 table
> 2. Number of journals is a bit more difficult.  If the FS is mounted
>    you can do:    gfs_tool df 
>    However, since you said "without getting a mount point involved" I'll
>    assume that it's not mounted.  You can still find out the number of
>    journals.  In RHEL5, Centos5 and equivalent, you can do this:
>    gfs2_edit -p jindex /dev/sdb1
>    (The gfs2_edit program recognizes gfs1 file systems as well as gfs2)
>    I'll warn you that the output is not very user-friendly.
> 
>    In RHEL4, Centos4 and equivalent there's no "good" way unless the
>    file system is mounted (again, use gfs_tool df).  There is a "not
>    so good" way, which is to use "gfs_edit" to poke around, but you've
>    got to know what you're doing.  You basically have to jump from the
>    superblock to the jindex and see how many entries are there.
>    Unlike gfs2_edit, gfs_edit is primitive and has no print option.
> 
>    It may not be relevant at this point in time, but for gfs2 starting
>    when RHEL5.2 is released (I think), I also added:
>    "gfs2_tool journals ".
> 3. Lock method is easy:
>    gfs_tool sb /dev/sdb1 table
> 
> Regards,
> 
> Bob Peterson
> Red Hat GFS
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

_________________________________________________________________




From gordan at bobich.net  Wed Jan 23 16:29:04 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 23 Jan 2008 16:29:04 +0000 (GMT)
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <2008123102159.519639@leena>
References: <2008123102159.519639@leena>
Message-ID: <alpine.LRH.1.00.0801231625560.3565@skynet.shatteredsilicon.net>



On Wed, 23 Jan 2008, isplist at logicore.net wrote:

>> "Citrix also reiterated its commitment to maintaining and growing
>> support for the independent Xen open source community which develops the
>> underlying virtualization engine used by many commercial products
>> throughout the industry, including those from XenSource."
>
> So anyone using it or thinking about it would still consider it viable as an
> open source project then?

Sure. You cannot exactly just close-source a GPL-ed project. It just means 
Xen now has a "sponsor" for enterprise installations in corporate 
environments where managers feel better if they pay for something because 
they misguidedly believe that passing the buck up the vendor chain makes 
it less of their responsibility when something goes wrong. :-)

Gordan



From mpartio at gmail.com  Wed Jan 23 16:41:56 2008
From: mpartio at gmail.com (Mikko Partio)
Date: Wed, 23 Jan 2008 18:41:56 +0200
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <E25B9DF3-7809-4E49-AC19-B9307381CCCC@Tufts.EDU>
References: <E25B9DF3-7809-4E49-AC19-B9307381CCCC@Tufts.EDU>
Message-ID: <2ca799770801230841h3a160faxb365b366d4c5aa03@mail.gmail.com>

2008/1/22 Jeff Wasilko <Jeff.Wasilko at tufts.edu>:

> Could someone comment on the potential timeline for supporting software
> raid in CLVM? Any hopes of it soon?
>


AFAIK cmirror would enable mirroring of clustered filesystems, but it is not
yet stable enough for production. Somewhere I remember reading that it would
be released with RHEL 5.2. (information on the internet regarding cmirror is
_very_ sparse)

Regards

Mikko
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/0d270692/attachment.htm>

From jos at xos.nl  Wed Jan 23 17:02:42 2008
From: jos at xos.nl (Jos Vos)
Date: Wed, 23 Jan 2008 18:02:42 +0100
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <2008123102159.519639@leena>
References: <1201104367.12853.180.camel@localhost> <2008123102159.519639@leena>
Message-ID: <20080123170242.GC32532@jasmine.xos.nl>

On Wed, Jan 23, 2008 at 10:21:59AM -0600, isplist at logicore.net wrote:

> > "Citrix also reiterated its commitment to maintaining and growing
> > support for the independent Xen open source community which develops the
> > underlying virtualization engine used by many commercial products
> > throughout the industry, including those from XenSource."
> 
> So anyone using it or thinking about it would still consider it viable as an 
> open source project then?

The problem is that nobody can look in the future.  Policies may change,
companies may be bought or go bankrupt, spin-offs may be created, etc.
The past has proven that's it's difficult to predict the future ;-).

I know some Xen users were already pretty unhappy with the "it's fixed
in the commercial version" answer that XenSource people tended to give
in response to some bug reports.  And they was before they were bought
by Citrix.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204



From gordan at bobich.net  Wed Jan 23 17:08:34 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 23 Jan 2008 17:08:34 +0000 (GMT)
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <20080123170242.GC32532@jasmine.xos.nl>
References: <1201104367.12853.180.camel@localhost> <2008123102159.519639@leena>
	<20080123170242.GC32532@jasmine.xos.nl>
Message-ID: <alpine.LRH.1.00.0801231707030.4233@skynet.shatteredsilicon.net>

>>> "Citrix also reiterated its commitment to maintaining and growing
>>> support for the independent Xen open source community which develops the
>>> underlying virtualization engine used by many commercial products
>>> throughout the industry, including those from XenSource."
>>
>> So anyone using it or thinking about it would still consider it viable as an
>> open source project then?
>
> The problem is that nobody can look in the future.  Policies may change,
> companies may be bought or go bankrupt, spin-offs may be created, etc.
> The past has proven that's it's difficult to predict the future ;-).
>
> I know some Xen users were already pretty unhappy with the "it's fixed
> in the commercial version" answer that XenSource people tended to give
> in response to some bug reports.  And they was before they were bought
> by Citrix.

Sure, but too much of that and they'll end up with a competing fully OSS 
fork. Having said that, if the source is GPL-ed, I'm not sure they are 
legally allowed to not publish the commercial patches.

Gordan



From jos at xos.nl  Wed Jan 23 17:16:47 2008
From: jos at xos.nl (Jos Vos)
Date: Wed, 23 Jan 2008 18:16:47 +0100
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <alpine.LRH.1.00.0801231707030.4233@skynet.shatteredsilicon.net>
References: <1201104367.12853.180.camel@localhost> <2008123102159.519639@leena>
	<20080123170242.GC32532@jasmine.xos.nl>
	<alpine.LRH.1.00.0801231707030.4233@skynet.shatteredsilicon.net>
Message-ID: <20080123171647.GD32532@jasmine.xos.nl>

On Wed, Jan 23, 2008 at 05:08:34PM +0000, gordan at bobich.net wrote:

> Sure, but too much of that and they'll end up with a competing fully OSS 
> fork. Having said that, if the source is GPL-ed, I'm not sure they are 
> legally allowed to not publish the commercial patches.

AFAIK, although I'm not a lawyer:

They may do as, as long as they have the copyright of the source code.
Being owner of the code, they may use whatever license they like to
distribute their commercial version (even source, if they would like).

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204



From gordan at bobich.net  Wed Jan 23 17:26:49 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 23 Jan 2008 17:26:49 +0000 (GMT)
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <20080123171647.GD32532@jasmine.xos.nl>
References: <1201104367.12853.180.camel@localhost> <2008123102159.519639@leena>
	<20080123170242.GC32532@jasmine.xos.nl>
	<alpine.LRH.1.00.0801231707030.4233@skynet.shatteredsilicon.net>
	<20080123171647.GD32532@jasmine.xos.nl>
Message-ID: <alpine.LRH.1.00.0801231724580.4309@skynet.shatteredsilicon.net>

>> Sure, but too much of that and they'll end up with a competing fully OSS
>> fork. Having said that, if the source is GPL-ed, I'm not sure they are
>> legally allowed to not publish the commercial patches.
>
> AFAIK, although I'm not a lawyer:
>
> They may do as, as long as they have the copyright of the source code.
> Being owner of the code, they may use whatever license they like to
> distribute their commercial version (even source, if they would like).

Only if they got permission/licence/copyright assigned from all 
contributors who have code in the current version.

Gordan



From mpartio at gmail.com  Wed Jan 23 19:10:51 2008
From: mpartio at gmail.com (Mikko Partio)
Date: Wed, 23 Jan 2008 21:10:51 +0200
Subject: [Linux-cluster] Adding new file system caused problems
In-Reply-To: <476636D6.6000203@redhat.com>
References: <474C5260.6030908@noaa.gov>
	<97F238EA86B5704DBAD740518CF829100394AE0C@hwpms600.tbo.citistreet.org>
	<1196442322.2437.8.camel@localhost.localdomain>
	<2ca799770712140707k2de9cd65re2d5260c9eb6d4d0@mail.gmail.com>
	<4762A01C.6070303@redhat.com>
	<2ca799770712151228g69aee3cdh59a8ebd3002e25b1@mail.gmail.com>
	<476636D6.6000203@redhat.com>
Message-ID: <2ca799770801231110l64eb9b7ds5289b60abecd99a0@mail.gmail.com>

>
> > Are there any plans to fix clvmd -R?
>
> It's fixed in version 2.02.28 - vgscan also will initiate it.



Sorry to revive an old thread, but when will version 2.02.28 be released?
The latest errata for clvmd seems to be for version 2.02.26 (
http://rhn.redhat.com/errata/RHBA-2007-0518.html).


Regards

Mikko
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/721dbcaf/attachment.htm>

From lhh at redhat.com  Wed Jan 23 19:20:44 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Jan 2008 14:20:44 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <20080122211821.GI19080@mila.usg.tufts.edu>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
Message-ID: <1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-22 at 16:18 -0500, Jeff Wasilko wrote:
> On Tue, Jan 22, 2008 at 03:59:47PM -0500, Christopher Hawkins wrote:
> > If you don't need GFS... Then drbd + heartbeat is a good way to go if you
> > want to mirror the local storage on each machine.
> 
> True, except Zimbra has explicit support for RHCS, and we'd be off
> rolling our own solution with drbd + heartbeat....

As I recall, you can use LVM mirroring.  In recent versions of RHCS (4.6
maybe?), there's an agent to handle assembly/disassembly of LVM
(non-clustered) volume groups before/after failover.

If only one node is accessing the data at a time, maybe this would work
for you. (Assumes, of course, you have a SAN)


With software RAID (or plain LVM), the general rule is:

Do not use a md set or LVM volume group on shared storage *unless* only
one node ever assembles that volume-group / raid-set at a time.

-- Lon



From lhh at redhat.com  Wed Jan 23 19:21:35 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Jan 2008 14:21:35 -0500
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200812394525.338903@leena>
References: <200812394525.338903@leena>
Message-ID: <1201116095.5420.108.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-23 at 09:45 -0600, isplist at logicore.net wrote:
> Bit of topic but it's a good place to ask.
> 
> I've just recently started thinking that I might want to check out virtual 
> machines using GFS and fibre channel. I just found that Zen seems to have been 
> bought out and I'm not coming across much else for well developed open source.
> 
> Is there anything else that others on this list are using which might be 
> interesting?

Xen is shipped with RHEL5

-- Lon



From lhh at redhat.com  Wed Jan 23 19:22:17 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Jan 2008 14:22:17 -0500
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <200812310230.232266@leena>
References: <200812310230.232266@leena>
Message-ID: <1201116137.5420.110.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-23 at 10:02 -0600, isplist at logicore.net wrote:
> > I presume you mean Xen, rather than Zen:
> > http://www.cl.cam.ac.uk/research/srg/netos/xen/
> 
> Sorry, have not had my coffee yet :). Yes, the citrix folks seem to have 
> bought XenSource. 
> 
> Wasn't that the big open source one?

There's also KVM, qemu, and (to a lesser extent) user mode linux.

-- Lon




From probst_christopher at hotmail.com  Wed Jan 23 21:30:28 2008
From: probst_christopher at hotmail.com (Christopher Probst)
Date: Wed, 23 Jan 2008 21:30:28 +0000
Subject: [Linux-cluster] Adding journals to GFS formatte partition without
 involving the mount point
Message-ID: <BAY143-W336D81B3FED1FF0DBE840AEF3F0@phx.gbl>


Hello all,

Is it possible to add a journal to a GFS formatted, ( possibly mounted ), possibly being used by a cluster partition without getting the mount point involved?

Thanks in advance,
Christopher 

_________________________________________________________________

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080123/4713de24/attachment.htm>

From lhh at redhat.com  Wed Jan 23 22:53:29 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Jan 2008 17:53:29 -0500
Subject: [Linux-cluster] Help with a two node cluster for a web
	serverneeded
In-Reply-To: <200801151545.26113.holger.ratzel@she.net>
References: <200801141830.16714.holger.ratzel@she.net>
	<1200341026.16786.56.camel@ayanami.boston.devel.redhat.com>
	<200801151545.26113.holger.ratzel@she.net>
Message-ID: <1201128809.29756.10.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-15 at 15:45 +0100, Holger L. Ratzel wrote:
> Hi,
> 
> Am Montag 14 Januar 2008 21:03:46 schrieb Lon Hohberger:
> > So, what was happening was this:
> [...]
> >
> > First, let's ping the router with the cable unplugged to see how long it
> > takes for our heuristic to complete when things are "broken".  On my
> > machine:
> >
> > [lhh at ayanami ~]$ time ping -c1 -t1 frederick
> > PING frederick (12.1.2.99) 56(84) bytes of data.
> >
> > >From ayanami (12.1.2.37) icmp_seq=1 Destination Host Unreachable

Holger,

Digging deeper -- for some reason, ping occasionally doesn't exit for
some reason if you make the dest IP unreachable, but only if started
from the init script - e.g. 'service qdiskd start'.

I was working with someone today and we reproduced it.

-- Lon



From lhh at redhat.com  Wed Jan 23 23:12:41 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Jan 2008 18:12:41 -0500
Subject: [Linux-cluster] Help with a two node cluster for a web
	serverneeded
In-Reply-To: <1201128809.29756.10.camel@ayanami.boston.devel.redhat.com>
References: <200801141830.16714.holger.ratzel@she.net>
	<1200341026.16786.56.camel@ayanami.boston.devel.redhat.com>
	<200801151545.26113.holger.ratzel@she.net>
	<1201128809.29756.10.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1201129961.29756.12.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-23 at 17:53 -0500, Lon Hohberger wrote:
> On Tue, 2008-01-15 at 15:45 +0100, Holger L. Ratzel wrote:
> > Hi,
> > 
> > Am Montag 14 Januar 2008 21:03:46 schrieb Lon Hohberger:
> > > So, what was happening was this:
> > [...]
> > >
> > > First, let's ping the router with the cable unplugged to see how long it
> > > takes for our heuristic to complete when things are "broken".  On my
> > > machine:
> > >
> > > [lhh at ayanami ~]$ time ping -c1 -t1 frederick
> > > PING frederick (12.1.2.99) 56(84) bytes of data.
> > >
> > > >From ayanami (12.1.2.37) icmp_seq=1 Destination Host Unreachable
> 
> Holger,
> 
> Digging deeper -- for some reason, ping occasionally doesn't exit for
> some reason if you make the dest IP unreachable, but only if started
> from the init script - e.g. 'service qdiskd start'.
> 
> I was working with someone today and we reproduced it.

https://bugzilla.redhat.com/show_bug.cgi?id=429927

Very, very strange indeed.

-- Lon



From rpeterso at redhat.com  Wed Jan 23 23:10:10 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Wed, 23 Jan 2008 17:10:10 -0600
Subject: [Linux-cluster] Adding journals to GFS formatte partition
	without involving the mount point
In-Reply-To: <BAY143-W336D81B3FED1FF0DBE840AEF3F0@phx.gbl>
References: <BAY143-W336D81B3FED1FF0DBE840AEF3F0@phx.gbl>
Message-ID: <1201129810.18461.6.camel@technetium.msp.redhat.com>

On Wed, 2008-01-23 at 21:30 +0000, Christopher Probst wrote:
> Hello all,
> 
> Is it possible to add a journal to a GFS formatted, ( possibly
> mounted ), possibly being used by a cluster partition without getting
> the mount point involved?
> 
> Thanks in advance,
> Christopher 
> 

Hi Christopher,

It is only possible to add journals to GFS through the mount point
(using gfs_grow) while it is mounted.  And only if there is free
space on the device to extend it.  That usually means using lvm2
(i.e. clvmd) and doing lvresize before gfs_grow, unless you somehow
otherwise made the partition bigger than the gfs file system.

For what it's worth: journals may be added to gfs2 file systems
without extending the device (but again, only while mounted).
(Note that gfs2 is not ready for production use yet.)

Regards,

Bob Peterson
Red Hat GFS




From nlam87346 at library.usyd.edu.au  Thu Jan 24 00:32:46 2008
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Thu, 24 Jan 2008 11:32:46 +1100
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <2008123102159.519639@leena>
References: <2008123102159.519639@leena>
Message-ID: <1201134766.4160.23.camel@zaniah.library.usyd.edu.au>

On Wed, 2008-01-23 at 10:21 -0600, isplist at logicore.net wrote:
> > "Citrix also reiterated its commitment to maintaining and growing
> > support for the independent Xen open source community which develops the
> > underlying virtualization engine used by many commercial products
> > throughout the industry, including those from XenSource."
> 
> So anyone using it or thinking about it would still consider it viable as an 
> open source project then?
> 
> Mike

We're using Xen guests as nodes in a cluster using Red Hat cluster
suite. Although we're not using the feature, it can automatically
migrate virtual machines (Domain U) among different physical hosts
(Domain 0) using cluster suite.

With respect to alternatives to Xen, I believe the strategy that Red Hat
and other linux vendors are taking is to use a wrapper library, libvirt,
which abstracts the management of virtualisation so you can use the same
management interface for different virtualisation systems. E.g. I think
currently Red Hat supports qemu and Xen virtualisation using libvirt,
and another up and coming form of virtualsation, KVM, will be supported
soon too. This approach will hopefully protect against e.g. the Xen
project dying as an decent open-source virtualisation platform. It will
be easier to transition to a different type of virtualisation.

Anyway, as far as clustering and high-availability virtualisation goes,
Xen-based virtualisation ready for use in RHEL and Centos 5, as long as
your willing to delve in and learn a bit of Xen and put up with a few
rough edges. A good way to learn would be to use RHEL5 client or Centos
on your PC and install the @virtualization group of packages. Oh, and
having some kind of automatic deployment/provisioning system such as
kickstart/red hat network satellite server, cobbler/koan makes things a
lot more fun.

Nik





From Jeff.Wasilko at tufts.edu  Thu Jan 24 02:28:04 2008
From: Jeff.Wasilko at tufts.edu (Jeff Wasilko)
Date: Wed, 23 Jan 2008 21:28:04 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20080124022804.GM21454@mila.usg.tufts.edu>

On Wed, Jan 23, 2008 at 02:20:44PM -0500, Lon Hohberger wrote:
> On Tue, 2008-01-22 at 16:18 -0500, Jeff Wasilko wrote:
> > On Tue, Jan 22, 2008 at 03:59:47PM -0500, Christopher Hawkins wrote:
> > > If you don't need GFS... Then drbd + heartbeat is a good way to go if you
> > > want to mirror the local storage on each machine.
> > 
> > True, except Zimbra has explicit support for RHCS, and we'd be off
> > rolling our own solution with drbd + heartbeat....
> 
> As I recall, you can use LVM mirroring.  In recent versions of RHCS (4.6
> maybe?), there's an agent to handle assembly/disassembly of LVM
> (non-clustered) volume groups before/after failover.

Hi Lon!

Thanks for the info!

Could you point me to docs on the agent? I'm still very new to the product
and trying to figure out it if it'll work for us....

> If only one node is accessing the data at a time, maybe this would work
> for you. (Assumes, of course, you have a SAN)

Yup, we have a SAN. 

> With software RAID (or plain LVM), the general rule is:
> 
> Do not use a md set or LVM volume group on shared storage *unless* only
> one node ever assembles that volume-group / raid-set at a time.

Which would be the case when LUNs were being failed over from node
to node in an active-passive environment.

The docs all seem to flatly state that mirroring and cluster aren't supported.
Is this something that redhat's support organization is willing to support
at this point?

Thanks again!

-j



From isplist at logicore.net  Thu Jan 24 02:36:55 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 20:36:55 -0600
Subject: [Linux-cluster] Open Source VM Solutions?
In-Reply-To: <1201134766.4160.23.camel@zaniah.library.usyd.edu.au>
Message-ID: <2008123203655.110755@leena>

> Anyway, as far as clustering and high-availability virtualisation goes,
> Xen-based virtualisation ready for use in RHEL and Centos 5, as long as
> your willing to delve in and learn a bit of Xen and put up with a few

You mean just like GFS and the clustering was until it became so simple that I 
had to actually un-complicate everything I had set up over time :).

Mike





From isplist at logicore.net  Thu Jan 24 02:46:44 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 23 Jan 2008 20:46:44 -0600
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
Message-ID: <2008123204644.875704@leena>

Figured I would start another thread so as not to lose this topic under 
another. 

I want to try to explain what it is that I need. Since I'm not an industry 
guy, I don't know all of the terms so apologies if I confuse anyone. 

What I badly need right now is a shared root style system. Perhaps where all 
nodes boot from the FC SAN using their HBA's and all have access to GFS 
storage all around the network. 

There are various reasons I would like to do this but one of them also 
includes trying to save on power. Say I took 32 machines and was able to get 
them all booting off the network without drives, then I could use a 12 drive 
FC chassis as the boot server. 

What I had worked on last year was partitioning one of these chassis into 32 
partitions, one for each system but I think there is a better way and, maybe 
even gaining some benefits. The problem with that was that partitions were 
fixed and inaccessible as individual partitions once formatted on the storage 
chassis. A shared root system would be better because then I don't have to 
have fixed partitions, just files. Then, each node would have it's storage 
over other storage chassis on the network. This is what I would like to 
achieve, so far, without success.

On another train of thought, I was wondering about the following. Would there 
be any benefit in creating an SSI cluster made up of x number of servers. 
Then, slicing that up into VM's as required. The SSI would always be intact as 
it is, the servers could come and go as needed, the storage would be separate 
from the entire mix. If one node needed more processing power than the rest, 
it would take it from the SSI cluster. Otherwise, idle machines are wasting 
their resources. 

Again, this is just a theory based on my tiny understanding of SSI clusters 
and VM to begin with but it's kind of an outline of what I'd like to achieve. 
The reason of course is that then I would have a very scalable environment 
where very little goes to waste, resources can be used where needed, not 
wasted.

Mike





From Jeff.Wasilko at tufts.edu  Thu Jan 24 02:51:16 2008
From: Jeff.Wasilko at tufts.edu (Jeff Wasilko)
Date: Wed, 23 Jan 2008 21:51:16 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20080124025116.GN21454@mila.usg.tufts.edu>

On Wed, Jan 23, 2008 at 02:20:44PM -0500, Lon Hohberger wrote:
> On Tue, 2008-01-22 at 16:18 -0500, Jeff Wasilko wrote:
> > On Tue, Jan 22, 2008 at 03:59:47PM -0500, Christopher Hawkins wrote:
> > > If you don't need GFS... Then drbd + heartbeat is a good way to go if you
> > > want to mirror the local storage on each machine.
> > 
> > True, except Zimbra has explicit support for RHCS, and we'd be off
> > rolling our own solution with drbd + heartbeat....
> 
> As I recall, you can use LVM mirroring.  In recent versions of RHCS (4.6
> maybe?), there's an agent to handle assembly/disassembly of LVM
> (non-clustered) volume groups before/after failover.

I found the 4.6 docs, and it does look like all of the previous
warnings about using mirrors under cluster are gone.

Plus, I saw this note in the 4.6 release notes:

#
A known device mapper issue causes unexpected cmirror deadlocking. This issue prevents reliable use of cluster mirrors configured with three or more legs. As such, it is advised that you avoid configuring cluster mirrors with three legs or more until this issue is resolved.
#

Is cmirror the layer below lvm?

We generally don't do 3-way mirrors, other than when we're transitioning
between arrays on the SAN.

-j




From doobs72 at hotmail.com  Thu Jan 24 04:13:39 2008
From: doobs72 at hotmail.com (dinesh _)
Date: Thu, 24 Jan 2008 04:13:39 +0000
Subject: [Linux-cluster] qdisk setup
Message-ID: <BLU121-W427C0A2EDF1825312FE658BF380@phx.gbl>


Hi I've trying to setup a cluster with qdisk (3nodes) without any success. Has anyone got any instuctions on how to do this ? or point me in the right direction to get information. Thanks in advance D.
_________________________________________________________________
Who's friends with who and co-starred in what?
http://www.searchgamesbox.com/celebrityseparation.shtml
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080124/8ee58aa0/attachment.htm>

From grimme at atix.de  Thu Jan 24 09:02:56 2008
From: grimme at atix.de (Marc Grimme)
Date: Thu, 24 Jan 2008 10:02:56 +0100
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008123204644.875704@leena>
References: <2008123204644.875704@leena>
Message-ID: <200801241002.56463.grimme@atix.de>

You might want to have a look at www.open-sharedroot.org and the related 
howtos:
http://www.open-sharedroot.org/documentation/the-opensharedroot-mini-howto/
http://www.open-sharedroot.org/documentation/rhel5-gfs-shared-root-mini-howto

Very shortly (in the next 1 or 2 weeks) we will also announce the availability 
of a slightly changed anaconda (beta version) that will be able to install a 
sharedroot cluster from scratch.
If you are interested we can give you access to the beta software.

XEN virtualisation is also supported.

Regards Marc.
On Thursday 24 January 2008 03:46:44 isplist at logicore.net wrote:
> Figured I would start another thread so as not to lose this topic under
> another.
>
> I want to try to explain what it is that I need. Since I'm not an industry
> guy, I don't know all of the terms so apologies if I confuse anyone.
>
> What I badly need right now is a shared root style system. Perhaps where
> all nodes boot from the FC SAN using their HBA's and all have access to GFS
> storage all around the network.
>
> There are various reasons I would like to do this but one of them also
> includes trying to save on power. Say I took 32 machines and was able to
> get them all booting off the network without drives, then I could use a 12
> drive FC chassis as the boot server.
>
> What I had worked on last year was partitioning one of these chassis into
> 32 partitions, one for each system but I think there is a better way and,
> maybe even gaining some benefits. The problem with that was that partitions
> were fixed and inaccessible as individual partitions once formatted on the
> storage chassis. A shared root system would be better because then I don't
> have to have fixed partitions, just files. Then, each node would have it's
> storage over other storage chassis on the network. This is what I would
> like to achieve, so far, without success.
>
> On another train of thought, I was wondering about the following. Would
> there be any benefit in creating an SSI cluster made up of x number of
> servers. Then, slicing that up into VM's as required. The SSI would always
> be intact as it is, the servers could come and go as needed, the storage
> would be separate from the entire mix. If one node needed more processing
> power than the rest, it would take it from the SSI cluster. Otherwise, idle
> machines are wasting their resources.
If your idea is to provide the same sharedroot form Dom0 to DomU you'll end up 
with a chicken and egg problem with fencing. As fence_xvm needs fence_xvmd on 
Dom0 running. But as the root is frozen it cannot proceed. I thought about 
moving it to our chroot running on tmpfs or any local fs where also the 
fenced and dependencies are running. Bug in short fence_xvmd has a great lot 
of dependencies. So this will not be so easy.

But basically that was also my first idea. I'm not so sure if you really need 
it. Why not two sharedroots one for the guest on one for Dom0s that will be 
perfectly ok.

Marc.
>
> Again, this is just a theory based on my tiny understanding of SSI clusters
> and VM to begin with but it's kind of an outline of what I'd like to
> achieve. The reason of course is that then I would have a very scalable
> environment where very little goes to waste, resources can be used where
> needed, not wasted.
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/



From paolom at prisma-eng.it  Thu Jan 24 09:56:45 2008
From: paolom at prisma-eng.it (Paolo Marini)
Date: Thu, 24 Jan 2008 10:56:45 +0100 (CET)
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
Message-ID: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>

After a period of time experimenting with a cluster composed of 3 physical
nodes hosting 5 XEN guests, both physical and virtual nodes being part of
their respective cluster, with shared storage based on iSCSI and GFS, I am
quite satisfied with the system stability and performances.

My concern now is with the management of the startup of the cluster (and
possibly also shutdown, imagine an UPS stating that the energy is
finished!).

When the physical cluster starts up, and the nodes are operational, the
virtual nodes are started in an unordered mode, e.g. the allocation to the
physical nodes is not predetermined. If the physical cluster nodes come up
at different times, the first operational node of the physical cluster
tries to bring up all the virtual guests, even if the available memory is
not sufficient. This causes some instabilities at system startup, and
requires some manual intervention in order to distribute the xen guests
among the physical cluster nodes.

Is there some way to prevent this kind of behaviour ?

Another aspect is related to system shutdown. My system is powered by an
UPS, but once in a year due to maintenance on the power supply it may
happen that there is need to shutdown the system, or in case of blackout
with a long time. How is it possible to completely shutdown the cluster by
UPS command ?

Thanks, Paolo



From johannes.russek at io-consulting.net  Thu Jan 24 10:00:12 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 24 Jan 2008 11:00:12 +0100
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008123204644.875704@leena>
References: <2008123204644.875704@leena>
Message-ID: <1201168812.20969.53.camel@localhost.localdomain>

Hi Mike,


> What I badly need right now is a shared root style system. Perhaps where all 
> nodes boot from the FC SAN using their HBA's and all have access to GFS 
> storage all around the network. 
> 
> There are various reasons I would like to do this but one of them also 
> includes trying to save on power. Say I took 32 machines and was able to get 
> them all booting off the network without drives, then I could use a 12 drive 
> FC chassis as the boot server. 
> 
> What I had worked on last year was partitioning one of these chassis into 32 
> partitions, one for each system but I think there is a better way and, maybe 
> even gaining some benefits. The problem with that was that partitions were 
> fixed and inaccessible as individual partitions once formatted on the storage 
> chassis. A shared root system would be better because then I don't have to 
> have fixed partitions, just files. Then, each node would have it's storage 
> over other storage chassis on the network. This is what I would like to 
> achieve, so far, without success.

maybe before going right into SSI which seems to be quite an effort to
get running, you might want to think about "cloning" system images by
using any kind of volume manager and copy on write volumes. lvm2 for
instance does support this. 
you basically create one master image, do all the basic configuration,
such as routes, connection to your ldap server, whatever, and then just
make snapshots of that image for every node / vm you want to create. 
you can make those snapshots writeable, which basically creates a
copy-on-write datafile. 
i hope i could help you a little bit :)
regards,
johannes

> 
> On another train of thought, I was wondering about the following. Would there 
> be any benefit in creating an SSI cluster made up of x number of servers. 
> Then, slicing that up into VM's as required. The SSI would always be intact as 
> it is, the servers could come and go as needed, the storage would be separate 
> from the entire mix. If one node needed more processing power than the rest, 
> it would take it from the SSI cluster. Otherwise, idle machines are wasting 
> their resources. 




From johannes.russek at io-consulting.net  Thu Jan 24 10:07:49 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 24 Jan 2008 11:07:49 +0100
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>
References: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>
Message-ID: <1201169269.20969.60.camel@localhost.localdomain>

Hi Paolo,


> When the physical cluster starts up, and the nodes are operational, the
> virtual nodes are started in an unordered mode, e.g. the allocation to the
> physical nodes is not predetermined. If the physical cluster nodes come up
> at different times, the first operational node of the physical cluster
> tries to bring up all the virtual guests, even if the available memory is
> not sufficient. This causes some instabilities at system startup, and
> requires some manual intervention in order to distribute the xen guests
> among the physical cluster nodes.
> 
> Is there some way to prevent this kind of behaviour ?
> 

yes there is. Basically what you want to do is the same thing that is
done with nfs services in the nfs cookbook.
you basically create three failoverdomains, one for each node, with
ordered=1, then put your nodes in with a priority of 2 except one node
which gets priority of 1. every domain then has to have a different node
with the priority of 1. 
then all you have to do is put the vms in the failoverdomain with the
node you want it to be started on automagically, so in case you have
failoverdomain1 with node2 having priority 1, you put
domain="failoverdomain1" in your <vm> definition.
this will also do some migration magic if your cluster is up. so if a
node has to take over a vm that's in a failoverdomain with a failed node
having the highest priority, it will automatically migrate the vm back,
once the higher priority node has become available again.


> Another aspect is related to system shutdown. My system is powered by an
> UPS, but once in a year due to maintenance on the power supply it may
> happen that there is need to shutdown the system, or in case of blackout
> with a long time. How is it possible to completely shutdown the cluster by
> UPS command ?

i believe shutting down rgmanager on every node will do the trick. or
else, shutting down cman should work too :)

enjoy,
johannes



From gordan at bobich.net  Thu Jan 24 10:25:48 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 24 Jan 2008 10:25:48 +0000 (GMT)
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <1201168812.20969.53.camel@localhost.localdomain>
References: <2008123204644.875704@leena>
	<1201168812.20969.53.camel@localhost.localdomain>
Message-ID: <alpine.LRH.1.00.0801241012050.4196@skynet.shatteredsilicon.net>

>> What I badly need right now is a shared root style system. Perhaps where all
>> nodes boot from the FC SAN using their HBA's and all have access to GFS
>> storage all around the network.
>>
>> There are various reasons I would like to do this but one of them also
>> includes trying to save on power. Say I took 32 machines and was able to get
>> them all booting off the network without drives, then I could use a 12 drive
>> FC chassis as the boot server.

You could do that, but see the comment further down about needing local 
disks for at least scratch space. It ultimately depends on your 
application.

>> What I had worked on last year was partitioning one of these chassis into 32
>> partitions, one for each system but I think there is a better way and, maybe
>> even gaining some benefits.

Indeed. That sounds rather like you were using a SAN just for the sake of 
using a SAN, and taking all the disadvantages without any of the 
advantages.

>> The problem with that was that partitions were
>> fixed and inaccessible as individual partitions once formatted on the storage
>> chassis. A shared root system would be better because then I don't have to
>> have fixed partitions, just files. Then, each node would have it's storage
>> over other storage chassis on the network. This is what I would like to
>> achieve, so far, without success.

Not sure what you mean there, the last two sentences didn't quite parse. 
Can you please elaborate?

> maybe before going right into SSI which seems to be quite an effort to
> get running, you might want to think about "cloning" system images by
> using any kind of volume manager and copy on write volumes. lvm2 for
> instance does support this.

That would make things more complicated to set up AND more complicated to 
maintain and keep in sync afterwards. SSI isn't hard to set up. Follow the 
OSR howto and you'll have it up and running in no time.

The only thing I'd do differently than the howto is that I wouldn't 
unshare the whole of /var, but only /var/log, /var/lock and /var/run (off 
the top of my head. It's useful to keep things like /var/cache shared. But 
that's all fairly minor stuff. The howto will get you up and running.

> you basically create one master image, do all the basic configuration,
> such as routes, connection to your ldap server, whatever, and then just
> make snapshots of that image for every node / vm you want to create.
> you can make those snapshots writeable, which basically creates a
> copy-on-write datafile.

Or you could just create a shared root volume after you've set up the 
first one, and get everything talking to that. The only minor downside is 
that you still need a local disk for the initrd base root (otherwise you 
waste about 120MB of RAM), swap and /tmp, plus any other major file 
systems you want unshared for performance (e.g. replicated DB copies that 
are replicated outside of the clustering framework).

The major upshot of SSI is that you only need to manage one file system, 
which means you can both use smaller disks in the nodes, and save yourself 
the hassle of keeping all the packages/libraries/configs in sync.

Gordan



From gordan at bobich.net  Thu Jan 24 10:36:20 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 24 Jan 2008 10:36:20 +0000 (GMT)
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008123204644.875704@leena>
References: <2008123204644.875704@leena>
Message-ID: <alpine.LRH.1.00.0801241034390.4196@skynet.shatteredsilicon.net>

> On another train of thought, I was wondering about the following. Would there
> be any benefit in creating an SSI cluster made up of x number of servers.
> Then, slicing that up into VM's as required. The SSI would always be intact as
> it is, the servers could come and go as needed, the storage would be separate
> from the entire mix. If one node needed more processing power than the rest,
> it would take it from the SSI cluster. Otherwise, idle machines are wasting
> their resources.
>
> Again, this is just a theory based on my tiny understanding of SSI clusters
> and VM to begin with but it's kind of an outline of what I'd like to achieve.
> The reason of course is that then I would have a very scalable environment
> where very little goes to waste, resources can be used where needed, not
> wasted.

This is pretty independent of clustering/SSI. Yes, you can do this, but 
clustering isn't a critical requirement.

When adding/removing nodes from the cluster, make sure quorum is 
maintained, though.

Gordan



From paolom at prisma-eng.it  Thu Jan 24 10:36:55 2008
From: paolom at prisma-eng.it (Paolo Marini)
Date: Thu, 24 Jan 2008 11:36:55 +0100 (CET)
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical 
	cluster
In-Reply-To: <1201169269.20969.60.camel@localhost.localdomain>
References: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>
	<1201169269.20969.60.camel@localhost.localdomain>
Message-ID: <23381.62.101.98.215.1201171015.squirrel@webmail.kpnqwest.it>

Thanks for the answer, I suspected that failover domains could do the
trick, I will try this solution.

Anyway, shutdown is for me still unclear.

starting up a cluster involves starting the cman daemon, clvmd, gfs,
mounting the gfs filesystems and starting rgmanager. Stopping cman does
not work on a cluster, because the other nodes still think that the
shutting down node is up, and they end fencing it if it is no more
responding. In fact, the shutdown procedures of a single node involves the
removeal of the node from both the fencing and cman agents (fence_tool
remove and cman_tool leave remove commands).

So stopping the daemons is not enough. Beside this, the ordered shutdown
of a physical/virtual cluster requires the shutdown of the virtual first
then of the physical cluster.

Paolo

> Hi Paolo,
>
>
>> When the physical cluster starts up, and the nodes are operational, the
>> virtual nodes are started in an unordered mode, e.g. the allocation to
>> the
>> physical nodes is not predetermined. If the physical cluster nodes come
>> up
>> at different times, the first operational node of the physical cluster
>> tries to bring up all the virtual guests, even if the available memory
>> is
>> not sufficient. This causes some instabilities at system startup, and
>> requires some manual intervention in order to distribute the xen guests
>> among the physical cluster nodes.
>>
>> Is there some way to prevent this kind of behaviour ?
>>
>
> yes there is. Basically what you want to do is the same thing that is
> done with nfs services in the nfs cookbook.
> you basically create three failoverdomains, one for each node, with
> ordered=1, then put your nodes in with a priority of 2 except one node
> which gets priority of 1. every domain then has to have a different node
> with the priority of 1.
> then all you have to do is put the vms in the failoverdomain with the
> node you want it to be started on automagically, so in case you have
> failoverdomain1 with node2 having priority 1, you put
> domain="failoverdomain1" in your <vm> definition.
> this will also do some migration magic if your cluster is up. so if a
> node has to take over a vm that's in a failoverdomain with a failed node
> having the highest priority, it will automatically migrate the vm back,
> once the higher priority node has become available again.
>
>
>> Another aspect is related to system shutdown. My system is powered by an
>> UPS, but once in a year due to maintenance on the power supply it may
>> happen that there is need to shutdown the system, or in case of blackout
>> with a long time. How is it possible to completely shutdown the cluster
>> by
>> UPS command ?
>
> i believe shutting down rgmanager on every node will do the trick. or
> else, shutting down cman should work too :)
>
> enjoy,
> johannes
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>




From basv at sara.nl  Thu Jan 24 14:15:27 2008
From: basv at sara.nl (Bas van der Vlies)
Date: Thu, 24 Jan 2008 15:15:27 +0100
Subject: [Linux-cluster] Question abou gfs1 and from which cvs branch
Message-ID: <47989D7F.9060702@sara.nl>

Hello,

  We are still using gfs1 version 1.0.3 with a 2.6.17 kernel on Debian. I 
noticed that there is only one newer version 1.0.4, but that is not working 
for us, reported the problem to the list.

Now i read on the Cluster Project Page to use cvs:
  * CVS STABLE branch (compiles against recent kernel.org kernel):
cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster checkout -r STABLE cluster

I noticed that there are not many update for STABLE since version 1.0.4 + 
which branch must i use to compile againt the latest vanilla kernel (2.6.23)?

I still see a lot of messages with do not use gfs2 in a production 
environment. That is why we still are using gfs1.

Regards


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From bpkroth at wisc.edu  Thu Jan 24 14:42:47 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Thu, 24 Jan 2008 08:42:47 -0600
Subject: [Linux-cluster] Question abou gfs1 and from which cvs branch
In-Reply-To: <47989D7F.9060702@sara.nl>
References: <47989D7F.9060702@sara.nl>
Message-ID: <20080124144247.GA29889@bpkroth-tux.hslc.wisc.edu>

Not sure if this is the "right" answer, but I was also running into this
problem on a Gentoo setup.  The 2.01.00 package has a gfs-kernel module
that (with some simple tweaks to the kernel - see usage.txt) will compile
against 2.6.23 and seems to run well.  This is my first gfs attempt so I
don't know how upgrading the tools will go for you.  I also read about
(and experienced) problems with gfs2, so I'd also stick with gfs1 for the
time being, but at least this way your upgrade path is prepared.

Brian

Bas van der Vlies <basv at sara.nl>:
> Hello,
>
>  We are still using gfs1 version 1.0.3 with a 2.6.17 kernel on Debian. I 
> noticed that there is only one newer version 1.0.4, but that is not working 
> for us, reported the problem to the list.
>
> Now i read on the Cluster Project Page to use cvs:
>  * CVS STABLE branch (compiles against recent kernel.org kernel):
> cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster checkout -r STABLE 
> cluster
>
> I noticed that there are not many update for STABLE since version 1.0.4 + 
> which branch must i use to compile againt the latest vanilla kernel 
> (2.6.23)?
>
> I still see a lot of messages with do not use gfs2 in a production 
> environment. That is why we still are using gfs1.
>
> Regards
>
>
> -- 
> --
> ********************************************************************
> *                                                                  *
> *  Bas van der Vlies                     e-mail: basv at sara.nl      *
> *  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
> *  Kruislaan 415                         fax:    +31 20 6683167    *
> *  1098 SJ Amsterdam                                               *
> *                                                                  *
> ********************************************************************
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080124/7021c706/attachment.bin>

From lhh at redhat.com  Thu Jan 24 15:12:43 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 10:12:43 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <20080124022804.GM21454@mila.usg.tufts.edu>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
	<20080124022804.GM21454@mila.usg.tufts.edu>
Message-ID: <1201187563.2910.8.camel@localhost.localdomain>


On Wed, 2008-01-23 at 21:28 -0500, Jeff Wasilko wrote:

> Yup, we have a SAN. 
> 
> > With software RAID (or plain LVM), the general rule is:
> > 
> > Do not use a md set or LVM volume group on shared storage *unless* only
> > one node ever assembles that volume-group / raid-set at a time.
> 
> Which would be the case when LUNs were being failed over from node
> to node in an active-passive environment.
> 
> The docs all seem to flatly state that mirroring and cluster aren't supported.
> Is this something that redhat's support organization is willing to support
> at this point?

http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/rgmanager/src/resources/lvm.sh?rev=1.1.2.7&content-type=text/x-cvsweb-markup&cvsroot=cluster&only_with_tag=RHEL46

Big link ;)

Ok, so...

  <lvm name="myLV" vg_name="VolGroup01" lv_name="LogVol01"/>

This looks like an earlier version of the agent, which only supports one
LV per VG at a time.

I've asked the author to write a wiki page on it... My knowledge of it
is pretty cursory.

-- Lon



From basv at sara.nl  Thu Jan 24 15:29:37 2008
From: basv at sara.nl (Bas van der Vlies)
Date: Thu, 24 Jan 2008 16:29:37 +0100
Subject: [Linux-cluster] Question abou gfs1 and from which cvs branch
In-Reply-To: <20080124144247.GA29889@bpkroth-tux.hslc.wisc.edu>
References: <47989D7F.9060702@sara.nl>
	<20080124144247.GA29889@bpkroth-tux.hslc.wisc.edu>
Message-ID: <4798AEE1.20701@sara.nl>

Brian Kroth wrote:
> Not sure if this is the "right" answer, but I was also running into this
> problem on a Gentoo setup.  The 2.01.00 package has a gfs-kernel module
> that (with some simple tweaks to the kernel - see usage.txt) will compile
> against 2.6.23 and seems to run well.  This is my first gfs attempt so I
> don't know how upgrading the tools will go for you.  I also read about
> (and experienced) problems with gfs2, so I'd also stick with gfs1 for the
> time being, but at least this way your upgrade path is prepared.
> 
> Brian
> 
Brian,

  Thanks for answer.  Which version of gfs1 is included in this package?


> Bas van der Vlies <basv at sara.nl>:
>> Hello,
>>
>>  We are still using gfs1 version 1.0.3 with a 2.6.17 kernel on Debian. I
>> noticed that there is only one newer version 1.0.4, but that is not working
>> for us, reported the problem to the list.
>>
>> Now i read on the Cluster Project Page to use cvs:
>>  * CVS STABLE branch (compiles against recent kernel.org kernel):
>> cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster checkout -r STABLE
>> cluster
>>
>> I noticed that there are not many update for STABLE since version 1.0.4 +
>> which branch must i use to compile againt the latest vanilla kernel
>> (2.6.23)?
>>
>> I still see a lot of messages with do not use gfs2 in a production
>> environment. That is why we still are using gfs1.
>>
>> Regards
>>
>>
>> --
>> --
>> ********************************************************************
>> *                                                                  *
>> *  Bas van der Vlies                     e-mail: basv at sara.nl      *
>> *  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
>> *  Kruislaan 415                         fax:    +31 20 6683167    *
>> *  1098 SJ Amsterdam                                               *
>> *                                                                  *
>> ********************************************************************
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From lhh at redhat.com  Thu Jan 24 15:37:21 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 10:37:21 -0500
Subject: [Linux-cluster] qdisk setup
In-Reply-To: <BLU121-W427C0A2EDF1825312FE658BF380@phx.gbl>
References: <BLU121-W427C0A2EDF1825312FE658BF380@phx.gbl>
Message-ID: <1201189041.2910.13.camel@localhost.localdomain>


On Thu, 2008-01-24 at 04:13 +0000, dinesh _ wrote:
> Hi
>  
> I've trying to setup a cluster with qdisk (3nodes) without any
> success. Has anyone got any instuctions on how to do this ? or point
> me in the right direction to get information.

What's the goal?  With a 3 node cluster, it's simpler to run without it,
but the man page is an excellent place to start.

What are you having problems with?

-- Lon



From lhh at redhat.com  Thu Jan 24 15:39:23 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 10:39:23 -0500
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical 
	cluster
In-Reply-To: <23381.62.101.98.215.1201171015.squirrel@webmail.kpnqwest.it>
References: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>
	<1201169269.20969.60.camel@localhost.localdomain>
	<23381.62.101.98.215.1201171015.squirrel@webmail.kpnqwest.it>
Message-ID: <1201189163.2910.15.camel@localhost.localdomain>


On Thu, 2008-01-24 at 11:36 +0100, Paolo Marini wrote:
> Thanks for the answer, I suspected that failover domains could do the
> trick, I will try this solution.
> 
> Anyway, shutdown is for me still unclear.
> 
> starting up a cluster involves starting the cman daemon, clvmd, gfs,
> mounting the gfs filesystems and starting rgmanager. Stopping cman does
> not work on a cluster, because the other nodes still think that the
> shutting down node is up, and they end fencing it if it is no more
> responding. In fact, the shutdown procedures of a single node involves the
> removeal of the node from both the fencing and cman agents (fence_tool
> remove and cman_tool leave remove commands).
> 
> So stopping the daemons is not enough. Beside this, the ordered shutdown
> of a physical/virtual cluster requires the shutdown of the virtual first
> then of the physical cluster.

Also, chkconfig --del xendomains if you haven't already.  Otherwise,
xendomains might start up domains you're trying to manage with rgmanager
- not good!

-- Lon



From lhh at redhat.com  Thu Jan 24 15:40:16 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 10:40:16 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <1201187563.2910.8.camel@localhost.localdomain>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
	<20080124022804.GM21454@mila.usg.tufts.edu>
	<1201187563.2910.8.camel@localhost.localdomain>
Message-ID: <1201189216.2910.17.camel@localhost.localdomain>


On Thu, 2008-01-24 at 10:12 -0500, Lon H. Hohberger wrote:
> On Wed, 2008-01-23 at 21:28 -0500, Jeff Wasilko wrote:
> 
> > Yup, we have a SAN. 
> > 
> > > With software RAID (or plain LVM), the general rule is:
> > > 
> > > Do not use a md set or LVM volume group on shared storage *unless* only
> > > one node ever assembles that volume-group / raid-set at a time.
> > 
> > Which would be the case when LUNs were being failed over from node
> > to node in an active-passive environment.
> > 
> > The docs all seem to flatly state that mirroring and cluster aren't supported.
> > Is this something that redhat's support organization is willing to support
> > at this point?
> 
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/rgmanager/src/resources/lvm.sh?rev=1.1.2.7&content-type=text/x-cvsweb-markup&cvsroot=cluster&only_with_tag=RHEL46
> 
> Big link ;)
> 
> Ok, so...
> 
>   <lvm name="myLV" vg_name="VolGroup01" lv_name="LogVol01"/>
> 
> This looks like an earlier version of the agent, which only supports one
> LV per VG at a time.
> 
> I've asked the author to write a wiki page on it... My knowledge of it
> is pretty cursory.

I've started this:

http://sources.redhat.com/cluster/wiki/LVMFailover

It's nothing more than I've told you so far, but it's a start.

-- Lon




From lhh at redhat.com  Thu Jan 24 15:41:31 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 10:41:31 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <20080124025116.GN21454@mila.usg.tufts.edu>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
	<20080124025116.GN21454@mila.usg.tufts.edu>
Message-ID: <1201189291.2910.19.camel@localhost.localdomain>


On Wed, 2008-01-23 at 21:51 -0500, Jeff Wasilko wrote:
> On Wed, Jan 23, 2008 at 02:20:44PM -0500, Lon Hohberger wrote:
> > On Tue, 2008-01-22 at 16:18 -0500, Jeff Wasilko wrote:
> > > On Tue, Jan 22, 2008 at 03:59:47PM -0500, Christopher Hawkins wrote:
> > > > If you don't need GFS... Then drbd + heartbeat is a good way to go if you
> > > > want to mirror the local storage on each machine.
> > > 
> > > True, except Zimbra has explicit support for RHCS, and we'd be off
> > > rolling our own solution with drbd + heartbeat....
> > 
> > As I recall, you can use LVM mirroring.  In recent versions of RHCS (4.6
> > maybe?), there's an agent to handle assembly/disassembly of LVM
> > (non-clustered) volume groups before/after failover.
> 
> I found the 4.6 docs, and it does look like all of the previous
> warnings about using mirrors under cluster are gone.
> 
> Plus, I saw this note in the 4.6 release notes:
> 
> #
> A known device mapper issue causes unexpected cmirror deadlocking. This issue prevents reliable use of cluster mirrors configured with three or more legs. As such, it is advised that you avoid configuring cluster mirrors with three legs or more until this issue is resolved.
> #
> 
> Is cmirror the layer below lvm?
> 
> We generally don't do 3-way mirrors, other than when we're transitioning
> between arrays on the SAN.

Just to stress - You need to use cmirror, not plain LVM mirror, and clvm
- not plain LVM (there are docs somewhere about changing the locking
type in lvm.conf)

-- Lon 



From paolom at prisma-eng.it  Thu Jan 24 16:13:44 2008
From: paolom at prisma-eng.it (Paolo Marini)
Date: Thu, 24 Jan 2008 17:13:44 +0100
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <1201189163.2910.15.camel@localhost.localdomain>
References: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>	<1201169269.20969.60.camel@localhost.localdomain>	<23381.62.101.98.215.1201171015.squirrel@webmail.kpnqwest.it>
	<1201189163.2910.15.camel@localhost.localdomain>
Message-ID: <4798B938.4030005@prisma-eng.it>

This is already configured that way - is'nt there a quick way to 
shutdown the entire virtual/physical clusters at once ?

Lon H. Hohberger ha scritto:
> On Thu, 2008-01-24 at 11:36 +0100, Paolo Marini wrote:
>   
>> Thanks for the answer, I suspected that failover domains could do the
>> trick, I will try this solution.
>>
>> Anyway, shutdown is for me still unclear.
>>
>> starting up a cluster involves starting the cman daemon, clvmd, gfs,
>> mounting the gfs filesystems and starting rgmanager. Stopping cman does
>> not work on a cluster, because the other nodes still think that the
>> shutting down node is up, and they end fencing it if it is no more
>> responding. In fact, the shutdown procedures of a single node involves the
>> removeal of the node from both the fencing and cman agents (fence_tool
>> remove and cman_tool leave remove commands).
>>
>> So stopping the daemons is not enough. Beside this, the ordered shutdown
>> of a physical/virtual cluster requires the shutdown of the virtual first
>> then of the physical cluster.
>>     
>
> Also, chkconfig --del xendomains if you haven't already.  Otherwise,
> xendomains might start up domains you're trying to manage with rgmanager
> - not good!
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>   



From lhh at redhat.com  Thu Jan 24 20:30:33 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 15:30:33 -0500
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <4798B938.4030005@prisma-eng.it>
References: <27382.62.101.98.215.1201168605.squirrel@webmail.kpnqwest.it>
	<1201169269.20969.60.camel@localhost.localdomain>
	<23381.62.101.98.215.1201171015.squirrel@webmail.kpnqwest.it>
	<1201189163.2910.15.camel@localhost.localdomain>
	<4798B938.4030005@prisma-eng.it>
Message-ID: <1201206633.2910.32.camel@localhost.localdomain>


On Thu, 2008-01-24 at 17:13 +0100, Paolo Marini wrote:
> This is already configured that way - is'nt there a quick way to 
> shutdown the entire virtual/physical clusters at once ?

Depend; if you're managing the virtual instances, you can do something
like:

for a in `clustat -l | grep "^Service Name" | cut -f2 -d: `; do
clusvcadm -d $a; done 

-- Lon



From lhh at redhat.com  Thu Jan 24 20:48:29 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 15:48:29 -0500
Subject: [Linux-cluster] any timeline for supporting software RAID?
In-Reply-To: <1201189216.2910.17.camel@localhost.localdomain>
References: <20080122205319.GF19080@mila.usg.tufts.edu>
	<200801222101.m0ML17fI011332@mxmail.leaseoptions.com>
	<20080122211821.GI19080@mila.usg.tufts.edu>
	<1201116044.5420.106.camel@ayanami.boston.devel.redhat.com>
	<20080124022804.GM21454@mila.usg.tufts.edu>
	<1201187563.2910.8.camel@localhost.localdomain>
	<1201189216.2910.17.camel@localhost.localdomain>
Message-ID: <1201207709.2910.34.camel@localhost.localdomain>


On Thu, 2008-01-24 at 10:40 -0500, Lon H. Hohberger wrote:

> I've started this:
> 
> http://sources.redhat.com/cluster/wiki/LVMFailover

Jon updated this, it's more complete now.  Give it a spin and see if you
find it helpful.  Comments are appreciated.

-- Lon




From garromo at us.ibm.com  Thu Jan 24 20:57:54 2008
From: garromo at us.ibm.com (Gary Romo)
Date: Thu, 24 Jan 2008 13:57:54 -0700
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <1201206633.2910.32.camel@localhost.localdomain>
Message-ID: <OF3916332D.A86D1CB1-ON872573DA.00731990-872573DA.0073128D@us.ibm.com>

This would also work for a regular "physical" cluster correct?

-Gary



"Lon H. Hohberger" <lhh at redhat.com> 
Sent by: linux-cluster-bounces at redhat.com
01/24/2008 01:30 PM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux clustering <linux-cluster at redhat.com>
cc

Subject
Re: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster







On Thu, 2008-01-24 at 17:13 +0100, Paolo Marini wrote:
> This is already configured that way - is'nt there a quick way to 
> shutdown the entire virtual/physical clusters at once ?

Depend; if you're managing the virtual instances, you can do something
like:

for a in `clustat -l | grep "^Service Name" | cut -f2 -d: `; do
clusvcadm -d $a; done 

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080124/fa820b53/attachment.htm>

From gordan at bobich.net  Thu Jan 24 21:04:01 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 24 Jan 2008 21:04:01 +0000
Subject: [Linux-cluster] Fencing using DRAC2?
Message-ID: <4798FD41.4020603@bobich.net>

Hi,

I'm looking to throw together a small test cluster on the cheap (0 
budget), but the hardware I've got lying around is a bit out of date and 
only has DRAC2 in it (Dell 2450 and 2550 servers). The references I've 
Googled say that Only DRAC3,4,5 are supported for fencing. Is that 
really the case? Will DRAC2 not work as a fencing device? Is it lacking 
a feature or is it just a difference in the CLI commands that makes it 
incompatible?

TIA.

Gordan



From lhh at redhat.com  Thu Jan 24 21:25:02 2008
From: lhh at redhat.com (Lon H. Hohberger)
Date: Thu, 24 Jan 2008 16:25:02 -0500
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <OF3916332D.A86D1CB1-ON872573DA.00731990-872573DA.0073128D@us.ibm.com>
References: <OF3916332D.A86D1CB1-ON872573DA.00731990-872573DA.0073128D@us.ibm.com>
Message-ID: <1201209902.2910.58.camel@localhost.localdomain>


On Thu, 2008-01-24 at 13:57 -0700, Gary Romo wrote:
> 
> This would also work for a regular "physical" cluster correct? 
> 

Basically, that will stop+disable all rgmanager-managed services on the
cluster.

The context I meant was when using it with rgmanager-managed VMs, but it
should work with all services/VMs.

Also, I had it wrong - I tested that on RHEL4, not RHEL5.  The following
should work on either RHEL4 or RHEL5:

  for a in `clustat -l | grep "^Service Name" | awk '{print $4}' `; do
    clusvcadm -d $a
  done

... however, it won't work if you put spaces in your service names ;)

-- Lon





From johannes.russek at io-consulting.net  Thu Jan 24 22:46:37 2008
From: johannes.russek at io-consulting.net (Johannes Russek)
Date: Thu, 24 Jan 2008 23:46:37 +0100
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <1201209902.2910.58.camel@localhost.localdomain>
References: <OF3916332D.A86D1CB1-ON872573DA.00731990-872573DA.0073128D@us.ibm.com>
	<1201209902.2910.58.camel@localhost.localdomain>
Message-ID: <4799154D.404@io-consulting.net>


>   for a in `clustat -l | grep "^Service Name" | awk '{print $4}' `; do
>     clusvcadm -d $a
>   done
>
> ... however, it won't work if you put spaces in your service names ;)
>
> -- Lon
clustat -l | sed -n 's/^Service Name.*: //g p' | xargs -n 1 clusvadm -d
there you go, should work even with space :)
i don't know if the "-n 1" is necessary though.
regards,
johannes



From isplist at logicore.net  Fri Jan 25 04:11:28 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 24 Jan 2008 22:11:28 -0600
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <200801241002.56463.grimme@atix.de>
Message-ID: <2008124221128.003419@leena>

I've been keeping that site in one of my browser tabs for months now :). I 
keep putting it off because it looks like it's going to take more time than I 
might have handy right now. 

>Very shortly (in the next 1 or 2 weeks) we will also announce the
>availability of a slightly changed anaconda (beta version) that will be able 
>to install a sharedroot cluster from scratch.
>If you are interested we can give you access to the beta software.

That sounds very interesting. So basically, an install disk to get going with? 
 
>XEN virtualisation is also supported.

I would also be interested in this, assuming I get the idea of how I could put 
MV to use. Need to read the rest of the replies to this thread.

Mike





From isplist at logicore.net  Fri Jan 25 04:21:21 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 24 Jan 2008 22:21:21 -0600
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <1201168812.20969.53.camel@localhost.localdomain>
Message-ID: <2008124222121.139567@leena>

Hi there,

> maybe before going right into SSI which seems to be quite an effort to
> get running, you might want to think about "cloning" system images by
> using any kind of volume manager and copy on write volumes. lvm2 for
> instance does support this.

That would help in as far as getting machines installed quickly which is 
sometimes a problem of it's own. I think I saw something called SystemImager 
on SourceForge a year or two back which offers what you are talking about.

However, what I am really after is a way to use my resources more effectively.

Like most networks, I have a lot of machines sitting around wasting their 
resources when they are sitting idle.These machines all have drives spinning 
24/7 wether they are being used or not. 
Machines with large drives sitting there empty since we didn't fill them up as 
anticipated, the list just goes on and on. 

Fibre channel and centralized storage is a great solution for this but the 
very first thing I'd like to do is get to the point where I could remove all 
those spinning drives and use that centralized storage I have. 

> you basically create one master image, do all the basic configuration,
> such as routes, connection to your ldap server, whatever, and then just

Do you have any software suggestions for this?

Mike





From isplist at logicore.net  Fri Jan 25 04:55:24 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 24 Jan 2008 22:55:24 -0600
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <alpine.LRH.1.00.0801241012050.4196@skynet.shatteredsilicon.net>
Message-ID: <2008124225524.441951@leena>

> Indeed. That sounds rather like you were using a SAN just for the sake of
> using a SAN, and taking all the disadvantages without any of the
> advantages.

I'm not sure what you mean. I use an FC SAN because it allows me to separate 
the storage from the machine. 
What I was hoping to do with the partitions was to give each blade it's own 
boot and scratch space, then allow each blade to have/use shared storage for 
the rest. I was hoping to boot the machines from FC or perhaps PXE. Then 
someone mentioned OpenSharedroot which sounded more interesting than carving 
up a chassis into a bunch of complicated partitions. Just never got back to it 
but want to again now. I badly want to eliminate the drives from each blade in 
place of PXE or FC boot with something such as a sharedroot.

> A shared root system would be better because then I don't have
> to have fixed partitions, just files.
> Then, each node would have it's storage over other storage chassis on 
> the network.

> Not sure what you mean there, the last two sentences didn't quite parse.
> Can you please elaborate?

These two statements you mean? 

What I meant was first, that when I was trying this out, I was not aware of 
the sharedroot project. However, I could take one of my 12-drive FC chassis 
and partition it into say 32 partitions. My hope was to be able to boot 32 
machines from that storage chassis. So, for the cost of running 12 drives in a 
RAID5 array, I would eliminate 32 drives of all sorts of sizes for 12.

Since I was not able to boot from FC for what ever reason, I found that I 
could install a small cheap flash IDE card into each blade for their /boot 
partition, then it's main / partition would be the storage device. This worked 
but of course I ran into other problems.
The problem had to do with not just zoning complications but that the storage 
device was static in that if say I needed to do something with a certain 
partition, I was unable to make any changes unless I changed all of the 
partitions. Not a good idea.

The second part was that I was going to make that partition above, only the 
operating system partition, everything else would use central storage over the 
FC network. This way, I would not waste drive space on machines which didn't 
use what we thought they would and reliability would be better since all 
machines would automatically gain the RAID security. 

>> maybe before going right into SSI which seems to be quite an effort to
>> get running, you might want to think about "cloning" system images by
>> using any kind of volume manager and copy on write volumes. lvm2 for
>> instance does support this.

But my thinking is not about ease of creating servers, it is about wasting the 
resources of servers which are sitting there idle for the most part. Important 
machines yet when they aren't doing anything, really, just wasted resources. 
My curiosity was about creating a cluster of machines, which could use all of 
the processing power of the others when needed. Instead of having a bunch of 
machines sitting around mostly idle, when something came up, any one of them 
could use what it needed for resources, better utilizing the resources. 

That, of course, is making the assumption that I would be using applications 
which put to use such resources as an SSI cluster.

> That would make things more complicated to set up AND more complicated to
> maintain and keep in sync afterwards. SSI isn't hard to set up. Follow the
> OSR howto and you'll have it up and running in no time.

 From what I understand of SSI clusters, the applications have to be able to 
put that sort of processing to proper use. While I'm using the term SSI, I am 
really just asking everyone here for thoughts :). 

All I really mean is kind of where computing is going anyhow. Build something 
very powerful (SSI for lack of better word), allow it to be sliced as many 
times as required (VM), allow all of it's power to be used by any one or more 
requests or share it amongst request automatically. 
On the resources side, make it easy to add more power by easily being able to 
add servers, memory, storage, what ever, into the mix. Isn't this where 
computing is heading?

In my case, I just want to stop wasting all the power I'm wasting, have 
something flexible, easy to grow and manage. 
 
> the top of my head. It's useful to keep things like /var/cache shared. But
> that's all fairly minor stuff. The howto will get you up and running.

I'll take a peek, thanks much.
 
> The major upshot of SSI is that you only need to manage one file system,
> which means you can both use smaller disks in the nodes, and save yourself
> the hassle of keeping all the packages/libraries/configs in sync.

The very first thing I'd like to achieve is being able to get rid of the 
drives in all of my machines in place of a FC HBA or using PXE. Then using 
central storage for each servers needs from there.

On SSI, again, this is where it is unclear to me and perhaps I am using the 
wrong term. I understand SSI as meaning a single system image but one which 
you can only take advantage of with special applications. In other words, a 
LAMP system would not take advantage of it.

Mike





From bfilipek at crscold.com  Fri Jan 25 05:01:22 2008
From: bfilipek at crscold.com (Brad Filipek)
Date: Thu, 24 Jan 2008 23:01:22 -0600
Subject: [Linux-cluster] Does samba work with cluster suite? 
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>

I am suprised to find little documentation on setting up a samba share on my RHEL cluster. I have two RHEL5 boxes running cluster suite. I want to share out a few directories via samba to the Windows workstations on my network. However, in the Cluster Configuration screen, it only shows "Name" and "Workgroup" in the Samba Service conifg screen. How do I setup my samba shares properly to work with cluster suite? 
 
Thanks,
Brad Filipek


Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080124/720aea4c/attachment.htm>

From doobs72 at hotmail.com  Fri Jan 25 05:09:53 2008
From: doobs72 at hotmail.com (Dinesh Patel)
Date: Fri, 25 Jan 2008 05:09:53 -0000
Subject: [Linux-cluster] qdisk setup
In-Reply-To: <1201189041.2910.13.camel@localhost.localdomain>
References: <BLU121-W427C0A2EDF1825312FE658BF380@phx.gbl>
	<1201189041.2910.13.camel@localhost.localdomain>
Message-ID: <BLU121-DAV118DEE870885B3CB2788CFBF390@phx.gbl>

First of all I'm using RHEL5.0, I've read on various mailing lists that
qdisk on RHEL5.0 does not work is that true?

I have a 3 node setup, but would like to be able to run with 1 out 3 nodes,
hence qdisk. I'm in the early stages of trying to get this to work. 

I will look at the man pages.

D.

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon H. Hohberger
Sent: 24 January 2008 15:37
To: linux clustering
Subject: Re: [Linux-cluster] qdisk setup


On Thu, 2008-01-24 at 04:13 +0000, dinesh _ wrote:
> Hi
>  
> I've trying to setup a cluster with qdisk (3nodes) without any
> success. Has anyone got any instuctions on how to do this ? or point
> me in the right direction to get information.

What's the goal?  With a 3 node cluster, it's simpler to run without it,
but the man page is an excellent place to start.

What are you having problems with?

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

No virus found in this incoming message.
Checked by AVG Free Edition. 
Version: 7.5.516 / Virus Database: 269.19.7/1232 - Release Date: 18/01/2008
19:32
 

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.516 / Virus Database: 269.19.7/1232 - Release Date: 18/01/2008
19:32
 



From basv at sara.nl  Fri Jan 25 07:38:24 2008
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 25 Jan 2008 08:38:24 +0100
Subject: [Linux-cluster] Question about gfs1 and from which cvs branch
In-Reply-To: <20080124153853.GB29889@bpkroth-tux.hslc.wisc.edu>
References: <47989D7F.9060702@sara.nl>
	<20080124144247.GA29889@bpkroth-tux.hslc.wisc.edu>
	<4798AEE1.20701@sara.nl>
	<20080124153853.GB29889@bpkroth-tux.hslc.wisc.edu>
Message-ID: <DAA50905-61E2-4C3A-9A28-49DE255F5E30@sara.nl>



Sorry for asking again, but can somebody from the redhat cluster team  
answer this question. Which cvs tree must i use for gfs1 and which  
vanilla kernel version is required for this version?

Or must i wait for gfs2 to become prodcution ready?

Thanks


--
Bas van der Vlies
basv at sara.nl





From wferi at niif.hu  Fri Jan 25 09:21:30 2008
From: wferi at niif.hu (Ferenc Wagner)
Date: Fri, 25 Jan 2008 10:21:30 +0100
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008124225524.441951@leena> (isplist@logicore.net's message of
	"Thu, 24 Jan 2008 22:55:24 -0600")
References: <2008124225524.441951@leena>
Message-ID: <87k5lyqmzp.fsf@tac.ki.iif.hu>

"isplist at logicore.net" <isplist at logicore.net> writes:

> What I was hoping to do with the partitions was to give each blade
> it's own boot and scratch space, then allow each blade to have/use
> shared storage for the rest. I was hoping to boot the machines from
> FC or perhaps PXE.

I'm doing something like that on a BladeCenter.  The (diskless) blades
boot using PXE/DHCP/TFTP from a central server.  They can see over FC
two storage partitions.  On the first, there is an LVM2 VG with a
single LV for the root filesystem of each blade.  On the other there's
GFS which they all mount after booting up.

This setup is simple, but lacks the shared root of OpenSharedroot.  I
decided to go with the simpler design with more wasted space, because
the root volumes aren't particularly big, only a few GBs each.
Another refinement could be creating LVM snapshots of a common "image"
root LV for each booting server, which would again cut down on disk
space for the price of added complexity.

In another case, building several bigger but volatile clusters, I went
for a shared read-only NFS root with some symlinks into tmpfs's for
the files that had to be writeable.  There was no SAN in that picture
at all, a read-write NFS share was enough, since locking wasn't an
issue.
-- 
Regards,
Feri.



From johannes.russek at io-consulting.net  Fri Jan 25 09:59:46 2008
From: johannes.russek at io-consulting.net (jr)
Date: Fri, 25 Jan 2008 10:59:46 +0100
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008124222121.139567@leena>
References: <2008124222121.139567@leena>
Message-ID: <1201255186.20969.118.camel@admc.win-rar.local>

Am Donnerstag, den 24.01.2008, 22:21 -0600 schrieb isplist at logicore.net:

> 
> > you basically create one master image, do all the basic configuration,
> > such as routes, connection to your ldap server, whatever, and then just
> 
> Do you have any software suggestions for this?
> 
> Mike

oh, i'm really just talking about the very standard tools that come with
RHEL such as system-config-authentication etc.
if you really just want to get rid of spinning disks you might think
about using cobbler for provisioning and serving the booting stuff:
http://cobbler.et.redhat.com/
(check their wiki)

a quick search on google for cobbler and diskless returned this: 
http://pcquest.ciol.com/content/search/showarticle.asp?artid=97271

hope it helps!
regards, johannes


> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From johannes.russek at io-consulting.net  Fri Jan 25 10:00:37 2008
From: johannes.russek at io-consulting.net (jr)
Date: Fri, 25 Jan 2008 11:00:37 +0100
Subject: [Linux-cluster] Startup and Shutdown of a Virtual/Physical cluster
In-Reply-To: <4799154D.404@io-consulting.net>
References: <OF3916332D.A86D1CB1-ON872573DA.00731990-872573DA.0073128D@us.ibm.com>
	<1201209902.2910.58.camel@localhost.localdomain>
	<4799154D.404@io-consulting.net>
Message-ID: <1201255237.20969.121.camel@admc.win-rar.local>


> clustat -l | sed -n 's/^Service Name.*: //g p' | xargs -n 1 clusvadm -d

dang, it's missing a "c"

clustat -l | sed -n 's/^Service Name.*: //g p' | xargs -n 1 clusvcadm -d

i'm sure you already figured that out anyhow ;)

johannes



From gordan at bobich.net  Fri Jan 25 10:20:38 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 25 Jan 2008 10:20:38 +0000 (GMT)
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <2008124225524.441951@leena>
References: <2008124225524.441951@leena>
Message-ID: <alpine.LRH.1.00.0801250926250.12549@skynet.shatteredsilicon.net>



On Thu, 24 Jan 2008, isplist at logicore.net wrote:

>> Indeed. That sounds rather like you were using a SAN just for the sake of
>> using a SAN, and taking all the disadvantages without any of the
>> advantages.
>
> I'm not sure what you mean. I use an FC SAN because it allows me to separate
> the storage from the machine.

But is that really cost effective? SANs aren't exactly cheap. Even if you 
build one yourself, it takes a lot of disks before you actually break even 
on disk costs.

> What I was hoping to do with the partitions was to give each blade it's own
> boot and scratch space, then allow each blade to have/use shared storage for
> the rest. I was hoping to boot the machines from FC or perhaps PXE.

The point I was making was that SAN isn't cost effective unless you are 
reaping other benefits (such as simplified administration) in addition to 
saving on storage space (by the time you put the rest of the SAN box 
together, the chances are that your net price per GB will increase). SAN 
is also typically slower than local disks (don't believe the marketting 
hype).

> Then
> someone mentioned OpenSharedroot which sounded more interesting than carving
> up a chassis into a bunch of complicated partitions. Just never got back to it
> but want to again now. I badly want to eliminate the drives from each blade in
> place of PXE or FC boot with something such as a sharedroot.

Sure, PXE+OSR is exactly the sort of administration simplification that I 
was talking about. It also saves you extra space on top. The fact that 
most configuration files (with only a few exceptions) are shared saves
you from having to implement really naff things like coming up with 
complex frameworks to push the configuration to all machines and keep them 
in sync.

Same applies to keeping installed packages the same across the cluster 
(you install on any node, and all nodes have it), and most importantly, 
the data itself. All of it is always going to be consistent. It 
effectively reduces the administration and maintainance complexity from 
O(n) to O(1).

The only thing to watch out for is that swapping onto SAN is likely to be 
relatively slow, as is scratch space. The only time I'd use completely 
diskless nodes is when I can get away without swap/scratch partitions and 
just use shared space for everything.

>> A shared root system would be better because then I don't have
>> to have fixed partitions, just files.
>> Then, each node would have it's storage over other storage chassis on
>> the network.
>
>> Not sure what you mean there, the last two sentences didn't quite parse.
>> Can you please elaborate?
>
> These two statements you mean?
>
> What I meant was first, that when I was trying this out, I was not aware of
> the sharedroot project. However, I could take one of my 12-drive FC chassis
> and partition it into say 32 partitions. My hope was to be able to boot 32
> machines from that storage chassis. So, for the cost of running 12 drives in a
> RAID5 array, I would eliminate 32 drives of all sorts of sizes for 12.

But does that really end up being cheaper, when you factor in the cost of 
the chasis itself?

> Since I was not able to boot from FC for what ever reason, I found that I
> could install a small cheap flash IDE card into each blade for their /boot
> partition, then it's main / partition would be the storage device. This worked
> but of course I ran into other problems.
>
> The problem had to do with not just zoning complications but that the storage
> device was static in that if say I needed to do something with a certain
> partition, I was unable to make any changes unless I changed all of the
> partitions. Not a good idea.

This is where PXE booting the OSR initrd is useful. Update the initrd, do 
a rolling reboot, and it's all updated. :-)

>>> maybe before going right into SSI which seems to be quite an effort to
>>> get running, you might want to think about "cloning" system images by
>>> using any kind of volume manager and copy on write volumes. lvm2 for
>>> instance does support this.

That sounds like misusing LVM for what DRBD is designed for...

> But my thinking is not about ease of creating servers, it is about wasting the
> resources of servers which are sitting there idle for the most part.

Actually, unless the administrator's time is worthless, you'll likely find 
that cost of the extra man-hours required to admin the system is a greater 
saving than what you'll achieve with cost saving on the hardware.

> Important
> machines yet when they aren't doing anything, really, just wasted resources.
> My curiosity was about creating a cluster of machines, which could use all of
> the processing power of the others when needed. Instead of having a bunch of
> machines sitting around mostly idle, when something came up, any one of them
> could use what it needed for resources, better utilizing the resources.

The point is that all the resources you have are spinning all the time 
anyway, and if you are using SSI/OSR, the chances are that you have a 
homogenous cluster and all the machines are evenly load balanced.

> That, of course, is making the assumption that I would be using applications
> which put to use such resources as an SSI cluster.

One of the neat things you CAN achieve using SSI and clustering is that 
you don't need all the nodes up all the time. Say you have 31 nodes. You 
need 16 of those to maintain quorum. You want another 2 in there just to 
cover any unexpected failures, so that's 18. The other 13 can be powered 
off to conserve power, with their customer facing IPs HA failed over to 
the 18 working nodes, with weighting adjusted accordingly. If the load 
goes up you can bring the additional nodes online to cope with the load, 
fail it's IP back to it and adjust the load balancing weights.

It gives you a reasonably nice way of only using the hardware you need for 
normal load, while giving you a transparent way to bring up extra capacity 
when your load spikes up (e.g. after a big marketing campaign). The 
hardware that is powered down isn't using power and it isn't depleting 
it's MTBF, so it prolongs the operational live of your cluster, too.

>> That would make things more complicated to set up AND more complicated to
>> maintain and keep in sync afterwards. SSI isn't hard to set up. Follow the
>> OSR howto and you'll have it up and running in no time.
>
> From what I understand of SSI clusters, the applications have to be able to
> put that sort of processing to proper use. While I'm using the term SSI, I am
> really just asking everyone here for thoughts :).

SSI in a very simple setup is just shared-root. It doesn't necessarily 
include Mosix-like things where all the CPUs from all the nodes appear as 
local CPUs. That adds an additional layer of complexity that is a lot less 
useful for most applications. SR SSI, OTOH, gives you immediate and 
obvious benefits such as reduced space usage and more importantly, reduced 
administration complexity.

> All I really mean is kind of where computing is going anyhow. Build something
> very powerful (SSI for lack of better word), allow it to be sliced as many
> times as required (VM), allow all of it's power to be used by any one or more
> requests or share it amongst request automatically.

You'll find that even with Mosix type node unification, it's more 
efficient to use the virtualizer's own VM migration solution. Mosix is 
useful if you want to be lazy, but it isn't the magic bullet and doesn't 
work well in all scenarios. Most of the time application-level clustering 
is better in terms of performance, and it is usually a lot better at 
handling error conditions (e.g. node failure) gracefully.

> On the resources side, make it easy to add more power by easily being able to
> add servers, memory, storage, what ever, into the mix. Isn't this where
> computing is heading?

That's exactly what I described above. SR makes that really easy. You can 
add additional nodes in the time it takes you to add it to the dhcp.conf 
and cluster.conf.

> In my case, I just want to stop wasting all the power I'm wasting, have
> something flexible, easy to grow and manage.

See above. :-)

>> the top of my head. It's useful to keep things like /var/cache shared. But
>> that's all fairly minor stuff. The howto will get you up and running.
>
> I'll take a peek, thanks much.
>
>> The major upshot of SSI is that you only need to manage one file system,
>> which means you can both use smaller disks in the nodes, and save yourself
>> the hassle of keeping all the packages/libraries/configs in sync.
>
> The very first thing I'd like to achieve is being able to get rid of the
> drives in all of my machines in place of a FC HBA or using PXE. Then using
> central storage for each servers needs from there.

You might as well do it all in one go. If you have a single machine set 
up, you can get the OSR set up pretty quickly. Certainly less time than 
it'll take you to create volumes for all the nodes and copy their FS to 
their SAN volume.

> On SSI, again, this is where it is unclear to me and perhaps I am using the
> wrong term. I understand SSI as meaning a single system image but one which
> you can only take advantage of with special applications. In other words, a
> LAMP system would not take advantage of it.

In that case you are misunderstanding. Any homogenous cluster can take 
advantage of it. In fact, you can do it even where all nodes don't have 
the exact same job, but then you have to unshare the configs, which makes 
it more complicated. You could still work around that by having / shared 
via GFS and then having a separate /etc volume per cluster, but then you 
still have the same package set on all the nodes (which may not be a 
problem).

To summarize - most clusters can take advantage of SSI.

Gordan



From gordan at bobich.net  Fri Jan 25 10:25:17 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 25 Jan 2008 10:25:17 +0000 (GMT)
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <87k5lyqmzp.fsf@tac.ki.iif.hu>
References: <2008124225524.441951@leena> <87k5lyqmzp.fsf@tac.ki.iif.hu>
Message-ID: <alpine.LRH.1.00.0801251021160.12549@skynet.shatteredsilicon.net>



On Fri, 25 Jan 2008, Ferenc Wagner wrote:

> "isplist at logicore.net" <isplist at logicore.net> writes:
>
>> What I was hoping to do with the partitions was to give each blade
>> it's own boot and scratch space, then allow each blade to have/use
>> shared storage for the rest. I was hoping to boot the machines from
>> FC or perhaps PXE.
>
> I'm doing something like that on a BladeCenter.  The (diskless) blades
> boot using PXE/DHCP/TFTP from a central server.  They can see over FC
> two storage partitions.  On the first, there is an LVM2 VG with a
> single LV for the root filesystem of each blade.  On the other there's
> GFS which they all mount after booting up.

What about the overhead of managing unshared local roots? Nobody seems to 
account for that, and IMO, it's signifficant.

> This setup is simple, but lacks the shared root of OpenSharedroot.  I
> decided to go with the simpler design with more wasted space, because
> the root volumes aren't particularly big, only a few GBs each.
> Another refinement could be creating LVM snapshots of a common "image"
> root LV for each booting server, which would again cut down on disk
> space for the price of added complexity.
>
> In another case, building several bigger but volatile clusters, I went
> for a shared read-only NFS root with some symlinks into tmpfs's for
> the files that had to be writeable.  There was no SAN in that picture
> at all, a read-write NFS share was enough, since locking wasn't an
> issue.

Indeed, NFS can be almost as good as GFS if you don't need fully 
operational locking. There are also scenarios where it's faster than GFS.

Gordan



From npf-mlists at eurotux.com  Fri Jan 25 12:00:06 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Fri, 25 Jan 2008 12:00:06 +0000
Subject: [Linux-cluster] Problem in virtual cluster
Message-ID: <200801251200.07356.npf-mlists@eurotux.com>

Hi,

I'm in the process of migrating a cluster of two nodes to two virtual 
machines.

The real servers have clumanager-1.0.28-1 (RHEL3/CentOS3).
I've migrated all the filesystems and started of the process of reconfiguring 
the cluster.

The real servers clustat:

Cluster Status Monitor (Cluster)                              11:50:14

Cluster alias: Not Configured

=========================  M e m b e r   S t a t u s  
==========================

  Member         Status     Node Id    Power Switch
  -------------- ---------- ---------- ------------
  cl1            Up         0          Good
  cl2            Up         1          Good

=========================  H e a r t b e a t   S t a t u s  
====================

  Name                           Type       Status
  ------------------------------ ---------- ------------
  cl1          <--> cl2          network    ONLINE
  cln1         <--> cln2         network    ONLINE

=========================  S e r v i c e   S t a t u s  
========================

                                         Last             Monitor  Restart
  Service        Status   Owner          Transition       Interval Count
  -------------- -------- -------------- ---------------- -------- -------
  mysql1         started  cl2            00:16:28 Oct 23  10       1
  nfs            started  cl2            23:20:58 Oct 08  10       0


Everything is about the same in the virtual cluster, except that they don't 
have any powerwitch, there is only one network. They both use network and 
quorum to check if the other node is ok.

The problem is in the virtual cluster. I've upgraded to clumanager-1.2.34-3 in 
the virtual cluster to check if it was an bug in the previous one. Both nodes 
can't see each other through the network. They think the other is Inactive.
As i start cl1 clumanager i get:

Jan 25 11:52:22 cl1 clumanager: [15039]: <notice> Starting Red Hat Cluster 
Manager...
Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers 
configured for host 'cl1'!
Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity may 
be compromised!
Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers 
configured for host 'cl2'!
Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity may 
be compromised!
Jan 25 11:52:22 cl1 clumanager: cluquorumd startup succeeded
Jan 25 11:52:33 cl1 clumembd[15056]: <notice> Member cl1 UP
Jan 25 11:52:34 cl1 cluquorumd[15054]: <notice> Quorum Formed; Starting 
Service Manager
Jan 25 11:52:34 cl1 clusvcmgrd: [15067]: <notice> service notice: Stopping 
service mysql ...
Jan 25 11:52:35 cl1 clusvcmgrd: [15067]: <notice> service notice: Running user 
script '/etc/init.d/mysql1 stop'
Jan 25 11:52:37 cl1 clusvcmgrd: [15067]: <notice> service notice: Stopped 
service mysql ...
Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice: Stopping 
service nfs ...
Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice: Stopped 
service nfs ...
Jan 25 11:52:37 cl1 clusvcmgrd[15381]: <notice> Starting stopped service mysql
Jan 25 11:52:37 cl1 clusvcmgrd[15395]: <notice> Starting stopped service nfs
Jan 25 11:52:37 cl1 clusvcmgrd: [15382]: <notice> service notice: Starting 
service mysql ...
Jan 25 11:52:37 cl1 clusvcmgrd: [15420]: <notice> service notice: Starting 
service nfs ...
Jan 25 11:52:37 cl1 kernel: kjournald starting.  Commit interval 5 seconds
Jan 25 11:52:37 cl1 kernel: EXT3 FS on hda5, internal journal
Jan 25 11:52:37 cl1 kernel: EXT3-fs: mounted filesystem with ordered data 
mode.
Jan 25 11:52:37 cl1 /sbin/hotplug: no runnable /etc/hotplug/block.agent is 
installed
Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Running user 
script '/etc/init.d/mysql1 start'
Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Started 
service mysql ...
Jan 25 11:52:38 cl1 clusvcmgrd: [15420]: <notice> service notice: Started 
service nfs ...

Everything seems ok... Then i start cl2's clumanager:

cl2 -bash: (1836) [root.root] |.| /etc/init.d/clumanager start
Jan 25 11:54:56 cl2 clumanager: [7651]: <notice> Starting Red Hat Cluster 
Manager...
Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers configured 
for host 'cl1'!
Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity may be 
compromised!
Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers configured 
for host 'cl2'!
Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity may be 
compromised!
Jan 25 11:54:56 cl2 clumanager: cluquorumd startup succeeded
Jan 25 11:55:07 cl2 clumembd[7670]: <notice> Member cl2 UP
Jan 25 11:55:08 cl2 cluquorumd[7666]: <warning> Membership reports #0 as down, 
but disk reports as up: State uncertain!
Jan 25 11:55:08 cl2 cluquorumd[7666]: <notice> Quorum Formed; Starting Service 
Manager
Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopping 
service mysql ...
Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Running user 
script '/etc/init.d/mysql1 stop'
Jan 25 11:55:10 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopped 
service mysql ...
Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopping 
service nfs ...
Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopped 
service nfs ...

Now we have a problem...
"cluquorumd[7666]: <warning> Membership reports #0 as down, but disk reports 
as up: State uncertain!"

Clustat from cl1 reports:

Cluster Status - Cluster                                      11:54:16
Cluster Quorum Incarnation #1
Shared State: Shared Raw Device Driver v1.2

  Member             Status
  ------------------ ----------
  cl1                Active     <-- You are here
  cl2                Inactive

  Service        Status   Owner (Last)     Last Transition Chk Restarts
  -------------- -------- ---------------- --------------- --- --------
  mysql          started  cl1              11:52:37 Jan 25  20        0
  nfs            started  cl1              11:52:37 Jan 25   0        0

Clustat from cl2 reports:
Cluster Status - Cluster                                      11:56:30
Cluster Quorum Incarnation #1
Shared State: Shared Raw Device Driver v1.2

  Member             Status
  ------------------ ----------
  cl1                Inactive
  cl2                Active     <-- You are here

  Service        Status   Owner (Last)     Last Transition Chk Restarts
  -------------- -------- ---------------- --------------- --- --------
  mysql          started  cl1              11:52:37 Jan 25  20        0
  nfs            started  cl1              11:52:37 Jan 25   0        0

I have network connectivity working:

[root at cl1 root]# ping -c2 -s30000 cl2
PING cl2 (172.30.5.112) 30000(30028) bytes of data.
30008 bytes from cl2 (172.30.5.112): icmp_seq=0 ttl=64 time=1.08 ms
30008 bytes from cl2 (172.30.5.112): icmp_seq=1 ttl=64 time=1.09 ms

[root at cl2 root]# ping -c2 -s30000 cl1
PING cl1 (172.30.5.111) 30000(30028) bytes of data.
30008 bytes from cl1 (172.30.5.111): icmp_seq=0 ttl=64 time=1.09 ms
30008 bytes from cl1 (172.30.5.111): icmp_seq=1 ttl=64 time=0.998 ms

Quorum seems ok, but network doesn't.

[root at cl1 root]# shutil -p /cluster/header
/cluster/header is 144 bytes long
SharedStateHeader {
        ss_magic = 0x39119fcd
        ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
        ss_updateHost = cl1.datacenter.imoportal.pt
}

[root at cl2 root]# shutil -p /cluster/header
/cluster/header is 144 bytes long
SharedStateHeader {
        ss_magic = 0x39119fcd
        ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
        ss_updateHost = cl1.datacenter.imoportal.pt
}


Any ideas? Thanks
Nuno Fernandes



From ben.yarwood at juno.co.uk  Fri Jan 25 12:08:07 2008
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Fri, 25 Jan 2008 12:08:07 -0000
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
Message-ID: <13ce01c85f4a$f273e7c0$d75bb740$@yarwood@juno.co.uk>

Trying to grow a 15TB file system to 20TB this morning, using RHEL4.4 I got an error and the grow failed.  The file system will
still mount but when accessed gives the following error and withdraws:

Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: fatal: invalid metadata block
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   bh = 465407847 (type: exp=4, found=3)
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   function = gfs_get_meta_buffer
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   file =
/builddir/build/BUILD/gfs-kernel-2.6.9-75/smp/src/gfs/dio.c, line = 1223
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   time = 1201260769
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: about to withdraw from the cluster
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: waiting for outstanding I/O
Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: telling LM to withdraw
Jan 25 11:32:50 jrmedia-c kernel: lock_dlm: withdraw abandoned memory
Jan 25 11:32:50 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: withdrawn


A gfs_fsck doesn't work either:

Initializing fsck
Initializing lists...
Initializing special inodes...
Validating Resource Group index.
Level 1 check.
20148 resource groups found.
Block #36354331 (0x22ab91b) (1 of 5) is neither GFS_METATYPE_RB nor GFS_METATYPE_RG.
Resource group or index is corrupted.
Unable to read in rgrp descriptor.
(failed--trying again at level 2)
Level 2 check.
The middle RG is not on an even boundary (fs has grown?)
(failed--trying again at level 3)
Level 3 check.
RG 1000 is damaged: recomputing RG dist from index: 0x10085
Section 1: 0x11 - 0x3e57fff
  RG 1 at block 0x11 intact [length 0x2881]
  RG 2 at block 0x2892 intact [length 0x2881]
* RG 3 at block 0x5113 *** DAMAGED *** [length 0x2881]
* RG 4 at block 0x7994 *** DAMAGED *** [length 0x2881]
* RG 5 at block 0xA215 *** DAMAGED *** [length 0x2881]
* RG 6 at block 0xCA96 *** DAMAGED *** [length 0x2881]
Error: too many bad RGs.
(failed--giving up)
Unable to fill in resource group information.
Freeing buffers.


Does anyone know if this file system can be fixed, it'll take a long time to restore form the backup.


Cheers
Ben






From npf-mlists at eurotux.com  Fri Jan 25 12:18:07 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Fri, 25 Jan 2008 12:18:07 +0000
Subject: [Linux-cluster] Problem in virtual cluster
In-Reply-To: <200801251200.07356.npf-mlists@eurotux.com>
References: <200801251200.07356.npf-mlists@eurotux.com>
Message-ID: <200801251218.07114.npf-mlists@eurotux.com>

Ahh.. forgot cluster.xml and i'm using 2.6.18-8.1.14.el5xen kernel.

<?xml version="1.0"?>
<cluconfig version="3.0">
  <clumembd broadcast="yes" interval="750000" loglevel="5" multicast="no" 
multicast_ipaddress="" thread="yes" tko_count="20"/>
  <cluquorumd loglevel="5" pinginterval="" tiebreaker_ip=""/>
  <clurmtabd loglevel="5" pollinterval="4"/>
  <clusvcmgrd loglevel="5"/>
  <clulockd loglevel="5"/>
  <cluster config_viewnumber="3" key="975b29840bb8835ce57b0fff3354fabc" 
name="Cluster"/>
  <sharedstate driver="libsharedraw.so" rawprimary="/dev/raw/raw1" 
rawshadow="/dev/raw/raw2" type="raw"/>
  <members>
    <member id="0" name="cl1" watchdog="yes">
    </member>
    <member id="1" name="cl2" watchdog="yes"/>
  </members>
  <services>
    <service checkinterval="20" failoverdomain="None" id="0" 
maxfalsestarts="0" maxrestarts="0" name="mysql" 
userscript="/etc/init.d/mysql1">
      <service_ipaddresses>
        <service_ipaddress broadcast="172.30.5.255" id="0" 
ipaddress="172.30.5.113" monitor_link="0" netmask="255.255.255.0"/>
      </service_ipaddresses>
      <device id="0" name="/dev/hda5" sharename="">
        <mount forceunmount="yes" fstype="ext3" mountpoint="/var/lib/mysql1" 
options="sync,rw,nosuid"/>
      </device>
    </service>
    <service checkinterval="0" failoverdomain="None" id="1" maxfalsestarts="0" 
maxrestarts="0" name="nfs" userscript="None">
      <service_ipaddresses>
        <service_ipaddress broadcast="172.30.5.255" id="0" 
ipaddress="172.30.5.114" monitor_link="0" netmask="255.255.255.0"/>
      </service_ipaddresses>
    </service>
  </services>
  <failoverdomains/>
</cluconfig>

Thanks
Nuno Fernandes

On Friday 25 January 2008 12:00:06 Nuno Fernandes wrote:
> Hi,
>
> I'm in the process of migrating a cluster of two nodes to two virtual
> machines.
>
> The real servers have clumanager-1.0.28-1 (RHEL3/CentOS3).
> I've migrated all the filesystems and started of the process of
> reconfiguring the cluster.
>
> The real servers clustat:
>
> Cluster Status Monitor (Cluster)                              11:50:14
>
> Cluster alias: Not Configured
>
> =========================  M e m b e r   S t a t u s
> ==========================
>
>   Member         Status     Node Id    Power Switch
>   -------------- ---------- ---------- ------------
>   cl1            Up         0          Good
>   cl2            Up         1          Good
>
> =========================  H e a r t b e a t   S t a t u s
> ====================
>
>   Name                           Type       Status
>   ------------------------------ ---------- ------------
>   cl1          <--> cl2          network    ONLINE
>   cln1         <--> cln2         network    ONLINE
>
> =========================  S e r v i c e   S t a t u s
> ========================
>
>                                          Last             Monitor  Restart
>   Service        Status   Owner          Transition       Interval Count
>   -------------- -------- -------------- ---------------- -------- -------
>   mysql1         started  cl2            00:16:28 Oct 23  10       1
>   nfs            started  cl2            23:20:58 Oct 08  10       0
>
>
> Everything is about the same in the virtual cluster, except that they don't
> have any powerwitch, there is only one network. They both use network and
> quorum to check if the other node is ok.
>
> The problem is in the virtual cluster. I've upgraded to clumanager-1.2.34-3
> in the virtual cluster to check if it was an bug in the previous one. Both
> nodes can't see each other through the network. They think the other is
> Inactive. As i start cl1 clumanager i get:
>
> Jan 25 11:52:22 cl1 clumanager: [15039]: <notice> Starting Red Hat Cluster
> Manager...
> Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers
> configured for host 'cl1'!
> Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity
> may be compromised!
> Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers
> configured for host 'cl2'!
> Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity
> may be compromised!
> Jan 25 11:52:22 cl1 clumanager: cluquorumd startup succeeded
> Jan 25 11:52:33 cl1 clumembd[15056]: <notice> Member cl1 UP
> Jan 25 11:52:34 cl1 cluquorumd[15054]: <notice> Quorum Formed; Starting
> Service Manager
> Jan 25 11:52:34 cl1 clusvcmgrd: [15067]: <notice> service notice: Stopping
> service mysql ...
> Jan 25 11:52:35 cl1 clusvcmgrd: [15067]: <notice> service notice: Running
> user script '/etc/init.d/mysql1 stop'
> Jan 25 11:52:37 cl1 clusvcmgrd: [15067]: <notice> service notice: Stopped
> service mysql ...
> Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice: Stopping
> service nfs ...
> Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice: Stopped
> service nfs ...
> Jan 25 11:52:37 cl1 clusvcmgrd[15381]: <notice> Starting stopped service
> mysql Jan 25 11:52:37 cl1 clusvcmgrd[15395]: <notice> Starting stopped
> service nfs Jan 25 11:52:37 cl1 clusvcmgrd: [15382]: <notice> service
> notice: Starting service mysql ...
> Jan 25 11:52:37 cl1 clusvcmgrd: [15420]: <notice> service notice: Starting
> service nfs ...
> Jan 25 11:52:37 cl1 kernel: kjournald starting.  Commit interval 5 seconds
> Jan 25 11:52:37 cl1 kernel: EXT3 FS on hda5, internal journal
> Jan 25 11:52:37 cl1 kernel: EXT3-fs: mounted filesystem with ordered data
> mode.
> Jan 25 11:52:37 cl1 /sbin/hotplug: no runnable /etc/hotplug/block.agent is
> installed
> Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Running
> user script '/etc/init.d/mysql1 start'
> Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Started
> service mysql ...
> Jan 25 11:52:38 cl1 clusvcmgrd: [15420]: <notice> service notice: Started
> service nfs ...
>
> Everything seems ok... Then i start cl2's clumanager:
>
> cl2 -bash: (1836) [root.root] |.| /etc/init.d/clumanager start
> Jan 25 11:54:56 cl2 clumanager: [7651]: <notice> Starting Red Hat Cluster
> Manager...
> Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers
> configured for host 'cl1'!
> Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity may
> be compromised!
> Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers
> configured for host 'cl2'!
> Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity may
> be compromised!
> Jan 25 11:54:56 cl2 clumanager: cluquorumd startup succeeded
> Jan 25 11:55:07 cl2 clumembd[7670]: <notice> Member cl2 UP
> Jan 25 11:55:08 cl2 cluquorumd[7666]: <warning> Membership reports #0 as
> down, but disk reports as up: State uncertain!
> Jan 25 11:55:08 cl2 cluquorumd[7666]: <notice> Quorum Formed; Starting
> Service Manager
> Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopping
> service mysql ...
> Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Running
> user script '/etc/init.d/mysql1 stop'
> Jan 25 11:55:10 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopped
> service mysql ...
> Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopping
> service nfs ...
> Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopped
> service nfs ...
>
> Now we have a problem...
> "cluquorumd[7666]: <warning> Membership reports #0 as down, but disk
> reports as up: State uncertain!"
>
> Clustat from cl1 reports:
>
> Cluster Status - Cluster                                      11:54:16
> Cluster Quorum Incarnation #1
> Shared State: Shared Raw Device Driver v1.2
>
>   Member             Status
>   ------------------ ----------
>   cl1                Active     <-- You are here
>   cl2                Inactive
>
>   Service        Status   Owner (Last)     Last Transition Chk Restarts
>   -------------- -------- ---------------- --------------- --- --------
>   mysql          started  cl1              11:52:37 Jan 25  20        0
>   nfs            started  cl1              11:52:37 Jan 25   0        0
>
> Clustat from cl2 reports:
> Cluster Status - Cluster                                      11:56:30
> Cluster Quorum Incarnation #1
> Shared State: Shared Raw Device Driver v1.2
>
>   Member             Status
>   ------------------ ----------
>   cl1                Inactive
>   cl2                Active     <-- You are here
>
>   Service        Status   Owner (Last)     Last Transition Chk Restarts
>   -------------- -------- ---------------- --------------- --- --------
>   mysql          started  cl1              11:52:37 Jan 25  20        0
>   nfs            started  cl1              11:52:37 Jan 25   0        0
>
> I have network connectivity working:
>
> [root at cl1 root]# ping -c2 -s30000 cl2
> PING cl2 (172.30.5.112) 30000(30028) bytes of data.
> 30008 bytes from cl2 (172.30.5.112): icmp_seq=0 ttl=64 time=1.08 ms
> 30008 bytes from cl2 (172.30.5.112): icmp_seq=1 ttl=64 time=1.09 ms
>
> [root at cl2 root]# ping -c2 -s30000 cl1
> PING cl1 (172.30.5.111) 30000(30028) bytes of data.
> 30008 bytes from cl1 (172.30.5.111): icmp_seq=0 ttl=64 time=1.09 ms
> 30008 bytes from cl1 (172.30.5.111): icmp_seq=1 ttl=64 time=0.998 ms
>
> Quorum seems ok, but network doesn't.
>
> [root at cl1 root]# shutil -p /cluster/header
> /cluster/header is 144 bytes long
> SharedStateHeader {
>         ss_magic = 0x39119fcd
>         ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
>         ss_updateHost = cl1.datacenter.imoportal.pt
> }
>
> [root at cl2 root]# shutil -p /cluster/header
> /cluster/header is 144 bytes long
> SharedStateHeader {
>         ss_magic = 0x39119fcd
>         ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
>         ss_updateHost = cl1.datacenter.imoportal.pt
> }
>
>
> Any ideas? Thanks
> Nuno Fernandes
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From npf-mlists at eurotux.com  Fri Jan 25 12:37:15 2008
From: npf-mlists at eurotux.com (Nuno Fernandes)
Date: Fri, 25 Jan 2008 12:37:15 +0000
Subject: [Linux-cluster] Problem in virtual cluster [SOLVED]
In-Reply-To: <200801251218.07114.npf-mlists@eurotux.com>
References: <200801251200.07356.npf-mlists@eurotux.com>
	<200801251218.07114.npf-mlists@eurotux.com>
Message-ID: <200801251237.15538.npf-mlists@eurotux.com>

Just for the record...

Solved adding tiebreaker ip.

Thanks
Nuno Fernandes

On Friday 25 January 2008 12:18:07 Nuno Fernandes wrote:
> Ahh.. forgot cluster.xml and i'm using 2.6.18-8.1.14.el5xen kernel.
>
> <?xml version="1.0"?>
> <cluconfig version="3.0">
>   <clumembd broadcast="yes" interval="750000" loglevel="5" multicast="no"
> multicast_ipaddress="" thread="yes" tko_count="20"/>
>   <cluquorumd loglevel="5" pinginterval="" tiebreaker_ip=""/>
>   <clurmtabd loglevel="5" pollinterval="4"/>
>   <clusvcmgrd loglevel="5"/>
>   <clulockd loglevel="5"/>
>   <cluster config_viewnumber="3" key="975b29840bb8835ce57b0fff3354fabc"
> name="Cluster"/>
>   <sharedstate driver="libsharedraw.so" rawprimary="/dev/raw/raw1"
> rawshadow="/dev/raw/raw2" type="raw"/>
>   <members>
>     <member id="0" name="cl1" watchdog="yes">
>     </member>
>     <member id="1" name="cl2" watchdog="yes"/>
>   </members>
>   <services>
>     <service checkinterval="20" failoverdomain="None" id="0"
> maxfalsestarts="0" maxrestarts="0" name="mysql"
> userscript="/etc/init.d/mysql1">
>       <service_ipaddresses>
>         <service_ipaddress broadcast="172.30.5.255" id="0"
> ipaddress="172.30.5.113" monitor_link="0" netmask="255.255.255.0"/>
>       </service_ipaddresses>
>       <device id="0" name="/dev/hda5" sharename="">
>         <mount forceunmount="yes" fstype="ext3"
> mountpoint="/var/lib/mysql1" options="sync,rw,nosuid"/>
>       </device>
>     </service>
>     <service checkinterval="0" failoverdomain="None" id="1"
> maxfalsestarts="0" maxrestarts="0" name="nfs" userscript="None">
>       <service_ipaddresses>
>         <service_ipaddress broadcast="172.30.5.255" id="0"
> ipaddress="172.30.5.114" monitor_link="0" netmask="255.255.255.0"/>
>       </service_ipaddresses>
>     </service>
>   </services>
>   <failoverdomains/>
> </cluconfig>
>
> Thanks
> Nuno Fernandes
>
> On Friday 25 January 2008 12:00:06 Nuno Fernandes wrote:
> > Hi,
> >
> > I'm in the process of migrating a cluster of two nodes to two virtual
> > machines.
> >
> > The real servers have clumanager-1.0.28-1 (RHEL3/CentOS3).
> > I've migrated all the filesystems and started of the process of
> > reconfiguring the cluster.
> >
> > The real servers clustat:
> >
> > Cluster Status Monitor (Cluster)                              11:50:14
> >
> > Cluster alias: Not Configured
> >
> > =========================  M e m b e r   S t a t u s
> > ==========================
> >
> >   Member         Status     Node Id    Power Switch
> >   -------------- ---------- ---------- ------------
> >   cl1            Up         0          Good
> >   cl2            Up         1          Good
> >
> > =========================  H e a r t b e a t   S t a t u s
> > ====================
> >
> >   Name                           Type       Status
> >   ------------------------------ ---------- ------------
> >   cl1          <--> cl2          network    ONLINE
> >   cln1         <--> cln2         network    ONLINE
> >
> > =========================  S e r v i c e   S t a t u s
> > ========================
> >
> >                                          Last             Monitor 
> > Restart Service        Status   Owner          Transition       Interval
> > Count -------------- -------- -------------- ---------------- --------
> > ------- mysql1         started  cl2            00:16:28 Oct 23  10      
> > 1 nfs            started  cl2            23:20:58 Oct 08  10       0
> >
> >
> > Everything is about the same in the virtual cluster, except that they
> > don't have any powerwitch, there is only one network. They both use
> > network and quorum to check if the other node is ok.
> >
> > The problem is in the virtual cluster. I've upgraded to
> > clumanager-1.2.34-3 in the virtual cluster to check if it was an bug in
> > the previous one. Both nodes can't see each other through the network.
> > They think the other is Inactive. As i start cl1 clumanager i get:
> >
> > Jan 25 11:52:22 cl1 clumanager: [15039]: <notice> Starting Red Hat
> > Cluster Manager...
> > Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers
> > configured for host 'cl1'!
> > Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity
> > may be compromised!
> > Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: No drivers
> > configured for host 'cl2'!
> > Jan 25 11:52:22 cl1 cluquorumd[15053]: <warning> STONITH: Data integrity
> > may be compromised!
> > Jan 25 11:52:22 cl1 clumanager: cluquorumd startup succeeded
> > Jan 25 11:52:33 cl1 clumembd[15056]: <notice> Member cl1 UP
> > Jan 25 11:52:34 cl1 cluquorumd[15054]: <notice> Quorum Formed; Starting
> > Service Manager
> > Jan 25 11:52:34 cl1 clusvcmgrd: [15067]: <notice> service notice:
> > Stopping service mysql ...
> > Jan 25 11:52:35 cl1 clusvcmgrd: [15067]: <notice> service notice: Running
> > user script '/etc/init.d/mysql1 stop'
> > Jan 25 11:52:37 cl1 clusvcmgrd: [15067]: <notice> service notice: Stopped
> > service mysql ...
> > Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice:
> > Stopping service nfs ...
> > Jan 25 11:52:37 cl1 clusvcmgrd: [15244]: <notice> service notice: Stopped
> > service nfs ...
> > Jan 25 11:52:37 cl1 clusvcmgrd[15381]: <notice> Starting stopped service
> > mysql Jan 25 11:52:37 cl1 clusvcmgrd[15395]: <notice> Starting stopped
> > service nfs Jan 25 11:52:37 cl1 clusvcmgrd: [15382]: <notice> service
> > notice: Starting service mysql ...
> > Jan 25 11:52:37 cl1 clusvcmgrd: [15420]: <notice> service notice:
> > Starting service nfs ...
> > Jan 25 11:52:37 cl1 kernel: kjournald starting.  Commit interval 5
> > seconds Jan 25 11:52:37 cl1 kernel: EXT3 FS on hda5, internal journal
> > Jan 25 11:52:37 cl1 kernel: EXT3-fs: mounted filesystem with ordered data
> > mode.
> > Jan 25 11:52:37 cl1 /sbin/hotplug: no runnable /etc/hotplug/block.agent
> > is installed
> > Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Running
> > user script '/etc/init.d/mysql1 start'
> > Jan 25 11:52:38 cl1 clusvcmgrd: [15382]: <notice> service notice: Started
> > service mysql ...
> > Jan 25 11:52:38 cl1 clusvcmgrd: [15420]: <notice> service notice: Started
> > service nfs ...
> >
> > Everything seems ok... Then i start cl2's clumanager:
> >
> > cl2 -bash: (1836) [root.root] |.| /etc/init.d/clumanager start
> > Jan 25 11:54:56 cl2 clumanager: [7651]: <notice> Starting Red Hat Cluster
> > Manager...
> > Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers
> > configured for host 'cl1'!
> > Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity
> > may be compromised!
> > Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: No drivers
> > configured for host 'cl2'!
> > Jan 25 11:54:56 cl2 cluquorumd[7665]: <warning> STONITH: Data integrity
> > may be compromised!
> > Jan 25 11:54:56 cl2 clumanager: cluquorumd startup succeeded
> > Jan 25 11:55:07 cl2 clumembd[7670]: <notice> Member cl2 UP
> > Jan 25 11:55:08 cl2 cluquorumd[7666]: <warning> Membership reports #0 as
> > down, but disk reports as up: State uncertain!
> > Jan 25 11:55:08 cl2 cluquorumd[7666]: <notice> Quorum Formed; Starting
> > Service Manager
> > Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopping
> > service mysql ...
> > Jan 25 11:55:08 cl2 clusvcmgrd: [7679]: <notice> service notice: Running
> > user script '/etc/init.d/mysql1 stop'
> > Jan 25 11:55:10 cl2 clusvcmgrd: [7679]: <notice> service notice: Stopped
> > service mysql ...
> > Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopping
> > service nfs ...
> > Jan 25 11:55:10 cl2 clusvcmgrd: [7856]: <notice> service notice: Stopped
> > service nfs ...
> >
> > Now we have a problem...
> > "cluquorumd[7666]: <warning> Membership reports #0 as down, but disk
> > reports as up: State uncertain!"
> >
> > Clustat from cl1 reports:
> >
> > Cluster Status - Cluster                                      11:54:16
> > Cluster Quorum Incarnation #1
> > Shared State: Shared Raw Device Driver v1.2
> >
> >   Member             Status
> >   ------------------ ----------
> >   cl1                Active     <-- You are here
> >   cl2                Inactive
> >
> >   Service        Status   Owner (Last)     Last Transition Chk Restarts
> >   -------------- -------- ---------------- --------------- --- --------
> >   mysql          started  cl1              11:52:37 Jan 25  20        0
> >   nfs            started  cl1              11:52:37 Jan 25   0        0
> >
> > Clustat from cl2 reports:
> > Cluster Status - Cluster                                      11:56:30
> > Cluster Quorum Incarnation #1
> > Shared State: Shared Raw Device Driver v1.2
> >
> >   Member             Status
> >   ------------------ ----------
> >   cl1                Inactive
> >   cl2                Active     <-- You are here
> >
> >   Service        Status   Owner (Last)     Last Transition Chk Restarts
> >   -------------- -------- ---------------- --------------- --- --------
> >   mysql          started  cl1              11:52:37 Jan 25  20        0
> >   nfs            started  cl1              11:52:37 Jan 25   0        0
> >
> > I have network connectivity working:
> >
> > [root at cl1 root]# ping -c2 -s30000 cl2
> > PING cl2 (172.30.5.112) 30000(30028) bytes of data.
> > 30008 bytes from cl2 (172.30.5.112): icmp_seq=0 ttl=64 time=1.08 ms
> > 30008 bytes from cl2 (172.30.5.112): icmp_seq=1 ttl=64 time=1.09 ms
> >
> > [root at cl2 root]# ping -c2 -s30000 cl1
> > PING cl1 (172.30.5.111) 30000(30028) bytes of data.
> > 30008 bytes from cl1 (172.30.5.111): icmp_seq=0 ttl=64 time=1.09 ms
> > 30008 bytes from cl1 (172.30.5.111): icmp_seq=1 ttl=64 time=0.998 ms
> >
> > Quorum seems ok, but network doesn't.
> >
> > [root at cl1 root]# shutil -p /cluster/header
> > /cluster/header is 144 bytes long
> > SharedStateHeader {
> >         ss_magic = 0x39119fcd
> >         ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
> >         ss_updateHost = cl1.datacenter.imoportal.pt
> > }
> >
> > [root at cl2 root]# shutil -p /cluster/header
> > /cluster/header is 144 bytes long
> > SharedStateHeader {
> >         ss_magic = 0x39119fcd
> >         ss_timestamp = 0x000000004798e63b (19:25:47 Jan 24 2008)
> >         ss_updateHost = cl1.datacenter.imoportal.pt
> > }
> >
> >
> > Any ideas? Thanks
> > Nuno Fernandes
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From rpeterso at redhat.com  Fri Jan 25 14:28:15 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Jan 2008 08:28:15 -0600
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
In-Reply-To: <13ce01c85f4a$f273e7c0$d75bb740$@yarwood@juno.co.uk>
References: <13ce01c85f4a$f273e7c0$d75bb740$@yarwood@juno.co.uk>
Message-ID: <1201271295.18461.26.camel@technetium.msp.redhat.com>

On Fri, 2008-01-25 at 12:08 +0000, Ben Yarwood wrote:
> Trying to grow a 15TB file system to 20TB this morning, using RHEL4.4 I got an error and the grow failed.  The file system will
> still mount but when accessed gives the following error and withdraws:
> 
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: fatal: invalid metadata block
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   bh = 465407847 (type: exp=4, found=3)
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   function = gfs_get_meta_buffer
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   file =
> /builddir/build/BUILD/gfs-kernel-2.6.9-75/smp/src/gfs/dio.c, line = 1223
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   time = 1201260769
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: about to withdraw from the cluster
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: waiting for outstanding I/O
> Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: telling LM to withdraw
> Jan 25 11:32:50 jrmedia-c kernel: lock_dlm: withdraw abandoned memory
> Jan 25 11:32:50 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: withdrawn

Hi Ben,

It sounds like you found a bug in gfs_grow.  It should probably have
cleaned up after itself when it failed.  Can you tell me more about
the gfs_grow error and possibly open a bugzilla record for it?
Nobody else has reported a problem like this to my knowledge.

Unfortunately, as far as your file system is concerned, there is not
much that can be done.  I tried to put a lot of smarts into gfs_fsck to
repair weird and damaged RG conditions (thus the 3 levels of RG repair).
Unfortunately, gfs_grow throws the normal ("mkfs") rules out and can put
file system metadata in places that gfs_fsck can't reasonably predict.

(I did my best to remedy that with gfs2 (gfs2_grow) but we can't
change the on-disk format of gfs1, so we can't change it.)

Regards,

Bob Peterson
Red Hat GFS




From ben at muppethouse.com  Fri Jan 25 14:51:09 2008
From: ben at muppethouse.com (Ben Russo)
Date: Fri, 25 Jan 2008 09:51:09 -0500
Subject: [Linux-cluster] Any HA Cluster Success with iSCSI storage?
Message-ID: <4799F75D.3000106@muppethouse.com>

I currently have a RHEL-3 HA Cluster in a different City using fiber 
channel SCSI storage.  It has worked fine.

I want to setup another cluster, this time with RHEL-4.

I already have a NetApp FAS270c (for NFS and CIFS NAS).
It supports iSCSI.


***  Can I setup my two node HA cluster with iSCSI quorum drives and 
cluster service storage volumes? (anyone do this before?)


***  I was thinking about getting 10Gbit/sec uplinks for the NetApp and 
two ethernet switches that have 10Gbit uplink ports and that support 
802.3ad.  The two cluster nodes would use 802.3ad NIC channel bonding 
for the storage access bandwidth.  (anyone do this before?)


*** Any suggestions on good switch or NIC hardware for this?


Thanks in advance,
-Ben.





From swplotner at amherst.edu  Fri Jan 25 14:56:59 2008
From: swplotner at amherst.edu (Steffen Plotner)
Date: Fri, 25 Jan 2008 09:56:59 -0500
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
In-Reply-To: <1201271295.18461.26.camel@technetium.msp.redhat.com>
Message-ID: <150F55E3591CD042B77ED3DB957854652B61AC@mail7.amherst.edu>

 

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob Peterson
> Sent: Friday, January 25, 2008 9:28 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Failed gfs_grow causing corrupt volume
> 
> On Fri, 2008-01-25 at 12:08 +0000, Ben Yarwood wrote:
> > Trying to grow a 15TB file system to 20TB this morning, 
> using RHEL4.4 
> > I got an error and the grow failed.  The file system will 
> still mount but when accessed gives the following error and withdraws:
> > 
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0: fatal: invalid metadata block
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0:   bh = 465407847 (type: exp=4, found=3)
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0:   function = gfs_get_meta_buffer
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0:   file =
> > 
> /builddir/build/BUILD/gfs-kernel-2.6.9-75/smp/src/gfs/dio.c, 
> line = 1223
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0:   time = 1201260769
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> fsid=alpha_cluster:wav.0: about 
> > to withdraw from the cluster Jan 25 11:32:49 jrmedia-c kernel: GFS: 
> > fsid=alpha_cluster:wav.0: waiting for outstanding I/O Jan 
> 25 11:32:49 
> > jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: telling LM to 
> > withdraw Jan 25 11:32:50 jrmedia-c kernel: lock_dlm: withdraw 
> > abandoned memory Jan 25 11:32:50 jrmedia-c kernel: GFS: 
> > fsid=alpha_cluster:wav.0: withdrawn
> 
> Hi Ben,
> 
> It sounds like you found a bug in gfs_grow.  It should 
> probably have cleaned up after itself when it failed.  Can 
> you tell me more about the gfs_grow error and possibly open a 
> bugzilla record for it?
> Nobody else has reported a problem like this to my knowledge.
> 
> Unfortunately, as far as your file system is concerned, there 
> is not much that can be done.  I tried to put a lot of smarts 
> into gfs_fsck to repair weird and damaged RG conditions (thus 
> the 3 levels of RG repair).
> Unfortunately, gfs_grow throws the normal ("mkfs") rules out 
> and can put file system metadata in places that gfs_fsck 
> can't reasonably predict.
> 
> (I did my best to remedy that with gfs2 (gfs2_grow) but we 
> can't change the on-disk format of gfs1, so we can't change it.)
> 
> Regards,
> 
> Bob Peterson
> Red Hat GFS

My question would be, if Bob had done a gfs_fsck before attempting to
grow the gfs space, what would that have returned? Would that have
prevented the gfs_grow issue?

Steffen



From ben.yarwood at juno.co.uk  Fri Jan 25 14:56:06 2008
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Fri, 25 Jan 2008 14:56:06 -0000
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
In-Reply-To: <1201271295.18461.26.camel@technetium.msp.redhat.com>
References: <13ce01c85f4a$f273e7c0$d75bb740$@yarwood@juno.co.uk>
	<1201271295.18461.26.camel@technetium.msp.redhat.com>
Message-ID: <13db01c85f62$69d157a0$3d7406e0$@yarwood@juno.co.uk>

I will try and find more information on the errors in the logs but I think the problem was that I was using a 32bit system and tried
to expand over 16TB.  I didn't realize this was the size limit until I read the FAQ afterwards.  Is this the root cause of the
problem?

I now have the file system mounted again and am copying what I can off it by moving the files by name.  So far we have copied over
250GB of files and not a single file has failed to copy or caused the file system to withdraw.  It is fortunate that we knew the
name of every file on the file system.  Not really understanding the structure of the file system myself, do you think it's possible
we will recover all the files using this method?


Thanks
Ben



> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob
> Peterson
> Sent: 25 January 2008 14:28
> To: linux clustering
> Subject: Re: [Linux-cluster] Failed gfs_grow causing corrupt volume
> 
> On Fri, 2008-01-25 at 12:08 +0000, Ben Yarwood wrote:
> > Trying to grow a 15TB file system to 20TB this morning, using RHEL4.4 I got an error and the grow
> failed.  The file system will
> > still mount but when accessed gives the following error and withdraws:
> >
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: fatal: invalid metadata block
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   bh = 465407847 (type: exp=4,
> found=3)
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   function = gfs_get_meta_buffer
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   file =
> > /builddir/build/BUILD/gfs-kernel-2.6.9-75/smp/src/gfs/dio.c, line = 1223
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0:   time = 1201260769
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: about to withdraw from the cluster
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: waiting for outstanding I/O
> > Jan 25 11:32:49 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: telling LM to withdraw
> > Jan 25 11:32:50 jrmedia-c kernel: lock_dlm: withdraw abandoned memory
> > Jan 25 11:32:50 jrmedia-c kernel: GFS: fsid=alpha_cluster:wav.0: withdrawn
> 
> Hi Ben,
> 
> It sounds like you found a bug in gfs_grow.  It should probably have
> cleaned up after itself when it failed.  Can you tell me more about
> the gfs_grow error and possibly open a bugzilla record for it?
> Nobody else has reported a problem like this to my knowledge.
> 
> Unfortunately, as far as your file system is concerned, there is not
> much that can be done.  I tried to put a lot of smarts into gfs_fsck to
> repair weird and damaged RG conditions (thus the 3 levels of RG repair).
> Unfortunately, gfs_grow throws the normal ("mkfs") rules out and can put
> file system metadata in places that gfs_fsck can't reasonably predict.
> 
> (I did my best to remedy that with gfs2 (gfs2_grow) but we can't
> change the on-disk format of gfs1, so we can't change it.)
> 
> Regards,
> 
> Bob Peterson
> Red Hat GFS
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster






From holger.ratzel at she.net  Fri Jan 25 15:13:26 2008
From: holger.ratzel at she.net (Holger L. Ratzel)
Date: Fri, 25 Jan 2008 16:13:26 +0100
Subject: [Linux-cluster] Help with a two node cluster for a webserverneeded
In-Reply-To: <1201129961.29756.12.camel@ayanami.boston.devel.redhat.com>
References: <200801141830.16714.holger.ratzel@she.net>
	<1201128809.29756.10.camel@ayanami.boston.devel.redhat.com>
	<1201129961.29756.12.camel@ayanami.boston.devel.redhat.com>
Message-ID: <200801251613.27658.holger.ratzel@she.net>

Hello Lon,

Am Donnerstag 24 Januar 2008 00:12:41 schrieb Lon Hohberger:
> On Wed, 2008-01-23 at 17:53 -0500, Lon Hohberger wrote:
> > On Tue, 2008-01-15 at 15:45 +0100, Holger L. Ratzel wrote:
> > > Hi,
> > >
> > > Am Montag 14 Januar 2008 21:03:46 schrieb Lon Hohberger:
> > > > So, what was happening was this:
> > >
> > > [...]
> > >
> > > > First, let's ping the router with the cable unplugged to see how long
> > > > it takes for our heuristic to complete when things are "broken".  On
> > > > my machine:
> > > >
> > > > [lhh at ayanami ~]$ time ping -c1 -t1 frederick
> > > > PING frederick (12.1.2.99) 56(84) bytes of data.
> > > >
> > > > >From ayanami (12.1.2.37) icmp_seq=1 Destination Host Unreachable
> >
> > Holger,
> >
> > Digging deeper -- for some reason, ping occasionally doesn't exit for
> > some reason if you make the dest IP unreachable, but only if started
> > from the init script - e.g. 'service qdiskd start'.
> >
> > I was working with someone today and we reproduced it.
>
> https://bugzilla.redhat.com/show_bug.cgi?id=429927
>
> Very, very strange indeed.

I've tried to implement the workaround given in the bug report:

- I've created a wrapper for ping (copied your attachment)
- Changed cluster.conf to give qdiskd more time to finish its job
  (see attached cluster.conf)

Now qudiskd occasionaly reports the heuristik to be down (the network isn't 
touched, no cable pulled):

Jan 25 15:17:34 testcluster-2 qdiskd[2151]: <info> 
Heuristic: 'ping-wrap -c3 -t1 10.200.10.1' DOWN (1/1)
Jan 25 15:17:36 testcluster-2 qdiskd[2151]: <notice> Score insufficient for 
master operation (0/1; required=1); downgrading

The result is that this node gets fenced an will reboot. This repeats after 
some time on the other node too, creating an endless loop.

Do you know when your fix will make it into the regular upgrades for RHEL5?

Regards,

	Holger

-- 
----------------- SHE - IT-Sicherheit von Experten ------------------
SHE Informationstechnologie AG
Holger L. Ratzel                               Fon:+49 621 5200 - 210 
Service Delivery & Support                     Fax:+49 621 5200 - 555
Donnersbergweg 3                                holger.ratzel at she.net
D-67059 Ludwigshafen                              http://www.she.net/
Sitz der Gesellschaft und Registergericht Ludwigshafen HRB 4593
Aufsichtsratsvorsitzender: Ulrich Engelhardt
Vorstand: Klaus Schulz
-------------------- while( !asleep( ) ) ++sheep; -------------------

PGP-Fingerprint:
9A 73 40 22 72 64 BE D1  D8 1A 54 3C 5B 64 AF C3  CC E3 CA A8
Get my PGP public key at: http://pgp.she.net/
-------------- next part --------------
<?xml version="1.0"?>
<cluster alias="Test" config_version="30" name="Test">
	<quorumd interval="5" label="Qdisk1" tko="3" votes="1">
		<heuristic interval="5" program="ping-wrap -c3 -t1 10.200.10.1" score="1" tko="1"/>
	</quorumd>
	<fence_daemon post_fail_delay="0" post_join_delay="3"/>
	<clusternodes>
		<clusternode name="testcluster-2" nodeid="2" votes="1">
			<fence>
				<method name="1">
					<device name="RPS"/>
				</method>
			</fence>
			<multicast addr="224.0.0.10" interface="eth0"/>
		</clusternode>
		<clusternode name="testcluster-1" nodeid="1" votes="1">
			<fence>
				<method name="1">
					<device name="RPS"/>
				</method>
			</fence>
			<multicast addr="224.0.0.10" interface="eth0"/>
		</clusternode>
	</clusternodes>
	<cman expected_votes="3" two_node="0">
		<multicast addr="224.0.0.10"/>
	</cman>
	<fencedevices>
		<fencedevice agent="fence_rps10" device="/dev/ttyS0" name="RPS" option="reboot" port="0"/>
	</fencedevices>
	<rm>
		<failoverdomains>
			<failoverdomain name="Apache" ordered="1" restricted="1">
				<failoverdomainnode name="testcluster-1" priority="1"/>
				<failoverdomainnode name="testcluster-2" priority="2"/>
			</failoverdomain>
		</failoverdomains>
		<resources>
			<ip address="10.200.10.189" monitor_link="1"/>
			<script file="/etc/init.d/httpd" name="Apache"/>
			<fs device="/dev/sdb1" force_fsck="0" force_unmount="1" fsid="26076" fstype="ext3" mountpoint="/data/httpd" name="DISK_Apache" options="" self_fence="0"/>
		</resources>
		<service autostart="1" domain="Apache" name="HTTPD">
			<ip ref="10.200.10.189"/>
			<script ref="Apache"/>
			<fs ref="DISK_Apache"/>
		</service>
	</rm>
	<totem token="40000"/>
</cluster>

From td3201 at gmail.com  Fri Jan 25 15:31:23 2008
From: td3201 at gmail.com (Terry)
Date: Fri, 25 Jan 2008 09:31:23 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with iSCSI
Message-ID: <8ee061010801250731l2e5b6906u51a10f11d7204520@mail.gmail.com>

Hello,

I have the need to provide multiple hosts read/write access to a
single or a collection of iSCSI volumes.  NFS or CIFS is the answer
but I don't want to forklift my SAN to a NAS (or mixed SAN/NAS
solution) so I am thinking of standing up an NFS cluster in front of
it.  The total storage amount will be 9-30 TB with volumes split up
into 4TB volumes.  Anyone have some thoughts about this architecture
right off the bat?  It seems straight forward to me but I want to make
sure I am not missing anything.

Thanks!
Terry



From td3201 at gmail.com  Fri Jan 25 15:43:46 2008
From: td3201 at gmail.com (Terry)
Date: Fri, 25 Jan 2008 09:43:46 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
Message-ID: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>

Also, two more questions:
1) What about putting this NFS cluster in VMware?
2) I am assuming this would have to be an active-standby model since
only 1 host can "own" the disk in read/write, correct?


=============================================
Hello,

I have the need to provide multiple hosts read/write access to a
single or a collection of iSCSI volumes.  NFS or CIFS is the answer
but I don't want to forklift my SAN to a NAS (or mixed SAN/NAS
solution) so I am thinking of standing up an NFS cluster in front of
it.  The total storage amount will be 9-30 TB with volumes split up
into 4TB volumes.  Anyone have some thoughts about this architecture
right off the bat?  It seems straight forward to me but I want to make
sure I am not missing anything.

Thanks!
Terry



From gordan at bobich.net  Fri Jan 25 15:57:39 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 25 Jan 2008 15:57:39 +0000 (GMT)
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>

> Also, two more questions:
> 1) What about putting this NFS cluster in VMware?
> 2) I am assuming this would have to be an active-standby model since
> only 1 host can "own" the disk in read/write, correct?

No, the whole point of GFS is that you can have multiple nodes mounting 
and using the same volume r/w at the same time.

Gordan



From rpeterso at redhat.com  Fri Jan 25 15:59:54 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Jan 2008 09:59:54 -0600
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
In-Reply-To: <13db01c85f62$69d157a0$3d7406e0$@yarwood@juno.co.uk>
References: <13ce01c85f4a$f273e7c0$d75bb740$@yarwood@juno.co.uk>
	<1201271295.18461.26.camel@technetium.msp.redhat.com>
	<13db01c85f62$69d157a0$3d7406e0$@yarwood@juno.co.uk>
Message-ID: <1201276794.18461.35.camel@technetium.msp.redhat.com>


On Fri, 2008-01-25 at 14:56 +0000, Ben Yarwood wrote:
> I will try and find more information on the errors in the logs but I think the problem was that I was using a 32bit system and tried
> to expand over 16TB.  I didn't realize this was the size limit until I read the FAQ afterwards.  Is this the root cause of the
> problem?
> 
> I now have the file system mounted again and am copying what I can off it by moving the files by name.  So far we have copied over
> 250GB of files and not a single file has failed to copy or caused the file system to withdraw.  It is fortunate that we knew the
> name of every file on the file system.  Not really understanding the structure of the file system myself, do you think it's possible
> we will recover all the files using this method?
> 
> 
> Thanks
> Ben

Hi Ben,

Yes, using a 32-bit system to expand beyond 16TB might cause that
problem.  I added some smarts to gfs_fsck in late 2006 so that it
won't let you run a gfs_fsck from a 32-bit system if your file system
is > 16TB.  Perhaps gfs_grow needs more smarts regarding the file
system size as well.  That seems like an easy fix that's well worth
making.

I suspect you can get most (or all) of your data back by copying it
off like you're doing.  That's because even if the new RG information
is corrupt, you will likely not have tried to allocate any new data
into that RG, so the copy will hopefully not encounter the
corruption.  That's just a guess, not knowing what those RGs really
look like.

Regards,

Bob Peterson
Red Hat GFS




From rpeterso at redhat.com  Fri Jan 25 16:10:22 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Jan 2008 10:10:22 -0600
Subject: [Linux-cluster] Failed gfs_grow causing corrupt volume
In-Reply-To: <150F55E3591CD042B77ED3DB957854652B61AC@mail7.amherst.edu>
References: <150F55E3591CD042B77ED3DB957854652B61AC@mail7.amherst.edu>
Message-ID: <1201277422.18461.46.camel@technetium.msp.redhat.com>

On Fri, 2008-01-25 at 09:56 -0500, Steffen Plotner wrote:
> My question would be, if Bob had done a gfs_fsck before attempting to
> grow the gfs space, what would that have returned? Would that have
> prevented the gfs_grow issue?
> 
> Steffen

Hi Steffen,

I don't think gfs_fsck would have found any problems prior to the
gfs_grow.  The gfs_fsck can handle a gfs file system extended by
gfs_grow.  In fact, I put gfs_fsck through hell by doing different
combinations of gfs_grows and forcing different kinds of corruption.
In fact, to stress these different situations, I developed an elaborate
test that does many different combinations of:

  1. gfs_mkfs
  2. gfs_grow
  3. gfs_grow again sometimes
  4. deliberately damage different parts of the RGs and/or rg index.
  5. gfs_fsck to repair the damage

The problem, I think, is that the gfs_grow failed and left things in
a bad state.

Regards,

Bob Peterson
Red Hat GFS




From td3201 at gmail.com  Fri Jan 25 16:33:10 2008
From: td3201 at gmail.com (Terry)
Date: Fri, 25 Jan 2008 10:33:10 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>
Message-ID: <8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>

On Jan 25, 2008 9:57 AM,  <gordan at bobich.net> wrote:
> > Also, two more questions:
> > 1) What about putting this NFS cluster in VMware?
> > 2) I am assuming this would have to be an active-standby model since
> > only 1 host can "own" the disk in read/write, correct?
>
> No, the whole point of GFS is that you can have multiple nodes mounting
> and using the same volume r/w at the same time.
>
> Gordan
>

OK, that makes sense.  My other question is what other filesystems
does Red Hat support that can be used in this fashion?  PVFS, OCFS,
etc. ?



From basv at sara.nl  Fri Jan 25 16:41:10 2008
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 25 Jan 2008 17:41:10 +0100
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>
	<8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
Message-ID: <479A1126.7070906@sara.nl>

Terry wrote:
> On Jan 25, 2008 9:57 AM,  <gordan at bobich.net> wrote:
>>> Also, two more questions:
>>> 1) What about putting this NFS cluster in VMware?
>>> 2) I am assuming this would have to be an active-standby model since
>>> only 1 host can "own" the disk in read/write, correct?
>> No, the whole point of GFS is that you can have multiple nodes mounting
>> and using the same volume r/w at the same time.
>>
>> Gordan
>>
> 
> OK, that makes sense.  My other question is what other filesystems
> does Red Hat support that can be used in this fashion?  PVFS, OCFS,
> etc. ?

We use gfs for a scalable NFS-server. We have one filesysteem on fibre disk 
array  and 5 gfs servers mount this filesystem. The 5 gfs servers export 
this filesystem as NFS to +/- 800 nodes.

This is our setup for gfs/nfs

Regards

-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From gordan at bobich.net  Fri Jan 25 16:45:37 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 25 Jan 2008 16:45:37 +0000 (GMT)
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>
	<8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>


>>> Also, two more questions:
>>> 1) What about putting this NFS cluster in VMware?
>>> 2) I am assuming this would have to be an active-standby model since
>>> only 1 host can "own" the disk in read/write, correct?
>>
>> No, the whole point of GFS is that you can have multiple nodes mounting
>> and using the same volume r/w at the same time.
>>
>
> OK, that makes sense.  My other question is what other filesystems
> does Red Hat support that can be used in this fashion?  PVFS, OCFS,
> etc. ?

By and large, none. GFS is fairly unique in the way it does things. The 
rest of the world hasn't really caught up yet. ;-)

I'm not sure why OCFS keeps coming up, since it's not a normal, general 
purpose file system. It's only useful for putting Oracle DB volumes on it.

Gordan



From raju.rajsand at gmail.com  Fri Jan 25 16:48:33 2008
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 25 Jan 2008 22:18:33 +0530
Subject: [Linux-cluster] qdisk setup
In-Reply-To: <BLU121-DAV118DEE870885B3CB2788CFBF390@phx.gbl>
References: <BLU121-W427C0A2EDF1825312FE658BF380@phx.gbl>
	<1201189041.2910.13.camel@localhost.localdomain>
	<BLU121-DAV118DEE870885B3CB2788CFBF390@phx.gbl>
Message-ID: <8786b91c0801250848x7335f54bg7107c8be5ea6ee95@mail.gmail.com>

Have you tried a quorum of 1?

Regards

Rajagopal

On Jan 25, 2008 10:39 AM, Dinesh Patel <doobs72 at hotmail.com> wrote:

> First of all I'm using RHEL5.0, I've read on various mailing lists that
> qdisk on RHEL5.0 does not work is that true?
>
> I have a 3 node setup, but would like to be able to run with 1 out 3
> nodes,
> hence qdisk. I'm in the early stages of trying to get this to work.
>
> I will look at the man pages.
>
> D.
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon H. Hohberger
> Sent: 24 January 2008 15:37
> To: linux clustering
> Subject: Re: [Linux-cluster] qdisk setup
>
>
> On Thu, 2008-01-24 at 04:13 +0000, dinesh _ wrote:
> > Hi
> >
> > I've trying to setup a cluster with qdisk (3nodes) without any
> > success. Has anyone got any instuctions on how to do this ? or point
> > me in the right direction to get information.
>
> What's the goal?  With a 3 node cluster, it's simpler to run without it,
> but the man page is an excellent place to start.
>
> What are you having problems with?
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> No virus found in this incoming message.
> Checked by AVG Free Edition.
> Version: 7.5.516 / Virus Database: 269.19.7/1232 - Release Date:
> 18/01/2008
> 19:32
>
>
> No virus found in this outgoing message.
> Checked by AVG Free Edition.
> Version: 7.5.516 / Virus Database: 269.19.7/1232 - Release Date:
> 18/01/2008
> 19:32
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080125/e15628c7/attachment.htm>

From basv at sara.nl  Fri Jan 25 16:50:34 2008
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 25 Jan 2008 17:50:34 +0100
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>	<8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
	<alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>
Message-ID: <479A135A.8080109@sara.nl>

gordan at bobich.net wrote:
>>>> Also, two more questions:
>>>> 1) What about putting this NFS cluster in VMware?
>>>> 2) I am assuming this would have to be an active-standby model since
>>>> only 1 host can "own" the disk in read/write, correct?
>>> No, the whole point of GFS is that you can have multiple nodes mounting
>>> and using the same volume r/w at the same time.
>>>
>> OK, that makes sense.  My other question is what other filesystems
>> does Red Hat support that can be used in this fashion?  PVFS, OCFS,
>> etc. ?
> 
> By and large, none. GFS is fairly unique in the way it does things. The
> rest of the world hasn't really caught up yet. ;-)
> 
>
That is not really true ;-):

  SGI (Silicon Graphics) has : cxfs
  IBM                        : gpfs

And there are more , but we use also the above ones on our Site.



-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From jamesm at xandros.com  Fri Jan 25 16:58:32 2008
From: jamesm at xandros.com (James McOrmond)
Date: Fri, 25 Jan 2008 11:58:32 -0500
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
Message-ID: <479A1538.3080805@xandros.com>


gordan at bobich.net wrote:
>
> By and large, none. GFS is fairly unique in the way it does things. 
> The rest of the world hasn't really caught up yet. ;-)
what exactly does it do that's unique?
> I'm not sure why OCFS keeps coming up, since it's not a normal, 
> general purpose file system. It's only useful for putting Oracle DB 
> volumes on it.
ocfs2 is a general file system and has been available for quite some 
time (and is very easy to setup).

-- 
James A. McOrmond (jamesm at xandros.com)
Network Administrator
Xandros Corporation, Ottawa, Canada.
Morpheus: ...after a century of war I remember that which matters most:
 *We are still HERE!*



From td3201 at gmail.com  Fri Jan 25 16:59:46 2008
From: td3201 at gmail.com (Terry)
Date: Fri, 25 Jan 2008 10:59:46 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<alpine.LRH.1.00.0801251557020.14479@skynet.shatteredsilicon.net>
	<8ee061010801250833n4a7612ddraaca29db1485344@mail.gmail.com>
	<alpine.LRH.1.00.0801251643160.14534@skynet.shatteredsilicon.net>
Message-ID: <8ee061010801250859o699a3caay49c15231d89fabe8@mail.gmail.com>

On Jan 25, 2008 10:45 AM,  <gordan at bobich.net> wrote:
>
> >>> Also, two more questions:
> >>> 1) What about putting this NFS cluster in VMware?
> >>> 2) I am assuming this would have to be an active-standby model since
> >>> only 1 host can "own" the disk in read/write, correct?
> >>
> >> No, the whole point of GFS is that you can have multiple nodes mounting
> >> and using the same volume r/w at the same time.
> >>
> >
> > OK, that makes sense.  My other question is what other filesystems
> > does Red Hat support that can be used in this fashion?  PVFS, OCFS,
> > etc. ?
>
> By and large, none. GFS is fairly unique in the way it does things. The
> rest of the world hasn't really caught up yet. ;-)
>
> I'm not sure why OCFS keeps coming up, since it's not a normal, general
> purpose file system. It's only useful for putting Oracle DB volumes on it.
>
>
> Gordan

It came up in my case just because I am fairly uninformed in this realm.  :)



From crh at ubiqx.mn.org  Fri Jan 25 17:02:10 2008
From: crh at ubiqx.mn.org (Christopher R. Hertel)
Date: Fri, 25 Jan 2008 11:02:10 -0600
Subject: [Linux-cluster] Does samba work with cluster suite?
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>
Message-ID: <479A1612.9060106@ubiqx.mn.org>

Brad,

There are some gotcha's running Samba on top of a cluster, but most of those
have been addressed.  Here's what you need to know:

- Samba has to translate a lot of metadata between Posix and Windows
  semantics.  We're talking about everything from file names to locking
  and sharing modes.  Samba keeps the translated metadata in databases
  that are shared by all Samba processes on a single machine.  Until
  recently, those databases were not sharable across a cluster (locking
  issues) so running Samba on multiple cluster nodes was risky.

- If the load is light, and the chance of collision small, you can run
  Samba that way.  People have done it, and still do it, even in
  production environments.

- Recently, some of the Samba Team folk (Tridge, Ronnie, I think Volker
  did some work as well) created a clustered database (CTDB) that does
  work in a clustered environment.  The *right* way to run Samba on top
  of GFS is with CTDB.

CTDB is still fairly new, but I believe it is being tested on GFS.  Others
on this list probably know more...

Chris -)-----

Brad Filipek wrote:
> I am suprised to find little documentation on setting up a samba share
> on my RHEL cluster. I have two RHEL5 boxes running cluster suite. I want
> to share out a few directories via samba to the Windows workstations on
> my network. However, in the Cluster Configuration screen, it only shows
> "Name" and "Workgroup" in the Samba Service conifg screen. How do I
> setup my samba shares properly to work with cluster suite?
>  
> Thanks,
> Brad Filipek
> 
> *_Confidentiality Notice: _*This message is intended for the use of the
> individual or entity to which it is addressed and may contain
> information that is privileged, confidential and exempt from disclosure
> under applicable law. If the reader of this message is not the intended
> recipient or the employee or agent responsible for delivering this
> message to the intended recipient, you are hereby notified that any
> dissemination, distribution or copying of this communication is strictly
> prohibited.
> 
> If you have received this communication in error, please notify us
> immediately by email reply or by telephone and immediately delete this
> message and any attachments.
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
"Implementing CIFS - the Common Internet FileSystem" ISBN: 013047116X
Samba Team -- http://www.samba.org/     -)-----   Christopher R. Hertel
jCIFS Team -- http://jcifs.samba.org/   -)-----   ubiqx development, uninq.
ubiqx Team -- http://www.ubiqx.org/     -)-----   crh at ubiqx.mn.org
OnLineBook -- http://ubiqx.org/cifs/    -)-----   crh at ubiqx.org



From gordan at bobich.net  Fri Jan 25 17:22:36 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Fri, 25 Jan 2008 17:22:36 +0000 (GMT)
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <479A1538.3080805@xandros.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
Message-ID: <alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>



On Fri, 25 Jan 2008, James McOrmond wrote:

>
> gordan at bobich.net wrote:
>> 
>> By and large, none. GFS is fairly unique in the way it does things. The 
>> rest of the world hasn't really caught up yet. ;-)
>
> what exactly does it do that's unique?

I appear to stand corrected as per the other post, with SGI and IBM 
offerings that do similar things, but providing shared r/w access to a 
common physical volume isn't exactly common.

>> I'm not sure why OCFS keeps coming up, since it's not a normal, general 
>> purpose file system. It's only useful for putting Oracle DB volumes on it.
>
> ocfs2 is a general file system and has been available for quite some time 
> (and is very easy to setup).

Fair enough. The documentation I had read on it seemed to imply otherwise, 
which is why I went with GFS. I cannot say that I am in any way regretting 
that choice. :-)

Gordan



From lhh at redhat.com  Fri Jan 25 18:45:27 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 25 Jan 2008 13:45:27 -0500
Subject: [Linux-cluster] Any HA Cluster Success with iSCSI storage?
In-Reply-To: <4799F75D.3000106@muppethouse.com>
References: <4799F75D.3000106@muppethouse.com>
Message-ID: <1201286727.29756.30.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-25 at 09:51 -0500, Ben Russo wrote:
> I currently have a RHEL-3 HA Cluster in a different City using fiber 
> channel SCSI storage.  It has worked fine.
> 
> I want to setup another cluster, this time with RHEL-4.
> 
> I already have a NetApp FAS270c (for NFS and CIFS NAS).
> It supports iSCSI.
> 
> 
> ***  Can I setup my two node HA cluster with iSCSI quorum drives and 
> cluster service storage volumes? (anyone do this before?)

You don't need quorum disks in RHEL4; they're optional.

iSCSI should work for your shared data.

-- Lon




From lhh at redhat.com  Fri Jan 25 18:51:04 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 25 Jan 2008 13:51:04 -0500
Subject: [Linux-cluster] Does samba work with cluster suite?
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>
Message-ID: <1201287064.29756.36.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-24 at 23:01 -0600, Brad Filipek wrote:
> I am suprised to find little documentation on setting up a samba share
> on my RHEL cluster. I have two RHEL5 boxes running cluster suite. I
> want to share out a few directories via samba to the Windows
> workstations on my network. However, in the Cluster Configuration
> screen, it only shows "Name" and "Workgroup" in the Samba Service
> conifg screen. How do I setup my samba shares properly to work with
> cluster suite? 

If your service also has file systems + IP addresses as part of the it,
a template configuration will be generated the first time the service is
started.

After you start the service for the first time, you can edit the
generated configuration file; it's stored in:

  /etc/samba/smb.conf.<name> 

Editing the file will disable future auto generation - meaning if you
add more file systems to the service, it won't pick them up
automatically.

-- Lon





From Alexandre.Racine at mhicc.org  Fri Jan 25 19:41:57 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 25 Jan 2008 14:41:57 -0500
Subject: [Linux-cluster] Any HA Cluster Success with iSCSI storage?
References: <4799F75D.3000106@muppethouse.com>
	<1201286727.29756.30.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12D7@cumulonimbus.RG.local>

I use iSCSI to connect to the SAN with GFS.

Works pretty well!


Alexandre Racine
514-461-1300 poste 3304
alexandre.racine at mhicc.org



-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Lon Hohberger
Sent: Fri 2008-01-25 13:45
To: linux clustering
Subject: Re: [Linux-cluster] Any HA Cluster Success with iSCSI storage?
 
On Fri, 2008-01-25 at 09:51 -0500, Ben Russo wrote:
> I currently have a RHEL-3 HA Cluster in a different City using fiber 
> channel SCSI storage.  It has worked fine.
> 
> I want to setup another cluster, this time with RHEL-4.
> 
> I already have a NetApp FAS270c (for NFS and CIFS NAS).
> It supports iSCSI.
> 
> 
> ***  Can I setup my two node HA cluster with iSCSI quorum drives and 
> cluster service storage volumes? (anyone do this before?)

You don't need quorum disks in RHEL4; they're optional.

iSCSI should work for your shared data.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3231 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080125/9bb507fb/attachment.bin>

From l_x2828 at yahoo.com  Fri Jan 25 20:57:28 2008
From: l_x2828 at yahoo.com (Lee X)
Date: Fri, 25 Jan 2008 12:57:28 -0800 (PST)
Subject: [Linux-cluster] SOS - Can we use mixed GFS cluster?
Message-ID: <700767.77992.qm@web55507.mail.re4.yahoo.com>

Hi,

I would appreciate it if some one could tell me if
it's okay to use a mixed GFS cluster.

Say there are three nodes in a GFS cluster.  One use
GFS-modules-6.0.2-25 and other other two use
GFS-modules-6.0.2.36-1.  If not, why?  If yes, are
there any potential problems?

Regards,
BJ
--




      ____________________________________________________________________________________
Looking for last minute shopping deals?  
Find them fast with Yahoo! Search.  http://tools.search.yahoo.com/newsearch/category.php?category=shopping



From rpeterso at redhat.com  Fri Jan 25 22:11:08 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Jan 2008 16:11:08 -0600
Subject: [Linux-cluster] SOS - Can we use mixed GFS cluster?
In-Reply-To: <700767.77992.qm@web55507.mail.re4.yahoo.com>
References: <700767.77992.qm@web55507.mail.re4.yahoo.com>
Message-ID: <1201299068.18461.60.camel@technetium.msp.redhat.com>

On Fri, 2008-01-25 at 12:57 -0800, Lee X wrote:
> Hi,
> 
> I would appreciate it if some one could tell me if
> it's okay to use a mixed GFS cluster.
> 
> Say there are three nodes in a GFS cluster.  One use
> GFS-modules-6.0.2-25 and other other two use
> GFS-modules-6.0.2.36-1.  If not, why?  If yes, are
> there any potential problems?
> 
> Regards,
> BJ
> --

Hi BJ,

The ondisk format of GFS has not changed, so you should be okay
mixing different versions of GFS.  Having said that, you should
be more concerned with the rest of the cluster infrastructure.

There was a cman protocol change between RHEL4U1 and RHEL4U2 
(and similar releases for other distros) that makes it impossible
to mix those within a cluster.  However, versions after that are
compatible between major releases.  So in theory, the cluster
infrastructure in RHEL4.2 will play nice with RHEL4.3, RHEL4.4,
RHEL4.5, Centos 4.4, Centos4.5, and so on.

However, you still cannot have RHEL4.x nodes and RHEL5.x nodes
in the same cluster.

The ondisk format for GFS has not changed between releases,
so in theory you could even take a SAN that was formatted and used by
a RHEL4.x cluster and move the storage over to a RHEL5.x cluster,
or upgrade the whole cluster from RHEL4 to RHEL5.

Regards,

Bob Peterson
Red Hat GFS




From barbos at gmail.com  Fri Jan 25 22:43:06 2008
From: barbos at gmail.com (Alex Kompel)
Date: Fri, 25 Jan 2008 14:43:06 -0800
Subject: [Linux-cluster] Any HA Cluster Success with iSCSI storage?
In-Reply-To: <4799F75D.3000106@muppethouse.com>
References: <4799F75D.3000106@muppethouse.com>
Message-ID: <3ae027040801251443q41b160adh599bbb57fe6a28@mail.gmail.com>

On Jan 25, 2008 6:51 AM, Ben Russo <ben at muppethouse.com> wrote:

> I currently have a RHEL-3 HA Cluster in a different City using fiber
> channel SCSI storage.  It has worked fine.
>
> I want to setup another cluster, this time with RHEL-4.
>
> I already have a NetApp FAS270c (for NFS and CIFS NAS).
> It supports iSCSI.
>
>
> ***  Can I setup my two node HA cluster with iSCSI quorum drives and
> cluster service storage volumes? (anyone do this before?)
>

I am going to play devils advocate here: is there a specific reason you want
to use GFS in this setup? NetApp has excellent NFS and CIFS support and it
looks like you already paid for both and HA option for NetApp (270c is a
clustered filer).


>
> ***  I was thinking about getting 10Gbit/sec uplinks for the NetApp and
> two ethernet switches that have 10Gbit uplink ports and that support
> 802.3ad.  The two cluster nodes would use 802.3ad NIC channel bonding
> for the storage access bandwidth.  (anyone do this before?)
>
Do not get 10gbE. First, I don't think you can get 10gbE interfaces in
FAS270.
Second, FAS270 won't be able to saturate even 1gb link. The bottleneck is
usually the filer CPU.
It does support link aggregation but you won't see much of the difference vs
active/passive bonding.

-Alex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080125/d5971569/attachment.htm>

From lgodoy at atichile.com  Sat Jan 26 03:58:22 2008
From: lgodoy at atichile.com (Luis Godoy Gonzalez)
Date: Sat, 26 Jan 2008 00:58:22 -0300
Subject: [Linux-cluster] Fence don't work with HP iLo2 (fw.1.42
In-Reply-To: <1200085015746.teemu.m2.67987.rqRlPdzG0SOEGEKzBhBdKA@luukku.com>
References: <1200085015746.teemu.m2.67987.rqRlPdzG0SOEGEKzBhBdKA@luukku.com>
Message-ID: <479AAFDE.3090204@atichile.com>

Hi

I have the same problem ...

DL 385G2
Red Hat Enterprice 4 U5 Cluster Suite 4 U5

For now I add one "exit 0"  when  the  script  try to power on the  host

==========================
# vi /sbin/fence_ilo
...
...
    616 elsif (/reboot/)
    617 {
    618         fail "power_status: unexpected error\n" if power_status;
    619
    620         if (! /^off$/i)
    621         {
    622                 fail "power_off: unexpected error\n" if power_off;
    623                 fail "power_status: unexpected error\n" if 
power_status;
    624                 fail "failed to turn off\n" unless (/^off$/i);
    625         }
    626
    627         if (/^off$/i)
    628         {
                           exit 0;
    629                 fail "power_on: unexpected error\n" if power_on;
    630                 fail "power_status: unexpected error\n" if 
power_status;
    631                 fail "failed to turn on\n" unless (/^on$/i);
    632         }
    633         else
    634         {

==========================

The problem is when the machine is fenced we have to power on it manualy. :(

I will check the fence_ilo in Cluster Suite U6 ... maybe this work fine 
in that version ...

Any advice will by apreciated

Sorry for my bad inglish

Luis G.



m.. mm.. wrote:
> Hi
> We are having some problems to get fence working with this hardware.
>
> Proliant 360 G5.. iLo firmware 1.42
> and os is, RedHat 5-> 2.6.18-8.el5 Cluster/GFS
>
> This script: fence_ilo
> Fence shutdown works fine, but it don't restart it.
> agent "fence_ilo" reports: failed to turn on
>
> Have someone working script to get this work?
>
>
>
>
> ...................................................................
> Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
> Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>   



From isplist at logicore.net  Sat Jan 26 04:49:43 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 25 Jan 2008 22:49:43 -0600
Subject: [Linux-cluster] SSI, Virtual Servers, ShareRoot, Etc
In-Reply-To: <1201255186.20969.118.camel@admc.win-rar.local>
Message-ID: <2008125224943.131520@leena>

> a quick search on google for cobbler and diskless returned this:
> http://pcquest.ciol.com/content/search/showarticle.asp?artid=97271

Interesting, thanks. I've got it in my browser to check it out.

Mike





From isplist at logicore.net  Sat Jan 26 05:11:50 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 25 Jan 2008 23:11:50 -0600
Subject: [Linux-cluster] iSCSI GFS
Message-ID: <2008125231150.061997@leena>

My thread on SSI/VM/Sharedroot got me thinking because of the input some of 
you have put into the thread. There is some very valuable information in 
there. 

In fact, it got me wondering if my FC path might not be the greatest based on 
Gordan's (I think it was) input on the subject. I've also seen a few posts 
about iSCSI lately. Can anyone shed a little light on iSCSI and what I would 
need to give it a try.

Can iSCSI be used to run a clustered LAMP setup which would be as flexible as 
FC for growth and management?

Preferably, without having to buy any hardware to try it out :). I've already 
got FC gear but it looks like iSCSI is much easier to deal with. I read 
somewhere or had found, an open source iSCSI target driver no less. 

Perhaps I could put my FC storage to better use?

Mike





From gordan at bobich.net  Sat Jan 26 08:32:24 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Sat, 26 Jan 2008 08:32:24 +0000
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008125231150.061997@leena>
References: <2008125231150.061997@leena>
Message-ID: <479AF018.1050103@bobich.net>

isplist at logicore.net wrote:
> My thread on SSI/VM/Sharedroot got me thinking because of the input some of 
> you have put into the thread. There is some very valuable information in 
> there. 
> 
> In fact, it got me wondering if my FC path might not be the greatest based on 
> Gordan's (I think it was) input on the subject. I've also seen a few posts 
> about iSCSI lately. Can anyone shed a little light on iSCSI and what I would 
> need to give it a try.

iSCSI is just a connection protocol like Fibre Channel. They both do the 
same thing. iSCSI works over ethernet, while FC works over fibre. iSCSI 
is cheaper, and FC has traditionally been faster (although the point 
gets a bit moot with 1Gb and 10Gb ethernet as the storage stops being 
the bottleneck.

> Can iSCSI be used to run a clustered LAMP setup which would be as flexible as 
> FC for growth and management?

iSCSI and FC are equivalent. I personally prefer iSCSI because it's 
cheaper. In terms of features there isn't a great deal to choose between 
them.

> Preferably, without having to buy any hardware to try it out :). I've already 
> got FC gear but it looks like iSCSI is much easier to deal with. I read 
> somewhere or had found, an open source iSCSI target driver no less. 

Yes, iSCSI is now ISS pretty much end-to-end. But if you already have FC 
gear and if you have the required drivers for it, then there's no need 
to replace it.

Here is another thing you may find interesting:
http://sourceware.org/cluster/ddraid/
I stumbled upon it last night, and the ides seems great - network RAID 
3.5 (n+1 like RAID 3,4,5). It seems to make sense for small-ish 
clusters, or situations where you are stacking RAID / cluster levels 
(e.g. RAID 3.5+3.5). But without the ability to dynamically add drives 
(like in standard MD software RAID) I'm not sure how useful it would be 
if you need a scaleable solution. It also wouldn't allow you to power 
down half of your cluster at off-peak times.

Gordan



From johannes.russek at io-consulting.net  Sat Jan 26 14:29:45 2008
From: johannes.russek at io-consulting.net (Johannes Russek)
Date: Sat, 26 Jan 2008 15:29:45 +0100
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008125231150.061997@leena>
References: <2008125231150.061997@leena>
Message-ID: <479B43D9.6010605@io-consulting.net>

Hi Mike,

> In fact, it got me wondering if my FC path might not be the greatest based on 
> Gordan's (I think it was) input on the subject. I've also seen a few posts 
> about iSCSI lately. Can anyone shed a little light on iSCSI and what I would 
> need to give it a try.
>   
I've heard that http://www.openfiler.com/ is probably the easiest way to 
get an iSCSI target up and running. Didn't try it yet, though.
But i've got to say, that SAN and performance thing really depends on 
how you do things. I've got a five node Cluster with GFS over FC running 
here, and it is pretty fast. Compared to friends of mine who use 
SAS-DAS, the bulk performance was about as fast. We are mostly serving 
files from it (FTP/HTTP), so write-performance is not so much an issue. 
It might be that it's usually slower then local disks, but I guess it 
really depends on your overbooking ratio. We are happy with the 
performance :)
Also I wonder if iSCSI does protect Block-Integrity as FC does. That's 
one of the things that make FC so robust, isn't it?
Anyone knows anything about this?
enjoy,
Johannes



From pillai at mathstat.dal.ca  Sat Jan 26 14:54:20 2008
From: pillai at mathstat.dal.ca (Balagopal Pillai)
Date: Sat, 26 Jan 2008 10:54:20 -0400 (AST)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <479B43D9.6010605@io-consulting.net>
References: <2008125231150.061997@leena> <479B43D9.6010605@io-consulting.net>
Message-ID: <Pine.LNX.4.64.0801261041470.754@chase.mathstat.dal.ca>

On Sat, 26 Jan 2008, Johannes Russek wrote:

Hi,

     I have tried http://iscsitarget.sourceforge.net/ a few times and 
it was quite easy to setup in the simplest of configurations. The 
performance was comparable to gnbd. But gnbd has the extra advantage of
having a fencing option available for using with gfs, while with iscsi 
you have to rely on ipmi,drac,ilo etc etc. With a sas das for use with 
gfs, gnbd is a good choice. I tried this almost an year ago when i was 
evaluating an alternate cluster file system. The performance was nice, 
but gnbd had stability issues on very high load. So went back to Lustre
and happy with the choice. I used the iscsi enterprise target before to 
export a big storage volume from the internal net to be mounted on an nfs 
server and later exported to the public net via nfs. It was stable for my 
use. There is also the new FCOE coming soon to provide FC over ethernet. 
That is a also choice for someone who has invested so much on FC, but 
wants to take advantage of cheaper ethernet without losing the reliability 
of fc. Might need dedicated switch hardware though! 
http://www.open-fcoe.org/ But the cheapest solution for those people with 
FC is still re-export via a software target like ietd. Just my thoughts!

Regards
Balagopal



 

> Hi Mike,
> 
> > In fact, it got me wondering if my FC path might not be the greatest based
> > on Gordan's (I think it was) input on the subject. I've also seen a few
> > posts about iSCSI lately. Can anyone shed a little light on iSCSI and what I
> > would need to give it a try.
> >   
> I've heard that http://www.openfiler.com/ is probably the easiest way to get
> an iSCSI target up and running. Didn't try it yet, though.
> But i've got to say, that SAN and performance thing really depends on how you
> do things. I've got a five node Cluster with GFS over FC running here, and it
> is pretty fast. Compared to friends of mine who use SAS-DAS, the bulk
> performance was about as fast. We are mostly serving files from it (FTP/HTTP),
> so write-performance is not so much an issue. It might be that it's usually
> slower then local disks, but I guess it really depends on your overbooking
> ratio. We are happy with the performance :)
> Also I wonder if iSCSI does protect Block-Integrity as FC does. That's one of
> the things that make FC so robust, isn't it?
> Anyone knows anything about this?
> enjoy,
> Johannes
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



From isplist at logicore.net  Sat Jan 26 16:47:31 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 26 Jan 2008 10:47:31 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <479AF018.1050103@bobich.net>
Message-ID: <2008126104731.348300@leena>

> iSCSI is just a connection protocol like Fibre Channel. They both do the
> same thing. iSCSI works over ethernet, while FC works over fibre. iSCSI
> is cheaper, and FC has traditionally been faster (although the point
> gets a bit moot with 1Gb and 10Gb ethernet as the storage stops being
> the bottleneck.

Right, I know they are similar but the big difference seems to be that it is 
simpler than FC. What I don't like about FC right now is that the moment I 
make any changed on the storage, things get really complicated. For some 
reason, after I take storage away to replace it with something different, 
Linux goes haywire on me. I've posted about that one before, just a long 
stream of long gone storage information that goes across the console on bootup 
and a lot of old information sitting in the /etc/lvm directory. Just get's 
darn messy and since I'm not a pro at any one thing, including FC, it starts 
confusing me when things get funky, which equals, down time. 

I replaced the drives in a storage chassis a few days ago. Seems I ended up 
having to do a lot of things I probably didn't need to and ended up with down 
time. 

I unmounted the shared storage from each node then took them out of the 
cluster because they cannot be live when FC changes happen, they don't seem to 
handle FC changes well. I then shut down the storage, replaced the drives, 
reformatted, ready to go. I restarted each node so that it would cleanly see 
the new storage yet tons of old garbage flew across the screen. 

I've gotten into LUN zoning, fibre zoning, it just gets ugly. From what I can 
tell of iSCSI, you just put some storage on the network, assign it an IP and 
it's all IP simple. What I don't get is the difference between iSCSI and 
NAS/DAS yet. Both seem to be independent of a server while iSCSI seems to sit 
behind the server unless it is a specialty device. 
 
> iSCSI and FC are equivalent. I personally prefer iSCSI because it's
> cheaper. In terms of features there isn't a great deal to choose between
> them.

And that's another thing. Having to find FC drives at a good price all the 
time get's mighty tiring while watching huge IDE/SATA drives going so cheap. 
It seems to make more sense to move to something that will allow me to use the 
newer cheaper yet very fast drives. 

What I wanted to build was something which would be simple to grow. I like FC 
now that I've been learning it but it seems that I should be looking at other 
options while I'm early in the game. 

I seem to read a lot of posts from folks who talk about assigning an IP to 
something and it's there on the LAN ready for access. I understand how I can 
grow my storage using FC but I'm not quite understanding these other 
technologies. It seems that you simply add storage as needed, add an IP, it's 
available. What I don't get is how all of that individual storage can be used 
in some aggregate manner. 

These days, it's all about content driven stuff, lot's of audio and big video 
files. I don't get how the servers manage to see what they need if it is 
spread across dozens of storage devices on the IP side.

> http://sourceware.org/cluster/ddraid/
> I stumbled upon it last night, and the ides seems great - network RAID
> 3.5 (n+1 like RAID 3,4,5). It seems to make sense for small-ish
> clusters, or situations where you are stacking RAID / cluster levels

I used to run an ISP in the early/mid 90's and what kept us awake and freezing 
in the server room for days sometimes was growth. We didn't have all of the 
cool things we have now so everything was new technology. The problem was as 
we would grow, we would have to change almost everything out every couple of 
years almost. It was a nightmare, constantly having to upgrade. 

In my current venture, I'd like to be ready for growth by having a good solid 
solution up front. One that will last me into any growth at least in as far as 
having to add serving resources such as storage and highly reliable LAMP based 
services. 

Lot's of good feedback again, I sense I'll be trying some new things soon. 
Can't wait for the day where I know enough that I can reply to questions with 
the answers people are looking for :).

Thanks folks.

Mike





From l_x2828 at yahoo.com  Sat Jan 26 20:46:17 2008
From: l_x2828 at yahoo.com (Lee X)
Date: Sat, 26 Jan 2008 12:46:17 -0800 (PST)
Subject: [Linux-cluster] SOS - Can we use mixed GFS cluster?
In-Reply-To: <1201299068.18461.60.camel@technetium.msp.redhat.com>
Message-ID: <410041.72832.qm@web55509.mail.re4.yahoo.com>

Thanks for the information.  It helps.

I probably should provide more information with my
previous email.  We are still using old kernel.  Is it
okay if we set up the following three GFS nodes in the
same cluster?  

Node1: GFS-6.0.2-25 with RHEL3U4
Node2: GFS-6.0.2-25 with CentOS 3.8
Node3: GFS-6.0.2.36-1 with CentOS 3.8

Regards,
BJ
--

--- Bob Peterson <rpeterso at redhat.com> wrote:

> On Fri, 2008-01-25 at 12:57 -0800, Lee X wrote:
> > Hi,
> > 
> > I would appreciate it if some one could tell me if
> > it's okay to use a mixed GFS cluster.
> > 
> > Say there are three nodes in a GFS cluster.  One
> use
> > GFS-modules-6.0.2-25 and other other two use
> > GFS-modules-6.0.2.36-1.  If not, why?  If yes, are
> > there any potential problems?
> > 
> > Regards,
> > BJ
> > --
> 
> Hi BJ,
> 
> The ondisk format of GFS has not changed, so you
> should be okay
> mixing different versions of GFS.  Having said that,
> you should
> be more concerned with the rest of the cluster
> infrastructure.
> 
> There was a cman protocol change between RHEL4U1 and
> RHEL4U2 
> (and similar releases for other distros) that makes
> it impossible
> to mix those within a cluster.  However, versions
> after that are
> compatible between major releases.  So in theory,
> the cluster
> infrastructure in RHEL4.2 will play nice with
> RHEL4.3, RHEL4.4,
> RHEL4.5, Centos 4.4, Centos4.5, and so on.
> 
> However, you still cannot have RHEL4.x nodes and
> RHEL5.x nodes
> in the same cluster.
> 
> The ondisk format for GFS has not changed between
> releases,
> so in theory you could even take a SAN that was
> formatted and used by
> a RHEL4.x cluster and move the storage over to a
> RHEL5.x cluster,
> or upgrade the whole cluster from RHEL4 to RHEL5.
> 
> Regards,
> 
> Bob Peterson
> Red Hat GFS
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 





      ____________________________________________________________________________________
Looking for last minute shopping deals?  
Find them fast with Yahoo! Search.  http://tools.search.yahoo.com/newsearch/category.php?category=shopping



From dsharp at ivytech.edu  Sat Jan 26 23:31:54 2008
From: dsharp at ivytech.edu (Doug Sharp)
Date: Sat, 26 Jan 2008 18:31:54 -0500
Subject: [Linux-cluster] Sharing Oracle HOME between RAC nodes
Message-ID: <02CC84960E52F745860DFF29ADC62091025E5419@MSEXCHNG-02.ivytech.local>

Greetings.  We are running a set of two-node Oracle RAC clusters for an upcoming ERP system - two 24x7 production clusters and three test clusters.  Each RAC cluster shares its data/archive/quorum disks over FC using an EMC Symmetrix DMX, and each node of each cluster has its own dedicated Oracle Home filesystem.  Due to the third-party software that is being managed on those Oracle Homes, the database folks want to be able to share Oracle Home between RAC nodes within a cluster.  The application was not written with RAC in mind, and causes some file synchronization issues within the Oracle Homes that they want to eliminate.  

I'm considering using one of the following Cluster Suite architectures.  I have three test nodes available, each with a single-port FC card and connectivity to the EMC DMX.  Each node would run 5.1 and GFS1 and implement "Managed NFS" as described by the Red Hat NFS Cookbook.  I also have a WTI IPS800 network power switch for fencing.  

In Option 1, one CS node would serve /orap1 and /orap2 to the two production clusters, The second CS node would serve /oratest1, /oratest2 and /oratest3 to the three test clusters.  In Option 2, one CS node would serve /orap1, /orap2, /oratest1, /oratest2 and /oratest3 to the five RAC nodes on Oracle server "1" and the other node would serve those same five Oracle HOMEs to the five RAC nodes on Oracle server "2".  

Given this, I have a few questions:

1. Does either testbed described above sound like the best architecture for this application or can somebody suggest a better way?

2. Would anyone share their experiences with sharing Oracle Home filesystems between RAC nodes using some form of Cluster Suite setup? 

3. We are having internal discussions on the number of nodes that is adequate for this.  Some in our area think a two-node active-passive NFS server running Heartbeat (without Cluster Suite) is sufficient.  Since the NFS service will hopefully scale to allow us to replace other NFS servers, some think that Cluster Suite on three nodes will provide better overall availability and scalability.  Given the fact that the Oracle RAC servers that will use the NFS service are 24x7 systems, some are concerned that even three nodes might be too few and that five nodes would provide better availability to handle scheduled maintenance, etc.  Are there any guidelines anyone would like to share on this issue?

Thanks in advance,
Doug



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080126/38feea76/attachment.htm>

From gordan at bobich.net  Sun Jan 27 15:01:37 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Sun, 27 Jan 2008 15:01:37 +0000
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008126104731.348300@leena>
References: <2008126104731.348300@leena>
Message-ID: <479C9CD1.7040706@bobich.net>

isplist at logicore.net wrote:

> What I don't get is the difference between iSCSI and 
> NAS/DAS yet. Both seem to be independent of a server while iSCSI seems to sit 
> behind the server unless it is a specialty device. 

NAS is just a server running NFS, CIFS, Coda or another network file system.

DAS (Direct Attached Storage?) probably refers to running shared storage 
without a SAN (iSCSI/FC) appliance. You get the nodes themselves to 
store and share the data between them, e.g. with DRBD/DDRaid/GNBD with 
GFS on top. The disks live in the nodes, rather than in a SAN, but the 
overall functionality is the same.

>> iSCSI and FC are equivalent. I personally prefer iSCSI because it's
>> cheaper. In terms of features there isn't a great deal to choose between
>> them.
> 
> And that's another thing. Having to find FC drives at a good price all the 
> time get's mighty tiring while watching huge IDE/SATA drives going so cheap. 
> It seems to make more sense to move to something that will allow me to use the 
> newer cheaper yet very fast drives. 

FC will typically be what you use to talk to the chasis. The disks 
inside will be the same as any other appliance, these days probably 
SAS/SATA.

> What I wanted to build was something which would be simple to grow. I like FC 
> now that I've been learning it but it seems that I should be looking at other 
> options while I'm early in the game. 

For flexibility and cost, I'd go with iSCSI over ethernet. If you are 
building your own SAN appliance with a Linux box, growing is very easy. 
If you are putting disks into and using software RAID 5/6 you can just 
add a disk and grow the stripe online. Then you up the size of the file 
system, and you can create more iSCSI volumes or enlarge the existing 
ones. There really is no scalability limit, over and above limits on the 
file system sizes.

> I seem to read a lot of posts from folks who talk about assigning an IP to 
> something and it's there on the LAN ready for access. I understand how I can 
> grow my storage using FC but I'm not quite understanding these other 
> technologies. It seems that you simply add storage as needed, add an IP, it's 
> available. What I don't get is how all of that individual storage can be used 
> in some aggregate manner. 

Unless I'm mistaking, you are describing DAS here. If that is the case, 
see above.

> These days, it's all about content driven stuff, lot's of audio and big video 
> files. I don't get how the servers manage to see what they need if it is 
> spread across dozens of storage devices on the IP side.

If you need the ultimate flexibility, you could always have all the 
storage exported via iSCSI to an aggregator box, have that run a 
software RAID to link them all up, and then re-export that via iSCSI.

>> http://sourceware.org/cluster/ddraid/
>> I stumbled upon it last night, and the ides seems great - network RAID
>> 3.5 (n+1 like RAID 3,4,5). It seems to make sense for small-ish
>> clusters, or situations where you are stacking RAID / cluster levels
> 
> I used to run an ISP in the early/mid 90's and what kept us awake and freezing 
> in the server room for days sometimes was growth. We didn't have all of the 
> cool things we have now so everything was new technology. The problem was as 
> we would grow, we would have to change almost everything out every couple of 
> years almost. It was a nightmare, constantly having to upgrade. 
> 
> In my current venture, I'd like to be ready for growth by having a good solid 
> solution up front. One that will last me into any growth at least in as far as 
> having to add serving resources such as storage and highly reliable LAMP based 
> services. 

It rather depends on what your requirements and constraints are. You can 
do all sorts of clever things with a multi-level-tree organisational 
design of your storage solution (something like having a separate 
eggregation box as I described above), and it'll scale pretty much as 
far as you want it. Just add an additional disk chassis into the aggregator.

Gordan



From isplist at logicore.net  Sun Jan 27 18:48:25 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 27 Jan 2008 12:48:25 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <479C9CD1.7040706@bobich.net>
Message-ID: <2008127124825.896118@leena>

It would be nice to have another path to try before investing in additional FC 
equipment. I also happen to have a Cisco 5420 iSCSI router if it could be put 
to use. I bought it some time ago to give SCSI over IP a try but never got 
around to it. I found the idea interesting but it was fairly new at the time. 
Must be an awful lot of cool solutions now. 

> If you need the ultimate flexibility, you could always have all the
> storage exported via iSCSI to an aggregator box, have that run a
> software RAID to link them all up, and then re-export that via iSCSI.

I need the ultimate because I'm just one guy, trying to take on way too much 
and trying to learn everything I'm touching, at least enough to make it work 
and manage it. Some day, I hope to hire someone to help me out :). 

So, this does sound interesting. 

I'm looking on goggle, I'm not finding exactly what this is yet. Would this 
also be known as storage aggregation or since we're talking about iSCSI, iSCSI 
storage virtualization?

I am guessing that what you're talking about would mean something like the 
servers looking at one point for their data. That one point would be called 
the agregator. 

The aggregator would basically either be the central connection point to all 
of the data (which I doubt since it means it would bog down at some point) or 
it acts as the routing point, sending the server to the proper storage device. 

I am guessing that the storage device ends up being almost anything reachable 
on the network, iSCSI, FC, what ever servers can reach. It might be storage on 
the FC network, it might be a box with an IP, it might be storage behind 
another server.

Is this the idea?

> It rather depends on what your requirements and constraints are. You can
> do all sorts of clever things with a multi-level-tree organisational
> design of your storage solution (something like having a separate
> eggregation box as I described above), and it'll scale pretty much as
> far as you want it. Just add an additional disk chassis into the aggregator.

My concern is also trying to keep things fairly easy to understand so that I 
can explain it to the person that will be hired to help me eventually :). 

More than anything, right now at least, I just want fast reliable LAMP 
services and badly need an easy storage growth path that does not tie me into 
strict requirements. 

Right now, I'm basically married to FC unless I find some alternatives. I 
don't have a problem using FC, I love it, but I'd like to be able to take 
advantage of IP based protocols too.

Mike





From gordan at bobich.net  Sun Jan 27 19:48:18 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Sun, 27 Jan 2008 19:48:18 +0000
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008127124825.896118@leena>
References: <2008127124825.896118@leena>
Message-ID: <479CE002.1030607@bobich.net>

isplist at logicore.net wrote:

>> If you need the ultimate flexibility, you could always have all the
>> storage exported via iSCSI to an aggregator box, have that run a
>> software RAID to link them all up, and then re-export that via iSCSI.
 >
> I'm looking on goggle, I'm not finding exactly what this is yet. Would this 
> also be known as storage aggregation or since we're talking about iSCSI, iSCSI 
> storage virtualization?

It's not virtualization. It is equivalent to mounting an NFS share, and 
then exporting it again from the machine that mounted it.

> I am guessing that what you're talking about would mean something like the 
> servers looking at one point for their data. That one point would be called 
> the agregator. 

Exactly. You have a machine that pretends to be a SAN when it in fact 
has no space on it. Instead, it connects to all the individual storage 
nodes, mounts their volumes, merges them into one big volume, and then 
presents that one big volume via iSCSI.

> The aggregator would basically either be the central connection point to all 
> of the data (which I doubt since it means it would bog down at some point) or 
> it acts as the routing point, sending the server to the proper storage device. 

It's a central connection point AND a router, only it isn't just 
straight routing, because the data is RAID striped for redundancy.

> I am guessing that the storage device ends up being almost anything reachable 
> on the network, iSCSI, FC, what ever servers can reach. It might be storage on 
> the FC network, it might be a box with an IP, it might be storage behind 
> another server.
> 
> Is this the idea?

Pretty much.

>> It rather depends on what your requirements and constraints are. You can
>> do all sorts of clever things with a multi-level-tree organisational
>> design of your storage solution (something like having a separate
>> eggregation box as I described above), and it'll scale pretty much as
>> far as you want it. Just add an additional disk chassis into the aggregator.
> 
> My concern is also trying to keep things fairly easy to understand so that I 
> can explain it to the person that will be hired to help me eventually :). 

Good help is hard to find. Most people seem to have a fundamentally 
lacking understanding of more advanced things like SANs and clustering. 
I know because I interviewed quite a few recently. :-(

But if they have a good understanding of clustering, RAID and SANs all 
this should be pretty obvious to them.

> More than anything, right now at least, I just want fast reliable LAMP 
> services and badly need an easy storage growth path that does not tie me into 
> strict requirements. 

You might find that the SAN-of-SANs idea above, with the aggregator, is 
the most future proofed growth path as you could scale it to pretty much 
any size the underlying file systems will allow.

Gordan



From gordan at bobich.net  Sun Jan 27 19:56:19 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Sun, 27 Jan 2008 19:56:19 +0000
Subject: [Linux-cluster] DDRAID vs. GNBD+MD
Message-ID: <479CE1E3.3060601@bobich.net>

Hi,

I mentioned DDRAID in a different thread recently, but it got me 
thinking. Is there any reason why a similar solution based on GNBD plus 
standard software RAID on top wouldn't work?

Say we have 7 nodes, and we want 5+2 (RAID6 redundancy). Have each node 
export a GNBD, and then all nodes connect the said GNBDs together using 
software RAID into a /dev/md? device, and then have GFS on top of that. 
We could then lose any 2 out of the 7 nodes and still maintain 
operational status.

Would that work, or would the underlying GNBDs end up being temporarily 
out of sync sufficiently for the RAID to not be assemblable on the other 
nodes?

If this works, then that would render DDRAID redundant, except for 
performance reasons. On a related node, was DDRAID ever stabilised, or 
is it one of those perpetual alphaware projects like Coda that will 
never reach production quality due to being abandoned?

Gordan



From td3201 at gmail.com  Sun Jan 27 22:02:46 2008
From: td3201 at gmail.com (Terry)
Date: Sun, 27 Jan 2008 16:02:46 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
Message-ID: <8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>

On Jan 25, 2008 11:22 AM,  <gordan at bobich.net> wrote:
>
>
> On Fri, 25 Jan 2008, James McOrmond wrote:
>
> >
> > gordan at bobich.net wrote:
> >>
> >> By and large, none. GFS is fairly unique in the way it does things. The
> >> rest of the world hasn't really caught up yet. ;-)
> >
> > what exactly does it do that's unique?
>
> I appear to stand corrected as per the other post, with SGI and IBM
> offerings that do similar things, but providing shared r/w access to a
> common physical volume isn't exactly common.
>
> >> I'm not sure why OCFS keeps coming up, since it's not a normal, general
> >> purpose file system. It's only useful for putting Oracle DB volumes on it.
> >
> > ocfs2 is a general file system and has been available for quite some time
> > (and is very easy to setup).
>
> Fair enough. The documentation I had read on it seemed to imply otherwise,
> which is why I went with GFS. I cannot say that I am in any way regretting
> that choice. :-)
>
> Gordan

Well, I am having a heck of a time wrapping my head around how
clustering and gfs are related.  My application is an N+1 type of
configuration (grid computing).  I don't need any application
clustering that cluster suite offers.  I just need a file system that
they can all mount in read/write.  I might just have to get a couple
of RHEL5 boxes up and going to fully understand this.  Is cluster
suite and gfs included in RHEL5? Or how does that work?



From gordan at bobich.net  Sun Jan 27 22:17:58 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Sun, 27 Jan 2008 22:17:58 +0000
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<479A1538.3080805@xandros.com>	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
Message-ID: <479D0316.5090505@bobich.net>

Terry wrote:

>>>> By and large, none. GFS is fairly unique in the way it does things. The
>>>> rest of the world hasn't really caught up yet. ;-)
>>> what exactly does it do that's unique?
>> I appear to stand corrected as per the other post, with SGI and IBM
>> offerings that do similar things, but providing shared r/w access to a
>> common physical volume isn't exactly common.
>>
>>>> I'm not sure why OCFS keeps coming up, since it's not a normal, general
>>>> purpose file system. It's only useful for putting Oracle DB volumes on it.
>>> ocfs2 is a general file system and has been available for quite some time
>>> (and is very easy to setup).
>> Fair enough. The documentation I had read on it seemed to imply otherwise,
>> which is why I went with GFS. I cannot say that I am in any way regretting
>> that choice. :-)
>>
> Well, I am having a heck of a time wrapping my head around how
> clustering and gfs are related.  My application is an N+1 type of
> configuration (grid computing).  I don't need any application
> clustering that cluster suite offers.  I just need a file system that
> they can all mount in read/write.  I might just have to get a couple
> of RHEL5 boxes up and going to fully understand this.  Is cluster
> suite and gfs included in RHEL5? Or how does that work?

You need the clustering component to be active in order to mount GFS 
file systems because the clustering subsystem is what maintains the 
quorum. You cannot multi-mount GFS without the cluster being active.

Gordan



From joseparrella at gmail.com  Mon Jan 28 00:01:00 2008
From: joseparrella at gmail.com (Jose Miguel Parrella Romero)
Date: Sun, 27 Jan 2008 19:31:00 -0430
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<479A1538.3080805@xandros.com>	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
Message-ID: <479D1B3C.1000204@gmail.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Terry escribi?:
> Well, I am having a heck of a time wrapping my head around how
> clustering and gfs are related.  My application is an N+1 type of
> configuration (grid computing).  I don't need any application
> clustering that cluster suite offers.  I just need a file system that
> they can all mount in read/write.  I might just have to get a couple
> of RHEL5 boxes up and going to fully understand this.  Is cluster
> suite and gfs included in RHEL5? Or how does that work?

Just FTR, you can have the cluster suite and GFS in other Linux
distributions. I run a GFS-based cluster on Debian, and I know others
that do the same thing. Just in case you are wondering if you need to
invest time and energies on setting up a different operating system.

Jose
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHnRs8UWAsjQBcO4IRAqrUAJ9yU3FlsmT+Ksbs3FuAQIZr3cDB9gCfeaEs
6zRmi0xPOSuejg8bnZkffeE=
=1xpS
-----END PGP SIGNATURE-----



From rainer at ultra-secure.de  Mon Jan 28 00:12:52 2008
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Mon, 28 Jan 2008 01:12:52 +0100
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
Message-ID: <4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>


Am 27.01.2008 um 23:02 schrieb Terry:

>
>
> Well, I am having a heck of a time wrapping my head around how
> clustering and gfs are related.  My application is an N+1 type of
> configuration (grid computing).  I don't need any application
> clustering that cluster suite offers.  I just need a file system that
> they can all mount in read/write.



People I have talked to, who have grid-computing requirements,  
generally avoid GFS (because of the complexity) and choose NFS.

Do you have concurrent writes to the same file from different nodes?
How many nodes do you have?
There are products tailored for this scenario (Isilon, Panasas)....



cheers,
Rainer
-- 
Rainer Duffner
CISSP, LPI, MCSE
rainer at ultra-secure.de




From isplist at logicore.net  Mon Jan 28 00:46:50 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 27 Jan 2008 18:46:50 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <479CE002.1030607@bobich.net>
Message-ID: <2008127184650.577261@leena>

> It's not virtualization. It is equivalent to mounting an NFS share, and
> then exporting it again from the machine that mounted it.

Ok, so a single machine where all storage is attached to it. Won't that bog it 
down big time pretty quickly? 

So, if this is correct, I can see how I could export everything from that one 
machine but overall I/O would be unreal? 

How would this machine be turned into an aggregator? Would it handle knowing 
where everything is or would servers still need to know which share to connect 
to in order to get the needed data?

I also happen to have a BlueArc i7500 machine which can offer up NFS shares. I 
didn't want to use anything like that because I've read too many message about 
NFS not being a good protocol to grow on. Do you disagree?

> Exactly. You have a machine that pretends to be a SAN when it in fact
> has no space on it. Instead, it connects to all the individual storage
> nodes, mounts their volumes, merges them into one big volume, and then
> presents that one big volume via iSCSI.

Ok, I like it :). I don't get how I aggregate it all into a single volume, 
guess I've not played with software RAID which expands to different storage 
devices and volumes. I get the idea though.

For hardware, would this aggregator need massive resources in terms of CPU or 
memory? I have IBM's which have 8-way CPU's and can have up to 64GB of memory. 

Would the aggregator be a potential cluster candidate perhaps? Might it be 
possible to run a cluster of them to be safe and to offload? 

This is interesting. I can see that if I could get to VM/shareroot and 
something like this, I would have something quite nice going.

> It's a central connection point AND a router, only it isn't just
> straight routing, because the data is RAID striped for redundancy.

Right, I just don't yet get how the aggregator handles all of that I/O. Or 
perhaps it just tells the servers which storage device to connect to so that 
it doesn't actually have to take on all of the I/O?

Mike





From td3201 at gmail.com  Mon Jan 28 01:36:27 2008
From: td3201 at gmail.com (Terry)
Date: Sun, 27 Jan 2008 19:36:27 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
Message-ID: <8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>

On Jan 27, 2008 6:12 PM, Rainer Duffner <rainer at ultra-secure.de> wrote:
>
> Am 27.01.2008 um 23:02 schrieb Terry:
>
> >
> >
> > Well, I am having a heck of a time wrapping my head around how
> > clustering and gfs are related.  My application is an N+1 type of
> > configuration (grid computing).  I don't need any application
> > clustering that cluster suite offers.  I just need a file system that
> > they can all mount in read/write.
>
>
>
> People I have talked to, who have grid-computing requirements,
> generally avoid GFS (because of the complexity) and choose NFS.
>
> Do you have concurrent writes to the same file from different nodes?
> How many nodes do you have?
> There are products tailored for this scenario (Isilon, Panasas)....
>
>
>
> cheers,
> Rainer
> --
> Rainer Duffner
> CISSP, LPI, MCSE
> rainer at ultra-secure.de


Good questions:
1) Do you have concurrent writes to the same file from different nodes?
  1a) No
2) How many nodes do you have?
  2a) 3 to start, probably won't go beyond 12

I appreciate alternative ideas to NFS.  NFS could possibly introduce
performance issues (comments here appreciated).  The majority of the
system is write.  I would say 80%.



From rainer at ultra-secure.de  Mon Jan 28 02:28:44 2008
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Mon, 28 Jan 2008 03:28:44 +0100
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
Message-ID: <9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>


Am 28.01.2008 um 02:36 schrieb Terry:

>
>
> Good questions:
> 1) Do you have concurrent writes to the same file from different  
> nodes?
>   1a) No


Well, that's one of the things GFS is good at ;-)


> 2) How many nodes do you have?
>   2a) 3 to start, probably won't go beyond 12
>


OK, that's still in the range GFS can handle (AFAIK).


> I appreciate alternative ideas to NFS.  NFS could possibly introduce
> performance issues (comments here appreciated).


One problem might be that NFS was never supposed to run on  GBit- 
networks.
Thus there is overhead.
But, OTOH, the vendors I mentioned have managed to squeeze a lot of  
performance out of NFS.
It's also a question of optimizing/matching NFS clients and servers.


>   The majority of the
> system is write.  I would say 80%.
>

Do you have a lot of small files?
Small files are usually what degrades GFS-performance.

As I mentioned, if your requirements are tending to be more grid- 
computing related, best ask somebody with a grid-computing background.
Though, they may think in dimensions where 12 nodes is what they have  
at home in a VMware-team.
;-)

Personally, I would be very careful to consider GFS for a system with  
a lot of writes (concurrent to a file or not).
The reason is that it is most times impossible to predict the  
behavior of GFS with a certain application-load-pattern at a given  
cluster-size - there are just too many variables.
This is, of course, also true for NFS - but there's simply much more  
NFS out there "in the field" and you can *always* find someone who  
does nearly (or exactly) the same thing as you want to do and then  
make better educated guesses about how the system will perform.

So, if you want to go with GFS: build your cluster, see if it  
performs, if it doesn't perform: work with your integrator and RedHat  
and see what they can do.
If that doesn't help: scrap it and install an NFS server on some box  
(use Solaris+ZFS - though there are other issues with that, too, of  
course).



cheers,
Rainer
-- 
Rainer Duffner
CISSP, LPI, MCSE
rainer at ultra-secure.de




From td3201 at gmail.com  Mon Jan 28 03:17:31 2008
From: td3201 at gmail.com (Terry)
Date: Sun, 27 Jan 2008 21:17:31 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
Message-ID: <8ee061010801271917y483de3f7yfd90a7e9ed7bf637@mail.gmail.com>

On Jan 27, 2008 8:28 PM, Rainer Duffner <rainer at ultra-secure.de> wrote:
>
> Am 28.01.2008 um 02:36 schrieb Terry:
>
> >
> >
> > Good questions:
> > 1) Do you have concurrent writes to the same file from different
> > nodes?
> >   1a) No
>
>
> Well, that's one of the things GFS is good at ;-)
>
>
> > 2) How many nodes do you have?
> >   2a) 3 to start, probably won't go beyond 12
> >
>
>
> OK, that's still in the range GFS can handle (AFAIK).
>
>
> > I appreciate alternative ideas to NFS.  NFS could possibly introduce
> > performance issues (comments here appreciated).
>
>
> One problem might be that NFS was never supposed to run on  GBit-
> networks.
> Thus there is overhead.
> But, OTOH, the vendors I mentioned have managed to squeeze a lot of
> performance out of NFS.
> It's also a question of optimizing/matching NFS clients and servers.
>
>
> >   The majority of the
> > system is write.  I would say 80%.
> >
>
> Do you have a lot of small files?
> Small files are usually what degrades GFS-performance.
>
> As I mentioned, if your requirements are tending to be more grid-
> computing related, best ask somebody with a grid-computing background.
> Though, they may think in dimensions where 12 nodes is what they have
> at home in a VMware-team.
> ;-)
>
> Personally, I would be very careful to consider GFS for a system with
> a lot of writes (concurrent to a file or not).
> The reason is that it is most times impossible to predict the
> behavior of GFS with a certain application-load-pattern at a given
> cluster-size - there are just too many variables.
> This is, of course, also true for NFS - but there's simply much more
> NFS out there "in the field" and you can *always* find someone who
> does nearly (or exactly) the same thing as you want to do and then
> make better educated guesses about how the system will perform.
>
> So, if you want to go with GFS: build your cluster, see if it
> performs, if it doesn't perform: work with your integrator and RedHat
> and see what they can do.
> If that doesn't help: scrap it and install an NFS server on some box
> (use Solaris+ZFS - though there are other issues with that, too, of
> course).
>
>
>
> cheers,
> Rainer
> --
> Rainer Duffner
> CISSP, LPI, MCSE
> rainer at ultra-secure.de


Good tips.  Yes, a ton of small files and many directories. 56,000,000
files roughly today.  I wonder how an active-active NFS would fare in
this type of environment.



From gordan at bobich.net  Mon Jan 28 08:07:20 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 28 Jan 2008 08:07:20 +0000
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008127184650.577261@leena>
References: <2008127184650.577261@leena>
Message-ID: <479D8D38.1060306@bobich.net>

isplist at logicore.net wrote:
>> It's not virtualization. It is equivalent to mounting an NFS share, and
>> then exporting it again from the machine that mounted it.
> 
> Ok, so a single machine where all storage is attached to it. Won't that bog it 
> down big time pretty quickly? 

Not if it can handle the I/O. You just need enough CPU and enough bonded 
gigabit ethernet NICs in it. At the end of the day, all a SAN appliance 
is just a PC with a few NICs and a bunch of disks in it, and those can 
handle quite a few machines using it simultaneously.

> So, if this is correct, I can see how I could export everything from that one 
> machine but overall I/O would be unreal? 

How much I/O do you actually need? If you have 10 disk nodes, each with 
a 1Gb NIC, then you could just have couple of 10Gb NICs in the 
aggregator (one on the client, one on the disk node side), and you'll 
get no bottleneck. In reality, you can overbook it quite a lot unless 
all the machines are going flat out all the time. Caching on the 
aggregator and the client nodes will also help reduce the I/O on the 
disk node side.

> How would this machine be turned into an aggregator? Would it handle knowing 
> where everything is or would servers still need to know which share to connect 
> to in order to get the needed data?

The disk nodes export their space via iSCSI as volumes. The aggregator 
connects to each of those iSCSI volumes as normal SCSI device nodes, and 
creates virtual software RAID stripe over them. It then exports this 
back out via iSCSI. All the client nodes then connect to the single big 
iSCSI node that is the aggregator.

> I also happen to have a BlueArc i7500 machine which can offer up NFS shares. I 
> didn't want to use anything like that because I've read too many message about 
> NFS not being a good protocol to grow on. Do you disagree?

NFS can give considerably better performance than GFS under some 
circumstances. If you don't need POSIX compliant file locking, you may 
find that NFS works better for your application. You'll just have to try 
it and see. There is no reason the aggregator box couldn't export an NFS 
share to the aggregated space (i.e. be a NAS rather than a SAN).

>> Exactly. You have a machine that pretends to be a SAN when it in fact
>> has no space on it. Instead, it connects to all the individual storage
>> nodes, mounts their volumes, merges them into one big volume, and then
>> presents that one big volume via iSCSI.
> 
> Ok, I like it :). I don't get how I aggregate it all into a single volume, 
> guess I've not played with software RAID which expands to different storage 
> devices and volumes. I get the idea though.
> 
> For hardware, would this aggregator need massive resources in terms of CPU or 
> memory? I have IBM's which have 8-way CPU's and can have up to 64GB of memory. 

I suspect that possibly overkill. It's NIC I/O you'll need more than 
anything. Jumbo frames, as big as your hardware can handle, will also help.

> Would the aggregator be a potential cluster candidate perhaps? Might it be 
> possible to run a cluster of them to be safe and to offload? 

There is no reason why the aggregator couldn't mind it's own exports, 
and run as one of the client cluster nodes.

> This is interesting. I can see that if I could get to VM/shareroot and 
> something like this, I would have something quite nice going.
> 
>> It's a central connection point AND a router, only it isn't just
>> straight routing, because the data is RAID striped for redundancy.
> 
> Right, I just don't yet get how the aggregator handles all of that I/O. Or 
> perhaps it just tells the servers which storage device to connect to so that 
> it doesn't actually have to take on all of the I/O?

No, it handles all of the I/O through itself. The client nodes don't 
connect to the disk nodes directly, ever.

Gordan



From gordan at bobich.net  Mon Jan 28 08:08:57 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 28 Jan 2008 08:08:57 +0000
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<479A1538.3080805@xandros.com>	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
Message-ID: <479D8D99.6030002@bobich.net>

Terry wrote:
> On Jan 27, 2008 6:12 PM, Rainer Duffner <rainer at ultra-secure.de> wrote:
>> Am 27.01.2008 um 23:02 schrieb Terry:
>>
>>>
>>> Well, I am having a heck of a time wrapping my head around how
>>> clustering and gfs are related.  My application is an N+1 type of
>>> configuration (grid computing).  I don't need any application
>>> clustering that cluster suite offers.  I just need a file system that
>>> they can all mount in read/write.
>>
>>
>> People I have talked to, who have grid-computing requirements,
>> generally avoid GFS (because of the complexity) and choose NFS.
>>
>> Do you have concurrent writes to the same file from different nodes?
>> How many nodes do you have?
>> There are products tailored for this scenario (Isilon, Panasas)....
>>
>>
>>
>> cheers,
>> Rainer
>> --
>> Rainer Duffner
>> CISSP, LPI, MCSE
>> rainer at ultra-secure.de
> 
> 
> Good questions:
> 1) Do you have concurrent writes to the same file from different nodes?
>   1a) No
> 2) How many nodes do you have?
>   2a) 3 to start, probably won't go beyond 12
> 
> I appreciate alternative ideas to NFS.  NFS could possibly introduce
> performance issues (comments here appreciated).  The majority of the
> system is write.  I would say 80%.

I suspect under that kind of a mostly-write load, NFS might actually 
outperform GFS.

Gordan



From gordan at bobich.net  Mon Jan 28 08:11:51 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 28 Jan 2008 08:11:51 +0000
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>	<479A1538.3080805@xandros.com>	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
Message-ID: <479D8E47.2010509@bobich.net>

Rainer Duffner wrote:
> 
> Am 28.01.2008 um 02:36 schrieb Terry:
> 
>>
>>
>> Good questions:
>> 1) Do you have concurrent writes to the same file from different nodes?
>>   1a) No
> 
> 
> Well, that's one of the things GFS is good at ;-)
> 
> 
>> 2) How many nodes do you have?
>>   2a) 3 to start, probably won't go beyond 12
>>
> OK, that's still in the range GFS can handle (AFAIK).

With an order of magnitude room for growth left.

>> I appreciate alternative ideas to NFS.  NFS could possibly introduce
>> performance issues (comments here appreciated).
> 
> 
> One problem might be that NFS was never supposed to run on  GBit-networks.
> Thus there is overhead.
> But, OTOH, the vendors I mentioned have managed to squeeze a lot of 
> performance out of NFS.
> It's also a question of optimizing/matching NFS clients and servers.

I've found that NFS v3 over UDP with large rsize,wsize and jumbo frames 
works pretty well.

>>   The majority of the
>> system is write.  I would say 80%.
>>
> 
> Do you have a lot of small files?
> Small files are usually what degrades GFS-performance.

I don't think small files are what kills it, it's lots of files that 
slow things down.

Gordan



From johannes.russek at io-consulting.net  Mon Jan 28 08:32:15 2008
From: johannes.russek at io-consulting.net (jr)
Date: Mon, 28 Jan 2008 09:32:15 +0100
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008127184650.577261@leena>
References: <2008127184650.577261@leena>
Message-ID: <1201509135.20969.159.camel@admc.win-rar.local>

Hi Mike,


> How would this machine be turned into an aggregator? Would it handle knowing 
> where everything is or would servers still need to know which share to connect 
> to in order to get the needed data?

actually, you need to know all that and then set it up :)

> 
> I also happen to have a BlueArc i7500 machine which can offer up NFS shares. I 
> didn't want to use anything like that because I've read too many message about 
> NFS not being a good protocol to grow on. Do you disagree?

Depends. For instance, there are many people that use NFS as a backend
for a cluster of mailservers or even webservers. 

> 
> > Exactly. You have a machine that pretends to be a SAN when it in fact
> > has no space on it. Instead, it connects to all the individual storage
> > nodes, mounts their volumes, merges them into one big volume, and then
> > presents that one big volume via iSCSI.
> 
> Ok, I like it :). I don't get how I aggregate it all into a single volume, 
> guess I've not played with software RAID which expands to different storage 
> devices and volumes. I get the idea though.

You won't even necessarily need software RAID for that. Think LVM! 
you basically initialize all your storage as Physical Volumes and then
slice some VolumeGroups from those depending on your needs.
You can still use software RAID though.

> 
> For hardware, would this aggregator need massive resources in terms of CPU or 
> memory? I have IBM's which have 8-way CPU's and can have up to 64GB of memory. 

wow, christ. I don't think you need that much woom for I/O :) 

> 
> Would the aggregator be a potential cluster candidate perhaps? Might it be 
> possible to run a cluster of them to be safe and to offload? 
> 
> This is interesting. I can see that if I could get to VM/shareroot and 
> something like this, I would have something quite nice going.
> 
> > It's a central connection point AND a router, only it isn't just
> > straight routing, because the data is RAID striped for redundancy.
> 
> Right, I just don't yet get how the aggregator handles all of that I/O. Or 
> perhaps it just tells the servers which storage device to connect to so that 
> it doesn't actually have to take on all of the I/O?

nah. actually all this FC-Head/Aggregator think calls for openfiler once
again! It basically does what you guys are talking to.
And no, it actually acts some kind of bridge between your servers
connecting with iSCSI and your FC-Storage. so yeah, all the I/O has to
go through that machine..
I'd say the I/O is mostly limited by the number of FC-links you give it
to use.
Johannes
> -
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From ckmvishnu at gmail.com  Mon Jan 28 10:20:50 2008
From: ckmvishnu at gmail.com (vishnu)
Date: Mon, 28 Jan 2008 15:50:50 +0530
Subject: [Linux-cluster] Fencing issues
In-Reply-To: <4e5956450801280207o6aac464fwda0ebd17a1b9e79@mail.gmail.com>
References: <4e5956450801280207o6aac464fwda0ebd17a1b9e79@mail.gmail.com>
Message-ID: <4e5956450801280220r7c7b3032y2208af939c89c27d@mail.gmail.com>

Hi,
   Im vishnu, I have some issues while starting my cluster. Im not sure if
my configurations are ok. But then i follow RedHat Docs.

When i try to start services cman, it hangs while starting fencing. The
prompt never comes back. But if i do a CTRL+C and clustat, It looks like my
cluster works fine.

My question is that
1.Why does starting fencing never end sometimes  ( without returning # promt
)
2.Any Noted Reasons for cman or fencing to fail.
3.if i give each node 2 votes in a two node cluster and expected votes=1,
then starting cman on a single node gives error (two_node set but there are
more than 2 nodes).


-- 
Vishnu,
Developer,
B I N A R Y K A R M A,
Mobile : +91-9994475599
Tel : 044-64621656
email: vishnu at binarykarma.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080128/3785e189/attachment.htm>

From pcaulfie at redhat.com  Mon Jan 28 10:44:31 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Mon, 28 Jan 2008 10:44:31 +0000
Subject: [Linux-cluster] Fencing issues
In-Reply-To: <4e5956450801280220r7c7b3032y2208af939c89c27d@mail.gmail.com>
References: <4e5956450801280207o6aac464fwda0ebd17a1b9e79@mail.gmail.com>
	<4e5956450801280220r7c7b3032y2208af939c89c27d@mail.gmail.com>
Message-ID: <479DB20F.5070301@redhat.com>

vishnu wrote:
> Hi,
>    Im vishnu, I have some issues while starting my cluster. Im not sure
> if my configurations are ok. But then i follow RedHat Docs.
> 
> When i try to start services cman, it hangs while starting fencing. The
> prompt never comes back. But if i do a CTRL+C and clustat, It looks like
> my cluster works fine.
> 
> My question is that
> 1.Why does starting fencing never end sometimes  ( without returning #
> promt )
> 2.Any Noted Reasons for cman or fencing to fail.

> 3.if i give each node 2 votes in a two node cluster and expected
> votes=1, then starting cman on a single node gives error (two_node set
> but there are more than 2 nodes).

Giving each node 2 votes in a two_node cluster doesn't achieve anything
more than giving each node 1 votes. OK, so you could argue that it
doesn't do any harm either and that could be true, but there's no point
in it either!

Patrick



From td3201 at gmail.com  Mon Jan 28 15:38:33 2008
From: td3201 at gmail.com (Terry)
Date: Mon, 28 Jan 2008 09:38:33 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <479D8E47.2010509@bobich.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
	<479D8E47.2010509@bobich.net>
Message-ID: <8ee061010801280738j440a01adr4821f57656643b09@mail.gmail.com>

On Jan 28, 2008 2:11 AM, Gordan Bobic <gordan at bobich.net> wrote:
> Rainer Duffner wrote:
> >
> > Am 28.01.2008 um 02:36 schrieb Terry:
> >
> >>
> >>
> >> Good questions:
> >> 1) Do you have concurrent writes to the same file from different nodes?
> >>   1a) No
> >
> >
> > Well, that's one of the things GFS is good at ;-)
> >
> >
> >> 2) How many nodes do you have?
> >>   2a) 3 to start, probably won't go beyond 12
> >>
> > OK, that's still in the range GFS can handle (AFAIK).
>
> With an order of magnitude room for growth left.
>
> >> I appreciate alternative ideas to NFS.  NFS could possibly introduce
> >> performance issues (comments here appreciated).
> >
> >
> > One problem might be that NFS was never supposed to run on  GBit-networks.
> > Thus there is overhead.
> > But, OTOH, the vendors I mentioned have managed to squeeze a lot of
> > performance out of NFS.
> > It's also a question of optimizing/matching NFS clients and servers.
>
> I've found that NFS v3 over UDP with large rsize,wsize and jumbo frames
> works pretty well.
>
> >>   The majority of the
> >> system is write.  I would say 80%.
> >>
> >
> > Do you have a lot of small files?
> > Small files are usually what degrades GFS-performance.
>
> I don't think small files are what kills it, it's lots of files that
> slow things down.
>
> Gordan

In an active-standby NFS cluster scenario, is GFS still required as
the format for the data drives?  I would be using a 2 node cluster so
I would need a quorum disk too.  Since only 1 node will have access at
any one time, could I get away with ext3 (or whatever) formatted
volumes?

Along the same note, I am going to ask an obvious question and answer
it myself.  :)   In an active-active, this would definitely need to be
GFS formatted, correct?

Thanks!



From gordan at bobich.net  Mon Jan 28 15:45:54 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 15:45:54 +0000 (GMT)
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801280738j440a01adr4821f57656643b09@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
	<479D8E47.2010509@bobich.net>
	<8ee061010801280738j440a01adr4821f57656643b09@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801281544540.28394@skynet.shatteredsilicon.net>


> In an active-standby NFS cluster scenario, is GFS still required as
> the format for the data drives?  I would be using a 2 node cluster so
> I would need a quorum disk too.  Since only 1 node will have access at
> any one time, could I get away with ext3 (or whatever) formatted
> volumes?

As long as you can absolutely guarantee that no more than 1 node will 
mount the volume, yes, you can use ext3.

> Along the same note, I am going to ask an obvious question and answer
> it myself.  :)   In an active-active, this would definitely need to be
> GFS formatted, correct?

Yes.

Gordan



From td3201 at gmail.com  Mon Jan 28 15:51:30 2008
From: td3201 at gmail.com (Terry)
Date: Mon, 28 Jan 2008 09:51:30 -0600
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <alpine.LRH.1.00.0801281544540.28394@skynet.shatteredsilicon.net>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
	<479D8E47.2010509@bobich.net>
	<8ee061010801280738j440a01adr4821f57656643b09@mail.gmail.com>
	<alpine.LRH.1.00.0801281544540.28394@skynet.shatteredsilicon.net>
Message-ID: <8ee061010801280751k6e654192u6c57b79c656d6e24@mail.gmail.com>

On Jan 28, 2008 9:45 AM,  <gordan at bobich.net> wrote:
>
> > In an active-standby NFS cluster scenario, is GFS still required as
> > the format for the data drives?  I would be using a 2 node cluster so
> > I would need a quorum disk too.  Since only 1 node will have access at
> > any one time, could I get away with ext3 (or whatever) formatted
> > volumes?
>
> As long as you can absolutely guarantee that no more than 1 node will
> mount the volume, yes, you can use ext3.
>
> > Along the same note, I am going to ask an obvious question and answer
> > it myself.  :)   In an active-active, this would definitely need to be
> > GFS formatted, correct?
>
> Yes.
>
>
> Gordan
>

Only in very strange situations (bug) would 2 nodes want to read/write
to the same file.  That being said, the cluster service would also
prevent this too, correct?



From Mathieu.MARY at neufcegetel.fr  Mon Jan 28 15:55:52 2008
From: Mathieu.MARY at neufcegetel.fr (MARY, Mathieu)
Date: Mon, 28 Jan 2008 16:55:52 +0100
Subject: [Linux-cluster] enabling quotas for EXT3 filesystement on 2 node
	cluster
In-Reply-To: <alpine.LRH.1.00.0801281544540.28394@skynet.shatteredsilicon.net>
Message-ID: <20080128155530.F375020B263@smtp3.ldcom.fr>

hi all,

I'm actually using RHEL5.1 cluster suite on a 2 nodes HA cluster, i need
to enable quotas.
the problem is I use ext3 filesystems (cluster suite is mounting FS on
the active node, the ext3 filesystem is mounted on only one node at one
time) thus I cannot set the quota options in the fstab file.

I've check on the web but didn't find any way to do it.
is there any way to unable quotas on such configuration ?

regards,

Mathieu



From gordan at bobich.net  Mon Jan 28 16:05:38 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 16:05:38 +0000 (GMT)
Subject: [Linux-cluster] architecture discussion -- NFS clustering with
	iSCSI
In-Reply-To: <8ee061010801280751k6e654192u6c57b79c656d6e24@mail.gmail.com>
References: <8ee061010801250743r742fda1fu6c78bbd3bfcb744@mail.gmail.com>
	<479A1538.3080805@xandros.com>
	<alpine.LRH.1.00.0801251720240.15537@skynet.shatteredsilicon.net>
	<8ee061010801271402o5dfbf0bdq89594446b1799216@mail.gmail.com>
	<4E2EAD1E-5923-403B-A291-09D2D09A5ECB@ultra-secure.de>
	<8ee061010801271736o3d22e3f6g96f92453c0b23864@mail.gmail.com>
	<9B86C3CF-FA6B-43C8-8650-20055ADE3423@ultra-secure.de>
	<479D8E47.2010509@bobich.net>
	<8ee061010801280738j440a01adr4821f57656643b09@mail.gmail.com>
	<alpine.LRH.1.00.0801281544540.28394@skynet.shatteredsilicon.net>
	<8ee061010801280751k6e654192u6c57b79c656d6e24@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801281603450.28394@skynet.shatteredsilicon.net>


>>> In an active-standby NFS cluster scenario, is GFS still required as
>>> the format for the data drives?  I would be using a 2 node cluster so
>>> I would need a quorum disk too.  Since only 1 node will have access at
>>> any one time, could I get away with ext3 (or whatever) formatted
>>> volumes?
>>
>> As long as you can absolutely guarantee that no more than 1 node will
>> mount the volume, yes, you can use ext3.
>>
>>> Along the same note, I am going to ask an obvious question and answer
>>> it myself.  :)   In an active-active, this would definitely need to be
>>> GFS formatted, correct?
>>
>> Yes.
>>
>>
>> Gordan
>>
>
> Only in very strange situations (bug) would 2 nodes want to read/write
> to the same file.  That being said, the cluster service would also
> prevent this too, correct?

No. GFS requires clustering to be up and running. ext3 doesn't. Cluster 
would in no way stop you from mounting the same iSCSI volume on both nodes 
and mounting the same ext3 fs from both of them. You'll just notice that 
the data gets completely trashed in very short order.

Gordan



From ben at muppethouse.com  Mon Jan 28 16:16:04 2008
From: ben at muppethouse.com (Ben Russo)
Date: Mon, 28 Jan 2008 11:16:04 -0500
Subject: [Linux-cluster] Re: Any HA Cluster Success with iSCSI storage?
In-Reply-To: <3ae027040801251443q41b160adh599bbb57fe6a28@mail.gmail.com>
References: <4799F75D.3000106@muppethouse.com>
	<3ae027040801251443q41b160adh599bbb57fe6a28@mail.gmail.com>
Message-ID: <479DFFC4.7090502@muppethouse.com>

 >Alex Kompel wrote:
> 
>> <mailto:ben at muppethouse.com>> wrote:
>>     ***  Can I setup my two node HA cluster with iSCSI quorum drives and
>>     cluster service storage volumes? (anyone do this before?)
>  
> I am going to play devils advocate here: is there a specific reason you 
> want to use GFS in this setup? NetApp has excellent NFS and CIFS support 
> and it looks like you already paid for both and HA option for NetApp 
> (270c is a clustered filer).

I wasn't going to use GFS.
I am going to setup an HA cluster with Oracle services.
So it is not simultaneous cluster node access to the disk storage.
(Kind of like HP Service Guard or Veritas Cluster Services).

>>     ***  I was thinking about getting 10Gbit/sec uplinks for the NetApp and
>>     two ethernet switches that have 10Gbit uplink ports and that support
>>     802.3ad.  The two cluster nodes would use 802.3ad NIC channel bonding
> >     for the storage access bandwidth.  (anyone do this before?)

> Do not get 10gbE. First, I don't think you can get 10gbE interfaces in 
> FAS270.
> Second, FAS270 won't be able to saturate even 1gb link. The bottleneck 
> is usually the filer CPU.
> It does support link aggregation but you won't see much of the 
> difference vs active/passive bonding.
>  
> -Alex

Yeah, I should ask NetApp if I can get FOUR nic's into the NetApp.
Then I can do two uplinks (aggregated) to Switch1--Node1,
and two uplinks to Switch2--Node2, then have the Nodes crossconnected
with heartbeat......

Hmmm, THANKS!



From ben at muppethouse.com  Mon Jan 28 16:16:04 2008
From: ben at muppethouse.com (Ben Russo)
Date: Mon, 28 Jan 2008 11:16:04 -0500
Subject: [Linux-cluster] Re: Any HA Cluster Success with iSCSI storage?
In-Reply-To: <3ae027040801251443q41b160adh599bbb57fe6a28@mail.gmail.com>
References: <4799F75D.3000106@muppethouse.com>
	<3ae027040801251443q41b160adh599bbb57fe6a28@mail.gmail.com>
Message-ID: <479DFFC4.7090502@muppethouse.com>

 >Alex Kompel wrote:
> 
>> <mailto:ben at muppethouse.com>> wrote:
>>     ***  Can I setup my two node HA cluster with iSCSI quorum drives and
>>     cluster service storage volumes? (anyone do this before?)
>  
> I am going to play devils advocate here: is there a specific reason you 
> want to use GFS in this setup? NetApp has excellent NFS and CIFS support 
> and it looks like you already paid for both and HA option for NetApp 
> (270c is a clustered filer).

I wasn't going to use GFS.
I am going to setup an HA cluster with Oracle services.
So it is not simultaneous cluster node access to the disk storage.
(Kind of like HP Service Guard or Veritas Cluster Services).

>>     ***  I was thinking about getting 10Gbit/sec uplinks for the NetApp and
>>     two ethernet switches that have 10Gbit uplink ports and that support
>>     802.3ad.  The two cluster nodes would use 802.3ad NIC channel bonding
> >     for the storage access bandwidth.  (anyone do this before?)

> Do not get 10gbE. First, I don't think you can get 10gbE interfaces in 
> FAS270.
> Second, FAS270 won't be able to saturate even 1gb link. The bottleneck 
> is usually the filer CPU.
> It does support link aggregation but you won't see much of the 
> difference vs active/passive bonding.
>  
> -Alex

Yeah, I should ask NetApp if I can get FOUR nic's into the NetApp.
Then I can do two uplinks (aggregated) to Switch1--Node1,
and two uplinks to Switch2--Node2, then have the Nodes crossconnected
with heartbeat......

Hmmm, THANKS!



From ben at muppethouse.com  Mon Jan 28 16:17:15 2008
From: ben at muppethouse.com (Ben Russo)
Date: Mon, 28 Jan 2008 11:17:15 -0500
Subject: [Linux-cluster] Re: Any HA Cluster Success with iSCSI storage?
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12D7@cumulonimbus.RG.local>
References: <4799F75D.3000106@muppethouse.com>	<1201286727.29756.30.camel@ayanami.boston.devel.redhat.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D6D12D7@cumulonimbus.RG.local>
Message-ID: <fnkva4$28i$2@ger.gmane.org>

Alexandre Racine wrote:
> I use iSCSI to connect to the SAN with GFS.
> 
> Works pretty well!
> 
> 
> Alexandre Racine
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org> 

THANKS!



From ben at muppethouse.com  Mon Jan 28 16:19:00 2008
From: ben at muppethouse.com (Ben Russo)
Date: Mon, 28 Jan 2008 11:19:00 -0500
Subject: [Linux-cluster] Re: Any HA Cluster Success with iSCSI storage?
In-Reply-To: <1201286727.29756.30.camel@ayanami.boston.devel.redhat.com>
References: <4799F75D.3000106@muppethouse.com>
	<1201286727.29756.30.camel@ayanami.boston.devel.redhat.com>
Message-ID: <fnkvdc$28i$3@ger.gmane.org>

Lon Hohberger wrote:
> 
> You don't need quorum disks in RHEL4; they're optional.
> iSCSI should work for your shared data.

Just to clarify...

With RHEL-4 clustering

I don't need quorum disks for an HA cluster?
Or I don't need quorum disks for GFS?
Or I don't need quorum disks for either/or?

(I'll read the new manuals soon, but for now just
mining for hints...)


Thanks,
-Ben.



From isplist at logicore.net  Mon Jan 28 16:41:55 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 10:41:55 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <479D8D38.1060306@bobich.net>
Message-ID: <2008128104155.287054@leena>

> How much I/O do you actually need?

My concern is not so much what I need up front but the growth path should I 
start needing to add a lot of storage. LAMP services can sometimes grow very 
quickly, immediately need endless ongoing storage space for uploaded media and 
playback, not to mention the web services themselves. 

Like I mentioned, I've seen not being prepared for growth and it's not fun. I 
would hate to have to keep changing out technologies once things get going 
because I didn't choose a nice flexible solution up front. 

> creates virtual software RAID stripe over them. It then exports this
> back out via iSCSI. All the client nodes then connect to the single big
> iSCSI node that is the aggregator.

Do you know of any software which helps to keep track of all this? This is an 
interesting idea. I think I understand it and want to give it a try. I have 
various types of storage where this would be a good solution.

Let's see if I've got this right.

I need a machine which will become the aggregator, plenty of memory, 
multi-port Ethernet card and of course an FC HBA. 
FC storage will be attached to this machine. Then, iSCSI storage targets will 
also export to this machine. 
This machine will then use virtual RAID (which I've no idea about yet) and 
aggregate the storage as a volume or how many I need. Next I export this to 
the servers via iSCSI for their larger ongoing storage needs.

Now I can use say a small FC GFS target to share various files such as web 
pages and other web based shared files yet have the larger more easily 
expandable V-RAID for media and other such things.

This means that I also need to find an iSCSI target driver that doesn't take 
rocket science to figure out and is open source to keep things cheap while 
trying this out. 

> NFS can give considerably better performance than GFS under some
> circumstances. If you don't need POSIX compliant file locking, you may

As I've put all of this together, I've come to learn that I need GFS for the 
shared data but the rest, I don't need anything but standard storage. I'll 
maintain a cluster of web servers which use GFS to share their pages/images 
but I could use this aggregator idea for the larger scale media storage. 

I had somehow gotten a little too caught up in GFS and I was basically not 
thinking anything beyond that. This makes things so much simpler than where I 
was heading.

> I suspect that possibly overkill. It's NIC I/O you'll need more than
> anything. Jumbo frames, as big as your hardware can handle, will also help.

Doesn't NIC I/O take up a lot of CPU time?

> There is no reason why the aggregator couldn't mind it's own exports,
> and run as one of the client cluster nodes.

I just mean for fail over. It might be nice to have a little redundancy there.

Mike





From isplist at logicore.net  Mon Jan 28 17:00:14 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 11:00:14 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <1201509135.20969.159.camel@admc.win-rar.local>
Message-ID: <200812811014.074180@leena>

>> How would this machine be turned into an aggregator? Would it handle
>> knowing where everything is or would servers still need to know which share 
>> to connect to in order to get the needed data?
 
> actually, you need to know all that and then set it up :)

Once I understand the concept, the rest is just hands on. I clearly see how I 
can aggregate it all into a single RAID array and in fact, have various arrays 
for different needs as well. I could have a few aggregator machines if needed.
It makes sense to me.
 
> Depends. For instance, there are many people that use NFS as a backend
> for a cluster of mailservers or even webservers.

I've read so much about NFS not being a good choice for production services 
yet I know it can be very reliable as I use it around the network a lot. Since 
I already have fibre though, that is the way better solution for my cluster at 
least.
 
> You won't even necessarily need software RAID for that. Think LVM!
> you basically initialize all your storage as Physical Volumes and then
> slice some VolumeGroups from those depending on your needs.
> You can still use software RAID though.

Yes, exactly. This is what I was thinking when I replied to Gordan since all 
of my storage devices are RAID devices. Creating a large volume would work 
just fine. 

However, just how many drives/arrays can you put into a single volume?
And, can you add/remove from a live volume?

>> have IBM's which have 8-way CPU's and can have up to 64GB of
>> memory.
>> 
> wow, christ. I don't think you need that much woom for I/O :)

Don't know yet, that's why I asked :). I've got various types of hardware I 
can use. Off the top, without having played with this yet, I don't know how 
much I/O there might be. Could a single aggregator handle 50 servers looking 
to it for their data? I don't know, sure doesn't sound like it could to me.

On the other hand, I've not looked around yet to see if there is in fact 
hardware being manufactured specifically to be used in this manner. Or, I can 
build something custom based on the requirements.

Right now, aside from building that machine, I need to better understand if I 
need a single volume which can grow or a virtual RAID array. Seems a volume 
would do the job since the machines are already RAID.
 
Mike





From lhh at redhat.com  Mon Jan 28 17:03:06 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 28 Jan 2008 12:03:06 -0500
Subject: [Linux-cluster] DDRAID vs. GNBD+MD
In-Reply-To: <479CE1E3.3060601@bobich.net>
References: <479CE1E3.3060601@bobich.net>
Message-ID: <1201539786.8693.28.camel@ayanami.boston.devel.redhat.com>

On Sun, 2008-01-27 at 19:56 +0000, Gordan Bobic wrote:
> Hi,
> 
> I mentioned DDRAID in a different thread recently, but it got me 
> thinking. Is there any reason why a similar solution based on GNBD plus 
> standard software RAID on top wouldn't work?

> Say we have 7 nodes, and we want 5+2 (RAID6 redundancy). Have each node 
> export a GNBD, and then all nodes connect the said GNBDs together using 
> software RAID into a /dev/md? device, and then have GFS on top of that. 
> We could then lose any 2 out of the 7 nodes and still maintain 
> operational status.

There's no way to ensure things like write ordering, cache coherency,
etc. with MD across multiple computers.

You could assemble an MD set and export it via NFS to other nodes in the
cluster, but it seems pretty complicated to save a few dollars.

> Would that work, or would the underlying GNBDs end up being temporarily 
> out of sync sufficiently for the RAID to not be assemblable on the other 
> nodes?

It's not GNBD that you have to worry about - it's just a generic,
shared-block-device setup similar to iSCSI, but less complex.

For example, you can run GFS on top of GNBD - because GFS has some
amount awareness of its own I/Os, and the server handles synchronizing
access to the actual block device.

However, write ordering between individual block devices isn't
guaranteed without something to help out.  MD doesn't do this in any
sane way for cluster use.

Suppose block A is striped across the 6 nodes.  You could have two nodes
who end up doing this:

  node 1: write block A to device 1
  node 1: write block A to device 2
  node 1: write block A to device 3

  node 2: write block A to device 1
  node 2: write block A to device 2
  node 2: write block A to device 3
  node 2: write block A to device 4
  node 2: write block A to device 5
  node 2: write block A to device 6

  node 1: write block A to device 4
  node 1: write block A to device 5
  node 1: write block A to device 6

At this point, devices {1,2,3} block A have data from node 1.
Devices {4,5,6} block A have data from node 2.

That block is now inconsistent (and unrecoverable).


> On a related node, was DDRAID ever stabilised

In fact, it's been removed from CVS:

http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/ddraid/Attic/?cvsroot=cluster

-- Lon



From gordan at bobich.net  Mon Jan 28 17:03:28 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 17:03:28 +0000 (GMT)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008128104155.287054@leena>
References: <2008128104155.287054@leena>
Message-ID: <alpine.LRH.1.00.0801281645270.28394@skynet.shatteredsilicon.net>



On Mon, 28 Jan 2008, isplist at logicore.net wrote:

>> How much I/O do you actually need?
>
> My concern is not so much what I need up front but the growth path should I
> start needing to add a lot of storage. LAMP services can sometimes grow very
> quickly, immediately need endless ongoing storage space for uploaded media and
> playback, not to mention the web services themselves.

Yes, but we are talking 10s od Gb here before you may need to alter your 
approach.

> Like I mentioned, I've seen not being prepared for growth and it's not fun. I
> would hate to have to keep changing out technologies once things get going
> because I didn't choose a nice flexible solution up front.
>
>> creates virtual software RAID stripe over them. It then exports this
>> back out via iSCSI. All the client nodes then connect to the single big
>> iSCSI node that is the aggregator.
>
> Do you know of any software which helps to keep track of all this? This is an
> interesting idea. I think I understand it and want to give it a try. I have
> various types of storage where this would be a good solution.

It's pretty simple to set up. You just need to be familliar with iSCSI 
tools and software RAID tools, all of which are almost certainly in your 
distro's apt/yum repositories.

> Let's see if I've got this right.
>
> I need a machine which will become the aggregator, plenty of memory,
> multi-port Ethernet card and of course an FC HBA.
> FC storage will be attached to this machine. Then, iSCSI storage targets will
> also export to this machine.

Not quite sure I follow this - you want to use FC storage and combine it 
with iSCSI storage into a bigger iSCSI storage pool? No reasib why not, I 
suppose.

> This machine will then use virtual RAID (which I've no idea about yet) and
> aggregate the storage as a volume or how many I need. Next I export this to
> the servers via iSCSI for their larger ongoing storage needs.

Pretty much. Once you have the big software RAID stripe, you can use this 
to back any number of iSCSI volumes.

Note that software RAID only goes up to RAID 6 (i.e. n+2). So you cannot 
lose more than 2 nodes (FC or iSCSI), otherwise you lose your data.

> Now I can use say a small FC GFS target to share various files such as web
> pages and other web based shared files yet have the larger more easily
> expandable V-RAID for media and other such things.

Once you have a big RAID-ed storage pool, you can partition it out in 
whatever way you like. You also don't have to put it all into one big RAID 
stripe, but a few smaller ones.

You can dynamically add disks to a software RAID stripe.

> This means that I also need to find an iSCSI target driver that doesn't take
> rocket science to figure out and is open source to keep things cheap while
> trying this out.

yum install iscsi-target
:-)

>> NFS can give considerably better performance than GFS under some
>> circumstances. If you don't need POSIX compliant file locking, you may
>
> As I've put all of this together, I've come to learn that I need GFS for the
> shared data but the rest, I don't need anything but standard storage. I'll
> maintain a cluster of web servers which use GFS to share their pages/images
> but I could use this aggregator idea for the larger scale media storage.

Sure, but you aggregator could export space via iSCSI or via NFS, 
whichever you prefer.

> I had somehow gotten a little too caught up in GFS and I was basically not
> thinking anything beyond that. This makes things so much simpler than where I
> was heading.

It happens to the best of us. :)

>> I suspect that possibly overkill. It's NIC I/O you'll need more than
>> anything. Jumbo frames, as big as your hardware can handle, will also help.
>
> Doesn't NIC I/O take up a lot of CPU time?

Not really. Topping out your CPU with NIC I/O load isn't all that trivial. 
There are also NICs that can offload the entire TCP/IP stack off the CPU 
onto the NIC, but I don't know that the driver support for Linux is like.

>> There is no reason why the aggregator couldn't mind it's own exports,
>> and run as one of the client cluster nodes.
>
> I just mean for fail over. It might be nice to have a little redundancy there.

Sure, you could set up heartbeat and preferably some fencing. I suspect 
that double activating a software RAID stripe would be quite destructive 
for your data (I asked about that in a different thread, but nobody 
stepped up to clarify yet), so fencing is a good idea, "just to make 
sure". When a node needs to take over, it fences the other node, connects 
the iSCSI shares, starts up the RAID on them, assumes the floating IP and 
exports the iSCSI/NFS shares.

Gordan



From gordan at bobich.net  Mon Jan 28 17:13:58 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 17:13:58 +0000 (GMT)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <200812811014.074180@leena>
References: <200812811014.074180@leena>
Message-ID: <alpine.LRH.1.00.0801281703580.28394@skynet.shatteredsilicon.net>



On Mon, 28 Jan 2008, isplist at logicore.net wrote:

>> Depends. For instance, there are many people that use NFS as a backend
>> for a cluster of mailservers or even webservers.
>
> I've read so much about NFS not being a good choice for production services
> yet I know it can be very reliable as I use it around the network a lot. Since
> I already have fibre though, that is the way better solution for my cluster at
> least.

Whether NFS is suitable for your production environment is 100% dependant 
on what you are trying to do. If you have a LAMP cluster/grid that doesn't 
require reliable file locking, you may actually find that NFS ends up 
being preferable to something like GFS. The major failing of NFS is the 
ability to have reliable file locking. This is largely an issue for (but 
not limited to) things like DBs. If you don't need this, then it'll 
probably be OK.

>>> have IBM's which have 8-way CPU's and can have up to 64GB of
>>> memory.
>>>
>> wow, christ. I don't think you need that much woom for I/O :)
>
> Don't know yet, that's why I asked :). I've got various types of hardware I
> can use. Off the top, without having played with this yet, I don't know how
> much I/O there might be. Could a single aggregator handle 50 servers looking
> to it for their data? I don't know, sure doesn't sound like it could to me.

How I/O bound do you expect your front ends to be? If each of your LAMP 
front ends can handle 100Mb, then it'll take 10 of those to saturate a 
single 1Gb interface. If your aggregator has a 10Gb interface on each 
side, that's 100 servers before you run out of I/O, assuming they are all 
going flat out (which let's face it, they won't be). And when you run out 
of that, you start adding additional 10Gb NICs to suit.

> On the other hand, I've not looked around yet to see if there is in fact
> hardware being manufactured specifically to be used in this manner. Or, I can
> build something custom based on the requirements.

That's pretty much it. All of this is about primitives / building blocks 
being available to put together whatever sort of a solution you deem most 
suitable for your particular application. :-)

> Right now, aside from building that machine, I need to better understand if I
> need a single volume which can grow or a virtual RAID array. Seems a volume
> would do the job since the machines are already RAID.

Machines may have RAID on them, but if you lose a whole machine, what 
happens then?

Gordan



From gordan at bobich.net  Mon Jan 28 17:21:14 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 17:21:14 +0000 (GMT)
Subject: [Linux-cluster] DDRAID vs. GNBD+MD
In-Reply-To: <1201539786.8693.28.camel@ayanami.boston.devel.redhat.com>
References: <479CE1E3.3060601@bobich.net>
	<1201539786.8693.28.camel@ayanami.boston.devel.redhat.com>
Message-ID: <alpine.LRH.1.00.0801281715220.28394@skynet.shatteredsilicon.net>

On Mon, 28 Jan 2008, Lon Hohberger wrote:

>> On a related node, was DDRAID ever stabilised
>
> In fact, it's been removed from CVS:
>
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/ddraid/Attic/?cvsroot=cluster

Ouch. I don't suppose that means that there is a reasonable alternative to 
it? I know DRBD works for providing RAID1 without cantralized storage, but 
is there anything supported and available that can provide any other kind 
of RAID?

Gordan



From isplist at logicore.net  Mon Jan 28 17:46:02 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 11:46:02 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801281645270.28394@skynet.shatteredsilicon.net>
Message-ID: <200812811462.054042@leena>

> It's pretty simple to set up. You just need to be familliar with iSCSI
> tools and software RAID tools, all of which are almost certainly in your
> distro's apt/yum repositories.

Figured I would ask. Never know, might be some cool management tools that help 
keep an eye on things. The setup sounds simple enough as you say. 

>> I need a machine which will become the aggregator, plenty of memory,
>> multi-port Ethernet card and of course an FC HBA.
>> FC storage will be attached to this machine. Then, iSCSI storage targets
>> will also export to this machine.
 
> Not quite sure I follow this - you want to use FC storage and combine it
> with iSCSI storage into a bigger iSCSI storage pool? No reasib why not, I
> suppose.

I need relatively small central GFS for the shared data between the servers 
but the rest is for media and such. I'll need to have FC HBA's in every LAMP 
server since it needs access to GFS but media servers, those who only offload, 
don't need access to GFS so why install an FC HBA in those? Rather, I could 
export the FCC storage as part of the aggregate volume so that any server can 
gain access over iSCSI. Seems that would give me more options. 
Am I thinking incorrectly on this?

> Note that software RAID only goes up to RAID 6 (i.e. n+2). So you cannot
> lose more than 2 nodes (FC or iSCSI), otherwise you lose your data.

So basically, can't lose more than one storage chassis. Since they are all 
RAID with hot swap, I should be ok so long as I keep a close eye on it all 
which one needs to anyhow. That's why I wondered about any software tools that 
might help but I'm sure there's a ton out there which will work.
 
> yum install iscsi-target

No?! I saw this a long time ago as a new concept, never looked at it since. 
Wonderful :).

> When a node needs to take over, it fences the other node, connects
> the iSCSI shares, starts up the RAID on them, assumes the floating IP and
> exports the iSCSI/NFS shares.

So this sounds like the complex part then because being able to fail over or 
switch over seems terribly important to me. If one machine is handling all 
this I/O and something happens to it, everything is down until that one 
machine is fixed. 

This, I would need to find a solution for first. I need to better understand 
how I would do this fencing.

Mike





From gordan at bobich.net  Mon Jan 28 18:09:19 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 18:09:19 +0000 (GMT)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <200812811462.054042@leena>
References: <200812811462.054042@leena>
Message-ID: <alpine.LRH.1.00.0801281749410.28394@skynet.shatteredsilicon.net>



On Mon, 28 Jan 2008, isplist at logicore.net wrote:

>> It's pretty simple to set up. You just need to be familliar with iSCSI
>> tools and software RAID tools, all of which are almost certainly in your
>> distro's apt/yum repositories.
>
> Figured I would ask. Never know, might be some cool management tools that help
> keep an eye on things. The setup sounds simple enough as you say.

Sadlt, contrary to what users of some other operating systems may think, 
you cannot control a complex and flexible system by clicking on pretty 
pictures. ;)

>>> I need a machine which will become the aggregator, plenty of memory,
>>> multi-port Ethernet card and of course an FC HBA.
>>> FC storage will be attached to this machine. Then, iSCSI storage targets
>>> will also export to this machine.
>
>> Not quite sure I follow this - you want to use FC storage and combine it
>> with iSCSI storage into a bigger iSCSI storage pool? No reasib why not, I
>> suppose.
>
> I need relatively small central GFS for the shared data between the servers
> but the rest is for media and such. I'll need to have FC HBA's in every LAMP
> server since it needs access to GFS but media servers, those who only offload,
> don't need access to GFS so why install an FC HBA in those? Rather, I could
> export the FCC storage as part of the aggregate volume so that any server can
> gain access over iSCSI. Seems that would give me more options.
> Am I thinking incorrectly on this?

Sure, that works.

>> Note that software RAID only goes up to RAID 6 (i.e. n+2). So you cannot
>> lose more than 2 nodes (FC or iSCSI), otherwise you lose your data.
>
> So basically, can't lose more than one storage chassis.

You can't lose more than 2.

> Since they are all
> RAID with hot swap, I should be ok so long as I keep a close eye on it all
> which one needs to anyhow. That's why I wondered about any software tools that
> might help but I'm sure there's a ton out there which will work.

cat /proc/mdstat is a good one to check every morning. :-)

>> yum install iscsi-target
>
> No?! I saw this a long time ago as a new concept, never looked at it since.
> Wonderful :).

Actually, on RHEL you'll probably want up2date, but you get the idea.

>> When a node needs to take over, it fences the other node, connects
>> the iSCSI shares, starts up the RAID on them, assumes the floating IP and
>> exports the iSCSI/NFS shares.
>
> So this sounds like the complex part then because being able to fail over or
> switch over seems terribly important to me. If one machine is handling all
> this I/O and something happens to it, everything is down until that one
> machine is fixed.
>
> This, I would need to find a solution for first. I need to better understand
> how I would do this fencing.

You need working fencing support for sane GFS operation anyway. You can do 
this via DRAC, ILO, switches that allow you to disable a port, UPS that 
lets you cut off power to a machine, etc. Just something you can use to 
make a machine stay down when it goes wrong until you can fix it.

Gordan



From isplist at logicore.net  Mon Jan 28 18:22:40 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 12:22:40 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801281703580.28394@skynet.shatteredsilicon.net>
Message-ID: <2008128122240.933208@leena>

Actually,  johannes had also mentioned the http://www.openfiler.com project 
which seems to be pretty much what we've been talking about. Have you looked 
at it?

> Whether NFS is suitable for your production environment is 100% dependant
> on what you are trying to do. If you have a LAMP cluster/grid that doesn't
> require reliable file locking, you may actually find that NFS ends up

Yes, it seems to be very reliable and as you say, if I don't need the file 
locking, which I would not for media storage, then it would be fine.

> How I/O bound do you expect your front ends to be? If each of your LAMP
> front ends can handle 100Mb, then it'll take 10 of those to saturate a
> single 1Gb interface. If your aggregator has a 10Gb interface on each
> side, that's 100 servers before you run out of I/O, assuming they are all
> going flat out (which let's face it, they won't be). And when you run out
> of that, you start adding additional 10Gb NICs to suit.

There is no rush up front, I'm just planning for growth should it be there. 
I'm not installing dozens of servers up front but I want the path to be there 
should it be required. 

Right now, I have two small clusters in place for testing this. I've got two 
8-way/16GB (up to 64GB each) servers taking care of the web servers, another 
two taking care of MySQL services. 
I've also installed three 1GHZ blade servers for the web servers in another 
cluster. I've been alternating between them as I test things out. As far as 
web servers go, I can use more powerful machines in a smaller cluster or I 
have dozens of these 1Ghz machines which I can add into a larger cluster. 

I've been asking for information on the linux forums but can't seem to get an 
answer about Apache. Wondering what Apache likes more. I know it can use 
memory, but I'm not sure if multiple CPU's are worth using such as above. I 
can't seem to find enough information on how to set up Apache to fully use the 
CPU's and memory of the 8-way systems but it's easy enough to set up for best 
performance on the smaller machines.

> Machines may have RAID on them, but if you lose a whole machine, what
> happens then?

Sorry, I meant that my storage devices are RAID devices. I install drives in 
the chassis, format to RAID then it becomes available for servers to use as 
needed. The chassis maintains the RAID array.

I won't be using drives on servers formatted into RAID arrays, all of the 
storage is and will be external. 

Mike





From isplist at logicore.net  Mon Jan 28 18:28:28 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 12:28:28 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801281749410.28394@skynet.shatteredsilicon.net>
Message-ID: <2008128122828.647713@leena>

>> So basically, can't lose more than one storage chassis.
>> 
> You can't lose more than 2.

This is if I go with the RAID right? Do you have any thoughts on creating a 
large volume vs using virtual RAID to tie all of the storage devices together?

> You need working fencing support for sane GFS operation anyway. You can do
> this via DRAC, ILO, switches that allow you to disable a port, UPS that
> lets you cut off power to a machine, etc. Just something you can use to
> make a machine stay down when it goes wrong until you can fix it.

For GFS, I've been using brocade switches for the fencing. I'll have to use 
something else for the aggregator/s.

Mike





From gordan at bobich.net  Mon Jan 28 18:35:16 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 18:35:16 +0000 (GMT)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008128122240.933208@leena>
References: <2008128122240.933208@leena>
Message-ID: <alpine.LRH.1.00.0801281830500.29626@skynet.shatteredsilicon.net>



On Mon, 28 Jan 2008, isplist at logicore.net wrote:

> Actually,  johannes had also mentioned the http://www.openfiler.com project
> which seems to be pretty much what we've been talking about. Have you looked
> at it?

That might help you out with the front end (exporting iSCSI devices 
onward), but from what the front page says, it doesn't look like it's 
going to set you up with the storage aggregation part. I could be wrong, 
though, I haven't had a chance to go through all the docs for OpenFiler 
yet.

> I've been asking for information on the linux forums but can't seem to get an
> answer about Apache. Wondering what Apache likes more. I know it can use
> memory, but I'm not sure if multiple CPU's are worth using such as above. I
> can't seem to find enough information on how to set up Apache to fully use the
> CPU's and memory of the 8-way systems but it's easy enough to set up for best
> performance on the smaller machines.

It'll use it as required unless they run out. It depends on your CGIs, 
Apache itself will always be quite light. If you have lots of requests 
being processed at any one time, lots of CPUs will help. If you have lots 
of CGIs requiring memory, it'll use the memory. The only way you'll find 
out is by throwing the expected mix of requests at it and seeing what it 
uses up first.

Gordan



From gordan at bobich.net  Mon Jan 28 18:41:14 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 28 Jan 2008 18:41:14 +0000 (GMT)
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008128122828.647713@leena>
References: <2008128122828.647713@leena>
Message-ID: <alpine.LRH.1.00.0801281835320.29626@skynet.shatteredsilicon.net>



On Mon, 28 Jan 2008, isplist at logicore.net wrote:

>>> So basically, can't lose more than one storage chassis.
>>>
>> You can't lose more than 2.
>
> This is if I go with the RAID right? Do you have any thoughts on creating a
> large volume vs using virtual RAID to tie all of the storage devices together?

I must admit I'm not a huge fan of volumes. Ever since software RAID 
gained the ability to dynamicall add disks to a RAID stripe, I've not seen 
much point in it. The only useful features LVM will give you are snapshots 
(of questionable use as your FS won't necessarily be consistent, so it's 
not usable as a backup tool) and the ability to span multiple RAIDs.

At some point, statistically, RAID6 will cease providing enough 
redundancy. At that point, you need to switch to some higher level of RAID 
(e.g. m+n type RAID, of which I am not aware of an available software 
implementation of), or use LVM to span multiple arrays (i.e. JBOD of RAID6 
arrays).

Gordan



From isplist at logicore.net  Mon Jan 28 19:07:41 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 13:07:41 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801281830500.29626@skynet.shatteredsilicon.net>
Message-ID: <200812813741.928562@leena>

> going to set you up with the storage aggregation part. I could be wrong,
> though, I haven't had a chance to go through all the docs for OpenFiler

Maybe someone else on the list knows. 

> It'll use it as required unless they run out. It depends on your CGIs,

Pretty much zero CGI's, it's all php.

> out is by throwing the expected mix of requests at it and seeing what it
> uses up first.

I've been using various tools including ab to try to kill the clusters without 
success so far :). I'm trying things like 100 clients each with 1000 requests, 
etc.

Mike






From isplist at logicore.net  Mon Jan 28 21:25:08 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 15:25:08 -0600
Subject: [Linux-cluster] Can't create new GFS storage
Message-ID: <200812815258.227449@leena>

I've replaced some drives on a test chassis and am trying to set up GFS on one 
of the blades. When I try to pvcreate, it just sits there and hangs.

I've tried rebooting the storage, rebooting the FC switch and the server. I 
can see the storage using fdisk -l but can't seem to do anything with it. 

Any thoughts on what to try as I also don't see anything in the logs.

Mike





From isplist at logicore.net  Mon Jan 28 21:56:46 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 15:56:46 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <200812815258.227449@leena>
Message-ID: <2008128155646.446962@leena>

I was able to create a small partition but had to do it via webmin? Still, 
something is not right.

#ps -aux
root      3758  0.2  5.6 31196 28944 ?       S    15:29   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3771  0.0  0.2  3780 1164 ?        S    15:29   0:00 vgdisplay
root      3794  0.2  5.6 31200 28944 ?       S    15:33   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3806  0.0  0.2  4360 1168 ?        S    15:33   0:00 vgdisplay
root      3807  0.2  5.0 28012 25784 ?       S    15:33   0:02 
/usr/libexec/webmin/lvm/index.cgi
root      3813  0.0  0.2  4428 1168 ?        S    15:33   0:00 vgdisplay
root      3816  0.3  5.6 31084 28948 ?       S    15:36   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3828  0.0  0.2  3944 1164 ?        S    15:36   0:00 vgdisplay
root      3839  0.0  0.1  3624  792 pts/0    R+   15:54   0:00 ps -aux

In the log;

Jan 28 15:29:19 compdev kernel: SCSI device sda: 1512124416 512-byte hdwr 
sectors (774208 MB)
Jan 28 15:29:19 compdev kernel: SCSI device sda: drive cache: write back
Jan 28 15:29:19 compdev kernel:  sda: sda1
Jan 28 15:29:21 compdev kernel: SCSI device sda: 1512124416 512-byte hdwr 
sectors (774208 MB)
Jan 28 15:29:21 compdev kernel: SCSI device sda: drive cache: write back
Jan 28 15:29:21 compdev kernel:  sda: sda1





On Mon, 28 Jan 2008 15:25:08 -0600, isplist at logicore.net wrote:
> I've replaced some drives on a test chassis and am trying to set up GFS on
> one
> 
> of the blades. When I try to pvcreate, it just sits there and hangs.
> 
> I've tried rebooting the storage, rebooting the FC switch and the server. I
> can see the storage using fdisk -l but can't seem to do anything with it.
> 
> Any thoughts on what to try as I also don't see anything in the logs.
> 
> Mike
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster






From gordan at bobich.net  Mon Jan 28 22:08:26 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 28 Jan 2008 22:08:26 +0000
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <200812815258.227449@leena>
References: <200812815258.227449@leena>
Message-ID: <479E525A.6070608@bobich.net>

I'm afraid I can't help much since I don't use LVM, but having said 
that, you don't have to use LVM. You can have GFS on the raw device. 
That's what I always do.

But if you have to use LVM, are you using LVM or CLVM? For clustering to 
work, you'll need CLVM.

Gordan

isplist at logicore.net wrote:
> I've replaced some drives on a test chassis and am trying to set up GFS on one 
> of the blades. When I try to pvcreate, it just sits there and hangs.
> 
> I've tried rebooting the storage, rebooting the FC switch and the server. I 
> can see the storage using fdisk -l but can't seem to do anything with it. 
> 
> Any thoughts on what to try as I also don't see anything in the logs.
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From isplist at logicore.net  Mon Jan 28 22:13:32 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 16:13:32 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <479E525A.6070608@bobich.net>
Message-ID: <2008128161332.252225@leena>

> But if you have to use LVM, are you using LVM or CLVM? For clustering to
> work, you'll need CLVM.

It was working before I changed out the drives and haven't changed anything on 
the server. I seem to have both lvm2 and lvm2-cluster installed also.

Mike





From isplist at logicore.net  Mon Jan 28 22:15:21 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 16:15:21 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <479E525A.6070608@bobich.net>
Message-ID: <2008128161521.053984@leena>

I can't really believe no one knows the answer or some tips on where to look 
for the problem. Maybe it's totally obvious and I'm just not catching it?

There must be someone on this list who has some thoughts on this? I've been 
asking about it for days now :).

Mike





From dsharp at ivytech.edu  Mon Jan 28 22:15:40 2008
From: dsharp at ivytech.edu (Doug Sharp)
Date: Mon, 28 Jan 2008 17:15:40 -0500
Subject: [Linux-cluster] Can't create new GFS storage
References: <2008128155646.446962@leena>
Message-ID: <02CC84960E52F745860DFF29ADC62091025E5420@MSEXCHNG-02.ivytech.local>

Mike, I just finished creating a GFS1 filesystem on a SAN LUN.  Here is the process I used.  Maybe it will help:

0. Made sure cman, clvmd, gfs, and rgmanager services were running
1. # lvmdiskscan - make sure SAN LUN appears.  In my case it was sdb
2. # pvcreate /dev/sdb 
3. # vgcreate oravg /dev/sdb  
4. # lvcreate -L 16G -n oralv oravg  => creates /dev/oravg/oralv as a 16GB LUN
5. # gfs_mkfs -p lock_dlm -t tora-cluster:orat -j 4 /dev/oravg/oralv  => makes a gfs1 filesystem 
6. # mkdir /orat
7. # mount /dev/oravg/oralv /orat

HTH
Doug


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Mon 1/28/2008 4:56 PM
To: linux-cluster
Subject: Re: [Linux-cluster] Can't create new GFS storage
 
I was able to create a small partition but had to do it via webmin? Still, 
something is not right.

#ps -aux
root      3758  0.2  5.6 31196 28944 ?       S    15:29   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3771  0.0  0.2  3780 1164 ?        S    15:29   0:00 vgdisplay
root      3794  0.2  5.6 31200 28944 ?       S    15:33   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3806  0.0  0.2  4360 1168 ?        S    15:33   0:00 vgdisplay
root      3807  0.2  5.0 28012 25784 ?       S    15:33   0:02 
/usr/libexec/webmin/lvm/index.cgi
root      3813  0.0  0.2  4428 1168 ?        S    15:33   0:00 vgdisplay
root      3816  0.3  5.6 31084 28948 ?       S    15:36   0:03 
/usr/libexec/webmin/fdisk/index.cgi
root      3828  0.0  0.2  3944 1164 ?        S    15:36   0:00 vgdisplay
root      3839  0.0  0.1  3624  792 pts/0    R+   15:54   0:00 ps -aux

In the log;

Jan 28 15:29:19 compdev kernel: SCSI device sda: 1512124416 512-byte hdwr 
sectors (774208 MB)
Jan 28 15:29:19 compdev kernel: SCSI device sda: drive cache: write back
Jan 28 15:29:19 compdev kernel:  sda: sda1
Jan 28 15:29:21 compdev kernel: SCSI device sda: 1512124416 512-byte hdwr 
sectors (774208 MB)
Jan 28 15:29:21 compdev kernel: SCSI device sda: drive cache: write back
Jan 28 15:29:21 compdev kernel:  sda: sda1





On Mon, 28 Jan 2008 15:25:08 -0600, isplist at logicore.net wrote:
> I've replaced some drives on a test chassis and am trying to set up GFS on
> one
> 
> of the blades. When I try to pvcreate, it just sits there and hangs.
> 
> I've tried rebooting the storage, rebooting the FC switch and the server. I
> can see the storage using fdisk -l but can't seem to do anything with it.
> 
> Any thoughts on what to try as I also don't see anything in the logs.
> 
> Mike
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3877 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080128/cd338890/attachment.bin>

From isplist at logicore.net  Mon Jan 28 22:40:04 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 16:40:04 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <02CC84960E52F745860DFF29ADC62091025E5420@MSEXCHNG-02.ivytech.local>
Message-ID: <200812816404.555809@leena>

Thanks for the input Doug.

> 0. Made sure cman, clvmd, gfs, and rgmanager services were running

Cman, ccsd and clvmd are running. I don't see gfs and rgmanager in the process 
list but their status indicates that they are running. Strangely, I was sure I 
used to see gfs in the task list with a ps? I do see clurgmgrd which is 
spawned by gfs right?

> 1. # lvmdiskscan - make sure SAN LUN appears.  In my case it was sdb

Yes, I see the storage I want. This is as far as I can get. 

> 2. # pvcreate /dev/sdb

Locks up the command, just sits there until I kill it. I was able to create a 
partition from webmin but things still locked up which I posted.

> 3. # vgcreate oravg /dev/sdb
> 4. # lvcreate -L 16G -n oralv oravg  => creates /dev/oravg/oralv as a 16GB
> LUN
> 5. # gfs_mkfs -p lock_dlm -t tora-cluster:orat -j 4 /dev/oravg/oralv  => 
> makes a gfs1 filesystem
> 6. # mkdir /orat
> 7. # mount /dev/oravg/oralv /orat

Mount using -t gfs of course :).

Can't get to any of this until I resolve the other but that's how I've always 
created my GFS partitions in the past. That's why I'm stumped right now, not 
seeing where the problem is and it might be obvious. Perhaps the fact that I 
don't see gfs running is a hint? :).

PS: I don't seem to have a problem getting my cluster back up, just the 
creating gfs volumes.

Mike





From linux-cluster at merctech.com  Mon Jan 28 23:14:09 2008
From: linux-cluster at merctech.com (linux-cluster at merctech.com)
Date: Mon, 28 Jan 2008 18:14:09 -0500
Subject: [Linux-cluster] clvmd,
	can't see shared storage on node1 of 2-node cluster (long)
References: <200812816404.555809@leena>
Message-ID: <16580.1201562049@localhost>


I'm having trouble with a new two-node CentOS5 cluster (kernel 2.6.23). The clvm
daemon doesn't run correctly on one node, and that node cannot see LVM objects,
while it can connect to the underlying storage.

The cluster has shared SAN storage with 5 LUNs presented to the servers.
Multiple HBAs and storage processes mean there are 4 paths to each LUN.
DM-Multipath is used to create virtual devices for LUN access. Each server sees
the LUNs at the SCSI level, and can see disk partitions via fdisk, and can see
the same LUNs via multipath

The host-level storage configuration was done on node2, with text files being
edited simultaneously on both nodes or copied from node2 to node1.

### Error Condition ###
The clvmd daemon does not run correctly on node1 of the two-node cluster.
The error messages are:

  connect() failed on local socket: Permission denied
  WARNING: Falling back to local file-based locking.
  Volume Groups with the clustered attribute will be inaccessible.

At this point, node1 cannot access any shared objects (ie., pvdisplay fails to 
show details about the physical volumes, etc.).

The daemon consistently starts without error on node2, whether node1 is
running or not.

The daemon consistently fails to start on node1, whether node2 is running or
not.

I've rebooted each node...the condition remains the same--only node2 can
successfully start clvmd and access the LVM volumes.

The /var/log/messages entry on both nodes is identical:
	clvmd: Cluster LVM daemon started - connected to CMAN
but on node1 the process never gets out of the " clvmd -T20" state.

There are no issues with SELinux blocking any lvm actions.


I'd really appreciate any suggestions about debugging this problem.

Extensive notes and command output are given 
below.

Thanks,

Mark

---------------------------------------------------------------------------------

#### Configuration Procedures ###
The following steps were run on each node:

	dm-multipath installed, confirmed that kernel modules are loaded

        setup /etc/multipath.conf  (aliasing WWIDs to logical names,
				blacklisting WWIDs of internal drive)

        stop multipathd 

        remove any multipath device entries with
		multipath -F

	Create /etc/lvm/lvm.conf (filtering internal drive, filtering /dev/sda
	devices

The following commands were run _only_ on node2:

        restart multipath, recreating the devices based on /etc/multipath.conf
	copy the /var/lib/multipath/bindings file from node2 to node1

	 Create Physical Devices for LVM, as in:
		pvcreate -M2 -v /dev/mpath/home

	Create Volume Groups, as in:
		vgcreate -c y home_vg /dev/mpath/home

	Create Logical Volumes, as in:
		lvcreate -l 100%VG -n archive archive_vg

	Once the volumes are created, the Major/Minor block numbers were set
	to be persistent in order to help with NFS load balancing and
	fail-over across cluster nodes using lvchange, as in:

		lvchange --persistent y --major 253 --minor 8 archive_vg/archive

	Partition the Logical Volumes
		fdisk was used to set the filesystem offset to 128 blocks
		(64KB) to avoid boundary crossing on the EMC array

	Filesystems
		All paritions are formatted as gfs type filesystems as in:
			gfs_mkfs -j 4 -O -p lock_dlm -t sbia-infr:archive /dev/archive_vg/archive



### Component Versions ###
clvmd version:
  Cluster LVM daemon version: 2.02.26-RHEL5 (2007-06-18)
  Protocol version:           0.2.1

pv* versions:
  Library version: 1.02.20 (2007-06-15)
  Driver version:  4.11.0

rgmanager:
	Version     : 2.0.31                            Vendor: CentOS
	Release     : 1.el5.centos                  Build Date: Mon Nov 12 01:13:08 2007


####################################  Configuration ################################
The /etc/lvm/lvm.conf files are identical on both nodes. The locking value is
set to:
	locking_type = 3


### DM Multipath Details ###
Both nodes have identical multipath configurations (/etc/multipath.conf,
/dev/mpath/*, /dev/dm-* are identical). Both nodes show the same devices from
"fdisk -l":

[root at sbia-infr2 mpath]# fdisk -l /dev/mpath/*

Disk /dev/mpath/archive: 1407.4 GB, 1407450152960 bytes
255 heads, 63 sectors/track, 171112 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/mpath/archive doesn't contain a valid partition table

Disk /dev/mpath/cluster_shared: 288.1 GB, 288161071104 bytes
255 heads, 63 sectors/track, 35033 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/mpath/cluster_shared doesn't contain a valid partition table

Disk /dev/mpath/comp_space: 2017.1 GB, 2017127497728 bytes
255 heads, 63 sectors/track, 245235 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/mpath/comp_space doesn't contain a valid partition table

Disk /dev/mpath/home: 1152.6 GB, 1152644284416 bytes
255 heads, 63 sectors/track, 140134 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/mpath/home doesn't contain a valid partition table

Disk /dev/mpath/sbiaprj: 2017.1 GB, 2017127497728 bytes
255 heads, 63 sectors/track, 245235 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/mpath/sbiaprj doesn't contain a valid partition table


### Example of clvmd, pvdisplay errors; group_tool, clustat, "clvmd -d" output ###

================  Node1  ====================================================
[root at sbia-infr1 lvm]# /etc/init.d/clvmd start
Starting clvmd:                                            [  OK  ]
Activating VGs:     Logging initialised at Mon Jan 28 16:01:41 2008
    Set umask to 0077
  connect() failed on local socket: Permission denied
  WARNING: Falling back to local file-based locking.
  Volume Groups with the clustered attribute will be inaccessible.
    Finding all volume groups
    Finding volume group "home_vg"
  Skipping clustered volume group home_vg
    Finding volume group "cluster_shared_vg"
  Skipping clustered volume group cluster_shared_vg
    Finding volume group "sbiaprj_vg"
  Skipping clustered volume group sbiaprj_vg
    Finding volume group "comp_space_vg"
  Skipping clustered volume group comp_space_vg
    Finding volume group "archive_vg"
  Skipping clustered volume group archive_vg
    Finding volume group "VolGroup00"
    2 logical volume(s) in volume group "VolGroup00" already active
    2 existing logical volume(s) in volume group "VolGroup00" monitored
    Found volume group "VolGroup00"
    Found volume group "VolGroup00"
    Activated logical volumes in volume group "VolGroup00"
  2 logical volume(s) in volume group "VolGroup00" now active
    Wiping internal VG cache

[root at sbia-infr1]# group_tool -v
type             level name       id       state node id local_done
fence            0     default    00010002 JOIN_START_WAIT 1 100020001 1
[1 2]
dlm              1     rgmanager  00030001 none        
[1 2]
dlm              1     clvmd      00050001 none   

[root at sbia-infr1 mpath]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  sbia-infr2-admin.uphs.upenn.edu       1 Online, rgmanager
  sbia-infr1-admin.uphs.upenn.edu       2 Online, Local, rgmanager

  Service Name         Owner (Last)                   State         
  ------- ----         ----- ------                   -----         
  service:vweb         (none)                         stopped         

[root at sbia-infr1]# clvmd -d

		[clvmd PRODUCES THE FOLLOWING DEBUGGING OUTPUT ON STARTUP]

CLVMD[4f71c4c0]: Jan 28 17:03:46 CLVMD started
CLVMD[4f71c4c0]: Jan 28 17:03:46 Connected to CMAN
CLVMD[4f71c4c0]: Jan 28 17:03:46 CMAN initialisation complete
CLVMD[4f71c4c0]: Jan 28 17:03:47 DLM initialisation complete
CLVMD[4f71c4c0]: Jan 28 17:03:47 Cluster ready, doing some more initialisation
CLVMD[4f71c4c0]: Jan 28 17:03:47 starting LVM thread
CLVMD[4f71c4c0]: Jan 28 17:03:47 clvmd ready for work
CLVMD[4f71c4c0]: Jan 28 17:03:47 Using timeout of 60 seconds
CLVMD[41001940]: Jan 28 17:03:47 LVM thread function started
    Logging initialised at Mon Jan 28 17:03:47 2008
    Set umask to 0077
CLVMD[41001940]: Jan 28 17:03:47 LVM thread waiting for work
CLVMD[4f71c4c0]: Jan 28 17:04:49 Got port closed message, removing node sbia-infr2-admin.uphs.upenn.edu
CLVMD[4f71c4c0]: Jan 28 17:05:04 add_to_lvmqueue: cmd=0x884bc0. client=0x65c8a0, msg=0x7fff5b3a91ac, len=30, csid=0x7fff5b3a90f4, xid=0
CLVMD[41001940]: Jan 28 17:05:04 process_work_item: remote
CLVMD[41001940]: Jan 28 17:05:04 process_remote_command 2 for clientid 0x0 XID 128 on node sbia-infr2-admin.uphs.upenn.edu
CLVMD[41001940]: Jan 28 17:05:04 Remote node sbia-infr2-admin.uphs.upenn.edu is version 0.2.1
CLVMD[41001940]: Jan 28 17:05:04 Added new node 1 to updown list
CLVMD[41001940]: Jan 28 17:05:04 LVM thread waiting for work

	[AT THIS POINT, THERE IS NO OUTPUT FROM clvmd IN RESPONSE TO
	"pvdisplay"]
=============================================================================



================  Node2  ====================================================
[root at sbia-infr2 lvm]# /etc/init.d/clvmd start
Starting clvmd:                                            [  OK  ]
Activating VGs:     Logging initialised at Mon Jan 28 16:01:41 2008
    Set umask to 0077
    Finding all volume groups
    Finding volume group "home_vg"
    Activated logical volumes in volume group "home_vg"
  1 logical volume(s) in volume group "home_vg" now active
    Finding volume group "cluster_shared_vg"
    Activated logical volumes in volume group "cluster_shared_vg"
  1 logical volume(s) in volume group "cluster_shared_vg" now active
    Finding volume group "sbiaprj_vg"
    Activated logical volumes in volume group "sbiaprj_vg"
  1 logical volume(s) in volume group "sbiaprj_vg" now active
    Finding volume group "comp_space_vg"
    Activated logical volumes in volume group "comp_space_vg"
  1 logical volume(s) in volume group "comp_space_vg" now active
    Finding volume group "archive_vg"
    Activated logical volumes in volume group "archive_vg"
  1 logical volume(s) in volume group "archive_vg" now active
    Finding volume group "VolGroup00"
    2 logical volume(s) in volume group "VolGroup00" already active
    2 existing logical volume(s) in volume group "VolGroup00" monitored
    Activated logical volumes in volume group "VolGroup00"
  2 logical volume(s) in volume group "VolGroup00" now active
    Wiping internal VG cache

[root at sbia-infr2 lvm]# group_tool -v
type             level name       id       state node id local_done
fence            0     default    00010001 JOIN_START_WAIT 2 200020001 1
[1 2]
dlm              1     rgmanager  00030001 none        
[1 2]
dlm              1     clvmd      00050001 none        
[1 2]

[root at sbia-infr2 mpath]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  sbia-infr2-admin.uphs.upenn.edu       1 Online, Local, rgmanager
  sbia-infr1-admin.uphs.upenn.edu       2 Online, rgmanager

  Service Name         Owner (Last)                   State         
  ------- ----         ----- ------                   -----         
  service:vweb         (none)                         stopped         

[root at sbia-infr2]# clvmd -d

		[ISSUING "pvdisplay" IN ANOTHER WINDOW ON THE SAME SERVER
		PRODUCES THE FOLLOWING DEBUGGING OUTPUT FROM clvmd]
CLVMD[3b52c4c0]: Jan 28 17:05:03 CLVMD started
CLVMD[3b52c4c0]: Jan 28 17:05:03 Connected to CMAN
CLVMD[3b52c4c0]: Jan 28 17:05:03 CMAN initialisation complete
CLVMD[3b52c4c0]: Jan 28 17:05:04 DLM initialisation complete
CLVMD[3b52c4c0]: Jan 28 17:05:04 Cluster ready, doing some more initialisation
CLVMD[3b52c4c0]: Jan 28 17:05:04 starting LVM thread
CLVMD[41001940]: Jan 28 17:05:04 LVM thread function started
CLVMD[3b52c4c0]: Jan 28 17:05:04 clvmd ready for work
CLVMD[3b52c4c0]: Jan 28 17:05:04 Using timeout of 60 seconds
    Logging initialised at Mon Jan 28 17:05:04 2008
    Set umask to 0077
File descriptor 5 left open
    Logging initialised at Mon Jan 28 17:05:04 2008
    Finding all logical volumes
    Wiping internal VG cache
CLVMD[41001940]: Jan 28 17:05:04 LVM thread waiting for work
CLVMD[3b52c4c0]: Jan 28 17:08:48 Got new connection on fd 9
CLVMD[3b52c4c0]: Jan 28 17:08:48 Read on local socket 9, len = 30
CLVMD[3b52c4c0]: Jan 28 17:08:48 creating pipe, [10, 11]
CLVMD[3b52c4c0]: Jan 28 17:08:48 Creating pre&post thread
CLVMD[3b52c4c0]: Jan 28 17:08:48 Created pre&post thread, state = 0
CLVMD[41802940]: Jan 28 17:08:48 in sub thread: client = 0x884bc0
CLVMD[41802940]: Jan 28 17:08:48 Sub thread ready for work.
CLVMD[41802940]: Jan 28 17:08:48 doing PRE command LOCK_VG 'V_home_vg' at 1 (client=0x884bc0)
CLVMD[41802940]: Jan 28 17:08:48 sync_lock: 'V_home_vg' mode:3 flags=0
CLVMD[41802940]: Jan 28 17:08:48 sync_lock: returning lkid 430001
CLVMD[41802940]: Jan 28 17:08:48 Writing status 0 down pipe 11
CLVMD[41802940]: Jan 28 17:08:48 Waiting to do post command - state = 0
CLVMD[3b52c4c0]: Jan 28 17:08:48 read on PIPE 10: 4 bytes: status: 0
CLVMD[3b52c4c0]: Jan 28 17:08:48 background routine status was 0, sock_client=0x884bc0
CLVMD[3b52c4c0]: Jan 28 17:08:48 distribute command: XID = 0
CLVMD[3b52c4c0]: Jan 28 17:08:48 add_to_lvmqueue: cmd=0x884ff0. client=0x884bc0, msg=0x8785d0, len=30, csid=(nil), xid=0
CLVMD[41001940]: Jan 28 17:08:48 process_work_item: local
CLVMD[41001940]: Jan 28 17:08:48 process_local_command: msg=0x885030, msglen =30, client=0x884bc0

	[HUNDREDS OF LINES OF OUTPUT DELETED...]


=============================================================================


-----
Mark Bergman

http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40merctech.com



From isplist at logicore.net  Mon Jan 28 23:32:05 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 17:32:05 -0600
Subject: [Linux-cluster] FC Storage changes leave warnings
Message-ID: <200801282332.m0SNW5w08233@f7.net>

This is what I get on the console any time I make a change in the FC
storage. Can someone shed some light on this?
 
Also, I changed out some drives and am trying to get the storage set up
for GFS again. The first thing I did was to run pvcreate /dev/sda (in
this case).
Yet, pvcreate just sits there until I finally kill it. I don't seem to
have full control of this storage yet, what might I look at?
 
It dawns on me as I write this... before making the changes, I totally
forgot about removing the logical volume information. And of course, any
other information that was in there before making the storage change. I
think that's where I went wrong. How can I clean up the warnings now and
move on to using the new storage?
 
Mike
 
 
 
> Jan 27 10:21:53 compdev lvm.static:
> 
> Jan 27 10:21:53 compdev lvm.static: connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:53 compdev lvm.static:
> Jan 27 10:21:53 compdev lvm.static: No volume groups found
> Jan 27 10:21:53 compdev lvm.static:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:53 compdev lvm.static:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:53 compdev rc.sysinit: Setting up Logical Volume Management: 
> succeeded
> Jan 27 10:21:53 compdev rc.sysinit: Checking filesystems succeeded
> Jan 27 10:21:53 compdev rc.sysinit: Mounting local filesystems:  succeeded
> Jan 27 10:21:53 compdev rc.sysinit: Enabling local filesystem quotas:  
> succeeded
> Jan 27 10:21:54 compdev rc.sysinit: Enabling swap space:  succeeded
> Jan 27 10:21:56 compdev microcode_ctl: microcode_ctl startup succeeded
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange: Volume group "WARNING:" not found
> Jan 27 10:21:57 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:57 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:57 compdev lvm2-monitor: Starting monitoring for VG WARNING:: 
> failed
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:57 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:57 compdev vgchange: Volume group "Falling" not found
> Jan 27 10:21:57 compdev lvm2-monitor: Starting monitoring for VG Falling: 
> failed
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange: connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:57 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:57 compdev vgchange: Volume group "back" not found
> Jan 27 10:21:57 compdev lvm2-monitor: Starting monitoring for VG back: 
> failed
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:57 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:57 compdev vgchange: Volume group "to" not found
> Jan 27 10:21:57 compdev lvm2-monitor: Starting monitoring for VG to: failed
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange: Volume group "local" not found
> Jan 27 10:21:57 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:57 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:57 compdev lvm2-monitor: Starting monitoring for VG local: 
> failed
> Jan 27 10:21:57 compdev vgchange:
> Jan 27 10:21:57 compdev vgchange: connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:57 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:
> Jan 27 10:21:58 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:58 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:58 compdev vgchange: Volume group "file-based" not found
> Jan 27 10:21:58 compdev lvm2-monitor: Starting monitoring for VG file-
> based: failed
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:
> Jan 27 10:21:58 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:58 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:58 compdev vgchange: Volume group "locking." not found
> Jan 27 10:21:58 compdev lvm2-monitor: Starting monitoring for VG locking.: 
> failed
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:
> Jan 27 10:21:58 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:58 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:58 compdev vgchange: Volume group "Volume" not found
> Jan 27 10:21:58 compdev lvm2-monitor: Starting monitoring for VG Volume: 
> failed
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:58 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "Groups" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG Groups: 
> failed
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange: connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "with" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG with: 
> failed
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "the" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG the: failed
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "clustered" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG clustered: 
> failed
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange: connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "attribute" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG attribute: 
> failed
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:
> Jan 27 10:21:59 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:21:59 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:21:59 compdev vgchange: Volume group "will" not found
> Jan 27 10:21:59 compdev lvm2-monitor: Starting monitoring for VG will: 
> failed
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:21:59 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:22:00 compdev vgchange:
> Jan 27 10:22:00 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:22:00 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:22:00 compdev vgchange: Volume group "be" not found
> Jan 27 10:22:00 compdev lvm2-monitor: Starting monitoring for VG be: failed
> Jan 27 10:22:00 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:22:00 compdev vgchange:   connect() failed on local socket: 
> Connection refused
> Jan 27 10:22:00 compdev vgchange:
> Jan 27 10:22:00 compdev vgchange:   WARNING: Falling back to local file-
> based locking.
> Jan 27 10:22:00 compdev vgchange:   Volume Groups with the clustered 
> attribute will be inaccessible.
> Jan 27 10:22:00 compdev vgchange: Volume group "inaccessible." not found
> Jan 27 10:22:00 compdev lvm2-monitor: Starting monitoring for VG 
> inaccessible.: failed



From isplist at logicore.net  Mon Jan 28 23:35:57 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 17:35:57 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <02CC84960E52F745860DFF29ADC62091025E5420@MSEXCHNG-02.ivytech.local>
Message-ID: <2008128173557.723337@leena>

> 2. # pvcreate /dev/sdb

This is happening from any node now. If I try to pvcreate, it just sits there.
I am seeing in the messages log;

Jan 28 15:42:52 img62 rgmanager: clurgmgrd startup failed

Anyone have any thoughts on what might be causing this? I don't seem to have 
much for errors and google doesn't seem to help much.

Mike





From dsharp at ivytech.edu  Tue Jan 29 01:27:19 2008
From: dsharp at ivytech.edu (Doug Sharp)
Date: Mon, 28 Jan 2008 20:27:19 -0500
Subject: [Linux-cluster] Can't create new GFS storage
References: <2008128173557.723337@leena>
Message-ID: <02CC84960E52F745860DFF29ADC62091025E5421@MSEXCHNG-02.ivytech.local>

You might check that /etc/lvm/lvm.conf has locking_type = 3 as described in FAQ #22 here.  It defaults to local locking.
http://sources.redhat.com/cluster/faq.html

Also, I've found that cluster suite components use a variety of IP ports for communication, and if you've selected a standard firewall configuration you may be blocking them.  For testing, I'd try shutting down the CS services (service cman stop, etc), then run "iptables -F" on all nodes to flush the rulesets, then restart the CS services on all nodes.  You can discover what ports are being used by cluster suite by doing a netstat -na before and after this process and compare the ports that are in use, then put together a custom /etc/sysconfig/iptables file if you want.

Also, assuming you're using a Redhat-based distro (Fedora, CentOS, RH), you can check the status of the CS services with the following rather than using "ps":
# service cman status
# service clvmd status
etc.

One other tool to check your cluster's status is cman_tool.  Do a man on it if you haven't used it.

HTH
Doug


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Mon 1/28/2008 6:35 PM
To: linux-cluster
Subject: RE: [Linux-cluster] Can't create new GFS storage
 
> 2. # pvcreate /dev/sdb

This is happening from any node now. If I try to pvcreate, it just sits there.
I am seeing in the messages log;

Jan 28 15:42:52 img62 rgmanager: clurgmgrd startup failed

Anyone have any thoughts on what might be causing this? I don't seem to have 
much for errors and google doesn't seem to help much.

Mike



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3585 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080128/1a7cec6a/attachment.bin>

From isplist at logicore.net  Tue Jan 29 04:51:08 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 28 Jan 2008 22:51:08 -0600
Subject: [Linux-cluster] Long delays after idle time
Message-ID: <200812822518.448969@leena>

When I have a GFS volume that sits for even a few minutes, it seems to go into 
some weird sleep mode. It can take up to 45 seconds or more to finally get it 
to respond, at which point, it will be fine until there is no use and it seems 
to go back to sleep mode. 

Sleep mode, for lack of better term just means that after it sits for a while, 
it always has a long delay before allowing access when I next try.

I posted about this last year but no one seemed to have any thoughts. How 
about this time? Are there some tests I can run to help find the problem?

Mike





From pcaulfie at redhat.com  Tue Jan 29 08:19:15 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Tue, 29 Jan 2008 08:19:15 +0000
Subject: [Linux-cluster] clvmd, can't see shared storage on node1 of 2-node
	cluster (long)
In-Reply-To: <16580.1201562049@localhost>
References: <200812816404.555809@leena> <16580.1201562049@localhost>
Message-ID: <479EE183.6020302@redhat.com>

linux-cluster at merctech.com wrote:
> I'm having trouble with a new two-node CentOS5 cluster (kernel 2.6.23). The clvm
> daemon doesn't run correctly on one node, and that node cannot see LVM objects,
> while it can connect to the underlying storage.
> 
> The cluster has shared SAN storage with 5 LUNs presented to the servers.
> Multiple HBAs and storage processes mean there are 4 paths to each LUN.
> DM-Multipath is used to create virtual devices for LUN access. Each server sees
> the LUNs at the SCSI level, and can see disk partitions via fdisk, and can see
> the same LUNs via multipath
> 
> The host-level storage configuration was done on node2, with text files being
> edited simultaneously on both nodes or copied from node2 to node1.
> 
> ### Error Condition ###
> The clvmd daemon does not run correctly on node1 of the two-node cluster.
> The error messages are:
> 
>   connect() failed on local socket: Permission denied

This is the important bit. LVM can't connect to clvmd because it is not
allowed to open the socket it communicates on.

Assuming this is all runs as root (and if not, there's your problem!)
it's worth checking out SELinux for reasons why the socket cannot be opened.


>   WARNING: Falling back to local file-based locking.
>   Volume Groups with the clustered attribute will be inaccessible.
> 



Patrick



From Alexandre.Racine at mhicc.org  Tue Jan 29 14:26:19 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Tue, 29 Jan 2008 09:26:19 -0500
Subject: [Linux-cluster] noatime
References: <200812822518.448969@leena>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>

Hi all,

If I put in my fstab file:
/dev/sdc5               /home           gfs     noatime


Will it actually mount without atime? And how can I confirm this?

Thanks.


Alexandre Racine
514-461-1300 poste 3304
alexandre.racine at mhicc.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2288 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/091aa9c5/attachment.bin>

From gordan at bobich.net  Tue Jan 29 14:35:08 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 29 Jan 2008 14:35:08 +0000 (GMT)
Subject: [Linux-cluster] noatime
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>
References: <200812822518.448969@leena>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>
Message-ID: <alpine.LRH.1.00.0801291434440.21011@skynet.shatteredsilicon.net>

On Tue, 29 Jan 2008, Alexandre Racine wrote:

> If I put in my fstab file:
> /dev/sdc5               /home           gfs     noatime
>
> Will it actually mount without atime? And how can I confirm this?

cat /proc/mounts

Gordan



From bpkroth at wisc.edu  Tue Jan 29 14:36:54 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Tue, 29 Jan 2008 08:36:54 -0600
Subject: [Linux-cluster] noatime
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>
References: <200812822518.448969@leena>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>
Message-ID: <20080129143654.GB1762@bpkroth-tux.hslc.wisc.edu>

touch test.txt
stat test.txt
echo test >> test.txt
stat test.txt

Compare access times and modification times.

or

mount | grep noatime

Alexandre Racine <Alexandre.Racine at mhicc.org>:
> Hi all,
> 
> If I put in my fstab file:
> /dev/sdc5               /home           gfs     noatime
> 
> 
> Will it actually mount without atime? And how can I confirm this?
> 
> Thanks.
> 
> 
> Alexandre Racine
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
> 


> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/808717cc/attachment.bin>

From isplist at logicore.net  Tue Jan 29 16:09:34 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 29 Jan 2008 10:09:34 -0600
Subject: [Linux-cluster] noatime
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12E7@cumulonimbus.RG.local>
Message-ID: <200812910934.067039@leena>

Were you trying this for a specific reason? I tried noatime to see if I could 
get a faster initial response time from the GFS volume. Every time I access 
it, there is a long delay initially, then things are fine until the next 
access.


On Tue, 29 Jan 2008 09:26:19 -0500, Alexandre Racine wrote:
> Hi all,
> 
> If I put in my fstab file:
> /dev/sdc5               /home           gfs     noatime
> 
> 
> Will it actually mount without atime? And how can I confirm this?
> 
> Thanks.
> 
> 
> Alexandre Racine
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org






From gordan at bobich.net  Tue Jan 29 16:14:06 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 29 Jan 2008 16:14:06 +0000 (GMT)
Subject: [Linux-cluster] noatime
In-Reply-To: <200812910934.067039@leena>
References: <200812910934.067039@leena>
Message-ID: <alpine.LRH.1.00.0801291611160.21695@skynet.shatteredsilicon.net>

The main reason why performance improves with noatime (is the 
nodiratime flag supported?) is because it means no write happens for ever 
file read. That means no locking required, which is the main thing that 
slows down concurrent accesses. I don't know what the first-access issue 
might be, but noatime should help under heavy concurrent use.

Gordan

On Tue, 29 Jan 2008, isplist at logicore.net wrote:

> Were you trying this for a specific reason? I tried noatime to see if I could
> get a faster initial response time from the GFS volume. Every time I access
> it, there is a long delay initially, then things are fine until the next
> access.
>
>
> On Tue, 29 Jan 2008 09:26:19 -0500, Alexandre Racine wrote:
>> Hi all,
>>
>> If I put in my fstab file:
>> /dev/sdc5               /home           gfs     noatime
>>
>>
>> Will it actually mount without atime? And how can I confirm this?
>>
>> Thanks.
>>
>>
>> Alexandre Racine
>> 514-461-1300 poste 3304
>> alexandre.racine at mhicc.org
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From bpkroth at wisc.edu  Tue Jan 29 16:32:32 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Tue, 29 Jan 2008 10:32:32 -0600
Subject: [Linux-cluster] noatime
In-Reply-To: <alpine.LRH.1.00.0801291611160.21695@skynet.shatteredsilicon.net>
References: <200812910934.067039@leena>
	<alpine.LRH.1.00.0801291611160.21695@skynet.shatteredsilicon.net>
Message-ID: <20080129163232.GA7035@wisc.edu>

Have a look at this link, it looks related:

http://sources.redhat.com/cluster/faq.html#gfs_speed1
http://sources.redhat.com/cluster/faq.html#gfs_slowaftermount

As I understand it, basically the first node to access a file must do some
lock checks - see if anyone has already locked it, attempt to lock it, let
the other nodes know about it, etc.

After this, the lock should be cached for a period.  There are some tips
about that here:
http://sources.redhat.com/cluster/faq.html#gfs_tuning

I personally, have some nagios/heartbeat checks setup that, among other
things, touch a file on the GFS mount (.${HOSTNAME}-check), just to make
sure it's still alive, else warn me that I need to poke it.  I'm not
positive, but I have a feeling that this keeps things "fresh".  Perhaps it
will help your problem as well.

Brian

gordan at bobich.net <gordan at bobich.net>:
> The main reason why performance improves with noatime (is the nodiratime 
> flag supported?) is because it means no write happens for ever file read. 
> That means no locking required, which is the main thing that slows down 
> concurrent accesses. I don't know what the first-access issue might be, but 
> noatime should help under heavy concurrent use.
>
> Gordan
>
> On Tue, 29 Jan 2008, isplist at logicore.net wrote:
>
>> Were you trying this for a specific reason? I tried noatime to see if I 
>> could
>> get a faster initial response time from the GFS volume. Every time I 
>> access
>> it, there is a long delay initially, then things are fine until the next
>> access.
>>
>>
>> On Tue, 29 Jan 2008 09:26:19 -0500, Alexandre Racine wrote:
>>> Hi all,
>>>
>>> If I put in my fstab file:
>>> /dev/sdc5               /home           gfs     noatime
>>>
>>>
>>> Will it actually mount without atime? And how can I confirm this?
>>>
>>> Thanks.
>>>
>>>
>>> Alexandre Racine
>>> 514-461-1300 poste 3304
>>> alexandre.racine at mhicc.org
>>
>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/8ff7a2db/attachment.bin>

From Alexandre.Racine at mhicc.org  Tue Jan 29 16:37:01 2008
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Tue, 29 Jan 2008 11:37:01 -0500
Subject: [Linux-cluster] noatime
References: <200812910934.067039@leena>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D6D12EE@cumulonimbus.RG.local>

Hi,

noatime will reduce traffic since if it's not there, each time you touch a file (read, write) it will change the value of the field "last access".

But, just adding it to the fstab did not work in my case.

Is there another way for the noatime?



Has for your isplist, look here http://sourceware.org/cluster/faq.html#gfs_slowaftermount


Alexandre Racine
514-461-1300 poste 3304
alexandre.racine at mhicc.org



-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Tue 2008-01-29 11:09
To: linux clustering
Subject: Re: [Linux-cluster] noatime
 
Were you trying this for a specific reason? I tried noatime to see if I could 
get a faster initial response time from the GFS volume. Every time I access 
it, there is a long delay initially, then things are fine until the next 
access.


On Tue, 29 Jan 2008 09:26:19 -0500, Alexandre Racine wrote:
> Hi all,
> 
> If I put in my fstab file:
> /dev/sdc5               /home           gfs     noatime
> 
> 
> Will it actually mount without atime? And how can I confirm this?
> 
> Thanks.
> 
> 
> Alexandre Racine
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org




--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3124 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/05616536/attachment.bin>

From lhh at redhat.com  Tue Jan 29 17:01:37 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 12:01:37 -0500
Subject: [Linux-cluster] Does samba work with cluster suite?
In-Reply-To: <1201287064.29756.36.camel@ayanami.boston.devel.redhat.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D024734@SRVEDI.upark.crscold.com>
	<1201287064.29756.36.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1201626097.15818.0.camel@ayanami.boston.devel.redhat.com>

On Fri, 2008-01-25 at 13:51 -0500, Lon Hohberger wrote:

> Editing the file will disable future auto generation - meaning if you
> add more file systems to the service, it won't pick them up
> automatically.

FYI - 

http://sources.redhat.com/cluster/wiki/SambaFailover

-- Lon




From td3201 at gmail.com  Tue Jan 29 17:07:24 2008
From: td3201 at gmail.com (Terry)
Date: Tue, 29 Jan 2008 11:07:24 -0600
Subject: [Linux-cluster] manual fencing -- issues
Message-ID: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>

Hello,

I know that manual fencing is not supported.  However, I don't have
anything else with the hardware I am testing at this time.  When (if)
we go live with this, I'll be using Dell DRACs to handle the fencing.
At any rate, I am trying manual fencing to test proof of concept.  I
am using RHEL5.1 with Conga to configure it.  My config is at the
bottom of this email.   Shouldn't there be a nodename paramater in
this block:
                        <fence>
                                <method name="1">
                                        <device name="fence00"
nodename="dss01.foobar.local"/>
                                </method>
                        </fence>

??  If so, why isn't Conga configuring this correctly?  If not, why am
I getting this error during a crash of the node with the service on
it:
Jan 29 10:56:08 dss00 fenced[2965]: agent "fence_manual" reports:
failed: fence_manual no node name
Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local" failed

Thanks for any ideas.





<?xml version="1.0"?>
<cluster alias="cluster00" config_version="5" name="cluster00">
        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="dss01.foobar.local" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="fence01"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="dss00.foobar.local" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="fence00"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="fence00"/>
                <fencedevice agent="fence_manual" name="fence01"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="failover00" ordered="0"
restricted="1">
                                <failoverdomainnode
name="dss01.foobar.local" priority="1"/>
                                <failoverdomainnode
name="dss00.foobar.local" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources/>
                <service autostart="1" domain="failover00"
exclusive="0" name="nfs" recovery="relocate">
                        <ip address="192.168.100.135" monitor_link="1"/>
                        <fs device="/dev/sdb1" force_fsck="0"
force_unmount="1" fsid="8849" fstype="ext3" mountpoint="/data00"
name="data00" self_fence="1"/>
                        <nfsexport name="data00_nfs"/>
                </service>
        </rm>
</cluster>



From lhh at redhat.com  Tue Jan 29 17:13:13 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 12:13:13 -0500
Subject: [Linux-cluster] DDRAID vs. GNBD+MD
In-Reply-To: <alpine.LRH.1.00.0801281715220.28394@skynet.shatteredsilicon.net>
References: <479CE1E3.3060601@bobich.net>
	<1201539786.8693.28.camel@ayanami.boston.devel.redhat.com>
	<alpine.LRH.1.00.0801281715220.28394@skynet.shatteredsilicon.net>
Message-ID: <1201626793.15818.11.camel@ayanami.boston.devel.redhat.com>

On Mon, 2008-01-28 at 17:21 +0000, gordan at bobich.net wrote:
> On Mon, 28 Jan 2008, Lon Hohberger wrote:
> 
> >> On a related node, was DDRAID ever stabilised
> >
> > In fact, it's been removed from CVS:
> >
> > http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/ddraid/Attic/?cvsroot=cluster
> 
> Ouch. I don't suppose that means that there is a reasonable alternative to 
> it? I know DRBD works for providing RAID1 without cantralized storage, but 
> is there anything supported and available that can provide any other kind 
> of RAID?

Not that I'm aware of.

I think your best bet at distributed storage is a distributed cluster
file system, a la GlusterFS, Petal/Frangipani (non-Free :( ), Lustre,
etc.

-- Lon



From lhh at redhat.com  Tue Jan 29 17:16:47 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 12:16:47 -0500
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
Message-ID: <1201627007.15818.16.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 11:07 -0600, Terry wrote:

> ??  If so, why isn't Conga configuring this correctly?  If not, why am
> I getting this error during a crash of the node with the service on
> it:
> Jan 29 10:56:08 dss00 fenced[2965]: agent "fence_manual" reports:
> failed: fence_manual no node name
> Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local" failed
> 
> Thanks for any ideas.

Try not configuring any fencing at all, and doing:

echo dss01.foobar.local > /var/run/cluster/fenced_override

Same effect as manual fencing, without the need for configuration.
You'll see something like:

Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local"
overridden by administrator intervention

-- Lon



From breeves at redhat.com  Tue Jan 29 17:18:00 2008
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 29 Jan 2008 17:18:00 +0000
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
Message-ID: <479F5FC8.70307@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Terry wrote:
> Hello,
> 
> I know that manual fencing is not supported.  However, I don't have
> anything else with the hardware I am testing at this time.  When (if)
> we go live with this, I'll be using Dell DRACs to handle the fencing.
> At any rate, I am trying manual fencing to test proof of concept.  I
> am using RHEL5.1 with Conga to configure it.  My config is at the
> bottom of this email.   Shouldn't there be a nodename paramater in
> this block:
>                         <fence>
>                                 <method name="1">
>                                         <device name="fence00"
> nodename="dss01.foobar.local"/>
>                                 </method>
>                         </fence>
> 
> ??  If so, why isn't Conga configuring this correctly?  If not, why am
> I getting this error during a crash of the node with the service on

Sounds like you hit:

https://bugzilla.redhat.com/show_bug.cgi?id=238655

The bugzilla is in the status ON_QA while the fixes are verified. In the
meantime you should be able to work around this by manually editing the
generated cluster.conf.

Kind regards,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHn1/I6YSQoMYUY94RAkMJAJ0eVRl/EJSpDDBGAajpN3RJceZdfgCbBiKo
KheIW7o60T5a3QfElLtTtWA=
=eotz
-----END PGP SIGNATURE-----



From gordan at bobich.net  Tue Jan 29 17:19:36 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 29 Jan 2008 17:19:36 +0000 (GMT)
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801291719110.22229@skynet.shatteredsilicon.net>

My understanding is that fence device selection is done by ordering, but I 
could be wrong.

Gordan

On Tue, 29 Jan 2008, Terry wrote:

> Hello,
>
> I know that manual fencing is not supported.  However, I don't have
> anything else with the hardware I am testing at this time.  When (if)
> we go live with this, I'll be using Dell DRACs to handle the fencing.
> At any rate, I am trying manual fencing to test proof of concept.  I
> am using RHEL5.1 with Conga to configure it.  My config is at the
> bottom of this email.   Shouldn't there be a nodename paramater in
> this block:
>                        <fence>
>                                <method name="1">
>                                        <device name="fence00"
> nodename="dss01.foobar.local"/>
>                                </method>
>                        </fence>
>
> ??  If so, why isn't Conga configuring this correctly?  If not, why am
> I getting this error during a crash of the node with the service on
> it:
> Jan 29 10:56:08 dss00 fenced[2965]: agent "fence_manual" reports:
> failed: fence_manual no node name
> Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local" failed
>
> Thanks for any ideas.
>
>
>
>
>
> <?xml version="1.0"?>
> <cluster alias="cluster00" config_version="5" name="cluster00">
>        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>        <clusternodes>
>                <clusternode name="dss01.foobar.local" nodeid="1" votes="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="fence01"/>
>                                </method>
>                        </fence>
>                </clusternode>
>                <clusternode name="dss00.foobar.local" nodeid="2" votes="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="fence00"/>
>                                </method>
>                        </fence>
>                </clusternode>
>        </clusternodes>
>        <cman expected_votes="1" two_node="1"/>
>        <fencedevices>
>                <fencedevice agent="fence_manual" name="fence00"/>
>                <fencedevice agent="fence_manual" name="fence01"/>
>        </fencedevices>
>        <rm>
>                <failoverdomains>
>                        <failoverdomain name="failover00" ordered="0"
> restricted="1">
>                                <failoverdomainnode
> name="dss01.foobar.local" priority="1"/>
>                                <failoverdomainnode
> name="dss00.foobar.local" priority="1"/>
>                        </failoverdomain>
>                </failoverdomains>
>                <resources/>
>                <service autostart="1" domain="failover00"
> exclusive="0" name="nfs" recovery="relocate">
>                        <ip address="192.168.100.135" monitor_link="1"/>
>                        <fs device="/dev/sdb1" force_fsck="0"
> force_unmount="1" fsid="8849" fstype="ext3" mountpoint="/data00"
> name="data00" self_fence="1"/>
>                        <nfsexport name="data00_nfs"/>
>                </service>
>        </rm>
> </cluster>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From td3201 at gmail.com  Tue Jan 29 17:23:58 2008
From: td3201 at gmail.com (Terry)
Date: Tue, 29 Jan 2008 11:23:58 -0600
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <479F5FC8.70307@redhat.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
	<479F5FC8.70307@redhat.com>
Message-ID: <8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>

On Jan 29, 2008 11:18 AM, Bryn M. Reeves <breeves at redhat.com> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Terry wrote:
> > Hello,
> >
> > I know that manual fencing is not supported.  However, I don't have
> > anything else with the hardware I am testing at this time.  When (if)
> > we go live with this, I'll be using Dell DRACs to handle the fencing.
> > At any rate, I am trying manual fencing to test proof of concept.  I
> > am using RHEL5.1 with Conga to configure it.  My config is at the
> > bottom of this email.   Shouldn't there be a nodename paramater in
> > this block:
> >                         <fence>
> >                                 <method name="1">
> >                                         <device name="fence00"
> > nodename="dss01.foobar.local"/>
> >                                 </method>
> >                         </fence>
> >
> > ??  If so, why isn't Conga configuring this correctly?  If not, why am
> > I getting this error during a crash of the node with the service on
>
> Sounds like you hit:
>
> https://bugzilla.redhat.com/show_bug.cgi?id=238655
>
> The bugzilla is in the status ON_QA while the fixes are verified. In the
> meantime you should be able to work around this by manually editing the
> generated cluster.conf.
>
> Kind regards,
> Bryn.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
> iD8DBQFHn1/I6YSQoMYUY94RAkMJAJ0eVRl/EJSpDDBGAajpN3RJceZdfgCbBiKo
> KheIW7o60T5a3QfElLtTtWA=
> =eotz
> -----END PGP SIGNATURE-----

Along that note, what do I need to do when updating the cluster.conf
manually?  Any commands to let the cluster know what I did.
Unfortunately, I am learning through the GUI, haha.  I may as well go
get my MCSE now.



From gordan at bobich.net  Tue Jan 29 17:30:47 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 29 Jan 2008 17:30:47 +0000 (GMT)
Subject: [Linux-cluster] DDRAID vs. GNBD+MD
In-Reply-To: <1201626793.15818.11.camel@ayanami.boston.devel.redhat.com>
References: <479CE1E3.3060601@bobich.net>
	<1201539786.8693.28.camel@ayanami.boston.devel.redhat.com>
	<alpine.LRH.1.00.0801281715220.28394@skynet.shatteredsilicon.net>
	<1201626793.15818.11.camel@ayanami.boston.devel.redhat.com>
Message-ID: <alpine.LRH.1.00.0801291723350.22229@skynet.shatteredsilicon.net>

>>>> On a related node, was DDRAID ever stabilised
>>>
>>> In fact, it's been removed from CVS:
>>>
>>> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/ddraid/Attic/?cvsroot=cluster
>>
>> Ouch. I don't suppose that means that there is a reasonable alternative to
>> it? I know DRBD works for providing RAID1 without cantralized storage, but
>> is there anything supported and available that can provide any other kind
>> of RAID?
>
> Not that I'm aware of.
>
> I think your best bet at distributed storage is a distributed cluster
> file system, a la GlusterFS, Petal/Frangipani (non-Free :( ), Lustre,
> etc.

The problem with GlusterFS is that the most it seems to provide is 
mirroring (RAID1). Lustre supports only striping (RAID0) but the 
recommendation is to run it on DRBD which provides mirroring (RAID1).

DDRAID at least provided n+1, which although not n+m at least seems to 
have been a step in the right direction. I am stunned that all these 
supposed petabyte file systems only support mirroring for redundancy. It 
really does not follow that an entity that can afford a petabyte of disks 
can also afford two petabytes of disks to achieve redundancy that doesn't 
even prevent data loss in case of failure of two speciffic disks.

Gordan



From praman at informatica.com  Tue Jan 29 17:31:23 2008
From: praman at informatica.com (Raman, Pattabhi)
Date: Tue, 29 Jan 2008 23:01:23 +0530
Subject: [Linux-cluster] Quota Reporting Tool
Message-ID: <02E7FA106DF5944BB0571C25A9243DE6013F30BC@in23ex01.informatica.com>

Hi All, 

 

We recently upgraded our Cluster to RHAS 5.1, since then the quota
reporting tool work. The command hangs in the middle and does not
complete successfully. Though quotas are in place, we need to get the
reporting tools working as lot of our in-house applications that our
engineers user are dependent on the data. 

 

Any help on this is appreciated.

 

Regards,
Pattabhi Raman 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/58c3eeda/attachment.htm>

From gordan at bobich.net  Tue Jan 29 17:34:39 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 29 Jan 2008 17:34:39 +0000 (GMT)
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
	<479F5FC8.70307@redhat.com>
	<8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>
Message-ID: <alpine.LRH.1.00.0801291731130.22229@skynet.shatteredsilicon.net>

>> Terry wrote:
>>> Hello,
>>>
>>> I know that manual fencing is not supported.  However, I don't have
>>> anything else with the hardware I am testing at this time.  When (if)
>>> we go live with this, I'll be using Dell DRACs to handle the fencing.
>>> At any rate, I am trying manual fencing to test proof of concept.  I
>>> am using RHEL5.1 with Conga to configure it.  My config is at the
>>> bottom of this email.   Shouldn't there be a nodename paramater in
>>> this block:
>>>                         <fence>
>>>                                 <method name="1">
>>>                                         <device name="fence00"
>>> nodename="dss01.foobar.local"/>
>>>                                 </method>
>>>                         </fence>
>>>
>>> ??  If so, why isn't Conga configuring this correctly?  If not, why am
>>> I getting this error during a crash of the node with the service on
>>
>> Sounds like you hit:
>>
>> https://bugzilla.redhat.com/show_bug.cgi?id=238655
>>
>> The bugzilla is in the status ON_QA while the fixes are verified. In the
>> meantime you should be able to work around this by manually editing the
>> generated cluster.conf.
>>
> Along that note, what do I need to do when updating the cluster.conf
> manually?  Any commands to let the cluster know what I did.

ccs_tool update /etc/cluster/cluster.conf

> Unfortunately, I am learning through the GUI, haha.  I may as well go
> get my MCSE now.

Expect no sympathy here. Next you'll say you're using Ubuntu.

Gordan



From adas at redhat.com  Tue Jan 29 17:41:30 2008
From: adas at redhat.com (Abhijith Das)
Date: Tue, 29 Jan 2008 11:41:30 -0600
Subject: [Linux-cluster] Quota Reporting Tool
In-Reply-To: <02E7FA106DF5944BB0571C25A9243DE6013F30BC@in23ex01.informatica.com>
References: <02E7FA106DF5944BB0571C25A9243DE6013F30BC@in23ex01.informatica.com>
Message-ID: <479F654A.1020108@redhat.com>

Raman, Pattabhi wrote:

> Hi All,
>
>  
>
> We recently upgraded our Cluster to RHAS 5.1, since then the quota
> reporting tool work. The command hangs in the middle and does not
> complete successfully. Though quotas are in place, we need to get the
> reporting tools working as lot of our in-house applications that our
> engineers user are dependent on the data.
>
>  
>
> Any help on this is appreciated.
>
>  
>
> Regards,
> Pattabhi Raman
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
Hi,
Can you elaborate on the problem a bit? What specific command hangs? Do
you have GFS/GFS2? Also, can you tell us what cluster/gfs packages you
are currently running?

There is a fix to improve "gfs_quota list" operation coming out in 5.2.
If your list operation appears to hang, that might be the problem. It
used to be really slow without this fix.

Cheers,
--Abhi



From raycharles_man at yahoo.com  Tue Jan 29 17:41:50 2008
From: raycharles_man at yahoo.com (Ray Charles)
Date: Tue, 29 Jan 2008 09:41:50 -0800 (PST)
Subject: [Linux-cluster] Initializing CMAN increases network chatter
Message-ID: <770987.62156.qm@web32103.mail.mud.yahoo.com>



Hi,

I am wondering if this is normal due to the use of
multicast in openais.

My network consists of 2 trunked layer2 switches and 2
nodes(aka 2 node cluster) each with 2 nics that are
configured as a bonded interface,mode 1. 

I have link lites on the switch ports i use. They stay
lit until i start cman up. Then all the ports blink
madly. It doesn't seem to impact the traffic, but i am
wondering if all the chatter is necessary?


-tia



      ____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 



From bfilipek at crscold.com  Tue Jan 29 18:05:13 2008
From: bfilipek at crscold.com (Brad Filipek)
Date: Tue, 29 Jan 2008 12:05:13 -0600
Subject: [Linux-cluster] Problem with fenced on 2 node cluster
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D02473A@SRVEDI.upark.crscold.com>

I have a 2 node cluster setup fairly basic right now (mounts an ext3 filesystem, and activates a virtual IP). When both boxes are powered off, I can start up node1 just fine. It starts the services and runs great. Then I power up node2 and it joins fine. However, if I run "ps aux | grep fenced" on node1 I see that /sbin/fenced is running and it never goes away. Is there anyway to check and see what it is doing? I know that the fenced service works because it will reboot the other node if I disconnect the network cable. I am using an APC master switch for power. 
 
Thanks,
Brad

Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/960d10cd/attachment.htm>

From isplist at logicore.net  Tue Jan 29 18:17:15 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 29 Jan 2008 12:17:15 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20080129163232.GA7035@wisc.edu>
Message-ID: <2008129121715.264757@leena>

New thread since this is a DLM issue :)

> As I understand it, basically the first node to access a file must do some
> lock checks - see if anyone has already locked it, attempt to lock it, let
> the other nodes know about it, etc.


I think I see what's going on. I see the following error in my logs which 
suggests to me that every connection is being checked since locking is not 
happening.

Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: replays = 0, 
skips = 1, sames = 24
Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Journal 
replayed in 1s
Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Done
Jan 29 11:30:10 compdev kernel: dlm: could not bind to local address for 
connect: -98
Jan 29 11:35:40 compdev kernel: dlm: could not bind to local address for 
connect: -98
Jan 29 11:38:35 compdev kernel: dlm: could not bind to local address for 
connect: -98

I'm not finding a lot on google about how to go about finding the problem, 
fixing it.

Mike





From td3201 at gmail.com  Tue Jan 29 18:32:37 2008
From: td3201 at gmail.com (Terry)
Date: Tue, 29 Jan 2008 12:32:37 -0600
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <1201627007.15818.16.camel@ayanami.boston.devel.redhat.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
	<1201627007.15818.16.camel@ayanami.boston.devel.redhat.com>
Message-ID: <8ee061010801291032h2593222ch56104fc1abc055fa@mail.gmail.com>

On Jan 29, 2008 11:16 AM, Lon Hohberger <lhh at redhat.com> wrote:
> On Tue, 2008-01-29 at 11:07 -0600, Terry wrote:
>
> > ??  If so, why isn't Conga configuring this correctly?  If not, why am
> > I getting this error during a crash of the node with the service on
> > it:
> > Jan 29 10:56:08 dss00 fenced[2965]: agent "fence_manual" reports:
> > failed: fence_manual no node name
> > Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local" failed
> >
> > Thanks for any ideas.
>
> Try not configuring any fencing at all, and doing:
>
> echo dss01.foobar.local > /var/run/cluster/fenced_override
>
> Same effect as manual fencing, without the need for configuration.
> You'll see something like:
>
> Jan 29 10:56:08 dss00 fenced[2965]: fence "dss01.foobar.local"
> overridden by administrator intervention
>
> -- Lon
>

This didn't work for some reason but manually putting in nodename and
updating the cluster worked.



From teemu.m2 at luukku.com  Tue Jan 29 18:51:32 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Tue, 29 Jan 2008 20:51:32 +0200 (EET)
Subject: [Linux-cluster] starting cman changes cluster.conf file? what in..
Message-ID: <1201632692449.teemu.m2.74056.KIOY4w6jyGdgo7rtuqFQSg@luukku.com>

Hi,

Can somebody help me, when i am starting service cman in my cluster, it changes the /etc/cluster/cluster.conf file? how is'it possible?

I am trying to get an other cluster enviroment in same network where i have allready on cluster working.

So this is then what happens.. i don't understand what is happening here!
Starting cluster: 
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed
cman not started: Can't find local node name in cluster.conf /usr/sbin/cman_tool: aisexec daemon didn't start
                                                           [FAILED]


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From lhh at redhat.com  Tue Jan 29 19:04:29 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 14:04:29 -0500
Subject: [Linux-cluster] Initializing CMAN increases network chatter
In-Reply-To: <770987.62156.qm@web32103.mail.mud.yahoo.com>
References: <770987.62156.qm@web32103.mail.mud.yahoo.com>
Message-ID: <1201633469.15818.20.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 09:41 -0800, Ray Charles wrote:

> I am wondering if this is normal due to the use of
> multicast in openais.
> 
> My network consists of 2 trunked layer2 switches and 2
> nodes(aka 2 node cluster) each with 2 nics that are
> configured as a bonded interface,mode 1. 
> 
> I have link lites on the switch ports i use. They stay
> lit until i start cman up. Then all the ports blink
> madly. It doesn't seem to impact the traffic, but i am
> wondering if all the chatter is necessary?

That's normal.  The cluster uses frequent multicast packets for
communication.

If it stops and goes solid while CMAN's running ... then you have a
problem :)

-- Lon



From lhh at redhat.com  Tue Jan 29 19:06:09 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 14:06:09 -0500
Subject: [Linux-cluster] Problem with fenced on 2 node cluster
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D02473A@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D02473A@SRVEDI.upark.crscold.com>
Message-ID: <1201633569.15818.23.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 12:05 -0600, Brad Filipek wrote:
> I have a 2 node cluster setup fairly basic right now (mounts an ext3
> filesystem, and activates a virtual IP). When both boxes are powered
> off, I can start up node1 just fine. It starts the services and runs
> great. Then I power up node2 and it joins fine. However, if I run "ps
> aux | grep fenced" on node1 I see that /sbin/fenced is running and it
> never goes away. Is there anyway to check and see what it is doing? I
> know that the fenced service works because it will reboot the other
> node if I disconnect the network cable. I am using an APC master
> switch for power. 

fenced should stay running at all times... did you mean it's stuck in an
infinite loop?

fenced spawns agents - such as 'fence_apc' - which should only be
present during a fencing operation.  If those stay around for a long
time, there's a problem.

-- Lon




From lhh at redhat.com  Tue Jan 29 19:10:44 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 29 Jan 2008 14:10:44 -0500
Subject: [Linux-cluster] starting cman changes cluster.conf file? what in..
In-Reply-To: <1201632692449.teemu.m2.74056.KIOY4w6jyGdgo7rtuqFQSg@luukku.com>
References: <1201632692449.teemu.m2.74056.KIOY4w6jyGdgo7rtuqFQSg@luukku.com>
Message-ID: <1201633844.15818.28.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 20:51 +0200, m.. mm.. wrote:
> Hi,
> 
> Can somebody help me, when i am starting service cman in my cluster, it changes the /etc/cluster/cluster.conf file? how is'it possible?

You *must* change the cluster name!

> I am trying to get an other cluster enviroment in same network where i have allready on cluster working.
> 
> So this is then what happens.. i don't understand what is happening here!
> Starting cluster: 
>    Loading modules... done
>    Mounting configfs... done
>    Starting ccsd... done
>    Starting cman... failed
> cman not started: Can't find local node name in cluster.conf /usr/sbin/cman_tool: aisexec daemon didn't start
>                                                            [FAILED]

Other things to look out for:

 * 'uname -n' must match something in cluster.conf.  If it doesn't,
   you need to edit /etc/init.d/cman: right before line 44
   ("load_modules()"), put:

      cman_join_opts+="-n <name_from_cluster.conf>"

   This must be done on each cluster member!

 * chkconfig --del openais (if it's not already disabled)

-- Lon



From sdake at redhat.com  Tue Jan 29 19:15:22 2008
From: sdake at redhat.com (Steven Dake)
Date: Tue, 29 Jan 2008 12:15:22 -0700
Subject: [Linux-cluster] Initializing CMAN increases network chatter
In-Reply-To: <770987.62156.qm@web32103.mail.mud.yahoo.com>
References: <770987.62156.qm@web32103.mail.mud.yahoo.com>
Message-ID: <1201634123.3706.1.camel@balance>

On Tue, 2008-01-29 at 09:41 -0800, Ray Charles wrote:
> 
> Hi,
> 
> I am wondering if this is normal due to the use of
> multicast in openais.
> 
> My network consists of 2 trunked layer2 switches and 2
> nodes(aka 2 node cluster) each with 2 nics that are
> configured as a bonded interface,mode 1. 
> 
> I have link lites on the switch ports i use. They stay
> lit until i start cman up. Then all the ports blink
> madly. It doesn't seem to impact the traffic, but i am
> wondering if all the chatter is necessary?
> 
> 
> -tia
> 

Openais checks periodically for other split networks in the system.
This option can be tuned via the "merge" option which is set in
milliseconds.  This can be overriden in the cluster.conf file.  Chrissie
can probably describe how to do this - I'm not immediately sure of the
option.

Here is the man page info:
       merge  This  timeout  specifies in milliseconds how long to wait
before
              checking for a partition when  no  multicast  traffic  is
being
              sent.   If  multicast traffic is being sent, the merge
detection
              happens automatically as a function of the protocol.

              The default is 200 milliseconds.


> 
> 
>       ____________________________________________________________________________________
> Be a better friend, newshound, and 
> know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From bfilipek at crscold.com  Tue Jan 29 20:13:19 2008
From: bfilipek at crscold.com (Brad Filipek)
Date: Tue, 29 Jan 2008 14:13:19 -0600
Subject: [Linux-cluster] Problem with fenced on 2 node cluster
References: <9C01E18EF3BC2448A3B1A4812EB87D02473A@SRVEDI.upark.crscold.com>
	<1201633569.15818.23.camel@ayanami.boston.devel.redhat.com>
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D02473C@SRVEDI.upark.crscold.com>

Lon,
 
Thanks for your response. I didn't realize that /sbin/fenced should be running at all times. During an actual fence action, I do see fence_apc show up so this all sounds normal, right? 
 
Brad Filipek


________________________________

From: linux-cluster-bounces at redhat.com on behalf of Lon Hohberger
Sent: Tue 1/29/2008 1:06 PM
To: linux clustering
Subject: Re: [Linux-cluster] Problem with fenced on 2 node cluster



On Tue, 2008-01-29 at 12:05 -0600, Brad Filipek wrote:
> I have a 2 node cluster setup fairly basic right now (mounts an ext3
> filesystem, and activates a virtual IP). When both boxes are powered
> off, I can start up node1 just fine. It starts the services and runs
> great. Then I power up node2 and it joins fine. However, if I run "ps
> aux | grep fenced" on node1 I see that /sbin/fenced is running and it
> never goes away. Is there anyway to check and see what it is doing? I
> know that the fenced service works because it will reboot the other
> node if I disconnect the network cable. I am using an APC master
> switch for power.

fenced should stay running at all times... did you mean it's stuck in an
infinite loop?

fenced spawns agents - such as 'fence_apc' - which should only be
present during a fencing operation.  If those stay around for a long
time, there's a problem.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 4790 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/442d7d58/attachment.bin>

From David.C.Harding at hp.com  Tue Jan 29 20:34:49 2008
From: David.C.Harding at hp.com (Harding, David)
Date: Tue, 29 Jan 2008 20:34:49 +0000
Subject: [Linux-cluster] Gfs 
Message-ID: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>


Hello,

I am in the process of setting up a two node Linux cluster with a fiber channel storage.  II created my
Volume groups and logical volumes and did gfs_mkfs. No issues show up.
When I attempt to mount the file system with the command mount -t gfs /dev/volcluster_vg01/lvol0   /mnt
I get the message mount: fs type gfs not supported by kernel.

I install the gfs software using the up2date facility.  I would have thought that the necessary kernel
rpms would have installed at that time.  Both systems have the same issue.  If I do a up2date
for GFS it says that all updates are installed.   What am I missing.


david

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/fce00ff5/attachment.htm>

From jakub.suchy at enlogit.cz  Tue Jan 29 20:44:09 2008
From: jakub.suchy at enlogit.cz (Jakub Suchy)
Date: Tue, 29 Jan 2008 21:44:09 +0100
Subject: [Linux-cluster] Gfs
In-Reply-To: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
References: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
Message-ID: <20080129204409.GA28411@localhost>

> I get the message mount: fs type gfs not supported by kernel.

Hi,
did you loaded kernel module "gfs"?

Jakub

-- 
Jakub Such? <jakub.suchy at enlogit.cz>
GSM: +420 - 777 817 949

Enlogit s.r.o, U Cukrovaru 509/4, 400 07 ?st? nad Labem
tel.: +420 - 474 745 159, fax: +420 - 474 745 160
e-mail: info at enlogit.cz, web: http://www.enlogit.cz



From rpeterso at redhat.com  Tue Jan 29 20:51:28 2008
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 29 Jan 2008 14:51:28 -0600
Subject: [Linux-cluster] Gfs
In-Reply-To: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
References: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
Message-ID: <1201639888.18461.148.camel@technetium.msp.redhat.com>

On Tue, 2008-01-29 at 20:34 +0000, Harding, David wrote:
>  
> Hello,
>  
> I am in the process of setting up a two node Linux cluster with a
> fiber channel storage.  II created my
> Volume groups and logical volumes and did gfs_mkfs. No issues show up.
> When I attempt to mount the file system with the command mount -t
> gfs /dev/volcluster_vg01/lvol0   /mnt
> I get the message mount: fs type gfs not supported by kernel.
>  
> I install the gfs software using the up2date facility.  I would have
> thought that the necessary kernel 
> rpms would have installed at that time.  Both systems have the same
> issue.  If I do a up2date
> for GFS it says that all updates are installed.   What am I missing.
>  
>  
> david

Hi David,

It sounds like you may need a couple things.  First of all,
you probably need the gfs-kmod rpm.  If you're going to use
this in a cluster, you'll need some clustering software as
well, and that depends on the version that you're running.

If you're running RHEL5.x, Centos5.x or similar, you'll need
to get some more rpms, like cman, openais, etc.
If you're running RHEL4.x, Centos4.x or similar, you may
need some more.  I've got some basic step-by-step instructions
in my NFS/GFS Cookbook, located here:

http://sources.redhat.com/cluster/doc/nfscookbook.pdf

Regards,

Bob Peterson
Red Hat GFS





From jruemker at redhat.com  Tue Jan 29 20:50:09 2008
From: jruemker at redhat.com (John Ruemker)
Date: Tue, 29 Jan 2008 15:50:09 -0500
Subject: [Linux-cluster] Gfs
In-Reply-To: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
References: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
Message-ID: <479F9181.2000606@redhat.com>

Make sure you have the GFS-kernel-<variant> package installed, where 
variant is smp, hugemem, etc.  up2date pulls the latest packages 
available, so if you are using a kernel older than the most recent one 
then it installed the gfs module for a newer kernel than you are 
running.  You'll need to boot into the latest kernel or manually install 
the gfs packages corresponding to your version. 

John

Harding, David wrote:
>  
> Hello,
>  
> I am in the process of setting up a two node Linux cluster with a 
> fiber channel storage.  II created my
> Volume groups and logical volumes and did gfs_mkfs. No issues show up.
> When I attempt to mount the file system with the command mount -t gfs 
> /dev/volcluster_vg01/lvol0   /mnt
> I get the message mount: fs type gfs not supported by kernel.
>  
> I install the gfs software using the up2date facility.  I would have 
> thought that the necessary kernel
> rpms would have installed at that time.  Both systems have the same 
> issue.  If I do a up2date
> for GFS it says that all updates are installed.   What am I missing.
>  
>  
> david
>  
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/135b2980/attachment.htm>

From jruemker at redhat.com  Tue Jan 29 20:54:41 2008
From: jruemker at redhat.com (John Ruemker)
Date: Tue, 29 Jan 2008 15:54:41 -0500
Subject: [Linux-cluster] Gfs
In-Reply-To: <479F9181.2000606@redhat.com>
References: <AD13D564266865489AE1FBE369E195E8245A482DB1@G5W0274.americas.hpqcorp.net>
	<479F9181.2000606@redhat.com>
Message-ID: <479F9291.4060601@redhat.com>

John Ruemker wrote:

For RHEL4:
> Make sure you have the GFS-kernel-<variant> package installed, where 
> variant is smp, hugemem, etc.  
For RHEL5, its kmod-gfs-<variant>.
> up2date pulls the latest packages available, so if you are using a 
> kernel older than the most recent one then it installed the gfs module 
> for a newer kernel than you are running.  You'll need to boot into the 
> latest kernel or manually install the gfs packages corresponding to 
> your version. 
>
> John
>
> Harding, David wrote:
>>  
>> Hello,
>>  
>> I am in the process of setting up a two node Linux cluster with a 
>> fiber channel storage.  II created my
>> Volume groups and logical volumes and did gfs_mkfs. No issues show up.
>> When I attempt to mount the file system with the command mount -t gfs 
>> /dev/volcluster_vg01/lvol0   /mnt
>> I get the message mount: fs type gfs not supported by kernel.
>>  
>> I install the gfs software using the up2date facility.  I would have 
>> thought that the necessary kernel
>> rpms would have installed at that time.  Both systems have the same 
>> issue.  If I do a up2date
>> for GFS it says that all updates are installed.   What am I missing.
>>  
>>  
>> david
>>  
>> ------------------------------------------------------------------------
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080129/b4779ce4/attachment.htm>

From linux-cluster at merctech.com  Tue Jan 29 22:26:55 2008
From: linux-cluster at merctech.com (linux-cluster at merctech.com)
Date: Tue, 29 Jan 2008 17:26:55 -0500
Subject: [Linux-cluster] starting cman changes cluster.conf file? what in..
In-Reply-To: Your message of "Tue, 29 Jan 2008 14:10:44 EST."
	<1201633844.15818.28.camel@ayanami.boston.devel.redhat.com>
References: <1201633844.15818.28.camel@ayanami.boston.devel.redhat.com>
	<1201632692449.teemu.m2.74056.KIOY4w6jyGdgo7rtuqFQSg@luukku.com>
Message-ID: <9269.1201645615@localhost>



In the message dated: Tue, 29 Jan 2008 14:10:44 EST,
The pithy ruminations from Lon Hohberger on 
<Re: [Linux-cluster] starting cman changes cluster.conf file? what in..> were:
=> On Tue, 2008-01-29 at 20:51 +0200, m.. mm.. wrote:
=> > Hi,
=> > 
=> > Can somebody help me, when i am starting service cman in my cluster, it changes the /etc/clust
=> er/cluster.conf file? how is'it possible?
=> 
=> You *must* change the cluster name!
=> 
=> > I am trying to get an other cluster enviroment in same network where i have allready on cluste
=> r working.
=> > 
=> > So this is then what happens.. i don't understand what is happening here!
=> > Starting cluster: 
=> >    Loading modules... done
=> >    Mounting configfs... done
=> >    Starting ccsd... done
=> >    Starting cman... failed
=> > cman not started: Can't find local node name in cluster.conf /usr/sbin/cman_tool: aisexec daem
=> on didn't start
=> >                                                            [FAILED]
=> 
=> Other things to look out for:
=> 
=>  * 'uname -n' must match something in cluster.conf.  If it doesn't,

Can you please clarify this?

On my cluster:
	uname -n == /etc/hosts entry for eth0 (public address)

	cluster.conf "clusternode name" entries == /etc/hosts entry for eth1 (private network)

and the cman daemon starts just fine. Is there something that I'm missing, or 
will this cause other issues?


I'm running CentOS 5, cman-2.0.73-1.el5_1.1.

=>    you need to edit /etc/init.d/cman: right before line 44
=>    ("load_modules()"), put:
=> 
=>       cman_join_opts+="-n <name_from_cluster.conf>"

Wow...that value really seems like an ideal candidate for inclusion in the 
cluster.conf file, or in :

	/etc/sysconfig/cman

=> 
=>    This must be done on each cluster member!
=> 
=>  * chkconfig --del openais (if it's not already disabled)
=> 
=> -- Lon

-----
Mark Bergman 

http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40merctech.com



From mnapolis at redhat.com  Tue Jan 29 23:19:15 2008
From: mnapolis at redhat.com (Isauro Michael Napolis)
Date: Wed, 30 Jan 2008 09:19:15 +1000
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
	<479F5FC8.70307@redhat.com>
	<8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>
Message-ID: <1201648755.7517.3.camel@localhost.localdomain>

On Wed, 2008-01-30 at 03:23, Terry wrote:
> On Jan 29, 2008 11:18 AM, Bryn M. Reeves <breeves at redhat.com> wrote:
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA1
> >
> > Terry wrote:
> > > Hello,
> > >
> > > I know that manual fencing is not supported.  However, I don't have
> > > anything else with the hardware I am testing at this time.  When (if)
> > > we go live with this, I'll be using Dell DRACs to handle the fencing.
> > > At any rate, I am trying manual fencing to test proof of concept.  I
> > > am using RHEL5.1 with Conga to configure it.  My config is at the
> > > bottom of this email.   Shouldn't there be a nodename paramater in
> > > this block:
> > >                         <fence>
> > >                                 <method name="1">
> > >                                         <device name="fence00"
> > > nodename="dss01.foobar.local"/>
> > >                                 </method>
> > >                         </fence>
> > >
> > > ??  If so, why isn't Conga configuring this correctly?  If not, why am
> > > I getting this error during a crash of the node with the service on
> >
> > Sounds like you hit:
> >
> > https://bugzilla.redhat.com/show_bug.cgi?id=238655
> >
> > The bugzilla is in the status ON_QA while the fixes are verified. In the
> > meantime you should be able to work around this by manually editing the
> > generated cluster.conf.
> >
> > Kind regards,
> > Bryn.
> >
> > -----BEGIN PGP SIGNATURE-----
> > Version: GnuPG v1.4.7 (GNU/Linux)
> > Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> >
> > iD8DBQFHn1/I6YSQoMYUY94RAkMJAJ0eVRl/EJSpDDBGAajpN3RJceZdfgCbBiKo
> > KheIW7o60T5a3QfElLtTtWA=
> > =eotz
> > -----END PGP SIGNATURE-----
> 
> Along that note, what do I need to do when updating the cluster.conf
> manually?  Any commands to let the cluster know what I did.

after you've run the command below you also need to run cman_tool to let
the cluster know that the config version changed.

ccs_tool update /etc/cluster/cluster.conf

cman_tool version -r <new_config_version>

Check with "cman_tool status" on all nodes to see if the changes (config
version) have propagated to the other nodes

Michael

> Unfortunately, I am learning through the GUI, haha.  I may as well go
> get my MCSE now.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From td3201 at gmail.com  Tue Jan 29 23:22:26 2008
From: td3201 at gmail.com (Terry)
Date: Tue, 29 Jan 2008 17:22:26 -0600
Subject: [Linux-cluster] manual fencing -- issues
In-Reply-To: <1201648755.7517.3.camel@localhost.localdomain>
References: <8ee061010801290907v29575b76ga8a551f2ac73a2d1@mail.gmail.com>
	<479F5FC8.70307@redhat.com>
	<8ee061010801290923s6a9a4218xedba6a56cb396fc2@mail.gmail.com>
	<1201648755.7517.3.camel@localhost.localdomain>
Message-ID: <8ee061010801291522n41c75f5ahfd69a6b3850671ad@mail.gmail.com>

On Jan 29, 2008 5:19 PM, Isauro Michael Napolis <mnapolis at redhat.com> wrote:
>
> On Wed, 2008-01-30 at 03:23, Terry wrote:
> > On Jan 29, 2008 11:18 AM, Bryn M. Reeves <breeves at redhat.com> wrote:
> > > -----BEGIN PGP SIGNED MESSAGE-----
> > > Hash: SHA1
> > >
> > > Terry wrote:
> > > > Hello,
> > > >
> > > > I know that manual fencing is not supported.  However, I don't have
> > > > anything else with the hardware I am testing at this time.  When (if)
> > > > we go live with this, I'll be using Dell DRACs to handle the fencing.
> > > > At any rate, I am trying manual fencing to test proof of concept.  I
> > > > am using RHEL5.1 with Conga to configure it.  My config is at the
> > > > bottom of this email.   Shouldn't there be a nodename paramater in
> > > > this block:
> > > >                         <fence>
> > > >                                 <method name="1">
> > > >                                         <device name="fence00"
> > > > nodename="dss01.foobar.local"/>
> > > >                                 </method>
> > > >                         </fence>
> > > >
> > > > ??  If so, why isn't Conga configuring this correctly?  If not, why am
> > > > I getting this error during a crash of the node with the service on
> > >
> > > Sounds like you hit:
> > >
> > > https://bugzilla.redhat.com/show_bug.cgi?id=238655
> > >
> > > The bugzilla is in the status ON_QA while the fixes are verified. In the
> > > meantime you should be able to work around this by manually editing the
> > > generated cluster.conf.
> > >
> > > Kind regards,
> > > Bryn.
> > >
> > > -----BEGIN PGP SIGNATURE-----
> > > Version: GnuPG v1.4.7 (GNU/Linux)
> > > Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> > >
> > > iD8DBQFHn1/I6YSQoMYUY94RAkMJAJ0eVRl/EJSpDDBGAajpN3RJceZdfgCbBiKo
> > > KheIW7o60T5a3QfElLtTtWA=
> > > =eotz
> > > -----END PGP SIGNATURE-----
> >
> > Along that note, what do I need to do when updating the cluster.conf
> > manually?  Any commands to let the cluster know what I did.
>
> after you've run the command below you also need to run cman_tool to let
> the cluster know that the config version changed.
>
> ccs_tool update /etc/cluster/cluster.conf
>
> cman_tool version -r <new_config_version>
>
> Check with "cman_tool status" on all nodes to see if the changes (config
> version) have propagated to the other nodes
>
> Michael
>
> > Unfortunately, I am learning through the GUI, haha.  I may as well go
> > get my MCSE now.

Ya, I figured that out but I just edited the version manually.



From teemu.m2 at luukku.com  Wed Jan 30 07:49:58 2008
From: teemu.m2 at luukku.com (m.. mm..)
Date: Wed, 30 Jan 2008 09:49:58 +0200 (EET)
Subject: [Linux-cluster] starting cman changes cluster.conf file? what in..
Message-ID: <1201679398038.teemu.m2.8188.vwdvMg65QF-q9Z0SHozYXA@luukku.com>


Okey,
I change this cluster-name, and with system-config-cluster it is okey.. but how
do i reset Conga? When i start Conga it says that it can't not contact any of the nodes from that "old" name cluster.

Where is config files in Conga, is'it same /etc/cluster/cluster.conf or somewhere else?

Lon Hohberger kirjoitti 29.01.2008 kello 21:10:
> On Tue, 2008-01-29 at 20:51 +0200, m.. mm.. wrote:
> > Hi,
> > 
> > Can somebody help me, when i am starting service cman in my cluster, it
> changes the /etc/cluster/cluster.conf file? how is'it possible?
> 
> You *must* change the cluster name!
> 
> > I am trying to get an other cluster enviroment in same network where i
> have allready on cluster working.
> > 
> > So this is then what happens.. i don't understand what is happening here!
> > Starting cluster: 
> >    Loading modules... done
> >    Mounting configfs... done
> >    Starting ccsd... done
> >    Starting cman... failed
> > cman not started: Can't find local node name in cluster.conf
> /usr/sbin/cman_tool: aisexec daemon didn't start
> >                                                            [FAILED]
> 
> Other things to look out for:
> 
>  * 'uname -n' must match something in cluster.conf.  If it doesn't,
>    you need to edit /etc/init.d/cman: right before line 44
>    ("load_modules()"), put:
> 
>       cman_join_opts+="-n <name_from_cluster.conf>"
> 
>    This must be done on each cluster member!
> 
>  * chkconfig --del openais (if it's not already disabled)
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From pcaulfie at redhat.com  Wed Jan 30 08:40:31 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Wed, 30 Jan 2008 08:40:31 +0000
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008129121715.264757@leena>
References: <2008129121715.264757@leena>
Message-ID: <47A037FF.9070408@redhat.com>

isplist at logicore.net wrote:
> New thread since this is a DLM issue :)
> 
>> As I understand it, basically the first node to access a file must do some
>> lock checks - see if anyone has already locked it, attempt to lock it, let
>> the other nodes know about it, etc.
> 
> 
> I think I see what's going on. I see the following error in my logs which 
> suggests to me that every connection is being checked since locking is not 
> happening.
> 
> Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: replays = 0, 
> skips = 1, sames = 24
> Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Journal 
> replayed in 1s
> Jan 29 11:24:04 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Done
> Jan 29 11:30:10 compdev kernel: dlm: could not bind to local address for 
> connect: -98
> Jan 29 11:35:40 compdev kernel: dlm: could not bind to local address for 
> connect: -98
> Jan 29 11:38:35 compdev kernel: dlm: could not bind to local address for 
> connect: -98
> 
> I'm not finding a lot on google about how to go about finding the problem, 
> fixing it.

That means that something else is using port 21064 - the TCP port that
the DLM uses. If the DLM can't bind to its port then it cannot start.

Use netstat -tap or lsof to find out what is using that port. If you
can't stop that particular application that is using it, then you'll
need to move the DLM to another port ON ALL CLUSTER NODES by echoing a
port number into /proc/cluster/dlm/tcp_port.

Patrick



From David.C.Harding at hp.com  Wed Jan 30 15:04:26 2008
From: David.C.Harding at hp.com (Harding, David)
Date: Wed, 30 Jan 2008 15:04:26 +0000
Subject: [Linux-cluster] Gfs
In-Reply-To: <479F9181.2000606@redhat.com>
Message-ID: <AD13D564266865489AE1FBE369E195E8245A482DBC@G5W0274.americas.hpqcorp.net>

It turned out that I needed to update GFS-kernel-* to the existing kernel on teh system. That fixed the mount issue,
but some how created a new problem. I reboot the systems now the cluster will not start.  Looking around I found that
the modclusterd is starting then getting a segfault on both systems. I would not have thought that updating the GFS-kernel
would have caused this.


dave
-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of John Ruemker
Sent: Tuesday, January 29, 2008 3:50 PM
To: linux clustering
Subject: Re: [Linux-cluster] Gfs

Make sure you have the GFS-kernel-<variant> package installed, where variant is smp, hugemem, etc.  up2date pulls the latest packages available, so if you are using a kernel older than the most recent one then it installed the gfs module for a newer kernel than you are running.  You'll need to boot into the latest kernel or manually install the gfs packages corresponding to your version.

John

Harding, David wrote:

Hello,

I am in the process of setting up a two node Linux cluster with a fiber channel storage.  II created my
Volume groups and logical volumes and did gfs_mkfs. No issues show up.
When I attempt to mount the file system with the command mount -t gfs /dev/volcluster_vg01/lvol0   /mnt
I get the message mount: fs type gfs not supported by kernel.

I install the gfs software using the up2date facility.  I would have thought that the necessary kernel
rpms would have installed at that time.  Both systems have the same issue.  If I do a up2date
for GFS it says that all updates are installed.   What am I missing.


david


________________________________

--
Linux-cluster mailing list
Linux-cluster at redhat.com<mailto:Linux-cluster at redhat.com>
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080130/6195b411/attachment.htm>

From isplist at logicore.net  Wed Jan 30 15:21:12 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 09:21:12 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A037FF.9070408@redhat.com>
Message-ID: <200813092112.760131@leena>

> That means that something else is using port 21064 - the TCP port that
> the DLM uses. If the DLM can't bind to its port then it cannot start.

What's weird is that I didn't change anything, just the storage, then needing 
to get the cluster quorum so that I could set up gfs. 

> Use netstat -tap or lsof to find out what is using that port. If you
> can't stop that particular application that is using it, then you'll

#lsof -i | grep 21064
Reveals nothing.

# netstat -anp | grep 21064
Seems to show port 21064 being available.
 
tcp	0	0 192.168.1.58:21064      0.0.0.0:*                   LISTEN      -
tcp	0	0 192.168.1.58:6809        192.168.1.62:21064  ESTABLISHED -
tcp	0	0 192.168.1.58:21064      192.168.1.63:32780  ESTABLISHED -
tcp    0   0 192.168.1.58:21064	192.168.1.92:6809     ESTABLISHED -
tcp	0	0 192.168.1.58:21064		192.168.1.62:6809     ESTABLISHED -
tcp	0	0 192.168.1.58:21064      192.168.1.40:6809     ESTABLISHED -

# nmap -sT -O localhost

PORT      STATE SERVICE
22/tcp    open  ssh
25/tcp    open  smtp
80/tcp    open  http
111/tcp   open  rpcbind
139/tcp   open  netbios-ssn
443/tcp   open  https
445/tcp   open  microsoft-ds
795/tcp   open  unknown
10000/tcp open  snet-sensor-mgmt

Is this a bug or something? I'm just not finding all that much on the net. I'm 
looking for dlm_lock port 21064 issues, etc. 

Mike





From pcaulfie at redhat.com  Wed Jan 30 15:42:45 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Wed, 30 Jan 2008 15:42:45 +0000
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <200813092112.760131@leena>
References: <200813092112.760131@leena>
Message-ID: <47A09AF5.2050600@redhat.com>

isplist at logicore.net wrote:
>> That means that something else is using port 21064 - the TCP port that
>> the DLM uses. If the DLM can't bind to its port then it cannot start.
> 
> What's weird is that I didn't change anything, just the storage, then needing 
> to get the cluster quorum so that I could set up gfs. 
> 
>> Use netstat -tap or lsof to find out what is using that port. If you
>> can't stop that particular application that is using it, then you'll
> 
> #lsof -i | grep 21064
> Reveals nothing.
> 
> # netstat -anp | grep 21064
> Seems to show port 21064 being available.
>  
> tcp	0	0 192.168.1.58:21064      0.0.0.0:*                   LISTEN      -
> tcp	0	0 192.168.1.58:6809        192.168.1.62:21064  ESTABLISHED -
> tcp	0	0 192.168.1.58:21064      192.168.1.63:32780  ESTABLISHED -
> tcp    0   0 192.168.1.58:21064	192.168.1.92:6809     ESTABLISHED -
> tcp	0	0 192.168.1.58:21064		192.168.1.62:6809     ESTABLISHED -
> tcp	0	0 192.168.1.58:21064      192.168.1.40:6809     ESTABLISHED -
> 
>

That's not showing available, that's showing it already in use. The port
6809 on the other end is very suspicious, like there is maybe some
confusion about cman & dlm ports going on. Check cluster.conf and your
startup scripts for port changing things. It's very unusual. Also check
that all the nodes are using the same configuration.

There certainly is some port conflict going on with that system. it
might be a bizarre cluster suite misconfiguration or it might be some
other application - it's hard to be sure.

Patrick



From isplist at logicore.net  Wed Jan 30 16:21:40 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 10:21:40 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A09AF5.2050600@redhat.com>
Message-ID: <2008130102140.643954@leena>

So, I at least now know for sure that this is a locking issue. 

> That's not showing available, that's showing it already in use.

I also used telnet to connect to a few of the machines and was able to get in. 
I don't know enough about what I'm seeing on the netstat command but it almost 
looks like redirection to 6809?
It's confusing as heck to me since I didn't change anything, just the storage 
then fired it all back up again. 

> 6809 on the other end is very suspicious, like there is maybe some
> confusion about cman & dlm ports going on. Check cluster.conf and your
> startup scripts for port changing things. It's very unusual. Also check
> that all the nodes are using the same configuration.

I checked the cluster.conf file, don't see anything obvious so changed the 
version number and ran an update to all nodes just to be safe. I'm rebooting 
the nodes now, one at a time.

On my workstation console log, I see;

Jan 29 21:22:40 compdev kernel: GFS: fsid=compweb:web.0: jid=0: Trying to 
acquire journal lock...
Jan 29 21:22:40 compdev kernel: GFS: fsid=compweb:web.0: jid=0: Looking at 
journal...
Jan 29 21:22:40 compdev kernel: GFS: fsid=compweb:web.0: jid=0: Done
Jan 30 08:50:36 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Trying to 
acquire journal lock...
Jan 30 08:50:36 compdev kernel: GFS: fsid=compweb:web.0: jid=3: Busy
Jan 30 08:57:38 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Trying to 
acquire journal lock...
Jan 30 08:57:38 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Busy
Jan 30 09:05:29 compdev kernel: GFS: fsid=compweb:web.0: jid=2: Trying to 
acquire journal lock...
Jan 30 09:05:29 compdev kernel: GFS: fsid=compweb:web.0: jid=2: Busy
Jan 30 10:06:10 compdev kernel: CMAN: node cweb92 has been removed from the 
cluster : Missed too many heartbeats
Jan 30 10:08:14 compdev kernel: CMAN: node cweb92 rejoining
Jan 30 10:08:17 compdev kernel: dlm: could not bind to local address for 
connect: -98
Jan 30 10:10:26 compdev kernel: CMAN: node img63 has been removed from the 
cluster : Missed too many heartbeats
Jan 30 10:12:53 compdev kernel: CMAN: node img63 rejoining
Jan 30 10:12:57 compdev kernel: dlm: could not bind to local address for 
connect: -98
Jan 30 10:17:43 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Trying to 
acquire journal lock...
Jan 30 10:17:43 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Looking at 
journal...
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Acquiring the 
transaction lock...
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Replaying 
journal...
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Replayed 0 of 
22 blocks
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: replays = 0, 
skips = 0, sames = 22
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Journal 
replayed in 1s
Jan 30 10:19:11 compdev kernel: GFS: fsid=compweb:web.0: jid=1: Done

I looked at some of the other nodes and they all show similar things. This 
seems to show that port 21064 is available on all nodes. 

#ssh 192.168.1.40 netstat -anp | grep 21064

tcp 0 0 192.168.1.40:21064 0.0.0.0:*          LISTEN      -
tcp 0 0 192.168.1.40:6809  192.168.1.58:21064 ESTABLISHED -
tcp 0 0 192.168.1.40:21064 192.168.1.92:33123 ESTABLISHED -
tcp 0 0 192.168.1.40:21064 192.168.1.62:32779 ESTABLISHED -
tcp 0 0 192.168.1.40:21064 192.168.1.63:6809  ESTABLISHED -

#ssh 192.168.1.62 netstat -anp | grep 21064
tcp 0 0 192.168.1.62:21064 0.0.0.0:*          LISTEN      -
tcp 0 0 192.168.1.62:21064 192.168.1.63:32774 ESTABLISHED -
tcp 0 0 192.168.1.62:6809  192.168.1.58:21064 ESTABLISHED -
tcp 0 0 192.168.1.62:32773 192.168.1.92:21064 ESTABLISHED -
tcp 0 0 192.168.1.62:32780 192.168.1.63:21064 ESTABLISHED -
tcp 0 0 192.168.1.62:21064 192.168.1.58:6809  ESTABLISHED -
tcp 0 0 192.168.1.62:32779 192.168.1.40:21064 ESTABLISHED -

#ssh 192.168.1.63 netstat -anp | grep 21064

tcp 0 0 192.168.1.63:21064 0.0.0.0:*          LISTEN      -
tcp 0 0 192.168.1.63:6809  192.168.1.40:21064 ESTABLISHED -
tcp 0 0 192.168.1.63:21064 192.168.1.62:32780 ESTABLISHED -
tcp 0 0 192.168.1.63:21064 192.168.1.92:33157 ESTABLISHED -
tcp 0 0 192.168.1.63:32774 192.168.1.62:21064 ESTABLISHED -
tcp 0 0 192.168.1.63:32780 192.168.1.58:21064 ESTABLISHED -

#ssh 192.168.1.92 netstat -anp | grep 21064
tcp 0 0 192.168.1.92:21064 0.0.0.0:*          LISTEN      -
tcp 0 0 192.168.1.92:6809  192.168.1.58:21064 ESTABLISHED -
tcp 0 0 192.168.1.92:21064 192.168.1.62:32773 ESTABLISHED -
tcp 0 0 192.168.1.92:33157 192.168.1.63:21064 ESTABLISHED -
tcp 0 0 192.168.1.92:33123 192.168.1.40:21064 ESTABLISHED -





From linux-cluster at merctech.com  Wed Jan 30 17:26:16 2008
From: linux-cluster at merctech.com (linux-cluster at merctech.com)
Date: Wed, 30 Jan 2008 12:26:16 -0500
Subject: [Linux-cluster] gfs1 vrs gfs2 best practices in CentOS 5.1 (RHEL5.1)
Message-ID: <4394.1201713976@localhost>


I've seen suggestions that GFS2 is not yet ready for production use...however, 
in CentOS 5.1 (and, I assume, the upstream provider), GFS2 is the default:


	lrwxrwxrwx 1 root root     10 Jan 29 17:44 /sbin/mount.gfs -> mount.gfs2
	-rwxr-xr-x 1 root root  42000 Nov 12 14:04 /sbin/mount.gfs2
	lrwxrwxrwx 1 root root     11 Jan 29 17:44 /sbin/umount.gfs -> umount.gfs2
	-rwxr-xr-x 1 root root  40840 Nov 12 14:04 /sbin/umount.gfs2

This is on a CentOS 5.1 system, fully up-to-date with "yum".

In other words, installing gfs1 (via the gfs-utils package through yum) includes
gfs2 as a dependency, and the gfs2 installation replaces mount.gfs and
umount.gfs with links to the gfs2 version.

The umount.gfs2 binary is not backward compatible--I was not able to mount a
gfs1 filesystem with mount.gfs2, and there is no mount.gfs (version1) binary
installed.


My questions are:
	
	Does this mean that GFS2 under CentOS5.1 (RHEL5.1) is now
	production-ready?

	If not, what's the recommended way of installing mount.gfs (version1)
	under CentOS 5.1?

[Yes, I can certainly compile mount.gfs from source, or remove gfs-utils and
gfs2-utils and the force the installation of gfs-utils without it's 
dependencies. However, for long-term system maintenance (and my sanity), I 
strongly prefer not to administer servers with those kind of "exceptions".]

Thanks,

Mark
-----
Mark Bergman   

http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40merctech.com



From gordan at bobich.net  Wed Jan 30 17:38:01 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 30 Jan 2008 17:38:01 +0000 (GMT)
Subject: [Linux-cluster] gfs1 vrs gfs2 best practices in CentOS 5.1
	(RHEL5.1)
In-Reply-To: <4394.1201713976@localhost>
References: <4394.1201713976@localhost>
Message-ID: <alpine.LRH.1.00.0801301736310.2760@skynet.shatteredsilicon.net>

I suspect you'll find the GFS2 mount/unmount tools are backward 
compatible and stable. The mkfs stuff isn't symlinked, so you still get to 
create the FS of the version you mean to use.

Gordan

On Wed, 30 Jan 2008, linux-cluster at merctech.com wrote:

>
> I've seen suggestions that GFS2 is not yet ready for production use...however,
> in CentOS 5.1 (and, I assume, the upstream provider), GFS2 is the default:
>
>
> 	lrwxrwxrwx 1 root root     10 Jan 29 17:44 /sbin/mount.gfs -> mount.gfs2
> 	-rwxr-xr-x 1 root root  42000 Nov 12 14:04 /sbin/mount.gfs2
> 	lrwxrwxrwx 1 root root     11 Jan 29 17:44 /sbin/umount.gfs -> umount.gfs2
> 	-rwxr-xr-x 1 root root  40840 Nov 12 14:04 /sbin/umount.gfs2
>
> This is on a CentOS 5.1 system, fully up-to-date with "yum".
>
> In other words, installing gfs1 (via the gfs-utils package through yum) includes
> gfs2 as a dependency, and the gfs2 installation replaces mount.gfs and
> umount.gfs with links to the gfs2 version.
>
> The umount.gfs2 binary is not backward compatible--I was not able to mount a
> gfs1 filesystem with mount.gfs2, and there is no mount.gfs (version1) binary
> installed.
>
>
> My questions are:
>
> 	Does this mean that GFS2 under CentOS5.1 (RHEL5.1) is now
> 	production-ready?
>
> 	If not, what's the recommended way of installing mount.gfs (version1)
> 	under CentOS 5.1?
>
> [Yes, I can certainly compile mount.gfs from source, or remove gfs-utils and
> gfs2-utils and the force the installation of gfs-utils without it's
> dependencies. However, for long-term system maintenance (and my sanity), I
> strongly prefer not to administer servers with those kind of "exceptions".]
>
> Thanks,
>
> Mark
> -----
> Mark Bergman
>
> http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40merctech.com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From dsharp at ivytech.edu  Wed Jan 30 17:40:43 2008
From: dsharp at ivytech.edu (Doug Sharp)
Date: Wed, 30 Jan 2008 12:40:43 -0500
Subject: [Linux-cluster] gfs1 vrs gfs2 best practices in CentOS
	5.1(RHEL5.1)
References: <4394.1201713976@localhost>
	<alpine.LRH.1.00.0801301736310.2760@skynet.shatteredsilicon.net>
Message-ID: <02CC84960E52F745860DFF29ADC62091025E5430@MSEXCHNG-02.ivytech.local>

I'm also running CentOS 5.1 brought up to date via yum.  I have the same /sbin links that you show, but didn't notice it when I created my filesystems.  I used gfs_mkfs and it created and is successfully managing the GFS1 filesystems.  I've manually mounted them using "mount /dev/vg/lv /mntpt" and also by letting cluster manager mount them (and unmount them) via the gfs resource.

So, what Gordon says appears to be true for my system - backward compatible.  I also noticed that doing an lsmod | grep gfs shows that both kernel modules are loaded.

Doug


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of gordan at bobich.net
Sent: Wed 1/30/2008 12:38 PM
To: linux clustering
Subject: Re: [Linux-cluster] gfs1 vrs gfs2 best practices in CentOS 5.1(RHEL5.1)
 
I suspect you'll find the GFS2 mount/unmount tools are backward 
compatible and stable. The mkfs stuff isn't symlinked, so you still get to 
create the FS of the version you mean to use.

Gordan

On Wed, 30 Jan 2008, linux-cluster at merctech.com wrote:

>
> I've seen suggestions that GFS2 is not yet ready for production use...however,
> in CentOS 5.1 (and, I assume, the upstream provider), GFS2 is the default:
>
>
> 	lrwxrwxrwx 1 root root     10 Jan 29 17:44 /sbin/mount.gfs -> mount.gfs2
> 	-rwxr-xr-x 1 root root  42000 Nov 12 14:04 /sbin/mount.gfs2
> 	lrwxrwxrwx 1 root root     11 Jan 29 17:44 /sbin/umount.gfs -> umount.gfs2
> 	-rwxr-xr-x 1 root root  40840 Nov 12 14:04 /sbin/umount.gfs2
>
> This is on a CentOS 5.1 system, fully up-to-date with "yum".
>
> In other words, installing gfs1 (via the gfs-utils package through yum) includes
> gfs2 as a dependency, and the gfs2 installation replaces mount.gfs and
> umount.gfs with links to the gfs2 version.
>
> The umount.gfs2 binary is not backward compatible--I was not able to mount a
> gfs1 filesystem with mount.gfs2, and there is no mount.gfs (version1) binary
> installed.
>
>
> My questions are:
>
> 	Does this mean that GFS2 under CentOS5.1 (RHEL5.1) is now
> 	production-ready?
>
> 	If not, what's the recommended way of installing mount.gfs (version1)
> 	under CentOS 5.1?
>
> [Yes, I can certainly compile mount.gfs from source, or remove gfs-utils and
> gfs2-utils and the force the installation of gfs-utils without it's
> dependencies. However, for long-term system maintenance (and my sanity), I
> strongly prefer not to administer servers with those kind of "exceptions".]
>
> Thanks,
>
> Mark
> -----
> Mark Bergman
>
> http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40merctech.com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 4220 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080130/b2694f5a/attachment.bin>

From raju.rajsand at gmail.com  Wed Jan 30 18:35:22 2008
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 31 Jan 2008 00:05:22 +0530
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <02CC84960E52F745860DFF29ADC62091025E5421@MSEXCHNG-02.ivytech.local>
References: <2008128173557.723337@leena>
	<02CC84960E52F745860DFF29ADC62091025E5421@MSEXCHNG-02.ivytech.local>
Message-ID: <8786b91c0801301035r7c9fd426k5e7ca19bdb1e598a@mail.gmail.com>

2008/1/29 Doug Sharp <dsharp at ivytech.edu>:

> You might check that /etc/lvm/lvm.conf has locking_type = 3 as described
> in FAQ #22 here.  It defaults to local locking.
> http://sources.redhat.com/cluster/faq.html
>
> Also, I've found that cluster suite components use a variety of IP ports
> for communication, and if you've selected a standard firewall configuration
> you may be blocking them.  For testing, I'd try shutting down the CS
> services (service cman stop, etc), then run "iptables -F" on all nodes to
> flush the rulesets, then restart the CS services on all nodes.  You can
> discover what ports are being used by cluster suite by doing a netstat -na
> before and after this process and compare the ports that are in use, then
> put together a custom /etc/sysconfig/iptables file if you want.
>
> Also, assuming you're using a Redhat-based distro (Fedora, CentOS, RH),
> you can check the status of the CS services with the following rather than
> using "ps":
> # service cman status
> # service clvmd status
> etc.
>
>
Chek if multicast is on in the ethernet switch on which the cluster
heartbeat is conencted

also try clustat command
how  about disabling selinux?
(ducking under the table) have you tried system-config-lvm gui tool
there is also a setting in lvm.conf which allows/disallows certain devices
that can/cannot appear as storage checkout the settings

HTH

Regards

Rajagopal
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/ae2e5ef0/attachment.htm>

From raju.rajsand at gmail.com  Wed Jan 30 18:36:10 2008
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 31 Jan 2008 00:06:10 +0530
Subject: [Linux-cluster] Long delays after idle time
In-Reply-To: <200812822518.448969@leena>
References: <200812822518.448969@leena>
Message-ID: <8786b91c0801301036u40b5f834o50e94f5329b0d750@mail.gmail.com>

On Jan 29, 2008 10:21 AM, isplist at logicore.net <isplist at logicore.net> wrote:

> When I have a GFS volume that sits for even a few minutes, it seems to go
> into
> some weird sleep mode. It can take up to 45 seconds or more to finally get
> it
> to respond, at which point, it will be fine until there is no use and it
> seems
> to go back to sleep mode.
>
> Sleep mode, for lack of better term just means that after it sits for a
> while,
> it always has a long delay before allowing access when I next try.
>
> I posted about this last year but no one seemed to have any thoughts. How
> about this time? Are there some tests I can run to help find the problem?
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/e2801980/attachment.htm>

From raju.rajsand at gmail.com  Wed Jan 30 18:38:38 2008
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 31 Jan 2008 00:08:38 +0530
Subject: [Linux-cluster] Long delays after idle time
In-Reply-To: <8786b91c0801301036u40b5f834o50e94f5329b0d750@mail.gmail.com>
References: <200812822518.448969@leena>
	<8786b91c0801301036u40b5f834o50e94f5329b0d750@mail.gmail.com>
Message-ID: <8786b91c0801301038o213023b7jdcc551925f2753a1@mail.gmail.com>

Greetings,


On Jan 29, 2008 10:21 AM, isplist at logicore.net <isplist at logicore.net> wrote:
>
> > When I have a GFS volume that sits for even a few minutes, it seems to
> > go into
> > some weird sleep mode. It can take up to 45 seconds or more to finally
> > get it
> > to respond, at which point, it will be fine until there is no use and it
> > seems
> > to go back to sleep mode.
> >
> > Sleep mode, for lack of better term just means that after it sits for a
> > while,
> > it always has a long delay before allowing access when I next try.
> >
> > I posted about this last year but no one seemed to have any thoughts.
> > How
> > about this time? Are there some tests I can run to help find the
> > problem?
> >
> > Mike
> >
> >
An innocent suggestion: How about BIOS settings for Power handling esp the
HDD spin-down part?

Regards

Rajagopal

PS: sorry for the empty earlier message
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/de2d698d/attachment.htm>

From isplist at logicore.net  Wed Jan 30 18:49:59 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 12:49:59 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <8786b91c0801301035r7c9fd426k5e7ca19bdb1e598a@mail.gmail.com>
Message-ID: <2008130124959.613709@leena>

>> You might check that /etc/lvm/lvm.conf has locking_type = 3 as described
>> in FAQ #22 here.  It defaults to local locking.

I'm still using type 2 which was how it was working before I changed the 
storage. But again, nothing was changed, only the drives.

>> Also, I've found that cluster suite components use a variety of IP ports
>> for communication, and if you've selected a standard firewall

I don't use any firewall's inside the private network so it's not that either.

>> configuration you may be blocking them.  For testing, I'd try shutting
>> down the CS services (service cman stop, etc), then run "iptables -F" on

I don't have iptables running on the internal machines, only the load balanced 
front end machines but this is not that network.

>> doing a netstat -na before and after this process and compare the ports

Yup, I've posted my findings on this also and it seems that there is something 
weird there. Dlm complains about the port not being available yet it's open 
and listening.

> also try clustat command
> how  about disabling selinux?

I odn't use selinux on any of the internal stuff but good call there too.

> (ducking under the table) have you tried system-config-lvm gui tool
> there is also a setting in lvm.conf which allows/disallows certain devices
> that can/cannot appear as storage checkout the settings

All good ideas, can't thank you enough. I've posted my latest findings. At 
this point, I took the storage off of the nodes and am looking at it from one 
node only using single node functions. Even with just the one node, I'm still 
seeing similar problems. I'm going to re-create the storage and see what 
happens but I doubt it's going to help.

Mike





From isplist at logicore.net  Wed Jan 30 18:52:02 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 12:52:02 -0600
Subject: [Linux-cluster] Long delays after idle time
In-Reply-To: <8786b91c0801301038o213023b7jdcc551925f2753a1@mail.gmail.com>
Message-ID: <200813012522.060562@leena>

> An innocent suggestion: How about BIOS settings for Power handling esp the
> HDD spin-down part?

I;m not sure that will make any difference as the storage is over fibre 
channel. The storage chassis is running fine, I've run tests on that. It seems 
to be a dlm_lock issue. In some of my other messages, you'll find postings 
about the error messages and port problems I'm having. 

You might have some thoughts on that.

Mike





From isplist at logicore.net  Wed Jan 30 18:52:59 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 12:52:59 -0600
Subject: [Linux-cluster] Long delays after idle time
In-Reply-To: <8786b91c0801301036u40b5f834o50e94f5329b0d750@mail.gmail.com>
Message-ID: <2008130125259.945985@leena>

>> I posted about this last year but no one seemed to have any thoughts. How
>> about this time? Are there some tests I can run to help find the problem?

I wanted to update this... shortly after I had posted this back then, the 
problems went away, not sure why.

Mike





From lhh at redhat.com  Wed Jan 30 19:06:21 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 30 Jan 2008 14:06:21 -0500
Subject: [Linux-cluster] starting cman changes cluster.conf file? what in..
In-Reply-To: <9269.1201645615@localhost>
References: <1201633844.15818.28.camel@ayanami.boston.devel.redhat.com>
	<1201632692449.teemu.m2.74056.KIOY4w6jyGdgo7rtuqFQSg@luukku.com>
	<9269.1201645615@localhost>
Message-ID: <1201719981.15818.100.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 17:26 -0500, linux-cluster at merctech.com wrote:

> On my cluster:
> 	uname -n == /etc/hosts entry for eth0 (public address)
> 
> 	cluster.conf "clusternode name" entries == /etc/hosts entry for eth1 (private network)
> 
> and the cman daemon starts just fine. Is there something that I'm missing, or 
> will this cause other issues?

I could be wrong... Just trying to eliminate possible problems as best I
can.



> Wow...that value really seems like an ideal candidate for inclusion in the 

Already done.  Just not in a release at this point.

http://sourceware.org/cgi-bin/cvsweb.cgi/cluster/cman/init.d/cman.diff?cvsroot=cluster&only_with_tag=RHEL5&r1=1.26.2.5&r2=1.26.2.6

-- Lon



From isplist at logicore.net  Wed Jan 30 20:29:31 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 14:29:31 -0600
Subject: [Linux-cluster] Long delays after idle time
In-Reply-To: <8786b91c0801301038o213023b7jdcc551925f2753a1@mail.gmail.com>
Message-ID: <2008130142931.442310@leena>

Yesterday, I think a question got lost in the postings. I was asking about the 
fact that I seem to have two versions of lvm installed.

lvm2 2.02.21-5
and 
lvm2-cluster 2.02.21-7

Is it possible that there is a conflict? 

# vgscan
  Reading all physical volumes.  This may take a while...
  Found volume group "compweb" using metadata type lvm2





From td3201 at gmail.com  Wed Jan 30 22:07:17 2008
From: td3201 at gmail.com (Terry)
Date: Wed, 30 Jan 2008 16:07:17 -0600
Subject: [Linux-cluster] manual fencing -- 1 node missing file
Message-ID: <8ee061010801301407y6dd3ba33n1bcb3749e7ce3ee3@mail.gmail.com>

For some reason, I can't fence a node:

fence_ack_manual -n nfs00a.foobar.local
can't open /tmp/fence_manual.fifo: No such file or directory

This file exists on the node that I am trying to fence.  What is
responsible for creating this file and how do I get it back?  I am
guessing it has to do with fenced.  Any ideas?



From rmccabe at redhat.com  Wed Jan 30 22:20:58 2008
From: rmccabe at redhat.com (Ryan McCabe)
Date: Wed, 30 Jan 2008 17:20:58 -0500
Subject: [Linux-cluster] Initializing CMAN increases network chatter
In-Reply-To: <1201634123.3706.1.camel@balance>
References: <770987.62156.qm@web32103.mail.mud.yahoo.com>
	<1201634123.3706.1.camel@balance>
Message-ID: <20080130222058.GA663324@redhat.com>

On Tue, Jan 29, 2008 at 12:15:22PM -0700, Steven Dake wrote:
> This option can be tuned via the "merge" option which is set in
> milliseconds.  This can be overriden in the cluster.conf file.  Chrissie

As far as I can tell, you can't override this one in the cluster.conf
file as of now.


Ryan



From isplist at logicore.net  Wed Jan 30 22:57:20 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 16:57:20 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801281835320.29626@skynet.shatteredsilicon.net>
Message-ID: <2008130165720.118177@leena>

I have a device which has storage connected to it which is exporting in NFS.

I can connect to it from any server but I'd like the aggregator to see it and 
offer it as part of it's virtual RAID array we talked about. 

Is there a way of RE-exporting an NFS share through the aggregator or am I 
overlooking something?

Mike





From gordan at bobich.net  Wed Jan 30 23:06:34 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 30 Jan 2008 23:06:34 +0000
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008130165720.118177@leena>
References: <2008130165720.118177@leena>
Message-ID: <47A102FA.7060309@bobich.net>

isplist at logicore.net wrote:
> I have a device which has storage connected to it which is exporting in NFS.
> 
> I can connect to it from any server but I'd like the aggregator to see it and 
> offer it as part of it's virtual RAID array we talked about. 
> 
> Is there a way of RE-exporting an NFS share through the aggregator or am I 
> overlooking something?

I'm not 100% certain, but I don't think that would work sensibly. The 
lack of sane POSIX locking might kill that plan. You could try it, 
though. Create a file on the NFS share and point the iSCSI initiator at 
it. See what happens, but make sure you stress test it VERY thoroughly 
with data you don't mind losing. Can you not get the said device to 
export an NBD or an iSCSI share? That would likely work better.

Gordan



From isplist at logicore.net  Wed Jan 30 23:19:12 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 17:19:12 -0600
Subject: [Linux-cluster] iSCSI GFS
In-Reply-To: <47A102FA.7060309@bobich.net>
Message-ID: <2008130171912.732364@leena>

> with data you don't mind losing. Can you not get the said device to
> export an NBD or an iSCSI share? That would likely work better.

I'll be able to do SCSI, FC and iSCSI devices easily enough but this 
particular device has NFS access built in. It takes external FC arrays and 
makes them available as the head for them though various file protocols such 
as CIFS, NFS, FTP, HTTP. Just wondered if I could use it. 

I'm just messing around with this and plan to put a machine to task next week 
once I understand it a little. Since I can't figure out my GFS problem, NFS is 
starting to look better all the time. Sad really, I very much liked how 
reliable GFS was but I can't solve this current problem quickly enough.

Mike





From Christopher.Barry at qlogic.com  Thu Jan 31 03:08:56 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Wed, 30 Jan 2008 22:08:56 -0500
Subject: [Linux-cluster] very simple question I can't find answer to ;(
Message-ID: <1201748937.5356.6.camel@localhost>

Hi All,

I kinda hate to ask this obviously simple question here, but I've been
googling around for 25 minutes, searched the FAQ, and still can't find
it. 

Q: How to I reload the cluster.conf file on a live running cluster?

running rhel4.5

I've done a service rgmanager restart and reload, but cman_tool status
still shows the old version. I really didn't want to restart anything
else before asking here.


Thanks,
-C



From isplist at logicore.net  Thu Jan 31 03:15:58 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 30 Jan 2008 21:15:58 -0600
Subject: [Linux-cluster] very simple question I can't find answer to ;(
In-Reply-To: <1201748937.5356.6.camel@localhost>
Message-ID: <2008130211558.542499@leena>

Two commands;

cman_tool version -r xx (New version number)
then
ccs_tool update /etc/cluster/cluster.conf

Mike


On Wed, 30 Jan 2008 22:08:56 -0500, chris barry wrote:
> Hi All,
> 
> I kinda hate to ask this obviously simple question here, but I've been
> googling around for 25 minutes, searched the FAQ, and still can't find
> it.
> 
> Q: How to I reload the cluster.conf file on a live running cluster?
> 
> running rhel4.5
> 
> I've done a service rgmanager restart and reload, but cman_tool status
> still shows the old version. I really didn't want to restart anything
> else before asking here.
> 
> 
> Thanks,
> -C
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster






From Christopher.Barry at qlogic.com  Thu Jan 31 03:14:58 2008
From: Christopher.Barry at qlogic.com (chris barry)
Date: Wed, 30 Jan 2008 22:14:58 -0500
Subject: [Linux-cluster] very simple question I can't find answer to ;(
In-Reply-To: <1201748937.5356.6.camel@localhost>
References: <1201748937.5356.6.camel@localhost>
Message-ID: <1201749298.5356.8.camel@localhost>

On Wed, 2008-01-30 at 22:08 -0500, chris barry wrote:
> Hi All,
> 
> I kinda hate to ask this obviously simple question here, but I've been
> googling around for 25 minutes, searched the FAQ, and still can't find
> it. 
> 
> Q: How to I reload the cluster.conf file on a live running cluster?
> 
> running rhel4.5
> 
> I've done a service rgmanager restart and reload, but cman_tool status
> still shows the old version. I really didn't want to restart anything
> else before asking here.
> 
> 
> Thanks,
> -C


nevermind... I was searching for reload, when I should have searched for
update. It's in the FAQ.


Thanks,
-C



From bfilipek at crscold.com  Thu Jan 31 06:02:08 2008
From: bfilipek at crscold.com (Brad Filipek)
Date: Thu, 31 Jan 2008 00:02:08 -0600
Subject: [Linux-cluster] two fencing devices per node
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D02A8A461@SRVEDI.upark.crscold.com>

I have a two node cluster and each node has two power supplies. Each
power supply is going to be plugged into a different APC switch. I
currently only have one power supply plugged in to one APC switch (for
testing purposes) and fencing works fine. Now I want to plug the second
power supply in. Do I need to add another fence level, or do I just add
a second fence device to the first fence level? 

 

Thanks,

Brad Filipek

 


Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/f4058cd3/attachment.htm>

From johannes.russek at io-consulting.net  Thu Jan 31 09:42:58 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 31 Jan 2008 10:42:58 +0100
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <2008130124959.613709@leena>
References: <2008130124959.613709@leena>
Message-ID: <1201772578.29126.15.camel@admc.win-rar.local>


> All good ideas, can't thank you enough. I've posted my latest findings. At 
> this point, I took the storage off of the nodes and am looking at it from one 
> node only using single node functions. Even with just the one node, I'm still 
> seeing similar problems. I'm going to re-create the storage and see what 
> happens but I doubt it's going to help.
> 
> Mike

just for the sake of having tried it.. did you actually disable clvmd on
that "single node"?
i mean, we had this clvmd issue quite a few times on the mailing list,
and as far as i can remember there wasn't really a fix to this. (i
solved it by rebooting the whole cluster back then... that was certainly
bad).

johannes

> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From pcaulfie at redhat.com  Thu Jan 31 11:59:47 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Thu, 31 Jan 2008 11:59:47 +0000
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <2008128161521.053984@leena>
References: <2008128161521.053984@leena>
Message-ID: <47A1B833.9010907@redhat.com>

isplist at logicore.net wrote:
> I can't really believe no one knows the answer or some tips on where to look 
> for the problem. Maybe it's totally obvious and I'm just not catching it?
> 
> There must be someone on this list who has some thoughts on this? I've been 
> asking about it for days now :).
> 

Unless you've solved your DLM port problem, clvmd will not work. If
that's still causing trouble then you need to focus on that and fix it
before worrying about other things. NOt having a DLM will prevent a few
things from working.

I think what you need to do there is to remove all the software you have
(make sure it also deletes the init scripts), and reinstall it complete
with a brand new cluster.conf file.

Patrick



From Dan.HAWKER at uk4.astrium.eads.net  Thu Jan 31 12:39:48 2008
From: Dan.HAWKER at uk4.astrium.eads.net (HAWKER, Dan 2 (external))
Date: Thu, 31 Jan 2008 12:39:48 -0000
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
Message-ID: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>


> Is there a way of RE-exporting an NFS share through the 
> aggregator or am I 
> overlooking something?

You can't NFS a NFS share (if I understand what you are saying). Don't
know the intricacies of why (I'd presume it is to do with keeping track
of locks down/across the hierarchy of multiple NFS servers), but it
doesn't work (in my experience anyway).

You'll need to export the share (actually the data in the share) you are
presently NFSing to the aggregator using something else (iSCSI for
instance) and then NFS that from the aggregator.

Dan

--

Dan Hawker
Linux System Administrator
Astrium
http://www.astrium.eads.net

-- 

This email (including any attachments) may contain confidential and/or
privileged information or information otherwise protected from disclosure.
If you are not the intended recipient, please notify the sender
immediately, do not copy this message or any attachments and do not use it
for any purpose or disclose its content to any person, but delete this
message and any attachments from your system. Astrium disclaims any and all
liability if this email transmission was virus corrupted, altered or
falsified.
---------------------------------------------------------------------
Astrium Limited, Registered in England and Wales No. 2449259
REGISTERED OFFICE:-
Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2AS, England



From gordan at bobich.net  Thu Jan 31 12:55:10 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 12:55:10 +0000 (GMT)
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>
References: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>
Message-ID: <alpine.LRH.1.00.0801311252420.30904@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, HAWKER, Dan 2 (external) wrote:

>> Is there a way of RE-exporting an NFS share through the
>> aggregator or am I
>> overlooking something?
>
> You can't NFS a NFS share (if I understand what you are saying). Don't
> know the intricacies of why (I'd presume it is to do with keeping track
> of locks down/across the hierarchy of multiple NFS servers), but it
> doesn't work (in my experience anyway).

I must admit I never tried it, but I'm sure I read some documentation that 
says it can be done. You have to mount first, then export the FS. Similar 
applies to exporting mount points for other FS-es (e.g. CDs).

As for locking, that doesn't work on NFS anyway.

> You'll need to export the share (actually the data in the share) you are
> presently NFSing to the aggregator using something else (iSCSI for
> instance) and then NFS that from the aggregator.

I think he's trying to do it the other way around. Put a volume file on an 
NFS share, then export that volume file via iSCSI.

Gordan



From isplist at logicore.net  Thu Jan 31 14:12:47 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 08:12:47 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801311252420.30904@skynet.shatteredsilicon.net>
Message-ID: <200813181247.633539@leena>

> As for locking, that doesn't work on NFS anyway.

I don't recall asking about locking? But no, would not need locking if I used 
GFS.
 
> I think he's trying to do it the other way around. Put a volume file on an
> NFS share, then export that volume file via iSCSI.

Yes, it was just a passing thought that if I could first mount the volume on 
the aggregator that I then might be able to make it a part of the overall RAID 
array I'll create. 

I think that get's too weird so I'll just use that for something else and 
stick to basics.

Mike





From isplist at logicore.net  Thu Jan 31 14:23:47 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 08:23:47 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <47A1B833.9010907@redhat.com>
Message-ID: <200813182347.279613@leena>

> Unless you've solved your DLM port problem, clvmd will not work. If
> that's still causing trouble then you need to focus on that and fix it
> before worrying about other things. NOt having a DLM will prevent a few
> things from working.

I solved this problem. It was because I took down the cluster to move some 
drives around. 

However, after that, I started another thread called DLM Problem because there 
are terrible delays using the GFS storage which is related to a lock problem.

Lock cannot happen because the service is being prevented from starting yet I 
cannot figure out why. I posted some logs in the other thread.

> I think what you need to do there is to remove all the software you have
> (make sure it also deletes the init scripts), and reinstall it complete
> with a brand new cluster.conf file.

Remove all of gfs, including scripts, then re-install it all? What could have 
gone so wrong from a perfectly working machine to this just from changing out 
drives? This has been very confusing to say the least.

Mike





From pcaulfie at redhat.com  Thu Jan 31 14:43:03 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Thu, 31 Jan 2008 14:43:03 +0000
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <200813182347.279613@leena>
References: <200813182347.279613@leena>
Message-ID: <47A1DE77.6030204@redhat.com>

isplist at logicore.net wrote:
>> Unless you've solved your DLM port problem, clvmd will not work. If
>> that's still causing trouble then you need to focus on that and fix it
>> before worrying about other things. NOt having a DLM will prevent a few
>> things from working.
> 
> I solved this problem. It was because I took down the cluster to move some 
> drives around. 
> 
> However, after that, I started another thread called DLM Problem because there 
> are terrible delays using the GFS storage which is related to a lock problem.
> 
> Lock cannot happen because the service is being prevented from starting yet I 
> cannot figure out why. I posted some logs in the other thread.
> 
>> I think what you need to do there is to remove all the software you have
>> (make sure it also deletes the init scripts), and reinstall it complete
>> with a brand new cluster.conf file.
> 
> Remove all of gfs, including scripts, then re-install it all? What could have 
> gone so wrong from a perfectly working machine to this just from changing out 
> drives? This has been very confusing to say the least.


I really don't know. but you seem to attract such a catalogue of
problems I can't quite keep track of what's working for you (if
anything) and what isn't ;-)

Patrick



From isplist at logicore.net  Thu Jan 31 14:58:16 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 08:58:16 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <47A1DE77.6030204@redhat.com>
Message-ID: <200813185816.850870@leena>

> I really don't know. but you seem to attract such a catalogue of
> problems I can't quite keep track of what's working for you (if
> anything) and what isn't ;-)

Ow! I guess when things break, they break.

In this case, it started as something I overlooked and solved but then came 
the locking problem.

Anyhow, I started another thread on this which you can peek at if you're so 
inclined and I hope you are :).

Mike







From dsharp at ivytech.edu  Thu Jan 31 15:13:08 2008
From: dsharp at ivytech.edu (Doug Sharp)
Date: Thu, 31 Jan 2008 10:13:08 -0500
Subject: [Linux-cluster] Can't create new GFS storage
References: <200813185816.850870@leena>
Message-ID: <02CC84960E52F745860DFF29ADC62091025E5435@MSEXCHNG-02.ivytech.local>

Mike, the last I remember, you still had locking_type = 2 in /etc/lvm/lvm.conf.  Have you tried setting this to 3 and restarting services to see if it helps?

Doug


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Thu 1/31/2008 9:58 AM
To: linux-cluster
Subject: Re: [Linux-cluster] Can't create new GFS storage
 
> I really don't know. but you seem to attract such a catalogue of
> problems I can't quite keep track of what's working for you (if
> anything) and what isn't ;-)

Ow! I guess when things break, they break.

In this case, it started as something I overlooked and solved but then came 
the locking problem.

Anyhow, I started another thread on this which you can peek at if you're so 
inclined and I hope you are :).

Mike





--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3065 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/7350cd69/attachment.bin>

From isplist at logicore.net  Thu Jan 31 15:19:04 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 09:19:04 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A037FF.9070408@redhat.com>
Message-ID: <20081319194.807060@leena>

>i mean, we had this clvmd issue quite a few times on the mailing list,
>and as far as i can remember there wasn't really a fix to this. (i
>solved it by rebooting the whole cluster back then... that was certainly
>bad).

Something went out of sync and it doesn't seem to make sense that it's gotten 
so complicated. I've not changed anything since everything was working just 
fine. 

When I built my first GFS cluster, there was a delay when first accessing the 
storage which eventually went away. When I fired up the (sama, unchanged) 
cluster after changing out some drives this time around, all hell broke loose.

If I did something wrong, I'd love to learn from the mistake but since I'm now 
knee deep in confusion, I can't seem to find the answer.

I'll try a full shutdown again and see if things get any better.

Mike





From gordan at bobich.net  Thu Jan 31 15:23:24 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 15:23:24 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20081319194.807060@leena>
References: <20081319194.807060@leena>
Message-ID: <alpine.LRH.1.00.0801311521550.30904@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>> i mean, we had this clvmd issue quite a few times on the mailing list,
>> and as far as i can remember there wasn't really a fix to this. (i
>> solved it by rebooting the whole cluster back then... that was certainly
>> bad).
>
> Something went out of sync and it doesn't seem to make sense that it's gotten
> so complicated. I've not changed anything since everything was working just
> fine.
>
> When I built my first GFS cluster, there was a delay when first accessing the
> storage which eventually went away. When I fired up the (sama, unchanged)
> cluster after changing out some drives this time around, all hell broke loose.
>
> If I did something wrong, I'd love to learn from the mistake but since I'm now
> knee deep in confusion, I can't seem to find the answer.
>
> I'll try a full shutdown again and see if things get any better.

You could just try it without LVM alltogether. LVM is entirely optional in 
this.

Gordan



From lhh at redhat.com  Thu Jan 31 15:58:17 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 31 Jan 2008 10:58:17 -0500
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>
References: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>
Message-ID: <1201795097.1759.7.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-31 at 12:39 +0000, HAWKER, Dan 2 (external) wrote:
> > Is there a way of RE-exporting an NFS share through the 
> > aggregator or am I 
> > overlooking something?
> 
> You can't NFS a NFS share (if I understand what you are saying). Don't
> know the intricacies of why (I'd presume it is to do with keeping track
> of locks down/across the hierarchy of multiple NFS servers), but it
> doesn't work (in my experience anyway).

Also permissions, UID/GID mappings, etc.

> You'll need to export the share (actually the data in the share) you are
> presently NFSing to the aggregator using something else (iSCSI for
> instance) and then NFS that from the aggregator.

Correct.

-- Lon



From lhh at redhat.com  Thu Jan 31 16:00:09 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 31 Jan 2008 11:00:09 -0500
Subject: [Linux-cluster] manual fencing -- 1 node missing file
In-Reply-To: <8ee061010801301407y6dd3ba33n1bcb3749e7ce3ee3@mail.gmail.com>
References: <8ee061010801301407y6dd3ba33n1bcb3749e7ce3ee3@mail.gmail.com>
Message-ID: <1201795209.1759.10.camel@ayanami.boston.devel.redhat.com>

On Wed, 2008-01-30 at 16:07 -0600, Terry wrote:
> For some reason, I can't fence a node:
> 
> fence_ack_manual -n nfs00a.foobar.local
> can't open /tmp/fence_manual.fifo: No such file or directory
> 
> This file exists on the node that I am trying to fence.  What is
> responsible for creating this file and how do I get it back?  I am
> guessing it has to do with fenced.  Any ideas?

fence_manual works by you power cycling the affected node first before
trying to run fence_ack_manual

fence_manual creates the file - you'll have to see the logs on the
system to see what node created it; some nodes will say:

  fencing of X deferred to node Y

Step 1: Power cycle node X
Step 2: Log in to node Y and run fence_ack_manual -n X

-- Lon



From lhh at redhat.com  Thu Jan 31 16:02:52 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 31 Jan 2008 11:02:52 -0500
Subject: [Linux-cluster] two fencing devices per node
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D02A8A461@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D02A8A461@SRVEDI.upark.crscold.com>
Message-ID: <1201795372.1759.14.camel@ayanami.boston.devel.redhat.com>

On Thu, 2008-01-31 at 00:02 -0600, Brad Filipek wrote:
> I have a two node cluster and each node has two power supplies. Each
> power supply is going to be plugged into a different APC switch. I
> currently only have one power supply plugged in to one APC switch (for
> testing purposes) and fencing works fine. Now I want to plug the
> second power supply in. Do I need to add another fence level, or do I
> just add a second fence device to the first fence level? 

 * Create a 2nd device
 * Add it to the same level.  If you're using Conga, it should "do the
right thing" - that is:
   <method name="1">
     <device name="apc1" port="x" option="off"/>
     <device name="apc2" port="x" option="off"/>
     <device name="apc1" port="x" option="on"/>
     <device name="apc2" port="x" option="on"/>
   </method>

Also:

http://sources.redhat.com/cluster/wiki/FAQ/Fencing#fence_redundant_pwr

-- Lon

> 



From lhh at redhat.com  Thu Jan 31 16:04:25 2008
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 31 Jan 2008 11:04:25 -0500
Subject: [Linux-cluster] Problem with fenced on 2 node cluster
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D02473C@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D02473A@SRVEDI.upark.crscold.com>
	<1201633569.15818.23.camel@ayanami.boston.devel.redhat.com>
	<9C01E18EF3BC2448A3B1A4812EB87D02473C@SRVEDI.upark.crscold.com>
Message-ID: <1201795465.1759.16.camel@ayanami.boston.devel.redhat.com>

On Tue, 2008-01-29 at 14:13 -0600, Brad Filipek wrote:
> Lon,
>  
> Thanks for your response. I didn't realize that /sbin/fenced should be running at all times. During an actual fence action, I do see fence_apc show up so this all sounds normal, right? 

Correct.

-- Lon




From isplist at logicore.net  Thu Jan 31 16:13:43 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 10:13:43 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311521550.30904@skynet.shatteredsilicon.net>
Message-ID: <2008131101343.893092@leena>

> You could just try it without LVM alltogether. LVM is entirely optional in
> this.

I didn't know that to be honest. I think you had mentioned it in another post. 
Since day one my understanding was that GFS had to be set up using LVM. 
What are the steps to creating a GFS volume without LVM?

PS: I cleanly rebooted everything again. The logs still show the following;

//
Jan 31 09:44:45 compdev kernel: dlm: could not bind to local address for 
connect: -98
\\

And, I hate to post such long winded logs but it might be helpful. This is 
where things get weird. I've posted about this several times. There is an 
error message that keeps trying to show up as vgchange is trying to run. 

There seems to be old volume information somewhere on the system causing this 
and I think this is the root of the problem. If I could remove the old long 
dead information, I think I could get this resolved.

Jan 31 09:42:47 compdev lvm.static:
Jan 31 09:42:47 compdev lvm.static: connect() failed on local socket: 
Connection refused
Jan 31 09:42:47 compdev lvm.static:
Jan 31 09:42:47 compdev lvm.static: No volume groups found
Jan 31 09:42:47 compdev lvm.static:   WARNING: Falling back to local 
file-based locking.
Jan 31 09:42:47 compdev lvm.static:   Volume Groups with the clustered 
attribute will be inaccessible.
Jan 31 09:42:47 compdev rc.sysinit: Setting up Logical Volume Management: 
succeeded
Jan 31 09:42:47 compdev rc.sysinit: Checking filesystems succeeded
Jan 31 09:42:47 compdev rc.sysinit: Mounting local filesystems:  succeeded
Jan 31 09:42:47 compdev rc.sysinit: Enabling local filesystem quotas:  
succeeded
Jan 31 09:42:48 compdev rc.sysinit: Enabling swap space:  succeeded
Jan 31 09:42:49 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:49 compdev vgchange:
Jan 31 09:42:49 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:49 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:49 compdev vgchange: Volume group "WARNING:" not found
Jan 31 09:42:49 compdev lvm2-monitor: Starting monitoring for VG WARNING:: 
failed
Jan 31 09:42:49 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:49 compdev vgchange:
Jan 31 09:42:49 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:49 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:49 compdev vgchange: Volume group "Falling" not found
Jan 31 09:42:49 compdev lvm2-monitor: Starting monitoring for VG Falling: 
failed
Jan 31 09:42:49 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:49 compdev vgchange:
Jan 31 09:42:49 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:49 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:49 compdev vgchange: Volume group "back" not found
Jan 31 09:42:49 compdev lvm2-monitor: Starting monitoring for VG back: failed
Jan 31 09:42:49 compdev vgchange:
Jan 31 09:42:49 compdev vgchange: connect() failed on local socket: Connection 
refused
Jan 31 09:42:49 compdev vgchange:
Jan 31 09:42:49 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:49 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:49 compdev vgchange: Volume group "to" not found
Jan 31 09:42:49 compdev lvm2-monitor: Starting monitoring for VG to: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "local" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG local: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "file-based" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG file-based: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "locking." not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG locking.: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "Volume" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG Volume: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "Groups" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG Groups: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "with" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG with: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "the" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG the: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "clustered" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG clustered: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "attribute" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG attribute: 
failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "will" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG will: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "be" not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG be: failed
Jan 31 09:42:50 compdev vgchange:   connect() failed on local socket: 
Connection refused
Jan 31 09:42:50 compdev vgchange:
Jan 31 09:42:50 compdev vgchange:   WARNING: Falling back to local file-based 
locking.
Jan 31 09:42:50 compdev vgchange:   Volume Groups with the clustered attribute 
will be inaccessible.
Jan 31 09:42:50 compdev vgchange: Volume group "inaccessible." not found
Jan 31 09:42:50 compdev lvm2-monitor: Starting monitoring for VG 
inaccessible.: failed

Jan 31 09:43:43 compdev cman: startup succeeded
Jan 31 09:43:44 compdev fenced: startup succeeded
Jan 31 09:43:47 compdev clvmd: clvmd startup succeeded
Jan 31 09:43:48 compdev vgchange:
Jan 31 09:43:48 compdev vgchange: No volume groups found
Jan 31 09:43:48 compdev clvmd: Activating VGs: succeeded
Jan 31 09:44:03 compdev rgmanager: clurgmgrd startup succeeded





From jparsons at redhat.com  Thu Jan 31 16:15:48 2008
From: jparsons at redhat.com (James Parsons)
Date: Thu, 31 Jan 2008 11:15:48 -0500
Subject: [Linux-cluster] two fencing devices per node
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D02A8A461@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D02A8A461@SRVEDI.upark.crscold.com>
Message-ID: <47A1F434.8070306@redhat.com>

Brad Filipek wrote:

> I have a two node cluster and each node has two power supplies. Each 
> power supply is going to be plugged into a different APC switch. I 
> currently only have one power supply plugged in to one APC switch (for 
> testing purposes) and fencing works fine. Now I want to plug the 
> second power supply in. Do I need to add another fence level, or do I 
> just add a second fence device to the first fence level?
>
>  
>
> Thanks,
>
> Brad Filipek
>
>  
>
> *_Confidentiality Notice: _*This message is intended for the use of 
> the individual or entity to which it is addressed and may contain 
> information that is privileged, confidential and exempt from 
> disclosure under applicable law. If the reader of this message is not 
> the intended recipient or the employee or agent responsible for 
> delivering this message to the intended recipient, you are hereby 
> notified that any dissemination, distribution or copying of this 
> communication is strictly prohibited.
>
> If you have received this communication in error, please notify us 
> immediately by email reply or by telephone and immediately delete this 
> message and any attachments.
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
Add a second device to the same level, and use the 'option="Off"' and 
'option="On"' attributes to make certain both power supplies are 
completely shut off before turning either one on again. Without setting 
the option attribute, the default is Reboot. It would look like this:

.....
  <fence>
    <method name="1">
        <device name="my_apc" port="1" option="Off"/>
        <device name="my_apc" port="2" option="Off"/>
        <device name="my_apc" port="1" option="On"/>
        <device name="my_apc" port="2" option="On"/>
    </method>
  </fence>

If you are using one of the GUI tools, it should be smart enough to 
detect that you are using two power switch type devices in the same 
level and add the option attributes for you. ..all you need to do is 
specify the device names and the ports.

-J



From isplist at logicore.net  Thu Jan 31 16:15:45 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 10:15:45 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <7F6B06837A5DBD49AC6E1650EFF5490601C64C19@auk52177.ukr.astrium.corp>
Message-ID: <2008131101545.040269@leena>

> You'll need to export the share (actually the data in the share) you are
> presently NFSing to the aggregator using something else (iSCSI for
> instance) and then NFS that from the aggregator.

Understood. Just wondered if it could be done so that I could take advantage 
of a fast NFS server I have but as part of an aggregated volume. 

Thanks.

Mike





From gordan at bobich.net  Thu Jan 31 16:22:59 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 16:22:59 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131101343.893092@leena>
References: <2008131101343.893092@leena>
Message-ID: <alpine.LRH.1.00.0801311615500.6906@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>> You could just try it without LVM alltogether. LVM is entirely optional in
>> this.
>
> I didn't know that to be honest. I think you had mentioned it in another post.
> Since day one my understanding was that GFS had to be set up using LVM.
> What are the steps to creating a GFS volume without LVM?

Yes, all the documentation seems to be hammering LVM down user's throat 
for some unfathomable reason. You can just mkfs.gfs -myoptions 
/dev/mygfsblockdevice

Under myoptions you'll want to specify the number of journals and lock 
module as dlm as per the docs.

> PS: I cleanly rebooted everything again. The logs still show the following;
>
> //
> Jan 31 09:44:45 compdev kernel: dlm: could not bind to local address for
> connect: -98
> \\

That vaguely rings a bell. I seem to remember that the problem in my case 
was somewhere in cluster.conf disagreements between nodes, and package 
versions not being up to date. Also make sure you start various things 
using the scripts under /etc/init.d, rather than starting things up 
manually.

What distro are you using?

> And, I hate to post such long winded logs but it might be helpful. This is
> where things get weird. I've posted about this several times. There is an
> error message that keeps trying to show up as vgchange is trying to run.
>
> There seems to be old volume information somewhere on the system causing this
> and I think this is the root of the problem. If I could remove the old long
> dead information, I think I could get this resolved.

Just blow away the LVM (uninstall the clvm package, just to make sure),
dd if=/dev/zero > /dev/mygfsblockdevice bs=1024 count=1024
(just to make sure it is properly gone - assuming you don't have any 
important data on there, of course!)
and then re-create gfs on the block device as described above.

Start up cman, gfs, ccsd services, and it should work, if your 
cluster.conf is working. Also remember that you need nodes to be quorate 
before gfs will successfully complete mounting. You can try it with 
locking lock_nolock with just a single node, to make sure your GFS file 
system is OK first.

Gordan



From gordan at bobich.net  Thu Jan 31 16:24:07 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 16:24:07 +0000 (GMT)
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008131101545.040269@leena>
References: <2008131101545.040269@leena>
Message-ID: <alpine.LRH.1.00.0801311623160.6906@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>> You'll need to export the share (actually the data in the share) you are
>> presently NFSing to the aggregator using something else (iSCSI for
>> instance) and then NFS that from the aggregator.
>
> Understood. Just wondered if it could be done so that I could take advantage
> of a fast NFS server I have but as part of an aggregated volume.

Try it. Put a big file on the NFS share, and export it via iSCSI as a 
volume. Then stress-test until you are satisfied that it works reliably.

Gordan



From johannes.russek at io-consulting.net  Thu Jan 31 16:23:36 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 31 Jan 2008 17:23:36 +0100
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131101343.893092@leena>
References: <2008131101343.893092@leena>
Message-ID: <1201796616.29126.75.camel@admc.win-rar.local>


> 
> Jan 31 09:43:43 compdev cman: startup succeeded
> Jan 31 09:43:44 compdev fenced: startup succeeded
> Jan 31 09:43:47 compdev clvmd: clvmd startup succeeded
> Jan 31 09:43:48 compdev vgchange:
> Jan 31 09:43:48 compdev vgchange: No volume groups found
> Jan 31 09:43:48 compdev clvmd: Activating VGs: succeeded
> Jan 31 09:44:03 compdev rgmanager: clurgmgrd startup succeeded
> 

did you try running group_tool to see if you can see the issue?


> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From johannes.russek at io-consulting.net  Thu Jan 31 16:21:09 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 31 Jan 2008 17:21:09 +0100
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131101343.893092@leena>
References: <2008131101343.893092@leena>
Message-ID: <1201796469.29126.73.camel@admc.win-rar.local>

Am Donnerstag, den 31.01.2008, 10:13 -0600 schrieb isplist at logicore.net:
> > You could just try it without LVM alltogether. LVM is entirely optional in
> > this.
> I didn't know that to be honest. I think you had mentioned it in another post. 
> Since day one my understanding was that GFS had to be set up using LVM. 
> What are the steps to creating a GFS volume without LVM?

hi there,
it's easy: don't make that disk/slice/raid/whatever an lvmvolume with
pvcreate but just run gfs_mkfs on the plain blockdevice.
but as this looks like a way weirder locking issue, gfs might not run at
all either.
oh, and you won't be able to grow your gfs without lvm, ofcourse.
johannes



From pcaulfie at redhat.com  Thu Jan 31 16:25:46 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Thu, 31 Jan 2008 16:25:46 +0000
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131101343.893092@leena>
References: <2008131101343.893092@leena>
Message-ID: <47A1F68A.8000901@redhat.com>

isplist at logicore.net wrote:
>> You could just try it without LVM alltogether. LVM is entirely optional in
>> this.
> 
> I didn't know that to be honest. I think you had mentioned it in another post. 
> Since day one my understanding was that GFS had to be set up using LVM. 
> What are the steps to creating a GFS volume without LVM?
> 
> PS: I cleanly rebooted everything again. The logs still show the following;
> 
> //
> Jan 31 09:44:45 compdev kernel: dlm: could not bind to local address for 
> connect: -98
> \\
> 
> 
CLVM is not the problem, it's the DLM. clvmd relies on the DLM to do
locking, that's all. As GFS also needs the DLM, trying to get it running
without clvmd will not help unfortunately.

The bind error needs to be sorted out - as I said elsewhere, nothing is
going to work until it is. There's something VERY odd about those
addresses in netstat that reeks of a configuration error somewhere - one
I've never seen before I must admit!

Patrick



From gordan at bobich.net  Thu Jan 31 16:29:07 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 16:29:07 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <1201796469.29126.73.camel@admc.win-rar.local>
References: <2008131101343.893092@leena>
	<1201796469.29126.73.camel@admc.win-rar.local>
Message-ID: <alpine.LRH.1.00.0801311627300.6906@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, jr wrote:

>>> You could just try it without LVM alltogether. LVM is entirely optional in
>>> this.
>> I didn't know that to be honest. I think you had mentioned it in another post.
>> Since day one my understanding was that GFS had to be set up using LVM.
>> What are the steps to creating a GFS volume without LVM?
>
> hi there,
> it's easy: don't make that disk/slice/raid/whatever an lvmvolume with
> pvcreate but just run gfs_mkfs on the plain blockdevice.
> but as this looks like a way weirder locking issue, gfs might not run at
> all either.
> oh, and you won't be able to grow your gfs without lvm, ofcourse.

A lot of RAID systems these days (including Linux's software RAID) can 
dynamically add a device to the array, and once that is integrated you can 
gfs_grow the file system to fill the newly available space.

Gordan



From johannes.russek at io-consulting.net  Thu Jan 31 16:30:03 2008
From: johannes.russek at io-consulting.net (jr)
Date: Thu, 31 Jan 2008 17:30:03 +0100
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A1F68A.8000901@redhat.com>
References: <2008131101343.893092@leena>  <47A1F68A.8000901@redhat.com>
Message-ID: <1201797003.29126.78.camel@admc.win-rar.local>


> > 
> > //
> > Jan 31 09:44:45 compdev kernel: dlm: could not bind to local address for 
> > connect: -98
> > \\
> > 
> > 
> CLVM is not the problem, it's the DLM. clvmd relies on the DLM to do
> locking, that's all. As GFS also needs the DLM, trying to get it running
> without clvmd will not help unfortunately.
> 
> The bind error needs to be sorted out - as I said elsewhere, nothing is
> going to work until it is. There's something VERY odd about those
> addresses in netstat that reeks of a configuration error somewhere - one
> I've never seen before I must admit!
> 

hm, i recall something where then naming of the nodes wasn't proper,
like the nodes in /etc/hosts were without but in the cluster.conf with
domain and something like that, i think that showed errors quite like
that!
try & check if the nodenames in the cluster.conf are absolutely right
and if you ping those names you get a reply (from the right ip!).
johannes


-
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From pcaulfie at redhat.com  Thu Jan 31 16:40:18 2008
From: pcaulfie at redhat.com (Patrick Caulfeld)
Date: Thu, 31 Jan 2008 16:40:18 +0000
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A1F68A.8000901@redhat.com>
References: <2008131101343.893092@leena> <47A1F68A.8000901@redhat.com>
Message-ID: <47A1F9F2.7050801@redhat.com>

Patrick Caulfeld wrote:
> isplist at logicore.net wrote:
>>> You could just try it without LVM alltogether. LVM is entirely optional in
>>> this.
>> I didn't know that to be honest. I think you had mentioned it in another post. 
>> Since day one my understanding was that GFS had to be set up using LVM. 
>> What are the steps to creating a GFS volume without LVM?
>>
>> PS: I cleanly rebooted everything again. The logs still show the following;
>>
>> //
>> Jan 31 09:44:45 compdev kernel: dlm: could not bind to local address for 
>> connect: -98
>> \\
>>

We need to break this down to understand it.

Can you boot a single node, without any cluster software running, then
do the 'netstat -tap'. Then start the cluster software and do it again.

If you get the error then, say so.

Then boot another node, using the same procedure.

Post the results here along with the cluster.conf ... also make sure you
have the same version of EVERYTHING on both nodes ... that's software
AND cluster.conf

Patrick



From isplist at logicore.net  Thu Jan 31 17:03:37 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:03:37 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311615500.6906@skynet.shatteredsilicon.net>
Message-ID: <200813111337.546198@leena>

> Yes, all the documentation seems to be hammering LVM down user's throat
> for some unfathomable reason. You can just mkfs.gfs -myoptions
> /dev/mygfsblockdevice

Whew, so it's not just me :). 

> was somewhere in cluster.conf disagreements between nodes, and package
> versions not being up to date. Also make sure you start various things
> using the scripts under /etc/init.d, rather than starting things up
> manually.

I'll triple check that at this point.
 
> What distro are you using?

Rhel4
 
> Just blow away the LVM (uninstall the clvm package, just to make sure),

So, both lvm2 and lvm2-cluster, just to be sure? There are endless 
dependencies for these. Remove without deps then? And, on all nodes?

> dd if=/dev/zero > /dev/mygfsblockdevice bs=1024 count=1024

On the /dev/sda in my case then.

> Start up cman, gfs, ccsd services, and it should work, if your
> cluster.conf is working.

Yup, the cluster is in fact up and running, just the storage problems.

> Also remember that you need nodes to be quorate

No problem.

Mike





From isplist at logicore.net  Thu Jan 31 17:07:02 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:07:02 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A1F68A.8000901@redhat.com>
Message-ID: <20081311172.677155@leena>

> CLVM is not the problem, it's the DLM. clvmd relies on the DLM to do
> locking, that's all. As GFS also needs the DLM, trying to get it running
> without clvmd will not help unfortunately.

I've not removed anything yet, reading this reply thread.
 
> going to work until it is. There's something VERY odd about those
> addresses in netstat that reeks of a configuration error somewhere - one
> I've never seen before I must admit!

Strange part is that everything was running fine until I replaced the drives.
Of course, the volume information/group changed also and since I didn't delete 
it properly before changing it, I now have all this old information stuck on 
the systems. My guess is that I need to clean that out first but where is it?

I'd like to learn if I made some errors or I'd like to understand what the 
problem is if I didn't so that we can note it. 

Mike





From gordan at bobich.net  Thu Jan 31 17:12:00 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 17:12:00 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <200813111337.546198@leena>
References: <200813111337.546198@leena>
Message-ID: <alpine.LRH.1.00.0801311710230.14299@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>> What distro are you using?
>
> Rhel4

My recent experience if RHEL5, so bear in mind that some of my advice may 
not be valid for RHEL4.

>> Just blow away the LVM (uninstall the clvm package, just to make sure),
>
> So, both lvm2 and lvm2-cluster, just to be sure? There are endless
> dependencies for these. Remove without deps then? And, on all nodes?

Careful that you don't require LVM2 for existing system partitions. But 
lvm2-cluster shouldn't be needed.

>> dd if=/dev/zero > /dev/mygfsblockdevice bs=1024 count=1024
>
> On the /dev/sda in my case then.
>
>> Start up cman, gfs, ccsd services, and it should work, if your
>> cluster.conf is working.
>
> Yup, the cluster is in fact up and running, just the storage problems.

What does "cman_tool status" say?

Gordan



From gordan at bobich.net  Thu Jan 31 17:13:34 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 17:13:34 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20081311172.677155@leena>
References: <20081311172.677155@leena>
Message-ID: <alpine.LRH.1.00.0801311712200.14299@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>> going to work until it is. There's something VERY odd about those
>> addresses in netstat that reeks of a configuration error somewhere - one
>> I've never seen before I must admit!
>
> Strange part is that everything was running fine until I replaced the drives.
> Of course, the volume information/group changed also and since I didn't delete
> it properly before changing it, I now have all this old information stuck on
> the systems. My guess is that I need to clean that out first but where is it?

I can see two possibilities - some config changed that didn't take effect 
until after the drives changed (happens to the best of us), or the drives 
might have had different LVM headers on them from before which is 
confusing LVM.

Wiping the front of the new disks may help.

Gordan



From isplist at logicore.net  Thu Jan 31 17:26:06 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:26:06 -0600
Subject: [Linux-cluster] Can't create new GFS storage
In-Reply-To: <02CC84960E52F745860DFF29ADC62091025E5435@MSEXCHNG-02.ivytech.local>
Message-ID: <200813111266.250075@leena>

Doug, I'll reply in the DLM Problem thread to keep things intact.

Mike


On Thu, 31 Jan 2008 10:13:08 -0500, Doug Sharp wrote:
> Mike, the last I remember, you still had locking_type = 2 in
> /etc/lvm/lvm.conf.  Have you tried setting this to 3 and restarting
> services to see if it helps?
> 
> Doug






From isplist at logicore.net  Thu Jan 31 17:27:35 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:27:35 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311615500.6906@skynet.shatteredsilicon.net>
Message-ID: <2008131112735.571947@leena>

>Mike, the last I remember, you still had locking_type = 2 in 
>/etc/lvm/lvm.conf.  Have you tried setting this to 3 and restarting services 
>to see if it helps?

>Doug

I'm still running RHEL4, I don't recall seeing anything about having to change 
the lvm.conf from type 2 to type 3. Is this something I missed? 

Even so, the storage was working before the drives were changed out?

Mike





From isplist at logicore.net  Thu Jan 31 17:31:11 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:31:11 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801311623160.6906@skynet.shatteredsilicon.net>
Message-ID: <2008131113111.146520@leena>

> Try it. Put a big file on the NFS share, and export it via iSCSI as a
> volume. Then stress-test until you are satisfied that it works reliably.

I do know that I can't re-export an NFS share via NFS, gave it a try. Then 
again, didn't try too hard either. 

Once I get my drive problems resolved, I'll get back to the aggregator which 
should so oh, so so much more fun than this :).

Mike





From pillai at mathstat.dal.ca  Thu Jan 31 17:38:13 2008
From: pillai at mathstat.dal.ca (Balagopal Pillai)
Date: Thu, 31 Jan 2008 13:38:13 -0400
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <2008131113111.146520@leena>
References: <2008131113111.146520@leena>
Message-ID: <47A20785.5050604@mathstat.dal.ca>

http://www.ussg.iu.edu/hypermail/linux/kernel/0108.0/0294.html
http://www.freepatentsonline.com/7103638.html

isplist at logicore.net wrote:
>> Try it. Put a big file on the NFS share, and export it via iSCSI as a
>> volume. Then stress-test until you are satisfied that it works reliably.
>>     
>
> I do know that I can't re-export an NFS share via NFS, gave it a try. Then 
> again, didn't try too hard either. 
>
> Once I get my drive problems resolved, I'll get back to the aggregator which 
> should so oh, so so much more fun than this :).
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   



From isplist at logicore.net  Thu Jan 31 17:38:51 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:38:51 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <1201796469.29126.73.camel@admc.win-rar.local>
Message-ID: <2008131113851.280714@leena>

> it's easy: don't make that disk/slice/raid/whatever an lvmvolume with
> pvcreate but just run gfs_mkfs on the plain blockdevice.

I didn't know you could do that up until this morning :). 

> but as this looks like a way weirder locking issue, gfs might not run at
> all either. oh, and you won't be able to grow your gfs without lvm, 
> of course.

Right now, I can live with that. I don't need GFS for large storage, just for 
the LAMP services cluster. Media and other files don't need GFS so I'm rather 
excited about building that aggregator we've been talking about.

 > johannes






From isplist at logicore.net  Thu Jan 31 17:40:34 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:40:34 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <1201796616.29126.75.camel@admc.win-rar.local>
Message-ID: <2008131114034.050338@leena>

> did you try running group_tool to see if you can see the issue?

Right now, the node I am working on is not in the cluster and has no cluster 
services running. I also don't seem to have group_tool installed on my system, 
I'll have to look that up.

Mike





From isplist at logicore.net  Thu Jan 31 17:42:10 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:42:10 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <1201797003.29126.78.camel@admc.win-rar.local>
Message-ID: <2008131114210.526262@leena>

> try & check if the nodenames in the cluster.conf are absolutely right
> and if you ping those names you get a reply (from the right ip!).

Good call :). I've always used the full fqdn in both hosts file and 
cluster.conf. I just edit them out when I post publicly.

Also, all of the nodes joint into the cluster properly, it's the storage that 
is messed up.

Mike





From gordan at bobich.net  Thu Jan 31 17:45:13 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 17:45:13 +0000 (GMT)
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <47A20785.5050604@mathstat.dal.ca>
References: <2008131113111.146520@leena> <47A20785.5050604@mathstat.dal.ca>
Message-ID: <alpine.LRH.1.00.0801311744040.14299@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, Balagopal Pillai wrote:

> http://www.ussg.iu.edu/hypermail/linux/kernel/0108.0/0294.html
> http://www.freepatentsonline.com/7103638.html

I despair. How can an NFS proxy be patent-worthy?

Gordan



From isplist at logicore.net  Thu Jan 31 17:46:00 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:46:00 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311710230.14299@skynet.shatteredsilicon.net>
Message-ID: <200813111460.666685@leena>

> Careful that you don't require LVM2 for existing system partitions. But
> lvm2-cluster shouldn't be needed.

No, I won't be needing LV's. The system drive is not using an lv either.
 
>>> dd if=/dev/zero > /dev/mygfsblockdevice bs=1024 count=1024
>> On the /dev/sda in my case then.

Ok, done. I can reboot and see if I still get those errors next.

> What does "cman_tool status" say?

On the clustered nodes;

# cman_tool status
Protocol version: 5.0.1
Config version: 87
Cluster name: compweb
Cluster ID: 13304
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 3
Expected_votes: 5
Total_votes: 3
Quorum: 3
Active subsystems: 4
Node name: cweb40.companions.com
Node ID: 40
Node addresses: 192.168.1.40

And on my non clustered node, of course
# cman_tool status
cman_tool: can't open /proc/cluster/status, cman not running

Should I add this node back into the cluster now?

Mike





From isplist at logicore.net  Thu Jan 31 17:52:02 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 11:52:02 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801311744040.14299@skynet.shatteredsilicon.net>
Message-ID: <200813111522.722183@leena>

>> http://www.ussg.iu.edu/hypermail/linux/kernel/0108.0/0294.html
>> http://www.freepatentsonline.com/7103638.html
>> 
> I despair. How can an NFS proxy be patent-worthy?

Can't concepts be protected?

Mike





From gordan at bobich.net  Thu Jan 31 18:09:02 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 18:09:02 +0000 (GMT)
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <200813111460.666685@leena>
References: <200813111460.666685@leena>
Message-ID: <alpine.LRH.1.00.0801311807400.14299@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>>>> dd if=/dev/zero > /dev/mygfsblockdevice bs=1024 count=1024
>>> On the /dev/sda in my case then.
>
> Ok, done. I can reboot and see if I still get those errors next.
>
>> What does "cman_tool status" say?
>
> On the clustered nodes;
>
> # cman_tool status
> Protocol version: 5.0.1
> Config version: 87
> Cluster name: compweb
> Cluster ID: 13304
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 3
> Expected_votes: 5
> Total_votes: 3
> Quorum: 3
> Active subsystems: 4
> Node name: cweb40.companions.com
> Node ID: 40
> Node addresses: 192.168.1.40
>
> And on my non clustered node, of course
> # cman_tool status
> cman_tool: can't open /proc/cluster/status, cman not running
>
> Should I add this node back into the cluster now?

Can do. Your cluster is up and running, so that implies that at least 
works. See what things say now when you try to get the nodes to mount the 
freshly created GFS.

Gordan



From gordan at bobich.net  Thu Jan 31 18:10:57 2008
From: gordan at bobich.net (gordan at bobich.net)
Date: Thu, 31 Jan 2008 18:10:57 +0000 (GMT)
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <200813111522.722183@leena>
References: <200813111522.722183@leena>
Message-ID: <alpine.LRH.1.00.0801311809130.14299@skynet.shatteredsilicon.net>

On Thu, 31 Jan 2008, isplist at logicore.net wrote:

>>> http://www.ussg.iu.edu/hypermail/linux/kernel/0108.0/0294.html
>>> http://www.freepatentsonline.com/7103638.html
>>>
>> I despair. How can an NFS proxy be patent-worthy?
>
> Can't concepts be protected?

AFAIK, patents have to be non-trivial _and_ non-obvious. Plus the fact 
that using a userspace NFS daemon has allowed for this for many years, 
which means prior art.

Gordan



From lgodoy at atichile.com  Thu Jan 31 18:13:17 2008
From: lgodoy at atichile.com (Luis Godoy Gonzalez)
Date: Thu, 31 Jan 2008 15:13:17 -0300
Subject: [Linux-cluster] Split Brain
In-Reply-To: <1201796469.29126.73.camel@admc.win-rar.local>
References: <2008131101343.893092@leena>
	<1201796469.29126.73.camel@admc.win-rar.local>
Message-ID: <47A20FBD.7080209@atichile.com>

Hi

I have a problem with mi current cluster.
we have a 2 node cluster ( DL385 G2 we not have external storage) with 
RedHat 4 u5 and cluster suite 4 u5

When the nodes loose comunication, we get 2 cluster instances with de 
service up in both :( .. too bad.
I don't undertand  why one node not try to fence the other node, before 
form the cluster.

this are the log:
==================================================
Jan 20 22:17:42 node1 kernel: bonding: bond0: link status definitely up 
for interface eth0.
Jan 20 22:17:48 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:17:48 node1 su(pam_unix)[11307]: session opened for user 
app_usr by (uid=0)
Jan 20 22:17:48 node1 su(pam_unix)[11307]: session closed for user app_usr
Jan 20 22:18:18 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:18:18 node1 su(pam_unix)[11533]: session opened for user 
app_usr by (uid=0)
Jan 20 22:18:18 node1 su(pam_unix)[11533]: session closed for user app_usr
Jan 20 22:18:33 node1 kernel: e1000: eth2: e1000_watchdog_task: NIC Link 
is Down
Jan 20 22:18:33 node1 kernel: bonding: bond0: link status definitely 
down for interface eth2, disabling it
Jan 20 22:18:33 node1 kernel: bonding: bond0: making interface eth0 the 
new active one.
Jan 20 22:18:37 node1 kernel: e1000: eth2: e1000_watchdog_task: NIC Link 
is Up 1000 Mbps Full Duplex
Jan 20 22:18:37 node1 kernel: bonding: bond0: link status definitely up 
for interface eth2.
Jan 20 22:18:43 node1 kernel: bnx2: eth0 NIC Link is Down
Jan 20 22:18:43 node1 kernel: bonding: bond0: link status definitely 
down for interface eth0, disabling it
Jan 20 22:18:43 node1 kernel: bonding: bond0: making interface eth2 the 
new active one.
Jan 20 22:18:46 node1 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full 
duplex
Jan 20 22:18:46 node1 kernel: bonding: bond0: link status definitely up 
for interface eth0.
Jan 20 22:19:03 node1 kernel: CMAN: removing node node2 from the cluster 
: Missed too many heartbeats
Jan 20 22:19:05 node1 clurgmgrd[4081]: <info> Magma Event: Membership Change
Jan 20 22:19:05 node1 clurgmgrd[4081]: <info> State change: node2 DOWN
Jan 20 22:19:06 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:19:06 node1 su(pam_unix)[11780]: session opened for user 
app_usr by (uid=0)
Jan 20 22:19:06 node1 su(pam_unix)[11780]: session closed for user app_usr
Jan 20 22:19:22 node1 kernel: bnx2: eth0 NIC Link is Down
Jan 20 22:19:22 node1 kernel: bonding: bond0: link status definitely 
down for interface eth0, disabling it
Jan 20 22:19:25 node1 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full 
duplex
Jan 20 22:19:25 node1 kernel: bonding: bond0: link status definitely up 
for interface eth0.
Jan 20 22:19:40 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:19:40 node1 su(pam_unix)[12037]: session opened for user 
app_usr by (uid=0)
Jan 20 22:19:40 node1 su(pam_unix)[12037]: session closed for user app_usr
Jan 20 22:20:10 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:20:10 node1 su(pam_unix)[12236]: session opened for user 
app_usr by (uid=0)
Jan 20 22:20:10 node1 su(pam_unix)[12236]: session closed for user app_usr
Jan 20 22:20:40 node1 clurgmgrd: [4081]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:20:40 node1 su(pam_unix)[12461]: session opened for user 
app_usr by (uid=0)
Jan 20 22:20:40 node1 su(pam_unix)[12461]: session closed for user app_usr
=====================================================================

Node 2
=====================================================================
Jan 20 22:10:22 node2 sshd(pam_unix)[22703]: session opened for user 
app_usr by (uid=0)
Jan 20 22:10:22 node2 sshd(pam_unix)[22703]: session closed for user app_usr
Jan 20 22:10:24 node2 sshd(pam_unix)[22741]: session opened for user 
app_usr by (uid=0)
Jan 20 22:10:24 node2 sshd(pam_unix)[22741]: session closed for user app_usr
Jan 20 22:20:07 node2 sshd(pam_unix)[23541]: session opened for user 
app_usr by (uid=0)
Jan 20 22:20:07 node2 sshd(pam_unix)[23541]: session closed for user app_usr
Jan 20 22:20:09 node2 sshd(pam_unix)[23578]: session opened for user 
app_usr by (uid=0)
Jan 20 22:20:09 node2 sshd(pam_unix)[23578]: session closed for user app_usr
Jan 20 22:21:38 node2 kernel: CMAN: removing node node1 from the cluster 
: Missed too many heartbeats
Jan 20 22:21:40 node2 clurgmgrd[4177]: <info> Magma Event: Membership Change
Jan 20 22:21:40 node2 clurgmgrd[4177]: <info> State change: node1 DOWN
Jan 20 22:21:41 node2 clurgmgrd[4177]: <notice> Taking over service 
myservice from down member (null)
Jan 20 22:21:41 node2 clurgmgrd: [4177]: <info> Adding IPv4 address 
10.10.65.1 to bond0
Jan 20 22:21:42 node2 clurgmgrd: [4177]: <info> Adding IPv4 address 
10.10.65.10 to bond0
Jan 20 22:21:43 node2 clurgmgrd: [4177]: <info> Executing 
/home/app/myservice.sh start
Jan 20 22:21:43 node2 su(pam_unix)[23855]: session opened for user 
app_usr by (uid=0)
Jan 20 22:21:43 node2 su(pam_unix)[23855]: session closed for user app_usr
Jan 20 22:21:43 node2 clurgmgrd: [4177]: <info> Adding IPv4 address 
10.10.70.20 to bond1
Jan 20 22:21:44 node2 clurgmgrd[4177]: <notice> Service myservice started
Jan 20 22:21:50 node2 clurgmgrd: [4177]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:21:50 node2 su(pam_unix)[24022]: session opened for user 
app_usr by (uid=0)
Jan 20 22:21:50 node2 su(pam_unix)[24022]: session closed for user app_usr
Jan 20 22:22:20 node2 clurgmgrd: [4177]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:22:20 node2 su(pam_unix)[24244]: session opened for user 
app_usr by (uid=0)
Jan 20 22:22:20 node2 su(pam_unix)[24244]: session closed for user app_usr
Jan 20 22:22:50 node2 clurgmgrd: [4177]: <info> Executing 
/home/app/myservice.sh status
Jan 20 22:22:50 node2 su(pam_unix)[24469]: session opened for user 
app_usr by (uid=0)
=================================================================

I have configured the fences device and the power off work fine  .... 
when I power up the machine the first en startup "fenced" the other and 
startup continue ok


Any help will by apreciated ..
Sorry for my bad inglish
Luis G.




From isplist at logicore.net  Thu Jan 31 18:24:44 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 12:24:44 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311712200.14299@skynet.shatteredsilicon.net>
Message-ID: <2008131122444.942553@leena>

> Wiping the front of the new disks may help.

Ok, so, I did that and, the errors with vgchange are still there. Now, that's 
without the node being in the cluster yet either.

I removed lvm2 and lvm2-cluster ignoring deps.

//
Uninstalling packages lvm2 lvm2-cluster ..
Uninstallation failed :
warning: /etc/lvm/lvm.conf saved as /etc/lvm/lvm.conf.rpmsave
error reading information on service lvm2-monitor: No such file or directory
error: %postun(lvm2-2.02.21-5.el4.XOS.1.i386) scriptlet failed, exit status 1
\\

I turned the cluster services back on and rebooted. Comes up fine in the 
cluster;

# cman_tool nodes
Node  Votes Exp Sts  Name
  40    1    5   M   cweb40.companions.com
  58    1    4   M   compdev.companions.com
  62    1    5   M   img62.companions.com
  92    1    5   M   cweb92.companions.com

Do an fdisk -l; 

# fdisk -l - Looks fine, needs partition.

Disk /dev/hda: 40.0 GB, 40020664320 bytes
255 heads, 63 sectors/track, 4865 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/hda1   *           1        4767    38290896   83  Linux
/dev/hda2            4768        4865      787185   82  Linux swap

Disk /dev/sda: 774.2 GB, 774207700992 bytes
255 heads, 63 sectors/track, 94125 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/sda doesn't contain a valid partition table

Create new GFS drive;

# gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda
This will destroy any data on /dev/sda.

Are you sure you want to proceed? [y/n] y

Device:                    /dev/sda
Blocksize:                 4096
Filesystem Size:           188738976
Journals:                  8
Resource Groups:           2882
Locking Protocol:          lock_dlm
Lock Table:                compweb:web

Syncing...
All Done

Go DANG!!!! I screwed it up by adding VG information, compweb:web. Now what? I 
have to fix this then re-do it, how?

Mike





From isplist at logicore.net  Thu Jan 31 18:30:17 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 12:30:17 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801311809130.14299@skynet.shatteredsilicon.net>
Message-ID: <2008131123017.350678@leena>

> AFAIK, patents have to be non-trivial _and_ non-obvious. Plus the fact
> that using a userspace NFS daemon has allowed for this for many years,
> which means prior art.

Maybe someone else knows this but I thought that a concept could include other 
existing technologies, etc. Because, it's not the items themselves that you 
want to protect, it's the concept of using them together.

Dunno, just guessing really.

Mike





From pillai at mathstat.dal.ca  Thu Jan 31 18:31:53 2008
From: pillai at mathstat.dal.ca (Balagopal Pillai)
Date: Thu, 31 Jan 2008 14:31:53 -0400
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <alpine.LRH.1.00.0801311809130.14299@skynet.shatteredsilicon.net>
References: <200813111522.722183@leena>
	<alpine.LRH.1.00.0801311809130.14299@skynet.shatteredsilicon.net>
Message-ID: <47A21419.3030105@mathstat.dal.ca>

http://www.patentstorm.us/patents/7103638-description.html


>
> AFAIK, patents have to be non-trivial _and_ non-obvious. Plus the fact 
> that using a userspace NFS daemon has allowed for this for many years, 
> which means prior art.
>
> Gordan
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From isplist at logicore.net  Thu Jan 31 18:44:17 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 12:44:17 -0600
Subject: UNCLASSIFIED - RE: [Linux-cluster] iSCSI GFS
In-Reply-To: <47A21419.3030105@mathstat.dal.ca>
Message-ID: <2008131124417.749696@leena>

> http://www.patentstorm.us/patents/7103638-description.html

Maybe I'm blind but I can't tell what type of patent this is? Is it 'art' as 
it calls it here and there, is it an 'invention' of technology? Is it a 
concept? It can't be NFS so?

Mike

 
> 
>> AFAIK, patents have to be non-trivial _and_ non-obvious. Plus the fact
>> that using a userspace NFS daemon has allowed for this for many years,
>> which means prior art.
>> 
>> Gordan
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster






From isplist at logicore.net  Thu Jan 31 20:09:01 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 14:09:01 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <alpine.LRH.1.00.0801311807400.14299@skynet.shatteredsilicon.net>
Message-ID: <20081311491.820707@leena>

Something else going on here.

I see the device using fdisk;

Disk /dev/sda doesn't contain a valid partition table
You have new mail in /var/spool/mail/root

I then use fdisk to create a partition;

Disk /dev/sda: 774.2 GB, 774207700992 bytes
255 heads, 63 sectors/track, 94125 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sda1               1       94125   756059031   83  Linux

Looks fine right.

Looking on the net, I can't find anything on gfs_mkfs one single drive so have 
to make assumptions for now. 

Assumptions are, lock mode, cluster name, FS name, journals and device.

#gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda

Seems to go fine;

Device:                    /dev/sda
Blocksize:                 4096
Filesystem Size:           188738976
Journals:                  8
Resource Groups:           2882
Locking Protocol:          lock_dlm
Lock Table:                compweb:web

Syncing...
All Done

Do an fdisk again;

Disk /dev/sda: 774.2 GB, 774207700992 bytes
255 heads, 63 sectors/track, 94125 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

Disk /dev/sda doesn't contain a valid partition table

It's all gone. Guessing I need to gfs_mkfs some other way?

Mike





From Derek.Anderson at compellent.com  Thu Jan 31 20:16:49 2008
From: Derek.Anderson at compellent.com (Derek Anderson)
Date: Thu, 31 Jan 2008 14:16:49 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20081311491.820707@leena>
References: <alpine.LRH.1.00.0801311807400.14299@skynet.shatteredsilicon.net>
	<20081311491.820707@leena>
Message-ID: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFB@honeywheat.Beer.Town>

In your example run gfs_mkfs against /dev/sda1.

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> bounces at redhat.com] On Behalf Of isplist at logicore.net
> Sent: Thursday, January 31, 2008 2:09 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] DLM Problem
> 
> Something else going on here.
> 
> I see the device using fdisk;
> 
> Disk /dev/sda doesn't contain a valid partition table
> You have new mail in /var/spool/mail/root
> 
> I then use fdisk to create a partition;
> 
> Disk /dev/sda: 774.2 GB, 774207700992 bytes
> 255 heads, 63 sectors/track, 94125 cylinders
> Units = cylinders of 16065 * 512 = 8225280 bytes
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sda1               1       94125   756059031   83  Linux
> 
> Looks fine right.
> 
> Looking on the net, I can't find anything on gfs_mkfs one single drive
> so have
> to make assumptions for now.
> 
> Assumptions are, lock mode, cluster name, FS name, journals and
device.
> 
> #gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda
> 
> Seems to go fine;
> 
> Device:                    /dev/sda
> Blocksize:                 4096
> Filesystem Size:           188738976
> Journals:                  8
> Resource Groups:           2882
> Locking Protocol:          lock_dlm
> Lock Table:                compweb:web
> 
> Syncing...
> All Done
> 
> Do an fdisk again;
> 
> Disk /dev/sda: 774.2 GB, 774207700992 bytes
> 255 heads, 63 sectors/track, 94125 cylinders
> Units = cylinders of 16065 * 512 = 8225280 bytes
> 
> Disk /dev/sda doesn't contain a valid partition table
> 
> It's all gone. Guessing I need to gfs_mkfs some other way?
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From isplist at logicore.net  Thu Jan 31 20:28:15 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 14:28:15 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFB@honeywheat.Beer.Town>
Message-ID: <2008131142815.565704@leena>

On Thu, 31 Jan 2008 14:16:49 -0600, Derek Anderson wrote:
> In your example run gfs_mkfs against /dev/sda1.

My god, I've been at this too long today! You just can't see it right in front 
of you when you get to that :). 

Thanks so much.

Mike





From isplist at logicore.net  Thu Jan 31 20:43:20 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 14:43:20 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFB@honeywheat.Beer.Town>
Message-ID: <2008131144320.543167@leena>

On Thu, 31 Jan 2008 14:16:49 -0600, Derek Anderson wrote:
> In your example run gfs_mkfs against /dev/sda1.

Fixed.

>> #gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda

What's not clear to me now is how I can use the volume's cluster name and FS 
name for mounting across nodes. 

For example, on other nodes, I can only see this as /dev/sdx rather than the 
less confusing and more easily dealt with cluster name/fs.

Since I created it using compweb:web, how can I now see them and mount them as 
such on the different nodes?

Mike





From Derek.Anderson at compellent.com  Thu Jan 31 21:02:35 2008
From: Derek.Anderson at compellent.com (Derek Anderson)
Date: Thu, 31 Jan 2008 15:02:35 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131144320.543167@leena>
References: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFB@honeywheat.Beer.Town>
	<2008131144320.543167@leena>
Message-ID: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFD@honeywheat.Beer.Town>



> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> bounces at redhat.com] On Behalf Of isplist at logicore.net
> Sent: Thursday, January 31, 2008 2:43 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] DLM Problem
> 
> On Thu, 31 Jan 2008 14:16:49 -0600, Derek Anderson wrote:
> > In your example run gfs_mkfs against /dev/sda1.
> 
> Fixed.
> 
> >> #gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda
> 
> What's not clear to me now is how I can use the volume's cluster name
> and FS
> name for mounting across nodes.
> 
> For example, on other nodes, I can only see this as /dev/sdx rather
> than the
> less confusing and more easily dealt with cluster name/fs.

Yes.  And there's another catch.  If node 1 sees it as /dev/sda, it is
possible that node 2 will see the same block device as /dev/sdb.  Naming
consistency among cluster nodes is a nice thing that CLVM provides.  And
it sounds like you are trying to move away from CLVM.  You found one of
the rubs.  The first few paragraphs of gfs_mount(8) speak to this topic.

> 
> Since I created it using compweb:web, how can I now see them and mount
> them as
> such on the different nodes?

I am not aware of a way to mount by LockTableName.

> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From bpkroth at wisc.edu  Thu Jan 31 21:06:56 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Thu, 31 Jan 2008 15:06:56 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFD@honeywheat.Beer.Town>
References: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFB@honeywheat.Beer.Town>
	<2008131144320.543167@leena>
	<99E0F1976E2DA2499F3E6EB18B25F03602C32CFD@honeywheat.Beer.Town>
Message-ID: <20080131210656.GE10343@wisc.edu>

Derek Anderson <Derek.Anderson at compellent.com>:
> 
> 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> > bounces at redhat.com] On Behalf Of isplist at logicore.net
> > Sent: Thursday, January 31, 2008 2:43 PM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] DLM Problem
> > 
> > On Thu, 31 Jan 2008 14:16:49 -0600, Derek Anderson wrote:
> > > In your example run gfs_mkfs against /dev/sda1.
> > 
> > Fixed.
> > 
> > >> #gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda
> > 
> > What's not clear to me now is how I can use the volume's cluster name
> > and FS
> > name for mounting across nodes.
> > 
> > For example, on other nodes, I can only see this as /dev/sdx rather
> > than the
> > less confusing and more easily dealt with cluster name/fs.
> 
> Yes.  And there's another catch.  If node 1 sees it as /dev/sda, it is
> possible that node 2 will see the same block device as /dev/sdb.  Naming
> consistency among cluster nodes is a nice thing that CLVM provides.  And
> it sounds like you are trying to move away from CLVM.  You found one of
> the rubs.  The first few paragraphs of gfs_mount(8) speak to this topic.

It's easy to get around this problem with scsi_id and udev.  It doesn't
necessitate clvm.

> > 
> > Since I created it using compweb:web, how can I now see them and mount
> > them as
> > such on the different nodes?
> 
> I am not aware of a way to mount by LockTableName.

I remember reading somewhere that label support will be added to gfs_mkfs
at some point.

Brian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/852afb29/attachment.bin>

From gordan at bobich.net  Thu Jan 31 21:09:04 2008
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 31 Jan 2008 21:09:04 +0000
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20081311491.820707@leena>
References: <20081311491.820707@leena>
Message-ID: <47A238F0.2000607@bobich.net>

isplist at logicore.net wrote:
> Something else going on here.
> 
> I see the device using fdisk;
> 
> Disk /dev/sda doesn't contain a valid partition table
> You have new mail in /var/spool/mail/root
> 
> I then use fdisk to create a partition;
> 
> Disk /dev/sda: 774.2 GB, 774207700992 bytes
> 255 heads, 63 sectors/track, 94125 cylinders
> Units = cylinders of 16065 * 512 = 8225280 bytes
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sda1               1       94125   756059031   83  Linux
> 
> Looks fine right.
> 
> Looking on the net, I can't find anything on gfs_mkfs one single drive so have 
> to make assumptions for now. 
> 
> Assumptions are, lock mode, cluster name, FS name, journals and device.
> 
> #gfs_mkfs -p lock_dlm -t compweb:web -j 8 /dev/sda
> 
> Seems to go fine;
> 
> Device:                    /dev/sda
> Blocksize:                 4096
> Filesystem Size:           188738976
> Journals:                  8
> Resource Groups:           2882
> Locking Protocol:          lock_dlm
> Lock Table:                compweb:web
> 
> Syncing...
> All Done
> 
> Do an fdisk again;
> 
> Disk /dev/sda: 774.2 GB, 774207700992 bytes
> 255 heads, 63 sectors/track, 94125 cylinders
> Units = cylinders of 16065 * 512 = 8225280 bytes
> 
> Disk /dev/sda doesn't contain a valid partition table
> 
> It's all gone. Guessing I need to gfs_mkfs some other way?

You're going to hit yourself when I say this, but you made the GFS 
partition on the raw drive (sda rather than sda1), not on a partition. 
There is nothing wrong with making it on the raw disk, mind you, that's 
what I always do when using SAN volumes.

Gordan



From isplist at logicore.net  Thu Jan 31 21:12:13 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 15:12:13 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFD@honeywheat.Beer.Town>
Message-ID: <2008131151213.764887@leena>

>And it sounds like you are trying to move away from CLVM.  You found one of
> the rubs.

I've got the storage working again, something to do with LVM after all.

I might have to go back to LVM if it's the only way to use names. Things will 
get um, way too much fun if I don't methinks :).

Thanks for the help, again.

Mike





From bpkroth at wisc.edu  Thu Jan 31 21:18:33 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Thu, 31 Jan 2008 15:18:33 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131151213.764887@leena>
References: <99E0F1976E2DA2499F3E6EB18B25F03602C32CFD@honeywheat.Beer.Town>
	<2008131151213.764887@leena>
Message-ID: <20080131211832.GF10343@wisc.edu>

isplist at logicore.net <isplist at logicore.net>:
> >And it sounds like you are trying to move away from CLVM.  You found one of
> > the rubs.
> 
> I've got the storage working again, something to do with LVM after all.
> 
> I might have to go back to LVM if it's the only way to use names. Things will 
> get um, way too much fun if I don't methinks :).
> 
> Thanks for the help, again.

Repeat from before:  you don't need to use clvm just to get names.  You make
some udev rules that correlate the unique scsi id of your array to some
name that you like.  For instance:

$ cat /etc/udev/rules.d/69-xraid.rules 
# /etc/udev/rules.d/69-xraid.rules
# 2008-01-18
# Brian Kroth
# Setup devices nodes for the two storage arrays based upon their unique
# scsi_id

ACTION!="add", GOTO="xraid_end"

KERNEL=="sd*[!0-9]", ENV{ID_SERIAL}=="3600039300001e6db01000000d52b0606",
SYMLINK+="xraid-c1-a1"
KERNEL=="sd*[!0-9]", ENV{ID_SERIAL}=="3600039300001e6db020000006a95fef7",
SYMLINK+="xraid-c1-a2"

KERNEL=="sd*[0-9]", ENV{ID_SERIAL}=="3600039300001e6db01000000d52b0606",
SYMLINK+="xraid-c1-a1-p%n"
KERNEL=="sd*[0-9]", ENV{ID_SERIAL}=="3600039300001e6db020000006a95fef7",
SYMLINK+="xraid-c1-a2-p%n"

LABEL="xraid_end"


Brian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/cf5815d2/attachment.bin>

From isplist at logicore.net  Thu Jan 31 22:29:33 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 16:29:33 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A238F0.2000607@bobich.net>
Message-ID: <2008131162933.286353@leena>

> You're going to hit yourself when I say this, but you made the GFS
> partition on the raw drive (sda rather than sda1), not on a partition.
> There is nothing wrong with making it on the raw disk, mind you, that's
> what I always do when using SAN volumes.

Ya, I almost did smack myself for being so silly :). Like I said, stare too 
long, you won't see anything after a while.

Mike





From isplist at logicore.net  Thu Jan 31 22:52:33 2008
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 31 Jan 2008 16:52:33 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <20080131211832.GF10343@wisc.edu>
Message-ID: <2008131165233.078427@leena>

> $ cat /etc/udev/rules.d/69-xraid.rules
> # /etc/udev/rules.d/69-xraid.rules

-rw-r--r--  1 root root 9798 May  6  2007 10-wacom.rules
-rw-r--r--  1 root root 3567 Aug 20  2006 50-udev.rules
-rw-r--r--  1 root root  507 Aug 20  2006 51-by-id.rules
-rw-r--r--  1 root root  160 May 15  2007 51-dlm.rules

I don't seem to have that but I'll put it in my list of things to check into. 
Thanks very much.

Mike





From johannes.russek at io-consulting.net  Thu Jan 31 22:58:36 2008
From: johannes.russek at io-consulting.net (Johannes Russek)
Date: Thu, 31 Jan 2008 23:58:36 +0100
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <2008131165233.078427@leena>
References: <2008131165233.078427@leena>
Message-ID: <47A2529C.1040404@io-consulting.net>

isplist at logicore.net schrieb:
>> $ cat /etc/udev/rules.d/69-xraid.rules
>> # /etc/udev/rules.d/69-xraid.rules
>>     
>
> -rw-r--r--  1 root root 9798 May  6  2007 10-wacom.rules
> -rw-r--r--  1 root root 3567 Aug 20  2006 50-udev.rules
> -rw-r--r--  1 root root  507 Aug 20  2006 51-by-id.rules
> -rw-r--r--  1 root root  160 May 15  2007 51-dlm.rules
>
> I don't seem to have that but I'll put it in my list of things to check into. 
> Thanks very much.
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   
Hi Mike,
i think that's the file he sent you ;) (starting with "# /etc/udev..")
johannes



From johannes.russek at io-consulting.net  Thu Jan 31 23:05:36 2008
From: johannes.russek at io-consulting.net (Johannes Russek)
Date: Fri, 01 Feb 2008 00:05:36 +0100
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A2529C.1040404@io-consulting.net>
References: <2008131165233.078427@leena> <47A2529C.1040404@io-consulting.net>
Message-ID: <47A25440.1020509@io-consulting.net>


>
> i think that's the file he sent you ;) (starting with "# /etc/udev..")
> johannes
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
thinking about it, i think i should add to say that you need to edit 
that file for your needs (adjust the scsi id). i didn't use udev yet, 
the only place where i scsi_id etc together with dm-multipath.
johannes



From bpkroth at wisc.edu  Thu Jan 31 23:39:56 2008
From: bpkroth at wisc.edu (Brian Kroth)
Date: Thu, 31 Jan 2008 17:39:56 -0600
Subject: [Linux-cluster] DLM Problem
In-Reply-To: <47A25440.1020509@io-consulting.net>
References: <2008131165233.078427@leena> <47A2529C.1040404@io-consulting.net>
	<47A25440.1020509@io-consulting.net>
Message-ID: <20080131233956.GA4485@omnius.hslc.wisc.edu>

Johannes Russek <johannes.russek at io-consulting.net>:
>
>>
>> i think that's the file he sent you ;) (starting with "# /etc/udev..")
>> johannes

Yes.

>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> thinking about it, i think i should add to say that you need to edit that 
> file for your needs (adjust the scsi id). i didn't use udev yet, the only 
> place where i scsi_id etc together with dm-multipath.
> johannes

True, you will need to edit it.  That example is coming from a gentoo
install, so it'll probably be different for you.

Brian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2192 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080131/212acd99/attachment.bin>