From lionel.hay at agriculture.gouv.fr  Mon Oct  1 07:06:02 2007
From: lionel.hay at agriculture.gouv.fr (lionel.hay)
Date: Mon, 01 Oct 2007 09:06:02 +0200
Subject: [Linux-cluster] DRBD cs:DiskLessClient  inconsistant
Message-ID: <47009C5A.6030706@agriculture.gouv.fr>

Hi

What does that mean  cs:DiskLessClient ld:Inconsistent ?

Here are my status

SERVER 1:
---------
drbd driver loaded OK; device status:
version: 0.7.11 (api:77/proto:74)
SVN Revision: 1807 build by root at svr-dep64, 2006-11-23 16:28:05
  0: cs:Unconfigured
  1: cs:Unconfigured
  2: cs:DiskLessClient st:Primary/Secondary ld:Inconsistent
     ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0
  3: cs:Connected st:Primary/Secondary ld:Consistent
     ns:80203 nr:0 dw:45187 dr:54382 al:0 bm:52 lo:0 pe:0 ua:0 ap:0
  4: cs:Unconfigured
  5: cs:Connected st:Primary/Secondary ld:Consistent
     ns:15865321 nr:0 dw:15865313 dr:1285608 al:372493 bm:2 lo:0 pe:0 
ua:0 ap:0
  6: cs:Connected st:Primary/Secondary ld:Consistent
     ns:2890920 nr:0 dw:224620 dr:8650473 al:938 bm:1088 lo:0 pe:0 ua:0 ap:0
  7: cs:Unconfigured


SERVER 2:
---------
drbd driver loaded OK; device status:
version: 0.7.11 (api:77/proto:74)
SVN Revision: 1807 build by root at svr-secours, 2006-11-17 15:16:30
  0: cs:Unconfigured
  1: cs:Unconfigured
  2: cs:ServerForDLess st:Secondary/Primary ld:Consistent
     ns:0 nr:0 dw:35800 dr:0 al:0 bm:60 lo:0 pe:0 ua:0 ap:0
  3: cs:Connected st:Secondary/Primary ld:Consistent
     ns:0 nr:80162 dw:80162 dr:0 al:0 bm:52 lo:0 pe:0 ua:0 ap:0
  4: cs:Unconfigured
  5: cs:Connected st:Secondary/Primary ld:Consistent
     ns:0 nr:15865224 dw:15865224 dr:0 al:0 bm:2 lo:0 pe:0 ua:0 ap:0
  6: cs:Connected st:Secondary/Primary ld:Consistent
     ns:0 nr:2890240 dw:2890240 dr:0 al:0 bm:1088 lo:0 pe:0 ua:0 ap:0
  7: cs:Unconfigured

Thanks for your response

Lionel HAY
DDAF 64
0559021254
FRANCE


From lhh at redhat.com  Mon Oct  1 14:09:11 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:09:11 -0400
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two
	nodes cluster
In-Reply-To: <20070925124750.GA28897@jasmine.xos.nl>
References: <46F8B1C5.7010007@bull.net> <1190709435.23919.11.camel@marc>
	<20070925124750.GA28897@jasmine.xos.nl>
Message-ID: <1191247751.4477.1.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-09-25 at 14:47 +0200, Jos Vos wrote:
> On Tue, Sep 25, 2007 at 10:37:15AM +0200, Marc - A. Dahlhaus [ Administration | Westermann GmbH ] wrote:
> 
> > Your problem lies here: expected_votes="3"
> 
> [...]
> 
> > You should calculate your votes like this:
> > ( votes % 2 ) + 1
> > 
> > So you should use this: expected_votes="2"
> 
> Oh... but the FAQ (# 18) explicitly says "nodes + 1" and gives "3"
> as the example for a two-node cluster.  And I tried it (accidently)
> with expected_votes="2" and the result was that the nodes started
> fencing each other in an endless loop.  This was solve by setting
> expected_votes="3".  This is on RHEL5. b.t.w., not RHEL4.
> 

two_node=0
expected_votes=3
qdisk=1 vote
nodes=1 vote each

-- Lon


From lhh at redhat.com  Mon Oct  1 14:11:30 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:11:30 -0400
Subject: [Linux-cluster] help:can't execute Add a failover Domain
In-Reply-To: <390793994.12818@ustc.edu.cn>
References: <390793994.12818@ustc.edu.cn>
Message-ID: <1191247890.4477.3.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-09-26 at 16:06 +0800, lining at mail.ustc.edu.cn wrote:
> I have a cluster with two nodes ,it has started.
> On the conga platform ,when I choose add a failover domain ,
> it returned as followed:
> 
> Network station Error
> this network station occured an error when handle your request.
> the error is :
> error type:
>     AttributeError
> error value:
>     getFdomNodes

That's a bug.

http://bugzilla.redhat.com

-- Lon


From lhh at redhat.com  Mon Oct  1 14:12:47 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:12:47 -0400
Subject: [Linux-cluster] Re: CS4 U5 / recommended quorumd values for a
	two nodes (contd.)
In-Reply-To: <46FA5AA8.4010205@bull.net>
References: <46FA5AA8.4010205@bull.net>
Message-ID: <1191247967.4477.5.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-09-26 at 15:12 +0200, Alain Moulle wrote:

> until the cman of second node is started and then :
> Sep 26 15:07:01 s_sys at bali0 ccsd[12224]: Cluster is quorate.  Allowing connections.
> 
> I have read and read again the FAQ page, especially the # you mention, but
> don't understand why it does not work for me ...
> Except if my quorum disk is not working ?
> But command mkqisk returns :
> #mkqdisk -L
> mkqdisk v0.5.1
> /dev/sdk:
>         Magic:   eb7a62c2
>         Label:   CS4QUORUMDISK
>         Created: Tue Sep 18 16:33:40 2007
>         Host:    node
> but is it sufficient to know if Quorum disk is working correctly ?

cat /tmp/qdisk_status

On RHEL4.x, you need to chkconfig --add qdiskd

On 4.4, qdiskd won't correctly wait for CMAN; this is fixed in the 4.5
version.

-- Lon


From lhh at redhat.com  Mon Oct  1 14:13:06 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:13:06 -0400
Subject: [Linux-cluster] rgmanager and qdisk in RHEL5 "behavior problems"
In-Reply-To: <OFCE45FADA.AB5EC862-ONC1257362.005B11EA-C1257362.005C847B@obi.de>
References: <OFCE45FADA.AB5EC862-ONC1257362.005B11EA-C1257362.005C847B@obi.de>
Message-ID: <1191247986.4477.7.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-09-26 at 18:50 +0200, thorsten.henrici at gfd.de wrote:
> 
> Hello List, 
> has this fix 
> 
> http://www.redhat.com/archives/cluster-devel/2007-April/msg00064.html 
> 
> Rgmanager thinks qdisk is a node (with node ID 0), so it tries to send
> VF information to node 0 - which doesn't exist, causing rgmanger to
> not
> work when qdisk is running :( 

This is a bug in 5.0; it's fixed in 5.1 beta.

-- Lon


From lhh at redhat.com  Mon Oct  1 14:14:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:14:49 -0400
Subject: [Linux-cluster] service can not be relocated
In-Reply-To: <9fa3c2e50709270031n187b3403nf0666fdfa9bed4e9@mail.gmail.com>
References: <9fa3c2e50709270031n187b3403nf0666fdfa9bed4e9@mail.gmail.com>
Message-ID: <1191248089.4477.9.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-09-27 at 15:31 +0800, Changer Van wrote:
> Hi all,
> 
> Httpd service can not be relocated when I performed the command as
> follows:
> 
> # clusvcadm -r httpd
> Trying to relocate service:httpd...Failure
> service:httpd is now running on node02

Hi, what release are you using?

-- Lon


From lhh at redhat.com  Mon Oct  1 14:18:55 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:18:55 -0400
Subject: [Linux-cluster] Different views
In-Reply-To: <46FCCE79.3070804@cesca.es>
References: <46FCCE79.3070804@cesca.es>
Message-ID: <1191248335.4477.13.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-09-28 at 11:50 +0200, Jordi Prats wrote:
> Hi,
> I'm getting a strange error: On one node I cannot see the other one, but 
> on the other I can see both online. Any one can help me with this?
> 
> I'm getting a lot of problems setting up this version of RH cluster (the 
> one with openais).
> 
> Thanks,
> 
> Here I paste some status messages:

Is fencing waiting for completion for inf19 on one of the other nodes?
When a cluster forms a quorum, the nodes which are not part of the
quorate partition need to be fenced.

So, it's likely that the one of the nodes is trying to fence inf19.

-- Lon


From lhh at redhat.com  Mon Oct  1 14:20:51 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 01 Oct 2007 10:20:51 -0400
Subject: [Linux-cluster] service can not be relocated
In-Reply-To: <1191248089.4477.9.camel@ayanami.boston.devel.redhat.com>
References: <9fa3c2e50709270031n187b3403nf0666fdfa9bed4e9@mail.gmail.com>
	<1191248089.4477.9.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1191248451.4477.15.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-01 at 10:14 -0400, Lon Hohberger wrote:
> On Thu, 2007-09-27 at 15:31 +0800, Changer Van wrote:
> > Hi all,
> > 
> > Httpd service can not be relocated when I performed the command as
> > follows:
> > 
> > # clusvcadm -r httpd
> > Trying to relocate service:httpd...Failure
> > service:httpd is now running on node02
> 
> Hi, what release are you using?

Right, and are there any logs on node01 indicating why it might not be
started?

-- Lon


From jobot at wmdata.com  Mon Oct  1 14:33:49 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Mon, 1 Oct 2007 16:33:49 +0200
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20070928170309.GD7239@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
	<20070928170309.GD7239@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>

-----Original Message-----
From: David Teigland [mailto:teigland at redhat.com] 
Sent: den 28 september 2007 19:03
To: Borgstr?m Jonas
Cc: linux clustering
Subject: Re: [Linux-cluster] Possible cman init script race condition

> On Fri, Sep 28, 2007 at 11:45:47AM -0500, David Teigland wrote:
> > On Fri, Sep 28, 2007 at 09:58:18AM -0500, David Teigland wrote:
> > > On Fri, Sep 28, 2007 at 04:48:18PM +0200, Borgstr?m Jonas wrote:
> > > > I must have misunderstood you or something, but didn't I already include
> > > > that info in the message I sent a few days ago?
> > > > 
> > > > http://permalink.gmane.org/gmane.linux.redhat.cluster/9999
> > > > 
> > > > (The archive inlines the "group_tool dump" output making it a bit hard
> > > > to read, but hopefully your email client shows them as attachments).
> > > 
> > > I missed that, I'll take a look, thanks.
> > 
> > You've hit a known bug that's been fixed:
> >   https://bugzilla.redhat.com/show_bug.cgi?id=251966
> > 
> > We may have to move up the release of that fix since people are seeing the
> > problem.  Be careful when reading that bz because there's a lot of
> > incorrect diagnosis that was recorded before we figured out what the real
> > bug was.  Here's the problem, it's very complex:
> > 
> > 1. when the nodes start up, they each form a 1-node openais cluster
> >    independent of the other
> > 
> >    [This shouldn't really happen, but in reality we can't prevent it
> >     100% of the time.  We try to make it rare, and then deal with it
> >     sensibly on the rare occasion when it does happen.  You've hit
> >     the "rare" occasion -- if you're actually seeing this regularly
> >     then we probably need to fix or adjust something at the openais
> >     level to make it less common.]
>
> I'd try to use some sleeps here, before running fence_tool join on either
> node, as a work-around.  We're trying to get both nodes merged together
> before they do anything else.

Strangely enough adding a "sleep 30" line directly below the "echo "Starting cluster: "" line seems to make this problem go away every time. Note that this is before any daemon is started. It works, but I'm not sure why.

>
> Also, how often are you seeing the nodes not merge together right away?
> If it's frequent, then we need to fix that.

This happens every time on this hardware (2 Dell 1955 blades). I never got fenced to work correctly until I figured out that I need to add a sleep 30 to the cman init script. So I'm obviously very interested in seeing this fixed in a 5.0 errata or in 5.1 at the very latest. I can't really wait until 5.2 is out...

And as I mentioned before, the really scary part is that I am able to mount gfs filesystems during this kind of cluster split. And if I one node is shot, the other node replays the gfs journal and makes the filesystem writable again without first fencing the shot/missing node.

Here some "group_tool -v" output with a mounted filesystem:

[root at prod-db2 pgsql]# group_tool -v
type             level name     id       state node id local_done
fence            0     default  00010002 JOIN_START_WAIT 1 100020001 1
[1 2]
dlm              1     clvmd    00020001 JOIN_START_WAIT 1 100020001 1
[1 2]
dlm              1     pg_fs    00060001 JOIN_START_WAIT 1 100020001 1
[1 2]
gfs              2     pg_fs    00050001 JOIN_START_WAIT 1 100020001 1
[1 2]


Regards,
Jonas


From teigland at redhat.com  Mon Oct  1 16:21:46 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 1 Oct 2007 11:21:46 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
	<20070928170309.GD7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>
Message-ID: <20071001162145.GC3937@redhat.com>

> Strangely enough adding a "sleep 30" line directly below the "echo
> "Starting cluster: "" line seems to make this problem go away every
> time. Note that this is before any daemon is started. It works, but I'm
> not sure why.

Have you tried numbers less than 30?  I forget if I've asked yet, but do
you have the xend init script disabled?


> > Also, how often are you seeing the nodes not merge together right
> > away?  If it's frequent, then we need to fix that.
> 
> This happens every time on this hardware (2 Dell 1955 blades). I never
> got fenced to work correctly until I figured out that I need to add a
> sleep 30 to the cman init script. So I'm obviously very interested in
> seeing this fixed in a 5.0 errata or in 5.1 at the very latest. I can't
> really wait until 5.2 is out...

Remember, there are two problems we're talking about here.  The first is
why openais doesn't merge together for many seconds when both nodes start
up in parallel.  This should be a rare occurance.  The fact that you're
seeing it every time implies there's an openais problem, or there could be
a problem related to the networking between your nodes.  We don't have any
idea at this point.  Maybe Steve Dake could help you more with this.  Your
sleep 30 workaround is a clue -- it forces openais to start 30 seconds
apart on the two nodes.

The second problem is how we deal with the eventual merging of the two
clusters.  After we fix the first problem, you will probably never see
this second problem again.


> And as I mentioned before, the really scary part is that I am able to
> mount gfs filesystems during this kind of cluster split. And if I one
> node is shot, the other node replays the gfs journal and makes the
> filesystem writable again without first fencing the shot/missing node.

I would need to see the logs from the exact scenario you're talking about
here to determine if this is a new problem or an effect of the other one.

Dave


From linux-cluster at veggiechinese.net  Mon Oct  1 19:04:02 2007
From: linux-cluster at veggiechinese.net (William Yardley)
Date: Mon, 1 Oct 2007 12:04:02 -0700
Subject: [Linux-cluster] simplest usage case for shared storage pool
Message-ID: <20071001190402.GA9288@mitch.veggiechinese.net>

I have a Dell MD-3000 hooked up (via SAS) to 2 (will be 4 in actual
usage) Dell 2900 series servers. I would like to know the _bare_minimum_
set of stuff I need to configure to share a filesystem between the two
devices, keeping in mind the following criteria:

 * The devices will mount the filesystem ro - there will be only a
   single node mounting the filesystesm rw
 * The root filesystem or other system files will not be mounting the
   array - just a filesystem with shared data.
 * The applications that will be accessing the data will be Apache
   (read-only), as well as (on the rw host) rsync or some other standard
   utility to keep things up to date.

So given that, is fencing each host and having a full cluster setup
really necessary? Given that gfs (and the application(s) accessing the
image do POSIX compliant file locking, isn't there some simpler way to
accomplish this?

Assuming it is necessary, I should be able to use IPMI on the individual
hosts to fence the devices, correct?

Obviously NFS would be a bit simpler, but even with a NetApp or other
filer appliance, and the latest versions of NFS, I'm a bit concerned
that the performance wouldn't be as good.

w


From stefano.bossi at mediaset.it  Mon Oct  1 21:06:02 2007
From: stefano.bossi at mediaset.it (Stefano Bossi)
Date: Mon, 01 Oct 2007 23:06:02 +0200
Subject: [Linux-cluster] dlm_controld problem starting
Message-ID: <4701613A.5040301@mediaset.it>

Hi guys,

I'm trying to compile a GFS2 system from source but I found some trouble.

I'm using 2.6.23-rc7 #1 SMP PREEMPT Mon Oct 1 14:14:36 CEST 2007 x86_64
AMD Opteron(tm) Processor 285 AuthenticAMD GNU/Linux on the node where I
found the trouble.

This is a 4 nodes cluster:

[root at SAN-node1 mnt]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M  416952   2007-10-01 19:31:46  SAN-node1
   2   M  417044   2007-10-01 19:31:46  SAN-node2
   3   M  417056   2007-10-01 22:37:34  SAN-node3
   4   M  417044   2007-10-01 19:31:46  SAN-node4

[root at SAN-node1 mnt]# cman_tool status
Version: 6.0.1
Config Version: 10
Cluster Name: alpha_cluster
Cluster Id: 50356
Cluster Member: Yes
Cluster Generation: 417056
Membership state: Cluster-Member
Nodes: 4
Expected votes: 4
Total votes: 4
Quorum: 3
Active subsystems: 8
Flags:
Ports Bound: 0 11 177
Node name: SAN-node1
Node ID: 1
Multicast addresses: 239.0.1.100
Node addresses: 10.102.41.74

as you can see the cluster is quorate and the other three node are
correcly working (they are Fedora core 7 and no compilation from scratch !)

The node I'm trying to rebuild is the SAN-node3 and the error is:

Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
/sys/kernel/config/dlm, is the dlm loaded?
Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
/sys/kernel/config/dlm, is the dlm loaded?
Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
/sys/kernel/config/dlm, is the dlm loaded?
Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
/sys/kernel/config/dlm, is the dlm loaded?

of course I recompiled the kernel too.

Where I can point my attention to find out the problem?
Has some more experienced then me some useful suggestion?

Thanks,
Stefano

Le informazioni trasmesse sono destinate esclusivamente alla persona o alla societ? in indirizzo e sono da intendersi confidenziali e riservate. Ogni trasmissione, inoltro, diffusione o altro uso di queste informazioni a persone o societ? differenti dal destinatario ? proibita. Se ricevete questa comunicazione per errore, contattate il mittente e cancellate le informazioni da ogni computer.

The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer.


From teigland at redhat.com  Mon Oct  1 21:01:45 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 1 Oct 2007 16:01:45 -0500
Subject: [Linux-cluster] dlm_controld problem starting
In-Reply-To: <4701613A.5040301@mediaset.it>
References: <4701613A.5040301@mediaset.it>
Message-ID: <20071001210145.GE3937@redhat.com>

On Mon, Oct 01, 2007 at 11:06:02PM +0200, Stefano Bossi wrote:
> Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
> /sys/kernel/config/dlm, is the dlm loaded?
> Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
> /sys/kernel/config/dlm, is the dlm loaded?
> Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
> /sys/kernel/config/dlm, is the dlm loaded?
> Oct  1 22:38:00 SAN-node3 dlm_controld[12928]: No
> /sys/kernel/config/dlm, is the dlm loaded?

Either you've not loaded the dlm kernel module, or you've not mounted
configfs, mount -t configfs none /sys/kernel/config.

Dave


From maciej.bogucki at artegence.com  Tue Oct  2 09:46:36 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 02 Oct 2007 11:46:36 +0200
Subject: [Linux-cluster] DRBD cs:DiskLessClient  inconsistant
In-Reply-To: <47009C5A.6030706@agriculture.gouv.fr>
References: <47009C5A.6030706@agriculture.gouv.fr>
Message-ID: <4702137C.2080208@artegence.com>

lionel.hay napisa?(a):
> Hi
> 
> What does that mean  cs:DiskLessClient ld:Inconsistent ?

Hello,

Check Your log(and dmesg). Probably your storage failed on IO request
and DRBD deatched one.

Best Regarads
Maciej Bogucki


From jobot at wmdata.com  Tue Oct  2 15:51:40 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Tue, 2 Oct 2007 17:51:40 +0200
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20071001162145.GC3937@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
	<20070928170309.GD7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>
	<20071001162145.GC3937@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D02230F81@WMRI000167.corp.wmdata.net>

-----Original Message-----
From: David Teigland [mailto:teigland at redhat.com] 
Sent: den 1 oktober 2007 18:22
To: Borgstr?m Jonas
Cc: linux clustering
Subject: Re: [Linux-cluster] Possible cman init script race condition
> 
> > Strangely enough adding a "sleep 30" line directly below the "echo
> > "Starting cluster: "" line seems to make this problem go away every
> > time. Note that this is before any daemon is started. It works, but I'm
> > not sure why.
>
> Have you tried numbers less than 30?  I forget if I've asked yet, but do
> you have the xend init script disabled?

I did try "sleep 15" but that was not enough. Maybe the HBA/lun initialization that's taking too long or something.

And no, xen is not installed on these servers.

>
>
> > > Also, how often are you seeing the nodes not merge together right
> > > away?  If it's frequent, then we need to fix that.
> > 
> > This happens every time on this hardware (2 Dell 1955 blades). I never
> > got fenced to work correctly until I figured out that I need to add a
> > sleep 30 to the cman init script. So I'm obviously very interested in
> > seeing this fixed in a 5.0 errata or in 5.1 at the very latest. I can't
> > really wait until 5.2 is out...
>
> Remember, there are two problems we're talking about here.  The first is
> why openais doesn't merge together for many seconds when both nodes start
> up in parallel.  This should be a rare occurance.  The fact that you're
> seeing it every time implies there's an openais problem, or there could be
> a problem related to the networking between your nodes.  We don't have any
> idea at this point.  Maybe Steve Dake could help you more with this.  Your
> sleep 30 workaround is a clue -- it forces openais to start 30 seconds
> apart on the two nodes.

No, I think the cman daemons are started at pretty much the same time on both nodes. At least if I reboot both machines at the same time. "sleep 30" gives the kernel and the programs started before "cman" an extra 30 seconds to do their stuff before the bulk of the cman init script is executed.

Another workaround is to run "chkconfig cman off" and start it from /etc/rc.d/rc.local. That also works, and does not require and "sleep". This probably works since rc.local is the very last thing executed by the boot-up process and that is probably at least 30 seconds later.

>
> The second problem is how we deal with the eventual merging of the two
> clusters.  After we fix the first problem, you will probably never see
> this second problem again.
>
>
> > And as I mentioned before, the really scary part is that I am able to
> > mount gfs filesystems during this kind of cluster split. And if I one
> > node is shot, the other node replays the gfs journal and makes the
> > filesystem writable again without first fencing the shot/missing node.
>
> I would need to see the logs from the exact scenario you're talking about
> here to determine if this is a new problem or an effect of the other one.

Ok, here's some log outpt:

Scenario: A gfs filesystem is mounted on two nodes in a "split cluster"

cluster.conf: http://jonas.borgstrom.se/gfs/cluster.conf

Node: prod-db1:
group_tool -v: http://jonas.borgstrom.se/gfs/prod_db1_group_tool_v.txt
group_tool dump: http://jonas.borgstrom.se/gfs/prod_db1_group_tool_dump.txt

Node: prod-db2:
group_tool -v: http://jonas.borgstrom.se/gfs/prod_db2_group_tool_v.txt
group_tool dump: http://jonas.borgstrom.se/gfs/prod_db2_group_tool_dump.txt

Node prod-db1 is now shot and prod-db2 happily replays the gfs journal without first fencing the failed node:

Node: prod-db2:
group_tool -v: http://jonas.borgstrom.se/gfs/prod_db2_group_tool_v_after_prod_db1_is_shot.txt
group_tool dump: http://jonas.borgstrom.se/gfs/prod_db2_group_tool_dump_after_prod_db1_is_shot.txt
/var/log/messages: http://jonas.borgstrom.se/gfs/prod_db2_messages_after_prod_db1_is_shot.txt

So gfs is till mounted and writable on prod-db2 even though prod-db1 was never fenced.

Expected behavior: prod-db1 should be fenced before the gfs journal is replayed. (Which happens if I add "sleep 30" to /etc/rc.d/init.d/cman).

Regards,
Jonas


From raycharles_man at yahoo.com  Tue Oct  2 16:08:25 2007
From: raycharles_man at yahoo.com (Ray Charles)
Date: Tue, 2 Oct 2007 09:08:25 -0700 (PDT)
Subject: [Linux-cluster] partition tables not sync'd
Message-ID: <694881.1462.qm@web32112.mail.mud.yahoo.com>


Hi,

I have a one year old cluster(2 nodes) in production
that is GFS 6.1 attached to an iscsi san. Its
currently in an active passive arrangement for
tomcat/apache.

While the active node was busy servicing web visitors,
I used the other node to add a lun from the san; then
a physical volume and then a logical volume to the
existing vol.group; then I put the file system on the
new partition. I also have it mounted on the
non-active node.

The active node will not recognize the new partition
schema. I've rebooted it but it still doesn't want to
see the new partition the other node has created.

What should I do?


-tia


      ____________________________________________________________________________________
Don't let your dream ride pass you by. Make it a reality with Yahoo! Autos.
http://autos.yahoo.com/index.html
 

From rhurst at bidmc.harvard.edu  Tue Oct  2 16:13:43 2007
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Tue, 02 Oct 2007 12:13:43 -0400
Subject: [Linux-cluster] partition tables not sync'd
In-Reply-To: <694881.1462.qm@web32112.mail.mud.yahoo.com>
References: <694881.1462.qm@web32112.mail.mud.yahoo.com>
Message-ID: <1191341623.19225.22.camel@xw9300.bidmc.harvard.edu>

Silly question: you also masked that lun on the active node, right?
You could also remove the /etc/lvm/.cache and lvmdiskscan / vgscan
--mknodes

On Tue, 2007-10-02 at 09:08 -0700, Ray Charles wrote:

> Hi,
> 
> I have a one year old cluster(2 nodes) in production
> that is GFS 6.1 attached to an iscsi san. Its
> currently in an active passive arrangement for
> tomcat/apache.
> 
> While the active node was busy servicing web visitors,
> I used the other node to add a lun from the san; then
> a physical volume and then a logical volume to the
> existing vol.group; then I put the file system on the
> new partition. I also have it mounted on the
> non-active node.
> 
> The active node will not recognize the new partition
> schema. I've rebooted it but it still doesn't want to
> see the new partition the other node has created.
> 
> What should I do?
> 
> 
> -tia
> 
> 
> 
> 
> 
>       ____________________________________________________________________________________
> Don't let your dream ride pass you by. Make it a reality with Yahoo! Autos.
> http://autos.yahoo.com/index.html
>  
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071002/1b387224/attachment.htm>

From raycharles_man at yahoo.com  Tue Oct  2 16:26:20 2007
From: raycharles_man at yahoo.com (Ray Charles)
Date: Tue, 2 Oct 2007 09:26:20 -0700 (PDT)
Subject: [Linux-cluster] partition tables not sync'd
In-Reply-To: <1191341623.19225.22.camel@xw9300.bidmc.harvard.edu>
Message-ID: <126009.89759.qm@web32101.mail.mud.yahoo.com>


Well, in my original attempt i did a rescan-scsi_bus,
vgscan and lvscan but the new partition didn't show
up.

I think you're on to something that i also want to try
and that's the /etc/lvm/.cache.

Any precautions? 

-tia

--- Robert Hurst <rhurst at bidmc.harvard.edu> wrote:

> Silly question: you also masked that lun on the
> active node, right?
> You could also remove the /etc/lvm/.cache and
> lvmdiskscan / vgscan
> --mknodes
> 
> On Tue, 2007-10-02 at 09:08 -0700, Ray Charles
> wrote:
> 
> > Hi,
> > 
> > I have a one year old cluster(2 nodes) in
> production
> > that is GFS 6.1 attached to an iscsi san. Its
> > currently in an active passive arrangement for
> > tomcat/apache.
> > 
> > While the active node was busy servicing web
> visitors,
> > I used the other node to add a lun from the san;
> then
> > a physical volume and then a logical volume to the
> > existing vol.group; then I put the file system on
> the
> > new partition. I also have it mounted on the
> > non-active node.
> > 
> > The active node will not recognize the new
> partition
> > schema. I've rebooted it but it still doesn't want
> to
> > see the new partition the other node has created.
> > 
> > What should I do?
> > 
> > 
> > -tia
> > 
> > 
> > 
> > 
> > 
> >      
>
____________________________________________________________________________________
> > Don't let your dream ride pass you by. Make it a
> reality with Yahoo! Autos.
> > http://autos.yahoo.com/index.html
> >  
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> >
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> > --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


____________________________________________________________________________________
Looking for a deal? Find great prices on flights and hotels with Yahoo! FareChase.
http://farechase.yahoo.com/


From teigland at redhat.com  Tue Oct  2 16:25:14 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 2 Oct 2007 11:25:14 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D02230F81@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
	<20070928170309.GD7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>
	<20071001162145.GC3937@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230F81@WMRI000167.corp.wmdata.net>
Message-ID: <20071002162514.GA30975@redhat.com>

On Tue, Oct 02, 2007 at 05:51:40PM +0200, Borgstr?m Jonas wrote:

> No, I think the cman daemons are started at pretty much the same time on
> both nodes. At least if I reboot both machines at the same time. "sleep
> 30" gives the kernel and the programs started before "cman" an extra 30
> seconds to do their stuff before the bulk of the cman init script is
> executed.
> 
> Another workaround is to run "chkconfig cman off" and start it from
> /etc/rc.d/rc.local. That also works, and does not require and "sleep".
> This probably works since rc.local is the very last thing executed by
> the boot-up process and that is probably at least 30 seconds later.

I've finally chatted with Steve Dake about this, and he's quite certain
that this is a result of openais bugs in the RHEL5.0 release -- fixed in
the upcoming 5.1.  It might be easiest to use your workarounds until 5.1.


> Ok, here's some log outpt:
> 
> Scenario: A gfs filesystem is mounted on two nodes in a "split cluster"

Thanks a lot, I'll take a look.

Dave


From rhurst at bidmc.harvard.edu  Tue Oct  2 17:02:40 2007
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Tue, 02 Oct 2007 13:02:40 -0400
Subject: [Linux-cluster] partition tables not sync'd
In-Reply-To: <126009.89759.qm@web32101.mail.mud.yahoo.com>
References: <126009.89759.qm@web32101.mail.mud.yahoo.com>
Message-ID: <1191344560.19225.37.camel@xw9300.bidmc.harvard.edu>

I erase that cache before invoking EMC PowerPath, because I have had
issues with changes not making it to that multipathing I/O solution.  I
have experienced no issues doing it adhoc, either.

rescan-scsi bus doesn't necessarily work with the fiber channel cards...
I have to send an equivalent scsi request command directly to the fc
driver (in our case, we are using Emulex lpfc).  An example script we
use:

#!/bin/bash
#
# rescan Emulex Fiber Channel card for new SCSI devices
#

ACTION=`basename $0`

powermt display dev=all | grep emcpower > /tmp/$ACTION.old

echo 0 > /sys/class/fc_host/host0/issue_lip
echo "- - -" > /sys/class/scsi_host/host0/scan

echo 1 > /sys/class/fc_host/host1/issue_lip
echo "- - -" > /sys/class/scsi_host/host1/scan

powermt config

powermt display dev=all | grep emcpower > /tmp/$ACTION.new

echo "Differences before & after"
echo "=========================="
diff /tmp/$ACTION.old /tmp/$ACTION.new


On Tue, 2007-10-02 at 09:26 -0700, Ray Charles wrote:

> Well, in my original attempt i did a rescan-scsi_bus,
> vgscan and lvscan but the new partition didn't show
> up.
> 
> I think you're on to something that i also want to try
> and that's the /etc/lvm/.cache.
> 
> Any precautions? 
> 
> -tia


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071002/05875cd1/attachment.htm>

From raycharles_man at yahoo.com  Tue Oct  2 17:47:41 2007
From: raycharles_man at yahoo.com (Ray Charles)
Date: Tue, 2 Oct 2007 10:47:41 -0700 (PDT)
Subject: [Linux-cluster] partition tables not sync'd
In-Reply-To: <1191344560.19225.37.camel@xw9300.bidmc.harvard.edu>
Message-ID: <807026.16460.qm@web32108.mail.mud.yahoo.com>


mea culpa-

Problem caused by an ID 10T controlling the box.  
The ID 10T didn't provision the active server to be a
host for the new lun. Sorry for the bother.


The ID 10T has been given a slap and a shake.

-Thanks everyone


--- Robert Hurst <rhurst at bidmc.harvard.edu> wrote:

> I erase that cache before invoking EMC PowerPath,
> because I have had
> issues with changes not making it to that
> multipathing I/O solution.  I
> have experienced no issues doing it adhoc, either.
> 
> rescan-scsi bus doesn't necessarily work with the
> fiber channel cards...
> I have to send an equivalent scsi request command
> directly to the fc
> driver (in our case, we are using Emulex lpfc).  An
> example script we
> use:
> 
> #!/bin/bash
> #
> # rescan Emulex Fiber Channel card for new SCSI
> devices
> #
> 
> ACTION=`basename $0`
> 
> powermt display dev=all | grep emcpower >
> /tmp/$ACTION.old
> 
> echo 0 > /sys/class/fc_host/host0/issue_lip
> echo "- - -" > /sys/class/scsi_host/host0/scan
> 
> echo 1 > /sys/class/fc_host/host1/issue_lip
> echo "- - -" > /sys/class/scsi_host/host1/scan
> 
> powermt config
> 
> powermt display dev=all | grep emcpower >
> /tmp/$ACTION.new
> 
> echo "Differences before & after"
> echo "=========================="
> diff /tmp/$ACTION.old /tmp/$ACTION.new
> 
> 
> On Tue, 2007-10-02 at 09:26 -0700, Ray Charles
> wrote:
> 
> > Well, in my original attempt i did a
> rescan-scsi_bus,
> > vgscan and lvscan but the new partition didn't
> show
> > up.
> > 
> > I think you're on to something that i also want to
> try
> > and that's the /etc/lvm/.cache.
> > 
> > Any precautions? 
> > 
> > -tia
> 
> 
> > --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


      ____________________________________________________________________________________
Fussy? Opinionated? Impossible to please? Perfect.  Join Yahoo!'s user panel and lay it on us. http://surveylink.yahoo.com/gmrs/yahoo_panel_invite.asp?a=7 


From teigland at redhat.com  Tue Oct  2 19:03:05 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 2 Oct 2007 14:03:05 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D02230F81@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
	<20070928170309.GD7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230A7D@WMRI000167.corp.wmdata.net>
	<20071001162145.GC3937@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230F81@WMRI000167.corp.wmdata.net>
Message-ID: <20071002190305.GA4378@redhat.com>

On Tue, Oct 02, 2007 at 05:51:40PM +0200, Borgstr?m Jonas wrote:
> > > And as I mentioned before, the really scary part is that I am able to
> > > mount gfs filesystems during this kind of cluster split. And if I one
> > > node is shot, the other node replays the gfs journal and makes the
> > > filesystem writable again without first fencing the shot/missing node.
> >
> > I would need to see the logs from the exact scenario you're talking about
> > here to determine if this is a new problem or an effect of the other one.
> 
> Ok, here's some log outpt:
> 
> Scenario: A gfs filesystem is mounted on two nodes in a "split cluster"

...

> So gfs is till mounted and writable on prod-db2 even though prod-db1 was
> never fenced.

Yes, you're correct.  I've looked at the logs, and it's a side effect of
the other bug where cman should disallow the merger of the two clusters.
So, in summary, you've identified three different problems, each one is an
effect of the one before it:

1. unidentified openais bug(s) in RHEL5.0 cause the two nodes to initially
   form independent clusters -- fixed in 5.1

2. bz 251966 is triggered by (1) -- fixed in 5.2 (maybe earlier)

3. groupd/fenced don't fence the failed node; this is triggered by (2).
   once (2) is fixed this won't happen

Dave


From changerv at gmail.com  Wed Oct  3 03:33:05 2007
From: changerv at gmail.com (Changer Van)
Date: Wed, 3 Oct 2007 11:33:05 +0800
Subject: [Linux-cluster] service can not be relocated
Message-ID: <9fa3c2e50710022033u466ea0dawfea6bdcf8b278920@mail.gmail.com>

------------------------------

Message: 8
Date: Mon, 01 Oct 2007 10:20:51 -0400
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] service can not be relocated
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1191248451.4477.15.camel at ayanami.boston.devel.redhat.com>
Content-Type: text/plain
On Mon, 2007-10-01 at 10:14 -0400, Lon Hohberger wrote:
> On Thu, 2007-09-27 at 15:31 +0800, Changer Van wrote:
> > Hi all,
> >
> > Httpd service can not be relocated when I performed the command as
> > follows:
> >
> > # clusvcadm -r httpd
> > Trying to relocate service:httpd...Failure
> > service:httpd is now running on node02
>
> Hi, what release are you using?
RHEL 5 (2.6.18-8el5)
> Right, and are there any logs on node01 indicating why it might not be
> started?
No, there aren't.
But service httpd was relocated to node01 while cluster member was specified

like 'clusvcadm -r httpd  -m node01'.

Now the service was on node01.
I did a test as follows:

I unplugged network cable of node01 for a while then plugged in again.
Service cman was terminated on node02 suddenly,
and it could not stop on node02.

logs on node02:
node02 openais[2813]: [CLM  ] CLM CONFIGURATION CHANGE
node02 openais[2813]: [CLM  ] New Configuration:
node02 openais[2813]: [CLM  ]   r(0) ip(192.168.0.221)
node02 openais[2813]: [CLM  ] Members Left:
node02 openais[2813]: [CLM  ] Members Joined:
node02 openais[2813]: [SYNC ] This node is within the primary component and
will provide service.
node02 openais[2813]: [CLM  ] CLM CONFIGURATION CHANGE
node02 openais[2813]: [CLM  ] New Configuration:
node02 openais[2813]: [CLM  ]   r(0) ip(192.168.0.219)
node02 openais[2813]: [CLM  ]   r(0) ip(192.168.0.221)
node02 openais[2813]: [CLM  ] Members Left:
node02 openais[2813]: [CLM  ] Members Joined:
node02 openais[2813]: [CLM  ]   r(0) ip(192.168.0.219)
node02 openais[2813]: [SYNC ] This node is within the primary component and
will provide service.
node02 openais[2813]: [TOTEM] entering OPERATIONAL state.
node02 openais[2813]: [MAIN ] Killing node node01 because it has rejoined
the cluster without cman_tool join
node02 openais[2813]: [CMAN ] cman killed by node 2 for reason 3
node02 dlm_controld[2843]: groupd is down, exiting
node02 kernel: dlm: closing connection to node 1
node02 gfs_controld[2849]: groupd_dispatch error -1 errno 11
node02 gfs_controld[2849]: groupd connection died
node02 gfs_controld[2849]: cluster is down, exiting
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 30
seconds.
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 60
seconds.
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 90
seconds.

Any help would be greatly appreciated.
-- 
Regards,
Changer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/ae5bd1b8/attachment.htm>

From bernard.chew at muvee.com  Wed Oct  3 07:14:25 2007
From: bernard.chew at muvee.com (Bernard Chew)
Date: Wed, 3 Oct 2007 15:14:25 +0800
Subject: [Linux-cluster] fence_xvmd in RHEL5 with a virtual domU cluster
Message-ID: <229C73600EB0E54DA818AB599482BCE901C0AB46@shadowfax.sg.muvee.net>

Hi,

I follow the steps below to use fence_xvmd in RHEL5;

(1) configure dom0 like a 1-node cluster 
(2) Add "<fence_xvmd/>" to cluster.conf in dom0 as a child of the
"<cluster>" tag.
(3) dd if=/dev/urandom of=/etc/cluster/fence_xvm.key bs=4096 count=1
(4) scp /etc/cluster/fence_xvm.key root virtual_node_1:/etc/cluster
(5) scp /etc/cluster/fence_xvm.key root virtual_node_2:/etc/cluster
(6) Start cman on dom0 - this should start fence_xvmd for you

However, I encounter the following errors while running "fence_xvmd
-fddddddddd" on the dom0 node and "fence_xvm -H <domain_name> -o null"
on one of the guests;

Hash mismatch:
PKT =
94598bef00f4bc3198032800b714bca59581404f8ca2e9c8ea8bb1119840e83c00000000
00000000000000000000000000000000000000000000000000000000
EXP =
85168eb74638ff044c31a4749dba9cc0b9c66e319398dcc8cd97ee4cf1e3936800000000
00000000000000000000000000000000000000000000000000000000
Key mismatch; dropping packet

Any idea why?

Regards,
Bernard Chew


From ben.yarwood at juno.co.uk  Wed Oct  3 11:53:28 2007
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Wed, 3 Oct 2007 12:53:28 +0100
Subject: [Linux-cluster] fencing using rps-10
Message-ID: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>

I think the documentation for using the rps10 fence device is incorrect
(http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html), or else there is a bug in the fence agent in rhel5:

The doc does not mention that you must specify an "option" attribute or else the agent returns an error.  Eg.

<fencedevice agent="fence_rps10" name="rps10" device="/dev/ttyS0" port="0" option="reboot" />
  
will work but without the "option" attribute you get the error:

failed: operation must be 'on', 'off', or 'reboot'


Thanks
Ben 


From Jeremyc at tasconline.com  Wed Oct  3 14:36:39 2007
From: Jeremyc at tasconline.com (Jeremy Carroll)
Date: Wed, 3 Oct 2007 09:36:39 -0500
Subject: [Linux-cluster] VMWare Fencing / RHCS 4
In-Reply-To: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
Message-ID: <BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com>

Does anybody here know of a fencing module that would work with VMWare
ESX Server 3? We utilize VMWare for our cluster infrastructure and would
like to put fencing in place to power down virtual machines.

Thanks!


From kanderso at redhat.com  Wed Oct  3 14:45:06 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Wed, 03 Oct 2007 09:45:06 -0500
Subject: [Linux-cluster] VMWare Fencing / RHCS 4
In-Reply-To: <BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
	<BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com>
Message-ID: <1191422706.2718.24.camel@dhcp80-204.msp.redhat.com>

On Wed, 2007-10-03 at 09:36 -0500, Jeremy Carroll wrote:
> Does anybody here know of a fencing module that would work with VMWare
> ESX Server 3? We utilize VMWare for our cluster infrastructure and would
> like to put fencing in place to power down virtual machines.
> 
Our desire is to use the fence_xvm/fence_xvmd agent for all virtual
machine management in the clusters.  The problem with fencing virtual
machines is knowing on which physical machine the virtual instance is
executing.  With the ability to failover/restart/migrate virtual
instances, fence_xvmd maintains that status and tracks the movement.

This issue is that fence_xvmd uses libvirt interfaces to do this for xen
and other virtual engines.  However, libvirt does not have APIs to
control VMWare instances due to VMWare not providing/documenting their
control points.  Given the lack of documentation, it will be problematic
to integrate that capability into the open source products.

So, put pressure on VMWare from a customer standpoint to open up their
interfaces.

Thanks
Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/3e4a9558/attachment.htm>

From jbrassow at redhat.com  Wed Oct  3 14:57:17 2007
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Wed, 3 Oct 2007 09:57:17 -0500
Subject: [Linux-cluster] Re: Some ideas on changes to the lvm.sh agent (or
	new agents).
In-Reply-To: <1190996054.5802.30.camel@localhost>
References: <1190996054.5802.30.camel@localhost>
Message-ID: <3984516A-D158-487B-BC6D-AAD5850746CD@redhat.com>

Great stuff!  Much of what you are describing I've thought about in  
the past, but just haven't had the cycles to work on.  You can see in  
the script itself, the comments at the top mention the desire to  
operate on the VG level.  You can also see a couple vg_* functions  
that simply return error right now, but were intended to be filled in.

Comments in-line.

On Sep 28, 2007, at 11:14 AM, Simone Gotti wrote:

> Hi,
>
> Trying to use a non cluster vg in redhat cluster I noticed that  
> lvm.sh,
> to avoid metadata corruption, is forcing the need of only one lv  
> per vg.
>
> I was thinking that other clusters don't have this limitation as they
> let you just use a vg only on one node at a time (and also on one
> service group at a time).
>
> To test if this was possible with lvm2 I made little changes to lvm.sh
> (just variables renames, use of vgchange instead of lvchange for tag
> adding) and using the same changes needed to /etc/lvm/lvm.conf
> (volume_list = [ "rootvgname", "@my_hostname" ]) looks like this idea
> was working.
>
> I can activate the vg and all of its volume only on the node with  
> the vg
> tagged with its hostname and the start on the other nodes is refused.
>
> Now, will this idea be accepted? If so these are a list of possible
> needed changes and other ideas:
>
> *) Make <parameter name="vg_name" required="1"> also unique="1" or
> better primary="1" and remove the parameter "name" as only one service
> can use a vg.

Sounds reasonable.  Be careful when using those parameters though,  
they often result in cryptic error messages that are tough to  
follow.  I do checks in lvm.sh where possible to be able to give the  
user more information on what went wrong.

>
> *) What vg_status should do?
> a) Monitor all the LVs
> 	or
> b) Check only the VG and use ANOTHER resource agent for every lv  
> used by
> the cluster? So I can create/remove/modify lvs on that vg that aren't
> under rgmanager control without any error reported by the status
> functions of the lvm.sh agent.
> Also other clusters distinguish between vg and lv and they have 2
> different agents for them.

This is were things get difficult.  It would be ok to modify lvs on  
that vg as long as it's on the same machine that has ownership.  Tags  
should prevent otherwise, so should be ok.

User would have to be careful (or barriers would have to prevent)  
users from assigning different LVs in the same VG to different  
services.  Otherwise, if a service fails (application level) and must  
be moved to a different machine, we would have to find a way to move  
all services associated with the VG to the next machine.  I think  
there are ways to mandate this (that service A stick with service B),  
but we would have to have a way to enforce it.

> Creating two new agents will also leave the actual lvm.sh without
> changes and keep backward compatibility for who is already using it.
>
> Something like this (lets call lvm_vg and lvm_lv respectively the  
> agents
> for the vg and the lv):
>
> <service name="foo">
>   <lvm_vg vgname="vg01">
>      <lvm_lv lvname="lv01/>
>      <lvm_lv lvname="lv01/>
>      <script .... />
>   </lvm_vg>
> </service>

I'd have to be convinced of this...  I'd hate to see two 'lvm_lv's  
that refer to the same VG in different services... or would lvm_lv  
somehow require a parent lvm_vg, and make lvm_vg have a 'unique=1'?   
In any case, we are forced to move an entire VG at a time, so I'm not  
sure it would make sense to break it up.

> *) Another problem that is present just now is that lvm should be
> changed to avoid any operation on a non activable vg or lv. In these
> days you cannot be able to start a vg/lv as its not tagged with the
> hostname but you can remove/resize it without any problem. :D

Yes, tags need to be tightened-up.  You should never be able to alter  
metadata of LVM from a machine that does not have permission.

Another improvement I'd like to see is automatic VG ownership.   
Basically, tighten-up the tagging as stated above and add a unique  
machine specific tag to new VGs by default.  (For clusters, this tag  
could be the cluster name.)  This does a number of things, but  
primarily:
1) restricts access to the machine (or cluster) that created the VG.
- now if you move a hard drive to a different machine, it won't  
conflict with the other hard drives there (which might cause  
confusion over which device is the actual root volume).
2) simplifies HA LVM (lvm.sh + rgmanager)
- there would no longer be a need to edit lvm.conf to add the machine  
name, etc...
One of the questions surrounding this is how do you generate the  
unique id and when?  You would probably need it upon installation...

There are many kinds of fault scenarios.  Thinking through each one  
of them will be important when trying to implement a by-VG model vs a  
by-LV model.  I usually stumble across an issue that gives me pause  
before hastily adding a new feature, but I would welcome further  
thought/code.

  brassow


From jlauro at umflint.edu  Wed Oct  3 15:07:47 2007
From: jlauro at umflint.edu (Lauro, John)
Date: Wed, 3 Oct 2007 11:07:47 -0400
Subject: [Linux-cluster] VMWare Fencing / RHCS 4
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk><BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com>
	<1191422706.2718.24.camel@dhcp80-204.msp.redhat.com>
Message-ID: <9E5D3AC73F476945B8BC2E86AA343F0061EA31@its-emb2.umflint.edu>

Vmware has always had fairly open interfaces.

 
Have at it:  http://www.vmware.com/support/developer/

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kevin Anderson
Sent: Wednesday, October 03, 2007 10:45 AM
To: linux clustering
Subject: Re: [Linux-cluster] VMWare Fencing / RHCS 4

 
On Wed, 2007-10-03 at 09:36 -0500, Jeremy Carroll wrote: 

 
Does anybody here know of a fencing module that would work with VMWare
ESX Server 3? We utilize VMWare for our cluster infrastructure and
would
like to put fencing in place to power down virtual machines.
 

Our desire is to use the fence_xvm/fence_xvmd agent for all virtual
machine management in the clusters.  The problem with fencing virtual
machines is knowing on which physical machine the virtual instance is
executing.  With the ability to failover/restart/migrate virtual
instances, fence_xvmd maintains that status and tracks the movement.

This issue is that fence_xvmd uses libvirt interfaces to do this for
xen and other virtual engines.  However, libvirt does not have APIs to
control VMWare instances due to VMWare not providing/documenting their
control points.  Given the lack of documentation, it will be
problematic to integrate that capability into the open source
products.

So, put pressure on VMWare from a customer standpoint to open up their
interfaces.

Thanks
Kevin 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/4fff6fe8/attachment.htm>

From Jeremyc at tasconline.com  Wed Oct  3 15:22:14 2007
From: Jeremyc at tasconline.com (Jeremy Carroll)
Date: Wed, 3 Oct 2007 10:22:14 -0500
Subject: [Linux-cluster] VMWare Fencing / RHCS 4
In-Reply-To: <9E5D3AC73F476945B8BC2E86AA343F0061EA31@its-emb2.umflint.edu>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk><BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com><1191422706.2718.24.camel@dhcp80-204.msp.redhat.com>
	<9E5D3AC73F476945B8BC2E86AA343F0061EA31@its-emb2.umflint.edu>
Message-ID: <BC16657A85D83746AFF99A42587CC6BC07C780F1@exchange2003.tasconline.com>

It seems that someone at RedHat wrote a fencing agent that I found
online.

 
http://64.233.167.104/search?q=cache:DVufMqgyn_wJ:people.ubuntu.com/~fab
bione/fence_vmware+esx+fencing&hl=en&ct=clnk&cd=5&gl=us&client=firefox-a

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lauro, John
Sent: Wednesday, October 03, 2007 10:08 AM
To: linux clustering
Subject: RE: [Linux-cluster] VMWare Fencing / RHCS 4

 
Vmware has always had fairly open interfaces.

 
Have at it:  http://www.vmware.com/support/developer/

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kevin Anderson
Sent: Wednesday, October 03, 2007 10:45 AM
To: linux clustering
Subject: Re: [Linux-cluster] VMWare Fencing / RHCS 4

 
On Wed, 2007-10-03 at 09:36 -0500, Jeremy Carroll wrote: 

 
Does anybody here know of a fencing module that would work with VMWare
ESX Server 3? We utilize VMWare for our cluster infrastructure and would
like to put fencing in place to power down virtual machines.
 

Our desire is to use the fence_xvm/fence_xvmd agent for all virtual
machine management in the clusters.  The problem with fencing virtual
machines is knowing on which physical machine the virtual instance is
executing.  With the ability to failover/restart/migrate virtual
instances, fence_xvmd maintains that status and tracks the movement.

This issue is that fence_xvmd uses libvirt interfaces to do this for xen
and other virtual engines.  However, libvirt does not have APIs to
control VMWare instances due to VMWare not providing/documenting their
control points.  Given the lack of documentation, it will be problematic
to integrate that capability into the open source products.

So, put pressure on VMWare from a customer standpoint to open up their
interfaces.

Thanks
Kevin 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/ef97769c/attachment.htm>

From kanderso at redhat.com  Wed Oct  3 17:56:15 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Wed, 03 Oct 2007 12:56:15 -0500
Subject: [Linux-cluster] VMWare Fencing / RHCS 4
In-Reply-To: <9E5D3AC73F476945B8BC2E86AA343F0061EA31@its-emb2.umflint.edu>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
	<BC16657A85D83746AFF99A42587CC6BC07C780F0@exchange2003.tasconline.com>
	<1191422706.2718.24.camel@dhcp80-204.msp.redhat.com>
	<9E5D3AC73F476945B8BC2E86AA343F0061EA31@its-emb2.umflint.edu>
Message-ID: <1191434175.2720.28.camel@dhcp80-204.msp.redhat.com>

Interesting, will ask the libvirt guys.  

these apis seem to be targeted at Java and C# environments, might be the
primary problem with using them.  Are there Linux C++ interfaces
available?

Kevin

On Wed, 2007-10-03 at 11:07 -0400, Lauro, John wrote:
> Vmware has always had fairly open interfaces.
> 
>  
> 
> Have at it:  http://www.vmware.com/support/developer/
> 
>  
> 
>  
> 
>  
> 
>                                    
> ______________________________________________________________________
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kevin Anderson
> Sent: Wednesday, October 03, 2007 10:45 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] VMWare Fencing / RHCS 4
> 
> 
>  
> 
> On Wed, 2007-10-03 at 09:36 -0500, Jeremy Carroll wrote: 
> 
>  
> Does anybody here know of a fencing module that would work with VMWare
> ESX Server 3? We utilize VMWare for our cluster infrastructure and would
> like to put fencing in place to power down virtual machines.
>  
> 
> Our desire is to use the fence_xvm/fence_xvmd agent for all virtual
> machine management in the clusters.  The problem with fencing virtual
> machines is knowing on which physical machine the virtual instance is
> executing.  With the ability to failover/restart/migrate virtual
> instances, fence_xvmd maintains that status and tracks the movement.
> 
> This issue is that fence_xvmd uses libvirt interfaces to do this for
> xen and other virtual engines.  However, libvirt does not have APIs to
> control VMWare instances due to VMWare not providing/documenting their
> control points.  Given the lack of documentation, it will be
> problematic to integrate that capability into the open source
> products.
> 
> So, put pressure on VMWare from a customer standpoint to open up their
> interfaces.
> 
> Thanks
> Kevin 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/c9984545/attachment.htm>

From rhurst at bidmc.harvard.edu  Wed Oct  3 18:19:03 2007
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Wed, 3 Oct 2007 14:19:03 -0400
Subject: [Linux-cluster] RHEL AS 4.5 GFS bug after removing a file?
Message-ID: <1191435543.6423.19.camel@xw9300.bidmc.harvard.edu>

This is a weird use-case:

Our /home is mounted on GFS in an 11-node cluster.
I develop a script with 755 permission bits in my ~/bin.
I test it.
After it is tested, I `cp ~/bin/mybackup.sh /usr/local/sbin` (a local
ext3 filesystem)
Later, I remove ~/bin/mybackup.sh
Now whenever I invoke mybackup.sh, bash
returns /home/rhurst/bin/mybackup.sh: No such file or directory
and `which mybackup.sh` returns /usr/local/sbin/mybackup.sh

Who's caching this wrong here, bash or GFS?


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/54860783/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2178 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/54860783/attachment.p7s>

From Alexandre.Racine at mhicc.org  Wed Oct  3 14:55:30 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Wed, 3 Oct 2007 10:55:30 -0400
Subject: [Linux-cluster] fencing with a DRAC in a chassis
References: <229C73600EB0E54DA818AB599482BCE901C0AB46@shadowfax.sg.muvee.net>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34D3@cumulonimbus.RG.local>

Hi all,

Are there some people with examples of fencing with DRAC in a chassis? Like the hole cluster.conf file? (remove the passwords)
For example, I need to fence the server in the slot 6 of the chassis witch use DRAC. What is the configuration for that?

Thanks.


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2585 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/717c4d4a/attachment.bin>

From jos at xos.nl  Wed Oct  3 18:57:01 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 3 Oct 2007 20:57:01 +0200
Subject: [Linux-cluster] RHEL AS 4.5 GFS bug after removing a file?
In-Reply-To: <1191435543.6423.19.camel@xw9300.bidmc.harvard.edu>
References: <1191435543.6423.19.camel@xw9300.bidmc.harvard.edu>
Message-ID: <20071003185701.GA25471@jasmine.xos.nl>

On Wed, Oct 03, 2007 at 02:19:03PM -0400, rhurst at bidmc.harvard.edu wrote:

> Our /home is mounted on GFS in an 11-node cluster.
> I develop a script with 755 permission bits in my ~/bin.
> I test it.
> After it is tested, I `cp ~/bin/mybackup.sh /usr/local/sbin` (a local
> ext3 filesystem)
> Later, I remove ~/bin/mybackup.sh
> Now whenever I invoke mybackup.sh, bash
> returns /home/rhurst/bin/mybackup.sh: No such file or directory
> and `which mybackup.sh` returns /usr/local/sbin/mybackup.sh
> 
> Who's caching this wrong here, bash or GFS?

What happens if you do "hash -r" before invoking "mybackup.sh"  after
the removal?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From padige at yahoo.com  Wed Oct  3 20:34:54 2007
From: padige at yahoo.com (AnuSri Padige)
Date: Wed, 3 Oct 2007 13:34:54 -0700 (PDT)
Subject: [Linux-cluster] adding virtual servers through piranha not working
Message-ID: <393259.48218.qm@web36808.mail.mud.yahoo.com>

Trying to add some vitural servers through piranha is not working. When trying to click on ADD/EDIT/DELETE button, nothing happens. Tried restarting piranha-gui, didn't help. Tried to restart pulse. It shutdown but, not coming back. Got an error:
   
  Starting pulse: pulse: cannot create heartbeat socket. running as root?
                                                           [FAILED]

  Pls. advise.
   
  DJ
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/ad29a77e/attachment.htm>

From lhh at redhat.com  Wed Oct  3 20:48:06 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 03 Oct 2007 16:48:06 -0400
Subject: [Linux-cluster] fencing using rps-10
In-Reply-To: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
Message-ID: <1191444486.4477.97.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-03 at 12:53 +0100, Ben Yarwood wrote:
> I think the documentation for using the rps10 fence device is incorrect
> (http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html), or else there is a bug in the fence agent in rhel5:
> 
> The doc does not mention that you must specify an "option" attribute or else the agent returns an error.  Eg.
> 
> <fencedevice agent="fence_rps10" name="rps10" device="/dev/ttyS0" port="0" option="reboot" />
>   
> will work but without the "option" attribute you get the error:
> 
> failed: operation must be 'on', 'off', or 'reboot'

Ouch.  Could you file a bz?  It should default to reboot if unspecified.

-- Lon


From lhh at redhat.com  Wed Oct  3 20:49:57 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 03 Oct 2007 16:49:57 -0400
Subject: [Linux-cluster] fence_xvmd in RHEL5 with a virtual domU cluster
In-Reply-To: <229C73600EB0E54DA818AB599482BCE901C0AB46@shadowfax.sg.muvee.net>
References: <229C73600EB0E54DA818AB599482BCE901C0AB46@shadowfax.sg.muvee.net>
Message-ID: <1191444597.4477.102.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-03 at 15:14 +0800, Bernard Chew wrote:
> Hi,
> 
> I follow the steps below to use fence_xvmd in RHEL5;
> 
> (1) configure dom0 like a 1-node cluster 
> (2) Add "<fence_xvmd/>" to cluster.conf in dom0 as a child of the
> "<cluster>" tag.
> (3) dd if=/dev/urandom of=/etc/cluster/fence_xvm.key bs=4096 count=1
> (4) scp /etc/cluster/fence_xvm.key root virtual_node_1:/etc/cluster
> (5) scp /etc/cluster/fence_xvm.key root virtual_node_2:/etc/cluster
> (6) Start cman on dom0 - this should start fence_xvmd for you
> 
> However, I encounter the following errors while running "fence_xvmd
> -fddddddddd" on the dom0 node and "fence_xvm -H <domain_name> -o null"
> on one of the guests;
> 
> Hash mismatch:
> PKT =
> 94598bef00f4bc3198032800b714bca59581404f8ca2e9c8ea8bb1119840e83c00000000
> 00000000000000000000000000000000000000000000000000000000
> EXP =
> 85168eb74638ff044c31a4749dba9cc0b9c66e319398dcc8cd97ee4cf1e3936800000000
> 00000000000000000000000000000000000000000000000000000000
> Key mismatch; dropping packet
> 
> Any idea why?

Are you using 5.0.z errata packages?  I thought this was fixed...

-- Lon


From lhh at redhat.com  Wed Oct  3 20:55:30 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 03 Oct 2007 16:55:30 -0400
Subject: [Linux-cluster] [PATCH] fix rps-10 bug where an option was not
	specified
In-Reply-To: <1191444486.4477.97.camel@ayanami.boston.devel.redhat.com>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
	<1191444486.4477.97.camel@ayanami.boston.devel.redhat.com>
Message-ID: <1191444930.4477.104.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-03 at 16:48 -0400, Lon Hohberger wrote:
> On Wed, 2007-10-03 at 12:53 +0100, Ben Yarwood wrote:
> > I think the documentation for using the rps10 fence device is incorrect
> > (http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html), or else there is a bug in the fence agent in rhel5:
> > 
> > The doc does not mention that you must specify an "option" attribute or else the agent returns an error.  Eg.
> > 
> > <fencedevice agent="fence_rps10" name="rps10" device="/dev/ttyS0" port="0" option="reboot" />
> >   
> > will work but without the "option" attribute you get the error:
> > 
> > failed: operation must be 'on', 'off', or 'reboot'
> 
> Ouch.  Could you file a bz?  It should default to reboot if unspecified.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: rps10-option.patch
Type: text/x-patch
Size: 1241 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071003/34654e27/attachment.bin>

From simone.gotti at email.it  Wed Oct  3 22:18:53 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Thu, 04 Oct 2007 00:18:53 +0200
Subject: [Linux-cluster] Re: Some ideas on changes to the lvm.sh agent (or
	new agents).
In-Reply-To: <3984516A-D158-487B-BC6D-AAD5850746CD@redhat.com>
References: <1190996054.5802.30.camel@localhost>
	<3984516A-D158-487B-BC6D-AAD5850746CD@redhat.com>
Message-ID: <1191449933.3171.34.camel@localhost>

Hi,

thanks for the answer! I added little comments in-line too.

[sorry for this top post]
After I wrote the mail last day, I started implementing my ideas. I used
lvm.sh as a base, but after some changed I discovered more and more
corner cases and issues so in the end it changed quite a lot.

I'll write a new mail attaching this new script. I did some tests with
various scenarios. For example: All Machines that lost the same PVs;
only one machine that lost PVs; removing an LVs, removing the whole VG
etc... But to be sure that there aren't problem I have to do more and
more tests and also try to document them.

Thanks!
Bye!

On Wed, 2007-10-03 at 09:57 -0500, Jonathan Brassow wrote:
> Great stuff!  Much of what you are describing I've thought about in  
> the past, but just haven't had the cycles to work on.  You can see in  
> the script itself, the comments at the top mention the desire to  
> operate on the VG level.  You can also see a couple vg_* functions  
> that simply return error right now, but were intended to be filled in.
> 
> Comments in-line.
> 
> On Sep 28, 2007, at 11:14 AM, Simone Gotti wrote:
> 
> > Hi,
> >
> > Trying to use a non cluster vg in redhat cluster I noticed that  
> > lvm.sh,
> > to avoid metadata corruption, is forcing the need of only one lv  
> > per vg.
> >
> > I was thinking that other clusters don't have this limitation as they
> > let you just use a vg only on one node at a time (and also on one
> > service group at a time).
> >
> > To test if this was possible with lvm2 I made little changes to lvm.sh
> > (just variables renames, use of vgchange instead of lvchange for tag
> > adding) and using the same changes needed to /etc/lvm/lvm.conf
> > (volume_list = [ "rootvgname", "@my_hostname" ]) looks like this idea
> > was working.
> >
> > I can activate the vg and all of its volume only on the node with  
> > the vg
> > tagged with its hostname and the start on the other nodes is refused.
> >
> > Now, will this idea be accepted? If so these are a list of possible
> > needed changes and other ideas:
> >
> > *) Make <parameter name="vg_name" required="1"> also unique="1" or
> > better primary="1" and remove the parameter "name" as only one service
> > can use a vg.
> 
> Sounds reasonable.  Be careful when using those parameters though,  
> they often result in cryptic error messages that are tough to  
> follow.  I do checks in lvm.sh where possible to be able to give the  
> user more information on what went wrong.
My idea was that a VG can be owned by a single service. The check you
added in lvm.sh forced to a single LV in the VG, but one was able anyway
to use multiple lvm.sh with the same vg_name in different services
leading to problems anyway. Probably I missed something... :D

> 
> >
> > *) What vg_status should do?
> > a) Monitor all the LVs
> > 	or
> > b) Check only the VG and use ANOTHER resource agent for every lv  
> > used by
> > the cluster? So I can create/remove/modify lvs on that vg that aren't
> > under rgmanager control without any error reported by the status
> > functions of the lvm.sh agent.
> > Also other clusters distinguish between vg and lv and they have 2
> > different agents for them.
> 
> This is were things get difficult.  It would be ok to modify lvs on  
> that vg as long as it's on the same machine that has ownership.  Tags  
> should prevent otherwise, so should be ok.
> 
> User would have to be careful (or barriers would have to prevent)  
> users from assigning different LVs in the same VG to different  
> services.  Otherwise, if a service fails (application level) and must  
> be moved to a different machine, we would have to find a way to move  
> all services associated with the VG to the next machine.  I think  
> there are ways to mandate this (that service A stick with service B),  
> but we would have to have a way to enforce it.
> 
> > Creating two new agents will also leave the actual lvm.sh without
> > changes and keep backward compatibility for who is already using it.
> >
> > Something like this (lets call lvm_vg and lvm_lv respectively the  
> > agents
> > for the vg and the lv):
> >
> > <service name="foo">
> >   <lvm_vg vgname="vg01">
> >      <lvm_lv lvname="lv01/>
> >      <lvm_lv lvname="lv01/>
> >      <script .... />
> >   </lvm_vg>
> > </service>
> 
> I'd have to be convinced of this...  I'd hate to see two 'lvm_lv's  
> that refer to the same VG in different services... or would lvm_lv  
> somehow require a parent lvm_vg, and make lvm_vg have a 'unique=1'?   
> In any case, we are forced to move an entire VG at a time, so I'm not  
> sure it would make sense to break it up.
I thinked about the same problem so for now I tried implementing it in a
unique script.

> 
> > *) Another problem that is present just now is that lvm should be
> > changed to avoid any operation on a non activable vg or lv. In these
> > days you cannot be able to start a vg/lv as its not tagged with the
> > hostname but you can remove/resize it without any problem. :D
> 
> Yes, tags need to be tightened-up.  You should never be able to alter  
> metadata of LVM from a machine that does not have permission.
That's what I was hoping. Good to know is in program to be implemented.

> 
> Another improvement I'd like to see is automatic VG ownership.   
> Basically, tighten-up the tagging as stated above and add a unique  
> machine specific tag to new VGs by default.  (For clusters, this tag  
> could be the cluster name.)  This does a number of things, but  
> primarily:
> 1) restricts access to the machine (or cluster) that created the VG.
> - now if you move a hard drive to a different machine, it won't  
> conflict with the other hard drives there (which might cause  
> confusion over which device is the actual root volume).
> 2) simplifies HA LVM (lvm.sh + rgmanager)
> - there would no longer be a need to edit lvm.conf to add the machine  
> name, etc...
> One of the questions surrounding this is how do you generate the  
> unique id and when?  You would probably need it upon installation...
This looks a great idea! (also if I cannot understand well how this
should be implemented :D )
> 
> There are many kinds of fault scenarios.  Thinking through each one  
> of them will be important when trying to implement a by-VG model vs a  
> by-LV model.  I usually stumble across an issue that gives me pause  
> before hastily adding a new feature, but I would welcome further  
> thought/code.
> 
>   brassow
> 
-- 
Simone Gotti
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/07a9d197/attachment.sig>

From simone.gotti at email.it  Wed Oct  3 22:20:18 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Thu, 04 Oct 2007 00:20:18 +0200
Subject: [Linux-cluster] Re: [RFC] New lvm_vg.sh agent for non-clustered
	lvm2 VGs
In-Reply-To: <3984516A-D158-487B-BC6D-AAD5850746CD@redhat.com>
References: <1190996054.5802.30.camel@localhost>
	<3984516A-D158-487B-BC6D-AAD5850746CD@redhat.com>
Message-ID: <1191450018.3171.37.camel@localhost>

Hi,

like reported in the previous mail I tried to implement a new VG based
rgmanager agent for non-clustered lvm2 VGs.

It's a sort of experiment, I just tried to implement in it some ideas
and I'd like to get some suggestions on them just before going on
working on it (it's quite probable that I had wrong assumption or missed
some problems around).

I did some tests with various scenarios. For example: All Machines that
lost the same PVs; only one machine that lost PVs; removing an LVs,
removing the whole VG etc... But to be sure that there aren't problem I
have to do more and more tests and also try to document them.

Below the most important points.

Thanks!
Bye!

===================================================================================
What you can/cannot do with it:

*) Only one service can own a VG. (this mimics other clusters)
*) In the parameter lv_name you can define a space separated list of
volumes to start/monitor. (Is this OCF/xml compliant?). If it's empty
all the LVs are started/monitored.
On stop all the volumes are stopped anyway.

=====
How its implemented:

*) Take control of the whole VG instead of a single LV by tagging the VG
and not the LVs.
*) Optional (can be removed if not wanted): I tried to let add
additional tag to the VG/LVs. The tags used to "lock" to VGs to a node
should be of type CMAN_NODE_${nodename}. So also the "volume_list"
in /etc/lvm/lvm.conf should be changed in this way.
*) Check if, for various reasons (manual intervention, race condition),
the VG has multiple node tags on it.
*) Check that the LVs aren't "node" tagged to avoid that they can be
activated also on other nodes.
*) The service is defined as unique="1" as only one service should use
one VG.
*) Shortened the status intervals (also for debugging pourposes, they
can be increased to better values).

=====
Bug found in current lvm.sh that I tried to fix:

Note: Using mirror devices dmeventd runs "vgreduce --removemissing" only
when a writes fails. During reads nothing is done by dmeventd on the VG
so all programs like vgs/lvs will fail.

*) If one or more PVs are missing when the service is started then
vgs/lvs will return an error and not provide the tags. The scripts
wrongly assumes that the vg (previously lv) isn't owned, tries to steal
it but the tagging fails and so runs "vgreduce --removemissing". But the
VG can be active on another machine.
Solution: First check for missing pv, get tags in partial mode and if
not owned try to fix it issuing a "vgreduce --removemissing"

*) If one or more PVs are missing when the service is stopped then vgs
will return an error and not provide the tags. The scripts wrongly
assumes that the vg (previously lv) isn't owned, tries to steal it but
the tagging fails and so the stop fails too putting the service in a
failed status the need manual intervention.
Solution: First check for missing pv, get tags in partial mode and if
not owned try to fix it issuing a "vgreduce --removemissing"

*) If one or more PVs are missing after the start, the status check will
assume that the node isn't owning as it cannot get the tags and will try
to change the vg, then it return an error that will bring to a service
restart.
Solution: First check for missing pv and if so return an error.
Introduce a recover action that will do the same as start. This will fix
the VG.

*)If one machine loses some disks, dmeventd or the start/stop scripts
will call vgreduce --removemissing. Then, if the service is switched to
another machine that sees all the disks or the disks come back, for
every lvm command launched a warning is issued "Inconsistent metadata
found for VG %s - updating to use version %s".
Solution: Avoid this by calling lvm_exec_resilient on all commands
except on some vgs called with --partial or that needs to get the real
state (vgs doesn't report "Inconsistent metadata found for VG %s -
updating to use version %s" anyway so no problems). As I needed to get
also some output echoed (returned) by lvm_exec_resilient I directed all
the ocf_log output of lvm_exec_resilient to stderr.

=====
Things to do:

*)Maybe increase the start/stop/status timeouts (5 seconds looks too
short for a "vgreduce --removemissing" an big VGs).
*)Implement better validation of parameters.


On Wed, 2007-10-03 at 09:57 -0500, Jonathan Brassow wrote:
> Great stuff!  Much of what you are describing I've thought about in  
> the past, but just haven't had the cycles to work on.  You can see in  
> the script itself, the comments at the top mention the desire to  
> operate on the VG level.  You can also see a couple vg_* functions  
> that simply return error right now, but were intended to be filled in.

>   brassow
> 
-- 
Simone Gotti
-------------- next part --------------
A non-text attachment was scrubbed...
Name: lvm_vg.sh.20061003-2330
Type: application/x-shellscript
Size: 21309 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/eccbaa92/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/eccbaa92/attachment.sig>

From bernard.chew at muvee.com  Thu Oct  4 02:58:46 2007
From: bernard.chew at muvee.com (Bernard Chew)
Date: Thu, 4 Oct 2007 10:58:46 +0800
Subject: [Linux-cluster] RE: [SOLVED] fence_xvmd in RHEL5 with a virtual
	domU cluster
In-Reply-To: <1191444597.4477.102.camel@ayanami.boston.devel.redhat.com>
References: <229C73600EB0E54DA818AB599482BCE901C0AB46@shadowfax.sg.muvee.net>
	<1191444597.4477.102.camel@ayanami.boston.devel.redhat.com>
Message-ID: <229C73600EB0E54DA818AB599482BCE901C96468@shadowfax.sg.muvee.net>

Thanks for the reminder Lon! Saw the RHBA-2007:0161 Bug Fix Advisory.

Regards,
Bernard

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Thursday, October 04, 2007 4:50 AM
To: linux clustering
Subject: Re: [Linux-cluster] fence_xvmd in RHEL5 with a virtual domU
cluster

On Wed, 2007-10-03 at 15:14 +0800, Bernard Chew wrote:
> Hi,
> 
> I follow the steps below to use fence_xvmd in RHEL5;
> 
> (1) configure dom0 like a 1-node cluster 
> (2) Add "<fence_xvmd/>" to cluster.conf in dom0 as a child of the
> "<cluster>" tag.
> (3) dd if=/dev/urandom of=/etc/cluster/fence_xvm.key bs=4096 count=1
> (4) scp /etc/cluster/fence_xvm.key root virtual_node_1:/etc/cluster
> (5) scp /etc/cluster/fence_xvm.key root virtual_node_2:/etc/cluster
> (6) Start cman on dom0 - this should start fence_xvmd for you
> 
> However, I encounter the following errors while running "fence_xvmd
> -fddddddddd" on the dom0 node and "fence_xvm -H <domain_name> -o null"
> on one of the guests;
> 
> Hash mismatch:
> PKT =
>
94598bef00f4bc3198032800b714bca59581404f8ca2e9c8ea8bb1119840e83c00000000
> 00000000000000000000000000000000000000000000000000000000
> EXP =
>
85168eb74638ff044c31a4749dba9cc0b9c66e319398dcc8cd97ee4cf1e3936800000000
> 00000000000000000000000000000000000000000000000000000000
> Key mismatch; dropping packet
> 
> Any idea why?

Are you using 5.0.z errata packages?  I thought this was fixed...

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From Arthur.MESSNER at tilak.at  Thu Oct  4 06:19:21 2007
From: Arthur.MESSNER at tilak.at (=?ISO-8859-1?Q?Arthur=20ME=DFNER?=)
Date: Thu, 04 Oct 2007 08:19:21 +0200
Subject: [Linux-cluster] how to add lvm volume to cluster
Message-ID: <4704A209020000CF0001FC54@tilak.at>

I'm new to this list, so please excuse if this question was
asked and probably solved elsewhere.

How is the suggested way to add a new lun,
create a disk partition on, it, use this partition as
physical volume, create a volume group and create logical volume ?

I have done this on our three node (xen01-xen03) RHEL 5 cluster.
I used a shared storage over fc ( HP EVA 8100 ) with multipath.

- rescan echo "- - -" > /sys/classes/scsi_host/host0/scan on all nodes
  the devices appeared
- done multipath mapping, and set multipath alias
- on one node made the partition
- on one node made the pv
- on one node made the vg
- on onde node made the logical volume

but this volume can't be taken active:

[root at xen02 kiswatch]# vgchange -ay
Error locking on node xen01.tilak.ibk: Volume group for uuid not found:
nEh1p8PzgfJO6Dedzjis1uDO6Yan2W5vew9ZlmZ3xyxbsgKclRwLCNAulf28UPqJ
Error locking on node xen02.tilak.ibk: Volume group for uuid not found:
nEh1p8PzgfJO6Dedzjis1uDO6Yan2W5vew9ZlmZ3xyxbsgKclRwLCNAulf28UPqJ

the command lvdisplay on xen01 show the searched uuid:
[root at xen01 kiswatch]# lvdisplay
  --- Logical volume ---
  LV Name                /dev/lvm1/kiswatch_xvda
  VG Name                lvm1
  LV UUID                ew9Zlm-Z3xy-xbsg-KclR-wLCN-Aulf-28UPqJ
  LV Write Access        read/write
  LV Status              NOT available
  LV Size                5.00 GB
  Current LE             1280
  Segments               1
  Allocation             inherit
  Read ahead sectors     0


after restart of node xen02 the problem disappeared,
but the node xen01 is very busy and should not be restartet.

thanks in advance for any help


From Arthur.MESSNER at tilak.at  Thu Oct  4 06:53:51 2007
From: Arthur.MESSNER at tilak.at (=?ISO-8859-1?Q?Arthur=20ME=DFNER?=)
Date: Thu, 04 Oct 2007 08:53:51 +0200
Subject: [Linux-cluster] gfs2 withdraw while copy to filesystem
Message-ID: <4704AA1F020000CF0001FC5A@tilak.at>


First i build a xen disk image on the local storage fs,
then want "cp" this disk image to the gfs2 filesystem.


cp disk.img /xenfs/storage1/xenmachine/

this filesystem ( /xenfs/storage1 )is gfs2 with option -j 10, lock_dlm

mount option is : noatime,quota=off

I have done this sevreal times on one node
of my two node cluster, no problem.

Yesterday i added another node ( the third ), 
an on this node the same procedure ended up with this.

Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: fatal:
assertion "x <= length" failed
Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1:   function =
rgblk_search, file = fs/gfs2/rgrp.c, line = 1116
Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: about to
withdraw this file system
Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: telling LM to
withdraw
Oct  4 07:44:45 xen03 kernel: GFS2: fsid=xen:storage1.1: withdrawn

On this node and this gfs2 filesystem the crash is reproduceable.
i never tried it on another node, because they are productive.

After reboot, the gfs2 can normally be accessed.
The same if i try with "dd if=xxx.img of=xxx.img"

Any suggestion, where the problem is ?
locking, gfs2 options ....


From swhiteho at redhat.com  Thu Oct  4 09:40:28 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 04 Oct 2007 10:40:28 +0100
Subject: [Linux-cluster] gfs2 withdraw while copy to filesystem
In-Reply-To: <4704AA1F020000CF0001FC5A@tilak.at>
References: <4704AA1F020000CF0001FC5A@tilak.at>
Message-ID: <1191490828.1068.194.camel@quoit>

Hi,

On Thu, 2007-10-04 at 08:53 +0200, Arthur ME?NER wrote:
> First i build a xen disk image on the local storage fs,
> then want "cp" this disk image to the gfs2 filesystem.
> 
> 
> cp disk.img /xenfs/storage1/xenmachine/
> 
> this filesystem ( /xenfs/storage1 )is gfs2 with option -j 10, lock_dlm
> 
> mount option is : noatime,quota=off
> 
> I have done this sevreal times on one node
> of my two node cluster, no problem.
> 
> Yesterday i added another node ( the third ), 
> an on this node the same procedure ended up with this.
> 
> Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: fatal:
> assertion "x <= length" failed
> Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1:   function =
> rgblk_search, file = fs/gfs2/rgrp.c, line = 1116
> Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: about to
> withdraw this file system
> Oct  4 07:44:15 xen03 kernel: GFS2: fsid=xen:storage1.1: telling LM to
> withdraw
> Oct  4 07:44:45 xen03 kernel: GFS2: fsid=xen:storage1.1: withdrawn
> 
> On this node and this gfs2 filesystem the crash is reproduceable.
> i never tried it on another node, because they are productive.
> 
> After reboot, the gfs2 can normally be accessed.
> The same if i try with "dd if=xxx.img of=xxx.img"
> 
> Any suggestion, where the problem is ?
> locking, gfs2 options ....
> 

This message means that when GFS2 tried to allocate some blocks it
couldn't find any in the resource group it had previously selected and
in which it has previously reserved some blocks.

The reason that this appears only to affect a single node is that GFS2
tries to keep resource groups local to a single node where it can to
avoid having to pass the lock (and hence also the cache) of the resource
group about the cluster (which is inefficient). So this may show up on
the other nodes in the case that the filesystem gets closer to being
full (which increases the chance that the other nodes will search this
resource group).

I'd suggest in the first instance running GFS2's fsck in order to be
certain that its a problem on the disk, but thats what it looks like to
me. It is probably just the summary information which is out of line
with the actual bitmaps on that resource group, so I wouldn't expect to
see any data loss.

Do you know what kind of fs activity caused that in the first place?

I can't see anything else that you are doing wrong, but I wonder which
kernel version you are using?

Steve.

> 
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From celso at webbertek.com.br  Thu Oct  4 10:25:33 2007
From: celso at webbertek.com.br (Celso K. Webber)
Date: Thu, 04 Oct 2007 07:25:33 -0300
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one active
	node?
Message-ID: <4704BF9D.3060501@webbertek.com.br>

Hello all,

Sorry for asking this basic question, I've checked the Cluster FAQ and did 
not find anything related to this.

I have set up a 2-node Cluster, with quorum/qdiskd configured, and 
everything goes fine.

But in the case where I need to shut down the whole cluster, and then for 
some reason I can boot up only one of the nodes, it does not come up, 
staying always at the "Inquorate" status.

What is the correct behaviour? Shouldn't my Cluster come up because I have 
two votes active? In this case each node counts one vote in the cluster, and 
the quorum counts another one.

By the way, I'm setting up qdiskd with no heuristics at all, I have no 
"pingable" router in this network.

Here is the relevant parts of the cluster.conf file:

<?xml version="1.0"?>
<cluster config_version="9" name="clu_NAME">
	<quorumd log_facility="local6" device="/dev/emcpowere1" interval="1" 
min_score="0" tko="10" votes="1"/>
	<fence_daemon post_fail_delay="10" post_join_delay="3"/>
	<clusternodes>
		<clusternode name="node1" votes="1">
			<fence>
				<method name="1">
					<device lanplus="" name="node1-ipmi"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="node2" votes="1">
			<fence>
				<method name="1">
					<device lanplus="" name="node2-ipmi"/>
				</method>
			</fence>
		</clusternode>
	</clusternodes>
	<cman/>
	<fencedevices>
		<fencedevice agent="fence_ipmilan" auth="none" ipaddr="node1-ipmi" 
login="root" name="node1-ipmi" passwd="PASS"/>
		<fencedevice agent="fence_ipmilan" auth="none" ipaddr="node2-ipmi" 
login="root" name="node2-ipmi" passwd="PASS"/>
	</fencedevices>
	<rm log_facility="local6">
...
	</rm>
</cluster>

-- 


-- 
Esta mensagem foi verificada pelo sistema de antiv?rus e
 acredita-se estar livre de perigo.


From randy.brown at noaa.gov  Thu Oct  4 11:18:32 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Thu, 04 Oct 2007 07:18:32 -0400
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one active
	node?
In-Reply-To: <4704BF9D.3060501@webbertek.com.br>
References: <4704BF9D.3060501@webbertek.com.br>
Message-ID: <4704CC08.9070702@noaa.gov>

In a two node cluster I believe you want to use:
<cman expected_votes="1" two_node="1"/>
in cluster.conf.

Randy

Celso K. Webber wrote:
> Hello all,
>
> Sorry for asking this basic question, I've checked the Cluster FAQ and 
> did not find anything related to this.
>
> I have set up a 2-node Cluster, with quorum/qdiskd configured, and 
> everything goes fine.
>
> But in the case where I need to shut down the whole cluster, and then 
> for some reason I can boot up only one of the nodes, it does not come 
> up, staying always at the "Inquorate" status.
>
> What is the correct behaviour? Shouldn't my Cluster come up because I 
> have two votes active? In this case each node counts one vote in the 
> cluster, and the quorum counts another one.
>
> By the way, I'm setting up qdiskd with no heuristics at all, I have no 
> "pingable" router in this network.
>
> Here is the relevant parts of the cluster.conf file:
>
> <?xml version="1.0"?>
> <cluster config_version="9" name="clu_NAME">
>     <quorumd log_facility="local6" device="/dev/emcpowere1" 
> interval="1" min_score="0" tko="10" votes="1"/>
>     <fence_daemon post_fail_delay="10" post_join_delay="3"/>
>     <clusternodes>
>         <clusternode name="node1" votes="1">
>             <fence>
>                 <method name="1">
>                     <device lanplus="" name="node1-ipmi"/>
>                 </method>
>             </fence>
>         </clusternode>
>         <clusternode name="node2" votes="1">
>             <fence>
>                 <method name="1">
>                     <device lanplus="" name="node2-ipmi"/>
>                 </method>
>             </fence>
>         </clusternode>
>     </clusternodes>
>     <cman/>
>     <fencedevices>
>         <fencedevice agent="fence_ipmilan" auth="none" 
> ipaddr="node1-ipmi" login="root" name="node1-ipmi" passwd="PASS"/>
>         <fencedevice agent="fence_ipmilan" auth="none" 
> ipaddr="node2-ipmi" login="root" name="node2-ipmi" passwd="PASS"/>
>     </fencedevices>
>     <rm log_facility="local6">
> ...
>     </rm>
> </cluster>
>


From Arthur.MESSNER at tilak.at  Thu Oct  4 11:37:57 2007
From: Arthur.MESSNER at tilak.at (=?ISO-8859-1?Q?Arthur=20ME=DFNER?=)
Date: Thu, 04 Oct 2007 13:37:57 +0200
Subject: [Linux-cluster] how to add lvm volume to cluster
Message-ID: <4704ECB5020000CF0001FCBE@tilak.at>

Ok, i have found the right button:

after creation of the new lv my problem as described:

[root at xen02 ~]# lvcreate -L 200G -n webint01_xvdb vg_data2
  Error locking on node xen02.tilak.ibk: Volume group for uuid not
found: wP3rMbP704LoUcUawH8eMq77AyRPJ5in43a93ifh0KYJ1HfD77UPFXF52wwrLrEs
  Error locking on node xen03.tilak.ibk: Volume group for uuid not
found: wP3rMbP704LoUcUawH8eMq77AyRPJ5in43a93ifh0KYJ1HfD77UPFXF52wwrLrEs
  Error locking on node xen01.tilak.ibk: Volume group for uuid not
found: wP3rMbP704LoUcUawH8eMq77AyRPJ5in43a93ifh0KYJ1HfD77UPFXF52wwrLrEs
  Failed to activate new LV.

and then 

restart service clvmd

-> the lv is active ...

done on all nodes, xen doesnt matter, everything online


Am Donnerstag, den 04.10.2007, 08:19 +0200 schrieb Arthur ME?NER:
> I'm new to this list, so please excuse if this question was
> asked and probably solved elsewhere.
> 
> How is the suggested way to add a new lun,
> create a disk partition on, it, use this partition as
> physical volume, create a volume group and create logical volume ?
> 
> I have done this on our three node (xen01-xen03) RHEL 5 cluster.
> I used a shared storage over fc ( HP EVA 8100 ) with multipath.
> 
> - rescan echo "- - -" > /sys/classes/scsi_host/host0/scan on all nodes
>   the devices appeared
> - done multipath mapping, and set multipath alias
> - on one node made the partition
> - on one node made the pv
> - on one node made the vg
> - on onde node made the logical volume
> 
> but this volume can't be taken active:
> 
> [root at xen02 kiswatch]# vgchange -ay
> Error locking on node xen01.tilak.ibk: Volume group for uuid not found:
> nEh1p8PzgfJO6Dedzjis1uDO6Yan2W5vew9ZlmZ3xyxbsgKclRwLCNAulf28UPqJ
> Error locking on node xen02.tilak.ibk: Volume group for uuid not found:
> nEh1p8PzgfJO6Dedzjis1uDO6Yan2W5vew9ZlmZ3xyxbsgKclRwLCNAulf28UPqJ
> 
> the command lvdisplay on xen01 show the searched uuid:
> [root at xen01 kiswatch]# lvdisplay
>   --- Logical volume ---
>   LV Name                /dev/lvm1/kiswatch_xvda
>   VG Name                lvm1
>   LV UUID                ew9Zlm-Z3xy-xbsg-KclR-wLCN-Aulf-28UPqJ
>   LV Write Access        read/write
>   LV Status              NOT available
>   LV Size                5.00 GB
>   Current LE             1280
>   Segments               1
>   Allocation             inherit
>   Read ahead sectors     0
> 
> 
> after restart of node xen02 the problem disappeared,
> but the node xen01 is very busy and should not be restartet.
> 
> thanks in advance for any help
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From Santosh.Panigrahi at in.unisys.com  Thu Oct  4 11:39:36 2007
From: Santosh.Panigrahi at in.unisys.com (Panigrahi, Santosh Kumar)
Date: Thu, 4 Oct 2007 17:09:36 +0530
Subject: [Linux-cluster] adding virtual servers through piranha not working
In-Reply-To: <393259.48218.qm@web36808.mail.mud.yahoo.com>
References: <393259.48218.qm@web36808.mail.mud.yahoo.com>
Message-ID: <D566E8CF3538B54D95B925CB69CB4D2A078ED4A2@inblr-exch1.eu.uis.unisys.com>

Check your piranha version. I am using piranha-0.8.4-7 and it is working
fine for me. If you are using an older version, then download the
mentioned one and try. I think it will also work for u.

Thanks

SP

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of AnuSri Padige
Sent: Thursday, October 04, 2007 2:05 AM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] adding virtual servers through piranha not
working

 
Trying to add some vitural servers through piranha is not working. When
trying to click on ADD/EDIT/DELETE button, nothing happens. Tried
restarting piranha-gui, didn't help. Tried to restart pulse. It shutdown
but, not coming back. Got an error:

 
Starting pulse: pulse: cannot create heartbeat socket. running as root?
                                                           [FAILED]

Pls. advise.

 
DJ

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/bf891b30/attachment.htm>

From rhurst at bidmc.harvard.edu  Thu Oct  4 12:50:25 2007
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Thu, 4 Oct 2007 08:50:25 -0400
Subject: [Linux-cluster] RHEL AS 4.5 GFS bug after removing a file?
In-Reply-To: <20071003185701.GA25471@jasmine.xos.nl>
References: <1191435543.6423.19.camel@xw9300.bidmc.harvard.edu>
	<20071003185701.GA25471@jasmine.xos.nl>
Message-ID: <1191502225.13666.1.camel@xw9300.bidmc.harvard.edu>

That worked, thanks!  I did not know about "hash -r" and its purpose for
cases like this.


On Wed, 2007-10-03 at 20:57 +0200, Jos Vos wrote:

> On Wed, Oct 03, 2007 at 02:19:03PM -0400, rhurst at bidmc.harvard.edu wrote:
> 
> > Our /home is mounted on GFS in an 11-node cluster.
> > I develop a script with 755 permission bits in my ~/bin.
> > I test it.
> > After it is tested, I `cp ~/bin/mybackup.sh /usr/local/sbin` (a local
> > ext3 filesystem)
> > Later, I remove ~/bin/mybackup.sh
> > Now whenever I invoke mybackup.sh, bash
> > returns /home/rhurst/bin/mybackup.sh: No such file or directory
> > and `which mybackup.sh` returns /usr/local/sbin/mybackup.sh
> > 
> > Who's caching this wrong here, bash or GFS?
> 
> What happens if you do "hash -r" before invoking "mybackup.sh"  after
> the removal?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/bfa0dd54/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2178 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071004/bfa0dd54/attachment.p7s>

From lhh at redhat.com  Thu Oct  4 14:35:13 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Oct 2007 10:35:13 -0400
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one
	active node?
In-Reply-To: <4704BF9D.3060501@webbertek.com.br>
References: <4704BF9D.3060501@webbertek.com.br>
Message-ID: <1191508513.4477.117.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-04 at 07:25 -0300, Celso K. Webber wrote:
> Hello all,
> 
> Sorry for asking this basic question, I've checked the Cluster FAQ and did 
> not find anything related to this.
> 
> I have set up a 2-node Cluster, with quorum/qdiskd configured, and 
> everything goes fine.
> 
> But in the case where I need to shut down the whole cluster, and then for 
> some reason I can boot up only one of the nodes, it does not come up, 
> staying always at the "Inquorate" status.

> What is the correct behaviour? Shouldn't my Cluster come up because I have 
> two votes active? In this case each node counts one vote in the cluster, and 
> the quorum counts another one.

cman_tool status / cman_tool nodes output would be helpful

Also, which version of cman do you have?

-- Lon


From lhh at redhat.com  Thu Oct  4 14:35:26 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Oct 2007 10:35:26 -0400
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one
	active node?
In-Reply-To: <4704CC08.9070702@noaa.gov>
References: <4704BF9D.3060501@webbertek.com.br>  <4704CC08.9070702@noaa.gov>
Message-ID: <1191508526.4477.119.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-04 at 07:18 -0400, Randy Brown wrote:
> In a two node cluster I believe you want to use:
> <cman expected_votes="1" two_node="1"/>
> in cluster.conf.

Not when using qdisk.

-- Lon


From celso at webbertek.com.br  Thu Oct  4 16:28:12 2007
From: celso at webbertek.com.br (Celso K. Webber)
Date: Thu, 04 Oct 2007 13:28:12 -0300
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one active
	node?
Message-ID: <4705149C.1000307@webbertek.com.br>

On Thu, 04 Oct 2007 10:35:13 -0400, Lon Hohberger wrote:

 > > What is the correct behaviour? Shouldn't my Cluster come up because I
have
 > > two votes active? In this case each node counts one vote in the cluster,
and
 > > the quorum counts another one.

 > cman_tool status / cman_tool nodes output would be helpful

 > Also, which version of cman do you have?

 > -- Lon

Hi Lon,

Here are some relevant information from the Cluster:

** What is happening:
  If I boot node1 with node2 powered off, it stops for 5 minutes during the
start of ccsd, and after that it regains quorum, qdiskd starts successfully,
but fenced keeps trying to start for 2 minutes and then it gives up with
a "failed" message.

** Relevant log messages collected after boot:
Oct  4 11:51:13 hercules01 kernel: CMAN: Waiting to join or form a Linux-
cluster
Oct  4 11:51:13 hercules01 ccsd[9144]: Connected to cluster infrastruture
via: CMAN/SM Plugin v1.1.7.4
Oct  4 11:51:13 hercules01 ccsd[9144]: Initial status:: Inquorate
Oct  4 11:51:45 hercules01 kernel: CMAN: forming a new cluster
Oct  4 11:56:45 hercules01 cman: Timed-out waiting for cluster failed
       ^^^^^^^^
       5 minutes later
Oct  4 11:56:45 hercules01 lock_gulmd: no <gulm> section detected
in /etc/cluster/cluster.conf succeeded
Oct  4 11:56:45 hercules01 qdiskd: Starting the Quorum Disk Daemon: succeeded
Oct  4 11:57:02 hercules01 kernel: CMAN: quorum regained, resuming activity
Oct  4 11:57:02 hercules01 ccsd[9144]: Cluster is quorate.  Allowing
connections.
Oct  4 11:58:45 hercules01 fenced: startup failed
       ^^^^^^^^
       exactly 2 minutes after the qdiskd message above, I've noticed that
fenced is started in the init scripts with "fence_tool -t 120 join -w"
Oct  4 11:59:38 hercules01 rgmanager: clurgmgrd startup failed
       ^^^^^^^^
       after other service boot up ok, rgmanager fails to boot, probably
because fenced failed to start
Oct  4 11:56:45 hercules01 qdiskd[9292]: <info> Quorum Daemon Initializing
Oct  4 11:56:55 hercules01 qdiskd[9292]: <info> Initial score 1/1
Oct  4 11:56:55 hercules01 qdiskd[9292]: <info> Initialization complete
Oct  4 11:56:55 hercules01 qdiskd[9292]: <notice> Score sufficient for
master operation (1/1; required=1); upgrading
Oct  4 11:57:01 hercules01 qdiskd[9292]: <info> Assuming master role
Oct  4 11:59:08 hercules01 clurgmgrd[10548]: <notice> Resource Group Manager
Starting
Oct  4 11:59:08 hercules01 clurgmgrd[10548]: <info> Loading Service Data
Oct  4 11:59:08 hercules01 clurgmgrd[10548]: <info> Initializing Services
... <messages of stopping the services and making sure filesystems are
unmounted>
Oct  4 11:59:28 hercules01 clurgmgrd[10548]: <info> Services Initialized
--- no more cluster messages after this point ---

** Daemons status:
# service fenced status
fenced (pid 9304) is running...
# service rgmanager status
clurgmgrd (pid 10548 10547) is running...

** Clustat:
< delay of about 10 seconds >
Timed out waiting for a response from Resource Group Manager
Member Status: Quorate

Resource Group Manager not running; no service information available.

  Member Name                              Status
  ------ ----                              ------
  node1                                    Online, Local
  node2                                    Offline

** cman_tool nodes
Node  Votes Exp Sts  Name
   0    1    0   M   /dev/emcpowere1
   1    1    3   M   node1

** cman_tool status
Protocol version: 5.0.1
Config version: 12
Cluster name: clu_prosperdb
Cluster ID: 570
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 1
Expected_votes: 3
Total_votes: 2
Quorum: 2
Active subsystems: 2
Node name: node1
Node ID: 1
Node addresses: 192.168.50.1

** Kernel version (uname -r): RHEL4u4 with latest kernel approved by EMC,
the EMC eLab was done for RHEL4U4, not RHEL 4.5, so we can't upgrade the
kernel, unless we move on everything to RHEL 4.5:
2.6.9-42.0.10.ELsmp

** Installed cluster package versions (same on both nodes):
ccs-1.0.10-0.x86_64.rpm
cman-1.0.17-0.x86_64.rpm
cman-kernel-smp-2.6.9-45.15.x86_64.rpm
dlm-1.0.3-1.x86_64.rpm
dlm-kernel-smp-2.6.9-44.9.x86_64.rpm
fence-1.32.45-1.0.2.x86_64.rpm
gulm-1.0.10-0.x86_64.rpm
iddev-2.0.0-4.x86_64.rpm
magma-1.0.7-1.x86_64.rpm
magma-plugins-1.0.12-0.x86_64.rpm
perl-Net-Telnet-3.03-3.noarch.rpm
rgmanager-1.9.68-1.x86_64.rpm
system-config-cluster-1.0.45-1.0.noarch.rpm

** What happens if I boot up the other node (node2):
- ccsd comes up after just a few seconds on node2
- all other cluster daemons start successfully
- fenced and rgmanager on node1 both start
- the logs show node1 starting services when node2 came up:
Oct  4 12:51:44 hercules01 clurgmgrd[10548]: <info> Logged in
SG "usrm::manager"
Oct  4 12:51:44 hercules01 clurgmgrd[10548]: <info> Magma Event: Membership
Change
Oct  4 12:51:44 hercules01 clurgmgrd[10548]: <info> State change: Local UP
... <messages about services starting and filesystems being mounted>
Oct  4 12:52:24 hercules01 clurgmgrd[10548]: <info> Magma Event: Membership
Change
Oct  4 12:52:24 hercules01 clurgmgrd[10548]: <info> State change: node2 UP

The only packages not up to date are the kernel related ones, which I
believe are the correct ones for my kernel version.

Please, tell me if you see any mistake on this setup. The problem is that
the customer cannot boot up the systems if one node is eventually dead. If
both nodes are up and one goes down, the functionality is OK. But as it is
now, if the remaining node reboots, the services cannot come up.

Thank you very much.

Regards,

-- Celso

-- 
Esta mensagem foi verificada pelo sistema de antiv?rus e
 acredita-se estar livre de perigo.


From lhh at redhat.com  Thu Oct  4 17:50:09 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Oct 2007 13:50:09 -0400
Subject: [Linux-cluster] Can a 2-node Cluster boot-up with only one
	active node?
In-Reply-To: <4705149C.1000307@webbertek.com.br>
References: <4705149C.1000307@webbertek.com.br>
Message-ID: <1191520209.4477.169.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-04 at 13:28 -0300, Celso K. Webber wrote:
> ** What is happening:
>   If I boot node1 with node2 powered off, it stops for 5 minutes during the
> start of ccsd, and after that it regains quorum, qdiskd starts successfully,
> but fenced keeps trying to start for 2 minutes and then it gives up with
> a "failed" message.

Hmmm...

> Oct  4 11:56:45 hercules01 lock_gulmd: no <gulm> section detected
> in /etc/cluster/cluster.conf succeeded

chkconfig --del lock_gulmd

> Oct  4 11:56:45 hercules01 qdiskd: Starting the Quorum Disk Daemon: succeeded
> Oct  4 11:57:02 hercules01 kernel: CMAN: quorum regained, resuming activity
> Oct  4 11:57:02 hercules01 ccsd[9144]: Cluster is quorate.  Allowing
> connections.
> Oct  4 11:58:45 hercules01 fenced: startup failed
>        ^^^^^^^^
>        exactly 2 minutes after the qdiskd message above, I've noticed that
> fenced is started in the init scripts with "fence_tool -t 120 join -w"
> Oct  4 11:59:38 hercules01 rgmanager: clurgmgrd startup failed
>        ^^^^^^^^
>        after other service boot up ok, rgmanager fails to boot, probably
> because fenced failed to start
> Oct  4 11:56:45 hercules01 qdiskd[9292]: <info> Quorum Daemon Initializing
> Oct  4 11:56:55 hercules01 qdiskd[9292]: <info> Initial score 1/1
> Oct  4 11:56:55 hercules01 qdiskd[9292]: <info> Initialization complete
> Oct  4 11:56:55 hercules01 qdiskd[9292]: <notice> Score sufficient for
> master operation (1/1; required=1); upgrading
> Oct  4 11:57:01 hercules01 qdiskd[9292]: <info> Assuming master role

[ at this point, the cluster is quorate ]


> ** Installed cluster package versions (same on both nodes):
> cman-kernel-smp-2.6.9-45.15.x86_64.rpm

>From cvs logs for cnxman.c (I know, too much information... but I can't
find a bugzilla on it):

RHEL45: 1.42.2.28.0.2
cman-kernel_2_6_9_48: 1.42.2.27
...
cman-kernel_2_6_9_45: 1.42.2.25

revision 1.42.2.27
date: 2007/01/19 10:23:14;  author: pcaulfield;  state: Exp;  lines: +2
-0
Tell SM when the quorum device comes or goes.


There's a bug in the one you have which is fixed in 4.5.  Basically, the
SM component of CMAN in the kernel wasn't getting notified when qdisk
votes were causing a quorum transition.  This caused problems with
fenced and the DLM (and thus, rgmanager - since rgmanager uses the DLM).

It's fixed in cman-kernel from 4.5 and later.  The patch is here:

http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/cman-kernel/src/Attic/cnxman.c.diff?r1=1.42.2.26&r2=1.42.2.27&cvsroot=cluster&hideattic=0&only_with_tag=RHEL4

-- Lon


From Timothy.Ward at itt.com  Thu Oct  4 21:37:02 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Thu, 4 Oct 2007 17:37:02 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC64@ACDFWMAIL1.acd.de.ittind.com>

On Thu, 2007-09-13 at 17:35 -0500, Kevin Anderson wrote: 
-------------------------------------------------------------
We have been primarily focused on getting GFS2 stable, but one of the
guys is going to spend sometime over the next couple of weeks working on
getting FC7 rpms built for gfs.

Kevin 
-------------------------------------------------------------


Kevin,

I completely reinstalled my cluster and gotten back to Apache and Samba
working and NFS using a GFS2 filesystem making a Kernel bug and crashing
the whole cluster.

How are the GFS1/FC7 rpms coming?  Where should I look to download them?

If I can get a stable setup of GFS (1 or 2) working to show management I
will be able to get a testbed for testing 64bit versions for you.  So
basically I need some flavor of FC and GFS that are stable so I can put
it into production and show it off.

Thanks,
Tim
*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From linuxtechmails at gmail.com  Fri Oct  5 10:31:36 2007
From: linuxtechmails at gmail.com (Linux Technology Mails)
Date: Fri, 5 Oct 2007 16:01:36 +0530
Subject: [Linux-cluster] Cluster Documents - Request
Message-ID: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>

Hi,
 Could anyone please provide links to documents related to Linux clusters.
Will be of great help.
-- 
Regards,

Linux Tech Mails Group.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071005/03840fd6/attachment.htm>

From andrewxwang at yahoo.com.tw  Fri Oct  5 12:19:15 2007
From: andrewxwang at yahoo.com.tw (Andrew Wang)
Date: Fri, 5 Oct 2007 20:19:15 +0800 (CST)
Subject: [Linux-cluster] Proceedings of 3rd Grid Engine Workshop
Message-ID: <526873.28021.qm@web73506.mail.tp2.yahoo.com>

The presentations from the 3rd Grid Engine Workshop
are now available for download:
http://gridengine.sunsource.net/workshop10-12.09.07/proceedings.html

Grid Engine Project Homepage:
http://gridengine.sunsource.net

Andrew.


      ____________________________________________________________________________________
????????????? - ???? Yahoo!??????http://tw.info.yahoo.com/seal/index.html


From dist-list at LEXUM.UMontreal.CA  Fri Oct  5 15:34:10 2007
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 05 Oct 2007 11:34:10 -0400
Subject: [Linux-cluster] cluster shutdown procedure best practice ?
Message-ID: <47065972.6060704@lexum.umontreal.ca>

Hello,
Tonight I will have to shutdown our 6 nodes clusters (with a shared GFS).

What is the best practice to shut it down ?

Regards,


From Alexandre.Racine at mhicc.org  Fri Oct  5 15:10:38 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 5 Oct 2007 11:10:38 -0400
Subject: [Linux-cluster] fence and DRAC problem...
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>

So I am testing DRAC right...


# /sbin/fence_drac -a 192.168.1.2 -m ????? -l user -p ***** -o reboot
failed: module 'server' not detected


Q1 - What should I write to the module name?


Q2 - How should this be put in the cluster.conf file?


Q3 - How can I tell the cluster.conf file that the server I want to reboot is in slot #9?


Thanks


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2650 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071005/d816e06f/attachment.bin>

From Alexandre.Racine at mhicc.org  Fri Oct  5 15:02:38 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 5 Oct 2007 11:02:38 -0400
Subject: [Linux-cluster] Cluster Documents - Request
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34D9@cumulonimbus.RG.local>

http://sourceware.org/cluster/
http://sourceware.org/cluster/faq.html


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Linux Technology Mails
Sent: Fri 2007-10-05 06:31
To: Linux-cluster at redhat.com
Subject: [Linux-cluster] Cluster Documents - Request
 
Hi,
 Could anyone please provide links to documents related to Linux clusters.
Will be of great help.
-- 
Regards,

Linux Tech Mails Group.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2800 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071005/2cf3b7f8/attachment.bin>

From gordan at bobich.net  Fri Oct  5 15:49:57 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 5 Oct 2007 16:49:57 +0100 (BST)
Subject: [Linux-cluster] GFS Feature Question
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
Message-ID: <Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>

Hi,

I stumbled upon an old document from back in 2000 (before RedHat acquired 
Sistina), and they were talking about a number of features for the "next 
version", including shadowing/copy-on-write.

The two features I am particularly interested in are:

1) Compression
I consider this to be important both for performance reasons and the fact 
that no matter how cheap, disks will always be more expensive. 
Performance-wise, at some point I/O becomes the bottleneck. Not 
necessarily the disk I/O but network I/O of the SAN, especially when all 
the nodes in the cluster are sharing the same SAN bandwidth. At that 
point, reducing the data volume through compression becomes a performance 
win. This point isn't all that difficult to reach even on a small cluster 
on gigabit ethernet.

2) Shadowing/Copy-On-Write File Versioning
Backups have 2 purposes - retrieving a file that was lost or corrupted 
through user error, and files lost or corrupted through disk failure. High 
levels of RAID alleviate the need for backup for the latter reason, but 
they do nothing to alleviate user-error caused damage. At the same time 
SANs can get big - I don't see hundreds of TB to be an inconcievable size. 
At this size, backups become an issue. Thus, a feature to provide file 
versioning is important.

In turn, 2) increases the volume of data, which increases the need for 1).

Are either of these two features planned for GFS in the near future?

Gordan


From swhiteho at redhat.com  Fri Oct  5 16:03:49 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Fri, 05 Oct 2007 17:03:49 +0100
Subject: [Linux-cluster] GFS Feature Question
In-Reply-To: <Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
	<Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>
Message-ID: <1191600229.1068.283.camel@quoit>

Hi,

On Fri, 2007-10-05 at 16:49 +0100, Gordan Bobic wrote:
> Hi,
> 
> I stumbled upon an old document from back in 2000 (before RedHat acquired 
> Sistina), and they were talking about a number of features for the "next 
> version", including shadowing/copy-on-write.
> 
> The two features I am particularly interested in are:
> 
> 1) Compression
> I consider this to be important both for performance reasons and the fact 
> that no matter how cheap, disks will always be more expensive. 
> Performance-wise, at some point I/O becomes the bottleneck. Not 
> necessarily the disk I/O but network I/O of the SAN, especially when all 
> the nodes in the cluster are sharing the same SAN bandwidth. At that 
> point, reducing the data volume through compression becomes a performance 
> win. This point isn't all that difficult to reach even on a small cluster 
> on gigabit ethernet.
> 
There are really two issues here rather than one:
 1. Compression of data
 Has, as a prerequisite, "allocate on flush" as we would really need
"compress on flush" in order to make this a viable option. Also we'd
need hints as to what kind of data we are looking at in order to make it
worthwhile. We'd also have to look at crypto too since you can't
compress encrypted data, the compression must come first if its
required.
 2. Compression of metadata
 This might well be worth looking into. There is a considerable amount
of redundancy in typical fs metadata, and we ought to be able to reduce
the number of blocks we have to read/write in order to complete an
operation in this way. Using extents for example could be considered a
form of metadata compression. The main problem is that our "cache line"
if you like in GFS(2) is one disk block, so that sharing between nodes
is a problem (hence the one inode per block rule we have at the moment).
We'd need to address the metadata migration issue first.

Neither of the above is likely to happen soon though as they both
require on-disk format changes.

> 2) Shadowing/Copy-On-Write File Versioning
> Backups have 2 purposes - retrieving a file that was lost or corrupted 
> through user error, and files lost or corrupted through disk failure. High 
> levels of RAID alleviate the need for backup for the latter reason, but 
> they do nothing to alleviate user-error caused damage. At the same time 
> SANs can get big - I don't see hundreds of TB to be an inconcievable size. 
> At this size, backups become an issue. Thus, a feature to provide file 
> versioning is important.
> 
> In turn, 2) increases the volume of data, which increases the need for 1).
> 
> Are either of these two features planned for GFS in the near future?
> 
> Gordan
> 
This also requires on-disk format changes, but I agree that it would be
a nice thing to do. Its very much in my mind though as to what a
suitable scheme would be. We do have an ever increasing patent minefield
to walk through here too I suspect.

Potentially it would be possible to address both of the above
suggestions (minus the metadata compression) by using a stacking
filesystem. That would be potentially more flexible by introducing the
features on all filesystems not just GFS(2),

Steve.


From erickson.jon at gmail.com  Fri Oct  5 16:30:45 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Fri, 5 Oct 2007 12:30:45 -0400
Subject: [Linux-cluster] cluster shutdown procedure best practice ?
In-Reply-To: <47065972.6060704@lexum.umontreal.ca>
References: <47065972.6060704@lexum.umontreal.ca>
Message-ID: <6a90e4da0710050930u589190dcl8a9e149c3cce4a13@mail.gmail.com>

See the following question and answer from the FAQ
http://sourceware.org/cluster/faq.html#cman_shutdown

I wrote a script called to shut down my nodes.  Basicly it runs a
bunch of service stop commands and the last line runs cman_tool
remove.  This seems to work for me.

I would recomend that redhat produces a shutdown/ startup cluster
script in the distro.

-Jon

On 10/5/07, FM <dist-list at lexum.umontreal.ca> wrote:
> Hello,
> Tonight I will have to shutdown our 6 nodes clusters (with a shared GFS).
>
> What is the best practice to shut it down ?
>
> Regards,
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From Alexandre.Racine at mhicc.org  Fri Oct  5 16:48:37 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 5 Oct 2007 12:48:37 -0400
Subject: [Linux-cluster] RE: fence and DRAC problem...
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DB@cumulonimbus.RG.local>

I'll answer my own Q1 question...


R1 
You have to see here : http://sourceware.org/cluster/faq.html#fence_devices

Then download the test_drac.sh and use it to get the modules witch is actually the name of your blades...


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Alexandre Racine
Sent: Fri 2007-10-05 11:10
To: linux clustering
Subject: fence and DRAC problem...
 
So I am testing DRAC right...


# /sbin/fence_drac -a 192.168.1.2 -m ????? -l user -p ***** -o reboot
failed: module 'server' not detected


Q1 - What should I write to the module name?


Q2 - How should this be put in the cluster.conf file?


Q3 - How can I tell the cluster.conf file that the server I want to reboot is in slot #9?


Thanks


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071005/1bd00b82/attachment.htm>

From gordan at bobich.net  Fri Oct  5 16:56:41 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 5 Oct 2007 17:56:41 +0100 (BST)
Subject: [Linux-cluster] GFS Feature Question
In-Reply-To: <1191600229.1068.283.camel@quoit>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
	<Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>
	<1191600229.1068.283.camel@quoit>
Message-ID: <Pine.LNX.4.64.0710051740220.7849@skynet.shatteredsilicon.net>

On Fri, 5 Oct 2007, Steven Whitehouse wrote:

>> I stumbled upon an old document from back in 2000 (before RedHat acquired
>> Sistina), and they were talking about a number of features for the "next
>> version", including shadowing/copy-on-write.
>>
>> The two features I am particularly interested in are:
>>
>> 1) Compression
>> I consider this to be important both for performance reasons and the fact
>> that no matter how cheap, disks will always be more expensive.
>> Performance-wise, at some point I/O becomes the bottleneck. Not
>> necessarily the disk I/O but network I/O of the SAN, especially when all
>> the nodes in the cluster are sharing the same SAN bandwidth. At that
>> point, reducing the data volume through compression becomes a performance
>> win. This point isn't all that difficult to reach even on a small cluster
>> on gigabit ethernet.
>>
> There are really two issues here rather than one:
> 1. Compression of data
> Has, as a prerequisite, "allocate on flush" as we would really need
> "compress on flush" in order to make this a viable option. Also we'd
> need hints as to what kind of data we are looking at in order to make it
> worthwhile. We'd also have to look at crypto too since you can't
> compress encrypted data, the compression must come first if its
> required.

Sure, but this is hardly a difficult problem. It could be based on any of:
1) file extension, perhaps listed somewhere in the /etc directory, and 
only read on boot-up, or even provided as a comma-separated list to the 
module/kernel at load-time (a file would be nicer, though).
2) Completely transparently based on a similar heuristic to what Reiser4 
uses. For each file, try to compress the first 64KB. If it yields a 
reasonable result, compress the rest, otherwise, flag as uncompressed 
and don't bother. The user could override this by the appropriate chattr 
command.
3) Leave it entirely up to the user - just inherit compression flag from 
the parent directory. If the user says to compress, then don't question 
it.

3) would be the simplest, and probably most useful. The only time 
when a block should be left uncompressed is when compressing it makes it 
get bigger.

> 2. Compression of metadata
> This might well be worth looking into. There is a considerable amount
> of redundancy in typical fs metadata, and we ought to be able to reduce
> the number of blocks we have to read/write in order to complete an
> operation in this way. Using extents for example could be considered a
> form of metadata compression. The main problem is that our "cache line"
> if you like in GFS(2) is one disk block, so that sharing between nodes
> is a problem (hence the one inode per block rule we have at the moment).
> We'd need to address the metadata migration issue first.

I'm not sure I understand what the problem is here. How is caching a 
problem any more than it would otherwise be - considering we have multiple 
nodes doing r/w ops on the same FS?

> Neither of the above is likely to happen soon though as they both
> require on-disk format changes.

Compatibility is already broken between GFS1 and GFS2. I don't see this as 
an issue. The FS will get mounted with whatever parameters it was created 
- and a new FS can be created with compression enabled.

>> 2) Shadowing/Copy-On-Write File Versioning
>> Backups have 2 purposes - retrieving a file that was lost or corrupted
>> through user error, and files lost or corrupted through disk failure. High
>> levels of RAID alleviate the need for backup for the latter reason, but
>> they do nothing to alleviate user-error caused damage. At the same time
>> SANs can get big - I don't see hundreds of TB to be an inconcievable size.
>> At this size, backups become an issue. Thus, a feature to provide file
>> versioning is important.
>>
>> In turn, 2) increases the volume of data, which increases the need for 1).
>>
>> Are either of these two features planned for GFS in the near future?

> This also requires on-disk format changes,

I don't remember implying that it wouldn't. But at the same time, why 
would this be a problem? It's not like it means that people won't be able 
to mount their GFS2 FS as they can now. And it's not like GFS2 works at 
the moment, anyway (not with the latest packaged releases on any of the 
spawns of RH (Fedore/CentOS, etc.)! :-p

> but I agree that it would be
> a nice thing to do. Its very much in my mind though as to what a
> suitable scheme would be. We do have an ever increasing patent minefield
> to walk through here too I suspect.

I very much doubt it. There are several OSS non-cluster FSs that provide 
copy-on-write file versioning, and this has been used since the days of 
VMS - which was now long enough ago that patents would have long since 
expired.

> Potentially it would be possible to address both of the above
> suggestions (minus the metadata compression) by using a stacking
> filesystem. That would be potentially more flexible by introducing the
> features on all filesystems not just GFS(2),

Can you explain what you mean by stackable? I would have thought that 
having a stacked file system on top of GFS would break GFS' ability to 
function correctly in a clustered environment (not to mention introduce 
unnecessary overheads).

Gordan


From Abdel.Sadek at lsi.com  Fri Oct  5 17:14:24 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Fri, 5 Oct 2007 11:14:24 -0600
Subject: [Linux-cluster] SCSI fencing on RHEL 4.5
Message-ID: <C776378855970A4DADE4A476447F6391EB612C@NAMAIL3.ad.lsil.com>

I have a 2-node RHEL 4.5 cluster. I am using SCSI fencing which uses
scsi-3 reservation. The 2 nodes are sharing devices on a FC storage
array. I am using a single path; there is no multipath driver involved.
I am also using GFS on top of LVM volumes.

When I shutdown one of the nodes, the cluster does not send any commands
to the storage array to clear out the persistent reservations for that
node. This causes the resources not to fail correctly to the other node
since it cannot break the reservation done by the dead node. My
questions are:

1. Is that a known issue?

2. Is there a way to turn some kind of debugging on the cluster that
will allow me to watch if any scsi PRIN/PROUT commands are sent to the
storage devices?

 
Thanks.

Abdel..

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071005/ce1eff24/attachment.htm>

From lhh at redhat.com  Fri Oct  5 18:25:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 05 Oct 2007 14:25:49 -0400
Subject: [Linux-cluster] Cluster Documents - Request
In-Reply-To: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
Message-ID: <1191608749.21582.54.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-05 at 16:01 +0530, Linux Technology Mails wrote:
> 
> Hi,
>  Could anyone please provide links to documents related to Linux
> clusters. Will be of great help.
> -- 
> Regards,

http://sources.redhat.com/cluster/faq.html
http://sources.redhat.com/cluster/doc/nfscookbook.pdf

http://sources.redhat.com/cluster/doc/usage.txt
http://www.redhat.com/docs/manuals/csgfs/


From rnevins at workflowbydesign.com  Fri Oct  5 19:01:41 2007
From: rnevins at workflowbydesign.com (Rob Nevins)
Date: Fri, 05 Oct 2007 15:01:41 -0400
Subject: [Linux-cluster] cluster shutdown procedure best practice ?
In-Reply-To: <6a90e4da0710050930u589190dcl8a9e149c3cce4a13@mail.gmail.com>
References: <47065972.6060704@lexum.umontreal.ca>
	<6a90e4da0710050930u589190dcl8a9e149c3cce4a13@mail.gmail.com>
Message-ID: <47068A15.2080503@workflowbydesign.com>

Just to be sure, should the filesystem be dismounted before using 
"cman_tool leave remove"?

-Rob

Jon Erickson wrote:
> See the following question and answer from the FAQ
> http://sourceware.org/cluster/faq.html#cman_shutdown
>
> I wrote a script called to shut down my nodes.  Basicly it runs a
> bunch of service stop commands and the last line runs cman_tool
> remove.  This seems to work for me.
>
> I would recomend that redhat produces a shutdown/ startup cluster
> script in the distro.
>
> -Jon
>
>   


From dist-list at LEXUM.UMontreal.CA  Fri Oct  5 19:45:48 2007
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 05 Oct 2007 15:45:48 -0400
Subject: [Linux-cluster] cluster shutdown procedure best practice ?
In-Reply-To: <47068A15.2080503@workflowbydesign.com>
References: <47065972.6060704@lexum.umontreal.ca>	<6a90e4da0710050930u589190dcl8a9e149c3cce4a13@mail.gmail.com>
	<47068A15.2080503@workflowbydesign.com>
Message-ID: <4706946C.7020307@lexum.umontreal.ca>

When I'm doing  kernel and GFS updates I do this :

stop service accessing the GFS
umount the GFS FS
service ccsd stop
service clvmd stop
service fenced stop
service rgmanager stop

cman_tool leave remove

then update and reboot

Regards,


Rob Nevins wrote:
> Just to be sure, should the filesystem be dismounted before using
> "cman_tool leave remove"?
> 
> -Rob
> 
> Jon Erickson wrote:
>> See the following question and answer from the FAQ
>> http://sourceware.org/cluster/faq.html#cman_shutdown
>>
>> I wrote a script called to shut down my nodes.  Basicly it runs a
>> bunch of service stop commands and the last line runs cman_tool
>> remove.  This seems to work for me.
>>
>> I would recomend that redhat produces a shutdown/ startup cluster
>> script in the distro.
>>
>> -Jon
>>
>>   
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From ben.yarwood at juno.co.uk  Sun Oct  7 16:44:47 2007
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Sun, 7 Oct 2007 17:44:47 +0100
Subject: [Linux-cluster] fencing using rps-10
In-Reply-To: <1191444486.4477.97.camel@ayanami.boston.devel.redhat.com>
References: <00ea01c805b4$039333d0$0ab99b70$@yarwood@juno.co.uk>
	<1191444486.4477.97.camel@ayanami.boston.devel.redhat.com>
Message-ID: <036201c80901$5f8afa60$1ea0ef20$@yarwood@juno.co.uk>

Bugzilla Bug 322291: rps-10 fence agent does not perform default reboot action


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon
> Hohberger
> Sent: 03 October 2007 21:48
> To: linux clustering
> Subject: Re: [Linux-cluster] fencing using rps-10
> 
> On Wed, 2007-10-03 at 12:53 +0100, Ben Yarwood wrote:
> > I think the documentation for using the rps10 fence device is incorrect
> > (http://sources.redhat.com/cluster/doc/cluster_schema_rhel5.html), or else there is a bug in the
> fence agent in rhel5:
> >
> > The doc does not mention that you must specify an "option" attribute or else the agent returns an
> error.  Eg.
> >
> > <fencedevice agent="fence_rps10" name="rps10" device="/dev/ttyS0" port="0" option="reboot" />
> >
> > will work but without the "option" attribute you get the error:
> >
> > failed: operation must be 'on', 'off', or 'reboot'
> 
> Ouch.  Could you file a bz?  It should default to reboot if unspecified.
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From qqlka at nask.pl  Mon Oct  8 09:50:26 2007
From: qqlka at nask.pl (=?iso-8859-2?Q?Agnieszka_Kuka=B3owicz?=)
Date: Mon, 8 Oct 2007 11:50:26 +0200
Subject: [Linux-cluster] Multihome configuration in RHEL5
Message-ID: <022801c80990$a7896880$0776b5c2@gda07ak>

Hi,

I have problem with multihome cluster configuration in RHEL5.

I've read that it was possible to configure alternative cluster node
name 
for servers who had configured two different nets. There are a few of
examples how to do it on that mailinig list. 

Now, I have RHEL5 Beta with cman version 2.0.70 and it doesn't work. 
I've checked that "ccs_tool addnode node1 -n 1 -a node11" doesn't add
the altname tag to cluster.conf file. And validating cluster.conf with
"altname" tag inside says:

element altname: Relax-NG validity error : Element clusternode has extra
content: altname

So my question is what happened with this tag in RHEL5? Is it
"deprecated"? And how to achieve HA in case of failure of ethernet
interface. Now there is no redundant channel for communication between
cluster members.

Agnieszka Kukalowicz


From pcaulfie at redhat.com  Mon Oct  8 10:00:59 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 08 Oct 2007 11:00:59 +0100
Subject: [Linux-cluster] Multihome configuration in RHEL5
In-Reply-To: <022801c80990$a7896880$0776b5c2@gda07ak>
References: <022801c80990$a7896880$0776b5c2@gda07ak>
Message-ID: <4709FFDB.1010909@redhat.com>

Agnieszka Kuka?owicz wrote:
> Hi,
> 
> I have problem with multihome cluster configuration in RHEL5.
> 
> I've read that it was possible to configure alternative cluster node
> name 
> for servers who had configured two different nets. There are a few of
> examples how to do it on that mailinig list. 
> 
> Now, I have RHEL5 Beta with cman version 2.0.70 and it doesn't work. 
> I've checked that "ccs_tool addnode node1 -n 1 -a node11" doesn't add
> the altname tag to cluster.conf file. And validating cluster.conf with
> "altname" tag inside says:

ccs_tool doesn't support adding altnames to a node....but you guessed that :)

> element altname: Relax-NG validity error : Element clusternode has extra
> content: altname
> 
> So my question is what happened with this tag in RHEL5? Is it
> "deprecated"? And how to achieve HA in case of failure of ethernet
> interface. Now there is no redundant channel for communication between
> cluster members. 

The tag is there but it's not well tested, and the DLM only supports multi-home
when using SCTP (also not well tested).

The recommended way to use redundant ethernets with cluster software is the
ethernet bonding driver.

-- 
Patrick


From swhiteho at redhat.com  Mon Oct  8 13:58:19 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 08 Oct 2007 14:58:19 +0100
Subject: [Linux-cluster] GFS Feature Question
In-Reply-To: <Pine.LNX.4.64.0710051740220.7849@skynet.shatteredsilicon.net>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
	<Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>
	<1191600229.1068.283.camel@quoit>
	<Pine.LNX.4.64.0710051740220.7849@skynet.shatteredsilicon.net>
Message-ID: <1191851899.1068.335.camel@quoit>

Hi,

On Fri, 2007-10-05 at 17:56 +0100, Gordan Bobic wrote:
> On Fri, 5 Oct 2007, Steven Whitehouse wrote:
> 
> >> I stumbled upon an old document from back in 2000 (before RedHat acquired
> >> Sistina), and they were talking about a number of features for the "next
> >> version", including shadowing/copy-on-write.
> >>
> >> The two features I am particularly interested in are:
> >>
> >> 1) Compression
> >> I consider this to be important both for performance reasons and the fact
> >> that no matter how cheap, disks will always be more expensive.
> >> Performance-wise, at some point I/O becomes the bottleneck. Not
> >> necessarily the disk I/O but network I/O of the SAN, especially when all
> >> the nodes in the cluster are sharing the same SAN bandwidth. At that
> >> point, reducing the data volume through compression becomes a performance
> >> win. This point isn't all that difficult to reach even on a small cluster
> >> on gigabit ethernet.
> >>
> > There are really two issues here rather than one:
> > 1. Compression of data
> > Has, as a prerequisite, "allocate on flush" as we would really need
> > "compress on flush" in order to make this a viable option. Also we'd
> > need hints as to what kind of data we are looking at in order to make it
> > worthwhile. We'd also have to look at crypto too since you can't
> > compress encrypted data, the compression must come first if its
> > required.
> 
> Sure, but this is hardly a difficult problem. It could be based on any of:
I would argue that it is a difficult problem for a number of reasons...

> 1) file extension, perhaps listed somewhere in the /etc directory, and 
> only read on boot-up, or even provided as a comma-separated list to the 
> module/kernel at load-time (a file would be nicer, though).
Which requires not only loading the list into the kernel, but what
happens if someone creates a link to foo.png called foo.txt or foo.ps?

> 2) Completely transparently based on a similar heuristic to what Reiser4 
> uses. For each file, try to compress the first 64KB. If it yields a 
> reasonable result, compress the rest, otherwise, flag as uncompressed 
> and don't bother. The user could override this by the appropriate chattr 
> command.
That assumes that you always have the "first 64k" available (in cache)
and that its not a hole in the file for example.

> 3) Leave it entirely up to the user - just inherit compression flag from 
> the parent directory. If the user says to compress, then don't question 
> it.
> 
Ok, but that still leaves a number of problems to resolve: firstly we
need to be able to ensure that the compression doesn't result in
expansion of the data. Whatever system we use we'd have to be able to
turn off compression in that case, otherwise block allocation would
become almost impossible as we'd not be able to put a reasonable max
bound on the number of blocks used.

Also the compression isn't likely to be very good if it can't tune
itself to the particular file content.

> 3) would be the simplest, and probably most useful. The only time 
> when a block should be left uncompressed is when compressing it makes it 
> get bigger.
> 
The other thing that would have to be decided is the size of a "block"
in this case. Too large and random access will be slow and cumbersome,
too small and the benefit from compression will be less.

So although all of those problems are solvable, given time, I would
still say that that it is not an easy thing to do.

> > 2. Compression of metadata
> > This might well be worth looking into. There is a considerable amount
> > of redundancy in typical fs metadata, and we ought to be able to reduce
> > the number of blocks we have to read/write in order to complete an
> > operation in this way. Using extents for example could be considered a
> > form of metadata compression. The main problem is that our "cache line"
> > if you like in GFS(2) is one disk block, so that sharing between nodes
> > is a problem (hence the one inode per block rule we have at the moment).
> > We'd need to address the metadata migration issue first.
> 
> I'm not sure I understand what the problem is here. How is caching a 
> problem any more than it would otherwise be - considering we have multiple 
> nodes doing r/w ops on the same FS?
> 
Because the data structures are carefully designed to minimimse the
times for which two nodes will want to access the same block. Of course
that still happens in some cases, but provided the nodes are not all
working in the same directory (or if they are, then the workload is
mostly readonly) then largely there is little contention.

As soon as you start (for example) having multiple inodes in the same
block, then the probability of sharing two items of data which are
required by different nodes at the same time goes up.

> > Neither of the above is likely to happen soon though as they both
> > require on-disk format changes.
> 
> Compatibility is already broken between GFS1 and GFS2. I don't see this as 
> an issue. The FS will get mounted with whatever parameters it was created 
> - and a new FS can be created with compression enabled.
> 
Mostly the data structures between GFS1 and GFS2 are the same. There are
a few differences, but thats mainly down to the addition of the metadata
file system (which has identical on-disk format to the main GFS2) and
one or two fields in the inode (the common fields are at the same
offsets). The format for journalled files has changed, but only to be
the same as that for non-journalled files, so its not a huge change
really.

> >> 2) Shadowing/Copy-On-Write File Versioning
> >> Backups have 2 purposes - retrieving a file that was lost or corrupted
> >> through user error, and files lost or corrupted through disk failure. High
> >> levels of RAID alleviate the need for backup for the latter reason, but
> >> they do nothing to alleviate user-error caused damage. At the same time
> >> SANs can get big - I don't see hundreds of TB to be an inconcievable size.
> >> At this size, backups become an issue. Thus, a feature to provide file
> >> versioning is important.
> >>
> >> In turn, 2) increases the volume of data, which increases the need for 1).
> >>
> >> Are either of these two features planned for GFS in the near future?
> 
> > This also requires on-disk format changes,
> 
> I don't remember implying that it wouldn't. But at the same time, why 
> would this be a problem? It's not like it means that people won't be able 
> to mount their GFS2 FS as they can now. And it's not like GFS2 works at 
> the moment, anyway (not with the latest packaged releases on any of the 
> spawns of RH (Fedore/CentOS, etc.)! :-p
> 
For the moment we are trying to not make changes to the on-disk format,
and in fact there haven't been any for a long time now. We have made
some major steps forward in stability recently and those are due to roll
into the distros fairly shortly now, so the last thing we want to do is
to change things at this stage.

Thats not to say that we won't come back and revisit the ideas later on
though, but its not top of our list right now.

> > but I agree that it would be
> > a nice thing to do. Its very much in my mind though as to what a
> > suitable scheme would be. We do have an ever increasing patent minefield
> > to walk through here too I suspect.
> 
> I very much doubt it. There are several OSS non-cluster FSs that provide 
> copy-on-write file versioning, and this has been used since the days of 
> VMS - which was now long enough ago that patents would have long since 
> expired.
> 
> > Potentially it would be possible to address both of the above
> > suggestions (minus the metadata compression) by using a stacking
> > filesystem. That would be potentially more flexible by introducing the
> > features on all filesystems not just GFS(2),
> 
> Can you explain what you mean by stackable? I would have thought that 
> having a stacked file system on top of GFS would break GFS' ability to 
> function correctly in a clustered environment (not to mention introduce 
> unnecessary overheads).
> 
> Gordan

I'm thinking of filesystems like (for example) unionfs which pass
certain operations through to the filesystem(s) underneath it. Depending
on how this is implemented, it need not be particularly inefficient. It
wouldn't affect how GFS2 works any more than it would affect any other
filesystem, although if locking were required (for example) in the
clustered case, then the higher level filesystem would be just as able
to use the DLM (for example) as any other kernel module or userland
application, so that shouldn't be a barrier,

Steve.


From gordan at bobich.net  Mon Oct  8 17:45:06 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 8 Oct 2007 18:45:06 +0100 (BST)
Subject: [Linux-cluster] GFS Feature Question
In-Reply-To: <1191851899.1068.335.camel@quoit>
References: <fc088dc30710050331i2aeb7537va761462e2abec9a9@mail.gmail.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C34DA@cumulonimbus.RG.local>
	<Pine.LNX.4.64.0710051639460.7176@skynet.shatteredsilicon.net>
	<1191600229.1068.283.camel@quoit>
	<Pine.LNX.4.64.0710051740220.7849@skynet.shatteredsilicon.net>
	<1191851899.1068.335.camel@quoit>
Message-ID: <Pine.LNX.4.64.0710081814080.31848@skynet.shatteredsilicon.net>

On Mon, 8 Oct 2007, Steven Whitehouse wrote:

>>>> I stumbled upon an old document from back in 2000 (before RedHat acquired
>>>> Sistina), and they were talking about a number of features for the "next
>>>> version", including shadowing/copy-on-write.
>>>>
>>>> The two features I am particularly interested in are:
>>>>
>>>> 1) Compression
>>>> I consider this to be important both for performance reasons and the fact
>>>> that no matter how cheap, disks will always be more expensive.
>>>> Performance-wise, at some point I/O becomes the bottleneck. Not
>>>> necessarily the disk I/O but network I/O of the SAN, especially when all
>>>> the nodes in the cluster are sharing the same SAN bandwidth. At that
>>>> point, reducing the data volume through compression becomes a performance
>>>> win. This point isn't all that difficult to reach even on a small cluster
>>>> on gigabit ethernet.
>>>>
>>> There are really two issues here rather than one:
>>> 1. Compression of data
>>> Has, as a prerequisite, "allocate on flush" as we would really need
>>> "compress on flush" in order to make this a viable option. Also we'd
>>> need hints as to what kind of data we are looking at in order to make it
>>> worthwhile. We'd also have to look at crypto too since you can't
>>> compress encrypted data, the compression must come first if its
>>> required.
>>
>> Sure, but this is hardly a difficult problem. It could be based on any of:
>
> I would argue that it is a difficult problem for a number of reasons...

Not really. It's only CPU time. ;-)

>> 1) file extension, perhaps listed somewhere in the /etc directory, and
>> only read on boot-up, or even provided as a comma-separated list to the
>> module/kernel at load-time (a file would be nicer, though).
>
> Which requires not only loading the list into the kernel, but what
> happens if someone creates a link to foo.png called foo.txt or foo.ps?

If the chattr flag says to compress, we compress. The computer should not 
be overriding the operator. If it gets bigger through compression (i.e. 
already compressed or encrypted), then you just save the block 
uncompressed and flag the block as such.

>> 2) Completely transparently based on a similar heuristic to what Reiser4
>> uses. For each file, try to compress the first 64KB. If it yields a
>> reasonable result, compress the rest, otherwise, flag as uncompressed
>> and don't bother. The user could override this by the appropriate chattr
>> command.
>
> That assumes that you always have the "first 64k" available (in cache)
> and that its not a hole in the file for example.

I'm actually very much in favour of just leaving it up to the user. So 
what if 1% of the files end up being uncompressible? If we are concerned 
about compression, we are clearly not concerned about every last CPU cycle 
that might end up getting wasted on the odd block of uncompressible data. 
If the user put the compress flag on the file (or directory), the the FS 
should try to compress, and if it doesn't manage to reduce the size, the 
save uncompressed.

I don't really see a problem with that.

>> 3) Leave it entirely up to the user - just inherit compression flag from
>> the parent directory. If the user says to compress, then don't question
>> it.
>>
> Ok, but that still leaves a number of problems to resolve: firstly we
> need to be able to ensure that the compression doesn't result in
> expansion of the data.

Inode flag to say whether the inode contains compressed or uncompressed 
data?

> Whatever system we use we'd have to be able to
> turn off compression in that case, otherwise block allocation would
> become almost impossible as we'd not be able to put a reasonable max
> bound on the number of blocks used.

Agreed. We'd need meta-data to tell which inodes are compressed and which 
aren't.

> Also the compression isn't likely to be very good if it can't tune
> itself to the particular file content.

It's a heuristic. If it does the job for most people most of the time, 
it's good enough. Say I have a big email system with 1 TB of maildirs from 
1M users on it. It's email. Most email is text. So we set the compression 
flags on everything to on, and just get on with it. Sure, some of it will 
be encrypted, and thus uncompressible, but so what if we still save 50% of 
the space?

>> 3) would be the simplest, and probably most useful. The only time
>> when a block should be left uncompressed is when compressing it makes it
>> get bigger.
>>
> The other thing that would have to be decided is the size of a "block"
> in this case. Too large and random access will be slow and cumbersome,
> too small and the benefit from compression will be less.

Not really. Just go with whatever the default block size is. Default on 
most file systems on Linux seens to be 4K (probably something to do with 
the page size on x86 ;-) ). This seems to yield reasonable results in 
e2compr (I think Reiser4 is a bit different, not sure), and NTFS uses 4K 
blocks by default, and the compression it achieves is certainly good 
enough to be useful.

> So although all of those problems are solvable, given time, I would
> still say that that it is not an easy thing to do.

It's a problem that has been solved by varioud other FS-es "well enough". 
There is no such thing as perfect compression. But a simple approach is 
good enough to be extremely useful for most people.

I am actually toying with the idea of home brewing an iSCSI san that 
exports a sparse file as a partition, and has that file sit on top of a 
Reiser4 compressed file system. This would actually solve the need to have 
GFS compressed, as the underlying FS would be compressed. It's not 
particularly neat, but it solves the problem. And the caching the SAN (or 
NAS, principle is the same) does would help overcome the compression speed 
overheads.

>>> 2. Compression of metadata
>>> This might well be worth looking into. There is a considerable amount
>>> of redundancy in typical fs metadata, and we ought to be able to reduce
>>> the number of blocks we have to read/write in order to complete an
>>> operation in this way. Using extents for example could be considered a
>>> form of metadata compression. The main problem is that our "cache line"
>>> if you like in GFS(2) is one disk block, so that sharing between nodes
>>> is a problem (hence the one inode per block rule we have at the moment).
>>> We'd need to address the metadata migration issue first.
>>
>> I'm not sure I understand what the problem is here. How is caching a
>> problem any more than it would otherwise be - considering we have multiple
>> nodes doing r/w ops on the same FS?
>>
> Because the data structures are carefully designed to minimimse the
> times for which two nodes will want to access the same block. Of course
> that still happens in some cases, but provided the nodes are not all
> working in the same directory (or if they are, then the workload is
> mostly readonly) then largely there is little contention.
>
> As soon as you start (for example) having multiple inodes in the same
> block, then the probability of sharing two items of data which are
> required by different nodes at the same time goes up.

Sure - so, you'd want to compress just before writing to disk. Although 
there could be performance benefit to having the cache compressed, too, if 
memory is short, especially with a fast decompression algorithm like LZO.

>>> Neither of the above is likely to happen soon though as they both
>>> require on-disk format changes.
>>
>> Compatibility is already broken between GFS1 and GFS2. I don't see this as
>> an issue. The FS will get mounted with whatever parameters it was created
>> - and a new FS can be created with compression enabled.
>>
> Mostly the data structures between GFS1 and GFS2 are the same. There are
> a few differences, but thats mainly down to the addition of the metadata
> file system (which has identical on-disk format to the main GFS2) and
> one or two fields in the inode (the common fields are at the same
> offsets). The format for journalled files has changed, but only to be
> the same as that for non-journalled files, so its not a huge change
> really.

Maybe not - but slightly incompatible or completely incompatible is still 
incompatible. It's re-format the file system time. :-)

>>>> 2) Shadowing/Copy-On-Write File Versioning
>>>> Backups have 2 purposes - retrieving a file that was lost or corrupted
>>>> through user error, and files lost or corrupted through disk failure. High
>>>> levels of RAID alleviate the need for backup for the latter reason, but
>>>> they do nothing to alleviate user-error caused damage. At the same time
>>>> SANs can get big - I don't see hundreds of TB to be an inconcievable size.
>>>> At this size, backups become an issue. Thus, a feature to provide file
>>>> versioning is important.
>>>>
>>>> In turn, 2) increases the volume of data, which increases the need for 1).
>>>>
>>>> Are either of these two features planned for GFS in the near future?
>>
>>> This also requires on-disk format changes,
>>
>> I don't remember implying that it wouldn't. But at the same time, why
>> would this be a problem? It's not like it means that people won't be able
>> to mount their GFS2 FS as they can now. And it's not like GFS2 works at
>> the moment, anyway (not with the latest packaged releases on any of the
>> spawns of RH (Fedore/CentOS, etc.)! :-p
>>
> For the moment we are trying to not make changes to the on-disk format,
> and in fact there haven't been any for a long time now. We have made
> some major steps forward in stability recently and those are due to roll
> into the distros fairly shortly now, so the last thing we want to do is
> to change things at this stage.

Sure, I understand that. I was suggesting this should be in the next 
stable version of GFS2. But it would mighty nice to have versioning and 
compression in GFS3. :-)

> Thats not to say that we won't come back and revisit the ideas later on
> though, but its not top of our list right now.
>
>>> but I agree that it would be
>>> a nice thing to do. Its very much in my mind though as to what a
>>> suitable scheme would be. We do have an ever increasing patent minefield
>>> to walk through here too I suspect.
>>
>> I very much doubt it. There are several OSS non-cluster FSs that provide
>> copy-on-write file versioning, and this has been used since the days of
>> VMS - which was now long enough ago that patents would have long since
>> expired.
>>
>>> Potentially it would be possible to address both of the above
>>> suggestions (minus the metadata compression) by using a stacking
>>> filesystem. That would be potentially more flexible by introducing the
>>> features on all filesystems not just GFS(2),
>>
>> Can you explain what you mean by stackable? I would have thought that
>> having a stacked file system on top of GFS would break GFS' ability to
>> function correctly in a clustered environment (not to mention introduce
>> unnecessary overheads).
>>
>
> I'm thinking of filesystems like (for example) unionfs which pass
> certain operations through to the filesystem(s) underneath it. Depending
> on how this is implemented, it need not be particularly inefficient. It
> wouldn't affect how GFS2 works any more than it would affect any other
> filesystem, although if locking were required (for example) in the
> clustered case, then the higher level filesystem would be just as able
> to use the DLM (for example) as any other kernel module or userland
> application, so that shouldn't be a barrier,

Interesting idea. In the meantime, I think I'll just try compressing the 
raw storage medium as mentioned above. But the big feature would really be 
copy-on-write journalling. Removing the need for backups on a big file 
system would be extremely useful. Sadly, though, that wouldn't stack as 
nicely as the compression does.

Gordan


From fs at lowpingbastards.de  Mon Oct  8 19:33:44 2007
From: fs at lowpingbastards.de (Frederik Schueler)
Date: Mon, 8 Oct 2007 21:33:44 +0200
Subject: [Linux-cluster] GFS 1.04: fatal: assertion "x <= length" failed
Message-ID: <20071008193343.GB29195@mail.lowpingbastards.de>

Hello,

I just got a crash on a gfs share:

GFS: fsid=beta:helium.1: fatal: assertion "x <= length" failed
GFS: fsid=beta:helium.1:   function = blkalloc_internal
GFS: fsid=beta:helium.1:   file = /usr/src/modules/redhat-cluster/gfs/gfs/rgrp.c, line = 1458
GFS: fsid=beta:helium.1:   time = 1191842568
GFS: fsid=beta:helium.1: about to withdraw from the cluster
GFS: fsid=beta:helium.1: waiting for outstanding I/O
GFS: fsid=beta:helium.1: telling LM to withdraw
lock_dlm: withdraw abandoned memory
GFS: fsid=beta:helium.1: withdrawn


the system is running gfs 1.04 with linux 2.6.21.

after the crash, I rebooted the concerned node and run an fsck on
another node to check the filesystem in question, and now it has a dozen
of lost files in l+f.

How can I debug the issue? 

Best regards
Frederik Sch?ler

-- 
ENOSIG
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071008/a71440a6/attachment.sig>

From rpeterso at redhat.com  Mon Oct  8 20:26:56 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 08 Oct 2007 15:26:56 -0500
Subject: [Linux-cluster] GFS 1.04: fatal: assertion "x <= length" failed
In-Reply-To: <20071008193343.GB29195@mail.lowpingbastards.de>
References: <20071008193343.GB29195@mail.lowpingbastards.de>
Message-ID: <1191875216.8820.24.camel@technetium.msp.redhat.com>

On Mon, 2007-10-08 at 21:33 +0200, Frederik Schueler wrote:
> Hello,
> 
> I just got a crash on a gfs share:
> 
> GFS: fsid=beta:helium.1: fatal: assertion "x <= length" failed
> GFS: fsid=beta:helium.1:   function = blkalloc_internal
> GFS: fsid=beta:helium.1:   file = /usr/src/modules/redhat-cluster/gfs/gfs/rgrp.c, line = 1458
> GFS: fsid=beta:helium.1:   time = 1191842568
> GFS: fsid=beta:helium.1: about to withdraw from the cluster
> GFS: fsid=beta:helium.1: waiting for outstanding I/O
> GFS: fsid=beta:helium.1: telling LM to withdraw
> lock_dlm: withdraw abandoned memory
> GFS: fsid=beta:helium.1: withdrawn
> 
> 
> the system is running gfs 1.04 with linux 2.6.21.
> 
> after the crash, I rebooted the concerned node and run an fsck on
> another node to check the filesystem in question, and now it has a dozen
> of lost files in l+f.
> 
> How can I debug the issue? 
> 
> Best regards
> Frederik Sch?ler

Hi Frederik,

This is odd.  What it means is this:  GFS was searching for a free block
to allocate.  The resource group ("RG"--not to be confused with
rgmanager's resource groups) indicated at least one free block for that
section of the file system, but there were no free blocks to be found in
the bitmap for that section (a direct contradiction).  Therefore, the
file system was determined to be corrupt.

It's nearly impossible to say how this could have happened.  Here are a
few possibilities:

(1) It's possible that some rogue kernel module overwrote the bitmap memory.
(2) This can also happen if gfs_fsck is run on a file system that is already
mounted from another node.  (3) Another possibility is a hardware problem
with your media--the hard drives, FC switch, HBAs, etc.  This could
happen, for example, if GFS read the bitmap(s) from disk and the disk
returned the wrong information.  We've seen a lot of that, and the best
thing to do is test the media (but it's a tedious and sometimes
destructive task).  For more information on that, see:

http://sources.redhat.com/cluster/faq.html#gfs_corruption

(4) It's also possible--although unlikely--it could be a GFS bug, although
as far as I know, you're the only person to report such a thing.

If it is really a GFS bug, the best way to solve it (and sometimes the
only way to solve it) is to give us a way to recreate the corruption
using a clean file system and a recreation program.
If that's not possible, you could describe what was happening to the
file system at the time of failure, in as much detail as possible, and
we can do some experiments here.  For example: were there lots of
file renames going on?  directory renames?  file creates?  What kind of
IO was happening to the file system at the time?  But doing these
experiments is often just a waste of time.

If you had not run gfs_fsck, we might have been able to tell a little
bit more about what happened from the contents of the journals.
For example, in RHEL5 and equivalent, you can use gfs2_edit to save
off the file system metadata and send it in for analysis.
(gfs2_edit can operate on gfs1 file systems as well as gfs2).  However,
Since gfs_fsck clears the journals, that information is now long gone.

I hope this helps.

Regards,

Bob Peterson
Red Hat Cluster Suite


From johnvge at yahoo.com  Mon Oct  8 20:42:16 2007
From: johnvge at yahoo.com (John Vijoe George)
Date: Mon, 8 Oct 2007 13:42:16 -0700 (PDT)
Subject: [Linux-cluster] mounting problem on RHEL5
Message-ID: <912218.242.qm@web54405.mail.yahoo.com>

I have a four node cluster with four storage nodes as well. I created 4 volume groups vg1,vg2,vg3,vg4 from the storage nodes and used mkfs.gfs2 to make a gfs2 filesystem.  

# mkfs.gfs2 -t c1:gfs1 -p lock_dlm -j 4 -r 2048 -J 128 /dev/vg1/lv0

I repeated the above for each volume group. I then mounted vg1 in to /mnt1 of one of the cluster nodes as follows:
# mount -t gfs2 -o quota=off /dev/vg1/lv0 /mnt1

It succeeded in mounting. But, when I try to mount the next volume group, I get an error as follows:

#mount -t gfs2 -o quota=off /dev/vg4/lv0 /mnt4
/sbin/mount.gfs2: lock_dlm_join: gfs_controld join error: -22
/sbin/mount.gfs2: error mounting lockproto lock_dlm


What could be the reason behind it? Am I messing up something related to lock_dlm? Are my options to gfs2.mkfs right? Any help is appreciated.

The following gfs modules were loaded:

# lsmod | grep gfs
gfs2                  519988  2 lock_dlm
configfs               62236  2 dlm


Regards,
John G


____________________________________________________________________________________
Need a vacation? Get great deals
to amazing places on Yahoo! Travel.
http://travel.yahoo.com/


From teigland at redhat.com  Mon Oct  8 20:37:28 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 8 Oct 2007 15:37:28 -0500
Subject: [Linux-cluster] mounting problem on RHEL5
In-Reply-To: <912218.242.qm@web54405.mail.yahoo.com>
References: <912218.242.qm@web54405.mail.yahoo.com>
Message-ID: <20071008203728.GA1060@redhat.com>

On Mon, Oct 08, 2007 at 01:42:16PM -0700, John Vijoe George wrote:
> I have a four node cluster with four storage nodes as well. I created 4
> volume groups vg1,vg2,vg3,vg4 from the storage nodes and used mkfs.gfs2
> to make a gfs2 filesystem.  
> 
> # mkfs.gfs2 -t c1:gfs1 -p lock_dlm -j 4 -r 2048 -J 128 /dev/vg1/lv0
> 
> I repeated the above for each volume group. I then mounted vg1 in to
> /mnt1 of one of the cluster nodes as follows:

Did you give each fs a different name?  e.g. gfs1,gfs2,gfs3,gfs4

Dave


From darryl.dixon at winterhouseconsulting.com  Mon Oct  8 22:28:14 2007
From: darryl.dixon at winterhouseconsulting.com (Darryl Dixon - Winterhouse Consulting)
Date: Tue, 9 Oct 2007 11:28:14 +1300 (NZDT)
Subject: [Linux-cluster] LVM2 cluster safety
Message-ID: <33088.203.97.62.217.1191882494.squirrel@services.directender.co.nz>

Hi All,

I'm trying to understand from the sourcecode and from previous discussions
on this list exactly which parts of a 'standard' LVM2 setup are 'unsafe'
in an active/active GFS or OCFS cluster scenario.

In other words, if I have a single LV, in a single VG, on a single PV, on
a single LUN seen by two hosts via an FC SAN, with GFS or OCFS on top of
it, and both hosts writing data, and no changes to the VG metadata at all,
then where are the points of risk?

>From what I can understand of the CLVM daemon, it is entirely concerned
with serialising ~metadata~ updates, and writes to the LV are assumed to
be safe as long as they are going through a cluster-aware filesystem on
top?


many regards,
Darryl Dixon
Winterhouse Consulting Ltd
http://www.winterhouseconsulting.com


From isplist at logicore.net  Mon Oct  8 23:19:42 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 8 Oct 2007 18:19:42 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <2007108181942.391281@leena>

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From diegows at gmail.com  Tue Oct  9 00:36:14 2007
From: diegows at gmail.com (Diego Woitasen)
Date: Mon, 8 Oct 2007 21:36:14 -0300
Subject: [Linux-cluster] CLVM with two nodes?
Message-ID: <f5526f350710081736l259e270et47d98fdff968c90@mail.gmail.com>

The question is simple, exists a limitation to use CLVM (Redhat 5)
with only two nodes? Some people say that is not recommeded but I
wan't some experienced opinions.

thanks!

-- 
-------------------
Diego Woitasen
-------------------


From mital at tinyprints.com  Tue Oct  9 02:05:37 2007
From: mital at tinyprints.com (Mital Patel)
Date: Mon, 08 Oct 2007 19:05:37 -0700
Subject: [Linux-cluster] gfs perfomance tuning
Message-ID: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>

Hi,

I am looking for suggestions on how to increase the speed of our GFS
configuration.  We are using GFS on a cluster of 4 web servers with a
EMC AX150i iscsi server backend.  Cluster/GFS is setup and working
properly, however it seems to have horrible performance when multiple
nodes attempt to create files in the same shared directory.  Our
application require us to create many small files on the shared drive,
which according to this site,
http://kbase.redhat.com/faq/FAQ_78_3152.shtm is not the best environment
to use GFS.  I am exploring alternatives, but if we can get GFS to a
point where it's usable, we want to stick with it and hope GFS2
increases the performance even more.

I have read through some previous threads and it seems like I can eek
some performance by increasing the gfs_scand interval, is that correct?
Are there any other settings that would help with performance?

System Info:
Centos 4.5
kernel 2.6.9-55.0.6.ELsmp
GFS-kernel-smp-2.6.9-72.2.0.7
GFS-6.1.14-0
dlm-1.0.3-1
dlm-kernel-smp-2.6.9-46.16.0.8


gfs_tool gettune:
ilimit1 = 100
ilimit1_tries = 3
ilimit1_min = 1
ilimit2 = 500
ilimit2_tries = 10
ilimit2_min = 3
demote_secs = 300
incore_log_blocks = 1024
jindex_refresh_secs = 60
depend_secs = 60
scand_secs = 5
recoverd_secs = 60
logd_secs = 1
quotad_secs = 5
inoded_secs = 15
glock_purge = 0
quota_simul_sync = 64
quota_warn_period = 10
atime_quantum = 3600
quota_quantum = 60
quota_scale = 1.0000   (1, 1)
quota_enforce = 0
quota_account = 0
new_files_jdata = 0
new_files_directio = 0
max_atomic_write = 4194304
max_readahead = 262144
lockdump_size = 131072
stall_secs = 600
complain_secs = 10
reclaim_limit = 5000
entries_per_readdir = 32
prefetch_secs = 10
statfs_slots = 64
max_mhc = 10000
greedy_default = 100
greedy_quantum = 25
greedy_max = 250
rgrp_try_threshold = 100
statfs_fast = 0


Mital


From mital at tinyprints.com  Tue Oct  9 02:22:05 2007
From: mital at tinyprints.com (Mital Patel)
Date: Mon, 08 Oct 2007 19:22:05 -0700
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
Message-ID: <1191896525.6319.54.camel@pc-mpatel.tinyprints.com>

Some extra info:

GFS is mounted with noquota,noatime options

/proc/cluster/lock_dlm/drop_count = 200000

Mital


On Mon, 2007-10-08 at 19:05 -0700, Mital Patel wrote:
> Hi,
> 
> I am looking for suggestions on how to increase the speed of our GFS
> configuration.  We are using GFS on a cluster of 4 web servers with a
> EMC AX150i iscsi server backend.  Cluster/GFS is setup and working
> properly, however it seems to have horrible performance when multiple
> nodes attempt to create files in the same shared directory.  Our
> application require us to create many small files on the shared drive,
> which according to this site,
> http://kbase.redhat.com/faq/FAQ_78_3152.shtm is not the best environment
> to use GFS.  I am exploring alternatives, but if we can get GFS to a
> point where it's usable, we want to stick with it and hope GFS2
> increases the performance even more.
> 
> I have read through some previous threads and it seems like I can eek
> some performance by increasing the gfs_scand interval, is that correct?
> Are there any other settings that would help with performance?
> 
> System Info:
> Centos 4.5
> kernel 2.6.9-55.0.6.ELsmp
> GFS-kernel-smp-2.6.9-72.2.0.7
> GFS-6.1.14-0
> dlm-1.0.3-1
> dlm-kernel-smp-2.6.9-46.16.0.8
> 
> 
> gfs_tool gettune:
> ilimit1 = 100
> ilimit1_tries = 3
> ilimit1_min = 1
> ilimit2 = 500
> ilimit2_tries = 10
> ilimit2_min = 3
> demote_secs = 300
> incore_log_blocks = 1024
> jindex_refresh_secs = 60
> depend_secs = 60
> scand_secs = 5
> recoverd_secs = 60
> logd_secs = 1
> quotad_secs = 5
> inoded_secs = 15
> glock_purge = 0
> quota_simul_sync = 64
> quota_warn_period = 10
> atime_quantum = 3600
> quota_quantum = 60
> quota_scale = 1.0000   (1, 1)
> quota_enforce = 0
> quota_account = 0
> new_files_jdata = 0
> new_files_directio = 0
> max_atomic_write = 4194304
> max_readahead = 262144
> lockdump_size = 131072
> stall_secs = 600
> complain_secs = 10
> reclaim_limit = 5000
> entries_per_readdir = 32
> prefetch_secs = 10
> statfs_slots = 64
> max_mhc = 10000
> greedy_default = 100
> greedy_quantum = 25
> greedy_max = 250
> rgrp_try_threshold = 100
> statfs_fast = 0
> 
> 
> 
> 
> 
> Mital
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Mital Patel
Systems Administrator
Tiny Prints Inc


From grimme at atix.de  Tue Oct  9 06:16:59 2007
From: grimme at atix.de (Marc Grimme)
Date: Tue, 9 Oct 2007 08:16:59 +0200
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
Message-ID: <200710090817.01139.grimme@atix.de>

Hi Mital,
enable glock_purging and have a look at here:
http://www.opensharedroot.org/Members/marc/blog/blog-on-gfs
http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm
try to adjust the rsbtbl_size and lkbtbl_size. That will help when having many 
files.
Regards Marc.
On Tuesday 09 October 2007 04:05:37 Mital Patel wrote:
> Hi,
>
> I am looking for suggestions on how to increase the speed of our GFS
> configuration.  We are using GFS on a cluster of 4 web servers with a
> EMC AX150i iscsi server backend.  Cluster/GFS is setup and working
> properly, however it seems to have horrible performance when multiple
> nodes attempt to create files in the same shared directory.  Our
> application require us to create many small files on the shared drive,
> which according to this site,
> http://kbase.redhat.com/faq/FAQ_78_3152.shtm is not the best environment
> to use GFS.  I am exploring alternatives, but if we can get GFS to a
> point where it's usable, we want to stick with it and hope GFS2
> increases the performance even more.
>
> I have read through some previous threads and it seems like I can eek
> some performance by increasing the gfs_scand interval, is that correct?
> Are there any other settings that would help with performance?
>
> System Info:
> Centos 4.5
> kernel 2.6.9-55.0.6.ELsmp
> GFS-kernel-smp-2.6.9-72.2.0.7
> GFS-6.1.14-0
> dlm-1.0.3-1
> dlm-kernel-smp-2.6.9-46.16.0.8
>
>
> gfs_tool gettune:
> ilimit1 = 100
> ilimit1_tries = 3
> ilimit1_min = 1
> ilimit2 = 500
> ilimit2_tries = 10
> ilimit2_min = 3
> demote_secs = 300
> incore_log_blocks = 1024
> jindex_refresh_secs = 60
> depend_secs = 60
> scand_secs = 5
> recoverd_secs = 60
> logd_secs = 1
> quotad_secs = 5
> inoded_secs = 15
> glock_purge = 0
> quota_simul_sync = 64
> quota_warn_period = 10
> atime_quantum = 3600
> quota_quantum = 60
> quota_scale = 1.0000   (1, 1)
> quota_enforce = 0
> quota_account = 0
> new_files_jdata = 0
> new_files_directio = 0
> max_atomic_write = 4194304
> max_readahead = 262144
> lockdump_size = 131072
> stall_secs = 600
> complain_secs = 10
> reclaim_limit = 5000
> entries_per_readdir = 32
> prefetch_secs = 10
> statfs_slots = 64
> max_mhc = 10000
> greedy_default = 100
> greedy_quantum = 25
> greedy_max = 250
> rgrp_try_threshold = 100
> statfs_fast = 0
>
>
>
>
>
> Mital
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**

Visit us at LinuxWorld Conference & Expo 
31.10. - 01.11.2007 in Jaarbeurs Utrecht - The Netherlands
ATIX stand: Hall 9 / B 005

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss


From gordan at bobich.net  Tue Oct  9 07:59:28 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 9 Oct 2007 08:59:28 +0100 (BST)
Subject: [Linux-cluster] mounting problem on RHEL5
In-Reply-To: <912218.242.qm@web54405.mail.yahoo.com>
References: <912218.242.qm@web54405.mail.yahoo.com>
Message-ID: <Pine.LNX.4.64.0710090856090.8118@skynet.shatteredsilicon.net>

Not sure, but you'll find that GFS2 is still unstable. The required 
bug-fixes haven't made it to the full release stage yet. RHEL5 stuff is 
not for production use. Use GFS1 instead.

With the latest GFS2 packages on FC7, I can consistently achieve a 
permanent (as in time to reboot the whole cluster) dead-lock within about 
10 seconds of fairly light use.

Gordan

On Mon, 8 Oct 2007, John Vijoe George wrote:

> I have a four node cluster with four storage nodes as well. I created 4 volume groups vg1,vg2,vg3,vg4 from the storage nodes and used mkfs.gfs2 to make a gfs2 filesystem.
>
> # mkfs.gfs2 -t c1:gfs1 -p lock_dlm -j 4 -r 2048 -J 128 /dev/vg1/lv0
>
> I repeated the above for each volume group. I then mounted vg1 in to /mnt1 of one of the cluster nodes as follows:
> # mount -t gfs2 -o quota=off /dev/vg1/lv0 /mnt1
>
> It succeeded in mounting. But, when I try to mount the next volume group, I get an error as follows:
>
> #mount -t gfs2 -o quota=off /dev/vg4/lv0 /mnt4
> /sbin/mount.gfs2: lock_dlm_join: gfs_controld join error: -22
> /sbin/mount.gfs2: error mounting lockproto lock_dlm
>
>
> What could be the reason behind it? Am I messing up something related to lock_dlm? Are my options to gfs2.mkfs right? Any help is appreciated.
>
> The following gfs modules were loaded:
>
> # lsmod | grep gfs
> gfs2                  519988  2 lock_dlm
> configfs               62236  2 dlm
>
>
> Regards,
> John G
>
>
>
>
>
> ____________________________________________________________________________________
> Need a vacation? Get great deals
> to amazing places on Yahoo! Travel.
> http://travel.yahoo.com/
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From gordan at bobich.net  Tue Oct  9 08:57:25 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 9 Oct 2007 09:57:25 +0100 (BST)
Subject: [Linux-cluster] LVM2 cluster safety
In-Reply-To: <33088.203.97.62.217.1191882494.squirrel@services.directender.co.nz>
References: <33088.203.97.62.217.1191882494.squirrel@services.directender.co.nz>
Message-ID: <Pine.LNX.4.64.0710090956060.8118@skynet.shatteredsilicon.net>

On Tue, 9 Oct 2007, Darryl Dixon - Winterhouse Consulting wrote:

> I'm trying to understand from the sourcecode and from previous discussions
> on this list exactly which parts of a 'standard' LVM2 setup are 'unsafe'
> in an active/active GFS or OCFS cluster scenario.
>
> In other words, if I have a single LV, in a single VG, on a single PV, on
> a single LUN seen by two hosts via an FC SAN, with GFS or OCFS on top of
> it, and both hosts writing data, and no changes to the VG metadata at all,
> then where are the points of risk?
>
>> From what I can understand of the CLVM daemon, it is entirely concerned
> with serialising ~metadata~ updates, and writes to the LV are assumed to
> be safe as long as they are going through a cluster-aware filesystem on
> top?

My understanding that LVM is not required for clustering, unless you plan 
to grow the FS across multiple volumes. I am not aware of it having a 
functional effect on the functioning of GFS (other than the fact that it 
may need special features to make sure that it is GFS safe).

Gordan


From gordan at bobich.net  Tue Oct  9 09:06:19 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 9 Oct 2007 10:06:19 +0100 (BST)
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
Message-ID: <Pine.LNX.4.64.0710091005320.8118@skynet.shatteredsilicon.net>

You may find that mounting the gfs file system with "noatime" option helps 
a bit.

On Mon, 8 Oct 2007, Mital Patel wrote:

> Hi,
>
> I am looking for suggestions on how to increase the speed of our GFS
> configuration.  We are using GFS on a cluster of 4 web servers with a
> EMC AX150i iscsi server backend.  Cluster/GFS is setup and working
> properly, however it seems to have horrible performance when multiple
> nodes attempt to create files in the same shared directory.  Our
> application require us to create many small files on the shared drive,
> which according to this site,
> http://kbase.redhat.com/faq/FAQ_78_3152.shtm is not the best environment
> to use GFS.  I am exploring alternatives, but if we can get GFS to a
> point where it's usable, we want to stick with it and hope GFS2
> increases the performance even more.
>
> I have read through some previous threads and it seems like I can eek
> some performance by increasing the gfs_scand interval, is that correct?
> Are there any other settings that would help with performance?
>
> System Info:
> Centos 4.5
> kernel 2.6.9-55.0.6.ELsmp
> GFS-kernel-smp-2.6.9-72.2.0.7
> GFS-6.1.14-0
> dlm-1.0.3-1
> dlm-kernel-smp-2.6.9-46.16.0.8
>
>
> gfs_tool gettune:
> ilimit1 = 100
> ilimit1_tries = 3
> ilimit1_min = 1
> ilimit2 = 500
> ilimit2_tries = 10
> ilimit2_min = 3
> demote_secs = 300
> incore_log_blocks = 1024
> jindex_refresh_secs = 60
> depend_secs = 60
> scand_secs = 5
> recoverd_secs = 60
> logd_secs = 1
> quotad_secs = 5
> inoded_secs = 15
> glock_purge = 0
> quota_simul_sync = 64
> quota_warn_period = 10
> atime_quantum = 3600
> quota_quantum = 60
> quota_scale = 1.0000   (1, 1)
> quota_enforce = 0
> quota_account = 0
> new_files_jdata = 0
> new_files_directio = 0
> max_atomic_write = 4194304
> max_readahead = 262144
> lockdump_size = 131072
> stall_secs = 600
> complain_secs = 10
> reclaim_limit = 5000
> entries_per_readdir = 32
> prefetch_secs = 10
> statfs_slots = 64
> max_mhc = 10000
> greedy_default = 100
> greedy_quantum = 25
> greedy_max = 250
> rgrp_try_threshold = 100
> statfs_fast = 0
>
>
>
>
>
> Mital
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From pcaulfie at redhat.com  Tue Oct  9 09:07:21 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 09 Oct 2007 10:07:21 +0100
Subject: [Linux-cluster] LVM2 cluster safety
In-Reply-To: <33088.203.97.62.217.1191882494.squirrel@services.directender.co.nz>
References: <33088.203.97.62.217.1191882494.squirrel@services.directender.co.nz>
Message-ID: <470B44C9.7050508@redhat.com>

Darryl Dixon - Winterhouse Consulting wrote:
> Hi All,
> 
> I'm trying to understand from the sourcecode and from previous discussions
> on this list exactly which parts of a 'standard' LVM2 setup are 'unsafe'
> in an active/active GFS or OCFS cluster scenario.
> 
> In other words, if I have a single LV, in a single VG, on a single PV, on
> a single LUN seen by two hosts via an FC SAN, with GFS or OCFS on top of
> it, and both hosts writing data, and no changes to the VG metadata at all,
> then where are the points of risk?
> 
>>From what I can understand of the CLVM daemon, it is entirely concerned
> with serialising ~metadata~ updates, and writes to the LV are assumed to
> be safe as long as they are going through a cluster-aware filesystem on
> top?

That's correct. If you are never going to change the LVM metadata then you don't
need clvm. lvm has no impact on data sharing at all, that's the job of the
filesystem (GFS/OCFS etc).

-- 
Patrick


From rnevins at workflowbydesign.com  Tue Oct  9 12:08:27 2007
From: rnevins at workflowbydesign.com (Rob Nevins)
Date: Tue, 09 Oct 2007 08:08:27 -0400
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <Pine.LNX.4.64.0710091005320.8118@skynet.shatteredsilicon.net>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
	<Pine.LNX.4.64.0710091005320.8118@skynet.shatteredsilicon.net>
Message-ID: <470B6F3B.4080205@workflowbydesign.com>

Gordan Bobic wrote:
> You may find that mounting the gfs file system with "noatime" option 
> helps a bit.
I like the idea of adding in noatime to our gfs share, but with three 
nodes running, what would be the proper way to add in "noatime" to 
fstab, unmount and remount.  Can it be one node at a time?  Or, should I 
unmount the gfs from all nodes, update and remount?

-Rob


From Alain.Moulle at bull.net  Tue Oct  9 12:43:50 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 09 Oct 2007 14:43:50 +0200
Subject: [Linux-cluster] CS4 U4/ Question about heuristic
Message-ID: <470B7786.1090005@bull.net>

Hi

I wonder if we can use heuristic fonctions without a real quorum disk working ?
Even if the record quorumd in cluster.conf is mandatory, perhaps using vote=0
for quorumd, just to have the benefit of heuristics ...

The goal is to monitor 3 networks in addition to the Heart-beat network
and to failover if one of this 3 networks does not give response to ping.

???
Thanks
Alain Moull?


From Alexandre.Racine at mhicc.org  Tue Oct  9 12:52:30 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Tue, 9 Oct 2007 08:52:30 -0400
Subject: [Linux-cluster] gfs perfomance tuning
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
	<200710090817.01139.grimme@atix.de>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C34E2@cumulonimbus.RG.local>


http://sourceware.org/cluster/faq.html#gfs_tuning


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Marc Grimme
Sent: Tue 2007-10-09 02:16
To: linux-cluster at redhat.com; mital at tinyprints.com
Subject: Re: [Linux-cluster] gfs perfomance tuning
 
Hi Mital,
enable glock_purging and have a look at here:
http://www.opensharedroot.org/Members/marc/blog/blog-on-gfs
http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm
try to adjust the rsbtbl_size and lkbtbl_size. That will help when having many 
files.
Regards Marc.
On Tuesday 09 October 2007 04:05:37 Mital Patel wrote:
> Hi,
>
> I am looking for suggestions on how to increase the speed of our GFS
> configuration.  We are using GFS on a cluster of 4 web servers with a
> EMC AX150i iscsi server backend.  Cluster/GFS is setup and working
> properly, however it seems to have horrible performance when multiple
> nodes attempt to create files in the same shared directory.  Our
> application require us to create many small files on the shared drive,
> which according to this site,
> http://kbase.redhat.com/faq/FAQ_78_3152.shtm is not the best environment
> to use GFS.  I am exploring alternatives, but if we can get GFS to a
> point where it's usable, we want to stick with it and hope GFS2
> increases the performance even more.
>
> I have read through some previous threads and it seems like I can eek
> some performance by increasing the gfs_scand interval, is that correct?
> Are there any other settings that would help with performance?
>
> System Info:
> Centos 4.5
> kernel 2.6.9-55.0.6.ELsmp
> GFS-kernel-smp-2.6.9-72.2.0.7
> GFS-6.1.14-0
> dlm-1.0.3-1
> dlm-kernel-smp-2.6.9-46.16.0.8
>
>
> gfs_tool gettune:
> ilimit1 = 100
> ilimit1_tries = 3
> ilimit1_min = 1
> ilimit2 = 500
> ilimit2_tries = 10
> ilimit2_min = 3
> demote_secs = 300
> incore_log_blocks = 1024
> jindex_refresh_secs = 60
> depend_secs = 60
> scand_secs = 5
> recoverd_secs = 60
> logd_secs = 1
> quotad_secs = 5
> inoded_secs = 15
> glock_purge = 0
> quota_simul_sync = 64
> quota_warn_period = 10
> atime_quantum = 3600
> quota_quantum = 60
> quota_scale = 1.0000   (1, 1)
> quota_enforce = 0
> quota_account = 0
> new_files_jdata = 0
> new_files_directio = 0
> max_atomic_write = 4194304
> max_readahead = 262144
> lockdump_size = 131072
> stall_secs = 600
> complain_secs = 10
> reclaim_limit = 5000
> entries_per_readdir = 32
> prefetch_secs = 10
> statfs_slots = 64
> max_mhc = 10000
> greedy_default = 100
> greedy_quantum = 25
> greedy_max = 250
> rgrp_try_threshold = 100
> statfs_fast = 0
>
>
>
>
>
> Mital
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**

Visit us at LinuxWorld Conference & Expo 
31.10. - 01.11.2007 in Jaarbeurs Utrecht - The Netherlands
ATIX stand: Hall 9 / B 005

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 4706 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071009/e4dae887/attachment.bin>

From Alain.Moulle at bull.net  Tue Oct  9 14:35:27 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 09 Oct 2007 16:35:27 +0200
Subject: [Linux-cluster] CS4 U5 / Question about status
Message-ID: <470B91AF.4060206@bull.net>

Hi

When CS4 performs its status on service, is there a timeout value
so that if the status does not return any response, the CS4 decides
that the status is KO ?

Thanks
Alain Moull?


From fs at lowpingbastards.de  Tue Oct  9 15:26:19 2007
From: fs at lowpingbastards.de (Frederik Schueler)
Date: Tue, 9 Oct 2007 17:26:19 +0200
Subject: [Linux-cluster] GFS 1.04: fatal: assertion "x <= length" failed
In-Reply-To: <1191875216.8820.24.camel@technetium.msp.redhat.com>
References: <20071008193343.GB29195@mail.lowpingbastards.de>
	<1191875216.8820.24.camel@technetium.msp.redhat.com>
Message-ID: <20071009152619.GA32032@mail.lowpingbastards.de>

Hello Bob,

First of all, thanks for the detailed answer.

On Mon, Oct 08, 2007 at 03:26:56PM -0500, Bob Peterson wrote:
> This is odd.  What it means is this:  GFS was searching for a free block
> to allocate.  The resource group ("RG"--not to be confused with
> rgmanager's resource groups) indicated at least one free block for that
> section of the file system, but there were no free blocks to be found in
> the bitmap for that section (a direct contradiction).  Therefore, the
> file system was determined to be corrupt.

What explains the files in l+f. 
The files where created  a day before the crash, they disappeared some time 
later and where recreated, but I was not informed about this before the 
crash happened. 

In this particular setup, the "storage" is a 2 nodes heartbeat(2.1.2) 
failover cluster exporting a drbd 8.2.0 device via iscsi (iscsitarget
0.4.15). 
The systems are rhel5 with stock 2.6.18-8.1.14.el5 kernel, drbd and 
heartbeat selfcompiled.

The iscsi target is exported without data and header digest, I will
switch it to crc32c now, to rule out the network.

 
>  (3) Another possibility is a hardware problem
> with your media--the hard drives, FC switch, HBAs, etc.  This could
> happen, for example, if GFS read the bitmap(s) from disk and the disk
> returned the wrong information.  We've seen a lot of that, and the best
> thing to do is test the media (but it's a tedious and sometimes
> destructive task).

I wonder if a drbd failover might cause something like this, manually
switching the resources to the backup node: the drbd share is made
secondary, iscsi-target stopped, the service IP removed, and on the
other node the IP activated, the drbd device made primary and
iscsitarget started. The clients issue a 

 connection0:0: iscsi: detected conn error (1011)

and everything continues after the switch, which takes 4-5 seconds.

OTOH, there was no failover since before the gfs filesystem was formatted and 
populated, only some scheduled node reboots.

I wonder if broken memory in one of the nodes or the iscsi servers could
be to blame, but I did not see segfaults or MCEs anywhere, GFS does
direct-I/O and we had iscsitarget set to blockio, which is uncached 
direct-I/O too.

> If you had not run gfs_fsck, we might have been able to tell a little
> bit more about what happened from the contents of the journals.
> For example, in RHEL5 and equivalent, you can use gfs2_edit to save
> off the file system metadata and send it in for analysis.
> (gfs2_edit can operate on gfs1 file systems as well as gfs2).  However,
> Since gfs_fsck clears the journals, that information is now long gone.
 
Oh, thanks for the hint, I'll do this in case this happens again.


Best regards
Frederik Sch?ler

-- 
ENOSIG
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071009/704f999d/attachment.sig>

From cosmih at freeland.ro  Tue Oct  9 15:38:40 2007
From: cosmih at freeland.ro (cosmih)
Date: Tue, 09 Oct 2007 18:38:40 +0300
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
Message-ID: <1191944320.2798.17.camel@mihaico.gecadco.local>

>> Thanks Steve. That gets me to the dlm dir, which fails with the errors
>> below. This is cvs RHEL5 against the stock RHEL5 kernel 2.6.18-8.el5 
>> 
>> Any thoughts on this? I googled this failure, but nothing relevant comes
>> up.
>
>The RHEL50 cvs branch is used for the current RHEL 5.0 release, which it
>sounds like you have.
>
>The RHEL5 cvs branch is in development, to be used for the RHEL 5.1
>release.  The RHEL 5.1 kernel includes the new dlm user/kernel interface
>which is in the upstream kernel.  If you want to use the RHEL5 cvs branch,
>you'd need the kernel for RHEL 5.1 (which is still in development and not
>accessible AFAIK.)
>
>The cvs HEAD should be used with upstream 2.6.22-rc kernels.  Doing this
>requires that you also copy the new dlm kernel headers into the system
>include directory:
>cp /usr/src/linux-2.6.22-rcX/include/linux/dlm*  /usr/include/linux/
>and then remove the __user tags from some them (or do as Patrick suggested).
>

hi,

i tryied to install cluster-2.01.00 on gentoo and archlinux linux distribution ... but i have
   the same errors
i have linux-2.6.22 headers and gcc 4.1.2 on gentoo machines and gcc 4.2.1 on archlinux machines
i successfully installed openais-0.80.3 (make && make install DESTDIR=/)

here is the configure command:
./configure --prefix=/usr/local/cluster --libdir=/usr/local/cluster/lib \
--libexecdir=/usr/local/cluster/libexec --mandir=/usr/local/cluster/man \
--sharedir=/usr/local/cluster/share --incdir=/usr/local/cluster/include  \
--openaisincdir=/usr/include --openaislibdir=/usr/lib/openais \
--ncursesincdir=/usr/include --ncurseslibdir=/usr/lib \
--readlineincdir=/usr/include/readline --readlinelibdir=/usr/lib \
--nssincdir=/usr/include/nss --nsslibdir=/usr/lib/nss  \
--nsprincdir=/usr/include/nspr --nsprlibdir=/usr/lib/nspr \
--cflags="-O2 -march=i686 -mtune=i686 -fomit-frame-pointer"


here is the errors
------------------------
make[1]: Entering directory `/root/cluster-2.01.00/dlm'
set -e && \
        for i in lib tool; do \
                make -C $i all; \
        done
make[2]: Entering directory `/root/cluster-2.01.00/dlm/lib'
gcc -O2 -march=i686 -mtune=i686 -fomit-frame-pointer -I/root/cluster-2.01.00/config -g -O2  -I. -I/usr/local/cluster/include -D_REENTRANT -c -o libdlm.o libdlm.c
libdlm.c: In function 'set_version_v5':
libdlm.c:324: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:325: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:326: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'set_version_v6':
libdlm.c:335: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:336: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:337: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'detect_kernel_version':
libdlm.c:443: error: storage size of 'v' isn't known
libdlm.c:446: error: invalid application of 'sizeof' to incomplete type 'struct dlm_device_version' 
libdlm.c:448: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:449: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:450: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:452: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:453: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:454: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'do_dlm_dispatch':
libdlm.c:590: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'ls_lock_v6':
libdlm.c:835: error: 'struct dlm_lock_params' has no member named 'xid'
libdlm.c:837: error: 'struct dlm_lock_params' has no member named 'timeout'
libdlm.c: In function 'ls_lock':
libdlm.c:892: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'dlm_ls_lockx':
libdlm.c:916: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'dlm_ls_unlock':
libdlm.c:1067: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'dlm_ls_deadlock_cancel':
libdlm.c:1099: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:1115: error: 'DLM_USER_DEADLOCK' undeclared (first use in this function)
libdlm.c:1115: error: (Each undeclared identifier is reported only once
libdlm.c:1115: error: for each function it appears in.)
libdlm.c: In function 'dlm_ls_purge':
libdlm.c:1134: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'create_lockspace':
libdlm.c:1311: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'release_lockspace':
libdlm.c:1417: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c: In function 'dlm_kernel_version':
libdlm.c:1503: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:1504: error: invalid use of undefined type 'struct dlm_device_version'
libdlm.c:1505: error: invalid use of undefined type 'struct dlm_device_version'
make[2]: *** [libdlm.o] Error 1
make[2]: Leaving directory `/root/cluster-2.01.00/dlm/lib'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/root/cluster-2.01.00/dlm'
make: *** [dlm] Error 2
-------------------------------

can someone give me some hints ?

regards,
cosmih
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071009/bd662b22/attachment.htm>

From fs at lowpingbastards.de  Tue Oct  9 15:55:12 2007
From: fs at lowpingbastards.de (Frederik Schueler)
Date: Tue, 9 Oct 2007 17:55:12 +0200
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <470B6F3B.4080205@workflowbydesign.com>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
	<Pine.LNX.4.64.0710091005320.8118@skynet.shatteredsilicon.net>
	<470B6F3B.4080205@workflowbydesign.com>
Message-ID: <20071009155512.GB32032@mail.lowpingbastards.de>

Hello,

On Tue, Oct 09, 2007 at 08:08:27AM -0400, Rob Nevins wrote:
> I like the idea of adding in noatime to our gfs share, but with three 
> nodes running, what would be the proper way to add in "noatime" to 
> fstab, unmount and remount.  Can it be one node at a time?  Or, should I 
> unmount the gfs from all nodes, update and remount?

mount -o remount,noatime /gfs/mount

should do the trick for the running nodes. And add it to
fstab.

Best regards
Frederik Sch?ler

-- 
ENOSIG


From JFillman at cucbc.com  Tue Oct  9 23:52:01 2007
From: JFillman at cucbc.com (James Fillman)
Date: Tue, 9 Oct 2007 16:52:01 -0700
Subject: [Linux-cluster] GFS problems!!!
Message-ID: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>

Ok. I'm trying to implement GFS on two different clusters: 9 nodes, 17
nodes.

I'm having nothing but troubles. The gfs volumes are freezing and
throwing the cluster into a bad state. Currently, this is the state of
my cluster:

[root at plxp01md-new log]# cman_tool services
type             level name     id       state
fence            0     default  00010004 none
[1 2 3 4 5 6 7 8 9]
dlm              1     clvmd    00010003 none
[1 2 3 4 5 6 7 8 9]
dlm              1     mdi_log  00020001 FAIL_START_WAIT
[1 2 3 4 6 7 8 9]
dlm              1     deploy   00040001 FAIL_START_WAIT
[1 4 6 7 8 9]
gfs              2     mdi_log  00010001 FAIL_START_WAIT
[1 2 3 4 6 7 8 9]
gfs              2     deploy   00030001 FAIL_START_WAIT

I have no idea what happened. I've got users who are writing to a gfs
volume and just came and reported to me that the volumes not responding.
/var/log/messages has been outputting the following message, about 50
times a second,  since Friday:

Oct  9 13:54:35 plxp01deploy kernel: dlm: recover_master_copy -53 401ce

Can someone tell me what FAIL_START_WAIT means and how to recover from
it? Also, does anyone know what the log message above means?

All my servers in the cluster are showing the same service states.

I'm running RHEL5-64 bit. 

please help. I'm almost ready to give up on GFS. It seems way too
unstable.

James Fillman


From sdake at redhat.com  Wed Oct 10 02:07:47 2007
From: sdake at redhat.com (Steven Dake)
Date: Tue, 09 Oct 2007 19:07:47 -0700
Subject: [Linux-cluster] GFS problems!!!
In-Reply-To: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
References: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
Message-ID: <1191982068.3552.12.camel@balance>

please include /var/log/messages from one system as well as group_tool
dump on one of the crashed nodes.  What brand/model of switch are you
using?

Regards
-steve

Could you use On Tue, 2007-10-09 at 16:52 -0700, James Fillman wrote:
> Ok. I'm trying to implement GFS on two different clusters: 9 nodes, 17
> nodes.
> 
> I'm having nothing but troubles. The gfs volumes are freezing and
> throwing the cluster into a bad state. Currently, this is the state of
> my cluster:
> 
> [root at plxp01md-new log]# cman_tool services
> type             level name     id       state
> fence            0     default  00010004 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     clvmd    00010003 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     mdi_log  00020001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> dlm              1     deploy   00040001 FAIL_START_WAIT
> [1 4 6 7 8 9]
> gfs              2     mdi_log  00010001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> gfs              2     deploy   00030001 FAIL_START_WAIT
> 
> I have no idea what happened. I've got users who are writing to a gfs
> volume and just came and reported to me that the volumes not responding.
> /var/log/messages has been outputting the following message, about 50
> times a second,  since Friday:
> 
> Oct  9 13:54:35 plxp01deploy kernel: dlm: recover_master_copy -53 401ce
> 
> Can someone tell me what FAIL_START_WAIT means and how to recover from
> it? Also, does anyone know what the log message above means?
> 
> All my servers in the cluster are showing the same service states.
> 
> I'm running RHEL5-64 bit. 
> 
> please help. I'm almost ready to give up on GFS. It seems way too
> unstable.
> 
> James Fillman
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From kadlec at sunserv.kfki.hu  Wed Oct 10 10:28:16 2007
From: kadlec at sunserv.kfki.hu (Kadlecsik Jozsi)
Date: Wed, 10 Oct 2007 12:28:16 +0200 (MEST)
Subject: [Linux-cluster] IPv6 enabled: out of memory!
Message-ID: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>

Hi,

Something really strange happened here: adding one global IPv6 address to 
an ("external") interface of one node caused "out of memory" on *all* of 
our nodes.

All nodes have got two interfaces, one dedicated to GFS and the other is 
for servicing the users. We configured IPv4 addresses for GFS, otherwise 
all interfaces have got link local IPv6 addresses. Now we wanted to 
introduce services over IPv6 as well. Adding an IPv6 address to one 
interface and rebooting the machine for a clean start caused the disaster.

Is it an unsupported configuration? Should we convert to IPv6 exclusively 
for GFS? (As we had trouble with IPv4 multicast for GFS, that'd be a major 
modification.)

What do you suggest, how could we enable IPv6 for services but keeping 
IPv4 for GFS?

We run cluster-2.01.00 over 2.6.23-rc4 kernel. 

Best regards,
Jozsef
--
E-mail : kadlec at sunserv.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From gordan at bobich.net  Wed Oct 10 10:54:04 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 10 Oct 2007 06:54:04 -0400 (EDT)
Subject: [Linux-cluster] gfs perfomance tuning
In-Reply-To: <470B6F3B.4080205@workflowbydesign.com>
References: <1191895537.6319.51.camel@pc-mpatel.tinyprints.com>
	<Pine.LNX.4.64.0710091005320.8118@skynet.shatteredsilicon.net>
	<470B6F3B.4080205@workflowbydesign.com>
Message-ID: <Pine.LNX.4.64.0710100652480.5996@outpost.shatteredsilicon.net>

On Tue, 9 Oct 2007, Rob Nevins wrote:

> Gordan Bobic wrote:
>>  You may find that mounting the gfs file system with "noatime" option helps
>>  a bit.
>
> I like the idea of adding in noatime to our gfs share, but with three nodes 
> running, what would be the proper way to add in "noatime" to fstab, unmount 
> and remount.  Can it be one node at a time?  Or, should I unmount the gfs 
> from all nodes, update and remount?

I can't see why it couldn't be done at run-time.

mount -o noatime,remount /my/gfs/path

on each node in turn. Worst that'll happen is that it'll tell you it won't 
do it.

Failing that, changing defaults to defaults,noatime in /etc/fstab will do 
it when you reboot the cluster.

Gordan


From cosmih at freeland.ro  Wed Oct 10 11:04:14 2007
From: cosmih at freeland.ro (cosmih)
Date: Wed, 10 Oct 2007 14:04:14 +0300
Subject: [Linux-cluster] cluster-2.01.00 compilation problem
In-Reply-To: <1191944320.2798.17.camel@mihaico.gecadco.local>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
Message-ID: <1192014254.17610.2.camel@mihaico.gecadco.local>


> hi,
> 
> i tryied to install cluster-2.01.00 on gentoo and archlinux linux distribution ... but i have
>    the some errors
> i have linux-2.6.22 headers and gcc 4.1.2 on gentoo machines and gcc 4.2.1 on archlinux machines
> i successfully installed openais-0.80.3 (make && make install DESTDIR=/)
> 
> here is the configure command for cluster-2.01.00:
> ./configure --prefix=/usr/local/cluster --libdir=/usr/local/cluster/lib \
> --libexecdir=/usr/local/cluster/libexec --mandir=/usr/local/cluster/man \
> --sharedir=/usr/local/cluster/share --incdir=/usr/local/cluster/include  \
> --openaisincdir=/usr/include --openaislibdir=/usr/lib/openais \
> --ncursesincdir=/usr/include --ncurseslibdir=/usr/lib \
> --readlineincdir=/usr/include/readline --readlinelibdir=/usr/lib \
> --nssincdir=/usr/include/nss --nsslibdir=/usr/lib/nss  \
> --nsprincdir=/usr/include/nspr --nsprlibdir=/usr/lib/nspr \
> --cflags="-O2 -march=i686 -mtune=i686 -fomit-frame-pointer"
> 
> 
> here is the errors from make fence command:
> ------------------------
> make[1]: Entering directory `/root/cluster-2.01.00/dlm'
> set -e && \
>         for i in lib tool; do \
>                 make -C $i all; \
>         done
> make[2]: Entering directory `/root/cluster-2.01.00/dlm/lib'
> gcc -O2 -march=i686 -mtune=i686 -fomit-frame-pointer -I/root/cluster-2.01.00/config -g -O2  -I. -I/usr/local/cluster/include -D_REENTRANT -c -o libdlm.o libdlm.c
> libdlm.c: In function 'set_version_v5':
> libdlm.c:324: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:325: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:326: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'set_version_v6':
> libdlm.c:335: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:336: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:337: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'detect_kernel_version':
> libdlm.c:443: error: storage size of 'v' isn't known
> libdlm.c:446: error: invalid application of 'sizeof' to incomplete type 'struct dlm_device_version' 
> libdlm.c:448: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:449: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:450: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:452: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:453: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:454: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'do_dlm_dispatch':
> libdlm.c:590: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'ls_lock_v6':
> libdlm.c:835: error: 'struct dlm_lock_params' has no member named 'xid'
> libdlm.c:837: error: 'struct dlm_lock_params' has no member named 'timeout'
> libdlm.c: In function 'ls_lock':
> libdlm.c:892: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'dlm_ls_lockx':
> libdlm.c:916: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'dlm_ls_unlock':
> libdlm.c:1067: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'dlm_ls_deadlock_cancel':
> libdlm.c:1099: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:1115: error: 'DLM_USER_DEADLOCK' undeclared (first use in this function)
> libdlm.c:1115: error: (Each undeclared identifier is reported only once
> libdlm.c:1115: error: for each function it appears in.)
> libdlm.c: In function 'dlm_ls_purge':
> libdlm.c:1134: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'create_lockspace':
> libdlm.c:1311: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'release_lockspace':
> libdlm.c:1417: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c: In function 'dlm_kernel_version':
> libdlm.c:1503: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:1504: error: invalid use of undefined type 'struct dlm_device_version'
> libdlm.c:1505: error: invalid use of undefined type 'struct dlm_device_version'
> make[2]: *** [libdlm.o] Error 1
> make[2]: Leaving directory `/root/cluster-2.01.00/dlm/lib'
> make[1]: *** [all] Error 2
> make[1]: Leaving directory `/root/cluster-2.01.00/dlm'
> make: *** [dlm] Error 2
> -------------------------------
> 
> can someone give me some hints ?
> 
> regards,
> cosmih
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071010/006e7909/attachment.htm>

From gordan at bobich.net  Wed Oct 10 12:23:24 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 10 Oct 2007 08:23:24 -0400 (EDT)
Subject: [Linux-cluster] GFS problems!!!
In-Reply-To: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
References: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
Message-ID: <Pine.LNX.4.64.0710100822460.5996@outpost.shatteredsilicon.net>

Are you using GFS or GFS2? GFS2 doesn't work yet, at least not in the 
released RPMs.

Gordan

On Tue, 9 Oct 2007, James Fillman wrote:

> Ok. I'm trying to implement GFS on two different clusters: 9 nodes, 17
> nodes.
>
> I'm having nothing but troubles. The gfs volumes are freezing and
> throwing the cluster into a bad state. Currently, this is the state of
> my cluster:
>
> [root at plxp01md-new log]# cman_tool services
> type             level name     id       state
> fence            0     default  00010004 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     clvmd    00010003 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     mdi_log  00020001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> dlm              1     deploy   00040001 FAIL_START_WAIT
> [1 4 6 7 8 9]
> gfs              2     mdi_log  00010001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> gfs              2     deploy   00030001 FAIL_START_WAIT
>
> I have no idea what happened. I've got users who are writing to a gfs
> volume and just came and reported to me that the volumes not responding.
> /var/log/messages has been outputting the following message, about 50
> times a second,  since Friday:
>
> Oct  9 13:54:35 plxp01deploy kernel: dlm: recover_master_copy -53 401ce
>
> Can someone tell me what FAIL_START_WAIT means and how to recover from
> it? Also, does anyone know what the log message above means?
>
> All my servers in the cluster are showing the same service states.
>
> I'm running RHEL5-64 bit.
>
> please help. I'm almost ready to give up on GFS. It seems way too
> unstable.
>
> James Fillman
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From teigland at redhat.com  Wed Oct 10 13:53:03 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 10 Oct 2007 08:53:03 -0500
Subject: [Linux-cluster] GFS problems!!!
In-Reply-To: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
References: <66F461DD7EDEEF4AA928FCC80B425B520468BC@c2kp01mail.cucbc.com>
Message-ID: <20071010135303.GA24010@redhat.com>

On Tue, Oct 09, 2007 at 04:52:01PM -0700, James Fillman wrote:
> Ok. I'm trying to implement GFS on two different clusters: 9 nodes, 17
> nodes.
> 
> I'm having nothing but troubles. The gfs volumes are freezing and
> throwing the cluster into a bad state. Currently, this is the state of
> my cluster:
> 
> [root at plxp01md-new log]# cman_tool services
> type             level name     id       state
> fence            0     default  00010004 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     clvmd    00010003 none
> [1 2 3 4 5 6 7 8 9]
> dlm              1     mdi_log  00020001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> dlm              1     deploy   00040001 FAIL_START_WAIT
> [1 4 6 7 8 9]
> gfs              2     mdi_log  00010001 FAIL_START_WAIT
> [1 2 3 4 6 7 8 9]
> gfs              2     deploy   00030001 FAIL_START_WAIT

You probably have nodes going down; if you can keep nodes from failing
things will run much better.  'cman_tool nodes' and /var/log/messages may
give us some idea about the source of node failures, whether they are
spurious, if your largish number of nodes are contributing to the
problems.

> I have no idea what happened. I've got users who are writing to a gfs
> volume and just came and reported to me that the volumes not responding.
> /var/log/messages has been outputting the following message, about 50
> times a second,  since Friday:
> 
> Oct  9 13:54:35 plxp01deploy kernel: dlm: recover_master_copy -53 401ce

Ignore those, they are debug messages that were mistakenly directed to
/var/log/messages.

Dave


From teigland at redhat.com  Wed Oct 10 13:58:15 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 10 Oct 2007 08:58:15 -0500
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
In-Reply-To: <1191944320.2798.17.camel@mihaico.gecadco.local>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
Message-ID: <20071010135815.GB24010@redhat.com>

On Tue, Oct 09, 2007 at 06:38:40PM +0300, cosmih wrote:
> i tryied to install cluster-2.01.00 on gentoo and archlinux linux
> distribution ... but i have the same errors i have linux-2.6.22 headers
> and gcc 4.1.2 on gentoo machines and gcc 4.2.1 on archlinux machines i
> successfully installed openais-0.80.3 (make && make install DESTDIR=/)

See the following; cluster-2.01.00 required 2.6.23-rc, which means it
should now work with 2.6.23.

https://www.redhat.com/archives/linux-cluster/2007-July/msg00268.html

Dave


From sys.mailing at gmail.com  Wed Oct 10 14:58:08 2007
From: sys.mailing at gmail.com (Bjorn Oglefjorn)
Date: Wed, 10 Oct 2007 10:58:08 -0400
Subject: [Linux-cluster] nofailback for failover domains?
In-Reply-To: <20070817161446.GR5853@redhat.com>
References: <926ab61b0708060631h5aa3ef0fu2fe183e80df752f6@mail.gmail.com>
	<20070817161446.GR5853@redhat.com>
Message-ID: <926ab61b0710100758n37cc316co5c7433896b6321a@mail.gmail.com>

Thanks Lon.  That fits my use-case just fine.

On 8/17/07, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Mon, Aug 06, 2007 at 09:31:16AM -0400, Bjorn Oglefjorn wrote:
> > I found that a 'nofailback' option was added for the <failoverdomains>
> > section of the conf.  I can't find any reference to 'nofailback' in any
> RHCS
> > doc I can find.  I'm guessing it should look like this:
> >
> >     <failoverdomain name="test_failover_domain" ordered="1"
> restricted="1"
> > nofailback="1">
> >         ...
> >     </failoverdomain>
> >
> > Can someone confirm?  I will attempt to confirm this myself and will
> report
> > back when I know for sure, but it seems to behave as I would expect.
>
> Josef Bacik wrote it; IIRC it works only in ordered failover domains,
> and doesn't prevent relocation to a node in the domain if it's running
> on a node outside the domain.
>
> --
> Lon Hohberger - Software Engineer - Red Hat, Inc.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071010/0857c734/attachment.htm>

From cosmih at freeland.ro  Wed Oct 10 15:11:34 2007
From: cosmih at freeland.ro (cosmih)
Date: Wed, 10 Oct 2007 18:11:34 +0300
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
In-Reply-To: <20071010135815.GB24010@redhat.com>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
	<20071010135815.GB24010@redhat.com>
Message-ID: <1192029094.14854.13.camel@mihaico.gecadco.local>


thanks ... it seems like i don't search enough on mailinglist :(

so .. iti is safe to copy only dlm header from 2.6.23, build and install
cluster-2.01.00 and use 2.6.22 kernel ?

regards,
cosmih

On Wed, 2007-10-10 at 08:58 -0500, David Teigland wrote:

> On Tue, Oct 09, 2007 at 06:38:40PM +0300, cosmih wrote:
> > i tryied to install cluster-2.01.00 on gentoo and archlinux linux
> > distribution ... but i have the same errors i have linux-2.6.22 headers
> > and gcc 4.1.2 on gentoo machines and gcc 4.2.1 on archlinux machines i
> > successfully installed openais-0.80.3 (make && make install DESTDIR=/)
> 
> See the following; cluster-2.01.00 required 2.6.23-rc, which means it
> should now work with 2.6.23.
> 
> https://www.redhat.com/archives/linux-cluster/2007-July/msg00268.html
> 
> Dave
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071010/e7d0f867/attachment.htm>

From teigland at redhat.com  Wed Oct 10 15:13:16 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 10 Oct 2007 10:13:16 -0500
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
In-Reply-To: <1192029094.14854.13.camel@mihaico.gecadco.local>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
	<20071010135815.GB24010@redhat.com>
	<1192029094.14854.13.camel@mihaico.gecadco.local>
Message-ID: <20071010151316.GC24010@redhat.com>

On Wed, Oct 10, 2007 at 06:11:34PM +0300, cosmih wrote:
> 
> thanks ... it seems like i don't search enough on mailinglist :(
> 
> so .. iti is safe to copy only dlm header from 2.6.23, build and install
> cluster-2.01.00 and use 2.6.22 kernel ?

No, 2.6.23 is required.

Dave


From pruebas at agro.uba.ar  Wed Oct 10 15:46:06 2007
From: pruebas at agro.uba.ar (Gustavo Marcello)
Date: Wed, 10 Oct 2007 12:46:06 -0300
Subject: [Linux-cluster] Distributed /var/spool/mail
Message-ID: <470CF3BE.6050505@agro.uba.ar>

Hello

My name is Gustavo. I'm from Argentina

I write to you because I need help with following...
I need to distribute a file system between several computers. The idea 
is to create a species of RAID 0 by TCP, in where soon space to this 
unit can be added adding new computers when cluster. You have idea if it 
is possible to do this?

Thanks

Gustavo


From gordan at bobich.net  Wed Oct 10 16:04:54 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 10 Oct 2007 12:04:54 -0400 (EDT)
Subject: [Linux-cluster] Distributed /var/spool/mail
In-Reply-To: <470CF3BE.6050505@agro.uba.ar>
References: <470CF3BE.6050505@agro.uba.ar>
Message-ID: <Pine.LNX.4.64.0710101203590.5996@outpost.shatteredsilicon.net>

On Wed, 10 Oct 2007, Gustavo Marcello wrote:

> I write to you because I need help with following...
> I need to distribute a file system between several computers. The idea is to 
> create a species of RAID 0 by TCP, in where soon space to this unit can be 
> added adding new computers when cluster. You have idea if it is possible to 
> do this?

If you're after non-redundant storage, you probably want to look into JBOD 
RAID stuff for Linux.

Gordan


From cosmih at freeland.ro  Wed Oct 10 16:27:55 2007
From: cosmih at freeland.ro (cosmih)
Date: Wed, 10 Oct 2007 19:27:55 +0300
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
In-Reply-To: <20071010151316.GC24010@redhat.com>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
	<20071010135815.GB24010@redhat.com>
	<1192029094.14854.13.camel@mihaico.gecadco.local>
	<20071010151316.GC24010@redhat.com>
Message-ID: <1192033675.7351.3.camel@mihaico.gecadco.local>


On Wed, 2007-10-10 at 10:13 -0500, David Teigland wrote:

> On Wed, Oct 10, 2007 at 06:11:34PM +0300, cosmih wrote:
> > 
> > thanks ... it seems like i don't search enough on mailinglist :(
> > 
> > so .. iti is safe to copy only dlm header from 2.6.23, build and install
> > cluster-2.01.00 and use 2.6.22 kernel ?
> 
> No, 2.6.23 is required.
> 
> Dave
> 


i still have errors !!!

here is what i was do

1) compiled the new 2.6.23 
2) booted the system with the kernel from above
3) copied /usr/src/linux-2.6.23/include/linux/dlm*
into /usr/include/linux/
4) run the following configure command into cluster-2.01.00 directory

./configure --prefix=/usr/local/cluster --libdir=/usr/local/cluster/lib
\

--libexecdir=/usr/local/cluster/libexec --mandir=/usr/local/cluster/man \
--sharedir=/usr/local/cluster/share --incdir=/usr/local/cluster/include  \
--openaisincdir=/usr/include --openaislibdir=/usr/lib/openais \
--ncursesincdir=/usr/include --ncurseslibdir=/usr/lib \
--readlineincdir=/usr/include/readline --readlinelibdir=/usr/lib \
--nssincdir=/usr/include/nss --nsslibdir=/usr/lib/nss  \
--nsprincdir=/usr/include/nspr --nsprlibdir=/usr/lib/nspr \
--cflags="-O2 -march=i686 -mtune=i686 -fomit-frame-pointer"

5) run "make fence" command and i have the following errors

make[1]: Entering directory `/root/cluster-2.01.00/dlm'
set -e && \
        for i in lib tool; do \
                make -C $i all; \
        done
make[2]: Entering directory `/root/cluster-2.01.00/dlm/lib'
gcc -O2 -march=i686 -mtune=i686 -fomit-frame-pointer -I/root/cluster-2.01.00/config -g -O2  -I. -I/usr/local/cluster/include -D_REENTRANT -c -o libdlm.o libdlm.c
In file included from libdlm.c:48:
/usr/include/linux/dlm_device.h:35: error: expected ?:?, ?,?, ?;?, ?}? or ?__attribute__? before ?*? token
/usr/include/linux/dlm_device.h:78: error: expected ?:?, ?,?, ?;?, ?}? or ?__attribute__? before ?*? token
libdlm.c: In function ?do_dlm_dispatch_v6?:
libdlm.c:570: error: ?struct dlm_lock_result? has no member named ?user_lksb?
libdlm.c:570: error: ?struct dlm_lock_result? has no member named ?lksb?
libdlm.c:574: error: ?struct dlm_lock_result? has no member named ?lvb_offset?
libdlm.c:575: error: ?struct dlm_lock_result? has no member named ?user_lksb?
libdlm.c:576: error: ?struct dlm_lock_result? has no member named ?lvb_offset?
libdlm.c:578: error: ?struct dlm_lock_result? has no member named ?user_lksb?
libdlm.c:578: error: ?struct dlm_lock_result? has no member named ?user_lksb?
libdlm.c:580: error: ?struct dlm_lock_result? has no member named ?user_astaddr?
libdlm.c:581: error: ?struct dlm_lock_result? has no member named ?user_astaddr?
libdlm.c:582: error: ?struct dlm_lock_result? has no member named ?user_astparam?
libdlm.c: In function ?sync_write_v6?:
libdlm.c:652: error: ?struct dlm_lock_params? has no member named ?castaddr?
libdlm.c:653: error: ?struct dlm_lock_params? has no member named ?castparam?
libdlm.c:659: error: ?struct dlm_lock_params? has no member named ?lksb?
libdlm.c:667: error: ?struct dlm_lock_params? has no member named ?castaddr?
libdlm.c:668: error: ?struct dlm_lock_params? has no member named ?castparam?
libdlm.c: In function ?ls_lock_v6?:
libdlm.c:828: error: ?struct dlm_lock_params? has no member named ?lksb?
libdlm.c:829: error: ?struct dlm_lock_params? has no member named ?castaddr?
libdlm.c:830: error: ?struct dlm_lock_params? has no member named ?bastaddr?
libdlm.c:831: error: ?struct dlm_lock_params? has no member named ?castparam?
libdlm.c:832: error: ?struct dlm_lock_params? has no member named ?bastparam?
libdlm.c:847: error: ?struct dlm_lock_params? has no member named ?name?
libdlm.c:851: error: ?struct dlm_lock_params? has no member named ?lvb?
libdlm.c: In function ?ls_unlock_v6?:
libdlm.c:1039: error: ?struct dlm_lock_params? has no member named ?lksb?
libdlm.c:1040: error: ?struct dlm_lock_params? has no member named ?castparam?
libdlm.c:1042: error: ?struct dlm_lock_params? has no member named ?castaddr?
make[2]: *** [libdlm.o] Error 1
make[2]: Leaving directory `/root/cluster-2.01.00/dlm/lib'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/root/cluster-2.01.00/dlm'
make: *** [dlm] Error 2


regards,
cosmih


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071010/d8cc6b83/attachment.htm>

From pruebas at agro.uba.ar  Wed Oct 10 16:35:03 2007
From: pruebas at agro.uba.ar (Gustavo Marcello)
Date: Wed, 10 Oct 2007 13:35:03 -0300
Subject: [Linux-cluster] Distributed /var/spool/mail
In-Reply-To: <Pine.LNX.4.64.0710101203590.5996@outpost.shatteredsilicon.net>
References: <470CF3BE.6050505@agro.uba.ar>
	<Pine.LNX.4.64.0710101203590.5996@outpost.shatteredsilicon.net>
Message-ID: <470CFF37.10000@agro.uba.ar>

Gordan Bobic wrote:
> On Wed, 10 Oct 2007, Gustavo Marcello wrote:
>
>> I write to you because I need help with following...
>> I need to distribute a file system between several computers. The 
>> idea is to create a species of RAID 0 by TCP, in where soon space to 
>> this unit can be added adding new computers when cluster. You have 
>> idea if it is possible to do this?
>
> If you're after non-redundant storage, you probably want to look into 
> JBOD RAID stuff for Linux.
>
> Gordan
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
mmmmm...........according to I have understood, JBOD RAID is for 
concatenating discs in a same server. What I am trying to do is to do a 
great disc joining many servers. The idea is to be able to distribute 
/var/spool/mail in many computers so that to distribute the work of hard 
disk. The pop/imap consulted to that great disc, reading of the server 
who corresponds.

Thank you very much !!!


From teigland at redhat.com  Wed Oct 10 16:30:05 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 10 Oct 2007 11:30:05 -0500
Subject: [Linux-cluster] Trouble finding openais include dir in RHEL5
	compile
In-Reply-To: <1192033675.7351.3.camel@mihaico.gecadco.local>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
	<20071010135815.GB24010@redhat.com>
	<1192029094.14854.13.camel@mihaico.gecadco.local>
	<20071010151316.GC24010@redhat.com>
	<1192033675.7351.3.camel@mihaico.gecadco.local>
Message-ID: <20071010163005.GD24010@redhat.com>

On Wed, Oct 10, 2007 at 07:27:55PM +0300, cosmih wrote:
> here is what i was do
> 
> 1) compiled the new 2.6.23 
> 2) booted the system with the kernel from above
> 3) copied /usr/src/linux-2.6.23/include/linux/dlm*
> into /usr/include/linux/

You have to use sanitized headers, so either munge them manually, or see
the 'make headers_install' steps in the email I referenced.

Dave


From jos at xos.nl  Wed Oct 10 18:56:27 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 10 Oct 2007 20:56:27 +0200
Subject: [Linux-cluster] Quorum disk votes not used when starting node?
Message-ID: <200710101856.l9AIuRWA008113@jasmine.xos.nl>

Hi,

On (RHEL4 U5) 3-node test cluster I have defined a quorum disk with a
test heuristic as follows:

    <cman expected_votes="6"/>
    ....
    <quorumd interval="1" label="/dev/sda1" min_score="1" tko="10" votes="3">
        <heuristic interval="2" program="test -f /tmp/qdisk" score="1"/>
    </quorumd>

The idea is (when replacing the heuristic with one or more "real"
heuristics) that when the heuristic score is not high enough, the
node considers itself not sane enough to join the cluster.

Now, this all works fine, cman_tool shows what I expected and when I
remove the file /tmp/qdisk on a node, that node reboots instantaneously.

However, after the reboot, while the file tested in the heuristic does
still not exist, the node is joining the cluster again and starts some
cluster services!

Even more strange, the "cman_tool status" output on that node includes:

    Cluster Member: Yes
    Membership state: Cluster-Member
    Nodes: 3
    Expected_votes: 6
    Total_votes: 3
    Quorum: 4

Why is the node a cluster member with only 3 votes out of 6?
The output of "cman_tool nodes" looks like:

    Node  Votes Exp Sts  Name
       0    3    0   X   /dev/sda1
       1    1    6   M   node1
       2    1    6   M   node2
       3    1    6   M   node3
    Member Status: Quorate

While booting, the logging includes this:

Oct 10 15:36:59 node1 ccsd[4125]: Initial status:: Inquorate
Oct 10 15:37:01 node1 kernel: CMAN: sending membership request
Oct 10 15:37:02 node1 kernel: CMAN: got node node2
Oct 10 15:37:02 node1 kernel: CMAN: got node node3
Oct 10 15:37:02 node1 kernel: CMAN: quorum regained, resuming activity
Oct 10 15:37:02 node1 ccsd[4125]: Cluster is quorate.  Allowing connections.
Oct 10 15:37:02 node1 cman: startup succeeded
Oct 10 15:37:02 node1 qdiskd[4232]: <info> Quorum Daemon Initializing
Oct 10 15:37:02 node1 qdiskd: Starting the Quorum Disk Daemon: succeeded
Oct 10 15:37:04 node1 fenced: startup succeeded
Oct 10 15:37:06 node1 clvmd: Cluster LVM daemon started - connected to CMAN
Oct 10 15:37:06 node1 clvmd: clvmd startup succeeded
Oct 10 15:37:13 node1 qdiskd[4232]: <info> Initial score 0/1
Oct 10 15:37:13 node1 qdiskd[4232]: <info> Initialization complete

I would expect that the node would not join the cluster, because the
heuristic score is 0 and thus the 3 votes are not given.

Can someone explain what's wrong?  My config, the behavior shown above,
or something else?

Thanks for any suggestions.

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From isplist at logicore.net  Wed Oct 10 20:59:55 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 10 Oct 2007 15:59:55 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <20071010155955.970248@leena>

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From joseparrella at gmail.com  Wed Oct 10 21:23:11 2007
From: joseparrella at gmail.com (=?ISO-8859-1?Q?Jos=E9_Miguel_Parrella_Romero?=)
Date: Wed, 10 Oct 2007 17:23:11 -0400
Subject: [Linux-cluster] CLVM with two nodes?
In-Reply-To: <f5526f350710081736l259e270et47d98fdff968c90@mail.gmail.com>
References: <f5526f350710081736l259e270et47d98fdff968c90@mail.gmail.com>
Message-ID: <470D42BF.6000405@gmail.com>

El 08/10/07 20:36, Diego Woitasen escribi?:
> The question is simple, exists a limitation to use CLVM (Redhat 5)
> with only two nodes? Some people say that is not recommeded but I
> wan't some experienced opinions.

I'm using CLVM on a two-node cluster based on Itanium2 machines running 
Debian 4.0. I'm not quite sure about what's CLVM function, though. Of 
course I expect that if I modify a VG in any of the nodes, the other 
node will see the changes automagically, but at least if I lvchange 
-a[y|n] a VG, it doesn't happen.

Otherwise, it works fine.

HTH,
Jose


From jos at xos.nl  Wed Oct 10 21:49:25 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 10 Oct 2007 23:49:25 +0200
Subject: [Linux-cluster] CLVM with two nodes?
In-Reply-To: <470D42BF.6000405@gmail.com>
References: <f5526f350710081736l259e270et47d98fdff968c90@mail.gmail.com>
	<470D42BF.6000405@gmail.com>
Message-ID: <20071010214925.GB8893@jasmine.xos.nl>

On Wed, Oct 10, 2007 at 05:23:11PM -0400, Jos? Miguel Parrella Romero wrote:

> I'm using CLVM on a two-node cluster based on Itanium2 machines running 
> Debian 4.0. I'm not quite sure about what's CLVM function, though. Of 
> course I expect that if I modify a VG in any of the nodes, the other 
> node will see the changes automagically, but at least if I lvchange 
> -a[y|n] a VG, it doesn't happen.

Did you set the locking_type in lvm.conf to 3?

Did you enable the clustered mode of the VG (vgchange -c ... or
set with vgcreate)?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From mpartio at gmail.com  Thu Oct 11 04:39:31 2007
From: mpartio at gmail.com (Mikko Partio)
Date: Thu, 11 Oct 2007 07:39:31 +0300
Subject: [Linux-cluster] Two-node cluster disconnecting
Message-ID: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>

Hello list

I have a problem with a two-node cluster going split-brain. When I first
boot the other node, it correctly starts all the services and informs that
cluster is quorate. Then when I boot the other node, on the boot phase when
it starts the cluster software it does not find the node already running and
starts the same services already running on node 1! When the boot is
complete I can see that the nodes have found each other for a small period
of time but then immediately disconnect from each other. The cluster is
created with Conga with shared disk support though no shared disks are
created yet. This is on CentOS 5.

cluster.conf:

<?xml version="1.0"?>
<cluster alias="testcluster" config_version="11" name="testcluster">
        <fence_daemon clean_start="0" post_fail_delay="5"
post_join_delay="1200"/>
        <clusternodes>
                <clusternode name="hume" nodeid="1" votes="1">
                        <fence>
                                <method name="2">
                                        <device name="ilohume"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="kant" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="ilokant"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="x.x.x.x" login="*"
name="ilohume" passwd="*"/>
                <fencedevice agent="fence_ilo" hostname="x.x.x.x" login="*"
name="ilokant" passwd="*"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
                <service autostart="1" exclusive="0" name="test"
recovery="relocate">
                        <script file="/etc/init.d/pgtest" name="pg"/>
                </service>
                <service autostart="1" exclusive="0" name="test2">
                        <script file="/etc/init.d/pgtest2" name="pg2"/>
                </service>
        </rm>
</cluster>

clustat & cman_tool status & cman_tool nodes on node already running:

$ sudo clustat
Member Status: Quorate

Member Name                        ID   Status
------ ----                        ---- ------
hume                           1 Online, Local, rgmanager
kant                           2 Offline

Service Name         Owner (Last)                   State
------- ----         ----- ------                   -----
service:test         hume                    started
service:test2        hume                    started


$ sudo cman_tool status
Version: 6.0.1
Config Version: 11
Cluster Name: testcluster
Cluster Id: 31540
Cluster Member: Yes
Cluster Generation: 32
Membership state: Cluster-Member
Nodes: 1
Expected votes: 1
Total votes: 1
Quorum: 1
Active subsystems: 8
Flags: 2node
Ports Bound: 0 11 177
Node name: hume
Node ID: 1
Multicast addresses: 239.192.123.175
Node addresses: 193.166.192.100


$ sudo cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M      4   2007-10-10 14:58:53  hume
   2   X     28                        kant


Here's what gets logged in /var/log/messages

Oct 11 07:20:15 hume openais[2410]: [TOTEM] entering GATHER state from 9.
Oct 11 07:20:15 hume openais[2410]: [TOTEM] Creating commit token because I
am the rep.
Oct 11 07:20:15 hume openais[2410]: [TOTEM] Saving state aru 13bd2 high seq
received 13bd2
Oct 11 07:20:15 hume openais[2410]: [TOTEM] entering COMMIT state.
Oct 11 07:20:15 hume openais[2410]: [TOTEM] entering RECOVERY state.
Oct 11 07:20:15 hume openais[2410]: [TOTEM] position [0] member
193.166.192.100:
Oct 11 07:20:15 hume openais[2410]: [TOTEM] previous ring seq 24 rep
193.166.192.100
Oct 11 07:20:15 hume openais[2410]: [TOTEM] aru 13bd2 high delivered 13bd2
received flag 0
Oct 11 07:20:15 hume openais[2410]: [TOTEM] position [1] member
193.166.192.101:
Oct 11 07:20:15 hume openais[2410]: [TOTEM] previous ring seq 4 rep
193.166.192.101
Oct 11 07:20:15 hume openais[2410]: [TOTEM] aru 27 high delivered 27
received flag 0
Oct 11 07:20:15 hume openais[2410]: [TOTEM] Did not need to originate any
messages in recovery.
Oct 11 07:20:15 hume openais[2410]: [TOTEM] Storing new sequence id for ring
1c
Oct 11 07:20:15 hume kernel: dlm: connecting to 2
Oct 11 07:20:15 hume openais[2410]: [TOTEM] Sending initial ORF token
Oct 11 07:20:15 hume kernel: dlm: got connection from 2
Oct 11 07:20:15 hume openais[2410]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] New Configuration:
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] Members Left:
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] Members Joined:
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] New Configuration:
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] Members Left:
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] Members Joined:
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [TOTEM] entering OPERATIONAL state.
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] got nodejoin message
193.166.192.100
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CLM  ] got nodejoin message
193.166.192.101
Oct 11 07:20:15 hume openais[2410]: [CPG  ] got joinlist message from node 1
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume openais[2410]: [CPG  ] got joinlist message from node 2
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:15 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:15 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: device eth0 entered promiscuous mode
Oct 11 07:20:16 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:16 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:17 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:18 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:19 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:19 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:19 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:19 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:20 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:20 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:20 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:20 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:21 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:21 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:21 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:21 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:22 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:22 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:22 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:22 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:23 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:23 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:23 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:23 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:24 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:24 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:25 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:25 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:25 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:25 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:25 hume kernel: device eth0 left promiscuous mode
Oct 11 07:20:26 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:26 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:27 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:27 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:27 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:27 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:28 hume kernel: device eth0 entered promiscuous mode
Oct 11 07:20:28 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:28 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:29 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:29 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:29 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:29 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:30 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:30 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:31 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:31 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:32 hume openais[2410]: [TOTEM] The token was lost in the
OPERATIONAL state.
Oct 11 07:20:32 hume openais[2410]: [TOTEM] Receive multicast socket recv
buffer size (262142 bytes).
Oct 11 07:20:32 hume openais[2410]: [TOTEM] Transmit multicast socket send
buffer size (262142 bytes).
Oct 11 07:20:32 hume openais[2410]: [TOTEM] entering GATHER state from 2.
Oct 11 07:20:32 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:32 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:32 hume kernel: device eth0 left promiscuous mode
Oct 11 07:20:33 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:33 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:34 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:34 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:34 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:34 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:35 hume kernel: dlm: lockspace 30002 from 2 type 1 not found
Oct 11 07:20:35 hume kernel: dlm: lockspace 20002 from 2 type 1 not found
Oct 11 07:20:36 hume kernel: dlm: connecting to 2
Oct 11 07:20:36 hume openais[2410]: [TOTEM] entering GATHER state from 0.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Creating commit token because I
am the rep.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Saving state aru 1d high seq
received 20
Oct 11 07:20:36 hume openais[2410]: [TOTEM] entering COMMIT state.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] entering RECOVERY state.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] position [0] member
193.166.192.100:
Oct 11 07:20:36 hume openais[2410]: [TOTEM] previous ring seq 28 rep
193.166.192.100
Oct 11 07:20:36 hume openais[2410]: [TOTEM] aru 1d high delivered 1d
received flag 0
Oct 11 07:20:36 hume openais[2410]: [TOTEM] copying all old ring messages
from 1e-20.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Originated 0 messages in
RECOVERY.
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Originated for recovery:
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Not Originated for recovery: 1e
1f 20
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Storing new sequence id for ring
20
Oct 11 07:20:36 hume kernel: dlm: closing connection to node 2
Oct 11 07:20:36 hume openais[2410]: [TOTEM] Sending initial ORF token
Oct 11 07:20:37 hume openais[2410]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:37 hume openais[2410]: [CLM  ] New Configuration:
Oct 11 07:20:37 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:37 hume openais[2410]: [CLM  ] Members Left:
Oct 11 07:20:37 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:37 hume openais[2410]: [CLM  ] Members Joined:
Oct 11 07:20:37 hume openais[2410]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:37 hume openais[2410]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:37 hume openais[2410]: [CLM  ] New Configuration:
Oct 11 07:20:37 hume openais[2410]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:37 hume openais[2410]: [CLM  ] Members Left:
Oct 11 07:20:37 hume openais[2410]: [CLM  ] Members Joined:
Oct 11 07:20:37 hume openais[2410]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:37 hume openais[2410]: [TOTEM] entering OPERATIONAL state.
Oct 11 07:20:37 hume openais[2410]: [CLM  ] got nodejoin message
193.166.192.100
Oct 11 07:20:37 hume openais[2410]: [CPG  ] got joinlist message from node 1


and on the other node:


Oct 11 07:20:16 kant openais[2411]: [TOTEM] entering GATHER state from 11.
Oct 11 07:20:16 kant openais[2411]: [TOTEM] Saving state aru 27 high seq
received 27
Oct 11 07:20:16 kant openais[2411]: [TOTEM] entering COMMIT state.
Oct 11 07:20:16 kant openais[2411]: [TOTEM] entering RECOVERY state.
Oct 11 07:20:16 kant openais[2411]: [TOTEM] position [0] member
193.166.192.100:
Oct 11 07:20:16 kant openais[2411]: [TOTEM] previous ring seq 24 rep
193.166.192.100
Oct 11 07:20:16 kant openais[2411]: [TOTEM] aru 13bd2 high delivered 13bd2
received flag 0
Oct 11 07:20:16 kant openais[2411]: [TOTEM] position [1] member
193.166.192.101:
Oct 11 07:20:16 kant openais[2411]: [TOTEM] previous ring seq 4 rep
193.166.192.101
Oct 11 07:20:16 kant openais[2411]: [TOTEM] aru 27 high delivered 27
received flag 0
Oct 11 07:20:16 kant openais[2411]: [TOTEM] Did not need to originate any
messages in recovery.
Oct 11 07:20:16 kant openais[2411]: [TOTEM] Storing new sequence id for ring
1c
Oct 11 07:20:16 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:16 kant kernel: dlm: connecting to 1
Oct 11 07:20:16 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:20:16 kant kernel: dlm: got connection from 1
Oct 11 07:20:16 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [TOTEM] entering OPERATIONAL state.
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] got nodejoin message
193.166.192.100
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CLM  ] got nodejoin message
193.166.192.101
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CPG  ] got joinlist message from node 1
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant openais[2411]: [CPG  ] got joinlist message from node 2
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:16 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:16 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:17 kant last message repeated 29 times
Oct 11 07:20:17 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:17 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:17 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:17 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:18 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:18 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:18 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:18 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:18 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:18 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:18 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:18 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:18 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:18 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:18 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:19 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:19 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:19 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:19 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:19 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:19 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:20 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:20 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:20 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:20 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:20 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:20 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:20 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:20 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:20 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:21 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:21 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:21 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:21 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:21 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:21 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:22 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:22 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:22 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:22 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:22 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:22 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:22 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:23 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:23 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:23 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:23 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:23 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:23 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:23 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:23 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:23 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:23 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:23 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:24 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:24 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:24 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:24 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:24 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:24 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:24 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:24 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:24 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:24 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:25 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:25 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:25 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:25 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:25 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:25 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:25 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:25 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:26 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:26 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:26 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:26 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:26 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:26 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:26 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:26 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:26 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:26 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:27 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:27 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:27 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:27 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:27 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:27 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:27 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:27 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:28 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:28 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:28 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:28 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:28 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:28 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:28 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:28 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:28 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:28 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:29 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:29 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:29 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:29 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:29 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:29 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:29 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:29 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:30 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:30 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:30 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:30 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:30 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:30 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:30 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:30 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:31 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:31 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:31 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:31 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:31 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:31 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:31 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:31 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:31 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:31 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:32 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:32 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:32 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:32 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:32 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:32 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:32 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:32 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:33 kant openais[2411]: [TOTEM] Retransmit List: 1e 1f 20
Oct 11 07:20:33 kant openais[2411]: [TOTEM] FAILED TO RECEIVE
Oct 11 07:20:33 kant openais[2411]: [TOTEM] entering GATHER state from 6.
Oct 11 07:20:33 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:33 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:34 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:34 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:35 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:35 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:36 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:36 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:37 kant kernel: dlm: lockspace 20001 from 1 type 1 not found
Oct 11 07:20:37 kant kernel: dlm: lockspace 30001 from 1 type 1 not found
Oct 11 07:20:37 kant openais[2411]: [TOTEM] entering GATHER state from 0.
Oct 11 07:20:37 kant openais[2411]: [TOTEM] Creating commit token because I
am the rep.
Oct 11 07:20:37 kant openais[2411]: [TOTEM] Saving state aru 20 high seq
received 20
Oct 11 07:20:37 kant openais[2411]: [TOTEM] entering COMMIT state.
Oct 11 07:20:37 kant openais[2411]: [TOTEM] entering RECOVERY state.
Oct 11 07:20:37 kant openais[2411]: [TOTEM] position [0] member
193.166.192.101:
Oct 11 07:20:37 kant openais[2411]: [TOTEM] previous ring seq 28 rep
193.166.192.100
Oct 11 07:20:37 kant openais[2411]: [TOTEM] aru 20 high delivered 20
received flag 0
Oct 11 07:20:37 kant openais[2411]: [TOTEM] Did not need to originate any
messages in recovery.
Oct 11 07:20:37 kant openais[2411]: [TOTEM] Storing new sequence id for ring
20
Oct 11 07:20:37 kant openais[2411]: [TOTEM] Sending initial ORF token
Oct 11 07:20:37 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:37 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:20:37 kant kernel: dlm: closing connection to node 1
Oct 11 07:20:37 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:37 kant kernel: dlm: connect from non cluster node
Oct 11 07:20:37 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:20:37 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:20:38 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:20:38 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:38 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:20:38 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:20:38 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:20:38 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:20:38 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:20:38 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:20:38 kant openais[2411]: [TOTEM] entering OPERATIONAL state.
Oct 11 07:20:38 kant openais[2411]: [CLM  ] got nodejoin message
193.166.192.101
Oct 11 07:20:38 kant openais[2411]: [CPG  ] got joinlist message from node 2
Oct 11 07:21:31 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:55646
Oct 11 07:21:31 kant snmpd[2664]: Received SNMP packet(s) from UDP: [
193.166.218.61]:55646
Oct 11 07:21:31 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:55646
Oct 11 07:21:31 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:55647
Oct 11 07:21:31 kant snmpd[2664]: Received SNMP packet(s) from UDP: [
193.166.218.61]:55647
Oct 11 07:21:31 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:55647
Oct 11 07:21:31 kant last message repeated 2 times
Oct 11 07:21:31 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:55646
Oct 11 07:21:41 kant ntpd[2696]: synchronized to LOCAL(0), stratum 10
Oct 11 07:21:41 kant ntpd[2696]: kernel time sync enabled 0001
Oct 11 07:22:45 kant ntpd[2696]: synchronized to 193.166.211.70, stratum 2
Oct 11 07:26:35 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:56021
Oct 11 07:26:35 kant snmpd[2664]: Received SNMP packet(s) from UDP: [
193.166.218.61]:56021
Oct 11 07:26:35 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:56021
Oct 11 07:26:35 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:56022
Oct 11 07:26:35 kant snmpd[2664]: Received SNMP packet(s) from UDP: [
193.166.218.61]:56022
Oct 11 07:26:35 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:56022
Oct 11 07:26:35 kant last message repeated 2 times
Oct 11 07:26:35 kant snmpd[2664]: Connection from UDP: [193.166.218.61
]:56021
Oct 11 07:30:20 kant openais[2411]: [TOTEM] entering GATHER state from 11.
Oct 11 07:30:20 kant openais[2411]: [TOTEM] Saving state aru 14 high seq
received 14
Oct 11 07:30:20 kant openais[2411]: [TOTEM] entering COMMIT state.
Oct 11 07:30:20 kant openais[2411]: [TOTEM] entering RECOVERY state.
Oct 11 07:30:20 kant openais[2411]: [TOTEM] position [0] member
193.166.192.100:
Oct 11 07:30:20 kant openais[2411]: [TOTEM] previous ring seq 32 rep
193.166.192.100
Oct 11 07:30:20 kant openais[2411]: [TOTEM] aru 15 high delivered 15
received flag 0
Oct 11 07:30:20 kant openais[2411]: [TOTEM] position [1] member
193.166.192.101:
Oct 11 07:30:20 kant openais[2411]: [TOTEM] previous ring seq 32 rep
193.166.192.101
Oct 11 07:30:20 kant openais[2411]: [TOTEM] aru 14 high delivered 14
received flag 0
Oct 11 07:30:20 kant openais[2411]: [TOTEM] Did not need to originate any
messages in recovery.
Oct 11 07:30:20 kant openais[2411]: [TOTEM] Storing new sequence id for ring
24
Oct 11 07:30:20 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:30:20 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:30:20 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:30:20 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:30:20 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:30:20 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:30:20 kant openais[2411]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 11 07:30:20 kant openais[2411]: [CLM  ] New Configuration:
Oct 11 07:30:20 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:30:20 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.101)
Oct 11 07:30:20 kant openais[2411]: [CLM  ] Members Left:
Oct 11 07:30:20 kant openais[2411]: [CLM  ] Members Joined:
Oct 11 07:30:20 kant openais[2411]: [CLM  ]     r(0) ip(193.166.192.100)
Oct 11 07:30:20 kant openais[2411]: [SYNC ] This node is within the primary
component and will provide service.
Oct 11 07:30:20 kant openais[2411]: [TOTEM] entering OPERATIONAL state.
Oct 11 07:30:20 kant openais[2411]: [MAIN ] Killing node hume because it has
rejoined the cluster without cman_tool join
Oct 11 07:30:20 kant openais[2411]: [CMAN ] cman killed by node 1 for reason
3
Oct 11 07:30:20 kant dlm_controld[2433]: cluster is down, exiting
Oct 11 07:30:20 kant kernel: dlm: closing connection to node 2
Oct 11 07:30:20 kant gfs_controld[2439]: groupd_dispatch error -1 errno 11
Oct 11 07:30:20 kant fenced[2427]: cluster is down, exiting
Oct 11 07:30:20 kant gfs_controld[2439]: groupd connection died
Oct 11 07:30:20 kant gfs_controld[2439]: cluster is down, exiting
Oct 11 07:30:47 kant ccsd[2403]: Unable to connect to cluster infrastructure
after 30 seconds.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/9aad3b7f/attachment.htm>

From isplist at logicore.net  Thu Oct 11 04:48:09 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 10 Oct 2007 23:48:09 -0500
Subject: [Linux-cluster] Multiple clusters in one cluster.conf
Message-ID: <2007101023489.508265@leena>

I need to create multiple clusters rather than one large one. Where can I find 
an example of how to do this? 

Do I add the various servers into the one cluster.conf with multiple names or 
do I create multiple cluster.conf files?

PS: Anyone have experience zoning on 2800 Brocades?

Mike


From leemars at gmail.com  Thu Oct 11 11:32:34 2007
From: leemars at gmail.com (Lee MaRS)
Date: Thu, 11 Oct 2007 19:32:34 +0800
Subject: [Linux-cluster] cluster-2.01.00 compilation problem
In-Reply-To: <1192014254.17610.2.camel@mihaico.gecadco.local>
References: <1191944320.2798.17.camel@mihaico.gecadco.local>
	<1192014254.17610.2.camel@mihaico.gecadco.local>
Message-ID: <dea0b920710110432j486c65f0kba72c7367b9c3c1d@mail.gmail.com>

You should specify the --kernel_src and --incdir argument correctly.

I use Archlinux and the kernel src was placed in /usr/src/linux-2.6.23-ARCH/.

I use this command:

./configure --kernel_src=/usr/src/linux-2.6.23-ARCH/
--incdir=/usr/src/linux-2.6.23-ARCH/include

And I can compile the cluster 2.01.00 without such errors.

but I'm shocked that you can compile it in Linux 2.6.22. I just tried
it and failed.

btw: at last, I still failed to complete compilation. I got this:
cc -Wall -I/home/leemars/Desktop/cluster-2.01.00/config -D_GNU_SOURCE
-DSHAREDIR=\"/usr/share/cluster\" -g -Werror -Wstrict-prototypes
-Wshadow -fPIC -I/home/leemars/Desktop/cluster-2.01.00/ccs/lib
-I/home/leemars/Desktop/cluster-2.01.00/cman/lib
-I/home/leemars/Desktop/cluster-2.01.00/dlm/lib `xml2-config --cflags`
-I/usr/src/linux-2.6.23-ARCH/include/ -I../../include
-I/usr/src/linux-2.6.23-ARCH/include/ -c -o rg_thread.o rg_thread.c
cc1: warnings being treated as errors
rg_thread.c: In function 'dump_threads':
rg_thread.c:71: warning: the address of 'resthread_list' will always
evaluate as 'true'
make[3]: *** [rg_thread.o] Error 1

On 10/10/07, cosmih <cosmih at freeland.ro> wrote:
>
>
>
>  hi,
>
> i tryied to install cluster-2.01.00 on gentoo and archlinux linux
> distribution ... but i have
>    the some errors
> i have linux-2.6.22 headers and gcc 4.1.2 on gentoo machines and gcc 4.2.1
> on archlinux machines
> i successfully installed openais-0.80.3 (make && make install DESTDIR=/)
>
> here is the configure command for cluster-2.01.00:
> ./configure --prefix=/usr/local/cluster --libdir=/usr/local/cluster/lib \
> --libexecdir=/usr/local/cluster/libexec
> --mandir=/usr/local/cluster/man \
> --sharedir=/usr/local/cluster/share
> --incdir=/usr/local/cluster/include  \
> --openaisincdir=/usr/include
> --openaislibdir=/usr/lib/openais \
> --ncursesincdir=/usr/include --ncurseslibdir=/usr/lib \
> --readlineincdir=/usr/include/readline
> --readlinelibdir=/usr/lib \
> --nssincdir=/usr/include/nss --nsslibdir=/usr/lib/nss  \
> --nsprincdir=/usr/include/nspr --nsprlibdir=/usr/lib/nspr \
> --cflags="-O2 -march=i686 -mtune=i686 -fomit-frame-pointer"
>
>
> here is the errors from make fence command:
> ------------------------
> make[1]: Entering directory `/root/cluster-2.01.00/dlm'
> set -e && \
>         for i in lib tool; do \
>                 make -C $i all; \
>         done
> make[2]: Entering directory `/root/cluster-2.01.00/dlm/lib'
> gcc -O2 -march=i686 -mtune=i686 -fomit-frame-pointer
> -I/root/cluster-2.01.00/config -g -O2  -I. -I/usr/local/cluster/include
> -D_REENTRANT -c -o libdlm.o libdlm.c
> libdlm.c: In function 'set_version_v5':
> libdlm.c:324: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:325: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:326: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'set_version_v6':
> libdlm.c:335: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:336: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:337: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'detect_kernel_version':
> libdlm.c:443: error: storage size of 'v' isn't known
> libdlm.c:446: error: invalid application of 'sizeof' to incomplete type
> 'struct dlm_device_version'
> libdlm.c:448: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:449: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:450: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:452: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:453: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:454: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'do_dlm_dispatch':
> libdlm.c:590: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'ls_lock_v6':
> libdlm.c:835: error: 'struct dlm_lock_params' has no member named 'xid'
> libdlm.c:837: error: 'struct dlm_lock_params' has no member named 'timeout'
> libdlm.c: In function 'ls_lock':
> libdlm.c:892: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'dlm_ls_lockx':
> libdlm.c:916: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'dlm_ls_unlock':
> libdlm.c:1067: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'dlm_ls_deadlock_cancel':
> libdlm.c:1099: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:1115: error: 'DLM_USER_DEADLOCK' undeclared (first use in this
> function)
> libdlm.c:1115: error: (Each undeclared identifier is reported only once
> libdlm.c:1115: error: for each function it appears in.)
> libdlm.c: In function 'dlm_ls_purge':
> libdlm.c:1134: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'create_lockspace':
> libdlm.c:1311: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'release_lockspace':
> libdlm.c:1417: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c: In function 'dlm_kernel_version':
> libdlm.c:1503: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:1504: error: invalid use of undefined type 'struct
> dlm_device_version'
> libdlm.c:1505: error: invalid use of undefined type 'struct
> dlm_device_version'
> make[2]: *** [libdlm.o] Error 1
> make[2]: Leaving directory `/root/cluster-2.01.00/dlm/lib'
> make[1]: *** [all] Error 2
> make[1]: Leaving directory `/root/cluster-2.01.00/dlm'
> make: *** [dlm] Error 2
> -------------------------------
>
> can someone give me some hints ?
>
> regards,
> cosmih
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From christian at zmcconsulting.com  Thu Oct 11 14:40:50 2007
From: christian at zmcconsulting.com (Christian Ullman)
Date: Thu, 11 Oct 2007 08:40:50 -0600
Subject: [Linux-cluster] CentOS5.x86_64 HA Cluster will not fail over to
	remaining node
Message-ID: <008801c80c14$bc736ee0$0902a8c0@corp.verio.net>

Hi-

 
I have a 2 node cluster on CentOS5.x86_64.

 
nodes: ccweb1 & ccweb2, which is an httpd cluster only

NOTE: no shared storages in employed as the content is static and the all
the dynamic data is stored on another database cluster. 

 
When the service "ca1-ccweb", which consists of a floating IP and
/etc/init.d/httpd script (see config file below), running on a ccweb1 node
and is fenced via:

 
fence_ipmilan -P -v -i 10.1.2.13 -l ADMIN -p password -A password -o off

 
"clustat" reports the service is stopped but does not restart it on the
remaining node:

 
<begin>

[root at ccweb2 ~]# clustat

Member Status: Quorate

 
  Member Name                        ID   Status

  ------ ----                        ---- ------

  ccweb1                                1 Offline

  ccweb2                                2 Online, Local, rgmanager

 
  Service Name         Owner (Last)                   State

  ------- ----         ----- ------                   -----

  service:ca1-ccweb    ccweb1                         stopping

<end>

 
After running:

 
clusvcadm -d ca1-ccweb -m ca1-ccweb1.daz3d.com

 
clustat reports the same as above.  In order to clear this out, I have to
restart rgmanager.

 
Any thoughts?

 
Christian

 
<begin cluster.conf>

<?xml version="1.0"?>

<cluster config_version="12" name="ccweb-clu">

        <fence_daemon post_fail_delay="0" post_join_delay="3"/>

        <clusternodes>

                <clusternode name="ccweb1 " nodeid="1" votes="1">

                        <fence>

                                <method name="1">

                                        <device lanplus="1"
name="ccweb1-fence"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="ccweb2 " nodeid="2" votes="1">

                        <fence>

                                <method name="1">

                                        <device lanplus="1"
name="ccweb2-fence"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_ipmilan" auth="password"
ipaddr="10.1.2.13" login="ADMIN" name="ccweb1-fence" passwd="password"/>

                <fencedevice agent="fence_ipmilan" auth="password"
ipaddr="10.1.2.14" login="ADMIN" name="ccweb2-fence" passwd="password"/>

        </fencedevices>

        <rm>

                <failoverdomains>

                        <failoverdomain name="ca1-ccweb-fail-dom"
ordered="0" restricted="0">

                                <failoverdomainnode name="ccweb2"
priority="1"/>

                                <failoverdomainnode name="ccweb1"
priority="1"/>

                        </failoverdomain>

                </failoverdomains>

                <resources/>

                <service autostart="1" domain="ca1-ccweb-fail-dom"
name="ca1-ccweb">

                        <ip address="38.103.62.30" monitor_link="1">

                                <script file="/etc/init.d/httpd"
name="httpd-init"/>

                        </ip>

                </service>

        </rm>

</cluster>

<end cluster.conf>

 
"Those who would give up security for shorts, deserve neither" - Benament
Frankenhoffer

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/adf9e756/attachment.htm>

From hlawatschek at atix.de  Thu Oct 11 15:25:58 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Thu, 11 Oct 2007 17:25:58 +0200
Subject: [Linux-cluster] RHEL5 sharedroot clusters
Message-ID: <200710111725.59227.hlawatschek@atix.de>

Hi all!

We have just released the preview channel for the open-sharedroot software for 
RHEL5.
With the open-sharedroot software package, you can build up a NFS or GFS based 
diskless sharedroot cluster with RHEL4 and RHEL5. It also contains a toolset  
to clone or backup entire diskless sharedroot clusters.

Open Sharedroot homepage:
http://open-sharedroot.org/

Description of the software channel:
http://open-sharedroot.org/faq/can-i-use-yum-or-up2date-to-install-the-software

Mini HowTo for RHEL5 sharedroot clusters:
http://open-sharedroot.org/documentation/rhel5-gfs-shared-root-mini-howto/view

Feel free to download the software and have a lot of fun installing your 
sharedroot clusters !!

Mark

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From kadlec at sunserv.kfki.hu  Thu Oct 11 16:40:06 2007
From: kadlec at sunserv.kfki.hu (Kadlecsik Jozsi)
Date: Thu, 11 Oct 2007 18:40:06 +0200 (MEST)
Subject: [Linux-cluster] IPv6 enabled: out of memory!
In-Reply-To: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>
References: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>
Message-ID: <Pine.GSO.4.64.0710111837150.12106@sunserv.kfki.hu>

On Wed, 10 Oct 2007, Kadlecsik Jozsi wrote:

> Something really strange happened here: adding one global IPv6 address to 
> an ("external") interface of one node caused "out of memory" on *all* of 
> our nodes.
> 
> All nodes have got two interfaces, one dedicated to GFS and the other is 
> for servicing the users. We configured IPv4 addresses for GFS, otherwise 
> all interfaces have got link local IPv6 addresses. Now we wanted to 
> introduce services over IPv6 as well. Adding an IPv6 address to one 
> interface and rebooting the machine for a clean start caused the disaster.

Forcing ccsd to use IPv4 solved it: now all of the nodes have got IPv6
addresses and GFS is still alive, using IPv4.

Best regards,
Jozsef
--
E-mail : kadlec at sunserv.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From gordan at bobich.net  Thu Oct 11 17:51:10 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 11 Oct 2007 13:51:10 -0400 (EDT)
Subject: [Linux-cluster] GFS with SSI (shared root)
In-Reply-To: <Pine.GSO.4.64.0710111837150.12106@sunserv.kfki.hu>
References: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>
	<Pine.GSO.4.64.0710111837150.12106@sunserv.kfki.hu>
Message-ID: <Pine.LNX.4.64.0710111348060.28997@outpost.shatteredsilicon.net>

Hi,

I managed to get this to work (only got the first node up and running), 
but I'm seeing errors like these come up:

GFS: fsid=mailstore:root.0: warning: assertion 
"gfs_glock_is_locked_by_me(ip->i_gl)" failed
GFS: fsid=mailstore:root.0:   function = gfs_readpage
GFS: fsid=mailstore:root.0:   file = 
/builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/ops_address.c, 
line = 279
GFS: fsid=mailstore:root.0:   time = 1192122289

GFS: fsid=mailstore:root.0: warning: assertion 
"gfs_glock_is_locked_by_me(ip->i_gl)" failed
GFS: fsid=mailstore:root.0:   function = gfs_readpage
GFS: fsid=mailstore:root.0:   file = 
/builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/ops_address.c, 
line = 279
GFS: fsid=mailstore:root.0:   time = 1192122304

What does this mean, and is it dangerous?

I also noticed that my /proc/mounts contains the GFS mounted volume twice:
/dev/sdb2 / gfs rw,hostdata=jid=0:id=196611:first=1 0 0
/dev/sdb2 /cdsl.local gfs rw,hostdata=jid=0:id=196611:first=1 0 0
Is this a problem?

Are these two issues related?

I am using GFS1 (rather than GFS2) on RHEL5.

Gordan


From hlawatschek at atix.de  Thu Oct 11 18:07:04 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Thu, 11 Oct 2007 20:07:04 +0200
Subject: [Linux-cluster] GFS with SSI (shared root)
In-Reply-To: <Pine.LNX.4.64.0710111348060.28997@outpost.shatteredsilicon.net>
References: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>
	<Pine.GSO.4.64.0710111837150.12106@sunserv.kfki.hu>
	<Pine.LNX.4.64.0710111348060.28997@outpost.shatteredsilicon.net>
Message-ID: <200710112007.05003.hlawatschek@atix.de>

> I managed to get this to work (only got the first node up and running),
> but I'm seeing errors like these come up:
>
> GFS: fsid=mailstore:root.0: warning: assertion
> "gfs_glock_is_locked_by_me(ip->i_gl)" failed
> GFS: fsid=mailstore:root.0:   function = gfs_readpage
> GFS: fsid=mailstore:root.0:   file =
> /builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/ops_address.c,
> line = 279
> GFS: fsid=mailstore:root.0:   time = 1192122289
>
> GFS: fsid=mailstore:root.0: warning: assertion
> "gfs_glock_is_locked_by_me(ip->i_gl)" failed
> GFS: fsid=mailstore:root.0:   function = gfs_readpage
> GFS: fsid=mailstore:root.0:   file =
> /builddir/build/BUILD/gfs-kmod-0.1.16/_kmod_build_/src/gfs/ops_address.c,
> line = 279
> GFS: fsid=mailstore:root.0:   time = 1192122304
>
> What does this mean, and is it dangerous?
I can't say anything about the GFS warnings
>
> I also noticed that my /proc/mounts contains the GFS mounted volume twice:
> /dev/sdb2 / gfs rw,hostdata=jid=0:id=196611:first=1 0 0
> /dev/sdb2 /cdsl.local gfs rw,hostdata=jid=0:id=196611:first=1 0 0
> Is this a problem?
No.
This is because /cdsl.local is --bind mounted from /cluster/cdsl/<nodeid> to 
represent the hostdependent files. 

Mark

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From jgray at nicusa.com  Thu Oct 11 18:13:56 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 11 Oct 2007 12:13:56 -0600
Subject: [Linux-cluster] Options with managed NFS service
Message-ID: <C333C404.10BF76%jgray@nicusa.com>

Reading over the NFS Cookbook it describes Managed NFS service as not being
as customizable.    That said, is there any way to force NFS v3 (or disable
NFS v4)?    

Also, is it possible to specify more than one IP or IP block in the nfs
client target statement?  I can't seem to get the syntax correct if so.

Works fine:
<nfsclient name="export_home" options="no_root_squash,rw"
path="/export/home" target="10.0.13.0/24"/>

But I'd like to be able to specify more like this instead of adding
additional lines:

<nfsclient name="export_home" options="no_root_squash,rw"
path="/export/home" target="10.0.13.0/24, 10.0.14.0/24, 10.0.15.0/24"/>


-- 
Josh Gray
Systems Administrator
Email: jgray at nicusa.com


"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From gordan at bobich.net  Thu Oct 11 18:23:31 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 11 Oct 2007 14:23:31 -0400 (EDT)
Subject: [Linux-cluster] GFS with SSI (shared root)
In-Reply-To: <200710112007.05003.hlawatschek@atix.de>
References: <Pine.GSO.4.64.0710101214580.25388@sunserv.kfki.hu>
	<Pine.GSO.4.64.0710111837150.12106@sunserv.kfki.hu>
	<Pine.LNX.4.64.0710111348060.28997@outpost.shatteredsilicon.net>
	<200710112007.05003.hlawatschek@atix.de>
Message-ID: <Pine.LNX.4.64.0710111422540.28997@outpost.shatteredsilicon.net>

On Thu, 11 Oct 2007, Mark Hlawatschek wrote:

>> I also noticed that my /proc/mounts contains the GFS mounted volume twice:
>> /dev/sdb2 / gfs rw,hostdata=jid=0:id=196611:first=1 0 0
>> /dev/sdb2 /cdsl.local gfs rw,hostdata=jid=0:id=196611:first=1 0 0
>> Is this a problem?
>
> No.
> This is because /cdsl.local is --bind mounted from /cluster/cdsl/<nodeid> to
> represent the hostdependent files.

Ah, OK. I'll stop worrying about that bit, then. :-)

Gordan


From gordan at bobich.net  Thu Oct 11 18:31:12 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 11 Oct 2007 14:31:12 -0400 (EDT)
Subject: [Linux-cluster] Options with managed NFS service
In-Reply-To: <C333C404.10BF76%jgray@nicusa.com>
References: <C333C404.10BF76%jgray@nicusa.com>
Message-ID: <Pine.LNX.4.64.0710111426590.28997@outpost.shatteredsilicon.net>

On Thu, 11 Oct 2007, Josh Gray wrote:

> Reading over the NFS Cookbook it describes Managed NFS service as not being
> as customizable.    That said, is there any way to force NFS v3 (or disable
> NFS v4)?

Check /etc/sysconfig/nfs

Also, you can mount with option nfsvers=3

> Also, is it possible to specify more than one IP or IP block in the nfs
> client target statement?  I can't seem to get the syntax correct if so.

I don't think so. You'd need something like Linux HA IPVS with floating 
IPs to handle failover. At least that's how I've always done it.

> Works fine:
> <nfsclient name="export_home" options="no_root_squash,rw"
> path="/export/home" target="10.0.13.0/24"/>
>
> But I'd like to be able to specify more like this instead of adding
> additional lines:
>
> <nfsclient name="export_home" options="no_root_squash,rw"
> path="/export/home" target="10.0.13.0/24, 10.0.14.0/24, 10.0.15.0/24"/>

I don't think NFS can handle that.

Gordan


From jgray at nicusa.com  Thu Oct 11 18:41:25 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 11 Oct 2007 12:41:25 -0600
Subject: [Linux-cluster] Options with managed NFS service
In-Reply-To: <Pine.LNX.4.64.0710111426590.28997@outpost.shatteredsilicon.net>
Message-ID: <C333CA75.10BF8A%jgray@nicusa.com>

Thanks for the reply, Gordon.   Problem is I have so many servers with NFS
mounted directories that'll be quite the pain to update each fstab or
vfstab.

That file was empty on my systems.   Googling helped show me the following
is what one site suggested but I don't think the nfsd that started up in the
cluster read this, does only init.d/nfs read it?

RPCNFSDARGS='--no-nfs-version 4'

On the second question - statements like that work fine in the exports file,
but I'm just trying to figure out how to do the same within the cluster
service.


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From gordan at bobich.net  Thu Oct 11 18:44:59 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Thu, 11 Oct 2007 14:44:59 -0400 (EDT)
Subject: [Linux-cluster] Options with managed NFS service
In-Reply-To: <C333CA75.10BF8A%jgray@nicusa.com>
References: <C333CA75.10BF8A%jgray@nicusa.com>
Message-ID: <Pine.LNX.4.64.0710111443290.28997@outpost.shatteredsilicon.net>

On Thu, 11 Oct 2007, Josh Gray wrote:

> Thanks for the reply, Gordon.   Problem is I have so many servers with NFS
> mounted directories that'll be quite the pain to update each fstab or
> vfstab.
>
> That file was empty on my systems.   Googling helped show me the following
> is what one site suggested but I don't think the nfsd that started up in the
> cluster read this, does only init.d/nfs read it?

Yes, I think that's a server option. So if you explicitly switch off the 
NFS1,2,4 stuff, it will HAVE TO use NFS3 - there'll be nothing else 
running. :-)

> RPCNFSDARGS='--no-nfs-version 4'
>
> On the second question - statements like that work fine in the exports file,
> but I'm just trying to figure out how to do the same within the cluster
> service.

That has mount options as well. You used some in your example. So just add 
nfsvers=3 to the options in the cluster file.

Gordan


From isplist at logicore.net  Thu Oct 11 21:52:53 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Oct 2007 16:52:53 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <20071011165253.156878@leena>

Am I still on this list or what? Can anyone see my posting???

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From christian at zmcconsulting.com  Thu Oct 11 22:02:42 2007
From: christian at zmcconsulting.com (Christian Ullman)
Date: Thu, 11 Oct 2007 16:02:42 -0600
Subject: [Linux-cluster] What comes first cman or qdiskd
Message-ID: <00de01c80c52$7561bb90$0902a8c0@corp.verio.net>

Hi

 
Planning on implementing a quorum partition and would like to know which
should start first cman or qdiskd?

 
Thanks

 
Christian

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/5ef8ef44/attachment.htm>

From jos at xos.nl  Thu Oct 11 22:09:09 2007
From: jos at xos.nl (Jos Vos)
Date: Fri, 12 Oct 2007 00:09:09 +0200
Subject: [Linux-cluster] What comes first cman or qdiskd
In-Reply-To: <00de01c80c52$7561bb90$0902a8c0@corp.verio.net>
References: <00de01c80c52$7561bb90$0902a8c0@corp.verio.net>
Message-ID: <20071011220909.GD19722@jasmine.xos.nl>

On Thu, Oct 11, 2007 at 04:02:42PM -0600, Christian Ullman wrote:

> Planning on implementing a quorum partition and would like to know which
> should start first cman or qdiskd?

cman

/etc/rc.d/init.d/cman:# chkconfig: - 21 79
/etc/rc.d/init.d/qdiskd:# chkconfig: - 22 78

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From christopher.barry at qlogic.com  Fri Oct 12 01:50:33 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Thu, 11 Oct 2007 20:50:33 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
References: <20071011165253.156878@leena>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB241906B4@EPEXCH1.qlogic.org>

-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Thu 10/11/2007 5:52 PM
To: linux-cluster
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
 
Am I still on this list or what? Can anyone see my posting???

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.
________________________________________________________________________

Of course we do. In fact, I thought this was some kind of weird email looping anomly. 
Apparently it's not, because the top has changed now... 

I think the problem everyone is having with your question is that: 

1.) everyone here knows that this - is not / cannot be - a linux-cluster issue.
2.) you are providing absolutely no clues to entice us to suspend our disbelief.

You may want to RTFineM as well as understand essential posting etiquitte.

Might I suggest in that vein you peruse:
http://www.reedmedia.net/misc/mail/using-mailing-list.html

It's a page turner.

-C


All that said, what do you think the problem could be?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/ab873d1b/attachment.htm>

From isplist at logicore.net  Fri Oct 12 01:58:26 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Oct 2007 20:58:26 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <D158540CCC0AB54C8FD4818F823CCB241906B4@EPEXCH1.qlogic.org>
Message-ID: <20071011205826.241421@leena>

Gee, "yes, we're getting your messages" would have been just as welcome a 
reply :). 

Glad to see I'm still alive here, guess it's changed a lot, I don't see the 
old names anymore.

Mike


> Here's a very weird one. I have a cluster of web servers outgoing over a T1.
> When the T1 went down this morning, the cluster, which is all internal, non
> routable IP's, would not come back. All of the machines locked up around the
> loading DLM section on bootup.
> 
> Once the T1 came back, they all booted just fine and went into cluster mode.
> 
> What in the world would cause that? There aren't any external services
> required to fire up my local cluster, never were, it's always been fine
> before.
> ________________________________________________________________________
> 
> Of course we do. In fact, I thought this was some kind of weird email
> looping anomly.
> Apparently it's not, because the top has changed now...
> 
> I think the problem everyone is having with your question is that:
> 
> 1.) everyone here knows that this - is not / cannot be - a linux-cluster
> issue.
> 2.) you are providing absolutely no clues to entice us to suspend our
> disbelief.
> 
> You may want to RTFineM as well as understand essential posting etiquitte.
> 
> Might I suggest in that vein you peruse:
> http://www.reedmedia.net/misc/mail/using-mailing-list.html
> 
> It's a page turner.
> 
> -C
> 
> 
> All that said, what do you think the problem could be?


From christopher.barry at qlogic.com  Fri Oct 12 02:06:16 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Thu, 11 Oct 2007 21:06:16 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
References: <20071011205826.241421@leena>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB241906B5@EPEXCH1.qlogic.org>

-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
Sent: Thu 10/11/2007 9:58 PM
To: linux-cluster
Subject: RE: [Linux-cluster] Cluster won't come up when T1 is down???
 
Gee, "yes, we're getting your messages" would have been just as welcome a 
reply :). 

Glad to see I'm still alive here, guess it's changed a lot, I don't see the 
old names anymore.

Mike


> Here's a very weird one. I have a cluster of web servers outgoing over a T1.
> When the T1 went down this morning, the cluster, which is all internal, non
> routable IP's, would not come back. All of the machines locked up around the
> loading DLM section on bootup.
> 
> Once the T1 came back, they all booted just fine and went into cluster mode.
> 
> What in the world would cause that? There aren't any external services
> required to fire up my local cluster, never were, it's always been fine
> before.
> ________________________________________________________________________
> 
> Of course we do. In fact, I thought this was some kind of weird email
> looping anomly.
> Apparently it's not, because the top has changed now...
> 
> I think the problem everyone is having with your question is that:
> 
> 1.) everyone here knows that this - is not / cannot be - a linux-cluster
> issue.
> 2.) you are providing absolutely no clues to entice us to suspend our
> disbelief.
> 
> You may want to RTFineM as well as understand essential posting etiquitte.
> 
> Might I suggest in that vein you peruse:
> http://www.reedmedia.net/misc/mail/using-mailing-list.html
> 
> It's a page turner.
> 
> -C
> 
> 
> All that said, what do you think the problem could be?


...and I can see you did not avail yourself to the link I sent, or you would not be top posting.

It's important to understand that no one here - or on mailing lists in general - owes you anything. You must do the bulk of the work.

I know nothing about your setup, but I can make guesses immediately; It's quite possible you're using a hosted dns, and it's likely a name resolution problem.


-C


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/dcd662c3/attachment.htm>

From isplist at logicore.net  Fri Oct 12 02:12:16 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Oct 2007 21:12:16 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <D158540CCC0AB54C8FD4818F823CCB241906B5@EPEXCH1.qlogic.org>
Message-ID: <20071011211216.580154@leena>

Great, list cop. Can't you just stop replying then? I've been on this list and 
others for MANY years. There's also such a thing as relaxing and not being so 
picky. 

You're creating a thread that no one cares about. If you don't like my 
posting, just don't read them.

 From qlogic no less.


On Thu, 11 Oct 2007 21:06:16 -0500, Christopher Barry wrote:
> -----Original Message-----
> 
> From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
> Sent: Thu 10/11/2007 9:58 PM
> To: linux-cluster
> Subject: RE: [Linux-cluster] Cluster won't come up when T1 is down???
> 
> Gee, "yes, we're getting your messages" would have been just as welcome a
> reply :).
> 
> Glad to see I'm still alive here, guess it's changed a lot, I don't see the
> old names anymore.
> 
> Mike
> 
> 
>> Here's a very weird one. I have a cluster of web servers outgoing over a
>> T1.
>> When the T1 went down this morning, the cluster, which is all internal,
>> non
>> routable IP's, would not come back. All of the machines locked up around
>> the
>> loading DLM section on bootup.
>> 
>> Once the T1 came back, they all booted just fine and went into cluster
>> mode.
>> 
>> What in the world would cause that? There aren't any external services
>> required to fire up my local cluster, never were, it's always been fine
>> before.
>> ________________________________________________________________________
>> 
>> Of course we do. In fact, I thought this was some kind of weird email
>> looping anomly.
>> Apparently it's not, because the top has changed now...
>> 
>> I think the problem everyone is having with your question is that:
>> 
>> 1.) everyone here knows that this - is not / cannot be - a linux-cluster
>> issue.
>> 2.) you are providing absolutely no clues to entice us to suspend our
>> disbelief.
>> 
>> You may want to RTFineM as well as understand essential posting etiquitte.
>> 
>> Might I suggest in that vein you peruse:
>> http://www.reedmedia.net/misc/mail/using-mailing-list.html
>> 
>> It's a page turner.
>> 
>> -C
>> 
>> 
>> All that said, what do you think the problem could be?
> 
> 
> ...and I can see you did not avail yourself to the link I sent, or you
> would not be top posting.
> 
> It's important to understand that no one here - or on mailing lists in
> general - owes you anything. You must do the bulk of the work.
> 
> I know nothing about your setup, but I can make guesses immediately; It's
> quite possible you're using a hosted dns, and it's likely a name resolution
> problem.
> 
> 
> -C


From changerv at gmail.com  Fri Oct 12 02:18:50 2007
From: changerv at gmail.com (Changer Van)
Date: Fri, 12 Oct 2007 10:18:50 +0800
Subject: [Linux-cluster] problem with GNBD device
Message-ID: <9fa3c2e50710111918v345d6fat1d2b0648a733997@mail.gmail.com>

Hi all,

I set up a http HA cluster consist of 3 nodes.
Node 1 is set to gnbd server for fencing.
Node 2 and node 3 are set to http HA.
In case the http service is running on node 3.
Once the network cable of node 3 was unplug,
the service would shift to node 2 properly,
but cman service on node 3 was killed after the catble was plugged in,
and cman's pid file was still there.
The worse thing is cman service can not be started again,
and node 3 can not be shutdown.

OS: RHEL 5 (2.6.18-8.el5)
rpms related:
Cluster_Administration-en-US-5.0.0-5.noarch.rpm
cluster-cim-0.8-27.el5.i386.rpm
cluster-snmp-0.8-27.el5.i386.rpm
modcluster-0.8-27.el5.i386.rpm
rgmanager-2.0.23-1.i386.rpm
system-config-cluster-1.0.50-1.0.noarch.rpm
gnbd-1.1.5-1.el5.i386.rpm
kmod-gnbd-0.1.3-4.2.6.18_8.el5.i686.rpm
kmod-gnbd-PAE-0.1.3-4.2.6.18_8.el5.i686.rpm
kmod-gnbd-xen-0.1.3-4.2.6.18_8.el5.i686.rpm

partial log messages on node 3:
openais[6621]: [CPG  ] got joinlist message from node 1
openais[6621]: [CPG  ] got joinlist message from node 2
openais[6621]: [CMAN ] cman killed by node 3 for reason 2
gnbd_import: ERROR [../../utils/gnbd_utils.c:78] cman_init failed :
Connection refused
gfs_controld[6648]: cman_start_notification error -1 104
dlm_controld[6641]: cluster is down, exiting
fenced[6635]: cluster is down, exiting
fence_node[6645]: agent "fence_gnbd" reports: gnbd_import: ERROR cannot get
node name : Connection refused gnbd_import: ERROR If you are not planning to
use a cluster manager, use -n failed: fence_gnbd, node03
kernel: dlm: closing connection to node 3
fence_node[6645]: Fence of "node03" was unsuccessful
kernel: dlm: closing connection to node 2
kernel: dlm: closing connection to node 1
ccsd[6615]: Unable to connect to cluster infrastructure after 30 seconds.
ccsd[6615]: Unable to connect to cluster infrastructure after 60 seconds.

Any help would be greatly appreciated.

-- 
Regards,
Changer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071012/5e746da5/attachment.htm>

From christopher.barry at qlogic.com  Fri Oct 12 03:04:12 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Thu, 11 Oct 2007 22:04:12 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
References: <20071011211216.580154@leena>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB241906B7@EPEXCH1.qlogic.org>


> From: linux-cluster-bounces at redhat.com on behalf of isplist at logicore.net
> Sent: Thu 10/11/2007 10:12 PM
> To: linux-cluster
> Subject: RE: [Linux-cluster] Cluster won't come up when T1 is down???
 
> Great, list cop. Can't you just stop replying then? I've been on this list and 


WooO00ooooOOOOooo; Pull Over! ;)

isplist, I'm know you're a great guy, and no doubt a tad stressed with your site down, but you have posted your message the exact same way at least 4 times. No one has answered you. I'm at least replying. Isn't the definition of insanity doing the same thing over and over while expecting different results? I'm simply attempting to engage you to help you - selfishly, I'll admit, because to be honest, I'm tired of this email hitting my inbox. However, you are acting as if I - in attempting to help you understand the traditional Internet, *time honored* posting techniques (for which there are very good reasons),  am somehow the bad guy here.

Please, let's please dispense with the vitriol, and solve your problem, ok? Great, then let's get on with it. You can thank me later.

Now, on to your computer problem:

>> Here's a very weird one. I have a cluster of web servers outgoing over a
>> T1.
>> When the T1 went down this morning, the cluster, which is all internal,
>> non
>> routable IP's, would not come back. All of the machines locked up around
>> the
>> loading DLM section on bootup.
>>
>> Once the T1 came back, they all booted just fine and went into cluster
>> mode.
>>
>> What in the world would cause that? There aren't any external services
>> required to fire up my local cluster, never were, it's always been fine
>> before.

Obviously, the external link is the dependency we need to examine. We need to ask what could be the dependency here? My gut feeling is that there is a name resolution problem happening. Where are the names you use for your nodes to find each other stored? I'm guessing in a dns that you cannot access when the T1 drops.

Insure that each node has an /etc/host file that has all node names and IP addresses in it, if you have not done so already. This will ensure that names will be resolved correctly - even if dns is not available. Understanding your network topology would also be helpful.

But keep in mind I am simply guessing here - as you have provided me with few details.

Can you please provide:

* a description of your topology
* config data for all interfaces
* contents of /etc/hosts
* IP address of dns server


Thanks,
-C

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/7c388324/attachment.htm>

From isplist at logicore.net  Fri Oct 12 03:26:06 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Oct 2007 22:26:06 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <D158540CCC0AB54C8FD4818F823CCB241906B7@EPEXCH1.qlogic.org>
Message-ID: <2007101122266.879879@leena>

Apologies to list for multiple posts. Just an email server burp. Building a 
new server and the same message ended up queued a few times cept for the last 
one asking if I could be seen.

No, my sites aren't down, no I have no additional information, this was a one 
time weirdness. I thought I'd pick some minds, never know, sometimes you ask 
your associates when something weird happens. Generates new questions and 
sometimes, answers.

So...

> Please, let's please dispense with the vitriol, and solve your problem, ok?
> Great, then let's get on with it. You can thank me later.

Agreed. On the problem, there is no solution because it's been too long. I was 
asking a general question, to see if anyone might have come across anything 
similar, not exactly the same perhaps but somehow related. It's not like this 
weirdness created any logs or anything that I could search out to get anything 
relevant to post so asked in general.
 
> name resolution problem happening. Where are the names you use for your
> nodes to find each other stored? I'm guessing in a dns that you cannot
> access when the T1 drops.

The thing is, it's all internal, behind firewalling. There aren't any 
dependencies on anything external for the cluster to come together. The T1 
being down should have had no effect what so ever yet it prevented the cluster 
from coming back together. 
 
> Insure that each node has an /etc/host file that has all node names and IP
> addresses in it, if you have not done so already. This will ensure that
> names will be resolved correctly - even if dns is not available.
> Understanding your network topology would also be helpful.

Each node does have a hosts file with all of the other nodes on it as well. I 
have internal DNS servers as well. The most frustrating aspect is that there 
isn't anything to look for since it works just fine other than that one time. 
Again, that's why my question was so general. Every now and then, something 
weird happens in a network and you just ask your associates for thoughts, 
ideas, maybe they will think of other questions to ask you which will get you 
thinking. That's what my intent was here too. 

> But keep in mind I am simply guessing here - as you have provided me with
> few details. Can you please provide:

That's because there really isn't much more to it. The cluster works just 
fine, always, except for that one time which didn't make any sense. 
 
> * a description of your topology

All servers are NETed behing firewalling. LVS front end to the services. 
Brocade switches for fencing and fibre channel networking. 

> * config data for all interfaces

Where do I start? 

> * contents of /etc/hosts

Check, each node knows about the next, all work just fine, just that one time.

> * IP address of dns server

Hosts files and internal DNS's.

I know this isn't something we'll find an answer to by breaking down my 
network. It's a one time thing that I thought I'd ask about, to see if anyone 
else had seen anything like that. Everything works perfectly, other than that 
one time. 


---

> results? I'm simply attempting to engage you to help you - selfishly, I'll
> admit, because to be honest, I'm tired of this email hitting my inbox.

It's just email. I think we as humans have to stop finding reasons to be 
pissed off, upset, worked up over such small things. I'm sure you get a lot 
more spam than anything else. That should frustrate you a lot more than my 
email burp.

Hope we're done with the list issues :).

Mike


From Christopher.Barry at qlogic.com  Fri Oct 12 03:55:50 2007
From: Christopher.Barry at qlogic.com (Christopher Barry)
Date: Thu, 11 Oct 2007 23:55:50 -0400
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <2007101122266.879879@leena>
References: <2007101122266.879879@leena>
Message-ID: <1192161350.16362.19.camel@localhost>

On Thu, 2007-10-11 at 22:26 -0500, isplist at logicore.net wrote:
> Apologies to list for multiple posts. Just an email server burp. Building a 
> new server and the same message ended up queued a few times cept for the last 
> one asking if I could be seen.

Once I searched my list archive, I discovered it had been answered way
back when, and I had missed it, and the poster immediately came to the
same conclusion.

http://www.redhat.com/archives/linux-cluster/2007-September/msg00038.html

> 
> No, my sites aren't down, no I have no additional information, this was a one 

Well for me anyway, 

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.0.12) Gecko/20070731
Ubuntu/dapper-security Firefox/1.5.0.12

your site?: http://www.logicore.net

has a large empty white space that I just assumed should have *some*
content in it. I just figured the space was to be filled by a backend
system, that might be inop. Maybe the site looks great in IE, but is not
Linux Firefox friendly?

> time weirdness. I thought I'd pick some minds, never know, sometimes you ask
> your associates when something weird happens. Generates new questions and 
> sometimes, answers.
> 

What makes it weird, is that you do not understand how it works. Once
you understand something, it's not weird anymore. But hey, if you are
happy not knowing, now that it's back working, then cool. Good luck.


-C


From isplist at logicore.net  Fri Oct 12 04:31:49 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Oct 2007 23:31:49 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <1192161350.16362.19.camel@localhost>
Message-ID: <20071011233149.213925@leena>

> back when, and I had missed it, and the poster immediately came to the
> same conclusion.
> http://www.redhat.com/archives/linux-cluster/2007-September/msg00038.html

I missed that one and probably a lot of other things due to a failed server 
this list was being sent to. Got busy and wasn't keeping track.

> your site?: http://www.logicore.net
> 
> has a large empty white space that I just assumed should have *some*
> content in it. I just figured the space was to be filled by a backend

We used to do business under that name and moved on to other things. Just an 
old domain we need to update some day.

> What makes it weird, is that you do not understand how it works. Once
> you understand something, it's not weird anymore. 

What??? I never said I don't care, I said there's little to go by. I was 
looking for thoughts, not a solution to something that can't be fixed. Man, 
how long have you been in development work? Have you never seen stuff that's 
just plain weird, you know isn't something that is likely to happen again, but 
could, so just pick minds to get some thoughts to learn from.

> But hey, if you are
> happy not knowing, now that it's back working, then cool. Good luck.

So... we're back to this again? You get all over me for multiple postings, 
then you tell me it's just me overreacting and now you come back with this 
once we're talking nicely? What are you, 15? At my freaking age, I just don't 
care for this nonsense. Stop writing to me if you're just going to act like 
this. 


From Abdel.Sadek at lsi.com  Fri Oct 12 15:27:03 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Fri, 12 Oct 2007 09:27:03 -0600
Subject: [Linux-cluster] Conga invalid certificate
Message-ID: <C776378855970A4DADE4A476447F6391F1515B@NAMAIL3.ad.lsil.com>

I am getting an invalid certificate error message when trying to open up
a conga web interface from a windows host. How do I get a valid
certificate?

 
Thanks.

Abdel...

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071012/d68e30e2/attachment.htm>

From stuarta at squashedfrog.net  Fri Oct 12 15:52:29 2007
From: stuarta at squashedfrog.net (Stuart Auchterlonie)
Date: Fri, 12 Oct 2007 16:52:29 +0100
Subject: [Linux-cluster] adding virtual servers through piranha not working
In-Reply-To: <393259.48218.qm@web36808.mail.mud.yahoo.com>
References: <393259.48218.qm@web36808.mail.mud.yahoo.com>
Message-ID: <470F983D.80400@squashedfrog.net>


AnuSri Padige wrote:
> Trying to add some vitural servers through piranha is not working. When 
> trying to click on ADD/EDIT/DELETE button, nothing happens. Tried 
> restarting piranha-gui, didn't help. Tried to restart pulse. It shutdown 
> but, not coming back. Got an error:
>  
> Starting pulse: pulse: cannot create heartbeat socket. running as root?
>                                                            [FAILED]
> Pls. advise.
>  

I see this when I attempt to do a restart on pulse.
The problem is that when pulse shuts down it doesn't always clean
up the nanny processess properly. The nanny processess are hanging
onto something that pulse wants, and because it can't get it, it
fails to start with that error. So check for old nanny processess.

Stuart


From jgray at nicusa.com  Fri Oct 12 15:54:46 2007
From: jgray at nicusa.com (Josh Gray)
Date: Fri, 12 Oct 2007 09:54:46 -0600
Subject: [Linux-cluster] Adding quorum to running cluster
Message-ID: <C334F4E6.10C054%jgray@nicusa.com>

I know this has been beat to death on this list but I have not found very
clear steps in the archives or even all the doc's and FAQ linked to.

Many times I've seen people say with "good" heuristics and voting settings
but have not found examples of what numbers you should use, and why.   The
two node example in the FAQ is somewhat helpful, but what about a 3+ node?

This test cluster is 3 nodes,  RHEL 5, 200gb gfs shared disk, 100mb quorum
disk

So you should start out with:
mkqdisk -c /dev/sdc -l nfsquorum
chkconfig qdiskd on

Add this to cluster.conf - is this what you should do for heuristics? What
about the votes/scores?
<quorumd interval="1" tko="10" votes="2"  label="nfsquorum">
 <heuristic program="ping 10.0.0.167 -c1 -t1" score="2" interval="2"/>
 <heuristic program="ping 10.0.0.168 -c1 -t1" score="2" interval="2"/>
 <heuristic program="ping 10.0.0.170 -c1 -t1" score="2" interval="2"/>
</quorumd>


Modify the cman statement,  but what should expected votes be?
<cman two_node="0" expected_votes="5"/>

Then reboot one node at a time?  Does qdiskd need to be started at a certain
run level, I think it's trying to start before cman on this cluster I'm
testing.

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From vipinnight at rediffmail.com  Sun Oct 14 06:09:28 2007
From: vipinnight at rediffmail.com (vipin kumar singh)
Date: 14 Oct 2007 06:09:28 -0000
Subject: [Linux-cluster] How to setup cluster
Message-ID: <20071014060928.29532.qmail@webmail64.rediffmail.com>

Hi,I need to setup cluster for a web application .I have two linux hp servers.Can you plz tell me the way how to setup cluster&nbsp;.how many public and private ip\'s are required.Also the minimum hardware requirement.Thanks and Regardsvipin singh+919873629209
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071014/e2b52cce/attachment.htm>

From bsd_daemon at msn.com  Sun Oct 14 18:36:22 2007
From: bsd_daemon at msn.com (mehmet celik)
Date: Sun, 14 Oct 2007 18:36:22 +0000
Subject: [Linux-cluster] How to setup cluster
In-Reply-To: <20071014060928.29532.qmail@webmail64.rediffmail.com>
References: <20071014060928.29532.qmail@webmail64.rediffmail.com>
Message-ID: <BLU115-W41EE80CA35E3E44EA6558BE3A20@phx.gbl>


Hi, you should read and learn :) Because, this job is complex, too.. I purpose you follow below link.
 
http://www.centos.org/docs/5/html/Cluster_Suite_Overview/
 
Sorry, my english isn't fine.. have a nice day..
 
 
Date: Sun, 14 Oct 2007 06:09:28 +0000To: Linux-cluster at redhat.comFrom: vipinnight at rediffmail.comCC: Subject: [Linux-cluster] How to setup clusterHi,I need to setup cluster for a web application .I have two linux hp servers.Can you plz tell me the way how to setup cluster .how many public and private ip's are required.Also the minimum hardware requirement.Thanks and Regardsvipin singh+919873629209


_________________________________________________________________
Windows Live Hotmail and Microsoft Office Outlook ? together at last. ?Get it now.
http://office.microsoft.com/en-us/outlook/HA102225181033.aspx?pid=CL100626971033
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071014/617b8a38/attachment.htm>

From vipinnight at rediffmail.com  Mon Oct 15 05:20:29 2007
From: vipinnight at rediffmail.com (vipin kumar singh)
Date: 15 Oct 2007 05:20:29 -0000
Subject: [Linux-cluster] How to setup cluster
Message-ID: <1192386922.S.6146.13757.webmail96.rediffmail.com.old.1192425629.23381@webmail.rediffmail.com>

Thanks&nbsp; for the document link. I think it can help me a lot.Regardsvipin singhOn Sun, 14 Oct 2007 18:36:22 +0000 linux clustering wrote.hmmessage P{margin:0px;padding:0px}body.hmmessage{FONT-SIZE: 10pt;FONT-FAMILY:Tahoma}&nbsp;Hi, you should read and learn :) Because, this job is complex, too.. I purpose you follow below link.&nbsp;http://www.centos.org/docs/5/html/Cluster_Suite_Overview/&nbsp;Sorry, my english isn\'t fine.. have a nice day..&nbsp;&nbsp;&nbsp;Date: Sun, 14 Oct 2007 06:09:28 +0000To: Linux-cluster at redhat.comFrom: vipinnight at rediffmail.comCC: Subject: [Linux-cluster] How to setup clusterHi,I need to setup cluster for a web application .I have two linux hp servers.Can you plz tell me the way how to setup cluster&nbsp;.how many public and private ip\'s are required.Also the minimum hardware requirement.Thanks and Regardsvipin singh+919873629209Windows Live Hotmail and Microsoft Office Outlook ? together at last. Get it now!

vipin kumar singh
+919911266103
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071015/47e169c2/attachment.htm>

From jgray at nicusa.com  Mon Oct 15 15:17:04 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 15 Oct 2007 09:17:04 -0600
Subject: [Linux-cluster] FW: Adding quorum to running cluster
In-Reply-To: <C334F4E6.10C054%jgray@nicusa.com>
Message-ID: <C338E090.10C331%jgray@nicusa.com>

Sorry to bump my own post but still having some trouble with this.


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


------ Forwarded Message
From: Josh Gray <jgray at nicusa.com>
Date: Fri, 12 Oct 2007 09:54:46 -0600
To: <linux-cluster at redhat.com>
Conversation: Adding quorum to running cluster
Subject: Adding quorum to running cluster

I know this has been beat to death on this list but I have not found very
clear steps in the archives or even all the doc's and FAQ linked to.

Many times I've seen people say with "good" heuristics and voting settings
but have not found examples of what numbers you should use, and why.   The
two node example in the FAQ is somewhat helpful, but what about a 3+ node?

This test cluster is 3 nodes,  RHEL 5, 200gb gfs shared disk, 100mb quorum
disk

So you should start out with:
mkqdisk -c /dev/sdc -l nfsquorum
chkconfig qdiskd on

Add this to cluster.conf - is this what you should do for heuristics? What
about the votes/scores?
<quorumd interval="1" tko="10" votes="2"  label="nfsquorum">
 <heuristic program="ping 10.0.0.167 -c1 -t1" score="2" interval="2"/>
 <heuristic program="ping 10.0.0.168 -c1 -t1" score="2" interval="2"/>
 <heuristic program="ping 10.0.0.170 -c1 -t1" score="2" interval="2"/>
</quorumd>


Modify the cman statement,  but what should expected votes be?
<cman two_node="0" expected_votes="5"/>

Then reboot one node at a time?  Does qdiskd need to be started at a certain
run level, I think it's trying to start before cman on this cluster I'm
testing.

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


------ End of Forwarded Message


From maciej.bogucki at artegence.com  Mon Oct 15 15:29:44 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Mon, 15 Oct 2007 17:29:44 +0200
Subject: [Linux-cluster] SCSI fencing on RHEL 4.5
In-Reply-To: <C776378855970A4DADE4A476447F6391EB612C@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391EB612C@NAMAIL3.ad.lsil.com>
Message-ID: <47138768.3060408@artegence.com>

Sadek, Abdel napisa?(a):
> I have a 2-node RHEL 4.5 cluster. I am using SCSI fencing which uses
> scsi-3 reservation. The 2 nodes are sharing devices on a FC storage
> array. I am using a single path; there is no multipath driver involved.
> I am also using GFS on top of LVM volumes.
> 
> When I shutdown one of the nodes, the cluster does not send any commands
> to the storage array to clear out the persistent reservations for that
> node. This causes the resources not to fail correctly to the other node
> since it cannot break the reservation done by the dead node. My
> questions are:
> 
> 1. Is that a known issue?
> 
> 2. Is there a way to turn some kind of debugging on the cluster that
> will allow me to watch if any scsi PRIN/PROUT commands are sent to the
> storage devices?

Hello,

1. Check in Your logs if cman run /sbin/fence_scsi
2. If Yes, check why this scritp failed fe. white wrapper script
3. If no, check why cman doesn't do fencing

Best Regards
Maciej Bogucki


From lhh at redhat.com  Mon Oct 15 15:35:03 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:35:03 -0400
Subject: [Linux-cluster] CS4 U4/ Question about heuristic
In-Reply-To: <470B7786.1090005@bull.net>
References: <470B7786.1090005@bull.net>
Message-ID: <1192462503.27135.13.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-10-09 at 14:43 +0200, Alain Moulle wrote:
> Hi
> 
> I wonder if we can use heuristic fonctions without a real quorum disk working ?
> Even if the record quorumd in cluster.conf is mandatory, perhaps using vote=0
> for quorumd, just to have the benefit of heuristics ...
> 
> The goal is to monitor 3 networks in addition to the Heart-beat network
> and to failover if one of this 3 networks does not give response to ping.


Not currently.  You can file a bugzilla and request this, though in 5.1,
the watchdog daemon will be included which would provide the same
functionality.


-- Lon


From lhh at redhat.com  Mon Oct 15 15:37:32 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:37:32 -0400
Subject: [Linux-cluster] Quorum disk votes not used when starting node?
In-Reply-To: <200710101856.l9AIuRWA008113@jasmine.xos.nl>
References: <200710101856.l9AIuRWA008113@jasmine.xos.nl>
Message-ID: <1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-10 at 20:56 +0200, Jos Vos wrote:
> Hi,
> 
> On (RHEL4 U5) 3-node test cluster I have defined a quorum disk with a
> test heuristic as follows:
> 
>     <cman expected_votes="6"/>
>     ....
>     <quorumd interval="1" label="/dev/sda1" min_score="1" tko="10" votes="3">
>         <heuristic interval="2" program="test -f /tmp/qdisk" score="1"/>
>     </quorumd>
> 
> The idea is (when replacing the heuristic with one or more "real"
> heuristics) that when the heuristic score is not high enough, the
> node considers itself not sane enough to join the cluster.
> 
> Now, this all works fine, cman_tool shows what I expected and when I
> remove the file /tmp/qdisk on a node, that node reboots instantaneously.
> 
> However, after the reboot, while the file tested in the heuristic does
> still not exist, the node is joining the cluster again and starts some
> cluster services!

Yup.

Add stop_cman="1" to <quorumd>

-- Lon


From lhh at redhat.com  Mon Oct 15 15:38:19 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:38:19 -0400
Subject: [Linux-cluster] RHEL5 sharedroot clusters
In-Reply-To: <200710111725.59227.hlawatschek@atix.de>
References: <200710111725.59227.hlawatschek@atix.de>
Message-ID: <1192462699.27135.18.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-11 at 17:25 +0200, Mark Hlawatschek wrote:
> Hi all!
> 
> We have just released the preview channel for the open-sharedroot software for 
> RHEL5.
> With the open-sharedroot software package, you can build up a NFS or GFS based 
> diskless sharedroot cluster with RHEL4 and RHEL5. It also contains a toolset  
> to clone or backup entire diskless sharedroot clusters.

Excellent news, Mark!

-- Lon


From lhh at redhat.com  Mon Oct 15 15:39:19 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:39:19 -0400
Subject: [Linux-cluster] What comes first cman or qdiskd
In-Reply-To: <00de01c80c52$7561bb90$0902a8c0@corp.verio.net>
References: <00de01c80c52$7561bb90$0902a8c0@corp.verio.net>
Message-ID: <1192462759.27135.20.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-11 at 16:02 -0600, Christian Ullman wrote:
> Hi
> 
>  
> 
> Planning on implementing a quorum partition and would like to know
> which should start first cman or qdiskd?

With 4.6 or 5.1, it doesn't matter.  Previous versions need cman to be
started first.


-- Lon


From lhh at redhat.com  Mon Oct 15 15:42:13 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:42:13 -0400
Subject: [Linux-cluster] problem with GNBD device
In-Reply-To: <9fa3c2e50710111918v345d6fat1d2b0648a733997@mail.gmail.com>
References: <9fa3c2e50710111918v345d6fat1d2b0648a733997@mail.gmail.com>
Message-ID: <1192462933.27135.24.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-12 at 10:18 +0800, Changer Van wrote:
> Hi all,
> I set up a http HA cluster consist of 3 nodes.
> Node 1 is set to gnbd server for fencing.
> Node 2 and node 3 are set to http HA.
> In case the http service is running on node 3.
> Once the network cable of node 3 was unplug, 
> the service would shift to node 2 properly,
> but cman service on node 3 was killed after the catble was plugged in,
> and cman's pid file was still there. 

After a node is evicted from the cluster, you generally need to reset it
before it can rejoin the cluster.


> partial log messages on node 3:
> openais[6621]: [CPG  ] got joinlist message from node 1 
> openais[6621]: [CPG  ] got joinlist message from node 2 
> openais[6621]: [CMAN ] cman killed by node 3 for reason 2 
> gnbd_import: ERROR [../../utils/gnbd_utils.c:78] cman_init failed :
> Connection refused 
> gfs_controld[6648]: cman_start_notification error -1 104
> dlm_controld[6641]: cluster is down, exiting
> fenced[6635]: cluster is down, exiting 
> fence_node[6645]: agent "fence_gnbd" reports: gnbd_import: ERROR
> cannot get node name : Connection refused gnbd_import: ERROR If you
> are not planning to use a cluster manager, use -n failed: fence_gnbd,
> node03 

This is weird...  CMAN killed itself on node 3?

-- Lon


From lhh at redhat.com  Mon Oct 15 15:49:34 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:49:34 -0400
Subject: [Linux-cluster] CentOS5.x86_64 HA Cluster will not fail over
	to remaining node
In-Reply-To: <008801c80c14$bc736ee0$0902a8c0@corp.verio.net>
References: <008801c80c14$bc736ee0$0902a8c0@corp.verio.net>
Message-ID: <1192463374.27135.30.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-11 at 08:40 -0600, Christian Ullman wrote:

> nodes: ccweb1 & ccweb2, which is an httpd cluster only
> 
> NOTE: no shared storages in employed as the content is static and the
> all the dynamic data is stored on another database cluster. 
> 
>  
> 
> When the service "ca1-ccweb", which consists of a floating IP
> and /etc/init.d/httpd script (see config file below), running on a
> ccweb1 node and is fenced via:
> 
>  
> 
> fence_ipmilan -P -v -i 10.1.2.13 -l ADMIN -p password -A password -o
> off


> "clustat" reports the service is stopped but does not restart it on
> the remaining node:

It's in the 'stopping' state.  It looks like you hit this, but it's been
fixed for some time; I don't see why you'd hit that on any RHEL5
package:

https://bugzilla.redhat.com/show_bug.cgi?id=193255

If the service is in the stopping state, failover should work.  It's
almost like rgmanager's waiting for the node to be fenced (which should
have already happened as you said).

What rgmanager package do you have (rpm -q rgmanager)?

-- Lon


From lhh at redhat.com  Mon Oct 15 15:50:08 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Oct 2007 11:50:08 -0400
Subject: [Linux-cluster] Multiple clusters in one cluster.conf
In-Reply-To: <2007101023489.508265@leena>
References: <2007101023489.508265@leena>
Message-ID: <1192463408.27135.32.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-10 at 23:48 -0500, isplist at logicore.net wrote:
> I need to create multiple clusters rather than one large one. Where can I find 
> an example of how to do this? 
> 
> Do I add the various servers into the one cluster.conf with multiple names or 
> do I create multiple cluster.conf files?

Multiple cluster.confs.  Be sure to make the cluster names different on
each cluster, too. :)

-- Lon


From rmccabe at redhat.com  Mon Oct 15 17:14:28 2007
From: rmccabe at redhat.com (Ryan McCabe)
Date: Mon, 15 Oct 2007 13:14:28 -0400
Subject: [Linux-cluster] Conga invalid certificate
In-Reply-To: <C776378855970A4DADE4A476447F6391F1515B@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391F1515B@NAMAIL3.ad.lsil.com>
Message-ID: <20071015171428.GA1192066@redhat.com>

On Fri, Oct 12, 2007 at 09:27:03AM -0600, Sadek, Abdel wrote:
> I am getting an invalid certificate error message when trying to open up
> a conga web interface from a windows host. How do I get a valid
> certificate?

What's the exact text of the error message you're getting? If it's not
the "Unable to verify the identity of..." dialog, there's nothing we can
do about that presently; you need to use a certificate signed by a trusted
CA to get rid of that one. If the browser is saying the cert is flat-out
invalid and is refusing to proceed, you may have an old certificate saved
(this can happen if you installed luci, deleted it, then installed it
again). You can fix this by deleting the certificate from the browser's
saved certs list. In firefox, you can find the saved certificates at
Preferences -> Advanced ->  Encryption -> View Certificates -> Web Sites.
It is probably listed as being issued by "Common Name."
 

Ryan


From christian at zmcconsulting.com  Mon Oct 15 17:14:35 2007
From: christian at zmcconsulting.com (Christian Ullman)
Date: Mon, 15 Oct 2007 11:14:35 -0600
Subject: [Linux-cluster] CentOS5.x86_64 HA Cluster will not fail over to
	remaining node
In-Reply-To: <20071015160007.DAECD73321@hormel.redhat.com>
References: <20071015160007.DAECD73321@hormel.redhat.com>
Message-ID: <004801c80f4e$dc8664c0$811ba8c0@corp.verio.net>


-----Original Message-----
Message: 10
Date: Mon, 15 Oct 2007 11:49:34 -0400
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] CentOS5.x86_64 HA Cluster will not fail
	over	to remaining node
To: linux clustering <linux-cluster at redhat.com>
Message-ID:
	<1192463374.27135.30.camel at ayanami.boston.devel.redhat.com>
Content-Type: text/plain

On Thu, 2007-10-11 at 08:40 -0600, Christian Ullman wrote:

> nodes: ccweb1 & ccweb2, which is an httpd cluster only
> 
> NOTE: no shared storages in employed as the content is static and the
> all the dynamic data is stored on another database cluster. 
> 
>  
> 
> When the service "ca1-ccweb", which consists of a floating IP
> and /etc/init.d/httpd script (see config file below), running on a
> ccweb1 node and is fenced via:
> 
>  
> 
> fence_ipmilan -P -v -i 10.1.2.13 -l ADMIN -p password -A password -o
> off


> "clustat" reports the service is stopped but does not restart it on
> the remaining node:

It's in the 'stopping' state.  It looks like you hit this, but it's been
fixed for some time; I don't see why you'd hit that on any RHEL5
package:

https://bugzilla.redhat.com/show_bug.cgi?id=193255

If the service is in the stopping state, failover should work.  It's
almost like rgmanager's waiting for the node to be fenced (which should
have already happened as you said).

What rgmanager package do you have (rpm -q rgmanager)?

-- Lon

Thanks for the response.

[root at ca1-ccweb1 ~]# rpm -q rgmanager
rgmanager-2.0.24-1.el5.centos

further:

[root at ca1-ccweb1 ~]# rpm -q cman
cman-2.0.64-1.0.1.el5

Your help is greatly appreciated.

Christian


From Abdel.Sadek at lsi.com  Mon Oct 15 17:30:35 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Mon, 15 Oct 2007 11:30:35 -0600
Subject: [Linux-cluster] Conga invalid certificate
In-Reply-To: <20071015171428.GA1192066@redhat.com>
Message-ID: <C776378855970A4DADE4A476447F6391F15393@NAMAIL3.ad.lsil.com>

Ryan;
Clearing up the firefox old common name certificate solved the issue.
I think that happened because I was trying to manage the cluster through
two different instances of Luci from different servers.

Thanks.
Abdel... 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ryan McCabe
Sent: Monday, October 15, 2007 12:14 PM
To: linux clustering
Subject: Re: [Linux-cluster] Conga invalid certificate

On Fri, Oct 12, 2007 at 09:27:03AM -0600, Sadek, Abdel wrote:
> I am getting an invalid certificate error message when trying to open
up
> a conga web interface from a windows host. How do I get a valid
> certificate?

What's the exact text of the error message you're getting? If it's not
the "Unable to verify the identity of..." dialog, there's nothing we can
do about that presently; you need to use a certificate signed by a
trusted
CA to get rid of that one. If the browser is saying the cert is flat-out
invalid and is refusing to proceed, you may have an old certificate
saved
(this can happen if you installed luci, deleted it, then installed it
again). You can fix this by deleting the certificate from the browser's
saved certs list. In firefox, you can find the saved certificates at
Preferences -> Advanced ->  Encryption -> View Certificates -> Web
Sites.
It is probably listed as being issued by "Common Name."
 

Ryan

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From gordan at bobich.net  Mon Oct 15 17:32:19 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Mon, 15 Oct 2007 13:32:19 -0400 (EDT)
Subject: [Linux-cluster] How to change GFS node timeout?
In-Reply-To: <004801c80f4e$dc8664c0$811ba8c0@corp.verio.net>
References: <20071015160007.DAECD73321@hormel.redhat.com>
	<004801c80f4e$dc8664c0$811ba8c0@corp.verio.net>
Message-ID: <Pine.LNX.4.64.0710151329390.32053@outpost.shatteredsilicon.net>

Hi,

How can I change the timeout after which a node will be deemed to 
have failed? My nodes are connected via a dedicated gigabit ethernet, and 
the chances are that if a node doesn't respond withing a few seconds, or 
10 at the most, that it needs to be dropped so that it doesn't lock out 
the remaining nodes.

TIA.

Gordan


From teigland at redhat.com  Mon Oct 15 17:48:36 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 15 Oct 2007 12:48:36 -0500
Subject: [Linux-cluster] How to change GFS node timeout?
In-Reply-To: <Pine.LNX.4.64.0710151329390.32053@outpost.shatteredsilicon.net>
References: <20071015160007.DAECD73321@hormel.redhat.com>
	<004801c80f4e$dc8664c0$811ba8c0@corp.verio.net>
	<Pine.LNX.4.64.0710151329390.32053@outpost.shatteredsilicon.net>
Message-ID: <20071015174836.GC10944@redhat.com>

On Mon, Oct 15, 2007 at 01:32:19PM -0400, Gordan Bobic wrote:
> Hi,
> 
> How can I change the timeout after which a node will be deemed to 
> have failed? My nodes are connected via a dedicated gigabit ethernet, and 
> the chances are that if a node doesn't respond withing a few seconds, or 
> 10 at the most, that it needs to be dropped so that it doesn't lock out 
> the remaining nodes.

If you're using cman with openais, then for 5 sec

<totem token="5000"/>

It was recently added to the man page cman(5).

If you're using cman-kernel, then for 5 sec

<cman deadnode_timeout="5"/>

Dave


From Abdel.Sadek at lsi.com  Mon Oct 15 20:38:09 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Mon, 15 Oct 2007 14:38:09 -0600
Subject: [Linux-cluster] SCSI 2 vs SCSI 3 reservation
Message-ID: <C776378855970A4DADE4A476447F6391F15439@NAMAIL3.ad.lsil.com>

Is there a document that clearly explains the difference between SCSI 2
reserve/release and SCSI 3 persistent reservation?

My understanding of the SCSI reservation in a Cluster environment is
that one node Only can reserve a LUN for write access.

In my 2 node RHEL 4.5 cluster I've noticed that both nodes are
registered with my LUNs from my SAN storage array. Only one of them
holds the reservation but both can access it for Read/Writes.

 
Thanks.

Abdel...

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071015/cb626a4c/attachment.htm>

From jgray at nicusa.com  Mon Oct 15 20:42:02 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 15 Oct 2007 14:42:02 -0600
Subject: [Linux-cluster] Vmware ESX Fencing script
Message-ID: <C3392CBA.10C38C%jgray@nicusa.com>

Could someone show me how to pull the fencing script for ESX servers out of
CVS - or just send it along if you have it handy?

(yea I know - not for production,  I'm just testing.. Etc. etc.. :)

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From changerv at gmail.com  Tue Oct 16 02:16:43 2007
From: changerv at gmail.com (Changer Van)
Date: Tue, 16 Oct 2007 10:16:43 +0800
Subject: [Linux-cluster] problem with GNBD device
Message-ID: <9fa3c2e50710151916v2d44b1dwa4e164e19a75dc6e@mail.gmail.com>

Hi,

On 10/16/07, linux-cluster-request at redhat.com <
linux-cluster-request at redhat.com> wrote:
>
> ------------------------------
>
> Message: 9
> Date: Mon, 15 Oct 2007 11:42:13 -0400
> From: Lon Hohberger <lhh at redhat.com>
> Subject: Re: [Linux-cluster] problem with GNBD device
> To: linux clustering <linux-cluster at redhat.com>
> Message-ID:
>        <1192462933.27135.24.camel at ayanami.boston.devel.redhat.com>
> Content-Type: text/plain
>
> On Fri, 2007-10-12 at 10:18 +0800, Changer Van wrote:
> > Hi all,
> > I set up a http HA cluster consist of 3 nodes.
> > Node 1 is set to gnbd server for fencing.
> > Node 2 and node 3 are set to http HA.
> > In case the http service is running on node 3.
> > Once the network cable of node 3 was unplug,
> > the service would shift to node 2 properly,
> > but cman service on node 3 was killed after the catble was plugged in,
> > and cman's pid file was still there.
>
> After a node is evicted from the cluster, you generally need to reset it
> before it can rejoin the cluster.


But I can not restart cman hung by fencing and I have to reboot the system
forcedly.
After the system is rebooted, cman is running and node 3 rejoins the
cluster.

> partial log messages on node 3:
> > openais[6621]: [CPG  ] got joinlist message from node 1
> > openais[6621]: [CPG  ] got joinlist message from node 2
> > openais[6621]: [CMAN ] cman killed by node 3 for reason 2
> > gnbd_import: ERROR [../../utils/gnbd_utils.c:78] cman_init failed :
> > Connection refused
> > gfs_controld[6648]: cman_start_notification error -1 104
> > dlm_controld[6641]: cluster is down, exiting
> > fenced[6635]: cluster is down, exiting
> > fence_node[6645]: agent "fence_gnbd" reports: gnbd_import: ERROR
> > cannot get node name : Connection refused gnbd_import: ERROR If you
> > are not planning to use a cluster manager, use -n failed: fence_gnbd,
> > node03
>
> This is weird...  CMAN killed itself on node 3?


Yes, I think so, cman killed itself for reason 2, What is the reason 2?


-- 
Regards,
Changer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071016/86397360/attachment.htm>

From prisenhoover at daxsolutions.com  Tue Oct 16 04:52:04 2007
From: prisenhoover at daxsolutions.com (Paul Risenhoover)
Date: Mon, 15 Oct 2007 21:52:04 -0700
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,
	clvmd (lock problem)
Message-ID: <47144374.6070806@daxsolutions.com>

Hi All,

I am a noob to this maillist, but I've got some kind of locking problem 
with Linux and clusters, and iSCSI that plagues me.  It's a pretty 
serious issue because every time I reboot my server, it fails to mount 
my primary iSCSI device out of the box, and in order to get it working, 
I have to perform some pretty manual operations to get it operational again.

Here is some configuration information:

Linux flax.xxx.com 2.6.9-55.0.9.ELsmp #1 SMP Thu Sep 27 18:27:41 EDT 
2007 i686 i686 i386 GNU/Linux

[root at flax ~]# clvmd -V
Cluster LVM daemon version: 2.02.21-RHEL4 (2007-04-17)
Protocol version:           0.2.1

dmesg (excerpted)
iscsi-sfnet: Loading iscsi_sfnet version 4:0.1.11-3
iscsi-sfnet: Control device major number 254
iscsi-sfnet:host3: Session established
scsi3 : SFNet iSCSI driver
  Vendor: Promise   Model: VTrak M500i       Rev: 2211
  Type:   Direct-Access                      ANSI SCSI revision: 04
sdh : very big device. try to use READ CAPACITY(16).
SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
SCSI device sdh: drive cache: write back
sdh : very big device. try to use READ CAPACITY(16).
SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
SCSI device sdh: drive cache: write back
 sdh: unknown partition table

[root at flax ~]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  flax                                     Online, Local, rgmanager

YES, THIS IS A ONE-NODE CLUSTER (Which, I suspect, might be the problem)

SYMPTOM:

When the server comes up, the clustered logical volume that is on the 
iSCSI device is labeled "inactive" when I do an "lvscan:"
[root at flax ~]# lvscan
  inactive            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
  ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
  ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit
 
The thing that's interesting is the lgevg_00 and the noraidvg_01 volumes 
are also clustered, but they are direct-attached SCSI (ie, not ISCSI).

The volume group that the logical volume is a member of shows clean:
[root at flax ~]# vgscan
  Reading all physical volumes.  This may take a while...
  Found volume group "nasvg_00" using metadata type lvm2
  Found volume group "lgevg_00" using metadata type lvm2
  Found volume group "noraidvg_01" using metadata type lvm2

So, in order to fix this, I execute the following:

[root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
Error locking on node flax: Volume group for uuid not found: 
oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS

This also shows up in my syslog, as such:
Oct 13 11:27:40 flax vgchange:   Error locking on node flax: Volume 
group for uuid not found: 
oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS

RESOLUTION:

It took me a very long time to figure this out, but since it happens to 
me every time I reboot my server, somebody's bound to run into this 
again sometime soon (and it will probably be me).

Here's how I resolved it:

I edited the /etc/lvm/lvm.conf file as such:

was:
    # Type of locking to use. Defaults to local file-based locking (1).
    # Turn locking off by setting to 0 (dangerous: risks metadata corruption
    # if LVM2 commands get run concurrently).
    # Type 2 uses the external shared library locking_library.
    # Type 3 uses built-in clustered locking.
    #locking_type = 1
    locking_type = 3

changed to:

(snip)
    # Type 3 uses built-in clustered locking.
    #locking_type = 1
    locking_type = 2

Then, restart clvmd as such:
[root at flax ~]# service clvmd restart

Then:
[root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
[root at flax ~]#

(see, no error!)
[root at flax ~]# lvscan
  ACTIVE            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
  ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
  ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit

(it's active!)

Then, go back and modify /etc/lvm/lvm.conf to restore the original 
locking_type to 3
Then, restart clvmd.

THOUGHTS:

I admit I don't know much about clustering, but from the evidence I see, 
the problem appears to be isolated to clvmd and iSCSI, if only for the 
fact that my direct-attached clustered volumes don't exhibit the symptoms.

I'll make another leap here and guess that it's probably isolated to 
single-node clusters, since I'd imagine that most people who are using 
clustering are probably using clustering as it was intended to be used 
(ie, multiple machines).


From mpartio at gmail.com  Tue Oct 16 07:13:00 2007
From: mpartio at gmail.com (Mikko Partio)
Date: Tue, 16 Oct 2007 10:13:00 +0300
Subject: [Linux-cluster] Re: Two-node cluster disconnecting
In-Reply-To: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
Message-ID: <2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>

On 10/11/07, Mikko Partio <mpartio at gmail.com> wrote:
>
> Hello list
>
> I have a problem with a two-node cluster going split-brain. When I first
> boot the other node, it correctly starts all the services and informs that
> cluster is quorate. Then when I boot the other node, on the boot phase when
> it starts the cluster software it does not find the node already running and
> starts the same services already running on node 1! When the boot is
> complete I can see that the nodes have found each other for a small period
> of time but then immediately disconnect from each other. The cluster is
> created with Conga with shared disk support though no shared disks are
> created yet. This is on CentOS 5.
>
> cluster.conf:


Anybody got a clue? Also, should I be configuring ais.conf also? There is no
mention about it in the docs but if cman is using openais, I'd guess it
should be configured with the correct addresses right?

Regards

M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071016/b0ed0325/attachment.htm>

From gordan at bobich.net  Tue Oct 16 07:28:13 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 16 Oct 2007 03:28:13 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle but high
	load
In-Reply-To: <2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
Message-ID: <Pine.LNX.4.64.0710160320160.13586@outpost.shatteredsilicon.net>

Hi,

I have a cluster (3 nodes at the moment, may grow up to 16) for handling a 
lot of small files (Maildir). When I test the system by sending around 3-5 
messages/second I see the load on the cluster nodes go up to about 20-30, 
even though the CPUs on the cluster are about 90% idle at all times.

I am guessing that this is due to the clustered machines waiting for DLM 
locks to be established, which causes a lot of processes to be fighting to 
run, but since they don't get to run very soon, they back up and cause the 
load averages to go up.

Assuming the DLM runs over the interface specified by IP and MAC in 
cluster.conf, it is running over gigabit ethernet.

Are there any configuration changes or tuning parameters I can apply to 
DLM to alleviate this condition? The machine I'm running the test from 
(the one sending messages) is about 1/4 of the spec of each of the cluster 
nodes, and it's running a load average of about 0.4. It seems crazy that a 
single low-spec node should be able to completely overwhelm a cluster 12x 
it's spec several times over.

Thanks.

Gordan


From Alain.Moulle at bull.net  Tue Oct 16 09:13:21 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 16 Oct 2007 11:13:21 +0200
Subject: [Linux-cluster] CS4 U4 versus CS4 U5 / cluster_id in cluster.conf
Message-ID: <471480B1.4040100@bull.net>

Hi

I don't remember if the modification to include directly
the cluster_id value in cluster.conf is available on CS4 U4
or only on CS4 U5 ?

Thanks
Alain Moull?


From maciej.bogucki at artegence.com  Tue Oct 16 09:23:56 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 16 Oct 2007 11:23:56 +0200
Subject: [Linux-cluster] SCSI 2 vs SCSI 3 reservation
In-Reply-To: <C776378855970A4DADE4A476447F6391F15439@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391F15439@NAMAIL3.ad.lsil.com>
Message-ID: <4714832C.40208@artegence.com>

Sadek, Abdel napisa?(a):
> Is there a document that clearly explains the difference between SCSI 2
> reserve/release and SCSI 3 persistent reservation?
> 
Google->"SCSI-2  SCSI-3 persistent reservation" fe. [1]

> My understanding of the SCSI reservation in a Cluster environment is
> that one node Only can reserve a LUN for write access.

No. For SCSI-3 PR only one node can reserve device but alle registrant
could have access to write - it depends also which mode for PROUT type
You set(fe. write exclusive - registrants only) [2]
> 
> In my 2 node RHEL 4.5 cluster I?ve noticed that both nodes are
> registered with my LUNs from my SAN storage array. Only one of them
> holds the reservation but both can access it for Read/Writes.

It is ok. For SCSI-3 PR all nodes could do registration, but only one
could have reservation.


[1] - http://blogs.sun.com/kristien/entry/scsi_reservations_in_sun_cluster
[2] - http://t10.org/ftp/t10/document.03/03-230r1.pdf

Best Regards
Maciej Bogucki


From pcaulfie at redhat.com  Tue Oct 16 09:28:12 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 16 Oct 2007 10:28:12 +0100
Subject: [Linux-cluster] CS4 U4 versus CS4 U5 / cluster_id in cluster.conf
In-Reply-To: <471480B1.4040100@bull.net>
References: <471480B1.4040100@bull.net>
Message-ID: <4714842C.9050504@redhat.com>

Alain Moulle wrote:
> Hi
> 
> I don't remember if the modification to include directly
> the cluster_id value in cluster.conf is available on CS4 U4
> or only on CS4 U5 ?

It was introduced in U5. There was an errata for U4 that included it though.

-- 
Patrick


From berthiaume_wayne at emc.com  Tue Oct 16 12:11:57 2007
From: berthiaume_wayne at emc.com (berthiaume_wayne at emc.com)
Date: Tue, 16 Oct 2007 08:11:57 -0400
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,
	clvmd (lock problem)
In-Reply-To: <47144374.6070806@daxsolutions.com>
References: <47144374.6070806@daxsolutions.com>
Message-ID: <D364D39DAD21D444BAE2C70919B62809079D4BD2@CORPUSMX40A.corp.emc.com>

I think it's because clvmd is trying to acquire the iSCSI LUNs and the
iSCSI driver has not come up fully yet. The network layer has to come
up, then iSCSI, then there iss a separate mount with a separate
liesystem tag _netdev that tells mount to wait for these. I'm not sue if
the same capabilities are in LVM to accommodate when an iSCSI device
comes up. This may by the reason they are missed by LVM.

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Paul Risenhoover
Sent: Tuesday, October 16, 2007 12:52 AM
To: Linux-cluster at redhat.com
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,clvmd
(lock problem)

Hi All,

I am a noob to this maillist, but I've got some kind of locking problem 
with Linux and clusters, and iSCSI that plagues me.  It's a pretty 
serious issue because every time I reboot my server, it fails to mount 
my primary iSCSI device out of the box, and in order to get it working, 
I have to perform some pretty manual operations to get it operational
again.

Here is some configuration information:

Linux flax.xxx.com 2.6.9-55.0.9.ELsmp #1 SMP Thu Sep 27 18:27:41 EDT 
2007 i686 i686 i386 GNU/Linux

[root at flax ~]# clvmd -V
Cluster LVM daemon version: 2.02.21-RHEL4 (2007-04-17)
Protocol version:           0.2.1

dmesg (excerpted)
iscsi-sfnet: Loading iscsi_sfnet version 4:0.1.11-3
iscsi-sfnet: Control device major number 254
iscsi-sfnet:host3: Session established
scsi3 : SFNet iSCSI driver
  Vendor: Promise   Model: VTrak M500i       Rev: 2211
  Type:   Direct-Access                      ANSI SCSI revision: 04
sdh : very big device. try to use READ CAPACITY(16).
SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
SCSI device sdh: drive cache: write back
sdh : very big device. try to use READ CAPACITY(16).
SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
SCSI device sdh: drive cache: write back
 sdh: unknown partition table

[root at flax ~]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  flax                                     Online, Local, rgmanager

YES, THIS IS A ONE-NODE CLUSTER (Which, I suspect, might be the problem)

SYMPTOM:

When the server comes up, the clustered logical volume that is on the 
iSCSI device is labeled "inactive" when I do an "lvscan:"
[root at flax ~]# lvscan
  inactive            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
  ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
  ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit
 
The thing that's interesting is the lgevg_00 and the noraidvg_01 volumes

are also clustered, but they are direct-attached SCSI (ie, not ISCSI).

The volume group that the logical volume is a member of shows clean:
[root at flax ~]# vgscan
  Reading all physical volumes.  This may take a while...
  Found volume group "nasvg_00" using metadata type lvm2
  Found volume group "lgevg_00" using metadata type lvm2
  Found volume group "noraidvg_01" using metadata type lvm2

So, in order to fix this, I execute the following:

[root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
Error locking on node flax: Volume group for uuid not found: 
oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS

This also shows up in my syslog, as such:
Oct 13 11:27:40 flax vgchange:   Error locking on node flax: Volume 
group for uuid not found: 
oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS

RESOLUTION:

It took me a very long time to figure this out, but since it happens to 
me every time I reboot my server, somebody's bound to run into this 
again sometime soon (and it will probably be me).

Here's how I resolved it:

I edited the /etc/lvm/lvm.conf file as such:

was:
    # Type of locking to use. Defaults to local file-based locking (1).
    # Turn locking off by setting to 0 (dangerous: risks metadata
corruption
    # if LVM2 commands get run concurrently).
    # Type 2 uses the external shared library locking_library.
    # Type 3 uses built-in clustered locking.
    #locking_type = 1
    locking_type = 3

changed to:

(snip)
    # Type 3 uses built-in clustered locking.
    #locking_type = 1
    locking_type = 2

Then, restart clvmd as such:
[root at flax ~]# service clvmd restart

Then:
[root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
[root at flax ~]#

(see, no error!)
[root at flax ~]# lvscan
  ACTIVE            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
  ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
  ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
  ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit

(it's active!)

Then, go back and modify /etc/lvm/lvm.conf to restore the original 
locking_type to 3
Then, restart clvmd.

THOUGHTS:

I admit I don't know much about clustering, but from the evidence I see,

the problem appears to be isolated to clvmd and iSCSI, if only for the 
fact that my direct-attached clustered volumes don't exhibit the
symptoms.

I'll make another leap here and guess that it's probably isolated to 
single-node clusters, since I'd imagine that most people who are using 
clustering are probably using clustering as it was intended to be used 
(ie, multiple machines).

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jos at xos.nl  Tue Oct 16 12:17:36 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 16 Oct 2007 14:17:36 +0200
Subject: [Linux-cluster] Quorum disk votes not used when starting node?
In-Reply-To: <1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>
References: <200710101856.l9AIuRWA008113@jasmine.xos.nl>
	<1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20071016121736.GF3462@jasmine.xos.nl>

On Mon, Oct 15, 2007 at 11:37:32AM -0400, Lon Hohberger wrote:

> > However, after the reboot, while the file tested in the heuristic does
> > still not exist, the node is joining the cluster again and starts some
> > cluster services!
> 
> Yup.
> 
> Add stop_cman="1" to <quorumd>

I already tried so, without success... :-(

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From wcheng at redhat.com  Tue Oct 16 13:36:17 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 16 Oct 2007 09:36:17 -0400
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <47144374.6070806@daxsolutions.com>
References: <47144374.6070806@daxsolutions.com>
Message-ID: <4714BE51.5020907@redhat.com>

Paul Risenhoover wrote:

> THOUGHTS:
>
> I admit I don't know much about clustering, but from the evidence I 
> see, the problem appears to be isolated to clvmd and iSCSI, if only 
> for the fact that my direct-attached clustered volumes don't exhibit 
> the symptoms.


Will let LVM folks comment on rest of the issues. However, if you intend 
to use this as single node case, are you aware that both GFS and GFS2 
support a "lock_nolock" protocol that doesn't require CLVMD ? It can be 
run on a plain storage device (say /dev/sda1)  and doesn't have any 
locking overhead. Do a "man gfs_mkfs" and search for "LockProtoName". A 
sample mkfs-mount command looks like the following:

shell> gfs_mkfs -t my_cluster:gfs1 -p lock_nolock -j 1 /dev/sda1
shell> mount -t gfs /dev/sda1 /mnt/gfs1

-- Wendy


From gordan at bobich.net  Tue Oct 16 13:26:46 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 16 Oct 2007 09:26:46 -0400 (EDT)
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <4714BE51.5020907@redhat.com>
References: <47144374.6070806@daxsolutions.com> <4714BE51.5020907@redhat.com>
Message-ID: <Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>

On Tue, 16 Oct 2007, Wendy Cheng wrote:

>>  THOUGHTS:
>>
>>  I admit I don't know much about clustering, but from the evidence I see,
>>  the problem appears to be isolated to clvmd and iSCSI, if only for the
>>  fact that my direct-attached clustered volumes don't exhibit the symptoms.
>
> Will let LVM folks comment on rest of the issues. However, if you intend to 
> use this as single node case, are you aware that both GFS and GFS2 support a 
> "lock_nolock" protocol that doesn't require CLVMD ? It can be run on a plain 
> storage device (say /dev/sda1)  and doesn't have any locking overhead. Do a 
> "man gfs_mkfs" and search for "LockProtoName". A sample mkfs-mount command 
> looks like the following:

Err - possibly a misunderstanding, but GFS/GFS2 doesn't require LVM/CLVM. 
You can run on a raw device without volume management.

Gordan


From isplist at logicore.net  Tue Oct 16 15:18:13 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 16 Oct 2007 10:18:13 -0500
Subject: [Linux-cluster] Multiple clusters in one cluster.conf
In-Reply-To: <1192463408.27135.32.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20071016101813.770207@leena>

Thanks for the help.

> Multiple cluster.confs.  Be sure to make the cluster names different on
> each cluster, too. :)

Just to confirm here, name my cluster.conf file to say cluster2.conf, placed 
in the same directory /etc/cluster and edited to suit my needs for the next 
cluster/s?

Mike


From jgray at nicusa.com  Tue Oct 16 17:21:12 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 16 Oct 2007 11:21:12 -0600
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to another
Message-ID: <C33A4F28.10C644%jgray@nicusa.com>

I've read reports of people having trouble with virtual ip's that are
controlled by the cluster suite coming up when they shouldn't on nodes and
even to the point where multiple hosts will have the ip on them (looking at
ip addr list) and the cluster doesn't realize it.

Has anyone seen this?  Any fixes you know of or ways to fight it?

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From jos at xos.nl  Tue Oct 16 17:26:21 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 16 Oct 2007 19:26:21 +0200
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to another
In-Reply-To: <C33A4F28.10C644%jgray@nicusa.com>
References: <C33A4F28.10C644%jgray@nicusa.com>
Message-ID: <20071016172621.GN3462@jasmine.xos.nl>

On Tue, Oct 16, 2007 at 11:21:12AM -0600, Josh Gray wrote:

> I've read reports of people having trouble with virtual ip's that are
> controlled by the cluster suite coming up when they shouldn't on nodes and
> even to the point where multiple hosts will have the ip on them (looking at
> ip addr list) and the cluster doesn't realize it.
> 
> Has anyone seen this?  Any fixes you know of or ways to fight it?

No,  Can't this be caused by a wrong config (i.e. multiple services
defining the same IP address, which might nog be checked by the
software)?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jgray at nicusa.com  Tue Oct 16 17:41:23 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 16 Oct 2007 11:41:23 -0600
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to
 another
In-Reply-To: <20071016172621.GN3462@jasmine.xos.nl>
Message-ID: <C33A53E3.10C65C%jgray@nicusa.com>

Sorry I'm quite the question asker on the list this week, trying to digest
the cluster docs pretty quick!

One more Q - Reading the FAQ section on power fencing.  What is the best
design for a two (or three) node cluster with dual power supplies?  I would
assume best with regards to redundancy would be two power switches - one for
each power supply.  But in the case of fencing shoot outs, should they all
be on the same single switch?  Am I just over thinking this?

" same network path as the path used by CMAN for cluster communication"
Does that mean literally the same ethernet switch or logical vlan, or what?


#  What is the best two-node network & fencing configuration?

In a two node cluster (where you are using two_node="1" in the cluster
configuration, and w/o QDisk), there are several considerations you need to
be aware of:

    * If you are using per-node power management of any sort where the
device is not shared between cluster nodes, you MUST have all fence devices
on the same network path as the path used by CMAN for cluster communication.
Failure to do so can result in both nodes simultaneously fencing each other,
leaving the entire cluster dead, or end up in a fence loop. Typically, this
includes all integrated power management solutions (iLO, IPMI, RSA, ERA, IBM
Blade Center, Egenera Blade Frame, Dell DRAC, etc.), but also includes
remote power switches (APC, WTI) if the devices are not shared between the
two nodes.
    * It is best to use power-type fencing. SAN or SCSI-reservation fencing
might work, as long as it meets the above requirements. If it does not, you
should consider using a quorum disk or partition

If you can not meet the above requirements, you can use quorum disk or
partition.


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From prisenhoover at sampledigital.com  Tue Oct 16 18:15:30 2007
From: prisenhoover at sampledigital.com (Paul Risenhoover)
Date: Tue, 16 Oct 2007 11:15:30 -0700
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <D364D39DAD21D444BAE2C70919B62809079D4BD2@CORPUSMX40A.corp.emc.com>
References: <47144374.6070806@daxsolutions.com>
	<D364D39DAD21D444BAE2C70919B62809079D4BD2@CORPUSMX40A.corp.emc.com>
Message-ID: <4714FFC2.8050500@sampledigital.com>


berthiaume_wayne at emc.com wrote:
> I think it's because clvmd is trying to acquire the iSCSI LUNs and the
> iSCSI driver has not come up fully yet. The network layer has to come
> up, then iSCSI, then there iss a separate mount with a separate
> liesystem tag _netdev that tells mount to wait for these. I'm not sue if
> the same capabilities are in LVM to accommodate when an iSCSI device
> comes up. This may by the reason they are missed by LVM.
>   
Yes it is true that when the system initially boots, iSCSI is not 
available.  But even after it boots, and the iSCSI device is available, 
I cannot manually make the device active.

And furthermore, as of this morning, I've got the same problem.  I just 
added a new 3TB physical device.

Performed the following
pvcreate /dev/sdj
vgextend nasvg_00 /dev/sjd
then...

[root at flax ~]# lvextend -l 2145765 /dev/nasvg_00/lvol0
  Extending logical volume lvol0 to 8.19 TB
  Error locking on node flax: Volume group for uuid not found: 
oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS
  Failed to suspend lvol0

I tried modifying lvm.conf to set the locking_type back to 2, but since 
the file system is in use it won't make the change.

Any thoughts?
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Paul Risenhoover
> Sent: Tuesday, October 16, 2007 12:52 AM
> To: Linux-cluster at redhat.com
> Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,clvmd
> (lock problem)
>
> Hi All,
>
> I am a noob to this maillist, but I've got some kind of locking problem 
> with Linux and clusters, and iSCSI that plagues me.  It's a pretty 
> serious issue because every time I reboot my server, it fails to mount 
> my primary iSCSI device out of the box, and in order to get it working, 
> I have to perform some pretty manual operations to get it operational
> again.
>
> Here is some configuration information:
>
> Linux flax.xxx.com 2.6.9-55.0.9.ELsmp #1 SMP Thu Sep 27 18:27:41 EDT 
> 2007 i686 i686 i386 GNU/Linux
>
> [root at flax ~]# clvmd -V
> Cluster LVM daemon version: 2.02.21-RHEL4 (2007-04-17)
> Protocol version:           0.2.1
>
> dmesg (excerpted)
> iscsi-sfnet: Loading iscsi_sfnet version 4:0.1.11-3
> iscsi-sfnet: Control device major number 254
> iscsi-sfnet:host3: Session established
> scsi3 : SFNet iSCSI driver
>   Vendor: Promise   Model: VTrak M500i       Rev: 2211
>   Type:   Direct-Access                      ANSI SCSI revision: 04
> sdh : very big device. try to use READ CAPACITY(16).
> SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
> SCSI device sdh: drive cache: write back
> sdh : very big device. try to use READ CAPACITY(16).
> SCSI device sdh: 5859373056 512-byte hdwr sectors (2999999 MB)
> SCSI device sdh: drive cache: write back
>  sdh: unknown partition table
>
> [root at flax ~]# clustat
> Member Status: Quorate
>
>   Member Name                              Status
>   ------ ----                              ------
>   flax                                     Online, Local, rgmanager
>
> YES, THIS IS A ONE-NODE CLUSTER (Which, I suspect, might be the problem)
>
> SYMPTOM:
>
> When the server comes up, the clustered logical volume that is on the 
> iSCSI device is labeled "inactive" when I do an "lvscan:"
> [root at flax ~]# lvscan
>   inactive            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
>   ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
>   ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
>   ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
>   ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit
>  
> The thing that's interesting is the lgevg_00 and the noraidvg_01 volumes
>
> are also clustered, but they are direct-attached SCSI (ie, not ISCSI).
>
> The volume group that the logical volume is a member of shows clean:
> [root at flax ~]# vgscan
>   Reading all physical volumes.  This may take a while...
>   Found volume group "nasvg_00" using metadata type lvm2
>   Found volume group "lgevg_00" using metadata type lvm2
>   Found volume group "noraidvg_01" using metadata type lvm2
>
> So, in order to fix this, I execute the following:
>
> [root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
> Error locking on node flax: Volume group for uuid not found: 
> oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS
>
> This also shows up in my syslog, as such:
> Oct 13 11:27:40 flax vgchange:   Error locking on node flax: Volume 
> group for uuid not found: 
> oNhRO1WqNJp3BZxxrlMT16dwpwcRiIQPejnrEUbQ3HMJ6BjHef1hKAsoA6Sl9ISS
>
> RESOLUTION:
>
> It took me a very long time to figure this out, but since it happens to 
> me every time I reboot my server, somebody's bound to run into this 
> again sometime soon (and it will probably be me).
>
> Here's how I resolved it:
>
> I edited the /etc/lvm/lvm.conf file as such:
>
> was:
>     # Type of locking to use. Defaults to local file-based locking (1).
>     # Turn locking off by setting to 0 (dangerous: risks metadata
> corruption
>     # if LVM2 commands get run concurrently).
>     # Type 2 uses the external shared library locking_library.
>     # Type 3 uses built-in clustered locking.
>     #locking_type = 1
>     locking_type = 3
>
> changed to:
>
> (snip)
>     # Type 3 uses built-in clustered locking.
>     #locking_type = 1
>     locking_type = 2
>
> Then, restart clvmd as such:
> [root at flax ~]# service clvmd restart
>
> Then:
> [root at flax ~]# lvchange -a y /dev/nasvg_00/lvol0
> [root at flax ~]#
>
> (see, no error!)
> [root at flax ~]# lvscan
>   ACTIVE            '/dev/nasvg_00/lvol0' [5.46 TB] inherit
>   ACTIVE            '/dev/lgevg_00/lvol0' [3.55 TB] inherit
>   ACTIVE            '/dev/noraidvg_01/lvol0' [546.92 GB] inherit
>   ACTIVE            '/dev/VolGroup00/LogVol00' [134.47 GB] inherit
>   ACTIVE            '/dev/VolGroup00/LogVol01' [1.94 GB] inherit
>
> (it's active!)
>
> Then, go back and modify /etc/lvm/lvm.conf to restore the original 
> locking_type to 3
> Then, restart clvmd.
>
> THOUGHTS:
>
> I admit I don't know much about clustering, but from the evidence I see,
>
> the problem appears to be isolated to clvmd and iSCSI, if only for the 
> fact that my direct-attached clustered volumes don't exhibit the
> symptoms.
>
> I'll make another leap here and guess that it's probably isolated to 
> single-node clusters, since I'd imagine that most people who are using 
> clustering are probably using clustering as it was intended to be used 
> (ie, multiple machines).
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   


From prisenhoover at sampledigital.com  Tue Oct 16 18:33:05 2007
From: prisenhoover at sampledigital.com (Paul Risenhoover)
Date: Tue, 16 Oct 2007 11:33:05 -0700
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
References: <47144374.6070806@daxsolutions.com> <4714BE51.5020907@redhat.com>
	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
Message-ID: <471503E1.4020902@sampledigital.com>

Gordan Bobic wrote:
> On Tue, 16 Oct 2007, Wendy Cheng wrote:
>
>>>  THOUGHTS:
>>>
>>>  I admit I don't know much about clustering, but from the evidence I 
>>> see,
>>>  the problem appears to be isolated to clvmd and iSCSI, if only for the
>>>  fact that my direct-attached clustered volumes don't exhibit the 
>>> symptoms.
>>
>> Will let LVM folks comment on rest of the issues. However, if you 
>> intend to use this as single node case, are you aware that both GFS 
>> and GFS2 support a "lock_nolock" protocol that doesn't require CLVMD 
>> ? It can be run on a plain storage device (say /dev/sda1)  and 
>> doesn't have any locking overhead. Do a "man gfs_mkfs" and search for 
>> "LockProtoName". A sample mkfs-mount command looks like the following:
>
> Err - possibly a misunderstanding, but GFS/GFS2 doesn't require 
> LVM/CLVM. You can run on a raw device without volume management.
Ouch.  Good to know.  If I use raw devices can I grow and shrink 
volumes?  The specific need is to be able to take a physical devices out 
of service (ie, one of my iSCSI devices) so that I can restripe it or 
replace it.

Here's another scenario: I've got two existing physical devices of ~3TB 
each, both are members of the nasvg_00 volume group (using clvmd), plus 
a third physical device that I'm trying to bring online.  Is there a 
migration path that allows me to format the new physical device with 
gfs/raw, join it to the exiting gfs file system, and then migrate the 
other physical devices (one by one) to a gfs/raw format? 

Oh and can anybody lend me a  tape backup system? ;)

>
> Gordan
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rstevens at internap.com  Tue Oct 16 18:41:24 2007
From: rstevens at internap.com (Rick Stevens)
Date: Tue, 16 Oct 2007 11:41:24 -0700
Subject: [Linux-cluster] Virtual ips 'drifting' from one node toanother
In-Reply-To: <C33A53E3.10C65C%jgray@nicusa.com>
References: <C33A53E3.10C65C%jgray@nicusa.com>
Message-ID: <1192560084.28353.11.camel@prophead.corp.publichost.com>

On Tue, 2007-10-16 at 11:41 -0600, Josh Gray wrote:
> Sorry I'm quite the question asker on the list this week, trying to digest
> the cluster docs pretty quick!
> 
> One more Q - Reading the FAQ section on power fencing.  What is the best
> design for a two (or three) node cluster with dual power supplies?  I would
> assume best with regards to redundancy would be two power switches - one for
> each power supply.  But in the case of fencing shoot outs, should they all
> be on the same single switch?  Am I just over thinking this?

In our configs, we use 8-port APC power strips.  On any given machine,
each power supply is on a separate switch.  Also note that the switches
are on separate circuits as well--the idea being that losing a power
circuit doesn't take a server off-line.

Ordinarily, a machine that uses 2 amps of power will draw 1 amp from
each supply.  However, if you lose a circuit, ALL of the power that
machine is using will come from the remaining supply (2 amps) and will
thus load the power strip twice as heavily.  Make sure you compute the
load of each server and only plug in as many machines as the strip can
handle.  In the above scenario, you obviously can't plug 8 machines into
the strip because if all 8 machines end up drawing 2 amps, that'll
exceed the 15 amp rating of the strip (and possibly the circuit it's
plugged into.

> " same network path as the path used by CMAN for cluster communication"
> Does that mean literally the same ethernet switch or logical vlan, or what?

Technically, the VLAN, but if possible, I'd use the same switch.  One
less thing to poop out.

----------------------------------------------------------------------
- Rick Stevens, Principal Engineer             rstevens at internap.com -
- CDN Systems, Internap, Inc.                http://www.internap.com -
-                                                                    -
-        Hard work has a future payoff. Laziness pays off now.       -
----------------------------------------------------------------------


From chris at cmiware.com  Tue Oct 16 18:49:19 2007
From: chris at cmiware.com (Chris Harms)
Date: Tue, 16 Oct 2007 13:49:19 -0500
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to another
In-Reply-To: <C33A53E3.10C65C%jgray@nicusa.com>
References: <C33A53E3.10C65C%jgray@nicusa.com>
Message-ID: <471507AF.3050509@cmiware.com>

We went with 2 APC remote power switches.  The ports can be grouped 
together so you can turn off multiple ports at the same time to combat 
redundant power supplies keeping the machine on.  Just remember to have 
the cluster use the same IP address to connect to the APC device for 
each node so they can't both be fenced simultaneously.  The APC software 
only allows one session at a time so the second node will be blocked 
from accessing it on the same IP.

Hope that helps.

Chris

Josh Gray wrote:
> Sorry I'm quite the question asker on the list this week, trying to digest
> the cluster docs pretty quick!
>
> One more Q - Reading the FAQ section on power fencing.  What is the best
> design for a two (or three) node cluster with dual power supplies?  I would
> assume best with regards to redundancy would be two power switches - one for
> each power supply.  But in the case of fencing shoot outs, should they all
> be on the same single switch?  Am I just over thinking this?
>
> " same network path as the path used by CMAN for cluster communication"
> Does that mean literally the same ethernet switch or logical vlan, or what?
>
>
>
>
> #  What is the best two-node network & fencing configuration?
>
> In a two node cluster (where you are using two_node="1" in the cluster
> configuration, and w/o QDisk), there are several considerations you need to
> be aware of:
>
>     * If you are using per-node power management of any sort where the
> device is not shared between cluster nodes, you MUST have all fence devices
> on the same network path as the path used by CMAN for cluster communication.
> Failure to do so can result in both nodes simultaneously fencing each other,
> leaving the entire cluster dead, or end up in a fence loop. Typically, this
> includes all integrated power management solutions (iLO, IPMI, RSA, ERA, IBM
> Blade Center, Egenera Blade Frame, Dell DRAC, etc.), but also includes
> remote power switches (APC, WTI) if the devices are not shared between the
> two nodes.
>     * It is best to use power-type fencing. SAN or SCSI-reservation fencing
> might work, as long as it meets the above requirements. If it does not, you
> should consider using a quorum disk or partition
>
> If you can not meet the above requirements, you can use quorum disk or
> partition.
>
>
>
>
>
>   


From eric at bootseg.com  Tue Oct 16 19:59:51 2007
From: eric at bootseg.com (Eric Kerin)
Date: Tue, 16 Oct 2007 15:59:51 -0400
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to another
In-Reply-To: <471507AF.3050509@cmiware.com>
References: <C33A53E3.10C65C%jgray@nicusa.com> <471507AF.3050509@cmiware.com>
Message-ID: <47151837.10408@bootseg.com>

Chris Harms wrote:
> We went with 2 APC remote power switches.  The ports can be grouped 
> together so you can turn off multiple ports at the same time to combat 
> redundant power supplies keeping the machine on.  Just remember to 
> have the cluster use the same IP address to connect to the APC device 
> for each node so they can't both be fenced simultaneously.  The APC 
> software only allows one session at a time so the second node will be 
> blocked from accessing it on the same IP.
>
Be careful with port groups.  In the quick bit of testing I did with my 
7920s, if I unplugged the network cable on the 2nd PDU (the one that my 
nodes were not accessing for control)  it said that it turned off the 
port, but the 2nd PDU still had both ports on (for obvious reasons)  
Which would quickly lead to a corrupted filesystem...

I personally use this config for my nodes (this allows for the cluster 
to know if fencing failed on either controller, and not continue in that 
case):
<device name="left" port="2" option="off"/>
<device name="right" port="2" option="off"/>
<device name="left" port="2" option="on"/>
<device name="right" port="2" option="on"/>

Thanks,
Eric Kerin
eric at bootseg.com


From wcheng at redhat.com  Tue Oct 16 20:36:08 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 16 Oct 2007 16:36:08 -0400
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <471503E1.4020902@sampledigital.com>
References: <47144374.6070806@daxsolutions.com>
	<4714BE51.5020907@redhat.com>	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
	<471503E1.4020902@sampledigital.com>
Message-ID: <471520B8.2030708@redhat.com>

Paul Risenhoover wrote:

>
> Ouch.  Good to know.  If I use raw devices can I grow and shrink 
> volumes?  The specific need is to be able to take a physical devices 
> out of service (ie, one of my iSCSI devices) so that I can restripe it 
> or replace it.

But you can also run lock_nolock on top of LVM partition, *any* LVM 
partition, cluster or not... Why going thru so much troubles for a local 
filesystem ?

-- Wendy


From nlam87346 at library.usyd.edu.au  Tue Oct 16 20:50:21 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Wed, 17 Oct 2007 06:50:21 +1000
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle but
	high load
In-Reply-To: <Pine.LNX.4.64.0710160320160.13586@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
	<Pine.LNX.4.64.0710160320160.13586@outpost.shatteredsilicon.net>
Message-ID: <1192567821.3272.5.camel@lits19.library.usyd.edu.au>

On Tue, 2007-10-16 at 03:28 -0400, Gordan Bobic wrote:
> Hi,
> 
> I have a cluster (3 nodes at the moment, may grow up to 16) for handling a 
> lot of small files (Maildir). When I test the system by sending around 3-5 
> messages/second I see the load on the cluster nodes go up to about 20-30, 
> even though the CPUs on the cluster are about 90% idle at all times.
> 
> I am guessing that this is due to the clustered machines waiting for DLM 
> locks to be established, which causes a lot of processes to be fighting to 
> run, but since they don't get to run very soon, they back up and cause the 
> load averages to go up.
> 
> Assuming the DLM runs over the interface specified by IP and MAC in 
> cluster.conf, it is running over gigabit ethernet.
> 
> Are there any configuration changes or tuning parameters I can apply to 
> DLM to alleviate this condition? The machine I'm running the test from 
> (the one sending messages) is about 1/4 of the spec of each of the cluster 
> nodes, and it's running a load average of about 0.4. It seems crazy that a 
> single low-spec node should be able to completely overwhelm a cluster 12x 
> it's spec several times over.
> 
> Thanks.
> 
> Gordan


Hi Gordan,

I don't know alot about GFS but since no one else has replied yet, my
understanding is that it's not suitable for an applications like what
you describe (many small files being opened frequently). I think GFS2,
which is still a tech preview, has been redesigned to improve this
situation.

Regards,

Nik


From prisenhoover at sampledigital.com  Tue Oct 16 21:40:14 2007
From: prisenhoover at sampledigital.com (Paul Risenhoover)
Date: Tue, 16 Oct 2007 14:40:14 -0700
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <471520B8.2030708@redhat.com>
References: <47144374.6070806@daxsolutions.com>	<4714BE51.5020907@redhat.com>	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>	<471503E1.4020902@sampledigital.com>
	<471520B8.2030708@redhat.com>
Message-ID: <47152FBE.9030709@sampledigital.com>

Wendy Cheng wrote:
> Paul Risenhoover wrote:
>
>>
>> Ouch.  Good to know.  If I use raw devices can I grow and shrink 
>> volumes?  The specific need is to be able to take a physical devices 
>> out of service (ie, one of my iSCSI devices) so that I can restripe 
>> it or replace it.
>
> But you can also run lock_nolock on top of LVM partition, *any* LVM 
> partition, cluster or not... Why going thru so much troubles for a 
> local filesystem ?
I'm planning on adding nodes as necessary.  I'm future-proofing!
>
> -- Wendy
>
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From orkcu at yahoo.com  Wed Oct 17 00:10:43 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Tue, 16 Oct 2007 17:10:43 -0700 (PDT)
Subject: [Linux-cluster] unregistering shared IP resource when stoping one
	service
Message-ID: <241847.48293.qm@web50611.mail.re2.yahoo.com>

Hi

I am getting a similar behaivor reported in:
 https://bugzilla.redhat.com/show_bug.cgi?id=254111
but this time with a IP shared resource

right now we have a cluster with 4 services which
share the same IP as a IP shared resource in one node,
If I stop just one of this services the IP is
unregistered so the others services lose their service
IP...

I know that an IP shared resource is very wierd, not
to say a wrong disign, but if that is the case why not
avoid to provide the option to configure that resource
as a shared resource?

In the other hand I can see that in an scenario with
an IP shared resource will be imposible to relocate
the services to another node because two nodes can't
have the same IP but... what if I do not want that any
of those service to be relocated to another node ?
(active-active cluster with a "one node" failover
domain)

any subjestion?

thanks
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


      ____________________________________________________________________________________
Tonight's top picks. What will you watch tonight? Preview the hottest shows on Yahoo! TV.
http://tv.yahoo.com/ 


From gordan at bobich.net  Wed Oct 17 07:17:59 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 03:17:59 -0400 (EDT)
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <471503E1.4020902@sampledigital.com>
References: <47144374.6070806@daxsolutions.com> <4714BE51.5020907@redhat.com>
	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
	<471503E1.4020902@sampledigital.com>
Message-ID: <Pine.LNX.4.64.0710170302250.31052@outpost.shatteredsilicon.net>

On Tue, 16 Oct 2007, Paul Risenhoover wrote:

>> > >   THOUGHTS:
>> > > 
>> > >  I admit I don't know much about clustering, but from the evidence I 
>> > >  see,
>> > >   the problem appears to be isolated to clvmd and iSCSI, if only for 
>> > >   the
>> > >   fact that my direct-attached clustered volumes don't exhibit the 
>> > >  symptoms.
>> > 
>> >  Will let LVM folks comment on rest of the issues. However, if you intend 
>> >  to use this as single node case, are you aware that both GFS and GFS2 
>> >  support a "lock_nolock" protocol that doesn't require CLVMD 
>>> ? It can be run on a plain storage device (say /dev/sda1)  and 
>> >  doesn't have any locking overhead. Do a "man gfs_mkfs" and search for 
>> >  "LockProtoName". A sample mkfs-mount command looks like the following:
>>
>>  Err - possibly a misunderstanding, but GFS/GFS2 doesn't require LVM/CLVM.
>>  You can run on a raw device without volume management.
> Ouch.  Good to know.  If I use raw devices can I grow and shrink volumes?

Sure - assuming your underlying volume provision supports it (e.g. if your 
iSCSI SAN allows you to grow volumes - it's a pretty lame SAN if it 
doesn't support something that is so trivial to implement).

The procedure would be to grow the block device on the SAN, and then use 
the file system growing utility (as you would do if you enlarged the 
volume using LVM) to make the FS expand onto the new, enlarged block 
device.

LVM is, IMNSHO, a solution for a non-problem now that we have online RAID 
growing capability and SANs can grow volumes at the press of a button.

> The specific need is to be able to take a physical devices out of service 
> (ie, one of my iSCSI devices) so that I can restripe it or replace it.

LVM won't help you with that.

What you really need to do is have two mirrored fail-over SANs, so 
you can down one, do whatever you need to do, re-mirror it, then repeat on 
the other one.

> Here's another scenario: I've got two existing physical devices of ~3TB each, 
> both are members of the nasvg_00 volume group (using clvmd), plus a third 
> physical device that I'm trying to bring online.  Is there a migration path 
> that allows me to format the new physical device with gfs/raw, join it to the 
> exiting gfs file system, and then migrate the other physical devices (one by 
> one) to a gfs/raw format?

You cannot "merge" two existing partitions into one. The file system won't 
be a single file system that spans both. You'd still have to put your data 
somewhere else, merge them, grow the one file system to the additional 
device(s) and the grow the FS.

In practice, however, a decent SAN solution (one you could build with COTS 
hardware and OSS Linux tools) will let you grow a SAN pretty much 
indefinitely, in terms of space. Bandwidth capacity of your ethernet will 
become an issue before the space becomes an issue.

Note that on RHEL4, there is a partitioning issue - for some reason, fdisk 
won't let you make partitions bigger than about 1TB. This may have been 
fixed on RHEL5, I don't know, I haven't tried it. But this isn't 
necessarily a problem because you can just use raw block devices, and most 
file systems can cope nowdays.

Gordan


From gordan at bobich.net  Wed Oct 17 07:21:18 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 03:21:18 -0400 (EDT)
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <471520B8.2030708@redhat.com>
References: <47144374.6070806@daxsolutions.com> <4714BE51.5020907@redhat.com>
	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
	<471503E1.4020902@sampledigital.com> <471520B8.2030708@redhat.com>
Message-ID: <Pine.LNX.4.64.0710170319520.31052@outpost.shatteredsilicon.net>

On Tue, 16 Oct 2007, Wendy Cheng wrote:

>>  Ouch.  Good to know.  If I use raw devices can I grow and shrink volumes?
>>  The specific need is to be able to take a physical devices out of service
>>  (ie, one of my iSCSI devices) so that I can restripe it or replace it.
>
> But you can also run lock_nolock on top of LVM partition, *any* LVM 
> partition, cluster or not... Why going thru so much troubles for a local 
> filesystem ?

Yeah, I've been wondering that myself...

Gordan


From gordan at bobich.net  Wed Oct 17 07:24:09 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 03:24:09 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle but	high
	load
In-Reply-To: <1192567821.3272.5.camel@lits19.library.usyd.edu.au>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
	<Pine.LNX.4.64.0710160320160.13586@outpost.shatteredsilicon.net>
	<1192567821.3272.5.camel@lits19.library.usyd.edu.au>
Message-ID: <Pine.LNX.4.64.0710170322240.31052@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Nikolas Lam wrote:

>> I have a cluster (3 nodes at the moment, may grow up to 16) for handling a
>> lot of small files (Maildir). When I test the system by sending around 3-5
>> messages/second I see the load on the cluster nodes go up to about 20-30,
>> even though the CPUs on the cluster are about 90% idle at all times.
>>
>> I am guessing that this is due to the clustered machines waiting for DLM
>> locks to be established, which causes a lot of processes to be fighting to
>> run, but since they don't get to run very soon, they back up and cause the
>> load averages to go up.
>>
>> Assuming the DLM runs over the interface specified by IP and MAC in
>> cluster.conf, it is running over gigabit ethernet.
>>
>> Are there any configuration changes or tuning parameters I can apply to
>> DLM to alleviate this condition? The machine I'm running the test from
>> (the one sending messages) is about 1/4 of the spec of each of the cluster
>> nodes, and it's running a load average of about 0.4. It seems crazy that a
>> single low-spec node should be able to completely overwhelm a cluster 12x
>> it's spec several times over.
>
> I don't know alot about GFS but since no one else has replied yet, my
> understanding is that it's not suitable for an applications like what
> you describe (many small files being opened frequently). I think GFS2,
> which is still a tech preview, has been redesigned to improve this
> situation.

Indeed, I am aware that GFS2 is still broken, but I seem to be getting no 
worse a performance out of GFS than I get out of NFS. The only penalty is 
the high load, but the throughput is actually similar. The advantage that 
makes GFS win is that I don't need an arbitrating server to handle the NFS 
exports, which makes the clustering and redundancy a bit tidier.

Gordan


From gordan at bobich.net  Wed Oct 17 07:25:58 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 03:25:58 -0400 (EDT)
Subject: [Linux-cluster] Linux clustering (one-node), GFS, iSCSI,	clvmd
	(lock problem)
In-Reply-To: <47152FBE.9030709@sampledigital.com>
References: <47144374.6070806@daxsolutions.com> <4714BE51.5020907@redhat.com>
	<Pine.LNX.4.64.0710160923310.19844@outpost.shatteredsilicon.net>
	<471503E1.4020902@sampledigital.com> <471520B8.2030708@redhat.com>
	<47152FBE.9030709@sampledigital.com>
Message-ID: <Pine.LNX.4.64.0710170324230.31052@outpost.shatteredsilicon.net>

On Tue, 16 Oct 2007, Paul Risenhoover wrote:

>> >  Ouch.  Good to know.  If I use raw devices can I grow and shrink 
>> >  volumes?  The specific need is to be able to take a physical devices out 
>> >  of service (ie, one of my iSCSI devices) so that I can restripe it or 
>> >  replace it.
>>
>>  But you can also run lock_nolock on top of LVM partition, *any* LVM
>>  partition, cluster or not... Why going thru so much troubles for a local
>>  filesystem ?
>
> I'm planning on adding nodes as necessary.  I'm future-proofing!

You would be better off using a decently architected SAN solution to cover 
that aspect. You'd get better performance, lower RAID disk overheads, and 
better scalability.

Gordan


From grimme at atix.de  Wed Oct 17 07:39:04 2007
From: grimme at atix.de (Marc Grimme)
Date: Wed, 17 Oct 2007 09:39:04 +0200
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <Pine.LNX.4.64.0710170322240.31052@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<1192567821.3272.5.camel@lits19.library.usyd.edu.au>
	<Pine.LNX.4.64.0710170322240.31052@outpost.shatteredsilicon.net>
Message-ID: <200710170939.04626.grimme@atix.de>

On Wednesday 17 October 2007 09:24:09 Gordan Bobic wrote:
> On Wed, 17 Oct 2007, Nikolas Lam wrote:
> >> I have a cluster (3 nodes at the moment, may grow up to 16) for handling
> >> a lot of small files (Maildir). When I test the system by sending around
> >> 3-5 messages/second I see the load on the cluster nodes go up to about
> >> 20-30, even though the CPUs on the cluster are about 90% idle at all
> >> times.
> >>
> >> I am guessing that this is due to the clustered machines waiting for DLM
> >> locks to be established, which causes a lot of processes to be fighting
> >> to run, but since they don't get to run very soon, they back up and
> >> cause the load averages to go up.
> >>
> >> Assuming the DLM runs over the interface specified by IP and MAC in
> >> cluster.conf, it is running over gigabit ethernet.
> >>
> >> Are there any configuration changes or tuning parameters I can apply to
> >> DLM to alleviate this condition? The machine I'm running the test from
> >> (the one sending messages) is about 1/4 of the spec of each of the
> >> cluster nodes, and it's running a load average of about 0.4. It seems
> >> crazy that a single low-spec node should be able to completely overwhelm
> >> a cluster 12x it's spec several times over.
> >
> > I don't know alot about GFS but since no one else has replied yet, my
> > understanding is that it's not suitable for an applications like what
> > you describe (many small files being opened frequently). I think GFS2,
> > which is still a tech preview, has been redesigned to improve this
> > situation.
>
> Indeed, I am aware that GFS2 is still broken, but I seem to be getting no
> worse a performance out of GFS than I get out of NFS. The only penalty is
> the high load, but the throughput is actually similar. The advantage that
> makes GFS win is that I don't need an arbitrating server to handle the NFS
> exports, which makes the clustering and redundancy a bit tidier.
Gordan,
with your testing did you also try to adapt the size of the 
rsbtbl_size/lkbtbl_size? I would be quite interested if this increases your 
performance or not. Do you have lot of small files?


-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From gordan at bobich.net  Wed Oct 17 07:52:20 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 03:52:20 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <200710170939.04626.grimme@atix.de>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<1192567821.3272.5.camel@lits19.library.usyd.edu.au>
	<Pine.LNX.4.64.0710170322240.31052@outpost.shatteredsilicon.net>
	<200710170939.04626.grimme@atix.de>
Message-ID: <Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Marc Grimme wrote:

>>>> I have a cluster (3 nodes at the moment, may grow up to 16) for handling
>>>> a lot of small files (Maildir). When I test the system by sending around
>>>> 3-5 messages/second I see the load on the cluster nodes go up to about
>>>> 20-30, even though the CPUs on the cluster are about 90% idle at all
>>>> times.
>>>>
>>>> I am guessing that this is due to the clustered machines waiting for DLM
>>>> locks to be established, which causes a lot of processes to be fighting
>>>> to run, but since they don't get to run very soon, they back up and
>>>> cause the load averages to go up.
>>>>
>>>> Assuming the DLM runs over the interface specified by IP and MAC in
>>>> cluster.conf, it is running over gigabit ethernet.
>>>>
>>>> Are there any configuration changes or tuning parameters I can apply to
>>>> DLM to alleviate this condition? The machine I'm running the test from
>>>> (the one sending messages) is about 1/4 of the spec of each of the
>>>> cluster nodes, and it's running a load average of about 0.4. It seems
>>>> crazy that a single low-spec node should be able to completely overwhelm
>>>> a cluster 12x it's spec several times over.
>>>
>>> I don't know alot about GFS but since no one else has replied yet, my
>>> understanding is that it's not suitable for an applications like what
>>> you describe (many small files being opened frequently). I think GFS2,
>>> which is still a tech preview, has been redesigned to improve this
>>> situation.
>>
>> Indeed, I am aware that GFS2 is still broken, but I seem to be getting no
>> worse a performance out of GFS than I get out of NFS. The only penalty is
>> the high load, but the throughput is actually similar. The advantage that
>> makes GFS win is that I don't need an arbitrating server to handle the NFS
>> exports, which makes the clustering and redundancy a bit tidier.
>
> with your testing did you also try to adapt the size of the
> rsbtbl_size/lkbtbl_size? I would be quite interested if this increases your
> performance or not.

I cannot find these files in /proc (that's where they are implied to be in 
the docs). Can you please point me in the right direction?

> Do you have lot of small files?

Yes. The problem doesn't seem to be so bad when files are in different 
directories, but when lots of files are being written to the same 
directory, the load goes up quite badly.

Gordan


From breeves at redhat.com  Wed Oct 17 07:56:30 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 17 Oct 2007 08:56:30 +0100
Subject: [Linux-cluster] Virtual ips 'drifting' from one node to another
In-Reply-To: <C33A4F28.10C644%jgray@nicusa.com>
References: <C33A4F28.10C644%jgray@nicusa.com>
Message-ID: <4715C02E.7020603@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Josh Gray wrote:
> I've read reports of people having trouble with virtual ip's that are
> controlled by the cluster suite coming up when they shouldn't on nodes and
> even to the point where multiple hosts will have the ip on them (looking at
> ip addr list) and the cluster doesn't realize it.
> 
> Has anyone seen this?  Any fixes you know of or ways to fight it?
> 

What sort of virtual IPs/cluster?

There are some problems with LVS routers & piranha (load-balancing
clusters) in the current released RHEL4 packages where VIPs are not
correctly deactivated when a service or an LVS router is shut down.

Versions prior to piranha-0.7.10-2 are not affected.

See these bugzillas:

https://bugzilla.redhat.com/show_bug.cgi?id=228530
https://bugzilla.redhat.com/show_bug.cgi?id=123342

There's a patch in comment #1 of the first bug that fixes this behavior.

Kind regards,
Bryn.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHFcAu6YSQoMYUY94RAukpAKCtoMLOkdPCMD0M7dC86Ev5d6V9WgCcCOGT
+QHIgZpl6nfMn3pKqB9ISKU=
=h0lW
-----END PGP SIGNATURE-----


From grimme at atix.de  Wed Oct 17 08:16:30 2007
From: grimme at atix.de (Marc Grimme)
Date: Wed, 17 Oct 2007 10:16:30 +0200
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710170939.04626.grimme@atix.de>
	<Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>
Message-ID: <200710171016.31354.grimme@atix.de>

On Wednesday 17 October 2007 09:52:20 Gordan Bobic wrote:
> On Wed, 17 Oct 2007, Marc Grimme wrote:
> >>>> I have a cluster (3 nodes at the moment, may grow up to 16) for
> >>>> handling a lot of small files (Maildir). When I test the system by
> >>>> sending around 3-5 messages/second I see the load on the cluster nodes
> >>>> go up to about 20-30, even though the CPUs on the cluster are about
> >>>> 90% idle at all times.
> >>>>
> >>>> I am guessing that this is due to the clustered machines waiting for
> >>>> DLM locks to be established, which causes a lot of processes to be
> >>>> fighting to run, but since they don't get to run very soon, they back
> >>>> up and cause the load averages to go up.
> >>>>
> >>>> Assuming the DLM runs over the interface specified by IP and MAC in
> >>>> cluster.conf, it is running over gigabit ethernet.
> >>>>
> >>>> Are there any configuration changes or tuning parameters I can apply
> >>>> to DLM to alleviate this condition? The machine I'm running the test
> >>>> from (the one sending messages) is about 1/4 of the spec of each of
> >>>> the cluster nodes, and it's running a load average of about 0.4. It
> >>>> seems crazy that a single low-spec node should be able to completely
> >>>> overwhelm a cluster 12x it's spec several times over.
> >>>
> >>> I don't know alot about GFS but since no one else has replied yet, my
> >>> understanding is that it's not suitable for an applications like what
> >>> you describe (many small files being opened frequently). I think GFS2,
> >>> which is still a tech preview, has been redesigned to improve this
> >>> situation.
> >>
> >> Indeed, I am aware that GFS2 is still broken, but I seem to be getting
> >> no worse a performance out of GFS than I get out of NFS. The only
> >> penalty is the high load, but the throughput is actually similar. The
> >> advantage that makes GFS win is that I don't need an arbitrating server
> >> to handle the NFS exports, which makes the clustering and redundancy a
> >> bit tidier.
> >
> > with your testing did you also try to adapt the size of the
> > rsbtbl_size/lkbtbl_size? I would be quite interested if this increases
> > your performance or not.
>
> I cannot find these files in /proc (that's where they are implied to be in
> the docs). Can you please point me in the right direction?
Sorry I new I forgot something ;-)
http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/influence-of-locktable-sizes-rsbtbl_size-lkbtbl_size
>
> > Do you have lot of small files?
>
> Yes. The problem doesn't seem to be so bad when files are in different
> directories, but when lots of files are being written to the same
> directory, the load goes up quite badly.
Then this should help. Also enable lock_purging if not already done.
>
> Gordan
Marc.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From gordan at bobich.net  Wed Oct 17 08:18:31 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 04:18:31 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <200710170939.04626.grimme@atix.de>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<1192567821.3272.5.camel@lits19.library.usyd.edu.au>
	<Pine.LNX.4.64.0710170322240.31052@outpost.shatteredsilicon.net>
	<200710170939.04626.grimme@atix.de>
Message-ID: <Pine.LNX.4.64.0710170417180.31052@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Marc Grimme wrote:

> with your testing did you also try to adapt the size of the
> rsbtbl_size/lkbtbl_size?

If I'm reading the docs correctly, these should be under /proc/cluster. 
My nodes don't appear to have /proc/cluster. (!!)

Gordan


From gordan at bobich.net  Wed Oct 17 08:27:21 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 04:27:21 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <200710171016.31354.grimme@atix.de>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710170939.04626.grimme@atix.de>
	<Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>
	<200710171016.31354.grimme@atix.de>
Message-ID: <Pine.LNX.4.64.0710170419280.31052@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Marc Grimme wrote:

>> I cannot find these files in /proc (that's where they are implied to be in
>> the docs). Can you please point me in the right direction?

> Sorry I new I forgot something ;-)
> http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/influence-of-locktable-sizes-rsbtbl_size-lkbtbl_size

I found that page when I went looking for docs on the subject, but as 
per my other email, I don't appear to have /proc/cluster. :-O

>>> Do you have lot of small files?
>>
>> Yes. The problem doesn't seem to be so bad when files are in different
>> directories, but when lots of files are being written to the same
>> directory, the load goes up quite badly.
>
> Then this should help. Also enable lock_purging if not already done.

That doesn't seem to work for me. :-(

# gfs_tool settune / glock_purge 50
gfs_tool: can't change tunable parameter glock_purge: Invalid argument

Gordan


From andreezer at gmail.com  Wed Oct 17 08:47:43 2007
From: andreezer at gmail.com (=?ISO-8859-1?Q?Andr=E9_Fernandes?=)
Date: Wed, 17 Oct 2007 09:47:43 +0100
Subject: [Linux-cluster] fence_vmware and ESX server
Message-ID: <a434d05d0710170147k16613ebdt3c1f4af24935fc75@mail.gmail.com>

Greetings.

I've been testing VM fencing with the fence_vmware perl script that is
available from
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/agents/vmware/fence_vmware.pl?cvsroot=clusterand
I got it to work with VMware server
1.0.4.

However, I intended to use it on an ESX 3.0.1 installation (which is part of
a Virtual Infrastructure) but for some reason I cannot authenticate. The
exact same user and password work fine if I log in directly to the ESX host
with VMware's Virtual Infrastructure Client. I believe that there are some
of you out there that use(d) this script with ESX.

I'm running it like this: fence_vmware.pl -a 192.168.254.119:902 -l
theusername -p thepassword -L

trying to list the VMs on the host but I always get a "-3" error (wrong
login or password).

Is the VMware Perl API compatible with ESX 3.0.1? Is the 902 port the
correct one? What can I do to debug this?

I appreciate any help. Thanks in advance.

Andr? Fernandes
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071017/282e1c7d/attachment.htm>

From hlawatschek at atix.de  Wed Oct 17 08:52:44 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Wed, 17 Oct 2007 10:52:44 +0200
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <Pine.LNX.4.64.0710170417180.31052@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710170939.04626.grimme@atix.de>
	<Pine.LNX.4.64.0710170417180.31052@outpost.shatteredsilicon.net>
Message-ID: <200710171052.45137.hlawatschek@atix.de>

On Wednesday 17 October 2007 10:18:31 Gordan Bobic wrote:
> On Wed, 17 Oct 2007, Marc Grimme wrote:
> > with your testing did you also try to adapt the size of the
> > rsbtbl_size/lkbtbl_size?
>
> If I'm reading the docs correctly, these should be under /proc/cluster.
> My nodes don't appear to have /proc/cluster. (!!)
>
As you are using RHEL5, I thinks the settings have to be done 
in /sys/kernel/config/dlm/cluster/
In your case it might be /var/comoonics/chroot/sys/kernel/config/dlm/cluster/
You could also easily mount the configfs to /sys/kernel/config with a 
#mount -t configfs none /sys/kernel/config/
Please note, that the settings have to be done before mounting your data 
filesystem.
  
Mark

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From grimme at atix.de  Wed Oct 17 09:41:02 2007
From: grimme at atix.de (Marc Grimme)
Date: Wed, 17 Oct 2007 11:41:02 +0200
Subject: [Linux-cluster] XEN and Cluster Questions
Message-ID: <200710171141.03724.grimme@atix.de>

Hello,
we are currently discussing XEN with clustering support. 
There came some questions we are not sure what the answer is. Perhaps you can 
help ;-) .

Background is: We are discussing a group of XEN Dom0 Hosts sharing all devices 
and files via GFS. They themselves again host a couple of virtually 
redhat-clustered DomU Hosts with or without gfs.

1. Live Migration of cluster DomU nodes:
When I live migrate a virtual DomU clusternode to another DOM0 XEN Host the 
migration works ;-) , but the virtual clusternode is thrown out of the 
cluster. Is this a "works as designed"? I think the problem are the 
heartbeats not coming in proper time.
Does that lead to the conclusion that one cannot live migrate cluster nodes?

2. Fencing:
How about fencing of the virtual Dom-U Clusternodes. You are never sure on 
which Dom-0 Node runs our Dom-U Clusternode. Is the fencing via fence_xvm[d] 
supported on such an environment? That means how does a virtual DomU 
clusternode X running on Dom0 Xen Host x know that if virtual DomU 
clusternode Y running on Dom0 Xen Host y is running there when it is getting 
the fence request to fence Host y where it is running?

Thanks Marc.
-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From gordan at bobich.net  Wed Oct 17 11:07:47 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 07:07:47 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <200710171052.45137.hlawatschek@atix.de>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710170939.04626.grimme@atix.de>
	<Pine.LNX.4.64.0710170417180.31052@outpost.shatteredsilicon.net>
	<200710171052.45137.hlawatschek@atix.de>
Message-ID: <Pine.LNX.4.64.0710170705400.598@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Mark Hlawatschek wrote:

>>> with your testing did you also try to adapt the size of the
>>> rsbtbl_size/lkbtbl_size?
>>
>> If I'm reading the docs correctly, these should be under /proc/cluster.
>> My nodes don't appear to have /proc/cluster. (!!)
>>
> As you are using RHEL5, I thinks the settings have to be done
> in /sys/kernel/config/dlm/cluster/
> In your case it might be /var/comoonics/chroot/sys/kernel/config/dlm/cluster/

Yup - that looks better. But I still cannot see any files called 
rsbtbl_size or lkbtbl_size.

> You could also easily mount the configfs to /sys/kernel/config with a
> #mount -t configfs none /sys/kernel/config/
> Please note, that the settings have to be done before mounting your data
> filesystem.

How does one change these settings in the init root, before the GFS root 
gets mounted? Where should I make the change for my initrd?

Gordan


From grimme at atix.de  Wed Oct 17 11:40:25 2007
From: grimme at atix.de (Marc Grimme)
Date: Wed, 17 Oct 2007 13:40:25 +0200
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <Pine.LNX.4.64.0710170705400.598@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710171052.45137.hlawatschek@atix.de>
	<Pine.LNX.4.64.0710170705400.598@outpost.shatteredsilicon.net>
Message-ID: <200710171340.25602.grimme@atix.de>

On Wednesday 17 October 2007 13:07:47 Gordan Bobic wrote:
> On Wed, 17 Oct 2007, Mark Hlawatschek wrote:
> >>> with your testing did you also try to adapt the size of the
> >>> rsbtbl_size/lkbtbl_size?
> >>
> >> If I'm reading the docs correctly, these should be under /proc/cluster.
> >> My nodes don't appear to have /proc/cluster. (!!)
> >
> > As you are using RHEL5, I thinks the settings have to be done
> > in /sys/kernel/config/dlm/cluster/
> > In your case it might be
> > /var/comoonics/chroot/sys/kernel/config/dlm/cluster/
>
> Yup - that looks better. But I still cannot see any files called
> rsbtbl_size or lkbtbl_size.
I see it
[root at axqa01_1 ~]# uname -a
Linux axqa01_1 2.6.18-52.el5xen #1 SMP Wed Sep 26 16:01:46 EDT 2007 i686 i686 
i386 GNU/Linux
[root at axqa01_1 ~]# 
cat /cdsl.local/var/comoonics/chroot/sys/kernel/config/dlm/cluster/lkbtbl_size
1024
[root at axqa01_1 ~]# 
cat /cdsl.local/var/comoonics/chroot/sys/kernel/config/dlm/cluster/rsbtbl_size
256
Hope that helps.
Marc.
>
> > You could also easily mount the configfs to /sys/kernel/config with a
> > #mount -t configfs none /sys/kernel/config/
> > Please note, that the settings have to be done before mounting your data
> > filesystem.
>
> How does one change these settings in the init root, before the GFS root
> gets mounted? Where should I make the change for my initrd?
>
> Gordan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**

Visit us at LinuxWorld Conference & Expo 
31.10. - 01.11.2007 in Jaarbeurs Utrecht - The Netherlands
ATIX stand: Hall 9 / B 005

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss


From gordan at bobich.net  Wed Oct 17 12:55:56 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 08:55:56 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle
	=?iso-8859-1?q?but=09high?= load
In-Reply-To: <200710171340.25602.grimme@atix.de>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710171052.45137.hlawatschek@atix.de>
	<Pine.LNX.4.64.0710170705400.598@outpost.shatteredsilicon.net>
	<200710171340.25602.grimme@atix.de>
Message-ID: <Pine.LNX.4.64.0710170853510.2801@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Marc Grimme wrote:

>>>>> with your testing did you also try to adapt the size of the
>>>>> rsbtbl_size/lkbtbl_size?
>>>>
>>>> If I'm reading the docs correctly, these should be under /proc/cluster.
>>>> My nodes don't appear to have /proc/cluster. (!!)
>>>
>>> As you are using RHEL5, I thinks the settings have to be done
>>> in /sys/kernel/config/dlm/cluster/
>>> In your case it might be
>>> /var/comoonics/chroot/sys/kernel/config/dlm/cluster/
>>
>> Yup - that looks better. But I still cannot see any files called
>> rsbtbl_size or lkbtbl_size.
> I see it
> [root at axqa01_1 ~]# uname -a
> Linux axqa01_1 2.6.18-52.el5xen #1 SMP Wed Sep 26 16:01:46 EDT 2007 i686 i686
> i386 GNU/Linux
> [root at axqa01_1 ~]#
> cat /cdsl.local/var/comoonics/chroot/sys/kernel/config/dlm/cluster/lkbtbl_size
> 1024
> [root at axqa01_1 ~]#
> cat /cdsl.local/var/comoonics/chroot/sys/kernel/config/dlm/cluster/rsbtbl_size
> 256
> Hope that helps.
> Marc.

I don't have /cdsl.local/var/comoonics. I do have /var/comoonics, though:
# ls /var/comoonics/chroot/sys/kernel/config/dlm/cluster/
comms  spaces

Only comms and spaces directories there. :-(

Gordan


From wcheng at redhat.com  Wed Oct 17 15:08:45 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 17 Oct 2007 11:08:45 -0400
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle but high
	load
In-Reply-To: <Pine.LNX.4.64.0710170419280.31052@outpost.shatteredsilicon.net>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>	<200710170939.04626.grimme@atix.de>	<Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>	<200710171016.31354.grimme@atix.de>
	<Pine.LNX.4.64.0710170419280.31052@outpost.shatteredsilicon.net>
Message-ID: <4716257D.5020903@redhat.com>

Gordan Bobic wrote:

>>>
>>> Yes. The problem doesn't seem to be so bad when files are in different
>>> directories, but when lots of files are being written to the same
>>> directory, the load goes up quite badly.
>>
>>
>> Then this should help. Also enable lock_purging if not already done.
>
>
> That doesn't seem to work for me. :-(
>
> # gfs_tool settune / glock_purge 50
> gfs_tool: can't change tunable parameter glock_purge: Invalid argument
>

I didn't follow this thread of discussion very well. I have to assume 
you're on earlier versions of RHEL 5 ? Newer versions of gfs_kmod should 
have this tunable (can't recall the exact gfs_kmod version it got 
included though).

In general, GFS doesn't perform well when you're writing (and/or 
deleting) large amount of files within one single directory from 
different nodes. This is due to directory lock contention (and directory 
lock needs to get ping-pong between different nodes). We always 
encourage users to avoid this type of setup.

-- Wendy


From gordan at bobich.net  Wed Oct 17 15:02:06 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 17 Oct 2007 11:02:06 -0400 (EDT)
Subject: [Linux-cluster] GFS cluster / DLM locking - Mostly idle but high
	load
In-Reply-To: <4716257D.5020903@redhat.com>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<200710170939.04626.grimme@atix.de>
	<Pine.LNX.4.64.0710170340320.31052@outpost.shatteredsilicon.net>
	<200710171016.31354.grimme@atix.de>
	<Pine.LNX.4.64.0710170419280.31052@outpost.shatteredsilicon.net>
	<4716257D.5020903@redhat.com>
Message-ID: <Pine.LNX.4.64.0710171058530.2801@outpost.shatteredsilicon.net>

On Wed, 17 Oct 2007, Wendy Cheng wrote:

>> > >  Yes. The problem doesn't seem to be so bad when files are in different
>> > >  directories, but when lots of files are being written to the same
>> > >  directory, the load goes up quite badly.
>> > 
>> > 
>> >  Then this should help. Also enable lock_purging if not already done.
>> 
>>
>>  That doesn't seem to work for me. :-(
>>
>>  # gfs_tool settune / glock_purge 50
>>  gfs_tool: can't change tunable parameter glock_purge: Invalid argument
>> 
>
> I didn't follow this thread of discussion very well. I have to assume you're 
> on earlier versions of RHEL 5 ? Newer versions of gfs_kmod should have this 
> tunable (can't recall the exact gfs_kmod version it got included though).

kernel-2.6.18-8.1.14.el5
kmod-gfs-0.1.16-5.2.6.18_8.1.14.el5

Gordan


From qqlka at nask.pl  Thu Oct 18 09:38:14 2007
From: qqlka at nask.pl (=?iso-8859-2?Q?Agnieszka_Kuka=B3owicz?=)
Date: Thu, 18 Oct 2007 11:38:14 +0200
Subject: [Linux-cluster] Problem with GFS mounting
Message-ID: <000401c8116a$9aefee00$0776b5c2@gda07ak>

Hi,

I have problem mounting GFS filesystem.
The filesystem was created like this:
gfs_mkfs -t cluster_name:name -j 12 -p lock_dlm /dev/storage_gfs/name

After that I tried to mount the filesystem:

mount.gfs /dev/storage_gfs/name /mnt

But I get:

Oct 18 09:10:46 d1 kernel: Trying to join cluster "lock_dlm",
"new_cluster:zoom_data"
Oct 18 09:10:46 d1 kernel: Joined cluster. Now mounting FS...
Oct 18 09:10:46 d1 kernel: GFS: fsid=cluster_name:name.4294967295: can't
mount journal #4294967295
Oct 18 09:10:46 d1 kernel: GFS: fsid=cluster_name:name.4294967295: there
are only 12 journals (0 - 11)

I don't know why it happens. Do you have any suggestions?

Thanks,
Cheers

Agnieszka Kuka?owicz
http://www.polska.pl/


From tam_annie at alice.it  Thu Oct 18 11:33:41 2007
From: tam_annie at alice.it (tam_annie at alice.it)
Date: Thu, 18 Oct 2007 13:33:41 +0200
Subject: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...
Message-ID: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>

Hi everybody,

	I have successfully installed and (almost successfully) configured RHEL5 cluster suite on a two node cluster, which will soon become a three node cluster, hopefully - my boss' euros willing-: that's why I configured it with qdisk (on a raw partition) too.
	Two GFS (v. 1) filesystems are shared by both nodes.
	
	Well, everything really works like a breeze (HP iLO power fencing obviously included), even when I reboot _NORMALLY_ any node.
	
	Problems arise only after a node _CRASHES_ (You see, I love performing cold reboots in test environments), when that's exactly what happens:
	
	1) Node 2 crashes;
	2) Node 1 successfully fences node 2: I can go on working on GFS file systems after a freeze lasting less than one second;
	3) While booting up, node 2 startup sequence runs fine: cman services start successfully (I even get 'Starting fencing... [OK]' !!!), but when it comes to clvmd, music definitely changes: dlm connections are successfully established, but then the whole node hangs on 'Starting clvmd... '. Debugging clvmd init script, I've found that the problem is due to the vgscan command, which hangs indefinitely on something like 'Locking vg_flash_1... '. I can't really find any particular error about that in my logs.
	4) Activity on GFS on node 1 goes on normally.
	5) The one and only way I've found to recover my whole cluster from this situation is to restart both nodes toghether, but this is a nonsense in a cluster architecture...
	
	Hoping I simply made some trivial mistake,
	I would greatly appreciate any help or pointers from you, gurus, to solve my problem.
	I thank anyone in advance, who can make me -and my boss too- happy again with my cluster, ...indeed!
	
	Tyzan
	
	
______________________________________________________________________________________________________________
Linux xxxxxxxxxxxxxxxx 2.6.18-8.1.8.el5 #1 SMP Tue Jul 10 06:39:17 EDT 2007 x86_64 x86_64 x86_64 GNU/Linux

lvm2-cluster-2.02.16-3.el5
kmod-gfs-0.1.16-5.2.6.18_8.1.8.el5
gfs2-utils-0.1.25-1.el5
gfs-utils-0.1.11-3.el5
cman-2.0.64-1.0.1.el5
rgmanager-2.0.24-1.el5

# cat /etc/lvm/lvm.conf

# This is an example configuration file for the LVM2 system.
# It contains the default settings that would be used if there was no
# /etc/lvm/lvm.conf file.
#
# Refer to 'man lvm.conf' for further information including the file layout.
#
# To put this file in a different directory and override /etc/lvm set
# the environment variable LVM_SYSTEM_DIR before running the tools.


# This section allows you to configure which block devices should
# be used by the LVM system.
devices {

    # Where do you want your volume groups to appear ?
    dir = "/dev"

    # An array of directories that contain the device nodes you wish
    # to use with LVM2.
    scan = [ "/dev" ]

    # A filter that tells LVM2 to only use a restricted set of devices.
    # The filter consists of an array of regular expressions.  These
    # expressions can be delimited by a character of your choice, and
    # prefixed with either an 'a' (for accept) or 'r' (for reject).
    # The first expression found to match a device name determines if
    # the device will be accepted or rejected (ignored).  Devices that
    # don't match any patterns are accepted.

    # Be careful if there there are symbolic links or multiple filesystem 
    # entries for the same device as each name is checked separately against
    # the list of patterns.  The effect is that if any name matches any 'a'
    # pattern, the device is accepted; otherwise if any name matches any 'r'
    # pattern it is rejected; otherwise it is accepted.

    # Don't have more than one filter line active at once: only one gets used.

    # Run vgscan after you change this parameter to ensure that
    # the cache file gets regenerated (see below).
    # If it doesn't do what you expect, check the output of 'vgscan -vvvv'.


    # By default we accept every block device:
    filter = [ "a/.*/" ]

    # Exclude the cdrom drive
    # filter = [ "r|/dev/cdrom|" ]

    # When testing I like to work with just loopback devices:
    # filter = [ "a/loop/", "r/.*/" ]

    # Or maybe all loops and ide drives except hdc:
    # filter =[ "a|loop|", "r|/dev/hdc|", "a|/dev/ide|", "r|.*|" ]

    # Use anchors if you want to be really specific
    # filter = [ "a|^/dev/hda8$|", "r/.*/" ]

    # The results of the filtering are cached on disk to avoid
    # rescanning dud devices (which can take a very long time).  By
    # default this cache file is hidden in the /etc/lvm directory.
    # It is safe to delete this file: the tools regenerate it.
    cache = "/etc/lvm/.cache"

    # You can turn off writing this cache file by setting this to 0.
    write_cache_state = 1

    # Advanced settings.

    # List of pairs of additional acceptable block device types found 
    # in /proc/devices with maximum (non-zero) number of partitions.
    # types = [ "fd", 16 ]

    # If sysfs is mounted (2.6 kernels) restrict device scanning to 
    # the block devices it believes are valid.
    # 1 enables; 0 disables.
    sysfs_scan = 1	

    # By default, LVM2 will ignore devices used as components of
    # software RAID (md) devices by looking for md superblocks.
    # 1 enables; 0 disables.
    md_component_detection = 1
}

# This section that allows you to configure the nature of the
# information that LVM2 reports.
log {

    # Controls the messages sent to stdout or stderr.
    # There are three levels of verbosity, 3 being the most verbose.
    verbose = 0

    # Should we send log messages through syslog?
    # 1 is yes; 0 is no.
    syslog = 1

    # Should we log error and debug messages to a file?
    # By default there is no log file.
    #file = "/var/log/lvm2.log"

    # Should we overwrite the log file each time the program is run?
    # By default we append.
    overwrite = 0

    # What level of log messages should we send to the log file and/or syslog?
    # There are 6 syslog-like log levels currently in use - 2 to 7 inclusive.
    # 7 is the most verbose (LOG_DEBUG).
    level = 0
    
    # Format of output messages
    # Whether or not (1 or 0) to indent messages according to their severity
    indent = 1

    # Whether or not (1 or 0) to display the command name on each line output
    command_names = 0

    # A prefix to use before the message text (but after the command name,
    # if selected).  Default is two spaces, so you can see/grep the severity
    # of each message.
    prefix = "  "

    # To make the messages look similar to the original LVM tools use:
    #   indent = 0
    #   command_names = 1
    #   prefix = " -- "

    # Set this if you want log messages during activation.
    # Don't use this in low memory situations (can deadlock).
    # activation = 0
}

# Configuration of metadata backups and archiving.  In LVM2 when we
# talk about a 'backup' we mean making a copy of the metadata for the
# *current* system.  The 'archive' contains old metadata configurations.
# Backups are stored in a human readeable text format.
backup {

    # Should we maintain a backup of the current metadata configuration ?
    # Use 1 for Yes; 0 for No.
    # Think very hard before turning this off!
    backup = 1

    # Where shall we keep it ?
    # Remember to back up this directory regularly!
    backup_dir = "/etc/lvm/backup"

    # Should we maintain an archive of old metadata configurations.
    # Use 1 for Yes; 0 for No.
    # On by default.  Think very hard before turning this off.
    archive = 1

    # Where should archived files go ?
    # Remember to back up this directory regularly!
    archive_dir = "/etc/lvm/archive"
    
    # What is the minimum number of archive files you wish to keep ?
    retain_min = 10

    # What is the minimum time you wish to keep an archive file for ?
    retain_days = 30
}

# Settings for the running LVM2 in shell (readline) mode.
shell {

    # Number of lines of history to store in ~/.lvm_history
    history_size = 100
}


# Miscellaneous global LVM2 settings
global {
    library_dir = "/usr/lib64"
    
    # The file creation mask for any files and directories created.
    # Interpreted as octal if the first digit is zero.
    umask = 077

    # Allow other users to read the files
    #umask = 022

    # Enabling test mode means that no changes to the on disk metadata
    # will be made.  Equivalent to having the -t option on every
    # command.  Defaults to off.
    test = 0

    # Whether or not to communicate with the kernel device-mapper.
    # Set to 0 if you want to use the tools to manipulate LVM metadata 
    # without activating any logical volumes.
    # If the device-mapper kernel driver is not present in your kernel
    # setting this to 0 should suppress the error messages.
    activation = 1

    # If we can't communicate with device-mapper, should we try running 
    # the LVM1 tools?
    # This option only applies to 2.4 kernels and is provided to help you
    # switch between device-mapper kernels and LVM1 kernels.
    # The LVM1 tools need to be installed with .lvm1 suffices
    # e.g. vgscan.lvm1 and they will stop working after you start using
    # the new lvm2 on-disk metadata format.
    # The default value is set when the tools are built.
    # fallback_to_lvm1 = 0

    # The default metadata format that commands should use - "lvm1" or "lvm2".
    # The command line override is -M1 or -M2.
    # Defaults to "lvm1" if compiled in, else "lvm2".
    # format = "lvm1"

    # Location of proc filesystem
    proc = "/proc"

    # Type of locking to use. Defaults to local file-based locking (1).
    # Turn locking off by setting to 0 (dangerous: risks metadata corruption
    # if LVM2 commands get run concurrently).
    # Type 2 uses the external shared library locking_library.
    # Type 3 uses built-in clustered locking.
    locking_type = 3

    # If using external locking (type 2) and initialisation fails,
    # with this set to 1 an attempt will be made to use the built-in
    # clustered locking.
    # If you are using a customised locking_library you should set this to 0.
    fallback_to_clustered_locking = 1

    # If an attempt to initialise type 2 or type 3 locking failed, perhaps
    # because cluster components such as clvmd are not running, with this set
    # to 1 an attempt will be made to use local file-based locking (type 1).
    # If this succeeds, only commands against local volume groups will proceed.
    # Volume Groups marked as clustered will be ignored.
    fallback_to_local_locking = 1

    # Local non-LV directory that holds file-based locks while commands are
    # in progress.  A directory like /tmp that may get wiped on reboot is OK.
    locking_dir = "/var/lock/lvm"

    # Other entries can go here to allow you to load shared libraries
    # e.g. if support for LVM1 metadata was compiled as a shared library use
    #   format_libraries = "liblvm2format1.so" 
    # Full pathnames can be given.

    # Search this directory first for shared libraries.
    #   library_dir = "/lib"

    # The external locking library to load if locking_type is set to 2.
    #   locking_library = "liblvm2clusterlock.so"
}

activation {
    # Device used in place of missing stripes if activating incomplete volume.
    # For now, you need to set this up yourself first (e.g. with 'dmsetup')
    # For example, you could make it return I/O errors using the 'error' 
    # target or make it return zeros.
    missing_stripe_filler = "/dev/ioerror"

    # How much stack (in KB) to reserve for use while devices suspended
    reserved_stack = 256

    # How much memory (in KB) to reserve for use while devices suspended
    reserved_memory = 8192

    # Nice value used while devices suspended
    process_priority = -18

    # If volume_list is defined, each LV is only activated if there is a
    # match against the list.
    #   "vgname" and "vgname/lvname" are matched exactly.
    #   "@tag" matches any tag set in the LV or VG.
    #   "@*" matches if any tag defined on the host is also set in the LV or VG
    #
    # volume_list = [ "vg1", "vg2/lvol1", "@tag1", "@*" ]

    # Size (in KB) of each copy operation when mirroring
    mirror_region_size = 512

    # 'mirror_image_fault_policy' and 'mirror_log_fault_policy' define
    # how a device failure affecting a mirror is handled.
    # A mirror is composed of mirror images (copies) and a log.
    # A disk log ensures that a mirror does not need to be re-synced
    # (all copies made the same) every time a machine reboots or crashes.
    #
    # In the event of a failure, the specified policy will be used to
    # determine what happens:
    #
    # "remove" - Simply remove the faulty device and run without it.  If
    #            the log device fails, the mirror would convert to using
    #            an in-memory log.  This means the mirror will not
    #            remember its sync status across crashes/reboots and
    #            the entire mirror will be re-synced.  If a
    #            mirror image fails, the mirror will convert to a
    #            non-mirrored device if there is only one remaining good
    #            copy.
    #
    # "allocate" - Remove the faulty device and try to allocate space on
    #            a new device to be a replacement for the failed device.
    #            Using this policy for the log is fast and maintains the
    #            ability to remember sync state through crashes/reboots.
    #            Using this policy for a mirror device is slow, as it
    #            requires the mirror to resynchronize the devices, but it
    #            will preserve the mirror characteristic of the device.
    #            This policy acts like "remove" if no suitable device and
    #            space can be allocated for the replacement.
    #            Currently this is not implemented properly and behaves
    #            similarly to:
    #
    # "allocate_anywhere" - Operates like "allocate", but it does not
    #            require that the new space being allocated be on a
    #            device is not part of the mirror.  For a log device
    #            failure, this could mean that the log is allocated on
    #            the same device as a mirror device.  For a mirror
    #            device, this could mean that the mirror device is
    #            allocated on the same device as another mirror device.
    #            This policy would not be wise for mirror devices
    #            because it would break the redundant nature of the
    #            mirror.  This policy acts like "remove" if no suitable
    #            device and space can be allocated for the replacement.

    mirror_log_fault_policy = "allocate"
    mirror_device_fault_policy = "remove"
}


####################
# Advanced section #
####################

# Metadata settings
#
# metadata {
    # Default number of copies of metadata to hold on each PV.  0, 1 or 2.
    # You might want to override it from the command line with 0 
    # when running pvcreate on new PVs which are to be added to large VGs.

    # pvmetadatacopies = 1

    # Approximate default size of on-disk metadata areas in sectors.
    # You should increase this if you have large volume groups or
    # you want to retain a large on-disk history of your metadata changes.

    # pvmetadatasize = 255

    # List of directories holding live copies of text format metadata.
    # These directories must not be on logical volumes!
    # It's possible to use LVM2 with a couple of directories here,
    # preferably on different (non-LV) filesystems, and with no other 
    # on-disk metadata (pvmetadatacopies = 0). Or this can be in
    # addition to on-disk metadata areas.
    # The feature was originally added to simplify testing and is not
    # supported under low memory situations - the machine could lock up.
    #
    # Never edit any files in these directories by hand unless you
    # you are absolutely sure you know what you are doing! Use
    # the supplied toolset to make changes (e.g. vgcfgrestore).

    # dirs = [ "/etc/lvm/metadata", "/mnt/disk2/lvm/metadata2" ]
#}

# Event daemon
#
# dmeventd {
    # mirror_library is the library used when monitoring a mirror device.
    #
    # "libdevmapper-event-lvm2mirror.so" attempts to recover from failures.
    # It removes failed devices from a volume group and reconfigures a
    # mirror as necessary.
    #
    # mirror_library = "libdevmapper-event-lvm2mirror.so"
#}


#pvdisplay

  --- Physical volume ---
  PV Name               /dev/sdd1
  VG Name               vg_flash
  PV Size               200.00 GB / not usable 1.34 MB
  Allocatable           yes (but full)
  PE Size (KByte)       4096
  Total PE              51199
  Free PE               0
  Allocated PE          51199
  PV UUID               EMy26w-jUZE-JaGV-EOW5-8vGo-Y3Zx-Ifv3ov

  --- Physical volume ---
  PV Name               /dev/sdc1
  VG Name               vg_share
  PV Size               200.00 GB / not usable 1.34 MB
  Allocatable           yes (but full)
  PE Size (KByte)       4096
  Total PE              51199
  Free PE               0
  Allocated PE          51199
  PV UUID               J1kTxh-qWD5-v9B6-Am1v-P3mq-AkSq-lGzrlN

#vgdisplay

  --- Volume group ---
  VG Name               vg_flash
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  2
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               1
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               200.00 GB
  PE Size               4.00 MB
  Total PE              51199
  Alloc PE / Size       51199 / 200.00 GB
  Free  PE / Size       0 / 0
  VG UUID               g79A0p-0g5V-sXyY-kXsY-rWzt-6G7D-BEktoM

  --- Volume group ---
  VG Name               vg_share
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  2
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               1
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               200.00 GB
  PE Size               4.00 MB
  Total PE              51199
  Alloc PE / Size       51199 / 200.00 GB
  Free  PE / Size       0 / 0
  VG UUID               g9fIcp-seMM-30O0-sz1r-ttMr-NkeF-KVRF5z

# lvdisplay

  --- Logical volume ---
  LV Name                /dev/vg_flash/lv_flash_1
  VG Name                vg_flash
  LV UUID                rYCXFB-7Dxw-qWCi-G8fc-uF1t-coP1-cD1VLp
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                200.00 GB
  Current LE             51199
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:0

  --- Logical volume ---
  LV Name                /dev/vg_share/lv_share_1
  VG Name                vg_share
  LV UUID                LvEZ3t-aYda-VU2i-rlYV-oXj6-nQWq-c5Fq8M
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                200.00 GB
  Current LE             51199
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:1

# cat /etc/fstab

LABEL=/                 /                       ext3    defaults        1 1
LABEL=/boot             /boot                   ext3    defaults        1 2
devpts                  /dev/pts                devpts  gid=5,mode=620  0 0
tmpfs                   /dev/shm                tmpfs   defaults        0 0
proc                    /proc                   proc    defaults        0 0
sysfs                   /sys                    sysfs   defaults        0 0
LABEL=SW-cciss/c0d0p2   swap                    swap    defaults        0 0
/dev/vg_share/lv_share_1        /share          gfs     defaults        1 2
/dev/vg_flash/lv_flash_1        /u02            gfs     defaults        1 2

# df -h

Filesystem            Size  Used Avail Use% Mounted on
/dev/cciss/c0d0p3      29G   13G   16G  45% /
/dev/cciss/c0d0p1      99M   15M   80M  16% /boot
tmpfs                 2.0G     0  2.0G   0% /dev/shm
/dev/vg_share/lv_share_1
                      199G  100K  199G   1% /share
/dev/vg_flash/lv_flash_1
                      199G  8.1G  191G   5% /u02

# cat /etc/cluster/cluster.conf

<cluster alias="orarac" config_version="25" name="orarac">
        <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="20"/>
        <clusternodes>
                <clusternode name="racnode1" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="iLO01"/>
                                </method>
                                <method name="2">
                                        <device name="Operator1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="racnode2" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="iLO09"/>
                                </method>
                                <method name="2">
                                        <device name="Operator2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="3"/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="192.168.14.11" login="Administrator" name="iLO01" passwd="********"/>
                <fencedevice agent="fence_ilo" hostname="192.168.14.19" login="Administrator" name="iLO09" passwd="********"/>
                <fencedevice agent="fence_manual" name="Operator1"/>
                <fencedevice agent="fence_manual" name="Operator2"/>
        </fencedevices>
        <quorumd device="/dev/sda1" interval="1" min_score="1" tko="10" votes="1">
                <heuristic interval="10" program="ping -t1 -c1 192.168.14.10" score="1"/>
        </quorumd>
</cluster>


# clustat

Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  racnode1                              1 Online, Local
  racnode2                              2 Online
  /dev/sda1                             0 Online, Quorum Disk

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/d005bf13/attachment.htm>

From nlam87346 at library.usyd.edu.au  Thu Oct 18 12:01:04 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Thu, 18 Oct 2007 22:01:04 +1000
Subject: [Linux-cluster] XEN and Cluster Questions
In-Reply-To: <200710171141.03724.grimme@atix.de>
References: <200710171141.03724.grimme@atix.de>
Message-ID: <1192708864.4536.15.camel@lits19.library.usyd.edu.au>

On Wed, 2007-10-17 at 11:41 +0200, Marc Grimme wrote:
> Hello,
> we are currently discussing XEN with clustering support. 
> There came some questions we are not sure what the answer is. Perhaps you can 
> help ;-) .
> 
> Background is: We are discussing a group of XEN Dom0 Hosts sharing all devices 
> and files via GFS. They themselves again host a couple of virtually 
> redhat-clustered DomU Hosts with or without gfs.
> 
> 1. Live Migration of cluster DomU nodes:
> When I live migrate a virtual DomU clusternode to another DOM0 XEN Host the 
> migration works ;-) , but the virtual clusternode is thrown out of the 
> cluster. Is this a "works as designed"? I think the problem are the 
> heartbeats not coming in proper time.
> Does that lead to the conclusion that one cannot live migrate cluster nodes?
> 
> 2. Fencing:
> How about fencing of the virtual Dom-U Clusternodes. You are never sure on 
> which Dom-0 Node runs our Dom-U Clusternode. Is the fencing via fence_xvm[d] 
> supported on such an environment? That means how does a virtual DomU 
> clusternode X running on Dom0 Xen Host x know that if virtual DomU 
> clusternode Y running on Dom0 Xen Host y is running there when it is getting 
> the fence request to fence Host y where it is running?
> 
> Thanks Marc.

Hi Marc,

I've not investigated it myself, but if I've understood some Red Hat
Professional Services people correctly, cluster suite has been designed
with this type of application in mind. It's one of the things that
Xen-based virtualisation has over VMWare at the moment.

I think it's the fence_xvm system that informs both the dom0 cluster
nodes and the domU cluster nodes of which dom0 nodes the various domU
nodes are running. (Sorry for that ugly sentence!)

Nik


From teigland at redhat.com  Thu Oct 18 14:04:26 2007
From: teigland at redhat.com (David Teigland)
Date: Thu, 18 Oct 2007 09:04:26 -0500
Subject: [Linux-cluster] Problem with GFS mounting
In-Reply-To: <000401c8116a$9aefee00$0776b5c2@gda07ak>
References: <000401c8116a$9aefee00$0776b5c2@gda07ak>
Message-ID: <20071018140425.GB1863@redhat.com>

On Thu, Oct 18, 2007 at 11:38:14AM +0200, Agnieszka Kuka?owicz wrote:
> Hi,
> 
> I have problem mounting GFS filesystem.
> The filesystem was created like this:
> gfs_mkfs -t cluster_name:name -j 12 -p lock_dlm /dev/storage_gfs/name
> 
> After that I tried to mount the filesystem:
> 
> mount.gfs /dev/storage_gfs/name /mnt
> 
> But I get:
> 
> Oct 18 09:10:46 d1 kernel: Trying to join cluster "lock_dlm",
> "new_cluster:zoom_data"
> Oct 18 09:10:46 d1 kernel: Joined cluster. Now mounting FS...
> Oct 18 09:10:46 d1 kernel: GFS: fsid=cluster_name:name.4294967295: can't
> mount journal #4294967295
> Oct 18 09:10:46 d1 kernel: GFS: fsid=cluster_name:name.4294967295: there
> are only 12 journals (0 - 11)
> 
> I don't know why it happens. Do you have any suggestions?

This happens because mount(8) can't find mount.gfs, which is generally at
/sbin/mount.gfs.

Dave


From teigland at redhat.com  Thu Oct 18 14:12:00 2007
From: teigland at redhat.com (David Teigland)
Date: Thu, 18 Oct 2007 09:12:00 -0500
Subject: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...
In-Reply-To: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>
References: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>
Message-ID: <20071018141200.GC1863@redhat.com>

On Thu, Oct 18, 2007 at 01:33:41PM +0200, tam_annie at alice.it wrote:
> Hi everybody,
> 
> 	I have successfully installed and (almost successfully) configured RHEL5 cluster suite on a two node cluster, which will soon become a three node cluster, hopefully - my boss' euros willing-: that's why I configured it with qdisk (on a raw partition) too.
> 	Two GFS (v. 1) filesystems are shared by both nodes.
> 	
> 	Well, everything really works like a breeze (HP iLO power fencing obviously included), even when I reboot _NORMALLY_ any node.
> 	
> 	Problems arise only after a node _CRASHES_ (You see, I love performing cold reboots in test environments), when that's exactly what happens:
> 	
> 	1) Node 2 crashes;
> 	2) Node 1 successfully fences node 2: I can go on working on GFS
> 	file systems after a freeze lasting less than one second;
> 	3) While booting up, node 2 startup sequence runs fine: cman
> 	services start successfully (I even get 'Starting fencing... [OK]'
> 	!!!), but when it comes to clvmd, music definitely changes: dlm
> 	connections are successfully established, but then the whole node
> 	hangs on 'Starting clvmd... '. Debugging clvmd init script, I've
> 	found that the problem is due to the vgscan command, which hangs
> 	indefinitely on something like 'Locking vg_flash_1... '. I can't
> 	really find any particular error about that in my logs.

Please remove qdisk from your configuration and see if anything changes.
If it still doesn't work, please send any dlm information from
/var/log/messages, cman_tool nodes output from all nodes, group_tool -v
output from all nodes.

Dave


From eschneid at uccs.edu  Thu Oct 18 15:12:34 2007
From: eschneid at uccs.edu (Eric Schneider)
Date: Thu, 18 Oct 2007 09:12:34 -0600
Subject: [Linux-cluster] crossover cables
Message-ID: <002f01c81199$5015e5e0$1b03c680@uccs.edu>

Does anyone have a good how-to on using a crossover cable with a cluster?  I
have looked a little bit and I can only find mention of the capability.  

I already have the nodes plugged into our normal network.

Eric


From andreezer at gmail.com  Thu Oct 18 15:19:33 2007
From: andreezer at gmail.com (=?ISO-8859-1?Q?Andr=E9_Fernandes?=)
Date: Thu, 18 Oct 2007 16:19:33 +0100
Subject: [Linux-cluster] crossover cables
In-Reply-To: <002f01c81199$5015e5e0$1b03c680@uccs.edu>
References: <002f01c81199$5015e5e0$1b03c680@uccs.edu>
Message-ID: <a434d05d0710180819u18014dffxce5c60ce2ffb4391@mail.gmail.com>

Well, it should be a simple setup.

Let's assume that your nodes have 2 ethernet interfaces and 1 management
interface (iLO for example). you will need 2 crossover cables for this.
You plug in eth0 on each node to your network.
Then, you plug eth1 on each machine to the other machine's management
interface via crossover cable.
Finally, you assign static IPs to the eth1 and the management interfaces
(something like eth1 = 10.0.0.1 and iLO = 10.0.0.2).
Then configure fencing as desired.

However, this setup will make it impossible to use the management interfaces
for other nifty purposes...

On 10/18/07, Eric Schneider <eschneid at uccs.edu> wrote:
>
> Does anyone have a good how-to on using a crossover cable with a
> cluster?  I
> have looked a little bit and I can only find mention of the capability.
>
> I already have the nodes plugged into our normal network.
>
> Eric
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/00fe917a/attachment.htm>

From Abdel.Sadek at lsi.com  Thu Oct 18 15:33:39 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Thu, 18 Oct 2007 09:33:39 -0600
Subject: [Linux-cluster] CLVM and SCSI-3 Reservations
Message-ID: <C776378855970A4DADE4A476447F6391F159ED@NAMAIL3.ad.lsil.com>

Is CLVM a requirement for SCSI-3 Persistent Reservations to work on a
RHEL 5.0 native cluster?

I have a 2-node cluster. If I build my file system directly on the
devices /dev/sdb /dev/sdd etc.., there are no Persistent Reservations
being established on my storage array. If use LVM2 to create logical
volumes and build my FS on top of them, I see the persistent
reservations being established.

 
Thanks.

Abdel...

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/4cc4704c/attachment.htm>

From jgray at nicusa.com  Thu Oct 18 15:27:16 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 11:27:16 -0400
Subject: [Linux-cluster] crossover cables
In-Reply-To: <a434d05d0710180819u18014dffxce5c60ce2ffb4391@mail.gmail.com>
Message-ID: <C33CF394.10C888%jgray@nicusa.com>

Does the cluster then automatically use the xover interface for cluster
communication?  Do you have to specify it somewhere?

Josh


On 10/18/07 11:19 AM, "Andr? Fernandes" <andreezer at gmail.com> wrote:

> Well, it should be a simple setup.
> 
> Let's assume that your nodes have 2 ethernet interfaces and 1 management
> interface (iLO for example). you will need 2 crossover cables for this.
> You plug in eth0 on each node to your network.
> Then, you plug eth1 on each machine to the other machine's management
> interface via crossover cable.
> Finally, you assign static IPs to the eth1 and the management interfaces
> (something like eth1 = 10.0.0.1 <http://10.0.0.1>  and iLO = 10.0.0.2
> <http://10.0.0.2> ).
> Then configure fencing as desired.
> 
> However, this setup will make it impossible to use the management interfaces
> for other nifty purposes...
> 
> On 10/18/07, Eric Schneider <eschneid at uccs.edu> wrote:
>> Does anyone have a good how-to on using a crossover cable with a cluster?  I
>> have looked a little bit and I can only find mention of the capability.
>> 
>> I already have the nodes plugged into our normal network.
>> 
>> Eric
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> <https://www.redhat.com/mailman/listinfo/linux-cluster>
>> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/099a0585/attachment.htm>

From tam_annie at alice.it  Thu Oct 18 18:05:02 2007
From: tam_annie at alice.it (tam_annie at alice.it)
Date: Thu, 18 Oct 2007 20:05:02 +0200
Subject: R: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...
References: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>
	<20071018141200.GC1863@redhat.com>
Message-ID: <A8ED7CBAB5254E4EA503653480601CF101291A29@FBCMST09V03.fbc.local>

Hi Dave,
 
 nice to meet you.
 Thank you very much, you are the greatest one! 
 The problem was totally solved by simply disabling the quorum disk as follows:
# chkconfig --level 2345 qdiskd off
# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="orarac" config_version="25" name="orarac">
        <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="20"/>
        <clusternodes>
                <clusternode name="racnode1" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="iLO01"/>
                                </method>
                                <method name="2">
                                        <device name="Operator1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="racnode2" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="iLO09"/>
                                </method>
                                <method name="2">
                                        <device name="Operator2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="192.168.14.11" login="Administrator" name="iLO01" passwd="********"/>
                <fencedevice agent="fence_ilo" hostname="192.168.14.19" login="Administrator" name="iLO09" passwd="********"/>
                <fencedevice agent="fence_manual" name="Operator1"/>
                <fencedevice agent="fence_manual" name="Operator2"/>
        </fencedevices>
        <!--quorumd device="/dev/sda1" interval="1" min_score="1" tko="10" votes="1">
                <heuristic interval="10" program="ping -t1 -c1 192.168.14.10" score="1"/>
        </quorumd-->
</cluster>
So, without qdisk my cluster just behaves exactly as I ever wanted! So is qdisk evil?
However, please, be so kind to explain this strange behavor to me, that's really a _MYSTERY_: 
Isn't the qdisk a tie-breaker useful especially in two node clusters and if one needs to decide cluster membership also on the basis of exogenous heuristics (like network connectivity as in my case, see RedHat Cluster FAQ)?
Shouldn't the qdisk allow one to build a more robust cluster against split-brain conditions?
Why does my cluster behave good only if I avoid using qdisk?
Have I really no more chance to use a quorum disk in my cluster architecture?
If it can help, I'd like to tell you that when I start my cluster with qdisk enabled, both nodes wait for each other on "Starting fencing..." before going on in the boot sequence: no node can boot alone while the other one is down.
That doesn't happen when I don't use qdisk, as you told me.
Again, thank you very very much indeed!
Tyzan
 
P.S.: Below I attach the outputs and logs you requested which I recorded when qdisk was _ENABLED_ on my cluster:
A) Node 1 OK, Node 2 hanging on reboot after a crash
Node 1:
 
Node  Sts   Inc   Joined               Name
   0   M      0   2007-10-18 18:21:29  /dev/sda1
   1   M      8   2007-10-18 18:21:15  racnode1
   2   M     16   2007-10-18 18:29:50  racnode2
   
type             level name         id       state node id local_done
fence            0     default      00010002 none
[1 2]
dlm              1     clvmd        00020002 none
[1 2]
dlm              1     gfs_share_1  00040002 none
[1]
dlm              1     gfs_flash_1  00060002 none
[1]
dlm              1     rgmanager    00010001 none
[1]
gfs              2     gfs_share_1  00030002 none
[1]
gfs              2     gfs_flash_1  00050002 none
[1]
# cat /var/log/messages
Oct 18 18:27:08 orarac1 qdiskd[2861]: <notice> Node 2 evicted
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] entering GATHER state from 0.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] Creating commit token because I am the rep.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] Saving state aru 73 high seq received 73
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] entering COMMIT state.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] entering RECOVERY state.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] position [0] member 192.168.20.101:
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] previous ring seq 8 rep 192.168.20.101
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] aru 73 high delivered 73 received flag 0
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] Did not need to originate any messages in recovery.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] Storing new sequence id for ringc
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] Sending initial ORF token
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] New Configuration:
Oct 18 18:27:12 orarac1 kernel: dlm: closing connection to node 2
Oct 18 18:27:12 orarac1 fenced[2808]: racnode2 not a cluster member after 0 sec post_fail_delay
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.101)
Oct 18 18:27:12 orarac1 fenced[2808]: fencing node "racnode2"
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] Members Left:
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.109)
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] Members Joined:
Oct 18 18:27:12 orarac1 openais[2792]: [SYNC ] This node is within the primary component and will provide service.
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] New Configuration:
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.101)
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] Members Left:
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] Members Joined:
Oct 18 18:27:12 orarac1 openais[2792]: [SYNC ] This node is within the primary component and will provide service.
Oct 18 18:27:12 orarac1 openais[2792]: [TOTEM] entering OPERATIONAL state.
Oct 18 18:27:12 orarac1 openais[2792]: [CLM  ] got nodejoin message 192.168.20.101
Oct 18 18:27:12 orarac1 openais[2792]: [CPG  ] got joinlist message from node 1
Oct 18 18:27:34 orarac1 kernel: bnx2: eth1 NIC Link is Down
Oct 18 18:27:37 orarac1 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON
Oct 18 18:27:51 orarac1 fenced[2808]: fence "racnode2" success
Oct 18 18:27:56 orarac1 ccsd[2785]: Attempt to close an unopened CCS descriptor (3150).
Oct 18 18:27:56 orarac1 ccsd[2785]: Error while processing disconnect: Invalid request descriptor
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Trying to acquire journal lock...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Trying to acquire journal lock...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Looking at journal...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Looking at journal...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Acquiring the transaction lock...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Replaying journal...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Replayed 0 of 2 blocks
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: replays = 0, skips = 0, sames = 2
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Journal replayed in 1s
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_flash_1.1: jid=0: Done
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Acquiring the transaction lock...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Replaying journal...
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Replayed 0 of 0 blocks
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: replays = 0, skips = 0, sames = 0
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Journal replayed in 1s
Oct 18 18:27:56 orarac1 kernel: GFS: fsid=orarac:gfs_share_1.1: jid=0: Done
Oct 18 18:29:40 orarac1 kernel: bnx2: eth1 NIC Link is Down
Oct 18 18:29:42 orarac1 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] entering GATHER state from 11.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] Creating commit token because I am the rep.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] Saving state aru 1a high seq received 1a
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] entering COMMIT state.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] entering RECOVERY state.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] position [0] member 192.168.20.101:
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] previous ring seq 12 rep 192.168.20.101
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] aru 1a high delivered 1a received flag 0
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] position [1] member 192.168.20.109:
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] previous ring seq 4 rep 192.168.20.109
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] aru 9 high delivered 9 received flag 0
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] Did not need to originate any messages in recovery.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] Storing new sequence id for ring 10
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] Sending initial ORF token
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] New Configuration:
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.101)
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] Members Left:
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] Members Joined:
Oct 18 18:29:50 orarac1 openais[2792]: [SYNC ] This node is within the primary component and will provide service.
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] New Configuration:
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.101)
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.109)
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] Members Left:
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] Members Joined:
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ]  r(0) ip(192.168.20.109)
Oct 18 18:29:50 orarac1 openais[2792]: [SYNC ] This node is within the primary component and will provide service.
Oct 18 18:29:50 orarac1 openais[2792]: [TOTEM] entering OPERATIONAL state.
Oct 18 18:29:50 orarac1 openais[2792]: [CLM  ] got nodejoin message 192.168.20.101
Oct 18 18:29:51 orarac1 openais[2792]: [CLM  ] got nodejoin message 192.168.20.109
Oct 18 18:29:51 orarac1 openais[2792]: [CPG  ] got joinlist message from node 1
Oct 18 18:29:56 orarac1 kernel: dlm: connecting to 2
Oct 18 18:29:56 orarac1 kernel: dlm: got connection from 0

Node 2:
#cat /var/log/messages
 18:27:07 orarac2 shutdown[10779]: shutting down for system halt
Oct 18 18:29:58 orarac2 syslogd 1.4.1: restart.
Oct 18 18:29:58 orarac2 kernel: klogd 1.4.1, log source = /proc/kmsg started.
Oct 18 18:29:58 orarac2 kernel: Linux version 2.6.18-8.1.8.el5 (mockbuild at builder6.centos.org) (gcc version 4.1.1 20070105 (Red Hat 4.1.1-52)) #1 SMP Tue Jul 10 06:39:17 EDT 2007
Oct 18 18:29:58 orarac2 kernel: Command line: ro root=LABEL=/ rhgb quiet
Oct 18 18:29:58 orarac2 kernel: BIOS-provided physical RAM map:
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 0000000000000000 - 000000000009f400 (usable)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 000000000009f400 - 00000000000a0000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 0000000000100000 - 00000000cfe50000 (usable)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000cfe50000 - 00000000cfe58000 (ACPI data)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000cfe58000 - 00000000d0000000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000fec00000 - 00000000fed00000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 00000000ffc00000 - 0000000100000000 (reserved)
Oct 18 18:29:58 orarac2 kernel:  BIOS-e820: 0000000100000000 - 000000012ffff000 (usable)
Oct 18 18:29:58 orarac2 kernel: DMI 2.3 present.
Oct 18 18:29:59 orarac2 kernel: SRAT: PXM 0 -> APIC 0 -> Node 0
Oct 18 18:29:59 orarac2 kernel: SRAT: PXM 0 -> APIC 1 -> Node 0
Oct 18 18:29:59 orarac2 kernel: SRAT: PXM 1 -> APIC 2 -> Node 1
Oct 18 18:29:59 orarac2 kernel: SRAT: PXM 1 -> APIC 3 -> Node 1
Oct 18 18:29:59 orarac2 kernel: SRAT: Node 0 PXM 0 0-a0000
Oct 18 18:29:59 orarac2 kernel: SRAT: Node 0 PXM 0 0-80000000
Oct 18 18:29:59 orarac2 kernel: SRAT: Node 1 PXM 1 80000000-d0000000
Oct 18 18:29:59 orarac2 kernel: SRAT: Node 1 PXM 1 80000000-130000000
Oct 18 18:29:59 orarac2 kernel: Bootmem setup node 0 0000000000000000-0000000080000000
Oct 18 18:29:59 orarac2 kernel: Bootmem setup node 1 0000000080000000-000000012ffff000
Oct 18 18:29:59 orarac2 rpc.statd[2648]: Version 1.0.9 Starting
Oct 18 18:29:59 orarac2 kernel: ACPI: PM-Timer IO Port: 0x920
Oct 18 18:29:59 orarac2 rpc.statd[2648]: statd running as root. chown /var/lib/nfs/statd/sm to choose different user 
Oct 18 18:29:59 orarac2 kernel: ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
Oct 18 18:29:59 orarac2 kernel: Processor #0 15:1 APIC version 16
Oct 18 18:29:59 orarac2 kernel: ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled)
Oct 18 18:29:59 orarac2 kernel: Processor #2 15:1 APIC version 16
Oct 18 18:29:59 orarac2 kernel: ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled)
Oct 18 18:29:59 orarac2 kernel: Processor #1 15:1 APIC version 16
Oct 18 18:29:59 orarac2 kernel: ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled)
Oct 18 18:29:59 orarac2 kernel: Processor #3 15:1 APIC version 16
Oct 18 18:29:59 orarac2 kernel: ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1])
Oct 18 18:29:59 orarac2 kernel: ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
Oct 18 18:29:59 orarac2 kernel: IOAPIC[0]: apic_id 8, version 17, address 0xfec00000, GSI 0-15
Oct 18 18:29:59 orarac2 kernel: ACPI: IOAPIC (id[0x09] address[0xfec01000] gsi_base[16])
Oct 18 18:29:59 orarac2 kernel: IOAPIC[1]: apic_id 9, version 17, address 0xfec01000, GSI 16-31
Oct 18 18:29:59 orarac2 kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 high edge)
Oct 18 18:29:59 orarac2 kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
Oct 18 18:29:59 orarac2 kernel: Setting APIC routing to physical flat
Oct 18 18:29:59 orarac2 kernel: ACPI: HPET id: 0x1166a201 base: 0xfed00000
Oct 18 18:30:00 orarac2 kernel: Using ACPI (MADT) for SMP configuration information
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 000000000009f000 - 00000000000a0000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000000a0000 - 00000000000f0000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000000f0000 - 0000000000100000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000cfe50000 - 00000000cfe58000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000cfe58000 - 00000000d0000000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000d0000000 - 00000000fec00000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000fec00000 - 00000000fed00000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000fed00000 - 00000000fee00000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000fee00000 - 00000000fee10000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000fee10000 - 00000000ffc00000
Oct 18 18:30:00 orarac2 kernel: Nosave address range: 00000000ffc00000 - 0000000100000000
Oct 18 18:30:00 orarac2 kernel: Allocating PCI resources starting at d4000000 (gap: d0000000:2ec00000)
Oct 18 18:30:00 orarac2 kernel: SMP: Allowing 4 CPUs, 0 hotplug CPUs
Oct 18 18:30:00 orarac2 kernel: Built 2 zonelists.  Total pages: 1029812
Oct 18 18:30:00 orarac2 kernel: Kernel command line: ro root=LABEL=/ rhgb quiet
Oct 18 18:30:00 orarac2 kernel: Initializing CPU#0
Oct 18 18:30:00 orarac2 kernel: PID hash table entries: 4096 (order: 12, 32768 bytes)
Oct 18 18:30:00 orarac2 kernel: Console: colour VGA+ 80x25
Oct 18 18:30:00 orarac2 kernel: Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes)
Oct 18 18:30:00 orarac2 kernel: Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes)
Oct 18 18:30:00 orarac2 kernel: Checking aperture...
Oct 18 18:30:00 orarac2 kernel: CPU 0: aperture @ ee38000000 size 32 MB
Oct 18 18:30:00 orarac2 kernel: Aperture too small (32 MB)
Oct 18 18:30:00 orarac2 kernel: No AGP bridge found
Oct 18 18:30:00 orarac2 kernel: Your BIOS doesn't leave a aperture memory hole
Oct 18 18:30:00 orarac2 kernel: Please enable the IOMMU option in the BIOS setup
Oct 18 18:30:00 orarac2 kernel: This costs you 64 MB of RAM
Oct 18 18:30:00 orarac2 kernel: Mapping aperture over 65536 KB of RAM @ 4000000
Oct 18 18:30:00 orarac2 kernel: Memory: 4045060k/4980732k available (2398k kernel code, 147124k reserved, 1222k data, 196k init)
Oct 18 18:30:01 orarac2 kernel: Calibrating delay using timer specific routine.. 5204.06 BogoMIPS (lpj=2602030)
Oct 18 18:30:01 orarac2 kernel: Security Framework v1.0.0 initialized
Oct 18 18:30:01 orarac2 kernel: SELinux:  Initializing.
Oct 18 18:30:01 orarac2 kernel: SELinux:  Starting in permissive mode
Oct 18 18:30:01 orarac2 kernel: selinux_register_security:  Registering secondary module capability
Oct 18 18:30:01 orarac2 kernel: Capability LSM initialized as secondary
Oct 18 18:30:01 orarac2 kernel: Mount-cache hash table entries: 256
Oct 18 18:30:01 orarac2 kernel: CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
Oct 18 18:30:01 orarac2 kernel: CPU: L2 Cache: 1024K (64 bytes/line)
Oct 18 18:30:01 orarac2 kernel: CPU 0/0 -> Node 0
Oct 18 18:30:01 orarac2 kernel: CPU: Physical Processor ID: 0
Oct 18 18:30:01 orarac2 ccsd[2833]: Starting ccsd 2.0.64: 
Oct 18 18:30:01 orarac2 kernel: CPU: Processor Core ID: 0
Oct 18 18:30:01 orarac2 ccsd[2833]:  Built: Jun 28 2007 07:24:49 
Oct 18 18:30:01 orarac2 kernel: SMP alternatives: switching to UP code
Oct 18 18:30:01 orarac2 ccsd[2833]:  Copyright (C) Red Hat, Inc.  2004  All rights reserved. 
Oct 18 18:30:01 orarac2 kernel: ACPI: Core revision 20060707
Oct 18 18:30:01 orarac2 kernel: Using local APIC timer interrupts.
Oct 18 18:30:01 orarac2 ccsd[2833]: cluster.conf (cluster name = orarac, version = 25) found. 
Oct 18 18:30:01 orarac2 kernel: result 12501958
Oct 18 18:30:01 orarac2 ccsd[2833]: Remote copy of cluster.conf is from quorate node. 
Oct 18 18:30:01 orarac2 kernel: Detected 12.501 MHz APIC timer.
Oct 18 18:30:01 orarac2 ccsd[2833]:  Local version # : 25 
Oct 18 18:30:02 orarac2 kernel: SMP alternatives: switching to SMP code
Oct 18 18:30:02 orarac2 ccsd[2833]:  Remote version #: 25 
Oct 18 18:30:02 orarac2 kernel: Booting processor 1/4 APIC 0x2
Oct 18 18:30:02 orarac2 kernel: Initializing CPU#1
Oct 18 18:30:02 orarac2 kernel: Calibrating delay using timer specific routine.. 5200.24 BogoMIPS (lpj=2600120)
Oct 18 18:30:02 orarac2 kernel: CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
Oct 18 18:30:02 orarac2 kernel: CPU: L2 Cache: 1024K (64 bytes/line)
Oct 18 18:30:02 orarac2 kernel: CPU 1/2 -> Node 1
Oct 18 18:30:02 orarac2 kernel: CPU: Physical Processor ID: 1
Oct 18 18:30:02 orarac2 kernel: CPU: Processor Core ID: 0
Oct 18 18:30:02 orarac2 kernel: Dual-Core AMD Opteron(tm) Processor 2218 stepping 03
Oct 18 18:30:02 orarac2 kernel: CPU 1: Syncing TSC to CPU 0.
Oct 18 18:30:02 orarac2 ccsd[2833]: Remote copy of cluster.conf is from quorate node. 
Oct 18 18:30:02 orarac2 kernel: CPU 1: synchronized TSC with CPU 0 (last diff -8 cycles, maxerr 1265 cycles)
Oct 18 18:30:02 orarac2 ccsd[2833]:  Local version # : 25 
Oct 18 18:30:02 orarac2 kernel: SMP alternatives: switching to SMP code
Oct 18 18:30:02 orarac2 ccsd[2833]:  Remote version #: 25 
Oct 18 18:30:02 orarac2 kernel: Booting processor 2/4 APIC 0x1
Oct 18 18:30:03 orarac2 ccsd[2833]: Remote copy of cluster.conf is from quorate node. 
Oct 18 18:30:03 orarac2 kernel: Initializing CPU#2
Oct 18 18:30:03 orarac2 ccsd[2833]:  Local version # : 25 
Oct 18 18:30:03 orarac2 kernel: Calibrating delay using timer specific routine.. 5212.91 BogoMIPS (lpj=2606459)
Oct 18 18:30:03 orarac2 ccsd[2833]:  Remote version #: 25 
Oct 18 18:30:03 orarac2 kernel: CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
Oct 18 18:30:03 orarac2 ccsd[2833]: Remote copy of cluster.conf is from quorate node. 
Oct 18 18:30:03 orarac2 kernel: CPU: L2 Cache: 1024K (64 bytes/line)
Oct 18 18:30:03 orarac2 ccsd[2833]:  Local version # : 25 
Oct 18 18:30:03 orarac2 kernel: CPU 2/1 -> Node 0
Oct 18 18:30:03 orarac2 ccsd[2833]:  Remote version #: 25 
Oct 18 18:30:03 orarac2 kernel: CPU: Physical Processor ID: 0
Oct 18 18:30:03 orarac2 kernel: CPU: Processor Core ID: 1
Oct 18 18:30:03 orarac2 kernel: Dual-Core AMD Opteron(tm) Processor 2218 stepping 03
Oct 18 18:30:03 orarac2 kernel: CPU 2: Syncing TSC to CPU 0.
Oct 18 18:30:03 orarac2 kernel: CPU 2: synchronized TSC with CPU 0 (last diff -1 cycles, maxerr 670 cycles)
Oct 18 18:30:03 orarac2 kernel: SMP alternatives: switching to SMP code
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] AIS Executive Service RELEASE 'subrev 1324 version 0.80.2' 
Oct 18 18:30:03 orarac2 kernel: Booting processor 3/4 APIC 0x3
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors. 
Oct 18 18:30:03 orarac2 kernel: Initializing CPU#3
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] Copyright (C) 2006 Red Hat, Inc. 
Oct 18 18:30:03 orarac2 kernel: Calibrating delay using timer specific routine.. 5200.22 BogoMIPS (lpj=2600111)
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] AIS Executive Service: started and ready to provide service. 
Oct 18 18:30:03 orarac2 kernel: CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] Using override node name racnode2 
Oct 18 18:30:03 orarac2 kernel: CPU: L2 Cache: 1024K (64 bytes/line)
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] Using default multicast address of 239.192.26.16 
Oct 18 18:30:03 orarac2 kernel: CPU 3/3 -> Node 1
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] openais component openais_cpg loaded. 
Oct 18 18:30:03 orarac2 kernel: CPU: Physical Processor ID: 1
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais cluster closed process group service v1.01' 
Oct 18 18:30:03 orarac2 kernel: CPU: Processor Core ID: 1
Oct 18 18:30:03 orarac2 openais[2840]: [MAIN ] openais component openais_cfg loaded. 
Oct 18 18:30:04 orarac2 kernel: Dual-Core AMD Opteron(tm) Processor 2218 stepping 03
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais configuration service' 
Oct 18 18:30:04 orarac2 kernel: CPU 3: Syncing TSC to CPU 0.
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_msg loaded. 
Oct 18 18:30:04 orarac2 kernel: CPU 3: synchronized TSC with CPU 0 (last diff -95 cycles, maxerr 1065 cycles)
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais message service B.01.01' 
Oct 18 18:30:04 orarac2 kernel: Brought up 4 CPUs
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_lck loaded. 
Oct 18 18:30:04 orarac2 kernel: testing NMI watchdog ... OK.
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais distributed locking service B.01.01' 
Oct 18 18:30:04 orarac2 kernel: time.c: Using 14.318180 MHz WALL HPET GTOD HPET timer.
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_evt loaded. 
Oct 18 18:30:04 orarac2 kernel: time.c: Detected 2600.404 MHz processor.
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais event service B.01.01' 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_ckpt loaded. 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais checkpoint service B.01.01' 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_amf loaded. 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais availability management framework B.01.01' 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_clm loaded. 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais cluster membership service B.01.01' 
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_evs loaded. 
Oct 18 18:30:04 orarac2 kernel: migration_cost=435,471
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais extended virtual synchrony service' 
Oct 18 18:30:04 orarac2 kernel: checking if image is initramfs... it is
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] openais component openais_cman loaded. 
Oct 18 18:30:04 orarac2 kernel: Freeing initrd memory: 1821k freed
Oct 18 18:30:04 orarac2 openais[2840]: [MAIN ] Registering service handler 'openais CMAN membership service 2.01' 
Oct 18 18:30:04 orarac2 kernel: NET: Registered protocol family 16
Oct 18 18:30:04 orarac2 kernel: ACPI: bus type pci registered
Oct 18 18:30:04 orarac2 openais[2840]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms) 
Oct 18 18:30:04 orarac2 kernel: PCI: BIOS Bug: MCFG area at d0000000 is not E820-reserved
Oct 18 18:30:04 orarac2 openais[2840]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans) 
Oct 18 18:30:04 orarac2 kernel: PCI: Not using MMCONFIG.
Oct 18 18:30:04 orarac2 openais[2840]: [TOTEM] join (60 ms) send_join (0 ms) consensus (4800 ms) merge (200 ms) 
Oct 18 18:30:04 orarac2 kernel: PCI: Using configuration type 1
Oct 18 18:30:04 orarac2 openais[2840]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs) 
Oct 18 18:30:05 orarac2 kernel: ACPI (exconfig-0455): Dynamic SSDT Load - OemId [HP    ] OemTableId [PNOWSSDT] [20060707]
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1500 
Oct 18 18:30:05 orarac2 kernel: ACPI: Interpreter enabled
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages) 
Oct 18 18:30:05 orarac2 kernel: ACPI: Using IOAPIC for interrupt routing
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] send threads (0 threads) 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Root Bridge [PCI0] (0000:00)
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] RRP token expired timeout (495 ms) 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] RRP token problem counter (2000 ms) 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] RRP threshold (10 problem count) 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] RRP mode set to none. 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] heartbeat_failures_allowed (0) 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Root Bridge [PCI1] (0000:04)
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] max_network_delay (50 ms) 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes). 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes). 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] The network interface [192.168.20.109] is now up. 
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] Created or loaded sequence id 0.192.168.20.109 for this ring. 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IUSB] (IRQs *5)
Oct 18 18:30:05 orarac2 openais[2840]: [TOTEM] entering GATHER state from 15. 
Oct 18 18:30:05 orarac2 ccsd[2833]: Initial status:: Quorate 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN00] (IRQs 7 10 *11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais extended virtual synchrony service' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN01] (IRQs 7 10 *11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais cluster membership service B.01.01' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN02] (IRQs 7 10 *11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais availability management framework B.01.01' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN03] (IRQs 7 10 *11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais checkpoint service B.01.01' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN04] (IRQs *7 10 11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais event service B.01.01' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN05] (IRQs 7 *10 11)
Oct 18 18:30:05 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais distributed locking service B.01.01' 
Oct 18 18:30:05 orarac2 kernel: ACPI: PCI Interrupt Link [IN06] (IRQs 7 *10 11)
Oct 18 18:30:06 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais message service B.01.01' 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN07] (IRQs *7 10 11)
Oct 18 18:30:06 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais configuration service' 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN08] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais cluster closed process group service v1.01' 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN09] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [SERV ] Initialising service handler 'openais CMAN membership service 2.01' 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN10] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [CMAN ] CMAN 2.0.64 (built Jun 28 2007 07:24:54) started 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN11] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [SYNC ] Not using a virtual synchrony filter. 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN12] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] Creating commit token because I am the rep. 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN13] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] Saving state aru 0 high seq received 0 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN14] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] entering COMMIT state. 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN15] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] entering RECOVERY state. 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN16] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] position [0] member 192.168.20.109: 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN17] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] previous ring seq 0 rep 192.168.20.109 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN18] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] aru 0 high delivered 0 received flag 0 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN19] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] Did not need to originate any messages in recovery. 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN20] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] Storing new sequence id for ring 4 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN21] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [TOTEM] Sending initial ORF token 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN22] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:06 orarac2 openais[2840]: [CLM  ] CLM CONFIGURATION CHANGE 
Oct 18 18:30:06 orarac2 kernel: ACPI: PCI Interrupt Link [IN23] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] New Configuration: 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN24] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] Members Left: 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN25] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] Members Joined: 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN26] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [SYNC ] This node is within the primary component and will provide service. 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN27] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] CLM CONFIGURATION CHANGE 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN28] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] New Configuration: 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN29] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.109)  
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN30] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] Members Left: 
Oct 18 18:30:07 orarac2 kernel: ACPI: PCI Interrupt Link [IN31] (IRQs 7 10 11) *0, disabled.
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] Members Joined: 
Oct 18 18:30:07 orarac2 kernel: Linux Plug and Play Support v0.97 (c) Adam Belay
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.109)  
Oct 18 18:30:07 orarac2 kernel: pnp: PnP ACPI init
Oct 18 18:30:07 orarac2 openais[2840]: [SYNC ] This node is within the primary component and will provide service. 
Oct 18 18:30:07 orarac2 kernel: pnp: PnP ACPI: found 11 devices
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] entering OPERATIONAL state. 
Oct 18 18:30:07 orarac2 kernel: usbcore: registered new driver usbfs
Oct 18 18:30:07 orarac2 openais[2840]: [CLM  ] got nodejoin message 192.168.20.109 
Oct 18 18:30:07 orarac2 kernel: usbcore: registered new driver hub
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] entering GATHER state from 11. 
Oct 18 18:30:07 orarac2 kernel: PCI: Using ACPI for IRQ routing
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] Saving state aru 9 high seq received 9 
Oct 18 18:30:07 orarac2 kernel: PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] entering COMMIT state. 
Oct 18 18:30:07 orarac2 kernel: NetLabel: Initializing
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] entering RECOVERY state. 
Oct 18 18:30:07 orarac2 kernel: NetLabel:  domain hash size = 128
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] position [0] member 192.168.20.101: 
Oct 18 18:30:07 orarac2 kernel: NetLabel:  protocols = UNLABELED CIPSOv4
Oct 18 18:30:07 orarac2 openais[2840]: [TOTEM] previous ring seq 12 rep 192.168.20.101 
Oct 18 18:30:07 orarac2 kernel: NetLabel:  unlabeled traffic allowed by default
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] aru 1a high delivered 1a received flag 0 
Oct 18 18:30:08 orarac2 kernel: hpet0: at MMIO 0xfed00000 (virtual 0xffffffffff5fe000), IRQs 2, 8, 0
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] position [1] member 192.168.20.109: 
Oct 18 18:30:08 orarac2 kernel: hpet0: 3 64-bit timers, 14318180 Hz
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] previous ring seq 4 rep 192.168.20.109 
Oct 18 18:30:08 orarac2 kernel: PCI-DMA: Disabling AGP.
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] aru 9 high delivered 9 received flag 0 
Oct 18 18:30:08 orarac2 kernel: PCI-DMA: aperture base @ 4000000 size 65536 KB
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] Did not need to originate any messages in recovery. 
Oct 18 18:30:08 orarac2 kernel: PCI-DMA: using GART IOMMU.
Oct 18 18:30:08 orarac2 openais[2840]: [TOTEM] Storing new sequence id for ring 10 
Oct 18 18:30:08 orarac2 kernel: PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] CLM CONFIGURATION CHANGE 
Oct 18 18:30:08 orarac2 kernel: pnp: 00:01: ioport range 0x408-0x40f has been reserved
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] New Configuration: 
Oct 18 18:30:08 orarac2 kernel: pnp: 00:01: ioport range 0x4d0-0x4d1 has been reserved
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.109)  
Oct 18 18:30:08 orarac2 kernel: pnp: 00:01: ioport range 0x4d6-0x4d6 has been reserved
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] Members Left: 
Oct 18 18:30:08 orarac2 kernel: PCI: Bridge: 0000:01:0d.0
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] Members Joined: 
Oct 18 18:30:08 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:08 orarac2 openais[2840]: [SYNC ] This node is within the primary component and will provide service. 
Oct 18 18:30:08 orarac2 kernel:   MEM window: f8000000-fbffffff
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] CLM CONFIGURATION CHANGE 
Oct 18 18:30:08 orarac2 kernel:   PREFETCH window: d4000000-d40fffff
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ] New Configuration: 
Oct 18 18:30:08 orarac2 kernel: PCI: Bridge: 0000:00:05.0
Oct 18 18:30:08 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.101)  
Oct 18 18:30:08 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.109)  
Oct 18 18:30:09 orarac2 kernel:   MEM window: f8000000-fbffffff
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ] Members Left: 
Oct 18 18:30:09 orarac2 kernel:   PREFETCH window: d4000000-d40fffff
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ] Members Joined: 
Oct 18 18:30:09 orarac2 kernel: PCI: Bridge: 0000:04:0f.0
Oct 18 18:30:09 orarac2 qdiskd[2909]: <info> Quorum Daemon Initializing 
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ]  r(0) ip(192.168.20.101)  
Oct 18 18:30:09 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:09 orarac2 openais[2840]: [SYNC ] This node is within the primary component and will provide service. 
Oct 18 18:30:09 orarac2 kernel:   MEM window: disabled.
Oct 18 18:30:09 orarac2 openais[2840]: [TOTEM] entering OPERATIONAL state. 
Oct 18 18:30:09 orarac2 kernel:   PREFETCH window: disabled.
Oct 18 18:30:09 orarac2 openais[2840]: [CMAN ] quorum regained, resuming activity 
Oct 18 18:30:09 orarac2 kernel: PCI: Bridge: 0000:04:10.0
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ] got nodejoin message 192.168.20.101 
Oct 18 18:30:09 orarac2 kernel:   IO window: 4000-4fff
Oct 18 18:30:09 orarac2 openais[2840]: [CLM  ] got nodejoin message 192.168.20.109 
Oct 18 18:30:09 orarac2 kernel:   MEM window: fde00000-fdefffff
Oct 18 18:30:09 orarac2 openais[2840]: [CPG  ] got joinlist message from node 1 
Oct 18 18:30:09 orarac2 kernel:   PREFETCH window: d4200000-d42fffff
Oct 18 18:30:09 orarac2 kernel: PCI: Bridge: 0000:14:04.0
Oct 18 18:30:10 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:10 orarac2 kernel:   MEM window: disabled.
Oct 18 18:30:10 orarac2 kernel:   PREFETCH window: disabled.
Oct 18 18:30:10 orarac2 kernel: PCI: Bridge: 0000:13:00.0
Oct 18 18:30:10 orarac2 kernel:   IO window: 5000-5fff
Oct 18 18:30:10 orarac2 kernel:   MEM window: fdf00000-fdffffff
Oct 18 18:30:10 orarac2 kernel:   PREFETCH window: d4300000-d43fffff
Oct 18 18:30:10 orarac2 kernel: PCI: Bridge: 0000:04:11.0
Oct 18 18:30:10 orarac2 kernel:   IO window: 5000-5fff
Oct 18 18:30:10 orarac2 kernel:   MEM window: fdf00000-fdffffff
Oct 18 18:30:10 orarac2 kernel:   PREFETCH window: d4300000-d43fffff
Oct 18 18:30:10 orarac2 kernel: PCI: Bridge: 0000:04:12.0
Oct 18 18:30:10 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:10 orarac2 kernel:   MEM window: disabled.
Oct 18 18:30:10 orarac2 kernel:   PREFETCH window: disabled.
Oct 18 18:30:10 orarac2 kernel: PCI: Bridge: 0000:04:13.0
Oct 18 18:30:10 orarac2 kernel:   IO window: disabled.
Oct 18 18:30:10 orarac2 kernel:   MEM window: disabled.
Oct 18 18:30:10 orarac2 kernel:   PREFETCH window: disabled.
Oct 18 18:30:10 orarac2 kernel: GSI 16 sharing vector 0xA9 and IRQ 16
Oct 18 18:30:10 orarac2 kernel: ACPI: PCI Interrupt 0000:04:0f.0[A] -> GSI 16 (level, low) -> IRQ 169
Oct 18 18:30:10 orarac2 kernel: GSI 17 sharing vector 0xB1 and IRQ 17
Oct 18 18:30:10 orarac2 kernel: ACPI: PCI Interrupt 0000:04:10.0[A] -> GSI 20 (level, low) -> IRQ 177
Oct 18 18:30:10 orarac2 kernel: GSI 18 sharing vector 0xB9 and IRQ 18
Oct 18 18:30:10 orarac2 kernel: ACPI: PCI Interrupt 0000:04:11.0[A] -> GSI 19 (level, low) -> IRQ 185
Oct 18 18:30:10 orarac2 kernel: GSI 19 sharing vector 0xC1 and IRQ 19
Oct 18 18:30:10 orarac2 kernel: ACPI: PCI Interrupt 0000:04:12.0[A] -> GSI 18 (level, low) -> IRQ 193
Oct 18 18:30:10 orarac2 kernel: GSI 20 sharing vector 0xC9 and IRQ 20
Oct 18 18:30:10 orarac2 kernel: ACPI: PCI Interrupt 0000:04:13.0[A] -> GSI 17 (level, low) -> IRQ 201
Oct 18 18:30:10 orarac2 kernel: NET: Registered protocol family 2
Oct 18 18:30:10 orarac2 kernel: IP route cache hash table entries: 131072 (order: 8, 1048576 bytes)
Oct 18 18:30:10 orarac2 kernel: TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
Oct 18 18:30:10 orarac2 kernel: TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
Oct 18 18:30:10 orarac2 kernel: TCP: Hash tables configured (established 262144 bind 65536)
Oct 18 18:30:10 orarac2 kernel: TCP reno registered
Oct 18 18:30:10 orarac2 kernel: audit: initializing netlink socket (disabled)
Oct 18 18:30:10 orarac2 kernel: audit(1192732155.813:1): initialized
Oct 18 18:30:11 orarac2 kernel: Total HugeTLB memory allocated, 0
Oct 18 18:30:11 orarac2 kernel: VFS: Disk quotas dquot_6.5.1
Oct 18 18:30:11 orarac2 kernel: Dquot-cache hash table entries: 512 (order 0, 4096 bytes)
Oct 18 18:30:11 orarac2 kernel: SELinux:  Registering netfilter hooks
Oct 18 18:30:11 orarac2 kernel: Initializing Cryptographic API
Oct 18 18:30:11 orarac2 kernel: ksign: Installing public key data
Oct 18 18:30:11 orarac2 kernel: Loading keyring
Oct 18 18:30:11 orarac2 kernel: - Added public key B91DCCA02A005251
Oct 18 18:30:11 orarac2 kernel: - User ID: CentOS (Kernel Module GPG key)
Oct 18 18:30:11 orarac2 kernel: io scheduler noop registered
Oct 18 18:30:11 orarac2 kernel: io scheduler anticipatory registered
Oct 18 18:30:11 orarac2 kernel: io scheduler deadline registered
Oct 18 18:30:11 orarac2 kernel: io scheduler cfq registered (default)
Oct 18 18:30:11 orarac2 kernel: pci 0000:00:04.4: HCRESET not completed yet!
Oct 18 18:30:11 orarac2 kernel: assign_interrupt_mode Found MSI capability
Oct 18 18:30:11 orarac2 last message repeated 4 times
Oct 18 18:30:11 orarac2 kernel: pci_hotplug: PCI Hot Plug PCI Core version: 0.5
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x4
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x5
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x6
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x7
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x8
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0x9
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xa
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xb
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xc
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xd
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xe
Oct 18 18:30:11 orarac2 kernel: ACPI Exception (acpi_processor-0681): AE_NOT_FOUND, Processor Device is not present [20060707]
Oct 18 18:30:11 orarac2 kernel: ACPI: Getting cpuindex for acpiid 0xf
Oct 18 18:30:12 orarac2 kernel: Real Time Clock Driver v1.12ac
Oct 18 18:30:12 orarac2 kernel: Non-volatile memory driver v1.2
Oct 18 18:30:12 orarac2 kernel: Linux agpgart interface v0.101 (c) Dave Jones
Oct 18 18:30:12 orarac2 kernel: Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
Oct 18 18:30:12 orarac2 kernel: serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
Oct 18 18:30:12 orarac2 kernel: serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
Oct 18 18:30:12 orarac2 kernel: 00:09: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
Oct 18 18:30:12 orarac2 kernel: RAMDISK driver initialized: 16 RAM disks of 16384K size 4096 blocksize
Oct 18 18:30:12 orarac2 kernel: Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
Oct 18 18:30:12 orarac2 kernel: ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
Oct 18 18:30:12 orarac2 kernel: ide-floppy driver 0.99.newide
Oct 18 18:30:12 orarac2 kernel: usbcore: registered new driver hiddev
Oct 18 18:30:12 orarac2 kernel: usbcore: registered new driver usbhid
Oct 18 18:30:12 orarac2 kernel: drivers/usb/input/hid-core.c: v2.6:USB HID core driver
Oct 18 18:30:12 orarac2 kernel: PNP: PS/2 Controller [PNP0303:KBD,PNP0f0e:PS2M] at 0x60,0x64 irq 1,12
Oct 18 18:30:12 orarac2 kernel: serio: i8042 AUX port at 0x60,0x64 irq 12
Oct 18 18:30:12 orarac2 kernel: serio: i8042 KBD port at 0x60,0x64 irq 1
Oct 18 18:30:12 orarac2 kernel: mice: PS/2 mouse device common for all mice
Oct 18 18:30:12 orarac2 kernel: md: md driver 0.90.3 MAX_MD_DEVS=256, MD_SB_DISKS=27
Oct 18 18:30:12 orarac2 kernel: md: bitmap version 4.39
Oct 18 18:30:12 orarac2 kernel: TCP bic registered
Oct 18 18:30:12 orarac2 kernel: Initializing IPsec netlink socket
Oct 18 18:30:12 orarac2 kernel: NET: Registered protocol family 1
Oct 18 18:30:12 orarac2 kernel: NET: Registered protocol family 17
Oct 18 18:30:12 orarac2 kernel: powernow-k8: Found 4 Dual-Core AMD Opteron(tm) Processor 2218 processors (version 2.10.00)
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    0 : fid 0x12 (2600 MHz), vid 0x8
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    1 : fid 0x10 (2400 MHz), vid 0xa
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    2 : fid 0xe (2200 MHz), vid 0xc
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    3 : fid 0xc (2000 MHz), vid 0xe
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    4 : fid 0xa (1800 MHz), vid 0x10
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    5 : fid 0x2 (1000 MHz), vid 0x12
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    0 : fid 0x12 (2600 MHz), vid 0xa
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    1 : fid 0x10 (2400 MHz), vid 0xc
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    2 : fid 0xe (2200 MHz), vid 0xe
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    3 : fid 0xc (2000 MHz), vid 0x10
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    4 : fid 0xa (1800 MHz), vid 0x10
Oct 18 18:30:12 orarac2 kernel: powernow-k8:    5 : fid 0x2 (1000 MHz), vid 0x12
Oct 18 18:30:12 orarac2 kernel: powernow-k8: ph2 null fid transition 0x12
Oct 18 18:30:12 orarac2 kernel: ACPI: (supports S0 S4 S5)
Oct 18 18:30:12 orarac2 kernel: Freeing unused kernel memory: 196k freed
Oct 18 18:30:13 orarac2 kernel: Write protecting the kernel read-only data: 460k
Oct 18 18:30:13 orarac2 kernel: qla2xxx_conf: no version for "inter_module_unregister" found: kernel tainted.
Oct 18 18:30:13 orarac2 kernel: SCSI subsystem initialized
Oct 18 18:30:13 orarac2 kernel: QLogic Fibre Channel HBA Driver
Oct 18 18:30:13 orarac2 kernel: USB Universal Host Controller Interface driver v3.0
Oct 18 18:30:13 orarac2 kernel: GSI 21 sharing vector 0x42 and IRQ 21
Oct 18 18:30:13 orarac2 kernel: ACPI: PCI Interrupt 0000:00:04.4[B] -> GSI 21 (level, low) -> IRQ 66
Oct 18 18:30:13 orarac2 kernel: uhci_hcd 0000:00:04.4: UHCI Host Controller
Oct 18 18:30:13 orarac2 kernel: uhci_hcd 0000:00:04.4: new USB bus registered, assigned bus number 1
Oct 18 18:30:13 orarac2 kernel: uhci_hcd 0000:00:04.4: port count misdetected? forcing to 2 ports
Oct 18 18:30:13 orarac2 kernel: uhci_hcd 0000:00:04.4: HCRESET not completed yet!
Oct 18 18:30:13 orarac2 kernel: uhci_hcd 0000:00:04.4: irq 66, io base 0x00001800
Oct 18 18:30:13 orarac2 kernel: usb usb1: configuration #1 chosen from 1 choice
Oct 18 18:30:13 orarac2 kernel: hub 1-0:1.0: USB hub found
Oct 18 18:30:13 orarac2 kernel: hub 1-0:1.0: 2 ports detected
Oct 18 18:30:13 orarac2 kernel: ACPI: PCI Interrupt Link [IUSB] enabled at IRQ 5
Oct 18 18:30:13 orarac2 kernel: ACPI: PCI Interrupt 0000:00:07.0[A] -> Link [IUSB] -> GSI 5 (level, low) -> IRQ 5
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.0: OHCI Host Controller
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.0: new USB bus registered, assigned bus number 2
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.0: irq 5, io mem 0xf7ee0000
Oct 18 18:30:13 orarac2 kernel: usb usb2: configuration #1 chosen from 1 choice
Oct 18 18:30:13 orarac2 kernel: hub 2-0:1.0: USB hub found
Oct 18 18:30:13 orarac2 kernel: hub 2-0:1.0: 2 ports detected
Oct 18 18:30:13 orarac2 kernel: usb 1-1: new full speed USB device using uhci_hcd and address 2
Oct 18 18:30:13 orarac2 kernel: ACPI: PCI Interrupt 0000:00:07.1[A] -> Link [IUSB] -> GSI 5 (level, low) -> IRQ 5
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.1: OHCI Host Controller
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.1: new USB bus registered, assigned bus number 3
Oct 18 18:30:13 orarac2 kernel: ohci_hcd 0000:00:07.1: irq 5, io mem 0xf7ed0000
Oct 18 18:30:13 orarac2 kernel: usb usb3: configuration #1 chosen from 1 choice
Oct 18 18:30:13 orarac2 kernel: hub 3-0:1.0: USB hub found
Oct 18 18:30:13 orarac2 kernel: hub 3-0:1.0: 2 ports detected
Oct 18 18:30:13 orarac2 kernel: usb 1-1: configuration #1 chosen from 1 choice
Oct 18 18:30:13 orarac2 kernel: input: HP Virtual Keyboard as /class/input/input0
Oct 18 18:30:13 orarac2 kernel: input: USB HID v1.01 Keyboard [HP Virtual Keyboard] on usb-0000:00:04.4-1
Oct 18 18:30:13 orarac2 kernel: input: HP Virtual Keyboard as /class/input/input1
Oct 18 18:30:13 orarac2 kernel: input: USB HID v1.01 Mouse [HP Virtual Keyboard] on usb-0000:00:04.4-1
Oct 18 18:30:13 orarac2 kernel: ACPI: PCI Interrupt 0000:00:07.2[A] -> Link [IUSB] -> GSI 5 (level, low) -> IRQ 5
Oct 18 18:30:13 orarac2 kernel: ehci_hcd 0000:00:07.2: EHCI Host Controller
Oct 18 18:30:14 orarac2 kernel: ehci_hcd 0000:00:07.2: new USB bus registered, assigned bus number 4
Oct 18 18:30:14 orarac2 kernel: ehci_hcd 0000:00:07.2: irq 5, io mem 0xf7ec0000
Oct 18 18:30:14 orarac2 kernel: ehci_hcd 0000:00:07.2: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
Oct 18 18:30:14 orarac2 kernel: usb usb4: configuration #1 chosen from 1 choice
Oct 18 18:30:14 orarac2 kernel: hub 4-0:1.0: USB hub found
Oct 18 18:30:14 orarac2 kernel: hub 4-0:1.0: 4 ports detected
Oct 18 18:30:14 orarac2 kernel: usb 1-2: new full speed USB device using uhci_hcd and address 3
Oct 18 18:30:14 orarac2 kernel: HP CISS Driver (v 3.6.14-RH1)
Oct 18 18:30:14 orarac2 kernel: ACPI: PCI Interrupt 0000:14:08.0[A] -> GSI 19 (level, low) -> IRQ 185
Oct 18 18:30:14 orarac2 kernel: cciss0: <0x3238> at PCI 0000:14:08.0 IRQ 74 using DAC
Oct 18 18:30:14 orarac2 kernel:       blocks= 71065440 block_size= 512
Oct 18 18:30:14 orarac2 kernel:       heads= 255, sectors= 32, cylinders= 8709
Oct 18 18:30:14 orarac2 kernel: 
Oct 18 18:30:14 orarac2 kernel:       blocks= 71065440 block_size= 512
Oct 18 18:30:14 orarac2 kernel:       heads= 255, sectors= 32, cylinders= 8709
Oct 18 18:30:14 orarac2 kernel: 
Oct 18 18:30:14 orarac2 kernel:  cciss/c0d0: p1 p2 p3
Oct 18 18:30:14 orarac2 kernel: Initializing USB Mass Storage driver...
Oct 18 18:30:14 orarac2 kernel: usb 1-2: configuration #1 chosen from 1 choice
Oct 18 18:30:14 orarac2 kernel: hub 1-2:1.0: USB hub found
Oct 18 18:30:14 orarac2 kernel: hub 1-2:1.0: 7 ports detected
Oct 18 18:30:14 orarac2 kernel: usbcore: registered new driver usb-storage
Oct 18 18:30:14 orarac2 kernel: USB Mass Storage support registered.
Oct 18 18:30:14 orarac2 kernel: ACPI: PCI Interrupt 0000:0c:00.0[A] -> GSI 20 (level, low) -> IRQ 177
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Found an ISP2432, irq 177, iobase 0xffffc20000020000
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Configuring PCI space...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Configure NVRAM parameters...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Verifying loaded RISC code...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Allocated (64 KB) for EFT...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Allocated (1413 KB) for firmware dump...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Waiting for LIP to complete...
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: LIP reset occured (f700).
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: LIP occured (f700).
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: LIP reset occured (f7f7).
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: LOOP UP detected (4 Gbps).
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: Topology - (F_Port), Host Loop address 0x0
Oct 18 18:30:14 orarac2 kernel: scsi0 : qla2xxx
Oct 18 18:30:14 orarac2 kernel: qla2400 0000:0c:00.0: 
Oct 18 18:30:14 orarac2 kernel:  QLogic Fibre Channel HBA Driver: 8.01.07.16-fo
Oct 18 18:30:14 orarac2 kernel:   QLogic QMH2462 - 
Oct 18 18:30:14 orarac2 kernel:   ISP2432: PCIe (2.5Gb/s x4) @ 0000:0c:00.0 hdma+, host#=0, fw=4.00.90 [IP] 
Oct 18 18:30:15 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:15 orarac2 kernel:   Type:   RAID                               ANSI SCSI revision: 05
Oct 18 18:30:15 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:0:0): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:15 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:15 orarac2 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Oct 18 18:30:15 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:0:1): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:15 orarac2 kernel: SCSI device sda: 2097152 512-byte hdwr sectors (1074 MB)
Oct 18 18:30:15 orarac2 kernel: sda: Write Protect is off
Oct 18 18:30:15 orarac2 kernel: SCSI device sda: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel: SCSI device sda: 2097152 512-byte hdwr sectors (1074 MB)
Oct 18 18:30:15 orarac2 kernel: sda: Write Protect is off
Oct 18 18:30:15 orarac2 qdiskd[2909]: <info> Node 1 is the master 
Oct 18 18:30:15 orarac2 kernel: SCSI device sda: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel:  sda: sda1 sda2 sda3
Oct 18 18:30:15 orarac2 kernel: sd 0:0:0:1: Attached scsi disk sda
Oct 18 18:30:15 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:15 orarac2 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Oct 18 18:30:15 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:0:2): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:15 orarac2 kernel: SCSI device sdb: 734003200 512-byte hdwr sectors (375810 MB)
Oct 18 18:30:15 orarac2 kernel: sdb: Write Protect is off
Oct 18 18:30:15 orarac2 kernel: SCSI device sdb: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel: SCSI device sdb: 734003200 512-byte hdwr sectors (375810 MB)
Oct 18 18:30:15 orarac2 kernel: sdb: Write Protect is off
Oct 18 18:30:15 orarac2 kernel: SCSI device sdb: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel:  sdb: sdb1
Oct 18 18:30:15 orarac2 kernel: sd 0:0:0:2: Attached scsi disk sdb
Oct 18 18:30:15 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:15 orarac2 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Oct 18 18:30:15 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:0:3): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:15 orarac2 kernel: SCSI device sdc: 419430400 512-byte hdwr sectors (214748 MB)
Oct 18 18:30:15 orarac2 kernel: sdc: Write Protect is off
Oct 18 18:30:15 orarac2 kernel: SCSI device sdc: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel: SCSI device sdc: 419430400 512-byte hdwr sectors (214748 MB)
Oct 18 18:30:15 orarac2 kernel: sdc: Write Protect is off
Oct 18 18:30:15 orarac2 kernel: SCSI device sdc: drive cache: write through w/ FUA
Oct 18 18:30:15 orarac2 kernel:  sdc: sdc1
Oct 18 18:30:15 orarac2 kernel: sd 0:0:0:3: Attached scsi disk sdc
Oct 18 18:30:15 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:15 orarac2 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:0:5): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:16 orarac2 kernel: SCSI device sdd: 419430400 512-byte hdwr sectors (214748 MB)
Oct 18 18:30:16 orarac2 kernel: sdd: Write Protect is off
Oct 18 18:30:16 orarac2 kernel: SCSI device sdd: drive cache: write through w/ FUA
Oct 18 18:30:16 orarac2 kernel: SCSI device sdd: 419430400 512-byte hdwr sectors (214748 MB)
Oct 18 18:30:16 orarac2 kernel: sdd: Write Protect is off
Oct 18 18:30:16 orarac2 kernel: SCSI device sdd: drive cache: write through w/ FUA
Oct 18 18:30:16 orarac2 kernel:  sdd: sdd1
Oct 18 18:30:16 orarac2 kernel: sd 0:0:0:5: Attached scsi disk sdd
Oct 18 18:30:16 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:16 orarac2 kernel:   Type:   RAID                               ANSI SCSI revision: 05
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.0: scsi(0:0:1:0): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:16 orarac2 kernel: scsi 0:0:1:0: Unexpected response from lun 1 while scanning, scan aborted
Oct 18 18:30:16 orarac2 kernel: ACPI: PCI Interrupt 0000:0c:00.1[B] -> GSI 16 (level, low) -> IRQ 169
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Found an ISP2432, irq 169, iobase 0xffffc20000186000
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Configuring PCI space...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Configure NVRAM parameters...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Verifying loaded RISC code...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Allocated (64 KB) for EFT...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Allocated (1413 KB) for firmware dump...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Waiting for LIP to complete...
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: LIP reset occured (f700).
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: LOOP UP detected (4 Gbps).
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: Topology - (F_Port), Host Loop address 0x0
Oct 18 18:30:16 orarac2 kernel: scsi1 : qla2xxx
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: 
Oct 18 18:30:16 orarac2 kernel:  QLogic Fibre Channel HBA Driver: 8.01.07.16-fo
Oct 18 18:30:16 orarac2 kernel:   QLogic QMH2462 - 
Oct 18 18:30:16 orarac2 kernel:   ISP2432: PCIe (2.5Gb/s x4) @ 0000:0c:00.1 hdma+, host#=1, fw=4.00.90 [IP] 
Oct 18 18:30:16 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:16 orarac2 kernel:   Type:   RAID                               ANSI SCSI revision: 05
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: scsi(1:0:0:0): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:16 orarac2 kernel: scsi 1:0:0:0: Unexpected response from lun 1 while scanning, scan aborted
Oct 18 18:30:16 orarac2 kernel:   Vendor: HP        Model: HSV200            Rev: 6000
Oct 18 18:30:16 orarac2 kernel:   Type:   RAID                               ANSI SCSI revision: 05
Oct 18 18:30:16 orarac2 kernel: qla2400 0000:0c:00.1: scsi(1:0:1:0): Enabled tagged queuing, queue depth 16.
Oct 18 18:30:16 orarac2 kernel: scsi 1:0:1:0: Unexpected response from lun 1 while scanning, scan aborted
Oct 18 18:30:16 orarac2 kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
Oct 18 18:30:16 orarac2 kernel: EXT3-fs: write access will be enabled during recovery.
Oct 18 18:30:16 orarac2 kernel: kjournald starting.  Commit interval 5 seconds
Oct 18 18:30:16 orarac2 kernel: EXT3-fs: cciss/c0d0p3: orphan cleanup on readonly fs
Oct 18 18:30:17 orarac2 kernel: EXT3-fs: cciss/c0d0p3: 2 orphan inodes deleted
Oct 18 18:30:17 orarac2 kernel: EXT3-fs: recovery complete.
Oct 18 18:30:17 orarac2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 18 18:30:17 orarac2 kernel: SELinux:  Disabled at runtime.
Oct 18 18:30:17 orarac2 kernel: SELinux:  Unregistering netfilter hooks
Oct 18 18:30:17 orarac2 kernel: audit(1192732178.168:2): selinux=0 auid=4294967295
Oct 18 18:30:17 orarac2 kernel: scsi 0:0:0:0: Attached scsi generic sg0 type 12
Oct 18 18:30:17 orarac2 kernel: sd 0:0:0:1: Attached scsi generic sg1 type 0
Oct 18 18:30:17 orarac2 kernel: sd 0:0:0:2: Attached scsi generic sg2 type 0
Oct 18 18:30:17 orarac2 kernel: sd 0:0:0:3: Attached scsi generic sg3 type 0
Oct 18 18:30:17 orarac2 kernel: sd 0:0:0:5: Attached scsi generic sg4 type 0
Oct 18 18:30:17 orarac2 kernel: scsi 0:0:1:0: Attached scsi generic sg5 type 12
Oct 18 18:30:17 orarac2 kernel: scsi 1:0:0:0: Attached scsi generic sg6 type 12
Oct 18 18:30:17 orarac2 kernel: scsi 1:0:1:0: Attached scsi generic sg7 type 12
Oct 18 18:30:17 orarac2 kernel: Broadcom NetXtreme II Gigabit Ethernet Driver bnx2 v1.4.44-1 (August 10, 2006)
Oct 18 18:30:17 orarac2 kernel: GSI 22 sharing vector 0x52 and IRQ 22
Oct 18 18:30:17 orarac2 kernel: ACPI: PCI Interrupt 0000:02:03.0[A] -> GSI 22 (level, low) -> IRQ 82
Oct 18 18:30:17 orarac2 kernel: eth0: Broadcom NetXtreme II BCM5706 1000Base-SX (A2) PCI-X 64-bit 100MHz found at mem fa000000, IRQ 82, node addr 001b78e45478
Oct 18 18:30:17 orarac2 kernel: GSI 23 sharing vector 0x5A and IRQ 23
Oct 18 18:30:17 orarac2 kernel: ACPI: PCI Interrupt 0000:02:04.0[A] -> GSI 23 (level, low) -> IRQ 90
Oct 18 18:30:17 orarac2 kernel: eth1: Broadcom NetXtreme II BCM5706 1000Base-SX (A2) PCI-X 64-bit 100MHz found at mem f8000000, IRQ 90, node addr 001b78e4547e
Oct 18 18:30:17 orarac2 kernel: input: PC Speaker as /class/input/input2
Oct 18 18:30:17 orarac2 kernel: EDAC MC: Ver: 2.0.1 Jul 10 2007
Oct 18 18:30:17 orarac2 kernel: EDAC MC0: Giving out device to k8_edac Athlon64/Opteron: DEV 0000:00:18.2
Oct 18 18:30:17 orarac2 kernel: EDAC MC1: Giving out device to k8_edac Athlon64/Opteron: DEV 0000:00:19.2
Oct 18 18:30:17 orarac2 kernel: shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
Oct 18 18:30:17 orarac2 kernel: Floppy drive(s): fd0 is 1.44M
Oct 18 18:30:17 orarac2 kernel: floppy0: no floppy controllers found
Oct 18 18:30:17 orarac2 kernel: Floppy drive(s): fd0 is 1.44M
Oct 18 18:30:17 orarac2 kernel: floppy0: no floppy controllers found
Oct 18 18:30:17 orarac2 kernel: lp: driver loaded but no devices found
Oct 18 18:30:17 orarac2 kernel: ACPI: Power Button (FF) [PWRF]
Oct 18 18:30:17 orarac2 kernel: ibm_acpi: ec object not found
Oct 18 18:30:17 orarac2 kernel: md: Autodetecting RAID arrays.
Oct 18 18:30:17 orarac2 kernel: md: autorun ...
Oct 18 18:30:17 orarac2 kernel: md: ... autorun DONE.
Oct 18 18:30:17 orarac2 kernel: device-mapper: ioctl: 4.11.0-ioctl (2006-09-14) initialised: dm-devel at redhat.com
Oct 18 18:30:17 orarac2 kernel: EXT3 FS on cciss/c0d0p3, internal journal
Oct 18 18:30:17 orarac2 kernel: kjournald starting.  Commit interval 5 seconds
Oct 18 18:30:17 orarac2 kernel: EXT3 FS on cciss/c0d0p1, internal journal
Oct 18 18:30:17 orarac2 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 18 18:30:18 orarac2 kernel: Adding 4192956k swap on /dev/cciss/c0d0p2.  Priority:-1 extents:1 across:4192956k
Oct 18 18:30:18 orarac2 kernel: NET: Registered protocol family 10
Oct 18 18:30:18 orarac2 kernel: lo: Disabled Privacy Extensions
Oct 18 18:30:18 orarac2 kernel: IPv6 over IPv4 tunneling driver
Oct 18 18:30:18 orarac2 kernel: process `sysctl' is using deprecated sysctl (syscall) net.ipv6.neigh.lo.retrans_time; Use net.ipv6.neigh.lo.retrans_time_ms instead.
Oct 18 18:30:18 orarac2 kernel: bnx2: eth0: using MSI
Oct 18 18:30:18 orarac2 kernel: ADDRCONF(NETDEV_UP): eth0: link is not ready
Oct 18 18:30:18 orarac2 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex
Oct 18 18:30:18 orarac2 kernel: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Oct 18 18:30:18 orarac2 kernel: bnx2: eth1: using MSI
Oct 18 18:30:18 orarac2 kernel: ADDRCONF(NETDEV_UP): eth1: link is not ready
Oct 18 18:30:18 orarac2 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON
Oct 18 18:30:18 orarac2 kernel: ADDRCONF(NETDEV_CHANGE): eth1: link becomes ready
Oct 18 18:30:18 orarac2 kernel: vmmon: module license 'unspecified' taints kernel.
Oct 18 18:30:18 orarac2 kernel: bridge-eth0: already up
Oct 18 18:30:18 orarac2 kernel: DLM (built Jul 10 2007 06:39:52) installed
Oct 18 18:30:18 orarac2 kernel: GFS2 (built Jul 10 2007 06:40:14) installed
Oct 18 18:30:18 orarac2 kernel: Lock_DLM (built Jul 10 2007 06:40:21) installed
Oct 18 18:30:18 orarac2 kernel: dlm: connecting to 1
Oct 18 18:30:18 orarac2 kernel: dlm: got connection from 1
Oct 18 18:30:19 orarac2 qdiskd[2909]: <info> Initial score 1/1 
Oct 18 18:30:19 orarac2 qdiskd[2909]: <info> Initialization complete 
Oct 18 18:30:19 orarac2 openais[2840]: [CMAN ] quorum device registered 
Oct 18 18:30:19 orarac2 clvmd: Cluster LVM daemon started - connected to CMAN
Oct 18 18:44:40 orarac2 syslogd 1.4.1: restart.

B) Both nodes up: cluster fully functional
 Node 1:
 
 Node  Sts   Inc   Joined               Name
    0   M      0   2007-10-18 18:47:29  /dev/sda1
    1   M      8   2007-10-18 18:47:14  racnode1
    2   M      8   2007-10-18 18:47:14  racnode2
    
 type             level name         id       state node id local_done
 fence            0     default      00010002 none
 [1 2]
 dlm              1     clvmd        00020002 none
 [1 2]
 dlm              1     gfs_share_1  00040002 none
 [1 2]
 dlm              1     gfs_flash_1  00060002 none
 [1 2]
 dlm              1     rgmanager    00070002 none
 [1 2]
 gfs              2     gfs_share_1  00030002 none
 [1 2]
 gfs              2     gfs_flash_1  00050002 none
 [1 2]

 Node 2:
 
 Node  Sts   Inc   Joined               Name
    0   M      0   2007-10-18 18:47:42  /dev/sda1
    1   M      8   2007-10-18 18:47:29  racnode1
    2   M      4   2007-10-18 18:44:48  racnode2
 
 type             level name         id       state node id local_done
 fence            0     default      00010002 none
 [1 2]
 dlm              1     clvmd        00020002 none
 [1 2]
 dlm              1     gfs_share_1  00040002 none
 [1 2]
 dlm              1     gfs_flash_1  00060002 none
 [1 2]
 dlm              1     rgmanager    00070002 none
 [1 2]
 gfs              2     gfs_share_1  00030002 none
 [1 2]
 gfs              2     gfs_flash_1  00050002 none
 [1 2]


________________________________

Da: David Teigland [mailto:teigland at redhat.com]
Inviato: gio 18/10/2007 16.12
A: tam_annie at alice.it
Cc: linux-cluster at redhat.com
Oggetto: Re: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...


On Thu, Oct 18, 2007 at 01:33:41PM +0200, tam_annie at alice.it wrote:
> Hi everybody,
>
>       I have successfully installed and (almost successfully) configured RHEL5 cluster suite on a two node cluster, which will soon become a three node cluster, hopefully - my boss' euros willing-: that's why I configured it with qdisk (on a raw partition) too.
>       Two GFS (v. 1) filesystems are shared by both nodes.
>      
>       Well, everything really works like a breeze (HP iLO power fencing obviously included), even when I reboot _NORMALLY_ any node.
>      
>       Problems arise only after a node _CRASHES_ (You see, I love performing cold reboots in test environments), when that's exactly what happens:
>      
>       1) Node 2 crashes;
>       2) Node 1 successfully fences node 2: I can go on working on GFS
>       file systems after a freeze lasting less than one second;
>       3) While booting up, node 2 startup sequence runs fine: cman
>       services start successfully (I even get 'Starting fencing... [OK]'
>       !!!), but when it comes to clvmd, music definitely changes: dlm
>       connections are successfully established, but then the whole node
>       hangs on 'Starting clvmd... '. Debugging clvmd init script, I've
>       found that the problem is due to the vgscan command, which hangs
>       indefinitely on something like 'Locking vg_flash_1... '. I can't
>       really find any particular error about that in my logs.

Please remove qdisk from your configuration and see if anything changes.
If it still doesn't work, please send any dlm information from
/var/log/messages, cman_tool nodes output from all nodes, group_tool -v
output from all nodes.

Dave


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/b47d58cc/attachment.htm>

From lhh at redhat.com  Thu Oct 18 20:00:33 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 18 Oct 2007 16:00:33 -0400
Subject: [Linux-cluster] XEN and Cluster Questions
In-Reply-To: <200710171141.03724.grimme@atix.de>
References: <200710171141.03724.grimme@atix.de>
Message-ID: <1192737634.10256.7.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-17 at 11:41 +0200, Marc Grimme wrote:
> Hello,
> we are currently discussing XEN with clustering support. 
> There came some questions we are not sure what the answer is. Perhaps you can 
> help ;-) .
> 
> Background is: We are discussing a group of XEN Dom0 Hosts sharing all devices 
> and files via GFS. They themselves again host a couple of virtually 
> redhat-clustered DomU Hosts with or without gfs.
> 
> 1. Live Migration of cluster DomU nodes:
> When I live migrate a virtual DomU clusternode to another DOM0 XEN Host the 
> migration works ;-) , but the virtual clusternode is thrown out of the 
> cluster. Is this a "works as designed"? I think the problem are the 
> heartbeats not coming in proper time.
> Does that lead to the conclusion that one cannot live migrate cluster nodes?

Depends.  If you're using rgmanager to do migration, the migration is
actually not live.  In order to do live migration,
change /usr/share/cluster/vm.sh...

  - where it says 'xm migrate ...'
  - change it to 'xm migrate -l ...'

That should enable live migration.

> 2. Fencing:
> How about fencing of the virtual Dom-U Clusternodes. You are never sure on 
> which Dom-0 Node runs our Dom-U Clusternode. Is the fencing via fence_xvm[d] 
> supported on such an environment? That means how does a virtual DomU 
> clusternode X running on Dom0 Xen Host x know that if virtual DomU 
> clusternode Y running on Dom0 Xen Host y is running there when it is getting 
> the fence request to fence Host y where it is running?

Yes.  Fence_xvmd is designed (specifically) to handle the case where the
dom0 hosting a particular domU is not known.  Note that this only works
on RHEL5 with openais and such; fence_xvmd uses AIS checkpoints to store
virtual machine locations.

Notes:
* the parent dom0 cluster still needs fencing, too :)
* do not mix domU and dom0 in the same cluster,
* all domUs within a dom0 cluster must have different domain names,
* do *not* reuse /etc/xen/fence_xvm.key between multiple dom0 clusters

-- Lon


From rpeterso at redhat.com  Thu Oct 18 20:00:45 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Thu, 18 Oct 2007 15:00:45 -0500
Subject: [Linux-cluster] crossover cables
In-Reply-To: <C33CF394.10C888%jgray@nicusa.com>
References: <C33CF394.10C888%jgray@nicusa.com>
Message-ID: <1192737645.3073.24.camel@technetium.msp.redhat.com>

On Thu, 2007-10-18 at 11:27 -0400, Josh Gray wrote:
> Does the cluster then automatically use the xover interface for
> cluster communication?  Do you have to specify it somewhere?
> 
> Josh

Hi Josh and everyone,

I just updated the faq on this topic this morning.
It pertains not only to crossover cables, but to using a
desired NIC in general, which may or may not be a crossover cable.

http://sources.redhat.com/cluster/faq.html#cman_heartbeat_nic

Regards,

Bob Peterson
Red Hat Cluster Suite


From jgray at nicusa.com  Thu Oct 18 20:07:06 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 16:07:06 -0400
Subject: [Linux-cluster] crossover cables
In-Reply-To: <1192737645.3073.24.camel@technetium.msp.redhat.com>
Message-ID: <C33D352A.10C908%jgray@nicusa.com>

Awesome!  I've been wondering this for a little while now actually.

Do you guys have a mailing list or any way of notifications when the FAQ is
updated?

JG

On 10/18/07 4:00 PM, "Bob Peterson" <rpeterso at redhat.com> wrote:

> On Thu, 2007-10-18 at 11:27 -0400, Josh Gray wrote:
>> Does the cluster then automatically use the xover interface for
>> cluster communication?  Do you have to specify it somewhere?
>> 
>> Josh
> 
> Hi Josh and everyone,
> 
> I just updated the faq on this topic this morning.
> It pertains not only to crossover cables, but to using a
> desired NIC in general, which may or may not be a crossover cable.
> 
> http://sources.redhat.com/cluster/faq.html#cman_heartbeat_nic
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From pruebas at agro.uba.ar  Thu Oct 18 20:22:03 2007
From: pruebas at agro.uba.ar (Gustavo Marcello)
Date: Thu, 18 Oct 2007 17:22:03 -0300
Subject: [Linux-cluster] GFS in debian
Message-ID: <4717C06B.9080309@agro.uba.ar>

Hello.
One question: It is possible to implement GFS in debian? Some of you 
made do? You have some tutorial to recommend? Thank you for the help

Gustavo


From christopher.barry at qlogic.com  Thu Oct 18 20:27:09 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Thu, 18 Oct 2007 15:27:09 -0500
Subject: [Linux-cluster] crossover cables
References: <C33CF394.10C888%jgray@nicusa.com>
	<1192737645.3073.24.camel@technetium.msp.redhat.com>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB241906D2@EPEXCH1.qlogic.org>

>-----Original Message-----
>From: linux-cluster-bounces at redhat.com on behalf of Bob Peterson
>Sent: Thu 10/18/2007 4:00 PM
>To: linux clustering
>Subject: Re: [Linux-cluster] crossover cables
> 
>On Thu, 2007-10-18 at 11:27 -0400, Josh Gray wrote:
>> Does the cluster then automatically use the xover interface for
>> cluster communication?  Do you have to specify it somewhere?
>> 
>> Josh
>
>Hi Josh and everyone,
>
>I just updated the faq on this topic this morning.
>It pertains not only to crossover cables, but to using a
>desired NIC in general, which may or may not be a crossover cable.
>
>http://sources.redhat.com/cluster/faq.html#cman_heartbeat_nic
>
>Regards,
>
>Bob Peterson
>Red Hat Cluster Suite

Hi Bob,

Based on the FAQ entry, am I right in assuming the nodes in cluster.conf are then named

node-01-p
node-02-p
...
node-0n-p

Correct?

Regards,
-C


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2953 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/7b61f06b/attachment.bin>

From jgray at nicusa.com  Thu Oct 18 21:14:17 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 17:14:17 -0400
Subject: [Linux-cluster] Fencing /w APC 7900
Message-ID: <C33D44E9.10C919%jgray@nicusa.com>

I'm getting a menu not recognized error when I try fencing with the switches
I just picked up.  Is a certain OS or Firmware required?


# /sbin/fence_apc -a 10.0.0.xxx -l admin -n 2 -o off -p xxx -v
failed: unrecognised menu response

# cat /tmp/apclog 

User Name : admin
Password  : **********


>From the gui:
      
Hardware Factory
       
Model Number:     AP7900
Hardware Revision:     B2
Manufacture Date:     08/17/2007
Management Uptime:     0 Days 3 Hours 8 Minutes
     
             
Application Module
       
Name:     rpdu
Version:     v3.3.3
Date:     01/05/2007
Time:     14:56:33
     
             
APC OS (AOS)
       
Name:     aos
Version:     v3.3.4
Date:     01/05/2007
Time:     


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From rpeterso at redhat.com  Thu Oct 18 21:19:04 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Thu, 18 Oct 2007 16:19:04 -0500
Subject: [Linux-cluster] crossover cables
In-Reply-To: <D158540CCC0AB54C8FD4818F823CCB241906D2@EPEXCH1.qlogic.org>
References: <C33CF394.10C888%jgray@nicusa.com>
	<1192737645.3073.24.camel@technetium.msp.redhat.com>
	<D158540CCC0AB54C8FD4818F823CCB241906D2@EPEXCH1.qlogic.org>
Message-ID: <1192742344.3073.25.camel@technetium.msp.redhat.com>

On Thu, 2007-10-18 at 15:27 -0500, Christopher Barry wrote:
> Hi Bob,
> 
> Based on the FAQ entry, am I right in assuming the nodes in cluster.conf are then named
> 
> node-01-p
> node-02-p
> ...
> node-0n-p
> 
> Correct?
> 
> Regards,
> -C

Hi,

Yes, that's the general idea.

Bob Peterson
Red Hat Cluster Suite


From jparsons at redhat.com  Thu Oct 18 21:21:51 2007
From: jparsons at redhat.com (jim parsons)
Date: Thu, 18 Oct 2007 17:21:51 -0400
Subject: [Linux-cluster] Fencing /w APC 7900
In-Reply-To: <C33D44E9.10C919%jgray@nicusa.com>
References: <C33D44E9.10C919%jgray@nicusa.com>
Message-ID: <1192742511.3294.1.camel@localhost.localdomain>

Try this one. It is slated for the next update. Just drop in to /sbin on
every node.

-j


On Thu, 2007-10-18 at 17:14 -0400, Josh Gray wrote:
> I'm getting a menu not recognized error when I try fencing with the switches
> I just picked up.  Is a certain OS or Firmware required?
> 
> 
> # /sbin/fence_apc -a 10.0.0.xxx -l admin -n 2 -o off -p xxx -v
> failed: unrecognised menu response
> 
> # cat /tmp/apclog 
> 
> User Name : admin
> Password  : **********
> 
> 
> >From the gui:
>       
> Hardware Factory
>        
> Model Number:     AP7900
> Hardware Revision:     B2
> Manufacture Date:     08/17/2007
> Management Uptime:     0 Days 3 Hours 8 Minutes
>      
>              
>       
> Application Module
>        
> Name:     rpdu
> Version:     v3.3.3
> Date:     01/05/2007
> Time:     14:56:33
>      
>              
>       
> APC OS (AOS)
>        
> Name:     aos
> Version:     v3.3.4
> Date:     01/05/2007
> Time:     
> 
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fence_apc
Type: text/x-python
Size: 25643 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/c814e8f7/attachment.py>

From jgray at nicusa.com  Thu Oct 18 21:30:04 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 17:30:04 -0400
Subject: [Linux-cluster] Fencing /w APC 7900
In-Reply-To: <1192742511.3294.1.camel@localhost.localdomain>
Message-ID: <C33D489C.10C923%jgray@nicusa.com>

Dang you guys reply fast..  Worked like a charm.  Many thanks from me (and
my deadlines) for all the questions answered the last week or so!

JG


On 10/18/07 5:21 PM, "jim parsons" <jparsons at redhat.com> wrote:

> Try this one. It is slated for the next update. Just drop in to /sbin on
> every node.
> 
> -j

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From rpeterso at redhat.com  Thu Oct 18 21:28:48 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Thu, 18 Oct 2007 16:28:48 -0500
Subject: [Linux-cluster] crossover cables
In-Reply-To: <C33D352A.10C908%jgray@nicusa.com>
References: <C33D352A.10C908%jgray@nicusa.com>
Message-ID: <1192742928.3073.36.camel@technetium.msp.redhat.com>

On Thu, 2007-10-18 at 16:07 -0400, Josh Gray wrote:
> Awesome!  I've been wondering this for a little while now actually.
> 
> Do you guys have a mailing list or any way of notifications when the FAQ is
> updated?
> 
> JG

Hi Josh,

Not really.  It doesn't seem to show up on cluster-devel like the other
cluster code commits, probably because it's not in the cluster tree.
I'll ask around and see if we can get that to happen automatically.

Anyway, I need to sit down and do a bunch of updating of the faq
sometime soon.  I used to update it about every week, but I've been so
busy I haven't had time to work on it for quite a while.
But people have been asking some really good questions and I've been
meaning to add them for a long time now.  Sigh.

Regards,

Bob Peterson
Red Hat Cluster Suite


From jgray at nicusa.com  Thu Oct 18 21:34:44 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 17:34:44 -0400
Subject: [Linux-cluster] crossover cables
In-Reply-To: <1192742928.3073.36.camel@technetium.msp.redhat.com>
Message-ID: <C33D49B4.10C92B%jgray@nicusa.com>

I have to admit the close contact RH has with the end users on this list is
impressive.  A welcome change from the cold shoulder other companies seem to
give.  

Back to the FAQ - Maybe just some hooks into an RSS feed that is updated
either automatically, or manually whenever you make changes/additions,
would be perfect for me.

Josh


On 10/18/07 5:28 PM, "Bob Peterson" <rpeterso at redhat.com> wrote:

> On Thu, 2007-10-18 at 16:07 -0400, Josh Gray wrote:
>> Awesome!  I've been wondering this for a little while now actually.
>> 
>> Do you guys have a mailing list or any way of notifications when the FAQ is
>> updated?
>> 
>> JG
> 
> Hi Josh,
> 
> Not really.  It doesn't seem to show up on cluster-devel like the other
> cluster code commits, probably because it's not in the cluster tree.
> I'll ask around and see if we can get that to happen automatically.
> 
> Anyway, I need to sit down and do a bunch of updating of the faq
> sometime soon.  I used to update it about every week, but I've been so
> busy I haven't had time to work on it for quite a while.
> But people have been asking some really good questions and I've been
> meaning to add them for a long time now.  Sigh.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From jgray at nicusa.com  Thu Oct 18 21:37:54 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 17:37:54 -0400
Subject: [Linux-cluster] Fencing /w APC 7900
In-Reply-To: <1192742511.3294.1.camel@localhost.localdomain>
Message-ID: <C33D4A72.10C92D%jgray@nicusa.com>

Hmmm  might just be a fluke or a bug in my unit - but on one (of the two)
apc's I have,   it rebooted both times I ran a command.  I first did off
then on a while later..   I'll play with it more and see if it keeps
happening.  Thought you might want to know.


JG


On 10/18/07 5:21 PM, "jim parsons" <jparsons at redhat.com> wrote:

> Try this one. It is slated for the next update. Just drop in to /sbin on
> every node.
> 
> -j

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From jgray at nicusa.com  Thu Oct 18 22:05:02 2007
From: jgray at nicusa.com (Josh Gray)
Date: Thu, 18 Oct 2007 18:05:02 -0400
Subject: [Linux-cluster] Fencing /w APC 7900
In-Reply-To: <1192742511.3294.1.camel@localhost.localdomain>
Message-ID: <C33D50CE.10C943%jgray@nicusa.com>

Jim could you send that script as an attachment?   When I run it from luci
I'm getting the following errors in /var/log/messages    Seems to work when
I do it from the command line though


Oct 18 18:03:33 nfs-7 fence_node[11985]: agent "fence_apc" reports:
Traceback (most recent call last):   File "/sbin/fence_apc", line 832, in ?
main()   File "/sbin/fence_apc", line 303, in main     do_power_off(sock)
File "/sbin/fence_apc", line 816, in do_power_off     x =
do_power_switch(sock, "off")   File "/sbi
Oct 18 18:03:33 nfs-7 fence_node[11985]: agent "fence_apc" reports:
n/fence_apc", line 611, in do_power_switch     result_code, response =
power_off(txt + ndbuf)   File "/sbin/fence_apc", line 820, in power_off
x = power_switch(buffer, False, "2", "3");   File "/sbin/fence_apc", line
813, in power_switch     raise "un
Oct 18 18:03:33 nfs-7 fence_node[11985]: agent "fence_apc" reports: known
screen encountered in \n" + str(lines) + "\n" unknown screen encountered in
['', '> 1', '', '', '------- Phase Management
------------------------------------------------------', '', '        Phase
Load :  3.9', '        Phase State: Normal Load ',
Oct 18 18:03:33 nfs-7 fence_node[11985]: agent "fence_apc" reports:  '', '
1- Overload Alarm Threshold(amps)       : 12', '     2- Near Overload
Warning Threshold(amps): 8', '     3- Low Load Warning Threshold(amps)     :
0', '     4- Accept Changes                       : ', '', '     ?- Help,
<ESC>- Back, <ENTER>- R
Oct 18 18:03:33 nfs-7 fence_node[11985]: agent "fence_apc" reports: efresh,
<CTRL-L>- Event Log']
Oct 18 18:03:33 nfs-7 fence_node[11985]: Fence of "nfs-6.cdc.nicusa.com" was
unsuccessful 


On 10/18/07 5:21 PM, "jim parsons" <jparsons at redhat.com> wrote:

> Try this one. It is slated for the next update. Just drop in to /sbin on
> every node.
> 
> -j
> 
> 
> On Thu, 2007-10-18 at 17:14 -0400, Josh Gray wrote:
>> I'm getting a menu not recognized error when I try fencing with the switches
>> I just picked up.  Is a certain OS or Firmware required?
>> 
>> 
>> # /sbin/fence_apc -a 10.0.0.xxx -l admin -n 2 -o off -p xxx -v
>> failed: unrecognised menu response
>> 
>> # cat /tmp/apclog
>> 
>> User Name : admin
>> Password  : **********
>> 
>> 
>>> From the gui:
>>       
>> Hardware Factory
>>        
>> Model Number:     AP7900
>> Hardware Revision:     B2
>> Manufacture Date:     08/17/2007
>> Management Uptime:     0 Days 3 Hours 8 Minutes
>>      
>>              
>>       
>> Application Module
>>        
>> Name:     rpdu
>> Version:     v3.3.3
>> Date:     01/05/2007
>> Time:     14:56:33
>>      
>>              
>>       
>> APC OS (AOS)
>>        
>> Name:     aos
>> Version:     v3.3.4
>> Date:     01/05/2007
>> Time:     
>> 
>> 
> #!/usr/bin/python
> 
> 
##############################################################################>
#
> 
##############################################################################>
#
> ##
> ##  Copyright (C) 2006 Red Hat, Inc.  All rights reserved.
> ##
> ##  This copyrighted material is made available to anyone wishing to use,
> ##  modify, copy, or redistribute it subject to the terms and conditions
> ##  of the GNU General Public License v.2.
> ##
> 
##############################################################################>
#
> 
##############################################################################>
#
> 
> import getopt, sys
> import os
> import socket
> import time
> 
> from telnetlib import Telnet
> 
> TELNET_TIMEOUT=5  #How long to wait for a response from a telnet try
> 
> # WARNING!! Do not add code bewteen "#BEGIN_VERSION_GENERATION" and
> # "#END_VERSION_GENERATION"  It is generated by the Makefile
> 
> #BEGIN_VERSION_GENERATION
> FENCE_RELEASE_NAME="New APC Agent - test release"
> REDHAT_COPYRIGHT=""
> BUILD_DATE="September 21, 2006"
> #END_VERSION_GENERATION
> 
> POWER_OFF = 0
> POWER_ON = 1
> POWER_STATUS = 2
> POWER_REBOOT = 3
> 
> COMPLETE = 0
> NOT_COMPLETE = 1
> 
> ON = "ON"
> OFF = "OFF"
> 
> SUCCESS = "success"
> FAIL = "fail"
> 
> address = ""
> login = ""
> passwd = ""
> passwd_script = ""
> port = ""
> switchnum = ""
> action = POWER_REBOOT   #default action
> verbose = False
> 
> logfile = None
> 
> #set up regex list
> CONTROL_CONSOLE = "Control Console -----"
> DEVICE_MANAGER = "Device Manager -----"
> OUTLET_CONTROL = "- Outlet Control/Configuration -----"
> OUTLET_MANAGE = "- Outlet Management -----"
> CONTROL_OUTLET = "- Control Outlet -----"
> CONTROL_OUTLET_2 = "- Outlet Control "
> COMMAND_SUCCESS = "Command successfully issued."
> COMMAND_SUCCESS_2 = " Success"
> CONFIRM = "Enter 'YES' to continue or <ENTER> to cancel :"
> CONTINUE = "Press <ENTER> to continue..."
> SCREEN_END = "<ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log"
> SCREEN_END_2 = "<ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log"
> USERNAME = "User Name :"
> PASSWORD = "Password  :"
> MASTER = "------- MasterSwitch"
> FIRMWARE_STR = "Rack PDU APP"
> 
> CONTINUE_INDEX = 0
> 
> FIRMWARE_REV = 2
> 
> regex_list = list()
> regex_list.append(CONTINUE)
> regex_list.append(SCREEN_END)
> regex_list.append(SCREEN_END_2)
> regex_list.append(USERNAME)
> regex_list.append(PASSWORD)
> 
> def usage():
>   print "Usage:\n"
>   print "fence_apc [options]"
>   print "Options:"
>   print "   -a <ipaddress>           ip or hostname of APC switch"
>   print "   -h                       print out help"
>   print "   -l [login]               login name"
>   print "   -n [port]                switch port"
>   print "   -p [password]            password"
>   print "   -S [path]                script to run to retrieve password"
>   print "   -o [action]              Reboot (default), Off, On, or Status"
>   print "   -v Verbose               Verbose mode - writes file to
> /tmp/apclog"
>   print "   -V                       Print Version, then exit"
> 
>   sys.exit (0)
> 
> def version():
>   print "fence_apc %s  %s\n" % (FENCE_RELEASE_NAME, BUILD_DATE)
>   print "%s\n" % REDHAT_COPYRIGHT
>   sys.exit(0)
> 
> def main():
> 
>   global address, login, passwd, passwd_script, port, action, verbose,
> logfile, switchnum
> 
>   if len(sys.argv) > 1:
>     try:
>       opts, args = getopt.getopt(sys.argv[1:], "a:hl:o:n:p:S:vV", ["help",
> "output="])
>     except getopt.GetoptError:
>       #print help info and quit
>       usage()
>       sys.exit(2)
>       
>     for o, a in opts:
>       if o == "-v":
>         verbose = True
>       if o == "-V":
>         version()
>       if o in ("-h", "--help"):
>         usage()
>         sys.exit()
>       if o == "-l":
>         login = a
>       if o == "-p":
>         passwd = a
>       if o == "-S":
>         passwd_script = a
>       if o == "-n":
>         dex = a.find(":")
>         if dex == (-1):
>           port = a
>         else:
>           switchnum = a[:dex]
>           port = a[(dex+1):]
>       if o  == "-o":
>         if a == "Off" or a == "OFF" or a == "off":
>           action = POWER_OFF
>         elif a == "On" or a == "ON" or a == "on":
>           action = POWER_ON
>         elif a == "Status" or a == "STATUS" or a == "status":
>           action = POWER_STATUS
>         elif a == "Reboot" or a == "REBOOT" or a == "reboot":
>           action = POWER_REBOOT
>         else:
>           usage()
>           sys.exit()
>       if o == "-a":
>         address = a
>     if address == "" or login == "" or (passwd == "" and passwd_script == "")
> or port == "":
>       usage()
>       sys.exit()
>     
>   else: #Take args from stdin...
>     params = {}
>     #place params in dict
>     for line in sys.stdin:
>       val = line.split("=")
>       if len(val) == 2:
>         params[val[0].strip()] = val[1].strip()
> 
>     try:
>       address = params["ipaddr"]
>     except KeyError, e:
>       sys.stderr.write("FENCE: Missing ipaddr param for fence_apc...exiting")
>       sys.exit(1)
>     try:
>       login = params["login"]
>     except KeyError, e:
>       sys.stderr.write("FENCE: Missing login param for fence_apc...exiting")
>       sys.exit(1)
>     try:
>       if 'passwd' in params:
>         passwd = params["passwd"]
>       if 'passwd_script' in params:
>         passwd_script = params['passwd_script']
>       if passwd == "" and passwd_script == "":
>         raise "missing password"
>     except:
>       sys.stderr.write("FENCE: Missing passwd for fence_apc...exiting")
>       sys.exit(1)
>     try:
>       port = params["port"]
>     except KeyError, e:
>       sys.stderr.write("FENCE: Missing port param for fence_apc...exiting")
>       sys.exit(1)
>     try:
>       switchnum = params["switch"]
>     except KeyError, e:
>       pass
>     try:
>       verbose = params["verbose"]
>       verbose = (verbose == 'True' or verbose == 'true' or verbose == 'TRUE')
>     except KeyError, e:
>       pass
>     
>     try:
>       a = params["option"]
>       if a == "Off" or a == "OFF" or a == "off":
>         action = POWER_OFF
>       elif a == "On" or a == "ON" or a == "on":
>         action = POWER_ON
>       elif a == "Reboot" or a == "REBOOT" or a == "reboot":
>         action = POWER_REBOOT
>     except KeyError, e:
>       action = POWER_REBOOT
>     
>     #### End of stdin section
>   
>   
>   # retrieve passwd from passwd_script (if specified)
>   passwd_scr = ''
>   if len(passwd_script):
>     try:
>       if not os.access(passwd_script, os.X_OK):
>         raise 'script not executable'
>       p = os.popen(passwd_script, 'r', 1024)
>       passwd_scr = p.readline().strip()
>       if p.close() != None:
>         raise 'script failed'
>     except:
>       sys.stderr.write('password-script "%s" failed\n' % passwd_script)
>       passwd_scr = ''
>     
>   if passwd == "" and passwd_scr == "":
>     sys.stderr.write('password not available, exiting...')
>     sys.exit(1)
>   elif passwd == passwd_scr:
>     pass
>   elif passwd and passwd_scr:
>     # execute self, with password_scr as passwd,
>     # if that fails, continue with "passwd" argument as password
>     if len(sys.argv) > 1:
>       comm = sys.argv[0]
>       skip_next = False
>       for w in sys.argv[1:]:
>         if skip_next:
>           skip_next = False
>         elif w in ['-p', '-S']:
>           skip_next = True
>         else:
>           comm += ' ' + w
>       comm += ' -p ' + passwd_scr
>       ret = os.system(comm)
>       if ret != -1 and os.WIFEXITED(ret) and os.WEXITSTATUS(ret) == 0:
>         # success
>         sys.exit(0)
>       else:
>         sys.stderr.write('Use of password from "passwd_script" failed, trying
> "passwd" argument\n')
>     else: # use stdin
>       p = os.popen(sys.argv[0], 'w', 1024)
>       for par in params:
>         if par not in ['passwd', 'passwd_script']:
>           p.write(par + '=' + params[par] + '\n')
>       p.write('passwd=' + passwd_scr + '\n')
>       p.flush()
>       if p.close() == None:
>         # success
>         sys.exit(0)
>       else:
>         sys.stderr.write('Use of password from "passwd_script" failed, trying
> "passwd" argument\n')
>   elif passwd_scr:
>     passwd = passwd_scr
>   # passwd all set
>   
>   
>   
>   ### Order of events
>   # 0) If verbose, prepare log file handle
>   # 1) Open socket
>   # 2) Log in
>   # 3) Evaluate task. Task will be one of:
>   # 3a - Check status and print to stdout (or log file if verbose)
>   # 3b - Turn a port off, then confirm
>   # 3c - Turn a port on, then confirm
>   # 3d - Reboot by turning a port off, then on, and confirming each step.
> 
>   if verbose:
>     setup_logging()
> 
>   sock = setup_socket()
> 
>   # Ok, now lets log in...
>   do_login(sock)
> 
>   # Now we should be at the outside Control screen
>     
>   if action == POWER_STATUS:
>     # We should be at the Control screen, so we need to write a '1'
>     # to kick things off
>     sock.write("1\r")
>     statusval = do_status_check(sock)
>     backout(sock)
>     sock.write("4\r")  # Logs out
>     
>   elif action == POWER_OFF:
>     sock.write("1\r")
>     do_power_off(sock)
>     backout(sock) # Return to control screen
>     statusval = do_status_check(sock)
>     if statusval == OFF:
>       if verbose:
>         logit("Power Off successful\n")
>       print "Power Off successful"
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(0)
>     else:
>       if verbose:
>         logit("Power Off unsuccessful\n")
>         logit("Undetermined error\n")
>       sys.stderr.write("Power Off unsuccessful")
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(1)
>     
>   elif action == POWER_ON:
>     sock.write("1\r")
>     do_power_on(sock)
>     backout(sock) # Return to control screen
>     statusval = do_status_check(sock)
>     if statusval == ON:
>       if verbose:
>         logit("Power On successful\n")
>       print "Power On successful"
>       backout(sock)
>       sock.write("4\r")  # logs out
>       sock.close()
>       sys.exit(0)
>     else:
>       if verbose:
>         logit("Power On unsuccessful\n")
>         logit("Undetermined error\n")
>       sys.stderr.write("Power On unsuccessful")
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(1)
> 
>   elif action == POWER_REBOOT:
>     sock.write("1\r")
>     do_power_off(sock)
>     backout(sock) # Return to control screen
>     statusval = do_status_check(sock)
>     if statusval == OFF:
>       if verbose:
>         logit("Power Off successful\n")
>       print "Power Off successful"
>       backout(sock)
>     else:
>       if verbose:
>         logit("Power Off unsuccessful\n")
>         logit("Undetermined error\n")
>       sys.stderr.write("Power Off unsuccessful")
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(1)
>     do_power_on(sock)
>     backout(sock) # Return to control screen
>     statusval = do_status_check(sock)
>     if statusval == ON:
>       if verbose:
>         logit("Power Reboot successful\n")
>       print "Power Reboot successful"
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(0)
>     else:
>       if verbose:
>         logit("Power Reboot unsuccessful\n")
>         logit("Undetermined error\n")
>       sys.stderr.write("Power Reboot unsuccessful")
>       backout(sock)
>       sock.write("4\r")  # Logs out
>       sock.close()
>       sys.exit(1)
>     
>   sock.close()
> 
> def backout(sock):
>   sock.write(chr(27))
>   
>   while (1):
>     i, mo, txt = sock.expect(regex_list, TELNET_TIMEOUT)
> 
>     if regex_list[i] == SCREEN_END:
>       break
>     elif regex_list[i] == SCREEN_END_2:
>       sock.write(chr(27))
> 
> def setup_socket():
>   ## Time to open telnet session and log in.
>   try:
>     sock = Telnet(address.strip())
>   except socket.error, (errno, msg):
>     my_msg = "FENCE: A problem was encountered opening a telnet session with "
> + address
>     if verbose:
>       logit(my_msg)
>       logit("FENCE: Error number: %d -- Message: %s\n" % (errno, msg))
>       logit("Firewall issue? Correct address?\n")
> 
>     sys.stderr.write(my_msg)
>     sys.stderr.write(("FENCE: Error number: %d -- Message: %s\n" % (errno,
> msg)))
>     sys.stderr.write("Firewall issue? Correct address?\n")
>     sys.exit(1)
> 
>   if verbose:
>     logit("\nsocket open to %s\n" % address)
> 
>   return sock
> 
> def setup_logging( log_location="/tmp/apclog"):
>   global logfile
>   try:
>     logfile = open(log_location, 'a')
>     logfile.write("###############################################\n")
>     logfile.write("Telnetting to apc switch %s\n" % address)
>     now = time.localtime(time.time())
>     logfile.write(time.asctime(now))
>   except IOError, e:
>     sys.stderr.write("Failed to open log file %s" % log_location)
>     logfile = None
> 
> def logit(instr):
>   if logfile != None:
>     logfile.write(instr)
> 
> def do_login(sock):
>   result_code = 1
> 
>   ## This loop tries to assemble complete telnet screens and passes
>   ## them to helper methods to handle responses accordingly.
>   while result_code:
>     try:
>       i, mo, txt = sock.expect(regex_list, TELNET_TIMEOUT)
>     except socket.error, (errno, msg):
>       my_msg = "FENCE: A problem was encountered opening a telnet session with
> " + address + "\n"
>       if verbose:
>         logit(my_msg)
>         logit("FENCE: Error number: %d -- Message: %s\n" % (errno, msg))
> 
>       sys.stderr.write(my_msg)
>       sys.stderr.write(("FENCE: Error number: %d -- Message: %s\n" % (errno,
> msg)))
>       sys.exit(1)
> 
>     if i == CONTINUE_INDEX: # Capture the rest of the screen...
>       sock.write("\r")
>       ii,moo,txtt = sock.expect(regex_list, TELNET_TIMEOUT)
>       txt = txt + txtt
> 
>     ndbuf = sock.read_eager() # Scoop up remainder
>     if verbose:
>       logit(txt + ndbuf)
>     result_code,response = log_in(txt + ndbuf)
>     if result_code:
>       try:
>         sock.write(response)
>       except socket.error, (errno, msg):
>         if verbose:
>           logit("Error #%s" % errno)
>           logit(msg)
>         sys.stderr.write("Error #%s:  %s" % (errno,msg))
>         sys.exit(1)
> 
> def log_in(buffer):
>   global FIRMWARE_REV
>   lines = buffer.splitlines()
> 
>   for i in lines:
>     if i.find(USERNAME) != (-1):
>       if verbose:
>         logit("Sending login: %s\n" % login)
>       return (NOT_COMPLETE, login + "\r")
>     elif i.find(PASSWORD) != (-1):
>       if verbose:
>         logit("Sending password: %s\n" % passwd)
>       return (NOT_COMPLETE, passwd + "\r")
>     elif i.find(CONTROL_CONSOLE) != (-1):
>       #while we are here, grab the firmware revision
>       rev_search_lines = buffer.splitlines()
>       for rev_search_line in rev_search_lines: #search screen again
>         rev_dex = rev_search_line.find(FIRMWARE_STR)
>         if rev_dex != (-1): #found revision line
>           scratch_rev = rev_search_line[rev_dex:]
>           v_dex = scratch_rev.find("v")
>           if v_dex != (-1):
>             if scratch_rev[v_dex + 1] == "3": #format is v3.3.4
>               FIRMWARE_REV = 3
>               break
>       return (COMPLETE, "1\r")
> 
> def do_status_check(sock):
>   result_code = 1
>   while result_code:
>     i, mo, txt = sock.expect(regex_list, TELNET_TIMEOUT)
>     if i == CONTINUE_INDEX: # Capture the rest of the screen...
>       sock.write("\r")
>       ii,moo,txtt = sock.expect(regex_list, TELNET_TIMEOUT)
>       txt = txt + txtt
> 
>     ndbuf = sock.read_eager() # Scoop up remainder
>     if verbose:
>       logit(txt + ndbuf)
>     (result_code,response,statusval) = return_status(txt + ndbuf)
>     if result_code:
>       try:
>         sock.write(response)
>       except socket.error, (errno, msg):
>         if verbose:
>           logit("Status check failed.")
>           logit("Error #%s" % errno)
>           logit(msg)
>         sys.stderr.write("Status check failed.")
>         sys.stderr.write("Error #%s:  %s" % (errno,msg))
>         sys.exit(1)
>   # Back from status check - value should be in status var
>   if response == SUCCESS:
>     if switchnum == "":
>       if verbose:
>         logit("Status check successful. Port %s is %s" % (port,statusval))
>       print "Status check successful. Port %s is %s" % (port,statusval)
>     else:
>       if verbose:
>         logit("Status check successful. Port %s:%s is %s" % (switchnum, port,
> statusval))
>       print "Status check successful. Port %s:%s is %s" % (switchnum, port,
> statusval)
> 
>     return statusval
>   else:
>     if verbose:
>       logit("Status check failed, unknown reason.")
>     sys.stderr.write("Status check failed, unknown reason.\n")
>     sock.close()
>     sys.exit(1) 
> 
> def return_status(buffer):
>   global switchnum, port
> 
>   lines = buffer.splitlines()
> 
>   for i in lines:
>     if i.find(CONTROL_CONSOLE) != (-1):
>       return (NOT_COMPLETE, "1\r", "Status Unknown")
>     elif i.find(DEVICE_MANAGER) != (-1):
>       if switchnum != "":
>         res = switchnum + "\r"
>       else:
>         if FIRMWARE_REV == 2:
>           res = "3\r"
>         elif FIRMWARE_REV == 3:
>           res = "2\r1\r"
>         else: #placeholder for future revisions
>           res = "3\r"
>       return (NOT_COMPLETE, res, "Status Unknown")
>     elif i.find(OUTLET_CONTROL) != (-1):
>       ls = buffer.splitlines()
>       portval = port.strip()
>       portval = " " + portval + " "
>       portval2 = " " + port.strip() + "- "
>       found_portval = False
>       for l in ls:
>         if l.find(portval) != (-1) or l.find(portval2) != (-1):
>           found_portval = True
>           linesplit = l.split()
>           linelen = len(linesplit)
>           return (COMPLETE,SUCCESS,linesplit[linelen - 1])
>     elif i.find(MASTER) != (-1):
>       try:
>         e = int(port.strip())
>         portval = port.strip()
>         switchval = switchnum.strip()
>         portval = switchval + ":" + portval
>       except ValueError, e:
>         portval = port.strip()
>       ls = buffer.splitlines()
>       found_portval = False
>       for l in ls:
>         words = l.split()
>         if len(words) > 3:
>           if words[2] == portval or words[3] == portval:
>             found_portval = True
>             linesplit = l.split()
>             linelen = len(linesplit)
>             return (COMPLETE, SUCCESS, linesplit[linelen - 3])
>       return (COMPLETE, FAIL, "Incorrect port number")
>   return (NOT_COMPLETE, chr(27), "Status Unknown")
> 
> def do_power_switch(sock, status):
>   result_code = 1
> 
>   while result_code:
>     i, mo, txt = sock.expect(regex_list, TELNET_TIMEOUT)
>     if i == CONTINUE_INDEX: # Capture the rest of the screen...
>       sock.write("\r")
>       ii,moo,txtt = sock.expect(regex_list, TELNET_TIMEOUT)
>       txt = txt + txtt
> 
>     ndbuf = sock.read_eager() # Scoop up remainder
>     if verbose:
>       logit(txt + ndbuf)
> 
>     if status == "off":
>       result_code, response = power_off(txt + ndbuf)
>     elif status == "on":
>       result_code, response = power_on(txt + ndbuf)
>     else:
>       if verbose:
>         logit("Invalid status in do_power_switch() function")
>       sys.stderr.write("Invalid status in do_power_switch() function")
>       sys.exit(1)
> 
>     if result_code:
>       try:
>         sock.write(response)
>       except socket.error, (errno, msg):
>         if verbose:
>           logit("Error #%s" % errno)
>           logit(msg)
>         sys.stderr.write("Error #%s:  %s" % (errno,msg))
>         sys.exit(1)
>       # FIXME: always returns COMPLETE (0)
>     else:
>       try:
>         sock.write(response)
>       except socket.error, (errno, msg):
>         if verbose:
>           logit("Error #%s" % errno)
>           logit(msg)
>         sys.stderr.write("Error #%s:  %s" % (errno,msg))
>         sys.exit(1)
>       return COMPLETE
> 
> 
> def power_switch(buffer, escape, control_outlet, control_outlet2):
>   # If port is not aliased, then outlet control screen will have the word
>   # 'Outlet' in the header. If the name is aliased, it will only have the
>   # alias in the header.
> 
>   outlet_search_str1 = "Outlet " + port.strip() + " ------------"
>   outlet_search_str2 = port.strip() + " ------------"
>   outlet_search_str3 = "Outlet " + switchnum.strip() + ":" + port.strip() + "
> ------"
>   outlet_search_str4 = "        Outlet       : " + port.strip()
>   outlet_search_str5 = "        Outlet Name : " + port.strip()
>   master_search_str1 = "-------- Master"
>   lines = buffer.splitlines()
> 
>   for i in lines:
>     if i.find(CONTROL_CONSOLE) != (-1):
>       return (NOT_COMPLETE,"1\r")
> 
>     elif i.find(DEVICE_MANAGER) != (-1):
>       if switchnum != "":
>         res = switchnum + "\r"
>       else:
>         if FIRMWARE_REV == 2:
>           res = "3\r"
>         elif FIRMWARE_REV == 3:
>           #Changed for bz299191
>           #res = "2\r1\r"
>           res = "2\r"
>         else: #placeholder for future revisions - sheesh
>           res = "3\r"
>       return (NOT_COMPLETE, res)
>       
>     elif (i.find(master_search_str1) != (-1)):
>       return (NOT_COMPLETE, port.strip() + "\r")
>       
>     elif i.find(outlet_search_str1) != (-1) and (switchnum == ""):
>       return (NOT_COMPLETE,"1\r")
> 
>     elif i.find(outlet_search_str2) != (-1) and (switchnum == ""):
>       return (NOT_COMPLETE,"1\r")
>     
>     elif i.find(outlet_search_str3) != (-1):
>       return (NOT_COMPLETE, "1\r")
>     
>     elif i == outlet_search_str4:
>       return (NOT_COMPLETE, "1\r")
>     
>     elif i == outlet_search_str5:
>       return (NOT_COMPLETE, "1\r")
>     
>     elif i.find(OUTLET_MANAGE) != (-1):
>       #Changed for bz299191
>       #return (NOT_COMPLETE, "\r")
>       return (NOT_COMPLETE, "1\r")
> 
>     #elif i.find(OUTLET_CONTROL) != (-1) or i.find(OUTLET_MANAGE) != (-1):
>     elif i.find(OUTLET_CONTROL) != (-1):
>       ls = buffer.splitlines()
>       portval = port.strip()
>       portval = " " + portval + " "
>       found_portval = False
>       i = 0
>       # look for aliased name
>       for l in ls:
>         i = i + 1
>         if l.find(portval) != (-1):
>           found_portval = True
>           linesplit = l.split()
>           outlet_str = linesplit[0]
>           dex = outlet_str.find("-")
>           if dex <= (0):
>             if verbose:
>               logit("Problem identifying outlet\n")
>               logit("Looking for %s in string %s\n" % (portval,outlet_str))
>             sys.stderr.write("Problem identifying outlet\n")
>             sys.stderr.write("Looking for %s in string %s\n" %
> (portval,outlet_str))
>             sys.exit(1)
>           normalized_outlet_str = outlet_str[:dex]
>           return (NOT_COMPLETE, normalized_outlet_str + "\r")
>       # look for portnum
>       portval = " " + port.strip() + "-"
>       i = 0
>       for l in ls:
>         i = i + 1
>         if l.find(portval) != (-1):
>           found_portval = True
>           linesplit = l.split()
>           outlet_str = linesplit[0]
>           dex = outlet_str.find("-")
>           if dex <= (0):
>             if verbose:
>               logit("Problem identifying outlet\n")
>               logit("Looking for %s in string %s\n" % (portval,outlet_str))
>             sys.stderr.write("Problem identifying outlet\n")
>             sys.stderr.write("Looking for %s in string %s\n" %
> (portval,outlet_str))
>             sys.exit(1)
>           normalized_outlet_str = outlet_str[:dex]
>           return (NOT_COMPLETE, normalized_outlet_str + "\r")
>       if found_portval == False:
>         if verbose:
>           logit("Problem identifying outlet\n")
>           logit("Looking for '%s' in string '%s'\n" % (portval, ls))
>         sys.stderr.write("Problem identifying outlet\n")
>         sys.stderr.write("Looking for '%s' in string '%s'\n" % (portval, ls))
>         sys.exit(1)
> 
>     elif i.find(MASTER) != (-1):
>       ls = buffer.splitlines()
>       found_portval = False
>       # look for aliased name
>       portval = port.strip()
>       for l in ls:
>         words = l.strip().split()
>         if len(words) > 3:
>           if '----' not in words[0] and words[3].strip() == portval:
>             outlet_str = words[0]
>             dex = outlet_str.find("-")
>             if dex <= (0):
>               if verbose:
>                 logit("Problem identifying outlet\n")
>                 logit("Looking for %s in string %s\n" % (portval, outlet_str))
>               sys.stderr.write("Problem identifying outlet\n")
>               sys.stderr.write("Looking for %s in string %s\n" % (portval,
> outlet_str))
>               sys.exit(1)
>             normalized_outlet_str = outlet_str[:dex]
>             return (NOT_COMPLETE, (normalized_outlet_str + "\r"))
>       # look for portnum
>       portval = port.strip()
>       portval = switchnum.strip() + ":" + portval + " "
>       i = 0
>       for l in ls:
>         i = i + 1
>         if l.find(portval) != (-1):
>           found_portval = True
>           linesplit = l.split()
>           outlet_str = linesplit[0]
>           dex = outlet_str.find("-")
>           if dex <= (0):
>             if verbose:
>               logit("Problem identifying outlet\n")
>               logit("Looking for %s in string %s\n" % (portval,outlet_str))
>             sys.stderr.write("Problem identifying outlet\n")
>             sys.stderr.write("Looking for %s in string %s\n" %
> (portval,outlet_str))
>             sys.exit(1)
>           normalized_outlet_str = outlet_str[:dex]
>           return (NOT_COMPLETE, (normalized_outlet_str + "\r"))
>       if found_portval == False:
>         if verbose:
>           logit("Problem identifying outlet\n")
>           logit("Looking for '%s' in string '%s'\n" % (portval, ls))
>         sys.stderr.write("Problem identifying outlet\n")
>         sys.stderr.write("Looking for '%s' in string '%s'\n" % (portval, ls))
>         sys.exit(1)
> 
>     elif i.find(CONFIRM) != (-1):
>       return (NOT_COMPLETE,"YES\r")
> 
>     elif i.find(COMMAND_SUCCESS) != (-1):
>       return (COMPLETE,"\r")
> 
>     elif i.find(COMMAND_SUCCESS_2) != (-1):
>       return (COMPLETE,"\r")
> 
>     elif i.find(CONTROL_OUTLET) != (-1):
>       return (NOT_COMPLETE, control_outlet + "\r")
> 
>     elif i.find(CONTROL_OUTLET_2) != (-1):
>       return (NOT_COMPLETE, control_outlet2 + "\r")
>   
>   if (escape == True):
>     return (NOT_COMPLETE, chr(27))
>   else:
>     raise "unknown screen encountered in \n" + str(lines) + "\n"
> 
> def do_power_off(sock):
>   x = do_power_switch(sock, "off")
>   return x
> 
> def power_off(buffer):
>   x = power_switch(buffer, False, "2", "3");
>   return x
> 
> def do_power_on(sock):
>   x = do_power_switch(sock, "on")
>   return x
> 
> def power_on(buffer):
>   x = power_switch(buffer, True, "1", "1");
>   return x
> 
> if __name__ == "__main__":
>   main()
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From kanderso at redhat.com  Thu Oct 18 22:24:02 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Thu, 18 Oct 2007 17:24:02 -0500
Subject: [Linux-cluster] RHEL5 sharedroot clusters
In-Reply-To: <200710111725.59227.hlawatschek@atix.de>
References: <200710111725.59227.hlawatschek@atix.de>
Message-ID: <1192746242.2849.58.camel@localhost.localdomain>

Mark,

When I run the mkinitrd command, I get an error that comoonics-cs rpm
not found.  When I try to yum install it, I get dependency errors on
perl(XML::DOM), perl(XML::XQL) and perl(XML::XQL::DOM)  are needed by
the package.  Which rpm are those commands in?

BTW - am trying to set this up in a xen guest.  Will see how far I get.

Thanks
Kevin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/4e720ada/attachment.htm>

From david.costakos at gmail.com  Thu Oct 18 23:23:04 2007
From: david.costakos at gmail.com (Dave Costakos)
Date: Thu, 18 Oct 2007 16:23:04 -0700
Subject: [Linux-cluster] RHEL5 sharedroot clusters
In-Reply-To: <1192746242.2849.58.camel@localhost.localdomain>
References: <200710111725.59227.hlawatschek@atix.de>
	<1192746242.2849.58.camel@localhost.localdomain>
Message-ID: <6b6836c60710181623m31e2aa76gbe305d85cb003a9a@mail.gmail.com>

If you want RPMs for RHEL5 prebuild, you can get them from EPEL:

http://fedoraproject.org/wiki/EPEL/

On 10/18/07, Kevin Anderson <kanderso at redhat.com> wrote:
>
>  Mark,
>
> When I run the mkinitrd command, I get an error that comoonics-cs rpm not
> found.  When I try to yum install it, I get dependency errors on
> perl(XML::DOM), perl(XML::XQL) and perl(XML::XQL::DOM)  are needed by the
> package.  Which rpm are those commands in?
>
> BTW - am trying to set this up in a xen guest.  Will see how far I get.
>
> Thanks
> Kevin
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Dave Costakos
mailto:david.costakos at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071018/f155a429/attachment.htm>

From joseparrella at gmail.com  Thu Oct 18 23:42:14 2007
From: joseparrella at gmail.com (Jose Miguel Parrella Romero)
Date: Thu, 18 Oct 2007 19:42:14 -0400
Subject: [Linux-cluster] GFS in debian
In-Reply-To: <4717C06B.9080309@agro.uba.ar>
References: <4717C06B.9080309@agro.uba.ar>
Message-ID: <4717EF56.6080303@gmail.com>

Gustavo Marcello escribi?:
> Hello.
> One question: It is possible to implement GFS in debian? Some of you
> made do? You have some tutorial to recommend? Thank you for the help

Yes. We are actually running two HA/LB/A-A clusters, using Heartbeat,
LVS, Red Hat Cluster Suite and GFS over a FC-based SAN. The whole setup
uses Debian Etch on IA64 machines. You can find everything related to
the Cluster Suite on Debian [1]

Use aptitude to install the packages you need and then follow the RHCS
docs -- you will be working on your cluster really soon!

[1] http://packages.debian.org/source/etch/redhat-cluster


From grimme at atix.de  Fri Oct 19 06:28:27 2007
From: grimme at atix.de (Marc Grimme)
Date: Fri, 19 Oct 2007 08:28:27 +0200
Subject: [Linux-cluster] RHEL5 sharedroot clusters
In-Reply-To: <1192746242.2849.58.camel@localhost.localdomain>
References: <200710111725.59227.hlawatschek@atix.de>
	<1192746242.2849.58.camel@localhost.localdomain>
Message-ID: <200710190828.28902.grimme@atix.de>

On Friday 19 October 2007 00:24:02 Kevin Anderson wrote:
> Mark,
>
> When I run the mkinitrd command, I get an error that comoonics-cs rpm
> not found.  When I try to yum install it, I get dependency errors on
> perl(XML::DOM), perl(XML::XQL) and perl(XML::XQL::DOM)  are needed by
> the package.  Which rpm are those commands in?
These are old messages. That only came up with installing comoonics-cs. Which 
is not needed any more. That should do.

Try to use the rpms from the preview channel and also those for rhel5 if using 
rhel5.
--------------------------------------------------------------------------------
[comoonics-preview]
name=Packages for the comoonics shared root cluster
baseurl=http://download.atix.de/yum/comoonics/redhat-el5/preview/noarch/
enabled=1
gpgcheck=1
gpgkey=http://download.atix.de/yum//comoonics/comoonics-RPM-GPG.key

[comoonics-preview-i386]
name=Packages for the comoonics shared root cluster
baseurl=http://download.atix.de/yum/comoonics/redhat-el5/preview/$arch
enabled=1
gpgcheck=1
gpgkey=http://download.atix.de/yum/comoonics/comoonics-RPM-GPG.key

[comoonics-addons]
name=Addon Packages for utilities
baseurl=http://download.atix.de/yum/comoonics/redhat-el5/preview/noarch/
enabled=1
gpgcheck=1
gpgkey=http://download.atix.de/yum/comoonics/comoonics-RPM-GPG.key


>
> BTW - am trying to set this up in a xen guest.  Will see how far I get.
Should not be a problem. Xen support was added recently.
You will also want to install comoonics-bootimage-extras-xen. That'll help.

Let us know about you success.

Regards Marc.
>
> Thanks
> Kevin


-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From grimme at atix.de  Fri Oct 19 06:42:02 2007
From: grimme at atix.de (Marc Grimme)
Date: Fri, 19 Oct 2007 08:42:02 +0200
Subject: [Linux-cluster] XEN and Cluster Questions
In-Reply-To: <1192737634.10256.7.camel@ayanami.boston.devel.redhat.com>
References: <200710171141.03724.grimme@atix.de>
	<1192737634.10256.7.camel@ayanami.boston.devel.redhat.com>
Message-ID: <200710190842.03026.grimme@atix.de>

On Thursday 18 October 2007 22:00:33 Lon Hohberger wrote:
> On Wed, 2007-10-17 at 11:41 +0200, Marc Grimme wrote:
> > Hello,
> > we are currently discussing XEN with clustering support.
> > There came some questions we are not sure what the answer is. Perhaps you
> > can help ;-) .
> >
> > Background is: We are discussing a group of XEN Dom0 Hosts sharing all
> > devices and files via GFS. They themselves again host a couple of
> > virtually redhat-clustered DomU Hosts with or without gfs.
> >
> > 1. Live Migration of cluster DomU nodes:
> > When I live migrate a virtual DomU clusternode to another DOM0 XEN Host
> > the migration works ;-) , but the virtual clusternode is thrown out of
> > the cluster. Is this a "works as designed"? I think the problem are the
> > heartbeats not coming in proper time.
> > Does that lead to the conclusion that one cannot live migrate cluster
> > nodes?
>
> Depends.  If you're using rgmanager to do migration, the migration is
> actually not live.  In order to do live migration,
> change /usr/share/cluster/vm.sh...
>
>   - where it says 'xm migrate ...'
>   - change it to 'xm migrate -l ...'
Ok got it.
Still did you try to live migrate a cluster node?
>
> That should enable live migration.
>
> > 2. Fencing:
> > How about fencing of the virtual Dom-U Clusternodes. You are never sure
> > on which Dom-0 Node runs our Dom-U Clusternode. Is the fencing via
> > fence_xvm[d] supported on such an environment? That means how does a
> > virtual DomU clusternode X running on Dom0 Xen Host x know that if
> > virtual DomU clusternode Y running on Dom0 Xen Host y is running there
> > when it is getting the fence request to fence Host y where it is running?
>
> Yes.  Fence_xvmd is designed (specifically) to handle the case where the
> dom0 hosting a particular domU is not known.  Note that this only works
> on RHEL5 with openais and such; fence_xvmd uses AIS checkpoints to store
> virtual machine locations.
>
> Notes:
> * the parent dom0 cluster still needs fencing, too :)
Yes. Thats in place. Check.
> * do not mix domU and dom0 in the same cluster,
I didn't. Check.
> * all domUs within a dom0 cluster must have different domain names,
Ups. hostname -d on dom0 and hostname -d on domu need to be different? 
What if they are empty?
Or do you mean some other domainname?
Dom0: 
[root at axqa01_2 ~]# hostname -d
[root at axqa01_2 ~]#
DomU:
[root at axqa03_1 ~]# hostname -d
cc.atix

> * do *not* reuse /etc/xen/fence_xvm.key between multiple dom0 clusters
I just did not use it.
Dom0:
[root at axqa01_2 ~]# ps ax | grep [f]ence_xvmd
 1932 pts/1    S+     0:00 fence_xvmd -ddddd -f -c none -C none
So on axqa01_2 runs axqa03_2 and on axqa01_1 runs axqa03_1
Then when I do a 
./fence_xvm -ddddd -C none -c none -H axqa03_2 on axqa03_1 I get the 
following:
Waiting for response
Received 264 bytes
Adding IP 127.0.0.1 to list (family 2)
Adding IP 10.1.2.1 to list (family 2)
Adding IP 192.168.10.40 to list (family 2)
Adding IP 192.168.122.1 to list (family 2)
Closing Netlink connection
ipv4_listen: Setting up ipv4 listen socket
ipv4_listen: Success; fd = 3
Setting up ipv4 multicast send (225.0.0.12:1229)
Joining IP Multicast group (pass 1)
Joining IP Multicast group (pass 2)
Setting TTL to 2 for fd4
ipv4_send_sk: success, fd = 4
sign_request: no-op (HASH_NONE)
Sending to 225.0.0.12 via 127.0.0.1
Setting up ipv4 multicast send (225.0.0.12:1229)
Joining IP Multicast group (pass 1)
Joining IP Multicast group (pass 2)
Setting TTL to 2 for fd4
ipv4_send_sk: success, fd = 4
sign_request: no-op (HASH_NONE)
Sending to 225.0.0.12 via 10.1.2.1
Setting up ipv4 multicast send (225.0.0.12:1229)
Joining IP Multicast group (pass 1)
Joining IP Multicast group (pass 2)
Setting TTL to 2 for fd4
ipv4_send_sk: success, fd = 4
sign_request: no-op (HASH_NONE)
Sending to 225.0.0.12 via 192.168.10.40
Setting up ipv4 multicast send (225.0.0.12:1229)
Joining IP Multicast group (pass 1)
Joining IP Multicast group (pass 2)
Setting TTL to 2 for fd4
ipv4_send_sk: success, fd = 4
sign_request: no-op (HASH_NONE)
Sending to 225.0.0.12 via 192.168.122.1
Waiting for connection from XVM host daemon.
Issuing TCP challenge
tcp_challenge: no-op (AUTH_NONE)
Responding to TCP challenge
tcp_response: no-op (AUTH_NONE)
TCP Exchange + Authentication done...
Waiting for return value from XVM host
Remote: Operation failed

on axqa01_2:
------                   ----                                 ----- -----
axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
Storing axqa03_2
libvir: Xen Daemon error : GET operation failed:
Domain                   UUID                                 Owner State
------                   ----                                 ----- -----
axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
Storing axqa03_2
Request to fence: axqa03_2
axqa03_2 is running locally
Plain TCP request
libvir: Xen Daemon error : GET operation failed:
libvir: error : invalid argument in __virGetDomain
libvir: Xen Store error : out of memory
tcp_response: no-op (AUTH_NONE)
tcp_challenge: no-op (AUTH_NONE)
Rebooting domain axqa03_2...
[[ XML Domain Info ]]
<domain type='xen'>
  <name>axqa03_2</name>
  <uuid>1732aae45a110676113df9e7da458b61</uuid>
  <os>
    <type>linux</type>
    <kernel>/var/lib/xen/boot/vmlinuz-2.6.18-52.el5xen</kernel>
    <initrd>/var/lib/xen/boot/initrd_sr-2.6.18-52.el5xen.img</initrd>
  </os>
  <currentMemory>366592</currentMemory>
  <memory>366592</memory>
  <vcpu>2</vcpu>
  <on_poweroff>destroy</on_poweroff>
  <on_reboot>restart</on_reboot>
  <on_crash>restart</on_crash>
  <devices>
    <disk type='block' device='disk'>
      <driver name='phy'/>
      <source dev='sds'/>
      <target dev='sds'/>
    </disk>
    <disk type='file' device='disk'>
      <driver name='file'/>
      <source file='/var/lib/xen/images/axqa03_2.localdisk.dd'/>
      <target dev='sda'/>
    </disk>
    <interface type='bridge'>
      <mac address='aa:00:00:00:00:12'/>
      <source bridge='xenbr0'/>
    </interface>
    <interface type='bridge'>
      <mac address='00:16:3e:43:90:d2'/>
      <source bridge='xenbr1'/>
    </interface>
    <console/>
  </devices>
</domain>

[[ XML END ]]
Virtual machine is Linux
Unlinkiking os block
[[ XML Domain Info (modified) ]]
<?xml version="1.0"?>
<domain type="xen">
  <name>axqa03_2</name>
  <uuid>1732aae45a110676113df9e7da458b61</uuid>
  <currentMemory>366592</currentMemory>
  <memory>366592</memory>
  <vcpu>2</vcpu>
  <on_poweroff>destroy</on_poweroff>
  <on_reboot>restart</on_reboot>
  <on_crash>restart</on_crash>
  <devices>
    <disk type="block" device="disk">
      <driver name="phy"/>
      <source dev="sds"/>
      <target dev="sds"/>
    </disk>
    <disk type="file" device="disk">
      <driver name="file"/>
      <source file="/var/lib/xen/images/axqa03_2.localdisk.dd"/>
      <target dev="sda"/>
    </disk>
    <interface type="bridge">
      <mac address="aa:00:00:00:00:12"/>
      <source bridge="xenbr0"/>
    </interface>
    <interface type="bridge">
      <mac address="00:16:3e:43:90:d2"/>
      <source bridge="xenbr1"/>
    </interface>
    <console/>
  </devices>
</domain>

[[ XML END ]]
[REBOOT] Calling virDomainDestroy
virDomainDestroy() failed: -1
Sending response to caller...

libvir: Xen Daemon error : GET operation failed:
Domain                   UUID                                 Owner State
------                   ----                                 ----- -----
axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
Storing axqa03_2

on axqa01_1:

Domain                   UUID                                 Owner State
------                   ----                                 ----- -----
axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
Storing axqa03_1
Domain                   UUID                                 Owner State
------                   ----                                 ----- -----
axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
Storing axqa03_1
Request to fence: axqa03_2
Evaluating Domain: axqa03_2   Last Owner: 2   State 2
Domain                   UUID                                 Owner State
------                   ----                                 ----- -----
axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
Storing axqa03_1
Domain                   UUID                                 Owner State

Any ideas?

Marc.

>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From gordan at bobich.net  Fri Oct 19 07:26:53 2007
From: gordan at bobich.net (Gordan Bobic)
Date: Fri, 19 Oct 2007 03:26:53 -0400 (EDT)
Subject: [Linux-cluster] RHEL5 sharedroot clusters
In-Reply-To: <1192746242.2849.58.camel@localhost.localdomain>
References: <200710111725.59227.hlawatschek@atix.de>
	<1192746242.2849.58.camel@localhost.localdomain>
Message-ID: <Pine.LNX.4.64.0710190325400.27608@outpost.shatteredsilicon.net>

On Thu, 18 Oct 2007, Kevin Anderson wrote:

> Mark,
>
> When I run the mkinitrd command, I get an error that comoonics-cs rpm
> not found.  When I try to yum install it, I get dependency errors on
> perl(XML::DOM), perl(XML::XQL) and perl(XML::XQL::DOM)  are needed by
> the package.  Which rpm are those commands in?
>
> BTW - am trying to set this up in a xen guest.  Will see how far I get.

yum is your friend. Use yum to install it, it'll resolve all the 
dependencies for you.

But if I had to hazard a guess, you'd want perl-XML-DOM, perl-XML-ZQL and 
perl-XML-XQL-DOM packages.

Gordan


From grimme at atix.de  Fri Oct 19 12:09:59 2007
From: grimme at atix.de (Marc Grimme)
Date: Fri, 19 Oct 2007 14:09:59 +0200
Subject: [Linux-cluster] XEN and Cluster Questions
In-Reply-To: <200710190842.03026.grimme@atix.de>
References: <200710171141.03724.grimme@atix.de>
	<1192737634.10256.7.camel@ayanami.boston.devel.redhat.com>
	<200710190842.03026.grimme@atix.de>
Message-ID: <200710191410.01042.grimme@atix.de>

Hello,
positive short update on this topic.

I figured out that tdbs are not a good idea to be shared and also not a good 
source to reside on GFS. When I make /var/lib/xenstored hostdependent and 
also mount a local filesystem (ext3) underneath.
Everything works.
1. Now I can *LIVE* migrate a sharedroot clusternode DomU form one Dom0 to the 
other
2. Now I can fence from DomU another DomU running on another Dom0.
That rocks!!

BTW concerning TDBs:
I had to do the same when I configured a samba cluster (some time ago) also 
using tdbs. There the tdbs where only cache files. So I didn't care. But now 
the tdbs seem to be somehow important.
Does anybody now why tdbs are not to be hosted on GFS?
It works but is very very slow.

Marc.
On Friday 19 October 2007 08:42:02 Marc Grimme wrote:
> On Thursday 18 October 2007 22:00:33 Lon Hohberger wrote:
> > On Wed, 2007-10-17 at 11:41 +0200, Marc Grimme wrote:
> > > Hello,
> > > we are currently discussing XEN with clustering support.
> > > There came some questions we are not sure what the answer is. Perhaps
> > > you can help ;-) .
> > >
> > > Background is: We are discussing a group of XEN Dom0 Hosts sharing all
> > > devices and files via GFS. They themselves again host a couple of
> > > virtually redhat-clustered DomU Hosts with or without gfs.
> > >
> > > 1. Live Migration of cluster DomU nodes:
> > > When I live migrate a virtual DomU clusternode to another DOM0 XEN Host
> > > the migration works ;-) , but the virtual clusternode is thrown out of
> > > the cluster. Is this a "works as designed"? I think the problem are the
> > > heartbeats not coming in proper time.
> > > Does that lead to the conclusion that one cannot live migrate cluster
> > > nodes?
> >
> > Depends.  If you're using rgmanager to do migration, the migration is
> > actually not live.  In order to do live migration,
> > change /usr/share/cluster/vm.sh...
> >
> >   - where it says 'xm migrate ...'
> >   - change it to 'xm migrate -l ...'
>
> Ok got it.
> Still did you try to live migrate a cluster node?
>
> > That should enable live migration.
> >
> > > 2. Fencing:
> > > How about fencing of the virtual Dom-U Clusternodes. You are never sure
> > > on which Dom-0 Node runs our Dom-U Clusternode. Is the fencing via
> > > fence_xvm[d] supported on such an environment? That means how does a
> > > virtual DomU clusternode X running on Dom0 Xen Host x know that if
> > > virtual DomU clusternode Y running on Dom0 Xen Host y is running there
> > > when it is getting the fence request to fence Host y where it is
> > > running?
> >
> > Yes.  Fence_xvmd is designed (specifically) to handle the case where the
> > dom0 hosting a particular domU is not known.  Note that this only works
> > on RHEL5 with openais and such; fence_xvmd uses AIS checkpoints to store
> > virtual machine locations.
> >
> > Notes:
> > * the parent dom0 cluster still needs fencing, too :)
>
> Yes. Thats in place. Check.
>
> > * do not mix domU and dom0 in the same cluster,
>
> I didn't. Check.
>
> > * all domUs within a dom0 cluster must have different domain names,
>
> Ups. hostname -d on dom0 and hostname -d on domu need to be different?
> What if they are empty?
> Or do you mean some other domainname?
> Dom0:
> [root at axqa01_2 ~]# hostname -d
> [root at axqa01_2 ~]#
> DomU:
> [root at axqa03_1 ~]# hostname -d
> cc.atix
>
> > * do *not* reuse /etc/xen/fence_xvm.key between multiple dom0 clusters
>
> I just did not use it.
> Dom0:
> [root at axqa01_2 ~]# ps ax | grep [f]ence_xvmd
>  1932 pts/1    S+     0:00 fence_xvmd -ddddd -f -c none -C none
> So on axqa01_2 runs axqa03_2 and on axqa01_1 runs axqa03_1
> Then when I do a
> ./fence_xvm -ddddd -C none -c none -H axqa03_2 on axqa03_1 I get the
> following:
> Waiting for response
> Received 264 bytes
> Adding IP 127.0.0.1 to list (family 2)
> Adding IP 10.1.2.1 to list (family 2)
> Adding IP 192.168.10.40 to list (family 2)
> Adding IP 192.168.122.1 to list (family 2)
> Closing Netlink connection
> ipv4_listen: Setting up ipv4 listen socket
> ipv4_listen: Success; fd = 3
> Setting up ipv4 multicast send (225.0.0.12:1229)
> Joining IP Multicast group (pass 1)
> Joining IP Multicast group (pass 2)
> Setting TTL to 2 for fd4
> ipv4_send_sk: success, fd = 4
> sign_request: no-op (HASH_NONE)
> Sending to 225.0.0.12 via 127.0.0.1
> Setting up ipv4 multicast send (225.0.0.12:1229)
> Joining IP Multicast group (pass 1)
> Joining IP Multicast group (pass 2)
> Setting TTL to 2 for fd4
> ipv4_send_sk: success, fd = 4
> sign_request: no-op (HASH_NONE)
> Sending to 225.0.0.12 via 10.1.2.1
> Setting up ipv4 multicast send (225.0.0.12:1229)
> Joining IP Multicast group (pass 1)
> Joining IP Multicast group (pass 2)
> Setting TTL to 2 for fd4
> ipv4_send_sk: success, fd = 4
> sign_request: no-op (HASH_NONE)
> Sending to 225.0.0.12 via 192.168.10.40
> Setting up ipv4 multicast send (225.0.0.12:1229)
> Joining IP Multicast group (pass 1)
> Joining IP Multicast group (pass 2)
> Setting TTL to 2 for fd4
> ipv4_send_sk: success, fd = 4
> sign_request: no-op (HASH_NONE)
> Sending to 225.0.0.12 via 192.168.122.1
> Waiting for connection from XVM host daemon.
> Issuing TCP challenge
> tcp_challenge: no-op (AUTH_NONE)
> Responding to TCP challenge
> tcp_response: no-op (AUTH_NONE)
> TCP Exchange + Authentication done...
> Waiting for return value from XVM host
> Remote: Operation failed
>
> on axqa01_2:
> ------                   ----                                 ----- -----
> axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
> Storing axqa03_2
> libvir: Xen Daemon error : GET operation failed:
> Domain                   UUID                                 Owner State
> ------                   ----                                 ----- -----
> axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
> Storing axqa03_2
> Request to fence: axqa03_2
> axqa03_2 is running locally
> Plain TCP request
> libvir: Xen Daemon error : GET operation failed:
> libvir: error : invalid argument in __virGetDomain
> libvir: Xen Store error : out of memory
> tcp_response: no-op (AUTH_NONE)
> tcp_challenge: no-op (AUTH_NONE)
> Rebooting domain axqa03_2...
> [[ XML Domain Info ]]
> <domain type='xen'>
>   <name>axqa03_2</name>
>   <uuid>1732aae45a110676113df9e7da458b61</uuid>
>   <os>
>     <type>linux</type>
>     <kernel>/var/lib/xen/boot/vmlinuz-2.6.18-52.el5xen</kernel>
>     <initrd>/var/lib/xen/boot/initrd_sr-2.6.18-52.el5xen.img</initrd>
>   </os>
>   <currentMemory>366592</currentMemory>
>   <memory>366592</memory>
>   <vcpu>2</vcpu>
>   <on_poweroff>destroy</on_poweroff>
>   <on_reboot>restart</on_reboot>
>   <on_crash>restart</on_crash>
>   <devices>
>     <disk type='block' device='disk'>
>       <driver name='phy'/>
>       <source dev='sds'/>
>       <target dev='sds'/>
>     </disk>
>     <disk type='file' device='disk'>
>       <driver name='file'/>
>       <source file='/var/lib/xen/images/axqa03_2.localdisk.dd'/>
>       <target dev='sda'/>
>     </disk>
>     <interface type='bridge'>
>       <mac address='aa:00:00:00:00:12'/>
>       <source bridge='xenbr0'/>
>     </interface>
>     <interface type='bridge'>
>       <mac address='00:16:3e:43:90:d2'/>
>       <source bridge='xenbr1'/>
>     </interface>
>     <console/>
>   </devices>
> </domain>
>
> [[ XML END ]]
> Virtual machine is Linux
> Unlinkiking os block
> [[ XML Domain Info (modified) ]]
> <?xml version="1.0"?>
> <domain type="xen">
>   <name>axqa03_2</name>
>   <uuid>1732aae45a110676113df9e7da458b61</uuid>
>   <currentMemory>366592</currentMemory>
>   <memory>366592</memory>
>   <vcpu>2</vcpu>
>   <on_poweroff>destroy</on_poweroff>
>   <on_reboot>restart</on_reboot>
>   <on_crash>restart</on_crash>
>   <devices>
>     <disk type="block" device="disk">
>       <driver name="phy"/>
>       <source dev="sds"/>
>       <target dev="sds"/>
>     </disk>
>     <disk type="file" device="disk">
>       <driver name="file"/>
>       <source file="/var/lib/xen/images/axqa03_2.localdisk.dd"/>
>       <target dev="sda"/>
>     </disk>
>     <interface type="bridge">
>       <mac address="aa:00:00:00:00:12"/>
>       <source bridge="xenbr0"/>
>     </interface>
>     <interface type="bridge">
>       <mac address="00:16:3e:43:90:d2"/>
>       <source bridge="xenbr1"/>
>     </interface>
>     <console/>
>   </devices>
> </domain>
>
> [[ XML END ]]
> [REBOOT] Calling virDomainDestroy
> virDomainDestroy() failed: -1
> Sending response to caller...
>
> libvir: Xen Daemon error : GET operation failed:
> Domain                   UUID                                 Owner State
> ------                   ----                                 ----- -----
> axqa03_2                 cb165cce-1798-daf9-1252-12a2347a9fc7 00002 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00002 00001
> Storing axqa03_2
>
> on axqa01_1:
>
> Domain                   UUID                                 Owner State
> ------                   ----                                 ----- -----
> axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
> Storing axqa03_1
> Domain                   UUID                                 Owner State
> ------                   ----                                 ----- -----
> axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
> Storing axqa03_1
> Request to fence: axqa03_2
> Evaluating Domain: axqa03_2   Last Owner: 2   State 2
> Domain                   UUID                                 Owner State
> ------                   ----                                 ----- -----
> axqa03_1                 8f89affa-4330-d281-9622-98665e4816c2 00001 00002
> Domain-0                 00000000-0000-0000-0000-000000000000 00001 00001
> Storing axqa03_1
> Domain                   UUID                                 Owner State
>
> Any ideas?
>
> Marc.
>
> > -- Lon
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Gruss / Regards,
>
> Marc Grimme
> http://www.atix.de/               http://www.open-sharedroot.org/
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
http://www.atix.de/               http://www.open-sharedroot.org/


From teigland at redhat.com  Fri Oct 19 14:33:28 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 19 Oct 2007 09:33:28 -0500
Subject: R: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...
In-Reply-To: <A8ED7CBAB5254E4EA503653480601CF101291A29@FBCMST09V03.fbc.local>
References: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>
	<20071018141200.GC1863@redhat.com>
	<A8ED7CBAB5254E4EA503653480601CF101291A29@FBCMST09V03.fbc.local>
Message-ID: <20071019143328.GA29574@redhat.com>

On Thu, Oct 18, 2007 at 08:05:02PM +0200, tam_annie at alice.it wrote:

> So, without qdisk my cluster just behaves exactly as I ever wanted! So
> is qdisk evil?  However, please, be so kind to explain this strange
> behavor to me, that's really a _MYSTERY_: Isn't the qdisk a tie-breaker
> useful especially in two node clusters and if one needs to decide
> cluster membership also on the basis of exogenous heuristics (like
> network connectivity as in my case, see RedHat Cluster FAQ)?  Shouldn't
> the qdisk allow one to build a more robust cluster against split-brain
> conditions?  Why does my cluster behave good only if I avoid using
> qdisk?  Have I really no more chance to use a quorum disk in my cluster
> architecture?  If it can help, I'd like to tell you that when I start my
> cluster with qdisk enabled, both nodes wait for each other on "Starting
> fencing..." before going on in the boot sequence: no node can boot alone
> while the other one is down.  That doesn't happen when I don't use
> qdisk, as you told me.  Again, thank you very very much indeed!

qdisk doesn't work well in RHEL5.0; I'm told it will work in 5.1

Dave


From mpartio at gmail.com  Fri Oct 19 14:38:44 2007
From: mpartio at gmail.com (Mikko Partio)
Date: Fri, 19 Oct 2007 17:38:44 +0300
Subject: R: [Linux-cluster] RHEL5 clvmd hangs only after a node crashes...
In-Reply-To: <20071019143328.GA29574@redhat.com>
References: <FBCMMO02fxUG0DpWhkR000282f5@FBCMMO02.fbc.local>
	<20071018141200.GC1863@redhat.com>
	<A8ED7CBAB5254E4EA503653480601CF101291A29@FBCMST09V03.fbc.local>
	<20071019143328.GA29574@redhat.com>
Message-ID: <2ca799770710190738p3b6652deu7620e074a10c0695@mail.gmail.com>

On 10/19/07, David Teigland <teigland at redhat.com> wrote:
>
> On Thu, Oct 18, 2007 at 08:05:02PM +0200, tam_annie at alice.it wrote:
>
> > So, without qdisk my cluster just behaves exactly as I ever wanted! So
> > is qdisk evil?  However, please, be so kind to explain this strange
> > behavor to me, that's really a _MYSTERY_: Isn't the qdisk a tie-breaker
> > useful especially in two node clusters and if one needs to decide
> > cluster membership also on the basis of exogenous heuristics (like
> > network connectivity as in my case, see RedHat Cluster FAQ)?  Shouldn't
> > the qdisk allow one to build a more robust cluster against split-brain
> > conditions?  Why does my cluster behave good only if I avoid using
> > qdisk?  Have I really no more chance to use a quorum disk in my cluster
> > architecture?  If it can help, I'd like to tell you that when I start my
> > cluster with qdisk enabled, both nodes wait for each other on "Starting
> > fencing..." before going on in the boot sequence: no node can boot alone
> > while the other one is down.  That doesn't happen when I don't use
> > qdisk, as you told me.  Again, thank you very very much indeed!
>
> qdisk doesn't work well in RHEL5.0; I'm told it will work in 5.1
>
> Dave


Is there a date set for the release?

M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071019/983cdfbd/attachment.htm>

From gwaycock at novolink.net  Fri Oct 19 14:53:54 2007
From: gwaycock at novolink.net (Glenn Aycock)
Date: Fri, 19 Oct 2007 14:53:54 -0000
Subject: [Linux-cluster] Active-Active configuration of arbitrary services
Message-ID: <E55DBA9DAB6FB04EAE75D6BAAF3309580190AF12@hou-exch01.novolink.com>

We are running RHCS on RHEL 4.5 and have a basic 2-node HA cluster
configuration for a critical application in place and functional. The
config looks like this:

 
<?xml version="1.0"?>
<cluster config_version="16" name="routing_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="10"/>
        <clusternodes>
                <clusternode name="host1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="manual"
nodename="host1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="host2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="manual"
nodename="host2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman dead_node_timeout="10" expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="manual"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="routing_servers"
ordered="1" restricted="1">
                                <failoverdomainnode name="host1"
priority="1"/>
                                <failoverdomainnode name="host2"
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <script file="/etc/init.d/rsd" name="rsd"/>
                        <ip address="123.456.78.9" monitor_link="1"/>
                </resources>
                <service autostart="1" domain="routing_servers"
name="routing_daemon" recovery="relocate">
                        <ip ref="123.456.78.9"/>
                        <script ref="rsd"/>
                </service>
        </rm>
</cluster>

 
The cluster takes about 15-20 seconds to notice that the daemon is down
and migrate it to the other node. However, due to slow migration and
startup time, we now require the daemon on the secondary to be active
and only transfer the VIP in case it aborts on the primary. 

 
I have found the cluster suite documentation to be lacking, particularly
in its reliance on GUI configuration tools without reference or
explanation to the underlying command-line tools. I managed to figure
out most of it by myself (we don't run GUIs on servers), but I'm not
sure how to keep both services active, using the init script for status,
while only moving the VIP. Is this possible? Am I overlooking the
obvious? 

 
Thanks in advance for suggestions/help!

 
Glenn

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071019/99b0753f/attachment.htm>

From rohara at redhat.com  Fri Oct 19 15:59:02 2007
From: rohara at redhat.com (Ryan O'Hara)
Date: Fri, 19 Oct 2007 10:59:02 -0500
Subject: [Linux-cluster] CLVM and SCSI-3 Reservations
In-Reply-To: <C776378855970A4DADE4A476447F6391F159ED@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391F159ED@NAMAIL3.ad.lsil.com>
Message-ID: <4718D446.108@redhat.com>

Sadek, Abdel wrote:
> Is CLVM a requirement for SCSI-3 Persistent Reservations to work on a 
> RHEL 5.0 native cluster?

Yes.

> I have a 2-node cluster. If I build my file system directly on the 
> devices /dev/sdb /dev/sdd etc.., there are no Persistent Reservations 
> being established on my storage array. If use LVM2 to create logical 
> volumes and build my FS on top of them, I see the persistent 
> reservations being established.

The scsi_reserve script looks for clvm volumes, then discovers the 
devices in those volumes. If you do not have a clvm volume, then you 
won't find any devices and thus you won't see any reservations.

Ryan


From maciej.bogucki at artegence.com  Fri Oct 19 16:21:35 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Fri, 19 Oct 2007 18:21:35 +0200
Subject: [Linux-cluster] CLVM and SCSI-3 Reservations
In-Reply-To: <4718D446.108@redhat.com>
References: <C776378855970A4DADE4A476447F6391F159ED@NAMAIL3.ad.lsil.com>
	<4718D446.108@redhat.com>
Message-ID: <4718D98F.7060503@artegence.com>

Ryan O'Hara napisa?(a):
> Sadek, Abdel wrote:
>> Is CLVM a requirement for SCSI-3 Persistent Reservations to work on a
>> RHEL 5.0 native cluster?
> 
> Yes.

Yes because CLVM is needed for GFS filesystem, but no if You want to use
it without clustert filesystem.

Best Regards
Maciej Bogucki


From beres.laszlo at sys-admin.hu  Fri Oct 19 16:38:20 2007
From: beres.laszlo at sys-admin.hu (BERES Laszlo)
Date: Fri, 19 Oct 2007 18:38:20 +0200
Subject: [Linux-cluster] CLVM and SCSI-3 Reservations
In-Reply-To: <4718D98F.7060503@artegence.com>
References: <C776378855970A4DADE4A476447F6391F159ED@NAMAIL3.ad.lsil.com>	<4718D446.108@redhat.com>
	<4718D98F.7060503@artegence.com>
Message-ID: <4718DD7C.3040606@sys-admin.hu>

Maciej Bogucki wrote:

> Yes because CLVM is needed for GFS filesystem, but no if You want to use
> it without clustert filesystem.

By the way: is it supported to do scsi reservation with cciss driver?
Because I have the issue below:

[root at devel ~]# sg_persist -d /dev/cciss/c1d0 -o -G -S 7F000001
inquiry: pass through os error: Inappropriate ioctl for device
sg_persist: /dev/cciss/c1d0 doesn't respond to a SCSI INQUIRY

-- 
B?RES L?szl? RHCE, RHCX
senior IT engineer, trainer


From lhh at redhat.com  Fri Oct 19 19:16:01 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 19 Oct 2007 15:16:01 -0400
Subject: [Linux-cluster] Active-Active configuration of arbitrary services
In-Reply-To: <E55DBA9DAB6FB04EAE75D6BAAF3309580190AF12@hou-exch01.novolink.com>
References: <E55DBA9DAB6FB04EAE75D6BAAF3309580190AF12@hou-exch01.novolink.com>
Message-ID: <1192821361.10256.52.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-19 at 14:53 +0000, Glenn Aycock wrote:
> We are running RHCS on RHEL 4.5 and have a basic 2-node HA cluster
> configuration for a critical application in place and functional. The
> config looks like this:

> <?xml version="1.0"?>
> <cluster config_version="16" name="routing_cluster">
>         <fence_daemon post_fail_delay="0" post_join_delay="10"/>
>         <clusternodes>
>                 <clusternode name="host1" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="manual" nodename="host1"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="host2" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="manual" nodename="host2"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>         </clusternodes>
>         <cman dead_node_timeout="10" expected_votes="1" two_node="1"/>
>         <fencedevices>
>                 <fencedevice agent="fence_manual" name="manual"/>
>         </fencedevices>
>         <rm>
>                 <failoverdomains>
>                         <failoverdomain name="routing_servers" ordered="1" restricted="1">
>                                 <failoverdomainnode name="host1" priority="1"/>
>                                 <failoverdomainnode name="host2" priority="2"/>
>                         </failoverdomain>
>                 </failoverdomains>
>                 <resources>
>                         <script file="/etc/init.d/rsd" name="rsd"/>
>                         <ip address="123.456.78.9" monitor_link="1"/>
>                 </resources>
>                 <service autostart="1" domain="routing_servers" name="routing_daemon" recovery="relocate">
>                         <ip ref="123.456.78.9"/>
>                         <script ref="rsd"/>
>                 </service>
>         </rm>
> </cluster>

> The cluster takes about 15-20 seconds to notice that the daemon is
> down and migrate it to the other node. However, due to slow migration
> and startup time, we now require the daemon on the secondary to be
> active and only transfer the VIP in case it aborts on the primary. 

You could start by decreasing the 'status check' time by
tweaking /usr/share/cluster/script.sh "status" interval:

        <action name="status" interval="30s" timeout="0"/>
        <action name="monitor" interval="30s" timeout="0"/>

Change to:
       
        <action name="status" interval="10s" timeout="0"/>
        <action name="monitor" interval="10s" timeout="0"/>

(as an example...)

You can also make a wrapper script which doesn't do the stop phase of
your rsd script unless it's already in a non-working state (to prevent
stop-before-start that rgmanager normally does):

#!/bin/bash

SCR=/etc/init.d/rsd

case $1 in
start)
    # Should be a no-op if already running
    $SCR start 
    exit $?
    ;;
stop)
    # Don't actually stop it if it's running; just
    # clean it up if it's broken.  This app is 
    # safe to run on multiple nodes
    $SCR status
    if [ $? -ne 0 ]; then
        $SCR stop
        exit $?
    fi
    exit 0
    ;;
status)
    $SCR status
    exit $?
    ;;
esac

exit 0

(Note: rsd will have to be enabled on boot for this to work).

-- Lon


From jgray at nicusa.com  Sat Oct 20 19:59:42 2007
From: jgray at nicusa.com (Josh Gray)
Date: Sat, 20 Oct 2007 13:59:42 -0600
Subject: [Linux-cluster] Email alerts on cluster events?
Message-ID: <C33FBA4E.10CB85%jgray@nicusa.com>

Does anyone have any scripts or other method of sending an alert on various
events in the cluster like in/out of quorate, services
stopping/starting/relocating etc..   I guess you could do this with a syslog
monitoring tool, but something more elegant would be better for my needs.


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From jgray at nicusa.com  Sat Oct 20 22:51:21 2007
From: jgray at nicusa.com (Josh Gray)
Date: Sat, 20 Oct 2007 16:51:21 -0600
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
Message-ID: <C33FE289.10CBA2%jgray@nicusa.com>

Borrowing the subject line on this from a thread last month.

I've encountered this issue as well with RHEL 5.0 - adding a quorum disk and
clurgmgrd not working right.    Everything I read says it's fixed in the
beta release.   Are there any other options to get this to work while trying
to stay with stable releases?


-- 
Josh Gray
Systems Administrator
NIC, Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"Talent is cheaper than table salt.
 What separates the talented individual
 from the successful one is a lot of hard work."
- Stephen King


From rottmann at atix.de  Mon Oct 22 07:15:14 2007
From: rottmann at atix.de (Reiner Rottmann)
Date: Mon, 22 Oct 2007 09:15:14 +0200
Subject: [Linux-cluster] Quorum Disk: mkqdisk shows "Host: (none)". What
	does that mean?
Message-ID: <200710220915.18932.rottmann@atix.de>

Hello,

I configured quorum disk for a two node cluster and get "Host: (none)" as 
output from mkqdisk. What does that mean?

Details:

cman_tool shows that the quorum disk is active and contributes its vote:

# cman_tool nodes
Node  Votes Exp Sts  Name
   0    1    0   M   /dev/sdh
   1    1    3   M   axqa02rc_1
   2    1    3   M   axqa02rc_2

When I  Display information on all accessible cluster quorum disks (-L option 
for mkqdisk), it shows "Host: (none):

# mkqdisk -L
mkqdisk v0.5.1
/dev/sdh:
        Magic:   eb7a62c2
        Label:   quorum
        Created: Fri Oct 19 15:57:19 2007
        Host:    (none)

Now I wonder what this means exactly. I expected that this entry should 
display the host that wrote to the quorum disk last time. Or maybe the 
designated master node?

I read the accompanying documentation for mkqdisk, qdisk in general and cman 
but did not find any clues regarding this entry.

Could someone enlighten me?

Best Regards,
Reiner Rottmann

-- 
Gruss / Regards,

Dipl.-Ing. (FH) Reiner Rottmann

Phone: +49-89 452 3538-12

http://www.atix.de/
http://open-sharedroot.org/

https://www.xing.com/profile/Reiner_Rottmann

PGP Key ID: 0xCA67C5A6
PGP Key Fingerprint = BF59FF006360B6E8D48F26B10D9F5A84CA67C5A6

Visit us at LinuxWorld Conference & Expo 
31.10. - 01.11.2007 in Jaarbeurs Utrecht - The Netherlands
ATIX stand: Hall 9 / B 005

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax: ? +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part.
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071022/9003c023/attachment.sig>

From mpartio at gmail.com  Mon Oct 22 13:39:25 2007
From: mpartio at gmail.com (Mikko Partio)
Date: Mon, 22 Oct 2007 16:39:25 +0300
Subject: [Linux-cluster] cmirror and RHEL 5
Message-ID: <2ca799770710220639pad02b0au53a671d3c3782714@mail.gmail.com>

Hello list

does RHEL 5 support cmirror (creating lvm mirrored volumes in a cluster
environment)? I can't find any rpm packages for it. My goal is to increase
redundancy by mirroring data between two sans.

Regards

M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071022/2ee75425/attachment.htm>

From mpartio at gmail.com  Mon Oct 22 13:41:58 2007
From: mpartio at gmail.com (Mikko Partio)
Date: Mon, 22 Oct 2007 16:41:58 +0300
Subject: [Linux-cluster] Re: Two-node cluster disconnecting
In-Reply-To: <2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com>
	<2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
Message-ID: <2ca799770710220641w52c923a4qe7f9777699fb0722@mail.gmail.com>

On 10/16/07, Mikko Partio <mpartio at gmail.com> wrote:
>
>
>
> On 10/11/07, Mikko Partio <mpartio at gmail.com> wrote:
> >
> > Hello list
> >
> > I have a problem with a two-node cluster going split-brain. When I first
> > boot the other node, it correctly starts all the services and informs that
> > cluster is quorate. Then when I boot the other node, on the boot phase
> > when it starts the cluster software it does not find the node already
> > running and starts the same services already running on node 1! When the
> > boot is complete I can see that the nodes have found each other for a small
> > period of time but then immediately disconnect from each other. The
> > cluster is created with Conga with shared disk support though no shared
> > disks are created yet. This is on CentOS 5.
> >
> > cluster.conf:
>
>
> Anybody got a clue? Also, should I be configuring ais.conf also? There is
> no mention about it in the docs but if cman is using openais, I'd guess it
> should be configured with the correct addresses right?
>
> Regards
>
> M
>

To finish up this monologue; one should not configure openais
(/etc/ais/openais.conf) nor start it at boot time. Also make sure your
multicast packets are coming through (pinging 224.0.0.1 is not enough).

Regards

M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071022/1b2aebc2/attachment.htm>

From christopher.barry at qlogic.com  Mon Oct 22 14:22:34 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Mon, 22 Oct 2007 09:22:34 -0500
Subject: [Linux-cluster] Re: Two-node cluster disconnecting
References: <2ca799770710102139u38aacd61lb0590e8f2658d1fc@mail.gmail.com><2ca799770710160013p409005ecie76a6a59c37be0d@mail.gmail.com>
	<2ca799770710220641w52c923a4qe7f9777699fb0722@mail.gmail.com>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB2453AC30@EPEXCH1.qlogic.org>

-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Mikko Partio
Sent: Mon 10/22/2007 9:41 AM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Re: Two-node cluster disconnecting
 
On 10/16/07, Mikko Partio <mpartio at gmail.com> wrote:
>
>
>
> On 10/11/07, Mikko Partio <mpartio at gmail.com> wrote:
> >
> > Hello list
> >
> > I have a problem with a two-node cluster going split-brain. When I first
> > boot the other node, it correctly starts all the services and informs that
> > cluster is quorate. Then when I boot the other node, on the boot phase
> > when it starts the cluster software it does not find the node already
> > running and starts the same services already running on node 1! When the
> > boot is complete I can see that the nodes have found each other for a small
> > period of time but then immediately disconnect from each other. The
> > cluster is created with Conga with shared disk support though no shared
> > disks are created yet. This is on CentOS 5.
> >
> > cluster.conf:
>
>
> Anybody got a clue? Also, should I be configuring ais.conf also? There is
> no mention about it in the docs but if cman is using openais, I'd guess it
> should be configured with the correct addresses right?
>
> Regards
>
> M
>

To finish up this monologue; one should not configure openais
(/etc/ais/openais.conf) nor start it at boot time. Also make sure your
multicast packets are coming through (pinging 224.0.0.1 is not enough).

Regards

M


Make sure you have correct /etc/hosts on both nodes, with all interfaces/names on your nodes defined.


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3511 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071022/daa93ca8/attachment.bin>

From lhh at redhat.com  Mon Oct 22 16:41:30 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 22 Oct 2007 12:41:30 -0400
Subject: [Linux-cluster] Quorum Disk: mkqdisk shows "Host: (none)".
	What does that mean?
In-Reply-To: <200710220915.18932.rottmann@atix.de>
References: <200710220915.18932.rottmann@atix.de>
Message-ID: <1193071290.7753.3.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-22 at 09:15 +0200, Reiner Rottmann wrote:
> Hello,
> 
> I configured quorum disk for a two node cluster and get "Host: (none)" as 
> output from mkqdisk. What does that mean?

It didn't write the hostname to the qdisk header block for some
reason.  

Nothing to worry about; the hostname in the header block is mostly for
debugging setups; it gives you one more point of comparison when
checking for the quorum partition on multiple nodes.

(There's also the timestamp + label which are far more important.)

-- Lon


From jgray at nicusa.com  Mon Oct 22 16:58:59 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 22 Oct 2007 10:58:59 -0600
Subject: [Linux-cluster] Re: clurgmgrd doesn't work with quorum disk
In-Reply-To: <C33FE289.10CBA2%jgray@nicusa.com>
Message-ID: <C34232F3.10CD8C%jgray@nicusa.com>

Sent this over the weekend,  curious if anyone has any insight into it.


On 10/20/07 4:51 PM, "Josh Gray" <jgray at nicusa.com> wrote:

> Borrowing the subject line on this from a thread last month.
> 
> I've encountered this issue as well with RHEL 5.0 - adding a quorum disk and
> clurgmgrd not working right.    Everything I read says it's fixed in the beta
> release.   Are there any other options to get this to work while trying to
> stay with stable releases?
> 
> 
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From lhh at redhat.com  Mon Oct 22 17:30:21 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 22 Oct 2007 13:30:21 -0400
Subject: [Linux-cluster] Re: clurgmgrd doesn't work with quorum disk
In-Reply-To: <C34232F3.10CD8C%jgray@nicusa.com>
References: <C34232F3.10CD8C%jgray@nicusa.com>
Message-ID: <1193074221.7753.33.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-22 at 10:58 -0600, Josh Gray wrote:
> Sent this over the weekend,  curious if anyone has any insight into it.
> 
> 
> On 10/20/07 4:51 PM, "Josh Gray" <jgray at nicusa.com> wrote:
> 
> > Borrowing the subject line on this from a thread last month.
> > 
> > I've encountered this issue as well with RHEL 5.0 - adding a quorum disk and
> > clurgmgrd not working right.    Everything I read says it's fixed in the beta
> > release.   Are there any other options to get this to work while trying to
> > stay with stable releases?

You'd probably be better off running the beta cluster bits than the
current alternative (e.g. rolling your own with patches).

I would just run without qdisk for now until we get 5.1 out, which
should not be too long.

-- Lon


From lhh at redhat.com  Mon Oct 22 17:38:45 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 22 Oct 2007 13:38:45 -0400
Subject: [Linux-cluster] Email alerts on cluster events?
In-Reply-To: <C33FBA4E.10CB85%jgray@nicusa.com>
References: <C33FBA4E.10CB85%jgray@nicusa.com>
Message-ID: <1193074725.7753.42.camel@ayanami.boston.devel.redhat.com>

On Sat, 2007-10-20 at 13:59 -0600, Josh Gray wrote:
> Does anyone have any scripts or other method of sending an alert on various
> events in the cluster like in/out of quorate, services
> stopping/starting/relocating etc..   I guess you could do this with a syslog
> monitoring tool, but something more elegant would be better for my needs.

I've actually got stuff in the cooker for this sort of thing, but I
hadn't added email to it:

https://bugzilla.redhat.com/show_bug.cgi?id=247772

(The bug name is misleading.)

The patch allows user-defined edge-triggered scripts for service and
node transitions.  Transition from quorate->inquorate however would be
more difficult (the transition master requires a DLM lock, which
requires quorum).

Additionally, it externalizes service/node event transition handling
into something that users can alter on the fly.  I'd be happy to work
with you on this to add a binding to the script language which would
then fire off emails.

-- Lon


From Alexandre.Racine at mhicc.org  Mon Oct 22 17:39:37 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Mon, 22 Oct 2007 13:39:37 -0400
Subject: [Linux-cluster] watchdog with GFS?
References: <D158540CCC0AB54C8FD4818F823CCB2453AC30@EPEXCH1.qlogic.org>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C353B@cumulonimbus.RG.local>

Hi all, 

I would need to have some good and bad points to use watchdog (the software Linux app) has a fencing device for GFS.

Does some of you have some experience with it? Let's just say that I want to use DRAC's witch are very nice, but some people needs to be over-convince sometimes.

Shoot everything your got :)


Thanks.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2507 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071022/6ed5a708/attachment.bin>

From jos at xos.nl  Mon Oct 22 17:43:31 2007
From: jos at xos.nl (Jos Vos)
Date: Mon, 22 Oct 2007 19:43:31 +0200
Subject: [Linux-cluster] Quorum disk votes not used when starting node?
In-Reply-To: <1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>
References: <200710101856.l9AIuRWA008113@jasmine.xos.nl>
	<1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>
Message-ID: <20071022174331.GB14244@jasmine.xos.nl>

On Mon, Oct 15, 2007 at 11:37:32AM -0400, Lon Hohberger wrote:

> On Wed, 2007-10-10 at 20:56 +0200, Jos Vos wrote:

> > Now, this all works fine, cman_tool shows what I expected and when I
> > remove the file /tmp/qdisk on a node, that node reboots instantaneously.
> > 
> > However, after the reboot, while the file tested in the heuristic does
> > still not exist, the node is joining the cluster again and starts some
> > cluster services!
> 
> Add stop_cman="1" to <quorumd>

Any other suggestions?  As I already replied, this didn't help (I'd
already tried that before asking the question).  Note that this is
on RHEL4 (with all updates), if that matters.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From randy.brown at noaa.gov  Mon Oct 22 18:30:56 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Mon, 22 Oct 2007 14:30:56 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
Message-ID: <471CEC60.1040308@noaa.gov>

I have a two node cluster configured which is going to be used as a NAS 
head for our iscsi based storage.  I think I have it working, for the 
most part (still some fencing issues, but that will be in another 
post).  I am using Centos 5 and it's associated clustering software.

If I boot both nodes with none of the clustering components (cman, 
clvmd, gfs, or rgmanager) starting at boot, I can restart the iscsi 
service then start cman, clvmd, gfs, and rgmanger and everything works.  
The problem stems from the iscsi not making a successful connection to 
the storage when the iscsi service starts so clvmd doesn't see the 
volume group so gfs can't mount the filesystem.

I have tried moving things around in the boot order using chkconfig, 
i.e., moved iscsi to S68, cman to S70, clvmd to S72, and gfs to S74 at 
runlevels 2, 3, 4, and 5, but that didn't make any difference.

I was wondering if anyone else has experienced this or has any 
suggestions as to what I can do.  It's very frustrating and I don't want 
to have to manually start the clustering on each machine after a reboot.

I will gladly provide further information as required.

Thanks in advance,

Randy


From gordan at bobich.net  Mon Oct 22 18:35:36 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 22 Oct 2007 19:35:36 +0100 (BST)
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471CEC60.1040308@noaa.gov>
References: <471CEC60.1040308@noaa.gov>
Message-ID: <Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>

On Mon, 22 Oct 2007, Randy Brown wrote:

> If I boot both nodes with none of the clustering components (cman, clvmd, 
> gfs, or rgmanager) starting at boot, I can restart the iscsi service then 
> start cman, clvmd, gfs, and rgmanger and everything works.  The problem stems 
> from the iscsi not making a successful connection to the storage when the 
> iscsi service starts so clvmd doesn't see the volume group so gfs can't mount 
> the filesystem.

Up the number of connection retries in iscsi.conf.

> I have tried moving things around in the boot order using chkconfig, i.e., 
> moved iscsi to S68, cman to S70, clvmd to S72, and gfs to S74 at runlevels 2, 
> 3, 4, and 5, but that didn't make any difference.

Bad idea. I _strongly_ suggest you put the priorities back as whey were.

Gordan


From randy.brown at noaa.gov  Mon Oct 22 18:51:27 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Mon, 22 Oct 2007 14:51:27 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
Message-ID: <471CF12F.70107@noaa.gov>

Thanks, I'll try upping the retries.  I am assuming this is the same 
thing as increasing the time value here:

# To specify the time to wait for login to complete, edit the line.
# The value is in seconds and the default is 15 seconds.
node.conn[0].timeo.login_timeout = 25

I have put the priorities back too.   That was just for testing purposes.

Randy

gordan at bobich.net wrote:
> On Mon, 22 Oct 2007, Randy Brown wrote:
>
>> If I boot both nodes with none of the clustering components (cman, 
>> clvmd, gfs, or rgmanager) starting at boot, I can restart the iscsi 
>> service then start cman, clvmd, gfs, and rgmanger and everything 
>> works.  The problem stems from the iscsi not making a successful 
>> connection to the storage when the iscsi service starts so clvmd 
>> doesn't see the volume group so gfs can't mount the filesystem.
>
> Up the number of connection retries in iscsi.conf.
>
>> I have tried moving things around in the boot order using chkconfig, 
>> i.e., moved iscsi to S68, cman to S70, clvmd to S72, and gfs to S74 
>> at runlevels 2, 3, 4, and 5, but that didn't make any difference.
>
> Bad idea. I _strongly_ suggest you put the priorities back as whey were.
>
> Gordan
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From gordan at bobich.net  Mon Oct 22 18:53:33 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 22 Oct 2007 19:53:33 +0100 (BST)
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471CF12F.70107@noaa.gov>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
Message-ID: <Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>

On Mon, 22 Oct 2007, Randy Brown wrote:

> Thanks, I'll try upping the retries.  I am assuming this is the same thing as 
> increasing the time value here:

No. Timeouts and retries are separate settings. The problem is usually 
that the iSCSI subsystems tries to access the SAN before the network has 
fully come up.

> #  To specify the time to wait for login to complete, edit the line.
> #  The value is in seconds and the default is 15 seconds.
> node.conn[0].timeo.login_timeout = 25

This is probably the setting you want to change:
node.session.initial_login_retry_max = 8

Gordan


From jgray at nicusa.com  Mon Oct 22 20:05:16 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 22 Oct 2007 14:05:16 -0600
Subject: [Linux-cluster] Cman tool leave remove, device or resource busy
Message-ID: <C3425E9C.10CDD8%jgray@nicusa.com>

I see in the FAQ the proper way to leave a cluster is with 'cman_tool leave
remove'  I am getting a  'device or resource busy' error when I do that even
with GFS mounts removed and stopped.   What am I missing?  The only service
I see up is a "fence" type called "default"  do I have to stop one more
thing?

Basically doing this:

service nfs stop
umount /export
service gfs stop
service rgmanager stop
service clvmd stop
cman_tool leave remove

Tells me this:
cman_tool: Error leaving cluster: Device or resource busy


Full snip from the server:

[root at nfs-6 ~]# cman_tool services
type             level name         id       state
fence            0     default      00010003 none
[1 2 3]
dlm              1     clvmd        00020003 none
[1 2 3]
dlm              1     rgmanager    00050003 none
[1 2 3]
dlm              1     nfs_data_vg  00040003 none
[1 2 3]
gfs              2     nfs_data_vg  00030003 none
[1 2 3]
[root at nfs-6 ~]# service nfs stop
Shutting down NFS mountd:                                  [  OK  ]
Shutting down NFS daemon:                                  [  OK  ]
Shutting down NFS quotas:                                  [  OK  ]
Shutting down NFS services:                                [  OK  ]
[root at nfs-6 ~]# umount /export
[root at nfs-6 ~]# service gfs stop
[root at nfs-6 ~]# service rgmanager stop
Shutting down Cluster Service Manager...
Waiting for services to stop:                              [  OK  ]
Cluster Service Manager is stopped.
[root at nfs-6 ~]# service clvmd stop
Deactivating VG nfs_data_vg:   0 logical volume(s) in volume group
"nfs_data_vg" now active
                                                           [  OK  ]
Stopping clvm:                                             [  OK  ]
[root at nfs-6 ~]# cman_tool leave remove
cman_tool: Error leaving cluster: Device or resource busy
[root at nfs-6 ~]# cman_tool services
type             level name     id       state
fence            0     default  00010003 none
[1 2 3]
[root at nfs-6 ~]# 


-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From randy.brown at noaa.gov  Mon Oct 22 20:25:44 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Mon, 22 Oct 2007 16:25:44 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
Message-ID: <471D0748.7040007@noaa.gov>

There isn't a line like that in my iscsi.conf file.  Can I simply add it?

Randy

gordan at bobich.net wrote:
> On Mon, 22 Oct 2007, Randy Brown wrote:
>
>> Thanks, I'll try upping the retries.  I am assuming this is the same 
>> thing as increasing the time value here:
>
> No. Timeouts and retries are separate settings. The problem is usually 
> that the iSCSI subsystems tries to access the SAN before the network 
> has fully come up.
>
>> #  To specify the time to wait for login to complete, edit the line.
>> #  The value is in seconds and the default is 15 seconds.
>> node.conn[0].timeo.login_timeout = 25
>
> This is probably the setting you want to change:
> node.session.initial_login_retry_max = 8
>
> Gordan
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From michaelc at cs.wisc.edu  Mon Oct 22 21:55:16 2007
From: michaelc at cs.wisc.edu (Mike Christie)
Date: Mon, 22 Oct 2007 16:55:16 -0500
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471D0748.7040007@noaa.gov>
References: <471CEC60.1040308@noaa.gov>	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>	<471CF12F.70107@noaa.gov>	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
	<471D0748.7040007@noaa.gov>
Message-ID: <471D1C44.2040202@cs.wisc.edu>

Randy Brown wrote:
> There isn't a line like that in my iscsi.conf file.  Can I simply add it?
> 

If you are using the iscsi tools that came with Centos 5.0, the feature 
is not present. You will need the tools that come with Centos 5.1 when 
it comes out or you need the upstream tools from open-iscsi.org.

> Randy
> 
> gordan at bobich.net wrote:
>> On Mon, 22 Oct 2007, Randy Brown wrote:
>>
>>> Thanks, I'll try upping the retries.  I am assuming this is the same 
>>> thing as increasing the time value here:
>>
>> No. Timeouts and retries are separate settings. The problem is usually 
>> that the iSCSI subsystems tries to access the SAN before the network 
>> has fully come up.
>>
>>> #  To specify the time to wait for login to complete, edit the line.
>>> #  The value is in seconds and the default is 15 seconds.
>>> node.conn[0].timeo.login_timeout = 25
>>
>> This is probably the setting you want to change:
>> node.session.initial_login_retry_max = 8
>>
>> Gordan
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From nlam87346 at library.usyd.edu.au  Mon Oct 22 22:16:59 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Tue, 23 Oct 2007 08:16:59 +1000
Subject: [Linux-cluster] cmirror and RHEL 5
In-Reply-To: <2ca799770710220639pad02b0au53a671d3c3782714@mail.gmail.com>
References: <2ca799770710220639pad02b0au53a671d3c3782714@mail.gmail.com>
Message-ID: <1193091419.5820.17.camel@lits19.library.usyd.edu.au>

On Mon, 2007-10-22 at 16:39 +0300, Mikko Partio wrote:
> Hello list
> 
> does RHEL 5 support cmirror (creating lvm mirrored volumes in a
> cluster environment)? I can't find any rpm packages for it. My goal is
> to increase redundancy by mirroring data between two sans. 
> 
> Regards
> 
> M

Hi Mikko,

Unfortunately the answer is no at the moment. This and LVM snapshots are
two things that I know can't be done with RHEL5's CLVM yet. I've been
told by Red Hat consultants that update 2 is the current target for
these features.

We wanted to use this solution to automate SAN failover between two HP
EVAs in different data centres but in the end have ended up having to
use HP's Continuous Access system to handle mirroring behind the scenes.
Fortunately we were already needed this mirroring elsewhere in the
university because I'm told the license costs six figures, which is a
lot more expensive than host based mirroring (0 dollars ;).

In some ways this is better because it removes a potential split brain
scenario that could happen if host-based mirroring was used. But it does
require manual intervention by a SAN administrator in the event of EVA
failure, which is not going to happen quickly if it happens, say at 3am.

Regards,

Nik

 
From christopher.barry at qlogic.com  Tue Oct 23 05:36:46 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Tue, 23 Oct 2007 00:36:46 -0500
Subject: [Linux-cluster] new fence agent for vmware esx server
Message-ID: <D158540CCC0AB54C8FD4818F823CCB2453AC35@EPEXCH1.qlogic.org>


Greetings,

I needed a fence agent to poweroff or reset my virtual cluster nodes using the viperltoolkit. 
I could not find one that did, (fence_esx didn't do it for me) so I hacked together this one.

I'm not much of a perl guy, so it could probably use some love, but it does work. 
It drops a vm in ~2 seconds.

Have a look, make corrections/additions, and let me know what you think.


Cheers,
-C
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/c883f126/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fence_vi3
Type: application/octet-stream
Size: 6329 bytes
Desc: fence_vi3
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/c883f126/attachment.obj>

From christopher.barry at qlogic.com  Tue Oct 23 06:46:02 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Tue, 23 Oct 2007 01:46:02 -0500
Subject: [Linux-cluster] RE: new fence agent for vmware esx server
References: <D158540CCC0AB54C8FD4818F823CCB2453AC35@EPEXCH1.qlogic.org>
Message-ID: <D158540CCC0AB54C8FD4818F823CCB2453AC38@EPEXCH1.qlogic.org>

-----Original Message-----
From: Christopher Barry
Sent: Tue 10/23/2007 1:36 AM
To: Linux-Cluster at Redhat. Com (linux-cluster at redhat.com)
Subject: new fence agent for vmware esx server
 

Greetings,

I needed a fence agent to poweroff or reset my virtual cluster nodes using the viperltoolkit. 
I could not find one that did, (fence_esx didn't do it for me) so I hacked together this one.

I'm not much of a perl guy, so it could probably use some love, but it does work. 
It drops a vm in ~2 seconds.

Have a look, make corrections/additions, and let me know what you think.


Cheers,
-C


I noticed that my company's spam filter never delivered this email to me, and thought that may be the case for many of you. You can find the fence_vi3 agent on MARC at:
http://marc.info/?l=redhat-linux-cluster&m=119311812110841&w=3


-C
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/82861f4b/attachment.htm>

From gordan at bobich.net  Tue Oct 23 07:36:52 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 23 Oct 2007 08:36:52 +0100 (BST)
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471D0748.7040007@noaa.gov>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
	<471D0748.7040007@noaa.gov>
Message-ID: <Pine.LNX.4.64.0710230836270.23622@skynet.shatteredsilicon.net>

Yes.

On Mon, 22 Oct 2007, Randy Brown wrote:

> There isn't a line like that in my iscsi.conf file.  Can I simply add it?
>
> Randy
>
> gordan at bobich.net wrote:
>> > #   To specify the time to wait for login to complete, edit the line.
>> > #   The value is in seconds and the default is 15 seconds.
>> >  node.conn[0].timeo.login_timeout = 25
>>
>>  This is probably the setting you want to change:
>>  node.session.initial_login_retry_max = 8


From gordan at bobich.net  Tue Oct 23 07:37:47 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 23 Oct 2007 08:37:47 +0100 (BST)
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471D1C44.2040202@cs.wisc.edu>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
	<471D0748.7040007@noaa.gov> <471D1C44.2040202@cs.wisc.edu>
Message-ID: <Pine.LNX.4.64.0710230837160.23622@skynet.shatteredsilicon.net>

Are you sure about that? It cured the problem for me.

On Mon, 22 Oct 2007, Mike Christie wrote:

> If you are using the iscsi tools that came with Centos 5.0, the feature is 
> not present. You will need the tools that come with Centos 5.1 when it comes 
> out or you need the upstream tools from open-iscsi.org.
>
>> >  This is probably the setting you want to change:
>> >  node.session.initial_login_retry_max = 8


From breeves at redhat.com  Tue Oct 23 08:05:20 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 23 Oct 2007 09:05:20 +0100
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
References: <471CEC60.1040308@noaa.gov>	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
Message-ID: <471DAB40.6020003@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

gordan at bobich.net wrote:
> On Mon, 22 Oct 2007, Randy Brown wrote:
> 
>> Thanks, I'll try upping the retries.  I am assuming this is the same
>> thing as increasing the time value here:
> 
> No. Timeouts and retries are separate settings. The problem is usually
> that the iSCSI subsystems tries to access the SAN before the network has
> fully come up.
> 

You can also tweak the network delay setting in /etc/sysconfig/network.
Set the variable NETWORKDELAY to a value (in seconds) and the network
scripts will sleep for this period of time after bringing all interfaces
up. This should allow time for your network to be fully up before the
iscsi service starts.

Regards,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHHatA6YSQoMYUY94RAiGAAJ41fsuuLLvypgti/C+Ik07RCBUkrgCfUL9W
tYdoAqbjAdoRr02KJKssdJQ=
=nciQ
-----END PGP SIGNATURE-----


From Alain.Moulle at bull.net  Tue Oct 23 09:29:39 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 23 Oct 2007 11:29:39 +0200
Subject: [Linux-cluster] deadnode_timer="value" to the cman section in
	cluster.conf
Message-ID: <471DBF03.1010507@bull.net>

Hi

Is the "deadnode_timer" field managed in CS4 U4 ? CS4 U5 ? all versions ?

Because I don't find this field described in the cluster.conf doc :
http://sources.redhat.com/cluster/doc/cluster_schema.html

Thanks
Regards
Alain Moull?


From maciej.bogucki at artegence.com  Tue Oct 23 10:10:29 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 23 Oct 2007 12:10:29 +0200
Subject: [Linux-cluster] cmirror and RHEL 5
In-Reply-To: <1193091419.5820.17.camel@lits19.library.usyd.edu.au>
References: <2ca799770710220639pad02b0au53a671d3c3782714@mail.gmail.com>
	<1193091419.5820.17.camel@lits19.library.usyd.edu.au>
Message-ID: <471DC895.4050603@artegence.com>

Nikolas Lam napisa?(a):
> On Mon, 2007-10-22 at 16:39 +0300, Mikko Partio wrote:
>> Hello list
>>
>> does RHEL 5 support cmirror (creating lvm mirrored volumes in a
>> cluster environment)? I can't find any rpm packages for it. My goal is
>> to increase redundancy by mirroring data between two sans. 
>>
>> Regards
>>
>> M
> 
> Hi Mikko,
> 
> Unfortunately the answer is no at the moment. This and LVM snapshots are
> two things that I know can't be done with RHEL5's CLVM yet. I've been
> told by Red Hat consultants that update 2 is the current target for
> these features.
> 
> We wanted to use this solution to automate SAN failover between two HP
> EVAs in different data centres but in the end have ended up having to
> use HP's Continuous Access system to handle mirroring behind the scenes.
> Fortunately we were already needed this mirroring elsewhere in the
> university because I'm told the license costs six figures, which is a
> lot more expensive than host based mirroring (0 dollars ;).
> 
> In some ways this is better because it removes a potential split brain
> scenario that could happen if host-based mirroring was used. But it does
> require manual intervention by a SAN administrator in the event of EVA
> failure, which is not going to happen quickly if it happens, say at 3am.

You can use LVM on the top of DRBD [1]. It is very simple to implement
with bash scripts(manual failover) or with Heartbeat(automatic failover)
[2]. In my opinion DRBD is better solution than cmirror, because you
don't need third device.

[1] - http://www.drbd.org/
[2] - http://www.linux-ha.org/

Best Regards
Maciej Bogucki


From maciej.bogucki at artegence.com  Tue Oct 23 10:12:41 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 23 Oct 2007 12:12:41 +0200
Subject: [Linux-cluster] watchdog with GFS?
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C353B@cumulonimbus.RG.local>
References: <D158540CCC0AB54C8FD4818F823CCB2453AC30@EPEXCH1.qlogic.org>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C353B@cumulonimbus.RG.local>
Message-ID: <471DC919.3080604@artegence.com>

Alexandre Racine napisa?(a):
> Hi all, 
> 
> I would need to have some good and bad points to use watchdog (the software Linux app) has a fencing device for GFS.
> 
> Does some of you have some experience with it? Let's just say that I want to use DRAC's witch are very nice, but some people needs to be over-convince sometimes.
> 
> Shoot everything your got :)

linux-cluster Archives is You friend [1]

[1] - https://www.redhat.com/archives/linux-cluster/

Best Regards
Maciej Bogucki


From breeves at redhat.com  Tue Oct 23 10:23:49 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 23 Oct 2007 11:23:49 +0100
Subject: [Linux-cluster] watchdog with GFS?
In-Reply-To: <471DC919.3080604@artegence.com>
References: <D158540CCC0AB54C8FD4818F823CCB2453AC30@EPEXCH1.qlogic.org>	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C353B@cumulonimbus.RG.local>
	<471DC919.3080604@artegence.com>
Message-ID: <471DCBB5.5090803@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Maciej Bogucki wrote:
> Alexandre Racine napisa?(a):
>> Hi all, 
>>
>> I would need to have some good and bad points to use watchdog (the software Linux app) has a fencing device for GFS.
>>
>> Does some of you have some experience with it? Let's just say that I want to use DRAC's witch are very nice, but some people needs to be over-convince sometimes.
>>
>> Shoot everything your got :)

Using watchdogs / manual fencing is such a bad idea it even gets its own
FAQ entry:

http://sources.redhat.com/cluster/faq.html#fence_manual

As a backup method or for testing/development/debugging it has some
uses, but as your primary fencing method in a production environment it
is just not suitable.

Regards,
Bryn.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHHcu16YSQoMYUY94RAhAbAJ9zOP3r52bUDXCGteXE8ghsr85svQCgzoT9
x1vQX4vyYHiRzym+SJEOXEs=
=puXN
-----END PGP SIGNATURE-----


From andreezer at gmail.com  Tue Oct 23 10:52:08 2007
From: andreezer at gmail.com (=?ISO-8859-1?Q?Andr=E9_Fernandes?=)
Date: Tue, 23 Oct 2007 11:52:08 +0100
Subject: [Linux-cluster] self_fence in <fs> resource tag?
Message-ID: <a434d05d0710230352nce96280r5d9e4ea907cfc179@mail.gmail.com>

Hi.

I was configuring a HA cluster with a shared ext3 filesystem and came across
this option in the <fs> resource tag.
What does the self_fence option do? I could not find it detailed anywhere.

Here's the sample from my cluster.conf:
...
<resources>
            ...
            <fs device="/dev/scalixdata/scalixmail" force_fsck="1"
force_unmount="1" fsid="8848" fstype="ext3" mountpoint="/var/opt/scalix"
name="scalix_mail_data" options="" self_fence="1"/>
            ...

thanks in advance.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/d3731834/attachment.htm>

From rhurst at bidmc.harvard.edu  Tue Oct 23 12:33:30 2007
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Tue, 23 Oct 2007 08:33:30 -0400
Subject: [Linux-cluster] Cman tool leave remove, device or resource
 busy
In-Reply-To: <C3425E9C.10CDD8%jgray@nicusa.com>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
Message-ID: <1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>

Unfortunately, we have run into the same issue... but we have found that
a small pause before shutting cman down works every time (script snippet
below -- sleep 1 was dicey, sleep 2 always worked, sleep3 just because):

service rgmanager stop
[ $? -eq 0 ] || exit

service gfs stop
[ $? -eq 0 ] || exit

service clvmd stop
[ $? -eq 0 ] || exit

service fenced stop
[ $? -eq 0 ] || exit

# invoking cman too early sometimes fails ... -t option too small?
sync
sleep 3

[ "$ACTION" = "leave" ] && service cman stop leave || service cman stop
[ $? -eq 0 ] || exit

service ccsd stop
[ $? -eq 0 ] || exit


On Mon, 2007-10-22 at 14:05 -0600, Josh Gray wrote:

> I see in the FAQ the proper way to leave a cluster is with 'cman_tool leave
> remove'  I am getting a  'device or resource busy' error when I do that even
> with GFS mounts removed and stopped.   What am I missing?  The only service
> I see up is a "fence" type called "default"  do I have to stop one more
> thing?
> 
> Basically doing this:
> 
> service nfs stop
> umount /export
> service gfs stop
> service rgmanager stop
> service clvmd stop
> cman_tool leave remove
> 
> Tells me this:
> cman_tool: Error leaving cluster: Device or resource busy
> 
> 
> Full snip from the server:
> 
> [root at nfs-6 ~]# cman_tool services
> type             level name         id       state
> fence            0     default      00010003 none
> [1 2 3]
> dlm              1     clvmd        00020003 none
> [1 2 3]
> dlm              1     rgmanager    00050003 none
> [1 2 3]
> dlm              1     nfs_data_vg  00040003 none
> [1 2 3]
> gfs              2     nfs_data_vg  00030003 none
> [1 2 3]
> [root at nfs-6 ~]# service nfs stop
> Shutting down NFS mountd:                                  [  OK  ]
> Shutting down NFS daemon:                                  [  OK  ]
> Shutting down NFS quotas:                                  [  OK  ]
> Shutting down NFS services:                                [  OK  ]
> [root at nfs-6 ~]# umount /export
> [root at nfs-6 ~]# service gfs stop
> [root at nfs-6 ~]# service rgmanager stop
> Shutting down Cluster Service Manager...
> Waiting for services to stop:                              [  OK  ]
> Cluster Service Manager is stopped.
> [root at nfs-6 ~]# service clvmd stop
> Deactivating VG nfs_data_vg:   0 logical volume(s) in volume group
> "nfs_data_vg" now active
>                                                            [  OK  ]
> Stopping clvm:                                             [  OK  ]
> [root at nfs-6 ~]# cman_tool leave remove
> cman_tool: Error leaving cluster: Device or resource busy
> [root at nfs-6 ~]# cman_tool services
> type             level name     id       state
> fence            0     default  00010003 none
> [1 2 3]
> [root at nfs-6 ~]# 
> 
> 
> 
> 


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/61e07631/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2178 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/61e07631/attachment.p7s>

From gordan at bobich.net  Tue Oct 23 15:24:59 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 23 Oct 2007 16:24:59 +0100 (BST)
Subject: [Linux-cluster] Downing secondary interface if primary fails?
In-Reply-To: <1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
	<1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
Message-ID: <Pine.LNX.4.64.0710231622250.25898@skynet.shatteredsilicon.net>

Hi,

Say a machine is a part of a cluster. It has eth0 for WAN accessm and eth1 
for SAN/cluster access. Say eth1 fails - is there a way to get eth0 to 
shut down as well, so the machine doesn't appear to be alive and thus get 
traffic routed to it?

Thanks.

Gordan


From Timothy.Ward at itt.com  Tue Oct 23 15:29:44 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Tue, 23 Oct 2007 11:29:44 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <1191870569.2804.12.camel@dhcp83-206.boston.redhat.com>
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC7C@ACDFWMAIL1.acd.de.ittind.com>

Kevin,

Thanks for the tip on CentOS.  My working cluster is now:
3node 64bit AMD opteron 2GHz
CentOS 5 (good)
GFS 1 0.1.11(good)
Apache 2.2.3 (good)
Samba 3.0.23c-2 (good)
NFS 1.0.9-16 (BAD)

So I am avoiding Fedora and GFS2 until you've had more time to "bake"
it.

I am reading up on Wendy Chang's NFS linux-cluster postings to
understand why NFS fail over is not consistent.
- Has anyone successfully setup a failover NFS cluster?
- Does nfsv4 work better?  It appears by default fedora6 uses nfsv3
(tcp) client and is flaky.

Tim 

________________________________

From: Kevin Anderson [mailto:kanderso at redhat.com] 
Sent: Monday, October 08, 2007 3:09 PM
To: Ward, Timothy - SSD
Subject: Re: [Linux-cluster] Cluster NFS causes kernel bug


On Mon, 2007-10-08 at 09:52 -0400, Ward, Timothy - SSD wrote: 

	Kevin,
	
	I completely reinstalled my cluster and gotten back to Apache
and Samba
	working and NFS using a GFS2 filesystem making a Kernel bug and
crashing
	the whole cluster.
	

Yeah, we have posted a large number of patches to fix most of the major
problems with gfs2.  Those should be percolating through upstream and
then will get pulled into a fedora kernel at some point. 

	
	How are the GFS1/FC7 rpms coming? Where should I look to
download them?
	

Not as well as we had hoped.  We have been swamped with multiple RHEL
releases/updates occurring pretty much at the same time.  We will likely
need to have our own version of the kernel since gfs1 requires access to
three symbols in the gfs2 code in order to access the lock managers.  

	
	If I can get a stable setup of GFS (1 or 2) working to show
management I
	will be able to get a testbed for testing 64bit versions for
you. So
	basically I need some flavor of FC and GFS that are stable so I
can put
	it into production and show it off.
	

Have you considered using the CENTOS instead of Fedora?  Since CentOS
essentially replicates the RHEL rpms, everything is already built and
included?

Kevin 
*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From randy.brown at noaa.gov  Tue Oct 23 15:39:46 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Tue, 23 Oct 2007 11:39:46 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471DAB40.6020003@redhat.com>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
	<471DAB40.6020003@redhat.com>
Message-ID: <471E15C2.5010004@noaa.gov>

I have tried both suggestions.  I added the line:
node.session.initial_login_retry_max = 8
I even made the value as high as 20 with no change during boot up.  Then 
I added NETWORKDELAY=20 to /etc/sysconfig/network and I still see:
iscsiadm: Could not login session (err 4)
iscsiadm: initiator reported error (4 - encountered connection failure)

when I boot the machine.  After the machine as booted, I restart iscsi, 
clvmd, and gfs and everything works.  Sometimes I hate computers.

Thanks,
Randy

Bryn M. Reeves wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> gordan at bobich.net wrote:
>   
>> On Mon, 22 Oct 2007, Randy Brown wrote:
>>
>>     
>>> Thanks, I'll try upping the retries.  I am assuming this is the same
>>> thing as increasing the time value here:
>>>       
>> No. Timeouts and retries are separate settings. The problem is usually
>> that the iSCSI subsystems tries to access the SAN before the network has
>> fully come up.
>>
>>     
>
> You can also tweak the network delay setting in /etc/sysconfig/network.
> Set the variable NETWORKDELAY to a value (in seconds) and the network
> scripts will sleep for this period of time after bringing all interfaces
> up. This should allow time for your network to be fully up before the
> iscsi service starts.
>
> Regards,
> Bryn.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
> iD8DBQFHHatA6YSQoMYUY94RAiGAAJ41fsuuLLvypgti/C+Ik07RCBUkrgCfUL9W
> tYdoAqbjAdoRr02KJKssdJQ=
> =nciQ
> -----END PGP SIGNATURE-----
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/da3e6da7/attachment.htm>

From gordan at bobich.net  Tue Oct 23 15:42:29 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Tue, 23 Oct 2007 16:42:29 +0100 (BST)
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC7C@ACDFWMAIL1.acd.de.ittind.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC7C@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <Pine.LNX.4.64.0710231637080.25898@skynet.shatteredsilicon.net>

On Tue, 23 Oct 2007, Ward, Timothy - SSD wrote:

> NFS 1.0.9-16 (BAD)
>
> So I am avoiding Fedora and GFS2 until you've had more time to "bake"
> it.
>
> I am reading up on Wendy Chang's NFS linux-cluster postings to
> understand why NFS fail over is not consistent.

What do you mean by "consistent"?

> - Has anyone successfully setup a failover NFS cluster?

You'll have to reveal more about your setup. Are you using a floating IP 
and HA to fail over the NFS service, with GNDB for the mirrored block 
device? A SAN with a shared GFS file system with multiple nodes exporting 
the NFS file system? Or something entirely different?

> - Does nfsv4 work better?  It appears by default fedora6 uses nfsv3
> (tcp) client and is flaky.

NFS over TCP has always been flaky, and from what I've seen NFSv4 is not 
really any better. I'm still using NFSv3 over UDP for everything, as it 
seems most reliable.

Gordan


From berthiaume_wayne at emc.com  Tue Oct 23 16:04:31 2007
From: berthiaume_wayne at emc.com (berthiaume_wayne at emc.com)
Date: Tue, 23 Oct 2007 12:04:31 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471E15C2.5010004@noaa.gov>
References: <471CEC60.1040308@noaa.gov><Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net><471CF12F.70107@noaa.gov><Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net><471DAB40.6020003@redhat.com>
	<471E15C2.5010004@noaa.gov>
Message-ID: <D364D39DAD21D444BAE2C70919B6280907C0278B@CORPUSMX40A.corp.emc.com>

Try inserting a delay into the iscsi init script in the beginning of the
start clause. Depending on the delays in your netwrok and the NIC
drivers being used I've found values of between 30 and 60 seconds worked
for me. 
 
Regards,
Wayne.

________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Randy Brown
Sent: Tuesday, October 23, 2007 11:40 AM
To: linux clustering
Subject: Re: [Linux-cluster] problems with iscsi storage and boot order


I have tried both suggestions.  I added the line:
node.session.initial_login_retry_max = 8
I even made the value as high as 20 with no change during boot up.  Then
I added NETWORKDELAY=20 to /etc/sysconfig/network and I still see:
iscsiadm: Could not login session (err 4)
iscsiadm: initiator reported error (4 - encountered connection failure)

when I boot the machine.  After the machine as booted, I restart iscsi,
clvmd, and gfs and everything works.  Sometimes I hate computers.

Thanks,
Randy

Bryn M. Reeves wrote: 

	-----BEGIN PGP SIGNED MESSAGE-----
	Hash: SHA1
	
	gordan at bobich.net wrote:
	  

		On Mon, 22 Oct 2007, Randy Brown wrote:
		
		    
			Thanks, I'll try upping the retries.  I am
assuming this is the same
			thing as increasing the time value here:
			      

		No. Timeouts and retries are separate settings. The
problem is usually
		that the iSCSI subsystems tries to access the SAN before
the network has
		fully come up.
		
		    
	You can also tweak the network delay setting in
/etc/sysconfig/network.
	Set the variable NETWORKDELAY to a value (in seconds) and the
network
	scripts will sleep for this period of time after bringing all
interfaces
	up. This should allow time for your network to be fully up
before the
	iscsi service starts.
	
	Regards,
	Bryn.
	
	-----BEGIN PGP SIGNATURE-----
	Version: GnuPG v1.4.7 (GNU/Linux)
	Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
	
	iD8DBQFHHatA6YSQoMYUY94RAiGAAJ41fsuuLLvypgti/C+Ik07RCBUkrgCfUL9W
	tYdoAqbjAdoRr02KJKssdJQ=
	=nciQ
	-----END PGP SIGNATURE-----
	
	--
	Linux-cluster mailing list
	Linux-cluster at redhat.com
	https://www.redhat.com/mailman/listinfo/linux-cluster
	  

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/a2e1c232/attachment.htm>

From jgray at nicusa.com  Tue Oct 23 16:08:51 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 23 Oct 2007 10:08:51 -0600
Subject: [Linux-cluster] deadnode_timer="value" to the cman section in
	cluster.conf
In-Reply-To: <471DBF03.1010507@bull.net>
Message-ID: <C34378B3.10D000%jgray@nicusa.com>

Here you go, think this is what you are after

http://sources.redhat.com/cluster/faq.html#cman_deadnode_timer

I think older versions had it part of /proc  but that's before I starting
using it


On 10/23/07 3:29 AM, "Alain Moulle" <Alain.Moulle at bull.net> wrote:

> Hi
> 
> Is the "deadnode_timer" field managed in CS4 U4 ? CS4 U5 ? all versions ?
> 
> Because I don't find this field described in the cluster.conf doc :
> http://sources.redhat.com/cluster/doc/cluster_schema.html
> 
> Thanks
> Regards
> Alain Moull?
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From breeves at redhat.com  Tue Oct 23 16:10:37 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 23 Oct 2007 17:10:37 +0100
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <D364D39DAD21D444BAE2C70919B6280907C0278B@CORPUSMX40A.corp.emc.com>
References: <471CEC60.1040308@noaa.gov><Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net><471CF12F.70107@noaa.gov><Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net><471DAB40.6020003@redhat.com>	<471E15C2.5010004@noaa.gov>
	<D364D39DAD21D444BAE2C70919B6280907C0278B@CORPUSMX40A.corp.emc.com>
Message-ID: <471E1CFD.9060006@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

berthiaume_wayne at emc.com wrote:
> Try inserting a delay into the iscsi init script in the beginning of the
> start clause. Depending on the delays in your netwrok and the NIC
> drivers being used I've found values of between 30 and 60 seconds worked
> for me.
>  
> Regards,
> Wayne.

Why hack around with the iscsi script when the network scripts already
have a generic mechanism to allow this?

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHHhz96YSQoMYUY94RAhsbAJ95YtbUlXih8ZAvR0VqrZwXIzzGVgCfS3ZP
+Lg1IemXlttXCtChpJ3YW+c=
=FD2I
-----END PGP SIGNATURE-----


From breeves at redhat.com  Tue Oct 23 16:13:17 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 23 Oct 2007 17:13:17 +0100
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471E15C2.5010004@noaa.gov>
References: <471CEC60.1040308@noaa.gov>	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>	<471CF12F.70107@noaa.gov>	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>	<471DAB40.6020003@redhat.com>
	<471E15C2.5010004@noaa.gov>
Message-ID: <471E1D9D.40709@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Randy Brown wrote:
> I have tried both suggestions.  I added the line:
> node.session.initial_login_retry_max = 8
> I even made the value as high as 20 with no change during boot up.  Then
> I added NETWORKDELAY=20 to /etc/sysconfig/network and I still see:
> iscsiadm: Could not login session (err 4)
> iscsiadm: initiator reported error (4 - encountered connection failure)
> 
> when I boot the machine.  After the machine as booted, I restart iscsi,
> clvmd, and gfs and everything works.  Sometimes I hate computers.

How is your network configured? Are you using DHCP or static
configuration and have you restored the initscripts priorities after the
earlier changes?

Bryn.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHHh2d6YSQoMYUY94RAsJmAKCqU3DrqBI0VFTVFN/Hp6SBpd/EEACfUMyj
zci9TOXt6+aI7NH4uq9cBEE=
=SnFs
-----END PGP SIGNATURE-----


From zepp157 at yahoo.com  Tue Oct 23 17:47:31 2007
From: zepp157 at yahoo.com (J.M.)
Date: Tue, 23 Oct 2007 10:47:31 -0700 (PDT)
Subject: [Linux-cluster] CLVM question
Message-ID: <836609.96330.qm@web63504.mail.re1.yahoo.com>

I think I missed something along the way...


I have several test clusters with three types of logical volumes:


System ext3 (local disk)

 Cluser-managed ext3 (SAN disk)

 Cluster-managed gfs (SAN disk)


I'm currently able to mount my Cluster-managed ext3 logical volumes on multiple nodes concurrently.  This is obviously unsafe.


How should the cluster suite differentiate VGs/LVs with cluster filesystems and VGs/LVs with filesystems that are single-node only?


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From randy.brown at noaa.gov  Tue Oct 23 18:23:43 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Tue, 23 Oct 2007 14:23:43 -0400
Subject: [Linux-cluster] problems with iscsi storage and boot order
In-Reply-To: <471E1D9D.40709@redhat.com>
References: <471CEC60.1040308@noaa.gov>
	<Pine.LNX.4.64.0710221934310.20323@skynet.shatteredsilicon.net>
	<471CF12F.70107@noaa.gov>
	<Pine.LNX.4.64.0710221951001.20439@skynet.shatteredsilicon.net>
	<471DAB40.6020003@redhat.com> <471E15C2.5010004@noaa.gov>
	<471E1D9D.40709@redhat.com>
Message-ID: <471E3C2F.20500@noaa.gov>

Yes, init scripts have all been restored to their correct priorities.  I 
am using static IPs on both eth0 and eth1.  Eth0 connects the host 
machine to our office network and eth1 is configure for a 192.168 
network which is for the iscsi.

Randy

Bryn M. Reeves wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Randy Brown wrote:
>   
>> I have tried both suggestions.  I added the line:
>> node.session.initial_login_retry_max = 8
>> I even made the value as high as 20 with no change during boot up.  Then
>> I added NETWORKDELAY=20 to /etc/sysconfig/network and I still see:
>> iscsiadm: Could not login session (err 4)
>> iscsiadm: initiator reported error (4 - encountered connection failure)
>>
>> when I boot the machine.  After the machine as booted, I restart iscsi,
>> clvmd, and gfs and everything works.  Sometimes I hate computers.
>>     
>
> How is your network configured? Are you using DHCP or static
> configuration and have you restored the initscripts priorities after the
> earlier changes?
>
> Bryn.
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
> iD8DBQFHHh2d6YSQoMYUY94RAsJmAKCqU3DrqBI0VFTVFN/Hp6SBpd/EEACfUMyj
> zci9TOXt6+aI7NH4uq9cBEE=
> =SnFs
> -----END PGP SIGNATURE-----
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/8c7beaba/attachment.htm>

From Timothy.Ward at itt.com  Tue Oct 23 18:57:15 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Tue, 23 Oct 2007 14:57:15 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC7D@ACDFWMAIL1.acd.de.ittind.com>

On Tue, 23 Oct 2007, Gordon wrote:

>> NFS 1.0.9-16 (BAD)
>> 
>> So I am avoiding Fedora and GFS2 until you've had more time to "bake"
it.
>> 
>> I am reading up on Wendy Chang's NFS linux-cluster postings to
>> understand why NFS fail over is not consistent.
> 
> What do you mean by "consistent"?
> 
>> - Has anyone successfully setup a failover NFS cluster?
> 
> You'll have to reveal more about your setup. Are you using a floating
IP and HA to fail over the NFS service, with GNDB for the mirrored block
device? A > SAN with a shared GFS file system with multiple nodes
exporting the NFS file system? Or something entirely different? 
> 
>> - Does nfsv4 work better?  It appears by default fedora6 uses nfsv3
(tcp) client and
>>   is flaky.
> 
> NFS over TCP has always been flaky, and from what I've seen NFSv4 is
not really any better. I'm still using NFSv3 over UDP for everything, as
it seems > > most reliable. 
> 
> Gordan

Gordon,

More detail:
- SAN FibreChannel array connected to a SAN switch with 2 of the 3 nodes
connected to it as well
- GFS1 fs mounted on the 2 nodes (So both SMB and NFS can access the
same files)
- A floating IP for the NFS service.  The failover domain only allows it
to go to one of the 2 SAN connected nodes
- A Samba service setup on the 2nd node but able to fail to the first
- NFS running on both nodes at all times, but only being cluster served
via the floating IP by one node

1) By "not consistent" I mean that when I failover from 1 node to
another node the NFS service fails just fine.  Its the client connection
that is painful.  Sometimes the client (FC6 in this case, building a
CentOS 5 as we speak) reconnects to the server "immediately", sometimes
within minutes, or an hour or never.

2) Thanks for the report on NFSv3/UDP.  From my reading that sounded
like something to avoid, but maybe I need to try it anyway.  How
reliable has it been?  Do the clients reconnect most times?

Thanks for the reply :)
Tim
*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From lhh at redhat.com  Tue Oct 23 20:49:22 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 23 Oct 2007 16:49:22 -0400
Subject: [Linux-cluster] Downing secondary interface if primary fails?
In-Reply-To: <Pine.LNX.4.64.0710231622250.25898@skynet.shatteredsilicon.net>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
	<1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
	<Pine.LNX.4.64.0710231622250.25898@skynet.shatteredsilicon.net>
Message-ID: <1193172562.4831.0.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-10-23 at 16:24 +0100, gordan at bobich.net wrote:
> Hi,
> 
> Say a machine is a part of a cluster. It has eth0 for WAN accessm and eth1 
> for SAN/cluster access. Say eth1 fails - is there a way to get eth0 to 
> shut down as well, so the machine doesn't appear to be alive and thus get 
> traffic routed to it?

Use fencing to power the node off?

-- Lon


From lhh at redhat.com  Tue Oct 23 20:49:48 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 23 Oct 2007 16:49:48 -0400
Subject: [Linux-cluster] self_fence in <fs> resource tag?
In-Reply-To: <a434d05d0710230352nce96280r5d9e4ea907cfc179@mail.gmail.com>
References: <a434d05d0710230352nce96280r5d9e4ea907cfc179@mail.gmail.com>
Message-ID: <1193172588.4831.2.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-10-23 at 11:52 +0100, Andr? Fernandes wrote:
> Hi.
> 
> I was configuring a HA cluster with a shared ext3 filesystem and came
> across this option in the <fs> resource tag.
> What does the self_fence option do? I could not find it detailed
> anywhere.
> 
> Here's the sample from my cluster.conf:
> ...
> <resources>
>             ...
>             <fs device="/dev/scalixdata/scalixmail" force_fsck="1"
> force_unmount="1" fsid="8848" fstype="ext3"
> mountpoint="/var/opt/scalix" name="scalix_mail_data" options=""
> self_fence="1"/>

Reboots the node if unmount fails.

-- Lon


From lhh at redhat.com  Tue Oct 23 21:14:58 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 23 Oct 2007 17:14:58 -0400
Subject: [Linux-cluster] Quorum disk votes not used when starting node?
In-Reply-To: <20071022174331.GB14244@jasmine.xos.nl>
References: <200710101856.l9AIuRWA008113@jasmine.xos.nl>
	<1192462652.27135.16.camel@ayanami.boston.devel.redhat.com>
	<20071022174331.GB14244@jasmine.xos.nl>
Message-ID: <1193174098.4831.12.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-22 at 19:43 +0200, Jos Vos wrote:
> On Mon, Oct 15, 2007 at 11:37:32AM -0400, Lon Hohberger wrote:
> 
> > On Wed, 2007-10-10 at 20:56 +0200, Jos Vos wrote:
> 
> > > Now, this all works fine, cman_tool shows what I expected and when I
> > > remove the file /tmp/qdisk on a node, that node reboots instantaneously.
> > > 
> > > However, after the reboot, while the file tested in the heuristic does
> > > still not exist, the node is joining the cluster again and starts some
> > > cluster services!
> > 
> > Add stop_cman="1" to <quorumd>
> 
> Any other suggestions?  As I already replied, this didn't help (I'd
> already tried that before asking the question).  Note that this is
> on RHEL4 (with all updates), if that matters.
> 

I see it - it looks like stop_cman only applies if qdiskd can't reach
the disk, not if the heuristics are bad.

This should make it kill CMAN if heuristics are bad too if stop_cman is
set.

-- Lon


-------------- next part --------------
A non-text attachment was scrubbed...
Name: qdisk-no-heuristic-stop-cman.patch
Type: text/x-patch
Size: 527 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/79f4d19a/attachment.bin>

From jgray at nicusa.com  Tue Oct 23 21:23:20 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 23 Oct 2007 15:23:20 -0600
Subject: [Linux-cluster] Cman tool leave remove, device or resource
 busy
In-Reply-To: <1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
Message-ID: <C343C268.10D6E4%jgray@nicusa.com>

Thanks for the reply..     Is this 4.x version you are running?   I see you
are doing `service cman stop leave`   instead of cman_tool,      I?m still
getting the same results putting a pause in there.

Femced isn?t a standalone process in 5.0 as far as I can tell.  If it is,
then is that the one process stopping me from leaving the cluster?
 

On 10/23/07 6:33 AM, "rhurst at bidmc.harvard.edu" <rhurst at bidmc.harvard.edu>
wrote:

> Unfortunately, we have run into the same issue... but we have found that a
> small pause before shutting cman down works every time (script snippet below
> -- sleep 1 was dicey, sleep 2 always worked, sleep3 just because):
> 
> service rgmanager stop
> [ $? -eq 0 ] || exit
> 
> service gfs stop
> [ $? -eq 0 ] || exit
> 
> service clvmd stop
> [ $? -eq 0 ] || exit
> 
> service fenced stop
> [ $? -eq 0 ] || exit
> 
> # invoking cman too early sometimes fails ... -t option too small?
> sync
> sleep 3
> 
> [ "$ACTION" = "leave" ] && service cman stop leave || service cman stop
> [ $? -eq 0 ] || exit
> 
> service ccsd stop
> [ $? -eq 0 ] || exit
> 
> 
> On Mon, 2007-10-22 at 14:05 -0600, Josh Gray wrote:
>> 
>> I see in the FAQ the proper way to leave a cluster is with 'cman_tool leave
>> remove'  I am getting a  'device or resource busy' error when I do that even
>> with GFS mounts removed and stopped.   What am I missing?  The only service
>> I see up is a "fence" type called "default"  do I have to stop one more
>> thing?
>> 
>> Basically doing this:
>> 
>> service nfs stop
>> umount /export
>> service gfs stop
>> service rgmanager stop
>> service clvmd stop
>> cman_tool leave remove
>> 
>> Tells me this:
>> cman_tool: Error leaving cluster: Device or resource busy
>> 
>> 
>> Full snip from the server:
>> 
>> [root at nfs-6 ~]# cman_tool services
>> type             level name         id       state
>> fence            0     default      00010003 none
>> [1 2 3]
>> dlm              1     clvmd        00020003 none
>> [1 2 3]
>> dlm              1     rgmanager    00050003 none
>> [1 2 3]
>> dlm              1     nfs_data_vg  00040003 none
>> [1 2 3]
>> gfs              2     nfs_data_vg  00030003 none
>> [1 2 3]
>> [root at nfs-6 ~]# service nfs stop
>> Shutting down NFS mountd:                                  [  OK  ]
>> Shutting down NFS daemon:                                  [  OK  ]
>> Shutting down NFS quotas:                                  [  OK  ]
>> Shutting down NFS services:                                [  OK  ]
>> [root at nfs-6 ~]# umount /export
>> [root at nfs-6 ~]# service gfs stop
>> [root at nfs-6 ~]# service rgmanager stop
>> Shutting down Cluster Service Manager...
>> Waiting for services to stop:                              [  OK  ]
>> Cluster Service Manager is stopped.
>> [root at nfs-6 ~]# service clvmd stop
>> Deactivating VG nfs_data_vg:   0 logical volume(s) in volume group
>> "nfs_data_vg" now active
>>                                                            [  OK  ]
>> Stopping clvm:                                             [  OK  ]
>> [root at nfs-6 ~]# cman_tool leave remove
>> cman_tool: Error leaving cluster: Device or resource busy
>> [root at nfs-6 ~]# cman_tool services
>> type             level name     id       state
>> fence            0     default  00010003 none
>> [1 2 3]
>> [root at nfs-6 ~]# 
>> 
>> 
>> 
>> 
> 
> Robert Hurst, Sr. Cach? Administrator
> Beth Israel Deaconess Medical Center
> 1135 Tremont Street, REN-7
> Boston, Massachusetts   02120-2140
> 617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
> Any technology distinguishable from magic is insufficiently advanced.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/e513fade/attachment.htm>

From andreezer at gmail.com  Tue Oct 23 21:33:36 2007
From: andreezer at gmail.com (=?ISO-8859-1?Q?Andr=E9_Fernandes?=)
Date: Tue, 23 Oct 2007 22:33:36 +0100
Subject: [Linux-cluster] self_fence in <fs> resource tag?
In-Reply-To: <1193172588.4831.2.camel@ayanami.boston.devel.redhat.com>
References: <a434d05d0710230352nce96280r5d9e4ea907cfc179@mail.gmail.com>
	<1193172588.4831.2.camel@ayanami.boston.devel.redhat.com>
Message-ID: <a434d05d0710231433i57e80fa2l2a70f4bd9c7321dd@mail.gmail.com>

Hi,
thank you for the reply.

What does it do to reboot the node? fences itself? does a clean shutdown?

On 10/23/07, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Tue, 2007-10-23 at 11:52 +0100, Andr? Fernandes wrote:
> > Hi.
> >
> > I was configuring a HA cluster with a shared ext3 filesystem and came
> > across this option in the <fs> resource tag.
> > What does the self_fence option do? I could not find it detailed
> > anywhere.
> >
> > Here's the sample from my cluster.conf:
> > ...
> > <resources>
> >             ...
> >             <fs device="/dev/scalixdata/scalixmail" force_fsck="1"
> > force_unmount="1" fsid="8848" fstype="ext3"
> > mountpoint="/var/opt/scalix" name="scalix_mail_data" options=""
> > self_fence="1"/>
>
> Reboots the node if unmount fails.
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071023/903244b8/attachment.htm>

From lhh at redhat.com  Tue Oct 23 22:31:40 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 23 Oct 2007 18:31:40 -0400
Subject: [Linux-cluster] self_fence in <fs> resource tag?
In-Reply-To: <a434d05d0710231433i57e80fa2l2a70f4bd9c7321dd@mail.gmail.com>
References: <a434d05d0710230352nce96280r5d9e4ea907cfc179@mail.gmail.com>
	<1193172588.4831.2.camel@ayanami.boston.devel.redhat.com>
	<a434d05d0710231433i57e80fa2l2a70f4bd9c7321dd@mail.gmail.com>
Message-ID: <1193178700.4831.24.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-10-23 at 22:33 +0100, Andr? Fernandes wrote:
> Hi,
> thank you for the reply.
> 
> What does it do to reboot the node? fences itself? does a clean
> shutdown?

Immediate reboot, a la 'reboot -fn', or the syscall:

   reboot(RB_AUTOBOOT);

So, only specify self_fence if recovery of the service is more important
than keeping the node up.

-- Lon


From jgray at nicusa.com  Wed Oct 24 01:48:36 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 23 Oct 2007 19:48:36 -0600
Subject: [Linux-cluster] Disable GFS quotas
Message-ID: <C3440094.11200B%jgray@nicusa.com>

Can someone tell me the best way to disable quotas on GFS for good?   I see
in the docs you are suppose to run this command but it doesn't work across
reboots or umounts.

gfs_tool settune /gfs quota_account 0


As a bonus question, does this listserv have a searchable archive anywhere?


-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From erickson.jon at gmail.com  Wed Oct 24 02:05:13 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Tue, 23 Oct 2007 22:05:13 -0400
Subject: [Linux-cluster] Disable GFS quotas
In-Reply-To: <C3440094.11200B%jgray@nicusa.com>
References: <C3440094.11200B%jgray@nicusa.com>
Message-ID: <6a90e4da0710231905v6c934e38w64640277895928ff@mail.gmail.com>

I wrote a simple chkconfig script that gets executed after my GFS
volumes are mounted....I can't remember exactly what I did but I think
I took the output of 'mount' and parsed out the GFS volumes to pass to
gfs_tool settune....if you need to see what it looks like it will have
to wait till the morning....

Not sure on the bonus question. :)

On 10/23/07, Josh Gray <jgray at nicusa.com> wrote:
> Can someone tell me the best way to disable quotas on GFS for good?   I see
> in the docs you are suppose to run this command but it doesn't work across
> reboots or umounts.
>
> gfs_tool settune /gfs quota_account 0
>
>
> As a bonus question, does this listserv have a searchable archive anywhere?
>
>
>
> --
> Josh Gray
> Systems Administrator
> NIC Inc
>
> Email: jgray at nicusa.com
> Desk/Mobile: 913-221-1520
>
> "It is not the mountain we conquer, but ourselves."
> - Sir Edmund Hillary
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From gordan at bobich.net  Wed Oct 24 07:22:28 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 24 Oct 2007 08:22:28 +0100 (BST)
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC7D@ACDFWMAIL1.acd.de.ittind.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC7D@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <Pine.LNX.4.64.0710240819400.32717@skynet.shatteredsilicon.net>

On Tue, 23 Oct 2007, Ward, Timothy - SSD wrote:

> 2) Thanks for the report on NFSv3/UDP.  From my reading that sounded
> like something to avoid, but maybe I need to try it anyway.  How
> reliable has it been?  Do the clients reconnect most times?

In your case, NFS over TCP is likely to have been the major cause of your 
problems. UDP can fail over much more transparently, because there is no 
state to it to expire.

You could also try tweaking your timeout, retry, and hard vs. soft failure 
modes on NFS.

Gordan


From gordan at bobich.net  Wed Oct 24 07:24:24 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Wed, 24 Oct 2007 08:24:24 +0100 (BST)
Subject: [Linux-cluster] Downing secondary interface if primary fails?
In-Reply-To: <1193172562.4831.0.camel@ayanami.boston.devel.redhat.com>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
	<1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
	<Pine.LNX.4.64.0710231622250.25898@skynet.shatteredsilicon.net>
	<1193172562.4831.0.camel@ayanami.boston.devel.redhat.com>
Message-ID: <Pine.LNX.4.64.0710240822410.32717@skynet.shatteredsilicon.net>

On Tue, 23 Oct 2007, Lon Hohberger wrote:

>> Say a machine is a part of a cluster. It has eth0 for WAN accessm and eth1
>> for SAN/cluster access. Say eth1 fails - is there a way to get eth0 to
>> shut down as well, so the machine doesn't appear to be alive and thus get
>> traffic routed to it?
>
> Use fencing to power the node off?

Can't power it off in software. :-(

Gordan


From mhm.saif at gmail.com  Wed Oct 24 08:28:56 2007
From: mhm.saif at gmail.com (mohammed saif)
Date: Wed, 24 Oct 2007 11:28:56 +0300
Subject: [Linux-cluster] Linux Login Cluster with AFS
Message-ID: <b221cb580710240128qbc8d0dem290c161f21eb90e9@mail.gmail.com>

Hi list,

How are you doing?

I have to do a project related to LLC (Linux Login Cluster) and AFS
(Andrew File System). Please if you know some manuals to read about
these ideas and how to implement them together, PLEASE inform me. I'm
googling since 2 days but there is nothing appropriate till now except
some principles.

I tried to test the AFS by downloading and installing the packages
from www.openafs.org but most documentations concentrate on using the
IBM AFS CD-ROM (To have some modules that I couldn't find through the
net). If its necessary I can recommend to buy the software. but I do
need some extensive manuals for both together to get the big picture.

Please send me any ideas.

Best Regards
Mohammad


From pariviere at ippon.fr  Wed Oct 24 10:09:04 2007
From: pariviere at ippon.fr (Pierre-Alain RIVIERE)
Date: Wed, 24 Oct 2007 12:09:04 +0200
Subject: [Linux-cluster] CLVM, iSCSI and naming rules
Message-ID: <471F19C0.2030203@ippon.fr>

Hello,

I'm working on a Xen solution where Xen's DomU are stored on iSCSI 
volumes (a LV on Dom0 is a partition on DomU) to allow live migration 
between hosts and to make DomU's partitions resizing easier.

iSCSI volumes -> LVM on Dom0 -> LV on Dom0 -> partition on DomU

As far as I know, I need CLVM in order to propagate automatically LVM 
changes between host. So I've installed Ubuntu Gutsy on my server and 
configured CMAN, CLVM and Open-ISCSI. Things seems to work fine.
I've also added some udev custom rules to have the same device name on 
each box

----------------- udev rule
KERNEL=="sd*", BUS=="scsi", PROGRAM="iscsidev 
%b",SYMLINK+="iscsi/%c{1}/lun%c{2}/part%n"
------------------

------------------ iscsidev
#!/bin/sh

BUS=${1}
HOST=${BUS%%:*}

# This was the first thing I thought of, I'm sure it can be done better:
LUN=`echo ${BUS} | cut -d":" -f4`

[ -e /sys/class/iscsi_host ] || exit 1

file="/sys/class/iscsi_host/host${HOST}/device/session*/iscsi_session*/targetname"

#target_name=$(cat ${file})
target_name=`cut -d":" -f2 ${file}`

# This is not an open-scsi drive
if [ -z "${target_name}" ]; then
    exit 1
fi

echo "${target_name} ${LUN}"
--------------------------------------


Using these scripts my iSCSI devices have a symlink under /dev/iscsi/, 
/dev/iscsi/IpponData.xen01/lun0/part is an example.

Next step, LVM initialization.

      # pvcreate /dev/iscsi/IpponData.xen02/lun0/part
      # pvdisplay /dev/iscsi/IpponData.xen02/lun0/part

        --- NEW Physical volume ---
        PV Name               /dev/sdj
        VG Name             
        PV Size               10.00 GB
        Allocatable           NO
        PE Size (KByte)       0
        Total PE              0
        Free PE               0
        Allocated PE          0
        PV UUID               ecfUIs-4mnP-sKhq-mK8s-mICm-rUvt-2o8E8g


As you can see pvdisplay doesn't care about 
/dev/iscsi/IpponData.xen02/lun0/part symlink and instead use the target 
device.

This is a normal behavior? And when I will deploy CMAN, CLVM and iSCSI 
on another box what will be LVM behaviours? I can't guarantee that 
/dev/sdj will be accessible on the other box, only the symlink will be 
the same.


From gwood at dragonhold.org  Wed Oct 24 10:24:50 2007
From: gwood at dragonhold.org (Graham Wood)
Date: Wed, 24 Oct 2007 11:24:50 +0100
Subject: [Linux-cluster] CLVM, iSCSI and naming rules
In-Reply-To: <471F19C0.2030203@ippon.fr>
References: <471F19C0.2030203@ippon.fr>
Message-ID: <20071024102450.GA22543@dragonhold.org>

On Wed, Oct 24, 2007 at 12:09:04PM +0200, Pierre-Alain RIVIERE wrote:
> I've also added some udev custom rules to have the same device name on 
> each box
This isn't actually necessary.  The same name makes things easier from the user/admin 
perspective, but the LVM side of things uses the ID of the pv, rather than the device name.

> This is a normal behavior? And when I will deploy CMAN, CLVM and iSCSI 
> on another box what will be LVM behaviours? I can't guarantee that 
> /dev/sdj will be accessible on the other box, only the symlink will be 
> the same.
LVM is using /etc/lvm/lvm.conf (on this RH machine anyway) to choose what devices to scan.  
The default on this RHEL5 machine is:

devices {
    dir = "/dev"
    scan = [ "/dev" ]
    filter = [ "a/.*/" ]
    cache = "/etc/lvm/.cache"
    write_cache_state = 1
    sysfs_scan = 1      
    md_component_detection = 1
}

which means that it'll detect anything in /dev and use that.  This has bit me in the past 
(using drbd) but shouldn't be a problem here.  If it is (and you're not using LVM for the 
OS) you could change this to only scan your links instead.

The reason it caused a problem with DRBD (it's a way of doing mirroring between 2 nodes, if 
you've never seen it) is that the PV is visible as the local device as well as the mirrored 
one, and writing changes to the local copy doesn't work (they don't get replicated to the 
other node).

So as long as the /dev/sd is a valid device to use, don't worry about it.

Indeed - if your iSCSI stuff is as straight forward as it sounds, I'd be tempted to stop 
doing the symlinks and just let the LVM stuff do its "thing".

Graham


From rhurst at bidmc.harvard.edu  Wed Oct 24 11:44:53 2007
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Wed, 24 Oct 2007 07:44:53 -0400
Subject: [Linux-cluster] Cman tool leave remove, device or resource
 busy
In-Reply-To: <C343C268.10D6E4%jgray@nicusa.com>
References: <C343C268.10D6E4%jgray@nicusa.com>
Message-ID: <1193226293.8005.3.camel@xw9300.bidmc.harvard.edu>

Yes, to both questions.  I am not familiar enough yet with the 5.0
version, but after reading the bottom of your message, there still is a
fence service registered in its table space.  Is there a separate fence
daemon running? Perhaps a `killall fenced` if there is no separate
service script?


                [root at nfs-6 ~]# cman_tool services
                type             level name     id       state
                fence            0     default  00010003 none
                [1 2 3] 
                

On Tue, 2007-10-23 at 15:23 -0600, Josh Gray wrote:

> Thanks for the reply..     Is this 4.x version you are running?   I
> see you are doing `service cman stop leave`   instead of cman_tool,
>      I?m still getting the same results putting a pause in there.
> 
> Femced isn?t a standalone process in 5.0 as far as I can tell.  If it
> is, then is that the one process stopping me from leaving the cluster?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071024/2c49bef2/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2178 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071024/2c49bef2/attachment.p7s>

From pariviere at ippon.fr  Wed Oct 24 11:57:25 2007
From: pariviere at ippon.fr (Pierre-Alain RIVIERE)
Date: Wed, 24 Oct 2007 13:57:25 +0200
Subject: [Linux-cluster] CLVM, iSCSI and naming rules
In-Reply-To: <20071024102450.GA22543@dragonhold.org>
References: <471F19C0.2030203@ippon.fr> <20071024102450.GA22543@dragonhold.org>
Message-ID: <471F3325.1070007@ippon.fr>

After reading your response I've proceed some tests (disconnecting and 
reconnecting iSCSI volumes in a random order) and you're right LVM 
doesn't care that much about which device a iSCSI volume is really 
connected to.

But I will still keep my symlinks for admin purpose, much more simple 
for me.


And by the way thanks for your answer.


Graham Wood wrote:
> On Wed, Oct 24, 2007 at 12:09:04PM +0200, Pierre-Alain RIVIERE wrote:
>   
>> I've also added some udev custom rules to have the same device name on 
>> each box
>>     
> This isn't actually necessary.  The same name makes things easier from the user/admin 
> perspective, but the LVM side of things uses the ID of the pv, rather than the device name.
>
>   
>> This is a normal behavior? And when I will deploy CMAN, CLVM and iSCSI 
>> on another box what will be LVM behaviours? I can't guarantee that 
>> /dev/sdj will be accessible on the other box, only the symlink will be 
>> the same.
>>     
> LVM is using /etc/lvm/lvm.conf (on this RH machine anyway) to choose what devices to scan.  
> The default on this RHEL5 machine is:
>
> devices {
>     dir = "/dev"
>     scan = [ "/dev" ]
>     filter = [ "a/.*/" ]
>     cache = "/etc/lvm/.cache"
>     write_cache_state = 1
>     sysfs_scan = 1      
>     md_component_detection = 1
> }
>
> which means that it'll detect anything in /dev and use that.  This has bit me in the past 
> (using drbd) but shouldn't be a problem here.  If it is (and you're not using LVM for the 
> OS) you could change this to only scan your links instead.
>
> The reason it caused a problem with DRBD (it's a way of doing mirroring between 2 nodes, if 
> you've never seen it) is that the PV is visible as the local device as well as the mirrored 
> one, and writing changes to the local copy doesn't work (they don't get replicated to the 
> other node).
>
> So as long as the /dev/sd is a valid device to use, don't worry about it.
>
> Indeed - if your iSCSI stuff is as straight forward as it sounds, I'd be tempted to stop 
> doing the symlinks and just let the LVM stuff do its "thing".
>
> Graham
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   


From vseagal at liquidnet.com  Wed Oct 24 16:00:01 2007
From: vseagal at liquidnet.com (Vlad Seagal)
Date: Wed, 24 Oct 2007 12:00:01 -0400
Subject: [Linux-cluster] How to configure resource dependency in service
Message-ID: <C344E441.1E4AD%vseagal@liquidnet.com>

Hi All,
I?m trying to configure cluster service that would include the following
resources:
GFS, IP, script (webserver).
<service autostart="1" domain="node1-fail" exclusive="0" name="webserver"
recovery="restart">
                        <clusterfs ref="GFS"/>
                        <ip ref="10.17.123.226"/>
                        <script ref="webserver"/>
                </service>

I put them into the service without any dependency and it works, but I need
to have a control of which resource starts after which.
I need to have the following dependency: GFS -->IP-->webserver
I tried to add shared resource ?IP? to the ?GFS? and then added shared
resource ?webserver? to the ?IP? resource.
So it became like that: GFS
                                    |-->IP
                                           |-->Webserver
                   
The cluster starts GFS, then starts IP but doesn?t start webserver.

I tried to run the following scenario: GFS
                                                        |-->IP
                                                        |-->Webserver

Where ?IP? and ?Webserver? are children of the ?GFS?, but that doesn?t work
as well.

Am I doing something wrong or is it a limitation of the cluster?
How to setup the dependency?


Thanks,

Vlad.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071024/5329300a/attachment.htm>

From jprats at cesca.es  Wed Oct 24 17:29:28 2007
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 24 Oct 2007 19:29:28 +0200
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <C344E441.1E4AD%vseagal@liquidnet.com>
References: <C344E441.1E4AD%vseagal@liquidnet.com>
Message-ID: <471F80F8.6070508@cesca.es>

Hi,
You should just put tags inside others:

  <service autostart="1" domain="node1-fail" exclusive="0"
  name="webserver" recovery="restart">
                          <clusterfs ref="GFS">
	                         <ip ref="10.17.123.226">
		                         <script ref="webserver"/>
				</ip>
			</clusterfs>
                  </service>

regards,
Jordi

Vlad Seagal wrote:
> Hi All,
> I?m trying to configure cluster service that would include the following 
> resources:
> GFS, IP, script (webserver).
> <service autostart="1" domain="node1-fail" exclusive="0" 
> name="webserver" recovery="restart">
>                         <clusterfs ref="GFS"/>
>                         <ip ref="10.17.123.226"/>
>                         <script ref="webserver"/>
>                 </service>
> 
> I put them into the service without any dependency and it works, but I 
> need to have a control of which resource starts after which.
> I need to have the following dependency: GFS -->IP-->webserver
> I tried to add shared resource ?IP? to the ?GFS? and then added shared 
> resource ?webserver? to the ?IP? resource.
> So it became like that: GFS
>                                     |-->IP
>                                            |-->Webserver
>                                      
> The cluster starts GFS, then starts IP but doesn?t start webserver.
> 
> I tried to run the following scenario: GFS
>                                                         |-->IP
>                                                         |-->Webserver
> 
> Where ?IP? and ?Webserver? are children of the ?GFS?, but that doesn?t 
> work as well.
> 
> Am I doing something wrong or is it a limitation of the cluster?
> How to setup the dependency?
> 
> 
> Thanks,
> 
> Vlad.
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From teigland at redhat.com  Wed Oct 24 18:50:06 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 24 Oct 2007 13:50:06 -0500
Subject: [Linux-cluster] Cman tool leave remove, device or resource busy
In-Reply-To: <C3425E9C.10CDD8%jgray@nicusa.com>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
Message-ID: <20071024185005.GA15071@redhat.com>

On Mon, Oct 22, 2007 at 02:05:16PM -0600, Josh Gray wrote:
> I see in the FAQ the proper way to leave a cluster is with 'cman_tool leave
> remove'  I am getting a  'device or resource busy' error when I do that even
> with GFS mounts removed and stopped.   What am I missing?  The only service
> I see up is a "fence" type called "default"  do I have to stop one more
> thing?
> 
> Basically doing this:
> 
> service nfs stop
> umount /export
> service gfs stop
> service rgmanager stop
> service clvmd stop
> cman_tool leave remove
> 
> Tells me this:
> cman_tool: Error leaving cluster: Device or resource busy


> [root at nfs-6 ~]# cman_tool services
> type             level name     id       state
> fence            0     default  00010003 none
> [1 2 3]

You need to run 'fence_tool leave' before trying to leave the cluster, or
use 'service cman stop' which does that prior to cman_tool leave.

Dave


From teigland at redhat.com  Wed Oct 24 18:57:11 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 24 Oct 2007 13:57:11 -0500
Subject: [Linux-cluster] Disable GFS quotas
In-Reply-To: <C3440094.11200B%jgray@nicusa.com>
References: <C3440094.11200B%jgray@nicusa.com>
Message-ID: <20071024185711.GB15071@redhat.com>

On Tue, Oct 23, 2007 at 07:48:36PM -0600, Josh Gray wrote:
> Can someone tell me the best way to disable quotas on GFS for good?   I see
> in the docs you are suppose to run this command but it doesn't work across
> reboots or umounts.
> 
> gfs_tool settune /gfs quota_account 0

A "noquota" mount option was added to do this since it was such a common
request.

> As a bonus question, does this listserv have a searchable archive anywhere?

http://marc.info/?l=redhat-linux-cluster

Dave


From vseagal at liquidnet.com  Wed Oct 24 19:21:09 2007
From: vseagal at liquidnet.com (Vlad Seagal)
Date: Wed, 24 Oct 2007 15:21:09 -0400
Subject: [Linux-cluster] How to configure resource dependency in
 service
In-Reply-To: <471F80F8.6070508@cesca.es>
Message-ID: <C3451365.1E57F%vseagal@liquidnet.com>

Thanks Jordi,
I tried that be it doesn't work. IP resource starts but webserver doesn't.
Here is the snap from /var/log/messages:
Oct 24 15:16:32 va0pmlt02 clurgmgrd[29854]: <notice> Starting disabled
service webserver 
Oct 24 15:16:32 va0pmlt02 clurgmgrd: [29854]: <info> Adding IPv4 address
10.17.123.226 to bond0
Oct 24 15:16:33 va0pmlt02 clurgmgrd[29854]: <notice> Service webserver
started 

As you see the resource webserver doesn't get started at all.

Here is the snap from the /etc/cluster/cluster.conf:

 <service autostart="1" domain="node1-fail" exclusive="0" name="webserver"
recovery="restart">
                        <clusterfs ref="GFS">
                                <ip ref="10.17.123.226">
                                        <script ref="webserver"/>
                                </ip>
                        </clusterfs>
                </service>


The strange thing is it starts the IP but not the webserver, it doesn't even
try to start webserver.

I'm not sure what is the reason.

Vlad.
On 10/24/07 1:29 PM, "Jordi Prats" <jprats at cesca.es> wrote:

> Hi,
> You should just put tags inside others:
> 
>   <service autostart="1" domain="node1-fail" exclusive="0"
>   name="webserver" recovery="restart">
>                           <clusterfs ref="GFS">
>                         <ip ref="10.17.123.226">
>                         <script ref="webserver"/>
> </ip>
> </clusterfs>
>                   </service>
> 
> regards,
> Jordi
> 
> Vlad Seagal wrote:
>> Hi All,
>> I?m trying to configure cluster service that would include the following
>> resources:
>> GFS, IP, script (webserver).
>> <service autostart="1" domain="node1-fail" exclusive="0"
>> name="webserver" recovery="restart">
>>                         <clusterfs ref="GFS"/>
>>                         <ip ref="10.17.123.226"/>
>>                         <script ref="webserver"/>
>>                 </service>
>> 
>> I put them into the service without any dependency and it works, but I
>> need to have a control of which resource starts after which.
>> I need to have the following dependency: GFS -->IP-->webserver
>> I tried to add shared resource ?IP? to the ?GFS? and then added shared
>> resource ?webserver? to the ?IP? resource.
>> So it became like that: GFS
>>                                     |-->IP
>>                                            |-->Webserver
>>                 
>> The cluster starts GFS, then starts IP but doesn?t start webserver.
>> 
>> I tried to run the following scenario: GFS
>>                                                         |-->IP
>>                                                         |-->Webserver
>> 
>> Where ?IP? and ?Webserver? are children of the ?GFS?, but that doesn?t
>> work as well.
>> 
>> Am I doing something wrong or is it a limitation of the cluster?
>> How to setup the dependency?
>> 
>> 
>> Thanks,
>> 
>> Vlad.
>> 
>> 
>> ------------------------------------------------------------------------
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Vlad Seagal
Unix System Administrator
Liquidnet Holdings, Inc.
vseagal at liquidnet.com
T 646.674.2268


From Abdel.Sadek at lsi.com  Wed Oct 24 20:08:46 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Wed, 24 Oct 2007 14:08:46 -0600
Subject: [Linux-cluster] Uninstalling RHEL 4.5 and RHEL 5.0 cluster
Message-ID: <C776378855970A4DADE4A476447F6391F79C18@NAMAIL3.ad.lsil.com>

What's the best way to uninstall the RHEL 4.5 and RHEL 5.0 cluster?

 
Thanks.

Abdel...

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071024/ca6c65ef/attachment.htm>

From lhh at redhat.com  Thu Oct 25 14:43:47 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Oct 2007 10:43:47 -0400
Subject: [Linux-cluster] Downing secondary interface if primary fails?
In-Reply-To: <Pine.LNX.4.64.0710240822410.32717@skynet.shatteredsilicon.net>
References: <C3425E9C.10CDD8%jgray@nicusa.com>
	<1193142810.6034.3.camel@xw9300.bidmc.harvard.edu>
	<Pine.LNX.4.64.0710231622250.25898@skynet.shatteredsilicon.net>
	<1193172562.4831.0.camel@ayanami.boston.devel.redhat.com>
	<Pine.LNX.4.64.0710240822410.32717@skynet.shatteredsilicon.net>
Message-ID: <1193323427.4831.40.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-24 at 08:24 +0100, gordan at bobich.net wrote:
> On Tue, 23 Oct 2007, Lon Hohberger wrote:
> 
> >> Say a machine is a part of a cluster. It has eth0 for WAN accessm and eth1
> >> for SAN/cluster access. Say eth1 fails - is there a way to get eth0 to
> >> shut down as well, so the machine doesn't appear to be alive and thus get
> >> traffic routed to it?
> >
> > Use fencing to power the node off?
> 
> Can't power it off in software. :-(

No, it requires a remote power switch.  However, the remaining nodes in
the cluster will need to fence the dead node anyway (being cut off from
the cluster will make the node need to be fenced)...

Internally (i.e. local to the node that's been cut off), you could use:

http://mon.wiki.kernel.org/index.php/Main_Page

-- Lon


From rnevins at workflowbydesign.com  Thu Oct 25 17:27:52 2007
From: rnevins at workflowbydesign.com (Rob Nevins)
Date: Thu, 25 Oct 2007 13:27:52 -0400
Subject: [Linux-cluster] Mount GFS volume question
Message-ID: <4720D218.8050309@workflowbydesign.com>

Hello,

I have a xen environment in RHEL5, 3 web servers using gfs for shared 
data.  I'd like to install our EMC backup client on the hypervisor to 
back log files, however, I am not sure how to temporarily mount the gfs 
volume to do this.  Obviously I do not want to break connectivity in the 
existing cluster.  Any ideas?

I would consider this, but wnat to be sure:

mount -t gfs /dev/VolGroup00/LogVol01 /mount/data -o lockproto=lock_nolock

Thanks!

-Rob


From lhh at redhat.com  Thu Oct 25 17:44:56 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Oct 2007 13:44:56 -0400
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <C344E441.1E4AD%vseagal@liquidnet.com>
References: <C344E441.1E4AD%vseagal@liquidnet.com>
Message-ID: <1193334296.20482.6.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-24 at 12:00 -0400, Vlad Seagal wrote:
> Hi All,
> I?m trying to configure cluster service that would include the
> following resources:
> GFS, IP, script (webserver).
> <service autostart="1" domain="node1-fail" exclusive="0"
> name="webserver" recovery="restart">
>                         <clusterfs ref="GFS"/>
>                         <ip ref="10.17.123.226"/>
>                         <script ref="webserver"/>
>                 </service>
> 
> I put them into the service without any dependency and it works, but I
> need to have a control of which resource starts after which. 

Actually, since this is the most common dependency used by
customers/users, no matter what order you put those 3 resources in
cluster.conf as a child of the "service"...

> I need to have the following dependency: GFS -->IP-->webserver

... they will start in this order:
  
  (1) start fs/clusterfs
  (2) start ip
  (3) start script

... and stop in this order:

  (1) stop script
  (2) stop IP
  (3) stop fs/clusterfs

See:

  /usr/share/cluster/service.sh

The "child" start/stop levels control what classes of resources are
started before other classes of resources.  This specification (in
service.sh) only affects children of the "service" quasi-resource; other
resources may provide different start/stop orders, but usually they
don't.

-- Lon


From lhh at redhat.com  Thu Oct 25 17:47:36 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Oct 2007 13:47:36 -0400
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <C3451365.1E57F%vseagal@liquidnet.com>
References: <C3451365.1E57F%vseagal@liquidnet.com>
Message-ID: <1193334456.20482.10.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-24 at 15:21 -0400, Vlad Seagal wrote:
> As you see the resource webserver doesn't get started at all.
> 
> Here is the snap from the /etc/cluster/cluster.conf:
> 
>  <service autostart="1" domain="node1-fail" exclusive="0" name="webserver"
> recovery="restart">
>                         <clusterfs ref="GFS">
>                                 <ip ref="10.17.123.226">
>                                         <script ref="webserver"/>
>                                 </ip>
>                         </clusterfs>
>                 </service>

That should work as well.  If it doesn't, try running:

clusvcadm -d webserver
rg_test test /etc/cluster/cluster.conf start service webserver


(before re-enabling, run:)
rg_test test /etc/cluster/cluster.conf stop service webserver

-- Lon


From lhh at redhat.com  Thu Oct 25 18:02:07 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Oct 2007 14:02:07 -0400
Subject: [Linux-cluster] Uninstalling RHEL 4.5 and RHEL 5.0 cluster
In-Reply-To: <C776378855970A4DADE4A476447F6391F79C18@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391F79C18@NAMAIL3.ad.lsil.com>
Message-ID: <1193335327.20482.20.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-24 at 14:08 -0600, Sadek, Abdel wrote:
> What?s the best way to uninstall the RHEL 4.5 and RHEL 5.0 cluster?

rpm -e ?

I think this query should return all the packages you'd want to erase on
either RHEL4 or RHEL5:

rpm -qa | grep "^\(cman\|dlm\|rgmanager\|gnbd\|gfs\|magma\|ccs\|GFS\|
lvm2-cluster\|ricci\|luci\|openais\|system-config-cluster\|fence\|iddev
\|gulm\)"

-- Lon


From vseagal at liquidnet.com  Thu Oct 25 18:35:06 2007
From: vseagal at liquidnet.com (Vlad Seagal)
Date: Thu, 25 Oct 2007 14:35:06 -0400
Subject: [Linux-cluster] How to configure resource dependency in
 service
In-Reply-To: <1193334456.20482.10.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C3465A1A.1EB3F%vseagal@liquidnet.com>

Lon,
Thanks for your reply. I did the rg_test and that worked. I was able to
start the service:
[root at va0pmlt01 ~]# rg_test test /etc/cluster/cluster.conf start service
webserver
Running in test mode.
Starting webserver...
<debug>  Link for bond0: Detected
<info>   Adding IPv4 address 10.17.123.226 to bond0
<debug>  Sending gratuitous ARP: 10.17.123.226 00:1c:c4:41:a4:1c brd
ff:ff:ff:ff:ff:ff
<info>   Executing /etc/init.d/httpd start
Starting httpd:                                            [  OK  ]
Start of webserver complete
[root at va0pmlt01 ~]# ps -ef|grep http
root     28637     1  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28638 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28639 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28641 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28642 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28644 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28645 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28646 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
apache   28647 28637  0 14:23 ?        00:00:00 /usr/sbin/httpd
root     28661  8355  0 14:23 pts/0    00:00:00 grep http
[root at va0pmlt01 ~]#
[root at va0pmlt01 ~]# rg_test test /etc/cluster/cluster.conf stop service
webserver
Running in test mode.
Stopping webserver...
<info>   Executing /etc/init.d/httpd stop
Stopping httpd:                                            [  OK  ]
<info>   Removing IPv4 address 10.17.123.226 from bond0
Stop of webserver complete
[root at va0pmlt01 ~]#
[root at va0pmlt01 ~]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  va0pmlt01                                Online, Local, rgmanager
  va0pmlt02                                Online, rgmanager
  va0pmlt03                                Online, rgmanager
  va0pmlt04                                Online, rgmanager
  va0pmlt05                                Online, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  webserver            (va0pmlt01)                    disabled

[root at va0pmlt01 ~]# clusvcadm -e webserver -m va0pmlt01
Member va0pmlt01 trying to enable webserver...success
Service webserver is now running on va0pmlt01
[root at va0pmlt01 ~]#
[root at va0pmlt01 ~]# ps -ef|grep http
root     30227  8355  0 14:32 pts/0    00:00:00 grep http
[root at va0pmlt01 ~]#
[root at va0pmlt01 ~]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  va0pmlt01                                Online, Local, rgmanager
  va0pmlt02                                Online, rgmanager
  va0pmlt03                                Online, rgmanager
  va0pmlt04                                Online, rgmanager
  va0pmlt05                                Online, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  webserver            va0pmlt01                      started


As you can see the service doesn't start properly if it's started via
clusvcadm command. Http is not running. Do you know what might be the
problem.

Thanks,
Vlad.


On 10/25/07 1:47 PM, "Lon Hohberger" <lhh at redhat.com> wrote:

> On Wed, 2007-10-24 at 15:21 -0400, Vlad Seagal wrote:
>> As you see the resource webserver doesn't get started at all.
>> 
>> Here is the snap from the /etc/cluster/cluster.conf:
>> 
>>  <service autostart="1" domain="node1-fail" exclusive="0" name="webserver"
>> recovery="restart">
>>                         <clusterfs ref="GFS">
>>                                 <ip ref="10.17.123.226">
>>                                         <script ref="webserver"/>
>>                                 </ip>
>>                         </clusterfs>
>>                 </service>
> 
> That should work as well.  If it doesn't, try running:
> 
> clusvcadm -d webserver
> rg_test test /etc/cluster/cluster.conf start service webserver
> 
> 
> (before re-enabling, run:)
> rg_test test /etc/cluster/cluster.conf stop service webserver
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Vlad Seagal
Unix System Administrator
Liquidnet Holdings, Inc.
vseagal at liquidnet.com
T 646.674.2268


From jliu at liquidnet.com  Thu Oct 25 19:13:53 2007
From: jliu at liquidnet.com (Jeff Liu)
Date: Thu, 25 Oct 2007 15:13:53 -0400
Subject: [Linux-cluster] strange modprobe drops VIP & fails service
In-Reply-To: <C3465A1A.1EB3F%vseagal@liquidnet.com>
Message-ID: <AAD74D664EBCB042BEB418AD2368C7F901F6FB2E@ny0ech01.LNHOLDINGS.COM>


I'm experiencing something similar.  I was wondering if there was some
additional information available regarding the "e1000 + multiple NIC
problem".

Thanks in advance,
Jeff


On Fri, 2006-02-10 at 07:16 -0500, danwest wrote:
> Has anyone every seen something like this?  A modprobe with either a
> "module 0" or a blank "module " is always followed by the interface
> loosing its VIP and thus initiating a service failure.  All non
> essential system services were turned off.  Not sure what is
> calling/causing the strange modprobe.  Logs are clean, no other events
> leading up to this.  I am now wondering if it could be the cluster
> itself?
> 
> Feb  9 13:44:32 node2 modprobe: modprobe: Can't locate module 0
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: IP
> address 147.107.188.81 missing
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: 0:
error
> fetching interface information: Device not found
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: Check
> status failed on IP addresses for mservice1

Sounds like the e1000 + multiple NIC problem.

-- Lon


From lhh at redhat.com  Thu Oct 25 23:04:14 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Oct 2007 19:04:14 -0400
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <C3465A1A.1EB3F%vseagal@liquidnet.com>
References: <C3465A1A.1EB3F%vseagal@liquidnet.com>
Message-ID: <1193353454.20482.82.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-25 at 14:35 -0400, Vlad Seagal wrote:

> [root at va0pmlt01 ~]# clusvcadm -e webserver -m va0pmlt01
> Member va0pmlt01 trying to enable webserver...success
> Service webserver is now running on va0pmlt01
> [root at va0pmlt01 ~]#
> [root at va0pmlt01 ~]# ps -ef|grep http
> root     30227  8355  0 14:32 pts/0    00:00:00 grep http
> [root at va0pmlt01 ~]#
> [root at va0pmlt01 ~]# clustat
> Member Status: Quorate
> 
>   Member Name                              Status
>   ------ ----                              ------
>   va0pmlt01                                Online, Local, rgmanager
>   va0pmlt02                                Online, rgmanager
>   va0pmlt03                                Online, rgmanager
>   va0pmlt04                                Online, rgmanager
>   va0pmlt05                                Online, rgmanager
> 
>   Service Name         Owner (Last)                   State
>   ------- ----         ----- ------                   -----
>   webserver            va0pmlt01                      started
> 
> 
> As you can see the service doesn't start properly if it's started via
> clusvcadm command. Http is not running. Do you know what might be the
> problem.

That's very strange.  It's quite literally the same code that runs the
tree in rg_test and rgmanager - it's almost like maybe rgmanager  didn't
get a configuration update or something (missing ccs_tool update?).  

What code base do you have currently?

The IP resource only says "nfsexport" and "nfsclient" are illegal as
children, so <script> should work.

-- Lon


From orkcu at yahoo.com  Fri Oct 26 01:55:15 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Thu, 25 Oct 2007 18:55:15 -0700 (PDT)
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <C3465A1A.1EB3F%vseagal@liquidnet.com>
Message-ID: <203255.46175.qm@web50604.mail.re2.yahoo.com>


--- Vlad Seagal <vseagal at liquidnet.com> wrote:

> Lon,
> Thanks for your reply. I did the rg_test and that
> worked. I was able to
> start the service:

> <info>   Executing /etc/init.d/httpd start
> Starting httpd:                                     
>       [  OK  ]
> Start of webserver complete
> [root at va0pmlt01 ~]# ps -ef|grep http
> root     28637     1  0 14:23 ?        00:00:00
> /usr/sbin/httpd
> [root at va0pmlt01 ~]#


> [root at va0pmlt01 ~]# clusvcadm -e webserver -m
> va0pmlt01
> Member va0pmlt01 trying to enable
> webserver...success
> Service webserver is now running on va0pmlt01
> [root at va0pmlt01 ~]#
> [root at va0pmlt01 ~]#
> [root at va0pmlt01 ~]# clustat
> Member Status: Quorate
>   ------- ----         ----- ------                 
>   webserver            va0pmlt01                    
>  started
> 
> 
> As you can see the service doesn't start properly if
> it's started via
> clusvcadm command. Http is not running. Do you know
> what might be the
> problem.

and nothing came into httpd error message file ?
if it is something about timing that make httpd unable
to start then the error should be printed in error.log

if it something in cluster infrastructure
(rgmanager)...
never had happened to me, maybe in that case it is
some  old cluster.conf still active.

the "active" cluster.conf number is the same that it
is in the cluster.conf file?
manual change of that number and a forced propagation
of the new conf to the all cluster should fix that

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From isplist at logicore.net  Fri Oct 26 14:29:01 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 26 Oct 2007 09:29:01 -0500
Subject: [Linux-cluster] Howto? Remove volumes
Message-ID: <200710269291.787386@leena>

After using LUN masking and zoning to make sure that hosts didn't access other 
hosts data space, I find myself with lost volumes on some of the hosts. Since 
I didn't remove them before making the changes, I am now stuck with the 
following;

# lvscan
  /dev/sda: read failed after 0 of 4096 at 0: Input/output error
  ACTIVE            '/dev/VolGroup03/web' [475.62 GB] inherit

# vgscan
  Reading all physical volumes.  This may take a while...
  /dev/sda: read failed after 0 of 4096 at 0: Input/output error
  /dev/sda: read failed after 0 of 4096 at 1073676288: Input/output error
  /dev/sda: read failed after 0 of 4096 at 0: Input/output error
  Found volume group "VolGroup03" using metadata type lvm2

# pvscan
  /dev/sda: read failed after 0 of 4096 at 0: Input/output error
  /dev/sda: read failed after 0 of 4096 at 1073676288: Input/output error
  /dev/sda: read failed after 0 of 4096 at 0: Input/output error
  PV /dev/sdb   VG VolGroup03   lvm2 [475.62 GB / 0    free]
  Total: 1 [475.62 GB] / in use: 1 [475.62 GB] / in no VG: 0 [0   ]

With the changes, /dev/sda and /dev/sdc (which doesn't show anymore) no longer 
exists. How can I safely remove the sda references?

Mike


From srigler at marathonoil.com  Fri Oct 26 14:42:31 2007
From: srigler at marathonoil.com (Steve Rigler)
Date: Fri, 26 Oct 2007 09:42:31 -0500
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <200710269291.787386@leena>
References: <200710269291.787386@leena>
Message-ID: <1193409751.6097.30.camel@houuc8>

On Fri, 2007-10-26 at 09:29 -0500, isplist at logicore.net wrote:
> After using LUN masking and zoning to make sure that hosts didn't access other 
> hosts data space, I find myself with lost volumes on some of the hosts. Since 
> I didn't remove them before making the changes, I am now stuck with the 
> following;
> 
> # lvscan
>   /dev/sda: read failed after 0 of 4096 at 0: Input/output error
>   ACTIVE            '/dev/VolGroup03/web' [475.62 GB] inherit
> 
> # vgscan
>   Reading all physical volumes.  This may take a while...
>   /dev/sda: read failed after 0 of 4096 at 0: Input/output error
>   /dev/sda: read failed after 0 of 4096 at 1073676288: Input/output error
>   /dev/sda: read failed after 0 of 4096 at 0: Input/output error
>   Found volume group "VolGroup03" using metadata type lvm2
> 
> # pvscan
>   /dev/sda: read failed after 0 of 4096 at 0: Input/output error
>   /dev/sda: read failed after 0 of 4096 at 1073676288: Input/output error
>   /dev/sda: read failed after 0 of 4096 at 0: Input/output error
>   PV /dev/sdb   VG VolGroup03   lvm2 [475.62 GB / 0    free]
>   Total: 1 [475.62 GB] / in use: 1 [475.62 GB] / in no VG: 0 [0   ]
> 
> With the changes, /dev/sda and /dev/sdc (which doesn't show anymore) no longer 
> exists. How can I safely remove the sda references?
> 
> Mike
> 

Usually you'd want to do this prior to zoning away a device, but "echo 1
> /sys/block/sda/device/delete" should get rid of it.

-Steve


From isplist at logicore.net  Fri Oct 26 15:02:38 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 26 Oct 2007 10:02:38 -0500
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <1193409751.6097.30.camel@houuc8>
Message-ID: <2007102610238.529681@leena>

Hi Steve,

> Usually you'd want to do this prior to zoning away a device,

Ya, I know :). I should have done it before but I was testing stuff and it 
just happened to work out as I needed it so left it that way.

>but "echo 1 /sys/block/sda/device/delete" should get rid of it.

There is no delete file in that directory. There is a delete file in a 
subdirectory of this called device however.

I was wondering if I could just delete the references in /etc/lvm perhaps?


From jliu at liquidnet.com  Fri Oct 26 15:09:19 2007
From: jliu at liquidnet.com (Jeff Liu)
Date: Fri, 26 Oct 2007 11:09:19 -0400
Subject: [Linux-cluster] strange modprobe drops VIP & fails service
In-Reply-To: <AAD74D664EBCB042BEB418AD2368C7F901F6FB2E@ny0ech01.LNHOLDINGS.COM>
Message-ID: <AAD74D664EBCB042BEB418AD2368C7F901F6FDC0@ny0ech01.LNHOLDINGS.COM>

I've haven't received any replies, so just to bump the question again.

*bump*

Thanks in advance.


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jeff Liu
Sent: Thursday, October 25, 2007 3:14 PM
To: linux clustering
Subject: [Linux-cluster] strange modprobe drops VIP & fails service


I'm experiencing something similar.  I was wondering if there was some
additional information available regarding the "e1000 + multiple NIC
problem".

Thanks in advance,
Jeff


On Fri, 2006-02-10 at 07:16 -0500, danwest wrote:
> Has anyone every seen something like this?  A modprobe with either a
> "module 0" or a blank "module " is always followed by the interface
> loosing its VIP and thus initiating a service failure.  All non
> essential system services were turned off.  Not sure what is
> calling/causing the strange modprobe.  Logs are clean, no other events
> leading up to this.  I am now wondering if it could be the cluster
> itself?
> 
> Feb  9 13:44:32 node2 modprobe: modprobe: Can't locate module 0
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: IP
> address 147.107.188.81 missing
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: 0:
error
> fetching interface information: Device not found
> Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: Check
> status failed on IP addresses for mservice1

Sounds like the e1000 + multiple NIC problem.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From breeves at redhat.com  Fri Oct 26 15:07:39 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Fri, 26 Oct 2007 16:07:39 +0100
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <2007102610238.529681@leena>
References: <2007102610238.529681@leena>
Message-ID: <472202BB.3010603@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

isplist at logicore.net wrote:
> Ya, I know :). I should have done it before but I was testing stuff and it 
> just happened to work out as I needed it so left it that way.
> 
>> but "echo 1 /sys/block/sda/device/delete" should get rid of it.
> 
> There is no delete file in that directory. There is a delete file in a 
> subdirectory of this called device however.

That's where it should be - /sys/block/<name>/device/delete.

> I was wondering if I could just delete the references in /etc/lvm perhaps?

No - that's just a cache reference. As long as the /dev/sd* nodes exist
(and the corresponding /sys/block/sd* entries), the next lvm scan will
pick the devices up and re-add them to the cache.

You could delete the device nodes and/or filter them via lvm.conf, but
since you want them gone (and have masked them from the SAN), you may as
well just delete them.

Regards,
Bryn.


-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHIgK76YSQoMYUY94RAg7TAJ9SYfQzNOSLMmhpbSrp7WPMwFC5+ACfRtPO
fDg7T0KupxmGPxMh6fihBLo=
=/0Io
-----END PGP SIGNATURE-----


From scottb at bxwa.com  Fri Oct 26 15:21:40 2007
From: scottb at bxwa.com (Scott Becker)
Date: Fri, 26 Oct 2007 08:21:40 -0700
Subject: [Linux-cluster] Fencing Race Question
Message-ID: <47220604.5070504@bxwa.com>

In the FAQ, Fencing Questions
13. What's the right way to configure fencing when I have redundant 
power supplies?

I'm going to setup the second example.
My concern is about a race condition with the two devices:

|<device name="pwr01" option="off" switch="1" port="1"/>
<device name="pwr02" option="off" switch="1" port="1"/>|

If I had only one power switch, the first node to login successfully 
turns the other off (APC with one IP address). My concern is with two 
APCs, that whoever loses the race to login to APC #1 may win the race to 
login to APC #2 and then each will turn off only one of the other's 
power supplies.

Is the fencing execution script designed to 1) perform all necessary 
fencing device logins successfully then and only then 2) issue the 
poweroff commands? Thereby averting a potential race because the loser 
of the race for APC #1 will give up and hand over device #2.

    thanks
    newby
    scottb

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071026/fb5b7b5e/attachment.htm>

From addi at hugsmidjan.is  Fri Oct 26 15:29:49 2007
From: addi at hugsmidjan.is (=?ISO-8859-1?Q?S=E6valdur?= Arnar Gunnarsson)
Date: Fri, 26 Oct 2007 15:29:49 +0000
Subject: [Linux-cluster] How to take down a CS/GFS setup with minimum
	downtime
Message-ID: <1193412589.5963.2.camel@addi.hugsmidjan.is>

I've got five RHEL4 systems with CS and about 800 GB of data on a shared
GFS filesystem.
I've been tasked to take down the cluster and divide the content of the
shared GFS filesystem to the local disks on each system with minimum
downtime.

I've removed two nodes from the cluster already and am somewhat scared
of a quorum problem if I remove another node.

From what I've been able to gather I should use cman_tool leave remove
on a node once it is ready to leave the cluster and thus be able to
remove four nodes from a five node cluster without disolving the quorum
or risking losing access to the GFS data on the last remaining node.

Is that correct ?

With best regards,
-- 
Saevaldur Arnar Gunnarsson
System Administrator | RHCE

Hugsmidjan ehf.
Snorrabraut 56 | 105 Reykjavik
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 4581 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071026/753f9886/attachment.bin>

From chris at cmiware.com  Fri Oct 26 15:36:38 2007
From: chris at cmiware.com (Chris Harms)
Date: Fri, 26 Oct 2007 10:36:38 -0500
Subject: [Linux-cluster] Fencing Race Question
In-Reply-To: <47220604.5070504@bxwa.com>
References: <47220604.5070504@bxwa.com>
Message-ID: <47220986.801@cmiware.com>

We have our APC power strips group the 2 ports together. The fence agent 
should login to only one IP and turn off the grouped ports instead of 
logging in to each unit separately to turn off power.


Scott Becker wrote:
> In the FAQ, Fencing Questions
> 13. What's the right way to configure fencing when I have redundant 
> power supplies?
>
> I'm going to setup the second example.
> My concern is about a race condition with the two devices:
>
> |<device name="pwr01" option="off" switch="1" port="1"/>
> <device name="pwr02" option="off" switch="1" port="1"/>|
>
> If I had only one power switch, the first node to login successfully 
> turns the other off (APC with one IP address). My concern is with two 
> APCs, that whoever loses the race to login to APC #1 may win the race 
> to login to APC #2 and then each will turn off only one of the other's 
> power supplies.
>
> Is the fencing execution script designed to 1) perform all necessary 
> fencing device logins successfully then and only then 2) issue the 
> poweroff commands? Thereby averting a potential race because the loser 
> of the race for APC #1 will give up and hand over device #2.
>
>     thanks
>     newby
>     scottb
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Fri Oct 26 15:43:40 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 26 Oct 2007 10:43:40 -0500
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <472208D3.20000@redhat.com>
Message-ID: <20071026104340.606075@leena>

Reference removed by going to the real path;

Simlink path was;
/sys/block/sda/~device/delete
Real path was;
device -> ../../devices/pci0000:00/0000:00:11.0/host0/target0:0:0/0:0:0:0

 From within real path;
echo 1 > /sys/block/sda/device/delete

> OK - that should be the correct file. Does it still say "No such file or
> directory" if you echo it into the target of the symlink directly?
> 
> I just tried this out on a 42.EL kernel here & it's working OK for me
> (local storage though).

That worked. I thought I tried that earlier but perhaps I didn't. 
Thanks much for the help.

Mike


From breeves at redhat.com  Fri Oct 26 15:44:59 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Fri, 26 Oct 2007 16:44:59 +0100
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <20071026104340.606075@leena>
References: <20071026104340.606075@leena>
Message-ID: <47220B7B.9020901@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

isplist at logicore.net wrote:
> Reference removed by going to the real path;
> 
> Simlink path was;
> /sys/block/sda/~device/delete
> Real path was;
> device -> ../../devices/pci0000:00/0000:00:11.0/host0/target0:0:0/0:0:0:0
> 
>  From within real path;
> echo 1 > /sys/block/sda/device/delete

Really weird! Do you actually have the "~" character in the path name
that appears in /sys/block/sda?

I'm wondering if this is the cause of the problem since the shell might
be expanding that to be the path to the home directory of a user named
"device" (just a hunch - could be total nonsense!).

> That worked. I thought I tried that earlier but perhaps I didn't. 
> Thanks much for the help.

No problem, glad it's working now!

Cheers,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHIgt76YSQoMYUY94RAjHDAKDTGUQSxrh9UO3F2OToOHp+Y9Nc2wCgjm7s
5WIVPbGjUD7nnfz6TVCcRtw=
=gHVm
-----END PGP SIGNATURE-----


From scottb at bxwa.com  Fri Oct 26 15:47:44 2007
From: scottb at bxwa.com (Scott Becker)
Date: Fri, 26 Oct 2007 08:47:44 -0700
Subject: [Linux-cluster] Fencing Race Question
In-Reply-To: <47220986.801@cmiware.com>
References: <47220604.5070504@bxwa.com> <47220986.801@cmiware.com>
Message-ID: <47220C20.3070203@bxwa.com>

To clarify, example #2 in the faq: redundant power feeds, redundant APCs 
- no single point of failure. Provides High availability but complicates 
fencing and race prevention.

This necessitates two power switch logins to successfully fence. I'll 
test by manually logging in to one device and tell luci to fence a node. 
With my manual login getting in the way, the fence operation should fail 
and turn off no ports and not prevent manual login to other apc, no 
matter which switch I manually log into. I haven't got a chance to test 
this yet and I wanted to tap into your wisdom to test my logic.

    thanks
    scottb


Chris Harms wrote:
> We have our APC power strips group the 2 ports together. The fence 
> agent should login to only one IP and turn off the grouped ports 
> instead of logging in to each unit separately to turn off power.
>
>
> Scott Becker wrote:
>> In the FAQ, Fencing Questions
>> 13. What's the right way to configure fencing when I have redundant 
>> power supplies?
>>
>> I'm going to setup the second example.
>> My concern is about a race condition with the two devices:
>>
>> |<device name="pwr01" option="off" switch="1" port="1"/>
>> <device name="pwr02" option="off" switch="1" port="1"/>|
>>
>> If I had only one power switch, the first node to login successfully 
>> turns the other off (APC with one IP address). My concern is with two 
>> APCs, that whoever loses the race to login to APC #1 may win the race 
>> to login to APC #2 and then each will turn off only one of the 
>> other's power supplies.
>>
>> Is the fencing execution script designed to 1) perform all necessary 
>> fencing device logins successfully then and only then 2) issue the 
>> poweroff commands? Thereby averting a potential race because the 
>> loser of the race for APC #1 will give up and hand over device #2.
>>
>>     thanks
>>     newby
>>     scottb
>>
>> ------------------------------------------------------------------------
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From james at cloud9.co.uk  Fri Oct 26 16:12:20 2007
From: james at cloud9.co.uk (James Fidell)
Date: Fri, 26 Oct 2007 17:12:20 +0100
Subject: [Linux-cluster] updates to fence_apc
Message-ID: <472211E4.9050905@cloud9.co.uk>

Although the fence_apc script in cman-2.0.64-1.0.1.el5 has 79xx support,
it wouldn't work with a 7920 I'm using.  I have patches to fix that and
also allow the port to be named rather than using the default "Outlet N"
numbering scheme.

Has this already been fixed in the development stream?  And if not, how
do I submit my patches for consideration for inclusion?

James


From james at cloud9.co.uk  Fri Oct 26 18:37:49 2007
From: james at cloud9.co.uk (James Fidell)
Date: Fri, 26 Oct 2007 19:37:49 +0100
Subject: [Linux-cluster] Problem with fence_ilo
Message-ID: <472233FD.5030202@cloud9.co.uk>

Again in cman-2.0.64-1.0.1.el5, fence_ilo reports an error from an iLO2
session saying power cannot be turned back on without TOGGLE = "Yes"
being set.  The unit reports that it uses RIBCL 2.22, which appears to
be supported in the code.

Can anyone confirm that their fence_ilo script works without
modification in such a configuration?  I've changed the script to
send

  <HOLD_PWR_BTN TOGGLE = "Yes"/>

and that does work for me, but I don't know if it's valid for all
v2.22 iLO2 units.

James


From lhh at redhat.com  Fri Oct 26 18:58:01 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Oct 2007 14:58:01 -0400
Subject: [Linux-cluster] How to configure resource dependency in service
In-Reply-To: <203255.46175.qm@web50604.mail.re2.yahoo.com>
References: <203255.46175.qm@web50604.mail.re2.yahoo.com>
Message-ID: <1193425082.9223.19.camel@ayanami.boston.devel.redhat.com>

On Thu, 2007-10-25 at 18:55 -0700, Roger Pe?a wrote:
> --- Vlad Seagal <vseagal at liquidnet.com> wrote:

> the "active" cluster.conf number is the same that it
> is in the cluster.conf file?
> manual change of that number and a forced propagation
> of the new conf to the all cluster should fix that

Possibly; to be sure, try incrementing the configuration version # in
cluster.conf and running:

   ccs_tool update /etc/cluster/cluster.conf

-- Lon


From lhh at redhat.com  Fri Oct 26 19:10:39 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Oct 2007 15:10:39 -0400
Subject: [Linux-cluster] Fencing Race Question
In-Reply-To: <47220604.5070504@bxwa.com>
References: <47220604.5070504@bxwa.com>
Message-ID: <1193425839.9223.33.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-26 at 08:21 -0700, Scott Becker wrote:
> In the FAQ, Fencing Questions
> 13. What's the right way to configure fencing when I have redundant
> power supplies?
> 
> I'm going to setup the second example.
> My concern is about a race condition with the two devices:
> 
> <device name="pwr01" option="off" switch="1" port="1"/>
> <device name="pwr02" option="off" switch="1" port="1"/>

> If I had only one power switch, the first node to login successfully
> turns the other off (APC with one IP address). My concern is with two
> APCs, that whoever loses the race to login to APC #1 may win the race
> to login to APC #2 and then each will turn off only one of the other's
> power supplies.

If it fails to log in to power supply #1, fencing has failed, and it
doesn't bother trying fencing option #2.  It then pauses for several
seconds before retrying.

It's not an "either-or" situation.  Both have to complete - in sequence
& successfully - or fencing has failed and the entire operation must be
retried.


> Is the fencing execution script designed to 1) perform all necessary
> fencing device logins successfully then and only then 2) issue the
> poweroff commands? Thereby averting a potential race because the loser
> of the race for APC #1 will give up and hand over device #2.

In the absolute worst case (which I think is very, very unlikely):

node1 powers off node2-PS1    node2 fails to log in to SW1
.                             node2 waits a bit
.                             node2 powers off node1-PS1
node1 powers off node2-PS1

Also, note that the "fence race" case typically applies in a partition
of the cluster network where fencing is still accessible.  This
configuration is not recommended for two node clusters; it's recommended
that the cluster communicates and fences over the same network:

http://sources.redhat.com/cluster/faq.html#two_node_correct

Basically, the theory goes like this:

If you pull the ethernet cable on a node and the fence device
controlling the other node is on the same path, it will have some
difficulty accessing the fence device, and will usually lose the "fence
race".

-- Lon


From lhh at redhat.com  Fri Oct 26 19:12:04 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Oct 2007 15:12:04 -0400
Subject: [Linux-cluster] Fencing Race Question
In-Reply-To: <47220C20.3070203@bxwa.com>
References: <47220604.5070504@bxwa.com> <47220986.801@cmiware.com>
	<47220C20.3070203@bxwa.com>
Message-ID: <1193425924.9223.35.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-26 at 08:47 -0700, Scott Becker wrote:
> To clarify, example #2 in the faq: redundant power feeds, redundant APCs 
> - no single point of failure. Provides High availability but complicates 
> fencing and race prevention.
> 
> This necessitates two power switch logins to successfully fence. I'll 
> test by manually logging in to one device and tell luci to fence a node. 
> With my manual login getting in the way, the fence operation should fail 
> and turn off no ports and not prevent manual login to other apc, no 
> matter which switch I manually log into. I haven't got a chance to test 
> this yet and I wanted to tap into your wisdom to test my logic.

If you log in to switch #2, the cluster will be able to turn off ports
on switch #1, but the fencing operation will fail.

-- Lon


From lhh at redhat.com  Fri Oct 26 19:14:24 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Oct 2007 15:14:24 -0400
Subject: [Linux-cluster] strange modprobe drops VIP & fails service
In-Reply-To: <AAD74D664EBCB042BEB418AD2368C7F901F6FDC0@ny0ech01.LNHOLDINGS.COM>
References: <AAD74D664EBCB042BEB418AD2368C7F901F6FDC0@ny0ech01.LNHOLDINGS.COM>
Message-ID: <1193426064.9223.38.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-26 at 11:09 -0400, Jeff Liu wrote:
> I've haven't received any replies, so just to bump the question again.
> 
> *bump*
> 
> Thanks in advance.
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jeff Liu
> Sent: Thursday, October 25, 2007 3:14 PM
> To: linux clustering
> Subject: [Linux-cluster] strange modprobe drops VIP & fails service
> 
> 
> I'm experiencing something similar.  I was wondering if there was some
> additional information available regarding the "e1000 + multiple NIC
> problem".
> 
> Thanks in advance,
> Jeff
> 
> 
> On Fri, 2006-02-10 at 07:16 -0500, danwest wrote:
> > Has anyone every seen something like this?  A modprobe with either a
> > "module 0" or a blank "module " is always followed by the interface
> > loosing its VIP and thus initiating a service failure.  All non
> > essential system services were turned off.  Not sure what is
> > calling/causing the strange modprobe.  Logs are clean, no other events
> > leading up to this.  I am now wondering if it could be the cluster
> > itself?
> > 
> > Feb  9 13:44:32 node2 modprobe: modprobe: Can't locate module 0
> > Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: IP
> > address 147.107.188.81 missing
> > Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: 0:
> error
> > fetching interface information: Device not found
> > Feb  9 13:44:32 node2 clusvcmgrd: [18150]: <err> service error: Check
> > status failed on IP addresses for mservice1
> 
> Sounds like the e1000 + multiple NIC problem.
> 
> 

https://bugzilla.redhat.com/show_bug.cgi?id=163636
https://rhn.redhat.com/errata/RHBA-2005-676.html

-- Lon


From rmccabe at redhat.com  Fri Oct 26 19:33:26 2007
From: rmccabe at redhat.com (Ryan McCabe)
Date: Fri, 26 Oct 2007 15:33:26 -0400
Subject: [Linux-cluster] Problem with fence_ilo
In-Reply-To: <472233FD.5030202@cloud9.co.uk>
References: <472233FD.5030202@cloud9.co.uk>
Message-ID: <20071026193326.GA159798@redhat.com>

On Fri, Oct 26, 2007 at 07:37:49PM +0100, James Fidell wrote:
> Can anyone confirm that their fence_ilo script works without
> modification in such a configuration?  I've changed the script to
> send
> 
>   <HOLD_PWR_BTN TOGGLE = "Yes"/>
> 
> and that does work for me, but I don't know if it's valid for all
> v2.22 iLO2 units.

There's a fix in the latest cman package (or fence if you're on RHEL4)
for this. The TOGGLE attribute must be set to yes if the firmware
revision is greater than 1.29. I think setting this attribute on older
firmware revisions will cause the script to break.


Ryan


From scottb at bxwa.com  Fri Oct 26 21:58:08 2007
From: scottb at bxwa.com (Scott Becker)
Date: Fri, 26 Oct 2007 14:58:08 -0700
Subject: [Linux-cluster] Fencing Race Question
In-Reply-To: <1193425839.9223.33.camel@ayanami.boston.devel.redhat.com>
References: <47220604.5070504@bxwa.com>
	<1193425839.9223.33.camel@ayanami.boston.devel.redhat.com>
Message-ID: <472262F0.7050409@bxwa.com>


I think I understand how it works. It's good to know that the loser of 
the first race doesn't immediately try fence device 2. If it's really a 
race then the delay in node 2's retry attempt is necessary for it to be 
killed before it retries. The ssh handshaking when logging into the APC 
does take a few seconds. If I set the delay specifically for the purpose 
of spanning the necessary logins then that should take care of it.

If the logging into all fence devices before any are turned off can't 
easily be done, then the other approach to make it safe would be to 
delay all the log offs until the end of the process.

Thanks for you help, I need to make sure the boss is getting her money's 
worth from this effort.

    scottb


From jos at xos.nl  Fri Oct 26 22:16:51 2007
From: jos at xos.nl (Jos Vos)
Date: Sat, 27 Oct 2007 00:16:51 +0200
Subject: [Linux-cluster] GFS RG size (and tuning)
Message-ID: <200710262216.l9QMGpf5032362@jasmine.xos.nl>

Hi,

The gfs_mkfs manual page (RHEL 5.0) says:

  If not  specified,  gfs_mkfs  will  choose the RG size based on the size
  of the file system: average size file systems will have 256 MB  RGs,
  and bigger file systems will have bigger RGs for better performance.

My 3 TB filesystems still seem to have 256 MB RG's (I don't know how to
see the RG size, but there are 11173 of them, so that seems to indicate
a size of 256 MB).  Is 3 TB considered to be "average size"? ;-)

Anyway, it is recommended trying to rebuild the fs's with "-r 2048" for
3 TB filesystems, each with between 1 and 2 million files on it?
Especially gfs_scand uses *huge* amounts of CPU time and doing df takes
a *very* long time....

Regards,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From wcheng at redhat.com  Fri Oct 26 23:57:18 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 26 Oct 2007 19:57:18 -0400
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
Message-ID: <47227EDE.4010901@redhat.com>

Jos Vos wrote:

>Hi,
>
>The gfs_mkfs manual page (RHEL 5.0) says:
>
>  If not  specified,  gfs_mkfs  will  choose the RG size based on the size
>  of the file system: average size file systems will have 256 MB  RGs,
>  and bigger file systems will have bigger RGs for better performance.
>
>My 3 TB filesystems still seem to have 256 MB RG's (I don't know how to
>see the RG size, but there are 11173 of them, so that seems to indicate
>a size of 256 MB).  Is 3 TB considered to be "average size"? ;-)
>
>Anyway, it is recommended trying to rebuild the fs's with "-r 2048" for
>3 TB filesystems, each with between 1 and 2 million files on it?
>Especially gfs_scand uses *huge* amounts of CPU time and doing df takes
>a *very* long time....
>
>
>  
>
1. 3TB is not "average size". Smaller RG can help with "df" command - 
but if your system is congested, it won't help much.
2. The gfs_scand issue is more to do with the number of glock count. One 
way to tune this is via purge_glock tunable. There is an old write-up in:
http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
. It is for RHEL4 but should work the same way for RHEL5.
3. If you don't need to know the exact disk usage and/or can tolerate 
some delays in disk usage update, there is another tunable 
"statfs_fast". The old write-up (RHEL4) is in: 
http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_fast_statfs.R4 
(and should work the same way as in RHEL 5).

-- Wendy


From jhahm at yahoo.com  Sat Oct 27 00:53:40 2007
From: jhahm at yahoo.com (Jiho Hahm)
Date: Fri, 26 Oct 2007 17:53:40 -0700 (PDT)
Subject: [Linux-cluster] Maximum number of nodes in RHCS 4/5
Message-ID: <778583.12069.qm@web32607.mail.mud.yahoo.com>

Hi, the cluster suite product page on redhat site states the maximum nodes supported is 128 for RHEL5 and 16 on RHEL4.  Is this accurate for a cman/rgmanager cluster?  Thanks,

-Jiho


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From isplist at logicore.net  Sat Oct 27 02:08:37 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 26 Oct 2007 21:08:37 -0500
Subject: [Linux-cluster] Howto? Remove volumes
In-Reply-To: <47220B7B.9020901@redhat.com>
Message-ID: <2007102621837.073552@leena>

Anyone have any additional thoughts on this? It's a left over volume after I 
zoned everything off.

Mike

---
SCSI subsystem initialized
QLogic Fibre Channel HBA Driver
qla2200 0000:00:11.0: Found an ISP2200, irq 11, iobase 0xe0816000
qla2200 0000:00:11.0: Configuring PCI space...
qla2200 0000:00:11.0: Configure NVRAM parameters...
qla2200 0000:00:11.0: Verifying loaded RISC code...
qla2200 0000:00:11.0: LIP reset occured (0).
qla2200 0000:00:11.0: Waiting for LIP to complete...
qla2200 0000:00:11.0: LOOP UP detected (1 Gbps).
qla2200 0000:00:11.0: Topology - (F_Port), Host Loop address 0xffff
scsi0 : qla2xxx
qla2200 0000:00:11.0:
 QLogic Fibre Channel HBA Driver: 8.01.04-d7
  QLogic QLA22xx -
  ISP2200: PCI (33 MHz) @ 0000:00:11.0 hdma-, host#=0, fw=2.02.08 TP
  Vendor: MYLEX     Model: DACARMRB PSEUDO   Rev: 7775
  Type:   Direct-Access                      ANSI SCSI revision: 02
qla2200 0000:00:11.0: scsi(0:0:0:0): Enabled tagged queuing, queue depth 16.
sda: Unit Not Ready, sense:
Current : sense key Illegal Request
Additional sense: Logical unit not supported
sda : READ CAPACITY failed.
sda : status=1, message=00, host=0, driver=08
Current sd: sense key Illegal Request
Additional sense: Logical unit not supported
sda: asking for cache data failed
sda: assuming drive cache: write through
sda: Unit Not Ready, sense:
Current : sense key Illegal Request
Additional sense: Logical unit not supported
sda : READ CAPACITY failed.
sda : status=1, message=00, host=0, driver=08
Current sd: sense key Illegal Request
Additional sense: Logical unit not supported
sda: asking for cache data failed
sda: assuming drive cache: write through
 sda:end_request: I/O error, dev sda, sector 0
Buffer I/O error on device sda, logical block 0
end_request: I/O error, dev sda, sector 0
Buffer I/O error on device sda, logical block 0
end_request: I/O error, dev sda, sector 0
Buffer I/O error on device sda, logical block 0
 unable to read partition table
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0
  Vendor: MYLEX     Model: DACARMRB          Rev: 7775
  Type:   Direct-Access                      ANSI SCSI revision: 02
qla2200 0000:00:11.0: scsi(0:0:0:2): Enabled tagged queuing, queue depth 16.
SCSI device sdb: 997449728 512-byte hdwr sectors (510694 MB)
SCSI device sdb: drive cache: write back
SCSI device sdb: 997449728 512-byte hdwr sectors (510694 MB)
SCSI device sdb: drive cache: write back
 sdb:
Attached scsi disk sdb at scsi0, channel 0, id 0, lun 2


From jos at xos.nl  Sat Oct 27 12:57:26 2007
From: jos at xos.nl (Jos Vos)
Date: Sat, 27 Oct 2007 14:57:26 +0200
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <47227EDE.4010901@redhat.com>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>
Message-ID: <20071027125726.GA5193@jasmine.xos.nl>

On Fri, Oct 26, 2007 at 07:57:18PM -0400, Wendy Cheng wrote:

> 1. 3TB is not "average size". Smaller RG can help with "df" command - 
> but if your system is congested, it won't help much.

The df also takes ages on an almost idle system.  Also, the system often
needs to do rsyncs on large trees and this takes a very long time too.

In <http://sourceware.org/cluster/faq.html#gfs_tuning> it is suggested
that you should then make the RG larger (i.e. less RGs).  As this requires
shuffling aroung with TB's of data before recreating a GFS fs, I want to
have some idea of what my chances are that this is usefull.

> 2. The gfs_scand issue is more to do with the number of glock count. One 
> way to tune this is via purge_glock tunable. There is an old write-up in:
> http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
> . It is for RHEL4 but should work the same way for RHEL5.

I'll try.  I assume I can do this per system (so that I don't have to
bring the whole cluster down, only stop the cluster services and unmount
the GFS volumes per node)?

Any chance this patch will make it into the standard RHEL-package?
I want to avoid to maintain my own patched packages, although as long
as gfs.ko is in the separate kmod-gfs package that's doable.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From josh at jtri.com  Sat Oct 27 18:15:33 2007
From: josh at jtri.com (Josh Gray)
Date: Sat, 27 Oct 2007 14:15:33 -0400
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <20071027125726.GA5193@jasmine.xos.nl>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>
	<20071027125726.GA5193@jasmine.xos.nl>
Message-ID: <835D9E0B-5FDB-4512-B08E-DAAB068448C6@jtri.com>

I had the same problems this week on GFS only about 800gig of mostly  
small files.  My application did not require it to be mounted on the  
whole cluster concurrently so i went ahead and switched to EXT3 with  
much better performance.

Support said i would probably expect significant improvement with  
more RG's but we went with the other file format before we tried that.

Josh


On Oct 27, 2007, at 8:57 AM, Jos Vos wrote:

> On Fri, Oct 26, 2007 at 07:57:18PM -0400, Wendy Cheng wrote:
>
>> 1. 3TB is not "average size". Smaller RG can help with "df" command -
>> but if your system is congested, it won't help much.
>
> The df also takes ages on an almost idle system.  Also, the system  
> often
> needs to do rsyncs on large trees and this takes a very long time too.
>
> In <http://sourceware.org/cluster/faq.html#gfs_tuning> it is suggested
> that you should then make the RG larger (i.e. less RGs).  As this  
> requires
> shuffling aroung with TB's of data before recreating a GFS fs, I  
> want to
> have some idea of what my chances are that this is usefull.
>
>> 2. The gfs_scand issue is more to do with the number of glock  
>> count. One
>> way to tune this is via purge_glock tunable. There is an old write- 
>> up in:
>> http://people.redhat.com/wcheng/Patches/GFS/ 
>> readme.gfs_glock_trimming.R4
>> . It is for RHEL4 but should work the same way for RHEL5.
>
> I'll try.  I assume I can do this per system (so that I don't have to
> bring the whole cluster down, only stop the cluster services and  
> unmount
> the GFS volumes per node)?
>
> Any chance this patch will make it into the standard RHEL-package?
> I want to avoid to maintain my own patched packages, although as long
> as gfs.ko is in the separate kmod-gfs package that's doable.
>
> -- 
> --    Jos Vos <jos at xos.nl>
> --    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
> --    Amsterdam, The Netherlands        |     Fax: +31 20 6948204
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jos at xos.nl  Sat Oct 27 18:21:57 2007
From: jos at xos.nl (Jos Vos)
Date: Sat, 27 Oct 2007 20:21:57 +0200
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <835D9E0B-5FDB-4512-B08E-DAAB068448C6@jtri.com>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>
	<20071027125726.GA5193@jasmine.xos.nl>
	<835D9E0B-5FDB-4512-B08E-DAAB068448C6@jtri.com>
Message-ID: <20071027182157.GA7270@jasmine.xos.nl>

On Sat, Oct 27, 2007 at 02:15:33PM -0400, Josh Gray wrote:

> Support said i would probably expect significant improvement with  
> more RG's but we went with the other file format before we tried that.

You mean *less* (and larger) RGs I presume?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From hlawatschek at atix.de  Sat Oct 27 21:10:54 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Sat, 27 Oct 2007 23:10:54 +0200
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <20071027125726.GA5193@jasmine.xos.nl>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>
	<20071027125726.GA5193@jasmine.xos.nl>
Message-ID: <200710272310.54727.hlawatschek@atix.de>

We did some system profiling and measured the influence of the lkbtbl_size and  
rsbtbl_size values (dlm config parameters) to the cpu usage. We noticed, that 
with large number of dlm locks, these values have a measurable impact to the 
system performance. 
You might want to have a look at 
http://www.open-sharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/profile-data-with-diffrent-table-sizes

Mark

On Saturday 27 October 2007 14:57:26 Jos Vos wrote:
> On Fri, Oct 26, 2007 at 07:57:18PM -0400, Wendy Cheng wrote:
> > 1. 3TB is not "average size". Smaller RG can help with "df" command -
> > but if your system is congested, it won't help much.
>
> The df also takes ages on an almost idle system.  Also, the system often
> needs to do rsyncs on large trees and this takes a very long time too.
>
> In <http://sourceware.org/cluster/faq.html#gfs_tuning> it is suggested
> that you should then make the RG larger (i.e. less RGs).  As this requires
> shuffling aroung with TB's of data before recreating a GFS fs, I want to
> have some idea of what my chances are that this is usefull.
>
> > 2. The gfs_scand issue is more to do with the number of glock count. One
> > way to tune this is via purge_glock tunable. There is an old write-up in:
> > http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4
> > . It is for RHEL4 but should work the same way for RHEL5.
>
> I'll try.  I assume I can do this per system (so that I don't have to
> bring the whole cluster down, only stop the cluster services and unmount
> the GFS volumes per node)?
>
> Any chance this patch will make it into the standard RHEL-package?
> I want to avoid to maintain my own patched packages, although as long
> as gfs.ko is in the separate kmod-gfs package that's doable.
>
> --
> --    Jos Vos <jos at xos.nl>
> --    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
> --    Amsterdam, The Netherlands        |     Fax: +31 20 6948204
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From jgray at nicusa.com  Sat Oct 27 21:13:59 2007
From: jgray at nicusa.com (Josh Gray)
Date: Sat, 27 Oct 2007 17:13:59 -0400
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <20071027182157.GA7270@jasmine.xos.nl>
Message-ID: <C3492257.112429%jgray@nicusa.com>

Yea larger - sorry larger I meant

<snip>
Yes, it is definitely possible that using a bigger resource group may help
with performance.  Since this is not yet in production, I would urge you to
do some testing now while you can and compare results from the default size
of 256 versus a setting of around 2048.
</snip>

On 10/27/07 2:21 PM, "Jos Vos" <jos at xos.nl> wrote:

> On Sat, Oct 27, 2007 at 02:15:33PM -0400, Josh Gray wrote:
> 
>> Support said i would probably expect significant improvement with
>> more RG's but we went with the other file format before we tried that.
> 
> You mean *less* (and larger) RGs I presume?

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From mk at adminlife.net  Sun Oct 28 08:30:40 2007
From: mk at adminlife.net (Matthias Kellermann)
Date: Sun, 28 Oct 2007 09:30:40 +0100
Subject: [Linux-cluster] Limit for subdirectories?
Message-ID: <472448B0.1080800@adminlife.net>

Hi list,

is there a limit on GFS/GFS2 for the number of subdirectories stored
under another directory? For example I know there is a limit on
ext2/ext3 filesystems where you can only store about 32.000
subdirectories in the root of a directory.

Matthias

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 252 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071028/24697aea/attachment.sig>

From andremailinglist at gmail.com  Mon Oct 29 00:20:32 2007
From: andremailinglist at gmail.com (Andrew Hole)
Date: Mon, 29 Oct 2007 00:20:32 +0000
Subject: [Linux-cluster] Two node cluster - howto
Message-ID: <640358900710281720j5c036bb5kf185b19f04e07192@mail.gmail.com>

Hello!

II want to create a very basic cluster of two nodes to configure
failover of a service. This
service is very simple, without shared storage. Basically if the
service fails in a node, I want
to ensure availability on the other node.
I would like to clarify some doubts:
- I need to use quorum disk?
- And about fince device? I need to setup a fince device?
- Exists some example of using Cluster to configure a simple two nodes cluster?

could you help me to define steps to configure this cluster?

Thanks a lot
Andrew


From huangxiong at uit.com.cn  Mon Oct 29 04:03:02 2007
From: huangxiong at uit.com.cn (Huang Xiong)
Date: Mon, 29 Oct 2007 12:03:02 +0800
Subject: [Linux-cluster] dlm_sendd and dlm_sendd running on 100% CPU
Message-ID: <200710291203.02123.huangxiong@uit.com.cn>

Hi Dave,

Have you released this new cluster tarball?
If so, can you please tell me where to download and how to use.


Regards,
Phillip
	

On Tue, Jul 24, 2007 at 09:37:03AM +0200, Sebastian Reitenbach wrote:
> Hi,
> 
> after getting the cluster up and running, and mounting the gfs2 partitions. 
> With one mounted gfs2 partition, the dlm_sendd and dlm_recvd both consume 
> 100% CPU. When I umount the partition, the CPU usage drops to zero.
> 
> While at 100%, there are no message popping up in /var/log/messages.
> Any idea what could be the reason for the CPU usage?
> 
> I use kernel 2.6.20.15 on x86_64 openSUSE 10.2, openais-0.80.1, and 
> cluster-2.00.00.

This is an old dlm bug that was fixed a long time ago.  I'll be releasing
a new cluster tarball shortly for 2.6.23-rc.

Dave


From s.c.graham at gmail.com  Mon Oct 29 10:51:56 2007
From: s.c.graham at gmail.com (s.c.graham at gmail.com)
Date: Mon, 29 Oct 2007 10:51:56 +0000
Subject: [Linux-cluster] clvmd hangs when third node tries to connect to
	cluster
Message-ID: <cdd145f50710290351k30221b3eh6afff88e4e8e9d47@mail.gmail.com>

Hi there,

I have a cluster with three nodes (all clone HL DL380 G4s) attached to
a Fibre SAN (HP MSA1000) and serving a number of GFS filesystems.  My
OS is Ubuntu Dapper (6.06) and my kernel is 2.6.15-29-amd64-server.
These machines have been working nicely for a long time.

On the weekend I "apt-get updated" to the latest version of the Dapper
redhat-cluster-suite package (1.20060222-0ubuntu6.1).  Now, when the
cluster boots the first two nodes to come up are able to see the GFS
filesystem. However, the third node to come up hangs at the point of
starting the clvm service.  Concomitantly, I see the following message
in /var/log/syslog of one of the other machines in the cluster:

Oct 28 14:42:18 machinea kernel: [ 1681.325152] CMAN: node machinec rejoining
Oct 28 14:42:20 machinea kernel: [ 1683.528299] Extra connection from
node 2 attempted

It does not seem to matter which order the nodes come up in - it is
always the third node to boot that will hang when starting clvmd.  I
have included my cluster.conf file below for reference - I can include
any additional diagnostics as required.

Any help would be most appreciated!

Stephen

<?xml version="1.0"?>
<cluster config_version="14" name="alpha_cluster">
       <fence_daemon post_fail_delay="0" post_join_delay="3"/>
       <clusternodes>
               <clusternode name="machineaint" votes="1">
                       <fence>
                               <method name="1">
                                       <device name="machinea_ILO"/>
                               </method>
                       </fence>
               </clusternode>
               <clusternode name="machinebint" votes="1">
                       <fence>
                               <method name="1">
                                       <device name="machineb_ILO"/>
                               </method>
                       </fence>
               </clusternode>
               <clusternode name="machinecint" votes="1">
                       <fence>
                               <method name="1">
                                       <device name="machinec_ILO"/>
                               </method>
                       </fence>
               </clusternode>
       </clusternodes>
       <cman/>
       <fencedevices>
               <fencedevice agent="fence_ilo"
hostname="192.168.81.200" login="Login" name="machinea_ILO"
passwd="Passwd"/>
               <fencedevice agent="fence_ilo"
hostname="192.168.81.199" login="Login" name="machineb_ILO"
passwd="Passwd"/>
               <fencedevice agent="fence_ilo"
hostname="192.168.81.197" login="Login" name="machinec_ILO"
passwd="Passwd"/>
       </fencedevices>
       <rm>
               <failoverdomains>
                       <failoverdomain name="fileservers" ordered="0"
restricted="0">
                               <failoverdomainnode name="machineaint"
priority="1"/>
                               <failoverdomainnode name="machinebint"
priority="1"/>
                               <failoverdomainnode name="machinecint"
priority="1"/>
                       </failoverdomain>
                       <failoverdomain name="backupers" ordered="0"
restricted="1">
                               <failoverdomainnode name="machineaint"
priority="1"/>
                               <failoverdomainnode name="machinebint"
priority="1"/>
                       </failoverdomain>
               </failoverdomains>
               <resources>
                       <ip address="192.168.81.98" monitor_link="1"/>
               </resources>
               <service autostart="1" domain="fileservers"
exclusive="1" name="fileserver_ip">
                       <ip ref="192.168.81.98"/>
               </service>
               <service autostart="1" domain="backupers" name="backups">
                       <script file="/etc/init.d/dsmcad-init"
name="TSM backup script"/>
               </service>
       </rm>
</cluster>


From teigland at redhat.com  Mon Oct 29 14:04:01 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 29 Oct 2007 09:04:01 -0500
Subject: [Linux-cluster] dlm_sendd and dlm_sendd running on 100% CPU
In-Reply-To: <200710291203.02123.huangxiong@uit.com.cn>
References: <200710291203.02123.huangxiong@uit.com.cn>
Message-ID: <20071029140401.GA22472@redhat.com>

On Mon, Oct 29, 2007 at 12:03:02PM +0800, Huang Xiong wrote:
> Hi Dave,
> 
> Have you released this new cluster tarball?
> If so, can you please tell me where to download and how to use.

https://www.redhat.com/archives/linux-cluster/2007-July/msg00268.html

Dave


From lhh at redhat.com  Mon Oct 29 14:46:12 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Oct 2007 10:46:12 -0400
Subject: [Linux-cluster] Two node cluster - howto
In-Reply-To: <640358900710281720j5c036bb5kf185b19f04e07192@mail.gmail.com>
References: <640358900710281720j5c036bb5kf185b19f04e07192@mail.gmail.com>
Message-ID: <1193669172.9223.55.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-29 at 00:20 +0000, Andrew Hole wrote:
> Hello!
> 
> II want to create a very basic cluster of two nodes to configure
> failover of a service. This
> service is very simple, without shared storage. Basically if the
> service fails in a node, I want
> to ensure availability on the other node.
> I would like to clarify some doubts:
> - I need to use quorum disk?

If you have no shared storage, then a quorum disk is rather
impossible ;)  (You don't need one.)

> - And about fince device? I need to setup a fince device?

Under most circumstances, you do not need a fence device if you do not
have any shared storage, but it can be helpful.  Fencing exists to
prevent data corruption on shared media.

ex: node A goes to sleep with a write lock held...
    node B thinks node A is dead, takes over, and acquires the same lock
a-> node A wakes up still thinking its lock is valid
b-> node A continues writing
    (doom)

Power fencing prevents (a) (and therefore (b))
SAN-level fencing prevents (b)

> - Exists some example of using Cluster to configure a simple two nodes cluster?

http://sources.redhat.com/cluster/faq.html 

http://sources.redhat.com/cluster/doc/nfscookbook.pdf

-- Lon


From lhh at redhat.com  Mon Oct 29 14:56:50 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Oct 2007 10:56:50 -0400
Subject: [Linux-cluster] How to take down a CS/GFS setup with minimum
	downtime
In-Reply-To: <1193412589.5963.2.camel@addi.hugsmidjan.is>
References: <1193412589.5963.2.camel@addi.hugsmidjan.is>
Message-ID: <1193669810.9223.63.camel@ayanami.boston.devel.redhat.com>

On Fri, 2007-10-26 at 15:29 +0000, S?valdur Arnar Gunnarsson wrote:
> I've got five RHEL4 systems with CS and about 800 GB of data on a shared
> GFS filesystem.
> I've been tasked to take down the cluster and divide the content of the
> shared GFS filesystem to the local disks on each system with minimum
> downtime.
> 
> I've removed two nodes from the cluster already and am somewhat scared
> of a quorum problem if I remove another node.
> 
> From what I've been able to gather I should use cman_tool leave remove
> on a node once it is ready to leave the cluster and thus be able to
> remove four nodes from a five node cluster without disolving the quorum
> or risking losing access to the GFS data on the last remaining node.
> 
> Is that correct ?

That should do it, yes.  Leave remove is supposed to decrement the
quorum count, meaning you can go from 5..1 nodes if done correctly.  You
can verify that the expected votes count decreases with each removal
using 'cman_tool status'.


If for some reason the above doesn't work, the alternative looks
something like this:
  * unmount the GFS volume + stop cluster on all nodes
  * use gfs_tool to alter the lock proto to nolock
  * mount on node 1.  copy out data.  unmount!
  * mount on node 2.  copy out data.  unmount!
  * ...
  * mount on node 5.  copy out data.  unmount!

-- Lon


From jgray at nicusa.com  Mon Oct 29 15:02:57 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 29 Oct 2007 11:02:57 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <1193669810.9223.63.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C34B6E61.11348C%jgray@nicusa.com>

Lon and the other gurus - have you guys ever seen an ext3 volume get mounted
on multiple cluster nodes at the same time WITHOUT a split brain? No
fencing, no errors logged, no network issues, etc..   I even ran clustat and
both nodes (let's say B and C ) even said that the single node I thought was
up was in control (B).

I saw this with virtual ip's in my testing but then encountered this during
deployment that then quickly corrupted the volume and brought the project to
a halt..

No worries if not - I have support involved already,  just curious your
input as you guys seem to know this stuff inside and out!


-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From gordan at bobich.net  Mon Oct 29 15:04:21 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 29 Oct 2007 15:04:21 +0000 (GMT)
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <C34B6E61.11348C%jgray@nicusa.com>
References: <C34B6E61.11348C%jgray@nicusa.com>
Message-ID: <Pine.LNX.4.64.0710291502440.6486@skynet.shatteredsilicon.net>

On Mon, 29 Oct 2007, Josh Gray wrote:

> Lon and the other gurus - have you guys ever seen an ext3 volume get mounted
> on multiple cluster nodes at the same time WITHOUT a split brain? No
> fencing, no errors logged, no network issues, etc..   I even ran clustat and
> both nodes (let's say B and C ) even said that the single node I thought was
> up was in control (B).
>
> I saw this with virtual ip's in my testing but then encountered this during
> deployment that then quickly corrupted the volume and brought the project to
> a halt..
>
> No worries if not - I have support involved already,  just curious your
> input as you guys seem to know this stuff inside and out!

Sure, that works. The only problem is that if both nodes write to the same 
files at the same time (including meta data), you'll end up with a 
corrupted file system. But if both machines are mounting the FS read-only 
(in which case you might as well use ext2), then there's no problem with 
that.

Gordan


From breeves at redhat.com  Mon Oct 29 15:20:15 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Mon, 29 Oct 2007 15:20:15 +0000
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <Pine.LNX.4.64.0710291502440.6486@skynet.shatteredsilicon.net>
References: <C34B6E61.11348C%jgray@nicusa.com>
	<Pine.LNX.4.64.0710291502440.6486@skynet.shatteredsilicon.net>
Message-ID: <4725FA2F.2070303@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

gordan at bobich.net wrote:
> Sure, that works. The only problem is that if both nodes write to the
> same files at the same time (including meta data), you'll end up with a
> corrupted file system. But if both machines are mounting the FS
> read-only (in which case you might as well use ext2), then there's no
> problem with that.

Are you sure about that? I've never known it do anything useful beyond
spectacularly breaking the file systems involved.

This is especially true for ext3 due to its journaling capability - as
soon as the two hosts start squabbling over the same journal block (i.e.
they detect an inconsistency caused by the other host overwriting
something) they're both going to abort the journal via an ext3_abort()
call and take the file system readonly.

In my experience, this tends to happen pretty quickly - the moment the
second host begins replaying the incomplete journal entries the first
host has outstanding.

Also, if mounting an ext3 volume read only you might want to consider
making the underlying block device read only too - this prevents any
problems with a host accidentally attempting journal recovery.

Regards,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHJfov6YSQoMYUY94RAlTAAKDHOjaWg81rd6ZkoBL4rCUGS5xZPACdHWlo
Q0kzUfTQYQ7wIgIlskITv14=
=s2TT
-----END PGP SIGNATURE-----


From gordan at bobich.net  Mon Oct 29 15:25:03 2007
From: gordan at bobich.net (gordan at bobich.net)
Date: Mon, 29 Oct 2007 15:25:03 +0000 (GMT)
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <4725FA2F.2070303@redhat.com>
References: <C34B6E61.11348C%jgray@nicusa.com>
	<Pine.LNX.4.64.0710291502440.6486@skynet.shatteredsilicon.net>
	<4725FA2F.2070303@redhat.com>
Message-ID: <Pine.LNX.4.64.0710291521340.6486@skynet.shatteredsilicon.net>

On Mon, 29 Oct 2007, Bryn M. Reeves wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> gordan at bobich.net wrote:
>> Sure, that works. The only problem is that if both nodes write to the
>> same files at the same time (including meta data), you'll end up with a
>> corrupted file system. But if both machines are mounting the FS
>> read-only (in which case you might as well use ext2), then there's no
>> problem with that.
>
> Are you sure about that? I've never known it do anything useful beyond
> spectacularly breaking the file systems involved.

I can't see why it would. If the fs is mounted ro, no harm will come to 
it. Any writing, though, and you're likely to trash it faster than you can 
type mount -o ro,remount. :-)

> This is especially true for ext3 due to its journaling capability - as
> soon as the two hosts start squabbling over the same journal block (i.e.
> they detect an inconsistency caused by the other host overwriting
> something) they're both going to abort the journal via an ext3_abort()
> call and take the file system readonly.

As I said, any writes (including meta-data - which includes journals), and 
the fs will be destroyed pretty quickly.

> In my experience, this tends to happen pretty quickly - the moment the
> second host begins replaying the incomplete journal entries the first
> host has outstanding.

Yup, that sounds about right. :-)

> Also, if mounting an ext3 volume read only you might want to consider
> making the underlying block device read only too - this prevents any
> problems with a host accidentally attempting journal recovery.

Sure - you're probably better of mounting it as ext2, which I also 
mentioned.

Gordan


From jgray at nicusa.com  Mon Oct 29 15:31:47 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 29 Oct 2007 11:31:47 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <4725FA2F.2070303@redhat.com>
Message-ID: <C34B7523.1134B4%jgray@nicusa.com>

I can tell you for a fact that is what happened unfortunately.    Two nodes
of 3 became active and were quite content to serve my NFS traffic on
multiple VIPs and write to the same EXT3 volume for about 20 or so minutes
until it went read only.

Most frustrating part of the whole deal was restarting the cluster would
just hang on 'starting'  with NO error message logged about what it was
trying to do.  In hindsight it was probably fscking as that's what I had to
do manually to get it back up after a ton of trial and error.  I even raised
the verbosity of the log output to no avail.

Josh


On 10/29/07 11:20 AM, "Bryn M. Reeves" <breeves at redhat.com> wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> gordan at bobich.net wrote:
>> Sure, that works. The only problem is that if both nodes write to the
>> same files at the same time (including meta data), you'll end up with a
>> corrupted file system. But if both machines are mounting the FS
>> read-only (in which case you might as well use ext2), then there's no
>> problem with that.
> 
> Are you sure about that? I've never known it do anything useful beyond
> spectacularly breaking the file systems involved.
> 
> This is especially true for ext3 due to its journaling capability - as
> soon as the two hosts start squabbling over the same journal block (i.e.
> they detect an inconsistency caused by the other host overwriting
> something) they're both going to abort the journal via an ext3_abort()
> call and take the file system readonly.
> 
> In my experience, this tends to happen pretty quickly - the moment the
> second host begins replaying the incomplete journal entries the first
> host has outstanding.
> 
> Also, if mounting an ext3 volume read only you might want to consider
> making the underlying block device read only too - this prevents any
> problems with a host accidentally attempting journal recovery.
> 
> Regards,
> Bryn.
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> 
> iD8DBQFHJfov6YSQoMYUY94RAlTAAKDHOjaWg81rd6ZkoBL4rCUGS5xZPACdHWlo
> Q0kzUfTQYQ7wIgIlskITv14=
> =s2TT
> -----END PGP SIGNATURE-----
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From breeves at redhat.com  Mon Oct 29 15:47:04 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Mon, 29 Oct 2007 15:47:04 +0000
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <Pine.LNX.4.64.0710291521340.6486@skynet.shatteredsilicon.net>
References: <C34B6E61.11348C%jgray@nicusa.com>	<Pine.LNX.4.64.0710291502440.6486@skynet.shatteredsilicon.net>	<4725FA2F.2070303@redhat.com>
	<Pine.LNX.4.64.0710291521340.6486@skynet.shatteredsilicon.net>
Message-ID: <47260078.7020203@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

gordan at bobich.net wrote:
> On Mon, 29 Oct 2007, Bryn M. Reeves wrote:
> 
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>>
>> gordan at bobich.net wrote:
>>> Sure, that works. The only problem is that if both nodes write to the
>>> same files at the same time (including meta data), you'll end up with a
>>> corrupted file system. But if both machines are mounting the FS
>>> read-only (in which case you might as well use ext2), then there's no
>>> problem with that.
>>
>> Are you sure about that? I've never known it do anything useful beyond
>> spectacularly breaking the file systems involved.
> 
> I can't see why it would. If the fs is mounted ro, no harm will come to
> it. Any writing, though, and you're likely to trash it faster than you
> can type mount -o ro,remount. :-)

The "are you sure about that?" was at the idea that you could have the
fs mounted writably and not run into problems unless you tried to write
to the same files (or their metadata) on the two hosts simultaneously.
Your original message seemed to imply that this would work.

Regards,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHJgB46YSQoMYUY94RAvNPAJ9wpEn96/WWanaKAECAehr2DcLmbACeOdEM
6bmhkBRPLvWbK5Vz97AnSs0=
=73As
-----END PGP SIGNATURE-----


From jgray at nicusa.com  Mon Oct 29 15:54:38 2007
From: jgray at nicusa.com (Josh Gray)
Date: Mon, 29 Oct 2007 11:54:38 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <47260078.7020203@redhat.com>
Message-ID: <C34B7A7E.1134D3%jgray@nicusa.com>

Heh oh yea I understand all of that about EXT3..  It's in the cluster
configuration, not mounting thru fstab.

Josh


On 10/29/07 11:47 AM, "Bryn M. Reeves" <breeves at redhat.com> wrote:

>>> Are you sure about that? I've never known it do anything useful beyond
>>> spectacularly breaking the file systems involved.
>> 
>> I can't see why it would. If the fs is mounted ro, no harm will come to
>> it. Any writing, though, and you're likely to trash it faster than you
>> can type mount -o ro,remount. :-)
> 
> The "are you sure about that?" was at the idea that you could have the
> fs mounted writably and not run into problems unless you tried to write
> to the same files (or their metadata) on the two hosts simultaneously.
> Your original message seemed to imply that this would work.
> 
> Regards,
> Bryn.
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> 
> iD8DBQFHJgB46YSQoMYUY94RAvNPAJ9wpEn96/WWanaKAECAehr2DcLmbACeOdEM
> 6bmhkBRPLvWbK5Vz97AnSs0=
> =73As
> -----END PGP SIGNATURE-----
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From scottb at bxwa.com  Mon Oct 29 16:48:31 2007
From: scottb at bxwa.com (Scott Becker)
Date: Mon, 29 Oct 2007 09:48:31 -0700
Subject: [Linux-cluster] Two node question
Message-ID: <47260EDF.8010301@bxwa.com>

I'm building a 3-node cluster.

One public subnet.

No shared filesystem

Starting with just one service (www) on one movable IP address.


I will be routinely bring down one node at a time for maintenance so I 
must configure the cluster for one node operation. I can't use the 
two_node="1" setting because I would have to bring the cluster down to 
set and unset it.

I want to use an IP tie-breaker to verify that the node can reach the 
main router. Is this, along with a self test of the web service, fs 
writable, etc. adequate for one, two and three node operation? Or would 
I also need a Qdisk?


    thanks
    scottb


From josh at jtri.com  Mon Oct 29 17:27:52 2007
From: josh at jtri.com (Josh Gray)
Date: Mon, 29 Oct 2007 13:27:52 -0400
Subject: [Linux-cluster] Two node question
In-Reply-To: <47260EDF.8010301@bxwa.com>
References: <47260EDF.8010301@bxwa.com>
Message-ID: <2C976BF2-EC5A-4424-A9D1-A80D00F2DDE3@jtri.com>

 From what i've found,  the safest to do would be modify the number  
of votes for the node you want to stay up to be a really high  
number.  I thought about making that my "maintenance mode" situation.

Josh


On Oct 29, 2007, at 12:48 PM, Scott Becker wrote:

> I'm building a 3-node cluster.
>
> One public subnet.
>
> No shared filesystem
>
> Starting with just one service (www) on one movable IP address.
>
>
> I will be routinely bring down one node at a time for maintenance  
> so I must configure the cluster for one node operation. I can't use  
> the two_node="1" setting because I would have to bring the cluster  
> down to set and unset it.
>
> I want to use an IP tie-breaker to verify that the node can reach  
> the main router. Is this, along with a self test of the web  
> service, fs writable, etc. adequate for one, two and three node  
> operation? Or would I also need a Qdisk?
>
>
>    thanks
>    scottb
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From scottb at bxwa.com  Mon Oct 29 17:59:45 2007
From: scottb at bxwa.com (Scott Becker)
Date: Mon, 29 Oct 2007 10:59:45 -0700
Subject: [Linux-cluster] Two node question
In-Reply-To: <2C976BF2-EC5A-4424-A9D1-A80D00F2DDE3@jtri.com>
References: <47260EDF.8010301@bxwa.com>
	<2C976BF2-EC5A-4424-A9D1-A80D00F2DDE3@jtri.com>
Message-ID: <47261F91.4030609@bxwa.com>

I don't have a favorite node, I just want to have failover protection 
when it's running on two nodes because I've removed the third for 
updates and fs checks.

I need the group's wisdom to make sure that's it's likely that the 
cluster will continue to operate down to one functioning node without 
the two_node="1" switch.

If the IP tie-breaker counts for one vote, will the good node maintain 
quorum? Will luci adjust the votes needed properly for this when I make 
the third node leave? If one node fails and a second node fails before I 
fix the first, will quorum be maintained in the last node standing 
(without two_node="1")? I'm not sure what all two_node changes about voting.

    thanks
    scottb


Josh Gray wrote:
> From what i've found,  the safest to do would be modify the number of 
> votes for the node you want to stay up to be a really high number.  I 
> thought about making that my "maintenance mode" situation.
>
> Josh
>
>
> On Oct 29, 2007, at 12:48 PM, Scott Becker wrote:
>
>> I'm building a 3-node cluster.
>>
>> One public subnet.
>>
>> No shared filesystem
>>
>> Starting with just one service (www) on one movable IP address.
>>
>>
>> I will be routinely bring down one node at a time for maintenance so 
>> I must configure the cluster for one node operation. I can't use the 
>> two_node="1" setting because I would have to bring the cluster down 
>> to set and unset it.
>>
>> I want to use an IP tie-breaker to verify that the node can reach the 
>> main router. Is this, along with a self test of the web service, fs 
>> writable, etc. adequate for one, two and three node operation? Or 
>> would I also need a Qdisk?
>>
>>
>>    thanks
>>    scottb
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From garromo at us.ibm.com  Mon Oct 29 19:00:17 2007
From: garromo at us.ibm.com (Gary Romo)
Date: Mon, 29 Oct 2007 13:00:17 -0600
Subject: [Linux-cluster] How to take down a CS/GFS setup with
	minimum	downtime
In-Reply-To: <1193412589.5963.2.camel@addi.hugsmidjan.is>
Message-ID: <OF78482FBA.45F11445-ON87257383.0066D56A-87257383.00684C45@us.ibm.com>

If exactly half or more of the expected votes disappear, a quorum no 
longer exists (except in the two-node special case).

The voting values cam be seen with a 

# cman_tool status

If there is quorum, and exit code of 0 (zero) should be returned to the 
shell wht the clustat -Q command (which produces no output) is run;

# clustat -Q
# echo $?

Gary Romo
IBM Global Technology Services
303.458.4415
Email: garromo at us.ibm.com
Pager:1.877.552.9264
Text message: gromo at skytel.com


S?valdur Arnar Gunnarsson <addi at hugsmidjan.is> 
Sent by: linux-cluster-bounces at redhat.com
10/26/2007 09:29 AM
Please respond to
linux clustering <linux-cluster at redhat.com>


To
linux-cluster at redhat.com
cc

Subject
[Linux-cluster] How to take down a CS/GFS setup with minimum    downtime


I've got five RHEL4 systems with CS and about 800 GB of data on a shared
GFS filesystem.
I've been tasked to take down the cluster and divide the content of the
shared GFS filesystem to the local disks on each system with minimum
downtime.

I've removed two nodes from the cluster already and am somewhat scared
of a quorum problem if I remove another node.

>From what I've been able to gather I should use cman_tool leave remove
on a node once it is ready to leave the cluster and thus be able to
remove four nodes from a five node cluster without disolving the quorum
or risking losing access to the GFS data on the last remaining node.

Is that correct ?

With best regards,
-- 
Saevaldur Arnar Gunnarsson
System Administrator | RHCE

Hugsmidjan ehf.
Snorrabraut 56 | 105 Reykjavik
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071029/b286f8dc/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/octet-stream
Size: 4581 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071029/b286f8dc/attachment.obj>

From DRand at amnesty.org  Mon Oct 29 19:26:44 2007
From: DRand at amnesty.org (DRand at amnesty.org)
Date: Mon, 29 Oct 2007 19:26:44 +0000
Subject: [Linux-cluster] GFS cluster freezing after a few hours..
Message-ID: <OF6CA5A4C5.5B1AB45E-ON80257383.0069AA14-80257383.006AC6DC@amnesty.org>

Hi,

We've just setup a three node GFS cluster on Debian Etch using qlogic HBA 
against a SAN.

gfs_tool 1.03.00 (built Mar  8 2007 23:38:09)
Copyright (C) Red Hat, Inc.  2004-2005  All rights reserved.

Linux cms2 2.6.18-5-amd64 #1 SMP Tue Oct 2 20:37:02 UTC 2007 x86_64 
GNU/Linux

We start the cluster and it works fine for a while..

/sbin/lock_gulmd -n aicluster -s cms1,cms2,cmsqa
sleep 1
/bin/mount -t gfs -o acl /dev/sda /san

But eventually after hours or a day something freezes/hangs and we can't 
issue any commands like df/ls/du etc..

There is no evidence that anything is wrong though.. This command seems to 
show a working cluster right?

cmsqa:/home/alfresco# gulm_tool nodelist cms1
 Name: cms2
  ip    = ::ffff:192.168.1.139
  state = Logged in
  last state = Logged out
  mode = Slave
  missed beats = 0
  last beat = 1193685839882270
  delay avg = 10003803
  max delay = 755383848
 
 Name: cmsqa
  ip    = ::ffff:128.1.32.134
  state = Logged in
  last state = Logged out
  mode = Slave
  missed beats = 0
  last beat = 1193685841974801
  delay avg = 10003928
  max delay = 138560844
 
 Name: cms1
  ip    = ::ffff:192.168.1.137
  state = Logged in
  last state = Was Logged in
  mode = Master
  missed beats = 0
  last beat = 1193685842490217
  delay avg = 10003231
  max delay = 10007256


Any ideas? We need to reboot the boxes to get the cluster back.

Damon.
Working to protect human rights worldwide

DISCLAIMER
Internet communications are not secure and therefore Amnesty International Ltd does not accept legal responsibility for the contents of this message. If you are not the intended recipient you must not disclose or rely on the information in this e-mail. Any views or opinions presented are solely those of the author and do not necessarily represent those of Amnesty International Ltd unless specifically stated. Electronic communications including email might be monitored by Amnesty International Ltd. for operational or business reasons.

This message has been scanned for viruses by Postini.
www.postini.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071029/246763e3/attachment.htm>

From doseyg at r-networks.net  Tue Oct 30 00:53:10 2007
From: doseyg at r-networks.net (Glen Dosey)
Date: Mon, 29 Oct 2007 20:53:10 -0400
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <20071027125726.GA5193@jasmine.xos.nl>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>  <20071027125726.GA5193@jasmine.xos.nl>
Message-ID: <1193705590.14247.59.camel@eclipse.office.r-networks.net>

We have several 3.3TB and larger filesystems on GFS1. I found that using
the larger RG helps with performance. Increasing the statfs_slots seemed
to help with df time and similar tasks as well. We have a filesystem
with smaller RG's and I don't think the difference is worth moving the
data around. If there is some other reason to make changes then I will
increase the RG size. The statfs_slots can be increased on the fly and
you should hopefully see an improvement. 

Going from RHEL4 to RHEL5 provided measurable benefits for us. You may
also want to look at the option on your disk to see if you can increase
the read ahead and similar underlying performance tunables.

My understanding, and someone correct me if I am wrong, is that GFS2
should resolve the df type issues as well as provide better performance.
I tested it briefly on a 4TB volume and it clearly performs better.
We're just waiting for it to be 'ready'.


On Sat, 2007-10-27 at 14:57 +0200, Jos Vos wrote:
> On Fri, Oct 26, 2007 at 07:57:18PM -0400, Wendy Cheng wrote:
> 
> > 1. 3TB is not "average size". Smaller RG can help with "df" command - 
> > but if your system is congested, it won't help much.
> 
> The df also takes ages on an almost idle system.  Also, the system often
> needs to do rsyncs on large trees and this takes a very long time too.
> 
> In <http://sourceware.org/cluster/faq.html#gfs_tuning> it is suggested
> that you should then make the RG larger (i.e. less RGs).  As this requires
> shuffling aroung with TB's of data before recreating a GFS fs, I want to
> have some idea of what my chances are that this is usefull.
> 
> > 2. The gfs_scand issue is more to do with the number of glock count. One 
> > way to tune this is via purge_glock tunable. There is an old write-up in:
> > http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
> > . It is for RHEL4 but should work the same way for RHEL5.
> 
> I'll try.  I assume I can do this per system (so that I don't have to
> bring the whole cluster down, only stop the cluster services and unmount
> the GFS volumes per node)?
> 
> Any chance this patch will make it into the standard RHEL-package?
> I want to avoid to maintain my own patched packages, although as long
> as gfs.ko is in the separate kmod-gfs package that's doable.
> 


From doseyg at r-networks.net  Tue Oct 30 00:53:10 2007
From: doseyg at r-networks.net (Glen Dosey)
Date: Mon, 29 Oct 2007 20:53:10 -0400
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <20071027125726.GA5193@jasmine.xos.nl>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>  <20071027125726.GA5193@jasmine.xos.nl>
Message-ID: <1193705590.14247.59.camel@eclipse.office.r-networks.net>

We have several 3.3TB and larger filesystems on GFS1. I found that using
the larger RG helps with performance. Increasing the statfs_slots seemed
to help with df time and similar tasks as well. We have a filesystem
with smaller RG's and I don't think the difference is worth moving the
data around. If there is some other reason to make changes then I will
increase the RG size. The statfs_slots can be increased on the fly and
you should hopefully see an improvement. 

Going from RHEL4 to RHEL5 provided measurable benefits for us. You may
also want to look at the option on your disk to see if you can increase
the read ahead and similar underlying performance tunables.

My understanding, and someone correct me if I am wrong, is that GFS2
should resolve the df type issues as well as provide better performance.
I tested it briefly on a 4TB volume and it clearly performs better.
We're just waiting for it to be 'ready'.


On Sat, 2007-10-27 at 14:57 +0200, Jos Vos wrote:
> On Fri, Oct 26, 2007 at 07:57:18PM -0400, Wendy Cheng wrote:
> 
> > 1. 3TB is not "average size". Smaller RG can help with "df" command - 
> > but if your system is congested, it won't help much.
> 
> The df also takes ages on an almost idle system.  Also, the system often
> needs to do rsyncs on large trees and this takes a very long time too.
> 
> In <http://sourceware.org/cluster/faq.html#gfs_tuning> it is suggested
> that you should then make the RG larger (i.e. less RGs).  As this requires
> shuffling aroung with TB's of data before recreating a GFS fs, I want to
> have some idea of what my chances are that this is usefull.
> 
> > 2. The gfs_scand issue is more to do with the number of glock count. One 
> > way to tune this is via purge_glock tunable. There is an old write-up in:
> > http://people.redhat.com/wcheng/Patches/GFS/readme.gfs_glock_trimming.R4 
> > . It is for RHEL4 but should work the same way for RHEL5.
> 
> I'll try.  I assume I can do this per system (so that I don't have to
> bring the whole cluster down, only stop the cluster services and unmount
> the GFS volumes per node)?
> 
> Any chance this patch will make it into the standard RHEL-package?
> I want to avoid to maintain my own patched packages, although as long
> as gfs.ko is in the separate kmod-gfs package that's doable.
> 


From jos at xos.nl  Tue Oct 30 07:27:34 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 30 Oct 2007 08:27:34 +0100
Subject: [Linux-cluster] GFS RG size (and tuning)
In-Reply-To: <1193705590.14247.59.camel@eclipse.office.r-networks.net>
References: <200710262216.l9QMGpf5032362@jasmine.xos.nl>
	<47227EDE.4010901@redhat.com>
	<20071027125726.GA5193@jasmine.xos.nl>
	<1193705590.14247.59.camel@eclipse.office.r-networks.net>
Message-ID: <20071030072734.GB7395@jasmine.xos.nl>

On Mon, Oct 29, 2007 at 08:53:10PM -0400, Glen Dosey wrote:

> We have several 3.3TB and larger filesystems on GFS1. I found that using
> the larger RG helps with performance. Increasing the statfs_slots seemed
> to help with df time and similar tasks as well. We have a filesystem
> with smaller RG's and I don't think the difference is worth moving the
> data around. If there is some other reason to make changes then I will
> increase the RG size. The statfs_slots can be increased on the fly and
> you should hopefully see an improvement. 

Unfortunately, no.  We see dramatic performace, df's may take 10+ minutes,
and above approx. 100 Mbit/sec total network traffic (mostly read from
GFS disks) the performance goes up and down, sometiems back to 5 Mbit/sec etc.
I have statfs_slots now set to 128, or should that be bigger?

We created now one fs with larger RG's and at least df works better, but
I fear we also have to do the other patches.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From carlopmart at gmail.com  Tue Oct 30 08:00:55 2007
From: carlopmart at gmail.com (carlopmart)
Date: Tue, 30 Oct 2007 09:00:55 +0100
Subject: [Linux-cluster] fence gnbd doesn't works as expected
Message-ID: <4726E4B7.9000700@gmail.com>

Hi all,

  I have already installed two nodes cluster using gnbd as a fence device. When 
tow nodes comes up at the same time all works ok, but when only I need to start 
only one node, GFS doesn't mounts because fence device doesn't works. Error is:

Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld join 
error: -22
/sbin/mount.gfs: error mounting lockproto lock_dlm.

I am using a third server as GNBD server wihout serving disks. Why this doesn't 
works?? Perhaps do I need quorum disk??

My cluster.conf:

<?xml version="1.0"?>
<cluster alias="XenDomUcluster" config_version="3" name="XenDomUcluster">
         <fence_daemon post_fail_delay="0" post_join_delay="3"/>
         <clusternodes>
                 <clusternode name="node01.hpulabs.org" nodeid="1" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="gnbd-fence" 
nodename="node01.hpulabs.org"/>
                                 </method>
                         </fence>
                         <multicast addr="239.192.75.55" interface="eth0"/>
                 </clusternode>
                 <clusternode name="node02.hpulabs.org" nodeid="2" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="gnbd-fence" 
nodename="node02.hpulabs.org"/>
                                 </method>
                         </fence>
                         <multicast addr="239.192.75.55" interface="eth0"/>
                 </clusternode>
         </clusternodes>
         <cman expected_votes="1" two_node="1">
                 <multicast addr="239.192.75.55"/>
         </cman>
         <fencedevices>
                 <fencedevice agent="fence_gnbd" name="gnbd-fence" 
servers="gnbdserv.hpulabs.org"/>
         </fencedevices>
         <rm log_facility="local4" log_level="7">
                 <failoverdomains>
                         <failoverdomain name="PriCluster" ordered="1" 
restricted="1">
                                 <failoverdomainnode name="node01.hpulabs.org" 
priority="1"/>
                                 <failoverdomainnode name="node02.hpulabs.org" 
priority="2"/>
                         </failoverdomain>
                         <failoverdomain name="SecCluster" ordered="1" 
restricted="1">
                                 <failoverdomainnode name="node02.hpulabs.org" 
priority="1"/>
                                 <failoverdomainnode name="node01.hpulabs.org" 
priority="2"/>
                         </failoverdomain>
                 </failoverdomains>
                 <resources>
                         <ip address="172.25.50.11" monitor_link="1"/>
                         <ip address="172.25.50.12" monitor_link="1"/>
                         <ip address="172.25.50.13" monitor_link="1"/>
                         <ip address="172.25.50.14" monitor_link="1"/>
                         <ip address="172.25.50.15" monitor_link="1"/>
                         <ip address="172.25.50.16" monitor_link="1"/>
                         <ip address="172.25.50.17" monitor_link="1"/>
                         <ip address="172.25.50.18" monitor_link="1"/>
			<ip address="172.25.50.19" monitor_link="1"/>
                         <ip address="172.25.50.20" monitor_link="1"/>
                 </resources>
                 <service autostart="1" domain="PriCluster" name="rsync-svc" 
recovery="relocate">
                         <ip ref="172.25.50.11">
                                 <script 
file="/data/cfgcluster/etc/init.d/rsyncd" name="rsyncd"/>
                         </ip>
                 </service>
                 <service autostart="1" domain="SecCluster" name="wwwsoft-svc" 
recovery="relocate">
                         <ip ref="172.25.50.12">
                                 <script 
file="/data/cfgcluster/etc/init.d/httpd-mirror" name="httpd-mirror"/>
                         </ip>
                 </service>
                 <service autostart="1" domain="PriCluster" name="proxy-svc" 
recovery="relocate">
                         <ip ref="172.25.50.13">
                                 <script 
file="/data/cfgcluster/etc/init.d/squid" name="squid"/>
                         </ip>
                 </service>
                 <service autostart="1" domain="SecCluster" name="mail-svc" 
recovery="relocate">
                         <ip ref="172.25.50.14">
                                 <script 
file="/data/cfgcluster/etc/init.d/postfix-cluster" name="postfix-cluster"/>
                         </ip>
                 </service>
         </rm>
</cluster>
-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From pcaulfie at redhat.com  Tue Oct 30 11:02:22 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 30 Oct 2007 11:02:22 +0000
Subject: [Linux-cluster] clvmd hangs when third node tries to connect
	to	cluster
In-Reply-To: <cdd145f50710290351k30221b3eh6afff88e4e8e9d47@mail.gmail.com>
References: <cdd145f50710290351k30221b3eh6afff88e4e8e9d47@mail.gmail.com>
Message-ID: <47270F3E.2030607@redhat.com>

s.c.graham at gmail.com wrote:
> Hi there,
> 
> I have a cluster with three nodes (all clone HL DL380 G4s) attached to
> a Fibre SAN (HP MSA1000) and serving a number of GFS filesystems.  My
> OS is Ubuntu Dapper (6.06) and my kernel is 2.6.15-29-amd64-server.
> These machines have been working nicely for a long time.
> 
> On the weekend I "apt-get updated" to the latest version of the Dapper
> redhat-cluster-suite package (1.20060222-0ubuntu6.1).  Now, when the
> cluster boots the first two nodes to come up are able to see the GFS
> filesystem. However, the third node to come up hangs at the point of
> starting the clvm service.  Concomitantly, I see the following message
> in /var/log/syslog of one of the other machines in the cluster:
> 
> Oct 28 14:42:18 machinea kernel: [ 1681.325152] CMAN: node machinec rejoining
> Oct 28 14:42:20 machinea kernel: [ 1683.528299] Extra connection from
> node 2 attempted
> 
> It does not seem to matter which order the nodes come up in - it is
> always the third node to boot that will hang when starting clvmd.  I
> have included my cluster.conf file below for reference - I can include
> any additional diagnostics as required.
> 
> Any help would be most appreciated!

That sounds like a bug that has already been fixed. I don't have the reference
to hand as I've just returned from holiday, sorry.

Patrick

Registered Address: Red Hat UK Ltd, Amberley Place, 107-111 Peascod Street,
Windsor, Berkshire, SL4 ITE, UK.
Registered in England and Wales under Company Registration No. 3798903


From joseparrella at gmail.com  Tue Oct 30 12:54:44 2007
From: joseparrella at gmail.com (=?ISO-8859-1?Q?Jos=E9_Miguel_Parrella_Romero?=)
Date: Tue, 30 Oct 2007 08:54:44 -0400
Subject: [Linux-cluster] Two node cluster - howto
In-Reply-To: <640358900710281720j5c036bb5kf185b19f04e07192@mail.gmail.com>
References: <640358900710281720j5c036bb5kf185b19f04e07192@mail.gmail.com>
Message-ID: <47272994.3000306@gmail.com>

El 28/10/07 20:20, Andrew Hole escribi?:
> II want to create a very basic cluster of two nodes to configure
> failover of a service. This

OT: You're probably better off with Heartbeat than with Red Hat's 
Cluster suite. You might even try the Ultramonkey strategy for building 
HA two-node clusters [1]. Basically it involves installing Heartbeat on 
each node, establishing a dedicated communication media between both 
nodes (such as crossover Ethernet cable, or whatever) and declaring 
which services you want to be handled by Heartbeat.

If you want fencing, there's STONITH for Heartbeat, FWIW.

HTH,
Jose

[1] http://www.ultramonkey.org/3/topologies/ha-overview.html


From teigland at redhat.com  Tue Oct 30 13:51:57 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 30 Oct 2007 08:51:57 -0500
Subject: [Linux-cluster] fence gnbd doesn't works as expected
In-Reply-To: <4726E4B7.9000700@gmail.com>
References: <4726E4B7.9000700@gmail.com>
Message-ID: <20071030135157.GA3706@redhat.com>

On Tue, Oct 30, 2007 at 09:00:55AM +0100, carlopmart wrote:
> Hi all,
> 
>  I have already installed two nodes cluster using gnbd as a fence device. 
>  When tow nodes comes up at the same time all works ok, but when only I need 
> to start only one node, GFS doesn't mounts because fence device doesn't 
> works. Error is:
> 
> Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld 
> join error: -22
> /sbin/mount.gfs: error mounting lockproto lock_dlm.

Is there anything in /var/log/messages?  What does 'group_tool -v' show?

Dave


From carlopmart at gmail.com  Tue Oct 30 14:06:53 2007
From: carlopmart at gmail.com (carlopmart)
Date: Tue, 30 Oct 2007 15:06:53 +0100
Subject: [Linux-cluster] fence gnbd doesn't works as expected (I think
	it's solved)
In-Reply-To: <20071030135157.GA3706@redhat.com>
References: <4726E4B7.9000700@gmail.com> <20071030135157.GA3706@redhat.com>
Message-ID: <47273A7D.2040600@gmail.com>

David Teigland wrote:
> On Tue, Oct 30, 2007 at 09:00:55AM +0100, carlopmart wrote:
>> Hi all,
>>
>>  I have already installed two nodes cluster using gnbd as a fence device. 
>>  When tow nodes comes up at the same time all works ok, but when only I need 
>> to start only one node, GFS doesn't mounts because fence device doesn't 
>> works. Error is:
>>
>> Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld 
>> join error: -22
>> /sbin/mount.gfs: error mounting lockproto lock_dlm.
> 
> Is there anything in /var/log/messages?  What does 'group_tool -v' show?
> 
> Dave
> 
> 
Hi Dave and all,

  I think that I solved the problem. I have included gnbd server on cluster, but 
I need to modify cman startup script to start gnbd_serv after fence. Is this 
problem solved on rhel 5.1 beta??


-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From lhh at redhat.com  Tue Oct 30 18:57:15 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 30 Oct 2007 14:57:15 -0400
Subject: [Linux-cluster] Two node question
In-Reply-To: <47260EDF.8010301@bxwa.com>
References: <47260EDF.8010301@bxwa.com>
Message-ID: <1193770636.9223.118.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-29 at 09:48 -0700, Scott Becker wrote:
> I'm building a 3-node cluster.
> 
> One public subnet.
> 
> No shared filesystem
> 
> Starting with just one service (www) on one movable IP address.

So, static content copied manually (or rsync'd periodically) between
multiple nodes...


> I will be routinely bring down one node at a time for maintenance so I 
> must configure the cluster for one node operation. I can't use the 
> two_node="1" setting because I would have to bring the cluster down to 
> set and unset it.

Not only that, but you need to use power fencing w/ two_node="1" for it
to work.


> I want to use an IP tie-breaker to verify that the node can reach the 
> main router. Is this, along with a self test of the web service, fs 
> writable, etc. adequate for one, two and three node operation? Or would 
> I also need a Qdisk?

Currently, you can't use qdisk to do that - because qdisk needs to talk
to shared storage (and therefore, fencing) as well.


What about just getting another pair of nodes that you can just let run
and running piranha+LVS to direct traffic to your frequently-maintained
servers since there's no shared data?

It kind of side-steps the quorum problem.

-- Lon


From lhh at redhat.com  Tue Oct 30 18:58:14 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 30 Oct 2007 14:58:14 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <C34B6E61.11348C%jgray@nicusa.com>
References: <C34B6E61.11348C%jgray@nicusa.com>
Message-ID: <1193770694.9223.121.camel@ayanami.boston.devel.redhat.com>

On Mon, 2007-10-29 at 11:02 -0400, Josh Gray wrote:
> Lon and the other gurus - have you guys ever seen an ext3 volume get mounted
> on multiple cluster nodes at the same time WITHOUT a split brain? No
> fencing, no errors logged, no network issues, etc..   I even ran clustat and
> both nodes (let's say B and C ) even said that the single node I thought was
> up was in control (B).

Yes - actually, what release do you have?

-- Lon


From jgray at nicusa.com  Tue Oct 30 19:07:28 2007
From: jgray at nicusa.com (Josh Gray)
Date: Tue, 30 Oct 2007 15:07:28 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <1193770694.9223.121.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C34CF930.113B33%jgray@nicusa.com>

You asked me that a while back, pardon my inexperience with Linux thus far
not positive how to get the version numbers you're looking for.

Here's a few - 

# uname -a
Linux nfs-5.cdc.nicusa.com 2.6.18-8.1.15.el5 #1 SMP Thu Oct 4 04:06:39 EDT
2007 x86_64 x86_64 x86_64 GNU/Linux

# cman_tool version
6.0.1 config 59

# clusvcadm -v
2.0.24


On 10/30/07 2:58 PM, "Lon Hohberger" <lhh at redhat.com> wrote:

> On Mon, 2007-10-29 at 11:02 -0400, Josh Gray wrote:
>> Lon and the other gurus - have you guys ever seen an ext3 volume get mounted
>> on multiple cluster nodes at the same time WITHOUT a split brain? No
>> fencing, no errors logged, no network issues, etc..   I even ran clustat and
>> both nodes (let's say B and C ) even said that the single node I thought was
>> up was in control (B).
> 
> Yes - actually, what release do you have?
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From Christopher.Barry at qlogic.com  Wed Oct 31 15:27:58 2007
From: Christopher.Barry at qlogic.com (Christopher Barry)
Date: Wed, 31 Oct 2007 11:27:58 -0400
Subject: [Linux-cluster] scsi reservation issue
Message-ID: <1193844478.5162.53.camel@localhost>

Greetings all,

I have 2 vmware esx servers, each hitting a NetApp over FS, and each
with 3 RHCS cluster nodes trying to mount a gfs volume.

All of the nodes (1,2,& 3) on esx-01 can mount the volume fine, but none
of the nodes in the second esx box can mount the gfs volume at all, and
I get the following error in dmesg:

Lock_Harness 2.6.9-72.2 (built Apr 24 2007 12:45:38) installed
GFS 2.6.9-72.2 (built Apr 24 2007 12:45:54) installed
GFS: Trying to join cluster "lock_dlm", "kop-sds:gfs_home"
Lock_DLM (built Apr 24 2007 12:45:40) installed
GFS: fsid=kop-sds:gfs_home.2: Joined cluster. Now mounting FS...
GFS: fsid=kop-sds:gfs_home.2: jid=2: Trying to acquire journal lock...
GFS: fsid=kop-sds:gfs_home.2: jid=2: Looking at journal...
GFS: fsid=kop-sds:gfs_home.2: jid=2: Done
scsi2 (0,0,0) : reservation conflict
SCSI error : <2 0 0 0> return code = 0x18
end_request: I/O error, dev sdc, sector 523720263
scsi2 (0,0,0) : reservation conflict
SCSI error : <2 0 0 0> return code = 0x18
end_request: I/O error, dev sdc, sector 523720271
scsi2 (0,0,0) : reservation conflict
SCSI error : <2 0 0 0> return code = 0x18
end_request: I/O error, dev sdc, sector 523720279
GFS: fsid=kop-sds:gfs_home.2: fatal: I/O error
GFS: fsid=kop-sds:gfs_home.2:   block = 65464979
GFS: fsid=kop-sds:gfs_home.2:   function = gfs_logbh_wait
GFS: fsid=kop-sds:gfs_home.2:   file
= /builddir/build/BUILD/gfs-kernel-2.6.9-72/smp/src/gfs/dio.c, line =
923
GFS: fsid=kop-sds:gfs_home.2:   time = 1193838678
GFS: fsid=kop-sds:gfs_home.2: about to withdraw from the cluster
GFS: fsid=kop-sds:gfs_home.2: waiting for outstanding I/O
GFS: fsid=kop-sds:gfs_home.2: telling LM to withdraw
lock_dlm: withdraw abandoned memory
GFS: fsid=kop-sds:gfs_home.2: withdrawn
GFS: fsid=kop-sds:gfs_home.2: can't get resource index inode: -5


Does anyone have a clue as to where I should start looking?


Thanks,
-C


From rohara at redhat.com  Wed Oct 31 15:44:09 2007
From: rohara at redhat.com (Ryan O'Hara)
Date: Wed, 31 Oct 2007 10:44:09 -0500
Subject: [Linux-cluster] scsi reservation issue
In-Reply-To: <1193844478.5162.53.camel@localhost>
References: <1193844478.5162.53.camel@localhost>
Message-ID: <4728A2C9.3000501@redhat.com>

Christopher Barry wrote:
> Greetings all,
> 
> I have 2 vmware esx servers, each hitting a NetApp over FS, and each
> with 3 RHCS cluster nodes trying to mount a gfs volume.
> 
> All of the nodes (1,2,& 3) on esx-01 can mount the volume fine, but none
> of the nodes in the second esx box can mount the gfs volume at all, and
> I get the following error in dmesg:

Are you intentionally trying to use scsi reservations as a fence method? 
It sounds like the nodes on esx-01 are creating reservations, but the 
nodes on the second esx box are not registering with the device and 
therefore are unable to mount the filesystem. Creation of reservations 
and registrations is handled by the scsi_reserve init script, which 
should be run at startup on all nodes in the cluster. You can check to 
see what devices a node is registered for before you mount the 
filesystem by doing /etc/init.d/scsi_reservce status. If your nodes are 
not registered with the device and a reservation exists then you won't 
be able to mount.

> Lock_Harness 2.6.9-72.2 (built Apr 24 2007 12:45:38) installed
> GFS 2.6.9-72.2 (built Apr 24 2007 12:45:54) installed
> GFS: Trying to join cluster "lock_dlm", "kop-sds:gfs_home"
> Lock_DLM (built Apr 24 2007 12:45:40) installed
> GFS: fsid=kop-sds:gfs_home.2: Joined cluster. Now mounting FS...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Trying to acquire journal lock...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Looking at journal...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Done
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720263
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720271
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720279
> GFS: fsid=kop-sds:gfs_home.2: fatal: I/O error
> GFS: fsid=kop-sds:gfs_home.2:   block = 65464979
> GFS: fsid=kop-sds:gfs_home.2:   function = gfs_logbh_wait
> GFS: fsid=kop-sds:gfs_home.2:   file
> = /builddir/build/BUILD/gfs-kernel-2.6.9-72/smp/src/gfs/dio.c, line =
> 923
> GFS: fsid=kop-sds:gfs_home.2:   time = 1193838678
> GFS: fsid=kop-sds:gfs_home.2: about to withdraw from the cluster
> GFS: fsid=kop-sds:gfs_home.2: waiting for outstanding I/O
> GFS: fsid=kop-sds:gfs_home.2: telling LM to withdraw
> lock_dlm: withdraw abandoned memory
> GFS: fsid=kop-sds:gfs_home.2: withdrawn
> GFS: fsid=kop-sds:gfs_home.2: can't get resource index inode: -5
> 
> 
> Does anyone have a clue as to where I should start looking?
> 
> 
> Thanks,
> -C
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Wed Oct 31 16:16:58 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 31 Oct 2007 12:16:58 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <C34CF930.113B33%jgray@nicusa.com>
References: <C34CF930.113B33%jgray@nicusa.com>
Message-ID: <1193847418.9223.171.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-10-30 at 15:07 -0400, Josh Gray wrote:
> You asked me that a while back, pardon my inexperience with Linux thus far
> not positive how to get the version numbers you're looking for.
> 
> Here's a few - 
> 
> # uname -a
> Linux nfs-5.cdc.nicusa.com 2.6.18-8.1.15.el5 #1 SMP Thu Oct 4 04:06:39 EDT
> 2007 x86_64 x86_64 x86_64 GNU/Linux
> 
> # cman_tool version
> 6.0.1 config 59
> 
> # clusvcadm -v
> 2.0.24

That's the ticket.

It sounds like you're hitting this, but I thought it was isolated to
post-RHEL5.0:

https://bugzilla.redhat.com/show_bug.cgi?id=249758

-- Lon


From james at cloud9.co.uk  Wed Oct 31 16:17:02 2007
From: james at cloud9.co.uk (James Fidell)
Date: Wed, 31 Oct 2007 16:17:02 +0000
Subject: [Linux-cluster] NFS failover question
Message-ID: <4728AA7E.2010602@cloud9.co.uk>

I have a cluster doing NFS failover that I'm testing atm.  I have three
nodes configured as per the cookbook example with one node having
higher priority than the other two and can mount NFS partitions from
that node using the floating IP address.  When I power off that node,
one of the remaining servers takes over and everything looks rosy.

When I bring the original server back into the cluster, should the
service stay where it is, or flip back to the highest priority server?

Without grubbing through the logs, is there any way to tell which server
is now providing the NFS service?

And if the service doesn't move back to the highest priority server, is
there any way to make it do so?

(It's entirely possible that the service hasn't moved back because the
node third node reports:

  <err> #52: Failed changing RG status
  <err> #57: Failed changing RG status

I've not got to the bottom of that yet.)

James


From raindoctor at gmail.com  Wed Oct 31 17:24:32 2007
From: raindoctor at gmail.com (Pedro Espinoza)
Date: Wed, 31 Oct 2007 13:24:32 -0400
Subject: [Linux-cluster] Why virtual ip?
Message-ID: <b86c876f0710311024h2530bcc0ra1903fb58da6ab6f@mail.gmail.com>

Gurus, I have a naive question?

Why cluster nodes need to have virtual ip?


From breeves at redhat.com  Wed Oct 31 17:23:09 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 31 Oct 2007 17:23:09 +0000
Subject: [Linux-cluster] Why virtual ip?
In-Reply-To: <b86c876f0710311024h2530bcc0ra1903fb58da6ab6f@mail.gmail.com>
References: <b86c876f0710311024h2530bcc0ra1903fb58da6ab6f@mail.gmail.com>
Message-ID: <4728B9FD.2080605@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Pedro Espinoza wrote:
> Gurus, I have a naive question?
> 
> Why cluster nodes need to have virtual ip?

So that a service can be relocated transparently without the client's
awareness.

The IP's are sometimes referred to as "floating" addresses since they
are moved between nodes when the service they belong to moves.

Regards,
Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFHKLn96YSQoMYUY94RAjARAKCWQGpvOoanhvsos+CMKhHXsfYYKgCgqk2n
Xl8JJc3NKE2XCQhVaEJ2flU=
=vVEO
-----END PGP SIGNATURE-----


From jgray at nicusa.com  Wed Oct 31 17:54:56 2007
From: jgray at nicusa.com (Josh Gray)
Date: Wed, 31 Oct 2007 11:54:56 -0600
Subject: [Linux-cluster] scsi reservation issue
In-Reply-To: <1193844478.5162.53.camel@localhost>
Message-ID: <C34E1D90.113C9E%jgray@nicusa.com>

Chris - How many other VM's do you have too?   Are the RH servers local disk
or shared storage,  what about all the others?

Josh


On 10/31/07 9:27 AM, "Christopher Barry" <Christopher.Barry at qlogic.com>
wrote:

> Greetings all,
> 
> I have 2 vmware esx servers, each hitting a NetApp over FS, and each
> with 3 RHCS cluster nodes trying to mount a gfs volume.
> 
> All of the nodes (1,2,& 3) on esx-01 can mount the volume fine, but none
> of the nodes in the second esx box can mount the gfs volume at all, and
> I get the following error in dmesg:
> 
> Lock_Harness 2.6.9-72.2 (built Apr 24 2007 12:45:38) installed
> GFS 2.6.9-72.2 (built Apr 24 2007 12:45:54) installed
> GFS: Trying to join cluster "lock_dlm", "kop-sds:gfs_home"
> Lock_DLM (built Apr 24 2007 12:45:40) installed
> GFS: fsid=kop-sds:gfs_home.2: Joined cluster. Now mounting FS...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Trying to acquire journal lock...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Looking at journal...
> GFS: fsid=kop-sds:gfs_home.2: jid=2: Done
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720263
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720271
> scsi2 (0,0,0) : reservation conflict
> SCSI error : <2 0 0 0> return code = 0x18
> end_request: I/O error, dev sdc, sector 523720279
> GFS: fsid=kop-sds:gfs_home.2: fatal: I/O error
> GFS: fsid=kop-sds:gfs_home.2:   block = 65464979
> GFS: fsid=kop-sds:gfs_home.2:   function = gfs_logbh_wait
> GFS: fsid=kop-sds:gfs_home.2:   file
> = /builddir/build/BUILD/gfs-kernel-2.6.9-72/smp/src/gfs/dio.c, line =
> 923
> GFS: fsid=kop-sds:gfs_home.2:   time = 1193838678
> GFS: fsid=kop-sds:gfs_home.2: about to withdraw from the cluster
> GFS: fsid=kop-sds:gfs_home.2: waiting for outstanding I/O
> GFS: fsid=kop-sds:gfs_home.2: telling LM to withdraw
> lock_dlm: withdraw abandoned memory
> GFS: fsid=kop-sds:gfs_home.2: withdrawn
> GFS: fsid=kop-sds:gfs_home.2: can't get resource index inode: -5
> 
> 
> Does anyone have a clue as to where I should start looking?
> 
> 
> Thanks,
> -C
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From jgray at nicusa.com  Wed Oct 31 17:56:38 2007
From: jgray at nicusa.com (Josh Gray)
Date: Wed, 31 Oct 2007 11:56:38 -0600
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <1193847418.9223.171.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C34E1DF6.113C9F%jgray@nicusa.com>

Hmmm I don't see to have access to that bug after creating an account.  Can
you summarize or cut/paste some info?

Josh


On 10/31/07 10:16 AM, "Lon Hohberger" <lhh at redhat.com> wrote:

> On Tue, 2007-10-30 at 15:07 -0400, Josh Gray wrote:
>> You asked me that a while back, pardon my inexperience with Linux thus far
>> not positive how to get the version numbers you're looking for.
>> 
>> Here's a few - 
>> 
>> # uname -a
>> Linux nfs-5.cdc.nicusa.com 2.6.18-8.1.15.el5 #1 SMP Thu Oct 4 04:06:39 EDT
>> 2007 x86_64 x86_64 x86_64 GNU/Linux
>> 
>> # cman_tool version
>> 6.0.1 config 59
>> 
>> # clusvcadm -v
>> 2.0.24
> 
> That's the ticket.
> 
> It sounds like you're hitting this, but I thought it was isolated to
> post-RHEL5.0:
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=249758
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From lhh at redhat.com  Wed Oct 31 18:55:54 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 31 Oct 2007 14:55:54 -0400
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <C34E1DF6.113C9F%jgray@nicusa.com>
References: <C34E1DF6.113C9F%jgray@nicusa.com>
Message-ID: <1193856955.10269.0.camel@ayanami.boston.devel.redhat.com>

On Wed, 2007-10-31 at 11:56 -0600, Josh Gray wrote:
> Hmmm I don't see to have access to that bug after creating an account.  Can
> you summarize or cut/paste some info?
> 
> Josh

Hmm, try now, it had a misplaced checkbox checked.

-- Lon


From jgray at nicusa.com  Wed Oct 31 19:01:59 2007
From: jgray at nicusa.com (Josh Gray)
Date: Wed, 31 Oct 2007 13:01:59 -0600
Subject: [Linux-cluster] EXT3 service mounted on two nodes
In-Reply-To: <1193856955.10269.0.camel@ayanami.boston.devel.redhat.com>
Message-ID: <C34E2D47.113CB6%jgray@nicusa.com>

Interesting, thanks.   I'll mention this to support.  I'm sure they can find
other ways to improve my increasingly ugly config though too!

Josh


On 10/31/07 12:55 PM, "Lon Hohberger" <lhh at redhat.com> wrote:

> On Wed, 2007-10-31 at 11:56 -0600, Josh Gray wrote:
>> Hmmm I don't see to have access to that bug after creating an account.  Can
>> you summarize or cut/paste some info?
>> 
>> Josh
> 
> Hmm, try now, it had a misplaced checkbox checked.
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
Josh Gray
Systems Administrator
NIC Inc

Email: jgray at nicusa.com
Desk/Mobile: 913-221-1520

"It is not the mountain we conquer, but ourselves."
- Sir Edmund Hillary


From Christopher.Barry at qlogic.com  Wed Oct 31 19:16:59 2007
From: Christopher.Barry at qlogic.com (Christopher Barry)
Date: Wed, 31 Oct 2007 15:16:59 -0400
Subject: [Linux-cluster] scsi reservation issue
In-Reply-To: <C34E1D90.113C9E%jgray@nicusa.com>
References: <C34E1D90.113C9E%jgray@nicusa.com>
Message-ID: <1193858220.5162.67.camel@localhost>

On Wed, 2007-10-31 at 11:54 -0600, Josh Gray wrote:
> Chris - How many other VM's do you have too?   Are the RH servers local disk
> or shared storage,  what about all the others?
> 
> Josh
> 
> 
> On 10/31/07 9:27 AM, "Christopher Barry" <Christopher.Barry at qlogic.com>
> wrote:
> 
> > Greetings all,
> > 
> > I have 2 vmware esx servers, each hitting a NetApp over FS, and each
> > with 3 RHCS cluster nodes trying to mount a gfs volume.
> > 
> > All of the nodes (1,2,& 3) on esx-01 can mount the volume fine, but none
> > of the nodes in the second esx box can mount the gfs volume at all, and
> > I get the following error in dmesg:
> > 
> > Lock_Harness 2.6.9-72.2 (built Apr 24 2007 12:45:38) installed
> > GFS 2.6.9-72.2 (built Apr 24 2007 12:45:54) installed
> > GFS: Trying to join cluster "lock_dlm", "kop-sds:gfs_home"
> > Lock_DLM (built Apr 24 2007 12:45:40) installed
> > GFS: fsid=kop-sds:gfs_home.2: Joined cluster. Now mounting FS...
> > GFS: fsid=kop-sds:gfs_home.2: jid=2: Trying to acquire journal lock...
> > GFS: fsid=kop-sds:gfs_home.2: jid=2: Looking at journal...
> > GFS: fsid=kop-sds:gfs_home.2: jid=2: Done
> > scsi2 (0,0,0) : reservation conflict
> > SCSI error : <2 0 0 0> return code = 0x18
> > end_request: I/O error, dev sdc, sector 523720263
> > scsi2 (0,0,0) : reservation conflict
> > SCSI error : <2 0 0 0> return code = 0x18
> > end_request: I/O error, dev sdc, sector 523720271
> > scsi2 (0,0,0) : reservation conflict
> > SCSI error : <2 0 0 0> return code = 0x18
> > end_request: I/O error, dev sdc, sector 523720279
> > GFS: fsid=kop-sds:gfs_home.2: fatal: I/O error
> > GFS: fsid=kop-sds:gfs_home.2:   block = 65464979
> > GFS: fsid=kop-sds:gfs_home.2:   function = gfs_logbh_wait
> > GFS: fsid=kop-sds:gfs_home.2:   file
> > = /builddir/build/BUILD/gfs-kernel-2.6.9-72/smp/src/gfs/dio.c, line =
> > 923
> > GFS: fsid=kop-sds:gfs_home.2:   time = 1193838678
> > GFS: fsid=kop-sds:gfs_home.2: about to withdraw from the cluster
> > GFS: fsid=kop-sds:gfs_home.2: waiting for outstanding I/O
> > GFS: fsid=kop-sds:gfs_home.2: telling LM to withdraw
> > lock_dlm: withdraw abandoned memory
> > GFS: fsid=kop-sds:gfs_home.2: withdrawn
> > GFS: fsid=kop-sds:gfs_home.2: can't get resource index inode: -5
> > 
> > 
> > Does anyone have a clue as to where I should start looking?
> > 
> > 
> > Thanks,
> > -C
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 


Hi Josh,

My cluster nodes are on the local mirrored disks, I have several other
VMs that do not participate in the cluster, and they are located in a
VMFS on the shared storage.

Each cluster node has three disks:
sda* : the operating sys and it is local to the host
sdb1 : a 20MB quorum disk on netapp (which mounts fine from all nodes)
sdc1 : a 600GB gfs volume on netapp (only mounts on 3 nodes from first
host)

both the quorum and the gfs use RDMs (one each per host)
the rdms were created on nodes 1 and 4, and then nodes 2 & 3 mount the
existing rdms from node 1, and 5 & 6 mount the existing rdms from node
4.

Each disk has it's own virtual scsi controller.
sda has scsi controller set to none.
sdb and sdc have scsi controller set to physical.


The error I get when attempting to mount from nodes 4,5, & 6 is:

mount: /dev/sdc1: can't read superblock


Any help at all would be great.


Thanks,
-C


From s.c.graham at gmail.com  Wed Oct 31 19:26:12 2007
From: s.c.graham at gmail.com (s.c.graham at gmail.com)
Date: Wed, 31 Oct 2007 19:26:12 +0000
Subject: [Linux-cluster] clvmd hangs when third node tries to connect to
	cluster
In-Reply-To: <47270F3E.2030607@redhat.com>
References: <cdd145f50710290351k30221b3eh6afff88e4e8e9d47@mail.gmail.com>
	<47270F3E.2030607@redhat.com>
Message-ID: <cdd145f50710311226l4228af5frb73252571e65a591@mail.gmail.com>

> > I have a cluster with three nodes (all clone HL DL380 G4s) attached to
> > a Fibre SAN (HP MSA1000) and serving a number of GFS filesystems.  My
> > OS is Ubuntu Dapper (6.06) and my kernel is 2.6.15-29-amd64-server.
> > These machines have been working nicely for a long time.
> >
> > On the weekend I "apt-get updated" to the latest version of the Dapper
> > redhat-cluster-suite package (1.20060222-0ubuntu6.1).  Now, when the
> > cluster boots the first two nodes to come up are able to see the GFS
> > filesystem. However, the third node to come up hangs at the point of
> > starting the clvm service.  Concomitantly, I see the following message
> > in /var/log/syslog of one of the other machines in the cluster:
> >
> > Oct 28 14:42:18 machinea kernel: [ 1681.325152] CMAN: node machinec rejoining
> > Oct 28 14:42:20 machinea kernel: [ 1683.528299] Extra connection from
> > node 2 attempted
> >
> > It does not seem to matter which order the nodes come up in - it is
> > always the third node to boot that will hang when starting clvmd.  I
> > have included my cluster.conf file below for reference - I can include
> > any additional diagnostics as required.
> >
> > Any help would be most appreciated!
>
> That sounds like a bug that has already been fixed. I don't have the reference
> to hand as I've just returned from holiday, sorry.

Does anyone else remember this bug (or could someone please point me
in the direction of the correct bugzilla so I can try and track it
down myself)?

Thanks,

Stephen


From james at cloud9.co.uk  Wed Oct 31 23:03:58 2007
From: james at cloud9.co.uk (James Fidell)
Date: Wed, 31 Oct 2007 23:03:58 +0000
Subject: [Linux-cluster] NFS failover question
In-Reply-To: <4728AA7E.2010602@cloud9.co.uk>
References: <4728AA7E.2010602@cloud9.co.uk>
Message-ID: <472909DE.40209@cloud9.co.uk>

James Fidell wrote:

> Without grubbing through the logs, is there any way to tell which server
> is now providing the NFS service?
> 
> And if the service doesn't move back to the highest priority server, is
> there any way to make it do so?

To answer my own questions, which are in the FAQ, but I'd somehow
managed not to find them first time around and only saw them when I was
looking for something else:

1) Yes.  clustat

2) Yes.  clusvcadm -r <service> -m <node>

James