From raziebe at gmail.com  Thu Dec  1 11:05:57 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Thu, 1 Dec 2005 13:05:57 +0200
Subject: [Linux-cluster] redundancy in redhat clusters
In-Reply-To: <7c1e2e6e97d94de9571ee838d2d9a677@redhat.com>
References: <5d96567b0511290719u3b88bf01w7a275b704f4eb813@mail.gmail.com>
	<7c1e2e6e97d94de9571ee838d2d9a677@redhat.com>
Message-ID: <5d96567b0512010305t17f35c56la306d97b84de9925@mail.gmail.com>

got it. good point.
why do you think raid5 would give poor performance ?
as long as it is not in degredation mode the performance scales
to n-1 disks.


thanks
raz.

On 11/30/05, Jonathan E Brassow <jbrassow at redhat.com> wrote:
>
>
> On Nov 29, 2005, at 9:19 AM, Raz Ben-Jehuda(caro) wrote:
>
> > Question:
> > I need to add to a clsutered environment redundancy.
> >
> > Since the native linux raid 5 is not clustered awared,
> > what would make it aware to the cluster ?
> > What does it lack ?
> >
>
> Clustered file systems and applications will ensure that they are not
> doing simultaneous writes to the same [meta-]data.  However, they have
> no way to tell that a write to one area will conflict with the write to
> another because of the stripe width and parity calculation of the RAID
> device.  This will lead to parity block corruption.
>
> To solve this problem, the RAID 5 implementation must be cluster aware
> and take out single-writer/multiple-reader locks on the stripes -
> ensuring that multiple machines are not writing to the same stripe at
> the same time.
>
> The performance of a cluster-aware software RAID 5 is likely to be
> abysmal, and will probably not rank very high on anyone's priority
> list.
>
> A mirroring solution is in the works, and later, dd-raid may become a
> reality.
>
>   brassow
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



--
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051201/7bf6ca46/attachment.htm>

From danwest at comcast.net  Thu Dec  1 15:17:45 2005
From: danwest at comcast.net (danwest at comcast.net)
Date: Thu, 01 Dec 2005 15:17:45 +0000
Subject: [Linux-cluster] recovery= options implemented?
Message-ID: <120120051517.12633.438F141900033D5E0000315922007354469B9C0A99020E0B@comcast.net>

Does anyone know if the below options are actually implemented/working?
 
Thanks,
 Dan
 
<parameter name="recovery">
            <longdesc lang="en">
                This currently has three possible options: "restart" tries
                to restart failed parts of this resource group locally before
                attempting to relocate (default); "relocate" does not bother
                trying to restart the service locally; "disable" disables
                the resource group if any component fails.  Note that
                any resource with a valid "recover" operation which can be
                recovered without a restart will be.
            </longdesc>
            <shortdesc lang="en">
                Failure recovery policy
            </shortdesc>
            <content type="string"/>
</parameter>



From lhh at redhat.com  Thu Dec  1 23:22:20 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 01 Dec 2005 18:22:20 -0500
Subject: [Linux-cluster] recovery= options implemented?
In-Reply-To: <120120051517.12633.438F141900033D5E0000315922007354469B9C0A99020E0B@comcast.net>
References: <120120051517.12633.438F141900033D5E0000315922007354469B9C0A99020E0B@comcast.net>
Message-ID: <1133479340.11030.152.camel@ayanami.boston.redhat.com>

On Thu, 2005-12-01 at 15:17 +0000, danwest at comcast.net wrote:
> Does anyone know if the below options are actually implemented/working?
>  
> Thanks,
>  Dan
>  
> <parameter name="recovery">
>             <longdesc lang="en">
>                 This currently has three possible options: "restart" tries
>                 to restart failed parts of this resource group locally before
>                 attempting to relocate (default); "relocate" does not bother
>                 trying to restart the service locally; "disable" disables
>                 the resource group if any component fails.  Note that
>                 any resource with a valid "recover" operation which can be
>                 recovered without a restart will be.
>             </longdesc>
>             <shortdesc lang="en">
>                 Failure recovery policy
>             </shortdesc>
>             <content type="string"/>
> </parameter>

They should be, but they're not in the GUI currently...

-- Lon



From bmarzins at redhat.com  Fri Dec  2 22:51:08 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 2 Dec 2005 16:51:08 -0600
Subject: [Linux-cluster] help
In-Reply-To: <5d96567b0511282201y3a616e11t332b16039b0ce2bb@mail.gmail.com>
References: <5d96567b0511220738l7fb3d5c9u9df6dc12d2fd3bef@mail.gmail.com>
	<20051128161723.GC27662@redhat.com>
	<5d96567b0511281007j60a7dca1v9bf3d252a920a66e@mail.gmail.com>
	<20051128181357.GG27662@redhat.com>
	<5d96567b0511282146k4d1f839dq76b42a11e566c0d9@mail.gmail.com>
	<5d96567b0511282201y3a616e11t332b16039b0ce2bb@mail.gmail.com>
Message-ID: <20051202225107.GB14768@phlogiston.msp.redhat.com>

On Tue, Nov 29, 2005 at 08:01:20AM +0200, Raz Ben-Jehuda(caro) wrote:
> sorry, i maanage to make it work only when cache enabled.
> Is it possible to do it with no cache ?

The only difference in letting you export between cached and uncached, is that
uncached requires the server to be a member of a quorate cluster. Could you
start up gnbd_serv with the -v option, try to export an uncached device, and
mail me what the messages you get back, both from the command and from the logs.

-Ben
 
> On 11/29/05, Raz Ben-Jehuda(caro) <raziebe at gmail.com> wrote:
> > been there.
> > if i would load gnbd_serv with no cluster i would failed to export
> > any devices with gnbd_export.
> > if i join with: "cman_tool -X -e 2 join -c gamma -m 224.0.0.1 -i eth1"
> > and then gnbd_export hangs and dmeg reports
> > CMAN: Waiting to join or form a Linux-cluster
> > CMAN: forming a new cluster
> > CMANsendmsg failed: -22.
> > sometimes gnbd_export just says that ERROR create request failed :
> > Operation not supported:
> > So again i am stuck.
> >
> > On 11/28/05, David Teigland <teigland at redhat.com> wrote:
> > > On Mon, Nov 28, 2005 at 08:07:47PM +0200, Raz Ben-Jehuda(caro) wrote:
> > > > tried it.
> > > > According to the min-gfs.txt at the GNBD  server the only thing i have to do
> > > > is simly run gnbd_serv. but looking at the code i learned that i need
> > > > to load cman.
> > > > yet this is not enough. gnbd_serv fails to load with :
> > > >
> > > > gnbd_serv: ERROR cannot get node name : No such process
> > > > gnbd_serv: ERROR No cluster manager is running
> > > > gnbd_serv: ERROR If you are not planning to use a cluster manager, use -n
> > > >
> > > > does gnbd_serv depends in a cluster manager?
> > > > What is my mistake  ? this is not part of the cluster.
> > >
> > > min-gfs.txt is wrong, I'll fix it.  You need to use gnbd_serv -n.
> > > Then gnbd_serv will ignore all clustering stuff which is what you want.
> > >
> > > Dave
> > >
> > >
> >
> >
> > --
> > Raz
> >
> 
> 
> --
> Raz
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From bmarzins at redhat.com  Fri Dec  2 23:16:45 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 2 Dec 2005 17:16:45 -0600
Subject: [Linux-cluster] Attn Patrick.
In-Reply-To: <013d01c5f42d$c49a0120$6401a8c0@fileserver>
References: <013d01c5f42d$c49a0120$6401a8c0@fileserver>
Message-ID: <20051202231645.GC14768@phlogiston.msp.redhat.com>

On Mon, Nov 28, 2005 at 11:09:45PM +0800, James Davis wrote:
> Hi Patrick,
>            I'm hoping you can explain to me, and if I have the right idea 
> about GFS/GNBD
> 
> I'm referring to this document http://gfs.wikidev.net/GNBD_installation for 
> the purposes of config
> 
> What I'm trying to do is setup GNBD on 2 machines.. I'm wondering if its 
> possible for the two mahines to cluster the data on the local hdd's rather 
> than using an external storage array..

No. Not without cluster aware mirror software, which isn't available just yet.
Otherwise, you run into the problem where If a node crashes, it might have
written to one device and not the other. So your mirror can be out of sync,
and the other machine will never know.

-Ben
 
> i.e machine raid0. basically replicated volumes...
> 
> Also if this IS possible, am I right in assuming if one machine goes down 
> the second one will take over the primaries role...
> 
> On the client machine how do you set it to connect to the cluster?
> 
> The documentation seems to be very lacking and I'm somewhat stressed out 
> from work wondering if what I'm trying to do is possible?
> 
> If you need more clarification on what I'm trying to do please ask.
> 
> Sorry if this comes across as newbish
> 
> Regards
> James 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From elmar at pruesse.net  Sat Dec  3 22:49:15 2005
From: elmar at pruesse.net (Elmar Pruesse)
Date: Sat, 03 Dec 2005 23:49:15 +0100
Subject: [Linux-cluster] New (small) cluster; What filesystem? GFS?
Message-ID: <439220EB.80901@pruesse.net>

hi!

We're getting a new cluster for Christmas this year, and I am wondering
whether there are better options than the nfs-server setup used on our
old one. Unfortunately we don't have any people who have had the chance
to try things out, and we won't have much time either.

I'd really appreciate some comments on what would make sense for us and
what would not. (I did read up some, but everyone claims to be best...)

We've got:
Five nodes 16GB/dual 275 opteron, one 32GB DB-host and one 8GB
fileserver. The latter two have external SCSI-Raids. All are
interconnected via Infiniband. We will expand to 12 or 16 nodes with the
next round of money.

The old cluster (as provided by our IBM vendor) uses a NFS-server
connected to a fibre-channel raid. Filesystem performance is a
[expletive deleted] major problem.

We will use the cluster for serveral different bioinformatics tools,
some of which I've been told produce directories with many thousands of
files.

Does iSCSI+GFS make sense? And more so than NFS? Would you route it via
the Infiniband network or via GbE? How about Lustre, PVFS2, OCFS?

regards,
Elmar

ps: Please do tell me the combination of hardware makes at least a
little sense. The infiniband was something of an afterthought, since it
was unexpectetly cheap.



From raziebe at gmail.com  Sun Dec  4 11:07:58 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Sun, 4 Dec 2005 13:07:58 +0200
Subject: [Linux-cluster] help
In-Reply-To: <20051202225107.GB14768@phlogiston.msp.redhat.com>
References: <5d96567b0511220738l7fb3d5c9u9df6dc12d2fd3bef@mail.gmail.com>
	<20051128161723.GC27662@redhat.com>
	<5d96567b0511281007j60a7dca1v9bf3d252a920a66e@mail.gmail.com>
	<20051128181357.GG27662@redhat.com>
	<5d96567b0511282146k4d1f839dq76b42a11e566c0d9@mail.gmail.com>
	<5d96567b0511282201y3a616e11t332b16039b0ce2bb@mail.gmail.com>
	<20051202225107.GB14768@phlogiston.msp.redhat.com>
Message-ID: <5d96567b0512040307v52782e16teb34800b9f6e0cef@mail.gmail.com>

gnbd_serv cannot load without -n flag.
They fixed it in min-gfs.txt document.
So, I have a little problem with it.

On 12/3/05, Benjamin Marzinski <bmarzins at redhat.com> wrote:
>
> On Tue, Nov 29, 2005 at 08:01:20AM +0200, Raz Ben-Jehuda(caro) wrote:
> > sorry, i maanage to make it work only when cache enabled.
> > Is it possible to do it with no cache ?
>
> The only difference in letting you export between cached and uncached, is
> that
> uncached requires the server to be a member of a quorate cluster. Could
> you
> start up gnbd_serv with the -v option, try to export an uncached device,
> and
> mail me what the messages you get back, both from the command and from the
> logs.
>
> -Ben
>
> > On 11/29/05, Raz Ben-Jehuda(caro) <raziebe at gmail.com> wrote:
> > > been there.
> > > if i would load gnbd_serv with no cluster i would failed to export
> > > any devices with gnbd_export.
> > > if i join with: "cman_tool -X -e 2 join -c gamma -m 224.0.0.1 -i eth1"
> > > and then gnbd_export hangs and dmeg reports
> > > CMAN: Waiting to join or form a Linux-cluster
> > > CMAN: forming a new cluster
> > > CMANsendmsg failed: -22.
> > > sometimes gnbd_export just says that ERROR create request failed :
> > > Operation not supported:
> > > So again i am stuck.
> > >
> > > On 11/28/05, David Teigland <teigland at redhat.com> wrote:
> > > > On Mon, Nov 28, 2005 at 08:07:47PM +0200, Raz Ben-Jehuda(caro)
> wrote:
> > > > > tried it.
> > > > > According to the min-gfs.txt at the GNBD  server the only thing i
> have to do
> > > > > is simly run gnbd_serv. but looking at the code i learned that i
> need
> > > > > to load cman.
> > > > > yet this is not enough. gnbd_serv fails to load with :
> > > > >
> > > > > gnbd_serv: ERROR cannot get node name : No such process
> > > > > gnbd_serv: ERROR No cluster manager is running
> > > > > gnbd_serv: ERROR If you are not planning to use a cluster manager,
> use -n
> > > > >
> > > > > does gnbd_serv depends in a cluster manager?
> > > > > What is my mistake  ? this is not part of the cluster.
> > > >
> > > > min-gfs.txt is wrong, I'll fix it.  You need to use gnbd_serv -n.
> > > > Then gnbd_serv will ignore all clustering stuff which is what you
> want.
> > > >
> > > > Dave
> > > >
> > > >
> > >
> > >
> > > --
> > > Raz
> > >
> >
> >
> > --
> > Raz
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



--
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051204/667a941c/attachment.htm>

From raziebe at gmail.com  Mon Dec  5 13:04:05 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Mon, 5 Dec 2005 05:04:05 -0800
Subject: [Linux-cluster] question : the flow of adding a GNBD with new
	storage
Message-ID: <5d96567b0512050504p2f478965x75a49eaa6ceb6a82@mail.gmail.com>

when i am adding a new GNBD to the cluster with an additional storage.
obvioulsy i must add it both to volume and the file system.
question : does clustered linux migrate data to balance cluster ?
If so  , how ?

--
thank you
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051205/44ef9647/attachment.htm>

From gforte at leopard.us.udel.edu  Tue Dec  6 00:40:16 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Mon, 05 Dec 2005 19:40:16 -0500
Subject: [Linux-cluster] two fencing problems
Message-ID: <4394DDF0.4080603@leopard.us.udel.edu>

two (probably related) questions concerning fencing and APC AP7900 units:

1) fence_apc doesn't appear to be compatible with these units - when I run:

sudo /sbin/fence_apc -a <ipaddy> -l <usr> -p <pwd> -n1 -T -v

it comes back with:

failed: unrecognised menu response

The output file shows that it's getting as far as the "Outlet 
Control/Configuration" menu, but never selects the specified port.

This is on RHEL ES4 update 2 with fence-1.32.6-0 installed.

Does anyone have this working with AP7900s, and if so did you have to 
hack the fence_apc script or is there just something I'm missing?

  2) in the cluster configuration tool (GUI), there's no place to 
specify the port to cycle for an "APC Power Device".  I tried adding 
"port=#" to the <fencedevice ...> tags in the cluster.conf file, but the 
cluster configuration tool didn't like that.  And of course, I was 
unable to test if this actually works anyway because of problem #1 :-(

Anyway, assuming I get fence_apc to work, how do I specify ports in the 
cluster configuration tool?  or is this not supported?  In which case 
can I add the port option in the cluster.conf like I'm trying to do and 
have it work?  I have system-config-cluster-1.0.16-1.0 installed.

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From busyadmin at gmail.com  Tue Dec  6 02:04:38 2005
From: busyadmin at gmail.com (busy admin)
Date: Mon, 5 Dec 2005 19:04:38 -0700
Subject: [Linux-cluster] manual fencing not working in RHEL4 branch
In-Reply-To: <20051130164839.GB23663@redhat.com>
References: <1c0e77670511281307i75bc26a4pc5bbcd3d152a8c8e@mail.gmail.com>
	<20051128212731.GK27662@redhat.com>
	<1c0e77670511291853i2603f61ayf2eae51903032ebd@mail.gmail.com>
	<20051130164839.GB23663@redhat.com>
Message-ID: <1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>

David,

I have tried the same init scripts with both ipmi and drac fencing, no
problems.  When I try manual fencing (it seems) that fence_manual
introduces some strangeness such that I run into my problem.

What is the problem:
When running manual fencing and doing failover testing, my secondary
node takes over the service without waiting for a fence_ack_manual. 
This all works perfectly with automatic fencing (ipmi, drac).

I have the same problem (most of the time) when I run this whole thing by hand:
1. nodeA: ccsd
2. nodeB: ccsd
3. nodeA: cman_tool join -w
4. nodeB: cman_tool join -w
5. nodeA: fence_tool join -w
6. nodeB: fence_tool join -w

When I start to see the problem, on the next reboot of both the
systems I can replace steps 5 & 6 with 'fenced -D'.  Now if I try to
failover a machine then manual fencing works perfectly (meaning forces
me to do a fence_ack_manual before a service fails over).  Next, I can
go in and change 'fenced -D' back to 'fence_tool join -w' and things
still work (forces me to run fence_ack_manual).

Next, if I replace the manual steps above with the init scripts then
manual fencing breaks all over again until I repeat the above steps.

Sounds like a timing issue around fence_manual?  Let me know if you
want me to try anything different.  Thanks for all your help.


On 11/30/05, David Teigland <teigland at redhat.com> wrote:
> On Tue, Nov 29, 2005 at 07:53:09PM -0700, busy admin wrote:
> > Here's a quick summary of what I've done and the results... to
> > simplify the config I've just been running ccsd and cman via init
> > scripts during boot and then manual executing 'fenced' or 'fence_tool'
> > or the fenced init script. The results I see are random success's and
> > failures!
> >
> > Initial test - reboot both systems and then, on both, executed 'fenced
> > -D' both systems joined the cluster and it was quorate. Rebooted one
> > node and to my surprise manual fencing worked, meaning
> > /tmp/fence_manual.fifo was created and I had to run 'fence_ack_manual'
> > on the other node. Tried again when the first node came back up and
> > again everything worked as expected.
> >
> > Additional testing - reboot both system and then, on both, executed
> > 'fence_tool join -w', both systems joined the cluster and it was
> > quorate. Rebooted one node and no fencing was done (nothing logged in
> > /var/log/messages).
> >
> > rebooted both systems again and this time executed 'fenced -D' on both
> > nodes... rebooted a node and fencing worked, was logged in
> > /var/log/messages and I had to manual run 'fence_ack_manual -n x64-5'.
> > when that node came back up again I again manually executed 'fenced
> > -D' on it and the cluster was quorate. I then rebooted the other node
> > and again fencing worked!
> >
> > so again I rebooted both nodes and executed 'fence_tool join -w' on
> > each... I again rebooted a node and fencing worked this time. fenced
> > msgs were logged to /var/log/messages, /tmp/fence_manual.fifo was
> > created and I had to execute 'fence_ack_manual -n x64-4' to recover.
> >
> > ... more testing w/mixed results ...
> >
> > modified fenced init script to execute 'fenced -D &' instead of
> > 'fence_tool join -w' and used chkconfig to turn it on on both systems
> > and rebooted them. both system restarted and joined the cluster. once
> > again I rebooted one node (x64-4) and fencing didn't work... nothing
> > was logged in /var/log/messages from fenced. see corresponding
> > /var/log/messages, fenced -D output and cluster.conf below.
>
> It's not clear what you're trying to test or what you expect to happen.
> Here's the optimal way to start up a cluster from a newly rebooted state:
>
> 1. nodeA: ccsd
> 2. nodeB: ccsd
> 3. nodeA: cman_tool join -w
> 4. nodeB: cman_tool join -w
> 5. nodeA: fence_tool join
> 6. nodeB: fence_tool join
>
> It's best if steps 5 & 6 only happen after both nodes are members of
> the cluster (see 'cman_tool nodes').  If this is the case, then no
> nodes should be fenced when starting up.
>
> If you use the init scripts you may loose a little control and certainty
> about what happens when, so I'd suggest using the commands directly until
> you know that things are running correctly, then try the init scripts.
>
> If, from the state above, nodeB fails, then nodeA should always fence
> nodeB.  With manual fencing, this means that a message should appear in
> nodeA's /var/log/messages telling you to reboot nodeB and run
> fence_ack_manual.  If, by chance, nodeB reboots and rejoins the cluster
> before you get to running fence_ack_manual, the fencing system on nodeA
> will just complete the fencing operation itself and you don't need to run
> fence_ack_manual (and if you try, the fence_ack_manual command will report
> an error.)
>
> Dave
>
>



From bmarzins at redhat.com  Tue Dec  6 02:49:20 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 5 Dec 2005 20:49:20 -0600
Subject: [Linux-cluster] help
In-Reply-To: <5d96567b0512040307v52782e16teb34800b9f6e0cef@mail.gmail.com>
References: <5d96567b0511220738l7fb3d5c9u9df6dc12d2fd3bef@mail.gmail.com>
	<20051128161723.GC27662@redhat.com>
	<5d96567b0511281007j60a7dca1v9bf3d252a920a66e@mail.gmail.com>
	<20051128181357.GG27662@redhat.com>
	<5d96567b0511282146k4d1f839dq76b42a11e566c0d9@mail.gmail.com>
	<5d96567b0511282201y3a616e11t332b16039b0ce2bb@mail.gmail.com>
	<20051202225107.GB14768@phlogiston.msp.redhat.com>
	<5d96567b0512040307v52782e16teb34800b9f6e0cef@mail.gmail.com>
Message-ID: <20051206024920.GC30722@phlogiston.msp.redhat.com>

On Sun, Dec 04, 2005 at 01:07:58PM +0200, Raz Ben-Jehuda(caro) wrote:
>    gnbd_serv cannot load without -n flag.
>    They fixed it in min-gfs.txt document.
>    So, I have a little problem with it.

I have had no problem starting gnbd_serv without the -n option.

If you try to start gnbd_serv without a cluster manager running on the node,
you will receive a message like this on the command line

gnbd_serv: ERROR cannot get node name : No such process
gnbd_serv: ERROR No cluster manager is running
gnbd_serv: ERROR If you are not planning to use a cluster manager, use -n

and like this in syslog

ERROR [gnbd_serv.c:389] If you are not planning to use a cluster manager, use -n

This is not a gnbd error. This means that the cluster has not been properly
started.

Not only must there be a working cluster, but the gnbd server node must be a
cluster member.  In the min-gfs.txt document, the gnbd server node is not a
cluster member, so you will not be able to export uncached gnbds with this
setup.

If you believe that the gnbd server node is a member of a quorate cluster,
you can do this check.

run

# ccsd
# cman_tool join

on all cluster nodes.

Then run

# cat /proc/cluster/status

For "Membership state:" you should see "Cluster-Member"
For "Nodes:" you should see the number of nodes you expect there to be.

If the cluster has not started up, you will not see a "Nodes:" section, and
"Membership state:" will say something like "Starting" or "Not-in-Cluster".

If you are a cluster member, and you still cannot start up gnbd_serv, please
run it with the -v option, and send me the command and log output, along
with the output from

# cat /proc/cluster/status.

Thanks,
Ben
 
>    On 12/3/05, Benjamin Marzinski <bmarzins at redhat.com> wrote:
> 
>      On Tue, Nov 29, 2005 at 08:01:20AM +0200, Raz Ben-Jehuda(caro) wrote:
>      > sorry, i maanage to make it work only when cache enabled.
>      > Is it possible to do it with no cache ?
> 
>      The only difference in letting you export between cached and uncached,
>      is that
>      uncached requires the server to be a member of a quorate cluster. Could
>      you
>      start up gnbd_serv with the -v option, try to export an uncached device,
>      and
>      mail me what the messages you get back, both from the command and from
>      the logs.
> 
>      -Ben
> 
>      > On 11/29/05, Raz Ben-Jehuda(caro) <raziebe at gmail.com> wrote:
>      > > been there.
>      > > if i would load gnbd_serv with no cluster i would failed to export
>      > > any devices with gnbd_export.
>      > > if i join with: "cman_tool -X -e 2 join -c gamma -m 224.0.0.1 -i
>      eth1"
>      > > and then gnbd_export hangs and dmeg reports
>      > > CMAN: Waiting to join or form a Linux-cluster
>      > > CMAN: forming a new cluster
>      > > CMANsendmsg failed: -22.
>      > > sometimes gnbd_export just says that ERROR create request failed :
>      > > Operation not supported:
>      > > So again i am stuck.
>      > >
>      > > On 11/28/05, David Teigland < teigland at redhat.com> wrote:
>      > > > On Mon, Nov 28, 2005 at 08:07:47PM +0200, Raz Ben-Jehuda(caro)
>      wrote:
>      > > > > tried it.
>      > > > > According to the min-gfs.txt at the GNBD  server the only thing
>      i have to do
>      > > > > is simly run gnbd_serv. but looking at the code i learned that i
>      need
>      > > > > to load cman.
>      > > > > yet this is not enough. gnbd_serv fails to load with :
>      > > > >
>      > > > > gnbd_serv: ERROR cannot get node name : No such process
>      > > > > gnbd_serv: ERROR No cluster manager is running
>      > > > > gnbd_serv: ERROR If you are not planning to use a cluster
>      manager, use -n
>      > > > >
>      > > > > does gnbd_serv depends in a cluster manager?
>      > > > > What is my mistake  ? this is not part of the cluster.
>      > > >
>      > > > min-gfs.txt is wrong, I'll fix it.  You need to use gnbd_serv -n.
>      > > > Then gnbd_serv will ignore all clustering stuff which is what you
>      want.
>      > > >
>      > > > Dave
>      > > >
>      > > >
>      > >
>      > >
>      > > --
>      > > Raz
>      > >
>      >
>      >
>      > --
>      > Raz
>      >
>      > --
>      > Linux-cluster mailing list
>      > Linux-cluster at redhat.com
>      > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
>      --
>      Linux-cluster mailing list
>      Linux-cluster at redhat.com
>      https://www.redhat.com/mailman/listinfo/linux-cluster
> 
>    --
>    Raz



From raziebe at gmail.com  Tue Dec  6 10:21:09 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Tue, 6 Dec 2005 02:21:09 -0800
Subject: [Linux-cluster] no a storage question
Message-ID: <5d96567b0512060221u8a92e5al44fdaa7a401a1466@mail.gmail.com>

i know this not the place, but since they so many kernel developers here...
i have a different question regarding locking in the kernel.

In the last issue of linux magazine there was an article about
locking. it presented the follwing  scenario.

spin_lock(lock)
...
spin_unlock(lock).

In uni processot mahcine with preemption enabled.
spin_lock saves flags and cli(). spin_unlock()  push flags out ( for nesting
interrupts)

they said that this code :
Does not protect preemption.

no protection from preemption ? How ?
How can process B get some cpu while process A had disabled interrupts,
no scheduling (unwillingly ) can be made.Prior to the preemption the
kernel scehdular must run and set Process B as the one to run , and to the
best my knowledge, a schedular runs only in timer interrupt.
Or is it possible that the schduler timer routine isn't running in interrupt
context ?


thank you
--
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051206/69f1af08/attachment.htm>

From adingman at cookgroup.com  Tue Dec  6 15:29:23 2005
From: adingman at cookgroup.com (Andrew C. Dingman)
Date: Tue, 06 Dec 2005 10:29:23 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4394DDF0.4080603@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
Message-ID: <1133882963.7571.43.camel@adingman.cin.cook>

At least in RHEL3, fence_apc only works with a very particular
configuration on the power strip. In particular, the numbers for menu
items change depending on whether the connecting user has permission to
do various things, so revoking permission to use outlets that aren't
part of your cluster will break the fencing agent. I ended up writing my
own fencing agent that was able to deal with at least a few more
configurations, though it's still not as flexible as I'd like.

My fencing agent is attached. I wrote it for RHEL3 and APC AP7901 power
strips, configured the way I wanted them, so it may or may not work for
you. Read the code. Test it somewhere it can't do any significant
damage. Don't come crying to me or my employer if it breaks. I disclaim
any responsibility for anything it might do, however heinous. Read and
understand the code before you use it. It's at least a start. It's not
as general as I'd like it to be, but since it works in the clusters I
wrote it for, I haven't been motivated to change it. It's derived from
the fencing agents Red Hat distributes, and therefore also under the
GPL.

Once you've got that working for GFS, you can then set cluster suite to
use the Stonith bridge to fence through GFS, so you don't need to
explicitly configure the fencing device in system-config-cluster.

Hope that helps.

-Andrew C. Dingman
Unix Administrator
Cook Incorporated

On Mon, 2005-12-05 at 19:40 -0500, Greg Forte wrote:
> two (probably related) questions concerning fencing and APC AP7900 units:
> 
> 1) fence_apc doesn't appear to be compatible with these units - when I run:
> 
> sudo /sbin/fence_apc -a <ipaddy> -l <usr> -p <pwd> -n1 -T -v
> 
> it comes back with:
> 
> failed: unrecognised menu response
> 
> The output file shows that it's getting as far as the "Outlet 
> Control/Configuration" menu, but never selects the specified port.
> 
> This is on RHEL ES4 update 2 with fence-1.32.6-0 installed.
> 
> Does anyone have this working with AP7900s, and if so did you have to 
> hack the fence_apc script or is there just something I'm missing?
> 
>   2) in the cluster configuration tool (GUI), there's no place to 
> specify the port to cycle for an "APC Power Device".  I tried adding 
> "port=#" to the <fencedevice ...> tags in the cluster.conf file, but the 
> cluster configuration tool didn't like that.  And of course, I was 
> unable to test if this actually works anyway because of problem #1 :-(
> 
> Anyway, assuming I get fence_apc to work, how do I specify ports in the 
> cluster configuration tool?  or is this not supported?  In which case 
> can I add the port option in the cluster.conf like I'm trying to do and 
> have it work?  I have system-config-cluster-1.0.16-1.0 installed.
> 
> -g
> 
> Greg Forte
> gforte at udel.edu
> IT - User Services
> University of Delaware
> 302-831-1982
> Newark, DE
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From gforte at leopard.us.udel.edu  Tue Dec  6 16:26:34 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 06 Dec 2005 11:26:34 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <20051206034238.GA3226@rover.pcbi.upenn.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
Message-ID: <4395BBBA.2090404@leopard.us.udel.edu>

Bryan Cardillo wrote:
 >         I'm in the process of testing the attached patch, basically
 >         just had to remove a portion of the match for the `Control
 >         Outlet' option.

Interesting ... I see you were getting hung on the menu after where I 
was - looks like my problem was that the author didn't expect anyone to 
rename their outlets to something more useful than "Outlet 1", "Outlet 
2", etc.  The same problem plagues the next menu, because it was looking 
to match the "----- Outlet # -------" banner, but the assigned name 
shows up there instead.  The following patch (against the "original") 
seemingly fixes both of these problems generally (incorporating Bryan's 
fix as well).

--- /sbin/fence_apc     2005-08-01 19:01:17.000000000 -0400
+++ fence_apc   2005-12-06 09:09:55.000000000 -0500
@@ -244,10 +244,10 @@
                         /--\s*device manager.*(\d+)\s*-\s*Outlet 
Control/is ||

                         # "Device Manager", "1- Cluster Node 0   ON"
-                       /--\s*Outlet 
Control.*(\d+)\s*-\s+Outlet\s+$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
+                       /--\s*Outlet 
Control.*($opt_n)\s*-[^\n]+\s(?-i:ON|OFF)\*?\s/ism ||

                         # Administrator Outlet Control menu
-                       /--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control 
outlet\s+$opt_n\D/ism
+                       /Outlet\s+:\s*$opt_n\D.*(\d+)\s*-\s*control 
outlet/ism
                 ) {
                         $t->print($1);
                         next;

>         here is the clusternode elem I'm using, with the port
>         specified, and seems to work so far.  as far as I know, this
>         must be specified in the cluster.conf manually.
> 
> <clusternode name="node1" votes="1">
>     <fence>
>         <method name="pdu">
>             <device name="pdu" port="1"/>
>         </method>
>     </fence>
> </clusternode>

Ah, I see I was confusing <fencedevice ...> with <fence> - it looks like 
it is configurable in the configuration tool afterall, under "manage 
fencing for this node".  Here's what I got after setting it up with my 
two cross-wired PDUs (the nodes have redundant power, so node 1 is 
plugged into outlet 1 on each pdu, and node 2 to outlet 2 on each pdu):

                 <clusternode name="NODE1" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="FENCE1" 
option="off" port="1" switch="1"/>
                                         <device name="FENCE2" 
option="off" port="1" switch="1"/>
                                         <device name="FENCE1" 
option="on" port="1" switch="1"/>
                                         <device name="FENCE2" 
option="on" port="1" switch="1"/>
                                 </method>
                         </fence>
                 </clusternode>
                 <clusternode name="NODE2" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="FENCE1" 
option="off" port="2" switch="1"/>
                                         <device name="FENCE2" 
option="off" port="2" switch="1"/>
                                         <device name="FENCE1" 
option="on" port="2" switch="1"/>
                                         <device name="FENCE2" 
option="on" port="2" switch="1"/>
                                 </method>
                         </fence>
                 </clusternode>

Except then when I stopped the configurator and started it again it 
complained about the "switch=" options that it put there itself! 
removing them by hand seems to have fixed it.  *sigh*

And it still doesn't appear to work ... I can turn the outlets on and 
off from the command line, but if I down the interface on a node, the 
other node reports that it's removing the "failed" node from the 
cluster, and that it's fencing the "failed" node, but the "failed" node 
never gets shut down.  Does this get logged somewhere besides 
/var/log/messages, or is there a way to force it to be more verbose?  If 
I could see what command fenced is actually invoking that might help ...

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From jeff at jettis.com  Tue Dec  6 17:12:34 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Tue, 6 Dec 2005 09:12:34 -0800
Subject: [Linux-cluster] custom fence agent
Message-ID: <B6A0A04D59978745A68272143BE55BD4A43228@laxmsex01.corp.jettis.com>

Matt,
 
This script is great.  I just finished hacking on it for my own purposes
and it's working well from the command line.  Could you pass along your
fencing section from cluster.conf as well?  Thanks a million.
 
 - Jeff DiNisco

  _____  

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Matt Brookover
Sent: Wednesday, November 30, 2005 5:34 PM
To: linux clustering
Subject: Re: [Linux-cluster] custom fence agent


I took the fence_apc and hacked it to do what I needed.  The fence
agents are perl scripts and can easily be modified to fit most any SAN.

Matt

On Wed, 2005-11-30 at 13:58, Jeff Dinisco wrote: 

	Could someone outline the rules for creating your own fencing
agent and how they're applied in cluster.conf?  Or just point me to a
doc?  Thanks 
	
	- Jeff
	
	
	
  _____  

	--
	Linux-cluster mailing list
	Linux-cluster at redhat.com
	https://www.redhat.com/mailman/listinfo/linux-cluster
<https://www.redhat.com/mailman/listinfo/linux-cluster> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051206/078c7812/attachment.htm>

From mbrookov at mines.edu  Tue Dec  6 18:00:12 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Tue, 06 Dec 2005 11:00:12 -0700
Subject: [Linux-cluster] custom fence agent
In-Reply-To: <B6A0A04D59978745A68272143BE55BD4A43228@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD4A43228@laxmsex01.corp.jettis.com>
Message-ID: <1133892012.4169.5.camel@merlin.Mines.EDU>

I am currently using GFS 6.0, so there is no cluster.conf.  I have
included all three files, cluster.css, fence.css and nodes.css.


[root at imagine CSM_ACN]# more *
::::::::::::::
cluster.ccs
::::::::::::::
cluster
{
        name = "CSM_ACN"
        lock_gulm
        {
                servers = ["imagine.Mines.EDU","illuminate.Mines.EDU","illusion.Mines.EDU"]
                heartbeat_rate = 3.0
                allowed_misses = 5
        }
}
::::::::::::::
fence.ccs
::::::::::::::
fence_devices
{
        CSMACN_fence
        {
                agent = "fence_cisco"
        }
}
::::::::::::::
nodes.ccs
::::::::::::::
nodes
{
        imagine.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.1"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="imagine"
                                }
                        }
                }
        }
 
        illuminate.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.2"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="illuminate"
                                }
                        }
                }
        }
 
        illusion.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.3"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="illusion"
                                }
                        }
                }
        }
 
        inspire.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.5"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="inspire"
                                }
                        }
                }
        }
        inception.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.4"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="inception"
                                }
                        }
                }
        }
        incantation.Mines.EDU
        {
                ip_interfaces
                {
                        eth0 = "138.67.130.6"
                }
                fence
                {
                        snmpfence
                        {
                                CSMACN_fence
                                {
                                        port="incantation"
                                }
                        }
                }
        }
}
[root at imagine CSM_ACN]#


On Tue, 2005-12-06 at 10:12, Jeff Dinisco wrote:

> Matt,
>  
> This script is great.  I just finished hacking on it for my own
> purposes and it's working well from the command line.  Could you pass
> along your fencing section from cluster.conf as well?  Thanks a
> million.
>  
>  - Jeff DiNisco
> 
> ______________________________________________________________________
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Matt Brookover
> Sent: Wednesday, November 30, 2005 5:34 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] custom fence agent
> 
> 
> I took the fence_apc and hacked it to do what I needed.  The fence
> agents are perl scripts and can easily be modified to fit most any
> SAN.
> 
> Matt
> 
> On Wed, 2005-11-30 at 13:58, Jeff Dinisco wrote: 
> 
> > Could someone outline the rules for creating your own fencing agent
> > and how they're applied in cluster.conf?  Or just point me to a
> > doc?  Thanks 
> > 
> > - Jeff
> > 
> > 
> > 
> > ____________________________________________________________________
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051206/f5ef80ec/attachment.htm>

From canfield at uindy.edu  Tue Dec  6 21:07:58 2005
From: canfield at uindy.edu (D Canfield)
Date: Tue, 06 Dec 2005 16:07:58 -0500
Subject: [Linux-cluster] CLVM & Partition Mounting
Message-ID: <4395FDAE.6020900@uindy.edu>

I'm trying to build my first GFS cluster (2-node on a SAN) on RHEL4, and 
I can get things up and running manually, but I'm having some trouble 
getting the process to automate smoothly.

The first issue is that after I install the lvm2-cluster RPM, I can no 
longer boot the machine cleanly because my /var/log partition is on a 
separate LVM VolumeGroup (It's still a standard ext3 partition, I just 
keep all my logs on a RAID10 array in a different area of the SAN for 
performance) and the presence of clvm library seems to prevent vgchange 
from running at boot time since clvmd isn't yet running.  This part I'm 
assuming I'm just missing something obvious, but I have no idea what.

The second issue is that GFS doesn't seem to allow an automatic way to 
actually mount the GFS partitions once clvmd is started.  This is a bit 
of an issue since the partition I am going to want to mount in most 
cases is /home, and even if I put a mount line in /etc/rc.local, that 
means services like imap (this cluster) or samba (on the next one) will 
be up and trying to serve items out of the home directories before the 
directories exist.

Sorry if I'm being brain dead on this, the fact that I couldn't any 
reference to it anywhere else suggests I probably am.   Can anyone offer 
any hints?

Thanks
DC



From pcaulfie at redhat.com  Wed Dec  7 08:40:24 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 07 Dec 2005 08:40:24 +0000
Subject: [Linux-cluster] CLVM & Partition Mounting
In-Reply-To: <4395FDAE.6020900@uindy.edu>
References: <4395FDAE.6020900@uindy.edu>
Message-ID: <43969FF8.2000507@redhat.com>

D Canfield wrote:
> I'm trying to build my first GFS cluster (2-node on a SAN) on RHEL4, and
> I can get things up and running manually, but I'm having some trouble
> getting the process to automate smoothly.
> 
> The first issue is that after I install the lvm2-cluster RPM, I can no
> longer boot the machine cleanly because my /var/log partition is on a
> separate LVM VolumeGroup (It's still a standard ext3 partition, I just
> keep all my logs on a RAID10 array in a different area of the SAN for
> performance) and the presence of clvm library seems to prevent vgchange
> from running at boot time since clvmd isn't yet running.  This part I'm
> assuming I'm just missing something obvious, but I have no idea what.

You need to mark cluster VGs as clustered (vgchange -cy) and non-clustered VGs
as non-clustered (vgchange -cn). You can't have non-clustered LVs in a
clustered VG (though it doesn't look like you're doing that).

The activation for local VGs should then have the --ignorelockingfailure flag
passed to the LVM commands, which should also only be activating the local VG)
so it will carry on even if the cluster locking attempt fails.

> The second issue is that GFS doesn't seem to allow an automatic way to
> actually mount the GFS partitions once clvmd is started.  This is a bit
> of an issue since the partition I am going to want to mount in most
> cases is /home, and even if I put a mount line in /etc/rc.local, that
> means services like imap (this cluster) or samba (on the next one) will
> be up and trying to serve items out of the home directories before the
> directories exist.
> 
> Sorry if I'm being brain dead on this, the fact that I couldn't any
> reference to it anywhere else suggests I probably am.   Can anyone offer
> any hints?
> 

-- 

patrick



From dillo+cluster at seas.upenn.edu  Tue Dec  6 03:42:38 2005
From: dillo+cluster at seas.upenn.edu (Bryan Cardillo)
Date: Mon, 5 Dec 2005 22:42:38 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4394DDF0.4080603@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
Message-ID: <20051206034238.GA3226@rover.pcbi.upenn.edu>

On Mon, Dec 05, 2005 at 07:40:16PM -0500, Greg Forte wrote:
> two (probably related) questions concerning fencing and APC AP7900 units:
> 
> 1) fence_apc doesn't appear to be compatible with these units - when I run:
> 
> sudo /sbin/fence_apc -a <ipaddy> -l <usr> -p <pwd> -n1 -T -v
> 
> it comes back with:
> 
> failed: unrecognised menu response
> 
> The output file shows that it's getting as far as the "Outlet 
> Control/Configuration" menu, but never selects the specified port.
> 
> This is on RHEL ES4 update 2 with fence-1.32.6-0 installed.
> 
> Does anyone have this working with AP7900s, and if so did you have to 
> hack the fence_apc script or is there just something I'm missing?

        I'm in the process of testing the attached patch, basically
        just had to remove a portion of the match for the `Control
        Outlet' option.

>  2) in the cluster configuration tool (GUI), there's no place to 
> specify the port to cycle for an "APC Power Device".  I tried adding 
> "port=#" to the <fencedevice ...> tags in the cluster.conf file, but the 
> cluster configuration tool didn't like that.  And of course, I was 
> unable to test if this actually works anyway because of problem #1 :-(
> 
> Anyway, assuming I get fence_apc to work, how do I specify ports in the 
> cluster configuration tool?  or is this not supported?  In which case 
> can I add the port option in the cluster.conf like I'm trying to do and 
> have it work?  I have system-config-cluster-1.0.16-1.0 installed.

        here is the clusternode elem I'm using, with the port
        specified, and seems to work so far.  as far as I know, this
        must be specified in the cluster.conf manually.

<clusternode name="node1" votes="1">
    <fence>
        <method name="pdu">
            <device name="pdu" port="1"/>
        </method>
    </fence>
</clusternode>

        hope this helps.

        Cheers,
        Bryan Cardillo
        Penn Bioinformatics Core
        University of Pennsylvania
-------------- next part --------------
--- /sbin/fence_apc	2005-10-27 16:12:19.000000000 -0400
+++ fence_apc	2005-12-05 22:33:04.000000000 -0500
@@ -247,7 +247,7 @@
 			/--\s*Outlet Control.*(\d+)\s*-\s+Outlet\s+$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
 
 			# Administrator Outlet Control menu
-			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control outlet\s+$opt_n\D/ism
+			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control outlet/ism
 		) {
 			$t->print($1);
 			next;

From canfield at uindy.edu  Wed Dec  7 15:01:05 2005
From: canfield at uindy.edu (D Canfield)
Date: Wed, 07 Dec 2005 10:01:05 -0500
Subject: [Linux-cluster] CLVM & Partition Mounting
In-Reply-To: <43969FF8.2000507@redhat.com>
References: <4395FDAE.6020900@uindy.edu> <43969FF8.2000507@redhat.com>
Message-ID: <4396F931.3000802@uindy.edu>

Patrick Caulfield wrote:

>D Canfield wrote:
>  
>
>>I'm trying to build my first GFS cluster (2-node on a SAN) on RHEL4, and
>>I can get things up and running manually, but I'm having some trouble
>>getting the process to automate smoothly.
>>
>>The first issue is that after I install the lvm2-cluster RPM, I can no
>>longer boot the machine cleanly because my /var/log partition is on a
>>separate LVM VolumeGroup (It's still a standard ext3 partition, I just
>>keep all my logs on a RAID10 array in a different area of the SAN for
>>performance) and the presence of clvm library seems to prevent vgchange
>>from running at boot time since clvmd isn't yet running.  This part I'm
>>assuming I'm just missing something obvious, but I have no idea what.
>>    
>>
>
>You need to mark cluster VGs as clustered (vgchange -cy) and non-clustered VGs
>as non-clustered (vgchange -cn). You can't have non-clustered LVs in a
>clustered VG (though it doesn't look like you're doing that).
>
>The activation for local VGs should then have the --ignorelockingfailure flag
>passed to the LVM commands, which should also only be activating the local VG)
>so it will carry on even if the cluster locking attempt fails.
>
>  
>
I see that the ignorelockingfailure flag was already in the initscripts 
of RHEL4, and a bit more testing got me some different information.  If 
I have lvm2-cluster installed, the process will error out to the 
maintenance shell when it tries to fsck my /var/log partition.  If I 
look in /dev/mapper VolGroup01 has not been activated (though if I look 
higher up in the boot log, vgscan did see it).  But from the maintenance 
shell, I can go ahead and run vgchange -a y --ignorelockingfailure (just 
like the rc.sysinit does 2-3 times by the time it gets to the fsck), and 
the VolGroup01 is activated just fine. 

If I remove the lvm2-cluster RPM, the machine boots up fine.  Also, if I 
leave the lvm2-cluster RPM installed but change the mount options from 
"defaults 0 2" ro "defaults 0 0", it will skip the fsck, and by the time 
the machine is booted, the /var/log partition has indeed been mounted (I 
think it gets mounted after clvmd starts). 

I've checked that -c n is set on this local volumegroup, but that 
doesn't seem to make a difference.  I've listed a few outputs below.

Any other thoughts?  Thanks much.
# vgdisplay
  --- Volume group ---
  VG Name               VolGroupMailGFS
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  3
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               0
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               341.62 GB
  PE Size               16.00 MB
  Total PE              21864
  Alloc PE / Size       21864 / 341.62 GB
  Free  PE / Size       0 / 0
  VG UUID               ehOhtR-cYE8-xjls-Qle0-eT71-DmZO-p5ur6v

  --- Volume group ---
  VG Name               VolGroup01
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  2
  VG Access             read/write
  VG Status             resizable
  MAX LV                0
  Cur LV                1
  Open LV               1
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               4.98 GB
  PE Size               16.00 MB
  Total PE              319
  Alloc PE / Size       318 / 4.97 GB
  Free  PE / Size       1 / 16.00 MB
  VG UUID               3Xuzas-tiX2-DgPG-71JH-dB2O-U1qH-SCdgGD

  --- Volume group ---
  VG Name               VolGroup00
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  3
  VG Access             read/write
  VG Status             resizable
  MAX LV                0
  Cur LV                2
  Open LV               2
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               7.89 GB
  PE Size               16.00 MB
  Total PE              505
  Alloc PE / Size       504 / 7.88 GB
  Free  PE / Size       1 / 16.00 MB
  VG UUID               cYiUzS-QlnZ-PF50-0kAO-kYL0-V3Yw-dXwBIe

# pvdisplay
  --- Physical volume ---
  PV Name               /dev/sdc
  VG Name               VolGroupMailGFS
  PV Size               341.62 GB / not usable 0
  Allocatable           yes (but full)
  PE Size (KByte)       16384
  Total PE              21864
  Free PE               0
  Allocated PE          21864
  PV UUID               NYWZVb-yKBl-o7dR-Xq9s-0z3A-VFS0-wxzwc1

  --- Physical volume ---
  PV Name               /dev/sdb1
  VG Name               VolGroup01
  PV Size               4.98 GB / not usable 0
  Allocatable           yes
  PE Size (KByte)       16384
  Total PE              319
  Free PE               1
  Allocated PE          318
  PV UUID               EFIqWw-SvP6-OWGV-u350-mwyx-5lJQ-29ksqz

  --- Physical volume ---
  PV Name               /dev/sda2
  VG Name               VolGroup00
  PV Size               7.89 GB / not usable 0
  Allocatable           yes
  PE Size (KByte)       16384
  Total PE              505
  Free PE               1
  Allocated PE          504
  PV UUID               qR2QxR-KuPF-Wsvc-w0yv-d7rK-3NlY-wLRREb

# lvdisplay
  --- Logical volume ---
  LV Name                /dev/VolGroupMailGFS/LogVolHome
  VG Name                VolGroupMailGFS
  LV UUID                7bE2Zt-27A2-OHga-qFDI-QnNc-m21r-LUaXEm
  LV Write Access        read/write
  LV Status              NOT available
  LV Size                341.62 GB
  Current LE             21864
  Segments               1
  Allocation             inherit
  Read ahead sectors     0

  --- Logical volume ---
  LV Name                /dev/VolGroup01/LogVolLogs
  VG Name                VolGroup01
  LV UUID                01rj7U-809c-jHmg-n6y7-md6Z-yYlF-NYMxCi
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                4.97 GB
  Current LE             318
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:2

  --- Logical volume ---
  LV Name                /dev/VolGroup00/LogVolRoot
  VG Name                VolGroup00
  LV UUID                YFunW2-SKSz-T6pZ-7Agf-AFvO-W411-bfX3Q1
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                6.88 GB
  Current LE             440
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:0

  --- Logical volume ---
  LV Name                /dev/VolGroup00/LogVolSwap
  VG Name                VolGroup00
  LV UUID                uvuww5-PzDY-79pc-hxtk-33Rl-L2tI-Kp9IDb
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                1.00 GB
  Current LE             64
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:1







From gforte at leopard.us.udel.edu  Wed Dec  7 15:08:20 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Wed, 07 Dec 2005 10:08:20 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4395BBBA.2090404@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
Message-ID: <4396FAE4.80006@leopard.us.udel.edu>

Greg Forte wrote:

> And it still doesn't appear to work ... I can turn the outlets on and 
> off from the command line, but if I down the interface on a node, the 
> other node reports that it's removing the "failed" node from the 
> cluster, and that it's fencing the "failed" node, but the "failed" node 
> never gets shut down.  Does this get logged somewhere besides 
> /var/log/messages, or is there a way to force it to be more verbose?  If 
> I could see what command fenced is actually invoking that might help ...

Well, in case anyone is interested, I got fed up with having no decent 
logging from any of these components, so I finally used tcpdump to 
monitor the telnet connection between the non-failed node and the PDUs 
as it tried to fence them ... and it turns out that fence_apc was trying 
to turn each port ON twice, instead of OFF and then ON like it's 
supposed to according to my configuration.  The fault apparently lies 
somewhere in ccsd or fenced, because the fence_apc script definitely 
responds properly to the on|off|reboot options, both on the command line 
and in the stdin like fenced uses.

I changed my cluster.conf so that it uses 'reboot' instead of 'off' and 
'on' (e.g. the old conf looked like this:

                                         <device name="FENCE1" 
option="off" port="1"/>
                                         <device name="FENCE2" 
option="off" port="1"/>
                                         <device name="FENCE1" 
option="on" port="1"/>
                                         <device name="FENCE2" 
option="on" port="1"/>

and the new one looks like this:

                                         <device name="FENCE1" 
option="reboot" port="1"/>
                                         <device name="FENCE2" 
option="reboot" port="1"/>

and increased the reboot wait time on the PDUs to make sure it'd wait 
long enough, and that SEEMS to work (once I remembered to turn off ccsd 
before updating my cluster.conf by hand so that it didn't end up 
replacing it with the old one immediately ;-)

Of course, I can't bring up any of the per-node fencing configuration 
items in system-config-cluster anymore, but I think I mentioned that 
previously - when I set them up through the gui it put "switch=" options 
in each <device /> tag, and then when I shut down and restarted the gui 
it complained that the file was formatted improperly.  I removed those 
options by hand, and then the gui worked again, but ever since the 
fencing info hasn't been available ...

Any developers care to comment on any of this?  I'm finding it really 
tough to believe that this is a supported RedHat "product".

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From eric at bootseg.com  Wed Dec  7 16:16:29 2005
From: eric at bootseg.com (Eric Kerin)
Date: Wed, 07 Dec 2005 11:16:29 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4396FAE4.80006@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
	<4396FAE4.80006@leopard.us.udel.edu>
Message-ID: <1133972189.3454.25.camel@auh5-0479.corp.jabil.org>

Greg,

I'm using the fence_apc agent on my cluster with APC 7900s, and fencing
is working perfectly for me, and has for more than 6 months now.

Only thing I had to do was modify the fence_apc agent to allow for the
renamed ports I setup (got rid of the outlet X names, and put in
descriptive server names) and add in the port groups feature I'm using.
One of these days I'll get a few spare minutes to whip up a correct
patch to the agent that can be submitted to the tree.  One that will
work in both the "Outlet X" naming method, and the descriptive port
method.


My device entry inside of fence for a node looks like this:
<device name="AUHAPC01" port="AUHJPSN01A" switch="0"/>

You can test that the cluster is configured correctly to fence a node by
running "fence_node <nodename>"  This will use the cluster's config file
to fence the node, ensuring that all config settings are correct.


> once I remembered to turn off ccsd 
> before updating my cluster.conf by hand so that it didn't end up 
> replacing it with the old one immediately ;-)
> 
When updating the cluster.conf file by hand, you are updating the
config_version attribute of the cluster node, right?  I do updates to my
cluster.conf file by hand pretty much exclusively, while the cluster is
running, and with no problems whatsoever.  Changes propagate as expected
after running "ccs_tool update <cluster.conf filename>"and "cman_tool
version -r <new_version_number>"

Thanks,
Eric Kerin
eric at bootseg.com




From gforte at leopard.us.udel.edu  Wed Dec  7 19:32:37 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Wed, 07 Dec 2005 14:32:37 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <1133972189.3454.25.camel@auh5-0479.corp.jabil.org>
References: <4394DDF0.4080603@leopard.us.udel.edu>	<20051206034238.GA3226@rover.pcbi.upenn.edu>	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>
	<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>
Message-ID: <439738D5.5000808@leopard.us.udel.edu>

Eric Kerin wrote:
 > Greg,
 >
 > I'm using the fence_apc agent on my cluster with APC 7900s, and fencing
 > is working perfectly for me, and has for more than 6 months now.

Thanks, Eric, but the fence_apc script is definitely not the issue - I 
had to make a couple of minor changes to fence_apc's regexps, and it now 
works both with command-line options and passing arguments through 
stdin.  This doesn't explain why the cluster conf doesn't work when it 
has "off" and then "on" as set up by system-config-cluster (and it did 
that itself, all I did was configure the ip address and login for the 
fence devices, and tell it which ports to use), but it does work when I 
make the change to 'reboot' as described in my previous message (this is 
the default option, anyway, which I assume is why yours works with no 
"option=" option).

> You can test that the cluster is configured correctly to fence a node by
> running "fence_node <nodename>"  This will use the cluster's config file
> to fence the node, ensuring that all config settings are correct.

Actually, that doesn't seem to work for me - no matter what nodename I 
specify, and regardless of whether I run it on the node I'm trying to 
fence or the other node (it's a two-node cluster), it comes back with 
"Fence of 'hostname' was unsuccessful."  I suspect this is because it's 
a two-node cluster so fenced doesn't want to let me kick out a node 
that's still active ... or maybe it's a just host name problem. 
Regardless, it _does_ work correctly if I simulate a real failure, after 
I made the aforementioned cluster.conf change, so I'm confident that 
I've got it configured correctly.  My gripe is that (a) the gui tool 
can't seem to generate even the most simple conf correctly, and (b) 
there's apparently a bug in fenced where it passes an "option=on" to the 
fence_apc agent, when it clearly should be "option = off".  Or else ccsd 
is misparsing the cluster.conf file.  I don't see how else to explain 
that the conf file said "off", then "on", but the daemon did "on", "on".

> When updating the cluster.conf file by hand, you are updating the
> config_version attribute of the cluster node, right?  I do updates to my
> cluster.conf file by hand pretty much exclusively, while the cluster is
> running, and with no problems whatsoever.  Changes propagate as expected
> after running "ccs_tool update <cluster.conf filename>"and "cman_tool
> version -r <new_version_number>"

Hmmm ... nope, but I will do so in the future.  ;-)  Thanks.

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From eric at bootseg.com  Wed Dec  7 19:48:26 2005
From: eric at bootseg.com (Eric Kerin)
Date: Wed, 07 Dec 2005 14:48:26 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <439738D5.5000808@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>
	<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>
	<439738D5.5000808@leopard.us.udel.edu>
Message-ID: <1133984906.5344.14.camel@auh5-0479.corp.jabil.org>

On Wed, 2005-12-07 at 14:32 -0500, Greg Forte wrote:
> I suspect this is because it's 
> a two-node cluster so fenced doesn't want to let me kick out a node 
> that's still active ... or maybe it's a just host name problem. 
> Regardless, it _does_ work correctly if I simulate a real failure, after 
> I made the aforementioned cluster.conf change, so I'm confident that 
> I've got it configured correctly.  

It's most likely a host name problem, because I run a two node cluster,
and I used fence_node while testing everything.

If you post the relevant sections of your cluster.conf file (the
clusternodes, and fencedevices sections are the important ones)  We
might be able to help you figure out why it's not working right though.

But mainly, check that the names you use in the clusternode name
attributes are resolvable on both nodes, and they resolve to the same IP
address on both nodes.

Thanks,
Eric Kerin
eric at bootseg.com


> My gripe is that (a) the gui tool 
> can't seem to generate even the most simple conf correctly, and (b) 
> there's apparently a bug in fenced where it passes an "option=on" to the 
> fence_apc agent, when it clearly should be "option = off".  Or else ccsd 
> is misparsing the cluster.conf file.  I don't see how else to explain 
> that the conf file said "off", then "on", but the daemon did "on", "on".

Hmm, I'll see if I can replicate this on my testing cluster.  Although I
don't think it's designed to work the way you're expecting it to from
your config.  Of course, I haven't played with multiple fence devices in
my configs before, so I could be mistaken.

Eric



From teigland at redhat.com  Wed Dec  7 19:54:26 2005
From: teigland at redhat.com (David Teigland)
Date: Wed, 7 Dec 2005 13:54:26 -0600
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <439738D5.5000808@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
	<4396FAE4.80006@leopard.us.udel.edu>
	<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>
	<439738D5.5000808@leopard.us.udel.edu>
Message-ID: <20051207195426.GA29230@redhat.com>

On Wed, Dec 07, 2005 at 02:32:37PM -0500, Greg Forte wrote:
> there's apparently a bug in fenced where it passes an "option=on" to the 
> fence_apc agent, when it clearly should be "option = off".  Or else ccsd 
> is misparsing the cluster.conf file.  I don't see how else to explain 
> that the conf file said "off", then "on", but the daemon did "on", "on".

This may be the bug
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=172401

Dave



From gforte at leopard.us.udel.edu  Wed Dec  7 20:15:25 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Wed, 07 Dec 2005 15:15:25 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <20051207195426.GA29230@redhat.com>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
	<4396FAE4.80006@leopard.us.udel.edu>
	<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>
	<439738D5.5000808@leopard.us.udel.edu>
	<20051207195426.GA29230@redhat.com>
Message-ID: <439742DD.4020403@leopard.us.udel.edu>

David Teigland wrote:
> On Wed, Dec 07, 2005 at 02:32:37PM -0500, Greg Forte wrote:
> 
>>there's apparently a bug in fenced where it passes an "option=on" to the 
>>fence_apc agent, when it clearly should be "option = off".  Or else ccsd 
>>is misparsing the cluster.conf file.  I don't see how else to explain 
>>that the conf file said "off", then "on", but the daemon did "on", "on".
> 
> 
> This may be the bug
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=172401

That certainly appears to be it!  Thanks.

Now I don't suppose there's one for system-config-cluster not being able 
to read the configuration file it just wrote after adding a fence method 
to a node ... I'm not finding one, but apparently my luck/skill with 
bugzilla is pretty poor.  ;-)

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From gforte at leopard.us.udel.edu  Wed Dec  7 20:34:22 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Wed, 07 Dec 2005 15:34:22 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <1133984906.5344.14.camel@auh5-0479.corp.jabil.org>
References: <4394DDF0.4080603@leopard.us.udel.edu>	
	<20051206034238.GA3226@rover.pcbi.upenn.edu>	
	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>	
	<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>	
	<439738D5.5000808@leopard.us.udel.edu>
	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>
Message-ID: <4397474E.7000705@leopard.us.udel.edu>

Eric Kerin wrote:
> But mainly, check that the names you use in the clusternode name
> attributes are resolvable on both nodes, and they resolve to the same IP
> address on both nodes.

They do resolve, and to the same IP address.  Interestingly, if I stop 
fenced on the "good" node and run it manually as 'fenced -D' to monitor 
the debugging output, and then run 'fence_node hostname', no activity 
shows up - but if I do my simulated failure on the "bad" node (drop the 
interface), then it starts spewing debugging output (though the fencing 
fails for some other unknown reason ... but killing that and restarting 
fenced properly fixes it).

I kind of give up at this point - fencing now works, and I can always 
force a node by dropping its interface (or yanking the network cable) - 
it's dirty, but it works.

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From gforte at leopard.us.udel.edu  Wed Dec  7 20:42:17 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Wed, 07 Dec 2005 15:42:17 -0500
Subject: [Linux-cluster] failover domain ip address hidden?
In-Reply-To: <4397474E.7000705@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>		<20051206034238.GA3226@rover.pcbi.upenn.edu>		<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>		<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>		<439738D5.5000808@leopard.us.udel.edu>	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>
	<4397474E.7000705@leopard.us.udel.edu>
Message-ID: <43974929.8090106@leopard.us.udel.edu>

Can anyone explain to me how failover ip addresses are bound to 
interfaces in the kernel, or why they don't seem to show up in 
'ifconfig' output?  I've got one configured and it worked like a charm 
first try (unlike my fencing setup, heh), I'm just confused as to why it 
doesn't appear in ifconfig.

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE



From fajar at telkom.co.id  Thu Dec  8 07:03:04 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Thu, 08 Dec 2005 14:03:04 +0700
Subject: [Linux-cluster] failover domain ip address hidden?
In-Reply-To: <43974929.8090106@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>		<20051206034238.GA3226@rover.pcbi.upenn.edu>		<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>		<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>		<439738D5.5000808@leopard.us.udel.edu>	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>	<4397474E.7000705@leopard.us.udel.edu>
	<43974929.8090106@leopard.us.udel.edu>
Message-ID: <4397DAA8.1010103@telkom.co.id>

Greg Forte wrote:

> Can anyone explain to me how failover ip addresses are bound to 
> interfaces in the kernel, or why they don't seem to show up in 
> 'ifconfig' output?  I've got one configured and it worked like a charm 
> first try (unlike my fencing setup, heh), I'm just confused as to why 
> it doesn't appear in ifconfig.
>
try running "ip addr list"

-- 
Fajar



From lhh at redhat.com  Thu Dec  8 15:18:06 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 08 Dec 2005 10:18:06 -0500
Subject: [Linux-cluster] Fencing problems
In-Reply-To: <Pine.OSF.4.61.0511301517340.188169@kosh.hut.fi>
References: <Pine.OSF.4.61.0511301517340.188169@kosh.hut.fi>
Message-ID: <1134055086.28864.8.camel@ayanami.boston.redhat.com>

On Wed, 2005-11-30 at 15:22 +0200, Jari J. Taskinen wrote:
> Hi there!
> 
> 
> I'm running RHEL v3 and GFS-6.0.2.20-2 with it and having problems with manual
> fencing. I'm planning to use other fencing methods, but will they work if
> manual doesn't? By trying to fence a node (fence_node test3) I only get this
> in /etc/log/messages:

As long as you do not intend to run manual fencing in production, see
below.  Otherwise, disregard this email...

> nodes.ccs
> 
> nodes {
>     test1 {
>        ip_interfaces {
>           eth0 = "10.0.0.1"
>        }
>        fence {
>           human {
>              t1 {
>                 ipaddr = "10.0.0.1"

Should be 'nodename="test1"', not ipaddr=xxx I think.

-- Lon




From gforte at leopard.us.udel.edu  Thu Dec  8 15:22:02 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Thu, 08 Dec 2005 10:22:02 -0500
Subject: [Linux-cluster] failover domain ip address hidden?
In-Reply-To: <4397DAA8.1010103@telkom.co.id>
References: <4394DDF0.4080603@leopard.us.udel.edu>		<20051206034238.GA3226@rover.pcbi.upenn.edu>		<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>		<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>		<439738D5.5000808@leopard.us.udel.edu>	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>	<4397474E.7000705@leopard.us.udel.edu>	<43974929.8090106@leopard.us.udel.edu>
	<4397DAA8.1010103@telkom.co.id>
Message-ID: <43984F9A.10605@leopard.us.udel.edu>

Interesting, thanks.  I didn't know you _could_ set multiple addresses
for an interface without using a separate label for each.  I don't
suppose there's a way to configure cman so that it _does_ use labels?
It would seem a tad more convenient to have this show up in ifconfig,
 I can never remember the ip syntax.

-g

Fajar A. Nugraha wrote:
> Greg Forte wrote:
> 
>> Can anyone explain to me how failover ip addresses are bound to
>> interfaces in the kernel, or why they don't seem to show up in
>> 'ifconfig' output?  I've got one configured and it worked like a charm
>> first try (unlike my fencing setup, heh), I'm just confused as to why
>> it doesn't appear in ifconfig.
>>
> try running "ip addr list"
> 



From pcaulfie at redhat.com  Thu Dec  8 15:36:39 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 08 Dec 2005 15:36:39 +0000
Subject: [Linux-cluster] failover domain ip address hidden?
In-Reply-To: <43984F9A.10605@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>		<20051206034238.GA3226@rover.pcbi.upenn.edu>		<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>		<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>		<439738D5.5000808@leopard.us.udel.edu>	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>	<4397474E.7000705@leopard.us.udel.edu>	<43974929.8090106@leopard.us.udel.edu>	<4397DAA8.1010103@telkom.co.id>
	<43984F9A.10605@leopard.us.udel.edu>
Message-ID: <43985307.4060803@redhat.com>

Greg Forte wrote:
>  I don't
> suppose there's a way to configure cman so that it _does_ use labels?

No, it uses node names/addresses and it's not likely to change in a hurry
either, Sorry.
-- 

patrick



From teigland at redhat.com  Thu Dec  8 17:33:15 2005
From: teigland at redhat.com (David Teigland)
Date: Thu, 8 Dec 2005 11:33:15 -0600
Subject: [Linux-cluster] manual fencing not working in RHEL4 branch
In-Reply-To: <1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>
References: <1c0e77670511281307i75bc26a4pc5bbcd3d152a8c8e@mail.gmail.com>
	<20051128212731.GK27662@redhat.com>
	<1c0e77670511291853i2603f61ayf2eae51903032ebd@mail.gmail.com>
	<20051130164839.GB23663@redhat.com>
	<1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>
Message-ID: <20051208173315.GD10340@redhat.com>

On Mon, Dec 05, 2005 at 07:04:38PM -0700, busy admin wrote:
> What is the problem:
> When running manual fencing and doing failover testing, my secondary
> node takes over the service without waiting for a fence_ack_manual. 
> This all works perfectly with automatic fencing (ipmi, drac).

We're going to try this here.  Just to be clear, we expect:

1. A and B in cluster, in fence domain and running rgmanager
2. kill B
3. A should start fence_manual and print message in /var/log/messages
4. admin should run fence_ack_manual on A
5. services from B should fail over to A

The problem is you're seeing 5 happen before 4.  What version of the
code are you using (cluster-1.01.00 ?)

Thanks
Dave



From busyadmin at gmail.com  Thu Dec  8 18:19:38 2005
From: busyadmin at gmail.com (busy admin)
Date: Thu, 8 Dec 2005 11:19:38 -0700
Subject: [Linux-cluster] manual fencing not working in RHEL4 branch
In-Reply-To: <20051208173315.GD10340@redhat.com>
References: <1c0e77670511281307i75bc26a4pc5bbcd3d152a8c8e@mail.gmail.com>
	<20051128212731.GK27662@redhat.com>
	<1c0e77670511291853i2603f61ayf2eae51903032ebd@mail.gmail.com>
	<20051130164839.GB23663@redhat.com>
	<1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>
	<20051208173315.GD10340@redhat.com>
Message-ID: <1c0e77670512081019g16971027l9ae856866ccbcc05@mail.gmail.com>

David,

We are using cluster-1.00 code.  I didn't see modifications under 1.01
that would have an impact, but maybe I missed something.

You are right, I see step 5 happen before step 4 (but remember,
sometimes it works fine specially after I run with 'fenced -D').  And
I have never seen any of these problems when I use IPMI or DRAC.

BTW, for simplicity sake, I wasn't even running rgmanager.  Just ccsd,
cman and fenced.

Thanks,
Ken


On 12/8/05, David Teigland <teigland at redhat.com> wrote:
> On Mon, Dec 05, 2005 at 07:04:38PM -0700, busy admin wrote:
> > What is the problem:
> > When running manual fencing and doing failover testing, my secondary
> > node takes over the service without waiting for a fence_ack_manual.
> > This all works perfectly with automatic fencing (ipmi, drac).
>
> We're going to try this here.  Just to be clear, we expect:
>
> 1. A and B in cluster, in fence domain and running rgmanager
> 2. kill B
> 3. A should start fence_manual and print message in /var/log/messages
> 4. admin should run fence_ack_manual on A
> 5. services from B should fail over to A
>
> The problem is you're seeing 5 happen before 4.  What version of the
> code are you using (cluster-1.01.00 ?)
>
> Thanks
> Dave
>
>



From teigland at redhat.com  Thu Dec  8 19:08:16 2005
From: teigland at redhat.com (David Teigland)
Date: Thu, 8 Dec 2005 13:08:16 -0600
Subject: [Linux-cluster] manual fencing not working in RHEL4 branch
In-Reply-To: <1c0e77670512081019g16971027l9ae856866ccbcc05@mail.gmail.com>
References: <1c0e77670511281307i75bc26a4pc5bbcd3d152a8c8e@mail.gmail.com>
	<20051128212731.GK27662@redhat.com>
	<1c0e77670511291853i2603f61ayf2eae51903032ebd@mail.gmail.com>
	<20051130164839.GB23663@redhat.com>
	<1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>
	<20051208173315.GD10340@redhat.com>
	<1c0e77670512081019g16971027l9ae856866ccbcc05@mail.gmail.com>
Message-ID: <20051208190816.GE10340@redhat.com>

On Thu, Dec 08, 2005 at 11:19:38AM -0700, busy admin wrote:
> David,
> 
> We are using cluster-1.00 code.  I didn't see modifications under 1.01
> that would have an impact, but maybe I missed something.
> 
> You are right, I see step 5 happen before step 4 (but remember,
> sometimes it works fine specially after I run with 'fenced -D').  And
> I have never seen any of these problems when I use IPMI or DRAC.
> 
> BTW, for simplicity sake, I wasn't even running rgmanager.  Just ccsd,
> cman and fenced.

Then I'm confused; I thought we defined the problem as step 5 (services
starting on A) happening before step 4 (admin running fence_ack_manual).
With no step 5, what's the problem?


> On 12/8/05, David Teigland <teigland at redhat.com> wrote:
> > On Mon, Dec 05, 2005 at 07:04:38PM -0700, busy admin wrote:
> > > What is the problem:
> > > When running manual fencing and doing failover testing, my secondary
> > > node takes over the service without waiting for a fence_ack_manual.
> > > This all works perfectly with automatic fencing (ipmi, drac).
> >
> > We're going to try this here.  Just to be clear, we expect:
> >
> > 1. A and B in cluster, in fence domain and running rgmanager
> > 2. kill B
> > 3. A should start fence_manual and print message in /var/log/messages
> > 4. admin should run fence_ack_manual on A
> > 5. services from B should fail over to A
> >
> > The problem is you're seeing 5 happen before 4.  What version of the
> > code are you using (cluster-1.01.00 ?)
> >
> > Thanks
> > Dave
> >
> >



From busyadmin at gmail.com  Thu Dec  8 19:18:01 2005
From: busyadmin at gmail.com (busy admin)
Date: Thu, 8 Dec 2005 12:18:01 -0700
Subject: [Linux-cluster] manual fencing not working in RHEL4 branch
In-Reply-To: <20051208190816.GE10340@redhat.com>
References: <1c0e77670511281307i75bc26a4pc5bbcd3d152a8c8e@mail.gmail.com>
	<20051128212731.GK27662@redhat.com>
	<1c0e77670511291853i2603f61ayf2eae51903032ebd@mail.gmail.com>
	<20051130164839.GB23663@redhat.com>
	<1c0e77670512051804l5cc38edfy1d2b87e8fd71cf2e@mail.gmail.com>
	<20051208173315.GD10340@redhat.com>
	<1c0e77670512081019g16971027l9ae856866ccbcc05@mail.gmail.com>
	<20051208190816.GE10340@redhat.com>
Message-ID: <1c0e77670512081118o67e2b030mb877bbce4fa9a97c@mail.gmail.com>

You are exactly right, step 5 happens before step 4.

On 12/8/05, David Teigland <teigland at redhat.com> wrote:
> On Thu, Dec 08, 2005 at 11:19:38AM -0700, busy admin wrote:
> > David,
> >
> > We are using cluster-1.00 code.  I didn't see modifications under 1.01
> > that would have an impact, but maybe I missed something.
> >
> > You are right, I see step 5 happen before step 4 (but remember,
> > sometimes it works fine specially after I run with 'fenced -D').  And
> > I have never seen any of these problems when I use IPMI or DRAC.
> >
> > BTW, for simplicity sake, I wasn't even running rgmanager.  Just ccsd,
> > cman and fenced.
>
> Then I'm confused; I thought we defined the problem as step 5 (services
> starting on A) happening before step 4 (admin running fence_ack_manual).
> With no step 5, what's the problem?
>
>
> > On 12/8/05, David Teigland <teigland at redhat.com> wrote:
> > > On Mon, Dec 05, 2005 at 07:04:38PM -0700, busy admin wrote:
> > > > What is the problem:
> > > > When running manual fencing and doing failover testing, my secondary
> > > > node takes over the service without waiting for a fence_ack_manual.
> > > > This all works perfectly with automatic fencing (ipmi, drac).
> > >
> > > We're going to try this here.  Just to be clear, we expect:
> > >
> > > 1. A and B in cluster, in fence domain and running rgmanager
> > > 2. kill B
> > > 3. A should start fence_manual and print message in /var/log/messages
> > > 4. admin should run fence_ack_manual on A
> > > 5. services from B should fail over to A
> > >
> > > The problem is you're seeing 5 happen before 4.  What version of the
> > > code are you using (cluster-1.01.00 ?)
> > >
> > > Thanks
> > > Dave
> > >
> > >
>



From jeff at jettis.com  Thu Dec  8 22:01:50 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Thu, 8 Dec 2005 14:01:50 -0800
Subject: [Linux-cluster] corrupted gfs filesystem
Message-ID: <B6A0A04D59978745A68272143BE55BD42CA59F@laxmsex01.corp.jettis.com>

I'm testing gfs 6.1 (lock dlm) in a 2 node cluster on FC4.  I took both
nodes out of the cluster manually, then added node01 back in.  As
expected, it fenced node02.  Fencing was done by shutting down a network
port on a switch so iscsi could not access the storage devices.
However, the device files still existed.  

Just to see how the cluster would react, I started up ccsd, cman, and
fenced on node02.  It joined the cluster w/ out issue.  Even though I
knew iscsi was unable to get to the storage devices, I started the gfs
init script which attempted to mount the filesystem.  Looks like it
trashed it.  Output from gfs_fsck...

# gfs_fsck /dev/iscsi/laxrifa01/lun0
Initializing fsck
Buffer #150609096 (1 of 5) is neither GFS_METATYPE_RB nor
GFS_METATYPE_RG.
Resource group is corrupted.
Unable to read in rgrp descriptor.
Unable to fill in resource group information.

Is this expected behavior or is it possible that I'm missing something
in my configuration that allowed this to happen?  Thanks.

 - Jeff

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051208/49f4b046/attachment.htm>

From elmar at pruesse.net  Fri Dec  9 12:03:05 2005
From: elmar at pruesse.net (Elmar Pruesse)
Date: Fri, 09 Dec 2005 13:03:05 +0100
Subject: [Linux-cluster] New (small) cluster; What filesystem? GFS?
In-Reply-To: <439220EB.80901@pruesse.net>
References: <439220EB.80901@pruesse.net>
Message-ID: <43997279.40809@pruesse.net>

Since I got no response from you guys, I guess I'm off topic. I
apologize for that.

Can you point me anywhere to ask my question?

We have no one with experience in this area and I'm having a really hard
time finding material to base a decision on. If I had the time, I'd just
try them all, but as usual, I don't...

regards,
Elmar



From jeff at jettis.com  Fri Dec  9 14:33:57 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Fri, 9 Dec 2005 06:33:57 -0800
Subject: [Linux-cluster] corrupted gfs filesystem
Message-ID: <B6A0A04D59978745A68272143BE55BD4A4322C@laxmsex01.corp.jettis.com>

Also, does the output from gfs_fsck indicate that the filesystem is
beyond repair?  If not, what steps could I take to fix it?

  _____  

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jeff Dinisco
Sent: Thursday, December 08, 2005 5:02 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] corrupted gfs filesystem



I'm testing gfs 6.1 (lock dlm) in a 2 node cluster on FC4.  I took both
nodes out of the cluster manually, then added node01 back in.  As
expected, it fenced node02.  Fencing was done by shutting down a network
port on a switch so iscsi could not access the storage devices.
However, the device files still existed.  

Just to see how the cluster would react, I started up ccsd, cman, and
fenced on node02.  It joined the cluster w/ out issue.  Even though I
knew iscsi was unable to get to the storage devices, I started the gfs
init script which attempted to mount the filesystem.  Looks like it
trashed it.  Output from gfs_fsck...

# gfs_fsck /dev/iscsi/laxrifa01/lun0 
Initializing fsck 
Buffer #150609096 (1 of 5) is neither GFS_METATYPE_RB nor
GFS_METATYPE_RG. 
Resource group is corrupted. 
Unable to read in rgrp descriptor. 
Unable to fill in resource group information. 

Is this expected behavior or is it possible that I'm missing something
in my configuration that allowed this to happen?  Thanks.

 - Jeff 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051209/82e833ef/attachment.htm>

From ben.yarwood at juno.co.uk  Fri Dec  9 15:44:45 2005
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Fri, 9 Dec 2005 15:44:45 -0000
Subject: [Linux-cluster] Question about fencing using wti power switch
Message-ID: <03af01c5fcd7$7adbf1d0$3964a8c0@WS076>

I am running FC4, with gfs and clustering and am trying to test fencing.  My
cluster.conf file is shown at the bottom.

Whenever I disable the network port for one of the cluster boxes I would
expect the device to be fenced using the wti power switch, however I just
get the following messsages in the log saying the device needs to be fenced
manually.

Dec  9 15:36:01 jrmedia-a fenced[2066]: fencing node "jrmedia-c"
Dec  9 15:36:01 jrmedia-a fence_manual: Node jrmedia-c needs to be reset
before recovery can procede.  Waiting for jrmedia-c to rejoin the cluster or
for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n
jrmedia-c) 

I have tested that the device can be fenced by using fence_wti directly and
it works correclty, power cycling the plug.
Can someone tell me either if I have made a mistake in the cluster.conf file
or how I can get more debugging information.

Thanks
Ben


<?xml version="1.0"?>
<cluster name="alpha_cluster" config_version="14">
        <clusternodes>
                <clusternode name="jrmedia-a">
                        <fence>
                                <method name="power">
                                        <device name="wti" port="16"/>
                                </method>
                                <method name="human">
                                        <device name="last_resort"
ipaddr="jrmedia-a"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="jrmedia-b">
                        <fence>
                                <method name="power">
                                        <device name="wti" port="15"/>
                                </method>
                                <method name="human">
                                        <device name="last_resort"
ipaddr="jrmedia-b"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="jrmedia-c">
                        <fence>
                                <method name="power">
                                        <device name="wti" port="13"/>
                                </method>
                                <method name="human">
                                        <device name="last_resort"
ipaddr="jrmedia-c"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <fencedevices>
                <fencedevice name="wti" agent="fence_wti" ipaddr="10.0.1.40"
passwd="xxxxx"/>
                <fencedevice name="last_resort" agent="fence_manual"/>
        </fencedevices>
</cluster>


Ben Yarwood
Technical Director
Juno Records
t - 020 7424 2804
m - 07930 922 333
e - ben.yarwood at juno.co.uk 






From teigland at redhat.com  Fri Dec  9 16:07:55 2005
From: teigland at redhat.com (David Teigland)
Date: Fri, 9 Dec 2005 10:07:55 -0600
Subject: [Linux-cluster] corrupted gfs filesystem
In-Reply-To: <B6A0A04D59978745A68272143BE55BD42CA59F@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD42CA59F@laxmsex01.corp.jettis.com>
Message-ID: <20051209160755.GA30517@redhat.com>

On Thu, Dec 08, 2005 at 02:01:50PM -0800, Jeff Dinisco wrote:
> I'm testing gfs 6.1 (lock dlm) in a 2 node cluster on FC4.  I took both
> nodes out of the cluster manually, then added node01 back in.  As
> expected, it fenced node02.  Fencing was done by shutting down a network
> port on a switch so iscsi could not access the storage devices.
> However, the device files still existed.  
> 
> Just to see how the cluster would react, I started up ccsd, cman, and
> fenced on node02.  It joined the cluster w/ out issue.  Even though I
> knew iscsi was unable to get to the storage devices, I started the gfs
> init script which attempted to mount the filesystem.  Looks like it
> trashed it.  

But node02 couldn't reach the storage, how could it trash it?  If node02
_could_ reach the storage, it would have just mounted the fs normally.

> Output from gfs_fsck...

When and where did you run fsck?  Not while either node had the fs mounted
I trust.

Dave

> 
> # gfs_fsck /dev/iscsi/laxrifa01/lun0
> Initializing fsck
> Buffer #150609096 (1 of 5) is neither GFS_METATYPE_RB nor
> GFS_METATYPE_RG.
> Resource group is corrupted.
> Unable to read in rgrp descriptor.
> Unable to fill in resource group information.
> 
> Is this expected behavior or is it possible that I'm missing something
> in my configuration that allowed this to happen?  Thanks.



From jeff at jettis.com  Fri Dec  9 16:20:52 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Fri, 9 Dec 2005 08:20:52 -0800
Subject: [Linux-cluster] corrupted gfs filesystem
Message-ID: <B6A0A04D59978745A68272143BE55BD4A4322E@laxmsex01.corp.jettis.com>

nope, the fs was unmounted on both nodes.  I ran it from node01 after I
was unable to mount it and had to reboot the node because the mount
command hung the system.  The latest output from gfs_fsck...

Initializing fsck
fs_compute_bitstructs:  # of blks in rgrp do not equal # of blks
represented in bitmap.
        bi_start = 134230407
        bi_len   = 17
        GFS_NBBY = 4
        ri_data  = 8
Unable to fill in resource group information.

The only thing that has changed is I tried to mount it a 2nd time and
again couldn't kill mount and was forced to reboot. 

-----Original Message-----
From: David Teigland [mailto:teigland at redhat.com] 
Sent: Friday, December 09, 2005 11:08 AM
To: Jeff Dinisco
Cc: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] corrupted gfs filesystem

On Thu, Dec 08, 2005 at 02:01:50PM -0800, Jeff Dinisco wrote:
> I'm testing gfs 6.1 (lock dlm) in a 2 node cluster on FC4.  I took
both
> nodes out of the cluster manually, then added node01 back in.  As
> expected, it fenced node02.  Fencing was done by shutting down a
network
> port on a switch so iscsi could not access the storage devices.
> However, the device files still existed.  
> 
> Just to see how the cluster would react, I started up ccsd, cman, and
> fenced on node02.  It joined the cluster w/ out issue.  Even though I
> knew iscsi was unable to get to the storage devices, I started the gfs
> init script which attempted to mount the filesystem.  Looks like it
> trashed it.  

But node02 couldn't reach the storage, how could it trash it?  If node02
_could_ reach the storage, it would have just mounted the fs normally.

> Output from gfs_fsck...

When and where did you run fsck?  Not while either node had the fs
mounted
I trust.

Dave

> 
> # gfs_fsck /dev/iscsi/laxrifa01/lun0
> Initializing fsck
> Buffer #150609096 (1 of 5) is neither GFS_METATYPE_RB nor
> GFS_METATYPE_RG.
> Resource group is corrupted.
> Unable to read in rgrp descriptor.
> Unable to fill in resource group information.
> 
> Is this expected behavior or is it possible that I'm missing something
> in my configuration that allowed this to happen?  Thanks.






From thomsonr at ucalgary.ca  Fri Dec  9 23:38:06 2005
From: thomsonr at ucalgary.ca (Ryan Thomson)
Date: Fri, 9 Dec 2005 16:38:06 -0700 (MST)
Subject: [Linux-cluster] rgmanager causing hard lock ups
Message-ID: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>

Hi List,

I have an RHCS cluster with four nodes on RHEL4U2 using the RHN RPMs and
GFS CVS (RHEL4) and LVM2 (clvmd) from source tarball (2.2.01.09).

I'm seeing some rather disturbing behavior from my cluster. I can get all
the nodes to join, fence each other properly, etc. I also have some
services setup, mainly GFS mounts and NFS exports.

However, now if I bring up the cluster and start rgmanager, the node that
tries to start one or more of the services (I can't tell which service but
I suspect the NFS export service) will hard lock with the caps lock and
scroll lock lights blinking and the rest of the cluster is useless:
services don't start and rgmanager won't stop or reload or do anything...
on all the nodes. Also, I have all but one of my services set to NOT
autostart, yet when I start rgmanager, they begin starting anyways...

Here is my cluster.conf file, I suspect the problem is with my NFS export
service as that is the only one I've changed since I started seeing this
behavior:

<?xml version="1.0" ?>
<cluster config_version="99" name="biocomp_cluster">
        <fence_daemon clean_start="1" post_fail_delay="0"
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="wolverine" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apcfence" port="1"
switch="0"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="skunk" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apcfence" port="2"
switch="0"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="cottontail" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apcfence" port="3"
switch="0"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="walrus" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apcfence" port="4"
switch="0"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_apc" ipaddr="10.1.1.254"
login="fence_user" name="apcfence" passwd="xxx"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="Cluster Failover"
ordered="0" restricted="1">
                                <failoverdomainnode name="wolverine"
priority="1"/>
                                <failoverdomainnode name="skunk"
priority="1"/>
                                <failoverdomainnode name="cottontail"
priority="1"/>
                        </failoverdomain>
                        <failoverdomain name="Backup" ordered="0"
restricted="1">
                                <failoverdomainnode name="walrus"
priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <nfsexport name="Cluster Export"/>
                        <nfsclient name="Biocomp Clients"
options="rw,sync" target="xxx.xxx.xxx.xxx/24"/>
                        <clusterfs device="/dev/BIOCOMP/docs"
force_unmount="0" fstype="gfs"
mountpoint="/projects/docs" name="Documentation"
options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/ryan"
force_unmount="0" fstype="gfs"
mountpoint="/people/ryan" name="Home - Ryan"
options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/luca"
force_unmount="0" fstype="gfs"
mountpoint="/people/luca" name="Home - Luca"
options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jlmaccal"
force_unmount="0" fstype="gfs"
mountpoint="/people/jlmaccal" name="Home - Justin"
options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_hexane"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/hexane"
name="Project - JM Hexane" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_LJ"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/LJ" name="Project -
JM LJ" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_sidechain_pmf"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/sidechain_pmf"
name="Project - JM sidechain_pmf" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_CG"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/CG" name="Project -
JM CG" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_CISS3"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/CISS3"
name="Project - JM CISS3" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_OPLS-sidechain"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/OPLS-sidechain"
name="Project - JM OPLS-sidechain" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_arg_pull"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/arg_pull"
name="Project - JM arg_pull" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_halo"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/halo" name="Project
- JM halo" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_old_bison"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/old_bison"
name="Project - JM old_bison" options="acl"/>
                        <clusterfs device="/dev/BIOCOMP/jm_CISS2"
force_unmount="0" fstype="gfs"
mountpoint="/projects/jlmaccal/CISS2"
name="Project - JM CISS2" options="acl"/>
                </resources>
                <service domain="Cluster Failover" name="cluster NAT">
                        <ip address="10.1.1.1" monitor_link="1"/>
                        <script file="/cluster/scripts/cluster_nat"
name="cluster NAT script"/>
                </service>
                <service domain="Cluster Failover" name="FDS Service">
                        <ip address="xxx.xxx.xxx.xxx" monitor_link="1"/>
                        <script file="/cluster/scripts/fds" name="FDS
script"/>
                </service>
                <service domain="Cluster Failover" exclusive="1" name="NFS
Exports">
                        <ip address="xxx.xxx.xxx.xxx" monitor_link="1"/>
                        <clusterfs ref="Documentation">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Home - Ryan">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Home - Luca">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Home - Justin">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM Hexane">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM LJ">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM sidechain_pmf">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM CG">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM CISS3">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM OPLS-sidechain">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM arg_pull">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM halo">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM old_bison">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                        <clusterfs ref="Project - JM CISS2">
                                <nfsexport ref="Cluster Export">
                                        <nfsclient ref="Biocomp Clients"/>
                                </nfsexport>
                        </clusterfs>
                </service>
                <service domain="Backup" name="Backup Mounts">
                        <clusterfs ref="Documentation"/>
                        <clusterfs ref="Home - Ryan"/>
                        <clusterfs ref="Home - Luca"/>
                        <clusterfs ref="Home - Justin"/>
                        <clusterfs ref="Project - JM Hexane"/>
                        <clusterfs ref="Project - JM LJ"/>
                        <clusterfs ref="Project - JM sidechain_pmf"/>
                        <clusterfs ref="Project - JM CG"/>
                        <clusterfs ref="Project - JM CISS3"/>
                        <clusterfs ref="Project - JM OPLS-sidechain"/>
                        <clusterfs ref="Project - JM arg_pull"/>
                        <clusterfs ref="Project - JM halo"/>
                        <clusterfs ref="Project - JM old_bison"/>
                        <clusterfs ref="Project - JM CISS2"/>
                </service>
        </rm>
</cluster>

I'm wondering whether I setup the NFS exports in the "correct" fashion or
not... It was working this way just fine until I started adding a lot of
GFS volumes and NFS exports for each one.

Any clues?

-- 
Ryan



From thomsonr at ucalgary.ca  Sat Dec 10 01:13:37 2005
From: thomsonr at ucalgary.ca (Ryan Thomson)
Date: Fri, 9 Dec 2005 18:13:37 -0700 (MST)
Subject: [Linux-cluster] rgmanager causing hard lock ups
In-Reply-To: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>
References: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>
Message-ID: <32965.136.159.234.194.1134177217.squirrel@136.159.234.194>

I am now certain it has *something* to do with either the "NFS exports"
service and/or the "Backup Mounts" service as when I removed both
services, everything started fine. When I re-added them, this time with
only a couple GFS mounts and NFS exports, it did the same thing...

The interesting part is that it keep happening to the same node, not
different nodes.

Any help or insights are appreciated.

--
Ryan


> Hi List,
>
> I have an RHCS cluster with four nodes on RHEL4U2 using the RHN RPMs and
> GFS CVS (RHEL4) and LVM2 (clvmd) from source tarball (2.2.01.09).
>
> I'm seeing some rather disturbing behavior from my cluster. I can get all
> the nodes to join, fence each other properly, etc. I also have some
> services setup, mainly GFS mounts and NFS exports.
>
> However, now if I bring up the cluster and start rgmanager, the node that
> tries to start one or more of the services (I can't tell which service
> but
> I suspect the NFS export service) will hard lock with the caps lock and
> scroll lock lights blinking and the rest of the cluster is useless:
> services don't start and rgmanager won't stop or reload or do anything...
> on all the nodes. Also, I have all but one of my services set to NOT
> autostart, yet when I start rgmanager, they begin starting anyways...
>
> Here is my cluster.conf file, I suspect the problem is with my NFS export
> service as that is the only one I've changed since I started seeing this
> behavior:
>
> <?xml version="1.0" ?>
> <cluster config_version="99" name="biocomp_cluster">
>         <fence_daemon clean_start="1" post_fail_delay="0"
> post_join_delay="3"/>
>         <clusternodes>
>                 <clusternode name="wolverine" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="apcfence" port="1"
> switch="0"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="skunk" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="apcfence" port="2"
> switch="0"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="cottontail" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="apcfence" port="3"
> switch="0"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="walrus" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="apcfence" port="4"
> switch="0"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>         </clusternodes>
>         <cman/>
>         <fencedevices>
>                 <fencedevice agent="fence_apc" ipaddr="10.1.1.254"
> login="fence_user" name="apcfence" passwd="xxx"/>
>         </fencedevices>
>         <rm>
>                 <failoverdomains>
>                         <failoverdomain name="Cluster Failover"
> ordered="0" restricted="1">
>                                 <failoverdomainnode name="wolverine"
> priority="1"/>
>                                 <failoverdomainnode name="skunk"
> priority="1"/>
>                                 <failoverdomainnode name="cottontail"
> priority="1"/>
>                         </failoverdomain>
>                         <failoverdomain name="Backup" ordered="0"
> restricted="1">
>                                 <failoverdomainnode name="walrus"
> priority="1"/>
>                         </failoverdomain>
>                 </failoverdomains>
>                 <resources>
>                         <nfsexport name="Cluster Export"/>
>                         <nfsclient name="Biocomp Clients"
> options="rw,sync" target="xxx.xxx.xxx.xxx/24"/>
>                         <clusterfs device="/dev/BIOCOMP/docs"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/docs" name="Documentation"
> options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/ryan"
> force_unmount="0" fstype="gfs"
> mountpoint="/people/ryan" name="Home - Ryan"
> options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/luca"
> force_unmount="0" fstype="gfs"
> mountpoint="/people/luca" name="Home - Luca"
> options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jlmaccal"
> force_unmount="0" fstype="gfs"
> mountpoint="/people/jlmaccal" name="Home - Justin"
> options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_hexane"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/hexane"
> name="Project - JM Hexane" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_LJ"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/LJ" name="Project -
> JM LJ" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_sidechain_pmf"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/sidechain_pmf"
> name="Project - JM sidechain_pmf" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_CG"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/CG" name="Project -
> JM CG" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_CISS3"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/CISS3"
> name="Project - JM CISS3" options="acl"/>
>                         <clusterfs
> device="/dev/BIOCOMP/jm_OPLS-sidechain"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/OPLS-sidechain"
> name="Project - JM OPLS-sidechain" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_arg_pull"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/arg_pull"
> name="Project - JM arg_pull" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_halo"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/halo" name="Project
> - JM halo" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_old_bison"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/old_bison"
> name="Project - JM old_bison" options="acl"/>
>                         <clusterfs device="/dev/BIOCOMP/jm_CISS2"
> force_unmount="0" fstype="gfs"
> mountpoint="/projects/jlmaccal/CISS2"
> name="Project - JM CISS2" options="acl"/>
>                 </resources>
>                 <service domain="Cluster Failover" name="cluster NAT">
>                         <ip address="10.1.1.1" monitor_link="1"/>
>                         <script file="/cluster/scripts/cluster_nat"
> name="cluster NAT script"/>
>                 </service>
>                 <service domain="Cluster Failover" name="FDS Service">
>                         <ip address="xxx.xxx.xxx.xxx" monitor_link="1"/>
>                         <script file="/cluster/scripts/fds" name="FDS
> script"/>
>                 </service>
>                 <service domain="Cluster Failover" exclusive="1"
> name="NFS
> Exports">
>                         <ip address="xxx.xxx.xxx.xxx" monitor_link="1"/>
>                         <clusterfs ref="Documentation">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Home - Ryan">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Home - Luca">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Home - Justin">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM Hexane">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM LJ">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM sidechain_pmf">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM CG">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM CISS3">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM OPLS-sidechain">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM arg_pull">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM halo">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM old_bison">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                         <clusterfs ref="Project - JM CISS2">
>                                 <nfsexport ref="Cluster Export">
>                                         <nfsclient ref="Biocomp
> Clients"/>
>                                 </nfsexport>
>                         </clusterfs>
>                 </service>
>                 <service domain="Backup" name="Backup Mounts">
>                         <clusterfs ref="Documentation"/>
>                         <clusterfs ref="Home - Ryan"/>
>                         <clusterfs ref="Home - Luca"/>
>                         <clusterfs ref="Home - Justin"/>
>                         <clusterfs ref="Project - JM Hexane"/>
>                         <clusterfs ref="Project - JM LJ"/>
>                         <clusterfs ref="Project - JM sidechain_pmf"/>
>                         <clusterfs ref="Project - JM CG"/>
>                         <clusterfs ref="Project - JM CISS3"/>
>                         <clusterfs ref="Project - JM OPLS-sidechain"/>
>                         <clusterfs ref="Project - JM arg_pull"/>
>                         <clusterfs ref="Project - JM halo"/>
>                         <clusterfs ref="Project - JM old_bison"/>
>                         <clusterfs ref="Project - JM CISS2"/>
>                 </service>
>         </rm>
> </cluster>
>
> I'm wondering whether I setup the NFS exports in the "correct" fashion or
> not... It was working this way just fine until I started adding a lot of
> GFS volumes and NFS exports for each one.
>
> Any clues?
>
> --
> Ryan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Ryan



From thomsonr at ucalgary.ca  Sat Dec 10 01:14:45 2005
From: thomsonr at ucalgary.ca (Ryan Thomson)
Date: Fri, 9 Dec 2005 18:14:45 -0700 (MST)
Subject: [Linux-cluster] rgmanager causing hard lock ups
In-Reply-To: <32965.136.159.234.194.1134177217.squirrel@136.159.234.194>
References: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>
	<32965.136.159.234.194.1134177217.squirrel@136.159.234.194>
Message-ID: <32968.136.159.234.194.1134177285.squirrel@136.159.234.194>

I retort, it doesn't happen on the same node always, just the one that
gets the "NFS exports" service.

--
Ryan


> I am now certain it has *something* to do with either the "NFS exports"
> service and/or the "Backup Mounts" service as when I removed both
> services, everything started fine. When I re-added them, this time with
> only a couple GFS mounts and NFS exports, it did the same thing...
>
> The interesting part is that it keep happening to the same node, not
> different nodes.
>
> Any help or insights are appreciated.
>
> --
> Ryan
>
>
>> Hi List,
>>
>> I have an RHCS cluster with four nodes on RHEL4U2 using the RHN RPMs
>> and
>> GFS CVS (RHEL4) and LVM2 (clvmd) from source tarball (2.2.01.09).
>>
>> I'm seeing some rather disturbing behavior from my cluster. I can get
>> all
>> the nodes to join, fence each other properly, etc. I also have some
>> services setup, mainly GFS mounts and NFS exports.
>>
>> However, now if I bring up the cluster and start rgmanager, the node
>> that
>> tries to start one or more of the services (I can't tell which service
>> but
>> I suspect the NFS export service) will hard lock with the caps lock and
>> scroll lock lights blinking and the rest of the cluster is useless:
>> services don't start and rgmanager won't stop or reload or do
>> anything...
>> on all the nodes. Also, I have all but one of my services set to NOT
>> autostart, yet when I start rgmanager, they begin starting anyways...
>>
>> Here is my cluster.conf file, I suspect the problem is with my NFS
>> export
>> service as that is the only one I've changed since I started seeing
>> this
>> behavior:
>>
>> <?xml version="1.0" ?>
>> <cluster config_version="99" name="biocomp_cluster">
>>         <fence_daemon clean_start="1" post_fail_delay="0"
>> post_join_delay="3"/>
>>         <clusternodes>
>>                 <clusternode name="wolverine" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="apcfence"
>> port="1"
>> switch="0"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="skunk" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="apcfence"
>> port="2"
>> switch="0"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="cottontail" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="apcfence"
>> port="3"
>> switch="0"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="walrus" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="apcfence"
>> port="4"
>> switch="0"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>         </clusternodes>
>>         <cman/>
>>         <fencedevices>
>>                 <fencedevice agent="fence_apc" ipaddr="10.1.1.254"
>> login="fence_user" name="apcfence" passwd="xxx"/>
>>         </fencedevices>
>>         <rm>
>>                 <failoverdomains>
>>                         <failoverdomain name="Cluster Failover"
>> ordered="0" restricted="1">
>>                                 <failoverdomainnode name="wolverine"
>> priority="1"/>
>>                                 <failoverdomainnode name="skunk"
>> priority="1"/>
>>                                 <failoverdomainnode name="cottontail"
>> priority="1"/>
>>                         </failoverdomain>
>>                         <failoverdomain name="Backup" ordered="0"
>> restricted="1">
>>                                 <failoverdomainnode name="walrus"
>> priority="1"/>
>>                         </failoverdomain>
>>                 </failoverdomains>
>>                 <resources>
>>                         <nfsexport name="Cluster Export"/>
>>                         <nfsclient name="Biocomp Clients"
>> options="rw,sync" target="xxx.xxx.xxx.xxx/24"/>
>>                         <clusterfs device="/dev/BIOCOMP/docs"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/docs" name="Documentation"
>> options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/ryan"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/people/ryan" name="Home - Ryan"
>> options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/luca"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/people/luca" name="Home - Luca"
>> options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jlmaccal"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/people/jlmaccal" name="Home - Justin"
>> options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_hexane"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/hexane"
>> name="Project - JM Hexane" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_LJ"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/LJ" name="Project -
>> JM LJ" options="acl"/>
>>                         <clusterfs
>> device="/dev/BIOCOMP/jm_sidechain_pmf"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/sidechain_pmf"
>> name="Project - JM sidechain_pmf" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_CG"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/CG" name="Project -
>> JM CG" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_CISS3"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/CISS3"
>> name="Project - JM CISS3" options="acl"/>
>>                         <clusterfs
>> device="/dev/BIOCOMP/jm_OPLS-sidechain"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/OPLS-sidechain"
>> name="Project - JM OPLS-sidechain" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_arg_pull"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/arg_pull"
>> name="Project - JM arg_pull" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_halo"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/halo" name="Project
>> - JM halo" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_old_bison"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/old_bison"
>> name="Project - JM old_bison" options="acl"/>
>>                         <clusterfs device="/dev/BIOCOMP/jm_CISS2"
>> force_unmount="0" fstype="gfs"
>> mountpoint="/projects/jlmaccal/CISS2"
>> name="Project - JM CISS2" options="acl"/>
>>                 </resources>
>>                 <service domain="Cluster Failover" name="cluster NAT">
>>                         <ip address="10.1.1.1" monitor_link="1"/>
>>                         <script file="/cluster/scripts/cluster_nat"
>> name="cluster NAT script"/>
>>                 </service>
>>                 <service domain="Cluster Failover" name="FDS Service">
>>                         <ip address="xxx.xxx.xxx.xxx"
>> monitor_link="1"/>
>>                         <script file="/cluster/scripts/fds" name="FDS
>> script"/>
>>                 </service>
>>                 <service domain="Cluster Failover" exclusive="1"
>> name="NFS
>> Exports">
>>                         <ip address="xxx.xxx.xxx.xxx"
>> monitor_link="1"/>
>>                         <clusterfs ref="Documentation">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Home - Ryan">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Home - Luca">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Home - Justin">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM Hexane">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM LJ">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM sidechain_pmf">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM CG">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM CISS3">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM OPLS-sidechain">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM arg_pull">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM halo">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM old_bison">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                         <clusterfs ref="Project - JM CISS2">
>>                                 <nfsexport ref="Cluster Export">
>>                                         <nfsclient ref="Biocomp
>> Clients"/>
>>                                 </nfsexport>
>>                         </clusterfs>
>>                 </service>
>>                 <service domain="Backup" name="Backup Mounts">
>>                         <clusterfs ref="Documentation"/>
>>                         <clusterfs ref="Home - Ryan"/>
>>                         <clusterfs ref="Home - Luca"/>
>>                         <clusterfs ref="Home - Justin"/>
>>                         <clusterfs ref="Project - JM Hexane"/>
>>                         <clusterfs ref="Project - JM LJ"/>
>>                         <clusterfs ref="Project - JM sidechain_pmf"/>
>>                         <clusterfs ref="Project - JM CG"/>
>>                         <clusterfs ref="Project - JM CISS3"/>
>>                         <clusterfs ref="Project - JM OPLS-sidechain"/>
>>                         <clusterfs ref="Project - JM arg_pull"/>
>>                         <clusterfs ref="Project - JM halo"/>
>>                         <clusterfs ref="Project - JM old_bison"/>
>>                         <clusterfs ref="Project - JM CISS2"/>
>>                 </service>
>>         </rm>
>> </cluster>
>>
>> I'm wondering whether I setup the NFS exports in the "correct" fashion
>> or
>> not... It was working this way just fine until I started adding a lot
>> of
>> GFS volumes and NFS exports for each one.
>>
>> Any clues?
>>
>> --
>> Ryan
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Ryan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Ryan



From ext-jari.huuskonen at nokia.com  Sat Dec 10 07:10:35 2005
From: ext-jari.huuskonen at nokia.com (ext-jari.huuskonen at nokia.com)
Date: Sat, 10 Dec 2005 09:10:35 +0200
Subject: [Linux-cluster] GFS6.1 and CS4.0 fencinfg RSA II
Message-ID: <72FEB86533D5F643882E15DB52FEE75F016C25AF@esebe105.NOE.Nokia.com>

Hi list.

I have litle problems with above configuration. 

Problem 1.

CS 3 and GFS 6.0 fencing ibm servers with RSA II adapters
was done with fence_xcat, in CS 4.0 and GFS 6.1 there is no fence_xcat
command.
Is there some other mechanism to fence nodes which have RSA II adapters?
Or have i missed something?

Problem 2. 
When configuring resource ipaddress \monitor link tab\ cannot be
unchecked
it stays checked after restartting rgmanager?

Thanks for advance. 




From gforte at leopard.us.udel.edu  Sat Dec 10 10:49:08 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Sat, 10 Dec 2005 05:49:08 -0500
Subject: [Linux-cluster] comprehensive cluster.conf reference?
Message-ID: <439AB2A4.1040807@leopard.us.udel.edu>

is there a comprehensive reference or specification for the structure of
the cluster.conf file?  the man page is woefully inadequate (doesn't
cover the <rm> section at all) and it doesn't sound like anyone trusts
the gui tool (from what I've seen so far, I certainly don't), so that
leaves me somewhat in the dark when trying to adjust my configuration
manually.  Thanks.

-g



From eric at bootseg.com  Sat Dec 10 16:48:03 2005
From: eric at bootseg.com (Eric Kerin)
Date: Sat, 10 Dec 2005 11:48:03 -0500
Subject: [Linux-cluster] rgmanager causing hard lock ups
In-Reply-To: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>
References: <50712.136.159.234.21.1134171486.squirrel@136.159.234.21>
Message-ID: <1134233283.3448.2.camel@auh5-0479.corp.jabil.org>

On Fri, 2005-12-09 at 16:38 -0700, Ryan Thomson wrote:
> However, now if I bring up the cluster and start rgmanager, the node that
> tries to start one or more of the services (I can't tell which service but
> I suspect the NFS export service) will hard lock with the caps lock and
> scroll lock lights blinking and the rest of the cluster is useless:

> Any clues?
> 

Your Kernel is panicing, that's what the blinking caps and scroll lock
lights are.  Check your /var/log/messages file for the OOPS message, and
post it here.

Thanks, 
Eric Kerin
eric at bootseg.com



From lyxmoo at gmail.com  Sun Dec 11 17:15:31 2005
From: lyxmoo at gmail.com (guanxun mu)
Date: Mon, 12 Dec 2005 01:15:31 +0800
Subject: [Linux-cluster] Security in CMAN
Message-ID: <ae65fe130512110915k396c9016k33e8d5cdabe51541@mail.gmail.com>

IMO, there're lack security check in cman join/leave mechanism, that's means
a aborative udp packet made the cluster untrusted, if there's a manageable
authorization password input through proc entries, the wrong configured node
or the cracker without the cluster-extension authorized word will not bother
the cluster message passing. a simple memcmp calling in the beginning of
process_message will out sight of load.


sincerely Michael Moore
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/5aaf348f/attachment.htm>

From yazan at ccs.com.jo  Mon Dec 12 06:35:32 2005
From: yazan at ccs.com.jo (Yazan Al-Bakheit)
Date: Mon, 12 Dec 2005 08:35:32 +0200
Subject: [Linux-cluster] NLB configuration
Message-ID: <002801c5fee6$40333c80$69050364@yazanz>


  Hello,

        can any body please tell me about how to have nlb implementation on 
        redhat enterprise v3 ???

        i will have a simulation for two PCs :

       each one have two network cards and 3 GB RAM
       Redhat enterprise linux V3 update 5

      and we will install IAS , tomcat , ECC schema , ECC services , ECC 
      Application on each one of the two PCs .

     but after that i want to configure NLB on linux , So how and where to 
     start configuring the NLB .

     if any one have do this , please to tell me about or any documents to 
     read   because i want to test the NLB with the ECC system 
     (Performance issues).




Regards
-------------------------------------------------

Yazan 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/5d80357d/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tech.gif
Type: image/gif
Size: 862 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/5d80357d/attachment.gif>

From pcaulfie at redhat.com  Mon Dec 12 08:27:09 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 12 Dec 2005 08:27:09 +0000
Subject: [Linux-cluster] Security in CMAN
In-Reply-To: <ae65fe130512110915k396c9016k33e8d5cdabe51541@mail.gmail.com>
References: <ae65fe130512110915k396c9016k33e8d5cdabe51541@mail.gmail.com>
Message-ID: <439D345D.9060800@redhat.com>

guanxun mu wrote:
> IMO, there're lack security check in cman join/leave mechanism, that's
> means a aborative udp packet made the cluster untrusted, if there's a
> manageable authorization password input through proc entries, the wrong
> configured node or the cracker without the cluster-extension authorized
> word will not bother the cluster message passing. a simple memcmp
> calling in the beginning of process_message will out sight of load.  

Don't run the cluster over an interface that's open to the internet.

It's true that the security extras in cman are pretty much non-existant
though, I grant you.
-- 

patrick



From carlopmart at gmail.com  Mon Dec 12 10:10:18 2005
From: carlopmart at gmail.com (carlopmart)
Date: Mon, 12 Dec 2005 11:10:18 +0100
Subject: [Linux-cluster] Question about gfs and gnbd
Message-ID: <439D4C8A.9040103@gmail.com>

Hi ll,

  Where I can find a simple howto that explains how can I install gfs 
and gnbd on the same machine?? I googled without results and redhat 
documentation is not very clear about this.

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com



From yazan at ccs.com.jo  Mon Dec 12 14:09:30 2005
From: yazan at ccs.com.jo (Yazan Al-Bakheit)
Date: Mon, 12 Dec 2005 16:09:30 +0200
Subject: [Linux-cluster] nlb conf
Message-ID: <001101c5ff25$abc2a460$69050364@yazanz>


Hello,

        can any body please tell me about how to have nlb implementation on 
        redhat enterprise v3 ???

        i will have a simulation for two PCs :

       each one have two network cards and 3 GB RAM
       Redhat enterprise linux V3 update 5

      and we will install IAS , tomcat , ECC schema , ECC services , ECC 
      Application on each one of the two PCs .

     but after that i want to configure NLB on linux , So how and where to 
     start configuring the NLB .

     if any one have do this , please to tell me about or any documents to 
     read   because i want to test the NLB with the ECC system 
     (Performance issues).




Regards
-------------------------------------------------

Yazan 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/085620cd/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tech.gif
Type: image/gif
Size: 862 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/085620cd/attachment.gif>

From brentonr at dorm.org  Mon Dec 12 17:22:25 2005
From: brentonr at dorm.org (Brenton Rothchild)
Date: Mon, 12 Dec 2005 11:22:25 -0600
Subject: [Linux-cluster] GNBD import mirroring?
Message-ID: <439DB1D1.30200@dorm.org>

Hi all,

I was thinking about trying to set up a test cluster
according to the figure in

http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY

but I see a post from Benjamin Marzinski that this is a
marketing fluke and isn't possible?
(http://www.redhat.com/archives/linux-cluster/2005-February/msg00049.html)

Is that true, or this possible?  I'd like to make use of two GNBD
export servers that I could RAID 1+0 on the GNBD import nodes
to get a network mirror plus resizeable GFS parition...
if that's even possible.

If anyone could tell me if I'm barking up the wrong tree on this,
I'd appreciate it!

Thanks,
-Brenton Rothchild



From mbrookov at mines.edu  Mon Dec 12 17:35:00 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Mon, 12 Dec 2005 10:35:00 -0700
Subject: [Linux-cluster] stuck processes on GFS partition?
Message-ID: <1134408900.23930.23.camel@merlin.Mines.EDU>

We are getting processes stuck in device waits on one file system. 
These errors are logged in /var/log/messages:

Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: stuck in
gfs_releasepage()...
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: blkno =
12446334, bh->b_count = 9
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bh->b_journal_head = !NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: gl = (4,
12477424)
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bd_new_le.le_trans = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bd_incore_le.le_trans = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_frozen =
NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_pinned =
0
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_ail_tr =
NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip =
12477424/12477424
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip->i_count
= 1, ip->i_vnode = !NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[0] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[1] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[2] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[3] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[4] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[5] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[6] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[7] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[8] = NULL
Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[9] = NULL
Dec 12 10:09:17 imagine su(pam_unix)[5104]: session closed for user root
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: stuck in
gfs_releasepage()...
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: blkno =
12446334, bh->b_count = 9
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bh->b_journal_head = !NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: gl = (4,
12477424)
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bd_new_le.le_trans = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
bd_incore_le.le_trans = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_frozen =
NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_pinned =
0
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_ail_tr =
NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip =
12477424/12477424
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip->i_count
= 1, ip->i_vnode = !NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[0] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[1] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[2] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[3] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[4] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[5] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[6] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[7] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[8] = NULL
Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
ip->i_arch.i_cache[9] = NULL

The file system in question appears to work fine on the other nodes, I
unmounted it to be on the safe side.

This is redhat enterprise 3.6, kernel 2.4.21-37.ELsmp, GFS 6.0.2.27-0. 
GFS was built from the source.
There are 2 partitions in the admin pool, the second was added a week or
so ago.

I tried to unmount it, but the umount failed because of the processes
that are stuck in device waits.

Any ideas?

thank you

Matt
mbrookov at mines.edu


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/d661677d/attachment.htm>

From mbrookov at mines.edu  Mon Dec 12 18:02:23 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Mon, 12 Dec 2005 11:02:23 -0700
Subject: [Linux-cluster] stuck processes on GFS partition?
In-Reply-To: <1134408900.23930.23.camel@merlin.Mines.EDU>
References: <1134408900.23930.23.camel@merlin.Mines.EDU>
Message-ID: <1134410543.23930.28.camel@merlin.Mines.EDU>

Looking at the logs, this problem started at 16:04 yesterday.  This set
of log messages has been logged every 10 minutes since then.

Any ideas?

Matt

On Mon, 2005-12-12 at 10:35, Matt Brookover wrote:

> We are getting processes stuck in device waits on one file system. 
> These errors are logged in /var/log/messages:
> 
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: stuck in
> gfs_releasepage()...
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: blkno =
> 12446334, bh->b_count = 9
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bh->b_journal_head = !NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: gl = (4,
> 12477424)
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bd_new_le.le_trans = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bd_incore_le.le_trans = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_frozen
> = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_pinned
> = 0
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_ail_tr
> = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip =
> 12477424/12477424
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_count = 1, ip->i_vnode = !NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[0] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[1] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[2] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[3] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[4] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[5] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[6] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[7] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[8] = NULL
> Dec 12 10:04:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[9] = NULL
> Dec 12 10:09:17 imagine su(pam_unix)[5104]: session closed for user
> root
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: stuck in
> gfs_releasepage()...
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: blkno =
> 12446334, bh->b_count = 9
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bh->b_journal_head = !NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: gl = (4,
> 12477424)
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bd_new_le.le_trans = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> bd_incore_le.le_trans = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_frozen
> = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_pinned
> = 0
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: bd_ail_tr
> = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5: ip =
> 12477424/12477424
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_count = 1, ip->i_vnode = !NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[0] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[1] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[2] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[3] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[4] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[5] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[6] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[7] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[8] = NULL
> Dec 12 10:14:02 imagine kernel: GFS: fsid=CSM_ACN:admin01.5:
> ip->i_arch.i_cache[9] = NULL
> 
> The file system in question appears to work fine on the other nodes, I
> unmounted it to be on the safe side.
> 
> This is redhat enterprise 3.6, kernel 2.4.21-37.ELsmp, GFS
> 6.0.2.27-0.  GFS was built from the source.
> There are 2 partitions in the admin pool, the second was added a week
> or so ago.
> 
> I tried to unmount it, but the umount failed because of the processes
> that are stuck in device waits.
> 
> Any ideas?
> 
> thank you
> 
> Matt
> mbrookov at mines.edu
> 
> 
> 
> 
> ______________________________________________________________________
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/15083b93/attachment.htm>

From bmarzins at redhat.com  Mon Dec 12 18:19:01 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 12 Dec 2005 12:19:01 -0600
Subject: [Linux-cluster] Question about gfs and gnbd
In-Reply-To: <439D4C8A.9040103@gmail.com>
References: <439D4C8A.9040103@gmail.com>
Message-ID: <20051212181901.GC23983@phlogiston.msp.redhat.com>

On Mon, Dec 12, 2005 at 11:10:18AM +0100, carlopmart wrote:
> Hi ll,
> 
>  Where I can find a simple howto that explains how can I install gfs 
> and gnbd on the same machine?? I googled without results and redhat 
> documentation is not very clear about this.

Have you looked at
http://sources.redhat.com/cluster/gnbd/gnbd_usage.txt

especially the "Complex setups" and "cluster.conf file" section?
This should hopefully have what you need. If not, I'd like to hear what you
think is missing from it.

-Ben
 
> Thanks.
> 
> -- 
> CL Martinez
> carlopmart {at} gmail {d0t} com
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From bmarzins at redhat.com  Mon Dec 12 18:42:33 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 12 Dec 2005 12:42:33 -0600
Subject: [Linux-cluster] GNBD import mirroring?
In-Reply-To: <439DB1D1.30200@dorm.org>
References: <439DB1D1.30200@dorm.org>
Message-ID: <20051212184233.GD23983@phlogiston.msp.redhat.com>

On Mon, Dec 12, 2005 at 11:22:25AM -0600, Brenton Rothchild wrote:
> Hi all,
> 
> I was thinking about trying to set up a test cluster
> according to the figure in
> 
> http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY
> 
> but I see a post from Benjamin Marzinski that this is a
> marketing fluke and isn't possible?
> (http://www.redhat.com/archives/linux-cluster/2005-February/msg00049.html)

The documentation has been corrected. The picture, as is stands now is a valid
configuration. previously, it looked like this

----------   -----------   -----------
|        |   |         |   |         |   GNBD servers
----------   -----------   -----------
    /\           /\             /\
   A  B'        B  C'          C  A'

Meaning that you were doing mirroring on the gnbd clients. This is untested
and unsupported.
 
> Is that true, or this possible?  I'd like to make use of two GNBD
> export servers that I could RAID 1+0 on the GNBD import nodes
> to get a network mirror plus resizeable GFS parition...
> if that's even possible.

I still believe that it's possible that someone with a bit of time and
understanding could hack some stuff and get GNBD running on top of DRBD.
I'm pretty sure I explained how I thought that this could be done on a past
post, which I can't locate now.  I would never do this on a production setup
unless you really knew what you would doing, and were willing to support it
yourself, and tested it like mad first.

The problem with mirroring is that if a machine dies, it may have successfully
written to one server and not the other.  Now your mirror is out of sync, and
a non-cluster aware mirror setup would never know.  The non-painful solution
is to use cluster mirroring, which will be available through CLVM soonish.

-Ben


> If anyone could tell me if I'm barking up the wrong tree on this,
> I'd appreciate it!
> 
> Thanks,
> -Brenton Rothchild
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From danwest at comcast.net  Mon Dec 12 18:59:24 2005
From: danwest at comcast.net (danwest at comcast.net)
Date: Mon, 12 Dec 2005 18:59:24 +0000
Subject: [Linux-cluster] random STONITHS
Message-ID: <121220051859.16177.439DC88C00099BA300003F3122070215739B9C0A99020E0B@comcast.net>

I have a ten node cluster based on the latest RHEL3 patched source.  5 of the 10 nodes are spares.  The last member ?members%members9%name = node10? tends to randomly STONITH one of the other nodes for no apparent reason.  These STONITHs happen to both the active and spare nodes.  The only thing I can find in common is that it is always the last device to issue the STONITH.  The nodes are not particularly loaded.  Does anyone have any ideas why this may be happening?
 
Here is an example of /var/log/messages
 
messages.1:Dec  5 02:50:11 node10 cluquorumd[5168]: <warning> --> Commencing STONITH <--
messages.1:Dec  5 02:50:17 node10 cluquorumd[5168]: <notice> STONITH: node3 has been fenced!
messages.1:Dec  5 02:50:23 node10 cluquorumd[5168]: <notice> STONITH: node3 is no longer fenced off.
messages.1:Dec  5 05:29:32 node10 cluquorumd[5168]: <warning> --> Commencing STONITH <--
messages.1:Dec  5 05:29:39 node10 cluquorumd[5168]: <notice> STONITH: node3 has been fenced!
messages.1:Dec  5 05:29:45 node10 cluquorumd[5168]: <notice> STONITH: node3 is no longer fenced off.
 
Thanks,
 Dan
 



From canfield at uindy.edu  Mon Dec 12 19:27:29 2005
From: canfield at uindy.edu (D Canfield)
Date: Mon, 12 Dec 2005 14:27:29 -0500
Subject: [Linux-cluster] General GFS Advice?
Message-ID: <439DCF21.4090802@uindy.edu>

I'm just looking for a bit of general advice about GFS... We're 
basically just looking to use it as a SAN-based replacement for NFS.  
We've got a handful of servers that need constant read/write access to 
our users' home directories (Samba PDC/BDC, web server, network terminal 
servers, etc.), and we thought GFS might be a good replacement from a 
performance and security standpoint, let alone removing the SPOF of our 
main NFS/file server.  Another place we're thinking of using it is 
underneath our mail servers, so that as we grow, SMTP deliveries (and 
virus scanning) can happen on one machine while IMAP/POP connections can 
be served through another.

Unfortunately, even at academic prices, Red Hat wants more per single 
GFS node than I'm paying for twenty AS licenses, so I've been heading 
down this road by building from the SRPMS.  I mostly have a 2-node test 
cluster built under RHEL4, but a number of things have me a little bit 
hesitant to move forward, so I'm wondering if some folks can offer some 
advice. 

For starters, is my intended use even appropriate for GFS?  It does seem 
as though I'm looking to put an awful lot of overhead (with the cluster 
management suite) onto these boxes just to eliminate a SPOF. 

Another concern is that this list seems to have a lot more questions 
posted than answers.  Are folks running into situations where 
filesystems are hopelessly corrupted or that they've been unable to 
recover from?  That's the impression I feel like I'm getting, but I 
suppose a newbie to Linux in general could get the same impression from 
reading the fedora lists out of context.    The last thing I want to do 
is put something into production and then have unexplained fencing 
occurences or filesystem errors.

Finally, Red Hat sales is laying it on pretty heavy that the reason the 
GFS pricing is so high is because it's nearly impossible to install it 
yourself.  That was particularly true before GFS landed in Fedora.  Now 
the claim is just that it's very difficult to manage without a support 
contract.  Is this just marketing, or does GFS really turn out to be a 
nightmare to maintain?

Any insights people could provide would be appreciated.

Thanks
DC



From rainer at ultra-secure.de  Mon Dec 12 20:29:07 2005
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Mon, 12 Dec 2005 21:29:07 +0100
Subject: [Linux-cluster] General GFS Advice?
In-Reply-To: <439DCF21.4090802@uindy.edu>
References: <439DCF21.4090802@uindy.edu>
Message-ID: <439DDD93.8090905@ultra-secure.de>

D Canfield wrote:

> I'm just looking for a bit of general advice about GFS... We're 
> basically just looking to use it as a SAN-based replacement for NFS.  
> We've got a handful of servers that need constant read/write access to 
> our users' home directories (Samba PDC/BDC, web server, network 
> terminal servers, etc.), and we thought GFS might be a good 
> replacement from a performance and security standpoint, let alone 
> removing the SPOF of our main NFS/file server.



That should be OK, IMO.
We're still using GFS 5.2 (I think) on our small webserver-farm and it's 
amazing how much performance you can beat out of tualatin Tualatin-CPUs 
and an aging hd-array.
We're looking to move it to GFS6.1 on newer hardware.


>   Another place we're thinking of using it is underneath our mail 
> servers, so that as we grow, SMTP deliveries (and virus scanning) can 
> happen on one machine while IMAP/POP connections can be served through 
> another.
>


That probably depends on your MTA.
You should, IMO, not deploy it without some real-world test-results.
GFS+small-files <=> possible nightmare.



> Unfortunately, even at academic prices, Red Hat wants more per single 
> GFS node than I'm paying for twenty AS licenses, so I've been heading 
> down this road by building from the SRPMS.  I mostly have a 2-node 
> test cluster built under RHEL4, but a number of things have me a 
> little bit hesitant to move forward, so I'm wondering if some folks 
> can offer some advice.
> For starters, is my intended use even appropriate for GFS?  It does 
> seem as though I'm looking to put an awful lot of overhead (with the 
> cluster management suite) onto these boxes just to eliminate a SPOF.




Indeed. And unless the storage itself is mirrored, that's still a SPOF ;-)
But on the over hand, it enables some things NFS can't do.


> Another concern is that this list seems to have a lot more questions 
> posted than answers.  Are folks running into situations where 
> filesystems are hopelessly corrupted or that they've been unable to 
> recover from?  That's the impression I feel like I'm getting, but I 
> suppose a newbie to Linux in general could get the same impression 
> from reading the fedora lists out of context.    The last thing I want 
> to do is put something into production and then have unexplained 
> fencing occurences or filesystem errors.
>


The support-contract should deal with these.
I suppose this list is not a replacement for a support-contract - merely 
a feedback-list for the developers.


> Finally, Red Hat sales is laying it on pretty heavy that the reason 
> the GFS pricing is so high is because it's nearly impossible to 
> install it yourself.  That was particularly true before GFS landed in 
> Fedora.  Now the claim is just that it's very difficult to manage 
> without a support contract.  Is this just marketing, or does GFS 
> really turn out to be a nightmare to maintain?
>


 From my (limited) exposure - I haven't done too much with it in the 
sense of experimenting and tinkering - I'd say that it requires an awful 
lot of knowledge to really "master" it.
One may get it to work with some tutorial and the mailing-list, but the 
technology behind is much more complex than your average NFS-setup.
You should ask yourself: "In case of an alert @ 3am - can I deliver a 
solution?".
If the answer is no.....

Having a test-setup that is similar or (equal) to the production-system 
should also help tremendously in avoiding any silly "mishaps"


> Any insights people could provide would be appreciated.


I think GFS does have its merrits (we really need to make some test with 
6.1 next year), but only in cases where the number of concurrent (write) 
accesses in the same directory is small.
Otherwhise, the overhead (at least by 6.0) is no longer worth the whole 
effort.




cheers,
Rainer






From mbrookov at mines.edu  Mon Dec 12 21:41:23 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Mon, 12 Dec 2005 14:41:23 -0700
Subject: [Linux-cluster] General GFS Advice?
In-Reply-To: <439DCF21.4090802@uindy.edu>
References: <439DCF21.4090802@uindy.edu>
Message-ID: <1134423683.23930.80.camel@merlin.Mines.EDU>

On Mon, 2005-12-12 at 12:27, D Canfield wrote:

> I'm just looking for a bit of general advice about GFS... We're 
> basically just looking to use it as a SAN-based replacement for NFS.  
> We've got a handful of servers that need constant read/write access to 
> our users' home directories (Samba PDC/BDC, web server, network terminal 
> servers, etc.), and we thought GFS might be a good replacement from a 
> performance and security standpoint, let alone removing the SPOF of our 
> main NFS/file server.  Another place we're thinking of using it is 
> underneath our mail servers, so that as we grow, SMTP deliveries (and 
> virus scanning) can happen on one machine while IMAP/POP connections can 
> be served through another.



I am working on using GFS for home directories, smtp, imap, pop and
web.  So far things are going pretty good.  (see my posting from this
morning for one down side)  At some point we may add Samba and other
services.  The current goal is to move mail and web servers to a set of
inexpensive, highly available, and scalable servers.

Be carefull with performance, GFS adds a lot of overhead and can be very
slow when dealing with lots of small file creates or deletes.  After I
did a lot of testing, we decided that the overhead was not a problem for
our application.

We hacked up some crude tools to push test mail messages and a thing to
read and verify the messages.  At one point the cluster was running 600
imap connections and had nearly 1500 inbound mail messages/minute.  The
simulated users were reading a message every 10 seconds, after reading
the mail box, would go back and delete a message every 2 seconds then
expunge the mail box.  There were 30 mail readers running on 20 linux
workstations.  The mail push consisted of 10 process on 5 workstations
that would generate a random message then send it by SMTP to cluster. 
Without any delay, the push would send around 3300 messages/minute and
swamp the cluster.  After some trial and error, I found that a delay of
1.7 seconds between messages would slow the push rate to around 1400
messages/minute.  The big bottle neck is openldap.  Its load goes up
serving sendmail aliases.  I expect our first upgrade will be to put in
more LDAP servers to spread the load.


> Unfortunately, even at academic prices, Red Hat wants more per single 
> GFS node than I'm paying for twenty AS licenses, so I've been heading 
> down this road by building from the SRPMS.  I mostly have a 2-node test 
> cluster built under RHEL4, but a number of things have me a little bit 
> hesitant to move forward, so I'm wondering if some folks can offer some 
> advice. 



I used the source RPM also, for the same reasons.  I would love to be
able to purchase support, but the funding is not there.  I would suggest
that you test carefully, and move slowly.


> For starters, is my intended use even appropriate for GFS?  It does seem 
> as though I'm looking to put an awful lot of overhead (with the cluster 
> management suite) onto these boxes just to eliminate a SPOF. 



I think this debate is ongoing for any body that is looking at a SAN or
cluster.  Once I factored in GFS, LDAP, Kerberos, load balancers, a SAN,
etc, this has turned into the most complex system I have ever built for
an employer.  The 2 times we have had problems, one of the other servers
took over and the traffic went through.  We are still in test mode, but
expect to put our first cluster in production on January 7th.


> Another concern is that this list seems to have a lot more questions 
> posted than answers.  Are folks running into situations where 
> filesystems are hopelessly corrupted or that they've been unable to 
> recover from?  That's the impression I feel like I'm getting, but I 
> suppose a newbie to Linux in general could get the same impression from 
> reading the fedora lists out of context.    The last thing I want to do 
> is put something into production and then have unexplained fencing 
> occurences or filesystem errors.



I have been working with GFS for over a year now, on both test and
soon-to-be production servers.  In general, I think GFS works well.

EXT3, JFS, etc are more stable.  The down side is that if the server
that holds the EXT3 is down, then the applications are down.  It is nice
to be able to take a server out of production, fix it, make changes, etc
and the users not know. 


> Finally, Red Hat sales is laying it on pretty heavy that the reason the 
> GFS pricing is so high is because it's nearly impossible to install it 
> yourself.  That was particularly true before GFS landed in Fedora.  Now 
> the claim is just that it's very difficult to manage without a support 
> contract.  Is this just marketing, or does GFS really turn out to be a 
> nightmare to maintain?



While testing, I have built on Fedora core 3, Suse 9.1, and RedHat
Enterprise 3.  I have built from the CVS tree, and SRPMS, and never had
much trouble getting GFS up and running.  I did have to write a fencing
module to work with a Cisco switch (not difficult, it is just a perl
script that does some SNMP calls).  The SAN is ISCSI based and the only
place to fence was in the switch.  Of course, I have been hacking Unix
boxes for 20 years now and using Linux in development and production for
10 years.  If you background is Windows or VMS, you would have to work
at it.


> Any insights people could provide would be appreciated.

Move slow, plan carefully, and test everything.  Of course, this is my
standard advice any time you are doing something new. 

Matt
mbrookov at mines.edu

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/1fa61cb7/attachment.htm>

From mbrookov at mines.edu  Mon Dec 12 21:51:55 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Mon, 12 Dec 2005 14:51:55 -0700
Subject: [Linux-cluster] stuck processes on GFS partition?
In-Reply-To: <1134410543.23930.28.camel@merlin.Mines.EDU>
References: <1134408900.23930.23.camel@merlin.Mines.EDU>
	<1134410543.23930.28.camel@merlin.Mines.EDU>
Message-ID: <1134424315.23930.88.camel@merlin.Mines.EDU>

After poking through the logs, etc, I finally decided to take the server
down that had the processes hung.  I did a gulm_tool forceexpire to lock
it out out of the cluster.  The file system was working on the other 5
nodes, but I unmounted it every where and ran gfs_fsck.  The fsck was
clean, the output is below.

This looks like a similar problem to the one described in bugzilla
160409.  It does not look like there ever was a solution. 

fsck output:

[root at illusion mbrookov]# gfs_fsck /dev/pool/admin01
Initializing fsck
Clearing journals (this may take a while)
.Cleared journals
Starting pass1
Pass1 complete
Starting pass1b
Pass1b complete
Starting pass1c
Pass1c complete
Starting pass2
Pass2 complete
Starting pass3
Pass3 complete
Starting pass4
Pass4 complete
Starting pass5
Converting 1 unused metadata blocks to free data blocks...
Converting 2 unused metadata blocks to free data blocks...
Converting 11 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 44 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 125 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 129 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 27 unused metadata blocks to free data blocks...
Converting 128 unused metadata blocks to free data blocks...
Converting 7 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 116 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 46 unused metadata blocks to free data blocks...
Converting 4 unused metadata blocks to free data blocks...
Converting 131 unused metadata blocks to free data blocks...
Converting 58 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 130 unused metadata blocks to free data blocks...
Converting 69 unused metadata blocks to free data blocks...
Converting 57 unused metadata blocks to free data blocks...
Converting 58 unused metadata blocks to free data blocks...
Converting 57 unused metadata blocks to free data blocks...
Converting 61 unused metadata blocks to free data blocks...
Converting 60 unused metadata blocks to free data blocks...
Converting 60 unused metadata blocks to free data blocks...
Converting 69 unused metadata blocks to free data blocks...
Converting 56 unused metadata blocks to free data blocks...
Converting 3 unused metadata blocks to free data blocks...
Converting 2 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 3 unused metadata blocks to free data blocks...
Converting 101 unused metadata blocks to free data blocks...
Converting 11 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 3 unused metadata blocks to free data blocks...
Converting 15 unused metadata blocks to free data blocks...
Converting 26 unused metadata blocks to free data blocks...
Converting 2 unused metadata blocks to free data blocks...
Converting 20 unused metadata blocks to free data blocks...
Converting 20 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 86 unused metadata blocks to free data blocks...
Converting 22 unused metadata blocks to free data blocks...
Converting 233 unused metadata blocks to free data blocks...
Converting 183 unused metadata blocks to free data blocks...
Converting 13 unused metadata blocks to free data blocks...
Converting 22 unused metadata blocks to free data blocks...
Converting 1 unused metadata blocks to free data blocks...
Converting 199 unused metadata blocks to free data blocks...
Converting 216 unused metadata blocks to free data blocks...
Converting 482 unused metadata blocks to free data blocks...
Converting 298 unused metadata blocks to free data blocks...
Converting 486 unused metadata blocks to free data blocks...
Converting 489 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 486 unused metadata blocks to free data blocks...
Converting 489 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 488 unused metadata blocks to free data blocks...
Converting 486 unused metadata blocks to free data blocks...
Converting 492 unused metadata blocks to free data blocks...
Converting 486 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 485 unused metadata blocks to free data blocks...
Converting 487 unused metadata blocks to free data blocks...
Converting 364 unused metadata blocks to free data blocks...
Pass5 complete
[root at illusion mbrookov]#

thank you

Matt
mbrookov at mines.edu


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/654120fc/attachment.htm>

From jeff at jettis.com  Tue Dec 13 02:40:32 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Mon, 12 Dec 2005 18:40:32 -0800
Subject: [Linux-cluster] adding to a 2 node cluster
Message-ID: <B6A0A04D59978745A68272143BE55BD42CA5A0@laxmsex01.corp.jettis.com>

I'm trying to add a node to a 2 node cluster.  Here's what I've done...

vi /etc/cluster.conf
removed this section...
<cman two_node="1" expected_votes="1">
</cman>
added the node
ccs_tool update /etc/cluster/cluster.conf
cman_tool version -r 5

on the new node...
/etc/init.d/ccsd start (starts ok)
/etc/init.d/cman start (also tried modprobe cman, then cman_tool join -c
cluster01)

log messages look ok to start...
Dec 12 18:36:41 laxlxhs02 ccsd[17354]: Starting ccsd 1.0.0: 
Dec 12 18:36:41 laxlxhs02 ccsd[17354]:  Built: Jun 16 2005 10:45:39 
Dec 12 18:36:41 laxlxhs02 ccsd[17354]:  Copyright (C) Red Hat, Inc.
2004  All rights reserved. 
Dec 12 18:36:59 laxlxhs02 kernel: CMAN 2.6.11.5-20050601.152643.FC4.15
(built Oct 25 2005 17:57:32) installed
Dec 12 18:36:59 laxlxhs02 kernel: NET: Registered protocol family 30
Dec 12 18:36:59 laxlxhs02 ccsd[17354]: cluster.conf (cluster name =
cluster01, version = 4) found. 
Dec 12 18:36:59 laxlxhs02 ccsd[17354]: Remote copy of cluster.conf is
from quorate node. 
Dec 12 18:36:59 laxlxhs02 ccsd[17354]:  Local version # : 4 
Dec 12 18:36:59 laxlxhs02 ccsd[17354]:  Remote version #: 5 
Dec 12 18:36:59 laxlxhs02 ccsd[17354]: Switching to remote copy.

then repeat this over and over...
Dec 12 18:36:59 laxlxhs02 kernel: CMAN: Waiting to join or form a
Linux-cluster
Dec 12 18:37:00 laxlxhs02 ccsd[17354]: Connected to cluster
infrastruture via: CMAN/SM Plugin v1.1.2 
Dec 12 18:37:00 laxlxhs02 ccsd[17354]: Initial status:: Inquorate 
Dec 12 18:37:00 laxlxhs02 kernel: CMAN: sending membership request
Dec 12 18:37:00 laxlxhs02 kernel: CMAN: Cluster membership rejected
Dec 12 18:37:00 laxlxhs02 ccsd[17354]: Cluster manager shutdown.
Attemping to reconnect... 

/proc/cluster/status on the other nodes indicates that the version has
been updated but it's still a 2 node cluster...
Protocol version: 5.0.1
Config version: 5
Cluster name: cluster01
Cluster ID: 53601
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 1
Total_votes: 2
Quorum: 1   
Active subsystems: 8

Thanks.

 - Jeff


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/3ae3ea8e/attachment.htm>

From yazan at ccs.com.jo  Tue Dec 13 06:33:02 2005
From: yazan at ccs.com.jo (Yazan Al-Bakheit)
Date: Tue, 13 Dec 2005 08:33:02 +0200
Subject: [Linux-cluster] NLBr witout using cluster suite ??
Message-ID: <007e01c5ffaf$1153ee40$69050364@yazanz>


Hi ,

       i sent an Email asking about NLB  , and i wrote the following :

        can any body please tell me about how to have nlb implementation on 
        redhat enterprise v3 ???

        i will have a simulation for two PCs :
 
       each one have two network cards and 3 GB RAM
       Redhat enterprise linux V3 update 5

      and we will install IAS , tomcat , ECC schema , ECC services , ECC 
      Application on each one of the two PCs .

      but after that i want to configure NLB on linux , So how and where to 
      start configuring the NLB .

       if any one have do this , please to tell me about or any documents to 
      read   because i want to test the NLB with the ECC system 
      (Performance issues).

----------------------------------------------

  Actualy , i have two nodes and want to build NLB between them but i dont 
  want to use Cluster suite (cluster ware )   .

   Can i do it ?  and How ?


  Does any one configure it without cluster suit ? please can you respond 
  me please .

  if it will not be configure without cluster sute please inform me , i have to 
  take a decision .



Regards
-------------------------------------------------

Yazan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051213/a46ec435/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tech.gif
Type: image/gif
Size: 862 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051213/a46ec435/attachment.gif>

From pcaulfie at redhat.com  Tue Dec 13 08:21:33 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 13 Dec 2005 08:21:33 +0000
Subject: [Linux-cluster] adding to a 2 node cluster
In-Reply-To: <B6A0A04D59978745A68272143BE55BD42CA5A0@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD42CA5A0@laxmsex01.corp.jettis.com>
Message-ID: <439E848D.4020406@redhat.com>

Jeff Dinisco wrote:
> I'm trying to add a node to a 2 node cluster.  Here's what I've done...
> 

You can't do this. The only way to make a two_node cluster into a 3 node
cluster is to reboot everything with the new cluster.conf.

Sorry.
-- 

patrick



From chekov at ucla.edu  Tue Dec 13 09:42:11 2005
From: chekov at ucla.edu (Alan Wood)
Date: Tue, 13 Dec 2005 01:42:11 -0800 (PST)
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 12
In-Reply-To: <20051212214140.1410373653@hormel.redhat.com>
References: <20051212214140.1410373653@hormel.redhat.com>
Message-ID: <Pine.LNX.4.61.0512130048110.2979@c-24-130-164-240.hsd1.ca.comcast.net>

DC,

While you did not directly say you planned to do so in your post, I would 
not recommend running GFS as the underlying filesystem from which you share 
home directories via samba.  SMB is stateful and not cluster aware, and 
when I implemented this last year I ran into repeated samba crashes and 
locking issues I could not resolve.  Last I checked Redhat only supports 
sharing stateless NFS on top of GFS.  I haven't inquired about this since 
linuxworld (August), so maybe the new cluster system handles samba better, 
but I would definitly try it under a good load in a test environmnet before 
rolling it out via production.  If someone out there is succesfully sharing 
a GFS filesystem via samba (and doing something else on it at the same 
time, like webserving) I'd love to hear about it.
Running a samba PDC should be fine though I would still recommend putting 
the netlogon share on another filesystem and mirroring it wherever you had 
BDCs.

eventually we simply went with an iSCSI-based san and heartbeat-based 
failover of samba.  If you have an LDAP backend everything can be made 
redundant and you can eliminate SPOF without special kernels patches and 
the cluster requirements.  of course, this does not help in load balancing 
(since the data isn't available to both nodes simultaneously) and heartbeat 
itself has a few quirks.

As for the support from redhat I think you'll find that the people on this 
mailing list as about as top notch as exist in the linux community and as 
long as you've done your homework you should be able to get very good help 
very quickly.  but if you need someone to blame and someone else's ass to 
be on the line, well...
-alan


> Date: Mon, 12 Dec 2005 14:27:29 -0500
> From: D Canfield <canfield at uindy.edu>
> Subject: [Linux-cluster] General GFS Advice?
> To: linux-cluster at redhat.com
> Message-ID: <439DCF21.4090802 at uindy.edu>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> I'm just looking for a bit of general advice about GFS... We're
> basically just looking to use it as a SAN-based replacement for NFS.
> We've got a handful of servers that need constant read/write access to
> our users' home directories (Samba PDC/BDC, web server, network terminal
> servers, etc.), and we thought GFS might be a good replacement from a
> performance and security standpoint, let alone removing the SPOF of our
> main NFS/file server.  Another place we're thinking of using it is
> underneath our mail servers, so that as we grow, SMTP deliveries (and
> virus scanning) can happen on one machine while IMAP/POP connections can
> be served through another.
>
> Unfortunately, even at academic prices, Red Hat wants more per single
> GFS node than I'm paying for twenty AS licenses, so I've been heading
> down this road by building from the SRPMS.  I mostly have a 2-node test
> cluster built under RHEL4, but a number of things have me a little bit
> hesitant to move forward, so I'm wondering if some folks can offer some
> advice.
>
> For starters, is my intended use even appropriate for GFS?  It does seem
> as though I'm looking to put an awful lot of overhead (with the cluster
> management suite) onto these boxes just to eliminate a SPOF.
>
> Another concern is that this list seems to have a lot more questions
> posted than answers.  Are folks running into situations where
> filesystems are hopelessly corrupted or that they've been unable to
> recover from?  That's the impression I feel like I'm getting, but I
> suppose a newbie to Linux in general could get the same impression from
> reading the fedora lists out of context.    The last thing I want to do
> is put something into production and then have unexplained fencing
> occurences or filesystem errors.
>
> Finally, Red Hat sales is laying it on pretty heavy that the reason the
> GFS pricing is so high is because it's nearly impossible to install it
> yourself.  That was particularly true before GFS landed in Fedora.  Now
> the claim is just that it's very difficult to manage without a support
> contract.  Is this just marketing, or does GFS really turn out to be a
> nightmare to maintain?
>
> Any insights people could provide would be appreciated.
>
> Thanks
> DC
>
>



From ajarbou at kau.edu.sa  Tue Dec 13 13:47:40 2005
From: ajarbou at kau.edu.sa (Abdullah Jarbou)
Date: Tue, 13 Dec 2005 16:47:40 +0300
Subject: [Linux-cluster] any documentaion about manual fence!
Message-ID: <200512131347.jBDDlen06470@mail1.kaau.edu.sa>

Hi All,

 

Although, I have some experience with the old red hat cluster manager on RH
AS2.1, but I am really newbie to RHCS4. Also, I'm too glade to find a lot of
cluster experts in this list.

 

I have built a simple cluster with 2 nodes by following the "Configuring and
Managing RHCS" article, and have the following situation:

 

- Both nodes are up on newly configured cluster, services started on
(node1).

- node1 shutdown, node 2 failover the services (apache, mysql).

- boot up node 1 again, (apache return to node1, mysql failed !)

- after that, when restart node1, node1 halted after starting dlm_lock.

- restart node2, node1 display the following:

 

CMAN: join request from node2 rejected, config version local 11 remote 10.

 

- after that, node2 start without cluster services.

 

- the only way  -I guess- to get node1 up now is to stop the cluster
services from the linux rescue shell command.

 

I'm using  the manual_fence since I don't have any hardware fencing device
yet. Also, I configure all the cluster services (ccsd, cman, rgmanager,
fenced) to be booting on automatically ( chkconfig on).

 

I know that the fence device is required to run RHCS, but I am trying now to
finish building the cluster using the manual fence. 

My questions are:

1-     Is it possible to run the cluster with the manual_fence? Is there any
related documentation or alternative way.

2-     I don't want to waste your time by such questions, but I surprised
that a lot of experts use some shell commands to manage and administer the
cluster while I am not find any documentation about  RHCS U2 except
"Configuring and Managing RHCS" from redhat. So, is there another
documentation or method to learn the RHCS well?

 

I appreciate any help,

 

Regards,

Ajarbou



"Disclaimer by McAfee Web Shield e1000 Appliance"

This Outgoing Email has been scanned by King Abdul Aziz University SMTP Gateway Antivirus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051213/399bd9be/attachment.htm>

From jeff at jettis.com  Tue Dec 13 13:55:55 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Tue, 13 Dec 2005 05:55:55 -0800
Subject: [Linux-cluster] adding to a 2 node cluster
Message-ID: <B6A0A04D59978745A68272143BE55BD4A43234@laxmsex01.corp.jettis.com>

That was my suspicion.  Is a reboot necessary or can I simply take down
all the daemons/mods and bring them back up again?  Also, my process
will work going from a 3 to 4 node cluster, correct? 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Patrick Caulfield
Sent: Tuesday, December 13, 2005 3:22 AM
To: linux clustering
Subject: Re: [Linux-cluster] adding to a 2 node cluster

Jeff Dinisco wrote:
> I'm trying to add a node to a 2 node cluster.  Here's what I've
done...
> 

You can't do this. The only way to make a two_node cluster into a 3 node
cluster is to reboot everything with the new cluster.conf.

Sorry.
-- 

patrick

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster





From pcaulfie at redhat.com  Tue Dec 13 14:17:23 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 13 Dec 2005 14:17:23 +0000
Subject: [Linux-cluster] adding to a 2 node cluster
In-Reply-To: <B6A0A04D59978745A68272143BE55BD4A43234@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD4A43234@laxmsex01.corp.jettis.com>
Message-ID: <439ED7F3.5060703@redhat.com>

Jeff Dinisco wrote:
> That was my suspicion.  Is a reboot necessary or can I simply take down
> all the daemons/mods and bring them back up again?  Also, my process
> will work going from a 3 to 4 node cluster, correct? 
> 

Yes, a full reboot is not strictly necessary - just leave the cluster and rejoin.

3 upwards is fine - just keep adding them. It's just 2 nodes that is a special
case to avoid a split-brain.

When adding new nodes, make sure the config file version number is updated and
the cluster is told of the new version change using "cman_tool version -r<n>".
-- 

patrick



From masotti at mclink.it  Tue Dec 13 14:55:52 2005
From: masotti at mclink.it (Marco Masotti)
Date: Tue, 13 Dec 2005 15:55:52 +0100 (CET)
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 12
Message-ID: <1.3.200512131555.9064@mclink.it>

> ==========================
> Date: Tue, 13 Dec 2005 01:42:11 -0800 (PST)
> From: Alan Wood <chekov at ucla.edu>
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 
> 12
> ==========================


[...]

> SMB is stateful and not cluster 
> aware, 


I'm defintely missing something in my assumptions. By its very nature, shouldn't GFS be prescinding from its application, as in every other filesystem?

Also, pls allow the ingenuous question, what number of applications needs ever to be cluster aware, if not a very strict one? Also, intuitively as it may come, should a properly written applicative be independent of the operating filesystem properties?
Thanks.




From anderson at centtech.com  Tue Dec 13 15:26:38 2005
From: anderson at centtech.com (Eric Anderson)
Date: Tue, 13 Dec 2005 09:26:38 -0600
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 12
In-Reply-To: <1.3.200512131555.9064@mclink.it>
References: <1.3.200512131555.9064@mclink.it>
Message-ID: <439EE82E.2080106@centtech.com>

Marco Masotti wrote:

>>==========================
>>Date: Tue, 13 Dec 2005 01:42:11 -0800 (PST)
>>From: Alan Wood <chekov at ucla.edu>
>>To: linux-cluster at redhat.com
>>Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 
>>12
>>==========================
>>    
>>
>
>
>[...]
>
>  
>
>>SMB is stateful and not cluster 
>>aware, 
>>    
>>
>
>
>I'm defintely missing something in my assumptions. By its very nature, shouldn't GFS be prescinding from its application, as in every other filesystem?
>
>Also, pls allow the ingenuous question, what number of applications needs ever to be cluster aware, if not a very strict one? Also, intuitively as it may come, should a properly written applicative be independent of the operating filesystem properties?
>Thanks.
>  
>

I agree here - GFS supposedly supports posix semantics, so the 
application should not care about whether it is clustered or not, as 
long as it using locking correctly on it's own.  At least, with other 
clustered filesystems, this is the case. If GFS doesn't allow this, I 
would say it isn't really a cluster aware filesystem, but more of a 
distributed lock/cache coherent filesystem without fully clustered 
semantics.. (please correct me here! I'm still learning)

Eric




-- 
------------------------------------------------------------------------
Eric Anderson        Sr. Systems Administrator        Centaur Technology
Anything that works is better than anything that doesn't.
------------------------------------------------------------------------



From gwood at dragonhold.org  Tue Dec 13 15:52:15 2005
From: gwood at dragonhold.org (gwood at dragonhold.org)
Date: Tue, 13 Dec 2005 15:52:15 -0000 (GMT)
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 12
In-Reply-To: <439EE82E.2080106@centtech.com>
References: <1.3.200512131555.9064@mclink.it> <439EE82E.2080106@centtech.com>
Message-ID: <29700.208.178.77.200.1134489135.squirrel@mnementh.dragonhold.org>

>>>SMB is stateful and not cluster
>>>aware,
> (please correct me here! I'm still learning)
The point is that if the application itself is storing information, then
the filesystem under it cannot (without app support) make up for this.

Hence the comment about SMB being stateful.  If the clients connections
cannot cope (locking or just data transfer) cleanly with the server
crashing/restarting, then it cannot be clustered in this way.

Personally I didn't think this applied to samba, but I don't know the
internals enough to comment.



From teigland at redhat.com  Tue Dec 13 17:15:25 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 13 Dec 2005 11:15:25 -0600
Subject: [Linux-cluster] any documentaion about manual fence!
In-Reply-To: <200512131347.jBDDlen06470@mail1.kaau.edu.sa>
References: <200512131347.jBDDlen06470@mail1.kaau.edu.sa>
Message-ID: <20051213171525.GB14186@redhat.com>

On Tue, Dec 13, 2005 at 04:47:40PM +0300, Abdullah Jarbou wrote:
> CMAN: join request from node2 rejected, config version local 11 remote 10.

This sounds like your cluster.conf file is different on the two nodes, at
least the version is.

> I'm using  the manual_fence since I don't have any hardware fencing device
> yet. Also, I configure all the cluster services (ccsd, cman, rgmanager,
> fenced) to be booting on automatically ( chkconfig on).
> 
>  
> 
> I know that the fence device is required to run RHCS, but I am trying now to
> finish building the cluster using the manual fence. 
> 
> My questions are:
> 
> 1-     Is it possible to run the cluster with the manual_fence? Is there any
> related documentation or alternative way.

Yes, it's possible, but not easy.  Read the man pages and some of the info
here:  http://sources.redhat.com/cluster/doc/usage.txt

> 2-     I don't want to waste your time by such questions, but I surprised
> that a lot of experts use some shell commands to manage and administer the
> cluster while I am not find any documentation about  RHCS U2 except
> "Configuring and Managing RHCS" from redhat. So, is there another
> documentation or method to learn the RHCS well?

There are no official documents explaining how to run a cluster from the
command line, only using the GUI.

Dave



From anderson at centtech.com  Tue Dec 13 17:21:47 2005
From: anderson at centtech.com (Eric Anderson)
Date: Tue, 13 Dec 2005 11:21:47 -0600
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 12
In-Reply-To: <29700.208.178.77.200.1134489135.squirrel@mnementh.dragonhold.org>
References: <1.3.200512131555.9064@mclink.it> <439EE82E.2080106@centtech.com>
	<29700.208.178.77.200.1134489135.squirrel@mnementh.dragonhold.org>
Message-ID: <439F032B.1070208@centtech.com>

gwood at dragonhold.org wrote:

>>>>SMB is stateful and not cluster
>>>>aware,
>>>>        
>>>>
>>(please correct me here! I'm still learning)
>>    
>>
>The point is that if the application itself is storing information, then
>the filesystem under it cannot (without app support) make up for this.
>
>Hence the comment about SMB being stateful.  If the clients connections
>cannot cope (locking or just data transfer) cleanly with the server
>crashing/restarting, then it cannot be clustered in this way.
>
>Personally I didn't think this applied to samba, but I don't know the
>internals enough to comment.
>  
>

Ahh - I see, what you are saying actually has nothing to do with unix 
filesystems, but samba and clients.  I believe from what I've seen, that 
most clients will reconnect upon connection loss, and re-acquire locks 
and such without issue.  I'm not 100% certain on this, so take it for 
what it's worth.

Eric



-- 
------------------------------------------------------------------------
Eric Anderson        Sr. Systems Administrator        Centaur Technology
Anything that works is better than anything that doesn't.
------------------------------------------------------------------------



From chekov at ucla.edu  Tue Dec 13 19:16:37 2005
From: chekov at ucla.edu (Alan Wood)
Date: Tue, 13 Dec 2005 11:16:37 -0800 (PST)
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 14
In-Reply-To: <20051213170006.85EDA7397D@hormel.redhat.com>
References: <20051213170006.85EDA7397D@hormel.redhat.com>
Message-ID: <Pine.LNX.4.61.0512131100230.16591@c-24-130-164-240.hsd1.ca.comcast.net>

you can look at my post from Nov 15 of 2004 to see the effects I 
experienced running samba on top of GFS.  whether or not the problems stem 
purely from locking I don't know (I played extensively with the locking 
options in my smb.conf, to no avail), but the crashes [and delays] I saw 
when I had multiple users access the same file/share made the system 
unusable in production. 
whenever I've pushed on this question people seem to fall into one of two 
camps:
1.  never tried running samba on top of GFS with high load, but thinks it 
should work
2.  acknowledges there might be some underlying problems

if there is a 3rd camp out there of people who are running samba sharing on 
top of GFS I'd love to hear about it.  My experience says it'll start up 
fine and probably work ok under light load (say, 5 users) or if users only 
ever access their own shares.  but as soon as you have multiple users 
accessing a common samba share you start experiencing [unacceptable] delays 
and if something else is going on (say a webserver serving the same path) 
you'll probably get a crash.
-alan

On Tue, 13 Dec 2005 linux-cluster-request at redhat.com wrote:

> Date: Tue, 13 Dec 2005 09:26:38 -0600
> From: Eric Anderson <anderson at centtech.com>
> Subject: Re: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue
> 	12
> To: linux clustering <linux-cluster at redhat.com>
> Message-ID: <439EE82E.2080106 at centtech.com>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> Marco Masotti wrote:
>
>>> ==========================
>>> Date: Tue, 13 Dec 2005 01:42:11 -0800 (PST)
>>> From: Alan Wood <chekov at ucla.edu>
>>> To: linux-cluster at redhat.com
>>> Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue
>>> 12
>>> ==========================
>>>
>>>
>>
>>
>> [...]
>>
>>
>>
>>> SMB is stateful and not cluster
>>> aware,
>>>
>>>
>>
>>
>> I'm defintely missing something in my assumptions. By its very nature, 
>> shouldn't GFS be prescinding from its application, as in every other 
>> filesystem?
>>
>> Also, pls allow the ingenuous question, what number of applications 
>> needs ever to be cluster aware, if not a very strict one? Also, 
>> intuitively as it may come, should a properly written applicative be 
>> independent of the operating filesystem properties? Thanks.
>>
>>
>
> I agree here - GFS supposedly supports posix semantics, so the
> application should not care about whether it is clustered or not, as
> long as it using locking correctly on it's own.  At least, with other
> clustered filesystems, this is the case. If GFS doesn't allow this, I
> would say it isn't really a cluster aware filesystem, but more of a
> distributed lock/cache coherent filesystem without fully clustered
> semantics.. (please correct me here! I'm still learning)
>
> Eric
>
>
>
>
>



From cjk at techma.com  Tue Dec 13 20:07:16 2005
From: cjk at techma.com (Kovacs, Corey J.)
Date: Tue, 13 Dec 2005 15:07:16 -0500
Subject: [Linux-cluster] GFS 6.0u5 not freeing locks?
Message-ID: <EE32D921D7601547AD9CA5C87906C5660951C9@tmaemail.techma.com>

It's been a while since I've worked on the following problem but here I am
at it again.

I have a three node system running RHEL3 update 5 (kernel 2.4.21-32) with 
GFS-6.0.2.20-1. All three nodes are running as both lock managers and 
filesystem clients. When sending thousands of files to the cluster 
(on the order of 1/2 terrabyte of 50k files) target nodes will run 
out of memory and refuse to fork. Interestingly enough this condition 
does not cause the cluster to fence the node, rather it things everything 
is "OK". The effect of course is that the fs is not accessable cuz the 
cluster is waiting to hear back from the node in question.

I set the high water mark to 10000 (I know that's low, but I wanted to see
the effect)
and the system seemed to be trying to free locks every ten seconds as  it
should but
simply could not keep up with the file xfer going in.

By the time a node finally locks up there are over 300K of locks in use.
There is
only a small % diff between the locks reported and the inodes in the
filesystem. If
I interperet this correctly, it simply meand that for almost all the files I
was able to xfer, there is an existing lock being used. Also, mem usage for
lock_gulmd was at 85M+. 
When we started logging things it was at 30M+ rising about 3-400k per min.

I remember seeing some traffic about another cluster user having a similar
problem but
I am not sure if it was resolved. 

This looks like a leak to me, anyone have any ideas?


Corey



From woytek+ at cmu.edu  Tue Dec 13 21:07:23 2005
From: woytek+ at cmu.edu (Jonathan Woytek)
Date: Tue, 13 Dec 2005 16:07:23 -0500
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 14
In-Reply-To: <Pine.LNX.4.61.0512131100230.16591@c-24-130-164-240.hsd1.ca.comcast.net>
References: <20051213170006.85EDA7397D@hormel.redhat.com>
	<Pine.LNX.4.61.0512131100230.16591@c-24-130-164-240.hsd1.ca.comcast.net>
Message-ID: <439F380B.4030801@cmu.edu>

I did not keep up on this thread, but I can comment and say that I fall 
into the third camp that you list below, and I am NOT a happy camper. 
I've actually been going through our setup and moving filesystems over 
to EXT3 and dealing without the advantages that GFS provided to us 
initially.

Our initial setup was about a year ago, with four storage units 
connected via iSCSI to a RedHat Cluster Services and GFS cluster of two 
machines to serve as front-ends to the data.  We later added two 
GFS-only nodes to handle specific applications.  The idea was that the 
RHCS cluster would provide user services for data access, such as NFS, 
samba, CVS, Subversion, HTTP, etc.  The majority of users wanted samba 
and NFS access, with the majority of that access coming through windows 
users connected via samba.

Our results were quite disastrous, and continue to be bad.  Our initial 
roll-out looked good, until we started to get a lot of concurrent access 
to files over samba.  Our symptoms are that what appears to be a memory 
leak is eventually triggered somewhere, which begins to dramatically 
slow samba access.  Eventually, the system gets into a state where the 
kernel begins to go into an OOM loop, killing things until it kills the 
RHCS watchdog, which causes a reboot of the machine.  While it is doing 
this, GFS performance for all filesystems grinds to a halt on the 
affected machine (though GFS performance elsewhere works, but is slowed 
somewhat).

As a result, we've begun looking at other solutions, and are moving as 
many filesystems off of GFS as possible.  I've also ended up being a bit 
critical of the support that we paid for on this issue, as what amounted 
to months of having a call open with RedHat support yielded nothing but 
requests for additional logs, the capture of which normally triggered 
the OOM loop when the machine was already in a bad state.

Until something that should be as simple as providing remote access to 
GFS filesystems works, I maintain (publically and privately) that GFS is 
not ready for prime-time, and certainly not worth the money that I paid 
for two nodes worth of GFS support.

While it is possible that the GFS system available under RH4 works and 
fixes some of these issues, I can't be upgrading our production machines 
  with major OS releases every few months on the unconfirmed and 
probably slim chance that the upgrade will fix the problems.  I'm sure 
I'm not the only one in this kind of situation.

Sorry for the rant-like post, but I am just a tiny bit frustrated here.

jonathan


Alan Wood wrote:
> you can look at my post from Nov 15 of 2004 to see the effects I 
> experienced running samba on top of GFS.  whether or not the problems 
> stem purely from locking I don't know (I played extensively with the 
> locking options in my smb.conf, to no avail), but the crashes [and 
> delays] I saw when I had multiple users access the same file/share made 
> the system unusable in production. whenever I've pushed on this question 
> people seem to fall into one of two camps:
> 1.  never tried running samba on top of GFS with high load, but thinks 
> it should work
> 2.  acknowledges there might be some underlying problems
> 
> if there is a 3rd camp out there of people who are running samba sharing 
> on top of GFS I'd love to hear about it.  My experience says it'll start 
> up fine and probably work ok under light load (say, 5 users) or if users 
> only ever access their own shares.  but as soon as you have multiple 
> users accessing a common samba share you start experiencing 
> [unacceptable] delays and if something else is going on (say a webserver 
> serving the same path) you'll probably get a crash.
> -alan
> 
> On Tue, 13 Dec 2005 linux-cluster-request at redhat.com wrote:
> 
>> Date: Tue, 13 Dec 2005 09:26:38 -0600
>> From: Eric Anderson <anderson at centtech.com>
>> Subject: Re: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue
>>     12
>> To: linux clustering <linux-cluster at redhat.com>
>> Message-ID: <439EE82E.2080106 at centtech.com>
>> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>>
>> Marco Masotti wrote:
>>
>>>> ==========================
>>>> Date: Tue, 13 Dec 2005 01:42:11 -0800 (PST)
>>>> From: Alan Wood <chekov at ucla.edu>
>>>> To: linux-cluster at redhat.com
>>>> Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue
>>>> 12
>>>> ==========================
>>>>
>>>>
>>>
>>>
>>> [...]
>>>
>>>
>>>
>>>> SMB is stateful and not cluster
>>>> aware,
>>>>
>>>>
>>>
>>>
>>> I'm defintely missing something in my assumptions. By its very 
>>> nature, shouldn't GFS be prescinding from its application, as in 
>>> every other filesystem?
>>>
>>> Also, pls allow the ingenuous question, what number of applications 
>>> needs ever to be cluster aware, if not a very strict one? Also, 
>>> intuitively as it may come, should a properly written applicative be 
>>> independent of the operating filesystem properties? Thanks.
>>>
>>>
>>
>> I agree here - GFS supposedly supports posix semantics, so the
>> application should not care about whether it is clustered or not, as
>> long as it using locking correctly on it's own.  At least, with other
>> clustered filesystems, this is the case. If GFS doesn't allow this, I
>> would say it isn't really a cluster aware filesystem, but more of a
>> distributed lock/cache coherent filesystem without fully clustered
>> semantics.. (please correct me here! I'm still learning)
>>
>> Eric
>>
>>
>>
>>
>>
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Jonathan Woytek                 w: 412-681-3463         woytek+ at cmu.edu
NREC Computing Manager          c: 412-401-1627         KB3HOZ
PGP Key available upon request



From wcheng at redhat.com  Tue Dec 13 21:38:46 2005
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 13 Dec 2005 16:38:46 -0500
Subject: [Linux-cluster] GFS 6.0u5 not freeing locks?
In-Reply-To: <EE32D921D7601547AD9CA5C87906C5660951C9@tmaemail.techma.com>
References: <EE32D921D7601547AD9CA5C87906C5660951C9@tmaemail.techma.com>
Message-ID: <439F3F66.7090007@redhat.com>

Kovacs, Corey J. wrote:

>It's been a while since I've worked on the following problem but here I am
>at it again.
>
>I have a three node system running RHEL3 update 5 (kernel 2.4.21-32) with 
>GFS-6.0.2.20-1. All three nodes are running as both lock managers and 
>filesystem clients. When sending thousands of files to the cluster 
>(on the order of 1/2 terrabyte of 50k files) target nodes will run 
>out of memory and refuse to fork. Interestingly enough this condition 
>does not cause the cluster to fence the node, rather it things everything 
>is "OK". The effect of course is that the fs is not accessable cuz the 
>cluster is waiting to hear back from the node in question.
>
>I set the high water mark to 10000 (I know that's low, but I wanted to see
>the effect)
>and the system seemed to be trying to free locks every ten seconds as  it
>should but
>simply could not keep up with the file xfer going in.
>
>By the time a node finally locks up there are over 300K of locks in use.
>There is
>only a small % diff between the locks reported and the inodes in the
>filesystem. If
>I interperet this correctly, it simply meand that for almost all the files I
>was able to xfer, there is an existing lock being used. Also, mem usage for
>lock_gulmd was at 85M+. 
>When we started logging things it was at 30M+ rising about 3-400k per min.
>
>  
>
By reading your description, we may have addressed this issue in RHEL 3 
Update 7 which is currently in QA stage.

Note that GFS caches file locks for optimization purpose. The lock is 
one-to-one corresponding to VFS inode. BTW, what is the size of your 
memory ? Since linux OS itself normally desn't release inode unless 
under memory pressure so GFS lock lingers around. In your case, when the 
memory pressure starts to show, the inode releasing from base kernel 
could be too late for you. Also you have combined lock servers with GFS 
nodes - this definitely makes the situation worse. The current solution 
is piggybacking the inode purge code in one of our kernel daemons which 
wakes up in a tunable interval to purge a tunable percentage of inode. 
There are a set of test RPMs available if you're willing to try out. Let 
us know.

-- Wendy




From jeff at jettis.com  Wed Dec 14 02:18:15 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Tue, 13 Dec 2005 18:18:15 -0800
Subject: [Linux-cluster] dlm caused a kernel panic
Message-ID: <B6A0A04D59978745A68272143BE55BD42CA5A1@laxmsex01.corp.jettis.com>

I'm running FC4 (2.6.13-1.1532_FC4smp), dlm-1.0.0-3 and GFS-6.1.0-3.  I
have a 3 node cluster.  The df command has always been very slow to
return output on my gfs mounted filesystems.  Series of events...

16:20:00 - node01 was out of the cluster, node02 and node03 were active
with 2 gfs filesystems mounted
16:22:10 - after joining the cluster, both filesystems were successfully
mounted
16:22:37 - a df command was attempted by a monitoring script 
16:22:54 - I executed /etc/init.d/gfs stop and it failed because 1 of
the filesystems was busy and could not be umounted (the above df command
may have been the cause, it ended up hanging)
16:22:55 - node02 and node03 panicked and were not properly fenced

log messages from node02 at the time of the panic

Dec 13 16:22:55 node02 kernel:  event 22 done
Dec 13 16:22:55 node02 kernel: gfs01 move flags 0,0,1 ids 15,22,22
Dec 13 16:22:55 node02 kernel: gfs01 process held requests
Dec 13 16:22:55 node02 kernel: gfs01 processed 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 resend marked requests
Dec 13 16:22:55 node02 kernel: gfs01 resent 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 recover event 22 finished
Dec 13 16:22:55 node02 kernel: gfs00 move flags 1,0,0 ids 20,20,20
Dec 13 16:22:55 node02 kernel: gfs00 move flags 0,1,0 ids 20,25,20
Dec 13 16:22:55 node02 kernel: gfs00 move use event 25
Dec 13 16:22:55 node02 kernel: gfs00 recover event 25
Dec 13 16:22:55 node02 kernel: gfs00 remove node 1
Dec 13 16:22:55 node02 kernel: gfs00 total nodes 2
Dec 13 16:22:55 node02 kernel:  event 22 done
Dec 13 16:22:55 node02 kernel: gfs01 move flags 0,0,1 ids 15,22,22
Dec 13 16:22:55 node02 kernel: gfs01 process held requests
Dec 13 16:22:55 node02 kernel: gfs01 processed 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 resend marked requests
Dec 13 16:22:55 node02 kernel: gfs01 resent 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 recover event 22 finished
Dec 13 16:22:55 node02 kernel: gfs00 move flags 1,0,0 ids 20,20,20
Dec 13 16:22:55 node02 kernel: gfs00 move flags 0,1,0 ids 20,25,20
Dec 13 16:22:55 node02 kernel: gfs00 move use event 25
Dec 13 16:22:55 node02 kernel: gfs00 recover event 25
Dec 13 16:22:55 node02 kernel: gfs00 remove node 1
Dec 13 16:22:55 node02 kernel: gfs00 total nodes 2
Dec 13 16:22:55 node02 kernel: gfs00 rebuild resource directory
Dec 13 16:22:55 node02 kernel: gfs00 rebuilt 1913 resources
Dec 13 16:22:55 node02 kernel:  event 22 done
Dec 13 16:22:55 node02 kernel: gfs01 move flags 0,0,1 ids 15,22,22
Dec 13 16:22:55 node02 kernel: gfs01 process held requests
Dec 13 16:22:55 node02 kernel: gfs01 processed 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 resend marked requests
Dec 13 16:22:55 node02 kernel: gfs01 resent 0 requests
Dec 13 16:22:55 node02 kernel: gfs01 recover event 22 finished
Dec 13 16:22:55 node02 kernel: gfs00 move flags 1,0,0 ids 20,20,20
Dec 13 16:22:55 node02 kernel: gfs00 move flags 0,1,0 ids 20,25,20
Dec 13 16:22:55 node02 kernel: gfs00 move use event 25
Dec 13 16:22:55 node02 kernel: gfs00 recover event 25
Dec 13 16:22:55 node02 kernel: gfs00 remove node 1
Dec 13 16:22:55 node02 kernel: gfs00 total nodes 2
Dec 13 16:22:55 node02 kernel: gfs00 rebuild resource directory
Dec 13 16:22:55 node02 kernel: gfs00 rebuilt 1913 resources
Dec 13 16:22:55 node02 kernel: gfs00 purge requests
Dec 13 16:22:55 node02 kernel: gfs00 purged 0 requests
Dec 13 16:22:55 node02 kernel: gfs00 mark waiting requests
Dec 13 16:22:55 node02 kernel: gfs00 mark 2900192 lq 4 nodeid 1
Dec 13 16:22:55 node02 kernel: gfs00 mark 2900192 unlock no rep
Dec 13 16:22:55 node02 kernel: gfs00 marked 1 requests
Dec 13 16:22:55 node02 kernel: gfs00 purge locks of departed nodes
Dec 13 16:22:55 node02 kernel: gfs00 purged 1 locks
Dec 13 16:22:55 node02 kernel: gfs00 update remastered resources
Dec 13 16:22:55 node02 kernel: gfs00 updated 1 resources
Dec 13 16:22:55 node02 kernel: gfs00 rebuild locks
Dec 13 16:22:55 node02 kernel: gfs00 rebuilt 0 locks
Dec 13 16:22:55 node02 kernel: gfs00 recover event 25 done
Dec 13 16:22:55 node02 kernel: gfs00 move flags 0,0,1 ids 20,25,25
Dec 13 16:22:55 node02 kernel: gfs00 process held requests
Dec 13 16:22:55 node02 kernel: gfs00 processed 0 requests
Dec 13 16:22:55 node02 kernel: gfs00 resend marked requests
Dec 13 16:22:55 node02 kernel: gfs00 resend 2900192 lq 4 flg 3080000
node 2/2 "withdraw 1"
Dec 13 16:22:55 node02 kernel: gfs00 unlock done 2900192
Dec 13 16:22:55 node02 kernel: gfs00 resent 1 requests
Dec 13 16:22:55 node02 kernel: gfs00 recover event 25 finished
Dec 13 16:22:55 node02 kernel:
Dec 13 16:22:55 node02 kernel: DLM:  Assertion failed on line 1007 of
file /usr/src/build/627959-i686/BUILD/smp/src/lockqueue.c
Dec 13 16:22:55 node02 kernel: DLM:  assertion:  "lkb"
Dec 13 16:22:56 node02 kernel: DLM:  time = 6642223
Dec 13 16:22:56 node02 kernel: dlm: reply
Dec 13 16:22:56 node02 kernel: rh_cmd 5
Dec 13 16:22:56 node02 kernel: rh_lkid 2900192
Dec 13 16:22:56 node02 kernel: lockstate 4137259392
Dec 13 16:22:56 node02 kernel: nodeid 3224043367
Dec 13 16:22:56 node02 kernel: status 4294901758
Dec 13 16:22:56 node02 kernel: lkid 4040
Dec 13 16:22:56 node02 kernel: nodeid 1
Dec 13 16:22:56 node02 kernel:
Dec 13 16:22:56 node02 kernel: ------------[ cut here ]------------
Dec 13 16:22:56 node02 kernel: kernel BUG at
/usr/src/build/627959-i686/BUILD/smp/src/lockqueue.c:1007!
Dec 13 16:22:56 node02 kernel: invalid operand: 0000 [#1]
Dec 13 16:22:56 node02 kernel: SMP
Dec 13 16:22:56 node02 kernel: Modules linked in: autofs4 i2c_dev
i2c_core lock_dlm(U) gfs(U) lock_harness(U) dlm(U) cman(U) ipv6 crc32c
libcrc32c iscsi_sfnet(U) scsi_transport_iscsi dm_mod video button
battery ac uhci_hcd ehci_hcd shpchp e100 mii e1000 floppy sg ext3 jbd
megaraid_mbox megaraid_mm sd_mod scsi_mod
Dec 13 16:22:56 node02 kernel: CPU:    3
Dec 13 16:22:56 node02 kernel: EIP:    0060:[<f8b66d09>]    Tainted: GF
VLI
Dec 13 16:22:56 node02 kernel: EFLAGS: 00010292   (2.6.13-1.1532_FC4smp)
Dec 13 16:22:56 node02 kernel: EIP is at
process_cluster_request+0x9b9/0xdfa [dlm]
Dec 13 16:22:56 node02 kernel: eax: 00000004   ebx: 00000000   ecx:
c036fc2c   edx: 00000286
Dec 13 16:22:56 node02 kernel: esi: f6d35200   edi: 00000000   ebp:
f6035ed4   esp: f6035e24
Dec 13 16:22:56 node02 kernel: ds: 007b   es: 007b   ss: 0068
Dec 13 16:22:56 node02 kernel: Process dlm_recvd (pid: 2939,
threadinfo=f6035000 task=f7916020)
Dec 13 16:22:56 node02 kernel: Stack: badc0ded f8b73a44 00000001
f8b74a9c f8b73a40 00655a2f 00000001 00000040
Dec 13 16:22:56 node02 kernel:        00004000 f6035e48 00000000
c039f100 00001000 f3c4bb80 c02aff67 00001000
Dec 13 16:22:56 node02 kernel:        00004040 00000000 f8b6f617
00000000 00000001 ffffffff 00000000 f7ef84bc
Dec 13 16:22:56 node02 kernel: Call Trace:
Dec 13 16:22:56 node02 kernel:  [<c02aff67>] sock_recvmsg+0x103/0x11e
Dec 13 16:22:56 node02 kernel:  [<f8b6f617>]
process_reply_async+0x1d/0x23 [dlm]
Dec 13 16:22:56 node02 kernel:  [<f8b6a6d1>] copy_from_cb+0x25/0x5d
[dlm]
Dec 13 16:22:56 node02 kernel:  [<f8b6a95b>]
midcomms_process_incoming_buffer+0x13b/0x25f [dlm]
Dec 13 16:22:56 node02 kernel:  [<c02aff67>] sock_recvmsg+0x103/0x11e
Dec 13 16:22:56 node02 kernel:  [<f8b6880f>]
receive_from_sock+0x19b/0x2ce [dlm]
Dec 13 16:22:56 node02 kernel:  [<c03166e3>] schedule+0x563/0xb8e
Dec 13 16:22:56 node02 kernel:  [<c0105f15>] do_IRQ+0x55/0x86
Dec 13 16:22:56 node02 kernel:  [<f8b69949>] dlm_recvd+0x0/0xa1 [dlm]
Dec 13 16:22:56 node02 kernel:  [<f8b69777>] process_sockets+0x80/0xda
[dlm]
Dec 13 16:22:56 node02 kernel:  [<f8b699b9>] dlm_recvd+0x70/0xa1 [dlm]
Dec 13 16:22:56 node02 kernel:  [<c01343d9>] kthread+0x93/0x97
Dec 13 16:22:56 node02 kernel:  [<c0134346>] kthread+0x0/0x97
Dec 13 16:22:56 node02 kernel:  [<c0101ca1>]
kernel_thread_helper+0x5/0xb
Dec 13 16:22:56 node02 kernel: Code: 65 a9 5b c7 89 e8 e8 6a bd 00 00 8b
54 24 14 89 54 24 04 c7 04 24 96 3b b7 f8 e8 4a a9 5b c7 c7 04 24 44 3a
b7 f8 e8 3e a9 5b c7 <0f> 0b ef 03 9c 4a b7 f8 c7 04 24 2c 4b b7 f8 e8
76 9f 5b c7 e8
Dec 13 16:22:56 node02 kernel:  <0>Fatal exception: panic in 5 seconds

Any help would be greatly appreciated.

 - Jeff


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051213/997fc6a0/attachment.htm>

From darren.jacobs at utoronto.ca  Wed Dec 14 05:17:43 2005
From: darren.jacobs at utoronto.ca (Darren Jacobs)
Date: Wed, 14 Dec 2005 00:17:43 -0500
Subject: [Linux-cluster] GFS and EMC's powerpath
Message-ID: <439FAAF7.6050901@utoronto.ca>

In prepping to install GFS 6.0 on a cluster of three RHEL 3, Update6 web 
servers attached to an EMC AX100.  I've run into some weirdness with EMC 
powerpath 4.3.4. 

If I have more than one server attached to the single virtual disk (i.e. 
LUN) I created on the array powerpath doesn't put the pseudo device 
(empowerb) into the partition table (as seen by doing a cat 
/proc/partitions).  If I only have single server attached/registered to 
the virtual disk the pseudo device does get placed in the partition table.

I can get the EMC pseudo device into the partition table of all three 
servers only by attaching and then detaching each server as I install 
powerpath.  Then I can attach the other two servers and things seem to 
work fine...until I reboot one of the boxes.  It then drops the emc 
pseudo device from the partition table again.

Anyone run into similar problems?


Darren.....





From wcheng at redhat.com  Wed Dec 14 06:03:31 2005
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 14 Dec 2005 01:03:31 -0500
Subject: [Linux-cluster] GFS 6.0u5 not freeing locks?
In-Reply-To: <6226349.1134526329294.JavaMail.root@vms172.mailsrvcs.net>
References: <6226349.1134526329294.JavaMail.root@vms172.mailsrvcs.net>
Message-ID: <1134540211.3411.26.camel@localhost.localdomain>

On Tue, 2005-12-13 at 20:12 -0600, vze8dkpl at verizon.net wrote:
  
>   
> Thanks for the response.  The machines I am using for testing HP 
> DL380 w/2 cpu's and 3GB of ram. Can you explain why GFS stopped 
> cleaning up after itself even though the high water mark was set to 
> 10000? The locks remained at 300K+ and never went back down. 

This is first time we have such report (lock count higher than
lt_high_locks) since normally we have users set "lt_high_locks" to
higher values than default 1024*1024. Will look into this issue more.
Intuitively I suspect there is a time lag between sending and receiving
of "lock full" messages between daemons

>  
> I'd be glad to try the new revision, just let me know how to get 
> it. Also a list of tunables would be good as well. 

Is it possible for you to log a support call or open a bugzilla (so we
can have some records internally) ? Will work with our maintainer
tomorrow to put the test RPMs somewhere that you can retrieve. 


-- Wendy



From pcaulfie at redhat.com  Wed Dec 14 08:53:49 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 14 Dec 2005 08:53:49 +0000
Subject: [Linux-cluster] dlm caused a kernel panic
In-Reply-To: <B6A0A04D59978745A68272143BE55BD42CA5A1@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD42CA5A1@laxmsex01.corp.jettis.com>
Message-ID: <439FDD9D.8000607@redhat.com>

Jeff Dinisco wrote:
> I'm running FC4 (2.6.13-1.1532_FC4smp), dlm-1.0.0-3 and GFS-6.1.0-3.  I
> have a 3 node cluster.  The df command has always been very slow to
> return output on my gfs mounted filesystems.  Series of events...
> 
> 16:20:00 - node01 was out of the cluster, node02 and node03 were active
> with 2 gfs filesystems mounted
> 16:22:10 - after joining the cluster, both filesystems were successfully
> mounted
> 16:22:37 - a df command was attempted by a monitoring script
> 16:22:54 - I executed /etc/init.d/gfs stop and it failed because 1 of
> the filesystems was busy and could not be umounted (the above df command
> may have been the cause, it ended up hanging)
> 
> 16:22:55 - node02 and node03 panicked and were not properly fenced

If there was only one node left in the cluster it would not fence the other
two because it doesn't have quorum. So it can't be sure that it's not just
been cut off from the other two nodes and they might be working fine.

> Dec 13 16:22:56 node02 kernel: ------------[ cut here ]------------
> Dec 13 16:22:56 node02 kernel: kernel BUG at
> /usr/src/build/627959-i686/BUILD/smp/src/lockqueue.c:1007!

I can reproduce this under very heavy lock load, but I'm not sure what's
causing it as yet. The "flood" tool I check in to STABLE yesterday is almost
guaranteed to cause it.

-- 

patrick



From jeff at jettis.com  Wed Dec 14 15:43:40 2005
From: jeff at jettis.com (Jeff Dinisco)
Date: Wed, 14 Dec 2005 07:43:40 -0800
Subject: [Linux-cluster] dlm caused a kernel panic
Message-ID: <B6A0A04D59978745A68272143BE55BD42CA5A2@laxmsex01.corp.jettis.com>

Is the slow output from df expected?  Does it just take considerable
time to read a gfs superblock?  In my scenario, is it likely that heavy
lock load was caused by the combination df and a umount at the same
time?  Were the gfs recover events in the log prior to the kernel panic
normal, or is it possible that I attempted the umount too quickly after
mounting?  Would r/o mounts decrease lock load and the likelihood of
this occurring again?  

Thanks for the help.  I was just about to move this into production and
now I'm a little apprehensive.  I just want make sure I'm taking the
necessary precautions. 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Patrick Caulfield
Sent: Wednesday, December 14, 2005 3:54 AM
To: linux clustering
Subject: Re: [Linux-cluster] dlm caused a kernel panic

Jeff Dinisco wrote:
> I'm running FC4 (2.6.13-1.1532_FC4smp), dlm-1.0.0-3 and GFS-6.1.0-3.
I
> have a 3 node cluster.  The df command has always been very slow to
> return output on my gfs mounted filesystems.  Series of events...
> 
> 16:20:00 - node01 was out of the cluster, node02 and node03 were
active
> with 2 gfs filesystems mounted
> 16:22:10 - after joining the cluster, both filesystems were
successfully
> mounted
> 16:22:37 - a df command was attempted by a monitoring script
> 16:22:54 - I executed /etc/init.d/gfs stop and it failed because 1 of
> the filesystems was busy and could not be umounted (the above df
command
> may have been the cause, it ended up hanging)
> 
> 16:22:55 - node02 and node03 panicked and were not properly fenced

If there was only one node left in the cluster it would not fence the
other
two because it doesn't have quorum. So it can't be sure that it's not
just
been cut off from the other two nodes and they might be working fine.

> Dec 13 16:22:56 node02 kernel: ------------[ cut here ]------------
> Dec 13 16:22:56 node02 kernel: kernel BUG at
> /usr/src/build/627959-i686/BUILD/smp/src/lockqueue.c:1007!

I can reproduce this under very heavy lock load, but I'm not sure what's
causing it as yet. The "flood" tool I check in to STABLE yesterday is
almost
guaranteed to cause it.

-- 

patrick

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster





From sly_extrem at yahoo.com  Wed Dec 14 16:41:17 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Wed, 14 Dec 2005 08:41:17 -0800 (PST)
Subject: [Linux-cluster] Problems with LVM: Need some help
Message-ID: <20051214164117.41626.qmail@web30703.mail.mud.yahoo.com>

 I have the following cluster.conf:
  
  <?xml version="1.0"?>
  <cluster name="GFSCL" config_version="2">
  
  <cman two_node="1" expected_votes="1">
  </cman>
  
  <clusternodes>
  <clusternode name="nd1">
      <fence>
          <method name="single">
              <device name="human1" nodename="nd1"/>
          </method>
      </fence>
  </clusternode>
  <clusternode name="nd2">
      <fence>
          <method name="single">
              <device name="human2" nodename="nd2"/>
          </method>
      </fence>
  </clusternode>
  </clusternodes>
  
  <fencedevices>
      <fencedevice name="human1" agent="fence_manual"/>
      <fencedevice name="human2" agent="fence_manual"/>
  </fencedevices>
  
  </cluster>
  
  Iam trying to set up a two node cluster with shared storage and I have /dev/hda3 of 5GB for testing.
  
  First I have instaled cluster, dm, and LVM with the specified "parameters" (specified in the usage.txt)
  
  I have done modprobe for:
  lock_dlm, dlm, cman, gfs, lock_harness
  
  with both nodes started i have tried
  
  ccsd  
  cman_tool join -w   
  fence_tool join -w
  clvmd
  
  and I started to create the lv:
  pvcreate /dev/hda3
  vgcreate vg /dev/hda3
  
  and when I tried to do lvcreate -L 100 vg i get this error:
  "Error locking on node nd2: Internal lvm error, check syslog
   Failed to activate new LV"
  
  
  When I check syslog there are no errors...and I have a lv created /dev/vg/lvol0
  and I can activate the lv bt vgchange -aly....
  
  
  Have I done something wrong ?
  Can I have shared storage using this method ? I want my nodes to have  identical data on one drive and this one to be updated only on one node.
  
  Thanks,
  
  
  


Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



			
---------------------------------
Yahoo! Shopping
 Find Great Deals on Holiday Gifts at Yahoo! Shopping 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051214/acab4209/attachment.htm>

From eric at bootseg.com  Wed Dec 14 19:31:11 2005
From: eric at bootseg.com (Eric Kerin)
Date: Wed, 14 Dec 2005 14:31:11 -0500
Subject: [Linux-cluster] Re: rgmanager dieing with no messages [was:
	Re: SMP and GFS]
In-Reply-To: <1128546502.27430.252.camel@ayanami.boston.redhat.com>
References: <04f401c588b6$b31db0a0$5001a8c0@spa.isqsolutions.com>
	<20051002102305.GD13944@neu.nirvana>
	<1128265567.23136.21.camel@saloon.neofreak.org>
	<1128352997.3504.9.camel@auh5-0479.corp.jabil.org>
	<1128546502.27430.252.camel@ayanami.boston.redhat.com>
Message-ID: <1134588671.3468.26.camel@auh5-0479.corp.jabil.org>

On Wed, 2005-10-05 at 17:08 -0400, Lon Hohberger wrote:
> On Mon, 2005-10-03 at 11:23 -0400, Eric Kerin wrote:
> > On Sun, 2005-10-02 at 11:06 -0400, DeadManMoving wrote:
> > > My cluster is highly instable, just this morning i've realized that
> > > the clurgmgrd deamon was dead...
> > 
> > I'm having this same problem on my cluster, I've been planning on
> > enabling core dumps for rgmanager once I find a few minutes to restart
> > the cluster services. With any luck, that will be today.
> 
> If you see anything, let me know.  There's a segfault I'm trying to
> track down which this is... I haven't been able to reproduce it
> internally :(
> 

I finally got the downtime to enable core dumps, and just noticed that
rgmanager crashed (not hung in the segfault loop).   After looking at
this a bit, this problem is becoming quite strange to me.

I don't have any nfs exports in my cluster.conf file, so I don't think
that bug applies.  But I am seeing really strange data in the backtraces
(below)  Similar to
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=166109

The thing is, this is a stock RHEL4 U1 Kernel (2.6.9-11.ELsmp)  On 64
bit capable Xeon processors, but running on a 32 bit kernel.

I can compress the core dump I have and send it, if you like, or run any
commands with gdb (and the like) needed.


Thanks, 
Eric




[root at auhjpsn01a ~]# gdb /usr/sbin/clurgmgrd
GNU gdb Red Hat Linux (6.3.0.0-0.31rh)
<SNIP LICENSE+STUFF>
This GDB was configured as "i386-redhat-linux-gnu"...Using host
libthread_db library "/lib/tls/libthread_db.so.1".

(gdb) core /core.2707
Core was generated by `clurgmgrd'.
Program terminated with signal 11, Segmentation fault.
#0  0x006bb5e9 in ?? ()

(gdb) thr a a bt
Thread 4 (process 2707):
#0  0x006427a2 in ?? ()
Cannot access memory at address 0xbff3dbcc

Thread 3 (process 3917):
#0  0x006427a2 in ?? ()
Cannot access memory at address 0xb75e4318

Thread 2 (process 10987):
#0  0x006427a2 in ?? ()
Cannot access memory at address 0xb4bff28c

Thread 1 (process 10986):
#0  0x006bb5e9 in ?? ()
#1  0x00000000 in ?? ()




From teigland at redhat.com  Wed Dec 14 19:45:38 2005
From: teigland at redhat.com (David Teigland)
Date: Wed, 14 Dec 2005 13:45:38 -0600
Subject: [Linux-cluster] dlm caused a kernel panic
In-Reply-To: <B6A0A04D59978745A68272143BE55BD42CA5A2@laxmsex01.corp.jettis.com>
References: <B6A0A04D59978745A68272143BE55BD42CA5A2@laxmsex01.corp.jettis.com>
Message-ID: <20051214194538.GA3693@redhat.com>

On Wed, Dec 14, 2005 at 07:43:40AM -0800, Jeff Dinisco wrote:
> Is the slow output from df expected?  Does it just take considerable
> time to read a gfs superblock?  

Yes, it's expected; df locks ever resource group in the fs to collect
usage information, so large fs's will take longer, and heavy writers on
other nodes will delay it further.

> In my scenario, is it likely that heavy lock load was caused by the
> combination df and a umount at the same time?  

I'm not sure lock load is related to this particular case.  After studying
your logs I think I know what the problem is; it's a situation where a dlm
message from an unmounting node is received after recovery for it is
completed on the remaining nodes.  A quick and correct fix would be to
remove the assertion (or perhaps change it, I'll see.)

> Were the gfs recover events in the log prior to the kernel panic
> normal, or is it possible that I attempted the umount too quickly after
> mounting?  

Mounting and unmounting always involve dlm recovery which is more prone to
bugs and corner cases, so avoiding unnecessary or rapidly repeating
mounting/unmounting is usually wise.  You didn't do anything wrong,
though; it's simply a corner case we aren't handling properly.

> Would r/o mounts decrease lock load and the likelihood of this occurring
> again?

no

Dave



From thomsonr at ucalgary.ca  Wed Dec 14 20:11:16 2005
From: thomsonr at ucalgary.ca (Ryan Thomson)
Date: Wed, 14 Dec 2005 13:11:16 -0700
Subject: [Linux-cluster] clusterfs.sh mount question
Message-ID: <1134591076.18393.10.camel@porcupine.bio.ucalgary.ca>

Hi List,

I'm just wondering if a GFS filesystem mount done by clusterfs.sh is any
different than a GFS filesystem mount done through /etc/fstab?

I have a GFS fs mounted from /etc/fstab on a node, that node also has
several GFS filesystems mounted via a cluster service. To me and
apparently to the OS, both mounts look the same. I see them all
in /proc/mounts and /etc/mtab, I cd into the directories and see the
files, etc.

What really gets me is that my IBM Tivoli Storage Manager client only
"sees" the mount that was made in /etc/fstab. I use the client to browse
to where I *know* the other file systems are mounted and the directories
look empty to TSM.

This leads me to believe that either TSM is dependent on /etc/fstab for
some odd reason to see mounts OR that clusterfs.sh mounts are somehow
different than /etc/fstab mounts... that last option sounds highly
unlikely to me but I do want to be sure about this before I go looking
down the other road.

I'm trying to get a cluster service to mount GFS filesystems on our
backup server to do backups straight from GFS but if they are mounted
from a service, TSM doesn't "see" them :(

Any input?

This problem is most likely a TSM issue, not a RHCS/GFS issue but I just
thought I'd ask here first to get the most unlikely possibility out of
the way first.

Thanks,

--
Ryan Thomson

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051214/589d58ba/attachment.sig>

From robert at deakin.edu.au  Thu Dec 15 02:10:26 2005
From: robert at deakin.edu.au (Robert Ruge)
Date: Thu, 15 Dec 2005 13:10:26 +1100
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 20, Issue 14
In-Reply-To: <439F380B.4030801@cmu.edu>
Message-ID: <200512150210.jBF2ARPQ027157@deakin.edu.au>

> While it is possible that the GFS system available under RH4 
> works and 
> fixes some of these issues, I can't be upgrading our 
> production machines 
>   with major OS releases every few months on the unconfirmed and 
> probably slim chance that the upgrade will fix the problems.  
> I'm sure 
> I'm not the only one in this kind of situation.
> 
> Sorry for the rant-like post, but I am just a tiny bit 
> frustrated here.
> 
> jonathan

RH4 has not fixed the problem at all. I too moved filesystems over to
GFS for the purpose of NFS and Samba file sharing and have since moved
them all back to ext3. I didn't even have a heavy load but I could
consistently lock up Samba and GFS. I now only use GFS for stuff
needing to be common between the machines and for Vmware images where
I can failover vmware machines between cluster nodes. NFS by itself
behaves OK but samba on GFS sucks.

Robert 



From darren.jacobs at utoronto.ca  Thu Dec 15 16:49:30 2005
From: darren.jacobs at utoronto.ca (Darren Jacobs)
Date: Thu, 15 Dec 2005 11:49:30 -0500
Subject: [Linux-cluster] problem setting up cluster configuration system
Message-ID: <43A19E9A.1050509@utoronto.ca>

I'm getting the following error message when trying to create the CCS 
archive on my CCA device
-- 
[root at dummy darren]# css_tool create /root/www /dev/pool/www_cca
WARNING::
You appear to be attempting to write a CCS archive to a device.
However, the device node does not exist.  The file, /dev/pool/www_cca,
has been created.  Be sure this is what you want.
--

I was able to create the file system pool, the CCS pool and assemble the 
pools ok.   Any suggestions on what could be the problem?  I'm running 
EMC powerpath, could this be interfering?

Darren....
 



From adingman at cookgroup.com  Thu Dec 15 21:24:09 2005
From: adingman at cookgroup.com (Andrew C. Dingman)
Date: Thu, 15 Dec 2005 16:24:09 -0500
Subject: [Linux-cluster] stuck processes on GFS partition?
In-Reply-To: <1134424315.23930.88.camel@merlin.Mines.EDU>
References: <1134408900.23930.23.camel@merlin.Mines.EDU>
	<1134410543.23930.28.camel@merlin.Mines.EDU>
	<1134424315.23930.88.camel@merlin.Mines.EDU>
Message-ID: <1134681849.5854.20.camel@adingman.cin.cook>

On Mon, 2005-12-12 at 14:51 -0700, Matt Brookover wrote:
> This looks like a similar problem to the one described in bugzilla
> 160409.  It does not look like there ever was a solution. 
> 

It does look similar, and there was no solution. We were never able to
re-produce the problem by any method other than putting it into
production. I think the theory we ended up with was that there was some
sort of lock contention problem, possibly having to do with the network
here. It was just a theory, though. We never managed to prove anything.

Do you know what you did to trigger it? I assume something other than
300 people running jBase applications?

-- 
Andrew C. Dingman
Unix Administrator
Cook Incorporated
(812)339-2235 x2131
adingman at cookgroup.com



From mbrookov at mines.edu  Thu Dec 15 21:54:14 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Thu, 15 Dec 2005 14:54:14 -0700
Subject: [Linux-cluster] stuck processes on GFS partition?
In-Reply-To: <1134681849.5854.20.camel@adingman.cin.cook>
References: <1134408900.23930.23.camel@merlin.Mines.EDU>
	<1134410543.23930.28.camel@merlin.Mines.EDU>
	<1134424315.23930.88.camel@merlin.Mines.EDU>
	<1134681849.5854.20.camel@adingman.cin.cook>
Message-ID: <1134683654.3483.25.camel@merlin.Mines.EDU>

I was the first to get a process stuck in a device wait.  I created a
directory in the root of the file system and then tried to do an ls. 
The ls got stuck.  From the looks of the logs, the problems had started
the day before, but went unnoticed until I did an ls.  The new directory
worked from other nodes that had mounted that GFS file system.  

Unfortunately, I do not believe that the server was doing much of any
thing at the time.  There were a few users, mostly reading email, and
not using the file system that had the problem.  The partition in
question is used for mail lists and a dumping ground for backups for 6
other servers.  The backups were not running at the time the first
gfs_releasepage() message was logged. The mail lists are just test lists
and not in use yet. The backups transfer about 10GB of data in 12 to 15
files between 3am and 5am every day.  The backups are transfered by scp
(the only path through a firewall). The backups that night ran without
any problems, both the copy from the remote servers and a copy of that
file system to tape.

If/when it happens again, I will try to have a better idea of what was
going on at the time.

The server in question had been up for over 30 days when the problem
started.

Thank you

Matt

On Thu, 2005-12-15 at 14:24, Andrew C. Dingman wrote:

> On Mon, 2005-12-12 at 14:51 -0700, Matt Brookover wrote:
> > This looks like a similar problem to the one described in bugzilla
> > 160409.  It does not look like there ever was a solution. 
> > 
> 
> It does look similar, and there was no solution. We were never able to
> re-produce the problem by any method other than putting it into
> production. I think the theory we ended up with was that there was some
> sort of lock contention problem, possibly having to do with the network
> here. It was just a theory, though. We never managed to prove anything.
> 
> Do you know what you did to trigger it? I assume something other than
> 300 people running jBase applications?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051215/40634021/attachment.htm>

From sly_extrem at yahoo.com  Thu Dec 15 22:29:40 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Thu, 15 Dec 2005 14:29:40 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
Message-ID: <20051215222940.48463.qmail@web30706.mail.mud.yahoo.com>

 Based on the documentation provided in  usage.txt...nothing can be done. Does anyone know how can I make a  shared disk cluster on 2 nodes ?
  
  Thanks a lot,
  


Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



			
---------------------------------
Yahoo! Shopping
 Find Great Deals on Holiday Gifts at Yahoo! Shopping 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051215/cd259e56/attachment.htm>

From omer at faruk.net  Fri Dec 16 07:43:55 2005
From: omer at faruk.net (Omer Faruk Sen)
Date: Fri, 16 Dec 2005 09:43:55 +0200 (EET)
Subject: [Linux-cluster] mysql and redhat cluster suite?
Message-ID: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>

Hi,

I want to install a mysql cluster. But I want to ask which path do I have
to follow? Or there is any special precautions that I have to take care
before and after installing mysql and redhat cluster suite.

My first impression is to install 2 node cluster it is better for me to
use GFS and a iscsi solution for a cheaper solution. But I have understood
from my readings that I can't setup an active-active mysql cluster using
redhat cluster + GFS. Because one node must be locked for writing. Thus I
think installing the second server for writing may be just for
availability not for combining 2 machines power ( I mean using
active-acvtive).

Then I think to install 2 machines for write (one is standby) on a redhat
cluster using GFS on a ISCSI system and adding a number of machines that
use the same GFS partition for msqyl read operations is what I have to
follow .. right?

PS: By the way can you give me an URL that details mysql and redhat
cluster + gfs installation? Or is there any ?






-- 
Omer Faruk Sen
http://www.faruk.net



From pcaulfie at redhat.com  Fri Dec 16 08:44:31 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 16 Dec 2005 08:44:31 +0000
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <20051215222940.48463.qmail@web30706.mail.mud.yahoo.com>
References: <20051215222940.48463.qmail@web30706.mail.mud.yahoo.com>
Message-ID: <43A27E6F.5050601@redhat.com>

Marius Stoica wrote:
>  Based on the documentation provided in usage.txt...nothing can be done.
> Does anyone know how can I make a shared disk cluster on 2 nodes ?
> 
Did you actually read it? There is a section on 2 node clusters.
-- 

patrick



From sly_extrem at yahoo.com  Fri Dec 16 11:03:52 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Fri, 16 Dec 2005 03:03:52 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <43A27E6F.5050601@redhat.com>
Message-ID: <20051216110352.24926.qmail@web30706.mail.mud.yahoo.com>

My question is how can I provide redundancy (mirroring) and accees the shared resource on both nodes ?...
  
  The cman options are implemented allready.

Patrick Caulfield <pcaulfie at redhat.com> wrote:  Marius Stoica wrote:
>  Based on the documentation provided in usage.txt...nothing can be done.
> Does anyone know how can I make a shared disk cluster on 2 nodes ?
> 
Did you actually read it? There is a section on 2 node clusters.
-- 

patrick

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster




Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051216/99117603/attachment.htm>

From basv at sara.nl  Fri Dec 16 11:25:36 2005
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 16 Dec 2005 12:25:36 +0100
Subject: [Linux-cluster] Small complile problem with STABLE version for CVS
Message-ID: <43A2A430.8020400@sara.nl>

Hello,

  I am just compiling GFS from STABLE CVS and encounterd a small problem 
in  cman/lib/libcman.c:
/* HvB
#include <cluster/cnxman-socket.h>
*/
Must be
#include <cnxman-socket.h>

Regards

-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From brentonr at dorm.org  Fri Dec 16 14:50:10 2005
From: brentonr at dorm.org (Brenton Rothchild)
Date: Fri, 16 Dec 2005 08:50:10 -0600
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>
Message-ID: <43A2D422.7010301@dorm.org>

Anyone feel free to correct me, but I was just doing some
reading on active-active MySQL + GFS last night, and
I think such an arrangement might be possible to a degree.

 From what I can read in the MySQL documentation,
http://dev.mysql.com/doc/refman/5.0/en/system.html
http://dev.mysql.com/doc/refman/4.1/en/system.html

MySQL can use external file locking via flock() calls,
although the 4.1 page describes that file locking is turned off
by default "because Linux file locking is not yet safe" via
a compile-time option "--skip-external-locking".

The 5.0 page doesn't warn against using external file locking
to that degree, but the option to use locking is still
disabled via the "--skip-external-locking" flag.

There are also some caveats to using the --external-locking option
in MySQL.  See the documentation for "--external-locking" in
http://dev.mysql.com/doc/refman/4.1/en/server-options.html
http://dev.mysql.com/doc/refman/5.0/en/server-options.html

Now, on the GFS side, locking via flock() and fcntl()-based
locks appear to be supported cluster-wide from this thread,
http://www.redhat.com/archives/linux-cluster/2004-October/msg00311.html

So, putting them together, if GFS supports file locking,
and MySQL can use file locking to support multiple instances
accessing the same files, an active-active MySQL set up should work.

I, too, and using a test cluster with GFS + iSCSI at this point,
and I plan to test MySQL 4.x and 5.x versions on it soon to
see what happens.  I really hope it works out under the loads
we want to test. :)

Like I said, anyone else feel free to correct me - I'm still
just getting started with RHCS/GFS/etc. at this point.

-Brenton Rothchild


Omer Faruk Sen wrote:
> Hi,
> 
> I want to install a mysql cluster. But I want to ask which path do I have
> to follow? Or there is any special precautions that I have to take care
> before and after installing mysql and redhat cluster suite.
> 
> My first impression is to install 2 node cluster it is better for me to
> use GFS and a iscsi solution for a cheaper solution. But I have understood
> from my readings that I can't setup an active-active mysql cluster using
> redhat cluster + GFS. Because one node must be locked for writing. Thus I
> think installing the second server for writing may be just for
> availability not for combining 2 machines power ( I mean using
> active-acvtive).
> 
> Then I think to install 2 machines for write (one is standby) on a redhat
> cluster using GFS on a ISCSI system and adding a number of machines that
> use the same GFS partition for msqyl read operations is what I have to
> follow .. right?
> 
> PS: By the way can you give me an URL that details mysql and redhat
> cluster + gfs installation? Or is there any ?
> 
> 
> 
> 
> 
> 



From Bowie_Bailey at BUC.com  Fri Dec 16 14:53:31 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Fri, 16 Dec 2005 09:53:31 -0500
Subject: [Linux-cluster] Need a lot of help
Message-ID: <4766EEE585A6D311ADF500E018C154E30213340D@BNIFEX>

From: Marius Stoica [mailto:sly_extrem at yahoo.com]
> Patrick Caulfield <pcaulfie at redhat.com> wrote:
> > Marius Stoica wrote:
> > > Based on the documentation provided in usage.txt...nothing can be
done.
> > > Does anyone know how can I make a shared disk cluster on 2 nodes ?
> > 
> > Did you actually read it? There is a section on 2 node clusters.
> 
> My question is how can I provide redundancy (mirroring) and 
> accees the shared resource on both nodes ?...
> 
> The cman options are implemented allready.

Yea, I had that problem too.  The docs are heavy on basic setup and a
bit sparse on how to actually do anything useful.

This may help.  This is the cluster.conf file from a test cluster I'm
working on.  Note the "resources" section at the bottom.  Based on
your question, I'm not sure if you are doing GFS or not, but maybe
this will point you in the right direction.

Use lvm2 to create the logical volumes on one box and then do a vgscan
on the other box to read the volume group information.

Bowie

-----------------------------------------------

<?xml version="1.0"?>
<cluster config_version="7" name="storagetest">
        <cman expected_votes="1" two_node="1"/>
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="20"/>
        <clusternodes>
                <clusternode name="test1">
                        <fence>
                                <method name="human">
                                        <device name="human"
nodename="test1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="test2">
                        <fence>
                                <method name="human">
                                        <device name="human"
nodename="test2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <fencedevices>
                <fencedevice agent="fence_manual" name="human"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources>
                        <clusterfs device="/dev/vg00/lvol01" fstype="gfs"
mountpoint="/raid" name="store1" options=""/>
                </resources>
        </rm>
</cluster>



From gwood at dragonhold.org  Fri Dec 16 17:14:55 2005
From: gwood at dragonhold.org (gwood at dragonhold.org)
Date: Fri, 16 Dec 2005 17:14:55 -0000 (GMT)
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <43A2D422.7010301@dorm.org>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>
	<43A2D422.7010301@dorm.org>
Message-ID: <3449.208.178.77.200.1134753295.squirrel@mnementh.dragonhold.org>

> Anyone feel free to correct me, but I was just doing some
> reading on active-active MySQL + GFS last night, and
> I think such an arrangement might be possible to a degree.

http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-overview.html

"MySQL Cluster is a technology which enables clustering of in-memory
databases in a share-nothing system."

So I'm not sure what you're hoping to achieve... The on disk storage needs
to be unique to each machine - and the data needs to all fit into memory.

Unless I'm missing something?



From bfields at fieldses.org  Fri Dec 16 21:17:36 2005
From: bfields at fieldses.org (J. Bruce Fields)
Date: Fri, 16 Dec 2005 16:17:36 -0500
Subject: [Linux-cluster] NFS exports
Message-ID: <20051216211736.GA14415@fieldses.org>

As implementers of the Linux NFSv4 server support, we've found there's
interest in good support for NFS exports of cluster filesystems (via
NFSv4 and earlier versions).

There are a number of obstacles to this, and we're interested in finding
solutions that are acceptable to GFS and OCFS2.  (If I've directed this
to the wrong email lists, please let me know!)

To give an example--there are a couple of problems with the current VFS
support for posix byte-range locks:

	*  We'd rather not block nfsd or lockd threads for longer than
	   necessary, so it'd be nice to have a way to make lock
	   requests asynchronously.  This might be helpful even for
	   non-blocking locks, since we may not even be able to
	   determine whether a lock is contended without waiting for a
	   response from a remote node.

	*  Given that in the blocking case we want the filesystem to be
	   able to return from ->lock() without having necessarily acquired
	   the lock, we need to be able to handle the case where a
	   process on the client is interrupted and the client cancels
	   the lock.

A patch is appended showing the sort of VFS lock changes we're thinking
about.

This patch allows the filesystem ->lock() method to return -EINPROGRESS
and then call a lock-manager callback if provided, and adds a FL_CANCEL
flag to the struct file_lock to indicate that the caller wants to cancel
the provided lock.

Look reasonable?  Ideas?  What work has anyone else done on this?

--Bruce Fields

Patch follows---

There is currently a filesystem ->lock() method, but it is defined only
by a few filesystems that are not exported via nfs.  So none of the lock
routines that are used by lockd or nfsv4 bother to call those methods.
Cluster filesystems would like to be able define their own ->lock()
methods and also would like to be exportable via NFS.

So we add vfs_lock_file, vfs_test_lock, and vfs_cancel_lock routines
which do call the underlying filesystem's lock routines.  These are
intended to be used by lock managers (lockd and nfsd); lockd and nfsd
changes to take advantage of them are made by later patches.

Acquiring a lock may require comminication with remote hosts, and to avoid
blocking lockd or nfsd threads during such communication, we allow the
results to be returned asynchronously.

When a ->lock() call needs to block, the file system will return
-EINPROGRESS, and then later return the results with a call to the routine
in the fl_vfs_callback of the lock_manager_operations struct.

Signed-off-by: Marc Eshel <eshel at almaden.ibm.com>
Signed-off-by: J. Bruce Fields <bfields at citi.umich.edu>
---

 fs/locks.c         |   79 ++++++++++++++++++++++++++++++++++++++++++++++++++++
 include/linux/fs.h |    6 ++++
 2 files changed, 85 insertions(+), 0 deletions(-)
diff --git a/fs/locks.c b/fs/locks.c
index 250ef53..05581c4 100644
--- a/fs/locks.c
+++ b/fs/locks.c
@@ -996,6 +996,85 @@ int posix_lock_file_wait(struct file *fi
 EXPORT_SYMBOL(posix_lock_file_wait);
 
 /**
+ * vfs_lock_file - file byte range lock
+ * @filp: The file to apply the lock to
+ * @fl: The lock to be applied
+ *
+ * To avoid blocking kernel daemons, such as lockd, that need to acquire POSIX
+ * locks, the ->lock() interface may return asynchronously, before the lock has
+ * been granted or denied by the underlying filesystem, if (and only if)
+ * fl_vfs_callback is set. Callers expecting ->lock() to return asynchronously
+ * will only use F_SETLK, not F_SETLKW; they will set FL_SLEEP if (and only if)
+ * the request is for a blocking lock. When ->lock() does return asynchronously,
+ * it must return -EINPROGRESS, and call ->fl_vfs_callback() when the lock
+ * request completes.
+ * If the request is for non-blocking lock the file system should return
+ * -EINPROGRESS then try to get the lock and call the callback routine with
+ * the result. If the request timed out the callback routine will return a
+ * nonzero return code and the file system should release the lock. The file
+ * system is also responsible to keep a corresponding posix lock when it
+ * grants a lock so the VFS can find out which locks are locally held and do
+ * the correct lock cleanup when required.
+ * The underlying filesystem must not drop the kernel lock or call
+ * ->fl_vfs_callback() before returning to the caller with a -EINPROGRESS
+ * return code.
+ */
+int vfs_lock_file(struct file *filp, struct file_lock *fl)
+{
+	if (filp->f_op && filp->f_op->lock)
+		return filp->f_op->lock(filp, F_SETLK, fl);
+	else
+		return __posix_lock_file_conf(filp->f_dentry->d_inode, fl, NULL);
+}
+EXPORT_SYMBOL(vfs_lock_file);
+
+/**
+ * vfs_test_lock - test file byte range lock
+ * @filp: The file to test lock for
+ * @fl: The lock to test
+ * @conf: Place to return a copy of the conflicting lock, if found.
+ */
+int vfs_test_lock(struct file *filp, struct file_lock *fl, struct file_lock *conf)
+{
+	int error;
+
+	conf->fl_type = F_UNLCK;
+	if (filp->f_op && filp->f_op->lock) {
+ 		locks_copy_lock(conf, fl);
+		error = filp->f_op->lock(filp, F_GETLK, conf);
+		if (!error) {
+			if (conf->fl_type != F_UNLCK)
+				error =  1;
+		}
+		return error;
+ 	} else
+		return posix_test_lock(filp, fl, conf);
+}
+EXPORT_SYMBOL(vfs_test_lock);
+
+/**
+ * vfs_cancel_lock - file byte range unblock lock
+ * @filp: The file to apply the unblock to
+ * @fl: The lock to be unblocked
+ *
+ * FL_CANCELED is used to cancel blocked requests
+ */
+void vfs_cancel_lock(struct file *filp, struct file_lock *fl)
+{
+	lock_kernel();
+	fl->fl_flags |= FL_CANCEL;
+	if (filp->f_op && filp->f_op->lock) {
+		/* XXX: check locking */
+		unlock_kernel();
+		filp->f_op->lock(filp, F_SETLK, fl);
+	} else {
+		posix_unblock_lock(filp, fl);
+		unlock_kernel();
+	}
+}
+EXPORT_SYMBOL(vfs_cancel_lock);
+
+/**
  * locks_mandatory_locked - Check for an active lock
  * @inode: the file to check
  *
diff --git a/include/linux/fs.h b/include/linux/fs.h
index cc35b6a..c5307ab 100644
--- a/include/linux/fs.h
+++ b/include/linux/fs.h
@@ -640,6 +640,7 @@ extern spinlock_t files_lock;
 #define FL_ACCESS	8	/* not trying to lock, just looking */
 #define FL_LOCKD	16	/* lock held by rpc.lockd */
 #define FL_LEASE	32	/* lease held on this file */
+#define FL_CANCEL	64	/* set to request cancelling a lock */
 #define FL_SLEEP	128	/* A blocking lock */
 
 /*
@@ -666,6 +667,7 @@ struct lock_manager_operations {
 	void (*fl_break)(struct file_lock *);
 	int (*fl_mylease)(struct file_lock *, struct file_lock *);
 	int (*fl_change)(struct file_lock **, int);
+	int (*fl_vfs_callback)(struct file_lock *, struct file_lock *, int result);
 };
 
 /* that will die - we need it for nfs_lock_info */
@@ -725,6 +727,10 @@ extern void locks_init_lock(struct file_
 extern void locks_copy_lock(struct file_lock *, struct file_lock *);
 extern void locks_remove_posix(struct file *, fl_owner_t);
 extern void locks_remove_flock(struct file *);
+extern int vfs_lock_file(struct file *, struct file_lock *);
+extern int vfs_lock_file_conf(struct file *, struct file_lock *, struct file_lock *);
+extern int vfs_test_lock(struct file *, struct file_lock *, struct file_lock *);
+extern void vfs_cancel_lock(struct file *, struct file_lock *);
 extern struct file_lock *posix_test_lock(struct file *, struct file_lock *);
 extern int posix_lock_file(struct file *, struct file_lock *);
 extern int posix_lock_file_wait(struct file *, struct file_lock *);



From brentonr at dorm.org  Fri Dec 16 23:10:54 2005
From: brentonr at dorm.org (Brenton Rothchild)
Date: Fri, 16 Dec 2005 17:10:54 -0600
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <3449.208.178.77.200.1134753295.squirrel@mnementh.dragonhold.org>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>	<43A2D422.7010301@dorm.org>
	<3449.208.178.77.200.1134753295.squirrel@mnementh.dragonhold.org>
Message-ID: <43A3497E.90009@dorm.org>

I think the MySQL Cluster product and what I was describing
were two different things.

I wasn't talking about using the MySQL Cluster product,
just running regular MySQL on two separate RHCS nodes
that have a shared GFS filesystem.

It appears that the MySQL Cluster system, from what I
can find, is more like a network-aware locking mechanism
at the MySQL layer that works only with in-RAM databases.
It also looks like you need a front-end SQL node to sit
in front of the two (or more) server nodes, an administration
node, etc.

Overall, the MySQL Cluster system looks like a stand-alone
cluster product, whereas I was trying to say that I was
going to run two normal MySQL processes on two RHCS nodes,
which from a different perspective could be seen as running
two MySQL server instances pointing at the same datadirs
using flock() to lock between them.

Since GFS makes flock() cluster-wide, and this appears
to be the only arbitration between multiple MySQL server
instances, then GFS would allow multiple MySQL server
instances to run on different nodes just as running
on the same machine.

I see several advantages over MySQL Cluster to running this
way - less number of dedicated MySQL nodes (no Admin,
SQL nodes, etc.), it integrates into an existing RHCS
arrangement, and, of course, cheaper due to not purchasing
MySQL Cluster.

There are several disadvantages, such as no query caching
and it's not all in-RAM (speed-wise, although not committing
to disk that often is another issue...).

Does that explain what I was trying to get across a bit
better?  I still don't think I'm entirely off my rocker,
but I've done so before :)

-Brenton Rothchild


gwood at dragonhold.org wrote:
>> Anyone feel free to correct me, but I was just doing some
>> reading on active-active MySQL + GFS last night, and
>> I think such an arrangement might be possible to a degree.
> 
> http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-overview.html
> 
> "MySQL Cluster is a technology which enables clustering of in-memory
> databases in a share-nothing system."
> 
> So I'm not sure what you're hoping to achieve... The on disk storage needs
> to be unique to each machine - and the data needs to all fit into memory.
> 
> Unless I'm missing something?
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From omer at faruk.net  Sat Dec 17 07:44:55 2005
From: omer at faruk.net (Omer Faruk Sen)
Date: Sat, 17 Dec 2005 09:44:55 +0200 (EET)
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <43A3497E.90009@dorm.org>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>	<43A2D422.7010301@dorm.org>
	<3449.208.178.77.200.1134753295.squirrel@mnementh.dragonhold.org>
	<43A3497E.90009@dorm.org>
Message-ID: <1182.85.103.112.60.1134805495.squirrel@85.103.112.60>

I totally agree with you not using mysql cluster product. Since I think it
brings more burden and doesn't work on some locales (it was stated on
mysql clustering Faq) also I have deep suspicions about totally in-RAM
databases.

But I want to ask the plan that I am thinking to install is right from the
perspective of a man who install cluster. (Which seems to be the right
solution to me). I am planning to install 1 rhcs+gfs with two nodes that
works active-passive(this cluster will be mysql write server) since I
think active-active hasn't been tried before with gfs+rhcs+mysql and I
plan to mount the gfs partition (which is on a iSCSI) on my msqyl read
servers and use these servers as a backend of load balancing server.
Another option can be to make replication via mysql so use read-servers'
I/O other than using shared storage's I/O. But I am not sure which to
install. but the second one seems to be a better solution because I don't
want to make high I/O load on my iSCSI because if I connect say 5 mysql
read server it may saturate my iSCSI box.

I really appreciate your answers about my question and want to thank in
advace.








> I think the MySQL Cluster product and what I was describing
> were two different things.
>
> I wasn't talking about using the MySQL Cluster product,
> just running regular MySQL on two separate RHCS nodes
> that have a shared GFS filesystem.
>
> It appears that the MySQL Cluster system, from what I
> can find, is more like a network-aware locking mechanism
> at the MySQL layer that works only with in-RAM databases.
> It also looks like you need a front-end SQL node to sit
> in front of the two (or more) server nodes, an administration
> node, etc.
>
> Overall, the MySQL Cluster system looks like a stand-alone
> cluster product, whereas I was trying to say that I was
> going to run two normal MySQL processes on two RHCS nodes,
> which from a different perspective could be seen as running
> two MySQL server instances pointing at the same datadirs
> using flock() to lock between them.
>
> Since GFS makes flock() cluster-wide, and this appears
> to be the only arbitration between multiple MySQL server
> instances, then GFS would allow multiple MySQL server
> instances to run on different nodes just as running
> on the same machine.
>
> I see several advantages over MySQL Cluster to running this
> way - less number of dedicated MySQL nodes (no Admin,
> SQL nodes, etc.), it integrates into an existing RHCS
> arrangement, and, of course, cheaper due to not purchasing
> MySQL Cluster.
>
> There are several disadvantages, such as no query caching
> and it's not all in-RAM (speed-wise, although not committing
> to disk that often is another issue...).
>
> Does that explain what I was trying to get across a bit
> better?  I still don't think I'm entirely off my rocker,
> but I've done so before :)
>
> -Brenton Rothchild
>
>
> gwood at dragonhold.org wrote:
>>> Anyone feel free to correct me, but I was just doing some
>>> reading on active-active MySQL + GFS last night, and
>>> I think such an arrangement might be possible to a degree.
>>
>> http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-overview.html
>>
>> "MySQL Cluster is a technology which enables clustering of in-memory
>> databases in a share-nothing system."
>>
>> So I'm not sure what you're hoping to achieve... The on disk storage
>> needs
>> to be unique to each machine - and the data needs to all fit into
>> memory.
>>
>> Unless I'm missing something?
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Omer Faruk Sen
http://www.faruk.net



From sly_extrem at yahoo.com  Sat Dec 17 15:18:41 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Sat, 17 Dec 2005 07:18:41 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <4766EEE585A6D311ADF500E018C154E30213340D@BNIFEX>
Message-ID: <20051217151841.35372.qmail@web30705.mail.mud.yahoo.com>

I have tried to implement your ideea and everything goes fine....there is a exception which is the most important to me. 
  
  I create the lv on 1st node...and with a vgscan on the second can't be seen.
  
  the following sequece I have used:
  
  1. pvcreate /dev/hda3
  2. vgcreate vg00 /dev/hda3
  3. lvcreate -L100 vg00
  
  What I'm doing wrong ?
  
  Thanks,




Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051217/a6725207/attachment.htm>

From cjkovacs at verizon.net  Sat Dec 17 19:20:24 2005
From: cjkovacs at verizon.net (Corey Kovacs)
Date: Sat, 17 Dec 2005 14:20:24 -0500
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <20051217151841.35372.qmail@web30705.mail.mud.yahoo.com>
References: <20051217151841.35372.qmail@web30705.mail.mud.yahoo.com>
Message-ID: <200512171420.24754.cjkovacs@verizon.net>

have you got clvmd running on both nodes?


On Saturday 17 December 2005 10:18, Marius Stoica wrote:
> I have tried to implement your ideea and everything goes fine....there is a
> exception which is the most important to me.
>
>   I create the lv on 1st node...and with a vgscan on the second can't be
> seen.
>
>   the following sequece I have used:
>
>   1. pvcreate /dev/hda3
>   2. vgcreate vg00 /dev/hda3
>   3. lvcreate -L100 vg00
>
>   What I'm doing wrong ?
>
>   Thanks,
>
>
>
>
> Marius Stoica
> skispeo at home.ro
> www.skispeo.home.ro
>
>
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com



From sly_extrem at yahoo.com  Sat Dec 17 21:40:24 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Sat, 17 Dec 2005 13:40:24 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <200512171420.24754.cjkovacs@verizon.net>
Message-ID: <20051217214024.37613.qmail@web30704.mail.mud.yahoo.com>

clvmd runs on both nodes....but I'm not sure that is doing the right think.
  
  How can I I be sure that clmvd is OK ?
  



Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051217/d8b0507f/attachment.htm>

From omer at faruk.net  Sun Dec 18 11:20:12 2005
From: omer at faruk.net (Omer Faruk Sen)
Date: Sun, 18 Dec 2005 13:20:12 +0200 (EET)
Subject: [Linux-cluster] bonding question?
Message-ID: <1683.85.103.114.121.1134904812.squirrel@85.103.114.121>


Hi,

Is bonding can be used only for performance improvements or can it be also
used for path availability along with redundant switch (2 switches)
configuration for iSCSI ?

Also in a paper of netapp (http://www.netapp.com/library/tr/3192.pdf)
iSCSI configuration is shown as 2 connection for one switch (in NAS ISLAND
Linux Cluster) but in SAN ISLAND (fiber connection and switches) 1
connection is being made for one switch which provides path availability.
What I want to ask is how path availability without SPOF can be provided
with iSCSI solutions? Can I achive it with 2 ethernet card that works with
bonding and each of them is connected to one switch (2 switches is
configured in cluster)? I think for that I have to use multipath
(http://christophe.varoqui.free.fr/wiki/wakka.php?wiki=Home) software for
that. Am I right?

-- 
Omer Faruk Sen
http://www.faruk.net



From mbrookov at mines.edu  Sun Dec 18 16:29:40 2005
From: mbrookov at mines.edu (Matt Brookover)
Date: Sun, 18 Dec 2005 09:29:40 -0700
Subject: [Linux-cluster] bonding question?
In-Reply-To: <1683.85.103.114.121.1134904812.squirrel@85.103.114.121>
References: <1683.85.103.114.121.1134904812.squirrel@85.103.114.121>
Message-ID: <1134923380.13839.13.camel@merlin.Mines.EDU>

The short answer is yes, bonded pairs offer both performance improvement
and availability.  I tend to doubt that most servers with any real
application could process all of the traffic from 2 1gb ethernet ports.

The network guys here set up 3 Cisco 3750 switches with the high speed
link that ties them together into one large switch.  On redhat
enterprise 3, I set up 2 bonded ethernets, each plugged into different
switches.  I have noticed that the inbound traffic goes to the first
ethernet, but outbound traffic is round-robin.  As I understand it, the
Cisco switches do not do round-robin because there is no efficient way
to keep track of which port received the last packet.  It is faster to
send all packets for a bonded ethnet pair to one port.  I can pull
either ethernet port and the other will take over the traffic.

As I understand it, any of the 3 switches can fail and every thing will
stay up. 

Sorry, I have never used a NetApp with ISCSI.

Matt

On Sun, 2005-12-18 at 04:20, Omer Faruk Sen wrote:

> Hi,
> 
> Is bonding can be used only for performance improvements or can it be also
> used for path availability along with redundant switch (2 switches)
> configuration for iSCSI ?
> 
> Also in a paper of netapp (http://www.netapp.com/library/tr/3192.pdf)
> iSCSI configuration is shown as 2 connection for one switch (in NAS ISLAND
> Linux Cluster) but in SAN ISLAND (fiber connection and switches) 1
> connection is being made for one switch which provides path availability.
> What I want to ask is how path availability without SPOF can be provided
> with iSCSI solutions? Can I achive it with 2 ethernet card that works with
> bonding and each of them is connected to one switch (2 switches is
> configured in cluster)? I think for that I have to use multipath
> (http://christophe.varoqui.free.fr/wiki/wakka.php?wiki=Home) software for
> that. Am I right?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051218/835cf4f4/attachment.htm>

From omer at faruk.net  Sun Dec 18 18:05:15 2005
From: omer at faruk.net (Omer Faruk Sen)
Date: Sun, 18 Dec 2005 20:05:15 +0200 (EET)
Subject: [Linux-cluster] bonding question?
In-Reply-To: <1134923380.13839.13.camel@merlin.Mines.EDU>
References: <1683.85.103.114.121.1134904812.squirrel@85.103.114.121>
	<1134923380.13839.13.camel@merlin.Mines.EDU>
Message-ID: <4775.85.103.114.121.1134929115.squirrel@85.103.114.121>

Yes now I understand that switch should also be configured for bonding. I
think bonding is called as ethernet trunking in cisco world which I have
found out from:

http://lists.debian.org/debian-beowulf/2004/08/msg00022.html and
http://www.crazygreek.co.uk/content/bonding_cisco

It seems that there is one more

> The short answer is yes, bonded pairs offer both performance improvement
> and availability.  I tend to doubt that most servers with any real
> application could process all of the traffic from 2 1gb ethernet ports.
>
> The network guys here set up 3 Cisco 3750 switches with the high speed
> link that ties them together into one large switch.  On redhat
> enterprise 3, I set up 2 bonded ethernets, each plugged into different
> switches.  I have noticed that the inbound traffic goes to the first
> ethernet, but outbound traffic is round-robin.  As I understand it, the
> Cisco switches do not do round-robin because there is no efficient way
> to keep track of which port received the last packet.  It is faster to
> send all packets for a bonded ethnet pair to one port.  I can pull
> either ethernet port and the other will take over the traffic.
>
> As I understand it, any of the 3 switches can fail and every thing will
> stay up.
>
> Sorry, I have never used a NetApp with ISCSI.
>
> Matt
>
> On Sun, 2005-12-18 at 04:20, Omer Faruk Sen wrote:
>
>> Hi,
>>
>> Is bonding can be used only for performance improvements or can it be
>> also
>> used for path availability along with redundant switch (2 switches)
>> configuration for iSCSI ?
>>
>> Also in a paper of netapp (http://www.netapp.com/library/tr/3192.pdf)
>> iSCSI configuration is shown as 2 connection for one switch (in NAS
>> ISLAND
>> Linux Cluster) but in SAN ISLAND (fiber connection and switches) 1
>> connection is being made for one switch which provides path
>> availability.
>> What I want to ask is how path availability without SPOF can be provided
>> with iSCSI solutions? Can I achive it with 2 ethernet card that works
>> with
>> bonding and each of them is connected to one switch (2 switches is
>> configured in cluster)? I think for that I have to use multipath
>> (http://christophe.varoqui.free.fr/wiki/wakka.php?wiki=Home) software
>> for
>> that. Am I right?
>


-- 
Omer Faruk Sen
http://www.faruk.net



From tristram at ubernet.co.nz  Sun Dec 18 21:07:49 2005
From: tristram at ubernet.co.nz (Tristram Cheer)
Date: Mon, 19 Dec 2005 10:07:49 +1300
Subject: [Linux-cluster] Basic Clustering Help
Message-ID: <43A5CFA5.5030804@ubernet.co.nz>

Good Morning All,

We are trialing XEN/GFS in a dev network before moving it to a live 
server base, the following outlines what we are doing and what we are 
tring to achive

We have 4 physical Server, 1 of which has a 50gb Raid5 Array we wish to 
export via GNBD, this is a simple scsi array and is not shared storage.
On top of this we have 8 "Xen Virtual Servers" They all need shared 
access to this 50gb Array and have block level access via GNBD.

Right now the cluster is running as shown by

/root at chessmaster:~# cat /proc/cluster/status
Protocol version: 5.0.1
Config version: 3
Cluster name: ubernet
Cluster ID: 13910
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 5
Expected_votes: 5
Total_votes: 5
Quorum: 3
Active subsystems: 1
Node name: chessmaster
Node addresses: 202.xxx.132.12/

This is done by booting up all machines then doing the following in 
order on all of them
/
cman_tool join -w
/etc/init.d/gnbd-client restart/

On the server with the 50gb array i also run

//etc/init.d/gnbd-server restart/

This allows me to see /dev/gnbd/50gb on all 8 servers, I then ran

/gfs_mkfs -p lock_dlm -t ubernet:50gbFS -j 10 /dev/gnbd/50gb/

Which completed fine, however attempting to mount it is proving tricky, 
when i try and do the following

/ mount -t gfs /dev/gnbd/50gb /users//

I get

/mount: permission denied/

and in /var/log/messages i see

/Dec 19 09:58:16 asimov kernel: Lock_Harness <CVS> (built Dec 17 2005 
13:55:10) installed
Dec 19 09:58:16 asimov kernel: GFS <CVS> (built Dec 17 2005 13:56:56) 
installed
Dec 19 09:58:16 asimov kernel: GFS: Trying to join cluster "lock_dlm", 
"ubernet:50gbFS"
Dec 19 09:58:16 asimov kernel: Lock_DLM (built Dec 17 2005 13:56:21) 
installed
Dec 19 09:58:16 asimov kernel: lock_dlm: fence domain not found; check 
fenced
Dec 19 09:58:16 asimov kernel: GFS: can't mount proto = lock_dlm, table 
= ubernet:50gbFS, hostdata =
/
when i try and run fence_tool join -w i get this

/fenced: local cman node name "asimov" not found in cluster.conf/

Is anyone able to point me in the right direction, i really need to get 
this dev network runnning asap.All we need is a basic GFS system so that 
all our servers can access user's e-mails no matter what server they 
connect to. Nothing fancy or full blown cluster, just a Shared Filesystem.
Here is my cluster.conf file

<?xml version="1.0" ?>
<cluster config_version="3" name="ubernet">
        <fence_daemon clean_start="1" post_fail_delay="0" 
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="chessmaster.ubernet.co.nz" votes="1">
                        <fence>
                                <method name="1">
                                        <device 
ipaddress="202.XXX.132.12" name="gnbd"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="asimov.ubernet.co.nz" votes="1">
                        <fence>
                                <method name="1">
                                        <device 
ipaddress="202.XXX.132.2" name="gnbd"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="abdul.ubernet.co.nz" votes="1">
                        <fence>
                                <method name="1">
                                        <device 
ipaddress="202.XXX.132.3" name="gnbd"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="faye.ubernet.co.nz" votes="1">
                        <fence>
                                <method name="1">
                                        <device 
ipaddress="202.XXX.132.4" name="gnbd"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="maria.ubernet.co.nz" votes="1">
                        <fence>
                                <method name="1">
                                        <device 
ipaddress="202.XXX.132.5" name="gnbd"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_gnbd" name="gnbd" 
server="chessmaster.ubernet.co.nz"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
</cluster>


Cheers

Tristram Cheer



From raziebe at gmail.com  Mon Dec 19 06:37:56 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Mon, 19 Dec 2005 08:37:56 +0200
Subject: [Linux-cluster] Reduced performance question
Message-ID: <5d96567b0512182237t1a78a16cl4777bbca6b80a357@mail.gmail.com>

i have been measuring the performance of my cluster.
It is compsed of 4 machines, each machine exports 1 sata disk (maxtor
maxline III),
and theses machines are connected 1 Gbps ethernel drive.

Peformance :
I am getting about 40% decrease in performance when i simply read a file.

1. Would switching to infiniband make it faster ?

2. What is GNBD IO model ?

    2.1  Does it use async io ?
          multithreaded IO ?
          sync IO ?

     2.2 When a GNBD servs a node , does it know what is amount of data it
needs
          to fetch right from the start ?
          Meaning , if a node asks for 1MB buffer, would GNBD read 1 MB
buffer
          or several small chunks ?

Thanks
--
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051219/c887f363/attachment.htm>

From sly_extrem at yahoo.com  Mon Dec 19 10:42:01 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Mon, 19 Dec 2005 02:42:01 -0800 (PST)
Subject: [Linux-cluster] Need Professional Help
Message-ID: <20051219104201.81306.qmail@web30704.mail.mud.yahoo.com>

 The cluster that I want to build has the following structure.
  
  I have 2 nodes with 2 network cards each.
  The conection between the nodes is made on 10.1.1.x addreses.
  
  
                     __________________
                       |Load balancer        |
                      |________________|
                            /            \ 
                          /                 \
                        /                     \ 
                      /                         \                  
                    /                             \ 
            192.168.1.2                 192.168.1.3
           _____________                            _____________
          |    nd1            |                              |    nd2            |
          |  /dev/hda3      |---------------------------|  /dev/hda3       |  
          |___________ |10.1.1.1   10.1.1.2 |____________|
  
  
  The connection with the Load balancer has 192.168.1.x addreses.
  
  What I want to do is:
  
  -I want to run HTTPD on the mount point of /dev/hda3 
  
  -/dev/hda3 to be in mirror on both nodes and
  in the same time to have acces to the partition
  (I belive this is SHARED DISK)
  
  
  Can anyone help me with this ?
  
  I have tried this with no succes...:(
  
  <?xml version="1.0"?>
  <cluster config_version="1" name="GFScluster">
          <cman expected_votes="1" two_node="1"/>
          <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="20"/>
          <clusternodes>
                  <clusternode name="nd1">
                           <fence>
                                   <method name="human">
                                           <device name="human" nodename="nd1" ipaddr="10.1.1.1"/>
                                   </method>
                           </fence>
                  </clusternode>
                  <clusternode name="nd2">
                           <fence>
                                   <method name="human">
                                           <device name="human" nodename="nd2" ipaddr="10.1.1.2"/>
                                   </method>
                           </fence>
                  </clusternode>
          </clusternodes>
          <fencedevices>
                   <fencedevice agent="fence_manual" name="human"/>
          </fencedevices>
          <rm>
                  <failoverdomains/>
                  <resources>
                           <clusterfs device="/dev/vg00/lvol01" fstype="gfs" mountpoint="/raid"  name="store1" options=""/>
                  </resources>
          </rm>
  </cluster>
  
  
  Thanks a lot,
  


Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051219/76f6444a/attachment.htm>

From hyperbaba at neobee.net  Mon Dec 19 11:03:29 2005
From: hyperbaba at neobee.net (Vladimir Grujic)
Date: Mon, 19 Dec 2005 12:03:29 +0100
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <43A2D422.7010301@dorm.org>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>
	<43A2D422.7010301@dorm.org>
Message-ID: <200512191203.29620.hyperbaba@neobee.net>

On Friday 16 December 2005 15:50, Brenton Rothchild wrote:
> Anyone feel free to correct me, but I was just doing some
> reading on active-active MySQL + GFS last night, and
> I think such an arrangement might be possible to a degree.
>
>  From what I can read in the MySQL documentation,
> http://dev.mysql.com/doc/refman/5.0/en/system.html
> http://dev.mysql.com/doc/refman/4.1/en/system.html
>
> MySQL can use external file locking via flock() calls,
> although the 4.1 page describes that file locking is turned off
> by default "because Linux file locking is not yet safe" via
> a compile-time option "--skip-external-locking".
>
> The 5.0 page doesn't warn against using external file locking
> to that degree, but the option to use locking is still
> disabled via the "--skip-external-locking" flag.
>
> There are also some caveats to using the --external-locking option
> in MySQL.  See the documentation for "--external-locking" in
> http://dev.mysql.com/doc/refman/4.1/en/server-options.html
> http://dev.mysql.com/doc/refman/5.0/en/server-options.html
>
> Now, on the GFS side, locking via flock() and fcntl()-based
> locks appear to be supported cluster-wide from this thread,
> http://www.redhat.com/archives/linux-cluster/2004-October/msg00311.html
>
> So, putting them together, if GFS supports file locking,
> and MySQL can use file locking to support multiple instances
> accessing the same files, an active-active MySQL set up should work.
>
> I, too, and using a test cluster with GFS + iSCSI at this point,
> and I plan to test MySQL 4.x and 5.x versions on it soon to
> see what happens.  I really hope it works out under the loads
> we want to test. :)
>
> Like I said, anyone else feel free to correct me - I'm still
> just getting started with RHCS/GFS/etc. at this point.
>
> -Brenton Rothchild
>
> Omer Faruk Sen wrote:
> > Hi,
> >
> > I want to install a mysql cluster. But I want to ask which path do I have
> > to follow? Or there is any special precautions that I have to take care
> > before and after installing mysql and redhat cluster suite.
> >
> > My first impression is to install 2 node cluster it is better for me to
> > use GFS and a iscsi solution for a cheaper solution. But I have
> > understood from my readings that I can't setup an active-active mysql
> > cluster using redhat cluster + GFS. Because one node must be locked for
> > writing. Thus I think installing the second server for writing may be
> > just for
> > availability not for combining 2 machines power ( I mean using
> > active-acvtive).
> >
> > Then I think to install 2 machines for write (one is standby) on a redhat
> > cluster using GFS on a ISCSI system and adding a number of machines that
> > use the same GFS partition for msqyl read operations is what I have to
> > follow .. right?
> >
> > PS: By the way can you give me an URL that details mysql and redhat
> > cluster + gfs installation? Or is there any ?
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

It is possible to use mysql on shared storage with enabled external locking 
and also disabling the query cache and few other things:

enable-locking
query_cache_wlock_invalidate
query_cache_size= 0
query_cache_type= 0
delay_key_write = OFF
flush

in mysqld section


this configuration worked for my 10 node cluster .

-- 
-------------------------------------------------------------------------
When you finally buy enough memory, you will not have enough disk space. 
	-- Murphy's Computer Laws n?3
-------------------------------------------------------------------------



From atle.saltermark at brreg.no  Mon Dec 19 14:01:19 2005
From: atle.saltermark at brreg.no (Saltermark, Atle)
Date: Mon, 19 Dec 2005 15:01:19 +0100
Subject: [Linux-cluster] cluster with NFS
Message-ID: <8CAAC37C88A624428FE555CA178DA16801CBF076@EXCHANGE.brreg.no>

Hi.
I'm trying to set up a redhat cluster with NFS.
File systems which may be exported to NFS clients.

Red Hat Cluster Suite (RHCS) running on Red Hat Enterprise Linux 4

On a "standalone NFS server" this is not a problem.
I only edit the file /etc/exports like this:

       /projects 10.1.1.111(rw,no_root_squash)
10.1.1.112(rw,no_root_squash)

Then restart the nfs service and it's possible to mount from clients.

With the cluster suite I'm not using "/etc/exports".

With the cluster suite I'm getting trouble.
Have tried different configs, but not been able to get this working.
I'm must bee missing something (?)
I have not been able to find any good documentation on this subject.
Only on older cluster suite and this do not match my GUI.

The nfs and portmap services are running on both nodes as described.

There are several NFS Resources ( NFS Export , NFS mount , NFS Client )
Who must be edit to replace the "/etc/exports" on a standalone server.
And howto?

Any help appreciated.

Regards Atle



From pcaulfie at redhat.com  Mon Dec 19 14:07:40 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 19 Dec 2005 14:07:40 +0000
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <20051217151841.35372.qmail@web30705.mail.mud.yahoo.com>
References: <20051217151841.35372.qmail@web30705.mail.mud.yahoo.com>
Message-ID: <43A6BEAC.8020502@redhat.com>

Marius Stoica wrote:
> I have tried to implement your ideea and everything goes fine....there
> is a exception which is the most important to me.
> 
> I create the lv on 1st node...and with a vgscan on the second can't be seen.
> 
> the following sequece I have used:
> 
> 1. pvcreate /dev/hda3
> 2. vgcreate vg00 /dev/hda3
> 3. lvcreate -L100 vg00


err hda3 ??

That doesn't look like shared storage to me.

-- 

patrick



From sly_extrem at yahoo.com  Mon Dec 19 14:19:20 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Mon, 19 Dec 2005 06:19:20 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <43A6BEAC.8020502@redhat.com>
Message-ID: <20051219141920.22694.qmail@web30713.mail.mud.yahoo.com>

how should I do to get shared storage ?

Patrick Caulfield <pcaulfie at redhat.com> wrote:  Marius Stoica wrote:
> I have tried to implement your ideea and everything goes fine....there
> is a exception which is the most important to me.
> 
> I create the lv on 1st node...and with a vgscan on the second can't be seen.
> 
> the following sequece I have used:
> 
> 1. pvcreate /dev/hda3
> 2. vgcreate vg00 /dev/hda3
> 3. lvcreate -L100 vg00


err hda3 ??

That doesn't look like shared storage to me.

-- 

patrick

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster




Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051219/470eae30/attachment.htm>

From pcaulfie at redhat.com  Mon Dec 19 14:26:28 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 19 Dec 2005 14:26:28 +0000
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <20051219141920.22694.qmail@web30713.mail.mud.yahoo.com>
References: <20051219141920.22694.qmail@web30713.mail.mud.yahoo.com>
Message-ID: <43A6C314.1000104@redhat.com>

Marius Stoica wrote:
> how should I do to get shared storage ?
> 

fibre-channel
iscsi
gnbd

-- 

patrick



From sly_extrem at yahoo.com  Mon Dec 19 14:43:01 2005
From: sly_extrem at yahoo.com (Marius Stoica)
Date: Mon, 19 Dec 2005 06:43:01 -0800 (PST)
Subject: [Linux-cluster] Need a lot of help
In-Reply-To: <43A6C314.1000104@redhat.com>
Message-ID: <20051219144301.27578.qmail@web30701.mail.mud.yahoo.com>


    I have 2 nodes with 2 network cards each (eth0, eth1).
    The conection between the nodes is made on 10.1.1.x addreses.
    
    
                       __________________
                       |Load balancer        |
                        |________________|
                            /            \ 
                          /                 \
                        /                     \ 
                      /                         \                  
                    /                             \ 
             192.168.1.2                 192.168.1.3
          _____________                            _____________
         |     nd1           |                               |    nd2            |
         |  /dev/hda3      |---------------------------|  /dev/hda3       |  
         |___________ |10.1.1.1   10.1.1.2   |____________|
    
    
    The connection with the Load balancer has 192.168.1.x addreses.
  
  I belive only gnbd can be used....
  
  but how can I do that ?



Marius Stoica
skispeo at home.ro
www.skispeo.home.ro



__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051219/974e5e54/attachment.htm>

From Bowie_Bailey at BUC.com  Mon Dec 19 15:13:29 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Mon, 19 Dec 2005 10:13:29 -0500
Subject: [Linux-cluster] Need Professional Help
Message-ID: <4766EEE585A6D311ADF500E018C154E302133421@BNIFEX>

From: Marius Stoica [mailto:sly_extrem at yahoo.com]
> 
>  The cluster that I want to build has the following structure.
> 
> I have 2 nodes with 2 network cards each.
> The conection between the nodes is made on 10.1.1.x addreses.
> 
> 
>                     __________________
>                     |Load balancer   |
>                     |________________|
>                       /            \ 
>                     /                \
>                   /                    \ 
>                 /                        \                 
>               /                            \ 
>           192.168.1.2                       192.168.1.3
>         _____________                       _____________
>         |    nd1     |                     |    nd2     |
>         | /dev/hda3  |---------------------| /dev/hda3  |  
>         |___________ |10.1.1.1    10.1.1.2 |____________|
> 
> 
> The connection with the Load balancer has 192.168.1.x addreses.
> 
> What I want to do is:
> 
> -I want to run HTTPD on the mount point of /dev/hda3 
> 
> -/dev/hda3 to be in mirror on both nodes and
> in the same time to have acces to the partition
> (I belive this is SHARED DISK)
> 
> 
> Can anyone help me with this ?
> 
> I have tried this with no succes...:(

   [ SNIP ]

>   <resources>
>       <clusterfs device="/dev/vg00/lvol01" fstype="gfs" mountpoint="/raid"
name="store1" options=""/>
>   </resources>

Your diagram does not match the storage defined here.  I believe you
took this from the example I provided last week.  My setup has a
separate raid unit that is accessible by both servers.  Your setup has
hard drives in each server.  This is a completely different setup.  I
think you can do what you want using either NFS or GNBD, but I can't
help you there as I have never attempted a cluster setup using either
of those.

To give you an idea, my setup looks like this:

           Load Balancer
           /           \
       Server1      Server2
           \           / 
        Raid Storage with GFS

Bowie



From teigland at redhat.com  Mon Dec 19 16:19:46 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 19 Dec 2005 10:19:46 -0600
Subject: [Linux-cluster] Basic Clustering Help
In-Reply-To: <43A5CFA5.5030804@ubernet.co.nz>
References: <43A5CFA5.5030804@ubernet.co.nz>
Message-ID: <20051219161946.GB6515@redhat.com>

On Mon, Dec 19, 2005 at 10:07:49AM +1300, Tristram Cheer wrote:
> when i try and run fence_tool join -w i get this
> 
> /fenced: local cman node name "asimov" not found in cluster.conf/

> <clusternode name="asimov.ubernet.co.nz" votes="1">

Use debugging with cman_tool and fence_tool to see exactly what node names
are being used:
  cman_tool join -wd
  fence_tool join -D

The solution may be to set the cluster.conf name to "asimov", or configure
the system to report "asimov.ubernet.co.nz" as the uname.

Dave



From teigland at redhat.com  Mon Dec 19 16:30:11 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 19 Dec 2005 10:30:11 -0600
Subject: [Linux-cluster] Re: 2.6.15-rc5-mm3 dlm: missing NULL pointer checks
In-Reply-To: <43A534F1.9090807@s5r6.in-berlin.de>
References: <43A534F1.9090807@s5r6.in-berlin.de>
Message-ID: <20051219163011.GC6515@redhat.com>

On Sun, Dec 18, 2005 at 11:07:45AM +0100, Stefan Richter wrote:
> Hi all,
> 
> while browsing http://sosdg.org/~coywolf/lxr/ for a completely unrelated 
> matter, I found these two potential NULL pointer dereferences in 
> drivers/dlm/device.c. In do_user_lock():

Fixed, along with the "&& DLM_LKF_PERSISTENT" typo.
Thanks,
Dave

> >803                 if (!li && DLM_LKF_PERSISTENT) {
> >804                         li = allocate_lockinfo(fi, cmd, kparams);



From rajiv.vaidyanath at ccur.com  Mon Dec 19 19:11:52 2005
From: rajiv.vaidyanath at ccur.com (Rajiv Vaidyanath)
Date: Mon, 19 Dec 2005 14:11:52 -0500
Subject: [Linux-cluster] CLVM - Data Mirroring
In-Reply-To: <20051219163011.GC6515@redhat.com>
References: <43A534F1.9090807@s5r6.in-berlin.de>
	<20051219163011.GC6515@redhat.com>
Message-ID: <1135019512.23466.2.camel@mouse>



Hi,

Is data mirroring possible with the latest CLVM ? I read in a
document/article that this will be incorporated by the end of 2005.

Thanks for your help.


Rajiv



From Darrell.Frazier at crc.army.mil  Mon Dec 19 21:14:28 2005
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Mon, 19 Dec 2005 15:14:28 -0600
Subject: [Linux-cluster] Cannot mount gfs filesystem
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544401890ACC@safeb1mf533c.crc.army.mil>

I have created a gfs filesystem and attempting to mount it. (mount -t gfs
/dev/vg00/lv00 /u04) I get the following error:

 

mount: Transport endpoint is not connected 

 

I haven't found a solution in the docs or Google. Thanx much in advance.

 

 

 

Darrell J. Frazier

Unix System Administrator

US Army Combat Readiness Center

Fort Rucker, Alabama 36362

 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051219/4941d965/attachment.htm>

From teigland at redhat.com  Mon Dec 19 21:21:22 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 19 Dec 2005 15:21:22 -0600
Subject: [Linux-cluster] Cannot mount gfs filesystem
In-Reply-To: <A5502A8A1836A54FB9CB33BDC6A5544401890ACC@safeb1mf533c.crc.army.mil>
References: <A5502A8A1836A54FB9CB33BDC6A5544401890ACC@safeb1mf533c.crc.army.mil>
Message-ID: <20051219212122.GA9996@redhat.com>

On Mon, Dec 19, 2005 at 03:14:28PM -0600, Frazier, Darrell USA CRC (Contractor) wrote:
> I have created a gfs filesystem and attempting to mount it. (mount -t gfs
> /dev/vg00/lv00 /u04) I get the following error:
> 
> mount: Transport endpoint is not connected 

Does dmesg or /var/log/messages say anything more?
Is the cluster running (cman_tool status) ?

Dave



From tristram at ubernet.co.nz  Mon Dec 19 21:24:19 2005
From: tristram at ubernet.co.nz (Tristram Cheer)
Date: Tue, 20 Dec 2005 10:24:19 +1300
Subject: [Linux-cluster] Cannot mount gfs filesystem
In-Reply-To: <A5502A8A1836A54FB9CB33BDC6A5544401890ACC@safeb1mf533c.crc.army.mil>
References: <A5502A8A1836A54FB9CB33BDC6A5544401890ACC@safeb1mf533c.crc.army.mil>
Message-ID: <43A72503.1080107@ubernet.co.nz>

What does /v/l/messages and /v/l/syslog have to say about it?

Frazier, Darrell USA CRC (Contractor) wrote:

> I have created a gfs filesystem and attempting to mount it. (mount -t 
> gfs /dev/vg00/lv00 /u04) I get the following error:
>
>  
>
> /mount: Transport endpoint is not connected /
>
> / /
>
> I haven't found a solution in the docs or Google. Thanx much in advance.
>
>  
>
>  
>
>  
>
> **Darrell J. Frazier**
>
> Unix System Administrator
>
> US Army Combat Readiness Center
>
> Fort Rucker, Alabama 36362
>
> */ /**//*
>
>  
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>



From tristram at ubernet.co.nz  Tue Dec 20 00:34:46 2005
From: tristram at ubernet.co.nz (Tristram Cheer)
Date: Tue, 20 Dec 2005 13:34:46 +1300
Subject: [Linux-cluster] Basic Clustering Help
In-Reply-To: <20051219161946.GB6515@redhat.com>
References: <43A5CFA5.5030804@ubernet.co.nz> <20051219161946.GB6515@redhat.com>
Message-ID: <43A751A6.5080609@ubernet.co.nz>

Amazing what 12 hours does, i "think" i tried this but may not have done 
so correctly, replaced all cluster.conf with the new version and bang it 
works great. ALtho i've had to modifiy the order of the init scripts the 
Ubuntu packaged came with to get the order correct.

Next simple question,

How do i go about auto mounting a GFS volume thats on a GNBD device, 
i've tried putting it in fstab but that runs before the Cluster and GNBD 
client are running, i would like to be able to reboot boxes without too 
much command typing

Cheers

David Teigland wrote:

>On Mon, Dec 19, 2005 at 10:07:49AM +1300, Tristram Cheer wrote:
>  
>
>>when i try and run fence_tool join -w i get this
>>
>>/fenced: local cman node name "asimov" not found in cluster.conf/
>>    
>>
>
>  
>
>><clusternode name="asimov.ubernet.co.nz" votes="1">
>>    
>>
>
>Use debugging with cman_tool and fence_tool to see exactly what node names
>are being used:
>  cman_tool join -wd
>  fence_tool join -D
>
>The solution may be to set the cluster.conf name to "asimov", or configure
>the system to report "asimov.ubernet.co.nz" as the uname.
>
>Dave
>
>  
>



From kaushalender at bhartitelesoft.com  Tue Dec 20 02:25:50 2005
From: kaushalender at bhartitelesoft.com (kaushalender at bhartitelesoft.com)
Date: Tue, 20 Dec 2005 02:25:50 GMT
Subject: [Linux-cluster] clusvcmgrd
Message-ID: <200512200225.jBK2PoQ0009783@mail.bhartitelesoft.com>

Hi all,

I have installed single node cluster with bonding on Red 
Hat Enterprise Linux AS release 3 (Taroon).I am using 
2.4.21-4.ELsmp #1 SMP Fri Oct 3 17:52:56 EDT 2003 i686 i686 
i386 GNU/Linux kernel.My cluster version is redhat-config-
cluster-1.0.1-4. I am getting following error again and 
again 
 clusvcmgrd: [27807]: <err> service error: Check status 
failed on IP addresses for oracle
Dec  1 03:46:38 smsc-cluster0 clusvcmgrd: [27819]: <err> 
service error: Check status failed on IP addresses for SMPP

Nothing happens to interface the clustermanager 
automatically loses the ip address of service .when i ping 
the ip address it gets ping .Please help me resolving this 
issue because this is causing my realtime application to go 
down

many thanx in advance 

REgards
Kaushalender

----------------------------------------------------------
http://www.bhartitelesoft.com
Telecommunication and E-Commerce solutions for the future.




From sco at adviseo.fr  Tue Dec 20 13:53:48 2005
From: sco at adviseo.fr (Sylvain Coutant)
Date: Tue, 20 Dec 2005 14:53:48 +0100
Subject: [Linux-cluster] cmirror status
Message-ID: <20051220135351.A2249181D1@smtp6-g19.free.fr>

Hi,

I wonder about the status (production ready ?) of cmirror and how it should be used.

>From the CVS, I see it is included in the STABLE and RHEL4 branches but TODO is over-aged and states not to use it for stable production.

Also I didn't found useful docs on how to make use of it ... The README could have a more lines to better explain how to interact with device mapper.


Any hint welcome.


Regards,

--
Sylvain COUTANT

ADVISEO
http://www.adviseo.fr/
http://www.open-sp.fr/





From lhh at redhat.com  Tue Dec 20 14:41:58 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 20 Dec 2005 09:41:58 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4396FAE4.80006@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
	<4396FAE4.80006@leopard.us.udel.edu>
Message-ID: <1135089718.27620.51.camel@ayanami.boston.redhat.com>

On Wed, 2005-12-07 at 10:08 -0500, Greg Forte wrote:

>                                          <device name="FENCE1" 
> option="reboot" port="1"/>
>                                          <device name="FENCE2" 
> option="reboot" port="1"/>
> 
> and increased the reboot wait time on the PDUs to make sure it'd wait 
> long enough, and that SEEMS to work (once I remembered to turn off ccsd 
> before updating my cluster.conf by hand so that it didn't end up 
> replacing it with the old one immediately ;-)

I don't know how I missed this, but this is a poor idea.

What if fenced hangs in the middle?  Then you haven't turned off the
power at all, but the cluster thinks you did!  Goodbye, file systems!

There's no way to guarantee that both ports were turned off
simultaneously, irrespective of the timeout values. :(

You could do:

   <device name="FENCE1" option="off" port="1"/>
   <device name="FENCE2" option="reboot" port="1"/>
   <device name="FENCE1" option="on" port="1"/>

...but that's about as "optimal" as you can get while still being safe.

-- Lon



From Bowie_Bailey at BUC.com  Tue Dec 20 14:47:46 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 20 Dec 2005 09:47:46 -0500
Subject: [Linux-cluster] Basic Clustering Help
Message-ID: <4766EEE585A6D311ADF500E018C154E302133432@BNIFEX>

Tristram Cheer <mailto:tristram at ubernet.co.nz> wrote:
> 
> How do i go about auto mounting a GFS volume thats on a GNBD device,
> i've tried putting it in fstab but that runs before the Cluster and
> GNBD client are running, i would like to be able to reboot boxes
> without too much command typing

On my system, GFS starts after everything else.  Part of the GFS startup
is "mount -a -t gfs" to automount all of the GFS filesystems.  This
works quite well.

-- 
Bowie



From Darrell.Frazier at crc.army.mil  Tue Dec 20 15:06:46 2005
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Tue, 20 Dec 2005 09:06:46 -0600
Subject: [Linux-cluster] Cannot mount gfs filesystem
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544401890AD2@safeb1mf533c.crc.army.mil>

Looks like I have a couple of things to take care of before mounting the gfs
filesystem. Thanks all for your help.

-----Original Message-----
From: Tristram Cheer [mailto:tristram at ubernet.co.nz] 
Sent: Monday, December 19, 2005 3:24 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cannot mount gfs filesystem

What does /v/l/messages and /v/l/syslog have to say about it?

Frazier, Darrell USA CRC (Contractor) wrote:

> I have created a gfs filesystem and attempting to mount it. (mount -t 
> gfs /dev/vg00/lv00 /u04) I get the following error:
>
>  
>
> /mount: Transport endpoint is not connected /
>
> / /
>
> I haven't found a solution in the docs or Google. Thanx much in advance.
>
>  
>
>  
>
>  
>
> **Darrell J. Frazier**
>
> Unix System Administrator
>
> US Army Combat Readiness Center
>
> Fort Rucker, Alabama 36362
>
> */ /**//*
>
>  
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051220/0a3e432b/attachment.htm>

From gforte at leopard.us.udel.edu  Tue Dec 20 15:21:22 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 20 Dec 2005 10:21:22 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <1135089718.27620.51.camel@ayanami.boston.redhat.com>
References: <4394DDF0.4080603@leopard.us.udel.edu>	<20051206034238.GA3226@rover.pcbi.upenn.edu>	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>
	<1135089718.27620.51.camel@ayanami.boston.redhat.com>
Message-ID: <43A82172.8070400@leopard.us.udel.edu>

Lon Hohberger wrote:
> On Wed, 2005-12-07 at 10:08 -0500, Greg Forte wrote:
> 
> 
>>                                         <device name="FENCE1" 
>>option="reboot" port="1"/>
>>                                         <device name="FENCE2" 
>>option="reboot" port="1"/>
>>
>>and increased the reboot wait time on the PDUs to make sure it'd wait 
>>long enough, and that SEEMS to work (once I remembered to turn off ccsd 
>>before updating my cluster.conf by hand so that it didn't end up 
>>replacing it with the old one immediately ;-)
> 
> 
> I don't know how I missed this, but this is a poor idea.
> 
> What if fenced hangs in the middle?  Then you haven't turned off the
> power at all, but the cluster thinks you did!  Goodbye, file systems!
> 
> There's no way to guarantee that both ports were turned off
> simultaneously, irrespective of the timeout values. :(
> 
> You could do:
> 
>    <device name="FENCE1" option="off" port="1"/>
>    <device name="FENCE2" option="reboot" port="1"/>
>    <device name="FENCE1" option="on" port="1"/>
> 
> ...but that's about as "optimal" as you can get while still being safe.

Sure sure ... except any multiple sequence of commands to the same fence
device doesn't work (per the bug that David Tiegland dug up somewhere in
this thread).  ;-)

-g



From Darrell.Frazier at crc.army.mil  Tue Dec 20 15:57:42 2005
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Tue, 20 Dec 2005 09:57:42 -0600
Subject: [Linux-cluster] gnbd configuration (newbie)
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544401890AD3@safeb1mf533c.crc.army.mil>

I am trying to set up a 2 node test cluster using the Cluster Suite and GFS.
Shared storage consists of three GFS formatted partitions on an iscsi jbod.
What I need to know is this: The documentation is a little vague in the
configuration of gnbd as a fencing device. How do I use the gnbd device to
fence the 2 nodes. Do I have to setup the existing GFS formatted partitions
to be exported as a GNBD device? Thanx in advance.  

 

Darrell J. Frazier

Unix System Administrator

US Army Combat Readiness Center

Fort Rucker, Alabama 36362

Com: (334)255-2676

DSN: 558-3879

Email: darrell.frazier at crc.army.mil

 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051220/b41612fe/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 3161 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051220/b41612fe/attachment.jpg>

From basv at sara.nl  Tue Dec 20 16:34:16 2005
From: basv at sara.nl (Bas van der Vlies)
Date: Tue, 20 Dec 2005 17:34:16 +0100
Subject: [Linux-cluster] Need some advice about gfs or gfs2
Message-ID: <43A83288.10709@sara.nl>


Our plan is to setup some nodes with GFS that export their filesystem as 
  NFS to our (640) clients. I read the kernel archives and some other 
discussion about gfs and gfs2. The version that maybe included in the
vanilla kernel will be gfs2 and is incompatible with gfs filesystems.

I now have the time to make an decission gfs or gfs2. So i need some 
advice which version to choose?

I have build gfs CVS stable without any problems. Gfs2 is another story 
till now i did not succeed. I am using debian sarge for this moment. 
Another question is when will gfs2 be included in the red-hat cluster suite?

Regards

-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From teigland at redhat.com  Tue Dec 20 16:58:47 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 20 Dec 2005 10:58:47 -0600
Subject: [Linux-cluster] Need some advice about gfs or gfs2
In-Reply-To: <43A83288.10709@sara.nl>
References: <43A83288.10709@sara.nl>
Message-ID: <20051220165847.GA19622@redhat.com>

On Tue, Dec 20, 2005 at 05:34:16PM +0100, Bas van der Vlies wrote:
> 
> Our plan is to setup some nodes with GFS that export their filesystem as 
>  NFS to our (640) clients. I read the kernel archives and some other 
> discussion about gfs and gfs2. The version that maybe included in the
> vanilla kernel will be gfs2 and is incompatible with gfs filesystems.
> 
> I now have the time to make an decission gfs or gfs2. So i need some 
> advice which version to choose?
> 
> I have build gfs CVS stable without any problems. Gfs2 is another story 
> till now i did not succeed. I am using debian sarge for this moment. 
> Another question is when will gfs2 be included in the red-hat cluster suite?

Use GFS(1) which is stable.  GFS2 is still in development, it won't be
ready for general use for quite a while.

Dave



From basv at sara.nl  Tue Dec 20 17:06:21 2005
From: basv at sara.nl (Bas van der Vlies)
Date: Tue, 20 Dec 2005 18:06:21 +0100
Subject: [Linux-cluster] Need some advice about gfs or gfs2
In-Reply-To: <20051220165847.GA19622@redhat.com>
References: <43A83288.10709@sara.nl> <20051220165847.GA19622@redhat.com>
Message-ID: <43A83A0D.30008@sara.nl>

David Teigland wrote:
> On Tue, Dec 20, 2005 at 05:34:16PM +0100, Bas van der Vlies wrote:
>> Our plan is to setup some nodes with GFS that export their filesystem as 
>>  NFS to our (640) clients. I read the kernel archives and some other 
>> discussion about gfs and gfs2. The version that maybe included in the
>> vanilla kernel will be gfs2 and is incompatible with gfs filesystems.
>>
>> I now have the time to make an decission gfs or gfs2. So i need some 
>> advice which version to choose?
>>
>> I have build gfs CVS stable without any problems. Gfs2 is another story 
>> till now i did not succeed. I am using debian sarge for this moment. 
>> Another question is when will gfs2 be included in the red-hat cluster suite?
> 
> Use GFS(1) which is stable.  GFS2 is still in development, it won't be
> ready for general use for quite a while.
> 

Dave,

  Thanks a lot for your advice.

Regards


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************



From gforte at leopard.us.udel.edu  Tue Dec 20 19:22:53 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 20 Dec 2005 14:22:53 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <1135089718.27620.51.camel@ayanami.boston.redhat.com>
References: <4394DDF0.4080603@leopard.us.udel.edu>	<20051206034238.GA3226@rover.pcbi.upenn.edu>	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>
	<1135089718.27620.51.camel@ayanami.boston.redhat.com>
Message-ID: <43A85A0D.3080108@leopard.us.udel.edu>

Lon Hohberger wrote:
> On Wed, 2005-12-07 at 10:08 -0500, Greg Forte wrote:
> 
> 
>>                                         <device name="FENCE1" 
>>option="reboot" port="1"/>
>>                                         <device name="FENCE2" 
>>option="reboot" port="1"/>
>>
>>and increased the reboot wait time on the PDUs to make sure it'd wait 
>>long enough, and that SEEMS to work (once I remembered to turn off ccsd 
>>before updating my cluster.conf by hand so that it didn't end up 
>>replacing it with the old one immediately ;-)
> 
> 
> I don't know how I missed this, but this is a poor idea.
> 
> What if fenced hangs in the middle?  Then you haven't turned off the
> power at all, but the cluster thinks you did!  Goodbye, file systems!
> 
> There's no way to guarantee that both ports were turned off
> simultaneously, irrespective of the timeout values. :(
> 
> You could do:
> 
>    <device name="FENCE1" option="off" port="1"/>
>    <device name="FENCE2" option="reboot" port="1"/>
>    <device name="FENCE1" option="on" port="1"/>
> 
> ...but that's about as "optimal" as you can get while still being safe.

Thinking about this a bit further, how is the second example any better
than the first?  If fenced hangs after issuing the "off" to FENCE1 in
your conf, but before or during issuing the reboot to FENCE2, how is
that different than it hanging between issuing the two reboots in mine?
 Aside from the fact that mine (in theory) leaves both power outlets on,
whereas yours leaves one off, isn't the net effect that the node didn't
get fenced but the cluster thinks it did?  The same argument applies to
then "off","off","on","on" configuration that I'd just as soon use.

I guess the real ambiguity here is in this concept of "thinks" -
wouldn't cman expect to get X "OK" responses from fenced, where X is
the number of entries in the <fence> section, and if it didn't receive
X responses then assume something was amiss?  Otherwise it seems like
fencing with redundant fence devices is inherently unsafe ...

On a slightly related note, system-config-cluster strikes again - I
started it to monitor the cluster services, and it appears to have
clobbered my "illegal" fence sections that it didn't like.  How would
one go about controlling (restarting, disabling) cluster services
without the gui?  I know cman_tool allows you to check the status, but
it doesn't seem to have any options for service control.  Thanks.

-g



From eric at bootseg.com  Tue Dec 20 19:36:12 2005
From: eric at bootseg.com (Eric Kerin)
Date: Tue, 20 Dec 2005 14:36:12 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <43A85A0D.3080108@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>
	<1135089718.27620.51.camel@ayanami.boston.redhat.com>
	<43A85A0D.3080108@leopard.us.udel.edu>
Message-ID: <1135107372.3494.36.camel@auh5-0479.corp.jabil.org>

On Tue, 2005-12-20 at 14:22 -0500, Greg Forte wrote:
> On a slightly related note, system-config-cluster strikes again - I
> started it to monitor the cluster services, and it appears to have
> clobbered my "illegal" fence sections that it didn't like.  How would
> one go about controlling (restarting, disabling) cluster services
> without the gui?  I know cman_tool allows you to check the status, but
> it doesn't seem to have any options for service control.  Thanks.

"clustat" will show the current status of each service
"clusvcadm" allows you to enable, disable, relocate, restart, etc.  

Keep in mind, that a stopped service can restart if a cluster event
occurs (such a node joining).  If you want to keep a service from
starting, you need to use disable on it.

--
Eric Kerin
eric at bootseg.com



From dillo+cluster at seas.upenn.edu  Tue Dec 20 19:37:10 2005
From: dillo+cluster at seas.upenn.edu (Bryan Cardillo)
Date: Tue, 20 Dec 2005 14:37:10 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <43A85A0D.3080108@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<20051206034238.GA3226@rover.pcbi.upenn.edu>
	<4395BBBA.2090404@leopard.us.udel.edu>
	<4396FAE4.80006@leopard.us.udel.edu>
	<1135089718.27620.51.camel@ayanami.boston.redhat.com>
	<43A85A0D.3080108@leopard.us.udel.edu>
Message-ID: <20051220193710.GL29928@rover.pcbi.upenn.edu>

On Tue, Dec 20, 2005 at 02:22:53PM -0500, Greg Forte wrote:
> On a slightly related note, system-config-cluster strikes again - I
> started it to monitor the cluster services, and it appears to have
> clobbered my "illegal" fence sections that it didn't like.  How would
> one go about controlling (restarting, disabling) cluster services
> without the gui?  I know cman_tool allows you to check the status, but
> it doesn't seem to have any options for service control.  Thanks.

        clusvcadm allows all of the service control that that the
        gui provides, and more.

        --Bryan



From lhh at redhat.com  Tue Dec 20 20:34:09 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 20 Dec 2005 15:34:09 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <43A85A0D.3080108@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
	<1135089718.27620.51.camel@ayanami.boston.redhat.com>
	<43A85A0D.3080108@leopard.us.udel.edu>
Message-ID: <200512201534.09712.lhh@redhat.com>

On Tuesday 20 December 2005 14:22, Greg Forte wrote:
> Lon Hohberger wrote:
> >
> >    <device name="FENCE1" option="off" port="1"/>
> >    <device name="FENCE2" option="reboot" port="1"/>
> >    <device name="FENCE1" option="on" port="1"/>
> >
> > ...but that's about as "optimal" as you can get while still being safe.
>
> Thinking about this a bit further, how is the second example any better
> than the first?  If fenced hangs after issuing the "off" to FENCE1 in
> your conf, but before or during issuing the reboot to FENCE2, how is
> that different than it hanging between issuing the two reboots in mine?

Sorry, I was not very clear...

A power switch "rebooting" a port means turning that port off, then on, 
optionally after some delay.

If you hang between two "reboot" operations and recover a few seconds later, 
the second reboot cycle can occur after the first reboot cycle has completed.  
That is - the first power outlet has power restored prior to the second 
outlet being turned off.  Fencing has succeeded as configured (with some 
delay for the hang), but the host has never lost power.  This is dangerous.

In the "off-reboot-on" case, if you hang between "off" (occurring first) and 
"reboot" and recover a few seconds later, the first outlet is still off when 
the second operation ("reboot") occurs.  Fencing has succeeded as configured, 
and the host has lost power.

Similarly, with the "off-off-on-on" case, if you hang between two "off" 
operations, the first port will still be off when the second "off" operation 
occurs, so the host loses power like it should.

Put simply, if we expand the "reboot" operation to what it really is to a 
power switch - "off-on" - we end up with the following:

"reboot-reboot" ==> "(off-on)-(off-on)" ==> bad!
"off-reboot-on" ==> "off-(off-on)-on" ==> good

>  Aside from the fact that mine (in theory) leaves both power outlets on,
> whereas yours leaves one off, isn't the net effect that the node didn't
> get fenced but the cluster thinks it did?  The same argument applies to
> then "off","off","on","on" configuration that I'd just as soon use.

off-off-on-on is the least ambiguous/easiest to explain plainly: both "off" 
operations succeed before either "on" operation occurs, or fencing fails.

In the case where fencing fails, the cluster will retry the operation.  If 
fencing hangs, I'm not sure what happens, but someone here knows ;)

> ... it appears to have
> clobbered my "illegal" fence sections that it didn't like.  How would
> one go about controlling (restarting, disabling) cluster services
> without the gui?

Your fencing section *was* illegal :) , but I do not think it should have 
clobbered (e.g. removed it).  If there are other problems, please file a 
bugzilla.  

See clustat(8), clusvcadm(8) for service control.

-- Lon



From Bowie_Bailey at BUC.com  Tue Dec 20 20:36:14 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 20 Dec 2005 15:36:14 -0500
Subject: [Linux-cluster] Quorum question
Message-ID: <4766EEE585A6D311ADF500E018C154E30213343B@BNIFEX>

I am building a cluster to share a GFS filesystem.  I don't want the
cluster to ever shut down due to loss of quorum.  I currently have it
set for a 2-node cluster, but I am about to add a third member.

Currently, I have:
    <cman expected_votes="1" two_node="1"/>

Can I just drop the "two_node" definition for a 3-node cluster to force
it to keep running with only one node?
    <cman expected_votes="1"/>

Thanks

-- 
Bowie



From gwood at dragonhold.org  Tue Dec 20 20:54:32 2005
From: gwood at dragonhold.org (Graham Wood)
Date: Tue, 20 Dec 2005 20:54:32 -0000 (GMT)
Subject: [Linux-cluster] Quorum question
In-Reply-To: <4766EEE585A6D311ADF500E018C154E30213343B@BNIFEX>
References: <4766EEE585A6D311ADF500E018C154E30213343B@BNIFEX>
Message-ID: <41618.80.229.192.18.1135112072.squirrel@80.229.192.18>

> Can I just drop the "two_node" definition for a 3-node cluster to force
> it to keep running with only one node?
If you're looking at GFS, then this arrangement is almost definitely going
to fry the data in the partition - which will take the system down for you
permanently.

Imagine that the 3 nodes lose communication (but all three are still
running) - they're all going to reply the logs from the other two, and
then start writing to the shared filesystem as if they were the only ones
in the cluster.

Which will corrupt the GFS very quickly.



From Bowie_Bailey at BUC.com  Tue Dec 20 21:11:17 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 20 Dec 2005 16:11:17 -0500
Subject: [Linux-cluster] Quorum question
Message-ID: <4766EEE585A6D311ADF500E018C154E30213343C@BNIFEX>

Graham Wood <mailto:gwood at dragonhold.org> wrote:
> > Can I just drop the "two_node" definition for a 3-node cluster to
> > force it to keep running with only one node?
> If you're looking at GFS, then this arrangement is almost definitely
> going to fry the data in the partition - which will take the system
> down for you permanently.
> 
> Imagine that the 3 nodes lose communication (but all three are still
> running) - they're all going to reply the logs from the other two, and
> then start writing to the shared filesystem as if they were the only
> ones in the cluster.
> 
> Which will corrupt the GFS very quickly.

Isn't that what fencing is supposed to take care of?  Maybe I'm not
understanding how this all works together.

What I will have is three nodes.  Two that actively use the data in the
shared storage and one node that handles backups.

The backup node is not critical and could be down at any time for a
number of reasons.  I want to make sure that if the backup node is down
and one of the other nodes crashes, that the one remaining node will
continue to be able to access the data in the GFS.

-- 
Bowie



From Bowie_Bailey at BUC.com  Tue Dec 20 21:20:50 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 20 Dec 2005 16:20:50 -0500
Subject: [Linux-cluster] How to update cluster.conf
Message-ID: <4766EEE585A6D311ADF500E018C154E30213343D@BNIFEX>

I ran into this problem previously and figured it out.  Now I'm hitting
it again and can't seem to do it this time.

I want to update the cluster.conf file for my cluster (to add a node).

According to one document:

    To update the config file in a running cluster:
    
    1. have all nodes running as cluster members using the original
       cluster.conf
    2. on one node, update /etc/cluster/cluster.conf, incrementing
       config_version
    3. on this same node run "killall -HUP ccsd"
    4. verify that the new cluster.conf exists on all nodes
    5. on this same node run "cman_tool -r <new config_version>"
    6. check 'cman_tool status' to verify the new config version

I have tried that, but the cluster.conf file did not get copied over.
Do I have to manually copy the cluster.conf file to each node?

Can someone give me the proper steps to make this work?

Thanks

-- 
Bowie



From gforte at leopard.us.udel.edu  Tue Dec 20 21:26:02 2005
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 20 Dec 2005 16:26:02 -0500
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <200512201534.09712.lhh@redhat.com>
References: <4394DDF0.4080603@leopard.us.udel.edu>	<1135089718.27620.51.camel@ayanami.boston.redhat.com>	<43A85A0D.3080108@leopard.us.udel.edu>
	<200512201534.09712.lhh@redhat.com>
Message-ID: <43A876EA.3040301@leopard.us.udel.edu>

Lon Hohberger wrote:
> On Tuesday 20 December 2005 14:22, Greg Forte wrote:
> 
>>Lon Hohberger wrote:
>>
>>>   <device name="FENCE1" option="off" port="1"/>
>>>   <device name="FENCE2" option="reboot" port="1"/>
>>>   <device name="FENCE1" option="on" port="1"/>
>>>
>>>...but that's about as "optimal" as you can get while still being safe.
>>
>>Thinking about this a bit further, how is the second example any better
>>than the first?  If fenced hangs after issuing the "off" to FENCE1 in
>>your conf, but before or during issuing the reboot to FENCE2, how is
>>that different than it hanging between issuing the two reboots in mine?
> 
> 
> Sorry, I was not very clear...
> 
> A power switch "rebooting" a port means turning that port off, then on, 
> optionally after some delay.
> 
> If you hang between two "reboot" operations and recover a few seconds later, 
> the second reboot cycle can occur after the first reboot cycle has completed.  
> That is - the first power outlet has power restored prior to the second 
> outlet being turned off.  Fencing has succeeded as configured (with some 
> delay for the hang), but the host has never lost power.  This is dangerous.
> 
> In the "off-reboot-on" case, if you hang between "off" (occurring first) and 
> "reboot" and recover a few seconds later, the first outlet is still off when 
> the second operation ("reboot") occurs.  Fencing has succeeded as configured, 
> and the host has lost power.
> 
> Similarly, with the "off-off-on-on" case, if you hang between two "off" 
> operations, the first port will still be off when the second "off" operation 
> occurs, so the host loses power like it should.
> 
> Put simply, if we expand the "reboot" operation to what it really is to a 
> power switch - "off-on" - we end up with the following:
> 
> "reboot-reboot" ==> "(off-on)-(off-on)" ==> bad!
> "off-reboot-on" ==> "off-(off-on)-on" ==> good

Ah, that makes more sense, yes - I was misinterpreting your use of
"hang".  Certainly true, but OTOH I set the wait time in the fence
devices to 10 seconds (I think, have to check that), and if fenced hangs
for 10 seconds between two operations then I've got other big problems.
 Anyway, this was only a stop-gap until the patched fenced makes its way
into the next update - or I get un-lazy enough to patch it myself.  ;-)

>>... it appears to have
>>clobbered my "illegal" fence sections that it didn't like.  How would
>>one go about controlling (restarting, disabling) cluster services
>>without the gui?
> 
> Your fencing section *was* illegal :) , but I do not think it should have 
> clobbered (e.g. removed it).  If there are other problems, please file a 
> bugzilla.  

I will have to see if I can reproduce it - right now neither of my nodes
will boot because I changed the lvm vg names and need to fix their
initrds ... and I'm on vacation.  ;-)  But I will fiddle with it in the
new year.

> See clustat(8), clusvcadm(8) for service control.

Thanks.

-g



From jbrassow at redhat.com  Tue Dec 20 21:52:19 2005
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Tue, 20 Dec 2005 15:52:19 -0600
Subject: [Linux-cluster] How to update cluster.conf
In-Reply-To: <4766EEE585A6D311ADF500E018C154E30213343D@BNIFEX>
References: <4766EEE585A6D311ADF500E018C154E30213343D@BNIFEX>
Message-ID: <1135115539.6756.19.camel@hydrogen.msp.redhat.com>

Those instructions seem pretty out of date for... hmmm, what are you
using - RHEL3, RHEL4, something else?

I think RHEL4 would be:

1) all nodes in cluster w/ original cluster.conf
2) cp /etc/cluster/cluster.conf foo.conf
3) edit foo.conf, bumping version #
4) ccs_tool update foo.conf
5) cman_tool version -r <current version #>

 brassow

On Tue, 2005-12-20 at 16:20 -0500, Bowie Bailey wrote:
> I ran into this problem previously and figured it out.  Now I'm hitting
> it again and can't seem to do it this time.
> 
> I want to update the cluster.conf file for my cluster (to add a node).
> 
> According to one document:
> 
>     To update the config file in a running cluster:
>     
>     1. have all nodes running as cluster members using the original
>        cluster.conf
>     2. on one node, update /etc/cluster/cluster.conf, incrementing
>        config_version
>     3. on this same node run "killall -HUP ccsd"
>     4. verify that the new cluster.conf exists on all nodes
>     5. on this same node run "cman_tool -r <new config_version>"
>     6. check 'cman_tool status' to verify the new config version
> 
> I have tried that, but the cluster.conf file did not get copied over.
> Do I have to manually copy the cluster.conf file to each node?
> 
> Can someone give me the proper steps to make this work?
> 
> Thanks
> 



From Bowie_Bailey at BUC.com  Tue Dec 20 22:06:41 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 20 Dec 2005 17:06:41 -0500
Subject: [Linux-cluster] How to update cluster.conf
Message-ID: <4766EEE585A6D311ADF500E018C154E30213343E@BNIFEX>

Jonathan Brassow <mailto:jbrassow at redhat.com> wrote:
> Those instructions seem pretty out of date for... hmmm, what are you
> using - RHEL3, RHEL4, something else?
> 
> I think RHEL4 would be:
> 
> 1) all nodes in cluster w/ original cluster.conf
> 2) cp /etc/cluster/cluster.conf foo.conf
> 3) edit foo.conf, bumping version #
> 4) ccs_tool update foo.conf
> 5) cman_tool version -r <current version #>

That's what I was looking for.  I seem to remember that ccs_tool command
now that you mention it.  I'll have to put this in my notes for next
time.

I'm running on CentOS4, which should be the same as RHEL4.

The instructions I was following are from:
http://sources.redhat.com/cluster/doc/usage.txt

They are a bit out of date, but it is a pretty good step-by-step
instruction set for a newcomer.  If there is a better one out there by
now, let me know.

-- 
Bowie



From pcaulfie at redhat.com  Wed Dec 21 11:20:53 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 21 Dec 2005 11:20:53 +0000
Subject: [Linux-cluster] Quorum question
In-Reply-To: <4766EEE585A6D311ADF500E018C154E30213343C@BNIFEX>
References: <4766EEE585A6D311ADF500E018C154E30213343C@BNIFEX>
Message-ID: <43A93A95.1000309@redhat.com>

Bowie Bailey wrote:
> Graham Wood <mailto:gwood at dragonhold.org> wrote:
> 
>>>Can I just drop the "two_node" definition for a 3-node cluster to
>>>force it to keep running with only one node?
>>
>>If you're looking at GFS, then this arrangement is almost definitely
>>going to fry the data in the partition - which will take the system
>>down for you permanently.
>>
>>Imagine that the 3 nodes lose communication (but all three are still
>>running) - they're all going to reply the logs from the other two, and
>>then start writing to the shared filesystem as if they were the only
>>ones in the cluster.
>>
>>Which will corrupt the GFS very quickly.
> 
> 
> Isn't that what fencing is supposed to take care of?  Maybe I'm not
> understanding how this all works together.

No, because fencing has to be done by one of the cluster nodes. And the
cluster must be quorate to fence another node - otherwise it could be an
isolated node fencing the valid part.

> What I will have is three nodes.  Two that actively use the data in the
> shared storage and one node that handles backups.
> 
> The backup node is not critical and could be down at any time for a
> number of reasons.  I want to make sure that if the backup node is down
> and one of the other nodes crashes, that the one remaining node will
> continue to be able to access the data in the GFS.
> 


-- 

patrick



From Bowie_Bailey at BUC.com  Wed Dec 21 13:53:49 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Wed, 21 Dec 2005 08:53:49 -0500
Subject: [Linux-cluster] Quorum question
Message-ID: <4766EEE585A6D311ADF500E018C154E302133440@BNIFEX>

Patrick Caulfield <mailto:pcaulfie at redhat.com> wrote:
> Bowie Bailey wrote:
> > Graham Wood <mailto:gwood at dragonhold.org> wrote:
> > 
> > > > Can I just drop the "two_node" definition for a 3-node cluster
> > > > to force it to keep running with only one node?
> > > 
> > > If you're looking at GFS, then this arrangement is almost
> > > definitely going to fry the data in the partition - which will
> > > take the system down for you permanently. 
> > > 
> > > Imagine that the 3 nodes lose communication (but all three are
> > > still running) - they're all going to reply the logs from the
> > > other two, and then start writing to the shared filesystem as if
> > > they were the only ones in the cluster. 
> > > 
> > > Which will corrupt the GFS very quickly.
> > 
> > Isn't that what fencing is supposed to take care of?  Maybe I'm not
> > understanding how this all works together.
> 
> No, because fencing has to be done by one of the cluster nodes. And
> the cluster must be quorate to fence another node - otherwise it
> could be an isolated node fencing the valid part.

Ok, that makes sense.  How does this work with a two-node cluster?

> > What I will have is three nodes.  Two that actively use the data in
> > the shared storage and one node that handles backups.
> > 
> > The backup node is not critical and could be down at any time for a
> > number of reasons.  I want to make sure that if the backup node is
> > down and one of the other nodes crashes, that the one remaining
> > node will continue to be able to access the data in the GFS.

Is there a way to make my setup work the way I want?

-- 
Bowie



From pcaulfie at redhat.com  Wed Dec 21 14:10:24 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 21 Dec 2005 14:10:24 +0000
Subject: [Linux-cluster] Quorum question
In-Reply-To: <4766EEE585A6D311ADF500E018C154E302133440@BNIFEX>
References: <4766EEE585A6D311ADF500E018C154E302133440@BNIFEX>
Message-ID: <43A96250.3000106@redhat.com>

Bowie Bailey wrote:
> Patrick Caulfield <mailto:pcaulfie at redhat.com> wrote:
> 

>>No, because fencing has to be done by one of the cluster nodes. And
>>the cluster must be quorate to fence another node - otherwise it
>>could be an isolated node fencing the valid part.
> 
> 
> Ok, that makes sense.  How does this work with a two-node cluster?
> 

It's a race to see who gets fenced first. The winner lives :)

>>>What I will have is three nodes.  Two that actively use the data in
>>>the shared storage and one node that handles backups.
>>>
>>>The backup node is not critical and could be down at any time for a
>>>number of reasons.  I want to make sure that if the backup node is
>>>down and one of the other nodes crashes, that the one remaining
>>>node will continue to be able to access the data in the GFS.
> 
> 

Not really. What you seem to want is a two-node cluster with a zero-vote
"hanger-on" node. cman is either a two-node cluster or not, there's no way to
tell it that the backup node isn't important.

I think the think to do it is not to have the backup node in the cluster at
all and think of sme other way of doing the backups - NFS say.

-- 

patrick



From Bowie_Bailey at BUC.com  Wed Dec 21 14:48:00 2005
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Wed, 21 Dec 2005 09:48:00 -0500
Subject: [Linux-cluster] Quorum question
Message-ID: <4766EEE585A6D311ADF500E018C154E302133442@BNIFEX>

Patrick Caulfield <mailto:pcaulfie at redhat.com> wrote:
> Bowie Bailey wrote:
> > 
> > What I will have is three nodes.  Two that actively use the data in
> > the shared storage and one node that handles backups.
> > 
> > The backup node is not critical and could be down at any time for a
> > number of reasons.  I want to make sure that if the backup node is
> > down and one of the other nodes crashes, that the one remaining node
> > will continue to be able to access the data in the GFS. 
> 
> Not really. What you seem to want is a two-node cluster with a
> zero-vote "hanger-on" node. cman is either a two-node cluster or not,
> there's no way to tell it that the backup node isn't important.
> 
> I think the thing to do it is not to have the backup node in the
> cluster at all and think of some other way of doing the backups - NFS
> say. 

I was trying to avoid the extra network traffic of doing backups over
the network.  Since I've got sharable storage, it's more efficient to
have the backup device connect directly.

I think I'll leave things the way they are and just keep in mind that
the cluster is vulnerable when the backup system is shut down (which
shouldn't really be that often anyway).

Thanks for the explanations!

-- 
Bowie



From omer at faruk.net  Wed Dec 21 14:49:35 2005
From: omer at faruk.net (Omer Faruk Sen)
Date: Wed, 21 Dec 2005 16:49:35 +0200 (EET)
Subject: [Linux-cluster] does multipath required with bonding?
Message-ID: <49907.193.140.143.25.1135176575.squirrel@193.140.143.25>


Hi,

I have a problem that I didn't understand clearly. If I enable channel
bonding on 2 interfaces (each of them connected to different switch) and
use iSCSI do I have to use multipath for that? Since one interface for
iSCSI is unavailable data is still can find its path to storage with using
the second interface on bonding and if one switch fails then ethernet link
for the one that is connected to that switch will also be down...


-- 
Omer Faruk Sen
http://www.faruk.net



From michael.weitzel at uni-siegen.de  Wed Dec 21 15:52:32 2005
From: michael.weitzel at uni-siegen.de (Michael Weitzel)
Date: Wed, 21 Dec 2005 16:52:32 +0100
Subject: [Linux-cluster] Basic question on CLVMD / GFS
Message-ID: <43A97A40.5070701@uni-siegen.de>

Hi,

I am confused about the usage of clvmd. On a blank 80 GB disk I created
a logical volume using the following commands (after the cluster was
started with "ccsd; cman_tool join; fence_tool join" on all nodes):

# overwrite a former partition table:
dd if=/dev/zero of=/dev/hdb bs=512 count=1
# initialize the physical volume:
pvcreate /dev/hdb
# create a volume group "vg301":
vgcreate vg301 /dev/hdb
# create a 40 GB logical volume "lvol0" inside vg301:
lvcreate -L 40G vg301
# start clvmd
clvmd
# mark the volume group as "clustered" (there are no other vg's):
vgchange -cy
# activate the volume group with/without cluster locking
vgchange -aey ...or... vgchange -aly

vgchange prints a notice if on some other node of the cluster clvmd is
not running and activates the logical volume.

Is this enough to use "vg301" or "vg301/lvol0" on the other nodes? If
yes, how do I access vg301 from another (than the local) node? vgscan,
executed on the other nodes, does not report anything.

Distributing the local vg301/lvol0 (with GFS on it) with GNBD 'nearly'
worked: At some point, the I/O was blocked on nodes accessing a specific
directory.

gfs_mkfs -p lock_dlm -t Isabella:lvol0_301 -j 4 /dev/vg301/lvol0
gnbd_export -v -e exp301_0 -d /dev/vg301/lvol0

Is there any information / documentation about clvmd? The lvm-HOWTO does
not cover this topic :-(

Many thanks!
-- 
Michael Weitzel



From pcaulfie at redhat.com  Wed Dec 21 16:01:05 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 21 Dec 2005 16:01:05 +0000
Subject: [Linux-cluster] Basic question on CLVMD / GFS
In-Reply-To: <43A97A40.5070701@uni-siegen.de>
References: <43A97A40.5070701@uni-siegen.de>
Message-ID: <43A97C41.6050503@redhat.com>

Michael Weitzel wrote:
> Hi,
> 
> I am confused about the usage of clvmd. On a blank 80 GB disk I created
> a logical volume using the following commands (after the cluster was
> started with "ccsd; cman_tool join; fence_tool join" on all nodes):
> 
> # overwrite a former partition table:
> dd if=/dev/zero of=/dev/hdb bs=512 count=1
> # initialize the physical volume:
> pvcreate /dev/hdb


hdb doesn't sound like a shared disk to me.

clvmd doesn't magically make IDE drives into shared storage I'm afraid.
-- 

patrick



From jbrassow at redhat.com  Wed Dec 21 17:32:01 2005
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Wed, 21 Dec 2005 11:32:01 -0600
Subject: [Linux-cluster] Basic question on CLVMD / GFS
In-Reply-To: <43A97C41.6050503@redhat.com>
References: <43A97A40.5070701@uni-siegen.de>  <43A97C41.6050503@redhat.com>
Message-ID: <1135186322.6756.26.camel@hydrogen.msp.redhat.com>

On Wed, 2005-12-21 at 16:01 +0000, Patrick Caulfield wrote:
> Michael Weitzel wrote:
> > Hi,
> > 
> > I am confused about the usage of clvmd. On a blank 80 GB disk I created
> > a logical volume using the following commands (after the cluster was
> > started with "ccsd; cman_tool join; fence_tool join" on all nodes):
> > 
> > # overwrite a former partition table:
> > dd if=/dev/zero of=/dev/hdb bs=512 count=1
> > # initialize the physical volume:
> > pvcreate /dev/hdb
> 
> 
> hdb doesn't sound like a shared disk to me.
> 
> clvmd doesn't magically make IDE drives into shared storage I'm afraid.

... but gnbd would work.  You would have a single point of failure in
that case, but you would be able to access a volume from two machines.

 brassow



From dillo at seas.upenn.edu  Tue Dec  6 03:42:51 2005
From: dillo at seas.upenn.edu (Bryan Cardillo)
Date: Tue, 06 Dec 2005 03:42:51 -0000
Subject: [Linux-cluster] two fencing problems
In-Reply-To: <4394DDF0.4080603@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>
Message-ID: <20051206034238.GA3226@rover.pcbi.upenn.edu>

On Mon, Dec 05, 2005 at 07:40:16PM -0500, Greg Forte wrote:
> two (probably related) questions concerning fencing and APC AP7900 units:
> 
> 1) fence_apc doesn't appear to be compatible with these units - when I run:
> 
> sudo /sbin/fence_apc -a <ipaddy> -l <usr> -p <pwd> -n1 -T -v
> 
> it comes back with:
> 
> failed: unrecognised menu response
> 
> The output file shows that it's getting as far as the "Outlet 
> Control/Configuration" menu, but never selects the specified port.
> 
> This is on RHEL ES4 update 2 with fence-1.32.6-0 installed.
> 
> Does anyone have this working with AP7900s, and if so did you have to 
> hack the fence_apc script or is there just something I'm missing?

        I'm in the process of testing the attached patch, basically
        just had to remove a portion of the match for the `Control
        Outlet' option.

>  2) in the cluster configuration tool (GUI), there's no place to 
> specify the port to cycle for an "APC Power Device".  I tried adding 
> "port=#" to the <fencedevice ...> tags in the cluster.conf file, but the 
> cluster configuration tool didn't like that.  And of course, I was 
> unable to test if this actually works anyway because of problem #1 :-(
> 
> Anyway, assuming I get fence_apc to work, how do I specify ports in the 
> cluster configuration tool?  or is this not supported?  In which case 
> can I add the port option in the cluster.conf like I'm trying to do and 
> have it work?  I have system-config-cluster-1.0.16-1.0 installed.

        here is the clusternode elem I'm using, with the port
        specified, and seems to work so far.  as far as I know, this
        must be specified in the cluster.conf manually.

<clusternode name="node1" votes="1">
    <fence>
        <method name="pdu">
            <device name="pdu" port="1"/>
        </method>
    </fence>
</clusternode>

        hope this helps.

        Cheers,
        Bryan Cardillo
        Penn Bioinformatics Core
        University of Pennsylvania
-------------- next part --------------
--- /sbin/fence_apc	2005-10-27 16:12:19.000000000 -0400
+++ fence_apc	2005-12-05 22:33:04.000000000 -0500
@@ -247,7 +247,7 @@
 			/--\s*Outlet Control.*(\d+)\s*-\s+Outlet\s+$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
 
 			# Administrator Outlet Control menu
-			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control outlet\s+$opt_n\D/ism
+			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control outlet/ism
 		) {
 			$t->print($1);
 			next;

From celso at webbertek.com.br  Thu Dec  8 02:11:42 2005
From: celso at webbertek.com.br (Celso K. Webber)
Date: Thu, 08 Dec 2005 00:11:42 -0200
Subject: [Linux-cluster] failover domain ip address hidden?
In-Reply-To: <43974929.8090106@leopard.us.udel.edu>
References: <4394DDF0.4080603@leopard.us.udel.edu>		<20051206034238.GA3226@rover.pcbi.upenn.edu>		<4395BBBA.2090404@leopard.us.udel.edu>	<4396FAE4.80006@leopard.us.udel.edu>		<1133972189.3454.25.camel@auh5-0479.corp.jabil.org>		<439738D5.5000808@leopard.us.udel.edu>	<1133984906.5344.14.camel@auh5-0479.corp.jabil.org>	<4397474E.7000705@leopard.us.udel.edu>
	<43974929.8090106@leopard.us.udel.edu>
Message-ID: <4397965E.7080401@webbertek.com.br>

An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051208/e97e7b41/attachment.htm>

From Frank.Weyns at ordina.nl  Thu Dec  8 13:21:10 2005
From: Frank.Weyns at ordina.nl (Weyns, Frank)
Date: Thu, 8 Dec 2005 14:21:10 +0100
Subject: [Linux-cluster] Generic fencing command that Continues
Message-ID: <AA2536FEE0A9D311887000508B5EDF1E057C9169@ab00s003.abab.cadans.nl>

I'm setting up several "example" Clusters whith vmware.
The problem I have is that I don,t have a generic fencing tool.

If I give a demo to management ;-), my "very important" xclock service does
not recover after a killed server.
(I need to take manual actions :-(

What I need would be fencing command that does following:

- send an alert. (ex "mail -s Cluster operating at testdom.com") 
- Wait for defined time ("sleep 15")
- Do some confirmation check.
- Execute a "Kill server" script, that doesn't stop  
- Continue cluster and start de service on a working node.

If you have something like that or a part of it.....

If I'm doing something wrong, please correct me
(Or even correct way to disable fencing.)

Frank Weyns

Disclaimer

Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht

This e-mail and any attachments are confidential and is solely intended for the addressee only. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from (the content of) the transmission of this message.





From jari_huuskonen at luukku.com  Fri Dec  9 18:48:03 2005
From: jari_huuskonen at luukku.com (jari huuskonen)
Date: Fri, 9 Dec 2005 20:48:03 +0200 (EET)
Subject: [Linux-cluster] GFS6.1 and CS4.0 fencinfg RSA II
Message-ID: <1134154083149.jari_huuskonen.94530.g7nGbryI4fZVCSiEAeqPqQ@luukku.com>

Hi.

CS 3 and GFS 6.0 fencing ibm servers with RSA II adapter
was done with fence_xcat, in CS 4.0 and GFS 6.1 there is no fence_xcat command.
Is there some other mechanism to fence nodes which have RSA II adapters.

Thanks for advance.


...................................................................
Luukku Plus paketilla p??set eroon tila- ja turvallisuusongelmista.
Hanki Luukku Plus ja helpotat el?m??si. http://www.mtv3.fi/luukku




From brenton at iowai.org  Mon Dec 12 15:03:40 2005
From: brenton at iowai.org (Brenton Rothchild)
Date: Mon, 12 Dec 2005 09:03:40 -0600
Subject: [Linux-cluster] GNBD import mirroring?
Message-ID: <439D914C.6060805@iowai.org>

Hi all,

I was thinking about trying to set up a test cluster
according to the figure in

http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY

but I see a post from Benjamin Marzinski that this is a
marketing fluke and isn't possible?
(http://www.redhat.com/archives/linux-cluster/2005-February/msg00049.html)

Is that true, or this possible?  I'd like to make use of two GNBD
export servers that I could RAID 1+0 on the GNBD import nodes
to get a network mirror plus resizeable GFS parition...
if that's even possible.

If anyone could tell me if I'm barking up the wrong tree on this,
I'd appreciate it!

Thanks,
-Brenton


-- 
Brenton D. Rothchild
Iowa Interactive
500 E. Court Ave.
Suite 500A-4
Des Moines, IA 50309
(515) 323-3468

NOTICE:  This electronic communication (including any attachments) is
covered by the Electronic Communication Privacy Act, 18 U.S.C. ?? 2510 -
2521, is confidential and also may be protected by attorney-client
privilege. If you believe that it has been sent to you in error, please
do not read it.  If you are not the intended recipient, you are hereby
notified that any review, use, retention, dissemination, distribution,
or copying of this communication is strictly prohibited.  Please reply
to the sender by return E-mail that you have received the communication
in error and then delete it from your system. Thank you.



From celso at webbertek.com.br  Tue Dec 13 01:51:58 2005
From: celso at webbertek.com.br (Celso K. Webber)
Date: Mon, 12 Dec 2005 23:51:58 -0200
Subject: [Linux-cluster] Cluster Suite v3 software watchdog
Message-ID: <439E293E.9080208@webbertek.com.br>

An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051212/1236b45a/attachment.htm>

From karl at nfox.com  Fri Dec 16 18:31:00 2005
From: karl at nfox.com (Karl Kraft)
Date: Fri, 16 Dec 2005 10:31:00 -0800
Subject: [Linux-cluster] FC4/GFS works only for a few minutes
Message-ID: <D8F3E27A-6293-4DBF-8DCE-78B45AF2F0DD@nfox.com>

I have setup a testing cluster for GFS with 4 boxes (Box1 .. Box4).    
Box1 has the actual storage setup using LVM.  I am able to mount the  
GFS storage on any of the boxes and use it on multiple boxes without  
problem.  However after a few minutes it locks up.

Once locked up, any attempt to hit the filesystem locks up the  
process, but the box continues to run.  If I mount the filesystem on  
Box1 directly, it works fine.  Mounting it just on a single box (e.g.  
Box2) over the ntework works for a few minutes, but locks up.

Any thoughts as to what I am missing here?

--
K2  //  Karl Kraft  //  karl at nfox.com  //  AIM: nfoxsupport
To purchase it is not like spending money, but rather it is an
investment in the future, in a blow against the empire




From stefanr at s5r6.in-berlin.de  Sun Dec 18 10:07:45 2005
From: stefanr at s5r6.in-berlin.de (Stefan Richter)
Date: Sun, 18 Dec 2005 11:07:45 +0100
Subject: [Linux-cluster] 2.6.15-rc5-mm3 dlm: missing NULL pointer checks
Message-ID: <43A534F1.9090807@s5r6.in-berlin.de>

Hi all,

while browsing http://sosdg.org/~coywolf/lxr/ for a completely unrelated 
matter, I found these two potential NULL pointer dereferences in 
drivers/dlm/device.c. In do_user_lock():

> 803                 if (!li && DLM_LKF_PERSISTENT) {
> 804                         li = allocate_lockinfo(fi, cmd, kparams);
> 805 
> 806                         li->li_lksb.sb_lkid = kparams->lkid;
> 807                         li->li_castaddr  = kparams->castaddr;
> 808                         li->li_castparam = kparams->castparam;
> 809 
> 810                         /* OK, this isn;t exactly a FIRSTLOCK but it is the
> 811                            first time we've used this lockinfo, and if things
> 812                            fail we want rid of it */
> 813                         init_MUTEX_LOCKED(&li->li_firstlock);
> 814                         set_bit(LI_FLAG_FIRSTLOCK, &li->li_flags);
> 815                         add_lockinfo(li);
> 816 
> 817                         /* TODO: do a query to get the current state ?? */
> 818                 }
> 819                 if (!li)
> 820                         return -EINVAL;

Lines 806...815 need to be enclosed by if (li) {...}, or line 803 should 
be replaced by
	if (!li && DLM_LKF_PERSISTENT &&
	    (li = allocate_lockinfo(fi, cmd, kparams)) {

In do_user_unlock():

> 915         if (!li) {
> 916                 li = allocate_lockinfo(fi, cmd, kparams);
> 917                 spin_lock(&fi->fi_li_lock);
> 918                 list_add(&li->li_ownerqueue, &fi->fi_li_list);
> 919                 spin_unlock(&fi->fi_li_lock);
> 920         }
> 921         if (!li)
> 922                 return -ENOMEM;

Same as above, or lines 921 and 922 should be moved up between 916 and 917.

(Sorry, no patch, I'm too lazy to fetch -mm...)
-- 
Stefan Richter
-=====-=-=-= ==-- =--=-
http://arcgraph.de/sr/



From raziebe at gmail.com  Tue Dec 20 11:01:26 2005
From: raziebe at gmail.com (Raz Ben-Jehuda(caro))
Date: Tue, 20 Dec 2005 13:01:26 +0200
Subject: [Linux-cluster] aysnc io to too slow over redhat cluster when
	reading
Message-ID: <5d96567b0512200301k5d93857eh28571d02b10e39ee@mail.gmail.com>

I have created an asyncio io test reader :
I ran it over linux cluster volume of 1TB, 4 machines
with raid0 of stripe size 512K over ethernet.
when issuing 50 conucurrent iops ( io operations ) the tester
hangs ( CTRL+C doesn't work ) and sometimes it simply far too slow.

I am attaching a tar ball with the tests the asyncio tester is called
aiotest
building:
tar zxvf io_test.tgz
cd io_tests/aiotest
make -f aiotest.mak CFG=Release

in order to run it you need to create in a directory of files starting from
index i to j.

for example :
if volume gnbd_x is mounted on /mnt/raid
you need to create files from 34 to 34+N , N is numnber of files read.
and run the test:

aiotest -p /mnt/raid/ -i 34 -n 100 -b 0.5 -s 100

this command would read files /mnt/raid/34 .... /mnt/raid/134
100 MB from each file with buffer size 1/2 MB.

Would be thankfull for any reply.

thank you.
--
Raz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051220/4a58ac22/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: io_tests.tgz
Type: application/x-gzip
Size: 97942 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051220/4a58ac22/attachment.bin>

From Sam.Terburg at Panther-IT.nl  Fri Dec 16 15:12:32 2005
From: Sam.Terburg at Panther-IT.nl (Sam Terburg - Panther IT)
Date: Fri, 16 Dec 2005 16:12:32 +0100
Subject: [Linux-cluster] mysql and redhat cluster suite?
In-Reply-To: <43A2D422.7010301@dorm.org>
References: <60787.193.140.143.25.1134719035.squirrel@193.140.143.25>
	<43A2D422.7010301@dorm.org>
Message-ID: <43A2D960.3080308@Panther-IT.nl>

Why nog use MySQL Cluster?
http://www.mysql.com/products/database/cluster/

It's quite simpel to setup actually.


Sam Terburg
Panther IT Services


Brenton Rothchild wrote:
> Anyone feel free to correct me, but I was just doing some
> reading on active-active MySQL + GFS last night, and
> I think such an arrangement might be possible to a degree.
> 
>  From what I can read in the MySQL documentation,
> http://dev.mysql.com/doc/refman/5.0/en/system.html
> http://dev.mysql.com/doc/refman/4.1/en/system.html
> 
> MySQL can use external file locking via flock() calls,
> although the 4.1 page describes that file locking is turned off
> by default "because Linux file locking is not yet safe" via
> a compile-time option "--skip-external-locking".
> 
> The 5.0 page doesn't warn against using external file locking
> to that degree, but the option to use locking is still
> disabled via the "--skip-external-locking" flag.
> 
> There are also some caveats to using the --external-locking option
> in MySQL.  See the documentation for "--external-locking" in
> http://dev.mysql.com/doc/refman/4.1/en/server-options.html
> http://dev.mysql.com/doc/refman/5.0/en/server-options.html
> 
> Now, on the GFS side, locking via flock() and fcntl()-based
> locks appear to be supported cluster-wide from this thread,
> http://www.redhat.com/archives/linux-cluster/2004-October/msg00311.html
> 
> So, putting them together, if GFS supports file locking,
> and MySQL can use file locking to support multiple instances
> accessing the same files, an active-active MySQL set up should work.
> 
> I, too, and using a test cluster with GFS + iSCSI at this point,
> and I plan to test MySQL 4.x and 5.x versions on it soon to
> see what happens.  I really hope it works out under the loads
> we want to test. :)
> 
> Like I said, anyone else feel free to correct me - I'm still
> just getting started with RHCS/GFS/etc. at this point.
> 
> -Brenton Rothchild
> 
> 
> Omer Faruk Sen wrote:
> 
>> Hi,
>>
>> I want to install a mysql cluster. But I want to ask which path do I have
>> to follow? Or there is any special precautions that I have to take care
>> before and after installing mysql and redhat cluster suite.
>>
>> My first impression is to install 2 node cluster it is better for me to
>> use GFS and a iscsi solution for a cheaper solution. But I have 
>> understood
>> from my readings that I can't setup an active-active mysql cluster using
>> redhat cluster + GFS. Because one node must be locked for writing. Thus I
>> think installing the second server for writing may be just for
>> availability not for combining 2 machines power ( I mean using
>> active-acvtive).
>>
>> Then I think to install 2 machines for write (one is standby) on a redhat
>> cluster using GFS on a ISCSI system and adding a number of machines that
>> use the same GFS partition for msqyl read operations is what I have to
>> follow .. right?
>>
>> PS: By the way can you give me an URL that details mysql and redhat
>> cluster + gfs installation? Or is there any ?
>>
>>
>>
>>
>>
>>
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From celso at webbertek.com.br  Wed Dec 21 18:25:43 2005
From: celso at webbertek.com.br (Celso K. Webber)
Date: Wed, 21 Dec 2005 16:25:43 -0200
Subject: [Linux-cluster] Cluster Suite v3 software watchdog
Message-ID: <43A99E27.5040300@webbertek.com.br>

Hello all,

<disclaimer>
Sorry for resending this, I've got some problems with HTML sending to 
the list (thanks to Alasdair G Kergon for pointing this out).
</disclaimer>


I'm having some difficulties with the software watchdog feature when 
configuring a Red Hat Cluster Suite v3 with Dell hardware.

Notably I had problems with Dell PowerEdges 2650 and 2850. After setting 
up the cluster nodes and selecting the "enable software watchdog" check 
box under every member of the "Members" tab, I can run the cluster with 
no problems, but the machines reboot from time to time.

I've tried passing the following parameters to the kernel by editing 
grub.conf:
nmi_watchdog=1

Does anyone has had this issue before? Or am I missing any step on 
configuring the software watchdog feature?

Another question for the Red Hat people on the list: does this "software 
watchdog" works ok? I ask because it's enabled by default when you add a 
new member to the cluster. The Cluster Suite v3 manual tells nothing 
about this resource either.

I could only find some information about soft watchdog in the Cluster 
Suite v2.1 manual, but that version seems to be somewhat different from 
v3 and v4.

Thanks in advance.

Regards,

Celso.
-- 

*Celso Kopp Webber*


<mailto:celso at webbertek.com.br>



From lhh at redhat.com  Wed Dec 21 18:40:46 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 21 Dec 2005 13:40:46 -0500
Subject: [Linux-cluster] clusvcmgrd
In-Reply-To: <200512200225.jBK2PoQ0009783@mail.bhartitelesoft.com>
References: <200512200225.jBK2PoQ0009783@mail.bhartitelesoft.com>
Message-ID: <1135190446.23999.8.camel@ayanami.boston.redhat.com>

On Tue, 2005-12-20 at 02:25 +0000, kaushalender at bhartitelesoft.com
wrote:
> Hi all,
> 
> I have installed single node cluster with bonding on Red 
> Hat Enterprise Linux AS release 3 (Taroon).I am using 
> 2.4.21-4.ELsmp #1 SMP Fri Oct 3 17:52:56 EDT 2003 i686 i686 
> i386 GNU/Linux kernel.My cluster version is redhat-config-
> cluster-1.0.1-4. I am getting following error again and 
> again 
>  clusvcmgrd: [27807]: <err> service error: Check status 
> failed on IP addresses for oracle
> Dec  1 03:46:38 smsc-cluster0 clusvcmgrd: [27819]: <err> 
> service error: Check status failed on IP addresses for SMPP
> 
> Nothing happens to interface the clustermanager 
> automatically loses the ip address of service .when i ping 
> the ip address it gets ping .Please help me resolving this 
> issue because this is causing my realtime application to go 
> down

This looks like a bonding/e1000 problem this bugzilla works around:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=163636

-- Lon



From lhh at redhat.com  Wed Dec 21 18:45:02 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 21 Dec 2005 13:45:02 -0500
Subject: [Linux-cluster] Cluster Suite v3 software watchdog
In-Reply-To: <43A99E27.5040300@webbertek.com.br>
References: <43A99E27.5040300@webbertek.com.br>
Message-ID: <1135190702.23999.13.camel@ayanami.boston.redhat.com>

On Wed, 2005-12-21 at 16:25 -0200, Celso K. Webber wrote:

> Does anyone has had this issue before? Or am I missing any step on 
> configuring the software watchdog feature?
> 
> Another question for the Red Hat people on the list: does this "software 
> watchdog" works ok? I ask because it's enabled by default when you add a 
> new member to the cluster. The Cluster Suite v3 manual tells nothing 
> about this resource either.

Yes, it works fine.

A few things could be happening:

(1) The NMI watchdog will reboot the machine if it detects an NMI hang.
This is only a few seconds.

(2) The cluster is extremely paranoid because you are not using a
STONITH device (power controller), and it's detecting internal hangs.
Try increasing the failover time.

(3) The cluster is not getting scheduled due to system load.  See the
man page for cludb(8) about clumembd%rtp - both may help.


-- Lon





From celso at webbertek.com.br  Wed Dec 21 18:50:58 2005
From: celso at webbertek.com.br (Celso K. Webber)
Date: Wed, 21 Dec 2005 16:50:58 -0200
Subject: [Linux-cluster] Cluster Suite v3 software watchdog
In-Reply-To: <1135190702.23999.13.camel@ayanami.boston.redhat.com>
References: <43A99E27.5040300@webbertek.com.br>
	<1135190702.23999.13.camel@ayanami.boston.redhat.com>
Message-ID: <43A9A412.3010309@webbertek.com.br>

Hi Lon,

Thank you very much for your reply. I'll try your tips.

Now another question: is it really necessary to pass on the 
"nmi_watchdog=1" parameter to the kernel? Or is it enabled by default 
under RHELv3 ou v4?

Regards,

Celso.

Lon Hohberger escreveu:

>On Wed, 2005-12-21 at 16:25 -0200, Celso K. Webber wrote:
>
>  
>
>>Does anyone has had this issue before? Or am I missing any step on 
>>configuring the software watchdog feature?
>>
>>Another question for the Red Hat people on the list: does this "software 
>>watchdog" works ok? I ask because it's enabled by default when you add a 
>>new member to the cluster. The Cluster Suite v3 manual tells nothing 
>>about this resource either.
>>    
>>
>
>Yes, it works fine.
>
>A few things could be happening:
>
>(1) The NMI watchdog will reboot the machine if it detects an NMI hang.
>This is only a few seconds.
>
>(2) The cluster is extremely paranoid because you are not using a
>STONITH device (power controller), and it's detecting internal hangs.
>Try increasing the failover time.
>
>(3) The cluster is not getting scheduled due to system load.  See the
>man page for cludb(8) about clumembd%rtp - both may help.
>
>
>-- Lon
>  
>



From jbrassow at redhat.com  Wed Dec 21 21:59:06 2005
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Wed, 21 Dec 2005 15:59:06 -0600
Subject: [Linux-cluster] cmirror status
In-Reply-To: <20051220135351.A2249181D1@smtp6-g19.free.fr>
References: <20051220135351.A2249181D1@smtp6-g19.free.fr>
Message-ID: <1135202346.6756.42.camel@hydrogen.msp.redhat.com>

On Tue, 2005-12-20 at 14:53 +0100, Sylvain Coutant wrote:
> Hi,
> 
> I wonder about the status (production ready ?) of cmirror and how it should be used.

cmirror won't be production ready until it gets some more testing.

> 
> >From the CVS, I see it is included in the STABLE and RHEL4 branches but TODO is over-aged and states not to use it for stable production.

RHEL4 is the only branch that really makes sense for cmirror right now,
and will require U3+ kernels.

> 
> Also I didn't found useful docs on how to make use of it ... The README could have a more lines to better explain how to interact with device mapper.

Yeah, sorry 'bout that.  New LVM packages are needed that support
mirroring.  LVM will "do the right thing" when it encounters volume
groups that are clustered vs non-clustered, and you should be able to
switch between the two w/o having to re-sync your volume.

If you'd like to use [cluster] mirroring before LVM is ready, I'll give
you the table arguments.  Note that LVM has a big part in how recovery
happens.

 brassow



From sco at adviseo.fr  Wed Dec 21 22:25:08 2005
From: sco at adviseo.fr (Sylvain Coutant)
Date: Wed, 21 Dec 2005 23:25:08 +0100
Subject: [Linux-cluster] cmirror status
In-Reply-To: <1135202346.6756.42.camel@hydrogen.msp.redhat.com>
Message-ID: <20051221222513.E246F6E833@smtp1-g19.free.fr>

> cmirror won't be production ready until it gets some more testing.

I'm ready to test. Apart from testing needs, is there some known bugs? The CVS activity is quite small during last months.
Is this part of the project dead or been left apart ?
Will it have active support and dev in the future ?


> > Also I didn't found useful docs on how to make use of it ... The README
> could have a more lines to better explain how to interact with device
> mapper.
> 
> Yeah, sorry 'bout that.  New LVM packages are needed that support
> mirroring.

Are they available somewhere ?


> LVM will "do the right thing" when it encounters volume
> groups that are clustered vs non-clustered, and you should be able to
> switch between the two w/o having to re-sync your volume.

I don't think I'll try that switch too much ;-) In fact, my target setup is a little bit complex and I won't play too much with those things.


> If you'd like to use [cluster] mirroring 

I could give it a try using some test servers.


> before LVM is ready,

You mean once LVM will be ready, the cluster mirroring will be integrated and somewhat "transparent" ? We will just setup a lv with several copies, that's right ?


> I'll give
> you the table arguments.

Just explain them a little bit more (ie why the "2" after the cluster keyword ?) please. And this log file thing (local/shared, size, ...)


>   Note that LVM has a big part in how recovery
> happens.

Btw, how does it happens ? What about error detection ?


Thanks for your answer.

Regards,

--
Sylvain COUTANT

ADVISEO
http://www.adviseo.fr/
http://www.open-sp.fr/





From jbrassow at redhat.com  Thu Dec 22 04:45:08 2005
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Wed, 21 Dec 2005 22:45:08 -0600
Subject: [Linux-cluster] cmirror status
In-Reply-To: <20051221222513.E246F6E833@smtp1-g19.free.fr>
References: <20051221222513.E246F6E833@smtp1-g19.free.fr>
Message-ID: <1135226708.6756.63.camel@hydrogen.msp.redhat.com>

On Wed, 2005-12-21 at 23:25 +0100, Sylvain Coutant wrote:
> > cmirror won't be production ready until it gets some more testing.
> 
> I'm ready to test. Apart from testing needs, is there some known bugs? The CVS activity is quite small during last months.

k, there aren't any bugs that I know WRT the kernel pieces of the code.

> Is this part of the project dead or been left apart ?

It's just been waiting for the rest of the project to catch up.

> Will it have active support and dev in the future ?

Yes.

> > > Also I didn't found useful docs on how to make use of it ... The README
> > could have a more lines to better explain how to interact with device
> > mapper.
> > 
> > Yeah, sorry 'bout that.  New LVM packages are needed that support
> > mirroring.
> 
> Are they available somewhere ?

Not yet, soon.

> > LVM will "do the right thing" when it encounters volume
> > groups that are clustered vs non-clustered, and you should be able to
> > switch between the two w/o having to re-sync your volume.
> 
> I don't think I'll try that switch too much ;-) In fact, my target setup is a little bit complex and I won't play too much with those things.
> 
> 
> > If you'd like to use [cluster] mirroring 
> 
> I could give it a try using some test servers.
> 
> 
> > before LVM is ready,
> 
> You mean once LVM will be ready, the cluster mirroring will be integrated and somewhat "transparent" ? We will just setup a lv with several copies, that's right ?

Yes.  You will not have to specify whether the mirror is to be clustered
or not, LVM will just know by the "clustered" flag associated with the
volume group.

> > I'll give
> > you the table arguments.
> 
> Just explain them a little bit more (ie why the "2" after the cluster keyword ?) please. And this log file thing (local/shared, size, ...)
> 
> 

The "2" in that line refers to the number of arguments the log type
(cluster in this case) takes.  In that example, it took 2; but that is
out of date.  Now it looks more like:

<start> <length> mirror \
cluster <# log args> <disk | core> <uuid> [device] <region size>
[[no]sync] [block_on_error] \
<N> <dev1> <offset> <dev2> <offset> ... <devN> <offset>

So an example might be (all one line):
0 4724028 mirror cluster 5 disk my_uuid /dev/sda 1024 block_on_error
2 /dev/sdb 0 /dev/sdc 0

Note that the uuid needs to be unique for each mirror device you are
creating.

> >   Note that LVM has a big part in how recovery
> > happens.
> 
> Btw, how does it happens ? What about error detection ?

Two types of errors in cluster mirroring; machine failure and disk
failure.

When a machine fails, whatever blocks it was working on are re-synced;
and if that machine was the log server, a new one is elected.

If a disk fails, I/O completions are frozen to prevent other machines in
the cluster from reading inconsistent data and a device-mapper event is
raised (this is where LVM comes in normally).  The device must be
suspended and a new table for the device loaded in which excludes the
failed device.

So, if you are using dmsetup, there are a number of steps involved to
get your volume running again after a disk failure.  When LVM is ready,
it will handle all those steps for you.

If you want to play around and keep it somewhat simple, you would test
cluster mirroring with a cluster file system (like GFS) and see what
happens when you kill machines.

 brassow



From michael.weitzel at uni-siegen.de  Thu Dec 22 14:02:27 2005
From: michael.weitzel at uni-siegen.de (Michael Weitzel)
Date: Thu, 22 Dec 2005 15:02:27 +0100
Subject: [Linux-cluster] Basic question on CLVMD / GFS
In-Reply-To: <1135186322.6756.26.camel@hydrogen.msp.redhat.com>
References: <43A97A40.5070701@uni-siegen.de> <43A97C41.6050503@redhat.com>
	<1135186322.6756.26.camel@hydrogen.msp.redhat.com>
Message-ID: <43AAB1F3.1090509@uni-siegen.de>

Jonathan Brassow wrote:
> On Wed, 2005-12-21 at 16:01 +0000, Patrick Caulfield wrote:
>>Michael Weitzel wrote:
>>>I am confused about the usage of clvmd. On a blank 80 GB disk I created
>>>pvcreate /dev/hdb

>>hdb doesn't sound like a shared disk to me.
>>clvmd doesn't magically make IDE drives into shared storage I'm afraid.

I see. I suspected this :-/

> ... but gnbd would work.  You would have a single point of failure in
> that case, but you would be able to access a volume from two machines.

Is there a limit on the number of machines a blockdevice can be exported
to? I read somewhere on this list that it is possible to use LVM to
build volume groups of imported gnbd block devices (probably with
mirroring). Can I safely "clone" such a LVM configuration to use it on
the different nodes of a cluster? ... I have a bunch of nodes with spare
IDE disks ...

Thanks & merry X-mas
-- 
Michael Weitzel



From jbrassow at redhat.com  Thu Dec 22 20:23:26 2005
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Thu, 22 Dec 2005 14:23:26 -0600
Subject: [Linux-cluster] Basic question on CLVMD / GFS
In-Reply-To: <43AAB1F3.1090509@uni-siegen.de>
References: <43A97A40.5070701@uni-siegen.de> <43A97C41.6050503@redhat.com>
	<1135186322.6756.26.camel@hydrogen.msp.redhat.com>
	<43AAB1F3.1090509@uni-siegen.de>
Message-ID: <1135283006.11788.4.camel@hydrogen.msp.redhat.com>

On Thu, 2005-12-22 at 15:02 +0100, Michael Weitzel wrote:
> Jonathan Brassow wrote:
> > On Wed, 2005-12-21 at 16:01 +0000, Patrick Caulfield wrote:
> >>Michael Weitzel wrote:
> >>>I am confused about the usage of clvmd. On a blank 80 GB disk I created
> >>>pvcreate /dev/hdb
> 
> >>hdb doesn't sound like a shared disk to me.
> >>clvmd doesn't magically make IDE drives into shared storage I'm afraid.
> 
> I see. I suspected this :-/
> 
> > ... but gnbd would work.  You would have a single point of failure in
> > that case, but you would be able to access a volume from two machines.
> 
> Is there a limit on the number of machines a blockdevice can be exported
> to?

Maybe, but it would be extremely high.

>  I read somewhere on this list that it is possible to use LVM to
> build volume groups of imported gnbd block devices (probably with
> mirroring). 

That would be to get around the single-point-of-failure problem.
However, cluster mirroring is not yet available.

> Can I safely "clone" such a LVM configuration to use it on
> the different nodes of a cluster? ... I have a bunch of nodes with spare
> IDE disks ...

Not sure what you mean by clone...  You would export local drives via
gnbd, so they are visible to the rest of the nodes.  Then, you can put
LVM on one or more of the exported disks.  Without mirroring though,
each exported disk you use in your storage pool becomes a potential
point of failure.  Should be fun to play around with, but until the
stars line up, don't use something like that in production.

 brassow



From renapte at vsnl.net  Sun Dec 25 02:51:24 2005
From: renapte at vsnl.net (renapte at vsnl.net)
Date: Sun, 25 Dec 2005 07:51:24 +0500
Subject: [Linux-cluster] help on DLM
Message-ID: <3beebd3bee07.3bee073beebd@vsnl.net>

I would like to know how to use DLM as a stand alone component without GFS. Is any document available on how to set it up by itself. Im using linux 2.6.9
Thank you
Renuka



From libregeek at gmail.com  Mon Dec 26 12:24:26 2005
From: libregeek at gmail.com (Manilal K M)
Date: Mon, 26 Dec 2005 17:54:26 +0530
Subject: [Linux-cluster] How to edit the configuration files.
Message-ID: <2315046d0512260424j49eed8eva6d4aeae014607ae@mail.gmail.com>

Hello all,
   I wish to use the redhat cluster suite and gfs in Fedora Core 4.
But to my suprise all the documentation explains , the configuration
using the GUI method. unfortunately I couldn't get
system-config-cluster from the FC-4 repository. However I'm not
interested in configuring it using the GUI. Can anyone post details on
configuring the clusters  by editing the configuration files. Kindly
give details regarding which files are to be edited and their purpose.

any help would be welcome.
Thanks in advance.
regards
Manilal



From sunjw at onewaveinc.com  Tue Dec 27 02:52:58 2005
From: sunjw at onewaveinc.com (=?GB2312?B?y++/oc6w?=)
Date: Tue, 27 Dec 2005 10:52:58 +0800
Subject: [Linux-cluster] gfs over dm-multipath?
Message-ID: <SERVERVBTxXkjV3TFpy00001247@mail.onewaveinc.com>

Hi,all

	Can I create gfs directly on dm-multipath device(such as /dev/dm-0, /dev/dm-1 which is 
a partition on dm-0)? I've got some problem about it. I create a 2 nodes' cluster, 
and create a gfs on the device /dev/dm-1. the multipath config is:

[size=476 GB][features="1 queue_if_no_path"][hwhandler="0"]
\_ round-robin 0 [prio=2][active]
 \_ 0:0:0:0     sda           8:0   [active][ready]
 \_ 1:0:0:0     sdb           8:16  [active][ready]

Then, I start concurrent read/write/ls test. After about 40 hours, error occurs on one node.
The logs are as follows:

---->>the timestamp on the two nodes are not synchronized. they differs about 16.5 hours.

Dec 25 02:40:10 nd02 udev[8113]: udev_db.c: unable to read db file '/dev/.udevdb/class at vc@vcs1'
Dec 25 02:40:10 nd02 udev[8113]: udev_remove.c: 'vcs1' not found in database, falling back on default name
Dec 25 02:40:10 nd02 udev[8114]: udev_db.c: unable to read db file '/dev/.udevdb/class at vc@vcsa1'
Dec 25 02:40:10 nd02 udev[8114]: udev_remove.c: 'vcsa1' not found in database, falling back on default name
Dec 25 21:13:19 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Trying to acquire journal lock...
Dec 25 21:13:19 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Looking at journal...
Dec 25 21:13:21 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Acquiring the transaction lock...
Dec 25 21:13:22 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Replaying journal...
Dec 25 21:13:26 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Replayed 166 of 327 blocks
Dec 25 21:13:26 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: replays = 166, skips = 2, sames = 159
Dec 25 21:13:26 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Journal replayed in 6s
Dec 25 21:13:26 nd02 kernel: GFS: fsid=IPTV:dm1.0: jid=1: Done

---->>nd02 did something which would do when fencing, but no fence opertaion were done.

Dec 25 19:39:31 nd03 udev[7206]: udev_db.c: unable to read db file '/dev/.udevdb/class at vc@vcsa1'
Dec 25 19:39:31 nd03 udev[7205]: udev_db.c: unable to read db file '/dev/.udevdb/class at vc@vcs1'
Dec 25 19:39:31 nd03 udev[7206]: udev_remove.c: 'vcsa1' not found in database, falling back on default name
Dec 25 19:39:31 nd03 udev[7205]: udev_remove.c: 'vcs1' not found in database, falling back on default name
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1: fatal: filesystem consistency error
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1:   RG = 114708654
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1:   function = gfs_setbit
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1:   file = /home/sunjw/projects/cluster.STABLE/gfs-kernel/src/gfs/bits.c, li
ne = 71
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1:   time = 1135575994
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1: about to withdraw from the cluster
Dec 26 13:46:34 nd03 kernel: GFS: fsid=IPTV:dm1.1: waiting for outstanding I/O
Dec 26 13:46:35 nd03 kernel: GFS: fsid=IPTV:dm1.1: telling LM to withdraw
Dec 26 13:46:44 nd03 kernel: lock_dlm: withdraw abandoned memory
Dec 26 13:46:44 nd03 kernel: GFS: fsid=IPTV:dm1.1: withdrawn
                                                                

Thanks for any reply.
Luckey.



From tristram at ubernet.co.nz  Tue Dec 27 06:45:22 2005
From: tristram at ubernet.co.nz (Tristram Cheer)
Date: Tue, 27 Dec 2005 19:45:22 +1300
Subject: [Linux-cluster] Mirroring Disk's Across the network
Message-ID: <43B0E302.8040702@ubernet.co.nz>

Good Afternoon All,

I'm looking for a way for us to mirror GFS volumes from a 500gb Raid5 
Array (Non-San) to another volume in a server 100km away, we have a 
100mbit link to this server and the purpose of it is to offsite backup 
the GFS volume incase of Critical Failure. Is anyone aware of a 
production ready method to doing this?

Cheers

Tristram



From gwood at dragonhold.org  Tue Dec 27 08:54:21 2005
From: gwood at dragonhold.org (gwood at dragonhold.org)
Date: Tue, 27 Dec 2005 08:54:21 +0000
Subject: [Linux-cluster] Mirroring Disk's Across the network
In-Reply-To: <43B0E302.8040702@ubernet.co.nz>
References: <43B0E302.8040702@ubernet.co.nz>
Message-ID: <20051227085421.GA28902@dragonhold.org>

On Tue, Dec 27, 2005 at 07:45:22PM +1300, Tristram Cheer wrote:
> Good Afternoon All,
> 
> I'm looking for a way for us to mirror GFS volumes from a 500gb Raid5 
> Array (Non-San) to another volume in a server 100km away, we have a 
> 100mbit link to this server and the purpose of it is to offsite backup 
> the GFS volume incase of Critical Failure. Is anyone aware of a 
> production ready method to doing this?
It's the sort of thing that you should be able to do quite easily with
DRBD - but unfortunately that (AFAIK) can't be laid over the top of an
existing filesystem - so you'd probably have to do something nasty to
get it working with the existing data.  I think the DRBD device is
slightly smaller than the underlying data device (DRBD metadata), and
I'm not sure whether it takes that space from the beginning or the end -
and either could possibly destroy files/metadata anyway.

All of the standard options should work with GFS that work with anything
else - as long as you're willing to stay 5-10 minutes behind, something
like rsync should be able to do it fine.

Graham



From christoph.thommen at bl.ch  Tue Dec 27 09:36:33 2005
From: christoph.thommen at bl.ch (Thommen, Christoph FKD)
Date: Tue, 27 Dec 2005 10:36:33 +0100
Subject: [Linux-cluster] Monitor external network-interface
Message-ID: <553B0E9C0C87D24A876E6B14FFE373D67B86B5@faimbx01.bl.ch>

Hi everybody

 

I've got a question:

I've configured a GFS-Cluster with 3 nodes. Now everything is running,
also the failover is running.

The only thing, that don't run is the failover if the external
network-interface goes down... what can I do to monitor the external
network-interface? Is there something, that I shout configure in the
cluster.conf or is there something else that I shout configure??

 

The network-configurations are like this:

- Node 1:          Cluster-Internal IP:         192.168.1.11

                        Cluster-External IP:       10.12.92.95

- Node 2:          Cluster-Internal IP:         192.168.1.12

                        Cluster-External IP:       10.12.92.96

- Node 3:          Cluster-Internal IP:         192.168.1.13

                        Cluster-External IP:       10.12.92.97

 

 

Thanks for your help

 

Greets

 

Chris

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051227/a99fce5f/attachment.htm>

From libregeek at gmail.com  Wed Dec 28 12:04:01 2005
From: libregeek at gmail.com (Manilal K M)
Date: Wed, 28 Dec 2005 17:34:01 +0530
Subject: [Linux-cluster] Starting cman failed.
Message-ID: <2315046d0512280404p59498d9ag6541433b5810c3ca@mail.gmail.com>

Hi all,
I am a newbie to Clusters and GFS. I have recently installed redhat
cluster on my FC-4 machine. The following packages are installed:
rgmanager-1.9.34-5
system-config-cluster-1.0.12-1.0
ccs-1.0.0-1
magma-1.0.0-1
magma-plugins-1.0.0-2
cman-kernel-2.6.11.5-20050601.152643.FC4.18
cman-1.0.0-1
dlm-kernel-2.6.11.5-20050601.152643.FC4.17
dlm-1.0.0-3
fence-1.32.1-1
gulm-1.0.0-2
iddev-2.0.0-1
GFS-6.1.0-3
GFS-kernel-2.6.11.8-20050601.152643.FC4.20
gnbd-1.0.0-1
gnbd-kernel-2.6.11.2-20050420.133124.FC4.53
The Kernel version is : 2.6.11-1.1369_FC4smp

I have created the clusters with the graphical tool, and after that
started the service in this order:

[root at triveni ~]# service ccsd start
Starting ccsd:                                             [  OK  ]
[root at triveni ~]#  service cman start
Starting cman:                                             [FAILED]
[root at triveni ~]# service fenced start
Starting fence domain:                                     [FAILED]
[root at triveni ~]# service gfs start
[root at triveni ~]# service rgmanager start
Starting Cluster Service Manager:                          [  OK  ]
[root at triveni ~]#

The /var/log/messages displays the following errors:
[root at triveni ~]# tail -f /var/log/messages|grep clu
Dec 28 17:27:54 triveni ccsd[4319]: Unable to connect to cluster
infrastructure after 1980 seconds.
Dec 28 17:28:25 triveni ccsd[4319]: Unable to connect to cluster
infrastructure after 2010 seconds.
Dec 28 17:30:04 triveni ccsd[8842]: Unable to connect to cluster
infrastructure after 30 seconds.
Dec 28 17:30:34 triveni ccsd[8842]: Unable to connect to cluster
infrastructure after 60 seconds.
Dec 28 17:30:48 triveni clurgmgrd[9045]: <info> Loading Service Data
Dec 28 17:30:48 triveni clurgmgrd[9045]: <crit> #5: Couldn't connect to ccsd!
Dec 28 17:30:48 triveni clurgmgrd[9045]: <crit> #8: Couldn't initialize services
Dec 28 17:31:05 triveni ccsd[8842]: Unable to connect to cluster
infrastructure after 90 seconds.

What could be wrong?? Please help me.

regards
Manilal



From linux-cluster at redhat.com  Thu Dec 29 08:41:30 2005
From: linux-cluster at redhat.com (linux-cluster at redhat.com)
Date: Thu, 29 Dec 2005 00:41:30 -0800
Subject: [Linux-cluster] Re[14]:
Message-ID: <0005$02362534$029efb64@daniel>

brothers, and another husbands cousin, and another husbands counsel, they have come to me.  If I have sometimes been unhappy, fancied he was still pursuing his long journey in search of his keep in readiness and order for me.  We found her, in her touching a bridge; there was an advertisement of a collection of
delicate face; a pritty head, leaning a little down; a quiet voice wrested this confession from me.  I thought I could have kept it in kiender a blessing fell upon us, said Mr. Peggotty, reverentially married girls with their three husbands, and one of the husbands eternally quarrels with the old Scotch Croesus, who is a sort of
conversation with me, had fallen on a pious fraud, or had really Emly, said he, arter you left her, maam - and I never heerd the foot of the large table like a Patriarch; and Sophy beams upon touching a bridge; there was an advertisement of a collection of and thrust out her little heap of golden curls from between the
mindful of yourself, and less of me, when we grew up here together, that has come and gone with them. I must speak plainly.  If you themselves until Sol gave warning for departure, Wilkins Micawber, The remaining toasts were DOCTOR MELL; Mrs. MICAWBER who vain.  There is no alloy of self in what I feel for you.
the roof.  Theer come along one day, when I was out a-working on better than you ever did.- now soothing her to sleep on her I see myself, with Agnes at my side, journeying along the road of the beginning of a favourite story Agnes used to tell them, odd-times, I have thowt so.  A slight figure, said Mr. Peggotty,
carry cards and letters to her on a golden salver, and a one time, exclusive of the company in the passage and on the And Agnes laid her head upon my breast, and wept; and I wept with were at no loss to distinguish the bell-like notes of that gifted over which I have had no control from the personal society of the
can trust me, as you say you can, and as I know you may, let me be that he is doing well.  But what is the latest news of him? not shut me out of what concerns your happiness so nearly. If you Which is verse, said Mr. Peggotty, surprised to find it out, Inhabitants of Port Middlebay, for the gratification of which you
exactly the same simple, unaffected fellow as he ever was, sits at a bucket as was standing by, and laid it over that theer ships conversation with me, had fallen on a pious fraud, or had really and Mrs. Strong, were the only guests at our quiet wedding.  We worked as we ought to t, and maybe we lived a leetle hard at first
right sat the distinguished guest.  After the removal of the cloth, - no new one; and is - not what you suppose.  I cannot reveal it, little discomfited.  She darted a hopeful glance at me, when I said chair, and we both leaned over her.  My aunt, with one clap of her I believed I could.  I drew the wife who had so long loved me,
brothers, and another husbands cousin, and another husbands counsel, they have come to me.  If I have sometimes been unhappy, fancied he was still pursuing his long journey in search of his keep in readiness and order for me.  We found her, in her touching a bridge; there was an advertisement of a collection of
delicate face; a pritty head, leaning a little down; a quiet voice wrested this confession from me.  I thought I could have kept it in kiender a blessing fell upon us, said Mr. Peggotty, reverentially married girls with their three husbands, and one of the husbands eternally quarrels with the old Scotch Croesus, who is a sort of
conversation with me, had fallen on a pious fraud, or had really Emly, said he, arter you left her, maam - and I never heerd the foot of the large table like a Patriarch; and Sophy beams upon touching a bridge; there was an advertisement of a collection of and thrust out her little heap of golden curls from between the
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051229/0b94d433/attachment.htm>

From jec at rptec.ch  Thu Dec 29 12:55:05 2005
From: jec at rptec.ch (Jean-Eric Cuendet)
Date: Thu, 29 Dec 2005 13:55:05 +0100
Subject: [Linux-cluster] Synchronized filesystem
Message-ID: <43B3DCA9.5070506@rptec.ch>

Hi,
Is it possible to use GFS to make a synchronized filesystem between 2 
machines with a "slow" line in between?
I mean:
- One machine with storage is in US
- One machine without storage is in UK
- There is a 2Mbit line between them.

We would like to have access to the files from both side, reading being 
*fast*. At the moment, we use rsync during the night. This is OK but:
  - Not up-to-date if modifs in US are done during the day
  - If someone wants to modify, he must edit the files in the *US* not 
in the UK, even if he is in the UK.

The idea would be to have gnbd in the US, GFS accross these 2 machines 
and GFS in the UK making *read* caching and taking care of locking if 
someone in the UK modify files in the US. Is that possible with GFS?

Thanks for any ideas.
-jec

-- 
Best regards / Salutations.

Jean-Eric Cuendet
Senior developer / Technical support
Riskpro Technologies SA
Av. Louis-Ruchonnet 2
CH-1003 Lausanne
Switzerland

Direct    : +41 21 637 0123
Principal : +41 21 637 0110
Fax       : +41 21 637 0111
Skype     : jec.rptec
Web       : http://www.rptec.ch
--------------------------------------------------------



From brugolsky at telemetry-investments.com  Thu Dec 29 14:02:25 2005
From: brugolsky at telemetry-investments.com (Bill Rugolsky Jr.)
Date: Thu, 29 Dec 2005 09:02:25 -0500
Subject: [Linux-cluster] Synchronized filesystem
In-Reply-To: <43B3DCA9.5070506@rptec.ch>
References: <43B3DCA9.5070506@rptec.ch>
Message-ID: <20051229140225.GB26673@ti64.telemetry-investments.com>

On Thu, Dec 29, 2005 at 01:55:05PM +0100, Jean-Eric Cuendet wrote:
> Is it possible to use GFS to make a synchronized filesystem between 2 
> machines with a "slow" line in between?
> I mean:
> - One machine with storage is in US
> - One machine without storage is in UK
> - There is a 2Mbit line between them.

Since it seems that you are primarily interested in persistent caching,
you may want to consider OpenAFS: www.openafs.org

Intermezzo/InterSync (available in Linux-2.4, removed from Linux-2.6)
was designed for the problem that you are trying to solve, but it still
needed work, and is now moribund.  Peter Braam has stated his intention
to add similar features to Lustre, but that's a long way off.

Regards,

	Bill Rugolsky



From jec at rptec.ch  Thu Dec 29 14:05:17 2005
From: jec at rptec.ch (Jean-Eric)
Date: Thu, 29 Dec 2005 15:05:17 +0100
Subject: [Linux-cluster] Synchronized filesystem
In-Reply-To: <20051229140225.GB26673@ti64.telemetry-investments.com>
References: <43B3DCA9.5070506@rptec.ch>
	<20051229140225.GB26673@ti64.telemetry-investments.com>
Message-ID: <43B3ED1D.10302@rptec.ch>


>>Is it possible to use GFS to make a synchronized filesystem between 2 
>>machines with a "slow" line in between?
>>I mean:
>>- One machine with storage is in US
>>- One machine without storage is in UK
>>- There is a 2Mbit line between them.
> 
> Since it seems that you are primarily interested in persistent caching,
> you may want to consider OpenAFS: www.openafs.org

Yes, I want local caching but also transparent writes. If I'm in the UK 
I want to write FileServerInTheUk (ans that goes auto to 
FileServerInTheUS which holds the storage) and if I'm in the US, I want 
to write to US server.
Is this case handled by AFS?
The last case is that I need to mount the partition with NFS *and* Samba 
(we are a mixed Windows/UNIX shop) on Linux and Windows hosts (access 
through AFS only is out of the question...) so I'm not sure that AFS 
will do it... Or am I wrong?
Thanks.
-jec



From rainer at ultra-secure.de  Thu Dec 29 14:22:47 2005
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Thu, 29 Dec 2005 15:22:47 +0100
Subject: [Linux-cluster] Synchronized filesystem
In-Reply-To: <43B3DCA9.5070506@rptec.ch>
References: <43B3DCA9.5070506@rptec.ch>
Message-ID: <43B3F137.4010105@ultra-secure.de>

Jean-Eric Cuendet wrote:

> Hi,
> Is it possible to use GFS to make a synchronized filesystem between 2 
> machines with a "slow" line in between?
> I mean:
> - One machine with storage is in US
> - One machine without storage is in UK
> - There is a 2Mbit line between them.
>
> We would like to have access to the files from both side, reading 
> being *fast*. At the moment, we use rsync during the night. This is OK 
> but:
>  - Not up-to-date if modifs in US are done during the day
>  - If someone wants to modify, he must edit the files in the *US* not 
> in the UK, even if he is in the UK.
>
> The idea would be to have gnbd in the US, GFS accross these 2 machines 
> and GFS in the UK making *read* caching and taking care of locking if 
> someone in the UK modify files in the US. Is that possible with GFS?



You want to have some sort of document-management system, which in the 
most simple form can be CVS (if you can teach your people CVS...).
At least, CVS can do the local caching but no locking of files - that 
has to be handled via the "old-fashioned" way.

GFS is the wrong tool for this purpose, no doubt.



cheers,
Rainer




From brugolsky at telemetry-investments.com  Thu Dec 29 16:56:26 2005
From: brugolsky at telemetry-investments.com (Bill Rugolsky Jr.)
Date: Thu, 29 Dec 2005 11:56:26 -0500
Subject: [Linux-cluster] Synchronized filesystem
In-Reply-To: <43B3ED1D.10302@rptec.ch>
References: <43B3DCA9.5070506@rptec.ch>
	<20051229140225.GB26673@ti64.telemetry-investments.com>
	<43B3ED1D.10302@rptec.ch>
Message-ID: <20051229165626.GA1493@ti64.telemetry-investments.com>

On Thu, Dec 29, 2005 at 03:05:17PM +0100, Jean-Eric wrote:
> Yes, I want local caching but also transparent writes. If I'm in the UK 
> I want to write FileServerInTheUk (ans that goes auto to 
> FileServerInTheUS which holds the storage) and if I'm in the US, I want 
> to write to US server.
> Is this case handled by AFS?

Disclaimer: I'm no expert on AFS, it just seemed more plausible than GFS
for your application.  Ask on their mail lists if you have questions.

What you do depends on whether the files that you are caching are different
(and well separated) from the files that you are writing, and whether
there is a single writer or multiple writers.  As a practical
matter, the size of the files can't be ignored either; 2Mbps isn't much
bandwidth if many users are writing bloated MS Office documents ...

AFS volume replication is designed for mostly static data, e.g., /usr
partitions.  They can be updated, but administratively, not in real-time.

There is also the small matter of (Windows) programs that open files
for update even though they may not write to them.  Depending on
the distributed filesystem in use, opening for write may immediately
invalidate or bypass caches.  AFS, IIRC, has seperate traversal paths for
read-only access and read/write access, due to the replication support.

> The last case is that I need to mount the partition with NFS *and* Samba 
> (we are a mixed Windows/UNIX shop) on Linux and Windows hosts (access 
> through AFS only is out of the question...) so I'm not sure that AFS 
> will do it... Or am I wrong?

It sounds like you want the file server on the UK side configured as
an AFS/NFSv4/whatever client and re-exporting the mount via NFS/Samba.
In that case, AFS server volume replication is not the right thing;
you instead want persistent client-side write-through caching for AFS
or NFSv4 (or CIFS).  I believe that the infrastructure for that is in
David Howells's and Steve Dickson's (as yet unmerged) fscache patches,
but I have no idea whether it is production ready (and available in an
Enterprise Linux distro) or how well it works with caches in the range of
tens or hundreds of gigabytes.  [Perhaps not well, but it couldn't be much
worse than trying to pull the file over the WAN.]  In order to populate
the cache on the UK side you'd probably have to set up a fscache client
and use tar or similar to populate the fscache partition, then overnight
it to the UK office.  That works for the initial cache load, but ongoing
maintenance would be a hassle and might require specialized tools.

There are commercial products in this niche, but I know nothing about
them.

Regards,

	Bill Rugolsky



From cjk at techma.com  Thu Dec 29 17:37:23 2005
From: cjk at techma.com (Kovacs, Corey J.)
Date: Thu, 29 Dec 2005 12:37:23 -0500
Subject: [Linux-cluster] Synchronized filesystem
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079DBB@tmaemail.techma.com>

FWIW...

AFS is designed to handle WAN speed connections. It does this by not xferring
entire
files, but rather smaller chunks (64K if iirc). Reads are cached locally,
while writes
are synced upon "close()" calls. So if you have two people using the same
file, they'll
each write there own version when finished. Last one to write wins. AFS
benefits come 
from security (kerberos) and file distribution (local cacheing) There are no
provisions
for automatic failover (there is replication though)that I recall (you can
manually move things around if you need to take down an AFS server from the
pool). Think of AFS like LVM for file-servers with some added features.
Things have changed a bit from when I last used it so take this with a grain
of salt.


Regards,

Corey

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bill Rugolsky Jr.
Sent: Thursday, December 29, 2005 11:56 AM
To: Jean-Eric
Cc: linux clustering
Subject: Re: [Linux-cluster] Synchronized filesystem

On Thu, Dec 29, 2005 at 03:05:17PM +0100, Jean-Eric wrote:
> Yes, I want local caching but also transparent writes. If I'm in the 
> UK I want to write FileServerInTheUk (ans that goes auto to 
> FileServerInTheUS which holds the storage) and if I'm in the US, I 
> want to write to US server.
> Is this case handled by AFS?

Disclaimer: I'm no expert on AFS, it just seemed more plausible than GFS for
your application.  Ask on their mail lists if you have questions.

What you do depends on whether the files that you are caching are different
(and well separated) from the files that you are writing, and whether there
is a single writer or multiple writers.  As a practical matter, the size of
the files can't be ignored either; 2Mbps isn't much bandwidth if many users
are writing bloated MS Office documents ...

AFS volume replication is designed for mostly static data, e.g., /usr
partitions.  They can be updated, but administratively, not in real-time.

There is also the small matter of (Windows) programs that open files for
update even though they may not write to them.  Depending on the distributed
filesystem in use, opening for write may immediately invalidate or bypass
caches.  AFS, IIRC, has seperate traversal paths for read-only access and
read/write access, due to the replication support.

> The last case is that I need to mount the partition with NFS *and* 
> Samba (we are a mixed Windows/UNIX shop) on Linux and Windows hosts 
> (access through AFS only is out of the question...) so I'm not sure 
> that AFS will do it... Or am I wrong?

It sounds like you want the file server on the UK side configured as an
AFS/NFSv4/whatever client and re-exporting the mount via NFS/Samba.
In that case, AFS server volume replication is not the right thing; you
instead want persistent client-side write-through caching for AFS or NFSv4
(or CIFS).  I believe that the infrastructure for that is in David Howells's
and Steve Dickson's (as yet unmerged) fscache patches, but I have no idea
whether it is production ready (and available in an Enterprise Linux distro)
or how well it works with caches in the range of tens or hundreds of
gigabytes.  [Perhaps not well, but it couldn't be much worse than trying to
pull the file over the WAN.]  In order to populate the cache on the UK side
you'd probably have to set up a fscache client and use tar or similar to
populate the fscache partition, then overnight it to the UK office.  That
works for the initial cache load, but ongoing maintenance would be a hassle
and might require specialized tools.

There are commercial products in this niche, but I know nothing about them.

Regards,

	Bill Rugolsky

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From mykleb at no.ibm.com  Thu Dec 29 17:54:45 2005
From: mykleb at no.ibm.com (Jan-Frode Myklebust)
Date: Thu, 29 Dec 2005 18:54:45 +0100
Subject: [Linux-cluster] Re: Synchronized filesystem
References: <43B3DCA9.5070506@rptec.ch>
	<20051229140225.GB26673@ti64.telemetry-investments.com>
	<43B3ED1D.10302@rptec.ch>
Message-ID: <slrndr88n5.1ge.mykleb@99RXZYP.ibm.com>

On 2005-12-29, Jean-Eric <jec at rptec.ch> wrote:
>
> Yes, I want local caching but also transparent writes. If I'm in the UK 
> I want to write FileServerInTheUk (ans that goes auto to 
> FileServerInTheUS which holds the storage) and if I'm in the US, I want 

I think what you want is something like IBM's GPFS, which is similar to GFS, 
but also support replication over 2 sets of storage nodes. ie. you define 
US_storage_nodes and UK_storage_nodes, and tell gpfs to keep 2 copies of 
every block in 2 different replication sets.

If you read and write in US, everything will happen locally and writes will be
replicated to UK in the background. 

But, if you lose the link between the sites, only one of the sites will be able 
to continue operating (by having a majority of the quorum nodes). Also I think
the 2 MBit link would probably not be sufficient if you need to do a re-replication
of the full file system..


  -jf