From lipson12 at yahoo.com  Sun Dec  4 11:38:01 2011
From: lipson12 at yahoo.com (Kaisar Ahmed Khan)
Date: Sun, 4 Dec 2011 03:38:01 -0800 (PST)
Subject: [Linux-cluster] fencing problem
Message-ID: <1322998681.56234.YahooMailClassic@web36502.mail.mud.yahoo.com>


Dear all,
?
fence_xvm -H station2.example.com when i am trying to fence the node it's showing 
?
request timeout.
[root at station1 ~]# fence_xvm -H station2.example.com -ddd -o null
Debugging threshold is now 3
-- args @ 0xbfa22738 --
? args->addr = 225.0.0.12
? args->domain = station2.example.com
? args->key_file = /etc/cluster/fence_xvm.key
? args->op = 0
? args->hash = 2
? args->auth = 2
? args->port = 1229
? args->ifindex = 0
? args->family = 2
? args->timeout = 30
? args->retr_time = 20
? args->flags = 0
? args->debug = 3
-- end args --
Reading in key file /etc/cluster/fence_xvm.key into 0xbfa216ec (4096 max size)
Actual key length = 4096 bytesSending to 225.0.0.12 via 127.0.0.1

Waiting for connection from XVM host daemon.
Timed out waiting for response
can anybody help me
?
Thanks
?

Md.Kaisar Ahmed Khan
?????
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111204/f002561c/attachment.htm>

From xubinbin2004 at gmail.com  Mon Dec  5 00:18:45 2011
From: xubinbin2004 at gmail.com (Bin)
Date: Sun, 4 Dec 2011 17:18:45 -0700
Subject: [Linux-cluster] Can I build a computer cluster based on RedHat
	Desktop edition?
Message-ID: <CAF8REyCHhenBm4rSTiXUL=kVGQy+qiwPpF_G-KmX056nu65EUA@mail.gmail.com>

I am a beginner -:) and want to build a PC clusters based on Redhat linux
for running my parallel codes. please help...

Thanks

-- 
Best regards,

Bin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111204/af7ab15d/attachment.htm>

From linux at alteeve.com  Mon Dec  5 01:30:49 2011
From: linux at alteeve.com (Digimer)
Date: Sun, 04 Dec 2011 20:30:49 -0500
Subject: [Linux-cluster] Can I build a computer cluster based on RedHat
 Desktop edition?
In-Reply-To: <CAF8REyCHhenBm4rSTiXUL=kVGQy+qiwPpF_G-KmX056nu65EUA@mail.gmail.com>
References: <CAF8REyCHhenBm4rSTiXUL=kVGQy+qiwPpF_G-KmX056nu65EUA@mail.gmail.com>
Message-ID: <4EDC1EC9.7020003@alteeve.com>

On 12/04/2011 07:18 PM, Bin wrote:
> I am a beginner -:) and want to build a PC clusters based on Redhat
> linux for running my parallel codes. please help...
> 
> Thanks

Performance clustering most often a per-application question, not one
that can be generalized too well. These tend to be pretty
distro-agnostic and generally rely on specialized tools running on nodes.

A classic example is a video render farm where a master node cuts up a
series of frames, hands them off to a node in the farm to render,
repeats for the various other parts of the movie, collects the finished
frame and stiches them together into a single movie. Similar concepts
can be applied to decryption, compilation and so on.

So, tell us what you are trying to do, specifically.

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From linux at alteeve.com  Mon Dec  5 01:36:16 2011
From: linux at alteeve.com (Digimer)
Date: Sun, 04 Dec 2011 20:36:16 -0500
Subject: [Linux-cluster] Can I build a computer cluster based on RedHat
 Desktop edition?
In-Reply-To: <1855549946-1323048837-cardhu_decombobulator_blackberry.rim.net-1003587105-@b11.c21.bise6.blackberry>
References: <1855549946-1323048837-cardhu_decombobulator_blackberry.rim.net-1003587105-@b11.c21.bise6.blackberry>
Message-ID: <4EDC2010.3020508@alteeve.com>

On 12/04/2011 08:33 PM, bin xu wrote:
> Thanks. I just want to run some MPI based computing codes.
>  Thanks
> 
> Bin
> ------Original Message------
> From: Digimer
> To: linux clustering
> Cc: Bin
> Subject: Re: [Linux-cluster] Can I build a computer cluster based on RedHat Desktop edition?
> Sent: Dec 4, 2011 6:30 PM
> 
> On 12/04/2011 07:18 PM, Bin wrote:
>> I am a beginner -:) and want to build a PC clusters based on Redhat
>> linux for running my parallel codes. please help...
>>
>> Thanks
> 
> Performance clustering most often a per-application question, not one
> that can be generalized too well. These tend to be pretty
> distro-agnostic and generally rely on specialized tools running on nodes.
> 
> A classic example is a video render farm where a master node cuts up a
> series of frames, hands them off to a node in the farm to render,
> repeats for the various other parts of the movie, collects the finished
> frame and stiches them together into a single movie. Similar concepts
> can be applied to decryption, compilation and so on.
> 
> So, tell us what you are trying to do, specifically.
> 

Please reply to the mailing list. Discussions like this can help other
people later when they're archived and searchable.

You will want to take a look at the OpenMPI project. I've not used it,
but it should give you what you need to get started.

http://www.open-mpi.org/

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From mgrac at redhat.com  Mon Dec  5 10:36:41 2011
From: mgrac at redhat.com (Marek Grac)
Date: Mon, 05 Dec 2011 11:36:41 +0100
Subject: [Linux-cluster] Fence_vmware_soap
In-Reply-To: <A64E914D72373B4287DF2222856FFF3512E66E67@ILBENEVS001.int.usc.local>
References: <A64E914D72373B4287DF2222856FFF3512E66E67@ILBENEVS001.int.usc.local>
Message-ID: <4EDC9EB9.6010309@redhat.com>

Hi,

On 11/28/2011 10:59 PM, Geovanis, Nicholas wrote:
> RH Cluster Services on RHEL 5.7, all nodes running in Vmware VMs:
> The only doc I can find on the fence_vmware_soap fencing agent is the
> script itself and the man page for it. There is no background info in
> either and no examples. I can get my Vcenter server to respond to a
> "list" subcommand but anything else receives "Failed: Unable to obtain
> correct plug status or plug is not available". Sadly, the "successful"
> list only retrieves exactly 100 entries from one of the 8 (Vmware)
> clusters running (same cluster every time, same 100 VMs and templates
> every time).

'list' option was not tested with that much entries but it's
functionality is not used anywhere yet. So it does not impact proper
function of fence agent.

'Plug is not available/...' in which format do you enter it. Proper one
is explained in manual page (/datacenter/vm/Discovered virtual
machine/myMachine) where myMachine is yours name. Alternatively you can
use option -U / uuid.

m,


From davegu1 at hotmail.com  Mon Dec  5 18:37:24 2011
From: davegu1 at hotmail.com (David F. Gutierrez)
Date: Mon, 5 Dec 2011 12:37:24 -0600
Subject: [Linux-cluster] Can I build a computer cluster based on RedHat
	Desktop edition?
In-Reply-To: <4EDC2010.3020508@alteeve.com>
References: <1855549946-1323048837-cardhu_decombobulator_blackberry.rim.net-1003587105-@b11.c21.bise6.blackberry>
	<4EDC2010.3020508@alteeve.com>
Message-ID: <SNT123-DS101EC750E1941DEAE7F46FAB50@phx.gbl>

to run mpi code it can be done and could costly.
Read these articles and do other searches in google for the same topic.

http://na-inet.jp/na/pccluster/fc5_x8664-en.html
http://www.webstreet.com/super_computer.htm
http://www.divms.uiowa.edu/~jni/HowTo/HowToBuildAClusterG.pdf
http://blizzard.rwic.und.edu/~nordlie/deuce/

Good luck

David


-----Original Message----- 
From: Digimer
Sent: Sunday, December 04, 2011 7:36 PM
To: linux clustering
Subject: Re: [Linux-cluster] Can I build a computer cluster based on RedHat 
Desktop edition?

On 12/04/2011 08:33 PM, bin xu wrote:
> Thanks. I just want to run some MPI based computing codes.
>  Thanks
>
> Bin
> ------Original Message------
> From: Digimer
> To: linux clustering
> Cc: Bin
> Subject: Re: [Linux-cluster] Can I build a computer cluster based on 
> RedHat Desktop edition?
> Sent: Dec 4, 2011 6:30 PM
>
> On 12/04/2011 07:18 PM, Bin wrote:
>> I am a beginner -:) and want to build a PC clusters based on Redhat
>> linux for running my parallel codes. please help...
>>
>> Thanks
>
> Performance clustering most often a per-application question, not one
> that can be generalized too well. These tend to be pretty
> distro-agnostic and generally rely on specialized tools running on nodes.
>
> A classic example is a video render farm where a master node cuts up a
> series of frames, hands them off to a node in the farm to render,
> repeats for the various other parts of the movie, collects the finished
> frame and stiches them together into a single movie. Similar concepts
> can be applied to decryption, compilation and so on.
>
> So, tell us what you are trying to do, specifically.
>

Please reply to the mailing list. Discussions like this can help other
people later when they're archived and searchable.

You will want to take a look at the OpenMPI project. I've not used it,
but it should give you what you need to get started.

http://www.open-mpi.org/

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 


From rossnick-lists at cybercat.ca  Tue Dec  6 18:39:39 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Tue, 6 Dec 2011 13:39:39 -0500
Subject: [Linux-cluster] Design question about VG / LV in a clustered
	environement
Message-ID: <6C60C76401934E22A74971C46A813064@versa>

Hi !

Since the last couple of months, we had a few problems with the maner we 
designed our clustered filesystem and we are planing to do a re-design of 
the filesystems and how they are used.

Our cluster is composed of 8 nodes, connected via fibre channel, to a raid 
enclosure where we have 6 pair of 1-tb drives in mirror, so 6 1tb physical 
volumes.

First of all, our services that are run from the cluster are running inside 
of directories. For exemple, a webserver for a given application is run from 
/CyberCat/WebServer/(...) That directory contains all executable (apache, 
php for exemple) and the related data, except for the databases. /CyberCat 
being a single GFS partition containing several other services.

This filesystem and another one like this containing services for some other 
clients occupy a single VG composed of 2 PV (total 2tb). The remaining (4) 
other PV are used in one 1tb VG each, and those VG contains only one LV that 
is used for databases servers.

For availibility reasons, we are planing of spliting the /CyberCat (and the 
other one like it) FS into several smaller filesystems, one for each 
service.

The reason being that in the event that we need to make a filesystem check, 
or any other unplaned reason, on any filesystem it won't affect other 
services.

So, now comes the question I have :

1. First of all, is this a bad idea ?

2. Is there any disadvantages of doing a single volume group composed of 
many physical volumes, enabling us to move the extents of a logical volume 
from one physical volume to another one, so that load is more balanced in 
the event we need it.

Thanks for the input. 


From jeff.sturm at eprize.com  Wed Dec  7 03:33:35 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 7 Dec 2011 03:33:35 +0000
Subject: [Linux-cluster] Design question about VG / LV in a
	clustered	environement
In-Reply-To: <6C60C76401934E22A74971C46A813064@versa>
References: <6C60C76401934E22A74971C46A813064@versa>
Message-ID: <B1B9801C5CBC954680D0374CC4EEABA51178D668@MailNode2.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Nicolas Ross
> Sent: Tuesday, December 06, 2011 1:40 PM
>
> For availibility reasons, we are planing of spliting the /CyberCat (and the other one like
> it) FS into several smaller filesystems, one for each service.

[snip]

> 1. First of all, is this a bad idea ?

Right or wrong, that's how we do it.

Apart from availability, you can tune the fs appropriately depending on how you use it.  GFS2 dropped some tunables, I think, but you can still mount with "noatime" (assuming your application doesn't rely on atime) and tune some things like block size.  Some of our GFS filesystems are also read-only on certain nodes, so we take advantage of spectator mounts for those.

> 2. Is there any disadvantages of doing a single volume group composed of many
> physical volumes, enabling us to move the extents of a logical volume from one
> physical volume to another one, so that load is more balanced in the event we need it.

Can't say, really.  We ditched CLVM but kept GFS.  It felt like CLVM had too many limitations to make it worthwhile.  It was straightforward to just export a LUN from our SAN for each file system, and that allows us to take advantage of the SAN's native snapshot facility.

-Jeff


From linux at alteeve.com  Wed Dec  7 04:45:32 2011
From: linux at alteeve.com (Digimer)
Date: Tue, 06 Dec 2011 23:45:32 -0500
Subject: [Linux-cluster] cluster 3.1.8 released
Message-ID: <4EDEEF6C.2040002@alteeve.com>

Welcome to the cluster 3.1.8 release.

This release addresses several bugs and includes a patch to improve RRP
configuration handling. DLM+SCTP (kernel counterpart of RRP) is still
under testing, feedback is always appreciated.

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/c/l/cluster/cluster-3.1.8.tar.xz

ChangeLog:

https://fedorahosted.org/releases/c/l/cluster/Changelog-3.1.8

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

Happy clustering,
Digimer


From Nicholas.Geovanis at uscellular.com  Wed Dec  7 17:22:54 2011
From: Nicholas.Geovanis at uscellular.com (Geovanis, Nicholas)
Date: Wed, 7 Dec 2011 11:22:54 -0600
Subject: [Linux-cluster] Design question about VG / LV in a clustered
	environment
In-Reply-To: <mailman.45.1323277204.7959.linux-cluster@redhat.com>
References: <mailman.45.1323277204.7959.linux-cluster@redhat.com>
Message-ID: <A64E914D72373B4287DF2222856FFF351309A6C8@ILBENEVS001.int.usc.local>


Jeff Sturme wrote:

>> We ditched CLVM but kept GFS.  It felt like CLVM had too many limitations to make it worthwhile. 

Would you elaborate on this for me please? I understand the "damn, forgot to start clvmd on that node...." type of annoyance, but what were your burning issues? I'm not convinced that there's a performance drawback which is specifically clvmd-related, but maybe I'm na?ve. Thanks....Nick G

Nick Geovanis
US Cellular/Kforce Inc
e. Nicholas.Geovanis at uscellular.com

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of linux-cluster-request at redhat.com
Sent: Wednesday, December 07, 2011 11:00 AM
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 92, Issue 4

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific than "Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. Design question about VG / LV in a clustered	environement
      (Nicolas Ross)
   2. Re: Design question about VG / LV in a	clustered	environement
      (Jeff Sturm)
   3. cluster 3.1.8 released (Digimer)


----------------------------------------------------------------------

Message: 1
Date: Tue, 6 Dec 2011 13:39:39 -0500
From: "Nicolas Ross" <rossnick-lists at cybercat.ca>
To: "linux clustering" <linux-cluster at redhat.com>
Subject: [Linux-cluster] Design question about VG / LV in a clustered
	environement
Message-ID: <6C60C76401934E22A74971C46A813064 at versa>
Content-Type: text/plain; format=flowed; charset="iso-8859-1";
	reply-type=original

Hi !

Since the last couple of months, we had a few problems with the maner we designed our clustered filesystem and we are planing to do a re-design of the filesystems and how they are used.

Our cluster is composed of 8 nodes, connected via fibre channel, to a raid enclosure where we have 6 pair of 1-tb drives in mirror, so 6 1tb physical volumes.

First of all, our services that are run from the cluster are running inside of directories. For exemple, a webserver for a given application is run from
/CyberCat/WebServer/(...) That directory contains all executable (apache, php for exemple) and the related data, except for the databases. /CyberCat being a single GFS partition containing several other services.

This filesystem and another one like this containing services for some other clients occupy a single VG composed of 2 PV (total 2tb). The remaining (4) other PV are used in one 1tb VG each, and those VG contains only one LV that is used for databases servers.

For availibility reasons, we are planing of spliting the /CyberCat (and the other one like it) FS into several smaller filesystems, one for each service.

The reason being that in the event that we need to make a filesystem check, or any other unplaned reason, on any filesystem it won't affect other services.

So, now comes the question I have :

1. First of all, is this a bad idea ?

2. Is there any disadvantages of doing a single volume group composed of many physical volumes, enabling us to move the extents of a logical volume from one physical volume to another one, so that load is more balanced in the event we need it.

Thanks for the input. 


------------------------------

Message: 2
Date: Wed, 7 Dec 2011 03:33:35 +0000
From: Jeff Sturm <jeff.sturm at eprize.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] Design question about VG / LV in a
	clustered	environement
Message-ID:
	<B1B9801C5CBC954680D0374CC4EEABA51178D668 at MailNode2.eprize.local>
Content-Type: text/plain; charset="us-ascii"

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Nicolas Ross
> Sent: Tuesday, December 06, 2011 1:40 PM
>
> For availibility reasons, we are planing of spliting the /CyberCat 
> (and the other one like
> it) FS into several smaller filesystems, one for each service.

[snip]

> 1. First of all, is this a bad idea ?

Right or wrong, that's how we do it.

Apart from availability, you can tune the fs appropriately depending on how you use it.  GFS2 dropped some tunables, I think, but you can still mount with "noatime" (assuming your application doesn't rely on atime) and tune some things like block size.  Some of our GFS filesystems are also read-only on certain nodes, so we take advantage of spectator mounts for those.

> 2. Is there any disadvantages of doing a single volume group composed 
> of many physical volumes, enabling us to move the extents of a logical 
> volume from one physical volume to another one, so that load is more balanced in the event we need it.

Can't say, really.  We ditched CLVM but kept GFS.  It felt like CLVM had too many limitations to make it worthwhile.  It was straightforward to just export a LUN from our SAN for each file system, and that allows us to take advantage of the SAN's native snapshot facility.

-Jeff


------------------------------

Message: 3
Date: Tue, 06 Dec 2011 23:45:32 -0500
From: Digimer <linux at alteeve.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: [Linux-cluster] cluster 3.1.8 released
Message-ID: <4EDEEF6C.2040002 at alteeve.com>
Content-Type: text/plain; charset=ISO-8859-1

Welcome to the cluster 3.1.8 release.

This release addresses several bugs and includes a patch to improve RRP configuration handling. DLM+SCTP (kernel counterpart of RRP) is still under testing, feedback is always appreciated.

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/c/l/cluster/cluster-3.1.8.tar.xz

ChangeLog:

https://fedorahosted.org/releases/c/l/cluster/Changelog-3.1.8

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this great milestone.

Happy clustering,
Digimer


------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 92, Issue 4
********************************************


From jeff.sturm at eprize.com  Thu Dec  8 19:48:27 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Thu, 8 Dec 2011 19:48:27 +0000
Subject: [Linux-cluster] Design question about VG / LV in a
	clustered	environment
In-Reply-To: <A64E914D72373B4287DF2222856FFF351309A6C8@ILBENEVS001.int.usc.local>
References: <mailman.45.1323277204.7959.linux-cluster@redhat.com>
	<A64E914D72373B4287DF2222856FFF351309A6C8@ILBENEVS001.int.usc.local>
Message-ID: <B1B9801C5CBC954680D0374CC4EEABA51179387B@MailNode2.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Geovanis, Nicholas
> Sent: Wednesday, December 07, 2011 12:23 PM
> >> We ditched CLVM but kept GFS.  It felt like CLVM had too many limitations to make
> it worthwhile.
> 
> Would you elaborate on this for me please? I understand the "damn, forgot to start
> clvmd on that node...." type of annoyance, but what were your burning issues? I'm not
> convinced that there's a performance drawback which is specifically clvmd-related, but
> maybe I'm na?ve. Thanks....Nick G

At the time there was no snapshot support.  That was the big missing feature for us.

We also tried using pvmove, and had problems with it.  It was very slow, and eventually stopped altogether complaining about a lock.  I tried activating LV's exclusively and it didn't help.  Later I found that that in drastic situations, I could remove the "clustered" bit temporarily, make my changes, then revert to a clustered volume (recognizing this is dangerous when the volume is shared).

At times, running simple commands like "lvs" became very slow, or stopped completely.  Restarting the node would clear it up.  When this occurred, the cluster would otherwise appear normal.

To be fair, it was a few years ago when we were evaluating this, on 5.2 I think.  It's likely some of the bugs have been worked out.   We didn't have a lot of motivation to work through them as long as we could fall back on the SAN for the functionality we needed.

Red Hat has a tendency I think to release features just a little before they are ready (sometimes with caveats, like the GFS2 preview release).  This is good for users who are evaluating the technology.  For production however, we need stability above all else.  Since about 5.3, GFS has worked very well for us.  Based on our experience with early 5.x releases, I'm not in any hurry to move to 6.x.

-Jeff


From matthew.painter at kusiri.com  Sat Dec 10 20:32:05 2011
From: matthew.painter at kusiri.com (Matthew Painter)
Date: Sat, 10 Dec 2011 20:32:05 +0000
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
Message-ID: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>

Hi all,

We are trying to get to the bottom of some odd intermittent behavior on a
cluster. We are intermittently seeing nodes leave and rejoin clusters,
without being fenced. Further the gap between leaving on re-joining is 8
minutes. We are monitoring the latency between boxes, and it is acceptable
(<5ms).

How can nodes exhibit this behavior? There seem to be no impact on the
services running on the box, just this leaving and re-joining. The SNMP
messages are below.

All help decoding this gratefully received! :)

Thanks,

Matt


Sat Dec 10 15:22:00 GMT 2011: cluster3.localdomain
DISMAN-EVENT-MIB::sysUpTimeInstance
= 3:2:52:23.35, SNMPv2-MIB::snmpTrapOID.0 =
COROSYNC-MIB::corosyncNoticesNodeStatus,
COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
COROSYNC-MIB::corosyncObjectsNodeAddress.0
= "10.79.202.1", COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "left"

Sat Dec 10 15:30:25 GMT 2011: cluster3.localdomain
DISMAN-EVENT-MIB::sysUpTimeInstance
= 3:3:00:48.75, SNMPv2-MIB::snmpTrapOID.0 =
COROSYNC-MIB::corosyncNoticesNodeStatus,
COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
COROSYNC-MIB::corosyncObjectsNodeAddress.0
= "10.79.202.1", COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "joined"
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111210/578c078a/attachment.htm>

From linux at alteeve.com  Sat Dec 10 20:55:38 2011
From: linux at alteeve.com (Digimer)
Date: Sat, 10 Dec 2011 15:55:38 -0500
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
Message-ID: <4EE3C74A.3000206@alteeve.com>

On 12/10/2011 03:32 PM, Matthew Painter wrote:
> Hi all,
> 
> We are trying to get to the bottom of some odd intermittent behavior on
> a cluster. We are intermittently seeing nodes leave and rejoin clusters,
> without being fenced. Further the gap between leaving on re-joining is 8
> minutes. We are monitoring the latency between boxes, and it is
> acceptable (<5ms).
> 
> How can nodes exhibit this behavior? There seem to be no impact on the
> services running on the box, just this leaving and re-joining. The SNMP
> messages are below.
> 
> All help decoding this gratefully received! :)
> 
> Thanks,
> 
> Matt
> 
> 
> Sat Dec 10 15:22:00 GMT 2011: cluster3.localdomain
> DISMAN-EVENT-MIB::sysUpTimeInstance = 3:2:52:23.35,
> SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus,
> COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
> COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
> COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1",
> COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "left"
> 
> Sat Dec 10 15:30:25 GMT 2011: cluster3.localdomain
> DISMAN-EVENT-MIB::sysUpTimeInstance = 3:3:00:48.75,
> SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus,
> COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
> COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
> COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1",
> COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "joined"

My first instinct is to point to multicast issues in your switch, but
then, I'd expect the node to get fenced. That said, any unexpected
disconnect should fire a fence, so it would seem like the node is
cleanly stopping/restarting corosync.

Can you share your configuration and, ideally, anything in syslog from
all involved nodes starting from just before the disconnect and
continuing through to after the node rejoins?

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From matthew.painter at kusiri.com  Sat Dec 10 22:00:12 2011
From: matthew.painter at kusiri.com (Matthew Painter)
Date: Sat, 10 Dec 2011 22:00:12 +0000
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <4EE3C74A.3000206@alteeve.com>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
	<4EE3C74A.3000206@alteeve.com>
Message-ID: <CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>

The switch was our first thought, but that has been swapped, and while we
are not having nodes fenced anymore (we were daily), this anomoly remains.

I will ask for those logs and conf on Monday.

I think it might be worth reinstalling corosync on this box anyway? Can't
be healthy if it is exiting unclearly. I have has reports of the rgmanager
dying on this box. (pid file but not running) Could that be related?

Thanks :)

On Saturday, December 10, 2011, Digimer <linux at alteeve.com> wrote:
> On 12/10/2011 03:32 PM, Matthew Painter wrote:
>> Hi all,
>>
>> We are trying to get to the bottom of some odd intermittent behavior on
>> a cluster. We are intermittently seeing nodes leave and rejoin clusters,
>> without being fenced. Further the gap between leaving on re-joining is 8
>> minutes. We are monitoring the latency between boxes, and it is
>> acceptable (<5ms).
>>
>> How can nodes exhibit this behavior? There seem to be no impact on the
>> services running on the box, just this leaving and re-joining. The SNMP
>> messages are below.
>>
>> All help decoding this gratefully received! :)
>>
>> Thanks,
>>
>> Matt
>>
>>
>> Sat Dec 10 15:22:00 GMT 2011: cluster3.localdomain
>> DISMAN-EVENT-MIB::sysUpTimeInstance = 3:2:52:23.35,
>> SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus,
>> COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
>> COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
>> COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1",
>> COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "left"
>>
>> Sat Dec 10 15:30:25 GMT 2011: cluster3.localdomain
>> DISMAN-EVENT-MIB::sysUpTimeInstance = 3:3:00:48.75,
>> SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus,
>> COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain",
>> COROSYNC-MIB::corosyncObjectsNodeID.0 = 1,
>> COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1",
>> COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "joined"
>
> My first instinct is to point to multicast issues in your switch, but
> then, I'd expect the node to get fenced. That said, any unexpected
> disconnect should fire a fence, so it would seem like the node is
> cleanly stopping/restarting corosync.
>
> Can you share your configuration and, ideally, anything in syslog from
> all involved nodes starting from just before the disconnect and
> continuing through to after the node rejoins?
>
> --
> Digimer
> E-Mail:              digimer at alteeve.com
> Freenode handle:     digimer
> Papers and Projects: http://alteeve.com
> Node Assassin:       http://nodeassassin.org
> "omg my singularity battery is dead again.
> stupid hawking radiation." - epitron
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111210/13776416/attachment.htm>

From linux at alteeve.com  Sat Dec 10 22:22:55 2011
From: linux at alteeve.com (Digimer)
Date: Sat, 10 Dec 2011 17:22:55 -0500
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
	<4EE3C74A.3000206@alteeve.com>
	<CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>
Message-ID: <4EE3DBBF.9080006@alteeve.com>

On 12/10/2011 05:00 PM, Matthew Painter wrote:
> The switch was our first thought, but that has been swapped, and while
> we are not having nodes fenced anymore (we were daily), this anomoly
> remains.
> 
> I will ask for those logs and conf on Monday.
> 
> I think it might be worth reinstalling corosync on this box anyway?
> Can't be healthy if it is exiting unclearly. I have has reports of the
> rgmanager dying on this box. (pid file but not running) Could that be
> related?
> 
> Thanks :)

It's impossible to say without knowing your configuration. Please share
the cluster.conf (only obfuscate passwords, please) along with the log
files. The more detail, the better. Versions, distros, network config, etc.

Uninstalling corosync is not likely help. RGManager is something fairly
high up in the stack, so it's not likely the cause either.

Did you configure the timeouts to be very high, by chance? I'm finding
it difficult to fathom how the node can withdraw without being fenced,
short of cleanly stopping the cluster stack. I suspect there is
something important not being said, which the configuration information,
versions and logs will hopefully expose.

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From Mdukhan at nds.com  Sun Dec 11 07:16:49 2011
From: Mdukhan at nds.com (Dukhan, Meir)
Date: Sun, 11 Dec 2011 09:16:49 +0200
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <4EE3DBBF.9080006@alteeve.com>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
	<4EE3C74A.3000206@alteeve.com>
	<CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>
	<4EE3DBBF.9080006@alteeve.com>
Message-ID: <6DAE69EA69F39E4B9DA073B8C848A27C60E82DE826@ILMA1.IL.NDS.COM>


Are your nodes time synced and how?

We ran into problems of nodes being fenced because NTP problem.

The solution (AFAIR, from the Redhat knowledge base) was to start ntpd _before_ cman.
I'm not sure but there could be an update of openais or ntpd re this issue.

For those of you who have RedHat account, see the RedHat KB article:

        Does cman need to have the time of nodes in sync?
        https://access.redhat.com/kb/docs/DOC-42471

Hope this help,

Regards,
-- Meir R. Dukhan

|-----Original Message-----
|From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
|bounces at redhat.com] On Behalf Of Digimer
|Sent: Sunday, December 11, 2011 0:23 AM
|To: Matthew Painter
|Cc: linux clustering
|Subject: Re: [Linux-cluster] Nodes leaving and re-joining intermittently
|
|On 12/10/2011 05:00 PM, Matthew Painter wrote:
|> The switch was our first thought, but that has been swapped, and while
|> we are not having nodes fenced anymore (we were daily), this anomoly
|> remains.
|>
|> I will ask for those logs and conf on Monday.
|>
|> I think it might be worth reinstalling corosync on this box anyway?
|> Can't be healthy if it is exiting unclearly. I have has reports of the
|> rgmanager dying on this box. (pid file but not running) Could that be
|> related?
|>
|> Thanks :)
|
|It's impossible to say without knowing your configuration. Please share the
|cluster.conf (only obfuscate passwords, please) along with the log files.
|The more detail, the better. Versions, distros, network config, etc.
|
|Uninstalling corosync is not likely help. RGManager is something fairly
|high up in the stack, so it's not likely the cause either.
|
|Did you configure the timeouts to be very high, by chance? I'm finding it
|difficult to fathom how the node can withdraw without being fenced, short
|of cleanly stopping the cluster stack. I suspect there is something
|important not being said, which the configuration information, versions and
|logs will hopefully expose.
|
|--
|Digimer
|E-Mail:              digimer at alteeve.com
|Freenode handle:     digimer
|Papers and Projects: http://alteeve.com
|Node Assassin:       http://nodeassassin.org
|"omg my singularity battery is dead again.
|stupid hawking radiation." - epitron
|
|--
|Linux-cluster mailing list
|Linux-cluster at redhat.com
|https://www.redhat.com/mailman/listinfo/linux-cluster

This message is confidential and intended only for the addressee. If you have received this message in error, please immediately notify the postmaster at nds.com and delete it from your system as well as any copies. The content of e-mails as well as traffic data may be monitored by NDS for employment and security purposes.
To protect the environment please do not print this e-mail unless necessary.

An NDS Group Limited company. www.nds.com


From matthew.painter at kusiri.com  Sun Dec 11 11:12:51 2011
From: matthew.painter at kusiri.com (Matthew Painter)
Date: Sun, 11 Dec 2011 11:12:51 +0000
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <6DAE69EA69F39E4B9DA073B8C848A27C60E82DE826@ILMA1.IL.NDS.COM>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
	<4EE3C74A.3000206@alteeve.com>
	<CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>
	<4EE3DBBF.9080006@alteeve.com>
	<6DAE69EA69F39E4B9DA073B8C848A27C60E82DE826@ILMA1.IL.NDS.COM>
Message-ID: <CALj8VczAUdXbkaqhe5h3nv_rWFBtmBK9LqYhQcQnOrx2ux-GnQ@mail.gmail.com>

Thank you for your input :)

The nodes are syncd using NTP. Although I am unsure about the respective
run levels.

I will look into this, thank you.

On Sun, Dec 11, 2011 at 7:16 AM, Dukhan, Meir <Mdukhan at nds.com> wrote:

>
> Are your nodes time synced and how?
>
> We ran into problems of nodes being fenced because NTP problem.
>
> The solution (AFAIR, from the Redhat knowledge base) was to start ntpd
> _before_ cman.
> I'm not sure but there could be an update of openais or ntpd re this issue.
>
> For those of you who have RedHat account, see the RedHat KB article:
>
>        Does cman need to have the time of nodes in sync?
>        https://access.redhat.com/kb/docs/DOC-42471
>
> Hope this help,
>
> Regards,
> -- Meir R. Dukhan
>
> |-----Original Message-----
> |From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> |bounces at redhat.com] On Behalf Of Digimer
> |Sent: Sunday, December 11, 2011 0:23 AM
> |To: Matthew Painter
> |Cc: linux clustering
> |Subject: Re: [Linux-cluster] Nodes leaving and re-joining intermittently
> |
> |On 12/10/2011 05:00 PM, Matthew Painter wrote:
> |> The switch was our first thought, but that has been swapped, and while
> |> we are not having nodes fenced anymore (we were daily), this anomoly
> |> remains.
> |>
> |> I will ask for those logs and conf on Monday.
> |>
> |> I think it might be worth reinstalling corosync on this box anyway?
> |> Can't be healthy if it is exiting unclearly. I have has reports of the
> |> rgmanager dying on this box. (pid file but not running) Could that be
> |> related?
> |>
> |> Thanks :)
> |
> |It's impossible to say without knowing your configuration. Please share
> the
> |cluster.conf (only obfuscate passwords, please) along with the log files.
> |The more detail, the better. Versions, distros, network config, etc.
> |
> |Uninstalling corosync is not likely help. RGManager is something fairly
> |high up in the stack, so it's not likely the cause either.
> |
> |Did you configure the timeouts to be very high, by chance? I'm finding it
> |difficult to fathom how the node can withdraw without being fenced, short
> |of cleanly stopping the cluster stack. I suspect there is something
> |important not being said, which the configuration information, versions
> and
> |logs will hopefully expose.
> |
> |--
> |Digimer
> |E-Mail:              digimer at alteeve.com
> |Freenode handle:     digimer
> |Papers and Projects: http://alteeve.com
> |Node Assassin:       http://nodeassassin.org
> |"omg my singularity battery is dead again.
> |stupid hawking radiation." - epitron
> |
> |--
> |Linux-cluster mailing list
> |Linux-cluster at redhat.com
> |https://www.redhat.com/mailman/listinfo/linux-cluster
>
> This message is confidential and intended only for the addressee. If you
> have received this message in error, please immediately notify the
> postmaster at nds.com and delete it from your system as well as any copies.
> The content of e-mails as well as traffic data may be monitored by NDS for
> employment and security purposes.
> To protect the environment please do not print this e-mail unless
> necessary.
>
> An NDS Group Limited company. www.nds.com
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111211/7166942d/attachment.htm>

From chris.alexander at kusiri.com  Sun Dec 11 15:26:23 2011
From: chris.alexander at kusiri.com (Chris Alexander)
Date: Sun, 11 Dec 2011 15:26:23 +0000
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
In-Reply-To: <CALj8VczAUdXbkaqhe5h3nv_rWFBtmBK9LqYhQcQnOrx2ux-GnQ@mail.gmail.com>
References: <CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA@mail.gmail.com>
	<4EE3C74A.3000206@alteeve.com>
	<CALj8VczAysjYLke3WSictTGNGAHRUy_=a6nWawgRJyKO-0ajbQ@mail.gmail.com>
	<4EE3DBBF.9080006@alteeve.com>
	<6DAE69EA69F39E4B9DA073B8C848A27C60E82DE826@ILMA1.IL.NDS.COM>
	<CALj8VczAUdXbkaqhe5h3nv_rWFBtmBK9LqYhQcQnOrx2ux-GnQ@mail.gmail.com>
Message-ID: <CAOJZNMcMj1Q73KwsxZ2EJ_jPc0T5MJO9GOs05yYhLjfn9_csQw@mail.gmail.com>

Please find below the cluster.conf Matt mentioned.

Regarding logs, I have verified the 2 SNMP trap notifications that Matt
posted in his first message are the only ones that were processed by our
script anywhere near this event window (days until the previous one, none
since). I will have a look in the on-disk logging tomorrow and see if
there's anything of any worth over that time period on any of the cluster
nodes.

Thanks,

Chris

<?xml version="1.0"?>
<cluster config_version="30" name="camra">

<fence_daemon clean_start="1" post_fail_delay="30" post_join_delay="30"
override_time="30"/>

        <clusternodes>
                <clusternode name="xxx.xxx.xxx.1" nodeid="1">
                        <fence>
                                <method name="ilo">
                                        <device name="ilo1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xxx.xxx.xxx.2" nodeid="2">
                        <fence>
                                <method name="ilo">
                                        <device name="ilo2"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="xxx.xxx.xxx.3" nodeid="3">
                        <fence>
                                <method name="ilo">
                                        <device name="ilo3"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>

<rm log_facility="local4" log_level="7">
                <failoverdomains>
                        <failoverdomain name="mysql" nofailback="0"
ordered="1" restricted="1">
                                <failoverdomainnode name="xxx.xxx.xxx.1"
priority="1"/>
                                <failoverdomainnode name="xxx.xxx.xxx.2"
priority="2"/>
                        </failoverdomain>
                        <failoverdomain name="solr" nofailback="0"
ordered="1" restricted="1">
                                <failoverdomainnode name="xxx.xxx.xxx.2"
priority="2"/>
                                <failoverdomainnode name="xxx.xxx.xxx.1"
priority="1"/>
                        </failoverdomain>
                        <failoverdomain name="cluster1" nofailback="1"
ordered="0" restricted="1">
                                <failoverdomainnode name="xxx.xxx.xxx.1"/>
                        </failoverdomain>
                        <failoverdomain name="cluster2" nofailback="1"
ordered="0" restricted="1">
                                <failoverdomainnode name="xxx.xxx.xxx.2"/>
                        </failoverdomain>
                        <failoverdomain name="cluster3" nofailback="1"
ordered="0" restricted="1">
                                <failoverdomainnode name="xxx.xxx.xxx.3"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <script file="/etc/init.d/fstest" name="fstest"/>
                        <script file="/etc/init.d/postfixstatus"
name="postfixstatus"/>
                        <script file="/etc/init.d/snmpdstatus"
name="snmpdstatus"/>
                        <script file="/etc/init.d/snmptrapdstatus"
name="snmptrapdstatus"/>
                        <script file="/etc/init.d/foghornstatus"
name="foghornstatus"/>
                </resources>
                <service domain="cluster1" max_restarts="50"
name="snmptrap1" recovery="restart-disable" restart_expire_time="900">
                        <script ref="fstest"/>
                        <script ref="foghornstatus"/>
                        <script ref="snmpdstatus"/>
                        <script ref="postfixstatus"/>
                        <script ref="snmptrapdstatus"/>
                </service>
                <service domain="cluster2" max_restarts="50"
name="snmptrap2" recovery="restart-disable" restart_expire_time="900">
                        <script ref="fstest"/>
                        <script ref="foghornstatus"/>
                        <script ref="snmpdstatus"/>
                        <script ref="postfixstatus"/>
                        <script ref="snmptrapdstatus"/>
                </service>
                <service domain="cluster3" max_restarts="50"
name="snmptrap3" recovery="restart-disable" restart_expire_time="900">
                        <script ref="fstest"/>
                        <script ref="foghornstatus"/>
                        <script ref="snmpdstatus"/>
                        <script ref="postfixstatus"/>
                        <script ref="snmptrapdstatus"/>
                </service>
        </rm>
        <fencedevices>
                <fencedevice agent="fence_ipmilan" ipaddr="xxx.xxx.xxx.101"
login="x" name="ilo1" passwd="x"/>
                <fencedevice agent="fence_ipmilan" ipaddr="xxx.xxx.xxx.102"
login="x" name="ilo2" passwd="x"/>
                <fencedevice agent="fence_ipmilan" ipaddr="xxx.xxx.xxx.103"
login="x" name="ilo3" passwd="x"/>
        </fencedevices>
</cluster>

On 11 December 2011 11:12, Matthew Painter <matthew.painter at kusiri.com>wrote:

> Thank you for your input :)
>
> The nodes are syncd using NTP. Although I am unsure about the respective
> run levels.
>
> I will look into this, thank you.
>
>
> On Sun, Dec 11, 2011 at 7:16 AM, Dukhan, Meir <Mdukhan at nds.com> wrote:
>
>>
>> Are your nodes time synced and how?
>>
>> We ran into problems of nodes being fenced because NTP problem.
>>
>> The solution (AFAIR, from the Redhat knowledge base) was to start ntpd
>> _before_ cman.
>> I'm not sure but there could be an update of openais or ntpd re this
>> issue.
>>
>> For those of you who have RedHat account, see the RedHat KB article:
>>
>>        Does cman need to have the time of nodes in sync?
>>        https://access.redhat.com/kb/docs/DOC-42471
>>
>> Hope this help,
>>
>> Regards,
>> -- Meir R. Dukhan
>>
>> |-----Original Message-----
>> |From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
>> |bounces at redhat.com] On Behalf Of Digimer
>> |Sent: Sunday, December 11, 2011 0:23 AM
>> |To: Matthew Painter
>> |Cc: linux clustering
>> |Subject: Re: [Linux-cluster] Nodes leaving and re-joining intermittently
>> |
>> |On 12/10/2011 05:00 PM, Matthew Painter wrote:
>> |> The switch was our first thought, but that has been swapped, and while
>> |> we are not having nodes fenced anymore (we were daily), this anomoly
>> |> remains.
>> |>
>> |> I will ask for those logs and conf on Monday.
>> |>
>> |> I think it might be worth reinstalling corosync on this box anyway?
>> |> Can't be healthy if it is exiting unclearly. I have has reports of the
>> |> rgmanager dying on this box. (pid file but not running) Could that be
>> |> related?
>> |>
>> |> Thanks :)
>> |
>> |It's impossible to say without knowing your configuration. Please share
>> the
>> |cluster.conf (only obfuscate passwords, please) along with the log files.
>> |The more detail, the better. Versions, distros, network config, etc.
>> |
>> |Uninstalling corosync is not likely help. RGManager is something fairly
>> |high up in the stack, so it's not likely the cause either.
>> |
>> |Did you configure the timeouts to be very high, by chance? I'm finding it
>> |difficult to fathom how the node can withdraw without being fenced, short
>> |of cleanly stopping the cluster stack. I suspect there is something
>> |important not being said, which the configuration information, versions
>> and
>> |logs will hopefully expose.
>> |
>> |--
>> |Digimer
>> |E-Mail:              digimer at alteeve.com
>> |Freenode handle:     digimer
>> |Papers and Projects: http://alteeve.com
>> |Node Assassin:       http://nodeassassin.org
>> |"omg my singularity battery is dead again.
>> |stupid hawking radiation." - epitron
>> |
>> |--
>> |Linux-cluster mailing list
>> |Linux-cluster at redhat.com
>> |https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> This message is confidential and intended only for the addressee. If you
>> have received this message in error, please immediately notify the
>> postmaster at nds.com and delete it from your system as well as any copies.
>> The content of e-mails as well as traffic data may be monitored by NDS for
>> employment and security purposes.
>> To protect the environment please do not print this e-mail unless
>> necessary.
>>
>> An NDS Group Limited company. www.nds.com
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111211/51ed606a/attachment.htm>

From Nicholas.Geovanis at uscellular.com  Sun Dec 11 17:24:08 2011
From: Nicholas.Geovanis at uscellular.com (Geovanis, Nicholas)
Date: Sun, 11 Dec 2011 11:24:08 -0600
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
	(Matthew Painter)
In-Reply-To: <mailman.39088.1323617194.23108.linux-cluster@redhat.com>
References: <mailman.39088.1323617194.23108.linux-cluster@redhat.com>
Message-ID: <A64E914D72373B4287DF2222856FFF351318DE72@ILBENEVS001.int.usc.local>

>> We are trying to get to the bottom of some odd intermittent behavior
on a cluster. We are intermittently 
>> seeing nodes leave and rejoin clusters, without being fenced. Further
the gap between leaving on re-joining is 
>> 8 minutes. We are monitoring the latency between boxes, and it is
acceptable (<5ms).

>From my recent experience, the first thing I would check is the
multicast config and behavior. I've deployed a couple dozen 2-3-node
clusters (with GFS2) in three different data-centers with three
seriously different network configurations. Multicast is always an
issue. RH Knowledgebase article
https://access.redhat.com/kb/docs/DOC-39175 has a python script
multicast.py which exercises it from client and server ends. It has come
in very handy. It sounds like it may be an intermittent problem, in
which case I might alter the script to reduce traffic a little but run
it longer-term as a diagnostic. If you're at RHEL 6.1 there is an
"omping" package in the channel/distro which serves the same purpose,
there's some info in the article on its use too. HTH......Nick G


Nick Geovanis
US Cellular/Kforce Inc
e. Nicholas.Geovanis at uscellular.com


Message: 1
Date: Sat, 10 Dec 2011 20:32:05 +0000
From: Matthew Painter <matthew.painter at kusiri.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: [Linux-cluster] Nodes leaving and re-joining intermittently
Message-ID:
	
<CALj8VcxxvOV_PTT9QZKJYnPuvhjBgoxNETBWxB4uCWCRhkzhSA at mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi all,

We are trying to get to the bottom of some odd intermittent behavior on
a cluster. We are intermittently seeing nodes leave and rejoin clusters,
without being fenced. Further the gap between leaving on re-joining is 8
minutes. We are monitoring the latency between boxes, and it is
acceptable (<5ms).

How can nodes exhibit this behavior? There seem to be no impact on the
services running on the box, just this leaving and re-joining. The SNMP
messages are below.

All help decoding this gratefully received! :)

Thanks,

Matt


From songyu555 at gmail.com  Mon Dec 12 03:29:39 2011
From: songyu555 at gmail.com (yu song)
Date: Mon, 12 Dec 2011 14:29:39 +1100
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
Message-ID: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>

Hi GFS2 gurus,

I am planning to setup 2 x 2 nodes clusters in our environment with using
emc storage to build cluster shared filesytems (GFS2).

PROD:  2 nodes ( cluster 1)

DR: 2 nodes ( cluster 2).

as below shows:

   *PROD*
*Cluster 1 share LUNs for PROD (node1, node2)***
?         1 x 100G = Tier 1 (R1)
?         1 x 200G = Tier 2 (R1)
?         1 x 200G = Tier 3 (R1)
**
 *DR*
*Cluster 2 share LUNs for DR (node 1,node 2)***
?         1 x 100G = Tier 1 (R2)
?         1 x 200G = Tier 2 (R2)
?         1 x 200G = Tier 3 (R2)
* *


My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it
only says that GFS2 does not support using asynchronous or active/passive
array based replication. but it seems like does not apply for SRDF.

if anyone has done this before, appreciate you can give some ideas.

cheers!

Yu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111212/5306a633/attachment.htm>

From list at fajar.net  Mon Dec 12 04:03:10 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Mon, 12 Dec 2011 11:03:10 +0700
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
Message-ID: <CAG1y0sdsPReZ48K_5oRZNHVAMD2Hic9KVJyws3mnkDvpm9eDxw@mail.gmail.com>

On Mon, Dec 12, 2011 at 10:29 AM, yu song <songyu555 at gmail.com> wrote:
> My question is that GFS2 supports SRDF ??? looking at KB in redhat site, it only says that GFS2 does not support using asynchronous or active/passive array based replication. but it seems like does not apply for SRDF.

And why would you conclude it does not apply for SRDF? AFAIK it's just
another array replication. Nothing really special about it.

Also, "not supported" does not necessarily mean it won't work. It
might mean "it works, but performance will be horrible", or it might
also mean "it might work, but if anything goes wrong don't ask help
from us".

-- 
Fajar


From Chris.Jankowski at hp.com  Mon Dec 12 04:37:02 2011
From: Chris.Jankowski at hp.com (Jankowski, Chris)
Date: Mon, 12 Dec 2011 04:37:02 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
Message-ID: <036B68E61A28CA49AC2767596576CD596F9CC7A10B@GVW1113EXC.americas.hpqcorp.net>

Yu,

GFS2 or any other filesystems being replicated are not aware at all of the block replication taking place in the storage layer.  This is entirely transparent to the OS and filesystems clustered or not.   Replication happens entirely in the array/SAN layer and the servers are not involved at all.

So, there is nothing for Red Hat to support or not support - they just do not see it.  Nor do they have any ability to see it even if they wanted.  Very often the array ports for replication are on separate ports and in separate FC zones.

Storage replication may have some performance impact, but this just looks like slower disks.  GFS2 does not have any specific numerical requirements for IO rate, bandwidth and latency.

Could you quote the Red Hat KB - what exactly does it say and in what context?

Regards,

Chris Jankowski


From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of yu song
Sent: Monday, 12 December 2011 14:30
To: linux clustering
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??

Hi GFS2 gurus,

I am planning to setup 2 x 2 nodes clusters in our environment with using emc storage to build cluster shared filesytems (GFS2).

PROD:  2 nodes ( cluster 1)

DR: 2 nodes ( cluster 2).

as below shows:

PROD
Cluster 1 share LUNs for PROD (node1, node2)
*         1 x 100G = Tier 1 (R1)
*         1 x 200G = Tier 2 (R1)
*         1 x 200G = Tier 3 (R1)


DR
Cluster 2 share LUNs for DR (node 1,node 2)
*         1 x 100G = Tier 1 (R2)
*         1 x 200G = Tier 2 (R2)
*         1 x 200G = Tier 3 (R2)


My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it only says that GFS2 does not support using asynchronous or active/passive array based replication. but it seems like does not apply for SRDF.

if anyone has done this before, appreciate you can give some ideas.

cheers!

Yu


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111212/ad03c859/attachment.htm>

From mgrac at redhat.com  Mon Dec 12 12:24:45 2011
From: mgrac at redhat.com (Marek Grac)
Date: Mon, 12 Dec 2011 13:24:45 +0100
Subject: [Linux-cluster] fence-agents-3.1.7 stable release
Message-ID: <4EE5F28D.4020502@redhat.com>

Welcome to the fence-agents 3.1.7 release.

This release includes these updates:
* manual pages are generated for most of the fence agents
* fence_vmware_soap accepts alias names as visible in Web UI (e.g. server1)
* fence agent for RSB was rewritten to fencing library, main benefit is
ssh support

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/f/e/fence-agents/fence-agents-3.1.7.tar.xz

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

m,


From swhiteho at redhat.com  Mon Dec 12 17:03:01 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 12 Dec 2011 17:03:01 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <036B68E61A28CA49AC2767596576CD596F9CC7A10B@GVW1113EXC.americas.hpqcorp.net>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A10B@GVW1113EXC.americas.hpqcorp.net>
Message-ID: <1323709381.2729.22.camel@menhir>

Hi,

On Mon, 2011-12-12 at 04:37 +0000, Jankowski, Chris wrote:
> Yu,
> 
>  
> 
> GFS2 or any other filesystems being replicated are not aware at all of
> the block replication taking place in the storage layer.  This is
> entirely transparent to the OS and filesystems clustered or not.
>   Replication happens entirely in the array/SAN layer and the servers
> are not involved at all.  
> 
That is true if the cluster is contained within the same physical
location. If, for example, the plan was a split site implementation,
then the inter-site interconnect would be added into the equation too.
I'm not sure which case is being proposed in this particular case.

>  
> 
> So, there is nothing for Red Hat to support or not support ? they just
> do not see it.  Nor do they have any ability to see it even if they
> wanted.  Very often the array ports for replication are on separate
> ports and in separate FC zones.
> 
It is probably just a case of not supporting what we do not test. I'm
wondering what the use case would be if both arrays were on the same
site mirroring the same filesystem?

If they are on different sites, then knowing which end is active becomes
a problem. If both ends become active, even for a short time then there
is no way to merge the two filesytems together again later on.

>  
> 
> Storage replication may have some performance impact, but this just
> looks like slower disks.  GFS2 does not have any specific numerical
> requirements for IO rate, bandwidth and latency.
> 
True to a certain extent, but if things get too slow then obviously it
is not going to meet a reasonable expectation of performance. The disk
latency has a big effect on how quickly cached data can be migrated
between nodes.

Also, a guarantee of reasonable network bandwidth and latency is a
requirement for corosync and thus all the services running over it, such
as fencing. So there are some issues which need to be addressed in order
to ensure that everything works as intended,

Steve.


>  
> 
> Could you quote the Red Hat KB ? what exactly does it say and in what
> context?
> 
>  
> 
> Regards,
> 
>  
> 
> Chris Jankowski
> 
>  
> 
>  
> 
>  
> 
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of yu song
> Sent: Monday, 12 December 2011 14:30
> To: linux clustering
> Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
> 
> 
>  
> 
> Hi GFS2 gurus,
> 
> 
>  
> 
> 
> I am planning to setup 2 x 2 nodes clusters in our environment with
> using emc storage to build cluster shared filesytems (GFS2).
> 
> 
>  
> 
> 
> PROD:  2 nodes ( cluster 1)
> 
> 
>  
> 
> 
> DR: 2 nodes ( cluster 2).
> 
> 
>  
> 
> 
> as below shows: 
> 
> 
>  
> 
> 
> PROD
> 
> 
> Cluster 1 share LUNs for PROD
> (node1, node2)
> 
> 
> ?         1 x 100G = Tier 1 (R1)
> 
> 
> ?         1 x 200G = Tier 2 (R1)
> 
> 
> ?         1 x 200G = Tier 3 (R1)
> 
> 
>  
> 
> 
> DR
> 
> 
> Cluster 2 share LUNs for DR (node
> 1,node 2)
> 
> 
> ?         1 x 100G = Tier 1 (R2)
> 
> 
> ?         1 x 200G = Tier 2 (R2)
> 
> 
> ?         1 x 200G = Tier 3 (R2)
> 
> 
>  
> 
> 
>  
> 
> 
>  
> 
> 
> My question is that GFS2 supports SRDF ??  looking at KB in redhat
> site, it only says that GFS2 does not support using asynchronous or
> active/passive array based replication. but it seems like does not
> apply for SRDF.
> 
> 
>  
> 
> 
> if anyone has done this before, appreciate you can give some ideas.
> 
> 
>  
> 
> 
> cheers!
> 
> 
>  
> 
> 
> Yu
> 
> 
>  
> 
> 
>  
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From bmr at redhat.com  Tue Dec 13 14:26:25 2011
From: bmr at redhat.com (Bryn M. Reeves)
Date: Tue, 13 Dec 2011 14:26:25 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <036B68E61A28CA49AC2767596576CD596F9CC7A10B@GVW1113EXC.americas.hpqcorp.net>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A10B@GVW1113EXC.americas.hpqcorp.net>
Message-ID: <4EE76091.4070609@redhat.com>

On 12/12/2011 04:37 AM, Jankowski, Chris wrote:
> GFS2 or any other filesystems being replicated are not aware at all
> of the block replication taking place in the storage layer.  This is
> entirely transparent to the OS and filesystems clustered or not.
> Replication happens entirely in the array/SAN layer and the servers
> are not involved at all.

That's not strictly true. While it is the case that normal I/O issued on 
the active (R1 in SRDF-speak) side of the replicated volume needs no 
special handling things get a bit more complex when the SRDF pair state 
changes (e.g. splits/failovers/partitions).

SRDF pair states govern which side of the replicated volume is writable 
and a change in state may mean a device abruptly becomes write-disabled.

There's currently no mechanism for handling these changes automatically 
on the host side so there is a need for manual intervention (or clever 
scripting) whenever such a change occurs.

> So, there is nothing for Red Hat to support or not support - they
> just do not see it.  Nor do they have any ability to see it even if
> they wanted.  Very often the array ports for replication are on
> separate ports and in separate FC zones.

SRDF is normally used between two or more Symmetrix arrays (often but 
not always in physically separate locations). Replication traffic 
travels over a dedicated inter-array link via a remote link director (a 
specialised symm channel adapter) rather than the regular front end 
director ports of the array.

In a typical deployment the R2 (remote/slave) side of the replicated 
volume is not reachable from the same fabric as the R1 (master).

This means additional considerations are required in a disaster recovery 
scenario since any backup hosts on the R2 side will need to be 
configured to cope with the fact that LUNs and ports will have different 
WWIDs than on the R1 site (this may cause problems for some multipath 
boot implementations for e.g.).

Regards,
Bryn.


From bmr at redhat.com  Tue Dec 13 14:43:10 2011
From: bmr at redhat.com (Bryn M. Reeves)
Date: Tue, 13 Dec 2011 14:43:10 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
Message-ID: <4EE7647E.5040005@redhat.com>

On 12/12/2011 03:29 AM, yu song wrote:
> My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it
> only says that GFS2 does not support using asynchronous or active/passive
> array based replication. but it seems like does not apply for SRDF.

SRDF offers both synchronous and asynchronous replication but is 
active/passive. I.e. the administrator can configure whether the primary 
(R1) site waits for write acknowledgement from the remote (R2) site or 
not but at any one time it is only possible to write to either the R1 or 
the R2 device.

Synchronous replication guarantees write order fidelity for the R2 copy 
and ensures the remote copy is crash consistent at all times.

Asynchronous replication allows SRDF to support longer distances (or 
lower bandwidth / higher latency inter site links) by packaging multiple 
writes into delta sets to be sent to the remote site.

More complex modes and combinations exist that allow consistency to be 
maintained among a group of devices, for example a database's data store 
and redo logs, or that relax some of the synchronous replication 
guarantees to improve efficiency (semi-synchronous operation).

Active/passive in the context of storage replication usually refers to 
the states of the devices on the two sites. In active/active replication 
both sides are fully active at all times and writes may be issued on 
either side of the replication (a bit like multi-master application 
layer replication). An active/passive design only allows one side to be 
active for writes at a time.

Most array based implementations are active/passive and offer 
asynchronous, synchronous or semi-synchronous operation.

Regards,
Bryn.


From anprice at redhat.com  Tue Dec 13 17:07:21 2011
From: anprice at redhat.com (Andrew Price)
Date: Tue, 13 Dec 2011 17:07:21 +0000
Subject: [Linux-cluster] gfs2-utils 3.1.3 Released
Message-ID: <4EE78649.5020303@redhat.com>

Hi,

gfs2-utils 3.1.3 has been released. This version features a large number 
of fsck.gfs2 improvements and bug fixes, more i18n-related improvements, 
and various bug fixes, refactoring work and code clean-ups. See below 
for a full list of changes.

The source tarball is available from:

    https://fedorahosted.org/released/gfs2-utils/gfs2-utils-3.1.3.tar.gz

To report bugs or issues, please use:

    https://bugzilla.redhat.com/

Regards,

Andy Price
Red Hat File Systems


Changes since 3.1.2:

Abhijith Das (1):
       mount.gfs2 - gfs2 mounts doubled up in mtab

Andrew Price (16):
       mkfs: Handle gfs2 creation on regular files
       gfs2-utils: Update .gitignore for i18n files
       tunegfs2: Add some malloc error checking
       gfs2_edit: Initialize metafds fully in savemetaopen
       libgfs2: clean up some dead code in gfs2_writei
       libgfs2: Remove dead code from gfs2_get_leaf
       gfs_controld: Remove dead code from loop()
       gfs2_edit: Fix segfault in find by resource group
       libgfs2: Fix null pointer dereference in linked_leaf_search
       gfs2_edit: Fix zeroing the memory for indirect
       gfs_controld: Fix if-statement in shutdown_callback
       libgfs2: Fix pointer arithmetic in gfs2_quota_change_*
       libgfs2: Don't count sentinel dirent as an entry
       mkfs.gfs2: Some minor cleanups
       libgfs2: Improve rgblocks2bitblocks()
       gfs2-utils: Add gfs2-cluster systemd unit

Bob Peterson (68):
       fsck.gfs2: Make functions use sdp rather than sbp
       fsck.gfs2: Change "if(" to "if ("
       libgfs1: Add gfs1 variable to superblock structure
       libgfs2: Make check_sb and read_sb operate on gfs1
       libgfs2: move gfs1 structures to libgfs2
       fsck.gfs2: Check for blocks wrongly inside resource groups
       fsck.gfs2: Rename check_leaf to check_ealeaf_block
       fsck.gfs2: fsck.gfs2: Delete vestigial buffer_head in check_leaf
       fsck.gfs2: fsck.gfs2: Rename nlink functions to be intuitive
       fsck.gfs2: fsck.gfs2: Sync di_nlink adding links for lost+found
       fsck.gfs2: fsck.gfs2: Make dir entry count 32 bits
       fsck.gfs2: get rid of triple negative logic
       dirent_repair needs to mark the buffer as modified
       fsck.gfs2: fsck.gfs2: Ask to reclaim unlinked meta per-rgrp only
       fsck.gfs2: fsck.gfs2: Refactor add_dotdot function in lost+found
       libgfs2: libgfs2: Use __FUNCTION__ rather than __FILE__
       fsck.gfs2: fsck.gfs2: Don't stop invalidating blocks on invalid
       fsck.gfs2: fsck.gfs2: Find and clear duplicate leaf blocks refs
       fsck.gfs2: fsck.gfs2: Move check_num_ptrs from metawalk to pass1
       fsck.gfs2: fsck.gfs2: Duplicate ref processing for leaf blocks
       fsck.gfs2: fsck.gfs2: split check_leaf_blks to be more readable
       fsck.gfs2: Shorten output
       fsck.gfs2: Make output messages more sensible
       fsck.gfs pass2: Refactor function set_dotdor_dir
       fsck.gfs2 pass2: Delete extended attributes with inode
       fsck.gfs2 pass2: Don't delete invalid inode metadata
       fsck.gfs2 pass3: Refactor mark_and_return_parent
       fsck.gfs2: misc cosmetic changes
       fsck.gfs2: Don't use old_leaf if it was a duplicate
       fsck.gfs2: Add find_remove_dup, free_block_if_notdup
       fsck.gfs2: don't free prev rgrp list repairing rgrps
       libgfs2: eliminate gfs1_readi in favor of gfs2_readi
       libgfs2: Mark buffer modified adding a new GFS1 block
       libgfs2: Use dinode buffer to map gfs1 dinode blocks
       libgfs2: move block_map functions to fsck.gfs2
       libgfs2: eliminate gfs1_rindex_read
       libgfs2: combine ri_update and gfs1_ri_update
       libgfs2: combine gfs_inode_read and gfs_inode_get
       libgfs2: move gfs1 functions from edit to libgfs2
       gfs2_edit savemeta: save_inode_data backward for gfs1
       libgfs2: expand capabilities to operate on gfs1
       fsck.gfs2: Combine block and char device inode types
       fsck.gfs2: four-step duplicate elimination process
       fsck.gfs2: Add ability to check gfs1 file systems
       fsck.gfs2: Remove bad inodes from duplicate tree
       fsck.gfs2: Handle duplicate reference to dinode blocks
       fsck.gfs2: Bad extended attributes not deleted
       libgfs2: Make rebuild functions not re-read ip
       fsck.gfs2: Shorten debug output
       fsck.gfs2: Increment link count reporting wrong dinode
       fsck.gfs2: system dinodes take priority over user dinodes
       fsck.gfs2: Recognize partially gfs2-converted dinodes
       fsck.gfs2: Print step 2 duplicate debug msg first
       fsck.gfs2: pass1c counts percentage backward
       fsck.gfs2: Speed up rangecheck functions
       libgfs2: Make in-core rgrps use rbtree
       fsck.gfs2: Fix memory leaks
       gfs2_edit: Fix memory leaks
       Change man pages and gfs2_convert messages to include GFS
       fsck.gfs2: Journals not properly checked
       fsck.gfs2: Rearrange block types to group all inode types
       fsck.gfs2: Fix initialization error return codes
       fsck.gfs2: Don't use strerror for libgfs2 errors
       fsck.gfs2: Fix memory leak in initialize.c
       fsck.gfs2: Add return code checks and initializations
       gfs2_edit: Fix segfault jumping within rindex
       libgfs2: Fix off-by-one error in rgrp searching
       gfs2-utils: gfs2_grow fails to grow a filesystem with less than 3 RGs

Carlos Maiolino (10):
       libgfs2: Add new RGSIZE macros
       mkfs: Change hardcoded numbers by new macro definitions
       i18n: strings review
       mkfs: Use rpmatch() to yes/no questions
       mkfs: Remove unneeded open/close fd test from are_you_sure()
       mkfs: remove duplicated code to ask yes/no question
       mkfs: fix error handling
       gfs2_grow: fix error handling, i18n strings
       gfs2_jadd: Fix error handlers
       fsck: Merge strings

David Teigland (2):
       gfs_controld: track membership changes
       gfs_controld: don't ignore dlmc_fs_register error

Steven Whitehouse (12):
       tunegfs2: Fix usage and ensure we don't try to open a null device
       tunegfs2: Fix label/locktable setting code
       libgfs2: Move generic_interrupt() into utils
       libgfs2: Move gfs2_getch into utils
       libgfs2: Prepare to remove log_xxx() macros from library
       libgfs2: Move some debug messages out into mkfs/fsck
       libgfs2: Clean up sb read/check functions
       libgfs2: Remove some more log_xxx calls
       libgfs2: Clean up device geometry code
       libgfs2: More unused bits (re)moved
       libgfs2: Remove more unused macros/functions
       gfs2_edit: Fix signal handling and window resize


From rossnick-lists at cybercat.ca  Tue Dec 13 20:16:30 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Tue, 13 Dec 2011 15:16:30 -0500
Subject: [Linux-cluster] Question about GFS2 journal size
Message-ID: <7ABBAA25C87B46949332F3C027E961AB@versa>

The default journal size for gfs2 is 128 megs. We have a 8 node cluster, so 
I create 10, just in case. So that yields to roughly 1.3 gig the size of an 
empty filesystem.

For some applications, I wish to create smaller file systems, like bellow 1 
gig. So to have 1.3 gig of journal for a 500 megs FS is like overkill. 
What's the use of the journal and can I lower down to a more proprotionate 
size for my application, like 16 or 32 meg each ? 


From rpeterso at redhat.com  Tue Dec 13 20:58:38 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 13 Dec 2011 15:58:38 -0500 (EST)
Subject: [Linux-cluster] Question about GFS2 journal size
In-Reply-To: <7ABBAA25C87B46949332F3C027E961AB@versa>
Message-ID: <e3848cbf-8aa2-4f3d-b5a1-d25c1541d3a9@zmail16.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| The default journal size for gfs2 is 128 megs. We have a 8 node
| cluster, so
| I create 10, just in case. So that yields to roughly 1.3 gig the size
| of an
| empty filesystem.
| 
| For some applications, I wish to create smaller file systems, like
| bellow 1
| gig. So to have 1.3 gig of journal for a 500 megs FS is like
| overkill.
| What's the use of the journal and can I lower down to a more
| proprotionate
| size for my application, like 16 or 32 meg each ?

Hi,

For GFS2 journals, you don't need to create extras like GFS1.
You only need one journal _per_mounting_node.  So your cluster
can be 32 nodes, but if only two of those nodes are going to
mount the file system, you only need 2 journals. So the question
is how many of your 8 nodes are going to mount?
If all 8 need to mount it simultaneously, then create 8 journals.
With GFS2, you can always easily add more journals with gfs2_jadd
(unlike GFS1).

Now as to journal size: The biggest factor to consider here is
performance. The reason you want big journals is so that the
processes writing metadata to the file system don't have to pause
and wait for journal flushing very often, which forces the journal
to be synced to the media, which is slow. IIRC, the minimum journal
size is 32 (MB), so 16 isn't an option. You probably wouldn't want
16MB anyway, because it would impact performance, as explained above.

So it depends on how you're planning to use the file system.

Regards,

Bob Peterson
Red Hat File Systems


From rossnick-lists at cybercat.ca  Tue Dec 13 21:18:04 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Tue, 13 Dec 2011 16:18:04 -0500
Subject: [Linux-cluster] Question about GFS2 journal size
In-Reply-To: <e3848cbf-8aa2-4f3d-b5a1-d25c1541d3a9@zmail16.collab.prod.int.phx2.redhat.com>
References: <e3848cbf-8aa2-4f3d-b5a1-d25c1541d3a9@zmail16.collab.prod.int.phx2.redhat.com>
Message-ID: <4EE7C10C.3040503@cybercat.ca>


> Hi,
>
> For GFS2 journals, you don't need to create extras like GFS1.
> You only need one journal _per_mounting_node.  So your cluster
> can be 32 nodes, but if only two of those nodes are going to
> mount the file system, you only need 2 journals. So the question
> is how many of your 8 nodes are going to mount?
> If all 8 need to mount it simultaneously, then create 8 journals.
> With GFS2, you can always easily add more journals with gfs2_jadd
> (unlike GFS1).

Thanks, I'll then only make 8 journals...

> Now as to journal size: The biggest factor to consider here is
> performance. The reason you want big journals is so that the
> processes writing metadata to the file system don't have to pause
> and wait for journal flushing very often, which forces the journal
> to be synced to the media, which is slow. IIRC, the minimum journal
> size is 32 (MB), so 16 isn't an option. You probably wouldn't want
> 16MB anyway, because it would impact performance, as explained above.
>
> So it depends on how you're planning to use the file system.

For the particular instance I am building, it's a "test service" that 
doesn't require almost no writing at all to the FS. And as for the 
minimum size, from the man page, it's 8 megs. So for this one, i'll go 
with 16 megs, that'll be enough...

Thanks,

Nicolas


From rossnick-lists at cybercat.ca  Tue Dec 13 21:18:19 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Tue, 13 Dec 2011 16:18:19 -0500
Subject: [Linux-cluster] Question about GFS2 journal size
In-Reply-To: <e3848cbf-8aa2-4f3d-b5a1-d25c1541d3a9@zmail16.collab.prod.int.phx2.redhat.com>
References: <e3848cbf-8aa2-4f3d-b5a1-d25c1541d3a9@zmail16.collab.prod.int.phx2.redhat.com>
Message-ID: <4EE7C11B.6060201@cybercat.ca>


> Hi,
>
> For GFS2 journals, you don't need to create extras like GFS1.
> You only need one journal _per_mounting_node.  So your cluster
> can be 32 nodes, but if only two of those nodes are going to
> mount the file system, you only need 2 journals. So the question
> is how many of your 8 nodes are going to mount?
> If all 8 need to mount it simultaneously, then create 8 journals.
> With GFS2, you can always easily add more journals with gfs2_jadd
> (unlike GFS1).

Thanks, I'll then only make 8 journals...

> Now as to journal size: The biggest factor to consider here is
> performance. The reason you want big journals is so that the
> processes writing metadata to the file system don't have to pause
> and wait for journal flushing very often, which forces the journal
> to be synced to the media, which is slow. IIRC, the minimum journal
> size is 32 (MB), so 16 isn't an option. You probably wouldn't want
> 16MB anyway, because it would impact performance, as explained above.
>
> So it depends on how you're planning to use the file system.

For the particular instance I am building, it's a "test service" that 
doesn't require almost no writing at all to the FS. And as for the 
minimum size, from the man page, it's 8 megs. So for this one, i'll go 
with 16 megs, that'll be enough...

Thanks,

Nicolas


From Chris.Jankowski at hp.com  Wed Dec 14 01:03:11 2011
From: Chris.Jankowski at hp.com (Jankowski, Chris)
Date: Wed, 14 Dec 2011 01:03:11 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <4EE7647E.5040005@redhat.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<4EE7647E.5040005@redhat.com>
Message-ID: <036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>

*Unidirectional* replication is probably a better phrase to describe what EMC SRDF and all other typical block mode storage arrays do for replication.

Typically this is used for manual or semi-automated DR systems and works very well for this purpose. This approach splits the HA and DR domains.

It can also be used with a HA stretched cluster configuration for failing over services from one site to the other.  You need to integrate into the service scripts unmounting of the filesystems for the service on one site, changing the direction of the replication and mounting the filesystem on the other site. This is quite complex and fiddly to say the least. I have yet to see an implementation where the users will be really happy with the robustness of the integrated solution.

To implement a HA cluster that uses a cluster filesystem such as GFS2 across geographical area you need a different type of storage - a geographically distributed storage to have a chance of the cluster surviving the inter-site link failure or site failure. Standard unidirectional replication won't do for this. I know of only one such storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300, P4500 and P4800 storage arrays). Again, implementation of such cluster is very complex. IMHO it is easier to have local HA clusters on both sites and a good DR process based on replication.

You could also try to implement the stretched cluster purely in software using separate LUNs on storage arrays in two sites and mirroring them.  Personally, I believe that this will not yield a robust solution with the current versions of software.

Regards,

Chris Jankowski

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
Sent: Wednesday, 14 December 2011 01:43
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??

On 12/12/2011 03:29 AM, yu song wrote:
> My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it
> only says that GFS2 does not support using asynchronous or active/passive
> array based replication. but it seems like does not apply for SRDF.

SRDF offers both synchronous and asynchronous replication but is 
active/passive. I.e. the administrator can configure whether the primary 
(R1) site waits for write acknowledgement from the remote (R2) site or 
not but at any one time it is only possible to write to either the R1 or 
the R2 device.

Synchronous replication guarantees write order fidelity for the R2 copy 
and ensures the remote copy is crash consistent at all times.

Asynchronous replication allows SRDF to support longer distances (or 
lower bandwidth / higher latency inter site links) by packaging multiple 
writes into delta sets to be sent to the remote site.

More complex modes and combinations exist that allow consistency to be 
maintained among a group of devices, for example a database's data store 
and redo logs, or that relax some of the synchronous replication 
guarantees to improve efficiency (semi-synchronous operation).

Active/passive in the context of storage replication usually refers to 
the states of the devices on the two sites. In active/active replication 
both sides are fully active at all times and writes may be issued on 
either side of the replication (a bit like multi-master application 
layer replication). An active/passive design only allows one side to be 
active for writes at a time.

Most array based implementations are active/passive and offer 
asynchronous, synchronous or semi-synchronous operation.

Regards,
Bryn.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From kolapallisatya531 at gmail.com  Wed Dec 14 08:22:28 2011
From: kolapallisatya531 at gmail.com (satya suresh kolapalli)
Date: Wed, 14 Dec 2011 13:52:28 +0530
Subject: [Linux-cluster] Document on gnbd
Message-ID: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>

Sir,

I need a document how to link gfs to gnbd

-- 
Regards,
SatyaSuresh Kolapalli
Mob: 7702430892


From list at fajar.net  Wed Dec 14 08:30:12 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Wed, 14 Dec 2011 15:30:12 +0700
Subject: [Linux-cluster] Document on gnbd
In-Reply-To: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>
References: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>
Message-ID: <CAG1y0scpChOZDK-iK4N_3m_U3mTVKSMeENBxXq0DNNVUeoZ0bg@mail.gmail.com>

On Wed, Dec 14, 2011 at 3:22 PM, satya suresh kolapalli
<kolapallisatya531 at gmail.com> wrote:
> Sir,
>
> I need a document how to link gfs to gnbd

Did you try Google?
http://sourceware.org/cluster/gnbd/gnbd_usage.txt

IIRC gnbd is not maintained anymore though. You'd probably have better
luck using a SAN/NAS (or even just a properly-configured server
running your favorite linux distro) exporting iscsi LUNs, then use
gfs/gfs2 on top of that.

-- 
Fajar


From michael.trachtman at emc.com  Wed Dec 14 21:56:01 2011
From: michael.trachtman at emc.com (michael.trachtman at emc.com)
Date: Wed, 14 Dec 2011 16:56:01 -0500
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<4EE7647E.5040005@redhat.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>
Message-ID: <D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>

Actually, another product that implements "geographically distributed storage" is VPlex (from EMC).  VPlex is a product for geographically distributed storage.  It works quite nicely.   And, yes you can put a HA file system on top of that.   I haven't tried it with GSF2 yet; but I have tried it with OCFS2, and there is no reason why GFS2 would be any different.  I.e. there is every reason why it should work.

..m


>>>To implement a HA cluster that uses a cluster filesystem such as GFS2 across geographical area you need a 
>>> different type of storage - a geographically distributed storage to have a chance of the cluster surviving the 
>>> inter-site link failure or site failure. Standard unidirectional replication won't do for this. I know of only 
>>>one such storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300, P4500 and P4800 storage 
>>>arrays). Again, implementation of such cluster is very complex. IMHO it is easier to have local HA clusters on 
>>>both sites and a good DR process based on replication.

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jankowski, Chris
Sent: Wednesday, December 14, 2011 3:03 AM
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??

*Unidirectional* replication is probably a better phrase to describe what EMC SRDF and all other typical block mode storage arrays do for replication.

Typically this is used for manual or semi-automated DR systems and works very well for this purpose. This approach splits the HA and DR domains.

It can also be used with a HA stretched cluster configuration for failing over services from one site to the other.  You need to integrate into the service scripts unmounting of the filesystems for the service on one site, changing the direction of the replication and mounting the filesystem on the other site. This is quite complex and fiddly to say the least. I have yet to see an implementation where the users will be really happy with the robustness of the integrated solution.

To implement a HA cluster that uses a cluster filesystem such as GFS2 across geographical area you need a different type of storage - a geographically distributed storage to have a chance of the cluster surviving the inter-site link failure or site failure. Standard unidirectional replication won't do for this. I know of only one such storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300, P4500 and P4800 storage arrays). Again, implementation of such cluster is very complex. IMHO it is easier to have local HA clusters on both sites and a good DR process based on replication.

You could also try to implement the stretched cluster purely in software using separate LUNs on storage arrays in two sites and mirroring them.  Personally, I believe that this will not yield a robust solution with the current versions of software.

Regards,

Chris Jankowski

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
Sent: Wednesday, 14 December 2011 01:43
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??

On 12/12/2011 03:29 AM, yu song wrote:
> My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it
> only says that GFS2 does not support using asynchronous or active/passive
> array based replication. but it seems like does not apply for SRDF.

SRDF offers both synchronous and asynchronous replication but is 
active/passive. I.e. the administrator can configure whether the primary 
(R1) site waits for write acknowledgement from the remote (R2) site or 
not but at any one time it is only possible to write to either the R1 or 
the R2 device.

Synchronous replication guarantees write order fidelity for the R2 copy 
and ensures the remote copy is crash consistent at all times.

Asynchronous replication allows SRDF to support longer distances (or 
lower bandwidth / higher latency inter site links) by packaging multiple 
writes into delta sets to be sent to the remote site.

More complex modes and combinations exist that allow consistency to be 
maintained among a group of devices, for example a database's data store 
and redo logs, or that relax some of the synchronous replication 
guarantees to improve efficiency (semi-synchronous operation).

Active/passive in the context of storage replication usually refers to 
the states of the devices on the two sites. In active/active replication 
both sides are fully active at all times and writes may be issued on 
either side of the replication (a bit like multi-master application 
layer replication). An active/passive design only allows one side to be 
active for writes at a time.

Most array based implementations are active/passive and offer 
asynchronous, synchronous or semi-synchronous operation.

Regards,
Bryn.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From erik.redding at txstate.edu  Wed Dec 14 22:53:24 2011
From: erik.redding at txstate.edu (Redding, Erik)
Date: Wed, 14 Dec 2011 16:53:24 -0600
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<4EE7647E.5040005@redhat.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>
	<D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>
Message-ID: <85669649-6BA0-46A9-9D2A-2D8405784408@txstate.edu>

HP had a product called SVSP - they're phasing it out now (they sold the IP or lost the lease), but it beat the *pants* off of VPlex (which requires 24 front-side ports on your MDS - laughable to the 4 needed for HP SVSP).  We've got 96Tb SVSP configuration and use it for our stretch VMWare cluster and it's doing a great job.  I'm waiting for a firmware upgrade so I can use it with RHCS.  There's an alua issue with the current version of the firmware and it causes issues with multipathd working correctly.


Erik Redding
Systems Programmer, RHCE
Core Systems
Texas State University-San Marcos


On Dec 14, 2011, at 3:56 PM, <michael.trachtman at emc.com> <michael.trachtman at emc.com> wrote:

> Actually, another product that implements "geographically distributed storage" is VPlex (from EMC).  VPlex is a product for geographically distributed storage.  It works quite nicely.   And, yes you can put a HA file system on top of that.   I haven't tried it with GSF2 yet; but I have tried it with OCFS2, and there is no reason why GFS2 would be any different.  I.e. there is every reason why it should work.
> 
> ..m
> 
> 
>>>> To implement a HA cluster that uses a cluster filesystem such as GFS2 across geographical area you need a 
>>>> different type of storage - a geographically distributed storage to have a chance of the cluster surviving the 
>>>> inter-site link failure or site failure. Standard unidirectional replication won't do for this. I know of only 
>>>> one such storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300, P4500 and P4800 storage 
>>>> arrays). Again, implementation of such cluster is very complex. IMHO it is easier to have local HA clusters on 
>>>> both sites and a good DR process based on replication.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jankowski, Chris
> Sent: Wednesday, December 14, 2011 3:03 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
> 
> *Unidirectional* replication is probably a better phrase to describe what EMC SRDF and all other typical block mode storage arrays do for replication.
> 
> Typically this is used for manual or semi-automated DR systems and works very well for this purpose. This approach splits the HA and DR domains.
> 
> It can also be used with a HA stretched cluster configuration for failing over services from one site to the other.  You need to integrate into the service scripts unmounting of the filesystems for the service on one site, changing the direction of the replication and mounting the filesystem on the other site. This is quite complex and fiddly to say the least. I have yet to see an implementation where the users will be really happy with the robustness of the integrated solution.
> 
> To implement a HA cluster that uses a cluster filesystem such as GFS2 across geographical area you need a different type of storage - a geographically distributed storage to have a chance of the cluster surviving the inter-site link failure or site failure. Standard unidirectional replication won't do for this. I know of only one such storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300, P4500 and P4800 storage arrays). Again, implementation of such cluster is very complex. IMHO it is easier to have local HA clusters on both sites and a good DR process based on replication.
> 
> You could also try to implement the stretched cluster purely in software using separate LUNs on storage arrays in two sites and mirroring them.  Personally, I believe that this will not yield a robust solution with the current versions of software.
> 
> Regards,
> 
> Chris Jankowski
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
> Sent: Wednesday, 14 December 2011 01:43
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
> 
> On 12/12/2011 03:29 AM, yu song wrote:
>> My question is that GFS2 supports SRDF ??  looking at KB in redhat site, it
>> only says that GFS2 does not support using asynchronous or active/passive
>> array based replication. but it seems like does not apply for SRDF.
> 
> SRDF offers both synchronous and asynchronous replication but is 
> active/passive. I.e. the administrator can configure whether the primary 
> (R1) site waits for write acknowledgement from the remote (R2) site or 
> not but at any one time it is only possible to write to either the R1 or 
> the R2 device.
> 
> Synchronous replication guarantees write order fidelity for the R2 copy 
> and ensures the remote copy is crash consistent at all times.
> 
> Asynchronous replication allows SRDF to support longer distances (or 
> lower bandwidth / higher latency inter site links) by packaging multiple 
> writes into delta sets to be sent to the remote site.
> 
> More complex modes and combinations exist that allow consistency to be 
> maintained among a group of devices, for example a database's data store 
> and redo logs, or that relax some of the synchronous replication 
> guarantees to improve efficiency (semi-synchronous operation).
> 
> Active/passive in the context of storage replication usually refers to 
> the states of the devices on the two sites. In active/active replication 
> both sides are fully active at all times and writes may be issued on 
> either side of the replication (a bit like multi-master application 
> layer replication). An active/passive design only allows one side to be 
> active for writes at a time.
> 
> Most array based implementations are active/passive and offer 
> asynchronous, synchronous or semi-synchronous operation.
> 
> Regards,
> Bryn.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 4897 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111214/adab2de0/attachment.p7s>

From songyu555 at gmail.com  Thu Dec 15 02:06:20 2011
From: songyu555 at gmail.com (yu song)
Date: Thu, 15 Dec 2011 13:06:20 +1100
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<4EE7647E.5040005@redhat.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>
	<D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>
Message-ID: <CADJUD+bt9y8RP3DsfTypA8R-kGdLVo4njY1ndM_ACy2mH6KQEw@mail.gmail.com>

Gents,

beauty!! it is great to see your ideas.

I found a doc from redhat kb, which has the following statement

"
*Multi-Site Disaster Recovery Clusters*

A multi-site cluster established for disaster recovery comprises two
completely different clusters. These clusters typically have the same
configuration, with one active and the other passive (and sometimes powered
off). If the primary site fails, the secondary site is manually activated
and takes over all services.


Multi-site clusters are supported since implementation involves two
separate clusters with the same configuration/architecture at two physical
locations. Shared storage must be replicated from the primary to the
back-up site using array-based replication. During a site failover, the
cluster administrator must first toggle the directionality of the storage
replication so that the back-up site becomes the primary and then start up
the back-up cluster. These steps cannot be automated since using heuristics
like site-to-site link failure might result in primary/back-up toggling
when there are intermittent network failures.
"

It does give me an answer what I am after.

have a great Christmas!!


On Thu, Dec 15, 2011 at 8:56 AM, <michael.trachtman at emc.com> wrote:

> Actually, another product that implements "geographically distributed
> storage" is VPlex (from EMC).  VPlex is a product for geographically
> distributed storage.  It works quite nicely.   And, yes you can put a HA
> file system on top of that.   I haven't tried it with GSF2 yet; but I have
> tried it with OCFS2, and there is no reason why GFS2 would be any
> different.  I.e. there is every reason why it should work.
>
> ..m
>
>
> >>>To implement a HA cluster that uses a cluster filesystem such as GFS2
> across geographical area you need a
> >>> different type of storage - a geographically distributed storage to
> have a chance of the cluster surviving the
> >>> inter-site link failure or site failure. Standard unidirectional
> replication won't do for this. I know of only
> >>>one such storage - Left Hand Networks iSCSI arrays (now owned by HP -
> the P4300, P4500 and P4800 storage
> >>>arrays). Again, implementation of such cluster is very complex. IMHO it
> is easier to have local HA clusters on
> >>>both sites and a good DR process based on replication.
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] On Behalf Of Jankowski, Chris
> Sent: Wednesday, December 14, 2011 3:03 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
>
> *Unidirectional* replication is probably a better phrase to describe what
> EMC SRDF and all other typical block mode storage arrays do for replication.
>
> Typically this is used for manual or semi-automated DR systems and works
> very well for this purpose. This approach splits the HA and DR domains.
>
> It can also be used with a HA stretched cluster configuration for failing
> over services from one site to the other.  You need to integrate into the
> service scripts unmounting of the filesystems for the service on one site,
> changing the direction of the replication and mounting the filesystem on
> the other site. This is quite complex and fiddly to say the least. I have
> yet to see an implementation where the users will be really happy with the
> robustness of the integrated solution.
>
> To implement a HA cluster that uses a cluster filesystem such as GFS2
> across geographical area you need a different type of storage - a
> geographically distributed storage to have a chance of the cluster
> surviving the inter-site link failure or site failure. Standard
> unidirectional replication won't do for this. I know of only one such
> storage - Left Hand Networks iSCSI arrays (now owned by HP - the P4300,
> P4500 and P4800 storage arrays). Again, implementation of such cluster is
> very complex. IMHO it is easier to have local HA clusters on both sites and
> a good DR process based on replication.
>
> You could also try to implement the stretched cluster purely in software
> using separate LUNs on storage arrays in two sites and mirroring them.
>  Personally, I believe that this will not yield a robust solution with the
> current versions of software.
>
> Regards,
>
> Chris Jankowski
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
> Sent: Wednesday, 14 December 2011 01:43
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
>
> On 12/12/2011 03:29 AM, yu song wrote:
> > My question is that GFS2 supports SRDF ??  looking at KB in redhat site,
> it
> > only says that GFS2 does not support using asynchronous or active/passive
> > array based replication. but it seems like does not apply for SRDF.
>
> SRDF offers both synchronous and asynchronous replication but is
> active/passive. I.e. the administrator can configure whether the primary
> (R1) site waits for write acknowledgement from the remote (R2) site or
> not but at any one time it is only possible to write to either the R1 or
> the R2 device.
>
> Synchronous replication guarantees write order fidelity for the R2 copy
> and ensures the remote copy is crash consistent at all times.
>
> Asynchronous replication allows SRDF to support longer distances (or
> lower bandwidth / higher latency inter site links) by packaging multiple
> writes into delta sets to be sent to the remote site.
>
> More complex modes and combinations exist that allow consistency to be
> maintained among a group of devices, for example a database's data store
> and redo logs, or that relax some of the synchronous replication
> guarantees to improve efficiency (semi-synchronous operation).
>
> Active/passive in the context of storage replication usually refers to
> the states of the devices on the two sites. In active/active replication
> both sides are fully active at all times and writes may be issued on
> either side of the replication (a bit like multi-master application
> layer replication). An active/passive design only allows one side to be
> active for writes at a time.
>
> Most array based implementations are active/passive and offer
> asynchronous, synchronous or semi-synchronous operation.
>
> Regards,
> Bryn.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111215/1346f11c/attachment.htm>

From swhiteho at redhat.com  Thu Dec 15 10:06:11 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 15 Dec 2011 10:06:11 +0000
Subject: [Linux-cluster] GFS2 support EMC storage SRDF??
In-Reply-To: <CADJUD+bt9y8RP3DsfTypA8R-kGdLVo4njY1ndM_ACy2mH6KQEw@mail.gmail.com>
References: <CADJUD+a26mNpF5h063fRKPp6AgWBpr5R7jbHzqR76xDACLib+Q@mail.gmail.com>
	<4EE7647E.5040005@redhat.com>
	<036B68E61A28CA49AC2767596576CD596F9CC7A9A1@GVW1113EXC.americas.hpqcorp.net>
	<D8B05DA4519DFB418920B12022F6D5BF0676D010@MX01A.corp.emc.com>
	<CADJUD+bt9y8RP3DsfTypA8R-kGdLVo4njY1ndM_ACy2mH6KQEw@mail.gmail.com>
Message-ID: <1323943571.2754.1.camel@menhir>

Hi,

On Thu, 2011-12-15 at 13:06 +1100, yu song wrote:
> Gents, 
> 
> beauty!! it is great to see your ideas.
> 
> I found a doc from redhat kb, which has the following statement
> 
> "
> 
> Multi-Site Disaster Recovery Clusters
> A multi-site cluster established for disaster recovery comprises two
> completely different clusters. These clusters typically have the same
> configuration, with one active and the other passive (and sometimes
> powered off). If the primary site fails, the secondary site is
> manually activated and takes over all services.
> 
>  
> 
> Multi-site clusters are supported since implementation involves two
> separate clusters with the same configuration/architecture at two
> physical locations. Shared storage must be replicated from the primary
> to the back-up site using array-based replication. During a site
> failover, the cluster administrator must first toggle the
> directionality of the storage replication so that the back-up site
> becomes the primary and then start up the back-up cluster. These steps
> cannot be automated since using heuristics like site-to-site link
> failure might result in primary/back-up toggling when there are
> intermittent network failures.
> 
> "
> 
> It does give me an answer what I am after.
> 
> have a great Christmas!!
> 

Also, just to clarify, these multi-site clusters are not supported when
combined with GFS2,

Steve.

> 
> On Thu, Dec 15, 2011 at 8:56 AM, <michael.trachtman at emc.com> wrote:
>         Actually, another product that implements "geographically
>         distributed storage" is VPlex (from EMC).  VPlex is a product
>         for geographically distributed storage.  It works quite
>         nicely.   And, yes you can put a HA file system on top of
>         that.   I haven't tried it with GSF2 yet; but I have tried it
>         with OCFS2, and there is no reason why GFS2 would be any
>         different.  I.e. there is every reason why it should work.
>         
>         ..m
>         
>         
>         >>>To implement a HA cluster that uses a cluster filesystem
>         such as GFS2 across geographical area you need a
>         >>> different type of storage - a geographically distributed
>         storage to have a chance of the cluster surviving the
>         >>> inter-site link failure or site failure. Standard
>         unidirectional replication won't do for this. I know of only
>         >>>one such storage - Left Hand Networks iSCSI arrays (now
>         owned by HP - the P4300, P4500 and P4800 storage
>         >>>arrays). Again, implementation of such cluster is very
>         complex. IMHO it is easier to have local HA clusters on
>         >>>both sites and a good DR process based on replication.
>         
>         -----Original Message-----
>         From: linux-cluster-bounces at redhat.com
>         [mailto:linux-cluster-bounces at redhat.com] On Behalf Of
>         Jankowski, Chris
>         Sent: Wednesday, December 14, 2011 3:03 AM
>         To: linux clustering
>         Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
>         
>         *Unidirectional* replication is probably a better phrase to
>         describe what EMC SRDF and all other typical block mode
>         storage arrays do for replication.
>         
>         Typically this is used for manual or semi-automated DR systems
>         and works very well for this purpose. This approach splits the
>         HA and DR domains.
>         
>         It can also be used with a HA stretched cluster configuration
>         for failing over services from one site to the other.  You
>         need to integrate into the service scripts unmounting of the
>         filesystems for the service on one site, changing the
>         direction of the replication and mounting the filesystem on
>         the other site. This is quite complex and fiddly to say the
>         least. I have yet to see an implementation where the users
>         will be really happy with the robustness of the integrated
>         solution.
>         
>         To implement a HA cluster that uses a cluster filesystem such
>         as GFS2 across geographical area you need a different type of
>         storage - a geographically distributed storage to have a
>         chance of the cluster surviving the inter-site link failure or
>         site failure. Standard unidirectional replication won't do for
>         this. I know of only one such storage - Left Hand Networks
>         iSCSI arrays (now owned by HP - the P4300, P4500 and P4800
>         storage arrays). Again, implementation of such cluster is very
>         complex. IMHO it is easier to have local HA clusters on both
>         sites and a good DR process based on replication.
>         
>         You could also try to implement the stretched cluster purely
>         in software using separate LUNs on storage arrays in two sites
>         and mirroring them.  Personally, I believe that this will not
>         yield a robust solution with the current versions of software.
>         
>         Regards,
>         
>         Chris Jankowski
>         
>         -----Original Message-----
>         From: linux-cluster-bounces at redhat.com
>         [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M.
>         Reeves
>         Sent: Wednesday, 14 December 2011 01:43
>         To: linux clustering
>         Subject: Re: [Linux-cluster] GFS2 support EMC storage SRDF??
>         
>         On 12/12/2011 03:29 AM, yu song wrote:
>         > My question is that GFS2 supports SRDF ??  looking at KB in
>         redhat site, it
>         > only says that GFS2 does not support using asynchronous or
>         active/passive
>         > array based replication. but it seems like does not apply
>         for SRDF.
>         
>         
>         SRDF offers both synchronous and asynchronous replication but
>         is
>         active/passive. I.e. the administrator can configure whether
>         the primary
>         (R1) site waits for write acknowledgement from the remote (R2)
>         site or
>         not but at any one time it is only possible to write to either
>         the R1 or
>         the R2 device.
>         
>         Synchronous replication guarantees write order fidelity for
>         the R2 copy
>         and ensures the remote copy is crash consistent at all times.
>         
>         Asynchronous replication allows SRDF to support longer
>         distances (or
>         lower bandwidth / higher latency inter site links) by
>         packaging multiple
>         writes into delta sets to be sent to the remote site.
>         
>         More complex modes and combinations exist that allow
>         consistency to be
>         maintained among a group of devices, for example a database's
>         data store
>         and redo logs, or that relax some of the synchronous
>         replication
>         guarantees to improve efficiency (semi-synchronous operation).
>         
>         Active/passive in the context of storage replication usually
>         refers to
>         the states of the devices on the two sites. In active/active
>         replication
>         both sides are fully active at all times and writes may be
>         issued on
>         either side of the replication (a bit like multi-master
>         application
>         layer replication). An active/passive design only allows one
>         side to be
>         active for writes at a time.
>         
>         Most array based implementations are active/passive and offer
>         asynchronous, synchronous or semi-synchronous operation.
>         
>         Regards,
>         Bryn.
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
>         
>         
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com
>         https://www.redhat.com/mailman/listinfo/linux-cluster
>         
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From raju.rajsand at gmail.com  Thu Dec 15 13:58:32 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 15 Dec 2011 19:28:32 +0530
Subject: [Linux-cluster] Document on gnbd
In-Reply-To: <CAG1y0scpChOZDK-iK4N_3m_U3mTVKSMeENBxXq0DNNVUeoZ0bg@mail.gmail.com>
References: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>
	<CAG1y0scpChOZDK-iK4N_3m_U3mTVKSMeENBxXq0DNNVUeoZ0bg@mail.gmail.com>
Message-ID: <CA+YdgarmjG5CV5CrwnF8uV53BTe+BcZd6=u2HrBSXjyKYfeZwg@mail.gmail.com>

Greetings,

On Wed, Dec 14, 2011 at 2:00 PM, Fajar A. Nugraha <list at fajar.net> wrote:
> On Wed, Dec 14, 2011 at 3:22 PM, satya suresh kolapalli
> <kolapallisatya531 at gmail.com> wrote:
>> Sir,
>>
>> I need a document how to link gfs to gnbd
>
> Did you try Google?
> http://sourceware.org/cluster/gnbd/gnbd_usage.txt
>
> IIRC gnbd is not maintained anymore though. You'd probably have better
> luck using a SAN/NAS (or even just a properly-configured server
> running your favorite linux distro) exporting iscsi LUNs, then use
> gfs/gfs2 on top of that.
>

Openfiler, perhaps?

I know it cannot mount existing NTFS partitions.

But _is_ good.

-- 
Regards,

Rajagopal


From Jan.Huijsmans at interaccess.nl  Thu Dec 15 15:53:41 2011
From: Jan.Huijsmans at interaccess.nl (Jan Huijsmans)
Date: Thu, 15 Dec 2011 16:53:41 +0100
Subject: [Linux-cluster] Spontanous eviction of node cluster
Message-ID: <B5DEF1D8DA6DA74A9D4074584C7AA0BA0C6C350E25@NTHVSEXCHMAIL01>

Hello,

I'm at this moment investigating problems on a site with qdisk time-outs. The SAN is slow, additional hardware is ordered, which leads to the time-outs. At this moment almost all clusters are managing to stay on-line, mostly due to huge time-out values (150-300 seconds). We've seen disk time-outs of more then 100 seconds, but they've now dropped to max. 15 seconds.

However, there is one cluster that just keeps on killing it's primary node and I'm unable to find the reason why. All that's being logged on this cluster are the lines below:

Dec 15 00:15:10 node2 last message repeated 2 times
Dec 15 00:16:59 node2 qdiskd[3073]: <notice> Writing eviction notice for node 1
Dec 15 00:17:02 node2 qdiskd[3073]: <notice> Node 1 evicted
Dec 15 00:17:54 node2 openais[3040]: [TOTEM] The token was lost in the OPERATIONAL state.

Dec 15 00:15:10 node1 last message repeated 2 times
Dec 15 00:16:59 node1 openais[3297]: [CMAN ] cman killed by node 2 because we were killed by cman_tool or other application
Dec 15 00:17:00 node1 openais[3297]: [SERV ] Unloading all openais components

The quorumd has an interval of 3, tko of 50 (to get 150 seconds, but keep receiving warnings for the > 3 seconds time delays.)
The quorum_dev_poll is set to 300000, just as the totem token value.

I can't find any differences with the other clusters, which are now working fine, except for one big differences. There are no resources configured. The set-up is configured only to get gfs working. Both nodes have their own ip, no resources are shared (other then the gfs file systems, which sre concurrently available) and there are no check, other then qdisk availability and totem tokens.

Is this behaviour that can be expected when running a cluster without resources configured?

Greetings,

Jan Huijsmans


From kolapallisatya531 at gmail.com  Thu Dec 15 16:38:58 2011
From: kolapallisatya531 at gmail.com (satya suresh kolapalli)
Date: Thu, 15 Dec 2011 22:08:58 +0530
Subject: [Linux-cluster] Document on gnbd
In-Reply-To: <CA+YdgarmjG5CV5CrwnF8uV53BTe+BcZd6=u2HrBSXjyKYfeZwg@mail.gmail.com>
References: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>
	<CAG1y0scpChOZDK-iK4N_3m_U3mTVKSMeENBxXq0DNNVUeoZ0bg@mail.gmail.com>
	<CA+YdgarmjG5CV5CrwnF8uV53BTe+BcZd6=u2HrBSXjyKYfeZwg@mail.gmail.com>
Message-ID: <CAJsiSVpicvqCeX2jMOskxMN3F1BAF13yZybZVrSxujTQGYejuw@mail.gmail.com>

Sir,

In my office,there are 3 servers for postgressql database.

Out of three, two are running for postgressql database software and
one server for storage.

Those are in Red Hat Cluster.

I mean to say that how the two servers are connecting to that
storage.I want that document.Can u please provide that......!

And in the third node, I found there is a lvm that is shared on gnbd.
Can any one help how the lvm is linking up with gnbd.I want that
document also............!

Please do the needful................................!


Thanks and Regards,
Satyasuresh K


From list at fajar.net  Thu Dec 15 21:39:11 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Fri, 16 Dec 2011 04:39:11 +0700
Subject: [Linux-cluster] Document on gnbd
In-Reply-To: <CAJsiSVpicvqCeX2jMOskxMN3F1BAF13yZybZVrSxujTQGYejuw@mail.gmail.com>
References: <CAJsiSVrcFfP6LswRAiVGsmPaG0Pi818wbLf0QeyXXfyv6QkbKg@mail.gmail.com>
	<CAG1y0scpChOZDK-iK4N_3m_U3mTVKSMeENBxXq0DNNVUeoZ0bg@mail.gmail.com>
	<CA+YdgarmjG5CV5CrwnF8uV53BTe+BcZd6=u2HrBSXjyKYfeZwg@mail.gmail.com>
	<CAJsiSVpicvqCeX2jMOskxMN3F1BAF13yZybZVrSxujTQGYejuw@mail.gmail.com>
Message-ID: <CAG1y0sfRMyNbBDZoEtnK3b=GKmLdLQBWxJJRLf4rhtHjyO5Ldw@mail.gmail.com>

On Thu, Dec 15, 2011 at 11:38 PM, satya suresh kolapalli
<kolapallisatya531 at gmail.com> wrote:
> Sir,
>
> In my office,there are 3 servers for postgressql database.
>
> Out of three, two are running for postgressql database software and
> one server for storage.
>
> Those are in Red Hat Cluster.
>
> I mean to say that how the two servers are connecting to that
> storage.I want that document.Can u please provide that......!

Ask whoever installed it.

>
> And in the third node, I found there is a lvm that is shared on gnbd.
> Can any one help how the lvm is linking up with gnbd.I want that
> document also............!

Did you read the link I sent?
Did you read redhat's cluster documentation?
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/4/html/GFS_6.1_Administration_Guide/ch-gnbd.html

-- 
Fajar


From sathyanarayanan.varadharajan at precisionit.co.in  Fri Dec 16 13:01:42 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Fri, 16 Dec 2011 18:31:42 +0530
Subject: [Linux-cluster] GFS2 + ACL + SGID...
Message-ID: <006a01ccbbf2$dd66bfc0$98343f40$@precisionit.co.in>

Hi,

 
We had configured a clustered file server with (samba + ctdb + gfs2). GFS2
partition is mounted with ACL option. When we create a folder in GFS2
partition, irrespective of user being provided with full permission for the
folder via ACL, the user is not able to delete or rename the folder created
by him.

 
Please find the example:

 
Folder path: /test/abc/

Permission for /test is rwxrwx---+ 

Permission for /test/abc is rwxrws---+

For abc user been provided with full permission.. Where we were able to find
the permission thro' getfacl command.

 
Now the user is able to created folder inside the abc folder. But not able
to rename / delete the folder which he created. Can anyone kindly assist on
this.

 
Thanks

 
Sathya Narayanan V

Solution Architect    

M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
 <http://www.precisionit.co.in/> www.precisionit.co.in

 
This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111216/4d27acb6/attachment.htm>

From swhiteho at redhat.com  Mon Dec 19 15:07:25 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 19 Dec 2011 15:07:25 +0000
Subject: [Linux-cluster] GFS2 + ACL + SGID...
In-Reply-To: <006a01ccbbf2$dd66bfc0$98343f40$@precisionit.co.in>
References: <006a01ccbbf2$dd66bfc0$98343f40$@precisionit.co.in>
Message-ID: <1324307245.2723.52.camel@menhir>

Hi,

On Fri, 2011-12-16 at 18:31 +0530, SATHYA - IT wrote:
> Hi,
> 
>  
> 
> We had configured a clustered file server with (samba + ctdb + gfs2).
> GFS2 partition is mounted with ACL option. When we create a folder in
> GFS2 partition, irrespective of user being provided with full
> permission for the folder via ACL, the user is not able to delete or
> rename the folder created by him.
> 
>  
> 
> Please find the example:
> 
>  
> 
> Folder path: /test/abc/
> 
> Permission for /test is rwxrwx---+ 
> 
> Permission for /test/abc is rwxrws---+
> 
> For abc user been provided with full permission?. Where we were able
> to find the permission thro? getfacl command.
> 
>  
> 
> Now the user is able to created folder inside the abc folder. But not
> able to rename / delete the folder which he created. Can anyone kindly
> assist on this.
> 
I'm not sure I understand the problem here... can you post the complete
command sequence and show the permissions at each stage and what you
think should be different?

Steve.

>  
> 
> Thanks
> 
>  
> 
> Sathya Narayanan V
> 
> Solution Architect    
> 
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
> SERVICE - In PRECISION IT is a PASSION
> ---------------------------------------------------------------------------------------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
> 
>  
> 
> 
> 
> This communication may contain confidential information. If you are
> not the intended recipient it may be unlawful for you to read, copy,
> distribute, disclose or otherwise use the information contained within
> this communication.. Errors and Omissions may occur in the contents of
> this Email arising out of or in connection with data transmission,
> network malfunction or failure, machine or software error,
> malfunction, or operator errors by the person who is sending the
> email. Precision Group accepts no responsibility for any such errors
> or omissions. The information, views and comments within this
> communication are those of the individual and not necessarily those of
> Precision Group. All email that is sent from/to Precision Group is
> scanned for the presence of computer viruses, security issues and
> inappropriate content. However, it is the recipient's responsibility
> to check any attachments for viruses before use. 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From nagius at astek.fr  Mon Dec 19 16:06:13 2011
From: nagius at astek.fr (nagius at astek.fr)
Date: Mon, 19 Dec 2011 17:06:13 +0100
Subject: [Linux-cluster] Freeze with cmirror and cLVM
Message-ID: <20111219170613.6wazswzvu8sw4cwc@webmail.astek.fr>

Hello all,

We've got a problem using a mirrored and clustered LV within a 5 nodes  
cluster.

Everything is fine, except when a node leave the cluster, with  
cman_tool leave, /etc/init/d/cman stop, electric poweroff or whatever.
In this case the storage just freeze on the others nodes, during 20  
seconds, until openais take into account the configuration change.

We can see this during the freeze in /var/log/messages :

?openais[3136]: [TOTEM] The token was lost in the OPERATIONAL state.
?[...]
?kernel: device-mapper: dm-log-clustered: [1lXq0Alc] Request timed  
out: [DM_CLOG_MARK_REGION/405288] - retrying
?[...]
?openais[3136]: [TOTEM] entering OPERATIONAL state.

I'm running lastest 5.7, with :
?- kernel-xen-2.6.18-274.12.1,
?- cman-2.0.115-85,
?- lvm2-cluster-2.02.84-6
?- cmirror-1.1.39-10
?- kmod-cmirror-xen-0.1.22-3

Is this a normal behavior ?

Thanks for help
Nicolas AGIUS
?

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111219/0d4c9823/attachment.htm>

From chris.alexander at kusiri.com  Mon Dec 19 19:01:05 2011
From: chris.alexander at kusiri.com (Chris Alexander)
Date: Mon, 19 Dec 2011 19:01:05 +0000
Subject: [Linux-cluster] Corosync memory problem
Message-ID: <CAOJZNMdz4R7qs2e9WCfhcN7=NnrOFkWWXAr8q+mL2rscgRSGdw@mail.gmail.com>

Hi all,

You may remember our recent issue, I believe this is being worsened if not
caused by another problem we have encountered.

Every few days our nodes are (non-simultaneously) being fenced due to
corosync taking up vast amounts of memory (i.e. 100% of the box). Please
see a sample log message, we have several just like this, [1] which occurs
when this happens. Note that it is not always corosync being killed - but
it is clearly corosync eating all the memory (see top output from three
servers at various times since their last reboot, [2] [3] [4]).

The corosync version is 1.2.3:
[g at cluster1 ~]$ corosync -v
Corosync Cluster Engine, version '1.2.3'
Copyright (c) 2006-2009 Red Hat, Inc.

We had a bit of a dig around and there are a significant number of bugfix
updates which address various segfaults, crashes, memory leaks etc. in this
minor as well as subsequent minor versions. [5] [6]

We're trialling the Fedora 14 (fc14) RPMs for corosync and corosynclib
(v1.4.2) to see if it fixes the particular issue we are seeing (i.e.
whether or not the memory keeps spiralling way out of control).

Has anyone else seen an issue like this, and is there any known way to
debug or fix it? If we can assist debugging by providing further
information, please specify what this is (and, if non-obvious, how to get
it).

Thanks again for your help

Chris

[1] http://pastebin.com/CbyERaRT
[2] http://pastebin.com/uk9ZGL7H
[3] http://pastebin.com/H4w5Zg46
[4] http://pastebin.com/KPZxL6UB
[5] http://rhn.redhat.com/errata/RHBA-2011-1361.html
[6] http://rhn.redhat.com/errata/RHBA-2011-1515.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111219/dbe7e386/attachment.htm>

From sathyanarayanan.varadharajan at precisionit.co.in  Tue Dec 20 04:14:54 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Tue, 20 Dec 2011 09:44:54 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 14
In-Reply-To: <mailman.35.1324314005.17482.linux-cluster@redhat.com>
References: <mailman.35.1324314005.17482.linux-cluster@redhat.com>
Message-ID: <007201ccbecd$ee331500$ca993f00$@precisionit.co.in>

Hi Nicolos,

Can you post your cluster.conf file. Do you have the unfence option enabled
in your cluster configuration. At times, system hangs out without allowing
the access to files during this scenario.

Thanks

Sathya Narayanan V
Solution Architect	
M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
www.precisionit.co.in

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
linux-cluster-request at redhat.com
Sent: Monday, December 19, 2011 10:30 PM
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 92, Issue 14

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific than
"Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. Re: GFS2 + ACL + SGID... (Steven Whitehouse)
   2. Freeze with cmirror and cLVM (nagius at astek.fr)


----------------------------------------------------------------------

Message: 1
Date: Mon, 19 Dec 2011 15:07:25 +0000
From: Steven Whitehouse <swhiteho at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] GFS2 + ACL + SGID...
Message-ID: <1324307245.2723.52.camel at menhir>
Content-Type: text/plain; charset="UTF-8"

Hi,

On Fri, 2011-12-16 at 18:31 +0530, SATHYA - IT wrote:
> Hi,
> 
>  
> 
> We had configured a clustered file server with (samba + ctdb + gfs2).
> GFS2 partition is mounted with ACL option. When we create a folder in
> GFS2 partition, irrespective of user being provided with full 
> permission for the folder via ACL, the user is not able to delete or 
> rename the folder created by him.
> 
>  
> 
> Please find the example:
> 
>  
> 
> Folder path: /test/abc/
> 
> Permission for /test is rwxrwx---+
> 
> Permission for /test/abc is rwxrws---+
> 
> For abc user been provided with full permission?. Where we were able 
> to find the permission thro? getfacl command.
> 
>  
> 
> Now the user is able to created folder inside the abc folder. But not 
> able to rename / delete the folder which he created. Can anyone kindly 
> assist on this.
> 
I'm not sure I understand the problem here... can you post the complete
command sequence and show the permissions at each stage and what you think
should be different?

Steve.

>  
> 
> Thanks
> 
>  
> 
> Sathya Narayanan V
> 
> Solution Architect    
> 
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521 
> SERVICE - In PRECISION IT is a PASSION
> ----------------------------------------------------------------------
> -----------------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
> 
>  
> 
> 
> 
> This communication may contain confidential information. If you are 
> not the intended recipient it may be unlawful for you to read, copy, 
> distribute, disclose or otherwise use the information contained within 
> this communication.. Errors and Omissions may occur in the contents of 
> this Email arising out of or in connection with data transmission, 
> network malfunction or failure, machine or software error, 
> malfunction, or operator errors by the person who is sending the 
> email. Precision Group accepts no responsibility for any such errors 
> or omissions. The information, views and comments within this 
> communication are those of the individual and not necessarily those of 
> Precision Group. All email that is sent from/to Precision Group is 
> scanned for the presence of computer viruses, security issues and 
> inappropriate content. However, it is the recipient's responsibility 
> to check any attachments for viruses before use.
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


------------------------------

Message: 2
Date: Mon, 19 Dec 2011 17:06:13 +0100
From: nagius at astek.fr
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Freeze with cmirror and cLVM
Message-ID: <20111219170613.6wazswzvu8sw4cwc at webmail.astek.fr>
Content-Type: text/plain; charset="iso-8859-1"; Format="flowed";
	DelSp="Yes"

Hello all,

We've got a problem using a mirrored and clustered LV within a 5 nodes
cluster.

Everything is fine, except when a node leave the cluster, with cman_tool
leave, /etc/init/d/cman stop, electric poweroff or whatever.
In this case the storage just freeze on the others nodes, during 20 seconds,
until openais take into account the configuration change.

We can see this during the freeze in /var/log/messages :

?openais[3136]: [TOTEM] The token was lost in the OPERATIONAL state.
?[...]
?kernel: device-mapper: dm-log-clustered: [1lXq0Alc] Request timed
out: [DM_CLOG_MARK_REGION/405288] - retrying ?[...]
?openais[3136]: [TOTEM] entering OPERATIONAL state.

I'm running lastest 5.7, with :
?- kernel-xen-2.6.18-274.12.1,
?- cman-2.0.115-85,
?- lvm2-cluster-2.02.84-6
?- cmirror-1.1.39-10
?- kmod-cmirror-xen-0.1.22-3

Is this a normal behavior ?

Thanks for help
Nicolas AGIUS
?

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://www.redhat.com/archives/linux-cluster/attachments/20111219/0d4c9823
/attachment.html>

------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 92, Issue 14
*********************************************

This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.


From nagius at astek.fr  Tue Dec 20 12:31:35 2011
From: nagius at astek.fr (nagius)
Date: Tue, 20 Dec 2011 13:31:35 +0100
Subject: [Linux-cluster] Freeze with cmirror and cLVM
In-Reply-To: <007201ccbecd$ee331500$ca993f00$@precisionit.co.in>
References: <mailman.35.1324314005.17482.linux-cluster@redhat.com>
	<007201ccbecd$ee331500$ca993f00$@precisionit.co.in>
Message-ID: <000501ccbf13$519fc210$f4df4630$@fr>

Hi,

Here is my cluster.conf :

<cluster alias="dev_s2" config_version="3" name="dev_s2">
        <clusternodes>
                <clusternode name="xen0dev40" nodeid="1"
votes="1"></clusternode>
                <clusternode name="xen0dev41" nodeid="2"
votes="1"></clusternode>
                <clusternode name="xen0dev42" nodeid="3"
votes="1"></clusternode>
                <clusternode name="xen0dev43" nodeid="4"
votes="1"></clusternode>
                <clusternode name="xen0dev44" nodeid="5"
votes="1"></clusternode>
        </clusternodes>
</cluster>

Fencing has been disabled a boot time in /etc/sysconfig/cman :
 FENCE_JOIN="no"

Thanks,
Nicolas AGIUS


-----Message d'origine-----
De?: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] De la part de SATHYA - IT
Envoy??: mardi 20 d?cembre 2011 05:15
??: linux-cluster at redhat.com
Objet?: Re: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 14

Hi Nicolos,

Can you post your cluster.conf file. Do you have the unfence option enabled
in your cluster configuration. At times, system hangs out without allowing
the access to files during this scenario.

Thanks

Sathya Narayanan V
Solution Architect	
M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
www.precisionit.co.in

-----Original Message-----
Date: Mon, 19 Dec 2011 17:06:13 +0100
From: nagius at astek.fr
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Freeze with cmirror and cLVM
Message-ID: <20111219170613.6wazswzvu8sw4cwc at webmail.astek.fr>
Content-Type: text/plain; charset="iso-8859-1"; Format="flowed";
	DelSp="Yes"

Hello all,

We've got a problem using a mirrored and clustered LV within a 5 nodes
cluster.

Everything is fine, except when a node leave the cluster, with cman_tool
leave, /etc/init/d/cman stop, electric poweroff or whatever.
In this case the storage just freeze on the others nodes, during 20 seconds,
until openais take into account the configuration change.

We can see this during the freeze in /var/log/messages :

?openais[3136]: [TOTEM] The token was lost in the OPERATIONAL state.
?[...]
?kernel: device-mapper: dm-log-clustered: [1lXq0Alc] Request timed
out: [DM_CLOG_MARK_REGION/405288] - retrying ?[...]
?openais[3136]: [TOTEM] entering OPERATIONAL state.

I'm running lastest 5.7, with :
?- kernel-xen-2.6.18-274.12.1,
?- cman-2.0.115-85,
?- lvm2-cluster-2.02.84-6
?- cmirror-1.1.39-10
?- kmod-cmirror-xen-0.1.22-3

Is this a normal behavior ?

Thanks for help
Nicolas AGIUS
?

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From sathyanarayanan.varadharajan at precisionit.co.in  Wed Dec 21 10:41:40 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Wed, 21 Dec 2011 16:11:40 +0530
Subject: [Linux-cluster] GFS2 Consistency...
Message-ID: <00e701ccbfcd$20d1c100$62754300$@precisionit.co.in>

Hi,

 
We are having an cluster environment running on GFS2 + CTDB + Samba. Due to
some unavoidable circumstances we were forced to hard reboot the server 2 to
3 times. After the 3rd time restart, everything worked fine without any
issues. But after 4 to 5 hours online, we got a trigger stating File System
consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times
a server, whether it affects the GFS2 file system. Is that the file system
is that much sensitive. Whereas we won't have any issues in ext3/ext4 file
system earlier in related scenarios. Can anyone revert on the GFS2
consistency and its recommendation to run in production environment.

 
Thanks

 
Sathya Narayanan V

Solution Architect    

M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
 <http://www.precisionit.co.in/> www.precisionit.co.in

 
This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111221/09f80f92/attachment.htm>

From adrew at redhat.com  Wed Dec 21 15:09:46 2011
From: adrew at redhat.com (Adam Drew)
Date: Wed, 21 Dec 2011 10:09:46 -0500
Subject: [Linux-cluster] GFS2 Consistency...
In-Reply-To: <00e701ccbfcd$20d1c100$62754300$@precisionit.co.in>
References: <00e701ccbfcd$20d1c100$62754300$@precisionit.co.in>
Message-ID: <C052F150-E6BA-41C5-B4C8-EC719105B73B@redhat.com>


> Hi,
>  
> We are having an cluster environment running on GFS2 + CTDB + Samba. Due to some unavoidable circumstances we were forced to hard reboot the server 2 to 3 times. After the 3rd time restart, everything worked fine without any issues. But after 4 to 5 hours online, we got a trigger stating File System consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times a server, whether it affects the GFS2 file system. Is that the file system is that much sensitive. Whereas we won?t have any issues in ext3/ext4 file system earlier in related scenarios. Can anyone revert on the GFS2 consistency and its recommendation to run in production environment.
>  
>  
> Thanks
>  
> Sathya Narayanan V
> Solution Architect   
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
> SERVICE - In PRECISION IT is a PASSION
> ---------------------------------------------------------------------------------------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
>  
> 
> This communication may contain confidential information. If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use. 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Hello Sathya,

If you are experiencing GFS2 withdraws you may be running into a bug , filesystem corruption, or both. If you have a Red Hat support contract I suggest opening a support case with Red Hat as soon as possible. When you open the support case you'll want to attach sosreports from all nodes (run the sosreport command on every node in the cluster and attach the resultant tarballs to the support case.) If you've hit a withdraw you are likely to keep hitting them and data loss or corruption is a tangible possibility; Red Hat support can help identify the source of the issue and provide relief.

If you don't have a Red Hat support contract then please reply to the thread with the kernel versions you are running on all nodes and the full withdraw message and call traces from the messages logs on the affected cluster. You'll be able to identify the withdraw easily in the logs. We'll want the withdraw messages which will include a pointer to the position in code where the error occurred and the nature of the withdraw. We'll also need the stack trace that follows the withdraw as it will allow us to understand the code path involved.

Thanks,
Adam

--
Adam Drew
Software Maintenance Engineer
Support Engineering Group
Red Hat, Inc.
Desk: (919) 754-4126
Cell: (919) 389-5334


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111221/b7c316f2/attachment.htm>

From sathyanarayanan.varadharajan at precisionit.co.in  Wed Dec 21 17:34:05 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Wed, 21 Dec 2011 23:04:05 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 16
In-Reply-To: <mailman.39.1324486806.21757.linux-cluster@redhat.com>
References: <mailman.39.1324486806.21757.linux-cluster@redhat.com>
Message-ID: <013401ccc006$bdf134a0$39d39de0$@precisionit.co.in>

Hi Adam,

Thanks for your response. We are not currently having any redhat support for
HA and RS. We have the support only for the Server OS. 2 Nodes are running
with RHEL 6.2 in the cluster environment. The withdrawn message from the log
file are as follows:

Dec 21 10:32:43 filesrv2 avahi-daemon[9585]: Registering new address record
for 192.168.129.15 on bond0.IPv4.
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: fatal:
filesystem consistency error
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1:   RG =
160469200
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1:   function =
gfs2_setbit, file = fs/gfs2/rgrp.c, line = 95
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: about to
withdraw this file system
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: telling LM to
unmount
Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: withdrawn
Dec 21 10:33:10 filesrv2 kernel: Pid: 26976, comm: smbd Not tainted
2.6.32-220.el6.x86_64 #1
Dec 21 10:33:10 filesrv2 kernel: Call Trace:
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa05508e2>] ?
gfs2_lm_withdraw+0x102/0x130 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81090bdf>] ?
wake_up_bit+0x2f/0x40
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0550a8a>] ?
gfs2_consist_rgrpd_i+0x4a/0x50 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa054b5d0>] ?
rgblk_free+0x1f0/0x200 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa054b992>] ?
gfs2_free_data+0x42/0x130 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0524f80>] ? do_strip+0x450/0x470
[gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa05251bf>] ?
recursive_scan.clone.0+0xbf/0x280 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81111aa7>] ?
find_lock_page+0x37/0x80
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff8115efb5>] ?
kmem_cache_alloc_notrace+0x115/0x130
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa052548d>] ?
trunc_dealloc+0x10d/0x130 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0537be1>] ?
gfs2_log_commit+0x1c1/0x300 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0526df3>] ?
gfs2_truncatei+0x4b3/0x820 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0543569>] ?
gfs2_setattr+0x119/0x3d0 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0543496>] ?
gfs2_setattr+0x46/0x3d0 [gfs2]
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81192698>] ?
notify_change+0x168/0x340
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81174de4>] ?
do_truncate+0x64/0xa0
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff811750b0>] ?
sys_ftruncate+0xf0/0x100
Dec 21 10:33:10 filesrv2 kernel: [<ffffffff8100b308>] ? tracesys+0xd9/0xde
Dec 21 10:33:16 filesrv2 avahi-daemon[9585]: Withdrawing address record for
192.168.129.15 on bond0.
Dec 21 10:36:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
than 120 seconds.
Dec 21 10:36:20 filesrv2 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Dec 21 10:36:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
9769      2 0x00000000
Dec 21 10:36:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
0000000000000000 000000004db7b07d
Dec 21 10:36:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
ffff88087fe51d70 ffffffff811a81be
Dec 21 10:36:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
000000000000f4e8 ffff880888437af8
Dec 21 10:36:20 filesrv2 kernel: Call Trace:
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff811a81be>] ?
submit_bh+0x10e/0x150
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa05389ca>]
gfs2_log_flush+0x46a/0x6e0 [gfs2]
Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa053736f>] ?
gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
autoremove_wake_function+0x0/0x40
Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
[gfs2]
Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
[gfs2]
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
Dec 21 10:36:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
Dec 21 10:38:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
than 120 seconds.
Dec 21 10:38:20 filesrv2 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Dec 21 10:38:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
9769      2 0x00000000
Dec 21 10:38:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
0000000000000000 000000004db7b07d
Dec 21 10:38:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
ffff88087fe51d70 ffffffff811a81be
Dec 21 10:38:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
000000000000f4e8 ffff880888437af8
Dec 21 10:38:20 filesrv2 kernel: Call Trace:
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff811a81be>] ?
submit_bh+0x10e/0x150
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa05389ca>]
gfs2_log_flush+0x46a/0x6e0 [gfs2]
Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa053736f>] ?
gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
autoremove_wake_function+0x0/0x40
Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
[gfs2]
Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
[gfs2]
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
Dec 21 10:38:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
Dec 21 10:40:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
than 120 seconds.
Dec 21 10:40:20 filesrv2 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Dec 21 10:40:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
9769      2 0x00000000
Dec 21 10:40:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
0000000000000000 000000004db7b07d
Dec 21 10:40:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
ffff88087fe51d70 ffffffff811a81be
Dec 21 10:40:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
000000000000f4e8 ffff880888437af8
Dec 21 10:40:20 filesrv2 kernel: Call Trace:
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff811a81be>] ?
submit_bh+0x10e/0x150
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa05389ca>]
gfs2_log_flush+0x46a/0x6e0 [gfs2]
Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa053736f>] ?
gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
autoremove_wake_function+0x0/0x40
Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
[gfs2]
Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
[gfs2]
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
Dec 21 10:40:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
Dec 21 10:42:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
than 120 seconds.
Dec 21 10:42:20 filesrv2 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Dec 21 10:42:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
9769      2 0x00000000
Dec 21 10:42:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
0000000000000000 000000004db7b07d
Dec 21 10:42:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
ffff88087fe51d70 ffffffff811a81be
Dec 21 10:42:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
000000000000f4e8 ffff880888437af8
Dec 21 10:42:20 filesrv2 kernel: Call Trace:
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff811a81be>] ?
submit_bh+0x10e/0x150
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa05389ca>]
gfs2_log_flush+0x46a/0x6e0 [gfs2]
Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa053736f>] ?
gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
autoremove_wake_function+0x0/0x40
Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
[gfs2]
Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
[gfs2]
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
Dec 21 10:42:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: fatal: invalid
metadata block
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1:   bh = 51194408
(magic number)
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1:   function =
gfs2_meta_indirect_buffer, file = fs/gfs2/meta_io.c, line = 401
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: about to withdraw
this file system
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: telling LM to
unmount
Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: withdrawn
Dec 21 10:43:42 filesrv2 kernel: Pid: 9710, comm: glock_workqueue Not
tainted 2.6.32-220.el6.x86_64 #1
Dec 21 10:43:42 filesrv2 kernel: Call Trace:
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa05508e2>] ?
gfs2_lm_withdraw+0x102/0x130 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090c30>] ?
wake_bit_function+0x0/0x50
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0550a35>] ?
gfs2_meta_check_ii+0x45/0x50 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa053b4a5>] ?
gfs2_meta_indirect_buffer+0x185/0x190 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0535e49>] ?
gfs2_inode_refresh+0x29/0x340 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff810ea694>] ?
rb_reserve_next_event+0xb4/0x370
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0535488>] ?
inode_go_lock+0x88/0xf0 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0533c07>] ?
do_promote+0x1c7/0x340 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0533ef8>] ?
finish_xmote+0x178/0x410 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0534d03>] ?
glock_work_func+0x133/0x1b0 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0534bd0>] ?
glock_work_func+0x0/0x1b0 [gfs2]
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8108b2b0>] ?
worker_thread+0x170/0x2a0
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090bf0>] ?
autoremove_wake_function+0x0/0x40
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8108b140>] ?
worker_thread+0x0/0x2a0
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090886>] ? kthread+0x96/0xa0
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8100c14a>] ? child_rip+0xa/0x20
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20

Thanks

Sathya Narayanan V
Solution Architect	
M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
www.precisionit.co.in


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
linux-cluster-request at redhat.com
Sent: Wednesday, December 21, 2011 10:30 PM
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 92, Issue 16

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific than
"Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. GFS2 Consistency... (SATHYA - IT)
   2. Re: GFS2 Consistency... (Adam Drew)


----------------------------------------------------------------------

Message: 1
Date: Wed, 21 Dec 2011 16:11:40 +0530
From: "SATHYA - IT" <sathyanarayanan.varadharajan at precisionit.co.in>
To: <linux-cluster at redhat.com>
Subject: [Linux-cluster] GFS2 Consistency...
Message-ID: <00e701ccbfcd$20d1c100$62754300$@precisionit.co.in>
Content-Type: text/plain; charset="us-ascii"

Hi,

 
We are having an cluster environment running on GFS2 + CTDB + Samba. Due to
some unavoidable circumstances we were forced to hard reboot the server 2 to
3 times. After the 3rd time restart, everything worked fine without any
issues. But after 4 to 5 hours online, we got a trigger stating File System
consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times
a server, whether it affects the GFS2 file system. Is that the file system
is that much sensitive. Whereas we won't have any issues in ext3/ext4 file
system earlier in related scenarios. Can anyone revert on the GFS2
consistency and its recommendation to run in production environment.

 
Thanks

 
Sathya Narayanan V

Solution Architect    

M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521 SERVICE
- In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
 <http://www.precisionit.co.in/> www.precisionit.co.in

 
This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read,
copy, distribute, disclose or otherwise use the information contained within
this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of
or in connection with data transmission, network malfunction or failure,
machine or software error, malfunction, or operator errors by the person who
is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions.
The information, views and comments within this communication are those of
the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence
of computer viruses, security issues and inappropriate content. However, it
is the recipient's responsibility to check any attachments for viruses
before use.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://www.redhat.com/archives/linux-cluster/attachments/20111221/09f80f92
/attachment.html>

------------------------------

Message: 2
Date: Wed, 21 Dec 2011 10:09:46 -0500
From: Adam Drew <adrew at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] GFS2 Consistency...
Message-ID: <C052F150-E6BA-41C5-B4C8-EC719105B73B at redhat.com>
Content-Type: text/plain; charset="windows-1252"


> Hi,
>  
> We are having an cluster environment running on GFS2 + CTDB + Samba. Due
to some unavoidable circumstances we were forced to hard reboot the server 2
to 3 times. After the 3rd time restart, everything worked fine without any
issues. But after 4 to 5 hours online, we got a trigger stating File System
consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times
a server, whether it affects the GFS2 file system. Is that the file system
is that much sensitive. Whereas we won?t have any issues in ext3/ext4 file
system earlier in related scenarios. Can anyone revert on the GFS2
consistency and its recommendation to run in production environment.
>  
>  
> Thanks
>  
> Sathya Narayanan V
> Solution Architect   
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521 
> SERVICE - In PRECISION IT is a PASSION
> ----------------------------------------------------------------------
> -----------------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
>  
> 
> This communication may contain confidential information. If you are not
the intended recipient it may be unlawful for you to read, copy, distribute,
disclose or otherwise use the information contained within this
communication.. Errors and Omissions may occur in the contents of this Email
arising out of or in connection with data transmission, network malfunction
or failure, machine or software error, malfunction, or operator errors by
the person who is sending the email. Precision Group accepts no
responsibility for any such errors or omissions. The information, views and
comments within this communication are those of the individual and not
necessarily those of Precision Group. All email that is sent from/to
Precision Group is scanned for the presence of computer viruses, security
issues and inappropriate content. However, it is the recipient's
responsibility to check any attachments for viruses before use. 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Hello Sathya,

If you are experiencing GFS2 withdraws you may be running into a bug ,
filesystem corruption, or both. If you have a Red Hat support contract I
suggest opening a support case with Red Hat as soon as possible. When you
open the support case you'll want to attach sosreports from all nodes (run
the sosreport command on every node in the cluster and attach the resultant
tarballs to the support case.) If you've hit a withdraw you are likely to
keep hitting them and data loss or corruption is a tangible possibility; Red
Hat support can help identify the source of the issue and provide relief.

If you don't have a Red Hat support contract then please reply to the thread
with the kernel versions you are running on all nodes and the full withdraw
message and call traces from the messages logs on the affected cluster.
You'll be able to identify the withdraw easily in the logs. We'll want the
withdraw messages which will include a pointer to the position in code where
the error occurred and the nature of the withdraw. We'll also need the stack
trace that follows the withdraw as it will allow us to understand the code
path involved.

Thanks,
Adam

--
Adam Drew
Software Maintenance Engineer
Support Engineering Group
Red Hat, Inc.
Desk: (919) 754-4126
Cell: (919) 389-5334


-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<https://www.redhat.com/archives/linux-cluster/attachments/20111221/b7c316f2
/attachment.html>

------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 92, Issue 16
*********************************************

This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.


From chris.alexander at kusiri.com  Wed Dec 21 18:04:55 2011
From: chris.alexander at kusiri.com (Chris Alexander)
Date: Wed, 21 Dec 2011 18:04:55 +0000
Subject: [Linux-cluster] Corosync memory problem
In-Reply-To: <CAOJZNMdz4R7qs2e9WCfhcN7=NnrOFkWWXAr8q+mL2rscgRSGdw@mail.gmail.com>
References: <CAOJZNMdz4R7qs2e9WCfhcN7=NnrOFkWWXAr8q+mL2rscgRSGdw@mail.gmail.com>
Message-ID: <CAOJZNMeBP=-jQrh-F7cTtK4tNy+HGHsLHvuf5ouWbBg-UdkfPw@mail.gmail.com>

An update in case anyone ever runs into something like this - we had
corosync-notify running on the servers and once we removed that and
restarted the cluster stack, corosync seemed to return to normal.

Additionally, according to the corosync mailing list, the cluster 1.2.3
version is basically very similar to (if not the same as) the 1.4 that they
currently have released, someone's been backporting.

Cheers

Chris

On 19 December 2011 19:01, Chris Alexander <chris.alexander at kusiri.com>wrote:

> Hi all,
>
> You may remember our recent issue, I believe this is being worsened if not
> caused by another problem we have encountered.
>
> Every few days our nodes are (non-simultaneously) being fenced due to
> corosync taking up vast amounts of memory (i.e. 100% of the box). Please
> see a sample log message, we have several just like this, [1] which occurs
> when this happens. Note that it is not always corosync being killed - but
> it is clearly corosync eating all the memory (see top output from three
> servers at various times since their last reboot, [2] [3] [4]).
>
> The corosync version is 1.2.3:
> [g at cluster1 ~]$ corosync -v
> Corosync Cluster Engine, version '1.2.3'
> Copyright (c) 2006-2009 Red Hat, Inc.
>
> We had a bit of a dig around and there are a significant number of bugfix
> updates which address various segfaults, crashes, memory leaks etc. in this
> minor as well as subsequent minor versions. [5] [6]
>
> We're trialling the Fedora 14 (fc14) RPMs for corosync and corosynclib
> (v1.4.2) to see if it fixes the particular issue we are seeing (i.e.
> whether or not the memory keeps spiralling way out of control).
>
> Has anyone else seen an issue like this, and is there any known way to
> debug or fix it? If we can assist debugging by providing further
> information, please specify what this is (and, if non-obvious, how to get
> it).
>
> Thanks again for your help
>
> Chris
>
> [1] http://pastebin.com/CbyERaRT
> [2] http://pastebin.com/uk9ZGL7H
> [3] http://pastebin.com/H4w5Zg46
> [4] http://pastebin.com/KPZxL6UB
> [5] http://rhn.redhat.com/errata/RHBA-2011-1361.html
> [6] http://rhn.redhat.com/errata/RHBA-2011-1515.html
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111221/e7ac3799/attachment.htm>

From adrew at redhat.com  Wed Dec 21 19:45:00 2011
From: adrew at redhat.com (Adam Drew)
Date: Wed, 21 Dec 2011 14:45:00 -0500
Subject: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 16
In-Reply-To: <013401ccc006$bdf134a0$39d39de0$@precisionit.co.in>
References: <mailman.39.1324486806.21757.linux-cluster@redhat.com>
	<013401ccc006$bdf134a0$39d39de0$@precisionit.co.in>
Message-ID: <09D9D545-B4E9-4497-9793-1D1CEB430602@redhat.com>


On Dec 21, 2011, at 12:34 PM, SATHYA - IT wrote:

> Hi Adam,
> 
> Thanks for your response. We are not currently having any redhat support for
> HA and RS. We have the support only for the Server OS. 2 Nodes are running
> with RHEL 6.2 in the cluster environment. The withdrawn message from the log
> file are as follows:
> 
> Dec 21 10:32:43 filesrv2 avahi-daemon[9585]: Registering new address record
> for 192.168.129.15 on bond0.IPv4.
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: fatal:
> filesystem consistency error
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1:   RG =
> 160469200
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1:   function =
> gfs2_setbit, file = fs/gfs2/rgrp.c, line = 95
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: about to
> withdraw this file system
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: telling LM to
> unmount
> Dec 21 10:33:10 filesrv2 kernel: GFS2: fsid=samba:hadata01.1: withdrawn
> Dec 21 10:33:10 filesrv2 kernel: Pid: 26976, comm: smbd Not tainted
> 2.6.32-220.el6.x86_64 #1
> Dec 21 10:33:10 filesrv2 kernel: Call Trace:
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa05508e2>] ?
> gfs2_lm_withdraw+0x102/0x130 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81090bdf>] ?
> wake_up_bit+0x2f/0x40
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0550a8a>] ?
> gfs2_consist_rgrpd_i+0x4a/0x50 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa054b5d0>] ?
> rgblk_free+0x1f0/0x200 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa054b992>] ?
> gfs2_free_data+0x42/0x130 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0524f80>] ? do_strip+0x450/0x470
> [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa05251bf>] ?
> recursive_scan.clone.0+0xbf/0x280 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81111aa7>] ?
> find_lock_page+0x37/0x80
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff8115efb5>] ?
> kmem_cache_alloc_notrace+0x115/0x130
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa052548d>] ?
> trunc_dealloc+0x10d/0x130 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0537be1>] ?
> gfs2_log_commit+0x1c1/0x300 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0526df3>] ?
> gfs2_truncatei+0x4b3/0x820 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0543569>] ?
> gfs2_setattr+0x119/0x3d0 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffffa0543496>] ?
> gfs2_setattr+0x46/0x3d0 [gfs2]
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81192698>] ?
> notify_change+0x168/0x340
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff81174de4>] ?
> do_truncate+0x64/0xa0
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff811750b0>] ?
> sys_ftruncate+0xf0/0x100
> Dec 21 10:33:10 filesrv2 kernel: [<ffffffff8100b308>] ? tracesys+0xd9/0xde
> Dec 21 10:33:16 filesrv2 avahi-daemon[9585]: Withdrawing address record for
> 192.168.129.15 on bond0.
> Dec 21 10:36:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
> than 120 seconds.
> Dec 21 10:36:20 filesrv2 kernel: "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Dec 21 10:36:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
> 9769      2 0x00000000
> Dec 21 10:36:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
> 0000000000000000 000000004db7b07d
> Dec 21 10:36:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
> ffff88087fe51d70 ffffffff811a81be
> Dec 21 10:36:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
> 000000000000f4e8 ffff880888437af8
> Dec 21 10:36:20 filesrv2 kernel: Call Trace:
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff811a81be>] ?
> submit_bh+0x10e/0x150
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa05389ca>]
> gfs2_log_flush+0x46a/0x6e0 [gfs2]
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa053736f>] ?
> gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
> autoremove_wake_function+0x0/0x40
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
> [gfs2]
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
> [gfs2]
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
> Dec 21 10:36:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
> Dec 21 10:38:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
> than 120 seconds.
> Dec 21 10:38:20 filesrv2 kernel: "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Dec 21 10:38:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
> 9769      2 0x00000000
> Dec 21 10:38:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
> 0000000000000000 000000004db7b07d
> Dec 21 10:38:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
> ffff88087fe51d70 ffffffff811a81be
> Dec 21 10:38:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
> 000000000000f4e8 ffff880888437af8
> Dec 21 10:38:20 filesrv2 kernel: Call Trace:
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff811a81be>] ?
> submit_bh+0x10e/0x150
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa05389ca>]
> gfs2_log_flush+0x46a/0x6e0 [gfs2]
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa053736f>] ?
> gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
> autoremove_wake_function+0x0/0x40
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
> [gfs2]
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
> [gfs2]
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
> Dec 21 10:38:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
> Dec 21 10:40:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
> than 120 seconds.
> Dec 21 10:40:20 filesrv2 kernel: "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Dec 21 10:40:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
> 9769      2 0x00000000
> Dec 21 10:40:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
> 0000000000000000 000000004db7b07d
> Dec 21 10:40:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
> ffff88087fe51d70 ffffffff811a81be
> Dec 21 10:40:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
> 000000000000f4e8 ffff880888437af8
> Dec 21 10:40:20 filesrv2 kernel: Call Trace:
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff811a81be>] ?
> submit_bh+0x10e/0x150
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa05389ca>]
> gfs2_log_flush+0x46a/0x6e0 [gfs2]
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa053736f>] ?
> gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
> autoremove_wake_function+0x0/0x40
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
> [gfs2]
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
> [gfs2]
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
> Dec 21 10:40:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
> Dec 21 10:42:20 filesrv2 kernel: INFO: task gfs2_logd:9769 blocked for more
> than 120 seconds.
> Dec 21 10:42:20 filesrv2 kernel: "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Dec 21 10:42:20 filesrv2 kernel: gfs2_logd     D ffff8808a7824100     0
> 9769      2 0x00000000
> Dec 21 10:42:20 filesrv2 kernel: ffff88087fe51dd0 0000000000000046
> 0000000000000000 000000004db7b07d
> Dec 21 10:42:20 filesrv2 kernel: ffff88084d820cf8 0000000000000441
> ffff88087fe51d70 ffffffff811a81be
> Dec 21 10:42:20 filesrv2 kernel: ffff880888437af8 ffff88087fe51fd8
> 000000000000f4e8 ffff880888437af8
> Dec 21 10:42:20 filesrv2 kernel: Call Trace:
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff811a81be>] ?
> submit_bh+0x10e/0x150
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff814ed1e3>] io_schedule+0x73/0xc0
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa05389ca>]
> gfs2_log_flush+0x46a/0x6e0 [gfs2]
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa053736f>] ?
> gfs2_ail1_empty+0x2f/0x1b0 [gfs2]
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff81090bf0>] ?
> autoremove_wake_function+0x0/0x40
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa0538d17>] gfs2_logd+0xd7/0x140
> [gfs2]
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffffa0538c40>] ? gfs2_logd+0x0/0x140
> [gfs2]
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff81090886>] kthread+0x96/0xa0
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff8100c14a>] child_rip+0xa/0x20
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
> Dec 21 10:42:20 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: fatal: invalid
> metadata block
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1:   bh = 51194408
> (magic number)
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1:   function =
> gfs2_meta_indirect_buffer, file = fs/gfs2/meta_io.c, line = 401
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: about to withdraw
> this file system
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: telling LM to
> unmount
> Dec 21 10:43:42 filesrv2 kernel: GFS2: fsid=samba:gen01.1: withdrawn
> Dec 21 10:43:42 filesrv2 kernel: Pid: 9710, comm: glock_workqueue Not
> tainted 2.6.32-220.el6.x86_64 #1
> Dec 21 10:43:42 filesrv2 kernel: Call Trace:
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa05508e2>] ?
> gfs2_lm_withdraw+0x102/0x130 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090c30>] ?
> wake_bit_function+0x0/0x50
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0550a35>] ?
> gfs2_meta_check_ii+0x45/0x50 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa053b4a5>] ?
> gfs2_meta_indirect_buffer+0x185/0x190 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0535e49>] ?
> gfs2_inode_refresh+0x29/0x340 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff810ea694>] ?
> rb_reserve_next_event+0xb4/0x370
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0535488>] ?
> inode_go_lock+0x88/0xf0 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0533c07>] ?
> do_promote+0x1c7/0x340 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0533ef8>] ?
> finish_xmote+0x178/0x410 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0534d03>] ?
> glock_work_func+0x133/0x1b0 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffffa0534bd0>] ?
> glock_work_func+0x0/0x1b0 [gfs2]
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8108b2b0>] ?
> worker_thread+0x170/0x2a0
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090bf0>] ?
> autoremove_wake_function+0x0/0x40
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8108b140>] ?
> worker_thread+0x0/0x2a0
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff81090886>] ? kthread+0x96/0xa0
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8100c14a>] ? child_rip+0xa/0x20
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff810907f0>] ? kthread+0x0/0xa0
> Dec 21 10:43:42 filesrv2 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20
> 
> Thanks
> 
> Sathya Narayanan V
> Solution Architect	
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
> SERVICE - In PRECISION IT is a PASSION
> ----------------------------------------------------------------------------
> -----------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of
> linux-cluster-request at redhat.com
> Sent: Wednesday, December 21, 2011 10:30 PM
> To: linux-cluster at redhat.com
> Subject: Linux-cluster Digest, Vol 92, Issue 16
> 
> Send Linux-cluster mailing list submissions to
> 	linux-cluster at redhat.com
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	https://www.redhat.com/mailman/listinfo/linux-cluster
> or, via email, send a message with subject or body 'help' to
> 	linux-cluster-request at redhat.com
> 
> You can reach the person managing the list at
> 	linux-cluster-owner at redhat.com
> 
> When replying, please edit your Subject line so it is more specific than
> "Re: Contents of Linux-cluster digest..."
> 
> 
> Today's Topics:
> 
>   1. GFS2 Consistency... (SATHYA - IT)
>   2. Re: GFS2 Consistency... (Adam Drew)
> 
> 
> ----------------------------------------------------------------------
> 
> Message: 1
> Date: Wed, 21 Dec 2011 16:11:40 +0530
> From: "SATHYA - IT" <sathyanarayanan.varadharajan at precisionit.co.in>
> To: <linux-cluster at redhat.com>
> Subject: [Linux-cluster] GFS2 Consistency...
> Message-ID: <00e701ccbfcd$20d1c100$62754300$@precisionit.co.in>
> Content-Type: text/plain; charset="us-ascii"
> 
> Hi,
> 
> 
> 
> We are having an cluster environment running on GFS2 + CTDB + Samba. Due to
> some unavoidable circumstances we were forced to hard reboot the server 2 to
> 3 times. After the 3rd time restart, everything worked fine without any
> issues. But after 4 to 5 hours online, we got a trigger stating File System
> consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times
> a server, whether it affects the GFS2 file system. Is that the file system
> is that much sensitive. Whereas we won't have any issues in ext3/ext4 file
> system earlier in related scenarios. Can anyone revert on the GFS2
> consistency and its recommendation to run in production environment.
> 
> 
> 
> 
> 
> Thanks
> 
> 
> 
> Sathya Narayanan V
> 
> Solution Architect    
> 
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521 SERVICE
> - In PRECISION IT is a PASSION
> ----------------------------------------------------------------------------
> -----------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> <http://www.precisionit.co.in/> www.precisionit.co.in
> 
> 
> 
> 
> This communication may contain confidential information. 
> If you are not the intended recipient it may be unlawful for you to read,
> copy, distribute, disclose or otherwise use the information contained within
> this communication.. 
> Errors and Omissions may occur in the contents of this Email arising out of
> or in connection with data transmission, network malfunction or failure,
> machine or software error, malfunction, or operator errors by the person who
> is sending the email. 
> Precision Group accepts no responsibility for any such errors or omissions.
> The information, views and comments within this communication are those of
> the individual and not necessarily those of Precision Group. 
> All email that is sent from/to Precision Group is scanned for the presence
> of computer viruses, security issues and inappropriate content. However, it
> is the recipient's responsibility to check any attachments for viruses
> before use.
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL:
> <https://www.redhat.com/archives/linux-cluster/attachments/20111221/09f80f92
> /attachment.html>
> 
> ------------------------------
> 
> Message: 2
> Date: Wed, 21 Dec 2011 10:09:46 -0500
> From: Adam Drew <adrew at redhat.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] GFS2 Consistency...
> Message-ID: <C052F150-E6BA-41C5-B4C8-EC719105B73B at redhat.com>
> Content-Type: text/plain; charset="windows-1252"
> 
> 
>> Hi,
>> 
>> We are having an cluster environment running on GFS2 + CTDB + Samba. Due
> to some unavoidable circumstances we were forced to hard reboot the server 2
> to 3 times. After the 3rd time restart, everything worked fine without any
> issues. But after 4 to 5 hours online, we got a trigger stating File System
> consistency error in one of the GFS2 partition. Hard reboot of 2 to 3 times
> a server, whether it affects the GFS2 file system. Is that the file system
> is that much sensitive. Whereas we won?t have any issues in ext3/ext4 file
> system earlier in related scenarios. Can anyone revert on the GFS2
> consistency and its recommendation to run in production environment.
>> 
>> 
>> Thanks
>> 
>> Sathya Narayanan V
>> Solution Architect   
>> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521 
>> SERVICE - In PRECISION IT is a PASSION
>> ----------------------------------------------------------------------
>> -----------------------------------
>> Precision Infomatic (M) Pvt Ltd
>> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
>> www.precisionit.co.in
>> 
>> 
>> This communication may contain confidential information. If you are not
> the intended recipient it may be unlawful for you to read, copy, distribute,
> disclose or otherwise use the information contained within this
> communication.. Errors and Omissions may occur in the contents of this Email
> arising out of or in connection with data transmission, network malfunction
> or failure, machine or software error, malfunction, or operator errors by
> the person who is sending the email. Precision Group accepts no
> responsibility for any such errors or omissions. The information, views and
> comments within this communication are those of the individual and not
> necessarily those of Precision Group. All email that is sent from/to
> Precision Group is scanned for the presence of computer viruses, security
> issues and inappropriate content. However, it is the recipient's
> responsibility to check any attachments for viruses before use. 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> Hello Sathya,
> 
> If you are experiencing GFS2 withdraws you may be running into a bug ,
> filesystem corruption, or both. If you have a Red Hat support contract I
> suggest opening a support case with Red Hat as soon as possible. When you
> open the support case you'll want to attach sosreports from all nodes (run
> the sosreport command on every node in the cluster and attach the resultant
> tarballs to the support case.) If you've hit a withdraw you are likely to
> keep hitting them and data loss or corruption is a tangible possibility; Red
> Hat support can help identify the source of the issue and provide relief.
> 
> If you don't have a Red Hat support contract then please reply to the thread
> with the kernel versions you are running on all nodes and the full withdraw
> message and call traces from the messages logs on the affected cluster.
> You'll be able to identify the withdraw easily in the logs. We'll want the
> withdraw messages which will include a pointer to the position in code where
> the error occurred and the nature of the withdraw. We'll also need the stack
> trace that follows the withdraw as it will allow us to understand the code
> path involved.
> 
> Thanks,
> Adam
> 
> --
> Adam Drew
> Software Maintenance Engineer
> Support Engineering Group
> Red Hat, Inc.
> Desk: (919) 754-4126
> Cell: (919) 389-5334
> 
> 
> 
> 
> 
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL:
> <https://www.redhat.com/archives/linux-cluster/attachments/20111221/b7c316f2
> /attachment.html>
> 
> ------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> End of Linux-cluster Digest, Vol 92, Issue 16
> *********************************************
> 
> This communication may contain confidential information. 
> If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
> Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
> Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
> All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

We're withdrawing in gfs2_meta_indirect_buffer which is a function that loads metadata into a buffer for use.  Where we are specifically failing is in a call to gfs2_metatype_check which does of the work of ensuring that the metadata we're loading into a buffer is of the type we expect. There's a macro and another function we pass through but ultimately we end up in gfs2_metatype_check_i which compares the expected metadata type with the type found in the buffer loaded from disk. If they don't match we withdraw.

So what does this all mean? It means that either the data on disk is corrupt (a section of what we expect should be metadata is not, or is the wrong kind of metadata) or it is some kind of memory corruption where the data in memory is being corrupted such that when we examine the buffer it appears to be the wrong type. From what I have here to analyze I cannot say which it is.

Your first action should be to unmount the filesystem in question from all nodes, update gfs2-utils, and run a gfs2_fsck on the filesystem. After the filesystem check is completed you can mount the filesystem back up and return to production. If the issue goes away then it was some anomalous sort of on-disk corruption. If the issue comes back then it is quite likely to either be a bug in GFS2 or something very wrong with the environment or workload (such as mounted without locking, or something doing block-level writes to metadata areas on disk, or something of that nature.)

If you find that you encounter further difficulties with the filesystem post-fsck I would advise, if you can, purchasing support for the Resilient Storage add-on entitlement and engaging support so that my group and I can assist you further. If you are unable to do so then you can create a bug report at bugzilla.redhat.com; but note that there are no production SLAs on bugzilla.

Good luck. I hope this helps in some capacity.

Thanks,
Adam

--
Adam Drew
Software Maintenance Engineer
Support Engineering Group
Red Hat, Inc.
Desk: (919) 754-4126
Cell: (919) 389-5334


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111221/bd54cd8b/attachment.htm>

From sherlock_zhang at uit.com.cn  Thu Dec 22 07:30:54 2011
From: sherlock_zhang at uit.com.cn (Sherlock Zhang)
Date: Thu, 22 Dec 2011 15:30:54 +0800
Subject: [Linux-cluster] error messages while use fsck.gfs2
In-Reply-To: <e711795e-92d0-45f0-96f8-160e8da07e7c@zmail16.collab.prod.int.phx2.redhat.com>
References: <CAHefBgR_0keH6A96V+9zP=2QzV6psgQgxy8V6AHJfWJhFojy7Q@mail.gmail.com>
	<e711795e-92d0-45f0-96f8-160e8da07e7c@zmail16.collab.prod.int.phx2.redhat.com>
Message-ID: <CAHefBgSKHEjAnXq5pv1OmZ9td0qP_9E+_X510rvFpU0zYNa+Bw@mail.gmail.com>

Hi all
    There are 4 storage with RAID5 and last time the dg02 was gone and the
repair are still progress.
but today I was been told there is another dg has gone :(
this time I didn't get any info from the disk, did any one could tell me
why would this happen.

the disk dump info can download via this URL
"http://www.waalker.com/disk.dg03.dump <http://www.waalker.com/disk.dump>"
and the file's md5sum is
ec92b4d5f84e52a91e9128a789da4579  disk.dg03.dump


On Mon, Nov 14, 2011 at 11:00 PM, Bob Peterson <rpeterso at redhat.com> wrote:

> ----- Original Message -----
> | Hi Bob
> |     You can download the disk.dump file via this URL
> | "http://www.waalker.com/disk.dump"
> | and the file's md5sum is
> | 1e43719ca086220478a9aee9c09c727b  disk.dump
> | if is there anything other can help ascertain the problem please let
> | me
> | know.
> |
> | Thank you
>
> Hi Sherlock,
>
> I've taken a close look at the image file you created.
> This appears to be a normal, everyday GFS2 file system
> except there is a section of 16 blocks (or 0x10 in hex)
> that are completely destroyed near the beginning of the
> file system, right after the root directory. Unfortunately,
> there are critical system files like the master directory
> that were overwritten.
>
> Blocks 35 through 50 are overwritten by unrecognisable
> binary data.  There's no way to tell how this happened.
>
> You might be able to recover the file system if you find
> a copy of the image from before the corruption and copy
> the corrupted 16 blocks from that.  For example, you could
> use a command like this:
>
> dd if=/dev/backup.image of=/dev/your/device bs=4K skip=35 seek=35 count=16
> conv=notrunc
>
> You would also need to fix up the locking protocol with:
> gfs2_tool sb /dev/your/device proto "lock_dlm"
>
> Without those 16 destroyed blocks, you may not be able to
> recover the file system.
>
> As a last-ditch effort, you could try running the experimental
> fsck.gfs2 for RHEL6 located on my people page to see if it
> can recreate any of the data, but it's a long shot, and unlikely
> to work:
>
> http://people.redhat.com/rpeterso/Experimental/RHEL6.x/fsck.gfs2
>
> I hope this helps.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
 Thank you

Best regards
*************************************************************************************************
Sherlock Zhang  ? ??
Technical Supporter East China Technical Support Department

Address: Room 23E No. 728 West Yanan Road Changning DC. Shanghai China
Post code: 20050

Office: +86 021 62253300 ext. 803
Mobile: +86 133 8600 6305
www.uit.com.cn


*************************************************************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111222/0810e42a/attachment.htm>

From sherlock_zhang at uit.com.cn  Thu Dec 22 07:34:31 2011
From: sherlock_zhang at uit.com.cn (Sherlock Zhang)
Date: Thu, 22 Dec 2011 15:34:31 +0800
Subject: [Linux-cluster] error messages while use fsck.gfs2
In-Reply-To: <CAHefBgSKHEjAnXq5pv1OmZ9td0qP_9E+_X510rvFpU0zYNa+Bw@mail.gmail.com>
References: <CAHefBgR_0keH6A96V+9zP=2QzV6psgQgxy8V6AHJfWJhFojy7Q@mail.gmail.com>
	<e711795e-92d0-45f0-96f8-160e8da07e7c@zmail16.collab.prod.int.phx2.redhat.com>
	<CAHefBgSKHEjAnXq5pv1OmZ9td0qP_9E+_X510rvFpU0zYNa+Bw@mail.gmail.com>
Message-ID: <CAHefBgTNwtdJJ18LNns5VHChx3er-kk938jq6VoZJTivhoMEVw@mail.gmail.com>

BTW:
The stdout info
[root at Toureg ~]# fsck.gfs2 /dev/sdc
Initializing fsck
Either the super block is corrupted, or this is not a GFS2 filesystem
Gathering information to repair the gfs2 superblock.  This may take some
time.
Block size determined to be: 4096
Found system jindex file at: 0x24
Found system per_node directory at: 0x2816f
>From per_node's '..' I backtracked the master directory to: 0x23
Found system statfs file at: 0x28171
Found system quota file at: 0x28172
Found system inum file at: 0x2867f
Found system rindex file at: 0x28681
Lock protocol determined to be: lock_nolock
Stand-alone file system: No need for a lock table.
Okay to fix the GFS2 superblock? (y/n)y
bad seek: Invalid argument from inode_read:58: block 18446744073709551615
(0xffffffffffffffff)

On Thu, Dec 22, 2011 at 3:30 PM, Sherlock Zhang
<sherlock_zhang at uit.com.cn>wrote:

> Hi all
>     There are 4 storage with RAID5 and last time the dg02 was gone and the
> repair are still progress.
> but today I was been told there is another dg has gone :(
> this time I didn't get any info from the disk, did any one could tell me
> why would this happen.
>
> the disk dump info can download via this URL
> "http://www.waalker.com/disk.dg03.dump <http://www.waalker.com/disk.dump>"
> and the file's md5sum is
> ec92b4d5f84e52a91e9128a789da4579  disk.dg03.dump
>
>
> On Mon, Nov 14, 2011 at 11:00 PM, Bob Peterson <rpeterso at redhat.com>wrote:
>
>> ----- Original Message -----
>> | Hi Bob
>> |     You can download the disk.dump file via this URL
>> | "http://www.waalker.com/disk.dump"
>> | and the file's md5sum is
>> | 1e43719ca086220478a9aee9c09c727b  disk.dump
>> | if is there anything other can help ascertain the problem please let
>> | me
>> | know.
>> |
>> | Thank you
>>
>> Hi Sherlock,
>>
>> I've taken a close look at the image file you created.
>> This appears to be a normal, everyday GFS2 file system
>> except there is a section of 16 blocks (or 0x10 in hex)
>> that are completely destroyed near the beginning of the
>> file system, right after the root directory. Unfortunately,
>> there are critical system files like the master directory
>> that were overwritten.
>>
>> Blocks 35 through 50 are overwritten by unrecognisable
>> binary data.  There's no way to tell how this happened.
>>
>> You might be able to recover the file system if you find
>> a copy of the image from before the corruption and copy
>> the corrupted 16 blocks from that.  For example, you could
>> use a command like this:
>>
>> dd if=/dev/backup.image of=/dev/your/device bs=4K skip=35 seek=35
>> count=16 conv=notrunc
>>
>> You would also need to fix up the locking protocol with:
>> gfs2_tool sb /dev/your/device proto "lock_dlm"
>>
>> Without those 16 destroyed blocks, you may not be able to
>> recover the file system.
>>
>> As a last-ditch effort, you could try running the experimental
>> fsck.gfs2 for RHEL6 located on my people page to see if it
>> can recreate any of the data, but it's a long shot, and unlikely
>> to work:
>>
>> http://people.redhat.com/rpeterso/Experimental/RHEL6.x/fsck.gfs2
>>
>> I hope this helps.
>>
>> Regards,
>>
>> Bob Peterson
>> Red Hat File Systems
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
>
> --
>  Thank you
>
> Best regards
>
> *************************************************************************************************
> Sherlock Zhang  ? ??
> Technical Supporter East China Technical Support Department
>
> Address: Room 23E No. 728 West Yanan Road Changning DC. Shanghai China
> Post code: 20050
>
> Office: +86 021 62253300 ext. 803
> Mobile: +86 133 8600 6305
> www.uit.com.cn
>
>
>
> *************************************************************************************************
>


-- 
 Thank you

Best regards
*************************************************************************************************
Sherlock Zhang  ? ??
Technical Supporter East China Technical Support Department

Address: Room 23E No. 728 West Yanan Road Changning DC. Shanghai China
Post code: 20050

Office: +86 021 62253300 ext. 803
Mobile: +86 133 8600 6305
www.uit.com.cn


*************************************************************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111222/2489b1fb/attachment.htm>

From zheka at uvt.cz  Mon Dec 26 22:52:29 2011
From: zheka at uvt.cz (Yevheniy Demchenko)
Date: Mon, 26 Dec 2011 23:52:29 +0100
Subject: [Linux-cluster] [PATCH] dlm: faster dlm recovery
Message-ID: <4EF8FAAD.50504@uvt.cz>

Avoid running find_rsb_root by storing last recovered rsb address for
each node.
Makes dlm recovery much faster for FS with large number of files.

Signed-off-by: Yevheniy Demchenko <zheka at uvt.cz>
---
Current dlm recovery uses small (4096 bytes) buffer to communicate between
dlm_copy_master_names and dlm_directory_recovery. This leads to running
find_rsb_root
N*32/4096 times, where N - number of locks to recover and 32 -
DLM_RESNAME_MAXLEN+1.
find_rsb_root itself takes N*c to complete, where c is some constant.
Eventually, dlm recovery
time is proportional to N*N. For an ocfs2 fs with one directory
consisting of 300000 small files
every mount on other node takes more than 2.5 minutes and umount more
than 5 minutes on a
fairly modern HW with 10Gb interconnect. During dlm recovery FS is not
available on any node.
 This patch makes mounts and umounts on non-locking-master nodes to take
less than a 2 seconds.
It is not limited to ocfs2 and might make dlm recovery faster in general
(i.e. for gfs2).

Test case:
2 node RHCS cluster, OCFS2 with cman cluster stack.
/sys/kernel/config/dlm/cluster/{lkbtbl_size,dirtbl_size,rsbtbl_size} =
16384 on both nodes

On node 1:
#mkfs.ocfs2
--fs-features=backup-super,sparse,inline-data,extended-slotmap,indexed-dirs,refcount,xattr,usrquota,grpquota,unwritten
/dev/vg1/test1
#mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime
#mkdir /mnt/temp/test1
#for i in $(seq 1 300000) ; do dd if=/dev/urandom bs=4096 count=1
of=/mnt/temp/test1/$i ; done
#umount /mnt/temp  #-----leave dlm and destroy locks
#mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime
#time (ls -l /mnt/temp/test1 | wc -l )    #-------create 300000 RR locks
on node 1

On node 2:
#mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime #--- dlm recovery
starts and takes a looooong time if dlm is not patched
#umount /mnt/temp #----- even looooooonger, FS is not available on any
node while recovery is running
After patching, both operations on node2 take less than a 2 seconds.

For now, patch tries to detect inconsistences and reverts to the
previous behaviour if there are any.
These tests can be dropped together with find_rsb_root and some
excessive code in the future.


diff -uNr vanilla/fs/dlm/dir.c v1.0/fs/dlm/dir.c
--- vanilla/fs/dlm/dir.c        2011-09-29 15:29:00.000000000 +0200
+++ v1.0/fs/dlm/dir.c   2011-12-26 22:00:21.068403493 +0100
@@ -196,6 +196,16 @@
        }
 }
 
+static int nodeid2index (struct dlm_ls *ls, int nodeid) {
+       int i;
+       for (i = 0; i < ls->ls_num_nodes ; i++) {
+               if (ls->ls_node_array[i] == nodeid)
+                       return (i);
+       }
+       log_debug(ls, "index not found for nodeid %d", nodeid);
+       return (-1);
+}
+
 int dlm_recover_directory(struct dlm_ls *ls)
 {
        struct dlm_member *memb;
@@ -375,11 +385,28 @@
        struct dlm_rsb *r;
        int offset = 0, dir_nodeid;
        __be16 be_namelen;
+       int index;
 
        down_read(&ls->ls_root_sem);
 
+       index = nodeid2index(ls, nodeid);
+
        if (inlen > 1) {
-               r = find_rsb_root(ls, inbuf, inlen);
+               if ((index > -1) && (ls->ls_recover_last_rsb[index])) {
+                       if (inlen ==
ls->ls_recover_last_rsb[index]->res_length &&
+                           !memcmp(inbuf,
ls->ls_recover_last_rsb[index]->res_name, inlen)) {
+                               r = ls->ls_recover_last_rsb[index];
+                       } else {
+                               /* This should never happen! */
+                               log_error(ls, "copy_master_names: rsb
cache failed 1: node %d: cached rsb %1.31s, needed rsb %1.31s;", nodeid,
+                                        
ls->ls_recover_last_rsb[index]->res_name, inbuf);
+                               r = find_rsb_root(ls, inbuf, inlen);
+                       }
+               } else {
+                       /* Left for safety reasons, we should never get
here */
+                       r = find_rsb_root(ls, inbuf, inlen);
+                       log_error(ls, "copy_master_names: rsb cache
failed 2: ,searching for %1.31s, node %d", inbuf, nodeid);
+               }
                if (!r) {
                        inbuf[inlen - 1] = '\0';
                        log_error(ls, "copy_master_names from %d start
%d %s",
@@ -421,6 +448,7 @@
                offset += sizeof(__be16);
                memcpy(outbuf + offset, r->res_name, r->res_length);
                offset += r->res_length;
+               ls->ls_recover_last_rsb[index] = r;
        }
 
        /*
diff -uNr vanilla/fs/dlm/dlm_internal.h v1.0/fs/dlm/dlm_internal.h
--- vanilla/fs/dlm/dlm_internal.h       2011-09-29 15:32:00.000000000 +0200
+++ v1.0/fs/dlm/dlm_internal.h  2011-12-22 23:51:00.000000000 +0100
@@ -526,6 +526,7 @@
        int                     ls_recover_list_count;
        wait_queue_head_t       ls_wait_general;
        struct mutex            ls_clear_proc_locks;
+       struct dlm_rsb          **ls_recover_last_rsb;
 
        struct list_head        ls_root_list;   /* root resources */
        struct rw_semaphore     ls_root_sem;    /* protect root_list */
diff -uNr vanilla/fs/dlm/member.c v1.0/fs/dlm/member.c
--- vanilla/fs/dlm/member.c     2011-09-29 15:29:00.000000000 +0200
+++ v1.0/fs/dlm/member.c        2011-12-23 19:55:00.000000000 +0100
@@ -128,6 +128,9 @@
 
        kfree(ls->ls_node_array);
        ls->ls_node_array = NULL;
+
+       kfree(ls->ls_recover_last_rsb);
+       ls->ls_recover_last_rsb = NULL;
 
        list_for_each_entry(memb, &ls->ls_nodes, list) {
                if (memb->weight)
@@ -146,6 +149,11 @@
        array = kmalloc(sizeof(int) * total, GFP_NOFS);
        if (!array)
                return;
+
+       ls->ls_recover_last_rsb = kcalloc(ls->ls_num_nodes+1,
sizeof(struct dlm_rsb *), GFP_NOFS);
+
+       if (!ls->ls_recover_last_rsb)
+               return;
 
        list_for_each_entry(memb, &ls->ls_nodes, list) {
                if (!all_zero && !memb->weight)

-- 
Ing. Yevheniy Demchenko
Senior Linux Administrator
UVT s.r.o. 


From sdake at redhat.com  Tue Dec 27 16:00:25 2011
From: sdake at redhat.com (Steven Dake)
Date: Tue, 27 Dec 2011 09:00:25 -0700
Subject: [Linux-cluster] Corosync memory problem
In-Reply-To: <CAOJZNMeBP=-jQrh-F7cTtK4tNy+HGHsLHvuf5ouWbBg-UdkfPw@mail.gmail.com>
References: <CAOJZNMdz4R7qs2e9WCfhcN7=NnrOFkWWXAr8q+mL2rscgRSGdw@mail.gmail.com>
	<CAOJZNMeBP=-jQrh-F7cTtK4tNy+HGHsLHvuf5ouWbBg-UdkfPw@mail.gmail.com>
Message-ID: <4EF9EB99.1050001@redhat.com>

On 12/21/2011 11:04 AM, Chris Alexander wrote:
> An update in case anyone ever runs into something like this - we had
> corosync-notify running on the servers and once we removed that and
> restarted the cluster stack, corosync seemed to return to normal.
> 
> Additionally, according to the corosync mailing list, the cluster 1.2.3
> version is basically very similar to (if not the same as) the 1.4 that
> they currently have released, someone's been backporting.
> 

The upstream 1.2.3 version hasn't had any backports applied to it.  Only
the RHEL 1.2.3-z versions have been backported.

Regards
-steve

> Cheers
> 
> Chris
> 
> On 19 December 2011 19:01, Chris Alexander <chris.alexander at kusiri.com
> <mailto:chris.alexander at kusiri.com>> wrote:
> 
>     Hi all,
> 
>     You may remember our recent issue, I believe this is being worsened
>     if not caused by another problem we have encountered.
> 
>     Every few days our nodes are (non-simultaneously) being fenced due
>     to corosync taking up vast amounts of memory (i.e. 100% of the box).
>     Please see a sample log message, we have several just like this, [1]
>     which occurs when this happens. Note that it is not always corosync
>     being killed - but it is clearly corosync eating all the memory (see
>     top output from three servers at various times since their last
>     reboot, [2] [3] [4]).
> 
>     The corosync version is 1.2.3:
>     [g at cluster1 ~]$ corosync -v
>     Corosync Cluster Engine, version '1.2.3'
>     Copyright (c) 2006-2009 Red Hat, Inc.
> 
>     We had a bit of a dig around and there are a significant number of
>     bugfix updates which address various segfaults, crashes, memory
>     leaks etc. in this minor as well as subsequent minor versions. [5] [6]
> 
>     We're trialling the Fedora 14 (fc14) RPMs for corosync and
>     corosynclib (v1.4.2) to see if it fixes the particular issue we are
>     seeing (i.e. whether or not the memory keeps spiralling way out of
>     control).
> 
>     Has anyone else seen an issue like this, and is there any known way
>     to debug or fix it? If we can assist debugging by providing further
>     information, please specify what this is (and, if non-obvious, how
>     to get it).
> 
>     Thanks again for your help
> 
>     Chris
> 
>     [1] http://pastebin.com/CbyERaRT
>     [2] http://pastebin.com/uk9ZGL7H
>     [3] http://pastebin.com/H4w5Zg46
>     [4] http://pastebin.com/KPZxL6UB
>     [5] http://rhn.redhat.com/errata/RHBA-2011-1361.html
>     [6] http://rhn.redhat.com/errata/RHBA-2011-1515.html
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From mkreder at gmail.com  Tue Dec 27 22:20:34 2011
From: mkreder at gmail.com (Matias Kreder)
Date: Tue, 27 Dec 2011 19:20:34 -0300
Subject: [Linux-cluster] problems with fence_vmware_soap and vCenter server
	4 / ESX 4
Message-ID: <CAH1v0TqDEOhE0OBpjvnWPpP8vX95FvJ_SUoHXMfr=uFoexURUw@mail.gmail.com>

hi all,

I am having troubles while using fence_vmware_soap to get the VM list
and any other operation.

fence_vmware_soap -l user  -p pass -a 192.168.1.11 -o list -z

My user has full admin privileges. 192.168.1.11 is the vCenter IP
address running on Windows 2008. vCenter is controlling only 1 ESX 4
server.

Doing the same thing against the ESX shows the same issue:

fence_vmware_soap -l root -p pass -a  192.168.1.12 -o list -z
Failed: Unable to obtain correct plug status or plug is not available

Looks like it is connecting and authenticating properly because if I
use a fake password it fails with "Unable to connect/login to fencing
device"

Please advice
Regards
Matias Kreder


From sathyanarayanan.varadharajan at precisionit.co.in  Wed Dec 28 03:42:58 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Wed, 28 Dec 2011 09:12:58 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 19
In-Reply-To: <mailman.17.1325005204.26554.linux-cluster@redhat.com>
References: <mailman.17.1325005204.26554.linux-cluster@redhat.com>
Message-ID: <001701ccc512$cb1a8da0$614fa8e0$@precisionit.co.in>

Hi  Yevheniy,

I am interested in applying this patch for my 2 node clustered configured in
RHEL 6.2 with CLVMD + GFS2 + CTDB + CMAN. Can you please guide me on how to
download and apply this patch for the said environment. 

Thanks

Sathya Narayanan V
Solution Architect	
M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
www.precisionit.co.in

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
linux-cluster-request at redhat.com
Sent: Tuesday, December 27, 2011 10:30 PM
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 92, Issue 19

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific than
"Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. [PATCH] dlm: faster dlm recovery (Yevheniy Demchenko)
   2. Re: Corosync memory problem (Steven Dake)


----------------------------------------------------------------------

Message: 1
Date: Mon, 26 Dec 2011 23:52:29 +0100
From: Yevheniy Demchenko <zheka at uvt.cz>
To: linux-cluster at redhat.com
Subject: [Linux-cluster] [PATCH] dlm: faster dlm recovery
Message-ID: <4EF8FAAD.50504 at uvt.cz>
Content-Type: text/plain; charset=ISO-8859-1

Avoid running find_rsb_root by storing last recovered rsb address for each
node.
Makes dlm recovery much faster for FS with large number of files.

Signed-off-by: Yevheniy Demchenko <zheka at uvt.cz>
---
Current dlm recovery uses small (4096 bytes) buffer to communicate between
dlm_copy_master_names and dlm_directory_recovery. This leads to running
find_rsb_root
N*32/4096 times, where N - number of locks to recover and 32 -
DLM_RESNAME_MAXLEN+1.
find_rsb_root itself takes N*c to complete, where c is some constant.
Eventually, dlm recovery
time is proportional to N*N. For an ocfs2 fs with one directory consisting
of 300000 small files every mount on other node takes more than 2.5 minutes
and umount more than 5 minutes on a fairly modern HW with 10Gb interconnect.
During dlm recovery FS is not available on any node.
 This patch makes mounts and umounts on non-locking-master nodes to take
less than a 2 seconds.
It is not limited to ocfs2 and might make dlm recovery faster in general
(i.e. for gfs2).

Test case:
2 node RHCS cluster, OCFS2 with cman cluster stack.
/sys/kernel/config/dlm/cluster/{lkbtbl_size,dirtbl_size,rsbtbl_size} =
16384 on both nodes

On node 1:
#mkfs.ocfs2
--fs-features=backup-super,sparse,inline-data,extended-slotmap,indexed-dirs,
refcount,xattr,usrquota,grpquota,unwritten
/dev/vg1/test1
#mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime #mkdir /mnt/temp/test1
#for i in $(seq 1 300000) ; do dd if=/dev/urandom bs=4096 count=1
of=/mnt/temp/test1/$i ; done #umount /mnt/temp  #-----leave dlm and destroy
locks #mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime
#time (ls -l /mnt/temp/test1 | wc -l )    #-------create 300000 RR locks
on node 1

On node 2:
#mount /dev/vg1/test1 /mnt/temp -o noatime,nodiratime #--- dlm recovery
starts and takes a looooong time if dlm is not patched #umount /mnt/temp
#----- even looooooonger, FS is not available on any node while recovery is
running After patching, both operations on node2 take less than a 2 seconds.

For now, patch tries to detect inconsistences and reverts to the previous
behaviour if there are any.
These tests can be dropped together with find_rsb_root and some excessive
code in the future.


diff -uNr vanilla/fs/dlm/dir.c v1.0/fs/dlm/dir.c
--- vanilla/fs/dlm/dir.c        2011-09-29 15:29:00.000000000 +0200
+++ v1.0/fs/dlm/dir.c   2011-12-26 22:00:21.068403493 +0100
@@ -196,6 +196,16 @@
        }
 }
 
+static int nodeid2index (struct dlm_ls *ls, int nodeid) {
+       int i;
+       for (i = 0; i < ls->ls_num_nodes ; i++) {
+               if (ls->ls_node_array[i] == nodeid)
+                       return (i);
+       }
+       log_debug(ls, "index not found for nodeid %d", nodeid);
+       return (-1);
+}
+
 int dlm_recover_directory(struct dlm_ls *ls)  {
        struct dlm_member *memb;
@@ -375,11 +385,28 @@
        struct dlm_rsb *r;
        int offset = 0, dir_nodeid;
        __be16 be_namelen;
+       int index;
 
        down_read(&ls->ls_root_sem);
 
+       index = nodeid2index(ls, nodeid);
+
        if (inlen > 1) {
-               r = find_rsb_root(ls, inbuf, inlen);
+               if ((index > -1) && (ls->ls_recover_last_rsb[index])) {
+                       if (inlen ==
ls->ls_recover_last_rsb[index]->res_length &&
+                           !memcmp(inbuf,
ls->ls_recover_last_rsb[index]->res_name, inlen)) {
+                               r = ls->ls_recover_last_rsb[index];
+                       } else {
+                               /* This should never happen! */
+                               log_error(ls, "copy_master_names: rsb
cache failed 1: node %d: cached rsb %1.31s, needed rsb %1.31s;", nodeid,
+                                        
ls->ls_recover_last_rsb[index]->res_name, inbuf);
+                               r = find_rsb_root(ls, inbuf, inlen);
+                       }
+               } else {
+                       /* Left for safety reasons, we should never get
here */
+                       r = find_rsb_root(ls, inbuf, inlen);
+                       log_error(ls, "copy_master_names: rsb cache
failed 2: ,searching for %1.31s, node %d", inbuf, nodeid);
+               }
                if (!r) {
                        inbuf[inlen - 1] = '\0';
                        log_error(ls, "copy_master_names from %d start %d
%s", @@ -421,6 +448,7 @@
                offset += sizeof(__be16);
                memcpy(outbuf + offset, r->res_name, r->res_length);
                offset += r->res_length;
+               ls->ls_recover_last_rsb[index] = r;
        }
 
        /*
diff -uNr vanilla/fs/dlm/dlm_internal.h v1.0/fs/dlm/dlm_internal.h
--- vanilla/fs/dlm/dlm_internal.h       2011-09-29 15:32:00.000000000 +0200
+++ v1.0/fs/dlm/dlm_internal.h  2011-12-22 23:51:00.000000000 +0100
@@ -526,6 +526,7 @@
        int                     ls_recover_list_count;
        wait_queue_head_t       ls_wait_general;
        struct mutex            ls_clear_proc_locks;
+       struct dlm_rsb          **ls_recover_last_rsb;
 
        struct list_head        ls_root_list;   /* root resources */
        struct rw_semaphore     ls_root_sem;    /* protect root_list */
diff -uNr vanilla/fs/dlm/member.c v1.0/fs/dlm/member.c
--- vanilla/fs/dlm/member.c     2011-09-29 15:29:00.000000000 +0200
+++ v1.0/fs/dlm/member.c        2011-12-23 19:55:00.000000000 +0100
@@ -128,6 +128,9 @@
 
        kfree(ls->ls_node_array);
        ls->ls_node_array = NULL;
+
+       kfree(ls->ls_recover_last_rsb);
+       ls->ls_recover_last_rsb = NULL;
 
        list_for_each_entry(memb, &ls->ls_nodes, list) {
                if (memb->weight)
@@ -146,6 +149,11 @@
        array = kmalloc(sizeof(int) * total, GFP_NOFS);
        if (!array)
                return;
+
+       ls->ls_recover_last_rsb = kcalloc(ls->ls_num_nodes+1,
sizeof(struct dlm_rsb *), GFP_NOFS);
+
+       if (!ls->ls_recover_last_rsb)
+               return;
 
        list_for_each_entry(memb, &ls->ls_nodes, list) {
                if (!all_zero && !memb->weight)

--
Ing. Yevheniy Demchenko
Senior Linux Administrator
UVT s.r.o. 


------------------------------

Message: 2
Date: Tue, 27 Dec 2011 09:00:25 -0700
From: Steven Dake <sdake at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] Corosync memory problem
Message-ID: <4EF9EB99.1050001 at redhat.com>
Content-Type: text/plain; charset=ISO-8859-1

On 12/21/2011 11:04 AM, Chris Alexander wrote:
> An update in case anyone ever runs into something like this - we had
> corosync-notify running on the servers and once we removed that and
> restarted the cluster stack, corosync seemed to return to normal.
> 
> Additionally, according to the corosync mailing list, the cluster 1.2.3
> version is basically very similar to (if not the same as) the 1.4 that
> they currently have released, someone's been backporting.
> 

The upstream 1.2.3 version hasn't had any backports applied to it.  Only
the RHEL 1.2.3-z versions have been backported.

Regards
-steve

> Cheers
> 
> Chris
> 
> On 19 December 2011 19:01, Chris Alexander <chris.alexander at kusiri.com
> <mailto:chris.alexander at kusiri.com>> wrote:
> 
>     Hi all,
> 
>     You may remember our recent issue, I believe this is being worsened
>     if not caused by another problem we have encountered.
> 
>     Every few days our nodes are (non-simultaneously) being fenced due
>     to corosync taking up vast amounts of memory (i.e. 100% of the box).
>     Please see a sample log message, we have several just like this, [1]
>     which occurs when this happens. Note that it is not always corosync
>     being killed - but it is clearly corosync eating all the memory (see
>     top output from three servers at various times since their last
>     reboot, [2] [3] [4]).
> 
>     The corosync version is 1.2.3:
>     [g at cluster1 ~]$ corosync -v
>     Corosync Cluster Engine, version '1.2.3'
>     Copyright (c) 2006-2009 Red Hat, Inc.
> 
>     We had a bit of a dig around and there are a significant number of
>     bugfix updates which address various segfaults, crashes, memory
>     leaks etc. in this minor as well as subsequent minor versions. [5] [6]
> 
>     We're trialling the Fedora 14 (fc14) RPMs for corosync and
>     corosynclib (v1.4.2) to see if it fixes the particular issue we are
>     seeing (i.e. whether or not the memory keeps spiralling way out of
>     control).
> 
>     Has anyone else seen an issue like this, and is there any known way
>     to debug or fix it? If we can assist debugging by providing further
>     information, please specify what this is (and, if non-obvious, how
>     to get it).
> 
>     Thanks again for your help
> 
>     Chris
> 
>     [1] http://pastebin.com/CbyERaRT
>     [2] http://pastebin.com/uk9ZGL7H
>     [3] http://pastebin.com/H4w5Zg46
>     [4] http://pastebin.com/KPZxL6UB
>     [5] http://rhn.redhat.com/errata/RHBA-2011-1361.html
>     [6] http://rhn.redhat.com/errata/RHBA-2011-1515.html
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 92, Issue 19
*********************************************

This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.


From zheka at uvt.cz  Wed Dec 28 14:38:01 2011
From: zheka at uvt.cz (Yevheniy Demchenko)
Date: Wed, 28 Dec 2011 15:38:01 +0100
Subject: [Linux-cluster] Linux-cluster Digest, Vol 92, Issue 19
In-Reply-To: <001701ccc512$cb1a8da0$614fa8e0$@precisionit.co.in>
References: <mailman.17.1325005204.26554.linux-cluster@redhat.com>
	<001701ccc512$cb1a8da0$614fa8e0$@precisionit.co.in>
Message-ID: <4EFB29C9.6040100@uvt.cz>

Hi Sathya!
Please download
http://www.bosson.eu/temp/dlm-kmod-1.0-1.el6.src.rpm
This package contains dlm from RH kernel 2.6.32-131.21.1 and my patch.
Then as usual $rpmbuild --rebuild dlm-kmod-1.0-1.el6.src.rpm
If you want to build for different kernel use
 $rpmbuild --rebuild  dlm-kmod-1.0-1.el6.src.rpm --define "kversion
<your kver>"

Module will be installed in /lib/modules/<kver>/extra/dlm/dlm.ko and
will be used by default. If you want to get rid of it, just deinstall
the package. Please let me know how it works for you.
Thanks.

Ing. Yevheniy Demchenko
Senior Linux Administrator
UVT s.r.o. 


On 12/28/2011 04:42 AM, SATHYA - IT wrote:
> Hi  Yevheniy,
>
> I am interested in applying this patch for my 2 node clustered configured in
> RHEL 6.2 with CLVMD + GFS2 + CTDB + CMAN. Can you please guide me on how to
> download and apply this patch for the said environment. 
>
> Thanks
>
> Sathya Narayanan V
> Solution Architect	
> M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
> SERVICE - In PRECISION IT is a PASSION
> ----------------------------------------------------------------------------
> -----------------------------
> Precision Infomatic (M) Pvt Ltd
> 22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
> www.precisionit.co.in
>
>


From achintmehta at gmail.com  Wed Dec 28 20:56:51 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Wed, 28 Dec 2011 15:56:51 -0500
Subject: [Linux-cluster] Passing command line arguments to script resources
Message-ID: <CAGV5edEXy0HzVeC9ReqyrG51EzGbFCVZWLfHyBnSMd2aq335Kg@mail.gmail.com>

Hi All,

I am using RHCS on RHEL 6.2.
Is there way to pass command line arguments while adding/starting a script
resource ?

Thanks
Achint
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111228/ae53c93c/attachment.htm>

From achintmehta at gmail.com  Wed Dec 28 21:03:32 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Wed, 28 Dec 2011 16:03:32 -0500
Subject: [Linux-cluster] Multiple script resources with same name
Message-ID: <CAGV5edH+kQM31qkC2zS79vBgMx2Rq1JOxz6YXJ3EuCSf72mogw@mail.gmail.com>

Hi All,

I am using RHCS on RHEL 6.2.
I was trying add multiple script resources that point to the same
script/resource/executable.
Is seems that every script resource that has been added should have a
unique script name.

Is there a way in RHCS by which I can spawn and manage multiple instances
of same script/executable?

Thanks
Achint
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111228/6c874738/attachment.htm>

From John.Anderson4 at weyerhaeuser.com  Wed Dec 28 21:41:20 2011
From: John.Anderson4 at weyerhaeuser.com (Anderson, John(Patni))
Date: Wed, 28 Dec 2011 13:41:20 -0800
Subject: [Linux-cluster] Multiple script resources with same name
In-Reply-To: <CAGV5edH+kQM31qkC2zS79vBgMx2Rq1JOxz6YXJ3EuCSf72mogw@mail.gmail.com>
References: <CAGV5edH+kQM31qkC2zS79vBgMx2Rq1JOxz6YXJ3EuCSf72mogw@mail.gmail.com>
Message-ID: <735D247930DCA240B50EB36F4586D25A27DC5F4805@WAFEDIXMCMS10.corp.weyer.pri>

One way might be to create the script and save it to different names.  IE vi the script and save it then cat it to a different name.

TEST #> Vi testfile1
After completing the task then:
TEST #> cat testfile1 > testfile2
TEST #> cat testfile2 > testfile3
Et Cetra ; etc.


John
253-924-7657

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Achint Mehta
Sent: Wednesday, December 28, 2011 1:04 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Multiple script resources with same name

Hi All,

I am using RHCS on RHEL 6.2.
I was trying add multiple script resources that point to the same script/resource/executable.
Is seems that every script resource that has been added should have a unique script name.

Is there a way in RHCS by which I can spawn and manage multiple instances of same script/executable?

Thanks
Achint


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111228/c568fb7d/attachment.htm>

From achintmehta at gmail.com  Wed Dec 28 21:54:54 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Wed, 28 Dec 2011 16:54:54 -0500
Subject: [Linux-cluster] Multiple script resources with same name
In-Reply-To: <735D247930DCA240B50EB36F4586D25A27DC5F4805@WAFEDIXMCMS10.corp.weyer.pri>
References: <CAGV5edH+kQM31qkC2zS79vBgMx2Rq1JOxz6YXJ3EuCSf72mogw@mail.gmail.com>
	<735D247930DCA240B50EB36F4586D25A27DC5F4805@WAFEDIXMCMS10.corp.weyer.pri>
Message-ID: <CAGV5edFhjCp13nw5tvxLVhNS0ZUdE7o-73pDEKTrLQ1vvDHWEw@mail.gmail.com>

Thanks John.

I have done something similar as a workaround.
Instead of creating copies I have created softlink with different names to
the same script file.
(I am editing the script a lot so softlinks saves me some effort)

But is it correct to say that there is no such provision of managing
multiple instances of services (script service) in the RHCS framework ?

Thanks
Achint

On Wed, Dec 28, 2011 at 4:41 PM, Anderson, John(Patni) <
John.Anderson4 at weyerhaeuser.com> wrote:

> One way might be to create the script and save it to different names.  IE
> vi the script and save it then cat it to a different name.****
>
> ** **
>
> TEST #> Vi testfile1****
>
> After completing the task then:****
>
> TEST #> cat testfile1 > testfile2****
>
> TEST #> cat testfile2 > testfile3  ****
>
> Et Cetra ; etc.****
>
> ** **
>
> ** **
>
> John ****
>
> 253-924-7657****
>
> ** **
>
> *From:* linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] *On Behalf Of *Achint Mehta
> *Sent:* Wednesday, December 28, 2011 1:04 PM
> *To:* linux-cluster at redhat.com
> *Subject:* [Linux-cluster] Multiple script resources with same name****
>
> ** **
>
> Hi All,****
>
> ** **
>
> I am using RHCS on RHEL 6.2.****
>
> I was trying add multiple script resources that point to the same
> script/resource/executable.****
>
> Is seems that every script resource that has been added should have a
> unique script name. ****
>
> ** **
>
> Is there a way in RHCS by which I can spawn and manage multiple instances
> of same script/executable?****
>
> ** **
>
> Thanks****
>
> Achint****
>
> ** **
>
> ** **
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111228/ee4f05f1/attachment.htm>

From John.Anderson4 at weyerhaeuser.com  Wed Dec 28 22:10:35 2011
From: John.Anderson4 at weyerhaeuser.com (Anderson, John(Patni))
Date: Wed, 28 Dec 2011 14:10:35 -0800
Subject: [Linux-cluster] Multiple script resources with same name
In-Reply-To: <CAGV5edFhjCp13nw5tvxLVhNS0ZUdE7o-73pDEKTrLQ1vvDHWEw@mail.gmail.com>
References: <CAGV5edH+kQM31qkC2zS79vBgMx2Rq1JOxz6YXJ3EuCSf72mogw@mail.gmail.com>
	<735D247930DCA240B50EB36F4586D25A27DC5F4805@WAFEDIXMCMS10.corp.weyer.pri>
	<CAGV5edFhjCp13nw5tvxLVhNS0ZUdE7o-73pDEKTrLQ1vvDHWEw@mail.gmail.com>
Message-ID: <735D247930DCA240B50EB36F4586D25A27DC5F480B@WAFEDIXMCMS10.corp.weyer.pri>

Your welcome Achint,

As far as I know.  In the past I did not change the name of a script I called out in another script.   I just commented out a section (of the line) and stated why I was calling it again, to avoid confusion.  That enabled me to keep resources at a lower level.  (Memory and storage space were measured in kilobytes and megabytes back then.)

John
253-924-7657

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Achint Mehta
Sent: Wednesday, December 28, 2011 1:55 PM
To: linux clustering
Subject: Re: [Linux-cluster] Multiple script resources with same name

Thanks John.

I have done something similar as a workaround.
Instead of creating copies I have created softlink with different names to the same script file.
(I am editing the script a lot so softlinks saves me some effort)

But is it correct to say that there is no such provision of managing multiple instances of services (script service) in the RHCS framework ?

Thanks
Achint

On Wed, Dec 28, 2011 at 4:41 PM, Anderson, John(Patni) <John.Anderson4 at weyerhaeuser.com<mailto:John.Anderson4 at weyerhaeuser.com>> wrote:
One way might be to create the script and save it to different names.  IE vi the script and save it then cat it to a different name.

TEST #> Vi testfile1
After completing the task then:
TEST #> cat testfile1 > testfile2
TEST #> cat testfile2 > testfile3
Et Cetra ; etc.


John
253-924-7657<tel:253-924-7657>

From: linux-cluster-bounces at redhat.com<mailto:linux-cluster-bounces at redhat.com> [mailto:linux-cluster-bounces at redhat.com<mailto:linux-cluster-bounces at redhat.com>] On Behalf Of Achint Mehta
Sent: Wednesday, December 28, 2011 1:04 PM
To: linux-cluster at redhat.com<mailto:linux-cluster at redhat.com>
Subject: [Linux-cluster] Multiple script resources with same name

Hi All,

I am using RHCS on RHEL 6.2.
I was trying add multiple script resources that point to the same script/resource/executable.
Is seems that every script resource that has been added should have a unique script name.

Is there a way in RHCS by which I can spawn and manage multiple instances of same script/executable?

Thanks
Achint


--
Linux-cluster mailing list
Linux-cluster at redhat.com<mailto:Linux-cluster at redhat.com>
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111228/6da7730e/attachment.htm>

From andreas at hastexo.com  Wed Dec 28 23:03:42 2011
From: andreas at hastexo.com (Andreas Kurz)
Date: Thu, 29 Dec 2011 00:03:42 +0100
Subject: [Linux-cluster] problems with fence_vmware_soap and vCenter
 server 4 / ESX 4
In-Reply-To: <CAH1v0TqDEOhE0OBpjvnWPpP8vX95FvJ_SUoHXMfr=uFoexURUw@mail.gmail.com>
References: <CAH1v0TqDEOhE0OBpjvnWPpP8vX95FvJ_SUoHXMfr=uFoexURUw@mail.gmail.com>
Message-ID: <4EFBA04E.1080904@hastexo.com>

Hello,

On 12/27/2011 11:20 PM, Matias Kreder wrote:
> hi all,
> 
> I am having troubles while using fence_vmware_soap to get the VM list
> and any other operation.
> 
> fence_vmware_soap -l user  -p pass -a 192.168.1.11 -o list -z
> 
> My user has full admin privileges. 192.168.1.11 is the vCenter IP
> address running on Windows 2008. vCenter is controlling only 1 ESX 4
> server.

from the man page ...

fence_vmware_soap  is an I/O Fencing agent which can be used with the
virtual machines managed by VMWare products that have SOAP API v4.1+.

... v4.1+ ... you have "plain" 4? Don't even know if the version is
directly related to ESX version, but looks "suspicious" to me ;-)

hth - regards,
Andreas

-- 
Need help with Pacemaker?
http://www.hastexo.com/now

> 
> Doing the same thing against the ESX shows the same issue:
> 
> fence_vmware_soap -l root -p pass -a  192.168.1.12 -o list -z
> Failed: Unable to obtain correct plug status or plug is not available
> 
> Looks like it is connecting and authenticating properly because if I
> use a fake password it fails with "Unable to connect/login to fencing
> device"
> 
> Please advice
> Regards
> Matias Kreder
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 286 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111229/8d413ab9/attachment.sig>

From tfrumbacher at gmail.com  Thu Dec 29 18:50:41 2011
From: tfrumbacher at gmail.com (Aaron Benner)
Date: Thu, 29 Dec 2011 11:50:41 -0700
Subject: [Linux-cluster] Cluster managed KVM guest failure in rgmanager
Message-ID: <24B617B4-2323-4179-BA34-CDE8038F6D8A@gmail.com>

All,

I'm not at all sure what is going on here.  I have a large number of KVM guests being managed by a 5 node RHEL5.6 cluster and recently whenever I modify the cluster config, or reload / restart libvirtd (to add / remove guests) rgmanager goes berserk.  When this happens rgmanager lists the guests as "failed" services and this is the result it the log:

Dec 29 10:44:17 plieadies1 clurgmgrd[6770]: <debug> 5 events processed 
Dec 29 10:49:56 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:49:59 plieadies1 last message repeated 3 times
Dec 29 10:49:59 plieadies1 clurgmgrd[6770]: <notice> status on vm "Demeter" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> status on vm "IoA" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> status on vm "IoF" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> status on vm "Pluto" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> status on vm "Venus" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:Demeter 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:Demeter 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:IoA 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:IoA 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> stop on vm "Demeter" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:IoF 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> stop on vm "IoA" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:Demeter failed to stop; intervention required 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Service vm:Demeter is failed 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:IoF 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:Pluto 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:IoA failed to stop; intervention required 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Service vm:IoA is failed 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:Venus 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:Pluto 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> stop on vm "IoF" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:Venus 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:IoF failed to stop; intervention required 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Service vm:IoF is failed 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> stop on vm "Venus" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:Venus failed to stop; intervention required 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Service vm:Venus is failed 
Dec 29 10:50:00 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> stop on vm "Pluto" returned 2 (invalid argument(s)) 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:Pluto failed to stop; intervention required 
Dec 29 10:50:00 plieadies1 clurgmgrd[6770]: <notice> Service vm:Pluto is failed 
Dec 29 10:50:02 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:12 plieadies1 last message repeated 4 times
Dec 29 10:50:19 plieadies1 clurgmgrd[6770]: <debug> 13 events processed 
Dec 29 10:50:20 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <notice> status on vm "saturn" returned 2 (invalid argument(s)) 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <debug> No other nodes have seen vm:saturn 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <notice> Stopping service vm:saturn 
Dec 29 10:50:20 plieadies1 clurgmgrd: [6770]: <err> Could not determine Hypervisor 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <notice> stop on vm "saturn" returned 2 (invalid argument(s)) 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <crit> #12: RG vm:saturn failed to stop; intervention required 
Dec 29 10:50:20 plieadies1 clurgmgrd[6770]: <notice> Service vm:saturn is failed 
Dec 29 10:50:31 plieadies1 clurgmgrd[6770]: <debug> 1 events processed 
Dec 29 10:59:30 plieadies1 clurgmgrd[6770]: <debug> 1 events processed 

The "Could not determine Hypervisor" message is coming from the following block of code in vm.sh:

	# If someone selects a hypervisor, honor it.
	# Otherwise, ask virsh what the hypervisor is.
	#
	if [ -z "$OCF_RESKEY_hypervisor" ] ||
	   [ "$OCF_RESKEY_hypervisor" = "auto" ]; then
		export OCF_RESKEY_hypervisor="`virsh version | grep \"Running hypervisor:\" | awk '{print $3}' | tr A-Z a-z`"
		if [ -z "$OCF_RESKEY_hypervisor" ]; then
			ocf_log err "Could not determine Hypervisor"
			return $OCF_ERR_ARGS
		fi
		echo Hypervisor: $OCF_RESKEY_hypervisor 
	fi

What's really twisting my shorts is that the command being run to determine the hypervisor works fine at the command prompt:

[root at plieadies1 ~]# virsh version | grep "Running hypervisor:" | awk '{print $3}' | tr A-Z a-z
qemu

I can migrate the still running guest to another node, use clusvcadm to disable it in rgmanager, and then use a wrapper on virsh which returns '0' when attempting to start an already running guest to return the still running vm to cluster control so I can work around this, however, I'm hugely concerned that I'm going to end up with a host failure and a heap of trouble at some point.

Anyone seen something similar or have thoughts on this?  Guesses as to why rgmanager / vm.sh are failing to detect the running hypervisor?

--AB


From dan131riley at gmail.com  Thu Dec 29 21:15:11 2011
From: dan131riley at gmail.com (Dan Riley)
Date: Thu, 29 Dec 2011 16:15:11 -0500
Subject: [Linux-cluster] [PATCH] dlm: faster dlm recovery
In-Reply-To: <4EF8FAAD.50504@uvt.cz>
References: <4EF8FAAD.50504@uvt.cz>
Message-ID: <7AF2C612-5496-466E-815E-BB4039BA441B@gmail.com>

On Dec 26, 2011, at 5:52 PM, Yevheniy Demchenko wrote:

> Avoid running find_rsb_root by storing last recovered rsb address for
> each node.
> Makes dlm recovery much faster for FS with large number of files.
[?]
> It is not limited to ocfs2 and might make dlm recovery faster in general
> (i.e. for gfs2).

>From my tests, this does appear to be an issue for gfs2 as well, and is
very likely the cause of problems we've been seeing with mounting a gfs2
file system on a busy cluster with many file locks.  Thanks!

-dan


From achintmehta at gmail.com  Thu Dec 29 21:49:17 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Thu, 29 Dec 2011 16:49:17 -0500
Subject: [Linux-cluster] Fencing required for node failover
Message-ID: <CAGV5edFzmfyCRiru4KtKzcVX84BrjV2GKOq8p=hkLH_d6VV8-Q@mail.gmail.com>

Hi All,

I am using RHCS in RHEL 6.2.

I am trying to perform a failover for a node in the cluster.
All the services have fail-over configured on them with recovery method set
to relocate.
When the node foes down the services are not relocated to to another nodes.

Though the node failure is detected by rgmanager:
------
Dec 29 16:20:57 rgmanager State change: pcs_linuxha_1 DOWN
Dec 29 16:28:25 rgmanager Status Child Max set to 7
------
and fenced has the following logs:
------
Dec 29 16:21:04 fenced fencing node pcs_linuxha_1
Dec 29 16:21:04 fenced fence pcs_linuxha_1 dev 0.0 agent none result: error
no method
Dec 29 16:21:04 fenced fence pcs_linuxha_1 failed
------

1. Do I require fencing to be enabled to make node failover work
2. If yes, what kind of failover device should I add. (all the nodes are
simple servers.)


Thanks!
Achint
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111229/5a94951e/attachment.htm>

From linux at alteeve.com  Thu Dec 29 22:06:37 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 29 Dec 2011 17:06:37 -0500
Subject: [Linux-cluster] Fencing required for node failover
In-Reply-To: <CAGV5edFzmfyCRiru4KtKzcVX84BrjV2GKOq8p=hkLH_d6VV8-Q@mail.gmail.com>
References: <CAGV5edFzmfyCRiru4KtKzcVX84BrjV2GKOq8p=hkLH_d6VV8-Q@mail.gmail.com>
Message-ID: <4EFCE46D.1000106@alteeve.com>

On 12/29/2011 04:49 PM, Achint Mehta wrote:
> Hi All, 
> 
> I am using RHCS in RHEL 6.2.
> 
> I am trying to perform a failover for a node in the cluster.
> All the services have fail-over configured on them with recovery method
> set to relocate.
> When the node foes down the services are not relocated to to another nodes.
> 
> Though the node failure is detected by rgmanager:
> ------
> Dec 29 16:20:57 rgmanager State change: pcs_linuxha_1 DOWN
> Dec 29 16:28:25 rgmanager Status Child Max set to 7
> ------
> and fenced has the following logs:
> ------
> Dec 29 16:21:04 fenced fencing node pcs_linuxha_1
> Dec 29 16:21:04 fenced fence pcs_linuxha_1 dev 0.0 agent none result:
> error no method
> Dec 29 16:21:04 fenced fence pcs_linuxha_1 failed
> ------
> 
> 1. Do I require fencing to be enabled to make node failover work
> 2. If yes, what kind of failover device should I add. (all the nodes are
> simple servers.)
> 
> 
> Thanks!
> Achint

Yes, you absolutely needs fencing.

As soon as a node is lost, fenced informs dlm which then stops providing
locks. Only when the fence succeeds is dlm informed and will again issue
locks. In turn, rgmanager uses dlm, so with dlm not providing locks,
rgmanager can't recover services.

See this for a more specific explanation;

https://alteeve.com/w/2-Node_Red_Hat_KVM_Cluster_Tutorial#Concept.3B_Fencing

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From achintmehta at gmail.com  Thu Dec 29 22:21:37 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Thu, 29 Dec 2011 17:21:37 -0500
Subject: [Linux-cluster] Fencing required for node failover
In-Reply-To: <4EFCE46D.1000106@alteeve.com>
References: <CAGV5edFzmfyCRiru4KtKzcVX84BrjV2GKOq8p=hkLH_d6VV8-Q@mail.gmail.com>
	<4EFCE46D.1000106@alteeve.com>
Message-ID: <CAGV5edFbx3BPgUHcFrAZmE5ZMZgWbhPs=54xK17MzxaOPhvbAw@mail.gmail.com>

On Thu, Dec 29, 2011 at 5:06 PM, Digimer <linux at alteeve.com> wrote:

> On 12/29/2011 04:49 PM, Achint Mehta wrote:
> > Hi All,
> >
> > I am using RHCS in RHEL 6.2.
> >
> > I am trying to perform a failover for a node in the cluster.
> > All the services have fail-over configured on them with recovery method
> > set to relocate.
> > When the node foes down the services are not relocated to to another
> nodes.
> >
> > Though the node failure is detected by rgmanager:
> > ------
> > Dec 29 16:20:57 rgmanager State change: pcs_linuxha_1 DOWN
> > Dec 29 16:28:25 rgmanager Status Child Max set to 7
> > ------
> > and fenced has the following logs:
> > ------
> > Dec 29 16:21:04 fenced fencing node pcs_linuxha_1
> > Dec 29 16:21:04 fenced fence pcs_linuxha_1 dev 0.0 agent none result:
> > error no method
> > Dec 29 16:21:04 fenced fence pcs_linuxha_1 failed
> > ------
> >
> > 1. Do I require fencing to be enabled to make node failover work
> > 2. If yes, what kind of failover device should I add. (all the nodes are
> > simple servers.)
> >
> >
> > Thanks!
> > Achint
>
> Yes, you absolutely needs fencing.
>
> As soon as a node is lost, fenced informs dlm which then stops providing
> locks. Only when the fence succeeds is dlm informed and will again issue
> locks. In turn, rgmanager uses dlm, so with dlm not providing locks,
> rgmanager can't recover services.
>
> See this for a more specific explanation;
>
>
> https://alteeve.com/w/2-Node_Red_Hat_KVM_Cluster_Tutorial#Concept.3B_Fencing
>
> --
> Digimer
> E-Mail:              digimer at alteeve.com
> Freenode handle:     digimer
> Papers and Projects: http://alteeve.com
> Node Assassin:       http://nodeassassin.org
> "omg my singularity battery is dead again.
> stupid hawking radiation." - epitron
>

Thanks for the explanation.

It appears that until fence daemon performs the fencing activity the
recovery will not be made.
So I would have to setup fencing.

1. I am not sure what kind of fencing device to add for simple servers.
2. Is this fencing device supposed to be a dedicated machine outside of the
cluster.
3. Also, would fencing be successful if the node  to be fenced is not
reachable on the network.

--Achint
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111229/a458d602/attachment.htm>

From linux at alteeve.com  Thu Dec 29 22:35:41 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 29 Dec 2011 17:35:41 -0500
Subject: [Linux-cluster] Fencing required for node failover
In-Reply-To: <CAGV5edFbx3BPgUHcFrAZmE5ZMZgWbhPs=54xK17MzxaOPhvbAw@mail.gmail.com>
References: <CAGV5edFzmfyCRiru4KtKzcVX84BrjV2GKOq8p=hkLH_d6VV8-Q@mail.gmail.com>
	<4EFCE46D.1000106@alteeve.com>
	<CAGV5edFbx3BPgUHcFrAZmE5ZMZgWbhPs=54xK17MzxaOPhvbAw@mail.gmail.com>
Message-ID: <4EFCEB3D.3010500@alteeve.com>

On 12/29/2011 05:21 PM, Achint Mehta wrote:
> Thanks for the explanation.
> 
> It appears that until fence daemon performs the fencing activity the
> recovery will not be made.
> So I would have to setup fencing.
> 
> 1. I am not sure what kind of fencing device to add for simple servers.

IPMI is popular, if your server supports it. Big vendors have their own
IPMI-like interfaces you can also use for fencing; HP use iLO, IBM uses
RSA, Dell uses DRAC, etc.

These all rely on the host server having power (though are otherwise
totally independent and can be used even when the server is shut off).
If the host node's power supply (or power feed) fails, then so do these.

Alternatively, or ideally, as a backup would be to use switched PDUs.
These are power bars that have network connections. You can turn off and
on individual outlets on these devices to cut the power to nodes.

Personally, I use IPMI for my primary fence device and a PDU as a backup.

> 2. Is this fencing device supposed to be a dedicated machine outside of
> the cluster.

Either/or. Devices which can confirm a node's state are best, like IPMI,
but as I mentioned, they suffer from sharing a power source. Switched
PDUs are totally independent, but they can't confirm a node is truly off.

> 3. Also, would fencing be successful if the node  to be fenced is not
> reachable on the network.

Provided the fence device is reachable, then the state of the node does
not matter.

I cover exactly how to implement fence devices on RHEL 6.2 here;

https://alteeve.com/w/2-Node_Red_Hat_KVM_Cluster_Tutorial#Defining_Fence_Devices

It includes examples using IPMI, APC switched PDUs and HP's iLO.

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From sathyanarayanan.varadharajan at precisionit.co.in  Fri Dec 30 06:24:38 2011
From: sathyanarayanan.varadharajan at precisionit.co.in (SATHYA - IT)
Date: Fri, 30 Dec 2011 11:54:38 +0530
Subject: [Linux-cluster] Fencing node automatically...
Message-ID: <005f01ccc6bb$b50cf110$1f26d330$@precisionit.co.in>

Hi,

 
On my two node cluster which got configured inRHEL 6.2 (with ctdb + gfs2 +
rgmanager + cman), primary node fences the secondary node automatically ( in
random). During this fence the message appears in the primary server is as
follows:

 
Dec 30 05:23:49 filesrv1 corosync[9065]:   [TOTEM ] A processor failed,
forming new configuration.

Dec 30 05:23:51 filesrv1 corosync[9065]:   [QUORUM] Members[1]: 1

Dec 30 05:23:51 filesrv1 corosync[9065]:   [TOTEM ] A processor joined or
left the membership and a new membership was formed.

Dec 30 05:23:51 filesrv1 kernel: dlm: closing connection to node 2

Dec 30 05:23:51 filesrv1 corosync[9065]:   [CPG   ] chosen downlist: sender
r(0) ip(10.0.0.10) ; members(old:2 left:1)

Dec 30 05:23:51 filesrv1 corosync[9065]:   [MAIN  ] Completed service
synchronization, ready to provide service.

Dec 30 05:23:51 filesrv1 rgmanager[12491]: State change: clustsrv2 DOWN

Dec 30 05:23:51 filesrv1 fenced[9122]: fencing node clustsrv2

Dec 30 05:23:51 filesrv1 kernel: GFS2: fsid=samba:ctdb.1: jid=0: Trying to
acquire journal lock...

Dec 30 05:23:51 filesrv1 kernel: GFS2: fsid=samba:gen01.1: jid=0: Trying to
acquire journal lock...

Dec 30 05:23:57 filesrv1 kernel: bnx2 0000:04:00.0: eth4: NIC Copper Link is
Down

Dec 30 05:23:57 filesrv1 kernel: bnx2 0000:03:00.1: eth3: NIC Copper Link is
Down

Dec 30 05:23:57 filesrv1 kernel: bonding: bond1: link status definitely down
for interface eth3, disabling it

Dec 30 05:23:57 filesrv1 kernel: bonding: bond1: now running without any
active interface !

Dec 30 05:23:57 filesrv1 kernel: bonding: bond1: link status definitely down
for interface eth4, disabling it

Dec 30 05:23:58 filesrv1 kernel: bnx2 0000:03:00.1: eth3: NIC Copper Link is
Up, 100 Mbps full duplex, receive & transmit flow control ON

Dec 30 05:23:58 filesrv1 kernel: bnx2 0000:04:00.0: eth4: NIC Copper Link is
Up, 100 Mbps full duplex, receive & transmit flow control ON

Dec 30 05:23:58 filesrv1 kernel: bond1: link status definitely up for
interface eth3, 100 Mbps full duplex.

Dec 30 05:23:58 filesrv1 kernel: bonding: bond1: making interface eth3 the
new active one.

Dec 30 05:23:58 filesrv1 kernel: bonding: bond1: first active interface up!

Dec 30 05:23:58 filesrv1 kernel: bond1: link status definitely up for
interface eth4, 100 Mbps full duplex.

Dec 30 05:23:59 filesrv1 kernel: bnx2 0000:04:00.0: eth4: NIC Copper Link is
Down

Dec 30 05:23:59 filesrv1 kernel: bnx2 0000:03:00.1: eth3: NIC Copper Link is
Down

Dec 30 05:23:59 filesrv1 kernel: bonding: bond1: link status definitely down
for interface eth3, disabling it

Dec 30 05:23:59 filesrv1 kernel: bonding: bond1: now running without any
active interface !

Dec 30 05:23:59 filesrv1 kernel: bonding: bond1: link status definitely down
for interface eth4, disabling it

Dec 30 05:24:00 filesrv1 fenced[9122]: fence clustsrv2 success

 
Can anyone please help why this is happening.

 
Thanks

 
Sathya Narayanan V

Solution Architect    

M +91 9940680173 |T +91 44 42199500  | Service Desk +91 44 42199521
SERVICE - In PRECISION IT is a PASSION
----------------------------------------------------------------------------
-----------------------------
Precision Infomatic (M) Pvt Ltd
22, 1st Floor, Habibullah Road, T. Nagar, Chennai - 600 017. India.
 <http://www.precisionit.co.in/> www.precisionit.co.in

 
This communication may contain confidential information. 
If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. 
Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. 
Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. 
All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/d1ae51ae/attachment.htm>

From linux at alteeve.com  Fri Dec 30 06:37:32 2011
From: linux at alteeve.com (Digimer)
Date: Fri, 30 Dec 2011 01:37:32 -0500
Subject: [Linux-cluster] Fencing node automatically...
In-Reply-To: <005f01ccc6bb$b50cf110$1f26d330$@precisionit.co.in>
References: <005f01ccc6bb$b50cf110$1f26d330$@precisionit.co.in>
Message-ID: <4EFD5C2C.9060001@alteeve.com>

On 12/30/2011 01:24 AM, SATHYA - IT wrote:
> Hi,
> 
> On my two node cluster which got configured inRHEL 6.2 (with ctdb + gfs2
> + rgmanager + cman), primary node fences the secondary node
> automatically ( in random). During this fence the message appears in the
> primary server is as follows:

That syslog snippet should have had some important bits before it.
Please re-paste the log files from that time period from both the victim
and the surviving host, starting from the first log message in the same
time period as the fence event, clear through to the surviving node has
restored services.

Also, please past your full cluster.conf file, and please only obfuscate
passwords only. Likewise, please paste your full network configuration.

With that said; You're probably facing transient network issues. Are
there any logs in your switches around the same time period? Is
multicast working properly? Is STP enabled and perhaps causing a
temporary block in network traffic?

The cluster certainly doesn't break randomly, there has to be a cause. :)

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From sslavic at gmail.com  Fri Dec 30 13:39:44 2011
From: sslavic at gmail.com (=?UTF-8?Q?Stevo_Slavi=C4=87?=)
Date: Fri, 30 Dec 2011 14:39:44 +0100
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
Message-ID: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>

Hello RedHat Linux cluster community,

I'm in process of configuring shared filesystem storage master/slave Apache
ActiveMQ setup. For it to work, it requires reliable distributed locking -
master is node that holds exclusive lock on a file on shared filesystem
storage.

On RHEL (5.4), using CLVM with GFS2 is one of the options that should work.
Third party configured the CLVM/GFS2. I'd like to make sure that
distributed locking works OK.
What are my options for verifying this?

Thanks in advance!

Regards,
Stevo.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/b6d4e32d/attachment.htm>

From linux at alteeve.com  Fri Dec 30 18:46:55 2011
From: linux at alteeve.com (Digimer)
Date: Fri, 30 Dec 2011 13:46:55 -0500
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
In-Reply-To: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
References: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
Message-ID: <4EFE071F.8030406@alteeve.com>

On 12/30/2011 08:39 AM, Stevo Slavi? wrote:
> Hello RedHat Linux cluster community,
> 
> I'm in process of configuring shared filesystem storage master/slave
> Apache ActiveMQ setup. For it to work, it requires reliable distributed
> locking - master is node that holds exclusive lock on a file on shared
> filesystem storage.
> 
> On RHEL (5.4), using CLVM with GFS2 is one of the options that should work.
> Third party configured the CLVM/GFS2. I'd like to make sure that
> distributed locking works OK.
> What are my options for verifying this?
> 
> Thanks in advance!
> 
> Regards,
> Stevo.

I've not had experience with Apache HA specifically, so I can only give
generalized comments.

DLM is very reliable... Red Hat's flagship clustered filesystem, as well
as it's cluster resource manager, rgmanager, both make extensive use of
DLM. If it wasn't reliable, there would be very big problems for Red Hat. ;)

The main issue with DLM is that you have to have tested and working
fencing. When a node is lost, fenced blocks DLM so anything using DLM
will also block until the fence action completes. This is by design
because you don't want nodes making changes while the cluster is in an
unknown state.

The other issue is performance; Your locks are, by necessity, travelling
over a network. You need to ensure that you network latency is as
minimal as possible. You will want to allocate time to tune DLM and GFS2
to make sure you've got decent performance. This is the nature of
clustered locking, more than anything specific to DLM.

For GFS2, one of the easiest performance wins is to set
'noatime,nodiratime' in the mount options to avoid requiring locks to
update the access times on files when you only read them.

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From yvette at dbtgroup.com  Fri Dec 30 19:30:53 2011
From: yvette at dbtgroup.com (yvette hirth)
Date: Fri, 30 Dec 2011 19:30:53 +0000
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
In-Reply-To: <4EFE071F.8030406@alteeve.com>
References: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
	<4EFE071F.8030406@alteeve.com>
Message-ID: <4EFE116D.3080505@dbtgroup.com>

Digimer wrote:

> For GFS2, one of the easiest performance wins is to set
> 'noatime,nodiratime' in the mount options to avoid requiring locks to
> update the access times on files when you only read them.

i've found that "noatime" implies "nodiratime", so both are not needed - 
unless GFS/GFS2 behaves differently than other fs's wrt this attribute. 
  if so, that would be good to know for certain.

see here:  http://lwn.net/Articles/245002/

the article didn't specify the filesystem...

yvette


From sslavic at gmail.com  Fri Dec 30 20:08:37 2011
From: sslavic at gmail.com (=?UTF-8?Q?Stevo_Slavi=C4=87?=)
Date: Fri, 30 Dec 2011 21:08:37 +0100
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
In-Reply-To: <4EFE116D.3080505@dbtgroup.com>
References: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
	<4EFE071F.8030406@alteeve.com> <4EFE116D.3080505@dbtgroup.com>
Message-ID: <CAAUywg8WaxF0WiotOtQA9u_WoqkcsxGnkA1hVMDX71oYst530Q@mail.gmail.com>

Hi Digimer and Yvette,

Thanks for tips! I don't doubt reliability of the technology, just want to
make sure it is configured well.

After fencing a node that held a lock on a file on shared storage, lock
remains, and non-fenced node cannot take over the lock on that file.
Wondering how can one check which process (from which node if possible) is
holding a lock on a file on shared storage.
dlm should have taken care of releasing the lock once node got fenced,
right?

Regards,
Stevo.

On Fri, Dec 30, 2011 at 8:30 PM, yvette hirth <yvette at dbtgroup.com> wrote:

> Digimer wrote:
>
>  For GFS2, one of the easiest performance wins is to set
>> 'noatime,nodiratime' in the mount options to avoid requiring locks to
>> update the access times on files when you only read them.
>>
>
> i've found that "noatime" implies "nodiratime", so both are not needed -
> unless GFS/GFS2 behaves differently than other fs's wrt this attribute.  if
> so, that would be good to know for certain.
>
> see here:  http://lwn.net/Articles/**245002/<http://lwn.net/Articles/245002/>
>
> the article didn't specify the filesystem...
>
> yvette
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/**mailman/listinfo/linux-cluster<https://www.redhat.com/mailman/listinfo/linux-cluster>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/4cb71d59/attachment.htm>

From linux at alteeve.com  Fri Dec 30 20:23:03 2011
From: linux at alteeve.com (Digimer)
Date: Fri, 30 Dec 2011 15:23:03 -0500
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
In-Reply-To: <CAAUywg8WaxF0WiotOtQA9u_WoqkcsxGnkA1hVMDX71oYst530Q@mail.gmail.com>
References: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
	<4EFE071F.8030406@alteeve.com> <4EFE116D.3080505@dbtgroup.com>
	<CAAUywg8WaxF0WiotOtQA9u_WoqkcsxGnkA1hVMDX71oYst530Q@mail.gmail.com>
Message-ID: <4EFE1DA7.3020506@alteeve.com>

On 12/30/2011 03:08 PM, Stevo Slavi? wrote:
> Hi Digimer and Yvette,
> 
> Thanks for tips! I don't doubt reliability of the technology, just want
> to make sure it is configured well.
> 
> After fencing a node that held a lock on a file on shared storage, lock
> remains, and non-fenced node cannot take over the lock on that file.
> Wondering how can one check which process (from which node if possible)
> is holding a lock on a file on shared storage.
> dlm should have taken care of releasing the lock once node got fenced,
> right?
> 
> Regards,
> Stevo.

After a successful fence call, DLM will clean up any locks held by the
lost node. That's why it's so critical that the fence action succeeded
(ie: test-test-test). If a node doesn't actually die in a fence, but the
cluster thinks it did, and somehow the lost node returns, the lost node
will think it's locks are still valid and modify shared storage, leading
to near-certain data corruption.

It's all perfectly safe, provided you've tested your fencing properly. :)

Yvette,

  You might be right on the 'noatime' implying 'nodiratime'... I add
both out of habit.

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron


From sslavic at gmail.com  Fri Dec 30 20:37:00 2011
From: sslavic at gmail.com (=?UTF-8?Q?Stevo_Slavi=C4=87?=)
Date: Fri, 30 Dec 2011 21:37:00 +0100
Subject: [Linux-cluster] CLVM/GFS2 distributed locking
In-Reply-To: <4EFE1DA7.3020506@alteeve.com>
References: <CAAUywg8PbgJKY6+uBT-udBZYd3PQk-eyEqqBJWYdxhcxw27Vzg@mail.gmail.com>
	<4EFE071F.8030406@alteeve.com> <4EFE116D.3080505@dbtgroup.com>
	<CAAUywg8WaxF0WiotOtQA9u_WoqkcsxGnkA1hVMDX71oYst530Q@mail.gmail.com>
	<4EFE1DA7.3020506@alteeve.com>
Message-ID: <CAAUywg-OJ-BPbJKsEfRK6Z=YbsrcWBczdLGvirjJ-92Mk0Cuxw@mail.gmail.com>

Pulling the cables between shared storage and foo01, foo01 gets fenced.
Here is some info from foo02 about shared storage and dlm debug (lock file
seems to remain locked)

root at foo02:-//data/activemq_data#ls -li
total 276
 66467 -rw-r--r-- 1 root root 33030144 Dec 30 16:32 db-1.log
 66468 -rw-r--r-- 1 root root    73728 Dec 30 16:24 db.data
 66470 -rw-r--r-- 1 root root    53344 Dec 30 16:24 db.redo
128014 -rw-r--r-- 1 root root        0 Dec 30 19:49 dummy
 66466 -rw-r--r-- 1 root root        0 Dec 30 16:23 lock
root at foo02:-//data/activemq_data#grep -A 7 -i 103a2 /debug/dlm/activemq
Resource ffff81090faf96c0 Name (len=24) "       2           103a2"
Master Copy
Granted Queue
03d10002 PR Remote:   1 00c80001
00e00001 PR
Conversion Queue
Waiting Queue
--
Resource ffff81090faf97c0 Name (len=24) "       5           103a2"
Master Copy
Granted Queue
03c30003 PR Remote:   1 039a0001
03550001 PR
Conversion Queue
Waiting Queue


Are there some docs for interpreting this dlm debug output?


Regards,
Stevo.

On Fri, Dec 30, 2011 at 9:23 PM, Digimer <linux at alteeve.com> wrote:

> On 12/30/2011 03:08 PM, Stevo Slavi? wrote:
> > Hi Digimer and Yvette,
> >
> > Thanks for tips! I don't doubt reliability of the technology, just want
> > to make sure it is configured well.
> >
> > After fencing a node that held a lock on a file on shared storage, lock
> > remains, and non-fenced node cannot take over the lock on that file.
> > Wondering how can one check which process (from which node if possible)
> > is holding a lock on a file on shared storage.
> > dlm should have taken care of releasing the lock once node got fenced,
> > right?
> >
> > Regards,
> > Stevo.
>
> After a successful fence call, DLM will clean up any locks held by the
> lost node. That's why it's so critical that the fence action succeeded
> (ie: test-test-test). If a node doesn't actually die in a fence, but the
> cluster thinks it did, and somehow the lost node returns, the lost node
> will think it's locks are still valid and modify shared storage, leading
> to near-certain data corruption.
>
> It's all perfectly safe, provided you've tested your fencing properly. :)
>
> Yvette,
>
>  You might be right on the 'noatime' implying 'nodiratime'... I add
> both out of habit.
>
> --
> Digimer
> E-Mail:              digimer at alteeve.com
> Freenode handle:     digimer
> Papers and Projects: http://alteeve.com
> Node Assassin:       http://nodeassassin.org
> "omg my singularity battery is dead again.
> stupid hawking radiation." - epitron
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/89531f0f/attachment.htm>

From akaris at gmail.com  Fri Dec 30 21:16:46 2011
From: akaris at gmail.com (Michel Nadeau)
Date: Fri, 30 Dec 2011 16:16:46 -0500
Subject: [Linux-cluster] CMAN across different datacenters
Message-ID: <CA+i7LTdODWzsaE9ZN6DssfPM7Dz6BqpXfyWPG41rGCP94VvK1g@mail.gmail.com>

Hi,

We're trying to configure a CMAN cluster with 2 nodes located in 2
different datacenters.

The 2 nodes are running Debian 6 and they can access each other on the
private LAN (using the eth0 interface).

The problem is that the 2 nodes don't have the same subnet and the
multicast doesn't seem to work: is there any way to make this work?

Thanks,

- Mike
akaris at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/dc982ba3/attachment.htm>

From andreas at hastexo.com  Sat Dec 31 00:32:33 2011
From: andreas at hastexo.com (Andreas Kurz)
Date: Sat, 31 Dec 2011 01:32:33 +0100
Subject: [Linux-cluster] CMAN across different datacenters
In-Reply-To: <CA+i7LTdODWzsaE9ZN6DssfPM7Dz6BqpXfyWPG41rGCP94VvK1g@mail.gmail.com>
References: <CA+i7LTdODWzsaE9ZN6DssfPM7Dz6BqpXfyWPG41rGCP94VvK1g@mail.gmail.com>
Message-ID: <4EFE5821.1020602@hastexo.com>

Hello,

On 12/30/2011 10:16 PM, Michel Nadeau wrote:
> Hi,
> 
> We're trying to configure a CMAN cluster with 2 nodes located in 2
> different datacenters.

only one of various problems of split-site cluster: how do you plan to
implement reliable fencing?

> 
> The 2 nodes are running Debian 6 and they can access each other on the
> private LAN (using the eth0 interface).
> 
> The problem is that the 2 nodes don't have the same subnet and the
> multicast doesn't seem to work: is there any way to make this work?

since 1.3.0 corosync supports unicasts (UDPU) ... it ships with a nice
example configuration.

Regards,
Andreas

-- 
Need help with Corosync?
http://www.hastexo.com/now

> 
> Thanks,
> 
> - Mike
> akaris at gmail.com <mailto:akaris at gmail.com>
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 286 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111231/be5d6762/attachment.sig>

From akaris at gmail.com  Sat Dec 31 03:13:57 2011
From: akaris at gmail.com (Michel Nadeau)
Date: Sat, 31 Dec 2011 03:13:57 +0000
Subject: [Linux-cluster] CMAN across different datacenters
In-Reply-To: <4EFE5821.1020602@hastexo.com>
References: <CA+i7LTdODWzsaE9ZN6DssfPM7Dz6BqpXfyWPG41rGCP94VvK1g@mail.gmail.com>
	<4EFE5821.1020602@hastexo.com>
Message-ID: <857731098-1325301235-cardhu_decombobulator_blackberry.rim.net-412261665-@b15.c31.bise6.blackberry>

We usually use only cman, gfs2 and drbd - does corosync replace cman or it's an addon?
-----Original Message-----
From: Andreas Kurz <andreas at hastexo.com>
Sender: linux-cluster-bounces at redhat.com
Date: Sat, 31 Dec 2011 01:32:33 
To: <linux-cluster at redhat.com>
Reply-To: linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] CMAN across different datacenters

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From achintmehta at gmail.com  Sat Dec 31 03:41:26 2011
From: achintmehta at gmail.com (Achint Mehta)
Date: Fri, 30 Dec 2011 22:41:26 -0500
Subject: [Linux-cluster] CMAN across different datacenters
In-Reply-To: <857731098-1325301235-cardhu_decombobulator_blackberry.rim.net-412261665-@b15.c31.bise6.blackberry>
References: <CA+i7LTdODWzsaE9ZN6DssfPM7Dz6BqpXfyWPG41rGCP94VvK1g@mail.gmail.com>
	<4EFE5821.1020602@hastexo.com>
	<857731098-1325301235-cardhu_decombobulator_blackberry.rim.net-412261665-@b15.c31.bise6.blackberry>
Message-ID: <CAGV5edHijewhiOm50HKvQ5ACmjkReG4c=UCG__vLEMYjXHuHWA@mail.gmail.com>

On Fri, Dec 30, 2011 at 10:13 PM, Michel Nadeau <akaris at gmail.com> wrote:

> We usually use only cman, gfs2 and drbd - does corosync replace cman or
> it's an addon?
> -----Original Message-----
> From: Andreas Kurz <andreas at hastexo.com>
> Sender: linux-cluster-bounces at redhat.com
> Date: Sat, 31 Dec 2011 01:32:33
> To: <linux-cluster at redhat.com>
> Reply-To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] CMAN across different datacenters
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

corosync is the messaging layer allowing cluster nodes to talk with each
other.
cman/rgmanager should sit on top of it.


try the following commands:
service corosync status
If the status is running the you are using it, otherwise not.

Also look into the /var/log/messages file for corosync.
If you are using corosync then there should be prints in there for corosync.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111230/48d2a477/attachment.htm>

From linux at alteeve.com  Sat Dec 31 04:39:03 2011
From: linux at alteeve.com (Digimer)
Date: Fri, 30 Dec 2011 23:39:03 -0500
Subject: [Linux-cluster] Fencing node automatically...
In-Reply-To: <003101ccc76d$e644e820$b2ceb860$@precisionit.co.in>
References: <003101ccc76d$e644e820$b2ceb860$@precisionit.co.in>
Message-ID: <4EFE91E7.9080607@alteeve.com>

On 12/30/2011 10:40 PM, SATHYA - IT wrote:
> Hi,
> 
> Herewith attaching the logs and configuration files for ref. Kindly assist.
> 
> Thanks

=================
Dec 25 11:11:26 filesrv1 corosync[9061]:   [TOTEM ] A processor failed,
forming new configuration.
Dec 25 11:11:26 filesrv1 kernel: bnx2 0000:03:00.1: eth3: NIC Copper
Link is Down
Dec 25 11:11:26 filesrv1 kernel: bonding: bond1: link status definitely
down for interface eth3, disabling it
Dec 25 11:11:26 filesrv1 kernel: bonding: bond1: making interface eth4
the new active one.
Dec 25 11:11:27 filesrv1 kernel: bnx2 0000:04:00.0: eth4: NIC Copper
Link is Down
Dec 25 11:11:27 filesrv1 kernel: bonding: bond1: link status definitely
down for interface eth4, disabling it
Dec 25 11:11:27 filesrv1 kernel: bonding: bond1: now running without any
active interface !
Dec 25 11:11:28 filesrv1 corosync[9061]:   [QUORUM] Members[1]: 1
Dec 25 11:11:28 filesrv1 corosync[9061]:   [TOTEM ] A processor joined
or left the membership and a new membership was formed.
Dec 25 11:11:28 filesrv1 rgmanager[12538]: State change: clustsrv2 DOWN
Dec 25 11:11:28 filesrv1 corosync[9061]:   [CPG   ] chosen downlist:
sender r(0) ip(10.0.0.10) ; members(old:2 left:1)
Dec 25 11:11:28 filesrv1 corosync[9061]:   [MAIN  ] Completed service
synchronization, ready to provide service.
Dec 25 11:11:28 filesrv1 kernel: dlm: closing connection to node 2
Dec 25 11:11:28 filesrv1 kernel: GFS2: fsid=samba:ctdb.1: jid=0: Trying
to acquire journal lock...
Dec 25 11:11:28 filesrv1 kernel: GFS2: fsid=samba:gen01.1: jid=0: Trying
to acquire journal lock...
Dec 25 11:11:28 filesrv1 fenced[9120]: fencing node clustsrv2
=================

Do you have the servers directly connected to one another? I don't see
the fence message until a full 2 seconds after the link dropped.

=================
Dec 25 03:30:06 filesrv2 kernel: imklog 4.6.2, log source = /proc/kmsg
started.
Dec 25 03:30:06 filesrv2 rsyslogd: [origin software="rsyslogd"
swVersion="4.6.2" x-pid="8660" x-info="http://www.rsyslog.com"] (re)start
Dec 25 11:14:56 filesrv2 kernel: imklog 4.6.2, log source = /proc/kmsg
started.
Dec 25 11:14:56 filesrv2 rsyslogd: [origin software="rsyslogd"
swVersion="4.6.2" x-pid="8811" x-info="http://www.rsyslog.com"] (re)start
Dec 25 11:14:56 filesrv2 kernel: Initializing cgroup subsys cpuset
Dec 25 11:14:56 filesrv2 kernel: Initializing cgroup subsys cpu
Dec 25 11:14:56 filesrv2 kernel: Linux version 2.6.32-220.el6.x86_64
(mockbuild at x86-004.build.bos.redhat.com) (gcc version 4.4.5 20110214
(Red Hat 4.4.5-6) (GCC) ) #1 SMP Wed Nov 9 08:03:13 EST 2011
Dec 25 11:14:56 filesrv2 kernel: Command line: ro
root=/dev/mapper/vg_filesrv2-LogVol01 rd_LVM_LV=vg_filesrv2/LogVol01
rd_LVM_LV=vg_filesrv2/LogVol00 rd_NO_LUKS rd_NO_MD rd_NO_DM
LANG=en_US.UTF-8 SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us
crashkernel=128M rhgb quiet acpi=off
Dec 25 11:14:56 filesrv2 kernel: KERNEL supported cpus:
=================

If this node failed, it failed hard as nothing got written to the logs.
Normally with network issues, you would expect to see "failed -> fence
-> network down" on the survivor and at least some portion of this on
the victim. That it just flat out died tells me that something else took
out the lost server, and what you saw was from the cluster is the
results of recovering from that loss.

I do see a crash in the second machine at 11:41:32 on Dec. 26, but there
doesn't seem to be any corresponding data on the first node. Are the
times in sync?

Lastly, I see the '[TOTEM ] Retransmit List:' list bug on the first
server, but not the second one. Are both nodes fully up to date? If they
are, and if you have a RHEL subscription, it might be worth talking to
your support contact.

In short, you seem to have multiple issues. Not entirely sure if they're
related or not, but possibly not which would make debugging tricky. Go
through both servers logs (that you attached here) and look closely at
these issues. Investigate them and see where that takes you.

Cheers

-- 
Digimer
E-Mail:              digimer at alteeve.com
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"omg my singularity battery is dead again.
stupid hawking radiation." - epitron