From ooolinux at 163.com  Tue Mar  1 02:03:44 2011
From: ooolinux at 163.com (yue)
Date: Tue, 1 Mar 2011 10:03:44 +0800 (CST)
Subject: [Linux-cluster] if there is no cman.ko anymore
In-Reply-To: <4D6BDD25.8040506@alteeve.com>
References: <4D6BDD25.8040506@alteeve.com>
	<3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
Message-ID: <7976fa5c.2b36.12e6f2823bf.Coremail.ooolinux@163.com>

1.the document says there is a cman.ko ,rhel5.  link is
http://www.linuxtopia.org/online_books/rhel5/rhel5_clustering_guide/rhel5_cluster_s1-ha-components-CSO.html
 
2. i use fedara12. cman version is cman-3.0.13-1.x86_64.rpm
 
so i want to know when redhat do not need cman.ko anymore?
 
3. i am going to test gfs2+clvm.  
would you give me any suggestion on optimization?
 
thanks


At 2011-03-01 01:36:37?Digimer <linux at alteeve.com> wrote:

>On 02/28/2011 12:16 PM, yue wrote:
>> my kernel 2.6.32,fc12
>> cman 3.0.17
>> i install cman.rpm
>> but i search no cman.ko, redhat cluster can work .
>> if there is not cman.ko anymore?
>>  
>> thanks
>
>I can't speak to this specific version, but I can confirm that CMAN is
>going away (in fact, I think it is already gone from 3.1).
>
>-- 
>Digimer
>E-Mail: digimer at alteeve.com
>AN!Whitepapers: http://alteeve.com
>Node Assassin:  http://nodeassassin.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/b6530fa8/attachment.htm>

From kawasaki at redhat.com  Tue Mar  1 04:20:50 2011
From: kawasaki at redhat.com (Tatsuo Kawasaki)
Date: Tue, 01 Mar 2011 13:20:50 +0900
Subject: [Linux-cluster] if there is no cman.ko anymore
In-Reply-To: <7976fa5c.2b36.12e6f2823bf.Coremail.ooolinux@163.com>
References: <4D6BDD25.8040506@alteeve.com>	<3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
	<7976fa5c.2b36.12e6f2823bf.Coremail.ooolinux@163.com>
Message-ID: <4D6C7422.6020203@redhat.com>

Hi yue,

I think this is an error in the document.
cman.ko kernel module is required for RHEL4 based cluster suite.

ex. RHEL4.8:
 cman-1.0.27-1.el4.x86_64.rpm
 cman-kernel-2.6.9-56.7.el4_8.9.x86_64.rpm

rpm -ql cman-kernel
/lib/modules/2.6.9-89.0.16.EL/kernel/cluster
/lib/modules/2.6.9-89.0.16.EL/kernel/cluster/cman.ko
/lib/modules/2.6.9-89.0.16.EL/kernel/cluster/cman.symvers


Regards,
--
Tatsuo Kawasaki


On 03/01/2011 11:03 AM, yue wrote:

> 1.the document says there is a cman.ko ,rhel5.  link is
> http://www.linuxtopia.org/online_books/rhel5/rhel5_clustering_guide/rhel5_cluster_s1-ha-components-CSO.html
>  
> 2. i use fedara12. cman version is cman-3.0.13-1.x86_64.rpm
>  
> so i want to know when redhat do not need cman.ko anymore?
>  
> 3. i am going to test gfs2+clvm.  
> would you give me any suggestion on optimization?
>  
> thanks
> 
> 
> At 2011-03-01 01:36:37??Digimer <linux at alteeve.com> wrote:
> 
>>On 02/28/2011 12:16 PM, yue wrote:
>>> my kernel 2.6.32,fc12
>>> cman 3.0.17
>>> i install cman.rpm
>>> but i search no cman.ko, redhat cluster can work .
>>> if there is not cman.ko anymore?
>>>  
>>> thanks
>>
>>I can't speak to this specific version, but I can confirm that CMAN is
>>going away (in fact, I think it is already gone from 3.1).



From ccaulfie at redhat.com  Tue Mar  1 09:25:15 2011
From: ccaulfie at redhat.com (Christine Caulfield)
Date: Tue, 01 Mar 2011 09:25:15 +0000
Subject: [Linux-cluster] if there is no cman.ko anymore
In-Reply-To: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
References: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
Message-ID: <4D6CBB7B.2020801@redhat.com>

On 28/02/11 17:16, yue wrote:
> my kernel 2.6.32,fc12
> cman 3.0.17
> i install cman.rpm
> but i search no cman.ko, redhat cluster can work .
> if there is not cman.ko anymore?
> thanks
>


There is no cman kernel module in RHEL5 and above. It's a module of 
openais/corosync and runs in userspace.

Chrissie



From szhargrave at ybs.co.uk  Tue Mar  1 11:10:45 2011
From: szhargrave at ybs.co.uk (Simon Hargrave)
Date: Tue, 1 Mar 2011 11:10:45 +0000
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <20110225085519297.00000004632@H04405>
References: <20110224161352225.00000004632@H04405>	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<20110224173814596.00000004632@H04405>	<09f63f66e569f668e219115b5e26f55f@sjolshagen.net>
	<20110225085519297.00000004632@H04405>
Message-ID: <20110301111045292.00000001752@H04405>

Please read the warning at the end of this email
________________________________________________

I just noticed the following errata released which appears to describe my problem: -
 
http://rhn.redhat.com/errata/RHBA-2011-0288.html <https://rhn.redhat.com/rhn/errata/details/Details.do?eid=10932> 
 
Suggesting that reads of metadata were no always using O_DIRECT and doing buffered reads.  However, having applied this update, the symptoms persist.
 
I'll raise this as a support call.
 
- 
Simon Hargrave szhargrave at ybs.co.uk <blocked::blocked::mailto:szhargrave at ybs.co.uk>  
Technical Services Team Leader x2831
Yorkshire Building Society 01274 472831
http://wwwtech/sysint/tsgcore.asp

________________________________________________
This email and any attachments are confidential and may contain privileged information.
If you are not the person for whom they are intended please return the email and then delete all material from any computer. You must not use the email or attachments for any purpose, nor disclose its contents to anyone other than the intended recipient.
Any statements made by an individual in this email do not necessarily reflect the views of the Yorkshire Building Society Group.
________________________________________________

Yorkshire Building Society, which is authorised and regulated by the Financial Services Authority, chooses to introduce its customers to Legal & General for the purposes of advising on and arranging life assurance and investment products bearing Legal & General?s name.

We are entered in the FSA Register and our FSA registration number is 106085 http://www.fsa.gov.uk/register

Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive, Bradford, BD5 8LJ
Tel: 0845 1 200 100

Visit Our Website
http://www.ybs.co.uk

All communications with us may be monitored/recorded to improve the quality of our service and for your protection and security.



________________________________________________________________________
This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/46a702f4/attachment.htm>

From parvez.h.shaikh at gmail.com  Tue Mar  1 13:20:18 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Tue, 1 Mar 2011 18:50:18 +0530
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
In-Reply-To: <20110228161406.GA14120@redhat.com>
References: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
	<20110228161406.GA14120@redhat.com>
Message-ID: <AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>

Hi Ryan,

Thank you for response. Does it mean there is no way to intimate
administrator about failure of fencing as of now?

Let me give more information about my cluster -

I have set of nodes in cluster with only IP resource being protected. I have
two levels of fencing, first bladecenter fencing and second one is manual
fencing.

At times if machine is already down(either power failure or turned off
abrupty); blade center fencing timesout and manual fencing happens. At this
time, administrator is expected to run fence_ack_manual.

Clearly this is not something which is desirable, as downtime of services is
as long as administrator runs fence_ack_manual.

What is recommended method to deal with  blade center fencing failure in
this situation? Do I have to add another level of fencing(between blade
center and manual) which can fence automatically(not requiring manual
interference)?


Thanks









On Mon, Feb 28, 2011 at 9:44 PM, Ryan O'Hara <rohara at redhat.com> wrote:

> On Mon, Feb 28, 2011 at 12:43:10PM +0530, Parvez Shaikh wrote:
> > Hi all,
> >
> > I have a question related to fence agents and SNMP alarms.
> >
> > Fence Agent can fail to fence the failed node for various reason; e.g.
> with
> > my bladecenter fencing agent, I sometimes get message saying bladecenter
> > fencing failed because of timeout or fence device IP address/user
> > credentials are incorrect.
> >
> > In such a situation is it possible to generate SNMP trap?
>
> This feature will be in RHEL6.1. There is a new project called
> 'foghorn' that creates SNMPv2 traps from dbus signals.
>
> git://git.fedorahosted.org/foghorn.git
>
> In RHEL6.1 (and the latest upstream release), certain cluster
> components will emit dbus signals when certain events occurs. This
> includes fencing. So when a node is fenced a dbus signal is generated
> by fenced. The foghorn service catches this signal and generated
> SNMPv2 trap.
>
> Note that foghorn runs as an AgentX subagent, so snmpd must be running
> as the master agentx.
>
> Ryan
>
> > My cluster config file looks like below and in my case if bladecenter
> > fencing fails, manual fencing kicks in and requires user to do
> > fence_ack_manual, for this user must at least be notified via SNMP (or
> any
> > other mechanism?) to intervene  -
> >
> >   <clusternodes>
> >     <clusternode name="blade2" nodeid="2" votes="1">
> >       <fence>
> >         <method name="1">
> >           <device blade="2" name="BladeCenterFencing"/>
> >         </method>
> >         <method name="2">
> >           <device name="ManualFencing" nodename="blade2"/>
> >         </method>
> >       </fence>
> >     </clusternode>
> >     <clusternode name="blade1" nodeid="1" votes="1">
> >       <fence>
> >         <method name="1">
> >           <device blade="1" name="BladeCenterFencing"/>
> >         </method>
> >         <method name="2">
> >           <device name="ManualFencing" nodename="blade1"/>
> >         </method>
> >       </fence>
> >     </clusternode>
> >   </clusternodes>
> >   <cman expected_votes="1" two_node="1"/>
> >   <fencedevices>
> >     <fencedevice agent="fence_bladecenter" ipaddr="blade-mm.com"
> > login="USERID" name="BladeCenterFencing" passwd="PASSW0RD"/>
> >     <fencedevice agent="fence_manual" name="ManualFencing"/>
> >   </fencedevices>
> >
> > Thanks,
> > Parvez
>
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/73002998/attachment.htm>

From mailtoaneeshvs at gmail.com  Tue Mar  1 13:47:44 2011
From: mailtoaneeshvs at gmail.com (aneesh vs)
Date: Tue, 1 Mar 2011 19:17:44 +0530
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <20110301111045292.00000001752@H04405>
References: <20110224161352225.00000004632@H04405>
	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<20110224173814596.00000004632@H04405>
	<09f63f66e569f668e219115b5e26f55f@sjolshagen.net>
	<20110225085519297.00000004632@H04405>
	<20110301111045292.00000001752@H04405>
Message-ID: <AANLkTim1hNWPAfyg5vHhDbYD+9x5P4CauHfCnVWvjBuU@mail.gmail.com>

Hello,

Does "clvmd -R" on all nodes makes any difference?

On Tue, Mar 1, 2011 at 4:40 PM, Simon Hargrave <szhargrave at ybs.co.uk> wrote:

>  Please read the warning at the end of this email
> ________________________________________________
>
>
> I just noticed the following errata released which appears to describe my
> problem: -
>
> http://rhn.redhat.com/errata/RHBA-2011-0288.html<https://rhn.redhat.com/rhn/errata/details/Details.do?eid=10932>
>
> Suggesting that reads of metadata were no always using O_DIRECT and doing
> buffered reads.  However, having applied this update, the symptoms persist.
>
> I'll raise this as a support call.
>
>  -
> Simon Hargrave szhargrave at ybs.co.uk
> Technical Services Team Leader x2831
> Yorkshire Building Society 01274 472831
> http://wwwtech/sysint/tsgcore.asp
>
> ________________________________________________
>
> This email and any attachments are confidential and may contain privileged
> information.
>
> If you are not the person for whom they are intended please return the
> email and then delete all material from any computer. You must not use the
> email or attachments for any purpose, nor disclose its contents to anyone
> other than the intended recipient.
>
> Any statements made by an individual in this email do not necessarily
> reflect the views of the Yorkshire Building Society Group.
>
> ________________________________________________
>
> Yorkshire Building Society, which is authorised and regulated by the
> Financial Services Authority, chooses to introduce its customers to Legal &
> General for the purposes of advising on and arranging life assurance and
> investment products bearing Legal & General?s name.
>
>
> We are entered in the FSA Register and our FSA registration number is
> 106085 http://www.fsa.gov.uk/register
>
> Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive,
> Bradford, BD5 8LJ
> Tel: 0845 1 200 100
>
> Visit Our Website
> http://www.ybs.co.uk
>
> All communications with us may be monitored/recorded to improve the quality
> of our service and for your protection and security.
>
>
>
> ________________________________________________________________________
> This e-mail has been scanned for all viruses by Star. The
> service is powered by MessageLabs. For more information on a proactive
> anti-virus service working around the clock, around the globe, visit:
> http://www.star.net.uk
> ________________________________________________________________________
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/e7aea0da/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Tue Mar  1 13:50:38 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 01 Mar 2011 13:50:38 +0000
Subject: [Linux-cluster] How fast can rsync be on GFS2?
In-Reply-To: <4D686679.40104@logik-internet.rs>
References: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<4D686679.40104@logik-internet.rs>
Message-ID: <4D6CF9AE.8050004@mssl.ucl.ac.uk>

Nikola Savic wrote:

> Rsync is very slow in creating file
> list, little faster than 100files/s.

That's about what I see too. Ditto on reading.





From szhargrave at ybs.co.uk  Tue Mar  1 14:45:23 2011
From: szhargrave at ybs.co.uk (Simon Hargrave)
Date: Tue, 1 Mar 2011 14:45:23 +0000
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <AANLkTim1hNWPAfyg5vHhDbYD+9x5P4CauHfCnVWvjBuU@mail.gmail.com>
References: <20110224161352225.00000004632@H04405>	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<20110224173814596.00000004632@H04405>	<09f63f66e569f668e219115b5e26f55f@sjolshagen.net>	<20110225085519297.00000004632@H04405>	<20110301111045292.00000001752@H04405>
	<AANLkTim1hNWPAfyg5vHhDbYD+9x5P4CauHfCnVWvjBuU@mail.gmail.com>
Message-ID: <20110301144523300.00000004068@H04405>

Please read the warning at the end of this email
________________________________________________

> Does "clvmd -R" on all nodes makes any difference?
 
No it doesn't.
 
Interesting that the lvs command in verbose mode does seem to see the test LV during scan, but not display it at the end: -
 
[root at ybsxlx87 ~]# lvs -vv 2>&1 | tail -30
      /dev/vgHBPOCSHARED/lvinv01: size is 2097152 sectors
      /dev/vgHBPOCSHARED/lvmanaged: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvmanaged: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvinstance: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvinstance: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvaserver: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvaserver: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvcluster: size is 41943040 sectors
      /dev/vgHBPOCSHARED/lvcluster: size is 41943040 sectors
      /dev/vgHBPOCSHARED/test: size is 2097152 sectors
      /dev/vgHBPOCSHARED/test: size is 2097152 sectors
  LV          VG            #Seg Attr   LSize   Maj Min KMaj KMin Origin Snap%  Move Copy%  Log Convert LV UUID
  esmlv       vg00             1 -wi-ao 480.00M  -1  -1 253  9                                          nMkgWo-FHiE-HcNR-Gejh-aGQu-5jdi-5WuzUl
  lvol1       vg00             1 -wi-ao   1.00G  -1  -1 253  7                                          gGMvIQ-A1YZ-Rdfj-IuUI-r0pw-o2uE-0TO4t2
  lvol2       vg00             1 -wi-ao   4.00G  -1  -1 253  17                                         An7J2B-W2l6-tSFl-M8Ud-rf2o-1mX9-RqP2MX
  lvol3       vg00             1 -wi-ao   3.91G  -1  -1 253  10                                         NuaTa1-L911-PAYp-x8DL-ccLI-rZvK-tJRNf2
  lvol4       vg00             1 -wi-ao   1.00G  -1  -1 253  8                                          dy6xdZ-DMYh-ykjw-FhvT-FeLa-SJSA-xn26qR
  lvol5       vg00             1 -wi-ao   1.00G  -1  -1 253  11                                         jVBGid-k9ke-kxwt-TIHS-00jW-gHwU-MjyG3h
  lvol6       vg00             1 -wi-ao 256.00M  -1  -1 253  14                                         36n8Dy-OtBd-QDs1-2L71-kmrx-5vLP-olvHcW
  netbackuplv vg00             1 -wi-ao 512.00M  -1  -1 253  16                                         3XhLTa-KKOO-ldZd-bd2h-bQGQ-Nj2C-eiqLkK
  tivolilv    vg00             1 -wi-ao  64.00M  -1  -1 253  18                                         9wVbG3-AYzn-31VY-uS1v-nuFz-ZMnH-yk6obG
  u001lv      vg00             1 -wi-ao   5.00G  -1  -1 253  13                                         ETn1bt-GZlk-q67D-JjMJ-Tvse-RbUe-dZ82LJ
  u003lv      vg00             1 -wi-ao 512.00M  -1  -1 253  15                                         ecJ8f1-0FrY-YWYx-hLpe-kHyi-ghGo-qzO0UI
  ybslv       vg00             1 -wi-ao  32.00M  -1  -1 253  12                                         aPHyyp-kWCT-uuKc-qeH3-gFNw-mU29-l0472c
  fmw1        vgHBPOCSHARED    1 -wi-a-  20.00G  -1  -1 253  19                                         1ychoA-hOjm-EiJ3-0yA1-6FmY-MPlZ-f4q8OB
  lvaserver   vgHBPOCSHARED    1 -wi-ao  20.00G  -1  -1 253  23                                         TOUmoW-xL3U-eozs-eHB8-jWvC-Pxwf-mJASYz
  lvcluster   vgHBPOCSHARED    1 -wi-ao  20.00G  -1  -1 253  24                                         AMDHIB-bXCu-18km-lGoL-Vzke-SIcD-KOEjVf
  lvinstance  vgHBPOCSHARED    1 -wi-ao  20.00G  -1  -1 253  22                                         IyXuAs-qIMS-xs8n-sGdZ-Sv7Z-9lUb-nweVvn
  lvinv01     vgHBPOCSHARED    1 -wi-a-   1.00G  -1  -1 253  20                                         may1bW-gRcZ-sDnj-mbWH-um0B-hddu-HUY93C
  lvmanaged   vgHBPOCSHARED    1 -wi-ao  20.00G  -1  -1 253  21                                         R1gaUa-FDKx-1DEf-LT9d-vt86-o1Zz-l9LmuU

I now have a case raised with RedHat.  I'll update if we make any progress.
 

Simon
 
- 
Simon Hargrave szhargrave at ybs.co.uk <blocked::blocked::mailto:szhargrave at ybs.co.uk>  
Technical Services Team Leader x2831
Yorkshire Building Society 01274 472831
http://wwwtech/sysint/tsgcore.asp


________________________________________________
This email and any attachments are confidential and may contain privileged information.
If you are not the person for whom they are intended please return the email and then delete all material from any computer. You must not use the email or attachments for any purpose, nor disclose its contents to anyone other than the intended recipient.
Any statements made by an individual in this email do not necessarily reflect the views of the Yorkshire Building Society Group.
________________________________________________

Yorkshire Building Society, which is authorised and regulated by the Financial Services Authority, chooses to introduce its customers to Legal & General for the purposes of advising on and arranging life assurance and investment products bearing Legal & General?s name.

We are entered in the FSA Register and our FSA registration number is 106085 http://www.fsa.gov.uk/register

Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive, Bradford, BD5 8LJ
Tel: 0845 1 200 100

Visit Our Website
http://www.ybs.co.uk

All communications with us may be monitored/recorded to improve the quality of our service and for your protection and security.



________________________________________________________________________
This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/b2ff2210/attachment.htm>

From fdinitto at redhat.com  Tue Mar  1 14:57:23 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 01 Mar 2011 15:57:23 +0100
Subject: [Linux-cluster] resource-agents 3.1.1 stable release
Message-ID: <4D6D0953.2010208@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Welcome to the second resource-agents standalone release.

this is a bug fix only release.

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/r/e/resource-agents/resource-agents-3.1.1.tar.xz

To report bugs or issues:

	https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

Happy clustering,
Fabio

Under the hood (from 3.1.0):

Fabio M. Di Nitto (1):
      fs-lib: fix do_monitor device mapping

Lon Hohberger (3):
      resource-agents: Fix migrateuriopt setting
      resource-agents: Improve LD_LIBRARY_PATH handling by SAP*
      resource-agents: Use literal quotes for tr calls

Marek 'marx' Grac (3):
      resource-agents: Add option disable_rdisc to ip.sh
      resource-agents: Apache resource with spaces in name fails to start
      resource-agents: Remove netmask from IP address when creating list
of them

 rgmanager/src/resources/SAPDatabase              |   19 ++++++++++---------
 rgmanager/src/resources/SAPInstance              |    5 +++--
 rgmanager/src/resources/ip.sh                    |   21
+++++++++++++++++++--
 rgmanager/src/resources/utils/config-utils.sh.in |   10 ++++++----
 rgmanager/src/resources/utils/fs-lib.sh          |   13 ++++++++++++-
 rgmanager/src/resources/vm.sh                    |    5 ++++-
 6 files changed, 54 insertions(+), 19 deletions(-)

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQIcBAEBCAAGBQJNbQlRAAoJEFA6oBJjVJ+OcbEP/389Dhidd3GMfaO8hn+RPEe0
y0W7CpWR73f2hFVmptLDT4e5bKj5+TRPh5rr/V9y4weDXfmv/YGbsCPmyBGZCtiW
DLkBuP8xnnb8M2pJtWM0T6SZLgR/iXviYchIj4D8F6zE2OsXQp7YOcfDeN/Xwe9J
znilVw9shBVlV2SWA/2avl6MXnmnO1IypUkSZ4VQt7IiJUYP/CRdxiwJbWGRM7Sk
rPeQArcdJ8xqKyPtmXslBiFNawFdw2rywGbRCeXo+IaWhw//urYDCuwSq+wwvsFq
BMWNRwqGuvUAmKPnustekfGLcVWwK1SaAgzeiQh5PHr5p7bFk+mRBl5JW63yJsjT
wQbSOTX4A6c7QmCGSlfuqz9sUbtb83bHPS3G9lvPiPFY8TpBl4XlAaEyEO9ipwJN
k8ktQwNWhDsza8lFEQDoD0p/DQRLkEZ8KXscP7qtyPCQ8MYkxaGFPmxWhkmVp0/l
liIoYu8W2wdTvOOcu4qdiuxV5Z9uBmMU6CSZmd3rG/Zg8h9oNHZU9FK6xncgHuxx
XvH6MhyVZSYh9K6UHZDWIFCjjL5+H9dMVmeFAs9XEu1RNacOixMBgLoTFy91PInu
GXDSmu1X4biiI5WbdahPWvgxxEQG2hgHrhQIVyzp+Lw2DRU1/f4vcOLCoklB69D7
NhJFuXZC7GF/uR7Zw+vU
=KZCV
-----END PGP SIGNATURE-----



From fdinitto at redhat.com  Wed Mar  2 09:25:06 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Wed, 02 Mar 2011 10:25:06 +0100
Subject: [Linux-cluster] fence-agents 3.1.2 stable release
Message-ID: <4D6E0CF2.6060402@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Welcome to the fence-agents 3.1.2 release.

This release contains a few bug fixes and a new watchdog/fence_scsi
integration script (thanks to Ryan O?hara).

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/f/e/fence-agents/fence-agents-3.1.2.tar.xz

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

Happy clustering,
Fabio

Under the hood (from 3.1.1):

Fabio M. Di Nitto (8):
      build: cleanup configure.ac
      fence_eaton_snmp: fix port number handling
      build: make ready for watchdog integration script
      build: fence_ipmilan does not need PYTHONPATH to generate manpage
      build: fix build dependecy
      build: plug in fence_scsi_check.pl and ship it in sharedir/cluster/
      build: update .gitignore
      Fix .gitignore a bit more

Marek 'marx' Grac (3):
      fence_rsa: Better error handling
      fence_wti: Unable to parse output when splitted into several screens
      fence_wti: Unable to parse output when splitted into several
screens (2/2)

Ryan O'Hara (4):
      fence_scsi: move key file to /var/run/cluster
      fence_scsi: write devices to tmp file on unfence
      fence_scsi: create /var/run/cluster if necessary
      fence_scsi_check: watchdog script for fence_scsi

[fabbione at daikengo fence-agents]$ git diff --stat  v3.1.1..v3.1.2
 .gitignore                                  |    2 +-
 configure.ac                                |    6 +-
 fence/agents/alom/Makefile.am               |    4 +-
 fence/agents/apc/Makefile.am                |    4 +-
 fence/agents/apc_snmp/Makefile.am           |    4 +-
 fence/agents/baytech/Makefile.am            |    4 +-
 fence/agents/bladecenter/Makefile.am        |    4 +-
 fence/agents/brocade/Makefile.am            |    4 +-
 fence/agents/bullpap/Makefile.am            |    4 +-
 fence/agents/cisco_mds/Makefile.am          |    4 +-
 fence/agents/cisco_ucs/Makefile.am          |    4 +-
 fence/agents/cpint/Makefile.am              |    4 +-
 fence/agents/drac/Makefile.am               |    4 +-
 fence/agents/drac5/Makefile.am              |    4 +-
 fence/agents/eaton_snmp/Makefile.am         |    4 +-
 fence/agents/eaton_snmp/fence_eaton_snmp.py |    5 +
 fence/agents/egenera/Makefile.am            |    4 +-
 fence/agents/eps/Makefile.am                |    4 +-
 fence/agents/ibmblade/Makefile.am           |    4 +-
 fence/agents/ifmib/Makefile.am              |    5 +-
 fence/agents/ilo/Makefile.am                |    4 +-
 fence/agents/ilo_mp/Makefile.am             |    4 +-
 fence/agents/intelmodular/Makefile.am       |    4 +-
 fence/agents/ipmilan/Makefile.am            |    1 -
 fence/agents/ldom/Makefile.am               |    4 +-
 fence/agents/lib/Makefile.am                |    4 +-
 fence/agents/lpar/Makefile.am               |    4 +-
 fence/agents/mcdata/Makefile.am             |    4 +-
 fence/agents/rhevm/Makefile.am              |    4 +-
 fence/agents/rsa/Makefile.am                |    4 +-
 fence/agents/rsa/fence_rsa.py               |    7 +-
 fence/agents/rsb/Makefile.am                |    4 +-
 fence/agents/sanbox2/Makefile.am            |    4 +-
 fence/agents/scsi/Makefile.am               |    9 ++-
 fence/agents/scsi/fence_scsi.pl             |   47 +++++++-
 fence/agents/scsi/fence_scsi_check.pl       |  170
+++++++++++++++++++++++++++
 fence/agents/virsh/Makefile.am              |    4 +-
 fence/agents/vixel/Makefile.am              |    4 +-
 fence/agents/vmware/Makefile.am             |    4 +-
 fence/agents/wti/Makefile.am                |    4 +-
 fence/agents/wti/fence_wti.py               |   27 ++++-
 fence/agents/xcat/Makefile.am               |    4 +-
 fence/agents/zvm/Makefile.am                |    4 +-
 make/fencebuild.mk                          |    2 +-
 44 files changed, 365 insertions(+), 48 deletions(-)

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQIcBAEBCAAGBQJNbgzsAAoJEFA6oBJjVJ+OIMgQAJV4QUJh77yGIWmI+XM9z0KU
+BRikjVKeiBGAET/Nm25P46fLt+x4JTh24QaAzD2ZMPvZKDqd9vOtuA2dij/SxIn
vMn5K4+A1XOeew9Ji+48dDKW5vk6hyEQckJQ7/uTty2hJxNoWu1R2+VD59Pet61C
lN9uXlJMBIVb8ckxSCD7h5i7FfFTNiDX2opYdtL6i0s8ysP5tlKDcph5LENHyxWM
/ux8SXEdPCSwCMsdmSPglKLJcRwWLVRaVJgy+K+Mau94S9AjUZBr0ts3+FNKToAU
5dVa833bedgLtHsM9pGifazgvo7qOYMTgpnULyQF7bg2od6jzbs1CD4k66eSN9yg
Qo/fAXiPYl/dNFLic7n6aepDEeAgBGdj2Llp9ien/XWLmkA+mxFuIQJqtYtPRFTl
fiYrmL0yfq3jbAQaApeZgDtK9aOz7J+us6y++6TrVUQXhagXI9xW3mQskiigXwWN
S+oW4ujAYcLTBlvTNvbHKEzFaq2gFke7goOJRahW+DsqOgvC8otL7f68QoCp5GAh
EHbt/5FHi16JS69VY0dTU1kdpCEvtSWXDKfsmHY2SzoLsSesy2bIs6BbQFItZTcf
krB/1F/oKogRAUPb09Pl2g8+Z9ruGcjyqhU+RCNkHdIhHC/IL6n3mc4Bqb1Q1qtu
qOmKOddGpY2hQFcL1cjl
=cXkC
-----END PGP SIGNATURE-----



From fdinitto at redhat.com  Wed Mar  2 10:39:27 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Wed, 02 Mar 2011 11:39:27 +0100
Subject: [Linux-cluster] new resource agents repository
Message-ID: <4D6E1E5F.6040507@redhat.com>

Hello,

There is a new repository for Resource Agents which contains RA
sets from both Linux HA and Red Hat projects:

	git://github.com/ClusterLabs/resource-agents.git

The purpose of the common repository is to share maintenance load
and try to consolidate resource agents.

There were no conflicts with the rgmanager RA set and both source
layouts remain the same. It is only that autoconf bits were
merged. The only difference is that if you want to get Linux HA
set of resource agents installed, configure should be run like
this:

	configure --with-ras-set=rgmanager ...

The new repository is git ane the existing history is preserved.

The existing repository at git.fedorahosted.org will be retired soon.

Many thanks to Dejan for writing the original "new resource agents
repository" email to linux-ha-dev for me to copy/paste almost pristine ;)

more seriously, thanks to all people for helping in all various aspects
of the merge.

There are for sure corners that we need to smooth due to the merge.
Please report any issue you find and we will try to address it as soon
as possible.

Cheers,

Fabio



From mika68vaan at gmail.com  Wed Mar  2 15:07:38 2011
From: mika68vaan at gmail.com (Mika i)
Date: Wed, 2 Mar 2011 17:07:38 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login to
	fencing device
Message-ID: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>

Hi

Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
I have now in both clusters rhel 5.5 version with:
kernel 2.6.18-194.el5
cman-2.0.115-68.el5_6.1

But in fence state i get allways message: Unable to connect/login to fencing
device

Any help - or must i update the cluster to rhel 5.6?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110302/df706d29/attachment.htm>

From sklemer at gmail.com  Wed Mar  2 15:38:41 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Wed, 2 Mar 2011 17:38:41 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
Message-ID: <AANLkTimQJP6XAFwySq-kXnswYwW2gOwLNvftHVKOtctS@mail.gmail.com>

Hello.

I think its not supported yet.

You should use ifence_ipmilan.

Regards

On Wed, Mar 2, 2011 at 5:07 PM, Mika i <mika68vaan at gmail.com> wrote:

> Hi
>
> Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
> I have now in both clusters rhel 5.5 version with:
> kernel 2.6.18-194.el5
> cman-2.0.115-68.el5_6.1
>
> But in fence state i get allways message: Unable to connect/login to
> fencing device
>
> Any help - or must i update the cluster to rhel 5.6?
>
>
>
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110302/873be71c/attachment.htm>

From Jason_Henderson at Mitel.com  Wed Mar  2 15:49:32 2011
From: Jason_Henderson at Mitel.com (Jason_Henderson at Mitel.com)
Date: Wed, 2 Mar 2011 10:49:32 -0500
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
	to	fencing device
In-Reply-To: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
Message-ID: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>

linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:

> Hi
>  
> Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
> I have now in both clusters rhel 5.5 version with:
> kernel 2.6.18-194.el5
> cman-2.0.115-68.el5_6.1
>  
> But in fence state i get allways message: Unable to connect/login to
> fencing device
>  
> Any help - or must i update the cluster to rhel 5.6?

What fence agent are you using, fence_ilo?
You will need to use the fence_ipmilan agent for iLO3.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110302/884f41dd/attachment.htm>

From Ning.Bao at statcan.gc.ca  Wed Mar  2 20:23:51 2011
From: Ning.Bao at statcan.gc.ca (Ning.Bao at statcan.gc.ca)
Date: Wed, 2 Mar 2011 15:23:51 -0500
Subject: [Linux-cluster]  question about backup/restore of GFS2
Message-ID: <DD4D0CFE33FD2F4F8657DCE624FE657E088ABC@STCEM17.statcan.ca>

Hi 

Does anyone have experience with using Netbackup to backup/restore  GFS2
file system in production environment?  I noticed that GFS2 is not on
the list of file compatitbitly of netbackup 7. Is there any alternative
backup tool for GFS2 in enterprise settings if Netbackup can not do it?
Thanks!

-Ning
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110302/a7029186/attachment.htm>

From vmutu at pcbi.upenn.edu  Wed Mar  2 21:50:50 2011
From: vmutu at pcbi.upenn.edu (Valeriu Mutu)
Date: Wed, 2 Mar 2011 16:50:50 -0500
Subject: [Linux-cluster] clvmd hangs on startup
Message-ID: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>

Hi,

I have a 2-node cluster setup and trying to get GFS2 working on top of an iSCSI volume. Each node is a Xen virtual machine.

I am currently unable to get clvmd working on the 2nd node. It starts fine on the 1st node:
[root at vm1 ~]# service clvmd start
Starting clvmd:                                            [  OK  ]
Activating VGs:     Logging initialised at Wed Mar  2 15:25:07 2011
    Set umask to 0077
    Finding all volume groups
    Finding volume group "PcbiHomesVG"
    Activated 1 logical volumes in volume group PcbiHomesVG
  1 logical volume(s) in volume group "PcbiHomesVG" now active
    Finding volume group "VolGroup00"
    2 logical volume(s) in volume group "VolGroup00" already active
    2 existing logical volume(s) in volume group "VolGroup00" monitored
    Activated 2 logical volumes in volume group VolGroup00
  2 logical volume(s) in volume group "VolGroup00" now active
    Wiping internal VG cache

[root at vm1 ~]# vgs
    Logging initialised at Wed Mar  2 15:25:12 2011
    Set umask to 0077
    Finding all volume groups
    Finding volume group "PcbiHomesVG"
    Finding volume group "VolGroup00"
  VG          #PV #LV #SN Attr   VSize VFree
  PcbiHomesVG   1   1   0 wz--nc 1.17T    0 
  VolGroup00    1   2   0 wz--n- 4.66G    0 
    Wiping internal VG cache

But when I try to start clvmd on the 2nd node, it hangs:
[root at vm2 ~]# service clvmd start
Starting clvmd: [ OK ]
...hangs...

I see the following in vm2:/var/log/messages:
Mar  2 15:59:02 vm2 clvmd[2283]: Cluster LVM daemon started - connected to CMAN
Mar  2 16:01:36 vm2 kernel: INFO: task clvmd:2302 blocked for more than 120 seconds.
Mar  2 16:01:36 vm2 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  2 16:01:36 vm2 kernel: clvmd         D 0022a86125f49a6a     0  2302      1                2299 (NOTLB)
Mar  2 16:01:36 vm2 kernel:  ffff880030cb7db8  0000000000000282  0000000000000000  0000000000000000 
Mar  2 16:01:36 vm2 kernel:  0000000000000008  ffff880033e327e0  ffff880000033080  000000000001c2b2 
Mar  2 16:01:36 vm2 kernel:  ffff880033e329c8  ffffffff8029c48f 
Mar  2 16:01:36 vm2 kernel: Call Trace:
Mar  2 16:01:36 vm2 kernel:  [<ffffffff8029c48f>] autoremove_wake_function+0x0/0x2e
Mar  2 16:01:36 vm2 kernel:  [<ffffffff802644cb>] __down_read+0x82/0x9a
Mar  2 16:01:36 vm2 kernel:  [<ffffffff884f646d>] :dlm:dlm_user_request+0x2d/0x174
Mar  2 16:01:36 vm2 kernel:  [<ffffffff8022d08d>] mntput_no_expire+0x19/0x89
Mar  2 16:01:36 vm2 kernel:  [<ffffffff8041716d>] sys_sendto+0x14a/0x164
Mar  2 16:01:36 vm2 kernel:  [<ffffffff884fd61f>] :dlm:device_write+0x2f5/0x5e5
Mar  2 16:01:36 vm2 kernel:  [<ffffffff80217379>] vfs_write+0xce/0x174
Mar  2 16:01:36 vm2 kernel:  [<ffffffff80217bb1>] sys_write+0x45/0x6e
Mar  2 16:01:36 vm2 kernel:  [<ffffffff802602f9>] tracesys+0xab/0xb6
[...]

I also noticed that there's a waiting "vgscan" process that "clvmd" is waiting on:
    1  1655  1655  1655 ?           -1 Ss       0   0:00 /usr/sbin/sshd
 1655  1801  1801  1801 ?           -1 Ss       0   0:00  \_ sshd: root at pts/0 
 1801  1803  1803  1803 pts/0     2187 Ss       0   0:00  |   \_ -bash
 1803  2187  2187  1803 pts/0     2187 S+       0   0:00  |       \_ /bin/sh /sbin/service clvmd start
 2187  2192  2187  1803 pts/0     2187 S+       0   0:00  |           \_ /bin/bash /etc/init.d/clvmd start
 2192  2215  2187  1803 pts/0     2187 S+       0   0:00  |               \_ /usr/sbin/vgscan

Before starting clvmd, cman is started and both nodes are cluster members:
[root at vm1 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M  544456   2011-03-02 15:24:31  172.16.50.32
   2   M  544468   2011-03-02 15:52:29  172.16.50.33

Note that I'm using manual fencing in this configuration.

Both nodes are running CentOS 5.5:
# uname -a
Linux vm2.pcbi.upenn.edu 2.6.18-194.32.1.el5xen #1 SMP Wed Jan 5 18:44:24 EST 2011 x86_64 x86_64 x86_64 GNU/Linux

These package versions were installed on each node:
cman-2.0.115-34.el5_5.4
cman-devel-2.0.115-34.el5_5.4
gfs2-utils-0.1.62-20.el5
lvm2-2.02.56-8.el5_5.6
lvm2-cluster-2.02.56-7.el5_5.4
rgmanager-2.0.52-6.el5.centos.8
system-config-cluster-1.0.57-3.el5_5.1

iptables is turned off on each node.

Does anyone know why clvmd hangs on the 2nd node?

Best,
-- 
Valeriu Mutu



From jeff.sturm at eprize.com  Wed Mar  2 22:36:45 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 2 Mar 2011 17:36:45 -0500
Subject: [Linux-cluster] clvmd hangs on startup
In-Reply-To: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>
References: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Valeriu Mutu
> Sent: Wednesday, March 02, 2011 4:51 PM
> 
> Does anyone know why clvmd hangs on the 2nd node?

Double-check that the 2nd node can read and write the shared iSCSI
storage.

-Jeff





From swap_project at yahoo.com  Wed Mar  2 23:16:43 2011
From: swap_project at yahoo.com (Srija)
Date: Wed, 2 Mar 2011 15:16:43 -0800 (PST)
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
Message-ID: <852499.20065.qm@web112812.mail.gq1.yahoo.com>

Hi all,

Here is the issue  with the cluster  describing below:

The cluster is built with 16 nodes. All rhel5.5   86_64 bit OS.
yesterday night  two  servers were rebooted and after that  these
two servers are not joining to the cluster.

I was not  the part of the team when it is built. and my knowledge regarding cluster is also little bit.

Here is the scenario:

   -  There is no quorum  disks.  But the person
      who has built the cluster he is telling he has executed the quorum 
      from command line, [ i am not sure  of that ]

  -  The errors  in the message log  are showing as

ccsd[24182]: Unable to connect to cluster infrastructure after 12060 seconds , it is a continuous error message in the log file

The cluster.conf  are as follows:

<?xml version="1.0"?>
<cluster alias="newenvt" config_version="21" name="newenvt">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>


<clusternodes>

<clusternode name="host-priv.domain.org" nodeid="1" votes="1">
        <fence><method name="1">
        <device name="ilo-hostr"/></method>
        </fence>
</clusternode>

................... [  all the other  nodes  ]...................

 </clusternodes>
<cman/>

<dlm plock_ownership="1" plock_rate_limit="0"/>

<gfs_controld plock_rate_limit="0"/>


<fencedevices>

        <fencedevice agent="fence_ilo" hostname="hostr" login="Admin" name="hostr" passwd="xxxxxx"/>  
 
         .............................[  all the fence devices for other  nodes ]................

</fencedevices>

<rm>

<failoverdomains/>

<resources/>


</rm></cluster>

It seems it is a very basic configuration. But at this stage more important
is, to attach the two servers  in the cluster environment.

If more information is needed , i will provide.

Any advice is  appreciated.

Thanks in advance



      



From lmb at novell.com  Thu Mar  3 09:37:25 2011
From: lmb at novell.com (Lars Marowsky-Bree)
Date: Thu, 3 Mar 2011 10:37:25 +0100
Subject: [Linux-cluster] Announcement: Linux Foundation HA working group
	mailing lists
Message-ID: <20110303093725.GA32146@suse.de>

Hi everyone,

please excuse the long Cc list.

Behind the scenes, some of the projects that make up the cluster stack
on Linux have been working together to converge and integrate the
various projects. We have been meeting on and off for the last decade,
and made some amazing progress over the years.

However, we believe we could make even better progress if we had a
common umbrella that did not try to take away any independence from the
projects, but acted as a vendor-neutral forum for coordination. Many
projects have chosen to create their own foundations these days, but we
did not want this overhead.

The Linux Foundation is a well established organization, and its board
has graciously agreed to host the working group for us, and also offered
further support.

One of the first steps here is the creation of mailing lists.

https://lists.linux-foundation.org/mailman/listinfo/ha-wg

https://lists.linux-foundation.org/mailman/listinfo/ha-wg-technical

These mailing lists are not intended to supersede any of the existing
project mailing lists, but act as a place for the coordination of
cross-project issues - such as distribution adoption, convergence of
components and projects (like the on-going resource agent merge between
RHCS & Linux-HA), discussion of summits, and so on.

You are all invited to join these mailing lists, but the focus is on
project maintainers, contributors, distribution packagers.

Our immediate roadmap (of the non-technical kind) is to prepare a
summary statement, a brief charta, agree on what we consider to be part
of the "core" stack, and explore the options that the LF can offer to us
(we have already discussed some of this in a smaller group, but it would
be too long for this announcement), and announce this working group to a
larger audience at the Collab Summit in April.

Also, we plan to hold this year's face to face meeting along the Linux
Foundation conferences in October in Prague, CZ.

I look forward to the dialogue!


Regards,
    Lars

-- 
Architect Storage/HA, OPS Engineering, Novell, Inc.
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG N?rnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde



From swhiteho at redhat.com  Thu Mar  3 10:21:44 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 03 Mar 2011 10:21:44 +0000
Subject: [Linux-cluster] question about backup/restore of GFS2
In-Reply-To: <DD4D0CFE33FD2F4F8657DCE624FE657E088ABC@STCEM17.statcan.ca>
References: <DD4D0CFE33FD2F4F8657DCE624FE657E088ABC@STCEM17.statcan.ca>
Message-ID: <1299147704.2572.37.camel@dolmen>

Hi,

On Wed, 2011-03-02 at 15:23 -0500, Ning.Bao at statcan.gc.ca wrote:
> Hi 
> 
> Does anyone have experience with using Netbackup to backup/restore
> GFS2 file system in production environment?  I noticed that GFS2 is
> not on the list of file compatitbitly of netbackup 7. Is there any
> alternative backup tool for GFS2 in enterprise settings if Netbackup
> can not do it?
> 
> Thanks!
> 
> -Ning
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

The thing to watch out for with backup is how it affects the normal
working set of the nodes in the cluster. Often you'll get much better
performance with backup if you write a custom set of scripts which
respects the working set on each node and which also allows the backup
to proceed in parallel across the filesystem.

What makes a good backup solution depends on the application and the I/O
pattern, so it is tricky to make any generic suggestions,

Steve.




From mailing.sr at gmail.com  Thu Mar  3 10:20:37 2011
From: mailing.sr at gmail.com (Seb)
Date: Thu, 3 Mar 2011 11:20:37 +0100
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <852499.20065.qm@web112812.mail.gq1.yahoo.com>
References: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<852499.20065.qm@web112812.mail.gq1.yahoo.com>
Message-ID: <AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>

2011/3/3 Srija <swap_project at yahoo.com>

> Hi all,
>
> Here is the issue  with the cluster  describing below:
>
> The cluster is built with 16 nodes. All rhel5.5   86_64 bit OS.
> yesterday night  two  servers were rebooted and after that  these
> two servers are not joining to the cluster.
>
> I was not  the part of the team when it is built. and my knowledge
> regarding cluster is also little bit.
>
> Here is the scenario:
>
>   -  There is no quorum  disks.  But the person
>      who has built the cluster he is telling he has executed the quorum
>      from command line, [ i am not sure  of that ]
>
>  -  The errors  in the message log  are showing as
>
> ccsd[24182]: Unable to connect to cluster infrastructure after 12060
> seconds , it is a continuous error message in the log file
>
> The cluster.conf  are as follows:
>
[snip]config[/snip]

There is no <quorumd> section in your config file?
Have you been able to identify a quorum disk on the nodes?

The host-priv.domain.org is in your /etc/hosts? on all nodes?

Why have they been rebooted? for maintenance/upgrade?

Any iptable used?

Could you please provide the logs showing the start of the cluster service?


> It seems it is a very basic configuration. But at this stage more important
> is, to attach the two servers  in the cluster environment.
>
> If more information is needed , i will provide.
>
> Any advice is  appreciated.
>
> Thanks in advance
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/da87a352/attachment.htm>

From mika68vaan at gmail.com  Thu Mar  3 10:21:33 2011
From: mika68vaan at gmail.com (Mika i)
Date: Thu, 3 Mar 2011 12:21:33 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
Message-ID: <AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>

hmm.
Okey: have someone good installation instructions to get this fence_ipmilan to
work.
1. active IPMI/DCMI over LAN in iLo3
2. what should i install in server to get fence_ipmilan to work
Now if i test connection it shows like this.

ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
Password:
Get Auth Capabilities error
Get Auth Capabilities error
Error issuing Get Channel Authentication Capabilies request
Error: Unable to establish IPMI v2 / RMCP+ session
Get Device ID command failed

Can someone help me!
2011/3/2 <Jason_Henderson at mitel.com>

>
>
> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>
> > Hi
>  >
> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
> > I have now in both clusters rhel 5.5 version with:
> > kernel 2.6.18-194.el5
> > cman-2.0.115-68.el5_6.1
> >
> > But in fence state i get allways message: Unable to connect/login to
> > fencing device
> >
> > Any help - or must i update the cluster to rhel 5.6?
>
> What fence agent are you using, fence_ilo?
> You will need to use the fence_ipmilan agent for iLO3.
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/65143e89/attachment.htm>

From sklemer at gmail.com  Thu Mar  3 13:00:43 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 3 Mar 2011 15:00:43 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
Message-ID: <AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>

Hi.
for ilo3 testing you can use :
# fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status

 fence_ipmilan -h
usage: fence_ipmilan <options>
   -A <authtype>  IPMI Lan Auth type (md5, password, or none)
   -a <ipaddr>    IPMI Lan IP to talk to
   -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
   -p <password>  Password (if required) to control power on
                  IPMI device
   -P             Use Lanplus
   -S <path>      Script to retrieve password (if required)
   -l <login>     Username/Login (if required) to control power
                  on IPMI device
   -o <op>        Operation to perform.
                  Valid operations: on, off, reboot, status
   -t <timeout>   Timeout (sec) for IPMI operation (default 20)
   -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
   -M <method>    Method to fence (onoff or cycle (default onoff)
   -V             Print version and exit
   -v             Verbose mode

If no options are specified, the following options will be read
from standard input (one per line):

   auth=<auth>           Same as -A
   ipaddr=<#>            Same as -a
   passwd=<pass>         Same as -p
   passwd_script=<path>  Same as -S
   lanplus               Same as -P
   login=<login>         Same as -u
   option=<op>           Same as -o
   operation=<op>        Same as -o
   action=<op>           Same as -o
   timeout=<timeout>     Same as -t
   cipher=<cipher>       Same as -C
   method=<method>       Same as -M
   verbose               Same as -v


On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:

> hmm.
> Okey: have someone good installation instructions to get this fence_ipmilan to
> work.
> 1. active IPMI/DCMI over LAN in iLo3
> 2. what should i install in server to get fence_ipmilan to work
> Now if i test connection it shows like this.
>
> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
> Password:
> Get Auth Capabilities error
> Get Auth Capabilities error
> Error issuing Get Channel Authentication Capabilies request
> Error: Unable to establish IPMI v2 / RMCP+ session
> Get Device ID command failed
>
> Can someone help me!
> 2011/3/2 <Jason_Henderson at mitel.com>
>
>>
>>
>> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>>
>> > Hi
>>  >
>> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
>> > I have now in both clusters rhel 5.5 version with:
>> > kernel 2.6.18-194.el5
>> > cman-2.0.115-68.el5_6.1
>> >
>> > But in fence state i get allways message: Unable to connect/login to
>> > fencing device
>> >
>> > Any help - or must i update the cluster to rhel 5.6?
>>
>> What fence agent are you using, fence_ilo?
>> You will need to use the fence_ipmilan agent for iLO3.
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/11f2bbd5/attachment.htm>

From mika68vaan at gmail.com  Thu Mar  3 13:53:32 2011
From: mika68vaan at gmail.com (Mika i)
Date: Thu, 3 Mar 2011 15:53:32 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
	<AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>
Message-ID: <AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>

this works:
ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
Server is rebooted.....

but not this:
root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool -I lan
-H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxxx1!' -v
chassis power cycle'...
Failed

cluster.conf
....
                                <method name="1">
                                        <device lanplus="1"
name="xxxxxx_xxxxdev" timeout="20"/>
......

<fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo"
login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>






2011/3/3 ???? ???? <sklemer at gmail.com>

> Hi.
> for ilo3 testing you can use :
> # fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status
>
>   fence_ipmilan -h
> usage: fence_ipmilan <options>
>    -A <authtype>  IPMI Lan Auth type (md5, password, or none)
>    -a <ipaddr>    IPMI Lan IP to talk to
>    -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
>    -p <password>  Password (if required) to control power on
>                   IPMI device
>    -P             Use Lanplus
>    -S <path>      Script to retrieve password (if required)
>    -l <login>     Username/Login (if required) to control power
>                   on IPMI device
>    -o <op>        Operation to perform.
>                   Valid operations: on, off, reboot, status
>    -t <timeout>   Timeout (sec) for IPMI operation (default 20)
>    -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
>    -M <method>    Method to fence (onoff or cycle (default onoff)
>    -V             Print version and exit
>    -v             Verbose mode
>
> If no options are specified, the following options will be read
> from standard input (one per line):
>
>    auth=<auth>           Same as -A
>    ipaddr=<#>            Same as -a
>    passwd=<pass>         Same as -p
>    passwd_script=<path>  Same as -S
>    lanplus               Same as -P
>    login=<login>         Same as -u
>    option=<op>           Same as -o
>    operation=<op>        Same as -o
>    action=<op>           Same as -o
>    timeout=<timeout>     Same as -t
>    cipher=<cipher>       Same as -C
>    method=<method>       Same as -M
>    verbose               Same as -v
>
>
> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
>
>> hmm.
>> Okey: have someone good installation instructions to get this fence_ipmilan to
>> work.
>> 1. active IPMI/DCMI over LAN in iLo3
>> 2. what should i install in server to get fence_ipmilan to work
>> Now if i test connection it shows like this.
>>
>> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
>> Password:
>> Get Auth Capabilities error
>> Get Auth Capabilities error
>> Error issuing Get Channel Authentication Capabilies request
>> Error: Unable to establish IPMI v2 / RMCP+ session
>> Get Device ID command failed
>>
>> Can someone help me!
>> 2011/3/2 <Jason_Henderson at mitel.com>
>>
>>>
>>>
>>> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>>>
>>> > Hi
>>>  >
>>> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and
>>> iLo3
>>> > I have now in both clusters rhel 5.5 version with:
>>> > kernel 2.6.18-194.el5
>>> > cman-2.0.115-68.el5_6.1
>>> >
>>> > But in fence state i get allways message: Unable to connect/login to
>>> > fencing device
>>> >
>>> > Any help - or must i update the cluster to rhel 5.6?
>>>
>>> What fence agent are you using, fence_ilo?
>>> You will need to use the fence_ipmilan agent for iLO3.
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/3832ad3f/attachment.htm>

From swap_project at yahoo.com  Thu Mar  3 15:37:12 2011
From: swap_project at yahoo.com (Srija)
Date: Thu, 3 Mar 2011 07:37:12 -0800 (PST)
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>
Message-ID: <352869.58978.qm@web112815.mail.gq1.yahoo.com>

Thanks  for your reply.
--- On Thu, 3/3/11, Seb <mailing.sr at gmail.com> wrote:
 >  
> There is no <quorumd> section in your config
> file?

No

> Have you been able to identify a quorum disk on the
> nodes?

There is no  quorum disk allocated  for this configuration. As mentioned,
only I know, quotum was alocated through command line etc.
>  
> The host-priv.domain.org
> is in your /etc/hosts? on all nodes?
>  

Yes.

> Why have they been rebooted? for
> maintenance/upgrade?
>  

For maintenance. But before  the reboot, the cluster  service on that node was not shutdown.

> Any iptable used?
>  

No.

> Could you please provide the logs showing the start
> of the cluster service?
>  

I am mentioning here  one of the  server's log ,  when ccs  started.
_______________________________________________________________________________________________________

Mar  1 20:20:39 host ccsd[5287]: Starting ccsd 2.0.115:
Mar  1 20:20:39 host ccsd[5287]:  Built: May 25 2010 04:32:00
Mar  1 20:20:39 host ccsd[5287]:  Copyright (C) Red Hat, Inc.  2004  All rights reserved.
Mar  1 20:20:39 host ccsd[5287]: cluster.conf (cluster name = xxxxxxx, version = 21) found.
Mar  1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service RELEASE 'subrev 1887 version 0.80.6'
Mar  1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Mar  1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Mar  1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service: started and ready to provide service.
Mar  1 20:20:40 host openais[5302]: [MAIN ] Using default multicast address of xxx.xxx.xxx.xx
Mar  1 20:20:40 host openais[5302]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
Mar  1 20:20:40 host openais[5302]: [TOTEM] join (60 ms) send_join (0 ms) consensus (20000 ms) merge (200 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
Mar  1 20:20:40 host openais[5302]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1402
Mar  1 20:20:40 host openais[5302]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Mar  1 20:20:40 host openais[5302]: [TOTEM] send threads (0 threads)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP token expired timeout (495 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP token problem counter (2000 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP threshold (10 problem count)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP mode set to none.
Mar  1 20:20:40 host openais[5302]: [TOTEM] heartbeat_failures_allowed (0)
Mar  1 20:20:40 host openais[5302]: [TOTEM] max_network_delay (50 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Mar  1 20:20:40 host openais[5302]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes).
Mar  1 20:20:40 host openais[5302]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Mar  1 20:20:40 host openais[5302]: [TOTEM] The network interface [192.168.xxx.x] is now up.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Created or loaded sequence id 6160.192.168.xxx.x for this ring.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering GATHER state from 15.
Mar  1 20:20:40 host openais[5302]: [CMAN ] CMAN 2.0.115 (built May 25 2010 04:32:02) started
Mar  1 20:20:40 host openais[5302]: [MAIN ] Service initialized 'openais CMAN membership service 2.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais extended virtual synchrony service'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster membership service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais availability management framework B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais checkpoint service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais event service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais distributed locking service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais message service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais configuration service'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster closed process group service v1.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster config database access v1.01'
Mar  1 20:20:40 host openais[5302]: [SYNC ] Not using a virtual synchrony filter.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Creating commit token because I am the rep.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Saving state aru 0 high seq received 0
Mar  1 20:20:40 host openais[5302]: [TOTEM] Storing new sequence id for ring 1814
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering COMMIT state.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering RECOVERY state.
Mar  1 20:20:40 host openais[5302]: [TOTEM] position [0] member 192.168.xxx.x:
Mar  1 20:20:40 host openais[5302]: [TOTEM] previous ring seq 6160 rep 192.168.xxx.x
Mar  1 20:20:40 host openais[5302]: [TOTEM] aru 0 high delivered 0 received flag 1
Mar  1 20:20:40 host openais[5302]: [TOTEM] Did not need to originate any messages in recovery.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Sending initial ORF token
Mar  1 20:20:40 host openais[5302]: [CLM  ] CLM CONFIGURATION CHANGE
Mar  1 20:20:40 host openais[5302]: [CLM  ] New Configuration:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Left:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Joined:
Mar  1 20:20:40 host openais[5302]: [CLM  ] CLM CONFIGURATION CHANGE
Mar  1 20:20:40 host openais[5302]: [CLM  ] New Configuration:
Mar  1 20:20:40 host openais[5302]: [CLM  ]         r(0) ip(192.168.xxx.x)
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Left:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Joined:
Mar  1 20:20:40 host openais[5302]: [CLM  ]         r(0) ip(192.168.xxx.x)
Mar  1 20:20:40 host openais[5302]: [SYNC ] This node is within the primary component and will provide service.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering OPERATIONAL state.
Mar  1 20:20:40 host openais[5302]: [CLM  ] got nodejoin message 192.168.xxx.x
Mar  1 20:20:41 host ccsd[5287]: Initial status:: Inquorate
Mar  1 20:20:41 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:41 host ccsd[5287]: Error while processing connect: Connection refused
Mar  1 20:20:42 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused
Mar  1 20:20:42 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused


_______________________________________________________________________________________________________


Thanks  again


      



From pradhanparas at gmail.com  Thu Mar  3 16:15:43 2011
From: pradhanparas at gmail.com (Paras pradhan)
Date: Thu, 3 Mar 2011 10:15:43 -0600
Subject: [Linux-cluster] question about backup/restore of GFS2
In-Reply-To: <1299147704.2572.37.camel@dolmen>
References: <DD4D0CFE33FD2F4F8657DCE624FE657E088ABC@STCEM17.statcan.ca>
	<1299147704.2572.37.camel@dolmen>
Message-ID: <AANLkTi=U7VzFW0sF4zMuvOvsKbBztUDhdvdsoFuyKsqk@mail.gmail.com>

We had a whole cluster lockdown when we forgot to exclude one of the
GFS partitions in netbackup when it was trying to lock the fs when it
the backup started. Then I had to restart one of the nodes. I am still
not sure why the lock was not released.

Paras.

On Thu, Mar 3, 2011 at 4:21 AM, Steven Whitehouse <swhiteho at redhat.com> wrote:
> Hi,
>
> On Wed, 2011-03-02 at 15:23 -0500, Ning.Bao at statcan.gc.ca wrote:
>> Hi
>>
>> Does anyone have experience with using Netbackup to backup/restore
>> GFS2 file system in production environment? ?I noticed that GFS2 is
>> not on the list of file compatitbitly of netbackup 7. Is there any
>> alternative backup tool for GFS2 in enterprise settings if Netbackup
>> can not do it?
>>
>> Thanks!
>>
>> -Ning
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> The thing to watch out for with backup is how it affects the normal
> working set of the nodes in the cluster. Often you'll get much better
> performance with backup if you write a custom set of scripts which
> respects the working set on each node and which also allows the backup
> to proceed in parallel across the filesystem.
>
> What makes a good backup solution depends on the application and the I/O
> pattern, so it is tricky to make any generic suggestions,
>
> Steve.
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From gianluca.cecchi at gmail.com  Thu Mar  3 16:16:06 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Thu, 3 Mar 2011 17:16:06 +0100
Subject: [Linux-cluster] Info on vm definitions and options in stable3
Message-ID: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>

Hello,
in stable 3 I can have this kind of config for a KVM virtual machine
to manage live migration:

<rm>
   <vm .....>
</rm>

It works ok, but I would like to know the possible parameters I can set.
At http://sources.redhat.com/cluster/wiki/VirtualMachineBehaviors I
can see this piece

"..Most of the behaviors are common with normal services.."
with a reference to start, stop, status monitoring, relocation, recovery

Where could I find a complete list?
For example are failover domains usable inside the line?
Or autostart option? I know how to manage autostart in a standalone
virt-manager environment, but when in a cluster of hosts?

Or dependency lines such as

< vm name="vm1" ... >
   <vm name="vm2" ... />
</vm>

to power on vm2 only after power on of vm1?

About "transient domain support":
In the stable3 implementation of rhel6 (or in general in stable 3 if
it applies generally) a line such as this:

<vm name="myvm" use_virsh="1" xmlfile="/etc/libvirt/qemu/myvm.xml"/>

where /etc/libvirt/qemu/myvm.xm is not on a shared path, is it
supposed that if I have myvm on node 1 and run

clusvcadm -M vm:myvm -m node2

the file is deleted from node 1 and created in node 2 automatically or not?

Thanks in advance,
Gianluca



From vmutu at pcbi.upenn.edu  Thu Mar  3 16:50:57 2011
From: vmutu at pcbi.upenn.edu (Valeriu Mutu)
Date: Thu, 3 Mar 2011 11:50:57 -0500
Subject: [Linux-cluster] clvmd hangs on startup
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local>
References: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>
	<64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local>
Message-ID: <20110303165056.GF10674@bsdera.pcbi.upenn.edu>

On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:
> Double-check that the 2nd node can read and write the shared iSCSI
> storage.

Reading/writing from/to the iSCSI storage device works as seen below.

On the 1st node:
[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes 
10000+0 records in
10000+0 records out
10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s

[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null 
10000+0 records in
10000+0 records out
10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s

On the 2nd node:
[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
10000+0 records in
10000+0 records out
10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s

[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
10000+0 records in
10000+0 records out
10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s

-- 
Valeriu Mutu



From sklemer at gmail.com  Thu Mar  3 17:25:12 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 3 Mar 2011 19:25:12 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
	<AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>
	<AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>
Message-ID: <AANLkTimecDvjDdyC_x1UedPo5eLdn-HB-ArwPiTRsnTE@mail.gmail.com>

H.

Maybe iLo3 dont support cycle. why not to use the default , which is "onoff"
. Try it

I think its good enough .

-M method
              Method to fence (onoff or cycle). Default is onoff. Use cycle
in
              case  your management card will power off with default method
so
              there will be no chance to power machine on by IPMI.


On Thu, Mar 3, 2011 at 3:53 PM, Mika i <mika68vaan at gmail.com> wrote:

> this works:
> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
> Server is rebooted.....
>
> but not this:
> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
> Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool -I lan
> -H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
> Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxxx1!'
> -v chassis power cycle'...
> Failed
>
> cluster.conf
> ....
>                                 <method name="1">
>                                         <device lanplus="1"
> name="xxxxxx_xxxxdev" timeout="20"/>
> ......
>
> <fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo"
> login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>
>
>
>
>
>
>
> 2011/3/3 ???? ???? <sklemer at gmail.com>
>
> Hi.
>> for ilo3 testing you can use :
>> # fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status
>>
>>   fence_ipmilan -h
>> usage: fence_ipmilan <options>
>>    -A <authtype>  IPMI Lan Auth type (md5, password, or none)
>>    -a <ipaddr>    IPMI Lan IP to talk to
>>    -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
>>    -p <password>  Password (if required) to control power on
>>                   IPMI device
>>    -P             Use Lanplus
>>    -S <path>      Script to retrieve password (if required)
>>    -l <login>     Username/Login (if required) to control power
>>                   on IPMI device
>>    -o <op>        Operation to perform.
>>                   Valid operations: on, off, reboot, status
>>    -t <timeout>   Timeout (sec) for IPMI operation (default 20)
>>    -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
>>    -M <method>    Method to fence (onoff or cycle (default onoff)
>>    -V             Print version and exit
>>    -v             Verbose mode
>>
>> If no options are specified, the following options will be read
>> from standard input (one per line):
>>
>>    auth=<auth>           Same as -A
>>    ipaddr=<#>            Same as -a
>>    passwd=<pass>         Same as -p
>>    passwd_script=<path>  Same as -S
>>    lanplus               Same as -P
>>    login=<login>         Same as -u
>>    option=<op>           Same as -o
>>    operation=<op>        Same as -o
>>    action=<op>           Same as -o
>>    timeout=<timeout>     Same as -t
>>    cipher=<cipher>       Same as -C
>>    method=<method>       Same as -M
>>    verbose               Same as -v
>>
>>
>> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
>>
>>> hmm.
>>> Okey: have someone good installation instructions to get this fence_ipmilan to
>>> work.
>>> 1. active IPMI/DCMI over LAN in iLo3
>>> 2. what should i install in server to get fence_ipmilan to work
>>> Now if i test connection it shows like this.
>>>
>>> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
>>> Password:
>>> Get Auth Capabilities error
>>> Get Auth Capabilities error
>>> Error issuing Get Channel Authentication Capabilies request
>>> Error: Unable to establish IPMI v2 / RMCP+ session
>>> Get Device ID command failed
>>>
>>> Can someone help me!
>>> 2011/3/2 <Jason_Henderson at mitel.com>
>>>
>>>>
>>>>
>>>> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>>>>
>>>> > Hi
>>>>  >
>>>> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and
>>>> iLo3
>>>> > I have now in both clusters rhel 5.5 version with:
>>>> > kernel 2.6.18-194.el5
>>>> > cman-2.0.115-68.el5_6.1
>>>> >
>>>> > But in fence state i get allways message: Unable to connect/login to
>>>> > fencing device
>>>> >
>>>> > Any help - or must i update the cluster to rhel 5.6?
>>>>
>>>> What fence agent are you using, fence_ilo?
>>>> You will need to use the fence_ipmilan agent for iLO3.
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/cf72805f/attachment.htm>

From raju.rajsand at gmail.com  Thu Mar  3 17:56:27 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 3 Mar 2011 23:26:27 +0530
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
In-Reply-To: <AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>
References: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
	<20110228161406.GA14120@redhat.com>
	<AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>
Message-ID: <AANLkTikEVAyWQ3otAjNRpGDgfyFBTnigc=8AMZXz-b+B@mail.gmail.com>

Greetings,


On 3/1/11, Parvez Shaikh <parvez.h.shaikh at gmail.com> wrote:
> Hi Ryan,
>
>
> What is recommended method to deal with  blade center fencing failure in
> this situation? Do I have to add another level of fencing(between blade
> center and manual) which can fence automatically(not requiring manual
> interference)?
>

IIRC, I had touched upon similar fencing post some time back.

AFAIK, Manual fencing is not supported by Redhat.

Having said that, manual fencing has no place in production. At best
it is ok for PHP's  salestalk POC.

The short answer: Two levels of fencing is NOT possible in blades
within the same enclosure.
Two levels of fencing is possible in two blades housed in two
different enclosures provided it does not bother other servers when
you yank the power chords from the enclosure.

Let me explain with another example.
The solution you are trying to achieve, is possible in 2 individual
physical servers in rack.

One fencing level would be the management port which I would call (for
the sake of this post) as "in-band" fencing device.

Second would be Power fencing using power strips similar to:

http://www.apc.com/products/family/index.cfm?id=70

(Disclaimer: I get zilch from any vendor for  that matter so you can
pick the most buttery vendor you are comfortable with)

Now this I call Power fencing or "out-of-band" fencing.

Now You have the the members of clluster across racks/continents (with
a unbreakable redundant datalinks and power control links) and we can
talk about two layer fencing.

First layer or level would be the in-band fencing using
IPMI/Bladecenter management port/DRAC/ILO/ALOM/RSA etc.

Second would be from the power control network which would yank the
power chord, as it were, off the server.

So we are possibly talking about three vlans here: power control vlan,
in-band vlan and data vlan.

Hope it is clear now to you now.

And Parvez, Yes, I do happen to know couple of HA fundas -- I have
deployed and managed few RHCS clusters in the past.

But then enclosure are SPOF if the member nodes are in the same
enclosure anyway.

phew!!

Regards,

Rajagopal



From mika68vaan at gmail.com  Thu Mar  3 19:44:18 2011
From: mika68vaan at gmail.com (Mika i)
Date: Thu, 3 Mar 2011 21:44:18 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTimecDvjDdyC_x1UedPo5eLdn-HB-ArwPiTRsnTE@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
	<AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>
	<AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>
	<AANLkTimecDvjDdyC_x1UedPo5eLdn-HB-ArwPiTRsnTE@mail.gmail.com>
Message-ID: <AANLkTi=EbChRVdUqQVVd2Ej4DY5n7XpuCqN1mt-+Ceq9@mail.gmail.com>

When i added -P like down here:
fence_ipmilan -P -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
Everything works, server reboots. But how do i get this "-P" option included
in fence_ipmilan.
How should my cluster.conf look like..that's the question...



2011/3/3 ???? ???? <sklemer at gmail.com>

> H.
>
> Maybe iLo3 dont support cycle. why not to use the default , which is
> "onoff" . Try it
>
> I think its good enough .
>
>  -M method
>               Method to fence (onoff or cycle). Default is onoff. Use cycle
> in
>               case  your management card will power off with default method
> so
>               there will be no chance to power machine on by IPMI.
>
>
> On Thu, Mar 3, 2011 at 3:53 PM, Mika i <mika68vaan at gmail.com> wrote:
>
>> this works:
>> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
>> Server is rebooted.....
>>
>> but not this:
>> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
>> Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool -I lan
>> -H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
>> Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxxx1!'
>> -v chassis power cycle'...
>> Failed
>>
>> cluster.conf
>> ....
>>                                 <method name="1">
>>                                         <device lanplus="1"
>> name="xxxxxx_xxxxdev" timeout="20"/>
>> ......
>>
>> <fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo"
>> login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>
>>
>>
>>
>>
>>
>>
>> 2011/3/3 ???? ???? <sklemer at gmail.com>
>>
>> Hi.
>>> for ilo3 testing you can use :
>>> # fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status
>>>
>>>   fence_ipmilan -h
>>> usage: fence_ipmilan <options>
>>>    -A <authtype>  IPMI Lan Auth type (md5, password, or none)
>>>    -a <ipaddr>    IPMI Lan IP to talk to
>>>    -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
>>>    -p <password>  Password (if required) to control power on
>>>                   IPMI device
>>>    -P             Use Lanplus
>>>    -S <path>      Script to retrieve password (if required)
>>>    -l <login>     Username/Login (if required) to control power
>>>                   on IPMI device
>>>    -o <op>        Operation to perform.
>>>                   Valid operations: on, off, reboot, status
>>>    -t <timeout>   Timeout (sec) for IPMI operation (default 20)
>>>    -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
>>>    -M <method>    Method to fence (onoff or cycle (default onoff)
>>>    -V             Print version and exit
>>>    -v             Verbose mode
>>>
>>> If no options are specified, the following options will be read
>>> from standard input (one per line):
>>>
>>>    auth=<auth>           Same as -A
>>>    ipaddr=<#>            Same as -a
>>>    passwd=<pass>         Same as -p
>>>    passwd_script=<path>  Same as -S
>>>    lanplus               Same as -P
>>>    login=<login>         Same as -u
>>>    option=<op>           Same as -o
>>>    operation=<op>        Same as -o
>>>    action=<op>           Same as -o
>>>    timeout=<timeout>     Same as -t
>>>    cipher=<cipher>       Same as -C
>>>    method=<method>       Same as -M
>>>    verbose               Same as -v
>>>
>>>
>>> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
>>>
>>>> hmm.
>>>> Okey: have someone good installation instructions to get this fence_ipmilan to
>>>> work.
>>>> 1. active IPMI/DCMI over LAN in iLo3
>>>> 2. what should i install in server to get fence_ipmilan to work
>>>> Now if i test connection it shows like this.
>>>>
>>>> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
>>>> Password:
>>>> Get Auth Capabilities error
>>>> Get Auth Capabilities error
>>>> Error issuing Get Channel Authentication Capabilies request
>>>> Error: Unable to establish IPMI v2 / RMCP+ session
>>>> Get Device ID command failed
>>>>
>>>> Can someone help me!
>>>> 2011/3/2 <Jason_Henderson at mitel.com>
>>>>
>>>>>
>>>>>
>>>>> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>>>>>
>>>>> > Hi
>>>>>  >
>>>>> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and
>>>>> iLo3
>>>>> > I have now in both clusters rhel 5.5 version with:
>>>>> > kernel 2.6.18-194.el5
>>>>> > cman-2.0.115-68.el5_6.1
>>>>> >
>>>>> > But in fence state i get allways message: Unable to connect/login to
>>>>> > fencing device
>>>>> >
>>>>> > Any help - or must i update the cluster to rhel 5.6?
>>>>>
>>>>> What fence agent are you using, fence_ilo?
>>>>> You will need to use the fence_ipmilan agent for iLO3.
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/bfa8d954/attachment.htm>

From Jason_Henderson at Mitel.com  Thu Mar  3 20:18:29 2011
From: Jason_Henderson at Mitel.com (Jason_Henderson at Mitel.com)
Date: Thu, 3 Mar 2011 15:18:29 -0500
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=EbChRVdUqQVVd2Ej4DY5n7XpuCqN1mt-+Ceq9@mail.gmail.com>
Message-ID: <OFF289DA47.9738E26E-ON85257848.006F5D7F-85257848.006F8EA4@ottlnmta.mitel.com>

linux-cluster-bounces at redhat.com wrote on 03/03/2011 02:44:18 PM:

> When i added -P like down here:
> fence_ipmilan -P -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
> Everything works, server reboots. But how do i get this "-P" option 
> included in fence_ipmilan.
> How should my cluster.conf look like..that's the question...

Here is an example with passwords removed:

<cluster name="88423a6c34c04d" config_version="3">

  <totem token="10000"/>
  <cman port="6809" expected_votes="1" two_node="1" broadcast="no"/>
  <logging syslog_facility="local3"/>

  <fencedevices>
    <fencedevice name="manual" agent="fence_manual"/>
    <fencedevice name="ilo" agent="fence_ipmilan"/>
    <fencedevice name="ilom" agent="fence_npm_ilom"/>
  </fencedevices>

  <clusternodes>
    <clusternode name="node1" votes="1" nodeid="1">
      <fence>
        <method name="method1">
          <device name="ilo" passwd="12345678" lanplus="1" method="cycle" 
login="user" ipaddr="10.39.170.233"/>
        </method>
      </fence>
    </clusternode>
    <clusternode name="node2" votes="1" nodeid="2">
      <fence>
        <method name="method2">
          <device name="ilo" passwd="12345678" lanplus="1" method="cycle" 
login="user" ipaddr="10.39.170.234"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>

</cluster>


> 2011/3/3 ???? ???? <sklemer at gmail.com>
> H. 
> 
> Maybe iLo3 dont support cycle. why not to use the default , which is
> "onoff" . Try it
> 
> I think its good enough .
> 
> -M method
>               Method to fence (onoff or cycle). Default is onoff. Use 
cycle in
>               case  your management card will power off with 
defaultmethod so
>               there will be no chance to power machine on by IPMI.
> 
> On Thu, Mar 3, 2011 at 3:53 PM, Mika i <mika68vaan at gmail.com> wrote:
> this works:
> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
> Server is rebooted.....
>  
> but not this:
> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
> Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool 
> -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
> Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P 
> 'xxxxxx1!' -v chassis power cycle'...
> Failed
>  
> cluster.conf
> ....
>                                 <method name="1">
>                                         <device lanplus="1" 
> name="xxxxxx_xxxxdev" timeout="20"/>
> ......
>  
> <fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo" 
> login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>
>  
>  
>  
>  
> 
>  
> 2011/3/3 ???? ???? <sklemer at gmail.com> 
> 
> Hi. 
> for ilo3 testing you can use :
> # fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status
> 
>  fence_ipmilan -h
> usage: fence_ipmilan <options>
>    -A <authtype>  IPMI Lan Auth type (md5, password, or none)
>    -a <ipaddr>    IPMI Lan IP to talk to
>    -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
>    -p <password>  Password (if required) to control power on
>                   IPMI device
>    -P             Use Lanplus
>    -S <path>      Script to retrieve password (if required)
>    -l <login>     Username/Login (if required) to control power
>                   on IPMI device
>    -o <op>        Operation to perform.
>                   Valid operations: on, off, reboot, status
>    -t <timeout>   Timeout (sec) for IPMI operation (default 20)
>    -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
>    -M <method>    Method to fence (onoff or cycle (default onoff)
>    -V             Print version and exit
>    -v             Verbose mode
> 
> If no options are specified, the following options will be read
> from standard input (one per line):
> 
>    auth=<auth>           Same as -A
>    ipaddr=<#>            Same as -a
>    passwd=<pass>         Same as -p
>    passwd_script=<path>  Same as -S
>    lanplus               Same as -P
>    login=<login>         Same as -u
>    option=<op>           Same as -o
>    operation=<op>        Same as -o
>    action=<op>           Same as -o
>    timeout=<timeout>     Same as -t
>    cipher=<cipher>       Same as -C
>    method=<method>       Same as -M
>    verbose               Same as -v
> 
> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
> hmm.
> Okey: have someone good installation instructions to get this 
> fence_ipmilan to work.
> 1. active IPMI/DCMI over LAN in iLo3
> 2. what should i install in server to get fence_ipmilan to work
> Now if i test connection it shows like this.
>  
> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info 
> Password: 
> Get Auth Capabilities error
> Get Auth Capabilities error
> Error issuing Get Channel Authentication Capabilies request
> Error: Unable to establish IPMI v2 / RMCP+ session
> Get Device ID command failed
> 
> Can someone help me!
> 2011/3/2 <Jason_Henderson at mitel.com>
> 
> 
> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
> 
> > Hi 
> >   
> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and 
iLo3 
> > I have now in both clusters rhel 5.5 version with: 
> > kernel 2.6.18-194.el5 
> > cman-2.0.115-68.el5_6.1 
> >   
> > But in fence state i get allways message: Unable to connect/login to
> > fencing device 
> >   
> > Any help - or must i update the cluster to rhel 5.6? 

> What fence agent are you using, fence_ilo? 
> You will need to use the fence_ipmilan agent for iLO3.
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110303/d503cf23/attachment.htm>

From carlopmart at gmail.com  Thu Mar  3 21:27:27 2011
From: carlopmart at gmail.com (carlopmart)
Date: Thu, 03 Mar 2011 22:27:27 +0100
Subject: [Linux-cluster] How can I change status check for a script??
Message-ID: <4D7007BF.3000105@gmail.com>

Hi all,

  How can I change status interval for a certain service?? I have tried 
to insert:

  <action depth="*" interval="300s" name="status" timeout="0"/>

  under a service without luck. I am using rgmanager-3.0.12-10.el6.i686 
and cman-3.0.12-23.el6_0.4.i686 under two RHEL6 hosts.

  Is it only possible to accomplish this changing 
/usr/share/cluster/script.sh directly??

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com



From gregory.lee.bartholomew at gmail.com  Thu Mar  3 21:29:22 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Thu, 03 Mar 2011 15:29:22 -0600
Subject: [Linux-cluster] Error: "ailed actions:
 dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".
Message-ID: <4D700832.9040305@gmail.com>

Hi, I'm trying to follow the "clusters from scratch" guide and I'm 
running Fedora 14.

When I try to add the DLM and GFS2 services, crm_mon keeps reporting 
"Failed actions: dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".

Does anyone know what I'm missing?

Thanks,
gb



From omerfsen at gmail.com  Thu Mar  3 22:08:41 2011
From: omerfsen at gmail.com (Omer Faruk SEN)
Date: Fri, 4 Mar 2011 00:08:41 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <OFF289DA47.9738E26E-ON85257848.006F5D7F-85257848.006F8EA4@ottlnmta.mitel.com>
References: <AANLkTi=EbChRVdUqQVVd2Ej4DY5n7XpuCqN1mt-+Ceq9@mail.gmail.com>
	<OFF289DA47.9738E26E-ON85257848.006F5D7F-85257848.006F8EA4@ottlnmta.mitel.com>
Message-ID: <AANLkTi=TGhmmh-FRJ6mXcqg=twpJiTAWPaR5PGw3iEZt@mail.gmail.com>

See https://access.redhat.com/kb/docs/DOC-39336

2011/3/3  <Jason_Henderson at mitel.com>:
>
>
> linux-cluster-bounces at redhat.com wrote on 03/03/2011 02:44:18 PM:
>
>> When i added -P like?down?here:
>> fence_ipmilan -P -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
>> Everything works, server reboots. But how do i get this "-P" option
>> included in fence_ipmilan.
>> How should my cluster.conf look like..that's the question...
>
> Here is an example with passwords removed:
>
> <cluster name="88423a6c34c04d" config_version="3">
>
> ? <totem token="10000"/>
> ? <cman port="6809" expected_votes="1" two_node="1" broadcast="no"/>
> ? <logging syslog_facility="local3"/>
>
> ? <fencedevices>
> ? ? <fencedevice name="manual" agent="fence_manual"/>
> ? ? <fencedevice name="ilo" agent="fence_ipmilan"/>
> ? ? <fencedevice name="ilom" agent="fence_npm_ilom"/>
> ? </fencedevices>
>
> ? <clusternodes>
> ? ? <clusternode name="node1" votes="1" nodeid="1">
> ? ? ? <fence>
> ? ? ? ? <method name="method1">
> ? ? ? ? ? <device name="ilo" passwd="12345678" lanplus="1" method="cycle"
> login="user" ipaddr="10.39.170.233"/>
> ? ? ? ? </method>
> ? ? ? </fence>
> ? ? </clusternode>
> ? ? <clusternode name="node2" votes="1" nodeid="2">
> ? ? ? <fence>
> ? ? ? ? <method name="method2">
> ? ? ? ? ? <device name="ilo" passwd="12345678" lanplus="1" method="cycle"
> login="user" ipaddr="10.39.170.234"/>
> ? ? ? ? </method>
> ? ? ? </fence>
> ? ? </clusternode>
> ? </clusternodes>
>
> </cluster>
>
>
>> 2011/3/3 ???? ???? <sklemer at gmail.com>
>> H.
>>
>> Maybe iLo3 dont support cycle. why not to use the default , which is
>> "onoff" . Try it
>>
>> I think its good enough .
>>
>> -M method
>> ?? ? ? ? ? ? ?Method to fence (onoff or cycle). Default is onoff. Use
>> cycle in
>> ?? ? ? ? ? ? ?case ?your management card will power off with defaultmethod
>> so
>> ?? ? ? ? ? ? ?there will be no chance to power machine on by IPMI.
>>
>> On Thu, Mar 3, 2011 at 3:53 PM, Mika i <mika68vaan at gmail.com> wrote:
>> this works:
>> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
>> Server is rebooted.....
>>
>> but not this:
>> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
>> Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool
>> -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
>> Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P
>> 'xxxxxx1!' -v chassis power cycle'...
>> Failed
>>
>> cluster.conf
>> ....
>> ??????????????????????????????? <method name="1">
>> ??????????????????????????????????????? <device lanplus="1"
>> name="xxxxxx_xxxxdev" timeout="20"/>
>> ......
>>
>> <fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo"
>> login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>
>>
>>
>>
>>
>>
>>
>> 2011/3/3 ???? ???? <sklemer at gmail.com>
>>
>> Hi.
>> for ilo3 testing you can use :
>> #?fence_ipmilan -a?17x.3x.7x.1xx -p "password" -o status
>>
>> ?fence_ipmilan -h
>> usage: fence_ipmilan <options>
>> ?? -A <authtype> ?IPMI Lan Auth type (md5, password, or none)
>> ?? -a <ipaddr> ? ?IPMI Lan IP to talk to
>> ?? -i <ipaddr> ? ?IPMI Lan IP to talk to (deprecated, use -a)
>> ?? -p <password> ?Password (if required) to control power on
>> ?? ? ? ? ? ? ? ? ?IPMI device
>> ?? -P ? ? ? ? ? ? Use Lanplus
>> ?? -S <path> ? ? ?Script to retrieve password (if required)
>> ?? -l <login> ? ? Username/Login (if required) to control power
>> ?? ? ? ? ? ? ? ? ?on IPMI device
>> ?? -o <op> ? ? ? ?Operation to perform.
>> ?? ? ? ? ? ? ? ? ?Valid operations: on, off, reboot, status
>> ?? -t <timeout> ? Timeout (sec) for IPMI operation (default 20)
>> ?? -C <cipher> ? ?Ciphersuite to use (same as ipmitool -C parameter)
>> ?? -M <method> ? ?Method to fence (onoff or cycle (default onoff)
>> ?? -V ? ? ? ? ? ? Print version and exit
>> ?? -v ? ? ? ? ? ? Verbose mode
>>
>> If no options are specified, the following options will be read
>> from standard input (one per line):
>>
>> ?? auth=<auth> ? ? ? ? ? Same as -A
>> ?? ipaddr=<#> ? ? ? ? ? ?Same as -a
>> ?? passwd=<pass> ? ? ? ? Same as -p
>> ?? passwd_script=<path> ?Same as -S
>> ?? lanplus ? ? ? ? ? ? ? Same as -P
>> ?? login=<login> ? ? ? ? Same as -u
>> ?? option=<op> ? ? ? ? ? Same as -o
>> ?? operation=<op> ? ? ? ?Same as -o
>> ?? action=<op> ? ? ? ? ? Same as -o
>> ?? timeout=<timeout> ? ? Same as -t
>> ?? cipher=<cipher> ? ? ? Same as -C
>> ?? method=<method> ? ? ? Same as -M
>> ?? verbose ? ? ? ? ? ? ? Same as -v
>>
>> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
>> hmm.
>> Okey: have someone good installation instructions to get this
>> fence_ipmilan?to work.
>> 1. active IPMI/DCMI over LAN in iLo3
>> 2. what should i install in server to get fence_ipmilan to work
>> Now if i test connection it shows like this.
>>
>> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
>> Password:
>> Get Auth Capabilities error
>> Get Auth Capabilities error
>> Error issuing Get Channel Authentication Capabilies request
>> Error: Unable to establish IPMI v2 / RMCP+ session
>> Get Device ID command failed
>>
>> Can someone help me!
>> 2011/3/2 <Jason_Henderson at mitel.com>
>>
>>
>> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
>>
>> > Hi
>> >
>> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and iLo3
>> > I have now in both clusters?rhel 5.5 version with:
>> > kernel 2.6.18-194.el5
>> > cman-2.0.115-68.el5_6.1
>> >
>> > But in fence state i get allways message: Unable to connect/login to
>> > fencing device
>> >
>> > Any help - or?must i update the?cluster to rhel 5.6?
>
>> What fence agent are you using, fence_ilo?
>> You will need to use the fence_ipmilan agent for iLO3.
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From mika68vaan at gmail.com  Fri Mar  4 11:14:40 2011
From: mika68vaan at gmail.com (Mika i)
Date: Fri, 4 Mar 2011 13:14:40 +0200
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=TGhmmh-FRJ6mXcqg=twpJiTAWPaR5PGw3iEZt@mail.gmail.com>
References: <AANLkTi=EbChRVdUqQVVd2Ej4DY5n7XpuCqN1mt-+Ceq9@mail.gmail.com>
	<OFF289DA47.9738E26E-ON85257848.006F5D7F-85257848.006F8EA4@ottlnmta.mitel.com>
	<AANLkTi=TGhmmh-FRJ6mXcqg=twpJiTAWPaR5PGw3iEZt@mail.gmail.com>
Message-ID: <AANLkTimpKGQErgm2JF04f-U8yw6H+TiQgPSqsqTgtK71@mail.gmail.com>

Hi and thanks to all, i get this worked...

this needed to active in cluster.conf, then everything started to work.
power_wait="15"


-Mika


2011/3/4 Omer Faruk SEN <omerfsen at gmail.com>

> See https://access.redhat.com/kb/docs/DOC-39336
>
> 2011/3/3  <Jason_Henderson at mitel.com>:
> >
> >
> > linux-cluster-bounces at redhat.com wrote on 03/03/2011 02:44:18 PM:
> >
> >> When i added -P like down here:
> >> fence_ipmilan -P -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v
> >> Everything works, server reboots. But how do i get this "-P" option
> >> included in fence_ipmilan.
> >> How should my cluster.conf look like..that's the question...
> >
> > Here is an example with passwords removed:
> >
> > <cluster name="88423a6c34c04d" config_version="3">
> >
> >   <totem token="10000"/>
> >   <cman port="6809" expected_votes="1" two_node="1" broadcast="no"/>
> >   <logging syslog_facility="local3"/>
> >
> >   <fencedevices>
> >     <fencedevice name="manual" agent="fence_manual"/>
> >     <fencedevice name="ilo" agent="fence_ipmilan"/>
> >     <fencedevice name="ilom" agent="fence_npm_ilom"/>
> >   </fencedevices>
> >
> >   <clusternodes>
> >     <clusternode name="node1" votes="1" nodeid="1">
> >       <fence>
> >         <method name="method1">
> >           <device name="ilo" passwd="12345678" lanplus="1" method="cycle"
> > login="user" ipaddr="10.39.170.233"/>
> >         </method>
> >       </fence>
> >     </clusternode>
> >     <clusternode name="node2" votes="1" nodeid="2">
> >       <fence>
> >         <method name="method2">
> >           <device name="ilo" passwd="12345678" lanplus="1" method="cycle"
> > login="user" ipaddr="10.39.170.234"/>
> >         </method>
> >       </fence>
> >     </clusternode>
> >   </clusternodes>
> >
> > </cluster>
> >
> >
> >> 2011/3/3 ???? ???? <sklemer at gmail.com>
> >> H.
> >>
> >> Maybe iLo3 dont support cycle. why not to use the default , which is
> >> "onoff" . Try it
> >>
> >> I think its good enough .
> >>
> >> -M method
> >>               Method to fence (onoff or cycle). Default is onoff. Use
> >> cycle in
> >>               case  your management card will power off with
> defaultmethod
> >> so
> >>               there will be no chance to power machine on by IPMI.
> >>
> >> On Thu, Mar 3, 2011 at 3:53 PM, Mika i <mika68vaan at gmail.com> wrote:
> >> this works:
> >> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
> >> Server is rebooted.....
> >>
> >> but not this:
> >> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle'
> -v
> >> Rebooting machine @ IPMI:xxixxxxxilo...Spawning: '/usr/bin/ipmitool
> >> -I lan -H 'xxxxxxx' -U 'admin' -P 'xxxxx41!' -v chassis power status'...
> >> Spawning: '/usr/bin/ipmitool -I lan -H 'xxxxxxx' -U 'admin' -P
> >> 'xxxxxx1!' -v chassis power cycle'...
> >> Failed
> >>
> >> cluster.conf
> >> ....
> >>                                 <method name="1">
> >>                                         <device lanplus="1"
> >> name="xxxxxx_xxxxdev" timeout="20"/>
> >> ......
> >>
> >> <fencedevice agent="fence_ipmilan" auth="none" ipaddr="xxxxxxxilo"
> >> login="admin" method="cycle" name="xxxxuxx32_fencedev" passwd="xxxxx!"/>
> >>
> >>
> >>
> >>
> >>
> >>
> >> 2011/3/3 ???? ???? <sklemer at gmail.com>
> >>
> >> Hi.
> >> for ilo3 testing you can use :
> >> # fence_ipmilan -a 17x.3x.7x.1xx -p "password" -o status
> >>
> >>  fence_ipmilan -h
> >> usage: fence_ipmilan <options>
> >>    -A <authtype>  IPMI Lan Auth type (md5, password, or none)
> >>    -a <ipaddr>    IPMI Lan IP to talk to
> >>    -i <ipaddr>    IPMI Lan IP to talk to (deprecated, use -a)
> >>    -p <password>  Password (if required) to control power on
> >>                   IPMI device
> >>    -P             Use Lanplus
> >>    -S <path>      Script to retrieve password (if required)
> >>    -l <login>     Username/Login (if required) to control power
> >>                   on IPMI device
> >>    -o <op>        Operation to perform.
> >>                   Valid operations: on, off, reboot, status
> >>    -t <timeout>   Timeout (sec) for IPMI operation (default 20)
> >>    -C <cipher>    Ciphersuite to use (same as ipmitool -C parameter)
> >>    -M <method>    Method to fence (onoff or cycle (default onoff)
> >>    -V             Print version and exit
> >>    -v             Verbose mode
> >>
> >> If no options are specified, the following options will be read
> >> from standard input (one per line):
> >>
> >>    auth=<auth>           Same as -A
> >>    ipaddr=<#>            Same as -a
> >>    passwd=<pass>         Same as -p
> >>    passwd_script=<path>  Same as -S
> >>    lanplus               Same as -P
> >>    login=<login>         Same as -u
> >>    option=<op>           Same as -o
> >>    operation=<op>        Same as -o
> >>    action=<op>           Same as -o
> >>    timeout=<timeout>     Same as -t
> >>    cipher=<cipher>       Same as -C
> >>    method=<method>       Same as -M
> >>    verbose               Same as -v
> >>
> >> On Thu, Mar 3, 2011 at 12:21 PM, Mika i <mika68vaan at gmail.com> wrote:
> >> hmm.
> >> Okey: have someone good installation instructions to get this
> >> fence_ipmilan to work.
> >> 1. active IPMI/DCMI over LAN in iLo3
> >> 2. what should i install in server to get fence_ipmilan to work
> >> Now if i test connection it shows like this.
> >>
> >> ipmitool -v -H 17x.3x.7x.1xx -I lanplus -U admin mc info
> >> Password:
> >> Get Auth Capabilities error
> >> Get Auth Capabilities error
> >> Error issuing Get Channel Authentication Capabilies request
> >> Error: Unable to establish IPMI v2 / RMCP+ session
> >> Get Device ID command failed
> >>
> >> Can someone help me!
> >> 2011/3/2 <Jason_Henderson at mitel.com>
> >>
> >>
> >> linux-cluster-bounces at redhat.com wrote on 03/02/2011 10:07:38 AM:
> >>
> >> > Hi
> >> >
> >> > Is there a way to get cluster-suite Fence to work with Rhel 5.5 and
> iLo3
> >> > I have now in both clusters rhel 5.5 version with:
> >> > kernel 2.6.18-194.el5
> >> > cman-2.0.115-68.el5_6.1
> >> >
> >> > But in fence state i get allways message: Unable to connect/login to
> >> > fencing device
> >> >
> >> > Any help - or must i update the cluster to rhel 5.6?
> >
> >> What fence agent are you using, fence_ilo?
> >> You will need to use the fence_ipmilan agent for iLO3.
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110304/afe737ed/attachment.htm>

From lhh at redhat.com  Fri Mar  4 17:06:48 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 12:06:48 -0500
Subject: [Linux-cluster] Service location (colocation)
In-Reply-To: <4D6A7709.6060108@gmail.com>
References: <4D6A7709.6060108@gmail.com>
Message-ID: <20110304170647.GC14803@redhat.com>

On Sun, Feb 27, 2011 at 06:08:41PM +0200, Budai Laszlo wrote:
> Hi all,
> 
> is there a way to define location dependencies among services? for
> instance how can I define that Service A should run on the same node as
> service B? Or the opposite: Service C should run on a different node
> than service D?
> 

rgmanager doesn't have this feature built-in; you can define
'collocated services' by simply creating one large service comprising
all of the resources for both services.

You could probably trivially extend central_processing mode to do "anti
collocation" (i.e. run on another node).

The 'follow_service.sl' script is an example of how to do part of
'anti-collocation'.   The way it works, it starts service A on a
different node from service B.  If the node running service A fails, it
is started on the same node as service B, then service B is moved away
to another (empty, usually) node in the cluster.

Alternatively, pacemaker supports this functionality.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Fri Mar  4 17:08:54 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 12:08:54 -0500
Subject: [Linux-cluster] GFS2
In-Reply-To: <4D6AED6F.9000203@gmail.com>
References: <4D6AED6F.9000203@gmail.com>
Message-ID: <20110304170853.GD14803@redhat.com>

On Mon, Feb 28, 2011 at 02:33:51AM +0200, Budai Laszlo wrote:
> Hi all,
> 
> in which version of RHEL GFS2 is considered production ready? 5.3?
> 

RHEL 5.3 and later, gfs2 moved in to 'full support'.  However, 5.3 EUS
(async errata, updates/fixes/etc) closed in January, so ideally, you
should move to RHEL 5.6 - which has loads of fixes for gfs2 compared to
RHEL 5.3.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Fri Mar  4 17:15:45 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 12:15:45 -0500
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
In-Reply-To: <AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>
References: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
	<20110228161406.GA14120@redhat.com>
	<AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>
Message-ID: <20110304171545.GE14803@redhat.com>

On Tue, Mar 01, 2011 at 06:50:18PM +0530, Parvez Shaikh wrote:
> Hi Ryan,
> 
> Thank you for response. Does it mean there is no way to intimate
> administrator about failure of fencing as of now?
> 
> Let me give more information about my cluster -
> 
> I have set of nodes in cluster with only IP resource being protected. I have
> two levels of fencing, first bladecenter fencing and second one is manual
> fencing.

If the problem you have with fence_bladecenter is intermittent - for
example, if it fails 1/2 the time, fence_manual is going to *detract*
from your cluster's ability to recover automatically.

Ordinarily, if a fencing action fails, fenced will automatically retry
the operation.

When you configure fence_manual as a backup, this retry will *never*
occur, meaning your cluster hangs.


> At times if machine is already down(either power failure or turned off
> abrupty); blade center fencing timesout and manual fencing happens. At this
> time, administrator is expected to run fence_ack_manual.

> Clearly this is not something which is desirable, as downtime of services is
> as long as administrator runs fence_ack_manual.

> What is recommended method to deal with  blade center fencing failure in
> this situation? Do I have to add another level of fencing(between blade
> center and manual) which can fence automatically(not requiring manual
> interference)?

Start with removing fence_manual.

If fencing is failing (permanently), you can still run:

   fence_ack_manual -e -n <nodename>

> > > my bladecenter fencing agent, I sometimes get message saying bladecenter
> > > fencing failed because of timeout or fence device IP address/user
> > > credentials are incorrect.

^^ This is why I think fence_manual is, in your specific case, very
likely hurting your availability.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Fri Mar  4 17:18:22 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 12:18:22 -0500
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>
References: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<852499.20065.qm@web112812.mail.gq1.yahoo.com>
	<AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>
Message-ID: <20110304171822.GF14803@redhat.com>

On Thu, Mar 03, 2011 at 11:20:37AM +0100, Seb wrote:
> [snip]config[/snip]
> 
> There is no <quorumd> section in your config file?
> Have you been able to identify a quorum disk on the nodes?

Small nitpick -

I'd really recommend against even trying to start qdiskd / use a quorum
disk in a 16 node cluster.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Fri Mar  4 17:20:40 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 12:20:40 -0500
Subject: [Linux-cluster] iLo3 and RedHat 5.5 : Unable to connect/login
 to fencing device
In-Reply-To: <AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>
References: <AANLkTi=odWFxfefeSfzvStZo3ZoafWsqxUmcQ6VZR8wE@mail.gmail.com>
	<OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<AANLkTi=WrYNhU+RzfEJ9zJk0qMhw+3adpqzpF9WsjCML@mail.gmail.com>
	<AANLkTi=3OGYdnmOcM=hANvRb1KLUi3P8rrD4vUk7ZaJw@mail.gmail.com>
	<AANLkTi=ZR2FRoJ+kpneKs+td1uc1bd4dVkWb4go7y2ok@mail.gmail.com>
Message-ID: <20110304172040.GG14803@redhat.com>

On Thu, Mar 03, 2011 at 03:53:32PM +0200, Mika i wrote:
> this works:
> ipmitool -H xxxxxilo -I lanplus -U admin -P xxxxxx chassis power cycle
> Server is rebooted.....
> 
> but not this:
> root at fff fence_ipmilan -a xxxxxxxxlo -l admin -p xxxxxxxxx -M 'cycle' -v

You forgot -P

(for lanplus)


-- 
Lon Hohberger - Red Hat, Inc.



From parvez.h.shaikh at gmail.com  Fri Mar  4 17:45:07 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Fri, 4 Mar 2011 23:15:07 +0530
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
In-Reply-To: <20110304171545.GE14803@redhat.com>
References: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
	<20110228161406.GA14120@redhat.com>
	<AANLkTimwp_r5RZgFsT-nC7mn3znu1e39rWqTZMY8dMjj@mail.gmail.com>
	<20110304171545.GE14803@redhat.com>
Message-ID: <AANLkTikdYKPccUBVXmQXCfuunzWmXZMkAZvTkDu8ke1G@mail.gmail.com>

Hi Lon,

Thank you for reply.

What I gathered from your response is to remove manual fencing at once. This
will cause fence daemon to retry fence_bladecenter until the node is fenced.
More likely the fenced will succeed in fencing the failed node(provided IP,
user name and password for bladecenter management module are right); even if
it times out for the first time. Am I right?

I will try removing manual fencing and see how things go.


>> If fencing is failing (permanently), you can still run:
>>   fence_ack_manual -e -n <nodename>

By the way as per my understanding fence_ack_manual -n <node name> can be
executed to acknowledge only manually fenced node(and not bladecenter fenced
node), correct me if this understanding is wrong. So God forbid, if
fence_bladecenter fails for some reason; we still have option to run
fence_manual and then fence_ack_manual, so cluster is back to working.

Thanks again and have great weekend ahead

Yours truly,
Parvez

On Fri, Mar 4, 2011 at 10:45 PM, Lon Hohberger <lhh at redhat.com> wrote:

> On Tue, Mar 01, 2011 at 06:50:18PM +0530, Parvez Shaikh wrote:
> > Hi Ryan,
> >
> > Thank you for response. Does it mean there is no way to intimate
> > administrator about failure of fencing as of now?
> >
> > Let me give more information about my cluster -
> >
> > I have set of nodes in cluster with only IP resource being protected. I
> have
> > two levels of fencing, first bladecenter fencing and second one is manual
> > fencing.
>
> If the problem you have with fence_bladecenter is intermittent - for
> example, if it fails 1/2 the time, fence_manual is going to *detract*
> from your cluster's ability to recover automatically.
>
> Ordinarily, if a fencing action fails, fenced will automatically retry
> the operation.
>
> When you configure fence_manual as a backup, this retry will *never*
> occur, meaning your cluster hangs.
>
>
> > At times if machine is already down(either power failure or turned off
> > abrupty); blade center fencing timesout and manual fencing happens. At
> this
> > time, administrator is expected to run fence_ack_manual.
>
> > Clearly this is not something which is desirable, as downtime of services
> is
> > as long as administrator runs fence_ack_manual.
>
> > What is recommended method to deal with  blade center fencing failure in
> > this situation? Do I have to add another level of fencing(between blade
> > center and manual) which can fence automatically(not requiring manual
> > interference)?
>
> Start with removing fence_manual.
>
> If fencing is failing (permanently), you can still run:
>
>   fence_ack_manual -e -n <nodename>
>
> > > > my bladecenter fencing agent, I sometimes get message saying
> bladecenter
> > > > fencing failed because of timeout or fence device IP address/user
> > > > credentials are incorrect.
>
> ^^ This is why I think fence_manual is, in your specific case, very
> likely hurting your availability.
>
> --
> Lon Hohberger - Red Hat, Inc.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110304/3b2acb1b/attachment.htm>

From lhh at redhat.com  Fri Mar  4 18:01:20 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 13:01:20 -0500
Subject: [Linux-cluster] Info on vm definitions and options in stable3
In-Reply-To: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>
References: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>
Message-ID: <20110304180120.GH14803@redhat.com>

On Thu, Mar 03, 2011 at 05:16:06PM +0100, Gianluca Cecchi wrote:
> Hello,
> in stable 3 I can have this kind of config for a KVM virtual machine
> to manage live migration:
> 
> <rm>
>    <vm .....>
> </rm>
> 
> It works ok, but I would like to know the possible parameters I can set.
> At http://sources.redhat.com/cluster/wiki/VirtualMachineBehaviors I
> can see this piece

> "..Most of the behaviors are common with normal services.."
> with a reference to start, stop, status monitoring, relocation, recovery
> 
> Where could I find a complete list?
> For example are failover domains usable inside the line?
> Or autostart option? I know how to manage autostart in a standalone
> virt-manager environment, but when in a cluster of hosts?

http://sources.redhat.com/cluster/wiki/ServiceOperationalBehaviors
http://sources.redhat.com/cluster/wiki/ServicePolicies
http://sources.redhat.com/cluster/wiki/FailoverDomains

> Or dependency lines such as
> 
> < vm name="vm1" ... >
>    <vm name="vm2" ... />
> </vm>

If you add a child of a VM, you can no longer live-migrate it.

> to power on vm2 only after power on of vm1?

you can use 'depend=' if you want, but rgmanager's handling of this sort
of dependency is rudimentary at best:

  <vm name="vm1" depend="vm:vm2" />
  <vm name="vm2" />

Will work, but if you stop vm2, vm1 will be stopped after vm2.

> About "transient domain support":
> In the stable3 implementation of rhel6 (or in general in stable 3 if
> it applies generally) a line such as this:
> 
> <vm name="myvm" use_virsh="1" xmlfile="/etc/libvirt/qemu/myvm.xml"/>
> 
> where /etc/libvirt/qemu/myvm.xm is not on a shared path, is it
> supposed that if I have myvm on node 1 and run


> clusvcadm -M vm:myvm -m node2
> 
> the file is deleted from node 1 and created in node 2 automatically or not?

You need to have the description on each host in the cluster.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Fri Mar  4 18:04:55 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 4 Mar 2011 13:04:55 -0500
Subject: [Linux-cluster] How can I change status check for a script??
In-Reply-To: <4D7007BF.3000105@gmail.com>
References: <4D7007BF.3000105@gmail.com>
Message-ID: <20110304180455.GI14803@redhat.com>

On Thu, Mar 03, 2011 at 10:27:27PM +0100, carlopmart wrote:
> Hi all,
> 
>  How can I change status interval for a certain service?? I have
> tried to insert:
> 
>  <action depth="*" interval="300s" name="status" timeout="0"/>
> 
>  under a service without luck. I am using
> rgmanager-3.0.12-10.el6.i686 and cman-3.0.12-23.el6_0.4.i686 under
> two RHEL6 hosts.

Checks are per-resource; the "service" meta-resource is largely a no-op
for "status"; you'd have to redefine it for each child of the service.

For example:

   <service name="foo" >
     <action name="status" depth="*" interval="300" />
     <fs name="1" mountpoint="/mnt/foo" device="/dev/sdb1" />
     <ip address="1.1.1.1" />
   </service>

... will effectively do nothing; you'd have to do:

   <service name="foo" >
     <fs name="1" mountpoint="/mnt/foo" device="/dev/sdb1" >
       <action name="status" depth="*" interval="300" />
     </ip>
     <ip address="1.1.1.1" >
       <action name="status" depth="*" interval="300" />
     </ip>
   </service>

Additionally, you can't redefine actions in a "ref"; you must do it
where the resource is defined:

http://sources.redhat.com/cluster/wiki/ResourceActions

-- 
Lon Hohberger - Red Hat, Inc.



From iarlyy at gmail.com  Fri Mar  4 18:17:51 2011
From: iarlyy at gmail.com (iarly selbir)
Date: Fri, 4 Mar 2011 15:17:51 -0300
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>
References: <OF2C118C89.E52CF8D2-ON85257847.0056D032-85257847.0056EEF7@ottlnmta.mitel.com>
	<852499.20065.qm@web112812.mail.gq1.yahoo.com>
	<AANLkTi=0J2J5auaX-_A6b_k16060RfLULxaNfH9P+JeV@mail.gmail.com>
Message-ID: <AANLkTimZ+W=ztyq+qmH9kj28mF8Xb_Wb21iq-RdwUDbh@mail.gmail.com>

Can you tell me where I can found more information about this secion (
quorumd ), I'm having a similar issue, some switches is failing, in this
moment the cluster is unable to check status of the nodes, the cluster hangs
and on /var/log/messages repeats this message ( unable to connect to
cluster... ) .

Thank you so much.

- -
iarlyy selbir

:wq!



On Thu, Mar 3, 2011 at 7:20 AM, Seb <mailing.sr at gmail.com> wrote:

> 2011/3/3 Srija <swap_project at yahoo.com>
>
> Hi all,
>>
>> Here is the issue  with the cluster  describing below:
>>
>> The cluster is built with 16 nodes. All rhel5.5   86_64 bit OS.
>> yesterday night  two  servers were rebooted and after that  these
>> two servers are not joining to the cluster.
>>
>> I was not  the part of the team when it is built. and my knowledge
>> regarding cluster is also little bit.
>>
>> Here is the scenario:
>>
>>   -  There is no quorum  disks.  But the person
>>      who has built the cluster he is telling he has executed the quorum
>>      from command line, [ i am not sure  of that ]
>>
>>  -  The errors  in the message log  are showing as
>>
>> ccsd[24182]: Unable to connect to cluster infrastructure after 12060
>> seconds , it is a continuous error message in the log file
>>
>> The cluster.conf  are as follows:
>>
> [snip]config[/snip]
>
> There is no <quorumd> section in your config file?
> Have you been able to identify a quorum disk on the nodes?
>
> The host-priv.domain.org is in your /etc/hosts? on all nodes?
>
> Why have they been rebooted? for maintenance/upgrade?
>
> Any iptable used?
>
> Could you please provide the logs showing the start of the cluster service?
>
>
>> It seems it is a very basic configuration. But at this stage more
>> important
>> is, to attach the two servers  in the cluster environment.
>>
>> If more information is needed , i will provide.
>>
>> Any advice is  appreciated.
>>
>> Thanks in advance
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110304/4bc052fe/attachment.htm>

From cos at aaaaa.org  Fri Mar  4 19:49:23 2011
From: cos at aaaaa.org (Ofer Inbar)
Date: Fri, 4 Mar 2011 14:49:23 -0500
Subject: [Linux-cluster] rg_test for testing other resource agent functions?
Message-ID: <20110304194923.GX934@mip.aaaaa.org>

I can do this:
sudo rg_test test /etc/cluster/cluster.conf status service [servicename]

To see what happens when I run a resource agent with the "status"
command line argument, in exactly the same context as RHCS would
run it - using the environment variables derived from cluster.conf,
and potentially running multiple resource agents or the same one more
than once with different variables, depending on what resources are
defined for that service.

It would be very useful to be able to use a similar framework to run
an arbitrary script, or an arbitrary resource agent command line option,
with the same automatic expansion from cluster.conf.

Unfortunately, rg_test only supports a short hardcoded set of options:
stop, start, and status.

For example, I want to add a "verify" procedure to my resource agent,
that I'd like to kick off from a monitoring script on my own schedule,
but I want to make sure that it is run in the same context as the
resource agent's status check is normally run.  I could write some
separate cluster.conf parser that simulates what I think rgmanager
would do, but I might get it wrong.  Or rgmanager might change in a
future version and I wouldn't track the change.

Is there anything like rg_test that might let me do this, or has
anyone patched rg_test to allow it?  Something as simple as:
  sudo rg_test test /etc/cluster/cluster.conf [foo] service [servicename]

... where it would simply call the resource agent the same way as it
does for status/start/stop, but substitute whatever command line
argument I give it.

Or do I have to reverse-engineer my own cluster.conf parsing to set up
the environment and run the script(s) myself (duplicating what rg_test
already does for status/start/stop) ?
  -- Cos



From swap_project at yahoo.com  Fri Mar  4 21:23:46 2011
From: swap_project at yahoo.com (Srija)
Date: Fri, 4 Mar 2011 13:23:46 -0800 (PST)
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <20110304171822.GF14803@redhat.com>
Message-ID: <19011.9401.qm@web112811.mail.gq1.yahoo.com>

Hi,

It will be really appreciated if you send the documentation of building cluster. I think max 16 nodes are permitable for cluster. If you think it is better to divide into two clusters that is also ok.  But I need some running ( i mean without any issue) configuration to follow.

There are many docs in the web, but   it is difficult to follow those docs specially on cluster . Once I get a running cluster doc , after that on that basis , I can go further for enhancing the knowledge on cluster.  

Thanks again


--- On Fri, 3/4/11, Lon Hohberger <lhh at redhat.com> wrote:

> From: Lon Hohberger <lhh at redhat.com>
> Subject: Re: [Linux-cluster] Nodes are not joining to the cluster
> To: "linux clustering" <linux-cluster at redhat.com>
> Date: Friday, March 4, 2011, 12:18 PM
> On Thu, Mar 03, 2011 at 11:20:37AM
> +0100, Seb wrote:
> > [snip]config[/snip]
> > 
> > There is no <quorumd> section in your config
> file?
> > Have you been able to identify a quorum disk on the
> nodes?
> 
> Small nitpick -
> 
> I'd really recommend against even trying to start qdiskd /
> use a quorum
> disk in a 16 node cluster.
> 
> -- 
> Lon Hohberger - Red Hat, Inc.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


      



From pradhanparas at gmail.com  Fri Mar  4 22:40:36 2011
From: pradhanparas at gmail.com (Paras pradhan)
Date: Fri, 4 Mar 2011 16:40:36 -0600
Subject: [Linux-cluster] GFS2 write
Message-ID: <AANLkTi=y3obd=e5-EPMW0Zy5g=HgHgzCYU4yQPUXZto6@mail.gmail.com>

Hi.

I was trying to copy a 400 GB file to a gfs2 share. It was copying at
50MB/s approx. Suddenly after copying 80% ,the rate dropped to 30KB/s
and stayed like that. I tried to kill the process but could't (which
is normal) and after few minutes it was killed. Then I tried it again
after few minutes and it was successfully copied at 50MB/s. But  then
after it looks like accessing the GFS share (even ls -l /gfsmount)
takes 10-15 seconds to complete. Then I rebooted this node and
everything is back normal.

I am really confused what has gone wrong. GFS is running with all
default parameters .

Thanks!
Paras.



From gianluca.cecchi at gmail.com  Sat Mar  5 14:03:25 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Sat, 5 Mar 2011 15:03:25 +0100
Subject: [Linux-cluster] Info on vm definitions and options in stable3
In-Reply-To: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>
References: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>
Message-ID: <AANLkTim5FVbJ5E9n1hk9DfK1-5rqz36wmz3VThNHdCBa@mail.gmail.com>

On Fri, 4 Mar 2011 13:01:20 -0500 Lon Hohberger wrote:
> http://sources.redhat.com/cluster/wiki/ServiceOperationalBehaviors
> http://sources.redhat.com/cluster/wiki/ServicePolicies
> http://sources.redhat.com/cluster/wiki/FailoverDomains

Thanks for the links
Some comments:
1) http://sources.redhat.com/cluster/wiki/ServicePolicies
probably to correct near the end from
The above service tolerance is 3 restarts in 10 minutes.
to
The above service tolerance is 3 restarts in 5 minutes.

3) http://sources.redhat.com/cluster/wiki/FailoverDomains
It could be useful to add at the top a comment such as the one in 1)
(Note: These policies also apply to virtual machine resources.)

for example something like:
Note: Failover Domains concepts also apply to virtual machine resources.

2) http://sources.redhat.com/cluster/wiki/ServiceOperationalBehaviors
Here the application to virtual resources is implicit due to the
various references inside the page itself

Cheers,
Gianluca



From gianluca.cecchi at gmail.com  Sat Mar  5 14:30:47 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Sat, 5 Mar 2011 15:30:47 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6: Migration
	unexpectedly failed
Message-ID: <AANLkTimZeTeRi7MExZdLYqaLmqo-JSQFjHnS5z6ZW-mT@mail.gmail.com>

I have two rh el 6 systems configured with rhcs and clvmd.
General cluster services seems to be ok.
As I'm not able to successfully migrate a vm through clusvcadm, I'm
now downsizing the problem to direct virsh command that fails when
called from clusvcadm.
The guest's storage is composed by two disks that are clustered logical volumes

vm definition is
<vm name="exorapr1" recovery="relocate" use_virsh="1"
xmlfile="/etc/libvirt/qemu/exorapr1.xml"/>

the xml file is the same at both hosts

At first I verified correct startup on both nodes this way:
- vm running on host2 with resource recovery policy set "relocate"
- shutdown vm from inside its operating system
- the cluster notices this and correctly restarts it on host1
- shutdown vm from inside its operating system
- the cluster notices this and correctly restarts it on host1


I have also ssh equivalence in place (for the intracluster names) so
that I can run from host2:

[host2 ] # virsh -c  qemu+ssh://intrarhev1/system list

without need of password input.

If I try the command used by the cluster itself (after stopping the vm
from clusvcadm):

# virsh migrate --live exorapr1 qemu+ssh://intrarhev1/system
I receive:
error: operation failed: Migration unexpectedly failed

On host2:
[host2 ] # virsh list
 Id Name                 State
----------------------------------
  3 exorapr1             running

In messages:
Mar  4 14:27:30 host2 libvirtd: 14:27:30.527: error :
qemuDomainWaitForMigrationComplete:5394 : operation failed: Migration
unexpectedly failed

Setting this:
[root at host2 libvirt]# export LIBVIRT_DEBUG=1
[root at host2 libvirt]# export LIBVIRT_LOG_OUTPUTS="1:file:/tmp/virsh.log"

I get the file I'm going to attach (due to migration happening with
intracluster network, the names are intrarhev1 and intrarhev2 on that
LAN)

It seems no more information in the file....
Any hints on further debugging?
If there is not any big mistake at my side I could also open an
official case, as these two systems are under subscription
maintenance...

Thanks in advance,
Gianluca
-------------- next part --------------
A non-text attachment was scrubbed...
Name: virsh.log
Type: application/octet-stream
Size: 14718 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110305/d73a43d2/attachment.obj>

From ra at ra.is  Sat Mar  5 16:36:10 2011
From: ra at ra.is (Richard Allen)
Date: Sat, 05 Mar 2011 11:36:10 -0500
Subject: [Linux-cluster] RHEL6 HA addon
In-Reply-To: <4D66863E.3070304@alteeve.com>
References: <4D667CE7.1050501@ra.is> <4D66863E.3070304@alteeve.com>
Message-ID: <4D72667A.90803@ra.is>

On 02/24/2011 11:24 AM, Digimer wrote:
> On 02/24/2011 10:44 AM, Richard Allen wrote:
>> Hi all
>>
>> I notice in the Release Notes for RHEL6 that many changes have been made
>> to the Cluster Suite (HA Addon) but I am unable to find any mention of
>> how the new suite does heartbeat.
>> In previous versions the Cluster could only do heartbeats (node
>> intercommunication) on one network link and for redundancy the only
>> option was to use bonded network devices.
>> There was a way to add a second heartbeat using altnode directives in
>> the XML config file but that always felt a bit hackish and was only
>> limited to only one altnode, giving two heartbeat paths.
>>
>> So I would like to ask how RHEL6 does this.  If I have nodes with 4 10Gb
>> NIC's, one connected to an admin network, another to a Database network
>> and one to the Application network and the last one connected directly
>> to the other node with a crossover cable, can the cluster now use all
>> possible paths to communicate to the other nodes or will one of those
>> paths become a single point of failure in the cluster?
>>
>> I'm used to using Clusters like HP's ServiceGuard where I can easily
>> define which links to use as heartbeat.  It can even use a serial
>> connection (in a two node cluster) as a additional heartbeat and I have
>> always felt this is quite a big limitation in Red Hat's cluster suite up
>> to RHEL6 atleast.
>>
>> Thanks in advance
>> Richard.
> Hi Richard,
>
>   Can I assume that you are talking about High Availability in general,
> as opposed to Heartbeat specifically? If not, the rest won't be too
> relevant.
>
>   As you know, the 'altnode' parameter is how you assign a second link.
> This is still the case (as is bonding to get more links, but that
> requires common subnets which you don't have).
>
>   Corosync is used as the cluster communication layer (as opposed to
> openais from RHEL 5.x). It supports one or two interfaces for "totem"
> communication. If the main fails, the second link will be used
> automatically. However, when the main is restored, totem must be
> manually moved back to the original link.
>
>   So in short; as it was in 5, so it is in 6. That said, the 'altname'
> is perfectly valid way of removing that SPF. :)
>

Thanks for the reply.

I was reading up on this and I noticed something new in the cman(5) man
page. Quote:


Multi-home configuration
It is quite common to use multiple ethernet adapters for cluster nodes, so
they will toler-
ate the failure of one link. A common way to do this is to use ethernet
bonding. Alterna-
tively you can get corosync to run in redundant ring mode by specifying an
?altname? for
the node. This is an alternative name by which the node is known, that
resolves to another
IP address used on the other ethernet adapter(s). You can optionally specify
a different
port and/or multicast address for each altname in use. Up to 9 altnames (10
interfaces in
total) can be used.

Note that if you are using the DLM with cman/corosync then you MUST tell it
to use SCTP as
it?s communications protocol as TCP does not support multihoming.


So I can use up to 9 altnames now? Is true, it would be fantastic :)

-- 
Rikki.         --  RHCE, RHCX, HP-UX Certified Administrator.
               --  Solaris 7 Certified Systems and Network Administrator.
Bell Labs Unix --  Reach out and grep someone.
Those who do not understand Unix are condemned to reinvent it, poorly.



From swap_project at yahoo.com  Sat Mar  5 22:53:51 2011
From: swap_project at yahoo.com (Srija)
Date: Sat, 5 Mar 2011 14:53:51 -0800 (PST)
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <20110304171822.GF14803@redhat.com>
Message-ID: <90386.57864.qm@web112805.mail.gq1.yahoo.com>

Hi,

Just a query , I am not very much clear so asking,

> I'd really recommend against even trying to start qdiskd /
> use a quorum
> disk in a 16 node cluster.

Did you ask not to use qdiskd /quorum disk in the 16 nodes cluster ?

Thanks




--- On Fri, 3/4/11, Lon Hohberger <lhh at redhat.com> wrote:

> From: Lon Hohberger <lhh at redhat.com>
> Subject: Re: [Linux-cluster] Nodes are not joining to the cluster
> To: "linux clustering" <linux-cluster at redhat.com>
> Date: Friday, March 4, 2011, 12:18 PM
> On Thu, Mar 03, 2011 at 11:20:37AM
> +0100, Seb wrote:
> > [snip]config[/snip]
> > 
> > There is no <quorumd> section in your config
> file?
> > Have you been able to identify a quorum disk on the
> nodes?
> 
> Small nitpick -
> 
> I'd really recommend against even trying to start qdiskd /
> use a quorum
> disk in a 16 node cluster.
> 
> -- 
> Lon Hohberger - Red Hat, Inc.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


      



From ooolinux at 163.com  Sun Mar  6 05:30:01 2011
From: ooolinux at 163.com (yue)
Date: Sun, 6 Mar 2011 13:30:01 +0800 (CST)
Subject: [Linux-cluster] is ocfs2 is limited 16T
Message-ID: <56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>

if there is a limit on ocfs2'volume? it must less 16T?
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110306/f822f9a1/attachment.htm>

From wen.gang.wang at oracle.com  Sun Mar  6 11:35:57 2011
From: wen.gang.wang at oracle.com (Wengang Wang)
Date: Sun, 6 Mar 2011 19:35:57 +0800
Subject: [Linux-cluster] is ocfs2 is limited 16T
In-Reply-To: <56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>
References: <56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>
Message-ID: <20110306113557.GC2756@laptop>

Hi,

For mainline kernel, there is no such limit.
But for existing ocfs2 1.2/1.4/1.6, there is a 16TB limit.

thanks,
wengang.

if there is a limit on ocfs2'volume? it must less 16T?
 
thanks

On 11-03-06 13:30, yue wrote:

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From jakov.sosic at srce.hr  Sun Mar  6 12:38:55 2011
From: jakov.sosic at srce.hr (Jakov Sosic)
Date: Sun, 06 Mar 2011 13:38:55 +0100
Subject: [Linux-cluster] is ocfs2 is limited 16T
In-Reply-To: <56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>
References: <56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>
Message-ID: <4D73805F.8020308@srce.hr>

On 03/06/2011 06:30 AM, yue wrote:
> if there is a limit on ocfs2'volume? it must less 16T?

For RHEL v5.x and derivateves yes. But you can hack it and rebuild
kernel modules without limitation. You also need to patch kernel-sources
and rebuild kernel too.


-- 
Jakov Sosic
www.srce.hr



From balajisundar at midascomm.com  Mon Mar  7 08:33:41 2011
From: balajisundar at midascomm.com (Balaji Sundar)
Date: Mon, 7 Mar 2011 14:03:41 +0530 (IST)
Subject: [Linux-cluster] rgmanager not running
Message-ID: <38415.59.90.241.47.1299486821.squirrel@59.90.241.47>

Dear All,

I have using RHEL6 Linux and Kernel Version is 2.6.32-71.el6.i686

I have configured Cluster Suite with 2 servers
Server 1 : 192.168.13.131 IP Address and hostname is primary
Server 2 : 192.168.13.132 IP Address and hostname is secondary
Floating : 192.168.13.133 IP Address (Assumed by currently active server)

I have verified that service cman is running and cluster.conf is valid
using ccs_config_validate command

Finally i found that rgmanager is not running and services are not started
[root at primary cluster]# service rgmanager status
rgmanager dead but pid file exists
[root at primary cluster]#
[root at primary cluster]# cman_tool services
[root at primary cluster]#
[root at primary cluster]# cman_tool status
Version: 6.2.0
Config Version: 1
Cluster Name: EMSCluster
Cluster Id: 808
Cluster Member: Yes
Cluster Generation: 96
Membership state: Cluster-Member
Nodes: 1
Expected votes: 1
Total votes: 1
Node votes: 1
Quorum: 1
Active subsystems: 7
Flags: 2node
Ports Bound: 0
Node name: primary
Node ID: 1
Multicast addresses: 239.192.3.43
Node addresses: 192.168.13.131
[root at primary cluster]#

Found some error messages in "/var/log/messages" file
Mar  7 14:39:42 primary corosync[7155]:   [CMAN  ] quorum regained,
resuming activity
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] This node is within the
primary component and will provide service.
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] Members[1]: 1
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] Members[1]: 1
Mar  7 14:39:42 primary corosync[7155]:   [CPG   ] downlist received
left_list: 0
Mar  7 14:39:42 primary corosync[7155]:   [CPG   ] chosen downlist from
node r(0) ip(192.168.13.131)
Mar  7 14:39:42 primary corosync[7155]:   [MAIN  ] Completed service
synchronization, ready to provide service.
Mar  7 14:39:44 primary fenced[7210]: fenced 3.0.12 started
Mar  7 14:39:45 primary dlm_controld[7224]: dlm_controld 3.0.12 started
Mar  7 14:39:45 primary gfs_controld[7254]: gfs_controld 3.0.12 started
Mar  7 14:39:45 primary kernel: dlm: Using TCP for communications
Mar  7 14:39:45 primary dlm_controld[7224]: dlm_join_lockspace no fence
domain
Mar  7 14:39:45 primary dlm_controld[7224]: process_uevent online@ error
-1 errno 2
Mar  7 14:39:45 primary kernel: dlm: rgmanager: group join failed -1 -1

Found some error messages in "/var/log/cluster/dlm_controld.log" file
Mar 07 14:39:45 dlm_controld dlm_controld 3.0.12 started
Mar 07 14:39:45 dlm_controld dlm_join_lockspace no fence domain
Mar 07 14:39:45 dlm_controld process_uevent online@ error -1 errno 2

I don't know what is the problem and Can some one throw light on this
peculiar problem

Thanks in Advance

--Regards
S.Balaji






From sdake at redhat.com  Mon Mar  7 15:09:39 2011
From: sdake at redhat.com (Steven Dake)
Date: Mon, 07 Mar 2011 08:09:39 -0700
Subject: [Linux-cluster] RHEL6 HA addon
In-Reply-To: <4D72667A.90803@ra.is>
References: <4D667CE7.1050501@ra.is> <4D66863E.3070304@alteeve.com>
	<4D72667A.90803@ra.is>
Message-ID: <4D74F533.70907@redhat.com>


> Note that if you are using the DLM with cman/corosync then you MUST tell it
> to use SCTP as
> it?s communications protocol as TCP does not support multihoming.
> 
> 
> So I can use up to 9 altnames now? Is true, it would be fantastic :)
> 

corosync supports a max of two interfaces.  If we ever get around to
supporting redundant ring well, we will add a larger number of redundant
rings (ie: make it configurable in the packet data).

Regards
-steve



From gregory.lee.bartholomew at gmail.com  Mon Mar  7 16:15:29 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Mon, 07 Mar 2011 10:15:29 -0600
Subject: [Linux-cluster] Error: "Failed actions:
 dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".
Message-ID: <4D7504A1.3090603@gmail.com>

Hi All,

I'm trying to follow the "clusters from scratch" guide and I'm
running Fedora 14.

When I try to add the DLM and GFS2 services, crm_mon keeps reporting
"Failed actions: dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".

Does anyone know what I'm missing?

Thanks,
gb



From rpeterso at redhat.com  Mon Mar  7 16:35:58 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 7 Mar 2011 11:35:58 -0500 (EST)
Subject: [Linux-cluster] Error: "Failed actions:
 dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".
In-Reply-To: <4D7504A1.3090603@gmail.com>
Message-ID: <1229003616.324573.1299515758308.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi All,
| 
| I'm trying to follow the "clusters from scratch" guide and I'm
| running Fedora 14.
| 
| When I try to add the DLM and GFS2 services, crm_mon keeps reporting
| "Failed actions: dlm:1_monitor_0/gfs-control:1_monitor_0 ... not
| installed".
| 
| Does anyone know what I'm missing?
| 
| Thanks,
| gb

Hm, it sounds like you don't have the debugfs mounted
and some piece of software (likely crm_mon) is expecting it.
Try adding something like this to /etc/fstab:

debugfs                 /sys/kernel/debug      debugfs  defaults        0 0

and doing mount -a

Regards,

Bob Peterson
Red Hat File Systems



From rpeterso at redhat.com  Mon Mar  7 18:43:34 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 7 Mar 2011 13:43:34 -0500 (EST)
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <AANLkTi=y3obd=e5-EPMW0Zy5g=HgHgzCYU4yQPUXZto6@mail.gmail.com>
Message-ID: <1152011131.327272.1299523414066.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi.
| 
| I was trying to copy a 400 GB file to a gfs2 share. It was copying at
| 50MB/s approx. Suddenly after copying 80% ,the rate dropped to 30KB/s
| and stayed like that. I tried to kill the process but could't (which
| is normal) and after few minutes it was killed. Then I tried it again
| after few minutes and it was successfully copied at 50MB/s. But then
| after it looks like accessing the GFS share (even ls -l /gfsmount)
| takes 10-15 seconds to complete. Then I rebooted this node and
| everything is back normal.
| 
| I am really confused what has gone wrong. GFS is running with all
| default parameters .
| 
| Thanks!
| Paras.

Hi Paras,

I think I've recreated the problem and I'm investigating it now.
I hope to have an answer soon (maybe today).  Looks like a bug to
me, and so I'll see if I can generate a patch to fix it.  That
may take a few days.

Regards,

Bob Peterson
Red Hat File Systems



From gregory.lee.bartholomew at gmail.com  Mon Mar  7 19:26:13 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Mon, 07 Mar 2011 13:26:13 -0600
Subject: [Linux-cluster] Error: "Failed actions:
 dlm:1_monitor_0/gfs-control:1_monitor_0 ... not installed".
In-Reply-To: <1229003616.324573.1299515758308.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <1229003616.324573.1299515758308.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D753155.1090809@gmail.com>

Thanks for offering me something to try Bob, but it still doesn't seem 
to work.  Here is the exact output of crm_mon and "crm configure show":

============
Last updated: Mon Mar  7 13:20:47 2011
Stack: openais
Current DC: eb2024-58.cs.siue.edu - partition with quorum
Version: 1.1.4-ac608e3491c7dfc3b3e3c36d966ae9b016f77065
2 Nodes configured, 2 expected votes
4 Resources configured.
============

Online: [ eb2024-58.cs.siue.edu eb2024-59.cs.siue.edu ]

ClusterIP	(ocf::heartbeat:IPaddr2):	Started eb2024-58.cs.siue.edu
WebSite (ocf::heartbeat:apache):        Started eb2024-58.cs.siue.edu

Failed actions:
     dlm:0_monitor_0 (node=eb2024-59.cs.siue.edu, call=4, rc=5, 
status=complete): not installed
     gfs-control:0_monitor_0 (node=eb2024-59.cs.siue.edu, call=5, rc=5, 
status=complete): not installed
     dlm:1_monitor_0 (node=eb2024-58.cs.siue.edu, call=4, rc=5, 
status=complete): not installed
     gfs-control:1_monitor_0 (node=eb2024-58.cs.siue.edu, call=5, rc=5, 
status=complete): not installed


[root at eb2024-58 ~]# crm configure show
node eb2024-58.cs.siue.edu
node eb2024-59.cs.siue.edu
primitive ClusterIP ocf:heartbeat:IPaddr2 \
	params ip="146.163.150.57" cidr_netmask="32" \
	op monitor interval="30s"
primitive WebSite ocf:heartbeat:apache \
	params configfile="/etc/httpd/conf/httpd.conf" \
	op start interval="0" timeout="40s" \
	op stop interval="0" timeout="60s" \
	op monitor interval="1min"
primitive dlm ocf:pacemaker:controld \
	op start interval="0" timeout="90s" \
	op stop interval="0" timeout="100s" \
	op monitor interval="120s"
primitive gfs-control ocf:pacemaker:controld \
	params daemon="gfs_controld.pcmk" args="-g 0" \
	op start interval="0" timeout="90s" \
	op stop interval="0" timeout="100s" \
	op monitor interval="120s"
clone dlm-clone dlm \
	meta interleave="true"
clone gfs-clone gfs-control \
	meta interleave="true"
location prefer-node1 WebSite 50: eb2024-58.cs.siue.edu
colocation gfs-with-dlm inf: gfs-clone dlm-clone
colocation website-with-ip inf: WebSite ClusterIP
order apache-after-ip inf: ClusterIP WebSite
order start-gfs-after-dlm inf: dlm-clone gfs-clone
property $id="cib-bootstrap-options" \
	dc-version="1.1.4-ac608e3491c7dfc3b3e3c36d966ae9b016f77065" \
	cluster-infrastructure="openais" \
	expected-quorum-votes="2" \
	stonith-enabled="false" \
	no-quorum-policy="ignore"

Has anyone got this to work on Fedora 14?
gb

On 03/07/2011 10:35 AM, Bob Peterson wrote:
> ----- Original Message -----
> | Hi All,
> |
> | I'm trying to follow the "clusters from scratch" guide and I'm
> | running Fedora 14.
> |
> | When I try to add the DLM and GFS2 services, crm_mon keeps reporting
> | "Failed actions: dlm:1_monitor_0/gfs-control:1_monitor_0 ... not
> | installed".
> |
> | Does anyone know what I'm missing?
> |
> | Thanks,
> | gb
>
> Hm, it sounds like you don't have the debugfs mounted
> and some piece of software (likely crm_mon) is expecting it.
> Try adding something like this to /etc/fstab:
>
> debugfs                 /sys/kernel/debug      debugfs  defaults        0 0
>
> and doing mount -a
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From pradhanparas at gmail.com  Mon Mar  7 20:33:24 2011
From: pradhanparas at gmail.com (Paras pradhan)
Date: Mon, 7 Mar 2011 14:33:24 -0600
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <1152011131.327272.1299523414066.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <AANLkTi=y3obd=e5-EPMW0Zy5g=HgHgzCYU4yQPUXZto6@mail.gmail.com>
	<1152011131.327272.1299523414066.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <AANLkTinbuc=fxBOQfWwHYE8mjEuB9qb9Mmne593Yv_q_@mail.gmail.com>

Thanks Bob. Please let me know if you need any other info.

Paras.

On Mon, Mar 7, 2011 at 12:43 PM, Bob Peterson <rpeterso at redhat.com> wrote:
> ----- Original Message -----
> | Hi.
> |
> | I was trying to copy a 400 GB file to a gfs2 share. It was copying at
> | 50MB/s approx. Suddenly after copying 80% ,the rate dropped to 30KB/s
> | and stayed like that. I tried to kill the process but could't (which
> | is normal) and after few minutes it was killed. Then I tried it again
> | after few minutes and it was successfully copied at 50MB/s. But then
> | after it looks like accessing the GFS share (even ls -l /gfsmount)
> | takes 10-15 seconds to complete. Then I rebooted this node and
> | everything is back normal.
> |
> | I am really confuseTd what has gone wrong. GFS is running with all
> | default parameters .
> |
> | Thanks!
> | Paras.
>
> Hi Paras,
>
> I think I've recreated the problem and I'm investigating it now.
> I hope to have an answer soon (maybe today). ?Looks like a bug to
> me, and so I'll see if I can generate a patch to fix it. ?That
> may take a few days.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From lhh at redhat.com  Mon Mar  7 21:36:01 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 7 Mar 2011 16:36:01 -0500
Subject: [Linux-cluster] Nodes are not joining to the cluster
In-Reply-To: <90386.57864.qm@web112805.mail.gq1.yahoo.com>
References: <20110304171822.GF14803@redhat.com>
	<90386.57864.qm@web112805.mail.gq1.yahoo.com>
Message-ID: <20110307213601.GH17423@redhat.com>

On Sat, Mar 05, 2011 at 02:53:51PM -0800, Srija wrote:
> Hi,
> 
> Just a query , I am not very much clear so asking,
> 
> > I'd really recommend against even trying to start qdiskd /
> > use a quorum
> > disk in a 16 node cluster.
> 
> Did you ask not to use qdiskd /quorum disk in the 16 nodes cluster ?

Right - qdiskd was designed for 2- and 4-node clusters to expand the
failure tolerances a little bit.

It will *work* in a 16 node cluster, but is unlikely to provide any
practical benefit.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar  7 21:42:12 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 7 Mar 2011 16:42:12 -0500
Subject: [Linux-cluster] Info on vm definitions and options in stable3
In-Reply-To: <AANLkTim5FVbJ5E9n1hk9DfK1-5rqz36wmz3VThNHdCBa@mail.gmail.com>
References: <AANLkTin2hzYUOvC6XKW5=ZaDC8nGk3qRBYFkcC3k4MUY@mail.gmail.com>
	<AANLkTim5FVbJ5E9n1hk9DfK1-5rqz36wmz3VThNHdCBa@mail.gmail.com>
Message-ID: <20110307214212.GI17423@redhat.com>

On Sat, Mar 05, 2011 at 03:03:25PM +0100, Gianluca Cecchi wrote:
> On Fri, 4 Mar 2011 13:01:20 -0500 Lon Hohberger wrote:
> > http://sources.redhat.com/cluster/wiki/ServiceOperationalBehaviors
> > http://sources.redhat.com/cluster/wiki/ServicePolicies
> > http://sources.redhat.com/cluster/wiki/FailoverDomains
> 
> Thanks for the links
> Some comments:
> 1) http://sources.redhat.com/cluster/wiki/ServicePolicies
> probably to correct near the end from
> The above service tolerance is 3 restarts in 10 minutes.
> to
> The above service tolerance is 3 restarts in 5 minutes.

Fixed.


> 3) http://sources.redhat.com/cluster/wiki/FailoverDomains
> It could be useful to add at the top a comment such as the one in 1)
> (Note: These policies also apply to virtual machine resources.)

> for example something like:
> Note: Failover Domains concepts also apply to virtual machine resources.

Done.

> 2) http://sources.redhat.com/cluster/wiki/ServiceOperationalBehaviors
> Here the application to virtual resources is implicit due to the
> various references inside the page itself

Added a note anyway.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar  7 21:49:19 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 7 Mar 2011 16:49:19 -0500
Subject: [Linux-cluster] rg_test for testing other resource agent
 functions?
In-Reply-To: <20110304194923.GX934@mip.aaaaa.org>
References: <20110304194923.GX934@mip.aaaaa.org>
Message-ID: <20110307214919.GJ17423@redhat.com>

On Fri, Mar 04, 2011 at 02:49:23PM -0500, Ofer Inbar wrote:
> 
> For example, I want to add a "verify" procedure to my resource agent,
> that I'd like to kick off from a monitoring script on my own schedule,
> but I want to make sure that it is run in the same context as the
> resource agent's status check is normally run.  I could write some
> separate cluster.conf parser that simulates what I think rgmanager
> would do, but I might get it wrong.  Or rgmanager might change in a
> future version and I wouldn't track the change.

rg_test exposes the operations rgmanager performs.

rgmanager doesn't actually call 'validate-all' - it expects RAs to do
this, or at least report when parameters are invalid if
start/status/stop operations are called.

> Is there anything like rg_test that might let me do this, or has
> anyone patched rg_test to allow it?  Something as simple as:
>   sudo rg_test test /etc/cluster/cluster.conf [foo] service [servicename]

rgmanager does implicit start/status/stop ordering based on service tree
structures, which is why those are the only operations that are
currently done.

> .. where it would simply call the resource agent the same way as it
> does for status/start/stop, but substitute whatever command line
> argument I give it.

You could just do:

  OCF_RESKEY_x=y OCF_RESKEY_a=b /path/to/agent.sh <operation>

> Or do I have to reverse-engineer my own cluster.conf parsing to set up
> the environment and run the script(s) myself (duplicating what rg_test
> already does for status/start/stop) ?

Pacemaker has ocf-tester as well; maybe that would be useful?

I have a tool that will flatten a cluster.conf for you, resolving
rgmanager's entire resource tree structure and flattening the result.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar  7 21:52:00 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 7 Mar 2011 16:52:00 -0500
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTimZeTeRi7MExZdLYqaLmqo-JSQFjHnS5z6ZW-mT@mail.gmail.com>
References: <AANLkTimZeTeRi7MExZdLYqaLmqo-JSQFjHnS5z6ZW-mT@mail.gmail.com>
Message-ID: <20110307215200.GK17423@redhat.com>

On Sat, Mar 05, 2011 at 03:30:47PM +0100, Gianluca Cecchi wrote:
> It seems no more information in the file....
> Any hints on further debugging?

Check /var/log/audit/audit.log for an AVC denial around self:capability
setpcap for xm_t?


-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar  7 21:55:38 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 7 Mar 2011 16:55:38 -0500
Subject: [Linux-cluster] rgmanager not running
In-Reply-To: <38415.59.90.241.47.1299486821.squirrel@59.90.241.47>
References: <38415.59.90.241.47.1299486821.squirrel@59.90.241.47>
Message-ID: <20110307215538.GL17423@redhat.com>

On Mon, Mar 07, 2011 at 02:03:41PM +0530, Balaji Sundar wrote:

> 
> Found some error messages in "/var/log/messages" file
> Mar  7 14:39:42 primary corosync[7155]:   [CMAN  ] quorum regained,
> resuming activity

How much time between:

[DATE] corosync[7155]:   [MAIN  ] Corosync Cluster
Engine ('1.2.3'): started and ready to provide service.

and the above message?

-- 
Lon Hohberger - Red Hat, Inc.



From gianluca.cecchi at gmail.com  Mon Mar  7 22:10:08 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Mon, 7 Mar 2011 23:10:08 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
Message-ID: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>

On Mon, 7 Mar 2011 16:52:00 -0500 Lon Hohberger wrote:

> Check /var/log/audit/audit.log for an AVC denial around self:capability
> setpcap for xm_t?

Uhm,
SElinux is disabled on both nodes (I'll cross check tomorrow anyway)
and auditd is chkconfig off too (even if I notice in rh el 6 many
audit messages related to cron writing in /var/log/messages...)
Could it be of any help an "strace -f" of the virsh command where I
can see the ssh and netcat forked calls but am not able to identify
the point where eventually there is something strange?

Gianluca



From rpeterso at redhat.com  Mon Mar  7 22:14:55 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 7 Mar 2011 17:14:55 -0500 (EST)
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <AANLkTinbuc=fxBOQfWwHYE8mjEuB9qb9Mmne593Yv_q_@mail.gmail.com>
Message-ID: <1326901784.331890.1299536095096.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Thanks Bob. Please let me know if you need any other info.
| 
| Paras.
| > Hi Paras,
| >
| > I think I've recreated the problem and I'm investigating it now.
| > I hope to have an answer soon (maybe today). Looks like a bug to
| > me, and so I'll see if I can generate a patch to fix it. That
| > may take a few days.
| >
| > Regards,
| >
| > Bob Peterson
| > Red Hat File Systems

Hi Paras,

Your block allocation problem will probably be fixed by this upstream
patch to GFS2:

http://git.kernel.org/?p=linux/kernel/git/steve/gfs2-2.6-nmw.git;a=commitdiff;h=9cabcdbd4638cf884839ee4cd15780800c223b90

I tracked it down.  I ported the patch to RHEL5 and now it doesn't
happen.  Unfortunately, my ported  patch needs cleaning: I've got
a bunch of instrumentation for other reasons in there.

Regards,

Bob Peterson
Red Hat File Systems



From pradhanparas at gmail.com  Mon Mar  7 22:21:31 2011
From: pradhanparas at gmail.com (Paras pradhan)
Date: Mon, 7 Mar 2011 16:21:31 -0600
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <1326901784.331890.1299536095096.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <AANLkTinbuc=fxBOQfWwHYE8mjEuB9qb9Mmne593Yv_q_@mail.gmail.com>
	<1326901784.331890.1299536095096.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <AANLkTik4CXEcWmuVyZT6kkLmzAca7T4+a68jagX3bq-T@mail.gmail.com>

We are running Redhat 5. Do you think this patch has already been
applied to the GFS that redhat ships?

Paras.

On Mon, Mar 7, 2011 at 4:14 PM, Bob Peterson <rpeterso at redhat.com> wrote:
> ----- Original Message -----
> | Thanks Bob. Please let me know if you need any other info.
> |
> | Paras.
> | > Hi Paras,
> | >
> | > I think I've recreated the problem and I'm investigating it now.
> | > I hope to have an answer soon (maybe today). Looks like a bug to
> | > me, and so I'll see if I can generate a patch to fix it. That
> | > may take a few days.
> | >
> | > Regards,
> | >
> | > Bob Peterson
> | > Red Hat File Systems
>
> Hi Paras,
>
> Your block allocation problem will probably be fixed by this upstream
> patch to GFS2:
>
> http://git.kernel.org/?p=linux/kernel/git/steve/gfs2-2.6-nmw.git;a=commitdiff;h=9cabcdbd4638cf884839ee4cd15780800c223b90
>
> I tracked it down. ?I ported the patch to RHEL5 and now it doesn't
> happen. ?Unfortunately, my ported ?patch needs cleaning: I've got
> a bunch of instrumentation for other reasons in there.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>



From rpeterso at redhat.com  Mon Mar  7 22:42:22 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 7 Mar 2011 17:42:22 -0500 (EST)
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <AANLkTik4CXEcWmuVyZT6kkLmzAca7T4+a68jagX3bq-T@mail.gmail.com>
Message-ID: <805163997.332464.1299537742195.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| We are running Redhat 5. Do you think this patch has already been
| applied to the GFS that redhat ships?
| 
| Paras.

Hi Paras,

If this is RHEL5, you should contact Red Hat support and open
a ticket.  After all, you're paying for support, so why not use it?
That patch is in the upstream (kernel.org) kernel, not in RHEL5.
I ported the patch to RHEL5 for testing purposes and I'm
planning to put a test version on my people page for some of
our customers to try out.  I don't know what kernel you're
running, but I can do the same for your kernel.
If you open a support ticket, ask them to attach your case
to bugzilla bug 681261, which is likely private because it contains
confidential customer information.

Regards,

Bob Peterson
Red Hat File Systems



From pradhanparas at gmail.com  Mon Mar  7 22:49:00 2011
From: pradhanparas at gmail.com (Paras pradhan)
Date: Mon, 7 Mar 2011 16:49:00 -0600
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <805163997.332464.1299537742195.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <AANLkTik4CXEcWmuVyZT6kkLmzAca7T4+a68jagX3bq-T@mail.gmail.com>
	<805163997.332464.1299537742195.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <AANLkTimsnTUr6ubjVoY-ULj-xN8YDEdMO_5Z6oxJbU18@mail.gmail.com>

Thanks Bob. Will do.

I don't know how quickly my issue gets resolved , I am using kernel
2.6.18-238.1.1.el5xen

Thanks
Paras.

On Mon, Mar 7, 2011 at 4:42 PM, Bob Peterson <rpeterso at redhat.com> wrote:
> ----- Original Message -----
> | We are running Redhat 5. Do you think this patch has already been
> | applied to the GFS that redhat ships?
> |
> | Paras.
>
> Hi Paras,
>
> If this is RHEL5, you should contact Red Hat support and open
> a ticket. ?After all, you're paying for support, so why not use it?
> That patch is in the upstream (kernel.org) kernel, not in RHEL5.
> I ported the patch to RHEL5 for testing purposes and I'm
> planning to put a test version on my people page for some of
> our customers to try out. ?I don't know what kernel you're
> running, but I can do the same for your kernel.
> If you open a support ticket, ask them to attach your case
> to bugzilla bug 681261, which is likely private because it contains
> confidential customer information.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>



From scooter at cgl.ucsf.edu  Tue Mar  8 01:01:02 2011
From: scooter at cgl.ucsf.edu (Scooter Morris)
Date: Mon, 07 Mar 2011 17:01:02 -0800
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <805163997.332464.1299537742195.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <805163997.332464.1299537742195.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D757FCE.7070305@cgl.ucsf.edu>

Hi Bob,
     I think we're seeing this also, but bugzilla #681261 is currently 
private.  Could we open that up (or add us to the cc list)?  If it does 
look like our problem, I'll open up a ticket.

Thanks!

-- scooter

On 03/07/2011 02:42 PM, Bob Peterson wrote:
> ----- Original Message -----
> | We are running Redhat 5. Do you think this patch has already been
> | applied to the GFS that redhat ships?
> |
> | Paras.
>
> Hi Paras,
>
> If this is RHEL5, you should contact Red Hat support and open
> a ticket.  After all, you're paying for support, so why not use it?
> That patch is in the upstream (kernel.org) kernel, not in RHEL5.
> I ported the patch to RHEL5 for testing purposes and I'm
> planning to put a test version on my people page for some of
> our customers to try out.  I don't know what kernel you're
> running, but I can do the same for your kernel.
> If you open a support ticket, ask them to attach your case
> to bugzilla bug 681261, which is likely private because it contains
> confidential customer information.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From fdinitto at redhat.com  Tue Mar  8 08:11:04 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 08 Mar 2011 09:11:04 +0100
Subject: [Linux-cluster] cluster 3.1.1 stable release
Message-ID: <4D75E498.9000206@redhat.com>

Welcome to the cluster 3.1.1 release.

This release contains dozens of bug fixes and improvements, including
dbus notifications of cluster events, that in conjunction with project
Foghorn, they can be translated into SNMP events.

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/c/l/cluster/cluster-3.1.1.tar.xz

ChangeLog:

https://fedorahosted.org/releases/c/l/cluster/Changelog-3.1.1

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

Happy clustering,
Fabio



From krishnanand.linux at gmail.com  Tue Mar  8 08:20:28 2011
From: krishnanand.linux at gmail.com (krishnanand gouri)
Date: Tue, 8 Mar 2011 13:50:28 +0530
Subject: [Linux-cluster] samba-cluster Issue
Message-ID: <AANLkTimGBDNsvyq5BC=7dhcHbDarsO1sPQWPrRiC=Uy0@mail.gmail.com>

Hi,

I have configured 2-Node cluster. Every thing is working fine even the fail
over cases also workign fine but i am facing a issue when ever I stop CTDB
service in server 1, the user are not able to acces samba share at all. even
after the CTDB IP is switched over.

But where as if at all i stop CTDB service in server2 the CTDB IP will
switch over to other server and the users are able to access the samba share
normally.

Why is it so happening only for server 1.

 Public IP's - 192.168.129.10 / 192.168.129.10

Heart Beat Ip's : 10.0.0.10 / 10.0.0.20

CTDB IP's - 192.168.129.14 / 192.168.129.15

Please help in solving this issue....

Thanks & Regards
Krishnanand G
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110308/4ab9d1c6/attachment.htm>

From rpeterso at redhat.com  Tue Mar  8 14:51:20 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 8 Mar 2011 09:51:20 -0500 (EST)
Subject: [Linux-cluster] GFS2 write
In-Reply-To: <4D757FCE.7070305@cgl.ucsf.edu>
Message-ID: <192026736.348110.1299595880572.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi Bob,
| I think we're seeing this also, but bugzilla #681261 is currently
| private. Could we open that up (or add us to the cc list)? If it does
| look like our problem, I'll open up a ticket.
| 
| Thanks!
| 
| -- scooter

Hi Scooter,

It's almost impossible to determine if you're experiencing a particular
problem without doing an in-depth analysis of your data.
It's probably best if you open a support ticket and collect
the info they request so we (as a team) can analyze it.

Regards,

Bob Peterson
Red Hat File Systems



From szekelyi at niif.hu  Tue Mar  8 16:35:41 2011
From: szekelyi at niif.hu (=?iso-8859-1?q?Sz=E9kelyi_Szabolcs?=)
Date: Tue, 8 Mar 2011 17:35:41 +0100
Subject: [Linux-cluster] wait states?
Message-ID: <201103081735.41537.szekelyi@niif.hu>

Hi all,

I've setup a simple two node cluster, and did some testing on it, but I'm 
having problems interpreting the results. Unfortunately I couldn't find any 
documentation answering my questions, so I'll post them here.

I don't want to do any complicated stuff, just run CLVM properly to serve 
logical volumes as iSCSI targets. The iSCSI target software should run on both 
nodes, independent of the cluster stack. The cluster is needed only because of 
CLVM. I tried to avoid using fencing as much as possible since I don't really 
see the need for it. My cluster.conf looks like this:

<cluster name="iscsigw" config_version="5">
  <cman two_node="2" expected_votes="1" />
  <clusternodes>
    <clusternode name="iscsigw1" nodeid="1">
      <fence />
      <unfence />
    </clusternode>
    <clusternode name="iscsigw2" nodeid="2">
      <fence />
      <unfence />
    </clusternode>
  </clusternodes>
  <fencedevices />
  <rm />
</cluster>

To test things, I broke the connection between the nodes for a while, and then 
restored it. I expected the cluster to return to normal state, but it didn't. 
The main difference between the nodes is the "wait state", about what I could 
hardly find any documentation. On one node it's "messages", on the other it's 
"quorum". Could you explain what these mean and how to return the cluster into 
normal state?

Thanks,
-- 
cc



From vmutu at pcbi.upenn.edu  Tue Mar  8 17:11:53 2011
From: vmutu at pcbi.upenn.edu (Valeriu Mutu)
Date: Tue, 8 Mar 2011 12:11:53 -0500
Subject: [Linux-cluster] clvmd hangs on startup
In-Reply-To: <20110303165056.GF10674@bsdera.pcbi.upenn.edu>
References: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>
	<64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local>
	<20110303165056.GF10674@bsdera.pcbi.upenn.edu>
Message-ID: <20110308171153.GB272@bsdera.pcbi.upenn.edu>

Hi,

I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines' iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.

On Thu, Mar 03, 2011 at 11:50:57AM -0500, Valeriu Mutu wrote:
> On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:
> > Double-check that the 2nd node can read and write the shared iSCSI
> > storage.
> 
> Reading/writing from/to the iSCSI storage device works as seen below.
> 
> On the 1st node:
> [root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes 
> 10000+0 records in
> 10000+0 records out
> 10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s
> 
> [root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null 
> 10000+0 records in
> 10000+0 records out
> 10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s
> 
> On the 2nd node:
> [root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
> 10000+0 records in
> 10000+0 records out
> 10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s
> 
> [root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
> 10000+0 records in
> 10000+0 records out
> 10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s

-- 
Valeriu Mutu



From jeff.sturm at eprize.com  Tue Mar  8 19:02:35 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Tue, 8 Mar 2011 14:02:35 -0500
Subject: [Linux-cluster] clvmd hangs on startup
In-Reply-To: <20110308171153.GB272@bsdera.pcbi.upenn.edu>
References: <20110302215050.GD10674@bsdera.pcbi.upenn.edu><64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local><20110303165056.GF10674@bsdera.pcbi.upenn.edu>
	<20110308171153.GB272@bsdera.pcbi.upenn.edu>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C339@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Valeriu Mutu
> Sent: Tuesday, March 08, 2011 12:12 PM
> 
> I think the problem is solved. I was using a 9000bytes MTU on the Xen
virtual
> machines' iSCSI interface. Switching back to 1500bytes MTU caused the
clvmd to start
> working.

That'll do it.  Jumbo frames with Xen are a little tricky, so it's
easiest to stick with MTU 1500, as you have done.  (If you really want
jumbo frames, change the MTU on the dom0 vif interfaces to match, and if
you use bridging, ditto for the bridge device and real interfaces.)

I suspect if your "dd" test had used a block size like 4096, rather than
1024, it would have similarly failed.

-Jeff





From gregory.lee.bartholomew at gmail.com  Tue Mar  8 19:53:47 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Tue, 08 Mar 2011 13:53:47 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
	gfs-pcmk-3.0.17-1.fc14.x86_64 woes
Message-ID: <4D76894B.6010809@gmail.com>

Hi Fabio M. Di Nitto,

FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14 
X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I 
needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and 
gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk 
would simply report "Nothing to do".  rpm -q showed that I didn't have 
the packages installed.  I tried installing the cman package but that 
didn't help.  I finally got it working by downloading the packages with 
wget and installing them with rpm -ivh.

FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora 
14 x86_64 database at the moment.

gb



From fdinitto at redhat.com  Tue Mar  8 19:55:37 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 08 Mar 2011 20:55:37 +0100
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D76894B.6010809@gmail.com>
References: <4D76894B.6010809@gmail.com>
Message-ID: <4D7689B9.7070906@redhat.com>

On 03/08/2011 08:53 PM, Gregory Bartholomew wrote:
> Hi Fabio M. Di Nitto,
> 
> FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14
> X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I
> needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and
> gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk
> would simply report "Nothing to do".  rpm -q showed that I didn't have
> the packages installed.  I tried installing the cman package but that
> didn't help.  I finally got it working by downloading the packages with
> wget and installing them with rpm -ivh.
> 
> FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora
> 14 x86_64 database at the moment.

No, those packages have been removed intentionally since pacemaker now
supports cman cluster manager and they become obsoleted.

So very short summary:

configure cman for clusternodes
start cman (including dlm/gfs controld)
tell pacemaker to use cman
configure fencing and all services.

Fabio



From lhh at redhat.com  Tue Mar  8 22:17:45 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 8 Mar 2011 17:17:45 -0500
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
References: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
Message-ID: <20110308221744.GA6659@redhat.com>

On Mon, Mar 07, 2011 at 11:10:08PM +0100, Gianluca Cecchi wrote:
> On Mon, 7 Mar 2011 16:52:00 -0500 Lon Hohberger wrote:
> 
> > Check /var/log/audit/audit.log for an AVC denial around self:capability
> > setpcap for xm_t?
> 
> Uhm,
> SElinux is disabled on both nodes (I'll cross check tomorrow anyway)
> and auditd is chkconfig off too (even if I notice in rh el 6 many
> audit messages related to cron writing in /var/log/messages...)
> Could it be of any help an "strace -f" of the virsh command where I
> can see the ssh and netcat forked calls but am not able to identify
> the point where eventually there is something strange?
> 

Nothing comes to mind; in my RHEL6 development cluster, I have a
custom SELinux policy:

#==== cut 

module clusterlocal 1.0;

require {
        type xm_t;
        type debugfs_t;
        type fenced_t;
        type mount_t;
        type telnetd_port_t;
        class capability setpcap;
        class tcp_socket name_connect;
        class dir mounton;
}

allow fenced_t telnetd_port_t:tcp_socket name_connect;
allow mount_t debugfs_t:dir mounton;
allow xm_t self:capability setpcap;

#=== end cut

And the following firewall rules:

-A INPUT -p tcp -m state --state NEW -m multiport --dports 21064 -j
ACCEPT
-A INPUT -p tcp -m state --state NEW -m multiport --dports 11111 -j
ACCEPT
-A INPUT -p udp -m state --state NEW -m multiport --dports 5404,5405 -j
ACCEPT

I'm using bridging (as documented in the RHEL6 documentation) and
everything pretty much just works.

Are you seeing any other notable behaviors, besides the migration
failing?

-- 
Lon Hohberger - Red Hat, Inc.



From Sunil_Gupta2 at Dell.com  Wed Mar  9 07:02:51 2011
From: Sunil_Gupta2 at Dell.com (Sunil_Gupta2 at Dell.com)
Date: Tue, 8 Mar 2011 23:02:51 -0800
Subject: [Linux-cluster] rgmanager not running
In-Reply-To: <38415.59.90.241.47.1299486821.squirrel@59.90.241.47>
References: <38415.59.90.241.47.1299486821.squirrel@59.90.241.47>
Message-ID: <8EF1FE59C3C8694E94F558EB27E464B71D130C73DD@BLRX7MCDC201.AMER.DELL.COM>

The rgmanager service is not necessary if the cluster has no resources to manage....further more info on cluster status is needed like

#clustat


If it says all the nodes are online then more debug logs will be needed to find out the problem.

--Sunil
-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Balaji Sundar
Sent: Monday, March 07, 2011 2:04 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] rgmanager not running

Dear All,

I have using RHEL6 Linux and Kernel Version is 2.6.32-71.el6.i686

I have configured Cluster Suite with 2 servers Server 1 : 192.168.13.131 IP Address and hostname is primary Server 2 : 192.168.13.132 IP Address and hostname is secondary Floating : 192.168.13.133 IP Address (Assumed by currently active server)

I have verified that service cman is running and cluster.conf is valid using ccs_config_validate command

Finally i found that rgmanager is not running and services are not started [root at primary cluster]# service rgmanager status rgmanager dead but pid file exists [root at primary cluster]# [root at primary cluster]# cman_tool services [root at primary cluster]# [root at primary cluster]# cman_tool status
Version: 6.2.0
Config Version: 1
Cluster Name: EMSCluster
Cluster Id: 808
Cluster Member: Yes
Cluster Generation: 96
Membership state: Cluster-Member
Nodes: 1
Expected votes: 1
Total votes: 1
Node votes: 1
Quorum: 1
Active subsystems: 7
Flags: 2node
Ports Bound: 0
Node name: primary
Node ID: 1
Multicast addresses: 239.192.3.43
Node addresses: 192.168.13.131
[root at primary cluster]#

Found some error messages in "/var/log/messages" file
Mar  7 14:39:42 primary corosync[7155]:   [CMAN  ] quorum regained,
resuming activity
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] This node is within the
primary component and will provide service.
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] Members[1]: 1
Mar  7 14:39:42 primary corosync[7155]:   [QUORUM] Members[1]: 1
Mar  7 14:39:42 primary corosync[7155]:   [CPG   ] downlist received
left_list: 0
Mar  7 14:39:42 primary corosync[7155]:   [CPG   ] chosen downlist from
node r(0) ip(192.168.13.131)
Mar  7 14:39:42 primary corosync[7155]:   [MAIN  ] Completed service
synchronization, ready to provide service.
Mar  7 14:39:44 primary fenced[7210]: fenced 3.0.12 started Mar  7 14:39:45 primary dlm_controld[7224]: dlm_controld 3.0.12 started Mar  7 14:39:45 primary gfs_controld[7254]: gfs_controld 3.0.12 started Mar  7 14:39:45 primary kernel: dlm: Using TCP for communications Mar  7 14:39:45 primary dlm_controld[7224]: dlm_join_lockspace no fence domain Mar  7 14:39:45 primary dlm_controld[7224]: process_uevent online@ error
-1 errno 2
Mar  7 14:39:45 primary kernel: dlm: rgmanager: group join failed -1 -1

Found some error messages in "/var/log/cluster/dlm_controld.log" file Mar 07 14:39:45 dlm_controld dlm_controld 3.0.12 started Mar 07 14:39:45 dlm_controld dlm_join_lockspace no fence domain Mar 07 14:39:45 dlm_controld process_uevent online@ error -1 errno 2

I don't know what is the problem and Can some one throw light on this peculiar problem

Thanks in Advance

--Regards
S.Balaji




--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From gianluca.cecchi at gmail.com  Wed Mar  9 08:47:09 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Wed, 9 Mar 2011 09:47:09 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
References: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
Message-ID: <AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>

On Mon, Mar 7, 2011 at 11:10 PM, Gianluca Cecchi
<gianluca.cecchi at gmail.com> wrote:
> Nothing comes to mind; in my RHEL6 development cluster, I have a
> custom SELinux policy:

I confirm that SElinux is disabled and
[root at rhev1 ~]# chkconfig --list | grep audit
auditd          0:off   1:off   2:off   3:off   4:off   5:off   6:off
[root at rhev1 ~]# service auditd status
auditd is stopped

[root at rhev2 ~]# chkconfig --list | grep audit
auditd          0:off   1:off   2:off   3:off   4:off   5:off   6:off
[root at rhev2 ~]# service auditd status
auditd is stopped

No other problems apparently, apart the bug on bond+vlan+bridge:
https://bugzilla.redhat.com/show_bug.cgi?id=623199
For which I have also a case open..
Cluster is sound and some test services worked ok.

Strange thing is that at some point during my test I was able to live
migrate this machine itself... apparently I did something that broke
or one of my latest updates created a problem...
Or something related with firewall perhaps.
Can I stop firewall at all and have libvirtd working at the same time
to test ...?
I know libvirtd puts some iptables rules itself..

Gianluca



From andrew at beekhof.net  Wed Mar  9 08:48:03 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Wed, 9 Mar 2011 09:48:03 +0100
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D7689B9.7070906@redhat.com>
References: <4D76894B.6010809@gmail.com>
	<4D7689B9.7070906@redhat.com>
Message-ID: <AANLkTimUJs-ZMmjhymOfXxzSmqQXY7FUeH9ihDyvQct1@mail.gmail.com>

On Tue, Mar 8, 2011 at 8:55 PM, Fabio M. Di Nitto <fdinitto at redhat.com> wrote:
> On 03/08/2011 08:53 PM, Gregory Bartholomew wrote:
>> Hi Fabio M. Di Nitto,
>>
>> FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14
>> X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I
>> needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and
>> gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk
>> would simply report "Nothing to do". ?rpm -q showed that I didn't have
>> the packages installed. ?I tried installing the cman package but that
>> didn't help. ?I finally got it working by downloading the packages with
>> wget and installing them with rpm -ivh.
>>
>> FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora
>> 14 x86_64 database at the moment.
>
> No, those packages have been removed intentionally since pacemaker now
> supports cman cluster manager and they become obsoleted.
>
> So very short summary:
>
> configure cman for clusternodes
> start cman (including dlm/gfs controld)
> tell pacemaker to use cman
> configure fencing and all services.

A week or so ago I added a big warning to the bottom of:
   http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/ch08s02.html

and an appendix for configuring cman+pacemaker.
Hopefully it will be of some help.



From gianluca.cecchi at gmail.com  Wed Mar  9 10:32:39 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Wed, 9 Mar 2011 11:32:39 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>
References: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
	<AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>
Message-ID: <AANLkTi=BqOTWZhLeZi_kDF_COhjaK-xtcXPbs3RzU=WG@mail.gmail.com>

Here is the output of the command

strace -f virsh migrate --live exorapr1 qemu+ssh://intrarhev1/system

Note that if I run the same with rhev1 (main host name and not
intracluster) instead of intrarhev1, I'm asked for the ssh password
(ok because I set ssh equivalence only for intracluster) but at the
end I get the same error:
operation failed: Migration unexpectedly failed

Gianluca



From gianluca.cecchi at gmail.com  Wed Mar  9 10:33:18 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Wed, 9 Mar 2011 11:33:18 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTi=BqOTWZhLeZi_kDF_COhjaK-xtcXPbs3RzU=WG@mail.gmail.com>
References: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
	<AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>
	<AANLkTi=BqOTWZhLeZi_kDF_COhjaK-xtcXPbs3RzU=WG@mail.gmail.com>
Message-ID: <AANLkTikk+8JvSkA=omM8O0WVNq1CVG=Vr+tOh9EJSEuH@mail.gmail.com>

On Wed, Mar 9, 2011 at 11:32 AM, Gianluca Cecchi
<gianluca.cecchi at gmail.com> wrote:
> Here is the output of the command
>
> strace -f virsh migrate --live exorapr1 qemu+ssh://intrarhev1/system
>
> Note that if I run the same with rhev1 (main host name and not
> intracluster) instead of intrarhev1, I'm asked for the ssh password
> (ok because I set ssh equivalence only for intracluster) but at the
> end I get the same error:
> operation failed: Migration unexpectedly failed
>
> Gianluca
>
I forgot the attachment... ;-(
It is in zip format
-------------- next part --------------
A non-text attachment was scrubbed...
Name: strace.zip
Type: application/zip
Size: 20072 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/a579e921/attachment.zip>

From balajisundar at midascomm.com  Wed Mar  9 11:24:18 2011
From: balajisundar at midascomm.com (Balaji)
Date: Wed, 09 Mar 2011 16:54:18 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
In-Reply-To: <mailman.1955.1299666816.5341.linux-cluster@redhat.com>
References: <mailman.1955.1299666816.5341.linux-cluster@redhat.com>
Message-ID: <4D776362.4020203@midascomm.com>

Dear All,

    Please find attached log file for more analysis
    Please help me to solve this problem ASAP.

*    Clustat Command Output is below *
    [root at corviewprimary ~]# clustat
    Cluster Status for EMSCluster @ Wed Mar  9 17:00:03 2011
    Member Status: Quorate

     Member Name                                                   ID   
Status
     ----------- 
-------                                                   ---- ------
     corviewprimary                                                    1 
Online, Local
     corviewsecondary                                                  2 
Offline

    [root at corviewprimary ~]#

Regards,
-S.Balaji

linux-cluster-request at redhat.com wrote:

>Send Linux-cluster mailaddr:115.249.107.179ing list submissions to
>	linux-cluster at redhat.com
>
>To subscribe or unsubscribe via the World Wide Web, visit
>	https://www.redhat.com/mailman/listinfo/linux-cluster
>or, via email, send a message with subject or body 'help' to
>	linux-cluster-request at redhat.com
>
>You can reach the person managing the list at
>	linux-cluster-owner at redhat.com
>
>When replying, please edit your Subject line so it is more specific
>than "Re: Contents of Linux-cluster digest..."
>
>
>Today's Topics:
>
>   1. Re: clvmd hangs on startup (Valeriu Mutu)
>   2. Re: clvmd hangs on startup (Jeff Sturm)
>   3. dlm-pcmk-3.0.17-1.fc14.x86_64 and
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)
>   4. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Fabio M. Di Nitto)
>   5. Re: unable to live migrate a vm in rh el 6: Migration
>      unexpectedly failed (Lon Hohberger)
>   6. Re: rgmanager not running (Sunil_Gupta2 at Dell.com)
>   7. Re: unable to live migrate a vm in rh el 6: Migration
>      unexpectedly failed (Gianluca Cecchi)
>   8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)
>   9. Re: unable to live migrate a vm in rh el 6: Migration
>      unexpectedly failed (Gianluca Cecchi)
>  10. Re: unable to live migrate a vm in rh el 6: Migration
>      unexpectedly failed (Gianluca Cecchi)
>
>
>----------------------------------------------------------------------
>
>Message: 1
>Date: Tue, 8 Mar 2011 12:11:53 -0500
>From: Valeriu Mutu <vmutu at pcbi.upenn.edu>
>To: linux clustering <linux-cluster at redhat.com>
>Subject: Re: [Linux-cluster] clvmd hangs on startup
>Message-ID: <20110308171153.GB272 at bsdera.pcbi.upenn.edu>
>Content-Type: text/plain; charset=us-ascii
>
>Hi,
>
>I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines' iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.
>
>On Thu, Mar 03, 2011 at 11:50:57AM -0500, Valeriu Mutu wrote:
>  
>
>>On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:
>>    
>>
>>>Double-check that the 2nd node can read and write the shared iSCSI
>>>storage.
>>>      
>>>
>>Reading/writing from/to the iSCSI storage device works as seen below.
>>
>>On the 1st node:
>>[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes 
>>10000+0 records in
>>10000+0 records out
>>10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s
>>
>>[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null 
>>10000+0 records in
>>10000+0 records out
>>10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s
>>
>>On the 2nd node:
>>[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
>>10000+0 records in
>>10000+0 records out
>>10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s
>>
>>[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
>>10000+0 records in
>>10000+0 records out
>>10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s
>>    
>>
>
>  
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/61c1900c/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: corosync.log
Type: text/x-log
Size: 2424 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/61c1900c/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dlm_controld.log
Type: text/x-log
Size: 190 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/61c1900c/attachment-0001.bin>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: messages
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/61c1900c/attachment.ksh>

From Sunil_Gupta2 at Dell.com  Wed Mar  9 12:14:17 2011
From: Sunil_Gupta2 at Dell.com (Sunil_Gupta2 at Dell.com)
Date: Wed, 9 Mar 2011 17:44:17 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
In-Reply-To: <4D776362.4020203@midascomm.com>
References: <mailman.1955.1299666816.5341.linux-cluster@redhat.com>
	<4D776362.4020203@midascomm.com>
Message-ID: <8EF1FE59C3C8694E94F558EB27E464B71D130C752D@BLRX7MCDC201.AMER.DELL.COM>

One node is offline cluster is not formed....check if multicast traffic is working...

--Sunil

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Balaji
Sent: Wednesday, March 09, 2011 4:54 PM
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13

Dear All,

    Please find attached log file for more analysis
    Please help me to solve this problem ASAP.

    Clustat Command Output is below
    [root at corviewprimary ~]# clustat
    Cluster Status for EMSCluster @ Wed Mar  9 17:00:03 2011
    Member Status: Quorate

     Member Name                                                   ID   Status
     ----------- -------                                                   ---- ------
     corviewprimary                                                    1 Online, Local
     corviewsecondary                                                  2 Offline

    [root at corviewprimary ~]#

Regards,
-S.Balaji

linux-cluster-request at redhat.com<mailto:linux-cluster-request at redhat.com> wrote:

Send Linux-cluster mailaddr:115.249.107.179ing list submissions to

        linux-cluster at redhat.com<mailto:linux-cluster at redhat.com>



To subscribe or unsubscribe via the World Wide Web, visit

        https://www.redhat.com/mailman/listinfo/linux-cluster

or, via email, send a message with subject or body 'help' to

        linux-cluster-request at redhat.com<mailto:linux-cluster-request at redhat.com>



You can reach the person managing the list at

        linux-cluster-owner at redhat.com<mailto:linux-cluster-owner at redhat.com>



When replying, please edit your Subject line so it is more specific

than "Re: Contents of Linux-cluster digest..."





Today's Topics:



   1. Re: clvmd hangs on startup (Valeriu Mutu)

   2. Re: clvmd hangs on startup (Jeff Sturm)

   3. dlm-pcmk-3.0.17-1.fc14.x86_64 and

      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)

   4. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and

      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Fabio M. Di Nitto)

   5. Re: unable to live migrate a vm in rh el 6: Migration

      unexpectedly failed (Lon Hohberger)

   6. Re: rgmanager not running (Sunil_Gupta2 at Dell.com<mailto:Sunil_Gupta2 at Dell.com>)

   7. Re: unable to live migrate a vm in rh el 6: Migration

      unexpectedly failed (Gianluca Cecchi)

   8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and

      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)

   9. Re: unable to live migrate a vm in rh el 6: Migration

      unexpectedly failed (Gianluca Cecchi)

  10. Re: unable to live migrate a vm in rh el 6: Migration

      unexpectedly failed (Gianluca Cecchi)





----------------------------------------------------------------------



Message: 1

Date: Tue, 8 Mar 2011 12:11:53 -0500

From: Valeriu Mutu <vmutu at pcbi.upenn.edu><mailto:vmutu at pcbi.upenn.edu>

To: linux clustering <linux-cluster at redhat.com><mailto:linux-cluster at redhat.com>

Subject: Re: [Linux-cluster] clvmd hangs on startup

Message-ID: <20110308171153.GB272 at bsdera.pcbi.upenn.edu><mailto:20110308171153.GB272 at bsdera.pcbi.upenn.edu>

Content-Type: text/plain; charset=us-ascii



Hi,



I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines' iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.



On Thu, Mar 03, 2011 at 11:50:57AM -0500, Valeriu Mutu wrote:



On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:



Double-check that the 2nd node can read and write the shared iSCSI

storage.



Reading/writing from/to the iSCSI storage device works as seen below.



On the 1st node:

[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes

10000+0 records in

10000+0 records out

10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s



[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null

10000+0 records in

10000+0 records out

10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s



On the 2nd node:

[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes

10000+0 records in

10000+0 records out

10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s



[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null

10000+0 records in

10000+0 records out

10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s







-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/dc0cbf73/attachment.htm>

From ooolinux at 163.com  Wed Mar  9 14:13:35 2011
From: ooolinux at 163.com (yue)
Date: Wed, 9 Mar 2011 22:13:35 +0800 (CST)
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
Message-ID: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>

which is better gfs2 and ocfs2?
i want to share fc-san, do you know which is better?
stablility,performmance?
 
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/603fcadf/attachment.htm>

From jeff.sturm at eprize.com  Wed Mar  9 14:48:03 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 9 Mar 2011 09:48:03 -0500
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C34D@hugo.eprize.local>

Do you expect to get an objective answer to that from a Red Hat list?
Most users on this forum are familiar with GFS2, some may have tried
OCFS2 but there's bound to be a bias.

 

GFS has been extremely stable for us (haven't migrated to GFS2 yet, went
into production with GFS in 2008).  Just last night in fact a single
hardware node failed in one of our virtual test clusters, the fencing
operations were successful and everything recovered nicely.  The cluster
never lost quorum and disruption was minimal.

 

Performance is highly variable depending on the software application.
We have developed our own application which gave us freedom to tailor it
for GFS, improving performance and throughput significantly.

 

Regardless of what you hear, why not give both a try?  Your evaluation
and feedback would be very useful to the cluster community.

 

-Jeff

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of yue
Sent: Wednesday, March 09, 2011 9:14 AM
To: linux-cluster
Subject: [Linux-cluster] which is better gfs2 and ocfs2?

 

which is better gfs2 and ocfs2?

i want to share fc-san, do you know which is better?

stablility,performmance?

 

 

thanks

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/492d14bd/attachment.htm>

From michael.lackner at unileoben.ac.at  Wed Mar  9 14:53:40 2011
From: michael.lackner at unileoben.ac.at (Michael Lackner)
Date: Wed, 09 Mar 2011 15:53:40 +0100
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
Message-ID: <4D779474.6020509@unileoben.ac.at>

I guess not all usage scenarios are comparable, but I once
tried to use GFS2 as well as OCFS2 to share a FC SAN to three
nodes using 8GBit FC and 1GBit Ethernet for the cluster
communication. Additionally, i compared it to a trial version
of Dataplows SAN File System (SFS). I was also supposed to
compare it to Quantum StorNext, but there just wasn't enough
time for that.

OS was CentOS 5.3 at that time.

So I tried a lot of performance tuning settings for all three,
and it was like this:

1.) SFS was the fastest, but caused reproducible kernel panics.
Those were fixed by Dataplow, but then SFS produced corrupted data
when writing large files. Unusable in that state, so we gave up.
SFS uses NFS for lock management. Noteworthy: Writing data on the
machine with the NFS lock manager also crippled the I/O performance
for all the other nodes in a VERY, VERY bad way..

2.) GFS2 was the slowest, and despite all the tunings I tried, it
never came close to anything that any local FS would provide in
terms of speed (compared to EXT3 and XFS). The statfs() calls
pretty much crippled the FS. Multiple I/O streams on multiple nodes:
Not a good idea it seems..  Sometimes you have to wait for minutes
for the FS to just give you any feedback, when you're hammering
it with let's say 30 sequential write streams across 3 nodes, with
the streams equally distributed among them.

3.) OCFS2 was slightly faster than GFS2, especially when it came
to statfs(), like ls -l. It did not slow down that much. But overall,
it was still just far too slow.

Our solution: Hook up the SAN on one node only, and share via NFS
over GBit Ethernet. Overall, we are getting better results even
with the obvious network overhead, especially when doing a lot of
I/O on multiple clients.

Our original goal was to provide a high-speed centralized storage
solution for multiple nodes without having to use ethernet. This
failed completely unfortunately.

Hope this helps, it's just my experience though. As usual, mileage
may vary...

yue wrote:
> which is better gfs2 and ocfs2?
> i want to share fc-san, do you know which is better?
> stablility,performmance?

-- 
Michael Lackner
Lehrstuhl f?r Informationstechnologie, Montanuniversit?t Leoben
IT Administration
michael.lackner at mu-leoben.at | +43 (0)3842/402-1505




From rhurst at bidmc.harvard.edu  Wed Mar  9 15:23:31 2011
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Wed, 9 Mar 2011 10:23:31 -0500
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
Message-ID: <50168EC934B8D64AA8D8DD37F840F3DE0568486C63@EVS2CCR.its.caregroup.org>

Depends on the application's use of the filesystem and your processing usage patterns.  You should evaluate both.

We're using 8gbit FC and 1gb private networking between two pairs (test & production) of 4-node clusters on IBM BladeCenter.  We used RHEL 4 GFS from 2007 - 2010 without issue.  Upgraded to RHEL 5u5, then most of the GFS filesystems to GFS2 without issues (yet).

We'd like to see a working cluster configuration using GFS2 on KVM guests.
________________________________
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of yue
Sent: Wednesday, March 09, 2011 9:14 AM
To: linux-cluster
Subject: [Linux-cluster] which is better gfs2 and ocfs2?

which is better gfs2 and ocfs2?
i want to share fc-san, do you know which is better?
stablility,performmance?


thanks


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/3b40a9ea/attachment.htm>

From gregory.lee.bartholomew at gmail.com  Wed Mar  9 15:33:53 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Wed, 09 Mar 2011 09:33:53 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <AANLkTimUJs-ZMmjhymOfXxzSmqQXY7FUeH9ihDyvQct1@mail.gmail.com>
References: <4D76894B.6010809@gmail.com>	<4D7689B9.7070906@redhat.com>
	<AANLkTimUJs-ZMmjhymOfXxzSmqQXY7FUeH9ihDyvQct1@mail.gmail.com>
Message-ID: <4D779DE1.7040306@gmail.com>

Arrr ... I was just starting to get all this figured out and you've gone 
and changed EVERYTHING!!! :-)

Since I'm now using cman, should I favor the RA's that are listed by 
"crm ra list ocf redhat" (ocf:redhat:ip.sh instead of 
ocf:heartbeat:IPaddr2, ocf:redhat:apache.sh instead of 
ocf:heartbeat:apache, etc.)?

gb

On 03/09/2011 02:48 AM, Andrew Beekhof wrote:
> On Tue, Mar 8, 2011 at 8:55 PM, Fabio M. Di Nitto<fdinitto at redhat.com>  wrote:
>> On 03/08/2011 08:53 PM, Gregory Bartholomew wrote:
>>> Hi Fabio M. Di Nitto,
>>>
>>> FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14
>>> X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I
>>> needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>> gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk
>>> would simply report "Nothing to do".  rpm -q showed that I didn't have
>>> the packages installed.  I tried installing the cman package but that
>>> didn't help.  I finally got it working by downloading the packages with
>>> wget and installing them with rpm -ivh.
>>>
>>> FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora
>>> 14 x86_64 database at the moment.
>>
>> No, those packages have been removed intentionally since pacemaker now
>> supports cman cluster manager and they become obsoleted.
>>
>> So very short summary:
>>
>> configure cman for clusternodes
>> start cman (including dlm/gfs controld)
>> tell pacemaker to use cman
>> configure fencing and all services.
>
> A week or so ago I added a big warning to the bottom of:
>     http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/ch08s02.html
>
> and an appendix for configuring cman+pacemaker.
> Hopefully it will be of some help.



From thomas at sjolshagen.net  Wed Mar  9 15:48:10 2011
From: thomas at sjolshagen.net (Thomas Sjolshagen)
Date: Wed, 09 Mar 2011 10:48:10 -0500
Subject: [Linux-cluster] =?utf-8?q?which_is_better_gfs2_and_ocfs2=3F?=
In-Reply-To: <50168EC934B8D64AA8D8DD37F840F3DE0568486C63@EVS2CCR.its.caregroup.org>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
	<50168EC934B8D64AA8D8DD37F840F3DE0568486C63@EVS2CCR.its.caregroup.org>
Message-ID: <a9030538d16b2e4a46ba7f89cb5439af@sjolshagen.net>

  

On Wed, 9 Mar 2011 10:23:31 -0500, rhurst at bidmc.harvard.edu wrote:


> We'd like to see a working cluster configuration using GFS2 on KVM
guests.

Got one of those although it's a very small-scale setup using
Fedora 14. 

Bare metal uses gfs2 for hosting KVM image files. vm's
managed by RH cluster3 stack. 

VMs use gfs2 for sharing Maildir spool
for a small postfix/dovecot setup with webmail frontend. All on shared
iSCSI based storage. 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110309/fd272d05/attachment.htm>

From andrew at beekhof.net  Wed Mar  9 15:54:42 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Wed, 9 Mar 2011 16:54:42 +0100
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D779DE1.7040306@gmail.com>
References: <4D76894B.6010809@gmail.com> <4D7689B9.7070906@redhat.com>
	<AANLkTimUJs-ZMmjhymOfXxzSmqQXY7FUeH9ihDyvQct1@mail.gmail.com>
	<4D779DE1.7040306@gmail.com>
Message-ID: <AANLkTi=TUsQqtgWkc6XtS++KLv693n9U32cPqrJ6xcR1@mail.gmail.com>

On Wed, Mar 9, 2011 at 4:33 PM, Gregory Bartholomew
<gregory.lee.bartholomew at gmail.com> wrote:
> Arrr ... I was just starting to get all this figured out and you've gone and
> changed EVERYTHING!!! :-)

Very little is actually changed :-)
These days cman is mostly just a small corosync plugin.

I'm not sure if this was the case back when we ported Pacemaker to
corosync, but it would have simplified a lot if we'd sucked in that
little plugin instead of writing our own.

>
> Since I'm now using cman, should I favor the RA's that are listed by "crm ra
> list ocf redhat" (ocf:redhat:ip.sh instead of ocf:heartbeat:IPaddr2,
> ocf:redhat:apache.sh instead of ocf:heartbeat:apache, etc.)?

No, we're only using cman for its quorum and membership information.
And the only reason for doing that is so that everything is getting it
from the same source (and the "native" pcmk variants aren't widely
available).

Everything else is unchanged.

>
> gb
>
> On 03/09/2011 02:48 AM, Andrew Beekhof wrote:
>>
>> On Tue, Mar 8, 2011 at 8:55 PM, Fabio M. Di Nitto<fdinitto at redhat.com>
>> ?wrote:
>>>
>>> On 03/08/2011 08:53 PM, Gregory Bartholomew wrote:
>>>>
>>>> Hi Fabio M. Di Nitto,
>>>>
>>>> FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14
>>>> X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I
>>>> needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>>> gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk
>>>> would simply report "Nothing to do". ?rpm -q showed that I didn't have
>>>> the packages installed. ?I tried installing the cman package but that
>>>> didn't help. ?I finally got it working by downloading the packages with
>>>> wget and installing them with rpm -ivh.
>>>>
>>>> FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora
>>>> 14 x86_64 database at the moment.
>>>
>>> No, those packages have been removed intentionally since pacemaker now
>>> supports cman cluster manager and they become obsoleted.
>>>
>>> So very short summary:
>>>
>>> configure cman for clusternodes
>>> start cman (including dlm/gfs controld)
>>> tell pacemaker to use cman
>>> configure fencing and all services.
>>
>> A week or so ago I added a big warning to the bottom of:
>>
>> ?http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/ch08s02.html
>>
>> and an appendix for configuring cman+pacemaker.
>> Hopefully it will be of some help.
>



From gregory.lee.bartholomew at gmail.com  Wed Mar  9 17:01:17 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Wed, 09 Mar 2011 11:01:17 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D7689B9.7070906@redhat.com>
References: <4D76894B.6010809@gmail.com> <4D7689B9.7070906@redhat.com>
Message-ID: <4D77B25D.5010603@gmail.com>

On 03/08/2011 01:55 PM, Fabio M. Di Nitto wrote:
> On 03/08/2011 08:53 PM, Gregory Bartholomew wrote:
>> Hi Fabio M. Di Nitto,
>>
>> FYI, I was just trying to set up gfs2 under pacemaker on Fedora 14
>> X86_64 and although yum provides '*/gfs_controld.pcmk' showed that I
>> needed the dlm-pcmk-3.0.17-1.fc14.x86_64 and
>> gfs-pcmk-3.0.17-1.fc14.x86_64 packages, yum install dlm-pcmk gfs-pcmk
>> would simply report "Nothing to do".  rpm -q showed that I didn't have
>> the packages installed.  I tried installing the cman package but that
>> didn't help.  I finally got it working by downloading the packages with
>> wget and installing them with rpm -ivh.
>>
>> FYI, the dlm-pcmk and gfs-pcmk packages seem to be broken in the Fedora
>> 14 x86_64 database at the moment.
>
> No, those packages have been removed intentionally since pacemaker now
> supports cman cluster manager and they become obsoleted.
>
> So very short summary:
>
> configure cman for clusternodes
> start cman (including dlm/gfs controld)
> tell pacemaker to use cman
> configure fencing and all services.
>
> Fabio

Hello again Linux Clustering group,

So I've switched to using cman and I have an IP resource shared between 
my two nodes, but now I'm having trouble with GFS2 again.  When I try to 
mount my active/active iscsi partition, I get:

[root at eb2024-58 ~]# mount /dev/sda1 /mnt
gfs_controld join connect error: Connection refused
error mounting lockproto lock_dlm

I was able to create a partition on my iscsi device and format it with 
"mkfs.gfs2 -p lock_dlm -j 2 -t pcmk:iscsi /dev/sda1" and I can see the 
partition on both nodes with "fdisk -l", so I think everything iscsi is 
working.

I was said earlier that I needed to "start cman (including dlm/gfs 
controld)".  I see the cman service and it is started and running, but 
the only dlm/gfs service that I see is one called "gfs2" and when I try 
to start it I get:

[root at eb2024-58 ~]# service gfs2 start
GFS2: no entries found in /etc/fstab

So why can't I mount my iscsi partition and where are these elusive 
dlm/gfs crontrold services?

Thanks,
gb



From gregory.lee.bartholomew at gmail.com  Wed Mar  9 18:03:09 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Wed, 09 Mar 2011 12:03:09 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D77B25D.5010603@gmail.com>
References: <4D76894B.6010809@gmail.com> <4D7689B9.7070906@redhat.com>
	<4D77B25D.5010603@gmail.com>
Message-ID: <4D77C0DD.4070405@gmail.com>

On 03/09/2011 11:01 AM, Gregory Bartholomew wrote:
> Hello again Linux Clustering group,
>
> So I've switched to using cman and I have an IP resource shared between
> my two nodes, but now I'm having trouble with GFS2 again. When I try to
> mount my active/active iscsi partition, I get:
>
> [root at eb2024-58 ~]# mount /dev/sda1 /mnt
> gfs_controld join connect error: Connection refused
> error mounting lockproto lock_dlm
>
> I was able to create a partition on my iscsi device and format it with
> "mkfs.gfs2 -p lock_dlm -j 2 -t pcmk:iscsi /dev/sda1" and I can see the
> partition on both nodes with "fdisk -l", so I think everything iscsi is
> working.
>
> I was said earlier that I needed to "start cman (including dlm/gfs
> controld)". I see the cman service and it is started and running, but
> the only dlm/gfs service that I see is one called "gfs2" and when I try
> to start it I get:
>
> [root at eb2024-58 ~]# service gfs2 start
> GFS2: no entries found in /etc/fstab
>
> So why can't I mount my iscsi partition and where are these elusive
> dlm/gfs crontrold services?
>
> Thanks,
> gb

Never mind, I figured it out ... I needed to install the gfs2-cluster 
package and start its service and I also had a different name for my 
cluster in /etc/cluster/cluster.conf than what I was using in my 
mkfs.gfs2 command.

It's all working now.  Thanks to those who helped me get this going,
gb



From balajisundar at midascomm.com  Thu Mar 10 04:42:31 2011
From: balajisundar at midascomm.com (Balaji)
Date: Thu, 10 Mar 2011 10:12:31 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 15
In-Reply-To: <mailman.25.1299690004.23738.linux-cluster@redhat.com>
References: <mailman.25.1299690004.23738.linux-cluster@redhat.com>
Message-ID: <4D7856B7.4070105@midascomm.com>

Dear All,
   
    Currently other node is shutdown.
    First of all we will check the cluster is up in simplex mode

Regards
-S.Balaji

linux-cluster-request at redhat.com wrote:

>Send Linux-cluster mailing list submissions to
>	linux-cluster at redhat.com
>
>To subscribe or unsubscribe via the World Wide Web, visit
>	https://www.redhat.com/mailman/listinfo/linux-cluster
>or, via email, send a message with subject or body 'help' to
>	linux-cluster-request at redhat.com
>
>You can reach the person managing the list at
>	linux-cluster-owner at redhat.com
>
>When replying, please edit your Subject line so it is more specific
>than "Re: Contents of Linux-cluster digest..."
>
>
>Today's Topics:
>
>   1. Re: Linux-cluster Digest, Vol 83, Issue 13 (Sunil_Gupta2 at Dell.com)
>   2. which is better gfs2 and ocfs2? (yue)
>   3. Re: which is better gfs2 and ocfs2? (Jeff Sturm)
>   4. Re: which is better gfs2 and ocfs2? (Michael Lackner)
>   5. Re: which is better gfs2 and ocfs2? (rhurst at bidmc.harvard.edu)
>   6. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)
>   7. Re: which is better gfs2 and ocfs2? (Thomas Sjolshagen)
>   8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)
>
>
>----------------------------------------------------------------------
>
>Message: 1
>Date: Wed, 9 Mar 2011 17:44:17 +0530
>From: <Sunil_Gupta2 at Dell.com>
>To: <linux-cluster at redhat.com>
>Subject: Re: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
>Message-ID:
>	<8EF1FE59C3C8694E94F558EB27E464B71D130C752D at BLRX7MCDC201.AMER.DELL.COM>
>	
>Content-Type: text/plain; charset="us-ascii"
>
>One node is offline cluster is not formed....check if multicast traffic is working...
>
>--Sunil
>
>From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Balaji
>Sent: Wednesday, March 09, 2011 4:54 PM
>To: linux-cluster at redhat.com
>Subject: Re: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
>
>Dear All,
>
>    Please find attached log file for more analysis
>    Please help me to solve this problem ASAP.
>
>    Clustat Command Output is below
>    [root at corviewprimary ~]# clustat
>    Cluster Status for EMSCluster @ Wed Mar  9 17:00:03 2011
>    Member Status: Quorate
>
>     Member Name                                                   ID   Status
>     ----------- -------                                                   ---- ------
>     corviewprimary                                                    1 Online, Local
>     corviewsecondary                                                  2 Offline
>
>    [root at corviewprimary ~]#
>
>Regards,
>-S.Balaji
>
>linux-cluster-request at redhat.com<mailto:linux-cluster-request at redhat.com> wrote:
>
>Send Linux-cluster mailaddr:115.249.107.179ing list submissions to
>
>        linux-cluster at redhat.com<mailto:linux-cluster at redhat.com>
>
>
>
>To subscribe or unsubscribe via the World Wide Web, visit
>
>        https://www.redhat.com/mailman/listinfo/linux-cluster
>
>or, via email, send a message with subject or body 'help' to
>
>        linux-cluster-request at redhat.com<mailto:linux-cluster-request at redhat.com>
>
>
>
>You can reach the person managing the list at
>
>        linux-cluster-owner at redhat.com<mailto:linux-cluster-owner at redhat.com>
>
>
>
>When replying, please edit your Subject line so it is more specific
>
>than "Re: Contents of Linux-cluster digest..."
>
>
>
>
>
>Today's Topics:
>
>
>
>   1. Re: clvmd hangs on startup (Valeriu Mutu)
>
>   2. Re: clvmd hangs on startup (Jeff Sturm)
>
>   3. dlm-pcmk-3.0.17-1.fc14.x86_64 and
>
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)
>
>   4. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Fabio M. Di Nitto)
>
>   5. Re: unable to live migrate a vm in rh el 6: Migration
>
>      unexpectedly failed (Lon Hohberger)
>
>   6. Re: rgmanager not running (Sunil_Gupta2 at Dell.com<mailto:Sunil_Gupta2 at Dell.com>)
>
>   7. Re: unable to live migrate a vm in rh el 6: Migration
>
>      unexpectedly failed (Gianluca Cecchi)
>
>   8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>
>      gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)
>
>   9. Re: unable to live migrate a vm in rh el 6: Migration
>
>      unexpectedly failed (Gianluca Cecchi)
>
>  10. Re: unable to live migrate a vm in rh el 6: Migration
>
>      unexpectedly failed (Gianluca Cecchi)
>
>
>
>
>
>----------------------------------------------------------------------
>
>
>
>Message: 1
>
>Date: Tue, 8 Mar 2011 12:11:53 -0500
>
>From: Valeriu Mutu <vmutu at pcbi.upenn.edu><mailto:vmutu at pcbi.upenn.edu>
>
>To: linux clustering <linux-cluster at redhat.com><mailto:linux-cluster at redhat.com>
>
>Subject: Re: [Linux-cluster] clvmd hangs on startup
>
>Message-ID: <20110308171153.GB272 at bsdera.pcbi.upenn.edu><mailto:20110308171153.GB272 at bsdera.pcbi.upenn.edu>
>
>Content-Type: text/plain; charset=us-ascii
>
>
>
>Hi,
>
>
>
>I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines' iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.
>
>
>
>On Thu, Mar 03, 2011 at 11:50:57AM -0500, Valeriu Mutu wrote:
>
>
>
>On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:
>
>
>
>Double-check that the 2nd node can read and write the shared iSCSI
>
>storage.
>
>
>
>Reading/writing from/to the iSCSI storage device works as seen below.
>
>
>
>On the 1st node:
>
>[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
>
>10000+0 records in
>
>10000+0 records out
>
>10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s
>
>
>
>[root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
>
>10000+0 records in
>
>10000+0 records out
>
>10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s
>
>
>
>On the 2nd node:
>
>[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
>
>10000+0 records in
>
>10000+0 records out
>
>10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s
>
>
>
>[root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
>
>10000+0 records in
>
>10000+0 records out
>
>10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s
>
>
>
>
>
>
>
>-------------- next part --------------
>An HTML attachment was scrubbed...
>URL: <https://www.redhat.com/archives/linux-cluster/attachments/20110309/dc0cbf73/attachment.html>
>
>------------------------------
>
>Message: 2
>Date: Wed, 9 Mar 2011 22:13:35 +0800 (CST)
>From: yue <ooolinux at 163.com>
>To: linux-cluster <linux-cluster at redhat.com>
>Subject: [Linux-cluster] which is better gfs2 and ocfs2?
>Message-ID: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux at 163.com>
>Content-Type: text/plain; charset="gbk"
>
>which is better gfs2 and ocfs2?
>i want to share fc-san, do you know which is better?
>stablility,performmance?
> 
> 
>thanks
>-------------- next part --------------
>An HTML attachment was scrubbed...
>URL: <https://www.redhat.com/archives/linux-cluster/attachments/20110309/603fcadf/attachment.html>
>
>------------------------------
>
>Message: 3
>Date: Wed, 9 Mar 2011 09:48:03 -0500
>From: Jeff Sturm <jeff.sturm at eprize.com>
>To: linux clustering <linux-cluster at redhat.com>
>Subject: Re: [Linux-cluster] which is better gfs2 and ocfs2?
>Message-ID:
>	<64D0546C5EBBD147B75DE133D798665F0855C34D at hugo.eprize.local>
>Content-Type: text/plain; charset="us-ascii"
>
>Do you expect to get an objective answer to that from a Red Hat list?
>Most users on this forum are familiar with GFS2, some may have tried
>OCFS2 but there's bound to be a bias.
>
> 
>
>GFS has been extremely stable for us (haven't migrated to GFS2 yet, went
>into production with GFS in 2008).  Just last night in fact a single
>hardware node failed in one of our virtual test clusters, the fencing
>operations were successful and everything recovered nicely.  The cluster
>never lost quorum and disruption was minimal.
>
> 
>
>Performance is highly variable depending on the software application.
>We have developed our own application which gave us freedom to tailor it
>for GFS, improving performance and throughput significantly.
>
> 
>
>Regardless of what you hear, why not give both a try?  Your evaluation
>and feedback would be very useful to the cluster community.
>
> 
>
>-Jeff
>
> 
>
>From: linux-cluster-bounces at redhat.com
>[mailto:linux-cluster-bounces at redhat.com] On Behalf Of yue
>Sent: Wednesday, March 09, 2011 9:14 AM
>To: linux-cluster
>Subject: [Linux-cluster] which is better gfs2 and ocfs2?
>
> 
>
>which is better gfs2 and ocfs2?
>
>i want to share fc-san, do you know which is better?
>
>stablility,performmance?
>
> 
>
> 
>
>thanks
>
> 
>
>-------------- next part --------------
>An HTML attachment was scrubbed...
>URL: <https://www.redhat.com/archives/linux-cluster/attachments/20110309/492d14bd/attachment.html>
>
>------------------------------
>
>Message: 4
>Date: Wed, 09 Mar 2011 15:53:40 +0100
>From: Michael Lackner <michael.lackner at unileoben.ac.at>
>To: linux clustering <linux-cluster at redhat.com>
>Subject: Re: [Linux-cluster] which is better gfs2 and ocfs2?
>Message-ID: <4D779474.6020509 at unileoben.ac.at>
>Content-Type: text/plain; charset=UTF-8; format=flowed
>
>I guess not all usage scenarios are comparable, but I once
>tried to use GFS2 as well as OCFS2 to share a FC SAN to three
>nodes using 8GBit FC and 1GBit Ethernet for the cluster
>communication. Additionally, i compared it to a trial version
>of Dataplows SAN File System (SFS). I was also supposed to
>compare it to Quantum StorNext, but there just wasn't enough
>time for that.
>
>OS was CentOS 5.3 at that time.
>
>So I tried a lot of performance tuning settings for all three,
>and it was like this:
>
>1.) SFS was the fastest, but caused reproducible kernel panics.
>Those were fixed by Dataplow, but then SFS produced corrupted data
>when writing large files. Unusable in that state, so we gave up.
>SFS uses NFS for lock management. Noteworthy: Writing data on the
>machine with the NFS lock manager also crippled the I/O performance
>for all the other nodes in a VERY, VERY bad way..
>
>2.) GFS2 was the slowest, and despite all the tunings I tried, it
>never came close to anything that any local FS would provide in
>terms of speed (compared to EXT3 and XFS). The statfs() calls
>pretty much crippled the FS. Multiple I/O streams on multiple nodes:
>Not a good idea it seems..  Sometimes you have to wait for minutes
>for the FS to just give you any feedback, when you're hammering
>it with let's say 30 sequential write streams across 3 nodes, with
>the streams equally distributed among them.
>
>3.) OCFS2 was slightly faster than GFS2, especially when it came
>to statfs(), like ls -l. It did not slow down that much. But overall,
>it was still just far too slow.
>
>Our solution: Hook up the SAN on one node only, and share via NFS
>over GBit Ethernet. Overall, we are getting better results even
>with the obvious network overhead, especially when doing a lot of
>I/O on multiple clients.
>
>Our original goal was to provide a high-speed centralized storage
>solution for multiple nodes without having to use ethernet. This
>failed completely unfortunately.
>
>Hope this helps, it's just my experience though. As usual, mileage
>may vary...
>
>yue wrote:
>  
>
>>which is better gfs2 and ocfs2?
>>i want to share fc-san, do you know which is better?
>>stablility,performmance?
>>    
>>
>
>  
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/d549746f/attachment.htm>

From andrew at beekhof.net  Thu Mar 10 07:14:53 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Thu, 10 Mar 2011 08:14:53 +0100
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D77C0DD.4070405@gmail.com>
References: <4D76894B.6010809@gmail.com> <4D7689B9.7070906@redhat.com>
	<4D77B25D.5010603@gmail.com> <4D77C0DD.4070405@gmail.com>
Message-ID: <AANLkTikA5Cpm0ZaasQnyC0RLbmZHQzf-Dn-UER8vaJU+@mail.gmail.com>

On Wed, Mar 9, 2011 at 7:03 PM, Gregory Bartholomew
<gregory.lee.bartholomew at gmail.com> wrote:
> Never mind, I figured it out ... I needed to install the gfs2-cluster
> package and start its service and I also had a different name for my cluster
> in /etc/cluster/cluster.conf than what I was using in my mkfs.gfs2 command.
>
> It's all working now. ?Thanks to those who helped me get this going,

So you're still using Pacemaker to mount/unmount the filesystem and
other services?
If so, were there any discrepancies in the documentation describing
how to configure this?



From gianluca.cecchi at gmail.com  Thu Mar 10 15:18:37 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Thu, 10 Mar 2011 16:18:37 +0100
Subject: [Linux-cluster] unable to live migrate a vm in rh el 6:
 Migration unexpectedly failed
In-Reply-To: <AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>
References: <AANLkTi=w-0-eAxAjYNv7FV0p9KKOSbvDSfExMUAfV2XM@mail.gmail.com>
	<AANLkTi=1xHuZ=GmQGQk4zaRjaxskLna-ToP=GkyGqjop@mail.gmail.com>
Message-ID: <AANLkTin=zTT_PDO0B5Xi1L-hou4Q7gQJBsRBGWXhv3mZ@mail.gmail.com>

On Wed, Mar 9, 2011 at 9:47 AM, Gianluca Cecchi
<gianluca.cecchi at gmail.com> wrote:
[snip]
> Or something related with firewall perhaps.
> Can I stop firewall at all and have libvirtd working at the same time
> to test ...?
> I know libvirtd puts some iptables rules itself..
>
> Gianluca
>

OK. It was indeed a problem related to iptables rules.
After adding at both ends this rule about intracluster network tcp
ports (.31 for the other node) I get live migration working ok using
clusvcadm command

iptables -t filter -I INPUT 17 -s 192.168.16.32/32 -p tcp -m multiport
--dports 49152:49215 -j ACCEPT

I'm going to put it in /etc/sysconfig/iptables in the middle of these two:
-I FORWARD -m physdev --physdev-is-bridged -j ACCEPT
-A INPUT -j REJECT --reject-with icmp-host-prohibited

I can also simulate the clusvcadm command with virsh (after freezing
the resource) with
virsh migrate --live exorapr1 qemu+ssh://intrarhev2/system tcp:intrarhev2

otherwise the ssh connection is tunneled through hostname in
connection string, but data exchange happens anyway through the public
lan (or what hostname resolves to, I suppose).

BTW: I noticed that when you freeze a vm resource you don't get the
[Z] notification at the right side of the corresponding line, as it
happens with standard services...
Is this intentional or could I post a bugzilla for it?
For a service, when frozen:
  service:MYSRV
intrarhev2                                                   started
 [Z]

[root at rhev2 ]# clusvcadm -Z vm:exorapr1
Local machine freezing vm:exorapr1...Success

[root at rhev2 ]# clustat | grep orapr1
 vm:exorapr1                    intrarhev1                     started

Cheers,
Gianluca



From gregory.lee.bartholomew at gmail.com  Thu Mar 10 15:52:26 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Thu, 10 Mar 2011 09:52:26 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <AANLkTikA5Cpm0ZaasQnyC0RLbmZHQzf-Dn-UER8vaJU+@mail.gmail.com>
References: <4D76894B.6010809@gmail.com>	<4D7689B9.7070906@redhat.com>	<4D77B25D.5010603@gmail.com>	<4D77C0DD.4070405@gmail.com>
	<AANLkTikA5Cpm0ZaasQnyC0RLbmZHQzf-Dn-UER8vaJU+@mail.gmail.com>
Message-ID: <4D78F3BA.1000904@gmail.com>

On 03/10/2011 01:14 AM, Andrew Beekhof wrote:
> On Wed, Mar 9, 2011 at 7:03 PM, Gregory Bartholomew
> <gregory.lee.bartholomew at gmail.com>  wrote:
>> Never mind, I figured it out ... I needed to install the gfs2-cluster
>> package and start its service and I also had a different name for my cluster
>> in /etc/cluster/cluster.conf than what I was using in my mkfs.gfs2 command.
>>
>> It's all working now.  Thanks to those who helped me get this going,
>
> So you're still using Pacemaker to mount/unmount the filesystem and
> other services?
> If so, were there any discrepancies in the documentation describing
> how to configure this?

Good morning,

This is what I did to get the file system going:

-----

yum install -y httpd gfs2-cluster gfs2-utils
chkconfig gfs2-cluster on
service gfs2-cluster start

mkfs.gfs2 -p lock_dlm -j 2 -t siue-cs:iscsi /dev/sda1

cat <<-END | crm
configure primitive gfs ocf:heartbeat:Filesystem params 
device="/dev/sda1" directory="/var/www/html" fstype="gfs2" op start 
interval="0" timeout="60s" op stop interval="0" timeout="60s"
configure clone dual-gfs gfs
END

-----

I think this sed command was also missing from the guide:

sed -i '/^#<Location \/server-status>/,/#<\/Location>/{s/^#//;s/Allow 
from .example.com/Allow from 127.0.0.1/}' /etc/httpd/conf/httpd.conf

I've attached the full record of all the commands that I used to set up 
my nodes to this email.  It has, at the end, the final result of "crm 
configure show".

gb
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: node-config.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/4dbfbdfe/attachment.txt>

From jinzishuai at gmail.com  Thu Mar 10 17:09:22 2011
From: jinzishuai at gmail.com (Shi Jin)
Date: Thu, 10 Mar 2011 10:09:22 -0700
Subject: [Linux-cluster] What is the proper procedure to reboot a node in a
	cluster?
Message-ID: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>

Hi there,

I've setup a two-node cluster with cman, clvmd and gfs2. I don't use qdisk
but had
<cman expected_votes="1" two_node="1"/>

I would like to know what is the proper procedure to reboot a node in the
two-node cluster (maybe this applies for all size?) when both nodes are
functioning fine but I just want to reboot one for some reason (for example,
upgrade kernel). Is there a preferred/better way to reboot the machine
rather than just running the "reboot" command as root. I have been doing the
"reboot" command so far and it sometimes creates problems for us, including
 making the other node to fail.

Thank you very much.
Shi
-- 
Shi Jin, Ph.D.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/448943e3/attachment.htm>

From linux at alteeve.com  Thu Mar 10 17:21:23 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 10 Mar 2011 12:21:23 -0500
Subject: [Linux-cluster] What is the proper procedure to reboot a node
 in a	cluster?
In-Reply-To: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
Message-ID: <4D790893.6000107@alteeve.com>

On 03/10/2011 12:09 PM, Shi Jin wrote:
> Hi there,
> 
> I've setup a two-node cluster with cman, clvmd and gfs2. I don't use
> qdisk but had
> <cman expected_votes="1" two_node="1"/>
> 
> I would like to know what is the proper procedure to reboot a node in
> the two-node cluster (maybe this applies for all size?) when both nodes
> are functioning fine but I just want to reboot one for some reason (for
> example, upgrade kernel). Is there a preferred/better way to reboot the
> machine rather than just running the "reboot" command as root. I have
> been doing the "reboot" command so far and it sometimes creates problems
> for us, including  making the other node to fail. 
> 
> Thank you very much.
> Shi
> -- 
> Shi Jin, Ph.D.

What I do is migrate any services from the node to the other member,
then stop rgmanager->gfs2->clvmd->cman (obviously adapt to what you are
running). If you have DRBD, then stop it as well. At this point, the
other node should be the only one in the cluster (confirm with
'cman_tool status'). If all is good, reboot. Once up, rejoin the cluster.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From ricks at nerd.com  Thu Mar 10 17:40:09 2011
From: ricks at nerd.com (Rick Stevens)
Date: Thu, 10 Mar 2011 09:40:09 -0800
Subject: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 15
In-Reply-To: <4D7856B7.4070105@midascomm.com>
References: <mailman.25.1299690004.23738.linux-cluster@redhat.com>
	<4D7856B7.4070105@midascomm.com>
Message-ID: <4D790CF9.9070406@nerd.com>

On 03/09/2011 08:42 PM, Balaji wrote:
> Dear All,
>
> Currently other node is shutdown.
> First of all we will check the cluster is up in simplex mode

Please don't respond to message digests.  Create a NEW message with an 
appropriate subject line along with your question or comment.

> linux-cluster-request at redhat.com
> <mailto:linux-cluster-request at redhat.com> wrote:
>> Send Linux-cluster mailing list submissions to
>> 	linux-cluster at redhat.com  <mailto:linux-cluster at redhat.com>
>>
>> To subscribe or unsubscribe via the World Wide Web, visit
>> 	https://www.redhat.com/mailman/listinfo/linux-cluster
>> or, via email, send a message with subject or body'help'  to
>> 	linux-cluster-request at redhat.com  <mailto:linux-cluster-request at redhat.com>
>>
>> You can reach the person managing the list at
>> 	linux-cluster-owner at redhat.com  <mailto:linux-cluster-owner at redhat.com>
>>
>> When replying, please edit your Subject line so it is more specific
>> than"Re: Contents of Linux-cluster digest..."
>>
>>
>> Today's Topics:
>>
>>     1. Re: Linux-cluster Digest, Vol 83, Issue 13 (Sunil_Gupta2 at Dell.com  <mailto:Sunil_Gupta2 at Dell.com>)
>>     2. which is better gfs2 and ocfs2? (yue)
>>     3. Re: which is better gfs2 and ocfs2? (Jeff Sturm)
>>     4. Re: which is better gfs2 and ocfs2? (Michael Lackner)
>>     5. Re: which is better gfs2 and ocfs2? (rhurst at bidmc.harvard.edu  <mailto:rhurst at bidmc.harvard.edu>)
>>     6. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>        gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)
>>     7. Re: which is better gfs2 and ocfs2? (Thomas Sjolshagen)
>>     8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>        gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)
>>
>>
>> ----------------------------------------------------------------------
>>
>> Message: 1
>> Date: Wed, 9 Mar 2011 17:44:17 +0530
>> From:<Sunil_Gupta2 at Dell.com>  <mailto:Sunil_Gupta2 at Dell.com>
>> To:<linux-cluster at redhat.com>  <mailto:linux-cluster at redhat.com>
>> Subject: Re: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
>> Message-ID:
>> 	<8EF1FE59C3C8694E94F558EB27E464B71D130C752D at BLRX7MCDC201.AMER.DELL.COM>  <mailto:8EF1FE59C3C8694E94F558EB27E464B71D130C752D at BLRX7MCDC201.AMER.DELL.COM>
>> 	
>> Content-Type: text/plain; charset="us-ascii"
>>
>> One node is offline cluster is not formed....check if multicast traffic is working...
>>
>> --Sunil
>>
>> From:linux-cluster-bounces at redhat.com  <mailto:linux-cluster-bounces at redhat.com>  [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Balaji
>> Sent: Wednesday, March 09, 2011 4:54 PM
>> To:linux-cluster at redhat.com  <mailto:linux-cluster at redhat.com>
>> Subject: Re: [Linux-cluster] Linux-cluster Digest, Vol 83, Issue 13
>>
>> Dear All,
>>
>>      Please find attached log file for more analysis
>>      Please help me to solve this problem ASAP.
>>
>>      Clustat Command Output is below
>>      [root at corviewprimary ~]# clustat
>>      Cluster Status for EMSCluster @ Wed Mar  9 17:00:03 2011
>>      Member Status: Quorate
>>
>>       Member Name                                                   ID   Status
>>       ----------- -------                                                   ---- ------
>>       corviewprimary                                                    1 Online, Local
>>       corviewsecondary                                                  2 Offline
>>
>>      [root at corviewprimary ~]#
>>
>> Regards,
>> -S.Balaji
>>
>> linux-cluster-request at redhat.com  <mailto:linux-cluster-request at redhat.com><mailto:linux-cluster-request at redhat.com>  wrote:
>>
>> Send Linux-cluster mailaddr:115.249.107.179ing list submissions to
>>
>>          linux-cluster at redhat.com  <mailto:linux-cluster at redhat.com><mailto:linux-cluster at redhat.com>
>>
>>
>>
>> To subscribe or unsubscribe via the World Wide Web, visit
>>
>>          https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> or, via email, send a message with subject or body'help'  to
>>
>>          linux-cluster-request at redhat.com  <mailto:linux-cluster-request at redhat.com><mailto:linux-cluster-request at redhat.com>
>>
>>
>>
>> You can reach the person managing the list at
>>
>>          linux-cluster-owner at redhat.com  <mailto:linux-cluster-owner at redhat.com><mailto:linux-cluster-owner at redhat.com>
>>
>>
>>
>> When replying, please edit your Subject line so it is more specific
>>
>> than"Re: Contents of Linux-cluster digest..."
>>
>>
>>
>>
>>
>> Today's Topics:
>>
>>
>>
>>     1. Re: clvmd hangs on startup (Valeriu Mutu)
>>
>>     2. Re: clvmd hangs on startup (Jeff Sturm)
>>
>>     3. dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>
>>        gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Gregory Bartholomew)
>>
>>     4. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>
>>        gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Fabio M. Di Nitto)
>>
>>     5. Re: unable to live migrate a vm in rh el 6: Migration
>>
>>        unexpectedly failed (Lon Hohberger)
>>
>>     6. Re: rgmanager not running (Sunil_Gupta2 at Dell.com  <mailto:Sunil_Gupta2 at Dell.com><mailto:Sunil_Gupta2 at Dell.com>)
>>
>>     7. Re: unable to live migrate a vm in rh el 6: Migration
>>
>>        unexpectedly failed (Gianluca Cecchi)
>>
>>     8. Re: dlm-pcmk-3.0.17-1.fc14.x86_64 and
>>
>>        gfs-pcmk-3.0.17-1.fc14.x86_64 woes (Andrew Beekhof)
>>
>>     9. Re: unable to live migrate a vm in rh el 6: Migration
>>
>>        unexpectedly failed (Gianluca Cecchi)
>>
>>    10. Re: unable to live migrate a vm in rh el 6: Migration
>>
>>        unexpectedly failed (Gianluca Cecchi)
>>
>>
>>
>>
>>
>> ----------------------------------------------------------------------
>>
>>
>>
>> Message: 1
>>
>> Date: Tue, 8 Mar 2011 12:11:53 -0500
>>
>> From: Valeriu Mutu<vmutu at pcbi.upenn.edu>  <mailto:vmutu at pcbi.upenn.edu><mailto:vmutu at pcbi.upenn.edu>
>>
>> To: linux clustering<linux-cluster at redhat.com>  <mailto:linux-cluster at redhat.com><mailto:linux-cluster at redhat.com>
>>
>> Subject: Re: [Linux-cluster] clvmd hangs on startup
>>
>> Message-ID:<20110308171153.GB272 at bsdera.pcbi.upenn.edu>  <mailto:20110308171153.GB272 at bsdera.pcbi.upenn.edu><mailto:20110308171153.GB272 at bsdera.pcbi.upenn.edu>
>>
>> Content-Type: text/plain; charset=us-ascii
>>
>>
>>
>> Hi,
>>
>>
>>
>> I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines'  iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.
>>
>>
>>
>> On Thu, Mar 03, 2011 at 11:50:57AM -0500, Valeriu Mutu wrote:
>>
>>
>>
>> On Wed, Mar 02, 2011 at 05:36:45PM -0500, Jeff Sturm wrote:
>>
>>
>>
>> Double-check that the 2nd node can read and write the shared iSCSI
>>
>> storage.
>>
>>
>>
>> Reading/writing from/to the iSCSI storage device works as seen below.
>>
>>
>>
>> On the 1st node:
>>
>> [root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
>>
>> 10000+0 records in
>>
>> 10000+0 records out
>>
>> 10240000 bytes (10 MB) copied, 3.39855 seconds, 3.0 MB/s
>>
>>
>>
>> [root at vm1 cluster]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
>>
>> 10000+0 records in
>>
>> 10000+0 records out
>>
>> 10240000 bytes (10 MB) copied, 0.331069 seconds, 30.9 MB/s
>>
>>
>>
>> On the 2nd node:
>>
>> [root at vm2 ~]# dd count=10000 bs=1024 if=/dev/urandom of=/dev/mapper/pcbi-homes
>>
>> 10000+0 records in
>>
>> 10000+0 records out
>>
>> 10240000 bytes (10 MB) copied, 3.2465 seconds, 3.2 MB/s
>>
>>
>>
>> [root at vm2 ~]# dd count=10000 bs=1024 if=/dev/mapper/pcbi-homes of=/dev/null
>>
>> 10000+0 records in
>>
>> 10000+0 records out
>>
>> 10240000 bytes (10 MB) copied, 0.223337 seconds, 45.8 MB/s
>>
>>
>>
>>
>>
>>
>>
>> -------------- next part --------------
>> An HTML attachment was scrubbed...
>> URL:<https://www.redhat.com/archives/linux-cluster/attachments/20110309/dc0cbf73/attachment.html>
>>
>> ------------------------------
>>
>> Message: 2
>> Date: Wed, 9 Mar 2011 22:13:35 +0800 (CST)
>> From: yue<ooolinux at 163.com>  <mailto:ooolinux at 163.com>
>> To: linux-cluster<linux-cluster at redhat.com>  <mailto:linux-cluster at redhat.com>
>> Subject: [Linux-cluster] which is better gfs2 and ocfs2?
>> Message-ID:<4f996c7c.1356a.12e9af733aa.Coremail.ooolinux at 163.com>  <mailto:4f996c7c.1356a.12e9af733aa.Coremail.ooolinux at 163.com>
>> Content-Type: text/plain; charset="gbk"
>>
>> which is better gfs2 and ocfs2?
>> i want to share fc-san, do you know which is better?
>> stablility,performmance?
>>
>>
>> thanks
>> -------------- next part --------------
>> An HTML attachment was scrubbed...
>> URL:<https://www.redhat.com/archives/linux-cluster/attachments/20110309/603fcadf/attachment.html>
>>
>> ------------------------------
>>
>> Message: 3
>> Date: Wed, 9 Mar 2011 09:48:03 -0500
>> From: Jeff Sturm<jeff.sturm at eprize.com>  <mailto:jeff.sturm at eprize.com>
>> To: linux clustering<linux-cluster at redhat.com>  <mailto:linux-cluster at redhat.com>
>> Subject: Re: [Linux-cluster] which is better gfs2 and ocfs2?
>> Message-ID:
>> 	<64D0546C5EBBD147B75DE133D798665F0855C34D at hugo.eprize.local>  <mailto:64D0546C5EBBD147B75DE133D798665F0855C34D at hugo.eprize.local>
>> Content-Type: text/plain; charset="us-ascii"
>>
>> Do you expect to get an objective answer to that from a Red Hat list?
>> Most users on this forum are familiar with GFS2, some may have tried
>> OCFS2 but there's bound to be a bias.
>>
>>
>>
>> GFS has been extremely stable for us (haven't migrated to GFS2 yet, went
>> into production with GFS in 2008).  Just last night in fact a single
>> hardware node failed in one of our virtual test clusters, the fencing
>> operations were successful and everything recovered nicely.  The cluster
>> never lost quorum and disruption was minimal.
>>
>>
>>
>> Performance is highly variable depending on the software application.
>> We have developed our own application which gave us freedom to tailor it
>> for GFS, improving performance and throughput significantly.
>>
>>
>>
>> Regardless of what you hear, why not give both a try?  Your evaluation
>> and feedback would be very useful to the cluster community.
>>
>>
>>
>> -Jeff
>>
>>
>>
>> From:linux-cluster-bounces at redhat.com  <mailto:linux-cluster-bounces at redhat.com>
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of yue
>> Sent: Wednesday, March 09, 2011 9:14 AM
>> To: linux-cluster
>> Subject: [Linux-cluster] which is better gfs2 and ocfs2?
>>
>>
>>
>> which is better gfs2 and ocfs2?
>>
>> i want to share fc-san, do you know which is better?
>>
>> stablility,performmance?
>>
>>
>>
>>
>>
>> thanks
>>
>>
>>
>> -------------- next part --------------
>> An HTML attachment was scrubbed...
>> URL:<https://www.redhat.com/archives/linux-cluster/attachments/20110309/492d14bd/attachment.html>
>>
>> ------------------------------
>>
>> Message: 4
>> Date: Wed, 09 Mar 2011 15:53:40 +0100
>> From: Michael Lackner<michael.lackner at unileoben.ac.at>  <mailto:michael.lackner at unileoben.ac.at>
>> To: linux clustering<linux-cluster at redhat.com>  <mailto:linux-cluster at redhat.com>
>> Subject: Re: [Linux-cluster] which is better gfs2 and ocfs2?
>> Message-ID:<4D779474.6020509 at unileoben.ac.at>  <mailto:4D779474.6020509 at unileoben.ac.at>
>> Content-Type: text/plain; charset=UTF-8; format=flowed
>>
>> I guess not all usage scenarios are comparable, but I once
>> tried to use GFS2 as well as OCFS2 to share a FC SAN to three
>> nodes using 8GBit FC and 1GBit Ethernet for the cluster
>> communication. Additionally, i compared it to a trial version
>> of Dataplows SAN File System (SFS). I was also supposed to
>> compare it to Quantum StorNext, but there just wasn't enough
>> time for that.
>>
>> OS was CentOS 5.3 at that time.
>>
>> So I tried a lot of performance tuning settings for all three,
>> and it was like this:
>>
>> 1.) SFS was the fastest, but caused reproducible kernel panics.
>> Those were fixed by Dataplow, but then SFS produced corrupted data
>> when writing large files. Unusable in that state, so we gave up.
>> SFS uses NFS for lock management. Noteworthy: Writing data on the
>> machine with the NFS lock manager also crippled the I/O performance
>> for all the other nodes in a VERY, VERY bad way..
>>
>> 2.) GFS2 was the slowest, and despite all the tunings I tried, it
>> never came close to anything that any local FS would provide in
>> terms of speed (compared to EXT3 and XFS). The statfs() calls
>> pretty much crippled the FS. Multiple I/O streams on multiple nodes:
>> Not a good idea it seems..  Sometimes you have to wait for minutes
>> for the FS to just give you any feedback, when you're hammering
>> it with let's say 30 sequential write streams across 3 nodes, with
>> the streams equally distributed among them.
>>
>> 3.) OCFS2 was slightly faster than GFS2, especially when it came
>> to statfs(), like ls -l. It did not slow down that much. But overall,
>> it was still just far too slow.
>>
>> Our solution: Hook up the SAN on one node only, and share via NFS
>> over GBit Ethernet. Overall, we are getting better results even
>> with the obvious network overhead, especially when doing a lot of
>> I/O on multiple clients.
>>
>> Our original goal was to provide a high-speed centralized storage
>> solution for multiple nodes without having to use ethernet. This
>> failed completely unfortunately.
>>
>> Hope this helps, it's just my experience though. As usual, mileage
>> may vary...
>>
>> yue wrote:
>>
>>> which is better gfs2 and ocfs2?
>>> i want to share fc-san, do you know which is better?
>>> stablility,performmance?
>>>
>>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
----------------------------------------------------------------------
- Rick Stevens, Systems Engineer, C2 Hosting          ricks at nerd.com -
- AIM/Skype: therps2        ICQ: 22643734            Yahoo: origrps2 -
-                                                                    -
-     Never put off 'til tommorrow what you can forget altogether!   -
----------------------------------------------------------------------



From lomazzog at dteenergy.com  Thu Mar 10 19:05:02 2011
From: lomazzog at dteenergy.com (Gino Lomazzo)
Date: Thu, 10 Mar 2011 14:05:02 -0500
Subject: [Linux-cluster] Performing fsck on large gfs file-systems.
In-Reply-To: <4D790CF9.9070406@nerd.com>
Message-ID: <OF0813B712.5925C25E-ON8525784F.006894B7-8525784F.0068D2E8@dteenergy.com>


Good Afternoon;

We currently have a critical Oracle application running on a two node Red
Hat Cluster environment. (RHEL5u5)

Our /oracle/d01 ( gfs file system) is about 2TB, when the system reboots it
takes a few hours to perform a fsck.

Is a fsck required on a gfs file system?

Thank you!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/e280907d/attachment.htm>

From gregory.lee.bartholomew at gmail.com  Thu Mar 10 19:42:57 2011
From: gregory.lee.bartholomew at gmail.com (Gregory Bartholomew)
Date: Thu, 10 Mar 2011 13:42:57 -0600
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D78F3BA.1000904@gmail.com>
References: <4D76894B.6010809@gmail.com>	<4D7689B9.7070906@redhat.com>	<4D77B25D.5010603@gmail.com>	<4D77C0DD.4070405@gmail.com>
	<AANLkTikA5Cpm0ZaasQnyC0RLbmZHQzf-Dn-UER8vaJU+@mail.gmail.com>
	<4D78F3BA.1000904@gmail.com>
Message-ID: <4D7929C1.3080209@gmail.com>

FYI, per:

 > Cluster shutdown tips
 > ---------------------
 >
 > * Avoiding a partly shutdown cluster due to lost quorum.
 >
 > There is a practical timing issue with respect to the shutdown steps 
being run
 > on all nodes when shutting down an entire cluster (or most of it).  When
 > shutting down the entire cluster (or shutting down a node for an extended
 > period) use "cman_tool leave remove".  This automatically reduces the 
number
 > of votes needed for quorum as each node leaves and prevents the loss 
of quorum
 > which could keep the last nodes from cleanly completing shutdown.
 >
 > Using the "remove" leave option should not be used in general since it
 > introduces potential split-brain risks.
 >
 > If the "remove" leave option is not used, quorum will be lost after 
enough
 > nodes have left the cluster.  Once the cluster is inquorate, 
remaining members
 > that have not yet completed "fence_tool leave" in the steps above will be
 > stuck.  Operations such as umounting gfs or leaving the fence domain will
 > block while the cluster is inquorate.  They can continue and complete 
only
 > when quorum is regained.
 >
 > If this happens, one option is to join the cluster ("cman_tool join") 
on some
 > of the nodes that have left so that the cluster regains quorum and 
the stuck
 > nodes can complete their shutdown.  Another option is to forcibly 
reduce the
 > number of expected votes for the cluster which allows the cluster to 
become
 > quorate again ("cman_tool expected <votes>").
 >
 > ...
 >
 > Two node clusters
 > -----------------
 >
 > Ordinarily the loss of quorum after one node fails out of two will 
prevent the
 > remaining node from continuing (if both nodes have one vote.)  Some 
special
 > configuration options can be set to allow the one remaining node to 
continue
 > operating if the other fails.  To do this only two nodes with one 
vote each can
 > be defined in cluster.conf.  The two_node and expected_votes values 
must then be
 > set to 1 in the cman config section as follows.
 >
 >   <cman two_node="1" expected_votes="1">
 >   </cman>
 >

In http://sourceware.org/cluster/doc/usage.txt, it looks like example 
C.1 in 
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html-single/Clusters_from_Scratch/index.html#ap-cman 
should be changed to:

<?xml version="1.0"?>
<cluster config_version="1" name="beekhof">
   <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
   <clusternodes>
     <clusternode name="pcmk-1" nodeid="1">
       <fence/>
     </clusternode>
     <clusternode name="pcmk-2" nodeid="2">
       <fence/>
     </clusternode>
   </clusternodes>
   <cman two_node="1" expected_votes="1"/>
   <fencedevices/>
   <rm/>
</cluster>

gb

On 03/10/2011 09:52 AM, Gregory Bartholomew wrote:
> On 03/10/2011 01:14 AM, Andrew Beekhof wrote:
>> On Wed, Mar 9, 2011 at 7:03 PM, Gregory Bartholomew
>> <gregory.lee.bartholomew at gmail.com> wrote:
>>> Never mind, I figured it out ... I needed to install the gfs2-cluster
>>> package and start its service and I also had a different name for my
>>> cluster
>>> in /etc/cluster/cluster.conf than what I was using in my mkfs.gfs2
>>> command.
>>>
>>> It's all working now. Thanks to those who helped me get this going,
>>
>> So you're still using Pacemaker to mount/unmount the filesystem and
>> other services?
>> If so, were there any discrepancies in the documentation describing
>> how to configure this?
>
> Good morning,
>
> This is what I did to get the file system going:
>
> -----
>
> yum install -y httpd gfs2-cluster gfs2-utils
> chkconfig gfs2-cluster on
> service gfs2-cluster start
>
> mkfs.gfs2 -p lock_dlm -j 2 -t siue-cs:iscsi /dev/sda1
>
> cat <<-END | crm
> configure primitive gfs ocf:heartbeat:Filesystem params
> device="/dev/sda1" directory="/var/www/html" fstype="gfs2" op start
> interval="0" timeout="60s" op stop interval="0" timeout="60s"
> configure clone dual-gfs gfs
> END
>
> -----
>
> I think this sed command was also missing from the guide:
>
> sed -i '/^#<Location \/server-status>/,/#<\/Location>/{s/^#//;s/Allow
> from .example.com/Allow from 127.0.0.1/}' /etc/httpd/conf/httpd.conf
>
> I've attached the full record of all the commands that I used to set up
> my nodes to this email. It has, at the end, the final result of "crm
> configure show".
>
> gb



From jinzishuai at gmail.com  Thu Mar 10 19:58:56 2011
From: jinzishuai at gmail.com (Shi Jin)
Date: Thu, 10 Mar 2011 12:58:56 -0700
Subject: [Linux-cluster] What is the proper procedure to reboot a node
 in a cluster?
In-Reply-To: <4D790893.6000107@alteeve.com>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
	<4D790893.6000107@alteeve.com>
Message-ID: <AANLkTinc6-jycFWC4g1naAuSQtiYxycVEQODvoxSzwfR@mail.gmail.com>

>
>
>
> What I do is migrate any services from the node to the other member,
> then stop rgmanager->gfs2->clvmd->cman (obviously adapt to what you are
> running). If you have DRBD, then stop it as well. At this point, the
> other node should be the only one in the cluster (confirm with
> 'cman_tool status'). If all is good, reboot. Once up, rejoin the cluster.
>
> Thank you.
I think what you did makes perfect sense but on the other hand shouldn't the
reboot process stop the services in the right order in the first place?
Maybe there is a timeout issue or they don't necessarily follow the right
order?

Do you let the boot system to start the services for you in the right order
or you actually have to do it manually?

Thanks.

Shi


-- 
Shi Jin, Ph.D.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/c5225a53/attachment.htm>

From linux at alteeve.com  Thu Mar 10 20:02:28 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 10 Mar 2011 15:02:28 -0500
Subject: [Linux-cluster] What is the proper procedure to reboot a node
 in a cluster?
In-Reply-To: <AANLkTinc6-jycFWC4g1naAuSQtiYxycVEQODvoxSzwfR@mail.gmail.com>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>	<4D790893.6000107@alteeve.com>
	<AANLkTinc6-jycFWC4g1naAuSQtiYxycVEQODvoxSzwfR@mail.gmail.com>
Message-ID: <4D792E54.1030109@alteeve.com>

On 03/10/2011 02:58 PM, Shi Jin wrote:
> Thank you.
> I think what you did makes perfect sense but on the other hand shouldn't
> the reboot process stop the services in the right order in the first
> place? Maybe there is a timeout issue or they don't necessarily follow
> the right order?
> 
> Do you let the boot system to start the services for you in the right
> order or you actually have to do it manually? 
> 
> Thanks.
> 
> Shi 

It should, assuming that the KXXfoo entries exist for the cluster
services in the right order, and that they all shut down properly. Of
course, I find this not always is the case. So for that reason, I like
to stop things manually, when I have the luxury.

As for starting, it depends. If it's a machine I have ready access to, I
generally like to start things manually. Mainly because of how heavily I
use DRBD and it's tendency to have issues on startup. Not frequent, but
frequent enough. If you do automatically start everything, pay close
attention to the start order and do plenty of testing.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From alvaro.fernandez at sivsa.com  Thu Mar 10 20:41:17 2011
From: alvaro.fernandez at sivsa.com (Alvaro Jose Fernandez)
Date: Thu, 10 Mar 2011 21:41:17 +0100
Subject: [Linux-cluster] What is the proper procedure to reboot a node
	in acluster?
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
Message-ID: <607D6181D9919041BE792D70EF2AEC48017568F1@LIMENS.sivsa.int>

Hi,

 

Given fencing is properly configured, I think the default boot/sshutdown
RHCS scripts should work. I too use two_node (but no clvmd) in RHEL5.5
with latest updates to cman and rgmanager, and a shutdown -r works well
(and a shutdown -h too). The other node cluster daemon should log this
as a node shutdown in /var/log/messages, and it should adjust quorum,
and not trigger a fencing action over the other node.

 

If one halts and poweroff via shutdown -h one of the two nodes, and then
reboots (via shutdown -r) the surviving node, the surviving node will
fence the other. We have power switch fencing, and it should simply
suceed (making a power  off then a power on on the other node's
outlets). Once this fencing suceeds, the boot sequence continues and the
node assumes quorum. 

 

If later the other node is powered on, it should join the cluster
without problems.

 

alvaro,

 

Hi there,

 

I've setup a two-node cluster with cman, clvmd and gfs2. I don't use
qdisk but had

<cman expected_votes="1" two_node="1"/>

 

I would like to know what is the proper procedure to reboot a node in
the two-node cluster (maybe this applies for all size?) when both nodes
are functioning fine but I just want to reboot one for some reason (for
example, upgrade kernel). Is there a preferred/better way to reboot the
machine rather than just running the "reboot" command as root. I have
been doing the "reboot" command so far and it sometimes creates problems
for us, including  making the other node to fail. 

 

Thank you very much.
Shi
-- 
Shi Jin, Ph.D.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/efa61681/attachment.htm>

From abhishekf2k1 at gmail.com  Fri Mar 11 05:28:27 2011
From: abhishekf2k1 at gmail.com (abhishek .)
Date: Thu, 10 Mar 2011 21:28:27 -0800
Subject: [Linux-cluster] disable me
Message-ID: <AANLkTimXo5FmUPXcZoA7wWLO4JAkvQLJSrU1rOF4CFS-@mail.gmail.com>

please dont send me more info about cluster

remove my id from mailing list

-- 
abhishek
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110310/5edd2e00/attachment.htm>

From laszlo.budai at gmail.com  Fri Mar 11 09:01:49 2011
From: laszlo.budai at gmail.com (Budai Laszlo)
Date: Fri, 11 Mar 2011 11:01:49 +0200
Subject: [Linux-cluster] documentation needed
Message-ID: <4D79E4FD.2060305@gmail.com>

Hello,

can you point me to some documentation of the new cluster architecture
available in RHEL6? I'm interested to learn about the internals.
I'm thinking about documents like the "Cluster2 architecture"
(http://people.redhat.com/teigland/cluster2-arch.txt), or "Symmetric
Cluster Architecture
and Component Technical Specifications"
(http://people.redhat.com/teigland/sca.pdf)

Thank you,
Laszlo



From andrew at beekhof.net  Fri Mar 11 10:06:22 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Fri, 11 Mar 2011 11:06:22 +0100
Subject: [Linux-cluster] dlm-pcmk-3.0.17-1.fc14.x86_64 and
 gfs-pcmk-3.0.17-1.fc14.x86_64 woes
In-Reply-To: <4D78F3BA.1000904@gmail.com>
References: <4D76894B.6010809@gmail.com> <4D7689B9.7070906@redhat.com>
	<4D77B25D.5010603@gmail.com> <4D77C0DD.4070405@gmail.com>
	<AANLkTikA5Cpm0ZaasQnyC0RLbmZHQzf-Dn-UER8vaJU+@mail.gmail.com>
	<4D78F3BA.1000904@gmail.com>
Message-ID: <AANLkTikDFiyncT5b=6e+6FKwTjWPBwE+bihWXTVFq9Sw@mail.gmail.com>

On Thu, Mar 10, 2011 at 4:52 PM, Gregory Bartholomew
<gregory.lee.bartholomew at gmail.com> wrote:
> On 03/10/2011 01:14 AM, Andrew Beekhof wrote:
>>
>> On Wed, Mar 9, 2011 at 7:03 PM, Gregory Bartholomew
>> <gregory.lee.bartholomew at gmail.com> ?wrote:
>>>
>>> Never mind, I figured it out ... I needed to install the gfs2-cluster
>>> package and start its service and I also had a different name for my
>>> cluster
>>> in /etc/cluster/cluster.conf than what I was using in my mkfs.gfs2
>>> command.
>>>
>>> It's all working now. ?Thanks to those who helped me get this going,
>>
>> So you're still using Pacemaker to mount/unmount the filesystem and
>> other services?
>> If so, were there any discrepancies in the documentation describing
>> how to configure this?
>
> Good morning,
>
> This is what I did to get the file system going:

Excellent, very pleased to hear you got it working.
I'll try and incorporate your feedback into the doc.

> -----
>
> yum install -y httpd gfs2-cluster gfs2-utils
> chkconfig gfs2-cluster on
> service gfs2-cluster start
>
> mkfs.gfs2 -p lock_dlm -j 2 -t siue-cs:iscsi /dev/sda1
>
> cat <<-END | crm
> configure primitive gfs ocf:heartbeat:Filesystem params device="/dev/sda1"
> directory="/var/www/html" fstype="gfs2" op start interval="0" timeout="60s"
> op stop interval="0" timeout="60s"
> configure clone dual-gfs gfs
> END
>
> -----
>
> I think this sed command was also missing from the guide:
>
> sed -i '/^#<Location \/server-status>/,/#<\/Location>/{s/^#//;s/Allow from
> .example.com/Allow from 127.0.0.1/}' /etc/httpd/conf/httpd.conf

What on earth does that do? :-)

>
> I've attached the full record of all the commands that I used to set up my
> nodes to this email. ?It has, at the end, the final result of "crm configure
> show".
>
> gb
>



From jinzishuai at gmail.com  Fri Mar 11 16:28:05 2011
From: jinzishuai at gmail.com (Shi Jin)
Date: Fri, 11 Mar 2011 09:28:05 -0700
Subject: [Linux-cluster] What is the proper procedure to reboot a node
	in acluster?
In-Reply-To: <607D6181D9919041BE792D70EF2AEC48017568F1@LIMENS.sivsa.int>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
	<607D6181D9919041BE792D70EF2AEC48017568F1@LIMENS.sivsa.int>
Message-ID: <AANLkTinG6b=rWgdfyN1uuiBg5FAfF-nZjVENiXeEAWxx@mail.gmail.com>

Thank you all.
The problem I have is that I don't seem to be able to get out of the cluster
gracefully, even if I stop the services manually in the right order.
For example, I joined the cluster manually by starting cman, clvmd and gfs2
in the order and everything is working just fine.

Then I wanted to reboot. This time, I want to do it manually so I went to
stop the services in order.
[root at test2 ~]# service gfs2 stop
Unmounting GFS2 filesystem (/vrstorm):                     [  OK  ]
[root at test2 ~]# service clvmd stop
Signaling clvmd to exit                                    [  OK  ]
Waiting for clvmd to exit:                                 [FAILED]
clvmd failed to exit                                       [FAILED]

Somehow clvmd cannot be stopped. I still have the process running
root      2646  0.0  0.5 194476 45016 ?        SLsl 02:18   0:00 clvmd -T30

How do I stop clvmd gracefully? I am running RHEL-6.
[root at test2 ~]# uname -a
Linux test2 2.6.32-71.18.2.el6.x86_64 #1 SMP Wed Mar 2 14:17:40 EST 2011
x86_64 x86_64 x86_64 GNU/Linux
[root at test2 ~]# cat /etc/redhat-release
Red Hat Enterprise Linux Server release 6.0 (Santiago)


Thank you very much.

Shi



On Thu, Mar 10, 2011 at 1:41 PM, Alvaro Jose Fernandez <
alvaro.fernandez at sivsa.com> wrote:

>  Hi,
>
>
>
> Given fencing is properly configured, I think the default boot/sshutdown
> RHCS scripts should work. I too use two_node (but no clvmd) in RHEL5.5 with
> latest updates to cman and rgmanager, and a shutdown -r works well (and a
> shutdown -h too). The other node cluster daemon should log this as a node
> shutdown in /var/log/messages, and it should adjust quorum, and not trigger
> a fencing action over the other node.
>
>
>
> If one halts and poweroff via shutdown -h one of the two nodes, and then
> reboots (via shutdown -r) the surviving node, the surviving node will fence
> the other. We have power switch fencing, and it should simply suceed (making
> a power  off then a power on on the other node's outlets). Once this fencing
> suceeds, the boot sequence continues and the node assumes quorum.
>
>
>
> If later the other node is powered on, it should join the cluster without
> problems.
>
>
>
> alvaro,
>
>
>
> Hi there,
>
>
>
> I've setup a two-node cluster with cman, clvmd and gfs2. I don't use qdisk
> but had
>
> <cman expected_votes="1" two_node="1"/>
>
>
>
> I would like to know what is the proper procedure to reboot a node in the
> two-node cluster (maybe this applies for all size?) when both nodes are
> functioning fine but I just want to reboot one for some reason (for example,
> upgrade kernel). Is there a preferred/better way to reboot the machine
> rather than just running the "reboot" command as root. I have been doing the
> "reboot" command so far and it sometimes creates problems for us, including
>  making the other node to fail.
>
>
>
> Thank you very much.
> Shi
> --
> Shi Jin, Ph.D.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 
Shi Jin, Ph.D.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110311/87c57095/attachment.htm>

From jinzishuai at gmail.com  Fri Mar 11 16:42:00 2011
From: jinzishuai at gmail.com (Shi Jin)
Date: Fri, 11 Mar 2011 09:42:00 -0700
Subject: [Linux-cluster] What is the proper procedure to reboot a node
	in acluster?
In-Reply-To: <AANLkTinG6b=rWgdfyN1uuiBg5FAfF-nZjVENiXeEAWxx@mail.gmail.com>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
	<607D6181D9919041BE792D70EF2AEC48017568F1@LIMENS.sivsa.int>
	<AANLkTinG6b=rWgdfyN1uuiBg5FAfF-nZjVENiXeEAWxx@mail.gmail.com>
Message-ID: <AANLkTi=1_pP6WXaWDLKD1CC7eixzg2CkkANuduNLs2y7@mail.gmail.com>

To follow up, I couldn't manually leave by dlm_tool
[root at test2 log]# dlm_tool leave clvmd
Leaving lockspace "clvmd"
dlm_open_lockspace clvmd error (nil) 2

[root at test2 log]# dlm_tool ls
dlm lockspaces
name          clvmd
id            0x4104eefa
flags         0x00000002 leave
change        member 2 joined 1 remove 0 failed 0 seq 1,1
members       1 2

Thanks.
Shi

On Fri, Mar 11, 2011 at 9:28 AM, Shi Jin <jinzishuai at gmail.com> wrote:

> Thank you all.
> The problem I have is that I don't seem to be able to get out of the
> cluster gracefully, even if I stop the services manually in the right order.
> For example, I joined the cluster manually by starting cman, clvmd and gfs2
> in the order and everything is working just fine.
>
> Then I wanted to reboot. This time, I want to do it manually so I went to
> stop the services in order.
> [root at test2 ~]# service gfs2 stop
> Unmounting GFS2 filesystem (/vrstorm):                     [  OK  ]
> [root at test2 ~]# service clvmd stop
> Signaling clvmd to exit                                    [  OK  ]
> Waiting for clvmd to exit:                                 [FAILED]
> clvmd failed to exit                                       [FAILED]
>
> Somehow clvmd cannot be stopped. I still have the process running
> root      2646  0.0  0.5 194476 45016 ?        SLsl 02:18   0:00 clvmd -T30
>
> How do I stop clvmd gracefully? I am running RHEL-6.
> [root at test2 ~]# uname -a
> Linux test2 2.6.32-71.18.2.el6.x86_64 #1 SMP Wed Mar 2 14:17:40 EST 2011
> x86_64 x86_64 x86_64 GNU/Linux
> [root at test2 ~]# cat /etc/redhat-release
> Red Hat Enterprise Linux Server release 6.0 (Santiago)
>
>
> Thank you very much.
>
> Shi
>
>
>
> On Thu, Mar 10, 2011 at 1:41 PM, Alvaro Jose Fernandez <
> alvaro.fernandez at sivsa.com> wrote:
>
>>  Hi,
>>
>>
>>
>> Given fencing is properly configured, I think the default boot/sshutdown
>> RHCS scripts should work. I too use two_node (but no clvmd) in RHEL5.5 with
>> latest updates to cman and rgmanager, and a shutdown -r works well (and a
>> shutdown -h too). The other node cluster daemon should log this as a node
>> shutdown in /var/log/messages, and it should adjust quorum, and not trigger
>> a fencing action over the other node.
>>
>>
>>
>> If one halts and poweroff via shutdown -h one of the two nodes, and then
>> reboots (via shutdown -r) the surviving node, the surviving node will fence
>> the other. We have power switch fencing, and it should simply suceed (making
>> a power  off then a power on on the other node's outlets). Once this fencing
>> suceeds, the boot sequence continues and the node assumes quorum.
>>
>>
>>
>> If later the other node is powered on, it should join the cluster without
>> problems.
>>
>>
>>
>> alvaro,
>>
>>
>>
>> Hi there,
>>
>>
>>
>> I've setup a two-node cluster with cman, clvmd and gfs2. I don't use qdisk
>> but had
>>
>> <cman expected_votes="1" two_node="1"/>
>>
>>
>>
>> I would like to know what is the proper procedure to reboot a node in the
>> two-node cluster (maybe this applies for all size?) when both nodes are
>> functioning fine but I just want to reboot one for some reason (for example,
>> upgrade kernel). Is there a preferred/better way to reboot the machine
>> rather than just running the "reboot" command as root. I have been doing the
>> "reboot" command so far and it sometimes creates problems for us, including
>>  making the other node to fail.
>>
>>
>>
>> Thank you very much.
>> Shi
>> --
>> Shi Jin, Ph.D.
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
>
> --
> Shi Jin, Ph.D.
>
>


-- 
Shi Jin, Ph.D.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110311/891731f1/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Fri Mar 11 23:57:41 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Fri, 11 Mar 2011 23:57:41 +0000
Subject: [Linux-cluster] clvmd hangs on startup
In-Reply-To: <20110308171153.GB272@bsdera.pcbi.upenn.edu>
References: <20110302215050.GD10674@bsdera.pcbi.upenn.edu>	<64D0546C5EBBD147B75DE133D798665F0855C290@hugo.eprize.local>	<20110303165056.GF10674@bsdera.pcbi.upenn.edu>
	<20110308171153.GB272@bsdera.pcbi.upenn.edu>
Message-ID: <4D7AB6F5.6070107@mssl.ucl.ac.uk>

On 08/03/11 17:11, Valeriu Mutu wrote:
> Hi,
>
> I think the problem is solved. I was using a 9000bytes MTU on the Xen virtual machines' iSCSI interface. Switching back to 1500bytes MTU caused the clvmd to start working.

As long as everything on the network is 9000bytes then you should be ok.

RH's linux implementation doesn't seem to allow path MTU discovery on 
local network.






From ajb2 at mssl.ucl.ac.uk  Sat Mar 12 00:06:47 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sat, 12 Mar 2011 00:06:47 +0000
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
Message-ID: <4D7AB917.80103@mssl.ucl.ac.uk>

On 09/03/11 14:13, yue wrote:
> which is better gfs2 and ocfs2?
> i want to share fc-san, do you know which is better?

"that depends" - it is highly dependent on the type of disk activity you 
are performing.

There are various reviews of both FSes circulating.

Personal observation: GFS and GFS2 currently have utterly rotten 
performance for activities involving many small files, such as NFS 
exporting /home via NFS sync mounts. They also fails miserably if there 
are a lot of files in a single directory (more than 5-700, with things 
getting unusable beyond about 1500 files)

I have not used OCFS2 in production environments, so I cannot comment on 
its performance in these scenarios.








From ajb2 at mssl.ucl.ac.uk  Sat Mar 12 00:15:16 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sat, 12 Mar 2011 00:15:16 +0000
Subject: [Linux-cluster] What is the proper procedure to reboot a node
 in a	cluster?
In-Reply-To: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
References: <AANLkTi=o5NFC59ebHuziok-9jVKjBCPxr0r+onSpEUWP@mail.gmail.com>
Message-ID: <4D7ABB14.2090201@mssl.ucl.ac.uk>


The only reliable way I have found (rhel4 and 5) is this:

1: Migrate all services off the node.

2: Unmount as many GFS disks as possible.

3: Power cycle the node.

The other nodes will recover quickly.

"cman leave (remove) (force)" sometimes works but often doesn't.






From jeff.sturm at eprize.com  Sat Mar 12 17:46:15 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Sat, 12 Mar 2011 12:46:15 -0500
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4D7AB917.80103@mssl.ucl.ac.uk>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>
	<4D7AB917.80103@mssl.ucl.ac.uk>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Alan Brown
> Sent: Friday, March 11, 2011 7:07 PM
> 
> Personal observation: GFS and GFS2 currently have utterly rotten
performance for
> activities involving many small files, such as NFS exporting /home via
NFS sync
> mounts. They also fails miserably if there are a lot of files in a
single directory (more
> than 5-700, with things getting unusable beyond about 1500 files)

While I certainly agree there are common scenarios in which GFS performs
slowly (backup by rsync is one), your characterization of GFS
performance within large directories isn't completely fair.

Here's a test I just ran on a cluster node, immediately after rebooting,
joining the cluster and mounting a GFS filesystem:

	[root at cluster1 76]# time ls
	00076985.ts  28d80a9c.ts  52b778d2.ts  7f50762b.ts  a9c5f908.ts
d39d0032.ts
	00917c3e.ts  28de643b.ts  532d3fd7.ts  7f5dea46.ts  a9e0328b.ts
d3bcc9fb.ts
	...
	289d2764.ts  527b6f37.ts  7f3e5c9a.ts  a989df77.ts  d36c57fc.ts
	28c3aa38.ts  52ab865f.ts  7f3e9278.ts  a9aa3dba.ts  d392d793.ts

	real    0m0.034s
	user    0m0.008s
	sys     0m0.004s

	[root at cluster1 76]# ls | wc -l
	1970

The key is that only a few locks are needed to list the directory:

	[root at cluster1 76]# gfs_tool counters /tb2
	
	                                  locks 32
	                             locks held 25

Running "ls -l" on the same directory takes a bit longer (by a factor of
about 20):

	[root at cluster1 76]# time ls -l
	total 1970
	-rw-r----- 1 root root 42 Mar  2 12:01 00076985.ts
	-rw-r----- 1 root root 42 Mar  2 12:01 00917c3e.ts
	-rw-r----- 1 root root 42 Mar  2 12:01 00b60c66.ts
	...
	-rw-r----- 1 root root 42 Mar  2 12:01 ffc02edd.ts
	-rw-r----- 1 root root 42 Mar  2 12:01 ffefd00a.ts
	-rw-r----- 1 root root 42 Mar  2 12:01 fff80ff6.ts

	real    0m0.641s
	user    0m0.032s
	sys     0m0.032s

presumably because it has to acquire quite a few additional locks:

	[root at cluster1 76]# gfs_tool counters /tb2
	
	                                  locks 3972
	                             locks held 3965

For better or worse, "ls -l" (or equivalently, the aliased "ls
--color=tty" for Red Hat users) is a very common operation for
interactive users, and such users often have an immediate negative
reaction to using GFS as a consequence.  In my personal opinion:

- Decades of work on Linux have optimized local filesystem performance
and system call performance to the point that system call overhead is
often treated as negligible for most applications.  Running "ls -l"
within a large directory is a slow, expensive operations on any system,
but if it "feels" fast enough (in terms of wall clock time, not compute
cycles) there's little incentive to optimize it further.  I find this is
true of software applications as well.  It's shocking to me how many
unnecessary system calls our own applications make, often as a result of
libraries such as glibc.

- Cluster filesystems require a lot of network communication to maintain
perfect consistency.  The network protocols used by (e.g.) DLM to
maintain this consistency are probably slower than the methods of
maintaining memory cache consistency on a SMP system by several orders
of magnitude.  It follows that assumptions about stat() performance on a
local filesystem do not necessarily hold on a clustered filesystem, and
application performance can suffer as a result.

- Overcoming this may involve significant changes to the Linux system
call interface (assuming there won't be a hardware solution anytime
soon).  For example, relying on the traditional stat() interface for
file metadata limits us to one file per system call.  In the case of a
clustered filesystem, stat() often triggers a synchronous network
round-trip via the locking protocol.  A theoretical stat() interface
that supports looking up multiple files at once would be an improvement,
but is relatively difficult to implement because it would entail
changing the system kernel, libraries, and application software.

- Ethernet is a terrible medium for a distributed locking protocol.
Ethernet is well suited for applications needing high bandwidth that are
not particularly sensitive to latency.  DLM doesn't need lots of
bandwidth, but is very sensitive to latency.  There exists better
hardware for this (e.g.
http://www.dolphinics.com/products/pemb-sci-d352.html) than Ethernet,
but alas Ethernet is ubiquitous and little work has been done in the
cluster community to support alternative hardware as far as I am aware.

As an example, while running a "du" command on my GFS mount point, I
observed the Ethernet traffic peak:

	12:20:33 PM     IFACE   rxpck/s   txpck/s   rxbyt/s   txbyt/s
rxcmp/s   txcmp/s  rxmcst/s
	12:20:38 PM      eth0   3517.60   3520.60 545194.80 631191.20
0.00      0.00      0.00

So a few thousand packets per second is the best this cluster node could
muster.  Average packet sizes are less than 200 bytes each way.  I'm
sure I could bring in my network experts and improve these results
somewhat, maybe with hardware that supports TCP offloading, but you'd
never improve this by more than perhaps an order of magnitude because
you're hitting the limits of what Ethernet hardware can do.

In summary, the state of the art in Linux clustered filesystems is
unlikely to change much until we change the way we write software
applications to optimize system call usage, or redesign the system call
interface to take better advantage of distributed locking protocols, or
start using new hardware that provides for distributed shared memory
much more efficiently than Ethernet is capable of.  Until any of those
things happen, many users are bound to be unimpressed with GFS and
similar clustered filesystems, relegating these to remain a niche
technology.

-Jeff





From ajb2 at mssl.ucl.ac.uk  Sat Mar 12 21:21:45 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sat, 12 Mar 2011 21:21:45 +0000
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>	<4D7AB917.80103@mssl.ucl.ac.uk>
	<64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>
Message-ID: <4D7BE3E9.2050101@mssl.ucl.ac.uk>

On 12/03/11 17:46, Jeff Sturm wrote:
> 	[root at cluster1 76]# ls | wc -l
> 	1970
>
> The key is that only a few locks are needed to list the directory:
>

You assume NFS clients are simply using "ls"

> Running "ls -l" on the same directory takes a bit longer (by a factor of
> about 20):
>

Or more. Try it with 256, 512, 1024 and 4096 files in the directory

Then try it with 16k files, 32k, 64k and 128k

Yes, users do have directories this large.

 > For better or worse, "ls -l" (or equivalently, the aliased "ls
> --color=tty" for Red Hat users) is a very common operation for
> interactive users, and such users often have an immediate negative
> reaction to using GFS as a consequence.

Those users are paying for GFS installations. They have every right to 
criticize its shockingly poor performance for these operations, 
especially when it adversely impacts their ability to get work done.

In addition the same problem appears every time a backup is run - even 
incrementals need to stat each file in order to find out what's changed. 
Having a 2million file filesystem take 28 hours to run an incremental vs 
10 minutes for the same thing on ext3/4 doesn't go down at all well.

What you've said is right, but also comes across to the average academic 
as condescending - which is a fast way of further alienating them.

As far as most users are concerned, a computer is a black box. You put 
files in, you get files out. If it's shockingly slow it's _not_ their 
problem, it's the problem of whoever installed it - and it doesn't help 
that GFS has been sold as production-ready when it's only useful in a 
limited range of filesystem activities.

AB





From ajb2 at mssl.ucl.ac.uk  Sat Mar 12 22:45:25 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sat, 12 Mar 2011 22:45:25 +0000
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>	<4D7AB917.80103@mssl.ucl.ac.uk>
	<64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>
Message-ID: <4D7BF785.6070006@mssl.ucl.ac.uk>

I missed somthing:

On 12/03/11 17:46, Jeff Sturm wrote:
> As an example, while running a "du" command on my GFS mount point, I
> observed the Ethernet traffic peak:
>
> 	12:20:33 PM     IFACE   rxpck/s   txpck/s   rxbyt/s   txbyt/s
> rxcmp/s   txcmp/s  rxmcst/s
> 	12:20:38 PM      eth0   3517.60   3520.60 545194.80 631191.20
> 0.00      0.00      0.00

Mount the GFS filesystem on one node only, lock_dlm and repeat all the 
tests.

Observe the network traffic.

The latencies aren't in the Ethernet layer (at the moment)

AB









From yvette at dbtgroup.com  Sat Mar 12 23:00:41 2011
From: yvette at dbtgroup.com (yvette hirth)
Date: Sat, 12 Mar 2011 23:00:41 +0000
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4D7BE3E9.2050101@mssl.ucl.ac.uk>
References: <4f996c7c.1356a.12e9af733aa.Coremail.ooolinux@163.com>	<4D7AB917.80103@mssl.ucl.ac.uk>	<64D0546C5EBBD147B75DE133D798665F0855C3A5@hugo.eprize.local>
	<4D7BE3E9.2050101@mssl.ucl.ac.uk>
Message-ID: <4D7BFB19.7020301@dbtgroup.com>

Alan Brown wrote:

> Those users are paying for GFS installations.

oh?  i've got the full cluster suite running here, from CentOS.  i don't 
remember receiving a bill...

> In addition the same problem appears every time a backup is run - even 
> incrementals need to stat each file in order to find out what's changed. 
> Having a 2million file filesystem take 28 hours to run an incremental vs 
> 10 minutes for the same thing on ext3/4 doesn't go down at all well.

if you have 2million files on one filesystem, methinks that GFS et al 
are doing the best that they can.  perhaps GFS is not the real issue...

we had issues with GFS; we flattened the big directories, and now things 
run much smoother.  slower than extX, and much slower than XFS, but 
since we can backup two machines to the same filesystem concurrently, 
we're not complaining...

> What you've said is right, but also comes across to the average academic 
> as condescending - which is a fast way of further alienating them.

"There is no offense where none is taken."
--old Vulcan sayinig

> As far as most users are concerned, a computer is a black box. You put 
> files in, you get files out. If it's shockingly slow it's _not_ their 
> problem, it's the problem of whoever installed it - and it doesn't help 
> that GFS has been sold as production-ready when it's only useful in a 
> limited range of filesystem activities.

while we have found that GFS is indeed production ready, one doesn't use 
a moving van to participate in the Indy 500.  caveat emptor.

yvette



From rpeterso at redhat.com  Sat Mar 12 23:13:05 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Sat, 12 Mar 2011 18:13:05 -0500 (EST)
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4D7BE3E9.2050101@mssl.ucl.ac.uk>
Message-ID: <368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Those users are paying for GFS installations. They have every right to
| criticize its shockingly poor performance for these operations,
| especially when it adversely impacts their ability to get work done.

Hi,

Agreed.  We're abundantly aware of the performance problems,
and we're not ignoring them.  People, please bear in mind that
Red Hat is also working diligently to improve all aspects of
gfs2 performance, and we've made great strides.  Cases in point:

(1) We recently found and fixed a problem that caused the
    dlm to pass locking traffic much slower than possible.
(2) We recently increased the speed and accuracy of fsck.gfs2
    quite a bit.
(3) We also recently developed a patch that improves GFS2's
    management of cluster locks by making hold times self-tuning.
    This makes gfs2 perform much faster in many situations.
(4) We've recently developed another performance patch that
    sped up clustered deletes (unlinks) as much as 25%.
(5) We recently identified and fixed a performance problem
    related to writing large files that sped things up considerably.

These patches are in various stages of development, and most or all
have already been posted to the public cluster-devel mailing list,
of various records in bugzilla, which means they're making their
way to a kernel (or user-space) near you.

Our work continues; we're improving it every day and have more
performance improvements planned.  I don't know about ocfs2,
but there's a whole team of people at Red Hat plus the open
source community at large working to improve gfs2.

Regards,

Bob Peterson
Red Hat File Systems



From ajb2 at mssl.ucl.ac.uk  Sun Mar 13 04:48:17 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sun, 13 Mar 2011 04:48:17 +0000
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D7C4C91.801@mssl.ucl.ac.uk>

On 12/03/11 23:13, Bob Peterson wrote:
> Agreed.  We're abundantly aware of the performance problems,
> and we're not ignoring them.

I know Bob, thanks.

> (1) We recently found and fixed a problem that caused the
>      dlm to pass locking traffic much slower than possible.

Is this rolled into 2.6.18-238.5.1.el5 ?

> (2) We recently increased the speed and accuracy of fsck.gfs2
>      quite a bit.

Noted and appreciated. I had cause to use them a few days ago.

> (3) We also recently developed a patch that improves GFS2's
>      management of cluster locks by making hold times self-tuning.
>      This makes gfs2 perform much faster in many situations.

Great

> (4) We've recently developed another performance patch that
>      sped up clustered deletes (unlinks) as much as 25%.

Good. This has been a real cow but at least for this kind of thing users 
simply tend to go for lunch and let it run.

> (5) We recently identified and fixed a performance problem
>      related to writing large files that sped things up considerably.
  See question 1 :)

Can I get hotfixes if possible? (el5.6 x64)

AB





From parvez.h.shaikh at gmail.com  Sun Mar 13 07:19:42 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Sun, 13 Mar 2011 12:49:42 +0530
Subject: [Linux-cluster] Two node cluster - a potential problem of node
	fencing each other?
Message-ID: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>

Hi all,

I have a question pertaining to two node cluster, I have RHEL 5.5 and
cluster along with it which at least should have two nodes.

In a situation where both nodes of the cluster are up, and have reliable
connection to fencing device (e.g. power switch OR any other power fencing
device) and heartbeat link between two nodes goes down.

Each node finds another node is down (because heartbeat IP becomes
unreachable) and tries to fence each other.

Is this situation possible? If so, can two nodes possibly fence (in short
shutdown or reboot) each other? Is there anyway out of this situation?

Thanks
Parvez
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110313/ac5069e4/attachment.htm>

From cthulhucalling at gmail.com  Sun Mar 13 07:49:02 2011
From: cthulhucalling at gmail.com (Ian Hayes)
Date: Sat, 12 Mar 2011 23:49:02 -0800
Subject: [Linux-cluster] Two node cluster - a potential problem of node
 fencing each other?
In-Reply-To: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
References: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
Message-ID: <AANLkTinepCrrqc3guN2NOfuE7xoJPyyUy9qg4yHvtFFP@mail.gmail.com>

On Sat, Mar 12, 2011 at 11:19 PM, Parvez Shaikh
<parvez.h.shaikh at gmail.com>wrote:

> Hi all,
>
> I have a question pertaining to two node cluster, I have RHEL 5.5 and
> cluster along with it which at least should have two nodes.
>
> In a situation where both nodes of the cluster are up, and have reliable
> connection to fencing device (e.g. power switch OR any other power fencing
> device) and heartbeat link between two nodes goes down.
>
> Each node finds another node is down (because heartbeat IP becomes
> unreachable) and tries to fence each other.
>
> Is this situation possible? If so, can two nodes possibly fence (in short
> shutdown or reboot) each other? Is there anyway out of this situation?
>

This is a fairly common problem called "split brain". The two nodes will go
into a shootout, fencing each other. There are a few ways to prevent this,
such as redundant network links and the use of quorum disks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110312/092d981b/attachment.htm>

From ooolinux at 163.com  Sun Mar 13 13:37:37 2011
From: ooolinux at 163.com (yue)
Date: Sun, 13 Mar 2011 21:37:37 +0800 (CST)
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4D7C4C91.801@mssl.ucl.ac.uk>
References: <4D7C4C91.801@mssl.ucl.ac.uk>
	<368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <ad4acc.6352.12eaf6fb5b1.Coremail.ooolinux@163.com>

1.i need gfs2 or ocfs2 to store xen-disk image file(20G--100G),it is big file. the underlying storage  is fc-san.   both of them have  cluster sence.so they fit for me.
if gfs2 is ready for product?  anyone use gfs2 in product?  stability is the most important thing.
2.i have try gfs2 and ocfs2 , iozone shows , gfs2 has a good throughput when record>=512k  and file size > 4G.   
3.my kernel is 2.6.32 and latest.




At 2011-03-13 12:48:17?"Alan Brown" <ajb2 at mssl.ucl.ac.uk> wrote:

>On 12/03/11 23:13, Bob Peterson wrote:
>> Agreed.  We're abundantly aware of the performance problems,
>> and we're not ignoring them.
>
>I know Bob, thanks.
>
>> (1) We recently found and fixed a problem that caused the
>>      dlm to pass locking traffic much slower than possible.
>
>Is this rolled into 2.6.18-238.5.1.el5 ?
>
>> (2) We recently increased the speed and accuracy of fsck.gfs2
>>      quite a bit.
>
>Noted and appreciated. I had cause to use them a few days ago.
>
>> (3) We also recently developed a patch that improves GFS2's
>>      management of cluster locks by making hold times self-tuning.
>>      This makes gfs2 perform much faster in many situations.
>
>Great
>
>> (4) We've recently developed another performance patch that
>>      sped up clustered deletes (unlinks) as much as 25%.
>
>Good. This has been a real cow but at least for this kind of thing users 
>simply tend to go for lunch and let it run.
>
>> (5) We recently identified and fixed a performance problem
>>      related to writing large files that sped things up considerably.
>  See question 1 :)
>
>Can I get hotfixes if possible? (el5.6 x64)
>
>AB
>
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110313/f16a7947/attachment.htm>

From parvez.h.shaikh at gmail.com  Sun Mar 13 13:57:46 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Sun, 13 Mar 2011 19:27:46 +0530
Subject: [Linux-cluster] Two node cluster - a potential problem of node
 fencing each other?
In-Reply-To: <AANLkTinepCrrqc3guN2NOfuE7xoJPyyUy9qg4yHvtFFP@mail.gmail.com>
References: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
	<AANLkTinepCrrqc3guN2NOfuE7xoJPyyUy9qg4yHvtFFP@mail.gmail.com>
Message-ID: <AANLkTim3j4YEbx0EL1A_CTyvRypTb_BEREfMBfO8HO5w@mail.gmail.com>

redundant network link - i trust you were referring to ethernet bonding.

On Sun, Mar 13, 2011 at 1:19 PM, Ian Hayes <cthulhucalling at gmail.com> wrote:

> On Sat, Mar 12, 2011 at 11:19 PM, Parvez Shaikh <parvez.h.shaikh at gmail.com
> > wrote:
>
>> Hi all,
>>
>> I have a question pertaining to two node cluster, I have RHEL 5.5 and
>> cluster along with it which at least should have two nodes.
>>
>> In a situation where both nodes of the cluster are up, and have reliable
>> connection to fencing device (e.g. power switch OR any other power fencing
>> device) and heartbeat link between two nodes goes down.
>>
>> Each node finds another node is down (because heartbeat IP becomes
>> unreachable) and tries to fence each other.
>>
>> Is this situation possible? If so, can two nodes possibly fence (in short
>> shutdown or reboot) each other? Is there anyway out of this situation?
>>
>
> This is a fairly common problem called "split brain". The two nodes will go
> into a shootout, fencing each other. There are a few ways to prevent this,
> such as redundant network links and the use of quorum disks.
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110313/cede3e30/attachment.htm>

From thomas at sjolshagen.net  Sun Mar 13 16:21:40 2011
From: thomas at sjolshagen.net (Thomas Sjolshagen)
Date: Sun, 13 Mar 2011 12:21:40 -0400
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <ad4acc.6352.12eaf6fb5b1.Coremail.ooolinux@163.com>
References: <4D7C4C91.801@mssl.ucl.ac.uk>
	<368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<ad4acc.6352.12eaf6fb5b1.Coremail.ooolinux@163.com>
Message-ID: <257E2D73-391F-4FA0-84FE-8E5A97F8CD82@sjolshagen.net>

I'm using gfs2 to host KVM vm image files for a pair of clustered hosts. Am using iscsi targets for the vm data devices however as they are hosting imap spools. No stability issues or performance problems I can readily or easily attribute to the gfs2 FS in my use case.

// Thomas

On Mar 13, 2011, at 9:37 AM, yue <ooolinux at 163.com> wrote:

> 1.i need gfs2 or ocfs2 to store xen-disk image file(20G--100G),it is big file. the underlying storage  is fc-san.   both of them have  cluster sence.so they fit for me.
> if gfs2 is ready for product?  anyone use gfs2 in product?  stability is the most important thing.
> 2.i have try gfs2 and ocfs2 , iozone shows , gfs2 has a good throughput when record>=512k  and file size > 4G.   
> 3.my kernel is 2.6.32 and latest.
> 
> 
> At 2011-03-13 12:48:17?"Alan Brown" <ajb2 at mssl.ucl.ac.uk> wrote:
> 
> >On 12/03/11 23:13, Bob Peterson wrote:
> >> Agreed.  We're abundantly aware of the performance problems,
> >> and we're not ignoring them.
> >
> >I know Bob, thanks.
> >
> >> (1) We recently found and fixed a problem that caused the
> >>      dlm to pass locking traffic much slower than possible.
> >
> >Is this rolled into 2.6.18-238.5.1.el5 ?
> >
> >> (2) We recently increased the speed and accuracy of fsck.gfs2
> >>      quite a bit.
> >
> >Noted and appreciated. I had cause to use them a few days ago.
> >
> >> (3) We also recently developed a patch that improves GFS2's
> >>      management of cluster locks by making hold times self-tuning.
> >>      This makes gfs2 perform much faster in many situations.
> >
> >Great
> >
> >> (4) We've recently developed another performance patch that
> >>      sped up clustered deletes (unlinks) as much as 25%.
> >
> >Good. This has been a real cow but at least for this kind of thing users 
> >simply tend to go for lunch and let it run.
> >
> >> (5) We recently identified and fixed a performance problem
> >>      related to writing large files that sped things up considerably.
> >  See question 1 :)
> >
> >Can I get hotfixes if possible? (el5.6 x64)
> >
> >AB
> >
> >
> >
> >--
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From ooolinux at 163.com  Mon Mar 14 04:43:59 2011
From: ooolinux at 163.com (yue)
Date: Mon, 14 Mar 2011 12:43:59 +0800 (CST)
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <257E2D73-391F-4FA0-84FE-8E5A97F8CD82@sjolshagen.net>
References: <257E2D73-391F-4FA0-84FE-8E5A97F8CD82@sjolshagen.net>
	<4D7C4C91.801@mssl.ucl.ac.uk>
	<368243113.417601.1299971585048.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<ad4acc.6352.12eaf6fb5b1.Coremail.ooolinux@163.com>
Message-ID: <363e5b.9647.12eb2ad844f.Coremail.ooolinux@163.com>

1.thanks,  i have 20-100 nodes.
anyone knows  how citrix does ?    



At 2011-03-14 00:21:40?"Thomas Sjolshagen" <thomas at sjolshagen.net> wrote:

>I'm using gfs2 to host KVM vm image files for a pair of clustered hosts. Am using iscsi targets for the vm data devices however as they are hosting imap spools. No stability issues or performance problems I can readily or easily attribute to the gfs2 FS in my use case.
>
>// Thomas
>
>On Mar 13, 2011, at 9:37 AM, yue <ooolinux at 163.com> wrote:
>
>> 1.i need gfs2 or ocfs2 to store xen-disk image file(20G--100G),it is big file. the underlying storage  is fc-san.   both of them have  cluster sence.so they fit for me.
>> if gfs2 is ready for product?  anyone use gfs2 in product?  stability is the most important thing.
>> 2.i have try gfs2 and ocfs2 , iozone shows , gfs2 has a good throughput when record>=512k  and file size > 4G.   
>> 3.my kernel is 2.6.32 and latest.
>> 
>> 
>> At 2011-03-13 12:48:17?"Alan Brown" <ajb2 at mssl.ucl.ac.uk> wrote:
>> 
>> >On 12/03/11 23:13, Bob Peterson wrote:
>> >> Agreed.  We're abundantly aware of the performance problems,
>> >> and we're not ignoring them.
>> >
>> >I know Bob, thanks.
>> >
>> >> (1) We recently found and fixed a problem that caused the
>> >>      dlm to pass locking traffic much slower than possible.
>> >
>> >Is this rolled into 2.6.18-238.5.1.el5 ?
>> >
>> >> (2) We recently increased the speed and accuracy of fsck.gfs2
>> >>      quite a bit.
>> >
>> >Noted and appreciated. I had cause to use them a few days ago.
>> >
>> >> (3) We also recently developed a patch that improves GFS2's
>> >>      management of cluster locks by making hold times self-tuning.
>> >>      This makes gfs2 perform much faster in many situations.
>> >
>> >Great
>> >
>> >> (4) We've recently developed another performance patch that
>> >>      sped up clustered deletes (unlinks) as much as 25%.
>> >
>> >Good. This has been a real cow but at least for this kind of thing users 
>> >simply tend to go for lunch and let it run.
>> >
>> >> (5) We recently identified and fixed a performance problem
>> >>      related to writing large files that sped things up considerably.
>> >  See question 1 :)
>> >
>> >Can I get hotfixes if possible? (el5.6 x64)
>> >
>> >AB
>> >
>> >
>> >
>> >--
>> >Linux-cluster mailing list
>> >Linux-cluster at redhat.com
>> >https://www.redhat.com/mailman/listinfo/linux-cluster
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110314/57f4430b/attachment.htm>

From raju.rajsand at gmail.com  Mon Mar 14 09:50:13 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Mon, 14 Mar 2011 15:20:13 +0530
Subject: [Linux-cluster] Two node cluster - a potential problem of node
 fencing each other?
In-Reply-To: <AANLkTim3j4YEbx0EL1A_CTyvRypTb_BEREfMBfO8HO5w@mail.gmail.com>
References: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
	<AANLkTinepCrrqc3guN2NOfuE7xoJPyyUy9qg4yHvtFFP@mail.gmail.com>
	<AANLkTim3j4YEbx0EL1A_CTyvRypTb_BEREfMBfO8HO5w@mail.gmail.com>
Message-ID: <AANLkTi=rsWtTgr9bJToQUwRpRkSMgzxWaMJinP4FJa7x@mail.gmail.com>

Greetings,

On Sun, Mar 13, 2011 at 7:27 PM, Parvez Shaikh
<parvez.h.shaikh at gmail.com> wrote:
> redundant network link - i trust you were referring to ethernet bonding.
>
>>
>> This is a fairly common problem called "split brain". The two nodes will
>> go into a shootout, fencing each other. There are a few ways to prevent
>> this, such as redundant network links and the use of quorum disks.
>>
>>

No .it is not bonding

It is another IP address accessible to each node of the cluster
(perhaps the gateway?-- can anybody expand on this a bit) and Quorum
disk is another LUN, say about 100mb per node, accessible to the
cluster (IOW, external storage)

Regards,

Rajagopal



From rpeterso at redhat.com  Mon Mar 14 13:16:15 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 14 Mar 2011 09:16:15 -0400 (EDT)
Subject: [Linux-cluster] which is better gfs2 and ocfs2?
In-Reply-To: <4D7C4C91.801@mssl.ucl.ac.uk>
Message-ID: <350344129.425432.1300108575392.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| > (1) We recently found and fixed a problem that caused the
| >      dlm to pass locking traffic much slower than possible.
| 
| Is this rolled into 2.6.18-238.5.1.el5 ?

Yes, it was added starting with 2.6.18-232

| > (5) We recently identified and fixed a performance problem
| >      related to writing large files that sped things up
| >      considerably.

This one is still in patch form.  Some of our customers are
testing it in production now, but it hasn't made its way to
any official kernels yet.

| 
| Can I get hotfixes if possible? (el5.6 x64)
| 
| AB

If you're a Red Hat customer you should contact our support
people.  We don't have kernels built for other distros, but
as I said, the patches are all posted in various places.
The first place to look is the cluster-devel mailing list.
The archives are here:

https://www.redhat.com/archives/cluster-devel/

The clustered unlink patch is here:
https://www.redhat.com/archives/cluster-devel/2011-February/msg00059.html

The self-tuning glocks patch is here:
https://www.redhat.com/archives/cluster-devel/2011-January/msg00079.html

The "large file" slowdown patch only affects RHEL5, so the upstream
code and RHEL6 don't have that problem.  The rhel5 patch is attached
to this bugzilla bug (not sure if it's public or private):
https://bugzilla.redhat.com/show_bug.cgi?id=683155

And as for fsck.gfs2, I think the performance patches are
planned for RHEL5.7.

Regards,

Bob Peterson
Red Hat File Systems



From iarlyy at gmail.com  Mon Mar 14 13:50:29 2011
From: iarlyy at gmail.com (iarly selbir)
Date: Mon, 14 Mar 2011 10:50:29 -0300
Subject: [Linux-cluster] time of left/join member
Message-ID: <AANLkTi=r9sV3NWCUtA-6d1ncQ1zexNOQiifs+Pu=DHNm@mail.gmail.com>

I was checking my two node cluster, I noticed that one node is down, my
question is how to find out when this node left the cluster? assuming that
services already running on the other node I can't see why this node "maybe"
was fenced and powered off.

/var/log/messages was not clear enough to me, my only information I only got
the time when machine was powered off in lastlog.

Thanks in advance for any suggestion.


- -
iarlyy selbir

:wq!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110314/8c9479dd/attachment.htm>

From linux at alteeve.com  Mon Mar 14 13:57:03 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Mar 2011 09:57:03 -0400
Subject: [Linux-cluster] time of left/join member
In-Reply-To: <AANLkTi=r9sV3NWCUtA-6d1ncQ1zexNOQiifs+Pu=DHNm@mail.gmail.com>
References: <AANLkTi=r9sV3NWCUtA-6d1ncQ1zexNOQiifs+Pu=DHNm@mail.gmail.com>
Message-ID: <4D7E1EAF.50205@alteeve.com>

On 03/14/2011 09:50 AM, iarly selbir wrote:
> I was checking my two node cluster, I noticed that one node is down, my
> question is how to find out when this node left the cluster? assuming
> that services already running on the other node I can't see why this
> node "maybe" was fenced and powered off.
> 
> /var/log/messages was not clear enough to me, my only information I only
> got the time when machine was powered off in lastlog.
> 
> Thanks in advance for any suggestion.

The surviving node's /var/log/messages should contain mention of when it
lost contact with the node and reformed the cluster.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From thiagoh at digirati.com.br  Mon Mar 14 14:03:06 2011
From: thiagoh at digirati.com.br (Thiago Henrique)
Date: Mon, 14 Mar 2011 11:03:06 -0300
Subject: [Linux-cluster] Two node cluster benchmark
Message-ID: <1300111386.2148.35.camel@thiagohenrique06>

Hello,

I have a two node cluster configured like: 
  Ubuntu 10.04 + CMAN + DRBD + GFS2

In a benchmark, I run simultaneously on both nodes, a script that make
write operations in the filesystem until it fills.

But when I run the benchmark, foo-node remains almost the whole time
waiting for bar-node write to the file system. Is this normal? How could
I optimize this?

Cman config:
################################################################################
<?xml version="1.0"?>

<cluster name="MyCluster" config_version="1">
  <logging debug="on"/>
  <totem consensus="6000" token="3000"/>
  <cman two_node="1" expected_votes="1"/>
  <dlm plock_ownership="1" plock_rate_limit="0"/>
  <gfs_controld plock_rate_limit="0"/>
  
  <clusternodes>
    <clusternode name="192.168.0.1" nodeid="1">
      <fence>
        <method name="human"> 
          <device name="human" nodename="192.168.0.1"/> 
        </method>       
      </fence>
    </clusternode>

    <clusternode name="192.168.0.2" nodeid="2">
      <fence> 
        <method name="human"> 
          <device name="human" nodename="192.168.0.2"/> 
        </method>
      </fence>
    </clusternode>
  </clusternodes>

  <fencedevices>
    <fencedevice name="human" agent="fence_manual"/>
  </fencedevices>
</cluster>
################################################################################

GFS2 config:
################################################################################
mkfs.gfs2 -p lock_dlm -t MyCluster:MyFileSystem -j 2 /dev/drbd2
mount.gfs2 /dev/drbd2 /var/fs_tests/gfs2/ -o noatime
################################################################################

Thank you in advance
Best regards
--
Thiago Henrique



From ajb2 at mssl.ucl.ac.uk  Mon Mar 14 14:14:37 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Mon, 14 Mar 2011 14:14:37 +0000
Subject: [Linux-cluster] Resource groups
Message-ID: <4D7E22CD.2060009@mssl.ucl.ac.uk>

Bob:

You say this in your best practice document:

"Our performance testing lab has experimented with various resource 
group sizes and found a performance problem with anything bigger than 
768MB. Until this is properly diagnosed, we recommend staying below 768MB."

What are the details? Nearly all of our FSes are created with 2Gb RGs.





From rpeterso at redhat.com  Mon Mar 14 14:30:11 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 14 Mar 2011 10:30:11 -0400 (EDT)
Subject: [Linux-cluster] Resource groups
In-Reply-To: <4D7E22CD.2060009@mssl.ucl.ac.uk>
Message-ID: <412215982.427568.1300113011942.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Bob:
| 
| You say this in your best practice document:
| 
| "Our performance testing lab has experimented with various resource
| group sizes and found a performance problem with anything bigger than
| 768MB. Until this is properly diagnosed, we recommend staying below
| 768MB."
| 
| What are the details? Nearly all of our FSes are created with 2Gb RGs.

Hi,

I'm afraid I don't have many more details.  This was just a comment
that one of our performance guys sent to me a while back.  I haven't
had a chance to investigate his claims or look into what's going on.
I'll do some tests of my own, and if I can recreate a performance
problem based on rgrp size, I'll open a bugzilla record and analyze
what's going on.

Regards,

Bob Peterson
Red Hat File Systems



From jduston at ll.mit.edu  Mon Mar 14 22:35:30 2011
From: jduston at ll.mit.edu (Jack Duston)
Date: Mon, 14 Mar 2011 18:35:30 -0400
Subject: [Linux-cluster] GFS2 file system maintenance question.
Message-ID: <4D7E9832.40000@ll.mit.edu>

Hello folks,

I am planning to create a 2 node cluster with a GFS2 CLVM SAN.
The following Note in the RHEL6 GFS2 manual jumped out at me:

Chapter 3. Managing GFS2
Note:
Once you have created a GFS2 file system with the mkfs.gfs2 command, you 
cannot decrease the size of the file system. You can, however, increase 
the size of an existing file system with the gfs2_grow command, as 
described in Section 3.6, ?Growing a File System?.

This seems to me to make a GFS2 LV un-maintainable.

What concerns me is the issue of how to remove a LUN from the GFS2 LV. 
This will be a necessity *when* there are hardware problems with a 
storage unit, End of Life/obsolescence (a la XRaid), or upgrade (replace 
1TB HDDS with 3 TB HDDs in the LUNs).

Hardware does not last forever, and manufacturers do EOL products or go 
out of business.
I had also hoped to upgrade the 1TB HDDs in our current LUNs with 3 TB 
HDDs next year.

I planned to free up enough space on the GFS2 LV to migrate data off one 
LUN. I could then decrease the GFS2 file system size, remove the LUN 
from the LV, destroy the RAID LUN, replace 1TB HDDs with 3TB HDDs, 
rebuild the RAID LUN, add the new larger LUN to the LV, increase the 
GFS2 file system size, and repeat migrating data off the next LUN.

If the above note is correct, it seems to only way to deal with a 
hardware issue, obsolescence/EOL, or upgrading components is to destroy 
the entire GFS2 file system, build a new GFS2 file system from scratch, 
and restore data from backups. This might not be too bad with a small 
SAN of 20TB, but our data will exceed 100TB and it would be good not to 
have to rebuild Rome in a day.

Can anyone confirm that GFS2 file system cannot be decreased? If so, is 
there any plan to add this capability/fix this issue in a future 
release? Is there another/better way to remove a LUN from GFS2 than what 
I considered?

Any info greatly appreciated.



From ooolinux at 163.com  Tue Mar 15 01:35:14 2011
From: ooolinux at 163.com (yue)
Date: Tue, 15 Mar 2011 09:35:14 +0800 (CST)
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <4D7E9832.40000@ll.mit.edu>
References: <4D7E9832.40000@ll.mit.edu>
Message-ID: <21f7a2.18b2.12eb72713be.Coremail.ooolinux@163.com>

1.
GFS2 is based on a 64-bit architecture, which can theoretically accommodate an 8 EB file system. However, the current supported maximum size of a GFS2 file system is 25 TB. If your system requires GFS2 file systems larger than 25 TB, contact your Red Hat service representative.





At 2011-03-15 06:35:30?"Jack Duston" <jduston at ll.mit.edu> wrote:

>Hello folks,
>
>I am planning to create a 2 node cluster with a GFS2 CLVM SAN.
>The following Note in the RHEL6 GFS2 manual jumped out at me:
>
>Chapter 3. Managing GFS2
>Note:
>Once you have created a GFS2 file system with the mkfs.gfs2 command, you 
>cannot decrease the size of the file system. You can, however, increase 
>the size of an existing file system with the gfs2_grow command, as 
>described in Section 3.6, ?Growing a File System?.
>
>This seems to me to make a GFS2 LV un-maintainable.
>
>What concerns me is the issue of how to remove a LUN from the GFS2 LV. 
>This will be a necessity *when* there are hardware problems with a 
>storage unit, End of Life/obsolescence (a la XRaid), or upgrade (replace 
>1TB HDDS with 3 TB HDDs in the LUNs).
>
>Hardware does not last forever, and manufacturers do EOL products or go 
>out of business.
>I had also hoped to upgrade the 1TB HDDs in our current LUNs with 3 TB 
>HDDs next year.
>
>I planned to free up enough space on the GFS2 LV to migrate data off one 
>LUN. I could then decrease the GFS2 file system size, remove the LUN 
>from the LV, destroy the RAID LUN, replace 1TB HDDs with 3TB HDDs, 
>rebuild the RAID LUN, add the new larger LUN to the LV, increase the 
>GFS2 file system size, and repeat migrating data off the next LUN.
>
>If the above note is correct, it seems to only way to deal with a 
>hardware issue, obsolescence/EOL, or upgrading components is to destroy 
>the entire GFS2 file system, build a new GFS2 file system from scratch, 
>and restore data from backups. This might not be too bad with a small 
>SAN of 20TB, but our data will exceed 100TB and it would be good not to 
>have to rebuild Rome in a day.
>
>Can anyone confirm that GFS2 file system cannot be decreased? If so, is 
>there any plan to add this capability/fix this issue in a future 
>release? Is there another/better way to remove a LUN from GFS2 than what 
>I considered?
>
>Any info greatly appreciated.
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110315/f33b6caf/attachment.htm>

From bergman at merctech.com  Tue Mar 15 04:11:41 2011
From: bergman at merctech.com (bergman at merctech.com)
Date: Tue, 15 Mar 2011 00:11:41 -0400
Subject: [Linux-cluster] quorum device not getting a vote causes 2-node
	cluster to be inquorate
Message-ID: <25829.1300162301@mirchi>

I have been using a 2-node cluster with a quorum disk successfully for
about 2 years. Beginning today, the cluster will not boot correctly.

The RHCS services start, but fencing fails with:
	
	dlm: no local IP address has been set
	dlm: cannot start dlm lowcomms -107

This seems to be a symtpom of the fact that the cluster votes do not include votes from the quorum
device:

	# clustat
	Cluster Status for example-infra @ Tue Mar 15 00:02:35 2011
	Member Status: Inquorate

	Member Name                                              ID   Status
	------ ----                                              ---- ------
	example-infr2-admin.domain.com                              1 Online, Local
	example-infr1-admin.domain.com                              2 Offline
        /dev/mpath/quorum                                           0 Offline

	[root at example-infr2 ~]# cman_tool status
	Version: 6.2.0
	Config Version: 239
	Cluster Name: example-infra
	Cluster Id: 42813
	Cluster Member: Yes
	Cluster Generation: 676844
	Membership state: Cluster-Member
	Nodes: 1
	Expected votes: 2
	Total votes: 1
	Quorum: 2 Activity blocked
	Active subsystems: 7
	Flags: 
	Ports Bound: 0  
	Node name: example-infr2-admin.domain.com
	Node ID: 1
	Multicast addresses: 239.192.167.228 
	Node addresses: 192.168.110.3 


The shared-SAN-disk quorum device is readable from each node. Testing
with "mkqdisk -L" and "dd if=/dev/mpath/quorum of=/dev/quorum.dump"
both succeed from each node.

When run in the foreground, "qdisk -d -f" gives messages that seem to indicate
that it is successful:

	# qdiskd -d -f
	[22568] debug: Loading configuration information
	[22568] debug: Heuristic: '/bin/ping -c3 -W1 -t2 192.168.110.10' score=1
	interval=2 tko=9
	[22568] debug: 1 heuristics loaded
	[22568] debug: Quorum Daemon: 1 heuristics, 3 interval, 15 tko, 1 votes
	[22568] debug: Run Flags: 00000035
	[22568] info: Quorum Daemon Initializing
	[22568] debug: I/O Size: 512  Page Size: 4096
	[22569] info: Heuristic: '/bin/ping -c3 -W1 -t2 192.168.110.10' UP
	[22568] debug: Node 3 is UP
	[22568] info: Node 3 is the master
	[22568] info: Initial score 1/1
	[22568] info: Initialization complete
	[22568] notice: Score sufficient for master operation (1/1; required=1); upgrading


Any suggestions?

Thanks,

Mark

------------Versions-----------------
Linux example-infr2.domain.com 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25

lvm2-cluster-2.02.56-7.el5_5.4
cman-2.0.115-34.el5_5.4
system-config-cluster-1.0.57-3.el5_5.1
rgmanager-2.0.52-6.el5.centos.8

----------excerpt from cluster.conf----------------
<?xml version="1.0"?>
<cluster alias="example-infra" config_version="239" name="example-infra">
	<fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="30"/>
	<clusternodes>
		<clusternode name="example-infr2-admin.domain.com" nodeid="1" votes="1">
			<fence>
				<method name="1">
					<device modulename="" name="example-infr2-drac"/>
				</method>
				<method name="2"/>
			</fence>
		</clusternode>
		<clusternode name="example-infr1-admin.domain.com" nodeid="2" votes="1">
			<fence>
				<method name="1">
					<device modulename="" name="example-infr1-drac"/>
				</method>
				<method name="2"/>
				<method name="3"/>
			</fence>
		</clusternode>
	</clusternodes>
	<cman expected_votes="2" two_node="0"/>



	<quorumd device="/dev/mpath/quorum" interval="3" tko="15" votes="1">
		<heuristic interval="2" program="/bin/ping -c3 -W1 -t2 192.168.110.10" score="1" tko="9"/>
	</quorumd>
	<totem token="54000"/>
</cluster>
--------------------------------------------------------------------------



From fdinitto at redhat.com  Tue Mar 15 05:22:13 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 15 Mar 2011 06:22:13 +0100
Subject: [Linux-cluster] quorum device not getting a vote causes 2-node
 cluster to be inquorate
In-Reply-To: <25829.1300162301@mirchi>
References: <25829.1300162301@mirchi>
Message-ID: <4D7EF785.8070708@redhat.com>

On 03/15/2011 05:11 AM, bergman at merctech.com wrote:
> I have been using a 2-node cluster with a quorum disk successfully for
> about 2 years. Beginning today, the cluster will not boot correctly.
> 
> The RHCS services start, but fencing fails with:
> 	
> 	dlm: no local IP address has been set
> 	dlm: cannot start dlm lowcomms -107
> 
> This seems to be a symtpom of the fact that the cluster votes do not include votes from the quorum
> device:
> 
> 	# clustat
> 	Cluster Status for example-infra @ Tue Mar 15 00:02:35 2011
> 	Member Status: Inquorate
> 
> 	Member Name                                              ID   Status
> 	------ ----                                              ---- ------
> 	example-infr2-admin.domain.com                              1 Online, Local
> 	example-infr1-admin.domain.com                              2 Offline
>         /dev/mpath/quorum                                           0 Offline
> 
> 	[root at example-infr2 ~]# cman_tool status
> 	Version: 6.2.0
> 	Config Version: 239
> 	Cluster Name: example-infra
> 	Cluster Id: 42813
> 	Cluster Member: Yes
> 	Cluster Generation: 676844
> 	Membership state: Cluster-Member
> 	Nodes: 1
> 	Expected votes: 2
> 	Total votes: 1
> 	Quorum: 2 Activity blocked
> 	Active subsystems: 7
> 	Flags: 
> 	Ports Bound: 0  
> 	Node name: example-infr2-admin.domain.com
> 	Node ID: 1
> 	Multicast addresses: 239.192.167.228 
> 	Node addresses: 192.168.110.3 

You should check the output from cman_tool nodes. It appears that the
nodes are not seeing each other at all.

The first things I would check are iptables, node names resolves to the
correct ip addresses, selinux and eventually if the switch in between
the nodes support multicast.

Fabio



From ooolinux at 163.com  Tue Mar 15 05:37:20 2011
From: ooolinux at 163.com (yue)
Date: Tue, 15 Mar 2011 13:37:20 +0800 (CST)
Subject: [Linux-cluster] is ocfs2 is limited 16T
In-Reply-To: <4D73805F.8020308@srce.hr>
References: <4D73805F.8020308@srce.hr>
	<56a50421.709d.12e89a4c7cb.Coremail.ooolinux@163.com>
Message-ID: <2baf4fa.6fc9.12eb804b8c4.Coremail.ooolinux@163.com>

how to rebuild ocfs2.ko
 
what is needed to changed?
thanks




At 2011-03-06 20:38:55?"Jakov Sosic" <jakov.sosic at srce.hr> wrote:

>On 03/06/2011 06:30 AM, yue wrote:
>> if there is a limit on ocfs2'volume? it must less 16T?
>
>For RHEL v5.x and derivateves yes. But you can hack it and rebuild
>kernel modules without limitation. You also need to patch kernel-sources
>and rebuild kernel too.
>
>
>-- 
>Jakov Sosic
>www.srce.hr
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110315/6813a7e3/attachment.htm>

From rmitchel at redhat.com  Tue Mar 15 07:09:17 2011
From: rmitchel at redhat.com (Ryan Mitchell)
Date: Tue, 15 Mar 2011 17:09:17 +1000
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <4D7E9832.40000@ll.mit.edu>
References: <4D7E9832.40000@ll.mit.edu>
Message-ID: <4D7F109D.4000404@redhat.com>

On 03/15/2011 08:35 AM, Jack Duston wrote:
>
> I planned to free up enough space on the GFS2 LV to migrate data off 
> one LUN. I could then decrease the GFS2 file system size, remove the 
> LUN from the LV, destroy the RAID LUN, replace 1TB HDDs with 3TB HDDs, 
> rebuild the RAID LUN, add the new larger LUN to the LV, increase the 
> GFS2 file system size, and repeat migrating data off the next LUN.
>
Hi,

No you will not be able to use that procedure to swap LUNs.  If you have 
the ability to present the new LUN's before removing the old LUN's from 
the volume group, it would be possible to:
1) vgextend the volume group using the new LUN
2) pvmove the extents from the old LUN to the new LUN
3) vgreduce the old LUN to remove it from the volume group

This could be done 1 LUN at a time.  It doesn't even require you to grow 
the filesystem (unless the new LUN's are larger than the old ones).  
This is common and I've seen it done many times.  You could even use a 
temporary staging LUN to shuffle the data around.

If you do not have the capacity to add additional LUNs before removing 
the original LUNs, then you will face a difficult migration, possibly 
using backup/restore as you mentioned.

The feature to reduce the filesystem has not been implemented; there is 
no code as yet to manage it.  It isn't commonly required.

Regards,

Ryan Mitchell



From fedorischev at bsu.edu.ru  Tue Mar 15 08:07:21 2011
From: fedorischev at bsu.edu.ru (=?koi8-r?Q?=E6=C5=C4=CF=D2=C9=DD=C5=D7_?= =?koi8-r?Q?=E9=2E=EE=2E?=)
Date: Tue, 15 Mar 2011 11:07:21 +0300
Subject: [Linux-cluster] gfs2 volume constantly growing
Message-ID: <1300176441.4161.19.camel@ui-tcc02.bsu.edu.ru>

Hello, subscribers!

Please advise me in the problem. Gfs2 volume on our cluster servers is
growing all the time, but the overall size of the files on the partition
remains small. Here it is:

# df -h
/dev/sdb5             4,7G  3,9G  853M  83% /var/log/httpd

But

# du -ch /var/log/httpd/
70M     /var/log/httpd/

I reboot the cluster systems to kill any suspicious processes but
nothing happens. Then I umount partition on both cluster nodes and do
fsck.gfs2 on partition. It was found many file system errors and after
that everything fell into place. But after 1 weak has passed 
the same thing happened again, volume is grows again. Is this any bug in
gfs2 implementation or something else? We using CentOS release 5.5
x86_64 on server and dag, epel repositories to software updates.

Thanks to all.




From laszlo.budai at gmail.com  Tue Mar 15 10:02:28 2011
From: laszlo.budai at gmail.com (Budai Laszlo)
Date: Tue, 15 Mar 2011 12:02:28 +0200
Subject: [Linux-cluster] Service location (colocation)
In-Reply-To: <20110304170647.GC14803@redhat.com>
References: <4D6A7709.6060108@gmail.com> <20110304170647.GC14803@redhat.com>
Message-ID: <4D7F3934.3040209@gmail.com>

Is pacemaker a supported alternative for rgmanager? Starting with which
version of Red Hat Enterprise Linux?

Thank you,
Laszlo


On 03/04/2011 07:06 PM, Lon Hohberger wrote:
> On Sun, Feb 27, 2011 at 06:08:41PM +0200, Budai Laszlo wrote:
>> Hi all,
>>
>> is there a way to define location dependencies among services? for
>> instance how can I define that Service A should run on the same node as
>> service B? Or the opposite: Service C should run on a different node
>> than service D?
>>
> rgmanager doesn't have this feature built-in; you can define
> 'collocated services' by simply creating one large service comprising
> all of the resources for both services.
>
> You could probably trivially extend central_processing mode to do "anti
> collocation" (i.e. run on another node).
>
> The 'follow_service.sl' script is an example of how to do part of
> 'anti-collocation'.   The way it works, it starts service A on a
> different node from service B.  If the node running service A fails, it
> is started on the same node as service B, then service B is moved away
> to another (empty, usually) node in the cluster.
>
> Alternatively, pacemaker supports this functionality.
>



From andrew at beekhof.net  Tue Mar 15 11:50:32 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Tue, 15 Mar 2011 12:50:32 +0100
Subject: [Linux-cluster] Service location (colocation)
In-Reply-To: <4D7F3934.3040209@gmail.com>
References: <4D6A7709.6060108@gmail.com> <20110304170647.GC14803@redhat.com>
	<4D7F3934.3040209@gmail.com>
Message-ID: <AANLkTik9+nsLWkbA1t+7JTK1MagKUda6WimsouNaSu0_@mail.gmail.com>

On Tue, Mar 15, 2011 at 11:02 AM, Budai Laszlo <laszlo.budai at gmail.com> wrote:
> Is pacemaker a supported alternative for rgmanager?

Not yet, although it is available as of 6.0

> Starting with which
> version of Red Hat Enterprise Linux?
>
> Thank you,
> Laszlo
>
>
> On 03/04/2011 07:06 PM, Lon Hohberger wrote:
>> On Sun, Feb 27, 2011 at 06:08:41PM +0200, Budai Laszlo wrote:
>>> Hi all,
>>>
>>> is there a way to define location dependencies among services? for
>>> instance how can I define that Service A should run on the same node as
>>> service B? Or the opposite: Service C should run on a different node
>>> than service D?
>>>
>> rgmanager doesn't have this feature built-in; you can define
>> 'collocated services' by simply creating one large service comprising
>> all of the resources for both services.
>>
>> You could probably trivially extend central_processing mode to do "anti
>> collocation" (i.e. run on another node).
>>
>> The 'follow_service.sl' script is an example of how to do part of
>> 'anti-collocation'. ? The way it works, it starts service A on a
>> different node from service B. ?If the node running service A fails, it
>> is started on the same node as service B, then service B is moved away
>> to another (empty, usually) node in the cluster.
>>
>> Alternatively, pacemaker supports this functionality.
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From jayesh.shinde at netcore.co.in  Tue Mar 15 12:13:14 2011
From: jayesh.shinde at netcore.co.in (jayesh.shinde)
Date: Tue, 15 Mar 2011 17:43:14 +0530
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
Message-ID: <4D7F57DA.8060204@netcore.co.in>

Hi All ,

I don't have SAN with me , so I want to build the 2 node DRBD active 
active for mysql & http  resource ( i.e /dev/drbd2 & /dev/drbd3 in my 
case) with RHCS .

I configured the require setup from 
http://sourceware.org/cluster/wiki/DRBD_Cookbook and from DRDB links.

 From last 1 week I am testing the same scenario in 2 XEN vms with 
kenel  2.6.18-128.el5xen , Every thing is working fine , like mysql and 
http services move from one server to other etc... But not working 
correctly when it get fence ( i.e when n/w fail on of the node).

*I am facing the split-brain problem. * I search a lot in google and 
mailling list but don't found the proper correct solution and suggestion.

For Fence testing  I am doing following.
================================
1) On node1 http service is running with /dev/drbd2
2) On node2 mysql service is runing with /dev/drbd3
     At this movement the DRBD primary-primary status is working correctly.
3) Now On node2 When I stop n/w service manually by "service network 
stop"  then within 3-5 sec.  node2 get fence properly and mysql service 
get switch on node1 properly.
4) After fencing when node2 come up , then I am facing the DRBD 
split-brain issue with node1 and node2.

My questions :--
==========
1) Why DRBD Split brain is not coming when I reboot or shutdown or 
destroy the machine by xm command
     i.e  "xm reboot <node1/node2>"   OR         xm shutdown 
<node1/node2>  OR      xm destroy <node/node2>

2) Why the DRBD split brain issue come at the time of fencing node only ?

3) Is the combination of DRBD active-active + RHCS is stable ? and 
workable solution
     Because one of the below mailling list I found it's workable solution
     http://www.gossamer-threads.com/lists/drbd/users/20467#20467

4) In fencing Is there any extra setting require for such combination  ?
5) Do I need to use any custom fencing logic ?

6) Any one using such "*DRBD active-active + RHCS*"  setup in Live 
without split brain issue ?

Please guide and suggest on the same.

Regards
Jayesh Shinde


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110315/58cccc6d/attachment.htm>

From jayesh.shinde at netcore.co.in  Tue Mar 15 12:34:59 2011
From: jayesh.shinde at netcore.co.in (jayesh.shinde)
Date: Tue, 15 Mar 2011 18:04:59 +0530
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <4D7F57DA.8060204@netcore.co.in>
References: <4D7F57DA.8060204@netcore.co.in>
Message-ID: <4D7F5CF3.4050805@netcore.co.in>

Hi All ,

Just want mention one point. I am using Ext3 filesystem in below setup.

Regards
Jayesh Shinde


On 03/15/2011 05:43 PM, jayesh.shinde wrote:
> Hi All ,
>
> I don't have SAN with me , so I want to build the 2 node DRBD active 
> active for mysql & http  resource ( i.e /dev/drbd2 & /dev/drbd3 in my 
> case) with RHCS .
>
> I configured the require setup from 
> http://sourceware.org/cluster/wiki/DRBD_Cookbook and from DRDB links.
>
> From last 1 week I am testing the same scenario in 2 XEN vms with 
> kenel  2.6.18-128.el5xen , Every thing is working fine , like mysql 
> and http services move from one server to other etc... But not working 
> correctly when it get fence ( i.e when n/w fail on of the node).
>
> *I am facing the split-brain problem. * I search a lot in google and 
> mailling list but don't found the proper correct solution and suggestion.
>
> For Fence testing  I am doing following.
> ================================
> 1) On node1 http service is running with /dev/drbd2
> 2) On node2 mysql service is runing with /dev/drbd3
>     At this movement the DRBD primary-primary status is working correctly.
> 3) Now On node2 When I stop n/w service manually by "service network 
> stop"  then within 3-5 sec.  node2 get fence properly and mysql 
> service get switch on node1 properly.
> 4) After fencing when node2 come up , then I am facing the DRBD 
> split-brain issue with node1 and node2.
>
> My questions :--
> ==========
> 1) Why DRBD Split brain is not coming when I reboot or shutdown or 
> destroy the machine by xm command
>     i.e  "xm reboot <node1/node2>"   OR         xm shutdown 
> <node1/node2>  OR      xm destroy <node/node2>
>
> 2) Why the DRBD split brain issue come at the time of fencing node only ?
>
> 3) Is the combination of DRBD active-active + RHCS is stable ? and 
> workable solution
>     Because one of the below mailling list I found it's workable solution
> http://www.gossamer-threads.com/lists/drbd/users/20467#20467
>
> 4) In fencing Is there any extra setting require for such combination  ?
> 5) Do I need to use any custom fencing logic ?
>
> 6) Any one using such "*DRBD active-active + RHCS*"  setup in Live 
> without split brain issue ?
>
> Please guide and suggest on the same.
>
> Regards
> Jayesh Shinde
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110315/228c2b36/attachment.htm>

From bazy84 at gmail.com  Tue Mar 15 13:53:51 2011
From: bazy84 at gmail.com (Bazy)
Date: Tue, 15 Mar 2011 15:53:51 +0200
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <4D7F5CF3.4050805@netcore.co.in>
References: <4D7F57DA.8060204@netcore.co.in> <4D7F5CF3.4050805@netcore.co.in>
Message-ID: <AANLkTinYYHoR5wynqqm2etXDSHpkq-WqKAQ9oSkjNkOA@mail.gmail.com>

Hello,

Good question. I myself use the manual split brain recovery after one
of the nodes fails. See
http://www.drbd.org/users-guide/s-resolve-split-brain.html.

If anyone can share how to resolve this without manual intervention it
would be great.

Cheers!


On Tue, Mar 15, 2011 at 2:34 PM, jayesh.shinde
<jayesh.shinde at netcore.co.in> wrote:
> Hi All ,
>
> Just want mention one point. I am using Ext3 filesystem in below setup.
>
> Regards
> Jayesh Shinde
>
>
> On 03/15/2011 05:43 PM, jayesh.shinde wrote:
>
> Hi All ,
>
> I don't have SAN with me , so I want to build the 2 node DRBD active active
> for mysql? & http? resource ( i.e /dev/drbd2 & /dev/drbd3 in my case) with
> RHCS .
>
> I configured the require setup from
> http://sourceware.org/cluster/wiki/DRBD_Cookbook and from DRDB links.
>
> From last 1 week I am testing the same scenario in 2 XEN vms with kenel
> 2.6.18-128.el5xen , Every thing is working fine , like mysql and http
> services move from one server to other etc...? But not working correctly
> when it get fence ( i.e when n/w fail on of the node).
>
> I am facing the split-brain problem.? I search a lot in google and mailling
> list but don't found the proper correct solution and suggestion.
>
> For Fence testing? I am doing following.
> ================================
> 1) On node1 http service is running with /dev/drbd2
> 2) On node2 mysql service is runing with /dev/drbd3
> ??? At this movement the DRBD primary-primary status is working correctly.
> 3) Now On node2 When I stop n/w service manually by "service network stop"
> then within 3-5 sec.? node2 get fence properly and mysql service get switch
> on node1 properly.
> 4) After fencing when node2 come up , then I am facing the DRBD split-brain
> issue with node1 and node2.
>
> My questions :--
> ==========
> 1) Why DRBD Split brain is not coming when I reboot or shutdown or destroy
> the machine by xm command
> ??? i.e? "xm reboot <node1/node2>"?? OR ??????? xm shutdown <node1/node2>
> OR????? xm destroy <node/node2>
>
> 2) Why the DRBD split brain issue come at the time of fencing node only ?
>
> 3) Is the combination of DRBD active-active + RHCS is stable ? and workable
> solution
> ??? Because one of the below mailling list I found it's workable solution
> ??? http://www.gossamer-threads.com/lists/drbd/users/20467#20467
>
> 4) In fencing Is there any extra setting require for such combination? ?
> 5) Do I need to use any custom fencing logic ?
>
> 6) Any one using such "DRBD active-active + RHCS"? setup in Live without
> split brain issue ?
>
> Please guide and suggest on the same.
>
> Regards
> Jayesh Shinde
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From bergman at merctech.com  Tue Mar 15 15:42:09 2011
From: bergman at merctech.com (bergman at merctech.com)
Date: Tue, 15 Mar 2011 11:42:09 -0400
Subject: [Linux-cluster] quorum device not getting a vote causes 2-node
 cluster to be inquorate
In-Reply-To: <4D7EF785.8070708@redhat.com>
References: <25829.1300162301@mirchi>
	<4D7EF785.8070708@redhat.com>
Message-ID: <20110315114209.73b9f0f2@mirchi>

The pithy ruminations from "Fabio M. Di Nitto" <fdinitto at redhat.com> on "Re: [Linux-cluster] quorum device not getting a vote causes 2-node cluster to be inquorate" were:

=> On 03/15/2011 05:11 AM, bergman at merctech.com wrote:
=> > I have been using a 2-node cluster with a quorum disk successfully for
=> > about 2 years. Beginning today, the cluster will not boot correctly.
=> > 
=> > The RHCS services start, but fencing fails with:
=> > 	
=> > 	dlm: no local IP address has been set
=> > 	dlm: cannot start dlm lowcomms -107
=> > 
=> > This seems to be a symtpom of the fact that the cluster votes do not include votes from the quorum
=> > device:
=> > 
=> > 	# clustat
=> > 	Cluster Status for example-infra @ Tue Mar 15 00:02:35 2011
=> > 	Member Status: Inquorate
=> > 
=> > 	Member Name                                              ID   Status
=> > 	------ ----                                              ---- ------
=> > 	example-infr2-admin.domain.com                              1 Online, Local
=> > 	example-infr1-admin.domain.com                              2 Offline
=> >         /dev/mpath/quorum                                           0 Offline
=> > 
=> > 	[root at example-infr2 ~]# cman_tool status
=> > 	Version: 6.2.0
=> > 	Config Version: 239
=> > 	Cluster Name: example-infra
=> > 	Cluster Id: 42813
=> > 	Cluster Member: Yes
=> > 	Cluster Generation: 676844
=> > 	Membership state: Cluster-Member
=> > 	Nodes: 1
=> > 	Expected votes: 2
=> > 	Total votes: 1
=> > 	Quorum: 2 Activity blocked
=> > 	Active subsystems: 7
=> > 	Flags: 
=> > 	Ports Bound: 0  
=> > 	Node name: example-infr2-admin.domain.com
=> > 	Node ID: 1
=> > 	Multicast addresses: 239.192.167.228 
=> > 	Node addresses: 192.168.110.3 
=> 
=> You should check the output from cman_tool nodes. It appears that the
=> nodes are not seeing each other at all.

That's correct...at the time I ran cman_tool and clustat, one node was down (deliberately, in an attempt to troubleshoot the issue, but this would also be the case in the event of a hardware failure).

As I see it, the problem is not with the inter-node communication, but with the quorum device. Note that there is only one vote registered--there are no votes from the quorum device. The quorum device should provide sufficient votes to make the "cluster" quorate if only one node is running.

If I understand it correctly, this should also let the "cluster" start with a single node (as long as that node can write to the quorum device). If my understanding is wrong, then how can a 2-node cluster start if one node is down?

=> 
=> The first things I would check are iptables, node names resolves to the
=> correct ip addresses, selinux and eventually if the switch in between
=> the nodes support multicast.

SElinux is disabled (as it has been for the 2 years this cluster has been operational).

There have been no switch changes.

Node names & IPs resolve correctly.

IPtables permits all communication between the "admin" address on the servers.

=> 
=> Fabio
=> 
=> --
=> Linux-cluster mailing list
=> Linux-cluster at redhat.com
=> https://www.redhat.com/mailman/listinfo/linux-cluster
=> 

Thanks,

Mark



From m.watts at eris.qinetiq.com  Tue Mar 15 16:11:45 2011
From: m.watts at eris.qinetiq.com (Mark Watts)
Date: Tue, 15 Mar 2011 16:11:45 +0000
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <4D7F5CF3.4050805@netcore.co.in>
References: <4D7F57DA.8060204@netcore.co.in> <4D7F5CF3.4050805@netcore.co.in>
Message-ID: <4D7F8FC1.9030505@eris.qinetiq.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 03/15/2011 12:34 PM, jayesh.shinde wrote:
> Hi All ,
> 
> Just want mention one point. I am using Ext3 filesystem in below setup.

Dual Primary DRBD (Active-Active) and EXT3 are mutually exclusive.

You should be using GFS(2) (or OCFS2) on a Dual-Primary setup.

- -- 
Mark Watts BSc RHCE
Senior Systems Engineer, MSS Secure Managed Hosting
www.QinetiQ.com
QinetiQ - Delivering customer-focused solutions
GPG Key: http://www.linux-corner.info/mwatts.gpg
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org/

iEYEARECAAYFAk1/j8AACgkQBn4EFUVUIO03DQCggILhV71sPjJ+VFXxtfjT+DPS
6n8An3loexkOIeaNg6IW/IhZ8wmI0saQ
=F80k
-----END PGP SIGNATURE-----



From jduston at ll.mit.edu  Tue Mar 15 17:14:49 2011
From: jduston at ll.mit.edu (Jack Duston)
Date: Tue, 15 Mar 2011 13:14:49 -0400
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <21f7a2.18b2.12eb72713be.Coremail.ooolinux@163.com>
References: <4D7E9832.40000@ll.mit.edu>
	<21f7a2.18b2.12eb72713be.Coremail.ooolinux@163.com>
Message-ID: <4D7F9E89.1020307@ll.mit.edu>

Thanks Yue, but your information would seem dated if this site is correct:

http://www.redhat.com/rhel/compare

Even if 100TB is what's officially supported in RHEL6, it doesn't mean
that larger file systems won't work. Most likely that is the largest
amount of storage that Red Hat had available to test. Since this is a
brand new setup, now is a great time to see if it works with the storage
I have available. If it doesn't, then I haven't lost anything other than
a little time, and I'll just chunk it up into 100TB Logical Volumes.
However, since it would be better for our purposes, I would like to keep
our data in one file system if possible.

Regards,
Jack

On 03/14/2011 09:35 PM, yue wrote:
> 1.
> GFS2 is based on a 64-bit architecture, which can theoretically
> accommodate an 8 EB file system. However, the current supported
> maximum size of a GFS2 file system is 25 TB. If your system requires
> GFS2 file systems larger than 25 TB, contact your Red Hat service
> representative.
>
>
> At 2011-03-15 06:35:30?"Jack Duston" <jduston at ll.mit.edu> wrote:
>
> >Hello folks,
> >
> >I am planning to create a 2 node cluster with a GFS2 CLVM SAN.
> >The following Note in the RHEL6 GFS2 manual jumped out at me:
> >
> >Chapter 3. Managing GFS2
> >Note:
> >Once you have created a GFS2 file system with the mkfs.gfs2 command, you 
> >cannot decrease the size of the file system. You can, however, increase 
> >the size of an existing file system with the gfs2_grow command, as 
> >described in Section 3.6, ?Growing a File System?.
> >
> >This seems to me to make a GFS2 LV un-maintainable.
> >
> >What concerns me is the issue of how to remove a LUN from the GFS2 LV. 
> >This will be a necessity *when* there are hardware problems with a 
> >storage unit, End of Life/obsolescence (a la XRaid), or upgrade (replace 
> >1TB HDDS with 3 TB HDDs in the LUNs).
> >
> >Hardware does not last forever, and manufacturers do EOL products or go 
> >out of business.
> >I had also hoped to upgrade the 1TB HDDs in our current LUNs with 3 TB 
> >HDDs next year.
> >
> >I planned to free up enough space on the GFS2 LV to migrate data off one 
> >LUN. I could then decrease the GFS2 file system size, remove the LUN 
> >from the LV, destroy the RAID LUN, replace 1TB HDDs with 3TB HDDs, 
> >rebuild the RAID LUN, add the new larger LUN to the LV, increase the 
> >GFS2 file system size, and repeat migrating data off the next LUN.
> >
> >If the above note is correct, it seems to only way to deal with a 
> >hardware issue, obsolescence/EOL, or upgrading components is to destroy 
> >the entire GFS2 file system, build a new GFS2 file system from scratch, 
> >and restore data from backups. This might not be too bad with a small 
> >SAN of 20TB, but our data will exceed 100TB and it would be good not to 
> >have to rebuild Rome in a day.
> >
> >Can anyone confirm that GFS2 file system cannot be decreased? If so, is 
> >there any plan to add this capability/fix this issue in a future 
> >release? Is there another/better way to remove a LUN from GFS2 than what 
> >I considered?
> >
> >Any info greatly appreciated.
> >
> >--
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
>
>



From bazy84 at gmail.com  Tue Mar 15 17:58:45 2011
From: bazy84 at gmail.com (Bazy)
Date: Tue, 15 Mar 2011 19:58:45 +0200
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <4D7F8FC1.9030505@eris.qinetiq.com>
References: <4D7F57DA.8060204@netcore.co.in> <4D7F5CF3.4050805@netcore.co.in>
	<4D7F8FC1.9030505@eris.qinetiq.com>
Message-ID: <AANLkTinyUsFBncBh2kjAELGS80cjHwxMZ4rT1vOTcD6v@mail.gmail.com>

Hi Mark,

Yes, clustered file system is mandatory. Even with gfs(2) DRBD will
not recover by itself from a split brain. I think specific options are
needed in drbd.conf "after-sb-0pri after-sb-1pri after-sb-2pri", but
don't know what the exact ones are.

Best regards!


On Tue, Mar 15, 2011 at 6:11 PM, Mark Watts <m.watts at eris.qinetiq.com> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> On 03/15/2011 12:34 PM, jayesh.shinde wrote:
>> Hi All ,
>>
>> Just want mention one point. I am using Ext3 filesystem in below setup.
>
> Dual Primary DRBD (Active-Active) and EXT3 are mutually exclusive.
>
> You should be using GFS(2) (or OCFS2) on a Dual-Primary setup.
>
> - --
> Mark Watts BSc RHCE
> Senior Systems Engineer, MSS Secure Managed Hosting
> www.QinetiQ.com
> QinetiQ - Delivering customer-focused solutions
> GPG Key: http://www.linux-corner.info/mwatts.gpg
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.11 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org/
>
> iEYEARECAAYFAk1/j8AACgkQBn4EFUVUIO03DQCggILhV71sPjJ+VFXxtfjT+DPS
> 6n8An3loexkOIeaNg6IW/IhZ8wmI0saQ
> =F80k
> -----END PGP SIGNATURE-----
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From jduston at ll.mit.edu  Tue Mar 15 18:14:24 2011
From: jduston at ll.mit.edu (Jack Duston)
Date: Tue, 15 Mar 2011 14:14:24 -0400
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <4D7F109D.4000404@redhat.com>
References: <4D7E9832.40000@ll.mit.edu> <4D7F109D.4000404@redhat.com>
Message-ID: <4D7FAC80.9080804@ll.mit.edu>

Thanks much Ryan,

Its good to know this situation can be handled via the CLVM layer even 
though GFS2 doesn't provide a method directly.   I just needed to know 
there was a way to deal with those situations.  I will hang on to the 
least bad old RAID system, rather than surplussing it, to use as a temp LUN.

I understand we are not a typical use case.  We do re-purpose equipment 
for other experiments or tasks at times, and it would be great to be 
able without completely tearing down existing setups.  Even though its 
not a common event, it sure would be nice to have the capability to move 
older data offline and reduce the file system in place if the situation 
arises.

Other than just griping, I'd also like to say its greatly appreciated 
that you (Red Hat) created and have made GFS2 available.  We have been 
using XSan2/StorNext, and its great to have such an alternative.  Now 
that Apple has discontinued its Server support we are looking to move to 
Red Hat's GFS2 SAN solution.  We looked into other cluster file systems 
like GlusterFS, but we need a real SAN and not a distributed file system 
for this use case.

Thanks again,
                  Jack

On 03/15/2011 03:09 AM, Ryan Mitchell wrote:
> On 03/15/2011 08:35 AM, Jack Duston wrote:
>> I planned to free up enough space on the GFS2 LV to migrate data off
>> one LUN. I could then decrease the GFS2 file system size, remove the
>> LUN from the LV, destroy the RAID LUN, replace 1TB HDDs with 3TB HDDs,
>> rebuild the RAID LUN, add the new larger LUN to the LV, increase the
>> GFS2 file system size, and repeat migrating data off the next LUN.
>>
> Hi,
>
> No you will not be able to use that procedure to swap LUNs.  If you have
> the ability to present the new LUN's before removing the old LUN's from
> the volume group, it would be possible to:
> 1) vgextend the volume group using the new LUN
> 2) pvmove the extents from the old LUN to the new LUN
> 3) vgreduce the old LUN to remove it from the volume group
>
> This could be done 1 LUN at a time.  It doesn't even require you to grow
> the filesystem (unless the new LUN's are larger than the old ones).
> This is common and I've seen it done many times.  You could even use a
> temporary staging LUN to shuffle the data around.
>
> If you do not have the capacity to add additional LUNs before removing
> the original LUNs, then you will face a difficult migration, possibly
> using backup/restore as you mentioned.
>
> The feature to reduce the filesystem has not been implemented; there is
> no code as yet to manage it.  It isn't commonly required.
>
> Regards,
>
> Ryan Mitchell
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From ajb2 at mssl.ucl.ac.uk  Tue Mar 15 18:26:41 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Mar 2011 18:26:41 +0000
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <4D7F9E89.1020307@ll.mit.edu>
References: <4D7E9832.40000@ll.mit.edu>	<21f7a2.18b2.12eb72713be.Coremail.ooolinux@163.com>
	<4D7F9E89.1020307@ll.mit.edu>
Message-ID: <4D7FAF61.4050801@mssl.ucl.ac.uk>

Jack Duston wrote:
> Thanks Yue, but your information would seem dated if this site is correct:
> 
> http://www.redhat.com/rhel/compare
> 
> Even if 100TB is what's officially supported in RHEL6, it doesn't mean
> that larger file systems won't work.

Anyone considering such large filesystems should consider the following
questions.

1: How long is it going to take to back it up.?

2: How long will it take to restore??

Even LTO5 takes the best part of 12 hours to restore 1Tb...





From linux at alteeve.com  Tue Mar 15 19:03:06 2011
From: linux at alteeve.com (Digimer)
Date: Tue, 15 Mar 2011 15:03:06 -0400
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <AANLkTinyUsFBncBh2kjAELGS80cjHwxMZ4rT1vOTcD6v@mail.gmail.com>
References: <4D7F57DA.8060204@netcore.co.in>
	<4D7F5CF3.4050805@netcore.co.in>	<4D7F8FC1.9030505@eris.qinetiq.com>
	<AANLkTinyUsFBncBh2kjAELGS80cjHwxMZ4rT1vOTcD6v@mail.gmail.com>
Message-ID: <4D7FB7EA.8090604@alteeve.com>

On 03/15/2011 01:58 PM, Bazy wrote:
> Hi Mark,
> 
> Yes, clustered file system is mandatory. Even with gfs(2) DRBD will
> not recover by itself from a split brain. I think specific options are
> needed in drbd.conf "after-sb-0pri after-sb-1pri after-sb-2pri", but
> don't know what the exact ones are.
> 
> Best regards!

For the recovery;

resource r0 {
	device /dev/drbd0;

	net {
		after-sb-0pri discard-zero-changes;
		after-sb-1pri discard-secondary;
		after-sb-2pri disconnect;
	}
}

To tie it into fenced via cman (to fence instead of split-brain), also add:

resource r0 {
	device /dev/drbd0;
	
	disk {
		fencing resource-and-stonith;
	}

	handlers {
		outdate-peer "/sbin/obliterate";
	}
}

You can download 'obliterate' from here:

http://people.redhat.com/lhh/obliterate
(found here: http://gfs.wikidev.net/DRBD_Cookbook)

See also:

http://www.drbd.org/users-guide/ch-rhcs.html

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From jduston at ll.mit.edu  Tue Mar 15 20:55:27 2011
From: jduston at ll.mit.edu (Jack Duston)
Date: Tue, 15 Mar 2011 16:55:27 -0400
Subject: [Linux-cluster] GFS2 file system maintenance question.
In-Reply-To: <4D7FAF61.4050801@mssl.ucl.ac.uk>
References: <4D7E9832.40000@ll.mit.edu>	<21f7a2.18b2.12eb72713be.Coremail.ooolinux@163.com>	<4D7F9E89.1020307@ll.mit.edu>
	<4D7FAF61.4050801@mssl.ucl.ac.uk>
Message-ID: <4D7FD23F.1040200@ll.mit.edu>

Hi Alan,

These certainly are concerns, although straying a little off-topic.  I 
am risk-averse and try to avoid or mitigate 'gotcha' issues.  (Hence why 
I'm thinking about maintenance issues now).  Unfortunately, having less 
data is not an option, so I need to try to implement the best solution 
available to handle that data.

I'm not sure backing up, say 5 x 100TB filesystems would be much 
difference from backing up the same data on a single 1 x 500TB filesystem.
Restoring a single 100TB filesystem would definitely be easier than a 
single 500TB filesystem, but the trade-off is that the data is split up 
across 5 filesystems.

Tape backup has definite drawbacks when you start dealing with large 
data sets.  Fortunately our data will not be changing often.  We are 
backing up to external hard drives using trayless chassis,  basically 
using 2TB HDDs as jumbo floppy drives.  YMMV.

We are presently running a 70TB XSan2/StorNext SAN.  It has been 
rock-steady since created,  about half a year now (XSan1, not so much).  
I hope creating a 100TB filesystem from one designed to scale to 8EB 
should really not be too much of a test for GFS2.  That is only 1/80th 
its design capacity.  I do not think the developers at Red Hat are any 
less capable than those at Quantum.  (although it will certainly suck to 
uncover an edge case or bug triggered by >100TB filesystem!).

Given the choice, I certainly would not be pushing the boundaries of 
what's officially supported.
However, it does seem Red Hat has built a Monster Truck.   Since I have 
cars that need crushing, lets see if it can crush some cars before just 
parking it in the drive.

Cheers,
             Jack

p.s.  I'll start building more driveways if it can't, but lets at least 
try it first...

On 03/15/2011 02:26 PM, Alan Brown wrote:
> Jack Duston wrote:
>> Thanks Yue, but your information would seem dated if this site is correct:
>>
>> http://www.redhat.com/rhel/compare
>>
>> Even if 100TB is what's officially supported in RHEL6, it doesn't mean
>> that larger file systems won't work.
> Anyone considering such large filesystems should consider the following
> questions.
>
> 1: How long is it going to take to back it up.?
>
> 2: How long will it take to restore??
>
> Even LTO5 takes the best part of 12 hours to restore 1Tb...
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From ooolinux at 163.com  Wed Mar 16 01:51:57 2011
From: ooolinux at 163.com (yue)
Date: Wed, 16 Mar 2011 09:51:57 +0800 (CST)
Subject: [Linux-cluster] which is max gfs2 filesystem size,25T or 100T?
Message-ID: <5ee37b80.144c7.12ebc5cbc8e.Coremail.ooolinux@163.com>

1.the link  is rhel5 and rhel6. but   the article confuse me.
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/5/html/Global_File_System_2/ch-overview-GFS2.html
 
 
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html/Global_File_System_2/ch-overview-GFS2.html
 
2.if it is to say  the limitation is efficacious  on fedora or centos?
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110316/a42e8b2b/attachment.htm>

From parvez.h.shaikh at gmail.com  Wed Mar 16 05:07:55 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Wed, 16 Mar 2011 10:37:55 +0530
Subject: [Linux-cluster] Clustat exit code for service status
Message-ID: <AANLkTik6af4SVJM5z1B8q+1Od0kraEKHNO03fS+k90r7@mail.gmail.com>

Hi all,

Command clustat -s <service name> gives status of service.

If service is started (i.e. running on some node), exit code of this command
is 0, if however service is not running, its exit code is non-zero (found it
to be 119).

Is this right and going to be continued in subsequent cluster versions as
well? Reason I am asking this, is if I can use this command in shell script
to give status of service -

clustat -s <service name>
if [ $? -eq 0 ]; then
  echo "service is up"
else
  echo "service is not up"

Thanks
Parvez
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110316/6590a48a/attachment.htm>

From linux at alteeve.com  Wed Mar 16 18:07:50 2011
From: linux at alteeve.com (Digimer)
Date: Wed, 16 Mar 2011 14:07:50 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
Message-ID: <4D80FC76.6070605@alteeve.com>

Hi all,

  Does anyone know if the tripp lite (mn: PDUMH15ATNET, specifically)
has an existing RHCS fence agent? Specifically for cluster 2 / EL5.5. If
not, has anyone written one? Failing all that, I suppose I will write
one. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From bergman at merctech.com  Wed Mar 16 18:59:51 2011
From: bergman at merctech.com (bergman at merctech.com)
Date: Wed, 16 Mar 2011 14:59:51 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <4D80FC76.6070605@alteeve.com>
References: <4D80FC76.6070605@alteeve.com>
Message-ID: <20110316145951.75cc0849@mirchi>

The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:

=> Hi all,
=> 
=>   Does anyone know if the tripp lite (mn: PDUMH15ATNET, specifically)
=> has an existing RHCS fence agent? Specifically for cluster 2 / EL5.5. If

Yes.


=> not, has anyone written one? Failing all that, I suppose I will write
=> one. :)
=> 

Yes.

I wrote an agent for that piece of hardware and offered the agent to the RHCS community in Nov 2008...there was no response at the time.[1]

In March, 2009, I sent a copy of the agent script to Jan Friesse <jfriesse at redhat.com>, Marek Grac <mgrac at redhat.com>, who were identified as the maintainers of all the fence agents.

Since it apparently hasn't made it into the RHCS distribution, let me know if you want a copy.


Finally, I'd like to warn people away from using the TrippLite PDU model 
PDUMH15ATNET as a fencing device. While it seems to have nice features, it has 
a design choice that is a serious problem with fencing--when a command is 
given to power down an outlet, there is a "random" delay (observed to be 
about 17 to 35 seconds) before that command is executed. This has been 
acknowledged by TrippLite support as a design choice, with no option or setting 
to override this behavior.
 
Mark


	[1] http://www.redhat.com/archives/linux-cluster/2008-November/msg00215.html



From linux at alteeve.com  Wed Mar 16 19:57:45 2011
From: linux at alteeve.com (Digimer)
Date: Wed, 16 Mar 2011 15:57:45 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <20110316145951.75cc0849@mirchi>
References: <4D80FC76.6070605@alteeve.com> <20110316145951.75cc0849@mirchi>
Message-ID: <4D811639.1070704@alteeve.com>

On 03/16/2011 02:59 PM, bergman at merctech.com wrote:
> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:
> 
> => Hi all,
> => 
> =>   Does anyone know if the tripp lite (mn: PDUMH15ATNET, specifically)
> => has an existing RHCS fence agent? Specifically for cluster 2 / EL5.5. If
> 
> Yes.
> 
> 
> => not, has anyone written one? Failing all that, I suppose I will write
> => one. :)
> => 
> 
> Yes.
> 
> I wrote an agent for that piece of hardware and offered the agent to the RHCS community in Nov 2008...there was no response at the time.[1]
> 
> In March, 2009, I sent a copy of the agent script to Jan Friesse <jfriesse at redhat.com>, Marek Grac <mgrac at redhat.com>, who were identified as the maintainers of all the fence agents.
> 
> Since it apparently hasn't made it into the RHCS distribution, let me know if you want a copy.
> 
> 
> Finally, I'd like to warn people away from using the TrippLite PDU model 
> PDUMH15ATNET as a fencing device. While it seems to have nice features, it has 
> a design choice that is a serious problem with fencing--when a command is 
> given to power down an outlet, there is a "random" delay (observed to be 
> about 17 to 35 seconds) before that command is executed. This has been 
> acknowledged by TrippLite support as a design choice, with no option or setting 
> to override this behavior.
>  
> Mark
> 
> 
> 	[1] http://www.redhat.com/archives/linux-cluster/2008-November/msg00215.html

Hi Mark,

  I came across your post in the archives, actually. :)

  I would like a copy of your agent, if you don't mind. I already
maintain another fence agent, and would be happy to maintain this one,
shy of someone more experience stepping up.

  As for the delay, that sounds annoying, but not insurmountable. I've
got one of the switches on order already, as I wanted to see how they
worked. I can fairly easily put in a 5-sec poll that checks the state
until the node is cut or a timeout is hit. From the cluster's point of
view, this is safe outside of delaying recovery. In my case though, I'll
be sure to use it as the secondary fence device. I'll include such a
warning/suggestion in the agent's man page as well.

Cheers

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From fdinitto at redhat.com  Wed Mar 16 20:24:38 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Wed, 16 Mar 2011 21:24:38 +0100
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <20110316145951.75cc0849@mirchi>
References: <4D80FC76.6070605@alteeve.com> <20110316145951.75cc0849@mirchi>
Message-ID: <4D811C86.70202@redhat.com>

On 03/16/2011 07:59 PM, bergman at merctech.com wrote:
> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:
> 
> => Hi all,
> => 
> =>   Does anyone know if the tripp lite (mn: PDUMH15ATNET, specifically)
> => has an existing RHCS fence agent? Specifically for cluster 2 / EL5.5. If
> 
> Yes.
> 
> 
> => not, has anyone written one? Failing all that, I suppose I will write
> => one. :)
> => 
> 
> Yes.
> 
> I wrote an agent for that piece of hardware and offered the agent to the RHCS community in Nov 2008...there was no response at the time.[1]
> 
> In March, 2009, I sent a copy of the agent script to Jan Friesse <jfriesse at redhat.com>, Marek Grac <mgrac at redhat.com>, who were identified as the maintainers of all the fence agents.
> 
> Since it apparently hasn't made it into the RHCS distribution, let me know if you want a copy.
> 

Hmm ok, this is pretty bad... i am sorry that it got missed and I take
responsibility for it.

Can you please send it to me/digimer?

Digimer, you have git commit access to fence-agents.git.

If the agent is GPLv2+ compliant, and it looks sane, please add it.

Fabio



From linux at alteeve.com  Wed Mar 16 20:52:13 2011
From: linux at alteeve.com (Digimer)
Date: Wed, 16 Mar 2011 16:52:13 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <4D811C86.70202@redhat.com>
References: <4D80FC76.6070605@alteeve.com> <20110316145951.75cc0849@mirchi>
	<4D811C86.70202@redhat.com>
Message-ID: <4D8122FD.2050001@alteeve.com>

On 03/16/2011 04:24 PM, Fabio M. Di Nitto wrote:
> On 03/16/2011 07:59 PM, bergman at merctech.com wrote:
>> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:
>>
>> => Hi all,
>> => 
>> =>   Does anyone know if the tripp lite (mn: PDUMH15ATNET, specifically)
>> => has an existing RHCS fence agent? Specifically for cluster 2 / EL5.5. If
>>
>> Yes.
>>
>>
>> => not, has anyone written one? Failing all that, I suppose I will write
>> => one. :)
>> => 
>>
>> Yes.
>>
>> I wrote an agent for that piece of hardware and offered the agent to the RHCS community in Nov 2008...there was no response at the time.[1]
>>
>> In March, 2009, I sent a copy of the agent script to Jan Friesse <jfriesse at redhat.com>, Marek Grac <mgrac at redhat.com>, who were identified as the maintainers of all the fence agents.
>>
>> Since it apparently hasn't made it into the RHCS distribution, let me know if you want a copy.
>>
> 
> Hmm ok, this is pretty bad... i am sorry that it got missed and I take
> responsibility for it.
> 
> Can you please send it to me/digimer?
> 
> Digimer, you have git commit access to fence-agents.git.
> 
> If the agent is GPLv2+ compliant, and it looks sane, please add it.
> 
> Fabio

I've got a copy now. Let me wait until I get my hardware to
test/tweak/document it, then I will look to push it into git.

Thanks again, Mark!

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From parvez.h.shaikh at gmail.com  Thu Mar 17 05:25:51 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Thu, 17 Mar 2011 10:55:51 +0530
Subject: [Linux-cluster] Node without fencing method,
	is it possible to failover from such a node?
Message-ID: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>

Hi all,

I have a red hat cluster on IBM blade center with blades being my
clusternodes and fence_bladecenter fencing agent. I have couple of resources
- IP which activate or deactivate floating IP and script which start my
server listening on this floating IP. This is a stateless server with no
shared storage requirements or any shared resources which require me to use
fancy fencing device.

Everything was working fine, when I disable ethcard of heartbeat IP or of
floating IP or pull powerplug or reboot/shutdown/halt one node, IP floats on
another node and script start my server which happily listen on this IP.
Life was good until I am now required to support cluster of nodes which are
not hosted in bladecenter but any vanilla nodes.

Now everything remains same but bladecenter fencing cant be used, and as per
my understanding since I am using red hat cluster, it requires me to use
some fence method, my first choice is to use power fencing and that only
fencing suits my application needs.

But is there any way (I know not the best and recommended but if I can live
with it) to get away with fencing and let service failover in absence of
fence devices configured for node?

Thanks,
Parvez
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110317/75cf3dc3/attachment.htm>

From ooolinux at 163.com  Thu Mar 17 05:57:40 2011
From: ooolinux at 163.com (yue)
Date: Thu, 17 Mar 2011 13:57:40 +0800 (CST)
Subject: [Linux-cluster] do you have gfs2 code call-flow
Message-ID: <424a4c1d.7604.12ec2640c9e.Coremail.ooolinux@163.com>

do you have gfs2 code call-flow
 
i wan to know how gfs2 is implemented ,on code level
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110317/9777a51b/attachment.htm>

From swhiteho at redhat.com  Thu Mar 17 09:30:50 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 17 Mar 2011 09:30:50 +0000
Subject: [Linux-cluster] do you have gfs2 code call-flow
In-Reply-To: <424a4c1d.7604.12ec2640c9e.Coremail.ooolinux@163.com>
References: <424a4c1d.7604.12ec2640c9e.Coremail.ooolinux@163.com>
Message-ID: <1300354250.2596.7.camel@dolmen>

Hi,

On Thu, 2011-03-17 at 13:57 +0800, yue wrote:
> do you have gfs2 code call-flow 
>  
> i wan to know how gfs2 is implemented ,on code level
>  
> thanks
> 
> 
There is some documentation in the kernel source
Documentation/filesystems directory. Also if you look at the GFS2
Wikipedia page, you'll see some links to some documents which I have
written that explain a bit about the internals. There is no one single
overall document though I'm afraid,

Steve.




From ccd.stoy.ml at gmail.com  Thu Mar 17 15:17:53 2011
From: ccd.stoy.ml at gmail.com (C.D.)
Date: Thu, 17 Mar 2011 17:17:53 +0200
Subject: [Linux-cluster]  GFS2 locking in a VM based cluster (KVM)
Message-ID: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>

Hello,

sorry guys to resurrect an old thread, but I have to say I can confirm that,
too. I have a libvirt setup with multipathed FC SAN devices and KVM guests
running on top of it. The physical machine is HP 465c G7 (2 x 12 Core
Magny-Cours with 96GB RAM). The host OS is Fedora 14. The guests are
Scientific Linux 6. With gfs2 10GB shared LUN I can manage ~600k plocks/sec
while both machines mounted the LUN. I started: ping_pong some_file 3 on one
of the VMs and got those 600k plocks. Then I started ping_pong the_same_file
3 on the second machines and got around 360 plocks/sec (that is 360, not 360
000). No matter what I tried I couldn't optimize it. If I stop the ping_pong
on one of the VMs the plocks wen't up to around 500-550 plocks/sec (again
550 not 550k). Stopping the process. Waiting a while and starting again on a
single machine still got me around 600k plocks. This I could reproduce both
with tcp and sctp and tried bunch of different settings.

Then I decided to give ocfs2 a change. Compiling the module on SL6, and I
suppose on RHEL6, is not the most straight forward taks, buth half an hour
later I got the module compiled from the sources of the EL kernel. Stripped
all debug symbols. Copied the ocfs2 kernel module dir to both VM machines.
Did depmod -a, I set up the oracle fs on top of the same LUN. Used ping_pong
the_same_file_i_used_in_the_first_test 3 on just one machine, while both VMs
have mounted the LUN. 1600k plocks/sec (as in ~1 600 000 ). Started
ping_pong on the second host. The plocks did not move at all. Still 1600k
plocks/sec. Tested with the real life app. It worked very well, unlike gfs2,
which was painfully slow with just 2 users. I created the ocfs2 with -T
mail, I didn't do any tuning on it, either.

I'm not trying to bash gfs2, actually I would definitely prefer it over
ocfs2 anytime, however it seems it doesn't work well with VM for some
reason. I have used both mtu 1500 and 9000 also, it just didn't make any
diffence, no matter what I have tried.I haven't tested the same setup on top
of two physical nodes, but I have the feeling it will work just as good as
ocfs2 on the VMs. I didn't test with hugepages for the VMs, but I somehow
doubt that would make much of a difference.

I think this should be investigates by someone at RH possibly because they
are the driving force behind both KVM, libvirt, the cluster soft and gfs2.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110317/a5a22982/attachment.htm>

From linux at alteeve.com  Thu Mar 17 16:01:40 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 17 Mar 2011 12:01:40 -0400
Subject: [Linux-cluster] Node without fencing method,
 is it possible to failover from such a node?
In-Reply-To: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>
References: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>
Message-ID: <4D823064.8040907@alteeve.com>

On 03/17/2011 01:25 AM, Parvez Shaikh wrote:
> Hi all,
> 
> I have a red hat cluster on IBM blade center with blades being my
> clusternodes and fence_bladecenter fencing agent. I have couple of
> resources - IP which activate or deactivate floating IP and script which
> start my server listening on this floating IP. This is a stateless
> server with no shared storage requirements or any shared resources which
> require me to use fancy fencing device.
> 
> Everything was working fine, when I disable ethcard of heartbeat IP or
> of floating IP or pull powerplug or reboot/shutdown/halt one node, IP
> floats on another node and script start my server which happily listen
> on this IP. Life was good until I am now required to support cluster of
> nodes which are not hosted in bladecenter but any vanilla nodes.
> 
> Now everything remains same but bladecenter fencing cant be used, and as
> per my understanding since I am using red hat cluster, it requires me to
> use some fence method, my first choice is to use power fencing and that
> only fencing suits my application needs.
> 
> But is there any way (I know not the best and recommended but if I can
> live with it) to get away with fencing and let service failover in
> absence of fence devices configured for node?
> 
> Thanks,
> Parvez

Manual fencing is not supported:
- http://sources.redhat.com/cluster/wiki/FAQ/Fencing#fence_manual2

Do you vanilla servers have IPMI (or equiv. like iLo, DRAC, etc)? If
they do, than you can use fence_ipmilan. Failing that, then I'd strongly
recommend investing is a switched PDU. All things considered, they are
not that expensive. Plus, when a node is fenced/rebooted, there is a
chance it will return to the cluster healthy.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From raju.rajsand at gmail.com  Thu Mar 17 16:49:20 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 17 Mar 2011 22:19:20 +0530
Subject: [Linux-cluster] Node without fencing method,
 is it possible to failover from such a node?
In-Reply-To: <4D823064.8040907@alteeve.com>
References: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>
	<4D823064.8040907@alteeve.com>
Message-ID: <AANLkTi=Ug8f72qoYUsS=rqCiARHJGCRbQkcnzXtMi4ED@mail.gmail.com>

Greetings,

On 3/17/11, Digimer <linux at alteeve.com> wrote:
> On 03/17/2011 01:25 AM, Parvez Shaikh wrote:
>> Hi all,
>>
>> Life was good until I am now required to support cluster of
>> nodes which are not hosted in bladecenter but any vanilla nodes.

Suggestions from somebody who stupidly yapped "I will support manual
fencing" and burnt his finger (Who? Oh! that was me):
1. Don't commit support for manual fencing
2. Don't support manual fencing.

If you are in India, APC Fence PDU is available for around 30-35K INR
(about a year back or so).
If someone is ready to invest say 500K INR for HA hardware such as two
servers etc., they might as well add 35k.

OTOH, if those nodes are rack mounted servers (Unlike entry level
server which does not have management port), the cost of the
Powerfence strip will be a different issue when it comes to
justifying, etc. within a corporate/Enterprise environment. Too much
paperwork, I agree. But It will give a more robust infrastructure
which will help us in using various tools like Zabbix, Spacewalk, snmp
(I think fence strips have some SNMP - please check) etc. in the
future.

Life will be good then.

With warm regards,

Rajagopal



From ra at ra.is  Thu Mar 17 20:29:34 2011
From: ra at ra.is (Richard Allen)
Date: Thu, 17 Mar 2011 16:29:34 -0400
Subject: [Linux-cluster] DLM problem
Message-ID: <4D826F2E.9030600@ra.is>

I have a simple test cluster up and running (RHEL 6 HA) on three vmware 
guests.  Each vmware guest has 3 vnic's.

After booting a node, I often get a dead rgmanager:

[root at syseng1-vm ~]# service rgmanager status
rgmanager dead but pid file exists

Cluster is otherwise OK

[root at syseng1-vm ~]# clustat
Cluster Status for RHEL6Test @ Thu Mar 17 16:10:38 2011
Member Status: Quorate

  Member Name                                               ID   Status
  ------ ----                                               ---- ------
  syseng1-vm                               1 Online, Local
  syseng2-vm                               2 Online
  syseng3-vm                               3 Online

There is a service running on node2 but clustat has no info on that.



[root at syseng1-vm ~]# cman_tool status
Version: 6.2.0
Config Version: 9
Cluster Name: RHEL6Test
Cluster Id: 36258
Cluster Member: Yes
Cluster Generation: 88
Membership state: Cluster-Member
Nodes: 3
Expected votes: 3
Total votes: 3
Node votes: 1
Quorum: 2
Active subsystems: 1
Flags:
Ports Bound: 0
Node name: syseng1-[CENSORED]
Node ID: 1
Multicast addresses: 239.192.141.48
Node addresses: 10.10.16.11


The syslog has some info:

Mar 17 15:47:55 syseng1-vm rgmanager[2463]: Quorum formed
Mar 17 15:47:55 syseng1-vm kernel: dlm: no local IP address has been set
Mar 17 15:47:55 syseng1-vm kernel: dlm: cannot start dlm lowcomms -107


The fix is always the same:

[root at syseng1-vm ~]# service cman restart
Stopping cluster:
    Leaving fence domain...                                 [  OK  ]
    Stopping gfs_controld...                                [  OK  ]
    Stopping dlm_controld...                                [  OK  ]
    Stopping fenced...                                      [  OK  ]
    Stopping cman...                                        [  OK  ]
    Waiting for corosync to shutdown:                       [  OK  ]
    Unloading kernel modules...                             [  OK  ]
    Unmounting configfs...                                  [  OK  ]
Starting cluster:
    Checking Network Manager...                             [  OK  ]
    Global setup...                                         [  OK  ]
    Loading kernel modules...                               [  OK  ]
    Mounting configfs...                                    [  OK  ]
    Starting cman...                                        [  OK  ]
    Waiting for quorum...                                   [  OK  ]
    Starting fenced...                                      [  OK  ]
    Starting dlm_controld...                                [  OK  ]
    Starting gfs_controld...                                [  OK  ]
    Unfencing self...                                       [  OK  ]
    Joining fence domain...                                 [  OK  ]

[root at syseng1-vm ~]# service rgmanager restart
Stopping Cluster Service Manager:                          [  OK  ]
Starting Cluster Service Manager:                          [  OK  ]



[root at syseng1-vm ~]# clustat
Cluster Status for RHEL6Test @ Thu Mar 17 16:22:01 2011
Member Status: Quorate

  Member Name                                               ID   Status
  ------ ----                                               ---- ------
  syseng1-vm                               1 Online, Local, rgmanager
  syseng2-vm                               2 Online, rgmanager
  syseng3-vm                               3 Online

  Service Name                                     Owner 
(Last)                                     State
  ------- ----                                     ----- 
------                                     -----
  service:TestDB                                   syseng2-vm 
                  started



Sometimes restarting rgmanager hangs and the node needs to be rebooted.

my cluster.conf:


<?xml version="1.0"?>
<cluster config_version="9" name="RHEL6Test">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="syseng1-vm" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="syseng1-vm"/>
</method>
</fence>
</clusternode>
<clusternode name="syseng2-vm" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="syseng2-vm"/>
</method>
</fence>
</clusternode>
<clusternode name="syseng3-vm" nodeid="3" votes="1">
<fence>
<method name="1">
<device name="syseng3-vm"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_vmware" ipaddr="vcenter-lab" 
login="Administrator" name="syseng1-vm" passwd="[CENSORED]" 
port="syseng1-vm"/>
<fencedevice agent="fence_vmware" ipaddr="vcenter-lab" 
login="Administrator" name="syseng2-vm" passwd="[CENSORED]" 
port="syseng2-vm"/>
<fencedevice agent="fence_vmware" ipaddr="vcenter-lab" 
login="Administrator" name="syseng3-vm" passwd="[CENSORED]" 
port="syseng3-vm"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="AllNodes" nofailback="0" ordered="0" restricted="0">
<failoverdomainnode name="syseng1-vm" priority="1"/>
<failoverdomainnode name="syseng2-vm" priority="1"/>
<failoverdomainnode name="syseng3-vm" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="10.10.16.234" monitor_link="on" sleeptime="10"/>
<fs device="/dev/vgpg/pgsql" fsid="62946" mountpoint="/opt/rg" 
name="SharedDisk"/>
<script file="/etc/rc.d/init.d/postgresql" name="postgresql"/>
</resources>
<service autostart="1" domain="AllNodes" exclusive="0" name="TestDB" 
recovery="relocate">
<ip ref="10.10.16.234"/>
<fs ref="SharedDisk"/>
<script ref="postgresql"/>
</service>
</rm>
</cluster>



Anyone have any ideas in what is going on?




From raju.rajsand at gmail.com  Thu Mar 17 21:47:21 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 18 Mar 2011 03:17:21 +0530
Subject: [Linux-cluster] GFS2 locking in a VM based cluster (KVM)
In-Reply-To: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
References: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
Message-ID: <AANLkTikbAfzWBVriFA3KpvG0BDD8Qc1TKBB6HPfPq9Ey@mail.gmail.com>

Greetings,

On 3/17/11, C.D. <ccd.stoy.ml at gmail.com> wrote:
> Hello,
>
> I'm not trying to bash gfs2, actually I would definitely prefer it over
> ocfs2 anytime, however it seems it doesn't work well with VM for some
> reason.
>
> I think this should be investigates by someone at RH possibly because they
> are the driving force behind both KVM, libvirt, the cluster soft and gfs2.
>

I am not an employee of Redhat.

1. As the fastest measure, turn off atime (if on) while remounting
GFS2 and you will immediately notice a zing in performance.

2. Why would one use GFS2 to store VM? In the absence of CLVM not
offering LUNs on which VMs are stored.

I understand that step 2 may not be feasible if the system is in
production (Well, unless you know that lovely qemu-convert command for
handlink disk images or something like that and dd incantations which
I can't off had remember).

But live storage migration is another issue altogether. I haven't had
a suitable opportunity to cut my teeth in the RHEV Cloud as yet. So no
comments on it from me as yet.

Welcome to the Wonderful world of Redhat HA+VM ! (Its not RHEV please.
RHEV is another, well, lovely and cuddly,  beast altogether)

Get the HA right first, then go for virt during rollouts.

HTH


Regards,

Rajagopal



From ccd.stoy.ml at gmail.com  Thu Mar 17 23:39:12 2011
From: ccd.stoy.ml at gmail.com (C.D.)
Date: Fri, 18 Mar 2011 01:39:12 +0200
Subject: [Linux-cluster] GFS2 locking in a VM based cluster (KVM)
In-Reply-To: <AANLkTikbAfzWBVriFA3KpvG0BDD8Qc1TKBB6HPfPq9Ey@mail.gmail.com>
References: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
	<AANLkTikbAfzWBVriFA3KpvG0BDD8Qc1TKBB6HPfPq9Ey@mail.gmail.com>
Message-ID: <AANLkTimerApyWmHswEC89OhZ5TXMdp_Q481jO1SKD5Kx@mail.gmail.com>

On Thu, Mar 17, 2011 at 11:47 PM, Rajagopal Swaminathan <
raju.rajsand at gmail.com> wrote:

> Greetings,
>
> On 3/17/11, C.D. <ccd.stoy.ml at gmail.com> wrote:
> > Hello,
> >
> > I'm not trying to bash gfs2, actually I would definitely prefer it over
> > ocfs2 anytime, however it seems it doesn't work well with VM for some
> > reason.
> >
> > I think this should be investigates by someone at RH possibly because
> they
> > are the driving force behind both KVM, libvirt, the cluster soft and
> gfs2.
> >
>
> I am not an employee of Redhat.
>
> 1. As the fastest measure, turn off atime (if on) while remounting
> GFS2 and you will immediately notice a zing in performance.
>
> I always mounted with quota=off,noatime,nodiratime , but with them on or
off there wasn't any difference.


> 2. Why would one use GFS2 to store VM? In the absence of CLVM not
> offering LUNs on which VMs are stored.
>
I never said I used GFS2 to store VM. I used GFS2 as a shared storage FS
inside VMs. I have a shared LUN exported through the SAN and through the
Fabrics switches with multipathing (the host takes care of that). I have a
mpath pool in libvirtd and friendly names = off in multipath.conf, so I see
the WWIN of each LUN inside virsh/virt-manager. I have all LUNS in all
storage groups of each VM host system so I can do migration, but this
doesn't seem as the proper place for such discussion.

What I set up was 2 (or more, but in this specific setup it was 2) VMs, each
has a LUN that uses for the root partition. And those 2 VMs also share one
LUN that I set up with clvmd and gfs2 where I share files between those 2
VMs. All the cluster stuff is working inside the VMs (the guests), the host
doesn't know about the cluster, clvmd or gfs2, nor it should know or care
about what is going on inside the VMs that are runing atop of it. In this
specific scenario those 2 VMs are running web servers and this is the
directory with all the files they serve. I hope this sheds some light on my
exact setup.


>
> I understand that step 2 may not be feasible if the system is in
> production (Well, unless you know that lovely qemu-convert command for
> handlink disk images or something like that and dd incantations which
> I can't off had remember).
>
> But live storage migration is another issue altogether. I haven't had
> a suitable opportunity to cut my teeth in the RHEV Cloud as yet. So no
> comments on it from me as yet.
>
Live storage migration is a non-issue inside my private cloud, as I already
solved that with libvirtd, multipathing, the SAN, the SAN switches, etc. It
took suprisingly not more than 3 or 4 days of poking around and reading
sources, docs and libvirt devel mailing list and I was able to build stable
and high performing solution from the ground up on top of RH technology
(which I happen to love).

>
> Welcome to the Wonderful world of Redhat HA+VM ! (Its not RHEV please.
> RHEV is another, well, lovely and cuddly,  beast altogether)
>
> Get the HA right first, then go for virt during rollouts.
>
> At this point I think I'll stay away from RH HA solutions inside VMs. It's
too much hastle and surprisingly sparse documentation from RH. As most of my
machines are running something that can be load balanced and HA set up
through nginx I would probably go again that way and would try some CARP on
some of the VMs. But that is something for the next stage of my setup.

> HTH
>
>
> Regards,
>
> Rajagopal
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

Thanks for taking the time to respond,

Regards
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110318/29f193e8/attachment.htm>

From ccd.stoy.ml at gmail.com  Thu Mar 17 23:47:19 2011
From: ccd.stoy.ml at gmail.com (C.D.)
Date: Fri, 18 Mar 2011 01:47:19 +0200
Subject: [Linux-cluster] DLM problem
In-Reply-To: <4D826F2E.9030600@ra.is>
References: <4D826F2E.9030600@ra.is>
Message-ID: <AANLkTinM76pw1C0nv9xqdQgVHWd3hUuXLFJ8i2GeY5VA@mail.gmail.com>

On Thu, Mar 17, 2011 at 10:29 PM, Richard Allen <ra at ra.is> wrote:

> I have a simple test cluster up and running (RHEL 6 HA) on three vmware
> guests.  Each vmware guest has 3 vnic's.
>
> After booting a node, I often get a dead rgmanager:
>
> [root at syseng1-vm ~]# service rgmanager status
> rgmanager dead but pid file exists
>
> Cluster is otherwise OK
>
> [root at syseng1-vm ~]# clustat
> Cluster Status for RHEL6Test @ Thu Mar 17 16:10:38 2011
> Member Status: Quorate
>
>  Member Name                                               ID   Status
>  ------ ----                                               ---- ------
>  syseng1-vm                               1 Online, Local
>  syseng2-vm                               2 Online
>  syseng3-vm                               3 Online
>
> There is a service running on node2 but clustat has no info on that.
>
>
>
> [root at syseng1-vm ~]# cman_tool status
> Version: 6.2.0
> Config Version: 9
> Cluster Name: RHEL6Test
> Cluster Id: 36258
> Cluster Member: Yes
> Cluster Generation: 88
> Membership state: Cluster-Member
> Nodes: 3
> Expected votes: 3
> Total votes: 3
> Node votes: 1
> Quorum: 2
> Active subsystems: 1
> Flags:
> Ports Bound: 0
> Node name: syseng1-[CENSORED]
> Node ID: 1
> Multicast addresses: 239.192.141.48
> Node addresses: 10.10.16.11
>
>
> The syslog has some info:
>
> Mar 17 15:47:55 syseng1-vm rgmanager[2463]: Quorum formed
> Mar 17 15:47:55 syseng1-vm kernel: dlm: no local IP address has been set
> Mar 17 15:47:55 syseng1-vm kernel: dlm: cannot start dlm lowcomms -107
>
>
> The fix is always the same:
>
> [root at syseng1-vm ~]# service cman restart
> Stopping cluster:
>   Leaving fence domain...                                 [  OK  ]
>   Stopping gfs_controld...                                [  OK  ]
>   Stopping dlm_controld...                                [  OK  ]
>   Stopping fenced...                                      [  OK  ]
>   Stopping cman...                                        [  OK  ]
>   Waiting for corosync to shutdown:                       [  OK  ]
>   Unloading kernel modules...                             [  OK  ]
>   Unmounting configfs...                                  [  OK  ]
> Starting cluster:
>   Checking Network Manager...                             [  OK  ]
>   Global setup...                                         [  OK  ]
>   Loading kernel modules...                               [  OK  ]
>   Mounting configfs...                                    [  OK  ]
>   Starting cman...                                        [  OK  ]
>   Waiting for quorum...                                   [  OK  ]
>   Starting fenced...                                      [  OK  ]
>   Starting dlm_controld...                                [  OK  ]
>   Starting gfs_controld...                                [  OK  ]
>   Unfencing self...                                       [  OK  ]
>   Joining fence domain...                                 [  OK  ]
>
> [root at syseng1-vm ~]# service rgmanager restart
> Stopping Cluster Service Manager:                          [  OK  ]
> Starting Cluster Service Manager:                          [  OK  ]
>
>
>
> [root at syseng1-vm ~]# clustat
> Cluster Status for RHEL6Test @ Thu Mar 17 16:22:01 2011
> Member Status: Quorate
>
>  Member Name                                               ID   Status
>  ------ ----                                               ---- ------
>  syseng1-vm                               1 Online, Local, rgmanager
>  syseng2-vm                               2 Online, rgmanager
>  syseng3-vm                               3 Online
>
>  Service Name                                     Owner (Last)
>                         State
>  ------- ----                                     ----- ------
>                         -----
>  service:TestDB                                   syseng2-vm
>    started
>
>
>
> Sometimes restarting rgmanager hangs and the node needs to be rebooted.
>
>
I'm running libvirtd setup on top of KVM/Qemu and I have similar experience
to yours. I have to force power off the VMs to be able to reboot them. I
also loose quorum from time to time, etc. I also noticed bad performance
from gfs2 inside such a setup and I'm starting to think it has something to
do with virtualization and there is something that we simply don't know
about the cluster manager. Probably some tweaking that is not yet in the
docs. I'm using SL6, by the way, which is very, very close to RHEL 6. I
unfortunatelly don't have the time to test with CentOS5 on the VMs, or with
the most recent Fedora. Probably it is something specific to RHEL 6?


> my cluster.conf:
>
>
> <?xml version="1.0"?>
> <cluster config_version="9" name="RHEL6Test">
> <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
> <clusternodes>
> <clusternode name="syseng1-vm" nodeid="1" votes="1">
> <fence>
> <method name="1">
> <device name="syseng1-vm"/>
> </method>
> </fence>
> </clusternode>
> <clusternode name="syseng2-vm" nodeid="2" votes="1">
> <fence>
> <method name="1">
> <device name="syseng2-vm"/>
> </method>
> </fence>
> </clusternode>
> <clusternode name="syseng3-vm" nodeid="3" votes="1">
> <fence>
> <method name="1">
> <device name="syseng3-vm"/>
> </method>
> </fence>
> </clusternode>
> </clusternodes>
> <cman/>
> <fencedevices>
> <fencedevice agent="fence_vmware" ipaddr="vcenter-lab"
> login="Administrator" name="syseng1-vm" passwd="[CENSORED]"
> port="syseng1-vm"/>
> <fencedevice agent="fence_vmware" ipaddr="vcenter-lab"
> login="Administrator" name="syseng2-vm" passwd="[CENSORED]"
> port="syseng2-vm"/>
> <fencedevice agent="fence_vmware" ipaddr="vcenter-lab"
> login="Administrator" name="syseng3-vm" passwd="[CENSORED]"
> port="syseng3-vm"/>
> </fencedevices>
> <rm>
> <failoverdomains>
> <failoverdomain name="AllNodes" nofailback="0" ordered="0" restricted="0">
> <failoverdomainnode name="syseng1-vm" priority="1"/>
> <failoverdomainnode name="syseng2-vm" priority="1"/>
> <failoverdomainnode name="syseng3-vm" priority="1"/>
> </failoverdomain>
> </failoverdomains>
> <resources>
> <ip address="10.10.16.234" monitor_link="on" sleeptime="10"/>
> <fs device="/dev/vgpg/pgsql" fsid="62946" mountpoint="/opt/rg"
> name="SharedDisk"/>
> <script file="/etc/rc.d/init.d/postgresql" name="postgresql"/>
> </resources>
> <service autostart="1" domain="AllNodes" exclusive="0" name="TestDB"
> recovery="relocate">
> <ip ref="10.10.16.234"/>
> <fs ref="SharedDisk"/>
> <script ref="postgresql"/>
> </service>
> </rm>
> </cluster>
>
>
>
> Anyone have any ideas in what is going on?
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110318/f3bc5deb/attachment.htm>

From fdinitto at redhat.com  Fri Mar 18 04:50:23 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Fri, 18 Mar 2011 05:50:23 +0100
Subject: [Linux-cluster] DLM problem
In-Reply-To: <AANLkTinM76pw1C0nv9xqdQgVHWd3hUuXLFJ8i2GeY5VA@mail.gmail.com>
References: <4D826F2E.9030600@ra.is>
	<AANLkTinM76pw1C0nv9xqdQgVHWd3hUuXLFJ8i2GeY5VA@mail.gmail.com>
Message-ID: <4D82E48F.1030007@redhat.com>

On 03/18/2011 12:47 AM, C.D. wrote:
> 
> 
> On Thu, Mar 17, 2011 at 10:29 PM, Richard Allen <ra at ra.is
> <mailto:ra at ra.is>> wrote:
> 
>     I have a simple test cluster up and running (RHEL 6 HA) on three
>     vmware guests.  Each vmware guest has 3 vnic's.
> 
>     After booting a node, I often get a dead rgmanager:
> 
>     [root at syseng1-vm ~]# service rgmanager status
>     rgmanager dead but pid file exists
> 
>     Cluster is otherwise OK
> 
>     [root at syseng1-vm ~]# clustat
>     Cluster Status for RHEL6Test @ Thu Mar 17 16:10:38 2011
>     Member Status: Quorate
> 
>      Member Name                                               ID   Status
>      ------ ----                                               ---- ------
>      syseng1-vm                               1 Online, Local
>      syseng2-vm                               2 Online
>      syseng3-vm                               3 Online
> 
>     There is a service running on node2 but clustat has no info on that.
> 
> 
> 
>     [root at syseng1-vm ~]# cman_tool status
>     Version: 6.2.0
>     Config Version: 9
>     Cluster Name: RHEL6Test
>     Cluster Id: 36258
>     Cluster Member: Yes
>     Cluster Generation: 88
>     Membership state: Cluster-Member
>     Nodes: 3
>     Expected votes: 3
>     Total votes: 3
>     Node votes: 1
>     Quorum: 2
>     Active subsystems: 1
>     Flags:
>     Ports Bound: 0
>     Node name: syseng1-[CENSORED]
>     Node ID: 1
>     Multicast addresses: 239.192.141.48
>     Node addresses: 10.10.16.11
> 
> 
>     The syslog has some info:
> 
>     Mar 17 15:47:55 syseng1-vm rgmanager[2463]: Quorum formed
>     Mar 17 15:47:55 syseng1-vm kernel: dlm: no local IP address has been set
>     Mar 17 15:47:55 syseng1-vm kernel: dlm: cannot start dlm lowcomms -107

Make sure the VMs don't traverse any kind of NAT and that node names (as
specified in cluster.conf) resolves to the correct ip addresses. Also
cross check your iptables setup to allow traffic between nodes. DLM uses
TCP vs userland that uses multicast.

Fabio



From vmutu at pcbi.upenn.edu  Fri Mar 18 05:42:57 2011
From: vmutu at pcbi.upenn.edu (Valeriu Mutu)
Date: Fri, 18 Mar 2011 01:42:57 -0400
Subject: [Linux-cluster] GFS2 metadata performance
Message-ID: <20110318054257.GA29732@bsdera.pcbi.upenn.edu>

Hi,

Has anyone done any GFS2 metadata performance benchmarks? If so, what have you found? Also, what performance tuning would be recommended to increase the metadata performance of a GFS2 filesystem?

I've recently ran 'fdtree' [1] against a GFS2 filesystem as well as an ext3 filesystem. Here's what I've found:

(fdtree on a GFS2 filesystem)
# ./fdtree.bash -l 10 -d 3 -f 4 -s 1 -o /gfs2bench/fdtree
fdtree-1.0.2: starting at /gfs2bench/fdtree//LEVEL0.vm1.23787/
        creating/deleting 10 directory levels with 3 directories at each level
        for a total of 88573 directories
        with 4 files of size 4KiB per directory
        for a total of 354292 files and 1417168KiB
Sun Mar 13 00:45:31 EST 2011
Sun Mar 13 00:58:46 EST 2011
DIRECTORY CREATE TIME IN, OUT, TOTAL = 0, 795, 795
        Directory creates per second =  111
Sun Mar 13 00:58:46 EST 2011
Sun Mar 13 03:00:44 EDT 2011
FILE CREATE TIME IN, OUT, TOTAL      = 795, 4513, 3718
        File creates per second      =  95
        KiB per second               =  381
Sun Mar 13 03:00:44 EDT 2011
Sun Mar 13 04:49:08 EDT 2011
FILE REMOVE TIME IN, OUT, TOTAL      = 4513, 11017, 6504
        File removals per second     =  54
Sun Mar 13 04:49:08 EDT 2011
Sun Mar 13 05:02:58 EDT 2011
DIRECTORY REMOVE TIME IN, OUT, TOTAL = 11017, 11847, 830
        Directory removals per second =  106

(fdtree on an ext3 filesystem)
# ./fdtree.bash -l 10 -d 3 -f 4 -s 1 -o /ext3bench/fdtree
fdtree-1.0.2: starting at /ext3bench/fdtree//LEVEL0.vm1.25896/
        creating/deleting 10 directory levels with 3 directories at each level
        for a total of 88573 directories
        with 4 files of size 4KiB per directory
        for a total of 354292 files and 1417168KiB
Sun Mar 13 18:41:11 EDT 2011
Sun Mar 13 18:45:48 EDT 2011
DIRECTORY CREATE TIME IN, OUT, TOTAL = 0, 277, 277
        Directory creates per second =  319
Sun Mar 13 18:45:49 EDT 2011
Sun Mar 13 19:04:33 EDT 2011
FILE CREATE TIME IN, OUT, TOTAL      = 278, 1402, 1124
        File creates per second      =  315
        KiB per second               =  1260
Sun Mar 13 19:04:33 EDT 2011
Sun Mar 13 19:09:15 EDT 2011
FILE REMOVE TIME IN, OUT, TOTAL      = 1402, 1684, 282
        File removals per second     =  1256
Sun Mar 13 19:09:15 EDT 2011
Sun Mar 13 19:10:42 EDT 2011
DIRECTORY REMOVE TIME IN, OUT, TOTAL = 1684, 1771, 87
        Directory removals per second =  1018

In other words, ext3 is about 3 times faster at creating files/dirs, about 20 times faster at removing existing files, and about 10 times faster at removing existing directories. I've added the following lines to /etc/cluster/cluster.conf to remove the plock rate limit:
 <dlm plock_ownership="1" plock_rate_limit="0"/>
 <gfs_controld plock_rate_limit="0"/>
but this didn't help increase the GFS2 metadata performance.

Note that I've used the same setup for the GFS2 and ext3 tests: same machine, same networking config, same storage array (which is not used by anything else).
I also confirmed using "pingpong" [2] that I get a rate of about 4K locks/sec on this particular node against GFS2.

Does anyone have any hints/ideas as what might help increase the metadata performance of a GFS2 filesystem?

[1] https://computing.llnl.gov/?set=code&page=sio_downloads
[2] http://wiki.samba.org/index.php/Ping_pong

Best,
-- 
Valeriu Mutu



From martijn.storck at gmail.com  Fri Mar 18 08:50:53 2011
From: martijn.storck at gmail.com (Martijn Storck)
Date: Fri, 18 Mar 2011 09:50:53 +0100
Subject: [Linux-cluster] GFS volume locks during cluster node join/leave
Message-ID: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>

Hello again,

We have a 3-node RHCS cluster with a shared GFS volume that's performing
quite well after some tuning, so I couldn't be happier.

However, whenever a node leaves the cluster (be it in a 'nice' way by
rebooting, or after being fenced) our GFS volume is unusable for at least 30
seconds. Even an 'ls' on the volume blocks during this period. During this
period I see no activity in the /var/log/messages of the other nodes. The
only message is that one node is leaving the cluster. After 30 seconds the
cluster starts reconfiguring.

When I fence a node the same thing happens. It takes about 30 seconds before
the other nodes try to reclaim the journal of the lost node, which in itself
takes over a minute. Once the missing node rejoins after a reboot, the GFS
is again unavaiable for a long period.

Is this expected behaviour? Is there anything we can do to reduce these
delays? We run 10 VMs on our active nodes.. it's a shame to have these all
lock up because we're rebooting a passive node :)

Thanks!

Cheers,
Martijn Storck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110318/c399b94e/attachment.htm>

From matthias.bernges at zimory.com  Fri Mar 18 10:19:07 2011
From: matthias.bernges at zimory.com (Matthias Bernges)
Date: Fri, 18 Mar 2011 11:19:07 +0100
Subject: [Linux-cluster] fence_scsi: Unable to execute vgs
Message-ID: <1300443547.5742.995.camel@matthias-desktop>

Hello,

I am testing GFS. But fencing yet does not work correctly. I am using
fence_scsi and have these messages in the logfile (repeating every 5
seconds):

Mar 13 04:03:36 host7 fenced[7243]: fencing node "host11"
Mar 13 04:03:36 host7 fenced[7243]: agent "fence_scsi" reports: version: 0.33 20070919 Unable to execute vgs. 
Mar 13 04:03:36 host7 fenced[7243]: fence "host11" failed

I change fence_scsi to log the actual command line executed and also the
return code. The command line is

vgs --config 'global { locking_type = 0 }' --noheadings --separator : -o vg_attr,pv_name,pv_uuid

Which returns code 1 (256 in Perl's $?) I also changed the command in
fence_scsi file to use the full path (same behaviour).

When executing the command line manually I get a return code of 0, but a
warning shown in the terminal:

[root at host7 ~]# vgs --config 'global { locking_type = 0 }' --noheadings --separator : -o vg_attr,pv_name,pv_uuid
  WARNING: Locking disabled. Be careful! This could corrupt your metadata.
  wz--n-:/dev/dm-2:NP01BE-d1Lu-qRoz-eVIX-UuCs-FuZK-fuNtr3
  wz--n-:/dev/dm-6:XTu0my-wcNR-g2F3-ai2e-W0rM-UPNi-eGTDTp

The devices listed here are not used for the cluster. For testing I use
a simple multipath device without a volume group on top of it
(/dev/mapper/mpath2p1)

The version information in fence_scsi says:
$FENCE_RELEASE_NAME="2.0.115";
$REDHAT_COPYRIGHT=("Copyright (C) Red Hat, Inc.  2004  All rights reserved.");
$BUILD_DATE="(built Thu Nov 11 13:23:18 EST 2010)";

Someone has an idea?

Best,
Matthias




From raju.rajsand at gmail.com  Fri Mar 18 10:36:14 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 18 Mar 2011 16:06:14 +0530
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <20110316145951.75cc0849@mirchi>
References: <4D80FC76.6070605@alteeve.com>
	<20110316145951.75cc0849@mirchi>
Message-ID: <AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>

Greetings,

On 3/17/11, bergman at merctech.com <bergman at merctech.com> wrote:
> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster]
> Tripp Lite switched PDU fence agent; exists?" were:
>
>
>
> Finally, I'd like to warn people away from using the TrippLite PDU model
> PDUMH15ATNET as a fencing device. While it seems to have nice features, it
> has
> a design choice that is a serious problem with fencing--when a command is
> given to power down an outlet, there is a "random" delay (observed to be
> about 17 to 35 seconds) before that command is executed. This has been
> acknowledged by TrippLite support as a design choice, with no option or
> setting
> to override this behavior.
>

This "powerfence" should come nowhere near a production cluster.

Such randomness can play havoc in the predictability of availability:
Just think two of those strips (A,B) used for each of the redundant
power inputs and they not being switched off together. can get _very_
messy.

Just my 2paise

Regards,

Rajagopal



From raju.rajsand at gmail.com  Fri Mar 18 11:19:11 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 18 Mar 2011 16:49:11 +0530
Subject: [Linux-cluster] Split-brain with DRBD active-active + RHCS
In-Reply-To: <4D7F57DA.8060204@netcore.co.in>
References: <4D7F57DA.8060204@netcore.co.in>
Message-ID: <AANLkTin5YrawMN3OJ8M9cHeTdGtZRZnxK+GCLe0RwRBL@mail.gmail.com>

Greetings,

On 3/15/11, jayesh.shinde <jayesh.shinde at netcore.co.in> wrote:
> Hi All ,
>
> I don't have SAN with me , so I want to build the 2 node DRBD active
> active for mysql & http  resource ( i.e /dev/drbd2 & /dev/drbd3 in my
> case) with RHCS .
>

Ok now you have the block device ready, say /dev/drbdx .

Wait, RHCS has not yet kicked in.

( If your storage is a iscsi target, I am not covering it )

Simply speaking, let us take on from here:

1. get the rhcs and clvmd working.

Now, let us carve out the slices of the juicy CLVM VG kulfi.

>
>  From last 1 week I am testing the same scenario in 2 XEN vms with
> kenel  2.6.18-128.el5xen ,
>

2. Allocate one LUN per VM enough local storage.

3. Test if your VMs live migrate successfully

> Every thing is working fine , like mysql and
> http services move from one server to other etc... But not working
> correctly when it get fence ( i.e when n/w fail on of the node).
>


==== Big IFF step three passes ok; i.e. your VMs are HA now

4. Let httpd run from within the VM which can have just simple
blistering fast native supported filesystem: If you are having RHEL 6,
that would be ext4. Of course you have the option of the true 64 bit
filesystem like xfs, btfs (technology previews) etc.

5. Prepare an SLA and availability guarantee statement from IT,
distribute it to everybody and their dogs and cats and cows too. Well
of course the CYA rule applies.

6. Run it and go home and sleep happily.

And oh if you are using HA at all, you must seriously cosider having a
DR and a monitoring system like Zabbix in place.

I have heard quite a few sob stories and couselled many during the
Risk Withdrawal Syndrome on learning that they were extremely
vulnerable but fortunately for them, the incidents did not cause
downtime in business.

Regards,

Rajagopal



From ajb2 at mssl.ucl.ac.uk  Fri Mar 18 13:18:19 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Fri, 18 Mar 2011 13:18:19 +0000
Subject: [Linux-cluster] GFS volume locks during cluster node join/leave
In-Reply-To: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>
References: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>
Message-ID: <4D835B9B.50800@mssl.ucl.ac.uk>

Martijn Storck wrote:
> 
> Is this expected behaviour?

Yes.

> Is there anything we can do to reduce these delays?

Unmount all clustered filesystems on the host before rebooting.

AB




From ra at ra.is  Fri Mar 18 13:25:48 2011
From: ra at ra.is (Richard Allen)
Date: Fri, 18 Mar 2011 09:25:48 -0400
Subject: [Linux-cluster] DLM problem
In-Reply-To: <4D82E48F.1030007@redhat.com>
References: <4D826F2E.9030600@ra.is>	<AANLkTinM76pw1C0nv9xqdQgVHWd3hUuXLFJ8i2GeY5VA@mail.gmail.com>
	<4D82E48F.1030007@redhat.com>
Message-ID: <4D835D5C.30105@ra.is>

On 03/18/2011 12:50 AM, Fabio M. Di Nitto wrote:
> On 03/18/2011 12:47 AM, C.D. wrote:
>>
>> On Thu, Mar 17, 2011 at 10:29 PM, Richard Allen <ra at ra.is
>> <mailto:ra at ra.is>> wrote:
>>
>>     I have a simple test cluster up and running (RHEL 6 HA) on three
>>     vmware guests.  Each vmware guest has 3 vnic's.
>>
>>     After booting a node, I often get a dead rgmanager:
>>
>>     [root at syseng1-vm ~]# service rgmanager status
>>     rgmanager dead but pid file exists
>>
>>     Cluster is otherwise OK
>>
>>     [root at syseng1-vm ~]# clustat
>>     Cluster Status for RHEL6Test @ Thu Mar 17 16:10:38 2011
>>     Member Status: Quorate
>>
>>      Member Name                                               ID   Status
>>      ------ ----                                               ---- ------
>>      syseng1-vm                               1 Online, Local
>>      syseng2-vm                               2 Online
>>      syseng3-vm                               3 Online
>>
>>     There is a service running on node2 but clustat has no info on that.
>>
>>
>>
>>     [root at syseng1-vm ~]# cman_tool status
>>     Version: 6.2.0
>>     Config Version: 9
>>     Cluster Name: RHEL6Test
>>     Cluster Id: 36258
>>     Cluster Member: Yes
>>     Cluster Generation: 88
>>     Membership state: Cluster-Member
>>     Nodes: 3
>>     Expected votes: 3
>>     Total votes: 3
>>     Node votes: 1
>>     Quorum: 2
>>     Active subsystems: 1
>>     Flags:
>>     Ports Bound: 0
>>     Node name: syseng1-[CENSORED]
>>     Node ID: 1
>>     Multicast addresses: 239.192.141.48
>>     Node addresses: 10.10.16.11
>>
>>
>>     The syslog has some info:
>>
>>     Mar 17 15:47:55 syseng1-vm rgmanager[2463]: Quorum formed
>>     Mar 17 15:47:55 syseng1-vm kernel: dlm: no local IP address has been set
>>     Mar 17 15:47:55 syseng1-vm kernel: dlm: cannot start dlm lowcomms -107
> Make sure the VMs don't traverse any kind of NAT and that node names (as
> specified in cluster.conf) resolves to the correct ip addresses. Also
> cross check your iptables setup to allow traffic between nodes. DLM uses
> TCP vs userland that uses multicast.
>
> Fabio
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

iptables ans selinux is disabled on my test setup.   All hosts are properly
set up in DNS and resolve fine in both directions and each node has a fully
populated /etc/hosts just to be on the safe side.


-- 
Rikki.         --  RHCE, RHCX, HP-UX Certified Administrator.
               --  Solaris 7 Certified Systems and Network Administrator.
Bell Labs Unix --  Reach out and grep someone.
Those who do not understand Unix are condemned to reinvent it, poorly.



From martijn.storck at gmail.com  Fri Mar 18 14:41:09 2011
From: martijn.storck at gmail.com (Martijn Storck)
Date: Fri, 18 Mar 2011 15:41:09 +0100
Subject: [Linux-cluster] GFS volume locks during cluster node join/leave
In-Reply-To: <4D835B9B.50800@mssl.ucl.ac.uk>
References: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>
	<4D835B9B.50800@mssl.ucl.ac.uk>
Message-ID: <AANLkTikZ+8oyCcaxv=cgFeHbesTyM91jUDO3guUQGx63@mail.gmail.com>

Hey, that's actually quite helpful for the shutdown problem. Thanks :)

Martijn

On Fri, Mar 18, 2011 at 2:18 PM, Alan Brown <ajb2 at mssl.ucl.ac.uk> wrote:

> Martijn Storck wrote:
>
>>
>> Is this expected behaviour?
>>
>
> Yes.
>
>
>  Is there anything we can do to reduce these delays?
>>
>
> Unmount all clustered filesystems on the host before rebooting.
>
> AB
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110318/3555fe00/attachment.htm>

From linux at alteeve.com  Fri Mar 18 16:29:16 2011
From: linux at alteeve.com (Digimer)
Date: Fri, 18 Mar 2011 12:29:16 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>
References: <4D80FC76.6070605@alteeve.com>	<20110316145951.75cc0849@mirchi>
	<AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>
Message-ID: <4D83885C.5050908@alteeve.com>

On 03/18/2011 06:36 AM, Rajagopal Swaminathan wrote:
> Greetings,
> 
> On 3/17/11, bergman at merctech.com <bergman at merctech.com> wrote:
>> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster]
>> Tripp Lite switched PDU fence agent; exists?" were:
>>
>>
>>
>> Finally, I'd like to warn people away from using the TrippLite PDU model
>> PDUMH15ATNET as a fencing device. While it seems to have nice features, it
>> has
>> a design choice that is a serious problem with fencing--when a command is
>> given to power down an outlet, there is a "random" delay (observed to be
>> about 17 to 35 seconds) before that command is executed. This has been
>> acknowledged by TrippLite support as a design choice, with no option or
>> setting
>> to override this behavior.
>>
> 
> This "powerfence" should come nowhere near a production cluster.
> 
> Such randomness can play havoc in the predictability of availability:
> Just think two of those strips (A,B) used for each of the redundant
> power inputs and they not being switched off together. can get _very_
> messy.
> 
> Just my 2paise
> 
> Regards,
> 
> Rajagopal

At the end of the day, I think the argument for or against a device's
use should be secondary to it being supported. It's difficult to predict
why a user may want to use a given piece of hardware. I'm hoping that
adequate warnings in the docs to potential drawbacks will suffice.

As for this specific device, I'd not seen a review of it when I ordered
it. I plan to write an amateur review and I will specifically test for
these issues. If I can not find a safe way around queued fence calls,
then I will certainly include that in the review. That should hopefully
help steer people away from using this device, should the delay be a
show-stopper for them.

Cheers

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From fdinitto at redhat.com  Fri Mar 18 18:29:32 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Fri, 18 Mar 2011 19:29:32 +0100
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <4D83885C.5050908@alteeve.com>
References: <4D80FC76.6070605@alteeve.com>	<20110316145951.75cc0849@mirchi>	<AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>
	<4D83885C.5050908@alteeve.com>
Message-ID: <4D83A48C.5080203@redhat.com>

On 3/18/2011 5:29 PM, Digimer wrote:
> On 03/18/2011 06:36 AM, Rajagopal Swaminathan wrote:
>> Greetings,
>>
>> On 3/17/11, bergman at merctech.com <bergman at merctech.com> wrote:
>>> The pithy ruminations from Digimer <linux at alteeve.com> on "[Linux-cluster]
>>> Tripp Lite switched PDU fence agent; exists?" were:
>>>
>>>
>>>
>>> Finally, I'd like to warn people away from using the TrippLite PDU model
>>> PDUMH15ATNET as a fencing device. While it seems to have nice features, it
>>> has
>>> a design choice that is a serious problem with fencing--when a command is
>>> given to power down an outlet, there is a "random" delay (observed to be
>>> about 17 to 35 seconds) before that command is executed. This has been
>>> acknowledged by TrippLite support as a design choice, with no option or
>>> setting
>>> to override this behavior.
>>>
>>
>> This "powerfence" should come nowhere near a production cluster.
>>
>> Such randomness can play havoc in the predictability of availability:
>> Just think two of those strips (A,B) used for each of the redundant
>> power inputs and they not being switched off together. can get _very_
>> messy.
>>
>> Just my 2paise
>>
>> Regards,
>>
>> Rajagopal
> 
> At the end of the day, I think the argument for or against a device's
> use should be secondary to it being supported. It's difficult to predict
> why a user may want to use a given piece of hardware. I'm hoping that
> adequate warnings in the docs to potential drawbacks will suffice.
> 
> As for this specific device, I'd not seen a review of it when I ordered
> it. I plan to write an amateur review and I will specifically test for
> these issues. If I can not find a safe way around queued fence calls,
> then I will certainly include that in the review. That should hopefully
> help steer people away from using this device, should the delay be a
> show-stopper for them.

Wouldn?t it be possible for the agent to:

1) issue OFF command
2) either poll for OFF status or wait > $known_random_max_delay
3) issue ON command
4) profit?

even if it takes time to fence, it would make it usable.

Not all fence devices are millisecond fast either...

Fabio



From bergman at merctech.com  Fri Mar 18 20:20:11 2011
From: bergman at merctech.com (bergman at merctech.com)
Date: Fri, 18 Mar 2011 16:20:11 -0400
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <4D83A48C.5080203@redhat.com>
References: <4D80FC76.6070605@alteeve.com> <20110316145951.75cc0849@mirchi>
	<AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>
	<4D83885C.5050908@alteeve.com> <4D83A48C.5080203@redhat.com>
Message-ID: <20110318162011.3d883757@mirchi>

The pithy ruminations from "Fabio M. Di Nitto" <fdinitto at redhat.com> on "Re: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:


=> 
=> Wouldn?t it be possible for the agent to:
=> 
=> 1) issue OFF command
=> 2) either poll for OFF status or wait > $known_random_max_delay
=> 3) issue ON command
=> 4) profit?


Yes, but here's the problem:

	0) there's a condition whereby cluster communication is lost between nodeA and nodeB
	1) the agent on nodeA sends OFF command to PDU to shut down nodeB
	2) the agent on nodeA polls for OFF status while waiting > $known_random_max_delay
	3) the agent on nodeB sends OFF command to PDU to shut down nodeA
	4) nodeB shuts down
	5) nodeA shuts down

The PDU responds quickly to network connections (ie., telnet & commands to shut down a power outlet). The PDU accepts multiple network sessions (ie., from nodeA and nodeB). The PDU delays executing the commands, potentially leaving enough time for multiple nodes to send commands each to shut down the "other" node.

Mark

=> 
=> even if it takes time to fence, it would make it usable.
=> 
=> Not all fence devices are millisecond fast either...
=> 
=> Fabio
=> 
=> --
=> Linux-cluster mailing list
=> Linux-cluster at redhat.com
=> https://www.redhat.com/mailman/listinfo/linux-cluster
=> 
=> 



-----
Mark Bergman    Biker, Rock Climber, Unix mechanic, IATSE #1 Stagehand
'94 Yamaha GTS1000A^2
bergman at panix.com

http://wwwkeys.pgp.net:11371/pks/lookup?op=get&search=bergman%40panix.com

I want a newsgroup with a infinite S/N ratio! Now taking CFV on:
rec.motorcycles.stagehands.pet-bird-owners.pinballers.unix-supporters
15+ So Far--Want to join? Check out: http://www.panix.com/~bergman 



From fdinitto at redhat.com  Fri Mar 18 20:40:55 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Fri, 18 Mar 2011 21:40:55 +0100
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <20110318162011.3d883757@mirchi>
References: <4D80FC76.6070605@alteeve.com>
	<20110316145951.75cc0849@mirchi>	<AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>	<4D83885C.5050908@alteeve.com>
	<4D83A48C.5080203@redhat.com> <20110318162011.3d883757@mirchi>
Message-ID: <4D83C357.8060301@redhat.com>

On 3/18/2011 9:20 PM, bergman at merctech.com wrote:
> The pithy ruminations from "Fabio M. Di Nitto" <fdinitto at redhat.com> on "Re: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?" were:
> 
> 
> => 
> => Wouldn?t it be possible for the agent to:
> => 
> => 1) issue OFF command
> => 2) either poll for OFF status or wait > $known_random_max_delay
> => 3) issue ON command
> => 4) profit?
> 
> 
> Yes, but here's the problem:
> 
> 	0) there's a condition whereby cluster communication is lost between nodeA and nodeB
> 	1) the agent on nodeA sends OFF command to PDU to shut down nodeB
> 	2) the agent on nodeA polls for OFF status while waiting > $known_random_max_delay
> 	3) the agent on nodeB sends OFF command to PDU to shut down nodeA
> 	4) nodeB shuts down
> 	5) nodeA shuts down
> 
> The PDU responds quickly to network connections (ie., telnet & commands to shut down a power outlet). The PDU accepts multiple network sessions (ie., from nodeA and nodeB). The PDU delays executing the commands, potentially leaving enough time for multiple nodes to send commands each to shut down the "other" node.

This is virtually true for all 2 nodes clusters and it?s a very well
known fencing race condition.

there are several mechanisms to avoid it:

1) fence delay option. One node basically sleeps N seconds before it can
fence
2) both cluster heartbeat traffic and fence devices are on the same
network (if node A can?t access the net, it also can?t access the fence
device)
3) qdiskd + heuristics
4) use a fence device that allows only one connection at a time (one
node access, the other is forbidden)

and note that it is independent on how long the device takes to fence
the node.

Fabio



From jwellband at gmail.com  Sat Mar 19 16:46:01 2011
From: jwellband at gmail.com (Jason W.)
Date: Sat, 19 Mar 2011 12:46:01 -0400
Subject: [Linux-cluster] GFS volume locks during cluster node join/leave
In-Reply-To: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>
References: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>
Message-ID: <AANLkTim+S1PX29GRvJo08OczjNAqBvi0_rQVSmTRkBfU@mail.gmail.com>

On Fri, Mar 18, 2011 at 4:50 AM, Martijn Storck
<martijn.storck at gmail.com> wrote:

> Is this expected behaviour? Is there anything we can do to reduce these
> delays? We run 10 VMs on our active nodes.. it's a shame to have these all
> lock up because we're rebooting a passive node :)

Yes, It's because you did not gracefully unmount the filesystem as
Alan mentioned. Another node in the cluster halts access to the
filesystem and replays the journal from the dead node to make sure
that the filesystem is in a known state.

Generally, when I take down a cluster node, I manually remove it from
the cluster by stopping all services (stopping rgmanager and/or
unmounting filesystems), stopping clvmd if it's in use, running
fence_tool leave, then cman_tool leave. That "warns" the other cluster
nodes that this one is going away and they don't panic when it does :)
 In theory, this should happen during a normal shutdown, but I've seen
it not enough times to make me do the extra work.

This section of the Cluster wiki gives you a pretty good idea of what
happens when nodes join & leave the cluster:

http://sources.redhat.com/cluster/wiki/FAQ/CMAN#cman_tool_services

-- 
HTH, YMMV, HANW :)

Jason

The path to enlightenment is /usr/bin/enlightenment.



From td3201 at gmail.com  Sat Mar 19 19:34:03 2011
From: td3201 at gmail.com (Terry)
Date: Sat, 19 Mar 2011 14:34:03 -0500
Subject: [Linux-cluster] lvm locking problem
Message-ID: <AANLkTi=qpXTpouJoTiw0Qp0y97g4eWDeX12K4EjUJhh2@mail.gmail.com>

I have a two node cluster on RHEL5 with 2.0.52 version of clustering
that appears to be operating normally however, I cannot run the
pvdisplay command:

[root at server1 ~]# pvdisplay
  Error locking on node server2: Command timed out
  Error locking on node server1: Command timed out
  Unable to obtain global lock.

I then run this command but it just hangs after displaying this output:
[root at server1 ~]# /etc/init.d/clvmd status
clvmd (pid 16888 8497) is running...


Here's clustat:
Cluster Status for server @ Sat Mar 19 14:30:05 2011
Member Status: Quorate

 Member Name                                                     ID   Status
 ------ ----                                                     ---- ------
 server1                                                            1
Online, Local, rgmanager
 server2                                                            2
Online, rgmanager

 Service Name                                                 Owner
(Last)                                      State
 ------- ----                                                 -----
------                                      -----
 service:server-nfs-a                                         server1
                                         started
 service:server-nfs-b                                         server2
                                         started
 service:server-nfs-c                                         server1
                                         started
 service:server-nfs-h                                         server2
                                         started
 service:server-nfs-i                                         server2
                                         started
 service:postgresql                                           server1
                                         started


Not sure where to begin troubleshooting this.  Any ideas?



From mgrac at redhat.com  Mon Mar 21 08:44:53 2011
From: mgrac at redhat.com (Marek Grac)
Date: Mon, 21 Mar 2011 09:44:53 +0100
Subject: [Linux-cluster] Tripp Lite switched PDU fence agent; exists?
In-Reply-To: <4D83A48C.5080203@redhat.com>
References: <4D80FC76.6070605@alteeve.com>	<20110316145951.75cc0849@mirchi>	<AANLkTi=d-Vrc6SgXeTCSX9i2WVhXBajBD=n=ntzhCTcF@mail.gmail.com>	<4D83885C.5050908@alteeve.com>
	<4D83A48C.5080203@redhat.com>
Message-ID: <4D871005.3030801@redhat.com>

Hi,

On 03/18/2011 07:29 PM, Fabio M. Di Nitto wrote:
> Wouldn?t it be possible for the agent to:
>
> 1) issue OFF command
> 2) either poll for OFF status or wait>  $known_random_max_delay
> 3) issue ON command
> 4) profit?
>
> even if it takes time to fence, it would make it usable.
>
> Not all fence devices are millisecond fast either...

This is possible. You can use --power-wait $known_random_max_delay or 
increase --power-timeout to $known_random_max_delay + time needed to 
power off.

m,



From raju.rajsand at gmail.com  Mon Mar 21 10:28:07 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Mon, 21 Mar 2011 15:58:07 +0530
Subject: [Linux-cluster] lvm locking problem
In-Reply-To: <AANLkTi=qpXTpouJoTiw0Qp0y97g4eWDeX12K4EjUJhh2@mail.gmail.com>
References: <AANLkTi=qpXTpouJoTiw0Qp0y97g4eWDeX12K4EjUJhh2@mail.gmail.com>
Message-ID: <AANLkTi=mw_LZh4CwmZUB+1x+dpzVFJ5VxFUa2SriFMed@mail.gmail.com>

Greetings,

On Sun, Mar 20, 2011 at 1:04 AM, Terry <td3201 at gmail.com> wrote:
> I have a two node cluster on RHEL5 with 2.0.52 version of clustering
> that appears to be operating normally however, I cannot run the
> pvdisplay command:
>
> [root at server1 ~]# pvdisplay
>
> Not sure where to begin troubleshooting this.  Any ideas?
>


I just saw this post was untouched for two days.

In HA support such delay is a cardinal crime. Even if it is committed
by the community :)

I am not an expert on clusters and I am not in front of a machine
having CLVM installed. I will any way try giving you general steps.

IIF you can bring down the cluster (I understand it may not be
possible if it is in use ) to one node for time being and and start
from there.

That way the services will be on and you have only one node accessing
a share resource.

But be very vigilant and keep the other node ready to come up any time
(like booting up the box and just plugging out all network cables).

HTH. YMMV. You have been warned.

Above, IMHO.

Regards,

Rajagopal



From swhiteho at redhat.com  Mon Mar 21 11:11:31 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 21 Mar 2011 11:11:31 +0000
Subject: [Linux-cluster] GFS2 metadata performance
In-Reply-To: <20110318054257.GA29732@bsdera.pcbi.upenn.edu>
References: <20110318054257.GA29732@bsdera.pcbi.upenn.edu>
Message-ID: <1300705891.2568.17.camel@dolmen>

Hi,

On Fri, 2011-03-18 at 01:42 -0400, Valeriu Mutu wrote:
> Hi,
> 
> Has anyone done any GFS2 metadata performance benchmarks? If so, what have you found? Also, what performance tuning would be recommended to increase the metadata performance of a GFS2 filesystem?
> 
> I've recently ran 'fdtree' [1] against a GFS2 filesystem as well as an ext3 filesystem. Here's what I've found:
> 
>From time to time, I run various test, but fdtree has not been among
them.

> (fdtree on a GFS2 filesystem)
> # ./fdtree.bash -l 10 -d 3 -f 4 -s 1 -o /gfs2bench/fdtree
> fdtree-1.0.2: starting at /gfs2bench/fdtree//LEVEL0.vm1.23787/
>         creating/deleting 10 directory levels with 3 directories at each level
>         for a total of 88573 directories
>         with 4 files of size 4KiB per directory
>         for a total of 354292 files and 1417168KiB
> Sun Mar 13 00:45:31 EST 2011
> Sun Mar 13 00:58:46 EST 2011
> DIRECTORY CREATE TIME IN, OUT, TOTAL = 0, 795, 795
>         Directory creates per second =  111
> Sun Mar 13 00:58:46 EST 2011
> Sun Mar 13 03:00:44 EDT 2011
> FILE CREATE TIME IN, OUT, TOTAL      = 795, 4513, 3718
>         File creates per second      =  95
>         KiB per second               =  381
> Sun Mar 13 03:00:44 EDT 2011
> Sun Mar 13 04:49:08 EDT 2011
> FILE REMOVE TIME IN, OUT, TOTAL      = 4513, 11017, 6504
>         File removals per second     =  54
> Sun Mar 13 04:49:08 EDT 2011
> Sun Mar 13 05:02:58 EDT 2011
> DIRECTORY REMOVE TIME IN, OUT, TOTAL = 11017, 11847, 830
>         Directory removals per second =  106
> 
> (fdtree on an ext3 filesystem)
> # ./fdtree.bash -l 10 -d 3 -f 4 -s 1 -o /ext3bench/fdtree
> fdtree-1.0.2: starting at /ext3bench/fdtree//LEVEL0.vm1.25896/
>         creating/deleting 10 directory levels with 3 directories at each level
>         for a total of 88573 directories
>         with 4 files of size 4KiB per directory
>         for a total of 354292 files and 1417168KiB
> Sun Mar 13 18:41:11 EDT 2011
> Sun Mar 13 18:45:48 EDT 2011
> DIRECTORY CREATE TIME IN, OUT, TOTAL = 0, 277, 277
>         Directory creates per second =  319
> Sun Mar 13 18:45:49 EDT 2011
> Sun Mar 13 19:04:33 EDT 2011
> FILE CREATE TIME IN, OUT, TOTAL      = 278, 1402, 1124
>         File creates per second      =  315
>         KiB per second               =  1260
> Sun Mar 13 19:04:33 EDT 2011
> Sun Mar 13 19:09:15 EDT 2011
> FILE REMOVE TIME IN, OUT, TOTAL      = 1402, 1684, 282
>         File removals per second     =  1256
> Sun Mar 13 19:09:15 EDT 2011
> Sun Mar 13 19:10:42 EDT 2011
> DIRECTORY REMOVE TIME IN, OUT, TOTAL = 1684, 1771, 87
>         Directory removals per second =  1018
> 
> In other words, ext3 is about 3 times faster at creating files/dirs, about 20 times faster at removing existing files, and about 10 times faster at removing existing directories. I've added the following lines to /etc/cluster/cluster.conf to remove the plock rate limit:
>  <dlm plock_ownership="1" plock_rate_limit="0"/>
>  <gfs_controld plock_rate_limit="0"/>
> but this didn't help increase the GFS2 metadata performance.
> 
These settings only affect the performance of the fcntl POSIX locks
which do not have any "on disk" representation and are processed in
userspace via gfs_controld/dlm_controld depending on which version you
are using. So that is an expected result.

> Note that I've used the same setup for the GFS2 and ext3 tests: same machine, same networking config, same storage array (which is not used by anything else).
> I also confirmed using "pingpong" [2] that I get a rate of about 4K locks/sec on this particular node against GFS2.
> 
The pingpong test does not test metadata performance.

> Does anyone have any hints/ideas as what might help increase the metadata performance of a GFS2 filesystem?
> 
> [1] https://computing.llnl.gov/?set=code&page=sio_downloads
> [2] http://wiki.samba.org/index.php/Ping_pong
> 
> Best,
There are a number of variables which you don't mention, but which are
important for the test results. Firstly, what kind of storage are you
using? Secondly, was this lock_dlm or lock_nolock? Also was there any
memory pressure while the tests were running? Was noatime set on the
filesystem (or indeed, other mount options)?

Steve.




From swhiteho at redhat.com  Mon Mar 21 11:22:29 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Mon, 21 Mar 2011 11:22:29 +0000
Subject: [Linux-cluster] GFS2 locking in a VM based cluster (KVM)
In-Reply-To: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
References: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
Message-ID: <1300706549.2568.28.camel@dolmen>

Hi,

On Thu, 2011-03-17 at 17:17 +0200, C.D. wrote:
> Hello,
> 
> sorry guys to resurrect an old thread, but I have to say I can confirm
> that, too. I have a libvirt setup with multipathed FC SAN devices and
> KVM guests running on top of it. The physical machine is HP 465c G7 (2
> x 12 Core Magny-Cours with 96GB RAM). The host OS is Fedora 14. The
> guests are Scientific Linux 6. With gfs2 10GB shared LUN I can manage
> ~600k plocks/sec while both machines mounted the LUN. I started:
> ping_pong some_file 3 on one of the VMs and got those 600k plocks.
> Then I started ping_pong the_same_file 3 on the second machines and
> got around 360 plocks/sec (that is 360, not 360 000). No matter what I
> tried I couldn't optimize it. If I stop the ping_pong on one of the
> VMs the plocks wen't up to around 500-550 plocks/sec (again 550 not
> 550k). Stopping the process. Waiting a while and starting again on a
> single machine still got me around 600k plocks. This I could reproduce
> both with tcp and sctp and tried bunch of different settings.
> 
That is expected, since when only one node is using the plocks then the
lock will be kept locally and will thus be very fast. I assume that you
have plock ownership turned on in the cluster.conf?

As soon as you introduce the second node, this will no longer be the
case, and the net result is that it will take a lot longer to grant the
lock.

> Then I decided to give ocfs2 a change. Compiling the module on SL6,
> and I suppose on RHEL6, is not the most straight forward taks, buth
> half an hour later I got the module compiled from the sources of the
> EL kernel. Stripped all debug symbols. Copied the ocfs2 kernel module
> dir to both VM machines. Did depmod -a, I set up the oracle fs on top
> of the same LUN. Used ping_pong the_same_file_i_used_in_the_first_test
> 3 on just one machine, while both VMs have mounted the LUN. 1600k
> plocks/sec (as in ~1 600 000 ). Started ping_pong on the second host.
> The plocks did not move at all. Still 1600k plocks/sec. Tested with
> the real life app. It worked very well, unlike gfs2, which was
> painfully slow with just 2 users. I created the ocfs2 with -T mail, I
> didn't do any tuning on it, either.
> 
Unless you are using OCFS2 with the RHCS cluster suite, it does not
support clustered fcntl locks. As a result you are probably measuring
the speed of local fcntl locking, not clustered locking.

What tuning did you do on GFS2? What options do you have in cluster.conf
relating to fcntl locks?

> I'm not trying to bash gfs2, actually I would definitely prefer it
> over ocfs2 anytime, however it seems it doesn't work well with VM for
> some reason. I have used both mtu 1500 and 9000 also, it just didn't
> make any diffence, no matter what I have tried.I haven't tested the
> same setup on top of two physical nodes, but I have the feeling it
> will work just as good as ocfs2 on the VMs. I didn't test with
> hugepages for the VMs, but I somehow doubt that would make much of a
> difference.
> 
> I think this should be investigates by someone at RH possibly because
> they are the driving force behind both KVM, libvirt, the cluster soft
> and gfs2.
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

The mtu is unlikely to make much of a difference. With locking the most
important aspect is latency, rather than throughput,

Steve.




From td3201 at gmail.com  Mon Mar 21 13:40:05 2011
From: td3201 at gmail.com (Terry)
Date: Mon, 21 Mar 2011 08:40:05 -0500
Subject: [Linux-cluster] lvm locking problem
In-Reply-To: <AANLkTi=mw_LZh4CwmZUB+1x+dpzVFJ5VxFUa2SriFMed@mail.gmail.com>
References: <AANLkTi=qpXTpouJoTiw0Qp0y97g4eWDeX12K4EjUJhh2@mail.gmail.com>
	<AANLkTi=mw_LZh4CwmZUB+1x+dpzVFJ5VxFUa2SriFMed@mail.gmail.com>
Message-ID: <AANLkTi=gts53cJBYSbK3-ykpui1LzQxO38fG3YMbnCs5@mail.gmail.com>

On Mon, Mar 21, 2011 at 5:28 AM, Rajagopal Swaminathan
<raju.rajsand at gmail.com> wrote:
> Greetings,
>
> On Sun, Mar 20, 2011 at 1:04 AM, Terry <td3201 at gmail.com> wrote:
>> I have a two node cluster on RHEL5 with 2.0.52 version of clustering
>> that appears to be operating normally however, I cannot run the
>> pvdisplay command:
>>
>> [root at server1 ~]# pvdisplay
>>
>> Not sure where to begin troubleshooting this. ?Any ideas?
>>
>
>
> I just saw this post was untouched for two days.
>
> In HA support such delay is a cardinal crime. Even if it is committed
> by the community :)
>
> I am not an expert on clusters and I am not in front of a machine
> having CLVM installed. I will any way try giving you general steps.
>
> IIF you can bring down the cluster (I understand it may not be
> possible if it is in use ) to one node for time being and and start
> from there.
>
> That way the services will be on and you have only one node accessing
> a share resource.
>
> But be very vigilant and keep the other node ready to come up any time
> (like booting up the box and just plugging out all network cables).
>
> HTH. YMMV. You have been warned.
>
> Above, IMHO.
>
> Regards,
>
> Rajagopal
>

Thank you for your response.  Unfortunately, I cannot bring a node
down unless I have a full game plan.  If anyone can provide some
insight, that would be great.  I have a feeling that rebooting the
cluster would bring things back to normal but would like to understand
possible root causes and perhaps a way to fix it without rebooting.



From lhh at redhat.com  Mon Mar 21 17:26:03 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 21 Mar 2011 13:26:03 -0400
Subject: [Linux-cluster] Two node cluster - a potential problem of node
 fencing each other?
In-Reply-To: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
References: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
Message-ID: <20110321172602.GE991@redhat.com>

On Sun, Mar 13, 2011 at 12:49:42PM +0530, Parvez Shaikh wrote:
> 
> Is this situation possible? If so, can two nodes possibly fence (in short
> shutdown or reboot) each other? Is there anyway out of this situation?

http://people.redhat.com/lhh/ClusterPitfalls.pdf

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar 21 17:31:05 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 21 Mar 2011 13:31:05 -0400
Subject: [Linux-cluster] quorum device not getting a vote causes 2-node
 cluster to be inquorate
In-Reply-To: <25829.1300162301@mirchi>
References: <25829.1300162301@mirchi>
Message-ID: <20110321173105.GF991@redhat.com>

On Tue, Mar 15, 2011 at 12:11:41AM -0400, bergman at merctech.com wrote:
> I have been using a 2-node cluster with a quorum disk successfully for
> about 2 years. Beginning today, the cluster will not boot correctly.
> 
> The RHCS services start, but fencing fails with:
> 	
> 	dlm: no local IP address has been set
> 	dlm: cannot start dlm lowcomms -107
> 
> This seems to be a symtpom of the fact that the cluster votes do not include votes from the quorum
> device:
> 

There's a timing issue in RHEL 5.6 qdiskd wrt heuristics:

https://bugzilla.redhat.com/show_bug.cgi?id=679274

Test packages in there.

-- 
Lon Hohberger - Red Hat, Inc.



From lhh at redhat.com  Mon Mar 21 17:32:10 2011
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 21 Mar 2011 13:32:10 -0400
Subject: [Linux-cluster] Clustat exit code for service status
In-Reply-To: <AANLkTik6af4SVJM5z1B8q+1Od0kraEKHNO03fS+k90r7@mail.gmail.com>
References: <AANLkTik6af4SVJM5z1B8q+1Od0kraEKHNO03fS+k90r7@mail.gmail.com>
Message-ID: <20110321173210.GG991@redhat.com>

On Wed, Mar 16, 2011 at 10:37:55AM +0530, Parvez Shaikh wrote:
> Hi all,
> 
> Command clustat -s <service name> gives status of service.
> 
> If service is started (i.e. running on some node), exit code of this command
> is 0, if however service is not running, its exit code is non-zero (found it
> to be 119).
> 
> Is this right and going to be continued in subsequent cluster versions as
> well? Reason I am asking this, is if I can use this command in shell script
> to give status of service -

Yes.

> clustat -s <service name>
> if [ $? -eq 0 ]; then
>   echo "service is up"
> else
>   echo "service is not up"

That's right.

-- 
Lon Hohberger - Red Hat, Inc.



From kmaguire at eso.org  Mon Mar 21 17:54:43 2011
From: kmaguire at eso.org (Kevin Maguire)
Date: Mon, 21 Mar 2011 18:54:43 +0100 (CET)
Subject: [Linux-cluster] Two node cluster - a potential problem of node
 fencing each other?
In-Reply-To: <20110321172602.GE991@redhat.com>
References: <AANLkTikZExHj_kR4UKMrxKR7CqnwVVYEgO0XwPd_5uZX@mail.gmail.com>
	<20110321172602.GE991@redhat.com>
Message-ID: <alpine.LNX.2.00.1103211848170.18194@nb016160.ads.eso.org>

Hi Lon

> http://people.redhat.com/lhh/ClusterPitfalls.pdf

In that PDF you write:

"Can be used to create highly scalable NFS servers".

I see how you can use GFS1/2 to make a redundant setup, but I am not sure 
what you mean by "highly scalable" here, or how to do it?  Is there a best 
practise guide / howto for that?

Best,
Kevin



From ccd.stoy.ml at gmail.com  Mon Mar 21 19:19:25 2011
From: ccd.stoy.ml at gmail.com (C.D.)
Date: Mon, 21 Mar 2011 21:19:25 +0200
Subject: [Linux-cluster] GFS2 locking in a VM based cluster (KVM)
In-Reply-To: <1300706549.2568.28.camel@dolmen>
References: <AANLkTikE_DSLVoRuccOuaC0HE9YHQPPXyqphPgdb=SnO@mail.gmail.com>
	<1300706549.2568.28.camel@dolmen>
Message-ID: <AANLkTi=Mr-+sugEZBMSxWo2qYE8Y0zFCqfzJzRwdO630@mail.gmail.com>

On Mon, Mar 21, 2011 at 1:22 PM, Steven Whitehouse <swhiteho at redhat.com>wrote:

> Hi,
>
> On Thu, 2011-03-17 at 17:17 +0200, C.D. wrote:
> > Hello,
> >
> > sorry guys to resurrect an old thread, but I have to say I can confirm
> > that, too. I have a libvirt setup with multipathed FC SAN devices and
> > KVM guests running on top of it. The physical machine is HP 465c G7 (2
> > x 12 Core Magny-Cours with 96GB RAM). The host OS is Fedora 14. The
> > guests are Scientific Linux 6. With gfs2 10GB shared LUN I can manage
> > ~600k plocks/sec while both machines mounted the LUN. I started:
> > ping_pong some_file 3 on one of the VMs and got those 600k plocks.
> > Then I started ping_pong the_same_file 3 on the second machines and
> > got around 360 plocks/sec (that is 360, not 360 000). No matter what I
> > tried I couldn't optimize it. If I stop the ping_pong on one of the
> > VMs the plocks wen't up to around 500-550 plocks/sec (again 550 not
> > 550k). Stopping the process. Waiting a while and starting again on a
> > single machine still got me around 600k plocks. This I could reproduce
> > both with tcp and sctp and tried bunch of different settings.
> >
> That is expected, since when only one node is using the plocks then the
> lock will be kept locally and will thus be very fast. I assume that you
> have plock ownership turned on in the cluster.conf?
>

from my cluster.conf:

    <dlm plock_ownership="1" plock_rate_limit="0"/>
    <gfs_controld plock_rate_limit="0" />


>
> As soon as you introduce the second node, this will no longer be the
> case, and the net result is that it will take a lot longer to grant the
> lock.
>
> > Then I decided to give ocfs2 a change. Compiling the module on SL6,
> > and I suppose on RHEL6, is not the most straight forward taks, buth
> > half an hour later I got the module compiled from the sources of the
> > EL kernel. Stripped all debug symbols. Copied the ocfs2 kernel module
> > dir to both VM machines. Did depmod -a, I set up the oracle fs on top
> > of the same LUN. Used ping_pong the_same_file_i_used_in_the_first_test
> > 3 on just one machine, while both VMs have mounted the LUN. 1600k
> > plocks/sec (as in ~1 600 000 ). Started ping_pong on the second host.
> > The plocks did not move at all. Still 1600k plocks/sec. Tested with
> > the real life app. It worked very well, unlike gfs2, which was
> > painfully slow with just 2 users. I created the ocfs2 with -T mail, I
> > didn't do any tuning on it, either.
> >
> Unless you are using OCFS2 with the RHCS cluster suite, it does not
> support clustered fcntl locks. As a result you are probably measuring
> the speed of local fcntl locking, not clustered locking.
>
I am not, using it with RHCS, and I deduced the fact that I'm measuring the
locking of the fs on individual nodes, yet, the FS seems to be working all
right (it's not insanely fast, but it performs sufficiently for this setup.


>
> What tuning did you do on GFS2? What options do you have in cluster.conf
> relating to fcntl locks?
>
i fiddled with quota turned off, noatime, nodiratime some options under /sys
but I couldn't achieve good results. Actually couldn't achieve any results
that were different from those ~360 locks/sec. The "feel" of the served
files by the webserver through gfs2 and ocfs2 is night and day, though. So
even though I'm not measuring the ping_pongs of ocfs2 with clustered fcntl
locks, it seems it performs significantly better inside VMs, or this
particular VMs, whichever of the two is the more correct statement.

I'm currently in the process of setting up 5 VMs on two different physical
hosts and I'm going the OCFS2 route, however I could spend sometime testing
with gfs2 if you point me in the right direction. I could also try to test
ocfs2 with RHCS, but I suppose the result will not be that much different as
I suspect the reason is exactly the locking manager of cluster tools.


> > I'm not trying to bash gfs2, actually I would definitely prefer it
> > over ocfs2 anytime, however it seems it doesn't work well with VM for
> > some reason. I have used both mtu 1500 and 9000 also, it just didn't
> > make any diffence, no matter what I have tried.I haven't tested the
> > same setup on top of two physical nodes, but I have the feeling it
> > will work just as good as ocfs2 on the VMs. I didn't test with
> > hugepages for the VMs, but I somehow doubt that would make much of a
> > difference.
> >
> > I think this should be investigates by someone at RH possibly because
> > they are the driving force behind both KVM, libvirt, the cluster soft
> > and gfs2.
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> The mtu is unlikely to make much of a difference. With locking the most
> important aspect is latency, rather than throughput,
>

If it makes any difference both VMs were running on the same physical host
with bridged network so I suppose the latency is sufficient? Actually MTU
seems to increase latency, but not that much to be significant, I think.


>
> Steve.
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110321/8841b8d9/attachment.htm>

From gianluca.cecchi at gmail.com  Tue Mar 22 10:12:11 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Tue, 22 Mar 2011 11:12:11 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other one
Message-ID: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>

Hello,
I'm using latest updates on a 2 nodes rhel 6 based cluster.
At the moment no quorum disk defined, so this line inside cluster.conf
<cman expected_votes="1" two_node="1"/>

# rpm -q cman rgmanager fence-agents ricci corosync
cman-3.0.12-23.el6_0.6.x86_64
rgmanager-3.0.12-10.el6.x86_64
fence-agents-3.0.12-8.el6_0.3.x86_64
ricci-0.16.2-13.el6.x86_64
corosync-1.2.3-21.el6_0.1.x86_64

# uname -r
2.6.32-71.18.2.el6.x86_64

If the initial situation is both nodes down and I start one of them, I
get it powering on the other, that is not my intentional target...
Is this an expected default behaviour in rh el 6 with two nodes
without quorum disk? Or in general no matter if a quorum disk is
defined?
If so, how to change it if possible?

below output of this command on the first node when booting and
starting the other one:
# egrep "ricci|rgmanager|dlm|gfs|cman|corosync|fence" /var/log/messages
Mar 22 10:56:53 rhev2 corosync[6747]:   [MAIN  ] Corosync Cluster
Engine ('1.2.3'): started and ready to provide service.
Mar 22 10:56:53 rhev2 corosync[6747]:   [MAIN  ] Corosync built-in
features: nss rdma
Mar 22 10:56:53 rhev2 corosync[6747]:   [MAIN  ] Successfully read
config from /etc/cluster/cluster.conf
Mar 22 10:56:53 rhev2 corosync[6747]:   [MAIN  ] Successfully parsed cman config
Mar 22 10:56:53 rhev2 corosync[6747]:   [TOTEM ] Initializing
transport (UDP/IP).
Mar 22 10:56:53 rhev2 corosync[6747]:   [TOTEM ] Initializing
transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Mar 22 10:56:53 rhev2 corosync[6747]:   [TOTEM ] The network interface
[192.168.16.32] is now up.
Mar 22 10:56:54 rhev2 corosync[6747]:   [QUORUM] Using quorum provider
quorum_cman
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync cluster quorum service v0.1
Mar 22 10:56:54 rhev2 corosync[6747]:   [CMAN  ] CMAN 3.0.12 (built
Dec  1 2010 13:41:12) started
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync CMAN membership service 2.90
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: openais checkpoint service B.01.01
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync extended virtual synchrony service
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync configuration service
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync cluster closed process group service v1.01
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync cluster config database access v1.01
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync profile loading service
Mar 22 10:56:54 rhev2 corosync[6747]:   [QUORUM] Using quorum provider
quorum_cman
Mar 22 10:56:54 rhev2 corosync[6747]:   [SERV  ] Service engine
loaded: corosync cluster quorum service v0.1
Mar 22 10:56:54 rhev2 corosync[6747]:   [MAIN  ] Compatibility mode
set to whitetank.  Using V1 and V2 of the synchronization engine.
Mar 22 10:56:54 rhev2 corosync[6747]:   [TOTEM ] A processor joined or
left the membership and a new membership was formed.
Mar 22 10:56:54 rhev2 corosync[6747]:   [CMAN  ] quorum regained,
resuming activity
Mar 22 10:56:54 rhev2 corosync[6747]:   [QUORUM] This node is within
the primary component and will provide service.
Mar 22 10:56:54 rhev2 corosync[6747]:   [QUORUM] Members[1]: 2
Mar 22 10:56:54 rhev2 corosync[6747]:   [QUORUM] Members[1]: 2
Mar 22 10:56:54 rhev2 corosync[6747]:   [CPG   ] downlist received left_list: 0
Mar 22 10:56:54 rhev2 corosync[6747]:   [CPG   ] chosen downlist from
node r(0) ip(192.168.16.32)
Mar 22 10:56:54 rhev2 corosync[6747]:   [MAIN  ] Completed service
synchronization, ready to provide service.
Mar 22 10:56:57 rhev2 fenced[6803]: fenced 3.0.12 started
Mar 22 10:56:58 rhev2 dlm_controld[6823]: dlm_controld 3.0.12 started
Mar 22 10:56:58 rhev2 gfs_controld[6848]: gfs_controld 3.0.12 started
Mar 22 10:57:44 rhev2 kernel: dlm: Using TCP for communications
Mar 22 10:57:49 rhev2 fenced[6803]: fencing node intrarhev1
Mar 22 10:57:53 rhev2 fenced[6803]: fence intrarhev1 success
Mar 22 10:58:00 rhev2 ricci: startup succeeded
Mar 22 10:58:01 rhev2 rgmanager[7460]: I am node #2
Mar 22 10:58:01 rhev2 rgmanager[7460]: Resource Group Manager Starting
Mar 22 10:58:01 rhev2 rgmanager[7460]: Loading Service Data
Mar 22 10:58:03 rhev2 rgmanager[7460]: Initializing Services
Mar 22 10:58:04 rhev2 rgmanager[7460]: Services Initialized
Mar 22 10:58:04 rhev2 rgmanager[7460]: State change: Local UP

Thanks in advance,
Gianluca



From fdinitto at redhat.com  Tue Mar 22 10:47:58 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 22 Mar 2011 11:47:58 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
 one
In-Reply-To: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
Message-ID: <4D887E5E.1010306@redhat.com>

Hi,

On 3/22/2011 11:12 AM, Gianluca Cecchi wrote:
> Hello,
> I'm using latest updates on a 2 nodes rhel 6 based cluster.
> At the moment no quorum disk defined, so this line inside cluster.conf
> <cman expected_votes="1" two_node="1"/>
> 
> # rpm -q cman rgmanager fence-agents ricci corosync
> cman-3.0.12-23.el6_0.6.x86_64
> rgmanager-3.0.12-10.el6.x86_64
> fence-agents-3.0.12-8.el6_0.3.x86_64
> ricci-0.16.2-13.el6.x86_64
> corosync-1.2.3-21.el6_0.1.x86_64
> 
> # uname -r
> 2.6.32-71.18.2.el6.x86_64

For RHEL related questions you should always file a ticket with GSS.

> 
> If the initial situation is both nodes down and I start one of them, I
> get it powering on the other, that is not my intentional target...
> Is this an expected default behaviour in rh el 6 with two nodes
> without quorum disk? Or in general no matter if a quorum disk is
> defined?
> If so, how to change it if possible?

This is expected behavior.

The node that is booting/powering on, will gain quorum by itself and
since it does not detect the other node for N amount of seconds, it will
perform a fencing action to make sure the node is not accessing any
shared resource.

I am not sure why you want a one node cluster, but one easy workaround
is to start both of them at the same time, and then shutdown one of
them. At that point they have both seen each other and the one going
down will tell the other "I am going offline, no worries, it?s all good".

Fabio



From BCherian at softcell.in  Tue Mar 22 11:42:56 2011
From: BCherian at softcell.in (Bobby Cherian)
Date: Tue, 22 Mar 2011 17:12:56 +0530
Subject: [Linux-cluster] RHEL6
Message-ID: <OFF0A722B5.0081157B-ON6525785B.00404FDF-6525785B.00405B3E@softcell.in>


Hi all,

May i know the link to download the RHEL 6.

Regards
Bobby



-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
This message contains information that may be privileged or confidential and is the property of Softcell Technologies Limited. It's intended only for the person/s to whom it is addressed and any misuse of this information is prohibited and unlawful. The contents of this message do not necessarily represent the views or policies of Softcell Technologies Limited.



From alvaro.fernandez at sivsa.com  Tue Mar 22 11:53:55 2011
From: alvaro.fernandez at sivsa.com (Alvaro Jose Fernandez)
Date: Tue, 22 Mar 2011 12:53:55 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D887E5E.1010306@redhat.com>
Message-ID: <607D6181D9919041BE792D70EF2AEC48017CFD75@LIMENS.sivsa.int>


Hi,

I have the same situation (two_node=1, RHEL5.5, no quorum disk), but it works nice for me. Having both nodes down, starting one node always sucessfully fence the other and this is the expected as Fabio said.

In my scenario the fenced node must remain down, even when sucessfully fenced by the remaining node, as long I did shutdown -h on it previously, I halted it. So the fencing (APC power switch) runs ok, it flips off then on the fenced node's outlets, but in this scenario the server bootup is not triggered until a operator press the server's power-on/off button.

How it reacts to a power fencing action, it depends sometimes how you configure the server BIOS setting, the Automatic Server Restart options at BIOS level for example.

alvaro

On 3/22/2011 11:12 AM, Gianluca Cecchi wrote:
> Hello,
> I'm using latest updates on a 2 nodes rhel 6 based cluster.
> At the moment no quorum disk defined, so this line inside cluster.conf
> <cman expected_votes="1" two_node="1"/>
> 
> # rpm -q cman rgmanager fence-agents ricci corosync
> cman-3.0.12-23.el6_0.6.x86_64
> rgmanager-3.0.12-10.el6.x86_64
> fence-agents-3.0.12-8.el6_0.3.x86_64
> ricci-0.16.2-13.el6.x86_64
> corosync-1.2.3-21.el6_0.1.x86_64
> 
> # uname -r
> 2.6.32-71.18.2.el6.x86_64

For RHEL related questions you should always file a ticket with GSS.

> 
> If the initial situation is both nodes down and I start one of them, I
> get it powering on the other, that is not my intentional target...
> Is this an expected default behaviour in rh el 6 with two nodes
> without quorum disk? Or in general no matter if a quorum disk is
> defined?
> If so, how to change it if possible?

This is expected behavior.

The node that is booting/powering on, will gain quorum by itself and
since it does not detect the other node for N amount of seconds, it will
perform a fencing action to make sure the node is not accessing any
shared resource.

I am not sure why you want a one node cluster, but one easy workaround
is to start both of them at the same time, and then shutdown one of
them. At that point they have both seen each other and the one going
down will tell the other "I am going offline, no worries, it?s all good".

Fabio

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From lomazzog at dteenergy.com  Tue Mar 22 12:23:24 2011
From: lomazzog at dteenergy.com (Gino Lomazzo)
Date: Tue, 22 Mar 2011 08:23:24 -0400
Subject: [Linux-cluster] "fsck" on large file systems at boot time
Message-ID: <OFF2C33EEF.69C8AF21-ON8525785B.00422924-8525785B.0044102D@dteenergy.com>


Good morning;
We have a critical Oracle application running on a two node Red Hat
clustered environment. (RHEL5u5)


Red Hat clustering has worked extremely well for us; we have achieved
better performance and improved reliability at a substantial  reduced cost.


The issue is that when the system reboots, it takes over an hour to perform
an ?fsck? on a 1.8T  gfs file-system.  We cannot afford this down-time.


Is the ?fsck? at boot time required on a gfs file-system?


Have a great day!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/dab9d628/attachment.htm>

From list at fajar.net  Tue Mar 22 12:31:05 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Tue, 22 Mar 2011 19:31:05 +0700
Subject: [Linux-cluster] RHEL6
In-Reply-To: <OFF0A722B5.0081157B-ON6525785B.00404FDF-6525785B.00405B3E@softcell.in>
References: <OFF0A722B5.0081157B-ON6525785B.00404FDF-6525785B.00405B3E@softcell.in>
Message-ID: <AANLkTinL_Ghrzqhgh9Jg6so=GJeAZYhRKq-bVVUX1_aS@mail.gmail.com>

On Tue, Mar 22, 2011 at 6:42 PM, Bobby Cherian <BCherian at softcell.in> wrote:
>
> Hi all,
>
> May i know the link to download the RHEL 6.

That's not really a linux-cluster question, but you can try create a
RHN login and request trial on
https://www.redhat.com/rhel/details/eval/
Try might be some delay since your trial request approved and being
able to download the ISOs, so if you can't download it immediately try
again the day after that.

As an alternative, there's also the clones (with their optional
enchancements): Oracle Linux
(http://blogs.oracle.com/infrared/2011/02/oracle_linux_6_enhancements_for_x86_systems.html)
and Scientific Linux (https://www.scientificlinux.org/)

-- 
Fajar



From rpeterso at redhat.com  Tue Mar 22 12:57:57 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 22 Mar 2011 08:57:57 -0400 (EDT)
Subject: [Linux-cluster] "fsck" on large file systems at boot time
In-Reply-To: <OFF2C33EEF.69C8AF21-ON8525785B.00422924-8525785B.0044102D@dteenergy.com>
Message-ID: <125993535.544672.1300798677239.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Good morning;
| We have a critical Oracle application running on a two node Red Hat
| clustered environment. (RHEL5u5)
| 
| 
| Red Hat clustering has worked extremely well for us; we have achieved
| better performance and improved reliability at a substantial reduced
| cost.
| 
| 
| The issue is that when the system reboots, it takes over an hour to
| perform
| an ?fsck? on a 1.8T gfs file-system. We cannot afford this down-time.
| 
| 
| Is the ?fsck? at boot time required on a gfs file-system?

Hi,

No, fsck is _not_ required for gfs at boot time.  Offhand, I'd have to
say you do _not_ want to run gfs_fsck at boot time because then it does
not know whether the volume is currently mounted on other nodes.  If it
is mounted elsewhere, the fsck can cause damage.  Of course, there are
special caveats and circumstances, like open-shared root, single-node
gfs, and such.

When in doubt, call the Red Hat support people because they
usually know best.

Regards,

Bob Peterson
Red Hat File Systems



From lomazzog at dteenergy.com  Tue Mar 22 13:34:39 2011
From: lomazzog at dteenergy.com (Gino Lomazzo)
Date: Tue, 22 Mar 2011 09:34:39 -0400
Subject: [Linux-cluster] "fsck" on large file systems at boot time
In-Reply-To: <125993535.544672.1300798677239.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <OF33236B79.B3ABB7B9-ON8525785B.0048499D-8525785B.004A9643@dteenergy.com>


Thank you Bob;

That is a very good point about the file-system being mounted on the other
node.
I have opened a ticket with Red Hat on this issue, and have not gotten a
satisfactory reply.
I was told that an "fsck" should be performed at boot from Red Hat support.

I am still finding that the knowledge about Red Hat Cluster is very
limited, that is why I find this
forum valuable. I have contacted our local Red Hat support to get a clear
answer on this issue, but I would like to hear how others are handling this
issue.

Thanks again!





                                                                           
             Bob Peterson                                                  
             <rpeterso at redhat.                                             
             com>                                                       To 
             Sent by:                  linux clustering                    
             linux-cluster-bou         <linux-cluster at redhat.com>          
             nces at redhat.com                                            cc 
                                                                           
                                                                   Subject 
             03/22/2011 09:06          Re: [Linux-cluster] "fsck" on large 
             AM                        file systems at boot time           
                                                                           
                                                                           
             Please respond to                                             
             linux clustering                                              
             <linux-cluster at re                                             
                 dhat.com>                                                 
                                                                           
                                                                           




----- Original Message -----
| Good morning;
| We have a critical Oracle application running on a two node Red Hat
| clustered environment. (RHEL5u5)
|
|
| Red Hat clustering has worked extremely well for us; we have achieved
| better performance and improved reliability at a substantial reduced
| cost.
|
|
| The issue is that when the system reboots, it takes over an hour to
| perform
| an ?fsck? on a 1.8T gfs file-system. We cannot afford this down-time.
|
|
| Is the ?fsck? at boot time required on a gfs file-system?

Hi,

No, fsck is _not_ required for gfs at boot time.  Offhand, I'd have to
say you do _not_ want to run gfs_fsck at boot time because then it does
not know whether the volume is currently mounted on other nodes.  If it
is mounted elsewhere, the fsck can cause damage.  Of course, there are
special caveats and circumstances, like open-shared root, single-node
gfs, and such.

When in doubt, call the Red Hat support people because they
usually know best.

Regards,

Bob Peterson
Red Hat File Systems

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/6cc48bd4/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/6cc48bd4/attachment.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pic40317.gif
Type: image/gif
Size: 1255 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/6cc48bd4/attachment-0001.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ecblank.gif
Type: image/gif
Size: 45 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/6cc48bd4/attachment-0002.gif>

From gianluca.cecchi at gmail.com  Tue Mar 22 14:54:29 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Tue, 22 Mar 2011 15:54:29 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
Message-ID: <AANLkTintN+UT-05xLMpS1xhn6NFxwM7tMMV3NmYDM_pG@mail.gmail.com>

On Tue, 22 Mar 2011 11:47:58 +0100, Fabio M. Di Nitto wrote:
> For RHEL related questions you should always file a ticket with GSS.

yes, it is my usual behaviour, but tipically I prefer to analyze in
advance and know if  a problem I'm encountering is a bug or only my
fault in docs understanding...

> This is expected behavior.
> The node that is booting/powering on, will gain quorum by itself and
> since it does not detect the other node for N amount of seconds, it will
> perform a fencing action to make sure the node is not accessing any
> shared resource.

I have been using clusters with rh el 4.x (x=6 and 8) without quorum
disks and clusters with rh el 5.y (y=3,4,5) with quorum disks and
tested also Fedora 13 cluster
All of them were two nodes clusters and I don't remember this behaviour.
I limit my example to 2-nodes cluster.
I thought that the sequence when both nodes are down and one starts was:
a) Fence daemon notices that the other node is down
(with status option of the fence command)
b) Fence daemon waits for the configured amount of time, based on
cluster.conf values or default ones, to "see" the other node coming up
c) after this amount of time fenced completes its start phase and the
other ones take place
In particular if quorum disk is defined (and votes expected = 2), when
the node becomes the master for the quorum disk, the cluster is formed
and services started
Without any power-on of the other node at all....

> I am not sure why you want a one node cluster,

This is intended only for mainteneance windows where for example I
could prefer to:
- power off both nodes
- startup and update the first one (so that the second remains
unchanged as a rollback path)
- test/verify it
- let it start alone and be an active one-node cluster (eventually with quorum)
- update the second node and let it join the cluster again

And in all the cases fence was working: it was iLO based fencing for
most of clusters and drac5 fencing another time.

Thanks,
Gianluca



From linux at alteeve.com  Tue Mar 22 15:23:14 2011
From: linux at alteeve.com (Digimer)
Date: Tue, 22 Mar 2011 11:23:14 -0400
Subject: [Linux-cluster] rhel6 node start causes power on of the other
 one
In-Reply-To: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
Message-ID: <4D88BEE2.9060300@alteeve.com>

On 03/22/2011 06:12 AM, Gianluca Cecchi wrote:
> If the initial situation is both nodes down and I start one of them, I
> get it powering on the other, that is not my intentional target...
> Is this an expected default behaviour in rh el 6 with two nodes
> without quorum disk? Or in general no matter if a quorum disk is
> defined?
> If so, how to change it if possible?

This behaviour is by design and is expected in 2-node scenarios.

The reason is that each node can, effectively, have quorum. So on boot,
if it can't talk to it's partner node, it has no way of knowing if it is
safe to start cluster services. So it has to assume the worst and fence
the other node. As most people set their fence action to 'reboot', the
other node will boot.

To avoid this behaviour, change the fence action to 'poweroff'. Of
course, this means that a failed node will never auto-recover.

Also, the time it takes for the cluster to give up waiting for the other
node is defaulted to 6 seconds. You can control this with the
<fencesdaemon post_join_delay="x" />. I personally prefer setting this
to 60 seconds, to give plenty of time to start both nodes. The value you
choose should best suit your needs.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From alvaro.fernandez at sivsa.com  Tue Mar 22 15:36:07 2011
From: alvaro.fernandez at sivsa.com (Alvaro Jose Fernandez)
Date: Tue, 22 Mar 2011 16:36:07 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the otherone
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<AANLkTintN+UT-05xLMpS1xhn6NFxwM7tMMV3NmYDM_pG@mail.gmail.com>
Message-ID: <607D6181D9919041BE792D70EF2AEC48017CFDCF@LIMENS.sivsa.int>

Gianluca,
I thought that the sequence when both nodes are down and one starts was:
a) Fence daemon notices that the other node is down
(with status option of the fence command)
b) Fence daemon waits for the configured amount of time, based on
cluster.conf values or default ones, to "see" the other node coming up
c) after this amount of time fenced completes its start phase and the
other ones take place

But in c), after waiting post_join_delay, fenced should always trigger a
fence over the halted node, anyway, no? it's the expected behavior, as
(I think) the node does not know for sure what's happening with the
halted node (even if the halted node properly leaved the fencing domain,
ie, a clean shudown).

I experienced, as commented in my previous post, the same, but even the
fence was triggered, the halted node shouldn't start/boot, because of
the BIOS settings commented (ie: on a server power fault (or a server
hardware fault for the sake) I expect the server to reboot when it gets
fenced, but not when the BIOS server-restart code notices a ordered,
clean shutdown was issued. 

alvaro



From bergman at merctech.com  Tue Mar 22 15:41:15 2011
From: bergman at merctech.com (bergman at merctech.com)
Date: Tue, 22 Mar 2011 11:41:15 -0400
Subject: [Linux-cluster] restarting cluster from one node to multiple (Was:
 Re: rhel6 node start causes power on of the other one)
In-Reply-To: <4D887E5E.1010306@redhat.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D887E5E.1010306@redhat.com>
Message-ID: <20110322114115.3df7ec8f@mirchi.uphs.upenn.edu>

The pithy ruminations from "Fabio M. Di Nitto" <fdinitto at redhat.com> on "Re: [Linux-cluster] rhel6 node start causes power on of the other one" were:

=> Hi,
=> 
=> On 3/22/2011 11:12 AM, Gianluca Cecchi wrote:

	[SNIP!]

=> > 
=> > If the initial situation is both nodes down and I start one of them, I
=> > get it powering on the other, that is not my intentional target...

	[SNIP!]

=> 
=> This is expected behavior.
=> 

	[SNIP!]

=> I am not sure why you want a one node cluster, but one easy workaround

Sometimes, it's not a matter of "wanting" a one-node cluster, but being forced to have one temporarily. For example, if there's a hardware failure in one node of a 2-node cluster. I think that a likely scenario is that there's an event (for example, a power outage) that shuts down all nodes in a cluster, and that there is subsequent damage from that event (hardware failure, filesystem corruption on the local storage, etc.) that prevents some nodes from being restarted.

=> is to start both of them at the same time, and then shutdown one of

If both nodes are not available, this is not an easy work-around.

=> them. At that point they have both seen each other and the one going
=> down will tell the other "I am going offline, no worries, it?s all good".
=> 

What are the recommended alternative methods to starting a single-node on a cluster? If the number of expected votes is set to the number of votes for the single node, I'm able to start a single node. However, I'm not sure what will happen if additional nodes in the cluster are started later...will there be fencing or split-brain issues if "expected votes" is "1" when there are 2 nodes in the cluster?

Can additional nodes be brought up without affecting the services running on the existing node (ie., without causing the new node to fence the existing node)?

Thanks,

Mark


=> Fabio
=> 




From rhayden.public at gmail.com  Tue Mar 22 16:02:09 2011
From: rhayden.public at gmail.com (Robert Hayden)
Date: Tue, 22 Mar 2011 11:02:09 -0500
Subject: [Linux-cluster] rhel6 node start causes power on of the otherone
In-Reply-To: <607D6181D9919041BE792D70EF2AEC48017CFDCF@LIMENS.sivsa.int>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<AANLkTintN+UT-05xLMpS1xhn6NFxwM7tMMV3NmYDM_pG@mail.gmail.com>
	<607D6181D9919041BE792D70EF2AEC48017CFDCF@LIMENS.sivsa.int>
Message-ID: <AANLkTimTM0D_jXh7Cv4R2mtbNDRDy=2EmnmEXUr6gGQp@mail.gmail.com>

I believe you will want to investigate the "clean_start" property in the
fence_daemon stanza (RHEL 5).  Unsure if it is in RHEL6/Cluster3 code.  It
is my understanding that the property can be used to by-pass the timeout and
remote fencing on initial startup.  This assumes you know that the remote
node that is down was shutdown down cleanly and is not part of a cluster.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110322/eff719b1/attachment.htm>

From fdinitto at redhat.com  Tue Mar 22 18:57:17 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 22 Mar 2011 19:57:17 +0100
Subject: [Linux-cluster] restarting cluster from one node to multiple
 (Was: Re: rhel6 node start causes power on of the other one)
In-Reply-To: <20110322114115.3df7ec8f@mirchi.uphs.upenn.edu>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>	<4D887E5E.1010306@redhat.com>
	<20110322114115.3df7ec8f@mirchi.uphs.upenn.edu>
Message-ID: <4D88F10D.3040805@redhat.com>

On 03/22/2011 04:41 PM, bergman at merctech.com wrote:
> The pithy ruminations from "Fabio M. Di Nitto" <fdinitto at redhat.com> on "Re: [Linux-cluster] rhel6 node start causes power on of the other one" were:
> 
> => Hi,
> => 
> => On 3/22/2011 11:12 AM, Gianluca Cecchi wrote:
> 
> 	[SNIP!]
> 
> => > 
> => > If the initial situation is both nodes down and I start one of them, I
> => > get it powering on the other, that is not my intentional target...
> 
> 	[SNIP!]
> 
> => 
> => This is expected behavior.
> => 
> 
> 	[SNIP!]
> 
> => I am not sure why you want a one node cluster, but one easy workaround
> 
> Sometimes, it's not a matter of "wanting" a one-node cluster, but
being forced to have one temporarily. For example, if there's a hardware
failure in one node of a 2-node cluster. I think that a likely scenario
is that there's an event (for example, a power outage) that shuts down
all nodes in a cluster, and that there is subsequent damage from that
event (hardware failure, filesystem corruption on the local storage,
etc.) that prevents some nodes from being restarted.

If the hardware has failed or doesn't boot, fencing will still happen
from the remaining node, succeed (assuming the fencing device is not
gone bad too) and the node will keep working. The failed node at that
point does NOT need to rejoin the cluster for the surviving node to keep
working.

the problem is that we need to differentiate between normal operations
and special situations.

In a special situation like you describe, you might have to go to the
server to do hw repair, just unplug the power cords from the fencing
device (assuming power fencing) and wait for the remaining node to fence
and keep working.

If fencing fails, then you can use fence_ack_manual to override the
"wait for fencing" condition on the surviving node and allow it to
operate (and make absolutely sure the bad node is really off for good or
bad things will happen).

> 
> => is to start both of them at the same time, and then shutdown one of
> 
> If both nodes are not available, this is not an easy work-around.
> 
> => them. At that point they have both seen each other and the one going
> => down will tell the other "I am going offline, no worries, it?s all good".
> => 
> 
> What are the recommended alternative methods to starting a
> single-node
on a cluster? If the number of expected votes is set to the number of
votes for the single node, I'm able to start a single node. However, I'm
not sure what will happen if additional nodes in the cluster are started
later...will there be fencing or split-brain issues if "expected votes"
is "1" when there are 2 nodes in the cluster?

So this area is delicate. Adding nodes to a running cluster when number
of nodes is >= 2 is easy. Adding nodes from 1 to 3 is delicate.

In some random tests I did, but they are not officially supported
operations, i was able to start a one node cluster (with literally one
node in the config) and go up to 16.

Then, assuming you are on rhel6 (didn't test 5) and start with a one
node cluster that's up and running:

- create a config with higher version, and 2 nodes (you cannot bump from
1 to random number of nodes in one go or you will risk fencing/split
brain, there are some rules related to quorum that could cause issues if
not followed strictly).

- copy the new config on both nodes

- start cman on the node you want to add.

At this point, the new node will join with config in running_version+1,
triggering immediately a config reload on the active single node, that
will see the new node and recalculate quorum.

You should be able to repeat the same operation adding one node at a
time, in some cases more, but since it's complex and delicate
calculation, stick to one.

> 
> Can additional nodes be brought up without affecting the services
running on the existing node (ie., without causing the new node to fence
the existing node)?

Yes, in theory, but this is not a scenario we test constantly or support
as full feature.

Clearly, all of the above is in the assumption that the configs are
correct at every stage and that there are no other problems in between
(for instance a network issue or iptables misconfigured etc).

Fabio



From fdinitto at redhat.com  Tue Mar 22 19:05:02 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 22 Mar 2011 20:05:02 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
 one
In-Reply-To: <AANLkTintN+UT-05xLMpS1xhn6NFxwM7tMMV3NmYDM_pG@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<AANLkTintN+UT-05xLMpS1xhn6NFxwM7tMMV3NmYDM_pG@mail.gmail.com>
Message-ID: <4D88F2DE.10808@redhat.com>

On 03/22/2011 03:54 PM, Gianluca Cecchi wrote:
> On Tue, 22 Mar 2011 11:47:58 +0100, Fabio M. Di Nitto wrote:
>> For RHEL related questions you should always file a ticket with GSS.
> 
> yes, it is my usual behaviour, but tipically I prefer to analyze in
> advance and know if  a problem I'm encountering is a bug or only my
> fault in docs understanding...

GSS should be able to help you with that too :)

> 
>> This is expected behavior.
>> The node that is booting/powering on, will gain quorum by itself and
>> since it does not detect the other node for N amount of seconds, it will
>> perform a fencing action to make sure the node is not accessing any
>> shared resource.
> 
> I have been using clusters with rh el 4.x (x=6 and 8) without quorum
> disks and clusters with rh el 5.y (y=3,4,5) with quorum disks and
> tested also Fedora 13 cluster
> All of them were two nodes clusters and I don't remember this behaviour.

The behaviour is the same. RHEL6 is derived from F13 code base.

> I limit my example to 2-nodes cluster.
> I thought that the sequence when both nodes are down and one starts was:
> a) Fence daemon notices that the other node is down
> (with status option of the fence command)
> b) Fence daemon waits for the configured amount of time, based on
> cluster.conf values or default ones, to "see" the other node coming up
> c) after this amount of time fenced completes its start phase and the
> other ones take place
> In particular if quorum disk is defined (and votes expected = 2), when
> the node becomes the master for the quorum disk, the cluster is formed
> and services started
> Without any power-on of the other node at all....

With qdiskd things change a bit and I don't recall exactly the details.

> 
>> I am not sure why you want a one node cluster,
> 
> This is intended only for mainteneance windows where for example I
> could prefer to:
> - power off both nodes
> - startup and update the first one (so that the second remains
> unchanged as a rollback path)
> - test/verify it
> - let it start alone and be an active one-node cluster (eventually with quorum)
> - update the second node and let it join the cluster again

There is a specific document that explains how to upgrade cluster nodes.

Quick and dirty:

1) stop cluster on one of the node (leaving the other active with services)
2) upgrade the node
3) reboot the node

at this point, the new upgraded node will join the cluster just fine (we
do test/support this scenario)

4) migrate one service at a time from the old node, test and see if it
works.

once migration is completed and your system verified, then upgrade the
remaining node

In an HA environment you just reduced your downtime a lot by following
the correct procedure, rather than shutting down both nodes, and you
still have achieved to keep one node with new and one with old software
for rollback, for the time you needed to test the new upgraded node,
with the minimal service downtime of migration.

Fabio



From cos at aaaaa.org  Tue Mar 22 21:29:40 2011
From: cos at aaaaa.org (Ofer Inbar)
Date: Tue, 22 Mar 2011 16:29:40 -0500
Subject: [Linux-cluster] rg_test for testing other resource agent
	functions?
In-Reply-To: <20110307214919.GJ17423@redhat.com>
References: <20110304194923.GX934@mip.aaaaa.org>
	<20110307214919.GJ17423@redhat.com>
Message-ID: <20110322212940.GF13584@mip.aaaaa.org>

Lon Hohberger <lhh at redhat.com> wrote:
> On Fri, Mar 04, 2011 at 02:49:23PM -0500, Ofer Inbar wrote:
> > I could write some
> > separate cluster.conf parser that simulates what I think rgmanager
> > would do, but I might get it wrong.  Or rgmanager might change in a
> > future version and I wouldn't track the change.

> > Is there anything like rg_test that might let me do this, or has
> > anyone patched rg_test to allow it?  Something as simple as:
> >   sudo rg_test test /etc/cluster/cluster.conf [foo] service [servicename]
> 
> rgmanager does implicit start/status/stop ordering based on service tree
> structures, which is why those are the only operations that are currently done.

> You could just do:
> 
>   OCF_RESKEY_x=y OCF_RESKEY_a=b /path/to/agent.sh <operation>

Right, that's what I have to do now, but it's what I'm trying to avoid.
Doing that means duplicating some of my cluster config in the command
line, which means I have to take care to keep cluster.conf changes in
sync with all such command lines.  I'd like to keep maintaining that
configuration information on one central place: the cluster.conf files.

> I have a tool that will flatten a cluster.conf for you, resolving
> rgmanager's entire resource tree structure and flattening the result.

That could be useful.
Do you have any plans to distribute this tool with cluster suite?
  -- Cos



From vmutu at pcbi.upenn.edu  Wed Mar 23 05:34:21 2011
From: vmutu at pcbi.upenn.edu (Valeriu Mutu)
Date: Wed, 23 Mar 2011 01:34:21 -0400
Subject: [Linux-cluster] GFS2 metadata performance
In-Reply-To: <1300705891.2568.17.camel@dolmen>
References: <20110318054257.GA29732@bsdera.pcbi.upenn.edu>
	<1300705891.2568.17.camel@dolmen>
Message-ID: <20110323053421.GB45722@bsdera.pcbi.upenn.edu>

Hi Steve,

Thanks for the reply.

On Mon, Mar 21, 2011 at 11:11:31AM +0000, Steven Whitehouse wrote:
> > Note that I've used the same setup for the GFS2 and ext3 tests: same machine, same networking config, same storage array (which is not used by anything else).
> > I also confirmed using "pingpong" [2] that I get a rate of about 4K locks/sec on this particular node against GFS2.
> > 
> The pingpong test does not test metadata performance.

I didn't say pingpong tests/measures metadata performance. I know it measures the _lock rate_, but wasn't sure how the lock rate impacts the metadata performance if any.

> There are a number of variables which you don't mention, but which are
> important for the test results. Firstly, what kind of storage are you
> using? 

I used an idle iSCSI Equallogic PS6500 storage array (48 disks, SATA, 7.2K RPM, 1TB each). It is configured as RAID-50. It is connected to the SAN switch via 4 1gigE links. By idle I mean it was not being used by anything at that time. Also, the VM (on which the test were run) is connected to the SAN switch via a single 1gigE link. MTU was set to 1500bytes due to problems with a 9000bytes MTU.

> Secondly, was this lock_dlm or lock_nolock? 
I've explicitly specified "lock_dlm" as the locking protocol when I created the shared GFS2 filesystem:

> Also was there any memory pressure while the tests were running? 
This VM is mostly idle, i.e. there's almost nothing running on it. It has 2Gb of RAM configured.

>Was noatime set on the filesystem (or indeed, other mount options)?
"noatime" was not set. This is how it's mounted:
/dev/mapper/Gfs2BenchmarksVG-gfs2benchLV on /gfs2bench type gfs2 (rw,hostdata=jid=0:id=196609:first=1)

You've mentioned that you are using other tools to measure the metadata performance of a given filesystem. What tools are those? Also, what numbers have you seen in your benchmarks when it comes to GFS2's metadata performance?

Best,
-- 
Valeriu Mutu



From swhiteho at redhat.com  Wed Mar 23 09:03:44 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 23 Mar 2011 09:03:44 +0000
Subject: [Linux-cluster] GFS2 metadata performance
In-Reply-To: <20110323053421.GB45722@bsdera.pcbi.upenn.edu>
References: <20110318054257.GA29732@bsdera.pcbi.upenn.edu>
	<1300705891.2568.17.camel@dolmen>
	<20110323053421.GB45722@bsdera.pcbi.upenn.edu>
Message-ID: <1300871024.2590.6.camel@dolmen>

Hi,

On Wed, 2011-03-23 at 01:34 -0400, Valeriu Mutu wrote:
> Hi Steve,
> 
> Thanks for the reply.
> 
> On Mon, Mar 21, 2011 at 11:11:31AM +0000, Steven Whitehouse wrote:
> > > Note that I've used the same setup for the GFS2 and ext3 tests: same machine, same networking config, same storage array (which is not used by anything else).
> > > I also confirmed using "pingpong" [2] that I get a rate of about 4K locks/sec on this particular node against GFS2.
> > > 
> > The pingpong test does not test metadata performance.
> 
> I didn't say pingpong tests/measures metadata performance. I know it measures the _lock rate_, but wasn't sure how the lock rate impacts the metadata performance if any.
> 
Generally not at all, since the fcntl locks use a totally different
system to the locks which are used by the filesystem internally.

> > There are a number of variables which you don't mention, but which are
> > important for the test results. Firstly, what kind of storage are you
> > using? 
> 
> I used an idle iSCSI Equallogic PS6500 storage array (48 disks, SATA, 7.2K RPM, 1TB each). It is configured as RAID-50. It is connected to the SAN switch via 4 1gigE links. By idle I mean it was not being used by anything at that time. Also, the VM (on which the test were run) is connected to the SAN switch via a single 1gigE link. MTU was set to 1500bytes due to problems with a 9000bytes MTU.
> 
> > Secondly, was this lock_dlm or lock_nolock? 
> I've explicitly specified "lock_dlm" as the locking protocol when I created the shared GFS2 filesystem:
> 
> > Also was there any memory pressure while the tests were running? 
> This VM is mostly idle, i.e. there's almost nothing running on it. It has 2Gb of RAM configured.
> 
Ok. Should not be an issue then.

> >Was noatime set on the filesystem (or indeed, other mount options)?
> "noatime" was not set. This is how it's mounted:
> /dev/mapper/Gfs2BenchmarksVG-gfs2benchLV on /gfs2bench type gfs2 (rw,hostdata=jid=0:id=196609:first=1)
> 
I would highly recommend using noatime and nodiratime unless you have a
good reason not to. That usually improves performance a fair amount.

> You've mentioned that you are using other tools to measure the metadata performance of a given filesystem. What tools are those? Also, what numbers have you seen in your benchmarks when it comes to GFS2's metadata performance?
> 
> Best,
postmark, iozone, bonnie++, fsx, specsfs and others. I don't currently
have any figures that I can share directly, and none which are likely to
be directly comparable to your array.

Is this RHEL/Centos 6 or Fedora/upstream, btw? If so then there are
tracepoints available which can be used to time certain aspects of the
internal performance of GFS2 which may help in narrowing down the
problem.

Another possible test is to use the seekwatcher script to gather
blktrace data and produce a graph of I/O on the filesystem in order to
see whether there are any issues with I/O being fragmented,

Steve.




From ajb2 at mssl.ucl.ac.uk  Wed Mar 23 14:57:48 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 23 Mar 2011 14:57:48 +0000
Subject: [Linux-cluster] GFS volume locks during cluster node join/leave
In-Reply-To: <AANLkTikZ+8oyCcaxv=cgFeHbesTyM91jUDO3guUQGx63@mail.gmail.com>
References: <AANLkTinc+hyTonYmmYSSbUYcM9FWfoT-uNWTO6+deK=n@mail.gmail.com>	<4D835B9B.50800@mssl.ucl.ac.uk>
	<AANLkTikZ+8oyCcaxv=cgFeHbesTyM91jUDO3guUQGx63@mail.gmail.com>
Message-ID: <4D8A0A6C.8000107@mssl.ucl.ac.uk>

Martijn Storck wrote:
> Hey, that's actually quite helpful for the shutdown problem. Thanks :)

No problem.

The reason for the hang is that if a host with mounted filesystems goes 
away, the others won't touch those filesystems until the cluster is sure 
it has been fenced, in order to reduce data corruption risk.

> 
> Martijn
> 
> On Fri, Mar 18, 2011 at 2:18 PM, Alan Brown <ajb2 at mssl.ucl.ac.uk> wrote:
> 
>> Martijn Storck wrote:
>>
>>> Is this expected behaviour?
>>>
>> Yes.
>>
>>
>>  Is there anything we can do to reduce these delays?
>> Unmount all clustered filesystems on the host before rebooting.
>>
>> AB
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> 
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster





From ooolinux at 163.com  Thu Mar 24 07:32:29 2011
From: ooolinux at 163.com (yue)
Date: Thu, 24 Mar 2011 15:32:29 +0800 (CST)
Subject: [Linux-cluster] why i can not find dlm_new_lockspace implement in
 kernel source code?
Message-ID: <152cc1b.11bd3.12ee6c76056.Coremail.ooolinux@163.com>

<linux/dlm.h>
int dlm_new_lockspace(const char *name, int namelen,
        dlm_lockspace_t **lockspace, uint32_t flags, int lvblen);
 
 
bu t i can not find where it is implemented  when i browse kernel source code.
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110324/da59bd6e/attachment.htm>

From hal at elizium.za.net  Thu Mar 24 08:25:39 2011
From: hal at elizium.za.net (Hugo Lombard)
Date: Thu, 24 Mar 2011 10:25:39 +0200
Subject: [Linux-cluster] why i can not find dlm_new_lockspace implement
 in kernel source code?
In-Reply-To: <152cc1b.11bd3.12ee6c76056.Coremail.ooolinux@163.com>
References: <152cc1b.11bd3.12ee6c76056.Coremail.ooolinux@163.com>
Message-ID: <20110324082539.GF3293@squishy.elizium.za.net>

On Thu, Mar 24, 2011 at 03:32:29PM +0800, yue wrote:
>    <linux/dlm.h>
>    int dlm_new_lockspace(const char *name, int namelen,
>            dlm_lockspace_t **lockspace, uint32_t flags, int lvblen);
> 
> 
>    bu t i can not find where it is implemented  when i browse kernel source
>    code.
> 

Defined as a function in:

    * fs/dlm/lockspace.c, line 621
    * include/linux/dlm.h, line 84 

 [ According to                                                    ]
 [ http://fxr.watson.org/fxr/ident?v=linux-2.6&i=dlm_new_lockspace ]

Hope that helps.

-- 
Hugo Lombard



From gianluca.cecchi at gmail.com  Thu Mar 24 08:26:40 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Thu, 24 Mar 2011 09:26:40 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <4D88BEE2.9060300@alteeve.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
Message-ID: <AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>

On Tue, 22 Mar 2011 11:02:09 -0500 Robert Hayden wrote:
> I believe you will want to investigate the "clean_start" property in the fence_daemon stanza (RHEL 5).
> Unsure if it is in RHEL6/Cluster3 code.  It is my understanding that the property can be used to
> by-pass the timeout and remote fencing on initial startup.  This assumes you know that the remote
> node that is down was shutdown down cleanly and is not part of a cluster.

Aha... that was the parameter I was missing!
At first stages of my approach with rh el 5 cluster I began to use
something like
<fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="20"/>

and always using it I forgot that clean_start is not the default...
And in my rh el6 cluster I didn't put that sort of line.
After doing this I get the behaviour I was having in rh el 5 again,
without node powering on... so it works also in RH EL 6.

On Tue, Mar 22, 2011 at 4:23 PM, Digimer wrote:
[snip]
>
> To avoid this behaviour, change the fence action to 'poweroff'. Of
> course, this means that a failed node will never auto-recover.
>
> Also, the time it takes for the cluster to give up waiting for the other
> node is defaulted to 6 seconds. You can control this with the
> <fencesdaemon post_join_delay="x" />. I personally prefer setting this
> to 60 seconds, to give plenty of time to start both nodes. The value you
> choose should best suit your needs.

I remember other clustering sw such as serviceguard: when a node
booted alone (in the sense that it didn't see any other node alive)
and predefined timeout expired, it stopped at console prompt asking if
you were sure to want to proceed, so not to potentially corrupt
data...
I think such a behaviour would be better than automatic fencing in
default configuration.
Again, also this approach could lead to problems such as need for
direct access to server console or cluster without any node running at
all without manual intervention ...


Thanks to all in the mean time and to Jeremy for his help (I updated
the case too)
Gianluca



From raju.rajsand at gmail.com  Thu Mar 24 17:03:00 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Thu, 24 Mar 2011 22:33:00 +0530
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
	<AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
Message-ID: <AANLkTikSGH8EoLmXvc82V+4sYH-dOCyz9U02SB==W=bJ@mail.gmail.com>

Greetings,

On 3/24/11, Gianluca Cecchi <gianluca.cecchi at gmail.com> wrote:
> such as need for direct access to server console or cluster without any node running at
> all without manual intervention ...
>

Pardon my ignorance. I fail to understand What do you mean by cluster
when no node is running?

A cluster is a conceptual or "generated" host  [shall we call it as
g-host like g-spot? ;- ) ] which relies on underlying physical
host/node . for its existence. With no node running at all, what are
you trying to access?

Apologies if I sounded rude. :)

With only warm regards,

Rajagopal
(A human node in linux-cluster list, not a ghost are "g-spot")



From rossnick-lists at cybercat.ca  Thu Mar 24 17:11:34 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Thu, 24 Mar 2011 13:11:34 -0400
Subject: [Linux-cluster] gfs2_quotad:2498 blocked
Message-ID: <1F8BA078A8FF4B66A702EDB450BA2F9A@versa>

Hi !

I've got a situation where I've done a rsync from a server outside the 
cluster to a cluster node, where a GFS2 fs was mounted. It was the only node 
at that time to have that FS mounted.

It was a large, very large directory, with somewhere neer one million small 
files, so the rsync took something like 3 to 4 hours. At some point, all 
nodes' consoles dispalyed this :

gfs2_quotad:2498 blocked for more that 120 seconds.
"echo 0 > /proc/sys/kernel/hang_task_timeout_secs" disables this message.

and then some debuging info dump.

I did stop the rsync and restarted it, and it go trough fine (still took 3 
to 4 hours).

I can't tell for sure since all nodes are in the trunk of my car right now 
because we are moving all of them to a new colo, but I beleive my FS was 
mounted with "-o noatime,noquota"

What does this mean ?

We are on RHEL6, up to date.

Regards,
Nicolas 



From gianluca.cecchi at gmail.com  Thu Mar 24 21:26:15 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Thu, 24 Mar 2011 22:26:15 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
	<AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
Message-ID: <AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>

On Thu, 24 Mar 2011 22:33:00 +0530, Rajagopal Swaminathan wrote:
> On 3/24/11, Gianluca Cecchi wrote:
> > such as need for direct access to server console or cluster without any node running at
> > all without manual intervention ...
> >
>
> Pardon my ignorance. I fail to understand What do you mean by cluster
> when no node is running?

I was referring to possible effects when problems arise.
I meant that with an approach similar to the ServiceGuard's one, you
can have for example a condition where both nodes reboot (such as
mutual stonith) and if there is a problem with intra cluster
communication that persists during the immediate reboot, both nodes
would remain at the prompt asking for confirmation.
So if you don't explicitly go on the console of one of them your
"cluster services" remain down indefinitely.
Instead, in rhcs, if for example the problem is resolved say after 5
minutes, probably there would be 2 or 3 mutual stoniths and at the 4th
attempt (when problem solved) they would form the cluster again and
give service...

Gianluca



From raju.rajsand at gmail.com  Fri Mar 25 01:49:40 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 25 Mar 2011 07:19:40 +0530
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
	<AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
	<AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>
Message-ID: <AANLkTikGLhb1jkLsRXu5vE0V5Ydkoeq4NDb4xtfdkfzo@mail.gmail.com>

Greetings,

On 3/25/11, Gianluca Cecchi <gianluca.cecchi at gmail.com> wrote:
> I was referring to possible effects when problems arise.

Let us cross the bridge when we come to it. See answers below.


> I meant that with an approach similar to the ServiceGuard's one, you
> can have for example a condition where both nodes reboot (such as
> mutual stonith) and if there is a problem with intra cluster
> communication that persists during the immediate reboot, both nodes
> would remain at the prompt asking for confirmation.

On which CPU  should service guard run on?

An RHCS is HA Cluster (HPC is a different RH product and HA offerings
like MRG is another different RH product and RHEV is a very diferent
one altogether in Rehat as I understand -- did not have right hardware
to get RHEV going though, as yet) is designed to be self managed ( I
mean, without the "head node" which is present in a HPC cluster)

I am talking about Only RHCS here.

I can't comment more at this point of time about the other products
that you or I are talking about.

> So if you don't explicitly go on the console of one of them your
> "cluster services" remain down indefinitely.

Of course. It will. When you have no legs, you can't run.

> Instead, in rhcs, if for example the problem is resolved say after 5
> minutes, probably there would be 2 or 3 mutual stoniths and at the 4th
> attempt (when problem solved) they would form the cluster again and
> give service...

Please see above.

Perhaps I have not understood RHCS enough.

Above IMHO, of course.


Regards,

Rajagopal



From parvez.h.shaikh at gmail.com  Fri Mar 25 04:22:03 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Fri, 25 Mar 2011 09:52:03 +0530
Subject: [Linux-cluster] Node without fencing method,
 is it possible to failover from such a node?
In-Reply-To: <AANLkTi=Ug8f72qoYUsS=rqCiARHJGCRbQkcnzXtMi4ED@mail.gmail.com>
References: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>
	<4D823064.8040907@alteeve.com>
	<AANLkTi=Ug8f72qoYUsS=rqCiARHJGCRbQkcnzXtMi4ED@mail.gmail.com>
Message-ID: <AANLkTinkyr_ixjj2Bc=GrLm0Y4J-84gw9nowZ-N=Cu5M@mail.gmail.com>

Guys, thanks a lot for your input.

I have a doubt related to IPMI fencing. In IPMI fencing, we specify network
address of IPMI controller.

This is out of band network address as well as IPMI board must have power
supply different from cluster node. Am I right?

Thanks in advance for your help.

Gratefully,
Parvez

On Thu, Mar 17, 2011 at 10:19 PM, Rajagopal Swaminathan <
raju.rajsand at gmail.com> wrote:

> Greetings,
>
> On 3/17/11, Digimer <linux at alteeve.com> wrote:
> > On 03/17/2011 01:25 AM, Parvez Shaikh wrote:
> >> Hi all,
> >>
> >> Life was good until I am now required to support cluster of
> >> nodes which are not hosted in bladecenter but any vanilla nodes.
>
> Suggestions from somebody who stupidly yapped "I will support manual
> fencing" and burnt his finger (Who? Oh! that was me):
> 1. Don't commit support for manual fencing
> 2. Don't support manual fencing.
>
> If you are in India, APC Fence PDU is available for around 30-35K INR
> (about a year back or so).
> If someone is ready to invest say 500K INR for HA hardware such as two
> servers etc., they might as well add 35k.
>
> OTOH, if those nodes are rack mounted servers (Unlike entry level
> server which does not have management port), the cost of the
> Powerfence strip will be a different issue when it comes to
> justifying, etc. within a corporate/Enterprise environment. Too much
> paperwork, I agree. But It will give a more robust infrastructure
> which will help us in using various tools like Zabbix, Spacewalk, snmp
> (I think fence strips have some SNMP - please check) etc. in the
> future.
>
> Life will be good then.
>
> With warm regards,
>
> Rajagopal
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110325/d0efab97/attachment.htm>

From raju.rajsand at gmail.com  Fri Mar 25 04:58:40 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 25 Mar 2011 10:28:40 +0530
Subject: [Linux-cluster] Node without fencing method,
 is it possible to failover from such a node?
In-Reply-To: <AANLkTinkyr_ixjj2Bc=GrLm0Y4J-84gw9nowZ-N=Cu5M@mail.gmail.com>
References: <AANLkTikARNHyccVns2h3bjp+KBve3PS18NEuTYOhwVyD@mail.gmail.com>
	<4D823064.8040907@alteeve.com>
	<AANLkTi=Ug8f72qoYUsS=rqCiARHJGCRbQkcnzXtMi4ED@mail.gmail.com>
	<AANLkTinkyr_ixjj2Bc=GrLm0Y4J-84gw9nowZ-N=Cu5M@mail.gmail.com>
Message-ID: <AANLkTik_E_dPh6+Fq74Rxg661REU3QLQz=M3Aozah3X5@mail.gmail.com>

Greetings,

On 3/25/11, Parvez Shaikh <parvez.h.shaikh at gmail.com> wrote:
> I have a doubt related to IPMI fencing. In IPMI fencing, we specify network
> address of IPMI controller.
>
> This is out of band network address as well as IPMI board must have power
> supply different from cluster node. Am I right?

I am not (and cannot be) near the last production cluster. So please
answers for  inaccuracies and the such.

YMMV etc. apply

Let me get this straight:

1. You have a  data network bond device of two MACs, 1 IP (not the
cluster VIP, node host IP)
2. You have a "management" network of ILOs and IPMIs (actually a
Single point of failure for a given box)
3. You have a third network, let us say power network consisting only
of Electrical Power fencing devices .

each one can be a VLAN perhaps if you have access to the networking pieces.

If necessary, Inter VLAN routing may be used But highly unavisable.
Keep the Data and Management/power vlan very very separate. If
possible physically.

Have got it right?

Regards,

Rajagopal



From gianluca.cecchi at gmail.com  Fri Mar 25 11:42:16 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Fri, 25 Mar 2011 12:42:16 +0100
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
	<AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
	<AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>
Message-ID: <AANLkTimSoLq2PFDmrEf13Zh_b+=ra1s-swbQ5sGbzBNZ@mail.gmail.com>

On Fri, 25 Mar 2011 07:19:40 +0530, Rajagopal Swaminathan wrote:
> On which CPU  should service guard run on?

ServiceGuard is an HA solution with similar targets as RHCS or other
solutions, older or newer ones (aka Kimberlite, LifeKeeper, heartbeat
2.x, Pacemaker/Corosync, Pacemaker/Heartbeat and so on...)
I have not to run ServiceGuard at all at the moment.

I used it in the past on both HP-UX and Linux. What are services in
RHCS are called packages in ServiceGuard.
Last time I worked on it on Linux was about 6 years ago on Itanium 1
64bit platform for a 4 nodes configuration.
It was at that time that I noticed its behaviour (prompt at console)
when you boot a node and it doesn't "see" the other cluster nodes...
And so after my "clean_start" misunderstanding, I wanted to feed the
thread with some considerations about possible evolutions/approaches
in RHCS itself, no other intention...

Now ServiceGuard is supported only on HP-UX and HP suggests a path in
migrating existing implementations on Linux towards OS vendor cluster
suites (aka RHCS if you are using RHEL).

More information (note the domain still Compaq ;-):
http://www.compaq.com/solutions/enterprise/highavailability/linux/serviceguard/index.html
http://h18026.www1.hp.com/solutions/enterprise/highavailability/linux/serviceguard/discontinuance.html

and for commercial/technical notes about features on HP:
http://h71028.www7.hp.com/enterprise/w1/en/os/hpux11i-serviceguard-solutions-overview.html

I think in general is a good thing to see and analyze what other ones
are doing to solve your same problems...



From raju.rajsand at gmail.com  Fri Mar 25 14:34:35 2011
From: raju.rajsand at gmail.com (Rajagopal Swaminathan)
Date: Fri, 25 Mar 2011 20:04:35 +0530
Subject: [Linux-cluster] rhel6 node start causes power on of the other
	one
In-Reply-To: <AANLkTimSoLq2PFDmrEf13Zh_b+=ra1s-swbQ5sGbzBNZ@mail.gmail.com>
References: <AANLkTinuEDyKZxajHNN41nbL89jbT6OfwQa7dPigEPhb@mail.gmail.com>
	<4D88BEE2.9060300@alteeve.com>
	<AANLkTim3e41NRWPXqb_2uDXVZB_VTnFpKQKZ7e_YsQhJ@mail.gmail.com>
	<AANLkTinsnwQyva2O9GqFWBeyCf2-_+czpYCUB6JxurOv@mail.gmail.com>
	<AANLkTimSoLq2PFDmrEf13Zh_b+=ra1s-swbQ5sGbzBNZ@mail.gmail.com>
Message-ID: <AANLkTi=YOJre9yXjdVLHOJCHPSSea-4+TzMX=4+xPWrH@mail.gmail.com>

Greetings,


On 3/25/11, Gianluca Cecchi <gianluca.cecchi at gmail.com> wrote:
>
> ServiceGuard is an HA solution with similar targets as RHCS or other
> solutions, older or newer ones (aka Kimberlite, LifeKeeper, heartbeat
> 2.x, Pacemaker/Corosync, Pacemaker/Heartbeat and so on...)


Thanks for that Wonderful crisp, concise explanation of ServiceGuard .

It indeed enriched my knowledge.

I had sometime back worked with an HP premium partner. But HP supports
such critical products directly. rather than through partners.

I did have an opportunity, though, to just see Veritas Sotagfe and
cluster suite being installed on RHEL. IIRC, harware was something
like couple of HP DL580 G5 and SSCI external storage. I was just doing
the RHEL bits and that was not much really. Same place I got exposure
to VMware. I got a bit confused with veritas stuff and Service Guard.

I was a little bit confused. Apologies fro my muddle headed posting.

Am I a RH Fanboi? Quit simply Yes. and proud of it.

Apologies for the interruption. May the regular programming continue.

Regards,

Rajagopal



From ajb2 at mssl.ucl.ac.uk  Fri Mar 25 17:48:56 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Fri, 25 Mar 2011 17:48:56 +0000
Subject: [Linux-cluster] gfs2_quotad:2498 blocked
In-Reply-To: <1F8BA078A8FF4B66A702EDB450BA2F9A@versa>
References: <1F8BA078A8FF4B66A702EDB450BA2F9A@versa>
Message-ID: <4D8CD588.1050304@mssl.ucl.ac.uk>

Nicolas Ross wrote:

> It was a large, very large directory, with somewhere neer one million 
> small files, so the rsync took something like 3 to 4 hours. At some 
> point, all nodes' consoles dispalyed this :
> 
> gfs2_quotad:2498 blocked for more that 120 seconds.
> "echo 0 > /proc/sys/kernel/hang_task_timeout_secs" disables this message.
> 
> and then some debuging info dump.


> What does this mean ?

It means you're writing data faster than GFS can handle it, so it's 
pausing on you. We see this semi-regularly here on our clusters.

I suspect Bob Petersen's test kernel or patches from ~2-3 weeks ago will 
help you as these contain a bunch of fixes for slow writes.

AB




From gianluca.cecchi at gmail.com  Tue Mar 29 15:08:37 2011
From: gianluca.cecchi at gmail.com (Gianluca Cecchi)
Date: Tue, 29 Mar 2011 17:08:37 +0200
Subject: [Linux-cluster] Considerations about fence_virtd with fence_xvm
	based guests
Message-ID: <AANLkTi=2J_dmGd5ZfXyrCrqcnPGkriFv9311Lv6Z9wy+@mail.gmail.com>

Hello,
I have 2 x rh el 6.0 hosts (rhev1 and rhev2) where I enabled ha and
resilient storage beta channels.
I'm testing from the Beta HA Addon channel the checkpoint backend
This necessary because I want to test managing clusters of rh el 5
guests (where for example I would keep one guest restricted to the
first host and the other one to the another host).
fence_virtd package is in 6.0 official
fence-virtd-checkpoint that I want to use is instead only in beta channels

I see from release notes that probably this feature, that is a tech
preview right now, could become officially supported in upcoming rh el
6.1.
the 2 guests (vorastud1 and vorastud2) compose a rh el 5.6 based cluster
I configured fence-virtd in hosts with its default config using
multicast listener and checkpoint backend
(btw: I had to manually run "chkconfig --add fence_virtd" ... donna if
it is intended due to testing purposes)

Inside the guests I configured the fence_xvm provided by rh el 5.6

Some notes of mine, to share doubts and solutions, if possible:

- the hosts use a bridged device brvlan66 configured with vlan over bonding:
bond0 --> bond0.66 --> brvlan66
(there is a bug opened so far on this config, so at the moment I
ifdown one of the interfaces composing the bond device)

- the hosts use also a bridged device brvlan65 configured with vlan
over bonding on the same bond0:
bond0 --> bond0.65 --> brvlan65

- the guests are configured with production over brvlan66 and cluster
over brvlan65

- To have multicast traffic used by guests' fencing go thorough, I
have to put this rule in /etc/sysconfig/iptables of hosts:
-I INPUT -d 225.0.0.12 -j ACCEPT

I initially tried to log the traffic and got this:
Mar 24 11:04:57 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:00:1e:79:2c:e2:88:08:00 SRC=10.192.15.65
DST=225.0.0.12 LEN=32 TOS=0x00 PREC=0xC0 TTL=1 ID=35426 PROTO=2

So I attempt
-I INPUT -m physdev --physdev-in brvlan66 -d 225.0.0.12  -j ACCEPT
but this doesn't work (it doesn't either log....)

So one question is what would be a more restrictive rule to put?

- considering my guests' cluster nodes
vorastud1==vnode02
vorastud2==vnode01
(it's weird because it is a duplicate of an existing so configured one ;-)

vorastud1 runs on rhev1
[root at rhev1 ~]# virsh list
 Id Name                 State
----------------------------------
  3 vorastud1            running

vorastud2 runs on rhev2
[root at rhev2 ~]# virsh list | grep orast
 10 vorastud2            running

vorastud1 10.4.5.164 (10.4.4.51 on intracluster)
vorastud2 10.4.5.165 (10.4.4.52 on intracluster)
guests' cluster vip 10.4.5.166
No iptables running on guests

my guests' cluster fencing device section is based on what read on
http://sources.redhat.com/cluster/wiki/XVM_FencingConfig
and
http://sources.redhat.com/cluster/wiki/VMClusterCookbook

               <clusternode name="vnode01" nodeid="1" votes="1">
                        <fence>
                                <method name="2">
                                        <device domain="vorastud2"
name="xvm-rhev2"/>
                                </method>
                                <method name="1">
                                        <device domain="vorastud2"
name="xvm-rhev1"/>
                                </method>
                        </fence>
                </clusternode>

        <fencedevices>
                <fencedevice name="xvm-rhev1" agent="fence_xvm"
key_file="/etc/cluster/rhev1.key"/>
                <fencedevice name="xvm-rhev2" agent="fence_xvm"
key_file="/etc/cluster/rhev2.key"/>
        </fencedevices>

The keys on hosts are different.

I had to configure this double method because sometimes it seems I can
reach a domain only through one host (that not necessarily is the one
where the guest is running).
Could it be considered useful or not in general to have this
redundancy of configuration?
Does it make sense to use an host for guests running on other hosts?

- sometimes fence_virtd seems to hang
[root at rhev1 ~]# service fence_virtd status
fence_virtd (pid  2849) is running...

[root at rhev1 ~]# strace -p 2849
Process 2849 attached - interrupt to quit
futex(0x7f751373f604, FUTEX_WAIT_PRIVATE, 1, NULL

but
[root at vorastud1 ~]# fence_xvm -H vorastud2 -k /etc/cluster/rhev1.key
-ddd -o null
Debugging threshold is now 3
-- args @ 0x7fff5acdda40 --
  args->addr = 225.0.0.12
  args->domain = vorastud2
  args->key_file = /etc/cluster/rhev1.key
  args->op = 0
  args->hash = 2
  args->auth = 2
  args->port = 1229
  args->ifindex = 0
  args->family = 2
  args->timeout = 30
  args->retr_time = 20
  args->flags = 0
  args->debug = 3
-- end args --
Reading in key file /etc/cluster/rhev1.key into 0x7fff5acdc9f0 (4096 max size)
Actual key length = 4096 bytesSending to 225.0.0.12 via 127.0.0.1
Sending to 225.0.0.12 via 10.4.5.164
Sending to 225.0.0.12 via 10.4.5.166
Sending to 225.0.0.12 via 10.4.4.51
Waiting for connection from XVM host daemon.
[and so on retrying]

no sort of iptables logging in messages.....

if I now run
[root at rhev1 ~]# service fence_virtd force-reload
Stopping fence_virtd:                                      [  OK  ]
Starting fence_virtd:                                      [  OK  ]
[root at rhev1 ~]# service fence_virtd status
fence_virtd (pid  2166) is running...

and the fence command now is successful:
[root at vorastud1 ~]# fence_xvm -H vorastud2 -k /etc/cluster/rhev1.key
-ddd -o null
Debugging threshold is now 3
-- args @ 0x7fff65d3e360 --
  args->addr = 225.0.0.12
  args->domain = vorastud2
  args->key_file = /etc/cluster/rhev1.key
  args->op = 0
  args->hash = 2
  args->auth = 2
  args->port = 1229
  args->ifindex = 0
  args->family = 2
  args->timeout = 30
  args->retr_time = 20
  args->flags = 0
  args->debug = 3
-- end args --
Reading in key file /etc/cluster/rhev1.key into 0x7fff65d3d310 (4096 max size)
Actual key length = 4096 bytesSending to 225.0.0.12 via 127.0.0.1
Sending to 225.0.0.12 via 10.4.5.164
Sending to 225.0.0.12 via 10.4.5.166
Sending to 225.0.0.12 via 10.4.4.51
Waiting for connection from XVM host daemon.
Issuing TCP challenge
Responding to TCP challenge
TCP Exchange + Authentication done...
Waiting for return value from XVM host
Remote: Operation failed

And in messages of rhev1
Mar 29 16:13:48 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:52:54:00:08:50:b3:08:00 SRC=10.4.5.164
DST=225.0.0.12 LEN=204 TOS=0x00 PREC=0x00 TTL=2 ID=0 DF PROTO=UDP
SPT=29349 DPT=1229 LEN=184
Mar 29 16:13:48 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:52:54:00:08:50:b3:08:00 SRC=10.4.5.166
DST=225.0.0.12 LEN=204 TOS=0x00 PREC=0x00 TTL=2 ID=0 DF PROTO=UDP
SPT=42058 DPT=1229 LEN=184
Mar 29 16:15:23 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:00:0b:bf:89:06:40:08:00 SRC=10.4.5.161
DST=225.0.0.12 LEN=32 TOS=0x00 PREC=0xC0 TTL=1 ID=0 PROTO=2
Mar 29 16:15:24 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:00:1e:79:2c:e2:80:08:00 SRC=10.4.5.161
DST=225.0.0.12 LEN=32 TOS=0x00 PREC=0xC0 TTL=1 ID=17217 PROTO=2
Mar 29 16:15:24 rhev1 kernel: IN=brvlan66 OUT=
MAC=01:00:5e:00:00:0c:00:0b:bf:89:06:40:08:00 SRC=10.4.5.161
DST=225.0.0.12 LEN=32 TOS=0x00 PREC=0xC0 TTL=1 ID=0 PROTO=2

[root at rhev1 ~]# service fence_virtd status
fence_virtd (pid  2166) is running...

Now the strace command gives a different output when nothing using it....
[root at rhev1 ~]# strace -p 2166
Process 2166 attached - interrupt to quit
select(6, [5], NULL, NULL, NULL

- After few minutes restart of both the fence_virtd daemons,

[root at vorastud1 ~]# fence_xvm -H vorastud1 -k /etc/cluster/rhev1.key
-ddd -o null
Debugging threshold is now 3
-- args @ 0x7fffca3528b0 --
  args->addr = 225.0.0.12
  args->domain = vorastud1
  args->key_file = /etc/cluster/rhev1.key
  args->op = 0
  args->hash = 2
  args->auth = 2
  args->port = 1229
  args->ifindex = 0
  args->family = 2
  args->timeout = 30
  args->retr_time = 20
  args->flags = 0
  args->debug = 3
-- end args --
Reading in key file /etc/cluster/rhev1.key into 0x7fffca351860 (4096 max size)
Actual key length = 4096 bytesSending to 225.0.0.12 via 127.0.0.1
Sending to 225.0.0.12 via 10.4.5.164
Sending to 225.0.0.12 via 10.4.5.166
Sending to 225.0.0.12 via 10.4.4.51
Waiting for connection from XVM host daemon.
[... keeps retrying...]

the strace output of rhev1 shows no progress:

[root at rhev1 ~]# strace -p 5168
Process 5168 attached - interrupt to quit
select(6, [5], NULL, NULL, NULL

while the strace of rhev2 (that has a different key from the one
specified in command line) shows :
[root at rhev2 ~]# strace -p 17145
Process 17145 attached - interrupt to quit
...
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(39460),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(65166),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(33259),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
write(1, "00000000000000000000000000000000"..., 4096) = 4096
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(28562),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(18599),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(42607),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(55231),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(19253),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(41675),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(44621),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(41881),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(65021),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176

If I force-reload fence_virtd on rhev1, it begins answering again....
[root at rhev1 ~]# service fence_virtd force-reload
Stopping fence_virtd:                                      [  OK  ]
Starting fence_virtd:                                      [  OK  ]

[root at rhev1 ~]# service fence_virtd status
fence_virtd (pid  5979) is running...
[root at rhev1 ~]# strace -p 5979
Process 5979 attached - interrupt to quit
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(64221),
sin_addr=inet_addr("10.4.5.164")}, [16]) = 176
socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 6
setsockopt(6, SOL_SOCKET, SO_KEEPALIVE, [1], 4) = 0
fcntl(6, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(6, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(6, {sa_family=AF_INET, sin_port=htons(1229),
sin_addr=inet_addr("10.4.5.164")}, 16) = -1 EINPROGRESS (Operation now
in progress)
select(7, [6], [6], NULL, {5, 0})       = 1 (out [6], left {4, 997894})
getsockopt(6, SOL_SOCKET, SO_ERROR, [8589934592], [4]) = 0
fcntl(6, F_SETFL, O_RDWR)               = 0
select(7, [6], NULL, NULL, {10, 0})     = 1 (in [6], left {9, 919600})
read(6, "A+\313\316\223\363\201\305\305s~\230\"\363\363g\332\225\363\32\335\266\17\333\252\363=\304N\1\322d"...,
64) = 64
write(6, "\21U\275\270;\345g\253\214\340\35\250\207&\323j[\303\307\324Y\257\301\353\353\1\312y\253\0\202\302"...,
64) = 64
open("/dev/urandom", O_RDONLY)          = 7
read(7, "\34\203\16\25\337f\323z='\372\177\2115\2133\353\255z\222\245hJ\341sA\331\256\245\314x,"...,
64) = 64
close(7)                                = 0
write(6, "\34\203\16\25\337f\323z='\372\177\2115\2133\353\255z\222\245hJ\341sA\331\256\245\314x,"...,
64) = 64
select(7, [6], NULL, NULL, {10, 0})     = 1 (in [6], left {9, 999856})
read(6, "\20\301\366B\5\355\262@\345\227\260\316\30\367\341v\5\343S
\30s\317H*|\277\221\16r\263\343"..., 64) = 64
write(6, "\1", 1)                       = 1
close(6)                                = 0
select(6, [5], NULL, NULL, NULL)        = 1 (in [5])
recvfrom(5, "\0\2\4\0vorastud1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
176, 0, {sa_family=AF_INET, sin_port=htons(14528),
sin_addr=inet_addr("10.4.5.166")}, [16]) = 176
socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 6
setsockopt(6, SOL_SOCKET, SO_KEEPALIVE, [1], 4) = 0
fcntl(6, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(6, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(6, {sa_family=AF_INET, sin_port=htons(1229),
sin_addr=inet_addr("10.4.5.166")}, 16) = -1 EINPROGRESS (Operation now
in progress)
select(7, [6], [6], NULL, {5, 0})       = 1 (out [6], left {4, 999124})
getsockopt(6, SOL_SOCKET, SO_ERROR, [8589934592], [4]) = 0
fcntl(6, F_SETFL, O_RDWR)               = 0
select(7, [6], NULL, NULL, {10, 0})     = 1 (in [6], left {9, 998486})
read(6, 0x7fff92689470, 64)             = -1 ECONNRESET (Connection
reset by peer)
dup(2)                                  = 7
fcntl(7, F_GETFL)                       = 0x8002 (flags O_RDWR|O_LARGEFILE)
fstat(7, {st_mode=S_IFCHR|0666, st_rdev=makedev(1, 3), ...}) = 0
ioctl(7, SNDCTL_TMR_TIMEBASE or TCGETS, 0x7fff926890f0) = -1 ENOTTY
(Inappropriate ioctl for device)
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0x7fd5e1f41000
lseek(7, 0, SEEK_CUR)                   = 0
write(7, "read: Connection reset by peer\n", 31) = 31
close(7)                                = 0
munmap(0x7fd5e1f41000, 4096)            = 0
close(6)                                = 0
select(6, [5], NULL, NULL, NULL^C <unfinished ...>
Process 5979 detached

Gianluca



From rossnick-lists at cybercat.ca  Tue Mar 29 17:43:23 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Tue, 29 Mar 2011 13:43:23 -0400
Subject: [Linux-cluster] gfs2_quotad:2498 blocked
References: <1F8BA078A8FF4B66A702EDB450BA2F9A@versa>
	<4D8CD588.1050304@mssl.ucl.ac.uk>
Message-ID: <193B77F8E8B94E709420E080194DEEFE@versa>

>> It was a large, very large directory, with somewhere neer one million
>> small files, so the rsync took something like 3 to 4 hours. At some
>> point, all nodes' consoles dispalyed this :
>>
>> gfs2_quotad:2498 blocked for more that 120 seconds.
>> "echo 0 > /proc/sys/kernel/hang_task_timeout_secs" disables this message.
>>
>> and then some debuging info dump.
>
> It means you're writing data faster than GFS can handle it, so it's
> pausing on you. We see this semi-regularly here on our clusters.
>
> I suspect Bob Petersen's test kernel or patches from ~2-3 weeks ago will
> help you as these contain a bunch of fixes for slow writes.

Hi !

Now, my server are now up into out new colo facility, I did update
gfs2-utils that was published on march 17th, amonng others, and I did not 
see that message, as it was at first before the update. 



From david.hill at ubisoft.com  Wed Mar 30 00:59:50 2011
From: david.hill at ubisoft.com (David Hill)
Date: Tue, 29 Mar 2011 20:59:50 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>

Hi guys,

                We have a GFS2 cluster consisting of 3 nodes.  At this point, everything is going smooth.  Now, we add a new node with more CPUs with the
exact same configuration but all transactions on the mount run very slow.

Copying a file to the mount is done at about 25kb/s when on the three other nodes, everything goes smooth at about 7MB/s.
CPU on all nodes is idling at some point, all cluster process are kind of sleeping.

We've tried the ping_pong.c from apache and it seems to be able to write/read lock files at a decent rate.

There's other mounts on the system using the same fc card/fibers/switches/san and all these are also working at a decent speed...

I've been reading a good part of the day, and I can't seem to find a solution.




[cid:image001.gif at 01CBEE54.3E4A9590]
David C. Hill
Linux System Administrator - Enterprise
514-490-2000#5655
http://www.ubi.com<http://www.ubi.com/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110329/5a982ee3/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 2555 bytes
Desc: image001.gif
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110329/5a982ee3/attachment.gif>

From david.hill at ubisoft.com  Wed Mar 30 05:34:05 2011
From: david.hill at ubisoft.com (David Hill)
Date: Wed, 30 Mar 2011 01:34:05 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>

Hi guys,

I've found this in /sys/kernel/debug/gfs2/fsname/glocks

H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a [gfs2]
H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a [gfs2]
H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]

The application running is confluence and has 184 thread.   The other nodes work fine but that specific node is having issues obtaining locks when it's time to write?

Dave




From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
Sent: 29 mars 2011 21:00
To: linux-cluster at redhat.com
Subject: [Linux-cluster] GFS2 cluster node is running very slow

Hi guys,

                We have a GFS2 cluster consisting of 3 nodes.  At this point, everything is going smooth.  Now, we add a new node with more CPUs with the
exact same configuration but all transactions on the mount run very slow.

Copying a file to the mount is done at about 25kb/s when on the three other nodes, everything goes smooth at about 7MB/s.
CPU on all nodes is idling at some point, all cluster process are kind of sleeping.

We've tried the ping_pong.c from apache and it seems to be able to write/read lock files at a decent rate.

There's other mounts on the system using the same fc card/fibers/switches/san and all these are also working at a decent speed...

I've been reading a good part of the day, and I can't seem to find a solution.




[cid:image001.gif at 01CBEE7A.8E441B40]
David C. Hill
Linux System Administrator - Enterprise
514-490-2000#5655
http://www.ubi.com<http://www.ubi.com/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110330/bf50eabb/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 2555 bytes
Desc: image001.gif
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110330/bf50eabb/attachment.gif>

From swhiteho at redhat.com  Wed Mar 30 11:47:40 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 30 Mar 2011 12:47:40 +0100
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <1301485660.2522.35.camel@dolmen>

Hi,

On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> Hi guys,
> 
>  
> 
> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> 
>  
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> 
This doesn't mean anything without a bit more context. Were these all
queued against the same glock? If so which glock was it?

>  
> 
> The application running is confluence and has 184 thread.   The other
> nodes work fine but that specific node is having issues obtaining
> locks when it?s time to write?
> 
That does sound a bit strange. Are you using a different network card on
the slow node? Have you checked to see if there is too much traffic on
that network link?

Also, how full was the filesystem and which version of GFS2 are you
using (i.e. RHELx, Fedora X or CentOS or....)?


Steve.

>  
> 
> Dave
> 
>  
> 
>  
> 
>  
> 
>  
> 
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> Sent: 29 mars 2011 21:00
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> 
> 
>  
> 
> Hi guys,
> 
>  
> 
>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> point, everything is going smooth.  Now, we add a new node with more
> CPUs with the
> 
> exact same configuration but all transactions on the mount run very
> slow.
> 
>  
> 
> Copying a file to the mount is done at about 25kb/s when on the three
> other nodes, everything goes smooth at about 7MB/s.
> 
> CPU on all nodes is idling at some point, all cluster process are kind
> of sleeping. 
> 
>  
> 
> We?ve tried the ping_pong.c from apache and it seems to be able to
> write/read lock files at a decent rate.
> 
>  
> 
> There?s other mounts on the system using the same fc
> card/fibers/switches/san and all these are also working at a decent
> speed...
> 
>  
> 
> I?ve been reading a good part of the day, and I can?t seem to find a
> solution.
> 
>  
> 
>  
> 
>  
> 
>  
> 
> ubisoft_logo
> 
> David C. Hill
> 
> Linux System Administrator - Enterprise
> 
> 514-490-2000#5655
> 
> http://www.ubi.com
> 
>  
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From david.hill at ubisoft.com  Wed Mar 30 15:42:26 2011
From: david.hill at ubisoft.com (David Hill)
Date: Wed, 30 Mar 2011 11:42:26 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <1301485660.2522.35.camel@dolmen>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>

Hi Steve,

	I think you're right about the the glock ... There was MANY more of these.  
We're using a new server with totally different hardware.  We've done many test 
before posting to the mailing list like:
- copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
- read from the problematic mount on the "broken" node is fine too (21MB/s)
So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
all the other nodes are doing ok at 7MB/s ...

Whenever we do the test, it doesn't seem to go higher than 200k/s ...

But still, we can transfer to all nodes at a decent speed from that host.
We can transfer to the SAN at a decent speed.

CPU is 0% used.
Memory is 50% used.
Network is 0% used.

Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .


Here is a capture of the system:
top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
Swap:  4095992k total,        0k used,  4095992k free,   684160k cached


It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  

The servers is Centos 5.5 .
The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?


Thank you for the reply,

Dave



-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
Sent: 30 mars 2011 07:48
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow

Hi,

On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> Hi guys,
> 
>  
> 
> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> 
>  
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> 
This doesn't mean anything without a bit more context. Were these all
queued against the same glock? If so which glock was it?

>  
> 
> The application running is confluence and has 184 thread.   The other
> nodes work fine but that specific node is having issues obtaining
> locks when it?s time to write?
> 
That does sound a bit strange. Are you using a different network card on
the slow node? Have you checked to see if there is too much traffic on
that network link?

Also, how full was the filesystem and which version of GFS2 are you
using (i.e. RHELx, Fedora X or CentOS or....)?


Steve.

>  
> 
> Dave
> 
>  
> 
>  
> 
>  
> 
>  
> 
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> Sent: 29 mars 2011 21:00
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> 
> 
>  
> 
> Hi guys,
> 
>  
> 
>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> point, everything is going smooth.  Now, we add a new node with more
> CPUs with the
> 
> exact same configuration but all transactions on the mount run very
> slow.
> 
>  
> 
> Copying a file to the mount is done at about 25kb/s when on the three
> other nodes, everything goes smooth at about 7MB/s.
> 
> CPU on all nodes is idling at some point, all cluster process are kind
> of sleeping. 
> 
>  
> 
> We?ve tried the ping_pong.c from apache and it seems to be able to
> write/read lock files at a decent rate.
> 
>  
> 
> There?s other mounts on the system using the same fc
> card/fibers/switches/san and all these are also working at a decent
> speed...
> 
>  
> 
> I?ve been reading a good part of the day, and I can?t seem to find a
> solution.
> 
>  
> 
>  
> 
>  
> 
>  
> 
> ubisoft_logo
> 
> David C. Hill
> 
> Linux System Administrator - Enterprise
> 
> 514-490-2000#5655
> 
> http://www.ubi.com
> 
>  
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From carlopmart at gmail.com  Wed Mar 30 18:05:33 2011
From: carlopmart at gmail.com (carlopmart)
Date: Wed, 30 Mar 2011 20:05:33 +0200
Subject: [Linux-cluster] Attaching a service to a specific interface
Message-ID: <4D9370ED.6010005@gmail.com>

Hi all,

  I have two rhel6.0 cluster nodes with five nic interfaces in each one. 
Actually, I have one free interface without an IP in each one. Can I 
assign a cluster service to this interface (service consists in one IP 
and one script)??

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com



From david.hill at ubisoft.com  Wed Mar 30 20:15:52 2011
From: david.hill at ubisoft.com (David Hill)
Date: Wed, 30 Mar 2011 16:15:52 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>

Hi Steve,

	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...

I'm out of ideas on this one.

Dave



-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
Sent: 30 mars 2011 11:42
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow

Hi Steve,

	I think you're right about the the glock ... There was MANY more of these.  
We're using a new server with totally different hardware.  We've done many test 
before posting to the mailing list like:
- copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
- read from the problematic mount on the "broken" node is fine too (21MB/s)
So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
all the other nodes are doing ok at 7MB/s ...

Whenever we do the test, it doesn't seem to go higher than 200k/s ...

But still, we can transfer to all nodes at a decent speed from that host.
We can transfer to the SAN at a decent speed.

CPU is 0% used.
Memory is 50% used.
Network is 0% used.

Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .


Here is a capture of the system:
top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
Swap:  4095992k total,        0k used,  4095992k free,   684160k cached


It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  

The servers is Centos 5.5 .
The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?


Thank you for the reply,

Dave



-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
Sent: 30 mars 2011 07:48
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow

Hi,

On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> Hi guys,
> 
>  
> 
> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> 
>  
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> [gfs2]
> 
> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> 
This doesn't mean anything without a bit more context. Were these all
queued against the same glock? If so which glock was it?

>  
> 
> The application running is confluence and has 184 thread.   The other
> nodes work fine but that specific node is having issues obtaining
> locks when it?s time to write?
> 
That does sound a bit strange. Are you using a different network card on
the slow node? Have you checked to see if there is too much traffic on
that network link?

Also, how full was the filesystem and which version of GFS2 are you
using (i.e. RHELx, Fedora X or CentOS or....)?


Steve.

>  
> 
> Dave
> 
>  
> 
>  
> 
>  
> 
>  
> 
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> Sent: 29 mars 2011 21:00
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> 
> 
>  
> 
> Hi guys,
> 
>  
> 
>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> point, everything is going smooth.  Now, we add a new node with more
> CPUs with the
> 
> exact same configuration but all transactions on the mount run very
> slow.
> 
>  
> 
> Copying a file to the mount is done at about 25kb/s when on the three
> other nodes, everything goes smooth at about 7MB/s.
> 
> CPU on all nodes is idling at some point, all cluster process are kind
> of sleeping. 
> 
>  
> 
> We?ve tried the ping_pong.c from apache and it seems to be able to
> write/read lock files at a decent rate.
> 
>  
> 
> There?s other mounts on the system using the same fc
> card/fibers/switches/san and all these are also working at a decent
> speed...
> 
>  
> 
> I?ve been reading a good part of the day, and I can?t seem to find a
> solution.
> 
>  
> 
>  
> 
>  
> 
>  
> 
> ubisoft_logo
> 
> David C. Hill
> 
> Linux System Administrator - Enterprise
> 
> 514-490-2000#5655
> 
> http://www.ubi.com
> 
>  
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From abhishekf2k1 at gmail.com  Thu Mar 31 04:30:07 2011
From: abhishekf2k1 at gmail.com (abhishek .)
Date: Thu, 31 Mar 2011 10:00:07 +0530
Subject: [Linux-cluster] dont send any more
Message-ID: <AANLkTimUJ=XWOUTTnP0VEmHxiHzOkj-8sNJ=i5O31NEq@mail.gmail.com>

hello sir,
plz dont send me any more mails and remove my id from ur mailing list.

-- 
abhishek
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110331/d65adff0/attachment.htm>

From list at fajar.net  Thu Mar 31 04:36:38 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Thu, 31 Mar 2011 11:36:38 +0700
Subject: [Linux-cluster] dont send any more
In-Reply-To: <AANLkTimUJ=XWOUTTnP0VEmHxiHzOkj-8sNJ=i5O31NEq@mail.gmail.com>
References: <AANLkTimUJ=XWOUTTnP0VEmHxiHzOkj-8sNJ=i5O31NEq@mail.gmail.com>
Message-ID: <AANLkTinE3LXhUHoZ-VvNrOm+ycxDX4oQWctQgAaV_f2f@mail.gmail.com>

Follow the link at the bottom of every mail you get from the list and
unsubscribe yourself

-- 
Fajar

On Thu, Mar 31, 2011 at 11:30 AM, abhishek . <abhishekf2k1 at gmail.com> wrote:

> hello sir,
> plz dont send me any more mails and remove my id from ur mailing list.
>
> --
> abhishek
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110331/80e11e7d/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Thu Mar 31 11:16:27 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 31 Mar 2011 12:16:27 +0100
Subject: [Linux-cluster] Attaching a service to a specific interface
In-Reply-To: <4D9370ED.6010005@gmail.com>
References: <4D9370ED.6010005@gmail.com>
Message-ID: <4D94628B.3040609@mssl.ucl.ac.uk>

carlopmart wrote:
> Hi all,
> 
>  I have two rhel6.0 cluster nodes with five nic interfaces in each one. 
> Actually, I have one free interface without an IP in each one. Can I 
> assign a cluster service to this interface (service consists in one IP 
> and one script)??

Yes.... but....

You will need to configure an IP address on the interface that the 
service is going to bind to.

As long as there is an IP on the interface, assigning an IP to the 
service within the same network range (ip+netmask) will automatically 
bind to that interface (I assume there are no other interfaces already 
configured within the same network range, or all bets are off.)








From ajb2 at mssl.ucl.ac.uk  Thu Mar 31 11:20:41 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 31 Mar 2011 12:20:41 +0100
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>	<1301485660.2522.35.camel@dolmen>	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <4D946389.7030100@mssl.ucl.ac.uk>

David Hill wrote:
> Hi Steve,
> 
> 	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
> 2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
> All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...

Just to clarify:

Are these directories on the same filesystem or are they on individual 
filesystems?

If the former, try splitting into separate filesystems.

Remember that one node will become the filesystem master and everything 
else will be slower when accessing that filesystem.

> I'm out of ideas on this one.
> 
> Dave
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> Sent: 30 mars 2011 11:42
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> Hi Steve,
> 
> 	I think you're right about the the glock ... There was MANY more of these.  
> We're using a new server with totally different hardware.  We've done many test 
> before posting to the mailing list like:
> - copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
> - read from the problematic mount on the "broken" node is fine too (21MB/s)
> So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
> we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
> all the other nodes are doing ok at 7MB/s ...
> 
> Whenever we do the test, it doesn't seem to go higher than 200k/s ...
> 
> But still, we can transfer to all nodes at a decent speed from that host.
> We can transfer to the SAN at a decent speed.
> 
> CPU is 0% used.
> Memory is 50% used.
> Network is 0% used.
> 
> Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
> Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .
> 
> 
> Here is a capture of the system:
> top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
> Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
> Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
> Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
> Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
> Swap:  4095992k total,        0k used,  4095992k free,   684160k cached
> 
> 
> It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  
> 
> The servers is Centos 5.5 .
> The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?
> 
> 
> Thank you for the reply,
> 
> Dave
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> Sent: 30 mars 2011 07:48
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> Hi,
> 
> On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
>> Hi guys,
>>
>>  
>>
>> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
>>
>>  
>>
>> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
>> [gfs2]
>>
>> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
>> [gfs2]
>>
>> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
>>
> This doesn't mean anything without a bit more context. Were these all
> queued against the same glock? If so which glock was it?
> 
>>  
>>
>> The application running is confluence and has 184 thread.   The other
>> nodes work fine but that specific node is having issues obtaining
>> locks when it?s time to write?
>>
> That does sound a bit strange. Are you using a different network card on
> the slow node? Have you checked to see if there is too much traffic on
> that network link?
> 
> Also, how full was the filesystem and which version of GFS2 are you
> using (i.e. RHELx, Fedora X or CentOS or....)?
> 
> 
> Steve.
> 
>>  
>>
>> Dave
>>
>>  
>>
>>  
>>
>>  
>>
>>  
>>
>> From: linux-cluster-bounces at redhat.com
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
>> Sent: 29 mars 2011 21:00
>> To: linux-cluster at redhat.com
>> Subject: [Linux-cluster] GFS2 cluster node is running very slow
>>
>>
>>  
>>
>> Hi guys,
>>
>>  
>>
>>                 We have a GFS2 cluster consisting of 3 nodes.  At this
>> point, everything is going smooth.  Now, we add a new node with more
>> CPUs with the
>>
>> exact same configuration but all transactions on the mount run very
>> slow.
>>
>>  
>>
>> Copying a file to the mount is done at about 25kb/s when on the three
>> other nodes, everything goes smooth at about 7MB/s.
>>
>> CPU on all nodes is idling at some point, all cluster process are kind
>> of sleeping. 
>>
>>  
>>
>> We?ve tried the ping_pong.c from apache and it seems to be able to
>> write/read lock files at a decent rate.
>>
>>  
>>
>> There?s other mounts on the system using the same fc
>> card/fibers/switches/san and all these are also working at a decent
>> speed...
>>
>>  
>>
>> I?ve been reading a good part of the day, and I can?t seem to find a
>> solution.
>>
>>  
>>
>>  
>>
>>  
>>
>>  
>>
>> ubisoft_logo
>>
>> David C. Hill
>>
>> Linux System Administrator - Enterprise
>>
>> 514-490-2000#5655
>>
>> http://www.ubi.com
>>
>>  
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 





From linux-cluster at redhat.com  Thu Mar 31 13:34:16 2011
From: linux-cluster at redhat.com (Lyris ListManager)
Date: Thu, 31 Mar 2011 09:34:16 -0400
Subject: [Linux-cluster] Off/Topic
Message-ID: <LYRIS-0-9308750-2011.03.31-09.34.17--linux-cluster#redhat.com@>


Your posting was filtered and bounced as it contains a 
word that shows it's off topic or inappropriate. Please 
keep the three rules in mind: ON TOPIC, NO NOISE and FRIENDLY.

Thanks much!

List Manager






From david.hill at ubisoft.com  Thu Mar 31 14:14:03 2011
From: david.hill at ubisoft.com (David Hill)
Date: Thu, 31 Mar 2011 10:14:03 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <4D946389.7030100@mssl.ucl.ac.uk>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
	<4D946389.7030100@mssl.ucl.ac.uk>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>

These directories are all on the same mount ... with a total size of 1.2TB!
/mnt/gfs is the mount
/mnt/gfs/scripts/appl01
/mnt/gfs/scripts/appl02
/mnt/gfs/scripts/appl03
/mnt/gfs/scripts/appl04
/mnt/gfs/scripts/appl05
/mnt/gfs/scripts/appl06
/mnt/gfs/scripts/appl07
/mnt/gfs/scripts/appl08

All files accessed by the application are within it's own folder/subdirectory.
No files is ever accessed by more than one node.

I'm going to suggest to split but this also bring another issue:

- We have a daily GFS lockout now...  We need to reboot the whole cluster to solve the issue.

This is going bad.

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alan Brown
Sent: 31 mars 2011 07:21
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow

David Hill wrote:
> Hi Steve,
> 
> 	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
> 2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
> All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...

Just to clarify:

Are these directories on the same filesystem or are they on individual 
filesystems?

If the former, try splitting into separate filesystems.

Remember that one node will become the filesystem master and everything 
else will be slower when accessing that filesystem.

> I'm out of ideas on this one.
> 
> Dave
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> Sent: 30 mars 2011 11:42
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> Hi Steve,
> 
> 	I think you're right about the the glock ... There was MANY more of these.  
> We're using a new server with totally different hardware.  We've done many test 
> before posting to the mailing list like:
> - copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
> - read from the problematic mount on the "broken" node is fine too (21MB/s)
> So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
> we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
> all the other nodes are doing ok at 7MB/s ...
> 
> Whenever we do the test, it doesn't seem to go higher than 200k/s ...
> 
> But still, we can transfer to all nodes at a decent speed from that host.
> We can transfer to the SAN at a decent speed.
> 
> CPU is 0% used.
> Memory is 50% used.
> Network is 0% used.
> 
> Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
> Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .
> 
> 
> Here is a capture of the system:
> top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
> Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
> Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
> Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
> Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
> Swap:  4095992k total,        0k used,  4095992k free,   684160k cached
> 
> 
> It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  
> 
> The servers is Centos 5.5 .
> The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?
> 
> 
> Thank you for the reply,
> 
> Dave
> 
> 
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> Sent: 30 mars 2011 07:48
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> Hi,
> 
> On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
>> Hi guys,
>>
>>  
>>
>> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
>>
>>  
>>
>> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
>> [gfs2]
>>
>> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
>> [gfs2]
>>
>> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
>>
> This doesn't mean anything without a bit more context. Were these all
> queued against the same glock? If so which glock was it?
> 
>>  
>>
>> The application running is confluence and has 184 thread.   The other
>> nodes work fine but that specific node is having issues obtaining
>> locks when it?s time to write?
>>
> That does sound a bit strange. Are you using a different network card on
> the slow node? Have you checked to see if there is too much traffic on
> that network link?
> 
> Also, how full was the filesystem and which version of GFS2 are you
> using (i.e. RHELx, Fedora X or CentOS or....)?
> 
> 
> Steve.
> 
>>  
>>
>> Dave
>>
>>  
>>
>>  
>>
>>  
>>
>>  
>>
>> From: linux-cluster-bounces at redhat.com
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
>> Sent: 29 mars 2011 21:00
>> To: linux-cluster at redhat.com
>> Subject: [Linux-cluster] GFS2 cluster node is running very slow
>>
>>
>>  
>>
>> Hi guys,
>>
>>  
>>
>>                 We have a GFS2 cluster consisting of 3 nodes.  At this
>> point, everything is going smooth.  Now, we add a new node with more
>> CPUs with the
>>
>> exact same configuration but all transactions on the mount run very
>> slow.
>>
>>  
>>
>> Copying a file to the mount is done at about 25kb/s when on the three
>> other nodes, everything goes smooth at about 7MB/s.
>>
>> CPU on all nodes is idling at some point, all cluster process are kind
>> of sleeping. 
>>
>>  
>>
>> We?ve tried the ping_pong.c from apache and it seems to be able to
>> write/read lock files at a decent rate.
>>
>>  
>>
>> There?s other mounts on the system using the same fc
>> card/fibers/switches/san and all these are also working at a decent
>> speed...
>>
>>  
>>
>> I?ve been reading a good part of the day, and I can?t seem to find a
>> solution.
>>
>>  
>>
>>  
>>
>>  
>>
>>  
>>
>> ubisoft_logo
>>
>> David C. Hill
>>
>> Linux System Administrator - Enterprise
>>
>> 514-490-2000#5655
>>
>> http://www.ubi.com
>>
>>  
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From david.hill at ubisoft.com  Thu Mar 31 14:17:15 2011
From: david.hill at ubisoft.com (David Hill)
Date: Thu, 31 Mar 2011 10:17:15 -0400
Subject: [Linux-cluster] Off/Topic
In-Reply-To: <LYRIS-0-9308750-2011.03.31-09.34.17--linux-cluster#redhat.com@>
References: <LYRIS-0-9308750-2011.03.31-09.34.17--linux-cluster#redhat.com@>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF48D42@MDC-MAIL-CMS01.ubisoft.org>

What does that mean?


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lyris ListManager
Sent: 31 mars 2011 09:34
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Off/Topic


Your posting was filtered and bounced as it contains a 
word that shows it's off topic or inappropriate. Please 
keep the three rules in mind: ON TOPIC, NO NOISE and FRIENDLY.

Thanks much!

List Manager




--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From swhiteho at redhat.com  Thu Mar 31 14:24:46 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 31 Mar 2011 15:24:46 +0100
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
	<4D946389.7030100@mssl.ucl.ac.uk>
	<710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <1301581486.2569.20.camel@dolmen>

Hi,

On Thu, 2011-03-31 at 10:14 -0400, David Hill wrote:
> These directories are all on the same mount ... with a total size of 1.2TB!
> /mnt/gfs is the mount
> /mnt/gfs/scripts/appl01
> /mnt/gfs/scripts/appl02
> /mnt/gfs/scripts/appl03
> /mnt/gfs/scripts/appl04
> /mnt/gfs/scripts/appl05
> /mnt/gfs/scripts/appl06
> /mnt/gfs/scripts/appl07
> /mnt/gfs/scripts/appl08
> 
> All files accessed by the application are within it's own folder/subdirectory.
> No files is ever accessed by more than one node.
> 
> I'm going to suggest to split but this also bring another issue:
> 
> - We have a daily GFS lockout now...  We need to reboot the whole cluster to solve the issue.
> 
I'm not sure what you mean by that. What actually happens? Is it just
the filesystem that goes slow? Do you get any messages
in /var/log/messages do any nodes get fenced or does that fail too?

Steve.

> This is going bad.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alan Brown
> Sent: 31 mars 2011 07:21
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> David Hill wrote:
> > Hi Steve,
> > 
> > 	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
> > 2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
> > All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...
> 
> Just to clarify:
> 
> Are these directories on the same filesystem or are they on individual 
> filesystems?
> 
> If the former, try splitting into separate filesystems.
> 
> Remember that one node will become the filesystem master and everything 
> else will be slower when accessing that filesystem.
> 
> > I'm out of ideas on this one.
> > 
> > Dave
> > 
> > 
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> > Sent: 30 mars 2011 11:42
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > 
> > Hi Steve,
> > 
> > 	I think you're right about the the glock ... There was MANY more of these.  
> > We're using a new server with totally different hardware.  We've done many test 
> > before posting to the mailing list like:
> > - copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
> > - read from the problematic mount on the "broken" node is fine too (21MB/s)
> > So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
> > we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
> > all the other nodes are doing ok at 7MB/s ...
> > 
> > Whenever we do the test, it doesn't seem to go higher than 200k/s ...
> > 
> > But still, we can transfer to all nodes at a decent speed from that host.
> > We can transfer to the SAN at a decent speed.
> > 
> > CPU is 0% used.
> > Memory is 50% used.
> > Network is 0% used.
> > 
> > Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
> > Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .
> > 
> > 
> > Here is a capture of the system:
> > top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
> > Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
> > Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
> > Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
> > Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
> > Swap:  4095992k total,        0k used,  4095992k free,   684160k cached
> > 
> > 
> > It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  
> > 
> > The servers is Centos 5.5 .
> > The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?
> > 
> > 
> > Thank you for the reply,
> > 
> > Dave
> > 
> > 
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> > Sent: 30 mars 2011 07:48
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > 
> > Hi,
> > 
> > On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> >> Hi guys,
> >>
> >>  
> >>
> >> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> >>
> >>  
> >>
> >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> >> [gfs2]
> >>
> >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> >> [gfs2]
> >>
> >> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> >>
> > This doesn't mean anything without a bit more context. Were these all
> > queued against the same glock? If so which glock was it?
> > 
> >>  
> >>
> >> The application running is confluence and has 184 thread.   The other
> >> nodes work fine but that specific node is having issues obtaining
> >> locks when it?s time to write?
> >>
> > That does sound a bit strange. Are you using a different network card on
> > the slow node? Have you checked to see if there is too much traffic on
> > that network link?
> > 
> > Also, how full was the filesystem and which version of GFS2 are you
> > using (i.e. RHELx, Fedora X or CentOS or....)?
> > 
> > 
> > Steve.
> > 
> >>  
> >>
> >> Dave
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >> From: linux-cluster-bounces at redhat.com
> >> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> >> Sent: 29 mars 2011 21:00
> >> To: linux-cluster at redhat.com
> >> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> >>
> >>
> >>  
> >>
> >> Hi guys,
> >>
> >>  
> >>
> >>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> >> point, everything is going smooth.  Now, we add a new node with more
> >> CPUs with the
> >>
> >> exact same configuration but all transactions on the mount run very
> >> slow.
> >>
> >>  
> >>
> >> Copying a file to the mount is done at about 25kb/s when on the three
> >> other nodes, everything goes smooth at about 7MB/s.
> >>
> >> CPU on all nodes is idling at some point, all cluster process are kind
> >> of sleeping. 
> >>
> >>  
> >>
> >> We?ve tried the ping_pong.c from apache and it seems to be able to
> >> write/read lock files at a decent rate.
> >>
> >>  
> >>
> >> There?s other mounts on the system using the same fc
> >> card/fibers/switches/san and all these are also working at a decent
> >> speed...
> >>
> >>  
> >>
> >> I?ve been reading a good part of the day, and I can?t seem to find a
> >> solution.
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >> ubisoft_logo
> >>
> >> David C. Hill
> >>
> >> Linux System Administrator - Enterprise
> >>
> >> 514-490-2000#5655
> >>
> >> http://www.ubi.com
> >>
> >>  
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From david.hill at ubisoft.com  Thu Mar 31 14:36:08 2011
From: david.hill at ubisoft.com (David Hill)
Date: Thu, 31 Mar 2011 10:36:08 -0400
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <1301581486.2569.20.camel@dolmen>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
	<4D946389.7030100@mssl.ucl.ac.uk>
	<710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
	<1301581486.2569.20.camel@dolmen>
Message-ID: <710D4D6CE160654C87478D18385BB9971BDEF48D57@MDC-MAIL-CMS01.ubisoft.org>

Hi Steve,

	The service is degrading ... it's going slower and slower and slower before actually being totally unresponsive.
The only entries in the log appears when we reboot the whole cluster.

Thank you for your interest in this issue :)

Dave


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
Sent: 31 mars 2011 10:25
To: linux clustering
Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow

Hi,

On Thu, 2011-03-31 at 10:14 -0400, David Hill wrote:
> These directories are all on the same mount ... with a total size of 1.2TB!
> /mnt/gfs is the mount
> /mnt/gfs/scripts/appl01
> /mnt/gfs/scripts/appl02
> /mnt/gfs/scripts/appl03
> /mnt/gfs/scripts/appl04
> /mnt/gfs/scripts/appl05
> /mnt/gfs/scripts/appl06
> /mnt/gfs/scripts/appl07
> /mnt/gfs/scripts/appl08
> 
> All files accessed by the application are within it's own folder/subdirectory.
> No files is ever accessed by more than one node.
> 
> I'm going to suggest to split but this also bring another issue:
> 
> - We have a daily GFS lockout now...  We need to reboot the whole cluster to solve the issue.
> 
I'm not sure what you mean by that. What actually happens? Is it just
the filesystem that goes slow? Do you get any messages
in /var/log/messages do any nodes get fenced or does that fail too?

Steve.

> This is going bad.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alan Brown
> Sent: 31 mars 2011 07:21
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> David Hill wrote:
> > Hi Steve,
> > 
> > 	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
> > 2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
> > All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...
> 
> Just to clarify:
> 
> Are these directories on the same filesystem or are they on individual 
> filesystems?
> 
> If the former, try splitting into separate filesystems.
> 
> Remember that one node will become the filesystem master and everything 
> else will be slower when accessing that filesystem.
> 
> > I'm out of ideas on this one.
> > 
> > Dave
> > 
> > 
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> > Sent: 30 mars 2011 11:42
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > 
> > Hi Steve,
> > 
> > 	I think you're right about the the glock ... There was MANY more of these.  
> > We're using a new server with totally different hardware.  We've done many test 
> > before posting to the mailing list like:
> > - copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
> > - read from the problematic mount on the "broken" node is fine too (21MB/s)
> > So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
> > we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
> > all the other nodes are doing ok at 7MB/s ...
> > 
> > Whenever we do the test, it doesn't seem to go higher than 200k/s ...
> > 
> > But still, we can transfer to all nodes at a decent speed from that host.
> > We can transfer to the SAN at a decent speed.
> > 
> > CPU is 0% used.
> > Memory is 50% used.
> > Network is 0% used.
> > 
> > Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
> > Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .
> > 
> > 
> > Here is a capture of the system:
> > top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
> > Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
> > Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
> > Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
> > Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
> > Swap:  4095992k total,        0k used,  4095992k free,   684160k cached
> > 
> > 
> > It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  
> > 
> > The servers is Centos 5.5 .
> > The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?
> > 
> > 
> > Thank you for the reply,
> > 
> > Dave
> > 
> > 
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> > Sent: 30 mars 2011 07:48
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > 
> > Hi,
> > 
> > On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> >> Hi guys,
> >>
> >>  
> >>
> >> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> >>
> >>  
> >>
> >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> >> [gfs2]
> >>
> >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> >> [gfs2]
> >>
> >> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> >>
> > This doesn't mean anything without a bit more context. Were these all
> > queued against the same glock? If so which glock was it?
> > 
> >>  
> >>
> >> The application running is confluence and has 184 thread.   The other
> >> nodes work fine but that specific node is having issues obtaining
> >> locks when it?s time to write?
> >>
> > That does sound a bit strange. Are you using a different network card on
> > the slow node? Have you checked to see if there is too much traffic on
> > that network link?
> > 
> > Also, how full was the filesystem and which version of GFS2 are you
> > using (i.e. RHELx, Fedora X or CentOS or....)?
> > 
> > 
> > Steve.
> > 
> >>  
> >>
> >> Dave
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >> From: linux-cluster-bounces at redhat.com
> >> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> >> Sent: 29 mars 2011 21:00
> >> To: linux-cluster at redhat.com
> >> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> >>
> >>
> >>  
> >>
> >> Hi guys,
> >>
> >>  
> >>
> >>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> >> point, everything is going smooth.  Now, we add a new node with more
> >> CPUs with the
> >>
> >> exact same configuration but all transactions on the mount run very
> >> slow.
> >>
> >>  
> >>
> >> Copying a file to the mount is done at about 25kb/s when on the three
> >> other nodes, everything goes smooth at about 7MB/s.
> >>
> >> CPU on all nodes is idling at some point, all cluster process are kind
> >> of sleeping. 
> >>
> >>  
> >>
> >> We?ve tried the ping_pong.c from apache and it seems to be able to
> >> write/read lock files at a decent rate.
> >>
> >>  
> >>
> >> There?s other mounts on the system using the same fc
> >> card/fibers/switches/san and all these are also working at a decent
> >> speed...
> >>
> >>  
> >>
> >> I?ve been reading a good part of the day, and I can?t seem to find a
> >> solution.
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >>  
> >>
> >> ubisoft_logo
> >>
> >> David C. Hill
> >>
> >> Linux System Administrator - Enterprise
> >>
> >> 514-490-2000#5655
> >>
> >> http://www.ubi.com
> >>
> >>  
> >>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From swhiteho at redhat.com  Thu Mar 31 14:56:08 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 31 Mar 2011 15:56:08 +0100
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48D57@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>
	<1301485660.2522.35.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>
	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>
	<4D946389.7030100@mssl.ucl.ac.uk>
	<710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
	<1301581486.2569.20.camel@dolmen>
	<710D4D6CE160654C87478D18385BB9971BDEF48D57@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <1301583368.2569.28.camel@dolmen>

Hi,

On Thu, 2011-03-31 at 10:36 -0400, David Hill wrote:
> Hi Steve,
> 
> 	The service is degrading ... it's going slower and slower and slower before actually being totally unresponsive.
> The only entries in the log appears when we reboot the whole cluster.
> 
> Thank you for your interest in this issue :)
> 
> Dave
> 
> 
Well my first red flag in the info you've reported is that the fs is 98%
full. When GFS2 tries to allocate blocks it will search through the
resource groups for free space, skipping those in use by other nodes.
With a filesystem that full, this might result in longer search times
since a large number of the resource groups will be full. I'd recommend
not going above say 80% full as a general rule.

Also, by not have the fs really full, it is more likely that you will
not run into fragmentation issues. However I would point out the
following bug:
https://bugzilla.redhat.com/show_bug.cgi?id=683155

which has caused similar problems in the past and will soon be fixed. I
still can't quite figure out why the problem should only show up on some
nodes and not others, perhaps they are the nodes which have already
reserved a resource group with lots of free space and the remaining
nodes can't find one of those?

Either way, those would be the first two thing that I'd look into in
order to track this down,

Steve.


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> Sent: 31 mars 2011 10:25
> To: linux clustering
> Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> 
> Hi,
> 
> On Thu, 2011-03-31 at 10:14 -0400, David Hill wrote:
> > These directories are all on the same mount ... with a total size of 1.2TB!
> > /mnt/gfs is the mount
> > /mnt/gfs/scripts/appl01
> > /mnt/gfs/scripts/appl02
> > /mnt/gfs/scripts/appl03
> > /mnt/gfs/scripts/appl04
> > /mnt/gfs/scripts/appl05
> > /mnt/gfs/scripts/appl06
> > /mnt/gfs/scripts/appl07
> > /mnt/gfs/scripts/appl08
> > 
> > All files accessed by the application are within it's own folder/subdirectory.
> > No files is ever accessed by more than one node.
> > 
> > I'm going to suggest to split but this also bring another issue:
> > 
> > - We have a daily GFS lockout now...  We need to reboot the whole cluster to solve the issue.
> > 
> I'm not sure what you mean by that. What actually happens? Is it just
> the filesystem that goes slow? Do you get any messages
> in /var/log/messages do any nodes get fenced or does that fail too?
> 
> Steve.
> 
> > This is going bad.
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alan Brown
> > Sent: 31 mars 2011 07:21
> > To: linux clustering
> > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > 
> > David Hill wrote:
> > > Hi Steve,
> > > 
> > > 	We seems to be experiencing some new issues now... With 4 nodes, only one is slow but with 3 nodes, 2 of them are now slow.
> > > 2 nodes are doing 20k/s and one is doing 2mb/s ...  Seems like all nodes will end up with poor performances.
> > > All nodes are locking files in their own directory /mnt/application/tomcat-1, /mnt/application/tomcat-2 ...
> > 
> > Just to clarify:
> > 
> > Are these directories on the same filesystem or are they on individual 
> > filesystems?
> > 
> > If the former, try splitting into separate filesystems.
> > 
> > Remember that one node will become the filesystem master and everything 
> > else will be slower when accessing that filesystem.
> > 
> > > I'm out of ideas on this one.
> > > 
> > > Dave
> > > 
> > > 
> > > 
> > > -----Original Message-----
> > > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> > > Sent: 30 mars 2011 11:42
> > > To: linux clustering
> > > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > > 
> > > Hi Steve,
> > > 
> > > 	I think you're right about the the glock ... There was MANY more of these.  
> > > We're using a new server with totally different hardware.  We've done many test 
> > > before posting to the mailing list like:
> > > - copy files from the problematic node to the other nodes without using the problematic mount, everything is fine (7MB/s)
> > > - read from the problematic mount on the "broken" node is fine too (21MB/s)
> > > So, at this point, I doubt the problem is the network infrastructure behind the node (or the network adapter) because everything is going smooth on all aspect BUT
> > > we cannot use the /mnt on the broken node because it's not usable.  Last time I tried to copy a file to that /mnt it was doing 5k/s while
> > > all the other nodes are doing ok at 7MB/s ...
> > > 
> > > Whenever we do the test, it doesn't seem to go higher than 200k/s ...
> > > 
> > > But still, we can transfer to all nodes at a decent speed from that host.
> > > We can transfer to the SAN at a decent speed.
> > > 
> > > CPU is 0% used.
> > > Memory is 50% used.
> > > Network is 0% used.
> > > 
> > > Only difference between that host and the others is that the mysql database is hosted locally and storage is on the same SAN ... but even with this,
> > > Mysqld is using only 2mbit/s on the loopback, a little bit of memory and mostly NO CPU .
> > > 
> > > 
> > > Here is a capture of the system:
> > > top - 15:39:51 up  7:40,  1 user,  load average: 0.08, 0.13, 0.11
> > > Tasks: 343 total,   1 running, 342 sleeping,   0 stopped,   0 zombie
> > > Cpu0  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu1  :  0.1%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu2  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu3  :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu4  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu5  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu6  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu7  :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu8  :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu9  :  0.1%us,  0.0%sy,  0.0%ni, 99.9%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu10 :  0.0%us,  0.0%sy,  0.0%ni, 99.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu11 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu12 :  0.0%us,  0.0%sy,  0.0%ni,100.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu13 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu14 :  0.1%us,  0.1%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu15 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu16 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu17 :  0.4%us,  0.1%sy,  0.0%ni, 99.4%id,  0.1%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu18 :  0.2%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu19 :  0.6%us,  0.1%sy,  0.0%ni, 99.3%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu20 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu21 :  0.6%us,  0.1%sy,  0.0%ni, 99.2%id,  0.1%wa,  0.0%hi,  0.1%si,  0.0%st
> > > Cpu22 :  0.2%us,  0.0%sy,  0.0%ni, 99.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
> > > Cpu23 :  0.1%us,  0.0%sy,  0.0%ni, 99.8%id,  0.0%wa,  0.0%hi,  0.1%si,  0.0%st
> > > Mem:  32952896k total,  2453956k used, 30498940k free,   256648k buffers
> > > Swap:  4095992k total,        0k used,  4095992k free,   684160k cached
> > > 
> > > 
> > > It's a monster for what it does.  Could it be possible that it's soo much more performant than the other nodes that it kills itself?  
> > > 
> > > The servers is Centos 5.5 .
> > > The filesystem if 98% full (31G remaining on 1.2T) ... but if that is an issue, why does all other nodes running smoothly and having no issues but that one?
> > > 
> > > 
> > > Thank you for the reply,
> > > 
> > > Dave
> > > 
> > > 
> > > 
> > > -----Original Message-----
> > > From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Whitehouse
> > > Sent: 30 mars 2011 07:48
> > > To: linux clustering
> > > Subject: Re: [Linux-cluster] GFS2 cluster node is running very slow
> > > 
> > > Hi,
> > > 
> > > On Wed, 2011-03-30 at 01:34 -0400, David Hill wrote:
> > >> Hi guys,
> > >>
> > >>  
> > >>
> > >> I?ve found this in /sys/kernel/debug/gfs2/fsname/glocks
> > >>
> > >>  
> > >>
> > >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> > >> [gfs2]
> > >>
> > >> H: s:EX f:tW e:0 p:22591 [jsvc] gfs2_inplace_reserve_i+0x451/0x69a
> > >> [gfs2]
> > >>
> > >> H: s:EX f:W e:0 p:806 [pdflush] gfs2_write_inode+0x57/0x152 [gfs2]
> > >>
> > > This doesn't mean anything without a bit more context. Were these all
> > > queued against the same glock? If so which glock was it?
> > > 
> > >>  
> > >>
> > >> The application running is confluence and has 184 thread.   The other
> > >> nodes work fine but that specific node is having issues obtaining
> > >> locks when it?s time to write?
> > >>
> > > That does sound a bit strange. Are you using a different network card on
> > > the slow node? Have you checked to see if there is too much traffic on
> > > that network link?
> > > 
> > > Also, how full was the filesystem and which version of GFS2 are you
> > > using (i.e. RHELx, Fedora X or CentOS or....)?
> > > 
> > > 
> > > Steve.
> > > 
> > >>  
> > >>
> > >> Dave
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >> From: linux-cluster-bounces at redhat.com
> > >> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of David Hill
> > >> Sent: 29 mars 2011 21:00
> > >> To: linux-cluster at redhat.com
> > >> Subject: [Linux-cluster] GFS2 cluster node is running very slow
> > >>
> > >>
> > >>  
> > >>
> > >> Hi guys,
> > >>
> > >>  
> > >>
> > >>                 We have a GFS2 cluster consisting of 3 nodes.  At this
> > >> point, everything is going smooth.  Now, we add a new node with more
> > >> CPUs with the
> > >>
> > >> exact same configuration but all transactions on the mount run very
> > >> slow.
> > >>
> > >>  
> > >>
> > >> Copying a file to the mount is done at about 25kb/s when on the three
> > >> other nodes, everything goes smooth at about 7MB/s.
> > >>
> > >> CPU on all nodes is idling at some point, all cluster process are kind
> > >> of sleeping. 
> > >>
> > >>  
> > >>
> > >> We?ve tried the ping_pong.c from apache and it seems to be able to
> > >> write/read lock files at a decent rate.
> > >>
> > >>  
> > >>
> > >> There?s other mounts on the system using the same fc
> > >> card/fibers/switches/san and all these are also working at a decent
> > >> speed...
> > >>
> > >>  
> > >>
> > >> I?ve been reading a good part of the day, and I can?t seem to find a
> > >> solution.
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >>  
> > >>
> > >> ubisoft_logo
> > >>
> > >> David C. Hill
> > >>
> > >> Linux System Administrator - Enterprise
> > >>
> > >> 514-490-2000#5655
> > >>
> > >> http://www.ubi.com
> > >>
> > >>  
> > >>
> > >>
> > >> --
> > >> Linux-cluster mailing list
> > >> Linux-cluster at redhat.com
> > >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > > 
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > > 
> > 
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From fdinitto at redhat.com  Thu Mar 31 15:19:07 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Thu, 31 Mar 2011 17:19:07 +0200
Subject: [Linux-cluster] Off/Topic
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48D42@MDC-MAIL-CMS01.ubisoft.org>
References: <LYRIS-0-9308750-2011.03.31-09.34.17--linux-cluster#redhat.com@>
	<710D4D6CE160654C87478D18385BB9971BDEF48D42@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <4D949B6B.8020906@redhat.com>

On 03/31/2011 04:17 PM, David Hill wrote:
> What does that mean?

SPAM..

Fabio



From ajb2 at mssl.ucl.ac.uk  Thu Mar 31 16:15:28 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 31 Mar 2011 17:15:28 +0100
Subject: [Linux-cluster] Current stable(ish) EL test kernel?
Message-ID: <4D94A8A0.9040008@mssl.ucl.ac.uk>

Bob, Steve et al,

Which EL test kernel post 2.6.18.247 is stable enough for use in a 
production system for a few days?

I'm seeing massive slowdowns on lots of 2-100Mb writes (someone's 
mirroring a ftp archive) and want to see if the .247 write speedups Bob 
mentioned 3 weeks back will help.

Alan




From swhiteho at redhat.com  Thu Mar 31 16:27:28 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 31 Mar 2011 17:27:28 +0100
Subject: [Linux-cluster] Current stable(ish) EL test kernel?
In-Reply-To: <4D94A8A0.9040008@mssl.ucl.ac.uk>
References: <4D94A8A0.9040008@mssl.ucl.ac.uk>
Message-ID: <1301588848.2569.32.camel@dolmen>

Hi,

On Thu, 2011-03-31 at 17:15 +0100, Alan Brown wrote:
> Bob, Steve et al,
> 
> Which EL test kernel post 2.6.18.247 is stable enough for use in a 
> production system for a few days?
> 
> I'm seeing massive slowdowns on lots of 2-100Mb writes (someone's 
> mirroring a ftp archive) and want to see if the .247 write speedups Bob 
> mentioned 3 weeks back will help.
> 
> Alan
> 
> 
Bryn spoke to me earlier about getting something to you. Apologies that
it has taken so long, but it should be ready for you very shortly,

Steve.




From ajb2 at mssl.ucl.ac.uk  Thu Mar 31 16:43:23 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 31 Mar 2011 17:43:23 +0100
Subject: [Linux-cluster] GFS2 cluster node is running very slow
In-Reply-To: <710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
References: <710D4D6CE160654C87478D18385BB9971BDEF489E0@MDC-MAIL-CMS01.ubisoft.org>	<710D4D6CE160654C87478D18385BB9971BDEF489F6@MDC-MAIL-CMS01.ubisoft.org>	<1301485660.2522.35.camel@dolmen>	<710D4D6CE160654C87478D18385BB9971BDEF48B37@MDC-MAIL-CMS01.ubisoft.org>	<710D4D6CE160654C87478D18385BB9971BDEF48C38@MDC-MAIL-CMS01.ubisoft.org>	<4D946389.7030100@mssl.ucl.ac.uk>
	<710D4D6CE160654C87478D18385BB9971BDEF48D3E@MDC-MAIL-CMS01.ubisoft.org>
Message-ID: <4D94AF2B.2060801@mssl.ucl.ac.uk>

David Hill wrote:
> These directories are all on the same mount ... with a total size of 1.2TB!

I _strongly_ suggest you setup one filesystem per directory.

> All files accessed by the application are within it's own folder/subdirectory.
> No files is ever accessed by more than one node.

That will help for directory locking but one node will still be faster 
than the rest in your current setup. I see similar problems to yours 
every time I have a FS mounted on multiple nodes and being written to, 
which is one reason we try to keep FSes mounted on one node only.

> - We have a daily GFS lockout now...  We need to reboot the whole cluster to solve the issue.

Please elaborate on that. Others on the list will be able to help then. :)

AB