From aneesh.kumar at gmail.com  Mon Jan  1 15:57:25 2007
From: aneesh.kumar at gmail.com (Aneesh Kumar K.V)
Date: Mon, 01 Jan 2007 21:27:25 +0530
Subject: [Linux-cluster] Re: RedHat SSI cluster
In-Reply-To: <45950A9D.7040607@interstudio.homeunix.net>
References: <45950A9D.7040607@interstudio.homeunix.net>
Message-ID: <45992F65.206@gmail.com>

Bob Marcan wrote:
> Hi.
> Are there any plans to enhance RHCS to become
> full SSI (Single System Image) cluster?
> Will http://www.open-sharedroot.org/ become officialy included and
> supported?
> Isn't time to unite force with the http://www.openssi.org ?
> 


If you look at openssi.org code you can consider it contain multiple components

a) ICS
b) VPROC
c) CFS
d) Clusterwide SYSVIPC
e) Clusterwide PID
f) Clusterwide remote file operations


I am right now done with getting ICS cleaned up for 2.6.20-rc1 kernel. It provides
a transport independent cluster framework for writing kernel cluster services. 
You can find the code at http://git.openssi.org/~kvaneesh/gitweb.cgi?p=ci-to-linus.git;a=summary


So what could be done which will help GFS and OCFS2 is to make sure they can work on top of
ICS. That also bring in an advantage that GFS and OCFS2 can work using TCP/Infiniband/SCTP/TIPC
what ever the transport layer protocol is. Once that is done next step would be to get Clusterwide SYSVIPC
from OpenSSI and merge it with latest kernel. ClusterWide PID and clusterwide remote file operation is easy
to get working. What is most difficult is VPROC which bring in the clusterwide proc model. Bruce Walker 
have a paper written on a generic framework at http://www.openssi.org/cgi-bin/view?page=proc-hooks.html


-aneesh


-aneesh 


From aneesh.kumar at gmail.com  Mon Jan  1 15:57:25 2007
From: aneesh.kumar at gmail.com (Aneesh Kumar K.V)
Date: Mon, 01 Jan 2007 21:27:25 +0530
Subject: [Linux-cluster] Re: RedHat SSI cluster
In-Reply-To: <45950A9D.7040607@interstudio.homeunix.net>
References: <45950A9D.7040607@interstudio.homeunix.net>
Message-ID: <45992F65.206@gmail.com>

Bob Marcan wrote:
> Hi.
> Are there any plans to enhance RHCS to become
> full SSI (Single System Image) cluster?
> Will http://www.open-sharedroot.org/ become officialy included and
> supported?
> Isn't time to unite force with the http://www.openssi.org ?
> 


If you look at openssi.org code you can consider it contain multiple components

a) ICS
b) VPROC
c) CFS
d) Clusterwide SYSVIPC
e) Clusterwide PID
f) Clusterwide remote file operations


I am right now done with getting ICS cleaned up for 2.6.20-rc1 kernel. It provides
a transport independent cluster framework for writing kernel cluster services. 
You can find the code at http://git.openssi.org/~kvaneesh/gitweb.cgi?p=ci-to-linus.git;a=summary


So what could be done which will help GFS and OCFS2 is to make sure they can work on top of
ICS. That also bring in an advantage that GFS and OCFS2 can work using TCP/Infiniband/SCTP/TIPC
what ever the transport layer protocol is. Once that is done next step would be to get Clusterwide SYSVIPC
from OpenSSI and merge it with latest kernel. ClusterWide PID and clusterwide remote file operation is easy
to get working. What is most difficult is VPROC which bring in the clusterwide proc model. Bruce Walker 
have a paper written on a generic framework at http://www.openssi.org/cgi-bin/view?page=proc-hooks.html


-aneesh


-aneesh 


From cbay at excellency.fr  Tue Jan  2 10:47:38 2007
From: cbay at excellency.fr (Cyril)
Date: Tue, 2 Jan 2007 11:47:38 +0100
Subject: [Linux-cluster] GFS2: kernel oops on mount with lock_nolock
Message-ID: <37213c1e67b467e8a709429a465d89a1@mail.excellency.fr>

Hello,

First of all, happy new year to everyone :-)

I compiled a 2.6.19.1 kernel with GFS2 and lock_nolock. When trying to mount the newly created GFS2 partition, I get 2 successive identical kernel oops (see [1], at the end of this mail). The second oops appears about 15 seconds after the first one.

Nevertheless, the FS is mounted and I can make basic file operations on it. However, extracting kernel sources triggers another oops and tar exits with a segmentation fault:

dbx5:/mnt# tar xjvf /root/linux-2.6.19.tar.bz2
[...]
linux-2.6.19/include/asm-h8300/shmbuf.h
Segmentation fault

See the oops in [2]. Now the FS seems stuck, and trying to remove a file hangs forever.

These errors happen on a freshly installed Debian stable with my custom kernel. I get the same oops with a 2.6.20-rc2 kernel.

Steps to reproduce:
  - install Debian stable
  - install the kernel compiled with my .config (can be found on http://dev.excellency.fr/cbay/config)
  - install mkfs.gfs2 (from latest cluster CVS, compiled myself) and libvolume_id.so (from udev 0.94, compiled myself)
  - # mkfs -t gfs2 -p lock_nolock -t test:test /dev/sda3
  - # mount -t gfs2 /dev/sda3 /mnt/

/dev/sda3 is 10GB.

Any idea?
Thanks!


[1] :
GFS2: fsid=: Trying to join cluster "lock_nolock", "test:test"
GFS2: fsid=test:test.0: Joined cluster. Now mounting FS...
GFS2: fsid=test:test.0: jid=0, already locked for use
GFS2: fsid=test:test.0: jid=0: Looking at journal...
GFS2: fsid=test:test.0: jid=0: Done
------------[ cut here ]------------
kernel BUG at fs/gfs2/glock.c:738!
invalid opcode: 0000 [#1]
Modules linked in:
CPU:    0
EIP:    0060:[<c023d206>]    Not tainted VLI
EFLAGS: 00010286   (2.6.19.1 #1)
EIP is at gfs2_glmutex_unlock+0x26/0x30
eax: f788dbbc   ebx: f788dbec   ecx: 00000001   edx: f788dbbc
esi: f788db78   edi: f5022388   ebp: f5415f94   esp: f5415f48
ds: 007b   es: 007b   ss: 0068
Process gfs2_glockd (pid: 1892, ti=f5414000 task=f7e6e030 task.ti=f5414000)
Stack: f788db78 c023ee95 f788db78 00000283 f5022000 f5415f88 c0234a28 f5022000 
       00000000 f7e6e030 c012d760 f5415f94 f5415f94 c052b7a0 00000000 00000000 
       00000000 f7e6e030 c012d760 f5415f94 f5415f94 f7014fe8 000000cc c1bc8550 
Call Trace:
 [<c023ee95>] gfs2_reclaim_glock+0x85/0xb0
 [<c0234a28>] gfs2_glockd+0xe8/0x110
 [<c012d760>] autoremove_wake_function+0x0/0x60
 [<c012d760>] autoremove_wake_function+0x0/0x60
 [<c0234940>] gfs2_glockd+0x0/0x110
 [<c012d3a7>] kthread+0xb7/0xc0
 [<c012d2f0>] kthread+0x0/0xc0
 [<c0103f17>] kernel_thread_helper+0x7/0x10
 =======================
Code: bf 00 00 00 00 83 ec 04 b8 01 00 00 00 8b 54 24 08 0f b3 42 08 c7 42 24 00 00 00 00 c7 42 28 00 00 00 00 89 14 24 e8 2a fe ff ff <0f> 0b e2 02 81 d4 3b c0 58 c3 55 57 56 31 f6 53 83 ec 10 8b 7c 
EIP: [<c023d206>] gfs2_glmutex_unlock+0x26/0x30 SS:ESP 0068:f5415f48
 <0>------------[ cut here ]------------
kernel BUG at fs/gfs2/glock.c:738!
invalid opcode: 0000 [#2]
Modules linked in:
CPU:    0
EIP:    0060:[<c023d206>]    Not tainted VLI
EFLAGS: 00010286   (2.6.19.1 #1)
EIP is at gfs2_glmutex_unlock+0x26/0x30
eax: f5757e2c   ebx: f5757de8   ecx: 00000001   edx: f5757e2c
esi: f5757de8   edi: f5022000   ebp: 00000001   esp: f55b3f78
ds: 007b   es: 007b   ss: 0068
Process gfs2_scand (pid: 1891, ti=f55b2000 task=c19a8030 task.ti=f55b2000)
Stack: f5757de8 c023ef23 f5757de8 0000090e f5022000 c0234900 fffffffc c023efc5 
       c023ef30 f5022000 0000090d f5022000 f5022000 c0234921 f5022000 f7813d7c 
       c012d3a7 f5022000 f55b3fcc 00000000 00000001 ffffffff ffffffff c012d2f0 
Call Trace:
 [<c023ef23>] examine_bucket+0x63/0x70
 [<c0234900>] gfs2_scand+0x0/0x40
 [<c023efc5>] gfs2_scand_internal+0x25/0x40
 [<c023ef30>] scan_glock+0x0/0x70
 [<c0234921>] gfs2_scand+0x21/0x40
 [<c012d3a7>] kthread+0xb7/0xc0
 [<c012d2f0>] kthread+0x0/0xc0
 [<c0103f17>] kernel_thread_helper+0x7/0x10
 =======================
Code: bf 00 00 00 00 83 ec 04 b8 01 00 00 00 8b 54 24 08 0f b3 42 08 c7 42 24 00 00 00 00 c7 42 28 00 00 00 00 89 14 24 e8 2a fe ff ff <0f> 0b e2 02 81 d4 3b c0 58 c3 55 57 56 31 f6 53 83 ec 10 8b 7c 
EIP: [<c023d206>] gfs2_glmutex_unlock+0x26/0x30 SS:ESP 0068:f55b3f78


[2] :

 <0>------------[ cut here ]------------
kernel BUG at fs/gfs2/log.c:74!
invalid opcode: 0000 [#3]
Modules linked in:
CPU:    0
EIP:    0060:[<c02430cb>]    Not tainted VLI
EFLAGS: 00010292   (2.6.19.1-alwaysdata #1)
EIP is at gfs2_ail1_start_one+0xb/0x150
eax: f5532380   ebx: f5022000   ecx: f5022000   edx: 00000000
esi: f5532380   edi: 00000000   ebp: f502269c   esp: f70cfcc4
ds: 007b   es: 007b   ss: 0068
Process tar (pid: 2032, ti=f70ce000 task=f7e6e030 task.ti=f70ce000)
Stack: 000001f6 000001f7 f502263c 00000000 f70cfcd4 f70cfcd4 00000004 f5022000 
       00000000 00000000 f502269c c0243378 f5022000 f5532380 f5022000 00000000 
       f5532380 f502267c f5022000 00000002 00000125 f5022658 c0243732 f5022000 
Call Trace:
 [<c0243378>] gfs2_ail1_start+0x68/0x120
 [<c0243732>] gfs2_log_reserve+0x92/0x110
 [<c023e0ea>] gfs2_glock_nq+0x4a/0xa0
 [<c025986f>] gfs2_trans_begin+0xff/0x160
 [<c0241a31>] link_dinode+0xe1/0x230
 [<c0241df8>] gfs2_createi+0x268/0x300
 [<c024d0c6>] gfs2_create+0x66/0x130
 [<c0241c01>] gfs2_createi+0x71/0x300
 [<c023e508>] gfs2_glock_nq_num+0x78/0xa0
 [<c024d060>] gfs2_create+0x0/0x130
 [<c016b4c9>] vfs_create+0xa9/0x190
 [<c016b8a0>] open_namei_create+0x60/0xb0
 [<c016bf3d>] open_namei+0x64d/0x680
 [<c01162e0>] default_wake_function+0x0/0x20
 [<c01615f0>] do_filp_open+0x40/0x60
 [<c01617f6>] get_unused_fd+0x66/0xc0
 [<c0161937>] do_sys_open+0x57/0xf0
 [<c01619f7>] sys_open+0x27/0x30
 [<c0102e17>] syscall_call+0x7/0xb
 =======================
Code: 89 c3 8d 44 08 ff f7 f3 8d 68 01 89 e8 8b 1c 24 8b 74 24 04 8b 7c 24 08 8b 6c 24 0c 83 c4 10 c3 55 57 56 53 83 ec 1c 8b 74 24 34 <0f> 0b 4a 00 29 d8 3b c0 8d 6e 0c 8d 76 00 8d bc 27 00 00 00 00 
EIP: [<c02430cb>] gfs2_ail1_start_one+0xb/0x150 SS:ESP 0068:f70cfcc4

-- 
Cyril B.
excelleNCy


From lhh at redhat.com  Tue Jan  2 15:45:42 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:45:42 -0500
Subject: [Linux-cluster] Lazy umount - NFS HA
In-Reply-To: <45897FCA.2080204@cesca.es>
References: <45897FCA.2080204@cesca.es>
Message-ID: <1167752742.26770.107.camel@rei.boston.devel.redhat.com>

On Wed, 2006-12-20 at 19:24 +0100, Jordi Prats wrote:
> Hi all,
> It's normal that I must use a script to do a lazy umount (umount -l
> /mountpoint) of a ext3 partition (not GFS) in a HA NFS cluster?
> 
> Thanks,

Don't do that.

If umount is failing, there are things you can enable in the current
code (RHCS4/5) to make it try to clean up the mountpoint harder.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/f5ed0132/attachment.sig>

From lhh at redhat.com  Tue Jan  2 15:48:38 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:48:38 -0500
Subject: [Linux-cluster] Lazy umount - NFS HA
In-Reply-To: <4589D02A.5010805@arts.usyd.edu.au>
References: <45897FCA.2080204@cesca.es>  <4589D02A.5010805@arts.usyd.edu.au>
Message-ID: <1167752918.26770.110.camel@rei.boston.devel.redhat.com>

On Thu, 2006-12-21 at 11:07 +1100, Matthew Geier wrote:
> Jordi Prats wrote:
> > Hi all,
> > It's normal that I must use a script to do a lazy umount (umount -l
> > /mountpoint) of a ext3 partition (not GFS) in a HA NFS cluster?
> 
>   I'm having the same problem - the service won't shutdown cleanly as it 
> can't unmount the file systems - which it can't unmount due to some one 
> logging in with SSH and their home directory is on that volume.

Try adding nfslock="1" to the <service> tag.

>   I've also had an issue with the cluster manager not shutting down 
> nfsd, so a volume won't unmount 'cause an NFS client is still attached. 
> I think I might have nailed that one, but there is little I can do about 
> people with interactive logins.

Not shutting down nfsd is expected behavior.

RHCS can manage multiple NFS services at the same time - killing nfsd
would kill all NFS services at the same time.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/b2c9bb17/attachment.sig>

From lhh at redhat.com  Tue Jan  2 15:49:07 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:49:07 -0500
Subject: [Linux-cluster] Lazy umount - NFS HA
In-Reply-To: <4589D02A.5010805@arts.usyd.edu.au>
References: <45897FCA.2080204@cesca.es>  <4589D02A.5010805@arts.usyd.edu.au>
Message-ID: <1167752947.26770.112.camel@rei.boston.devel.redhat.com>

On Thu, 2006-12-21 at 11:07 +1100, Matthew Geier wrote:
> Jordi Prats wrote:
> > Hi all,
> > It's normal that I must use a script to do a lazy umount (umount -l
> > /mountpoint) of a ext3 partition (not GFS) in a HA NFS cluster?
> 
>   I'm having the same problem - the service won't shutdown cleanly as it 
> can't unmount the file systems - which it can't unmount due to some one 
> logging in with SSH and their home directory is on that volume.

Oh, and, don't forget to enable force-unmount of the file system.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/b9c7dac6/attachment.sig>

From lhh at redhat.com  Tue Jan  2 15:49:54 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:49:54 -0500
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <emeh5m$rk5$1@sea.gmane.org>
References: <45897FCA.2080204@cesca.es> <4589D02A.5010805@arts.usyd.edu.au>
	<emeh5m$rk5$1@sea.gmane.org>
Message-ID: <1167752994.26770.114.camel@rei.boston.devel.redhat.com>

On Thu, 2006-12-21 at 09:44 -0800, Jonathan Biggar wrote:
> 
> We got around this by writing a custom script that uses fuser to 
> identify and kill all processes that had open files on the filesystem.
> 

The fs script does this if you enable force unmount.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/7bd75601/attachment.sig>

From lhh at redhat.com  Tue Jan  2 15:50:18 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:50:18 -0500
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <458ACC39.8010005@cesca.es>
References: <45897FCA.2080204@cesca.es> <4589D02A.5010805@arts.usyd.edu.au>
	<emeh5m$rk5$1@sea.gmane.org>  <458ACC39.8010005@cesca.es>
Message-ID: <1167753019.26770.116.camel@rei.boston.devel.redhat.com>

On Thu, 2006-12-21 at 19:02 +0100, Jordi Prats wrote:
> Hi,
> fuser (or lsof) does not show any process because we export the
> filesystem with NFS (NFS is inside the kernel)

It's probably a lock, then; try adding nfslock="1" to the <service> tag.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/0be28c5f/attachment.sig>

From lhh at redhat.com  Tue Jan  2 15:57:10 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 10:57:10 -0500
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <20061229155200.29844.qmail@web33203.mail.mud.yahoo.com>
References: <20061229155200.29844.qmail@web33203.mail.mud.yahoo.com>
Message-ID: <1167753430.26770.123.camel@rei.boston.devel.redhat.com>

On Fri, 2006-12-29 at 07:52 -0800, Brian Pontz wrote:
> 
> I ended up having to reboot both nodes. Any ideas on
> what would cause this error?

Are you up to date?  I thought this was fixed in U4...

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/9e37fec9/attachment.sig>

From axehind007 at yahoo.com  Tue Jan  2 16:14:29 2007
From: axehind007 at yahoo.com (Brian Pontz)
Date: Tue, 2 Jan 2007 08:14:29 -0800 (PST)
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <1167753430.26770.123.camel@rei.boston.devel.redhat.com>
Message-ID: <960889.25622.qm@web33207.mail.mud.yahoo.com>


> On Fri, 2006-12-29 at 07:52 -0800, Brian Pontz
> wrote:
> > 
> > I ended up having to reboot both nodes. Any ideas
> on
> > what would cause this error?
> 
> Are you up to date?  I thought this was fixed in
> U4...

At this point the 2 machines are running CentOS
release 4.2. I guess we need to upgrade/update though.
Do you have a bug id as to what this is related to so
I can make sure it's been fixed in the latest release?
If not, then no big deal...

Thanks,
Brian


From jprats at cesca.es  Tue Jan  2 16:31:14 2007
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 02 Jan 2007 17:31:14 +0100
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <1167752994.26770.114.camel@rei.boston.devel.redhat.com>
References: <45897FCA.2080204@cesca.es>
	<4589D02A.5010805@arts.usyd.edu.au>	<emeh5m$rk5$1@sea.gmane.org>
	<1167752994.26770.114.camel@rei.boston.devel.redhat.com>
Message-ID: <459A88D2.7080204@cesca.es>

Hi,
I have this already enabled with the option force_unmount="1" on fs's 
tags, but it's still failing:

 <fs device="/dev/testing/test01" force_unmount="1" fstype="ext3" 
mountpoint="/test" name="test01" options="">

There is any other option ?

Thanks,
Jordi

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


Lon Hohberger wrote:
> On Thu, 2006-12-21 at 09:44 -0800, Jonathan Biggar wrote:
>   
>> We got around this by writing a custom script that uses fuser to 
>> identify and kill all processes that had open files on the filesystem.
>>
>>     
>
> The fs script does this if you enable force unmount.
>
> -- Lon
>   
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From jprats at cesca.es  Tue Jan  2 16:37:56 2007
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 02 Jan 2007 17:37:56 +0100
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <1167753019.26770.116.camel@rei.boston.devel.redhat.com>
References: <45897FCA.2080204@cesca.es>
	<4589D02A.5010805@arts.usyd.edu.au>	<emeh5m$rk5$1@sea.gmane.org>
	<458ACC39.8010005@cesca.es>
	<1167753019.26770.116.camel@rei.boston.devel.redhat.com>
Message-ID: <459A8A64.8030507@cesca.es>

Ok, I'll try this. Thank you!

Jordi

Lon Hohberger wrote:
> On Thu, 2006-12-21 at 19:02 +0100, Jordi Prats wrote:
>   
>> Hi,
>> fuser (or lsof) does not show any process because we export the
>> filesystem with NFS (NFS is inside the kernel)
>>     
>
> It's probably a lock, then; try adding nfslock="1" to the <service> tag.
>
> -- Lon
>
>   
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From isplist at logicore.net  Tue Jan  2 16:49:03 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 2 Jan 2007 10:49:03 -0600
Subject: [Linux-cluster] Logging with cluster
In-Reply-To: <296568.98665.qm@web50601.mail.yahoo.com>
Message-ID: <20071210493.264895@leena>

Solution: Log Merging

> to merge the log I used to use the 'mergerlog' command:

> About:
> mergelog is a small and fast C program, which merges
> HTTP log files by date in 'Common Log Format' (Apache
> default log format) from Web servers, behind
> round-robin DNS. It has been designed to easily
> process huge logs from highly stressed servers, and
> can manage gzipped files.

Thanks, this is one option.

Mike


From isplist at logicore.net  Tue Jan  2 17:12:57 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 2 Jan 2007 11:12:57 -0600
Subject: [Linux-cluster] Logging with cluster
In-Reply-To: <459567DD.7090205@dorm.org>
Message-ID: <200712111257.852962@leena>

Solution: preprocessing logs

Great ideas, I'll look into some of these, thanks very much.


> We have a cluster of 6 machines, some running Apache, some running MySQL.
> We use shared logging successfully along with stats and post-processing
> scripts.  We also use plain-ol' logrotate with our shared logs.
> 
> We use network-enabled syslog to capture logging on every node to a single,
> master logging node (with fail-over, of course!)
> 
> For Apache, we use custom ErrorLog, CustomLog, and RewriteLog directives
> per vhost to pipe output to a custom script which greps a few undesirable
> statements out prior to logging.
> 
> Apache is sent to the local1 facility on the target syslog
> machine that holds all of our logs, where it's configured
> with something like:
> 
> /etc/syslog.conf:
> # Cluster Apache Logging
> local1.err                /var/log/shared-apache-err.log
> local1.notice                /var/log/shared-apache-access.log
> local1.debug                /var/log/shared-apache-rewrite.log
> 
> 
> And, for example, all Apache nodes use the same config akin to:
> 
> /path/to/http-vhost.conf:
> <snip>
> ErrorLog   "|/path/to/logger.pl err some_string_ID"
> CustomLog  "|/path/to/logger.pl notice some_string_ID"
> RewriteLog "|/path/to/logger.pl debug some_string_ID"
> </snip>
> 
> where logger.pl continually reads input, runs some filters
> to determine if it should indeed log the particular message,
> and then calls Sys::Syslog's "syslog()" function, and
> "some_string_ID" is a tag to identify each message in
> the shared log files.
> 
> You could really use any line-by-line filtering program
> here, but be aware that Apache executes the first argument
> after the pipe symbol directly - it doesn't run a shell or
> anything, so you don't have any expansion, piping of other
> commands, etc.
> 
> You can also use /usr/bin/logger (see "man logger") to
> send output to various facilities (localN) and informational
> levels (err, notice, debug, etc.).  This does the same
> thing as "logger.pl" above, but doesn't provide any
> filtering.
> 
> Also, we've seen syslog drop some messages under
> heavy load (hence why we filter some Apache logging
> prior to syslogging it).  I don't know the exact
> cause - maybe someone else can shed light on that for me!
> 
> 
> Hope this helps - it's what we do and it seems to work
> well enough for what we need.
> 
> Regards,
> -Brenton Rothchild


From jon at levanta.com  Tue Jan  2 17:12:48 2007
From: jon at levanta.com (Jonathan Biggar)
Date: Tue, 02 Jan 2007 09:12:48 -0800
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <1167752994.26770.114.camel@rei.boston.devel.redhat.com>
References: <45897FCA.2080204@cesca.es>
	<4589D02A.5010805@arts.usyd.edu.au>	<emeh5m$rk5$1@sea.gmane.org>
	<1167752994.26770.114.camel@rei.boston.devel.redhat.com>
Message-ID: <ene3qc$n6n$1@sea.gmane.org>

Lon Hohberger wrote:
> On Thu, 2006-12-21 at 09:44 -0800, Jonathan Biggar wrote:
>> We got around this by writing a custom script that uses fuser to 
>> identify and kill all processes that had open files on the filesystem.
>>
> 
> The fs script does this if you enable force unmount.

Thanks for the tip, but we don't use the fs service directly because our 
application dynamically mounts & unmounts filesystems, so we need finer 
control over the mounting & unmounting.

-- 
Jonathan Biggar
jon at levanta.com


From lhh at redhat.com  Tue Jan  2 21:46:01 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 16:46:01 -0500
Subject: [Linux-cluster] Re: Lazy umount - NFS HA
In-Reply-To: <ene3qc$n6n$1@sea.gmane.org>
References: <45897FCA.2080204@cesca.es> <4589D02A.5010805@arts.usyd.edu.au>
	<emeh5m$rk5$1@sea.gmane.org>
	<1167752994.26770.114.camel@rei.boston.devel.redhat.com>
	<ene3qc$n6n$1@sea.gmane.org>
Message-ID: <1167774361.26770.146.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-02 at 09:12 -0800, Jonathan Biggar wrote:
> Lon Hohberger wrote:
> > On Thu, 2006-12-21 at 09:44 -0800, Jonathan Biggar wrote:
> >> We got around this by writing a custom script that uses fuser to 
> >> identify and kill all processes that had open files on the filesystem.
> >>
> > 
> > The fs script does this if you enable force unmount.
> 
> Thanks for the tip, but we don't use the fs service directly because our 
> application dynamically mounts & unmounts filesystems, so we need finer 
> control over the mounting & unmounting.
> 

Good reason not to use it, then :)

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/0dd07195/attachment.sig>

From lhh at redhat.com  Tue Jan  2 21:48:46 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 02 Jan 2007 16:48:46 -0500
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <960889.25622.qm@web33207.mail.mud.yahoo.com>
References: <960889.25622.qm@web33207.mail.mud.yahoo.com>
Message-ID: <1167774526.26770.148.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-02 at 08:14 -0800, Brian Pontz wrote:
> > On Fri, 2006-12-29 at 07:52 -0800, Brian Pontz
> > wrote:
> > > 
> > > I ended up having to reboot both nodes. Any ideas
> > on
> > > what would cause this error?
> > 
> > Are you up to date?  I thought this was fixed in
> > U4...
> 
> At this point the 2 machines are running CentOS
> release 4.2. I guess we need to upgrade/update though.
> Do you have a bug id as to what this is related to so
> I can make sure it's been fixed in the latest release?
> If not, then no big deal...

I can find them if you want -- there's one fairly recent one which
cropped up which isn't in any release (yet) but has been fixed in CVS.

There are several lock-related fixes between 4.2 and 4.4; this could be
one of several.

Here's one of them:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=193128

You'll need to update magma, magma-plugins, and rgmanager to the latest.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/97957687/attachment.sig>

From otakurx at gmail.com  Tue Jan  2 22:28:23 2007
From: otakurx at gmail.com (Michael Mitchell)
Date: Tue, 2 Jan 2007 17:28:23 -0500
Subject: [Linux-cluster] Kernel issue when trying to mount
Message-ID: <96fe693d0701021428r1682a993kc95d06454d4adb22@mail.gmail.com>

When I go to mount the GFS drive I get the following Kernel error:

Jan  2 16:51:51 perforce2 kernel: GFS: Trying to join cluster "lock_dlm",
"media:perforce_gfs"
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: Joined
cluster. Now mounting FS...
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=0:
Trying to acquire journal lock...
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=0:
Looking at journal...
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=0:
Done
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=1:
Trying to acquire journal lock...
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=1:
Looking at journal...
Jan  2 16:51:53 perforce2 kernel: GFS: fsid=media:perforce_gfs.0: jid=1:
Done
Jan  2 16:51:53 perforce2 kernel: BUG: unable to handle kernel NULL pointer
dereference at virtual address 00000024
Jan  2 16:51:53 perforce2 kernel:  printing eip:
Jan  2 16:51:53 perforce2 kernel: c0172db7
Jan  2 16:51:53 perforce2 kernel: *pde = 2e32b001
Jan  2 16:51:53 perforce2 kernel: Oops: 0000 [#1]
Jan  2 16:51:53 perforce2 kernel: SMP
Jan  2 16:51:53 perforce2 kernel: Modules linked in: iscsi_tcp libiscsi
scsi_transport_iscsi crc32c libcrc32c lock_dlm dlm gfs lock_harness cman
gnbd mptctl mptbase ipmi_devintf ipmi_si ipmi_msghandler dell_rbu parport_pc
lp parport autofs4 i2c_dev i2c_core sunrpc dm_mirror dm_mod button battery
asus_acpi ac ipv6 uhci_hcd ehci_hcd e1000 floppy sg megaraid_mbox
megaraid_mm sd_mod
Jan  2 16:51:53 perforce2 kernel: CPU:    2
Jan  2 16:51:53 perforce2 kernel: EIP:    0060:[<c0172db7>]    Tainted:
GF     VLI
Jan  2 16:51:53 perforce2 kernel: EFLAGS: 00010293   (2.6.18.3.mitchell #2)
Jan  2 16:51:53 perforce2 kernel: EIP is at do_add_mount+0x67/0xea
Jan  2 16:51:53 perforce2 kernel: eax: 0000000c   ebx: d87aba00   ecx:
00000000   edx: c676d440
Jan  2 16:51:53 perforce2 kernel: esi: d8697f38   edi: ffffffea   ebp:
00000000   esp: d8697f08
Jan  2 16:51:53 perforce2 kernel: ds: 007b   es: 007b   ss: 0068
Jan  2 16:51:53 perforce2 kernel: Process mount (pid: 15304, ti=d8697000
task=f5390df0 task.ti=d8697000)
Jan  2 16:51:53 perforce2 kernel: Stack: 00000000 00000000 00000000 d8697f38
00000000 c017334b 00000000 d3a04000
Jan  2 16:51:53 perforce2 kernel:        00000000 00000000 e92df000 d3a04000
d15933e8 c676d440 00000000 000200d0
Jan  2 16:51:53 perforce2 kernel:        c0362780 00000001 00000001 00000000
c0142da5 00000044 00001000 f5390df0
Jan  2 16:51:53 perforce2 kernel: Call Trace:
Jan  2 16:51:53 perforce2 kernel:  [<c017334b>] do_mount+0x1af/0x1c7
Jan  2 16:51:53 perforce2 kernel:  [<c0142da5>] __alloc_pages+0x5e/0x284
Jan  2 16:51:53 perforce2 kernel:  [<c01735d2>] sys_mount+0x6f/0xa8
Jan  2 16:51:53 perforce2 kernel:  [<c0103235>] sysenter_past_esp+0x56/0x79
Jan  2 16:51:53 perforce2 kernel: Code: f0 ff ff 8b 00 8b 80 58 04 00 00 39
42 64 75 7a 8b 43 14 66 bf f0 ff 39 42 14 75 07 8b 06 39 42 10 74 67 8b 43
10 bf ea ff ff ff <8b> 40 18 0f b7 40 28 25 00 f0 00 00 3d 00 a0 00 00 74 4c
8b 04
Jan  2 16:51:53 perforce2 kernel: EIP: [<c0172db7>] do_add_mount+0x67/0xea
SS:ESP 0068:d8697f08
[root at perforce2 ~]#

Is there a patch for this?  I am running kernel 2.6.18.3

-- 
Mike Mitchell
otakurx at gmail.com
(603) 706-0026
www.otaku-wired.net (offline)
zatoichi.is-a-geek.org
otakurx.blogspot.com (my Blog)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070102/b53f00f8/attachment.htm>

From marco.lusini at governo.it  Wed Jan  3 11:35:57 2007
From: marco.lusini at governo.it (Marco Lusini)
Date: Wed, 3 Jan 2007 12:35:57 +0100
Subject: [Linux-cluster] High system CPU usage in one of a two node cluster
In-Reply-To: <96fe693d0701021428r1682a993kc95d06454d4adb22@mail.gmail.com>
Message-ID: <016001c72f2b$57f3c080$8ec9100a@nicchio>


Hi all,
 
I have 3 2-node clusters, running just cluster suite, without gfs, each one
updated with the latest
packages released by RHN.
 
In each cluster one of the two nodes has a steadily growing system CPU
usage, which seems 
to be consumed by clurgmgrd and dlm_recvd.
As an example here is the running time accumulated on one cluster since 20
december when
oit was rebooted:
 
[root at estestest ~]# ps axo pid,start,time,args
  PID  STARTED     TIME COMMAND
...
10221   Dec 20 10:37:05 clurgmgrd
11169   Dec 20 06:48:24 [dlm_recvd]
...
 
[root at frascati ~]# ps axo pid,start,time,args
  PID  STARTED     TIME COMMAND
...
 6226   Dec 20 00:04:17 clurgmgrd
 8249   Dec 20 00:00:19 [dlm_recvd]
...
 
I attach two graphs made with RRD which show that the system CPU usage is
steadily growing:
note how the trend changed after the reboot on 20 december.
Of course as the system usage increases so does the system load and I am
afraid of what will
happen after 1-2 months of uptime...
 
Does anybody else see this behaviour? Any hint on a solution?
 
TIA,
 
Marco Lusini
 
 
_______________________________________________________
Messaggio analizzato e protetto da tecnologia antivirus

Servizio erogato dal sistema informativo della 
Presidenza del Consiglio dei Ministri
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070103/dcfd66aa/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: estestest_monthly.jpg
Type: image/jpeg
Size: 35833 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070103/dcfd66aa/attachment.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: frascati_monthly.jpg
Type: image/jpeg
Size: 31823 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070103/dcfd66aa/attachment-0001.jpg>

From lhh at redhat.com  Wed Jan  3 16:39:31 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 03 Jan 2007 11:39:31 -0500
Subject: [Linux-cluster] High system CPU usage in one of a two node cluster
In-Reply-To: <016001c72f2b$57f3c080$8ec9100a@nicchio>
References: <016001c72f2b$57f3c080$8ec9100a@nicchio>
Message-ID: <1167842371.26770.162.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-03 at 12:35 +0100, Marco Lusini wrote:
> Hi all,
>  
> I have 3 2-node clusters, running just cluster suite, without gfs,
> each one updated with the latest
> packages released by RHN.
>  
> In each cluster one of the two nodes has a steadily growing system CPU
> usage, which seems 
> to be consumed by clurgmgrd and dlm_recvd.
> As an example here is the running time accumulated on one cluster
> since 20 december when
> oit was rebooted:
>  
> [root at estestest ~]# ps axo pid,start,time,args
>   PID  STARTED     TIME COMMAND
> ...
> 10221   Dec 20 10:37:05 clurgmgrd
> 11169   Dec 20 06:48:24 [dlm_recvd]
> ...
>  
> [root at frascati ~]# ps axo pid,start,time,args
>   PID  STARTED     TIME COMMAND
> ...
>  6226   Dec 20 00:04:17 clurgmgrd
>  8249   Dec 20 00:00:19 [dlm_recvd]
> ...
>  
> I attach two graphs made with RRD which show that the system CPU usage
> is steadily growing:
> note how the trend changed after the reboot on 20 december.

> Of course as the system usage increases so does the system load and I
> am afraid of what will
> happen after 1-2 months of uptime...

System load averages are the average of the number of processes on the
run queue over the past 1, 5, and 15 minutes.  It doesn't generally
trend upwards over time; if that were the case, I'd be in trouble:

...
28204 15:11:11 01:04:19 /usr/lib/firefox-1.5.0.9/firefox-bin -UILocale
en-US
...

However, it is a little odd that you had 10 hours of runtime for
clurgmgrd and over 6 for dlm_recvd.  Just taking a wild guess, but it
looks like the locks were all mastered on frascati.

How many services are you running?

Also, take a look at:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634

The RPMs there might solve the problem with dlm_recvd.  Rgmanager in
some situations causes a strange leak of NL locks in the DLM.  If
dlm_recvd has to traverse lock lists and that list is ever-growing
(total speculation here), it could explain the amount of consumed system
time.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070103/6a48ed4d/attachment.sig>

From danwest at comcast.net  Thu Jan  4 04:12:28 2007
From: danwest at comcast.net (danwest)
Date: Wed, 03 Jan 2007 23:12:28 -0500
Subject: [Linux-cluster] qdiskd eviction on missed writes
Message-ID: <1167883949.6614.24.camel@belmont.site>

It seems that we can get into situations where certain spike conditions
will cause a node to evict another node based on missed writes to the
qdisk.  The problem is that during these spikes application access to
the same storage back end does not seem to be impacted.  The SAN in this
case is a high end EMC DMX, multipathed, etc...  Currently our clusters
are set to interval="1" and tko="15" which should allow for at least 15
seconds (a very long time for this type of storage)

In looking at ~/cluster/cman/qdisk/main.c it seems like the following is
taking place:

In quroum_loop {}

        1) read everybody else's status (not sure if this includes
yourself
        2) check for node transitions (write eviction notice if number
of heartbeats missed > tko)
        3) check local heuristic (if we do not meet requirement remove
from qdisk partition and possibly reboot)
        4) Find master and/or determine new master, etc...
        5) write out our status to qdisk
        6) write out our local status (heuristics)
        7) cycle ( sleep for defined interval).  sleep() measured in
seconds so complete cycle = interval + time for steps (1) through (6)

Do you think that any delay in steps (1) through (4) could be the
problem?  From an architectural standpoint wouldn't it be better to have
(6) and (7) as a separate thread or daemon?  A kernel thread like
cman_hbeat for example?

Further in the check_transitions procedure case #2 it might be more
helpful to clulog what actually caused this to trigger.  The current
logging is a bit generic.

Am I totally off base or does this seem plausible?

Thanks,
 Dan


From simone.gotti at email.it  Thu Jan  4 14:13:12 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Thu, 04 Jan 2007 15:13:12 +0100
Subject: [Linux-cluster] [PATCH] Fix cman_get_node_id in qdisk.
Message-ID: <1167919992.11659.14.camel@localhost>

Hi all,


testing the qdiskd provided by the new openais cman-2.0.35-2 (on RHEL5
Beta 2) I found that it would no start with the following error:

Could not determine local node ID; cannot start


Looking at the code of the other programs that connects to cman I
noticed that before the call libcman function:
cman_get_node(cman_handle_t handle, int nodeid, cman_node_t *node), 
they are inizialing with all zeros the third argument, so I did the same
with qdiskd and it worked.

Looking at the cvs repository I didn't find a fix for it.
A patch is attached. 

Thanks!

Bye!


-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Refill srl il paradiso della tua stampante - cartucce e toner compatibili, inchiostri e accessori per la ricarica, carta speciale. Tutto a prezzi scontatissimi!
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5187&d=4-1
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cman-2.0.35-qdisk-cman_get_node-fix.patch
Type: text/x-patch
Size: 452 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070104/3fab651a/attachment.bin>

From pcaulfie at redhat.com  Thu Jan  4 14:23:56 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 04 Jan 2007 14:23:56 +0000
Subject: [Linux-cluster] [PATCH] Fix cman_get_node_id in qdisk.
In-Reply-To: <1167919992.11659.14.camel@localhost>
References: <1167919992.11659.14.camel@localhost>
Message-ID: <459D0DFC.1080609@redhat.com>

Simone Gotti wrote:
> Hi all,
> 
> 
> testing the qdiskd provided by the new openais cman-2.0.35-2 (on RHEL5
> Beta 2) I found that it would no start with the following error:
> 
> Could not determine local node ID; cannot start
> 
> 
> Looking at the code of the other programs that connects to cman I
> noticed that before the call libcman function:
> cman_get_node(cman_handle_t handle, int nodeid, cman_node_t *node), 
> they are inizialing with all zeros the third argument, so I did the same
> with qdiskd and it worked.
> 
> Looking at the cvs repository I didn't find a fix for it.
> A patch is attached. 
> 

Yes, that patch look correct to me.

Thanks
-- 

patrick


From francisco_javier.pena at roche.com  Thu Jan  4 14:42:05 2007
From: francisco_javier.pena at roche.com (Pena, Francisco Javier)
Date: Thu, 4 Jan 2007 15:42:05 +0100
Subject: [Linux-cluster] Removing a node from a running cluster
Message-ID: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>

Hello,

I am finding a strange cman behavior when removing a node from a running cluster. The starting point is:

- 3 nodes running RHEL 4 U4, GFS 6.1	(1 vote per node)
- Quorum disk					(4 votes)

I stop all cluster services on node 3, then modify the cluster.conf file to remove the node (and adjust the quorum disk votes to 3), and then "ccs_tool update" and "cman_tool version -r <new_version>". The cluster services keep running, however it looks like cman is not completely in sync with ccsd:

# ccs_tool lsnode

Cluster name: TestCluster, config_version: 9

Nodename                        Votes Nodeid Iface Fencetype
gfsnode1                           1    1          iLO_NODE1
gfsnode2                           1    2          iLO_NODE2


# cman_tool nodes

Node  Votes Exp Sts  Name
   0    4    0   M   /dev/emcpowera1
   1    1    3   M   gfsnode1
   2    1    3   M   gfsnode2
   3    1    3   X   gfsnode3

# cman_tool status

Protocol version: 5.0.1
Config version: 9
Cluster name: TestCluster
Cluster ID: 62260
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 3
Total_votes: 6
Quorum: 4
Active subsystems: 9
Node name: gfsnode1
Node ID: 1
Node addresses: A.B.C.D

CMAN still thinks the third node is part of the cluster, but has just stopped working. In addition to that, it is not updating the number of votes for the quorum disk. If I completely restart the cluster services on all nodes, I get the right information:

- Correct votes for the quorum disk
- Third node dissappears
- The Expected_votes value is now 2

I know from a previous post that two node clusters are a special case, even with quorum disk, but I am pretty sure the same problem will happen with higher node counts (I just do not have enough hardware to test it).

So, is this considered as a bug or is it expected that the information from removed nodes is still there until the whole cluster is restarted?

Thanks in advance,

Javier Pe?a


From Alain.Moulle at bull.net  Thu Jan  4 14:48:43 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Thu, 04 Jan 2007 15:48:43 +0100
Subject: [Linux-cluster] Re: CS4 U4 / problem when fencing (Marcos David)
Message-ID: <459D13CB.3000909@bull.net>

Hi

Thanks for the patch.
I've applied the patch, it seems that fencing is successful despite
I always have the message Error with ccsd, is it normal ?

Extract of syslog :
...
Jan  4 16:17:32 s_sys at nova6 ccsd[16197]: Attempt to close an unopened CCS
descriptor (870).
Jan  4 16:17:32 s_sys at nova6 ccsd[16197]: Error while processing disconnect:
Invalid request descriptor
Jan  4 16:17:32 s_sys at nova6 fenced[16341]: fence "nova10" success
Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <info> Magma Event: Membership Change
Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <info> State change: nova10 DOWN
Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <notice> Taking over service
testHA from down member (null)
Jan  4 16:17:39 s_sys at nova6 clurgmgrd: [16355]: <info> Executing
/tmp/testHAmanage start
Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <notice> Service testHA started
...

Alain Moull?

> Hello,
> I've had the same problem, the fence agent times out while trying to
> fence a node.
> There is a patch to solve this problem.
> follow this link:
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=219633
> It explains the problem and has a fix.
> Hope it helps.
> Marcos David


From jwhiter at redhat.com  Thu Jan  4 14:59:07 2007
From: jwhiter at redhat.com (Josef Whiter)
Date: Thu, 4 Jan 2007 09:59:07 -0500
Subject: [Linux-cluster] Re: CS4 U4 / problem when fencing (Marcos David)
In-Reply-To: <459D13CB.3000909@bull.net>
References: <459D13CB.3000909@bull.net>
Message-ID: <20070104145906.GB5282@korben.rdu.redhat.com>

Yes thats normal.  What happens is the connection to magma is dropped because
the fencing takes too long and then fenced goes to reuse the connection, and
then that error pops up, and then fenced will re-create the connection and try
again.  So it is doing what its supposed to.

Josef

On Thu, Jan 04, 2007 at 03:48:43PM +0100, Alain Moulle wrote:
> Hi
> 
> Thanks for the patch.
> I've applied the patch, it seems that fencing is successful despite
> I always have the message Error with ccsd, is it normal ?
> 
> Extract of syslog :
> ...
> Jan  4 16:17:32 s_sys at nova6 ccsd[16197]: Attempt to close an unopened CCS
> descriptor (870).
> Jan  4 16:17:32 s_sys at nova6 ccsd[16197]: Error while processing disconnect:
> Invalid request descriptor
> Jan  4 16:17:32 s_sys at nova6 fenced[16341]: fence "nova10" success
> Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <info> Magma Event: Membership Change
> Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <info> State change: nova10 DOWN
> Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <notice> Taking over service
> testHA from down member (null)
> Jan  4 16:17:39 s_sys at nova6 clurgmgrd: [16355]: <info> Executing
> /tmp/testHAmanage start
> Jan  4 16:17:39 s_sys at nova6 clurgmgrd[16355]: <notice> Service testHA started
> ...
> 
> Alain Moull?
> 
> > Hello,
> > I've had the same problem, the fence agent times out while trying to
> > fence a node.
> > There is a patch to solve this problem.
> > follow this link:
> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=219633
> > It explains the problem and has a fix.
> > Hope it helps.
> > Marcos David
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From pcaulfie at redhat.com  Thu Jan  4 15:00:43 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 04 Jan 2007 15:00:43 +0000
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>
Message-ID: <459D169B.90201@redhat.com>

Pena, Francisco Javier wrote:
> Hello,
> 
> I am finding a strange cman behavior when removing a node from a running cluster. The starting point is:
> 
> - 3 nodes running RHEL 4 U4, GFS 6.1	(1 vote per node)
> - Quorum disk					(4 votes)
> 
> I stop all cluster services on node 3, then modify the cluster.conf file to remove the node (and adjust the quorum disk votes to 3), and then "ccs_tool update" and "cman_tool version -r <new_version>". The cluster services keep running, however it looks like cman is not completely in sync with ccsd:
> 
> # ccs_tool lsnode
> 
> Cluster name: TestCluster, config_version: 9
> 
> Nodename                        Votes Nodeid Iface Fencetype
> gfsnode1                           1    1          iLO_NODE1
> gfsnode2                           1    2          iLO_NODE2
> 
> 
> # cman_tool nodes
> 
> Node  Votes Exp Sts  Name
>    0    4    0   M   /dev/emcpowera1
>    1    1    3   M   gfsnode1
>    2    1    3   M   gfsnode2
>    3    1    3   X   gfsnode3
> 
> # cman_tool status
> 
> Protocol version: 5.0.1
> Config version: 9
> Cluster name: TestCluster
> Cluster ID: 62260
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 2
> Expected_votes: 3
> Total_votes: 6
> Quorum: 4
> Active subsystems: 9
> Node name: gfsnode1
> Node ID: 1
> Node addresses: A.B.C.D
> 
> CMAN still thinks the third node is part of the cluster, but has just stopped working. In addition to that, it is not updating the number of votes for the quorum disk. If I completely restart the cluster services on all nodes, I get the right information:
> 
> - Correct votes for the quorum disk
> - Third node dissappears
> - The Expected_votes value is now 2
>

I can't comment on the behaviour of the quorum disk, but cman is behaving as expected. A node is NEVER removed from the internal
lists of cman while any node of the cluster is till active. It is completely harmless in that state, the node simply remains
permanently dead and expected votes is adjusted accordingly.


-- 

patrick


From jparsons at redhat.com  Thu Jan  4 15:55:14 2007
From: jparsons at redhat.com (Jim Parsons)
Date: Thu, 04 Jan 2007 10:55:14 -0500
Subject: [Linux-cluster] Removing a node from a running cluster
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>
	<459D169B.90201@redhat.com>
Message-ID: <459D2362.40204@redhat.com>

Patrick Caulfield wrote:

>Pena, Francisco Javier wrote:
>
>>Hello,
>>
>>I am finding a strange cman behavior when removing a node from a running cluster. The starting point is:
>>
>>- 3 nodes running RHEL 4 U4, GFS 6.1	(1 vote per node)
>>- Quorum disk					(4 votes)
>>
>>I stop all cluster services on node 3, then modify the cluster.conf file to remove the node (and adjust the quorum disk votes to 3), and then "ccs_tool update" and "cman_tool version -r <new_version>". The cluster services keep running, however it looks like cman is not completely in sync with ccsd:
>>
>># ccs_tool lsnode
>>
>>Cluster name: TestCluster, config_version: 9
>>
>>Nodename                        Votes Nodeid Iface Fencetype
>>gfsnode1                           1    1          iLO_NODE1
>>gfsnode2                           1    2          iLO_NODE2
>>
>>
>># cman_tool nodes
>>
>>Node  Votes Exp Sts  Name
>>   0    4    0   M   /dev/emcpowera1
>>   1    1    3   M   gfsnode1
>>   2    1    3   M   gfsnode2
>>   3    1    3   X   gfsnode3
>>
>># cman_tool status
>>
>>Protocol version: 5.0.1
>>Config version: 9
>>Cluster name: TestCluster
>>Cluster ID: 62260
>>Cluster Member: Yes
>>Membership state: Cluster-Member
>>Nodes: 2
>>Expected_votes: 3
>>Total_votes: 6
>>Quorum: 4
>>Active subsystems: 9
>>Node name: gfsnode1
>>Node ID: 1
>>Node addresses: A.B.C.D
>>
>>CMAN still thinks the third node is part of the cluster, but has just stopped working. In addition to that, it is not updating the number of votes for the quorum disk. If I completely restart the cluster services on all nodes, I get the right information:
>>
>>- Correct votes for the quorum disk
>>- Third node dissappears
>>- The Expected_votes value is now 2
>>
>
>I can't comment on the behaviour of the quorum disk, but cman is behaving as expected. A node is NEVER removed from the internal
>lists of cman while any node of the cluster is till active. It is completely harmless in that state, the node simply remains
>permanently dead and expected votes is adjusted accordingly.
>
>
Patrick - isn't it also necessary to set a cman attribute for 
two-node='1' in the conf file? In order for cman to see this attribute, 
the entire cluster would need to be restarted.

Regards,

-Jim


From pcaulfie at redhat.com  Thu Jan  4 15:34:16 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 04 Jan 2007 15:34:16 +0000
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <459D2362.40204@redhat.com>
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>	<459D169B.90201@redhat.com>
	<459D2362.40204@redhat.com>
Message-ID: <459D1E78.6070602@redhat.com>

Jim Parsons wrote:
> Patrick Caulfield wrote:
> 
>> Pena, Francisco Javier wrote:
>>
>>> Hello,
>>>
>>> I am finding a strange cman behavior when removing a node from a
>>> running cluster. The starting point is:
>>>
>>> - 3 nodes running RHEL 4 U4, GFS 6.1    (1 vote per node)
>>> - Quorum disk                    (4 votes)
>>>
>>> I stop all cluster services on node 3, then modify the cluster.conf
>>> file to remove the node (and adjust the quorum disk votes to 3), and
>>> then "ccs_tool update" and "cman_tool version -r <new_version>". The
>>> cluster services keep running, however it looks like cman is not
>>> completely in sync with ccsd:
>>>
>>> # ccs_tool lsnode
>>>
>>> Cluster name: TestCluster, config_version: 9
>>>
>>> Nodename                        Votes Nodeid Iface Fencetype
>>> gfsnode1                           1    1          iLO_NODE1
>>> gfsnode2                           1    2          iLO_NODE2
>>>
>>>
>>> # cman_tool nodes
>>>
>>> Node  Votes Exp Sts  Name
>>>   0    4    0   M   /dev/emcpowera1
>>>   1    1    3   M   gfsnode1
>>>   2    1    3   M   gfsnode2
>>>   3    1    3   X   gfsnode3
>>>
>>> # cman_tool status
>>>
>>> Protocol version: 5.0.1
>>> Config version: 9
>>> Cluster name: TestCluster
>>> Cluster ID: 62260
>>> Cluster Member: Yes
>>> Membership state: Cluster-Member
>>> Nodes: 2
>>> Expected_votes: 3
>>> Total_votes: 6
>>> Quorum: 4
>>> Active subsystems: 9
>>> Node name: gfsnode1
>>> Node ID: 1
>>> Node addresses: A.B.C.D
>>>
>>> CMAN still thinks the third node is part of the cluster, but has just
>>> stopped working. In addition to that, it is not updating the number
>>> of votes for the quorum disk. If I completely restart the cluster
>>> services on all nodes, I get the right information:
>>>
>>> - Correct votes for the quorum disk
>>> - Third node dissappears
>>> - The Expected_votes value is now 2
>>>
>>
>> I can't comment on the behaviour of the quorum disk, but cman is
>> behaving as expected. A node is NEVER removed from the internal
>> lists of cman while any node of the cluster is till active. It is
>> completely harmless in that state, the node simply remains
>> permanently dead and expected votes is adjusted accordingly.
>>
>>
> Patrick - isn't it also necessary to set a cman attribute for
> two-node='1' in the conf file? In order for cman to see this attribute,
> the entire cluster would need to be restarted.
> 

No, not if they're using a quorum disk.

That flag is only needed for a two-node cluster where the quorum is set to one and the surviving node is determined by a fencing race.

-- 

patrick


From jparsons at redhat.com  Thu Jan  4 16:23:13 2007
From: jparsons at redhat.com (Jim Parsons)
Date: Thu, 04 Jan 2007 11:23:13 -0500
Subject: [Linux-cluster] Removing a node from a running cluster
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>	<459D169B.90201@redhat.com>	<459D2362.40204@redhat.com>
	<459D1E78.6070602@redhat.com>
Message-ID: <459D29F1.2070205@redhat.com>

Patrick Caulfield wrote:

>
>>>
>>Patrick - isn't it also necessary to set a cman attribute for
>>two-node='1' in the conf file? In order for cman to see this attribute,
>>the entire cluster would need to be restarted.
>>
>
>No, not if they're using a quorum disk.
>
>That flag is only needed for a two-node cluster where the quorum is set to one and the surviving node is determined by a fencing race.
>
Oh my - what are the implications of having that attr set when using a 
quorum disk? Nothing, I hope...

-J


From lhh at redhat.com  Thu Jan  4 16:35:42 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Jan 2007 11:35:42 -0500
Subject: [Linux-cluster] qdiskd eviction on missed writes
In-Reply-To: <1167883949.6614.24.camel@belmont.site>
References: <1167883949.6614.24.camel@belmont.site>
Message-ID: <1167928542.26770.190.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-03 at 23:12 -0500, danwest wrote:

> The SAN in this
> case is a high end EMC DMX, multipathed, etc...  Currently our clusters
> are set to interval="1" and tko="15" which should allow for at least 15
> seconds (a very long time for this type of storage)

"at max" 15 seconds.

> In looking at ~/cluster/cman/qdisk/main.c it seems like the following is
> taking place:
> 
> In quroum_loop {}
> 
>         1) read everybody else's status (not sure if this includes
> yourself
>         2) check for node transitions (write eviction notice if number
> of heartbeats missed > tko)
>         3) check local heuristic (if we do not meet requirement remove
> from qdisk partition and possibly reboot)
>         4) Find master and/or determine new master, etc...
>         5) write out our status to qdisk
>         6) write out our local status (heuristics)
>         7) cycle ( sleep for defined interval).  sleep() measured in
> seconds so complete cycle = interval + time for steps (1) through (6)
> 
> Do you think that any delay in steps (1) through (4) could be the
> problem?  From an architectural standpoint wouldn't it be better to have
> (6) and (7) as a separate thread or daemon?  A kernel thread like
> cman_hbeat for example?

The heuristics are checked in the background in a separate thread; the
only thing that is checked is their states.  Step 1 will take awhile
(most of any part of qdiskd).  However, steps 2-4 shouldn't.

Making the read/write separate probably will (probably) not change much
- it's all direct I/O.  You basically said it yourself: on high end
storage, this just shouldn't be a problem.  We're doing a maddening 8k
of reads and 0.5k of writes during a normal cycle every (in your case) 1
second.

So, I suspect it's a scheduling problem.  That is, it would probably be
a whole lot more effective to just increase the priority of qdiskd so
that it gets scheduled even during load spikes (E.g. use a realtime
queue; SCHED_RR?).  I don't think the I/O path is the bottleneck.

> Further in the check_transitions procedure case #2 it might be more
> helpful to clulog what actually caused this to trigger.  The current
> logging is a bit generic.

You're totally right here; the logging isn't very great at the moment.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070104/1582aed9/attachment.sig>

From pcaulfie at redhat.com  Thu Jan  4 17:05:04 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 04 Jan 2007 17:05:04 +0000
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <459D29F1.2070205@redhat.com>
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>	<459D169B.90201@redhat.com>	<459D2362.40204@redhat.com>	<459D1E78.6070602@redhat.com>
	<459D29F1.2070205@redhat.com>
Message-ID: <459D33C0.9010504@redhat.com>

Jim Parsons wrote:
> Patrick Caulfield wrote:
> 
>>
>>>>
>>> Patrick - isn't it also necessary to set a cman attribute for
>>> two-node='1' in the conf file? In order for cman to see this attribute,
>>> the entire cluster would need to be restarted.
>>>
>>
>> No, not if they're using a quorum disk.
>>
>> That flag is only needed for a two-node cluster where the quorum is
>> set to one and the surviving node is determined by a fencing race.
>>
> Oh my - what are the implications of having that attr set when using a
> quorum disk? Nothing, I hope...

Well, basically that flag allows a cluster to continue with a single vote. So it could be quite dangerous I suppose if the cluster
splits and one node has the quorum disk and one doesn't.

I'd need to check specific configurations but I wouldn't really recommend it...


-- 

patrick


From axehind007 at yahoo.com  Thu Jan  4 19:46:54 2007
From: axehind007 at yahoo.com (Brian Pontz)
Date: Thu, 4 Jan 2007 11:46:54 -0800 (PST)
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <1167774526.26770.148.camel@rei.boston.devel.redhat.com>
Message-ID: <20070104194654.36383.qmail@web33210.mail.mud.yahoo.com>

So I tried upgrading a node in the cluster from CentOS
4.2.

I did
yum update yum sqlite python-sqlite
yum upgrade
reboot

And now the node starts to come up and hangs after
complaining about "lvm.static segfault" on line 504 in
rc.sysinit

Any ideas?

Brian

> I can find them if you want -- there's one fairly
> recent one which
> cropped up which isn't in any release (yet) but has
> been fixed in CVS.
> 
> There are several lock-related fixes between 4.2 and
> 4.4; this could be
> one of several.
> 
> Here's one of them:
> 
>
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=193128
> 
> You'll need to update magma, magma-plugins, and
> rgmanager to the latest.
> 


From lhh at redhat.com  Thu Jan  4 22:19:21 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Jan 2007 17:19:21 -0500
Subject: [Linux-cluster] qdiskd eviction on missed writes
In-Reply-To: <1167928542.26770.190.camel@rei.boston.devel.redhat.com>
References: <1167883949.6614.24.camel@belmont.site>
	<1167928542.26770.190.camel@rei.boston.devel.redhat.com>
Message-ID: <1167949161.10215.5.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-04 at 11:35 -0500, Lon Hohberger wrote:
> 
> Making the read/write separate probably will (probably) not change much
> - it's all direct I/O.  You basically said it yourself: on high end
> storage, this just shouldn't be a problem.  We're doing a maddening 8k
> of reads and 0.5k of writes during a normal cycle every (in your case) 1
> second.
> 
> So, I suspect it's a scheduling problem.  That is, it would probably be
> a whole lot more effective to just increase the priority of qdiskd so
> that it gets scheduled even during load spikes (E.g. use a realtime
> queue; SCHED_RR?).  I don't think the I/O path is the bottleneck.

I'll be working on a patch to allow you to turn on/off RT scheduling for
qdiskd from the configuration file (as well as other qdisk-related bits)
tomorrow and early next week -- would you like to test it when I get it
ready?

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070104/108d3d2c/attachment.sig>

From lhh at redhat.com  Thu Jan  4 22:25:11 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 04 Jan 2007 17:25:11 -0500
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <20070104194654.36383.qmail@web33210.mail.mud.yahoo.com>
References: <20070104194654.36383.qmail@web33210.mail.mud.yahoo.com>
Message-ID: <1167949511.10215.11.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-04 at 11:46 -0800, Brian Pontz wrote:
> So I tried upgrading a node in the cluster from CentOS
> 4.2.
> 
> I did
> yum update yum sqlite python-sqlite
> yum upgrade
> reboot
> 
> And now the node starts to come up and hangs after
> complaining about "lvm.static segfault" on line 504 in
> rc.sysinit
> 
> Any ideas?

Is it before or after it starts init? 

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070104/6e6954fc/attachment.sig>

From axehind007 at yahoo.com  Thu Jan  4 22:52:22 2007
From: axehind007 at yahoo.com (Brian Pontz)
Date: Thu, 4 Jan 2007 14:52:22 -0800 (PST)
Subject: [Linux-cluster] Cluster issue
In-Reply-To: <1167949511.10215.11.camel@rei.boston.devel.redhat.com>
Message-ID: <20070104225222.9881.qmail@web33206.mail.mud.yahoo.com>

Nevermind about this.
It finally passes this after hanging for a little bit.

It's basically the same error as this person posted.

http://lists.centos.org/pipermail/centos/2006-November/072155.html

Brian


--- Lon Hohberger <lhh at redhat.com> wrote:

> On Thu, 2007-01-04 at 11:46 -0800, Brian Pontz
> wrote:
> > So I tried upgrading a node in the cluster from
> CentOS
> > 4.2.
> > 
> > I did
> > yum update yum sqlite python-sqlite
> > yum upgrade
> > reboot
> > 
> > And now the node starts to come up and hangs after
> > complaining about "lvm.static segfault" on line
> 504 in
> > rc.sysinit
> > 
> > Any ideas?
> 
> Is it before or after it starts init? 
> 
> -- Lon
> 
> > --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


From marco.lusini at governo.it  Fri Jan  5 09:38:41 2007
From: marco.lusini at governo.it (Marco Lusini)
Date: Fri, 5 Jan 2007 10:38:41 +0100
Subject: R: [Linux-cluster] High system CPU usage in one of a two node cluster
In-Reply-To: <1167842371.26770.162.camel@rei.boston.devel.redhat.com>
Message-ID: <002e01c730ad$4c6e4900$8ec9100a@nicchio>


> 
> System load averages are the average of the number of 
> processes on the run queue over the past 1, 5, and 15 
> minutes.  It doesn't generally trend upwards over time; if 
> that were the case, I'd be in trouble:
>

I am in trouble, then :-)

As I told in the first mail, as system (i.e. kernel) CPU 
usage grows so does the system load (1, 5, and 15 mins average).
In order to better show what I see in my clusters, I am sending
more graphs (on a yearly time base) that illustrate how system load 
trends upwards as kernel usage grows.
Graphs were produced by CACTI probing the snmpd daemon running on the nodes.

Again note how the trend swap from node to node on reboots.

> 
> However, it is a little odd that you had 10 hours of runtime 
> for clurgmgrd and over 6 for dlm_recvd.  Just taking a wild 
> guess, but it looks like the locks were all mastered on frascati.
>

How can I get more info on this? I checked /proc/cluster/dlm_locks
on both nodes and it is empty.
Here is the output of cat /proc/cluster/dlm_stats:

  [root at estestest ~]# cat /proc/cluster/dlm_stats
  DLM stats (HZ=1000)

  Lock operations:    1688738
  Unlock operations:   838064
  Convert operations:       0
  Completion ASTs:    2526802
  Blocking ASTs:            0

  Lockqueue        num  waittime   ave

  [root at frascati ~]# cat /proc/cluster/dlm_stats
  DLM stats (HZ=1000)

  Lock operations:    1122141
  Unlock operations:   556623
  Convert operations:       0
  Completion ASTs:    1678764
  Blocking ASTs:            0

  Lockqueue        num  waittime   ave
  WAIT_RSB           6         3     0
  WAIT_GRANT    1122141   32507056    28
  WAIT_UNLOCK   556623    316924     0
  Total         1678770   32823983    19

>
> How many services are you running?
> 

At the moment I have 3 services on estestest (Sybase SQL server, a tomcat5 
application and an apache web site) and 2 services on frascati (another
tomcat5 application and Postgres SQL server).

> Also, take a look at:
> 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634
> 
> The RPMs there might solve the problem with dlm_recvd.  
> Rgmanager in some situations causes a strange leak of NL 
> locks in the DLM.  If dlm_recvd has to traverse lock lists 
> and that list is ever-growing (total speculation here), it 
> could explain the amount of consumed system time.
> 

If I use those RPMs, will the patches be included in RHCS 4.5
(I think so, but just to be sure...)?

Thanks,

Marco


_______________________________________________________
Messaggio analizzato e protetto da tecnologia antivirus

Servizio erogato dal sistema informativo della 
Presidenza del Consiglio dei Ministri
-------------- next part --------------
A non-text attachment was scrubbed...
Name: frascati_yearly_CPU_Usage.jpg
Type: image/jpeg
Size: 29334 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/5a281359/attachment.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: frascati_yearly_System_Load.jpg
Type: image/jpeg
Size: 27710 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/5a281359/attachment-0001.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: estestest_yearly_CPU_Usage.jpg
Type: image/jpeg
Size: 30518 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/5a281359/attachment-0002.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: estestest_yearly_System_Load.jpg
Type: image/jpeg
Size: 26103 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/5a281359/attachment-0003.jpg>

From simone.gotti at email.it  Fri Jan  5 10:11:27 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Fri, 05 Jan 2007 11:11:27 +0100
Subject: [Linux-cluster] [PATCH] wrong strings in quorum disk registration.
Message-ID: <1167991887.3079.13.camel@localhost>

Hi all,

on the openais based cman-2.0.35-2.el5 the output of "cman_tool nodes"
or "clustat" provides a wrong quorum device name:

[root at nodo01 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   0   X      0                        /dev/sdb1??????
   1   M      4   2007-01-05 13:03:18  nodo01
   2   X      0                        nodo02

[root at nodo01 ~]# clustat
/dev/sdb1?????? not found
realloc 924
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  nodo01                                1 Online, Local
  nodo02                                2 Offline
  /dev/sdb1??????                       0 Online, Estranged


Looking at the code look like the call to info_call in
cman_register_quorum_device passed a too small by one "inlen" argument
missing the ending \0 of the device name string.
I attached a patch the should fix this, I hope it's correct.

Thanks!

Bye!
-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Cassine di Pietra: una variet? completa di vini del Veneto, 
* in pi? un regalo per il primo ordine! Clicca subito qui
* 
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=3925&d=5-1
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cman-2.0.35-libcman-cman_register_quorum_device-info_call.patch
Type: text/x-patch
Size: 600 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/b2407aa1/attachment.bin>

From pcaulfie at redhat.com  Fri Jan  5 10:13:16 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 10:13:16 +0000
Subject: [Linux-cluster] High system CPU usage in one of a two node cluster
In-Reply-To: <1167842371.26770.162.camel@rei.boston.devel.redhat.com>
References: <016001c72f2b$57f3c080$8ec9100a@nicchio>
	<1167842371.26770.162.camel@rei.boston.devel.redhat.com>
Message-ID: <459E24BC.7070301@redhat.com>

Lon Hohberger wrote:
> On Wed, 2007-01-03 at 12:35 +0100, Marco Lusini wrote:
>> Hi all,
>>  
>> I have 3 2-node clusters, running just cluster suite, without gfs,
>> each one updated with the latest
>> packages released by RHN.
>>  
>> In each cluster one of the two nodes has a steadily growing system CPU
>> usage, which seems 
>> to be consumed by clurgmgrd and dlm_recvd.
>> As an example here is the running time accumulated on one cluster
>> since 20 december when
>> oit was rebooted:
>>  
>> [root at estestest ~]# ps axo pid,start,time,args
>>   PID  STARTED     TIME COMMAND
>> ...
>> 10221   Dec 20 10:37:05 clurgmgrd
>> 11169   Dec 20 06:48:24 [dlm_recvd]
>> ...
>>  
>> [root at frascati ~]# ps axo pid,start,time,args
>>   PID  STARTED     TIME COMMAND
>> ...
>>  6226   Dec 20 00:04:17 clurgmgrd
>>  8249   Dec 20 00:00:19 [dlm_recvd]
>> ...

I suspect these two being at the top are related. If clurgmgrd is taking out
locks then dlm_recvd will also be busy

>> I attach two graphs made with RRD which show that the system CPU usage
>> is steadily growing:
>> note how the trend changed after the reboot on 20 december.
> 
>> Of course as the system usage increases so does the system load and I
>> am afraid of what will
>> happen after 1-2 months of uptime...
> 
> System load averages are the average of the number of processes on the
> run queue over the past 1, 5, and 15 minutes.  It doesn't generally
> trend upwards over time; if that were the case, I'd be in trouble:
> 
> ...
> 28204 15:11:11 01:04:19 /usr/lib/firefox-1.5.0.9/firefox-bin -UILocale
> en-US
> ...
> 
> However, it is a little odd that you had 10 hours of runtime for
> clurgmgrd and over 6 for dlm_recvd.  Just taking a wild guess, but it
> looks like the locks were all mastered on frascati.
> 
> How many services are you running?
> 
> Also, take a look at:
> 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634
> 
> The RPMs there might solve the problem with dlm_recvd.  Rgmanager in
> some situations causes a strange leak of NL locks in the DLM.  If
> dlm_recvd has to traverse lock lists and that list is ever-growing
> (total speculation here), it could explain the amount of consumed system
> time.
> 


Yes, DLM will do a lot of traversing lock lists if there are a lot of similar locks on one resource. VMS has an optimisation on this
known as the group grant and concversion grant modes that we don't currently implement.


> How can I get more info on this? I checked /proc/cluster/dlm_locks
> on both nodes and it is empty.

/proc/cluster/dlm_locks needs to be told which lockspace to use. Just catting that file after bootup will show nothing.
What you need to do is to echo the lockspace name into that file, then look a it. You can get the lockspace names with the
"cman_tool services" command so (eg)

# cman_tool services

Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "clvmd"                             2   3 run       -
[1 2]

# echo "clvmd" > /proc/cluster/dlm_locks
# cat /proc/cluster/dlm_locks

This shows locks held by clvmd. If you want to look at another lockspace just echo the other name into the /proc file.
-- 

patrick


From pcaulfie at redhat.com  Fri Jan  5 10:31:55 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 10:31:55 +0000
Subject: [Linux-cluster] [PATCH] wrong strings in quorum disk registration.
In-Reply-To: <1167991887.3079.13.camel@localhost>
References: <1167991887.3079.13.camel@localhost>
Message-ID: <459E291B.5040101@redhat.com>

Simone Gotti wrote:
> Hi all,
> 
> on the openais based cman-2.0.35-2.el5 the output of "cman_tool nodes"
> or "clustat" provides a wrong quorum device name:
> 
> [root at nodo01 ~]# cman_tool nodes
> Node  Sts   Inc   Joined               Name
>    0   X      0                        /dev/sdb1??
>    1   M      4   2007-01-05 13:03:18  nodo01
>    2   X      0                        nodo02
> 
> [root at nodo01 ~]# clustat
> /dev/sdb1?? not found
> realloc 924
> Member Status: Quorate
> 
>   Member Name                        ID   Status
>   ------ ----                        ---- ------
>   nodo01                                1 Online, Local
>   nodo02                                2 Offline
>   /dev/sdb1??                       0 Online, Estranged
> 
> 
> Looking at the code look like the call to info_call in
> cman_register_quorum_device passed a too small by one "inlen" argument
> missing the ending \0 of the device name string.
> I attached a patch the should fix this, I hope it's correct.
> 


Now committed to CVS

Thank very much.

-- 

patrick


From pcaulfie at redhat.com  Fri Jan  5 10:35:22 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 10:35:22 +0000
Subject: [Linux-cluster] [PATCH] Fix cman_get_node_id in qdisk.
In-Reply-To: <459D0DFC.1080609@redhat.com>
References: <1167919992.11659.14.camel@localhost> <459D0DFC.1080609@redhat.com>
Message-ID: <459E29EA.4060105@redhat.com>

Patrick Caulfield wrote:
> Simone Gotti wrote:
>> Hi all,
>>
>>
>> testing the qdiskd provided by the new openais cman-2.0.35-2 (on RHEL5
>> Beta 2) I found that it would no start with the following error:
>>
>> Could not determine local node ID; cannot start
>>
>>
>> Looking at the code of the other programs that connects to cman I
>> noticed that before the call libcman function:
>> cman_get_node(cman_handle_t handle, int nodeid, cman_node_t *node), 
>> they are inizialing with all zeros the third argument, so I did the same
>> with qdiskd and it worked.
>>
>> Looking at the cvs repository I didn't find a fix for it.
>> A patch is attached. 
>>
> 
> Yes, that patch look correct to me.
> 
> Thanks


Now in CVS.


-- 

patrick


From marco.lusini at governo.it  Fri Jan  5 10:49:46 2007
From: marco.lusini at governo.it (Marco Lusini)
Date: Fri, 5 Jan 2007 11:49:46 +0100
Subject: R: [Linux-cluster] High system CPU usage in one of a two node cluster
In-Reply-To: <459E24BC.7070301@redhat.com>
Message-ID: <004801c730b7$466cc3b0$8ec9100a@nicchio>

Thanks Patrick,

I have tried to get the locks for Magma on both nodes,
and I get the same error of
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634:

cat: /proc/cluster/dlm_locks: Cannot allocate memory

I will try to install the RPMs from Lon if I can and
see if it solve the problem...

Marco 

> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di 
> Patrick Caulfield
> Inviato: venerd? 5 gennaio 2007 11.13
> A: linux clustering
> Oggetto: Re: [Linux-cluster] High system CPU usage in one of 
> a two node cluster
> 
> 
> Lon Hohberger wrote:
> > On Wed, 2007-01-03 at 12:35 +0100, Marco Lusini wrote:
> >> Hi all,
> >>  
> >> I have 3 2-node clusters, running just cluster suite, without gfs, 
> >> each one updated with the latest packages released by RHN.
> >>  
> >> In each cluster one of the two nodes has a steadily growing system 
> >> CPU usage, which seems to be consumed by clurgmgrd and dlm_recvd.
> >> As an example here is the running time accumulated on one cluster 
> >> since 20 december when oit was rebooted:
> >>  
> >> [root at estestest ~]# ps axo pid,start,time,args
> >>   PID  STARTED     TIME COMMAND
> >> ...
> >> 10221   Dec 20 10:37:05 clurgmgrd
> >> 11169   Dec 20 06:48:24 [dlm_recvd]
> >> ...
> >>  
> >> [root at frascati ~]# ps axo pid,start,time,args
> >>   PID  STARTED     TIME COMMAND
> >> ...
> >>  6226   Dec 20 00:04:17 clurgmgrd
> >>  8249   Dec 20 00:00:19 [dlm_recvd]
> >> ...
> 
> I suspect these two being at the top are related. If 
> clurgmgrd is taking out locks then dlm_recvd will also be busy
> 
> >> I attach two graphs made with RRD which show that the system CPU 
> >> usage is steadily growing:
> >> note how the trend changed after the reboot on 20 december.
> > 
> >> Of course as the system usage increases so does the system 
> load and I 
> >> am afraid of what will happen after 1-2 months of uptime...
> > 
> > System load averages are the average of the number of 
> processes on the 
> > run queue over the past 1, 5, and 15 minutes.  It doesn't generally 
> > trend upwards over time; if that were the case, I'd be in trouble:
> > 
> > ...
> > 28204 15:11:11 01:04:19 
> /usr/lib/firefox-1.5.0.9/firefox-bin -UILocale 
> > en-US ...
> > 
> > However, it is a little odd that you had 10 hours of runtime for 
> > clurgmgrd and over 6 for dlm_recvd.  Just taking a wild 
> guess, but it 
> > looks like the locks were all mastered on frascati.
> > 
> > How many services are you running?
> > 
> > Also, take a look at:
> > 
> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634
> > 
> > The RPMs there might solve the problem with dlm_recvd.  
> Rgmanager in 
> > some situations causes a strange leak of NL locks in the DLM.  If 
> > dlm_recvd has to traverse lock lists and that list is ever-growing 
> > (total speculation here), it could explain the amount of consumed 
> > system time.
> > 
> 
> 
> Yes, DLM will do a lot of traversing lock lists if there are 
> a lot of similar locks on one resource. VMS has an 
> optimisation on this known as the group grant and concversion 
> grant modes that we don't currently implement.
> 
> 
> > How can I get more info on this? I checked 
> /proc/cluster/dlm_locks on 
> > both nodes and it is empty.
> 
> /proc/cluster/dlm_locks needs to be told which lockspace to 
> use. Just catting that file after bootup will show nothing.
> What you need to do is to echo the lockspace name into that 
> file, then look a it. You can get the lockspace names with 
> the "cman_tool services" command so (eg)
> 
> # cman_tool services
> 
> Service          Name                              GID LID 
> State     Code
> Fence Domain:    "default"                           1   2 run       -
> [1 2]
> 
> DLM Lock Space:  "clvmd"                             2   3 run       -
> [1 2]
> 
> # echo "clvmd" > /proc/cluster/dlm_locks # cat /proc/cluster/dlm_locks
> 
> This shows locks held by clvmd. If you want to look at 
> another lockspace just echo the other name into the /proc file.
> -- 
> 
> patrick
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> _______________________________________________________
> Messaggio analizzato e protetto da tecnologia antivirus
> 
> Servizio erogato dal sistema informativo della Presidenza del 
> Consiglio dei Ministri


From pcaulfie at redhat.com  Fri Jan  5 10:54:18 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 10:54:18 +0000
Subject: R: [Linux-cluster] High system CPU usage in one of a two node
	cluster
In-Reply-To: <004801c730b7$466cc3b0$8ec9100a@nicchio>
References: <004801c730b7$466cc3b0$8ec9100a@nicchio>
Message-ID: <459E2E5A.1060304@redhat.com>

Marco Lusini wrote:
> Thanks Patrick,
> 
> I have tried to get the locks for Magma on both nodes,
> and I get the same error of
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634:
> 
> cat: /proc/cluster/dlm_locks: Cannot allocate memory

That shows that there is definitely a problem of too many locks there!


> I will try to install the RPMs from Lon if I can and
> see if it solve the problem...
> 

I think it will, AFAIK Magma should not be allocating many locks, certainly not enough to cause a allocation overflow!

-- 

patrick


From marco.lusini at governo.it  Fri Jan  5 11:04:31 2007
From: marco.lusini at governo.it (Marco Lusini)
Date: Fri, 5 Jan 2007 12:04:31 +0100
Subject: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <459E2E5A.1060304@redhat.com>
Message-ID: <004901c730b9$5112d460$8ec9100a@nicchio>

I was looking at Lon's RPMs, and they are (apparently)
based on rgmanager 1.9.53-1, while the last released
package is 1.9.54-1...
Would it be possible to have fixed RPMs compiled wrt the
last version?

TIA,

Marco


> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di 
> Patrick Caulfield
> Inviato: venerd? 5 gennaio 2007 11.54
> A: linux clustering
> Oggetto: Re: R: [Linux-cluster] High system CPU usage in one 
> of a two nodecluster
> 
> 
> Marco Lusini wrote:
> > Thanks Patrick,
> > 
> > I have tried to get the locks for Magma on both nodes, and 
> I get the 
> > same error of
> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634:
> > 
> > cat: /proc/cluster/dlm_locks: Cannot allocate memory
> 
> That shows that there is definitely a problem of too many locks there!
> 
> 
> > I will try to install the RPMs from Lon if I can and see if 
> it solve 
> > the problem...
> > 
> 
> I think it will, AFAIK Magma should not be allocating many 
> locks, certainly not enough to cause a allocation overflow!
> 
> -- 
> 
> patrick
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> _______________________________________________________
> Messaggio analizzato e protetto da tecnologia antivirus
> 
> Servizio erogato dal sistema informativo della Presidenza del 
> Consiglio dei Ministri


From pcaulfie at redhat.com  Fri Jan  5 11:22:33 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 11:22:33 +0000
Subject: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <004901c730b9$5112d460$8ec9100a@nicchio>
References: <004901c730b9$5112d460$8ec9100a@nicchio>
Message-ID: <459E34F9.1060604@redhat.com>

Marco Lusini wrote:
> I was looking at Lon's RPMs, and they are (apparently)
> based on rgmanager 1.9.53-1, while the last released
> package is 1.9.54-1...
> Would it be possible to have fixed RPMs compiled wrt the
> last version?
> 

I'm not up on rgmanager versions & RPMs so I'll leave that for Lon, but I reckon it's still worth your while trying that package or
building from source with the patch if you can.

-- 

patrick


From marco.lusini at governo.it  Fri Jan  5 11:39:47 2007
From: marco.lusini at governo.it (Marco Lusini)
Date: Fri, 5 Jan 2007 12:39:47 +0100
Subject: R: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <459E34F9.1060604@redhat.com>
Message-ID: <004b01c730be$3d8da140$8ec9100a@nicchio>

In the mean time I diff-ed rel 53 and rel 54, and the olny
difference is a kill related to NFS locks (which I don't use),
so I'll try to rebuild updated RPMs and will give them a try...

I'll let you know of the results (it will take at least a
week to be sure that the CPU kernel usage is not growing
again...)

Marco

> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di 
> Patrick Caulfield
> Inviato: venerd? 5 gennaio 2007 12.23
> A: linux clustering
> Oggetto: Re: R: R: [Linux-cluster] High system CPU usage in 
> one of a two nodecluster
> 
> 
> Marco Lusini wrote:
> > I was looking at Lon's RPMs, and they are (apparently) based on 
> > rgmanager 1.9.53-1, while the last released package is 1.9.54-1...
> > Would it be possible to have fixed RPMs compiled wrt the 
> last version?
> > 
> 
> I'm not up on rgmanager versions & RPMs so I'll leave that 
> for Lon, but I reckon it's still worth your while trying that 
> package or building from source with the patch if you can.
> 
> -- 
> 
> patrick
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> _______________________________________________________
> Messaggio analizzato e protetto da tecnologia antivirus
> 
> Servizio erogato dal sistema informativo della Presidenza del 
> Consiglio dei Ministri


From simone.gotti at email.it  Fri Jan  5 14:02:17 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Fri, 05 Jan 2007 15:02:17 +0100
Subject: [Linux-cluster] [PATCH] Fix fence_agent string not correctly sent
	over the cluster.
Message-ID: <1168005737.6322.11.camel@localhost>

Hi all,

on the openais based cman-2.0.35-2.el5 I noticed that the output of
"cman_tool nodes -f" provided a not correctly terminated fence agent
name:

[root at nodo01 ~]# cman_tool nodes -f
Node  Sts   Inc   Joined               Name
   1   M      4   2007-01-05 17:39:27  nodo01
   2   X      0                        nodo02
       Last fenced:   2007-01-05 17:39:41 by fence-node02!?
                                                         ^^

I think the problem is in the function do_cmd_update_fence_info in
cman/daemon/commands.c that calculate the bytes needed by the message to
send without counting the \0 terminating the fence_agent string.

I found also another similar problem in another point of the file and I
changed also it, but without testing.

I made a little patch and I hope it's correct.

Thanks!

Bye!
-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Acquista i tuoi gioielli in tutta sicurezza ed a prezzi veramente imbattibili. Sfoglia il nostro catalogo on-line!
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5634&d=5-1
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cman-2.0.35-cman-do_cmd_update_fence_info-msg_size.patch
Type: text/x-patch
Size: 1038 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/611fe1ac/attachment.bin>

From baesso at ksolutions.it  Fri Jan  5 15:33:24 2007
From: baesso at ksolutions.it (Baesso Mirko)
Date: Fri, 5 Jan 2007 16:33:24 +0100
Subject: [Linux-cluster] Kernel Bug 
Message-ID: <10DBC6018C67E94C961A7334501A2E6F4041B6@kmail.ksolutions.it>

Hi,

we received this error on our cluster node,

could you please tell me how to debug?

Thanks

 
Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------

Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------

Jan  5 14:35:02 clnfs2 kernel: kernel BUG at include/asm/spinlock.h:109!

Jan  5 14:35:02 clnfs2 kernel: invalid operand: 0000 [#1]

Jan  5 14:35:02 clnfs2 kernel: SMP

Jan  5 14:35:02 clnfs2 kernel: Modules linked in: dlm(U) cman(U) nfsd
exportfs lockd parport_pc lp parport autofs4 i2c_dev i2

c_core sunrpc dm_round_robin md5 ipv6 dm_multipath button battery ac
uhci_hcd ehci_hcd hw_random shpchp e1000 tg3 bonding(U)

sg dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod ata_piix libata
mptscsih mptbase sd_mod scsi_mod

Jan  5 14:35:02 clnfs2 kernel: CPU:    1

Jan  5 14:35:02 clnfs2 kernel: EIP:    0060:[<c02cfeab>]    Not tainted
VLI

Jan  5 14:35:02 clnfs2 kernel: EFLAGS: 00010002   (2.6.9-22.ELsmp)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/935434e7/attachment.htm>

From pcaulfie at redhat.com  Fri Jan  5 15:44:23 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 05 Jan 2007 15:44:23 +0000
Subject: [Linux-cluster] Kernel Bug
In-Reply-To: <10DBC6018C67E94C961A7334501A2E6F4041B6@kmail.ksolutions.it>
References: <10DBC6018C67E94C961A7334501A2E6F4041B6@kmail.ksolutions.it>
Message-ID: <459E7257.5090009@redhat.com>

Baesso Mirko wrote:
> Hi,
> 
> we received this error on our cluster node,
> 
> could you please tell me how to debug?
> 
> Thanks
> 
>  
> 
> Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
> 
> Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
> 
> Jan  5 14:35:02 clnfs2 kernel: kernel BUG at include/asm/spinlock.h:109!
> 
> Jan  5 14:35:02 clnfs2 kernel: invalid operand: 0000 [#1]
> 
> Jan  5 14:35:02 clnfs2 kernel: SMP
> 
> Jan  5 14:35:02 clnfs2 kernel: Modules linked in: dlm(U) cman(U) nfsd
> exportfs lockd parport_pc lp parport autofs4 i2c_dev i2
> 
> c_core sunrpc dm_round_robin md5 ipv6 dm_multipath button battery ac
> uhci_hcd ehci_hcd hw_random shpchp e1000 tg3 bonding(U)
> 
> sg dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod ata_piix libata
> mptscsih mptbase sd_mod scsi_mod
> 
> Jan  5 14:35:02 clnfs2 kernel: CPU:    1
> 
> Jan  5 14:35:02 clnfs2 kernel: EIP:    0060:[<c02cfeab>]    Not tainted VLI
> 
> Jan  5 14:35:02 clnfs2 kernel: EFLAGS: 00010002   (2.6.9-22.ELsmp)
> 


providing us with the whole of the kernel traceback would be a start ;-)

-- 

patrick


From baesso at ksolutions.it  Fri Jan  5 15:56:08 2007
From: baesso at ksolutions.it (Baesso Mirko)
Date: Fri, 5 Jan 2007 16:56:08 +0100
Subject: R: [Linux-cluster] Kernel Bug
Message-ID: <10DBC6018C67E94C961A7334501A2E6F4041B7@kmail.ksolutions.it>

Thanks for reply
This is all kernel log I found on messages before server restarting


----------------------
Jan  5 14:35:02 clnfs2 kernel: Assertion failure in log_do_checkpoint() at fs/jbd/checkpoint.c:361: "drop_count != 0 || clean
up_ret != 0"
Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
Jan  5 14:35:02 clnfs2 kernel: kernel BUG at include/asm/spinlock.h:109!
Jan  5 14:35:02 clnfs2 kernel: invalid operand: 0000 [#1]
Jan  5 14:35:02 clnfs2 kernel: SMP
Jan  5 14:35:02 clnfs2 kernel: Modules linked in: dlm(U) cman(U) nfsd exportfs lockd parport_pc lp parport autofs4 i2c_dev i2
c_core sunrpc dm_round_robin md5 ipv6 dm_multipath button battery ac uhci_hcd ehci_hcd hw_random shpchp e1000 tg3 bonding(U)
sg dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod ata_piix libata mptscsih mptbase sd_mod scsi_mod
Jan  5 14:35:02 clnfs2 kernel: CPU:    1
Jan  5 14:35:02 clnfs2 kernel: EIP:    0060:[<c02cfeab>]    Not tainted VLI
Jan  5 14:35:02 clnfs2 kernel: EFLAGS: 00010002   (2.6.9-22.ELsmp)
Jan  5 14:39:46 clnfs2 syslogd 1.4.1: restart.
Jan  5 14:39:46 clnfs2 syslog: syslogd startup succeeded
Jan  5 14:39:46 clnfs2 kernel: klogd 1.4.1, log source = /proc/kmsg started.
Jan  5 14:39:46 clnfs2 kernel: Linux version 2.6.9-22.ELsmp (bhcompile at porky.build.redhat.com) (gcc version 3.4.4 20050721 (R
ed Hat 3.4.4-2)) #1 SMP Mon Sep 19 18:32:14 EDT 2005
-------------------------------------

-----Messaggio originale-----
Da: Patrick Caulfield [mailto:pcaulfie at redhat.com] 
Inviato: venerd? 5 gennaio 2007 16.44
A: linux clustering
Oggetto: Re: [Linux-cluster] Kernel Bug

Baesso Mirko wrote:
> Hi,
> 
> we received this error on our cluster node,
> 
> could you please tell me how to debug?
> 
> Thanks
> 
>  
> 
> Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
> 
> Jan  5 14:35:02 clnfs2 kernel: ------------[ cut here ]------------
> 
> Jan  5 14:35:02 clnfs2 kernel: kernel BUG at include/asm/spinlock.h:109!
> 
> Jan  5 14:35:02 clnfs2 kernel: invalid operand: 0000 [#1]
> 
> Jan  5 14:35:02 clnfs2 kernel: SMP
> 
> Jan  5 14:35:02 clnfs2 kernel: Modules linked in: dlm(U) cman(U) nfsd
> exportfs lockd parport_pc lp parport autofs4 i2c_dev i2
> 
> c_core sunrpc dm_round_robin md5 ipv6 dm_multipath button battery ac
> uhci_hcd ehci_hcd hw_random shpchp e1000 tg3 bonding(U)
> 
> sg dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod ata_piix libata
> mptscsih mptbase sd_mod scsi_mod
> 
> Jan  5 14:35:02 clnfs2 kernel: CPU:    1
> 
> Jan  5 14:35:02 clnfs2 kernel: EIP:    0060:[<c02cfeab>]    Not tainted VLI
> 
> Jan  5 14:35:02 clnfs2 kernel: EFLAGS: 00010002   (2.6.9-22.ELsmp)
> 


providing us with the whole of the kernel traceback would be a start ;-)

-- 

patrick

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From Darrell.Frazier at crc.army.mil  Fri Jan  5 16:13:59 2007
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Fri, 5 Jan 2007 10:13:59 -0600 
Subject: [Linux-cluster] How to turn off the cluster attribute of a local
	volume
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544404C437E1@safeb1mf533c.crc.army.mil>


Hi,

I have an issue I havent yet been able to find an answer to. I created a
local volume on a cluster node to give it more swap space using command-line
tools (pvcreate, vgcreate, lvcreate). Unbeknownst to me, at the time I
created the volume, the clvmd subsystem was dead but locked.

Anyway, now the system I have created the filesystem on thinks that the
partition created is a clustered partition. I found this out using the vgs
command I found in the cluster FAQ (you da man Bob Peterson)

 VG       #PV #LV #SN Attr   VSize   VFree
  homevg     1   1   0 wz--n-   3.12G   1.12G
  optvg      1   1   0 wz--n-   7.84G   2.84G
  rootvg     1   1   0 wz--n-   3.12G   1.12G
  swapvg00   1   1   0 wz--n-   3.12G   1.12G
  swapvg01   1   1   0 wz--nc   9.32G 324.00M
  tmpvg      1   1   0 wz--n-   4.72G   1.69G
  u01vg      1   1   0 wz--n-  33.00G  12.00G
  u02vg      1   1   0 wz--nc 399.61G      0
  usrvg      1   1   0 wz--n-   6.28G   2.28G
  varvg      1   1   0 wz--n-   6.28G   2.28G

Though I would love to know how this happened. It is more important to me
right now to know how to disable the clustering attribute on this partition.
Thanx much in advance.

Darrell J. Frazier
Unix System Administrator
US Army Combat Readiness Center
 
CAUTION: This electronic transmission may contain information protected by
deliberative process or other privilege, which is protected from disclosure
under the Freedom of Information Act, 5 U.S.C. ? 552. The information is
intended for the use of the individual or agency to which it was sent. If
you are not the intended recipient, be aware that any disclosure,
distribution or use of the contents of this information is prohibited. Do
not release outside of DoD channels without prior authorization from the
sender. The sender provides no assurance as to the integrity of the content
of this electronic transmission after it has been sent and received by the
intended email recipient. 


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/5071b5b3/attachment.htm>

From rpeterso at redhat.com  Fri Jan  5 16:45:36 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 05 Jan 2007 10:45:36 -0600
Subject: [Linux-cluster] How to turn off the cluster attribute of a local
	volume
In-Reply-To: <A5502A8A1836A54FB9CB33BDC6A5544404C437E1@safeb1mf533c.crc.army.mil>
References: <A5502A8A1836A54FB9CB33BDC6A5544404C437E1@safeb1mf533c.crc.army.mil>
Message-ID: <459E80B0.3030808@redhat.com>

Frazier, Darrell USA CRC (Contractor) wrote:
>
> Hi,
>
> I have an issue I havent yet been able to find an answer to. I created 
> a local volume on a cluster node to give it more swap space using 
> command-line tools (pvcreate, vgcreate, lvcreate). Unbeknownst to me, 
> at the time I created the volume, the clvmd subsystem was dead but locked.
>
> Anyway, now the system I have created the filesystem on thinks that 
> the partition created is a clustered partition. I found this out using 
> the vgs command I found in the cluster FAQ (you da man Bob Peterson)
>
>  VG       #PV #LV #SN Attr   VSize   VFree
>   homevg     1   1   0 wz--n-   3.12G   1.12G
>   optvg      1   1   0 wz--n-   7.84G   2.84G
>   rootvg     1   1   0 wz--n-   3.12G   1.12G
>   swapvg00   1   1   0 wz--n-   3.12G   1.12G
>  / //swapvg01   1   1   0 wz--nc   9.32G 324.00M/
>   tmpvg      1   1   0 wz--n-   4.72G   1.69G
>   u01vg      1   1   0 wz--n-  33.00G  12.00G
>   u02vg      1   1   0 wz--nc 399.61G      0
>   usrvg      1   1   0 wz--n-   6.28G   2.28G
>   varvg      1   1   0 wz--n-   6.28G   2.28G
>
> Though I would love to know how this happened. It is more important to 
> me right now to know how to disable the clustering attribute on this 
> partition. Thanx much in advance.
>
> *Darrell J. Frazier*
> Unix System Administrator
> US Army Combat Readiness Center
> *//*
>
Hi Darrell,

Glad to be of service!
What you want to disable the clustering bit is:  vgchange -cn
The answer isn't "exactly" in the faq, but you can find something close 
here:

http://sources.redhat.com/cluster/faq.html#clvmd_clustered

Regards,

Bob Peterson
Red Hat Cluster Suite


From Darrell.Frazier at crc.army.mil  Fri Jan  5 19:46:04 2007
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Fri, 5 Jan 2007 13:46:04 -0600 
Subject: [Linux-cluster] How to turn off the cluster attribute of a lo
	cal volume
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544404C437E2@safeb1mf533c.crc.army.mil>

Thanks Bob, I will try that. I wonder how that bit got turned on using
standard local lvm commands? Interesting. 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Robert Peterson
Sent: Friday, January 05, 2007 10:46 AM
To: linux clustering
Subject: Re: [Linux-cluster] How to turn off the cluster attribute of a
local volume

Frazier, Darrell USA CRC (Contractor) wrote:
>
> Hi,
>
> I have an issue I havent yet been able to find an answer to. I created 
> a local volume on a cluster node to give it more swap space using 
> command-line tools (pvcreate, vgcreate, lvcreate). Unbeknownst to me, 
> at the time I created the volume, the clvmd subsystem was dead but locked.
>
> Anyway, now the system I have created the filesystem on thinks that 
> the partition created is a clustered partition. I found this out using 
> the vgs command I found in the cluster FAQ (you da man Bob Peterson)
>
>  VG       #PV #LV #SN Attr   VSize   VFree
>   homevg     1   1   0 wz--n-   3.12G   1.12G
>   optvg      1   1   0 wz--n-   7.84G   2.84G
>   rootvg     1   1   0 wz--n-   3.12G   1.12G
>   swapvg00   1   1   0 wz--n-   3.12G   1.12G
>  / //swapvg01   1   1   0 wz--nc   9.32G 324.00M/
>   tmpvg      1   1   0 wz--n-   4.72G   1.69G
>   u01vg      1   1   0 wz--n-  33.00G  12.00G
>   u02vg      1   1   0 wz--nc 399.61G      0
>   usrvg      1   1   0 wz--n-   6.28G   2.28G
>   varvg      1   1   0 wz--n-   6.28G   2.28G
>
> Though I would love to know how this happened. It is more important to 
> me right now to know how to disable the clustering attribute on this 
> partition. Thanx much in advance.
>
> *Darrell J. Frazier*
> Unix System Administrator
> US Army Combat Readiness Center
> *//*
>
Hi Darrell,

Glad to be of service!
What you want to disable the clustering bit is:  vgchange -cn The answer
isn't "exactly" in the faq, but you can find something close
here:

http://sources.redhat.com/cluster/faq.html#clvmd_clustered

Regards,

Bob Peterson
Red Hat Cluster Suite

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/327ee252/attachment.htm>

From Darrell.Frazier at crc.army.mil  Fri Jan  5 20:16:20 2007
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Fri, 5 Jan 2007 14:16:20 -0600 
Subject: [Linux-cluster] FC Fabric fencing
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544404C437E3@safeb1mf533c.crc.army.mil>

Hi,

I am looking into adding another fence level to my two two-node clusters.
Since our APC setup doesn't support powering off and on of specific outlets,
I thought I would look to our Fiber channel fabric. Here are the details:

2 sets of 2 HP DL380 systems with dual port qlogic isp2422 FC cards
2 qlogic SANBox 5600 Fiber Channel switches
1 Compellent enclosure (12 TB)

The way we currently have it set up is each FC card has one port going to
one switch (We plan to do redundancy with the other port to the second
switch) We are currently using HP ilo to fence the cluster. Since reading
the interesting posts on this board regarding HP ilo, I thought I would add
a fabric fence for even more redundancy.

My questions are: Will the SANBox 2 fence device in RHCS support my
switches? Also, how does fabric fencing work, since I have other systems
besides the four systems connected to these switches, I am hoping the fence
device can stop access to the SAN on a per-port basis.

My thanx in advance for your replies...   

Darrell J. Frazier
Unix System Administrator
US Army Combat Readiness Center

 
CAUTION: This electronic transmission may contain information protected by
deliberative process or other privilege, which is protected from disclosure
under the Freedom of Information Act, 5 U.S.C. ? 552. The information is
intended for the use of the individual or agency to which it was sent. If
you are not the intended recipient, be aware that any disclosure,
distribution or use of the contents of this information is prohibited. Do
not release outside of DoD channels without prior authorization from the
sender. The sender provides no assurance as to the integrity of the content
of this electronic transmission after it has been sent and received by the
intended email recipient. 


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/dfe1a798/attachment.htm>

From lhh at redhat.com  Fri Jan  5 22:03:23 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 05 Jan 2007 17:03:23 -0500
Subject: [Linux-cluster] [PATCH] wrong strings in quorum disk registration.
In-Reply-To: <1167991887.3079.13.camel@localhost>
References: <1167991887.3079.13.camel@localhost>
Message-ID: <1168034603.5634.0.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-05 at 11:11 +0100, Simone Gotti wrote:

> Member Status: Quorate
> 
>   Member Name                        ID   Status
>   ------ ----                        ---- ------
>   nodo01                                1 Online, Local
>   nodo02                                2 Offline
>   /dev/sdb1??                       0 Online, Estranged
> 


Nice!  Heh, the fact that clustat reports /dev/sdb1 is ... a little
weird, and actually a bug, but I'll leave it for now ;)

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/a148d861/attachment.sig>

From lhh at redhat.com  Fri Jan  5 22:04:14 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 05 Jan 2007 17:04:14 -0500
Subject: R: [Linux-cluster] High system CPU usage in one of a two node
	cluster
In-Reply-To: <004801c730b7$466cc3b0$8ec9100a@nicchio>
References: <004801c730b7$466cc3b0$8ec9100a@nicchio>
Message-ID: <1168034654.5634.2.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-05 at 11:49 +0100, Marco Lusini wrote:
> Thanks Patrick,
> 
> I have tried to get the locks for Magma on both nodes,
> and I get the same error of
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634:
> 
> cat: /proc/cluster/dlm_locks: Cannot allocate memory
> 
> I will try to install the RPMs from Lon if I can and
> see if it solve the problem...

The RPMs in 212634 have solved that problem for several people :)

-- Lon


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/eceac046/attachment.sig>

From lhh at redhat.com  Fri Jan  5 22:04:58 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 05 Jan 2007 17:04:58 -0500
Subject: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <004901c730b9$5112d460$8ec9100a@nicchio>
References: <004901c730b9$5112d460$8ec9100a@nicchio>
Message-ID: <1168034698.5634.4.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-05 at 12:04 +0100, Marco Lusini wrote:
> I was looking at Lon's RPMs, and they are (apparently)
> based on rgmanager 1.9.53-1, while the last released
> package is 1.9.54-1...
> Would it be possible to have fixed RPMs compiled wrt the
> last version?

use .53 for now; I'll build new ones on .54 on Monday.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070105/1f16a805/attachment.sig>

From danwest at comcast.net  Fri Jan  5 23:19:49 2007
From: danwest at comcast.net (danwest)
Date: Fri, 05 Jan 2007 18:19:49 -0500
Subject: [Linux-cluster] qdiskd eviction on missed writes
Message-ID: <1168039189.3646.2.camel@belmont.site>

>On Thu, 2007-01-04 at 11:35 -0500, Lon Hohberger wrote:


>I'll be working on a patch to allow you to turn on/off RT scheduling for
>qdiskd from the configuration file (as well as other qdisk-related bits)
>tomorrow and early next week -- would you like to test it when I get it
>ready?

>-- Lon

sure, ready and willing to test as early as possible.

Thanks,
 Dan


From isplist at logicore.net  Sun Jan  7 16:53:44 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 7 Jan 2007 10:53:44 -0600
Subject: [Linux-cluster] Multiple clusters
Message-ID: <200717105344.765824@leena>

In my case, I have separate GFS storage for web, mysql, mail and other parts 
of our network. 

All of the servers are in one single cluster but share their own areas. For 
example, web servers only share web storage areas, etc.

My question is... is there value in spilling these things up? 

I have enough fencing hardware to split up each section and can see only one 
benefit in that if the cluster crashes, it does not take everything down, only 
that section. Yet, all sections function as one site so, not sure about the 
value in that.

Any thoughts on this?

Mike


From isplist at logicore.net  Sun Jan  7 17:36:30 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 7 Jan 2007 11:36:30 -0600
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <1160576999.11134.45.camel@rei.boston.devel.redhat.com>
Message-ID: <200717113630.820770@leena>

>>> http://sources.redhat.com/cluster/doc/cluster_schema.html

I've been looking at this and searching the net high and low and just can't 
seem to find enough information to build a proper cluster.conf file. I'm 
almost sure that it is the cause of some of the problems I am still suffering 
months into this cluster learning.

For example, I've seen all sorts of uses for "method name" but have not found 
ONE single document showing/explaining all of the possible choices or why on 
each one.

That goes for MANY other areas of this file. Is there not any documentation 
for building this file other than the above? It's just not enough for me at 
least.

Mike


From simone.gotti at email.it  Sun Jan  7 19:29:54 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Sun, 07 Jan 2007 20:29:54 +0100
Subject: [Linux-cluster] qdiskd + cman: trying to fix the use of
	quorumdev_poll.
Message-ID: <1168198194.4309.19.camel@localhost>

Hi all,

I'm using the openais based cman-2.0.35.el5 and I'm trying to understand
how the quorum disk concept is implemented in rhcs, after various
experiments I think that I found at least 2 problems:

Problem 1)

Little bug in the quorum disk polling mechanism:

looking at the code in cman/daemon/commands.c the variable
quorumdev_poll = 10000 is expressed in milliseconds and used to call
"quorum_device_timer_fn" every quorumdev_poll interval to check if
qdiskd is informing cman that the node can use the quorum votes.

The same variable is then used in quorum_device_timer_fn, but here it's
used as seconds:

if (quorum_device->last_hello.tv_sec + quorumdev_poll < now.tv_sec) {

so, when the qdisks dies, or the access to the quorum disk is lost it
will take more than 2 hours to notify this and recalculate the quorum.

After changing the line:
========================================================================
--- cman-2.0.35.orig/cman/daemon/commands.c  2007-01-07
21:01:30.000000000 +0100
+++ cman-2.0.35.patched/cman/daemon/commands.c  2007-01-05
18:12:33.000000000 +0100
@@ -1038,15 +1037,12 @@ static void ccsd_timer_fn(void *arg)

 static void quorum_device_timer_fn(void *arg)
 {
        struct timeval now;
        if (!quorum_device || quorum_device->state == NODESTATE_DEAD)
                return;

        gettimeofday(&now, NULL);
-       if (quorum_device->last_hello.tv_sec + quorumdev_poll <
now.tv_sec) {
+       if (quorum_device->last_hello.tv_sec + quorumdev_poll/1000 <
now.tv_sec) {
                quorum_device->state = NODESTATE_DEAD;
                log_msg(LOG_INFO, "lost contact with quorum device\n");
                recalculate_quorum(0);
========================================================================


it worked. A more precise fix should be the use if tv_usec/1000 instead
of tv_sec.


Problem 2)

After fixing Problem 1, if I set in the quorumd tag of cluster.conf an
interval > quorumdev_poll/1000*2 the quorum is lost then regained over
and over as the polling frequency of qdiskd is less than the polling one
of cman.
Probably the right thing to do is to calculate the value of
quorumdev_poll from the ccs return value of "/cluster/quorumd/@interval"
and quorumdev_poll=interval*1000*2 should be ok.


What do you think about these problems? I'll be happy to fix them
providing a full patch.


Thanks.

Bye!
-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Cerchi un gioiello per te o da regalare? Sfoglia il nostro catalogo on-line e non lasciarti sfuggire le numerose occasioni presenti!
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5631&d=7-1


From pcaulfie at redhat.com  Mon Jan  8 10:17:11 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 08 Jan 2007 10:17:11 +0000
Subject: [Linux-cluster] [PATCH] Fix fence_agent string not correctly
	sent	over the cluster.
In-Reply-To: <1168005737.6322.11.camel@localhost>
References: <1168005737.6322.11.camel@localhost>
Message-ID: <45A21A27.50005@redhat.com>

Simone Gotti wrote:
> Hi all,
> 
> on the openais based cman-2.0.35-2.el5 I noticed that the output of
> "cman_tool nodes -f" provided a not correctly terminated fence agent
> name:
> 
> [root at nodo01 ~]# cman_tool nodes -f
> Node  Sts   Inc   Joined               Name
>    1   M      4   2007-01-05 17:39:27  nodo01
>    2   X      0                        nodo02
>        Last fenced:   2007-01-05 17:39:41 by fence-node02!?
>                                                          ^^
> 
> I think the problem is in the function do_cmd_update_fence_info in
> cman/daemon/commands.c that calculate the bytes needed by the message to
> send without counting the \0 terminating the fence_agent string.
> 
> I found also another similar problem in another point of the file and I
> changed also it, but without testing.
> 
> I made a little patch and I hope it's correct.
> 


Another good patch, now in CVS :)

Thank you very much.

-- 

patrick


From jparsons at redhat.com  Mon Jan  8 13:31:57 2007
From: jparsons at redhat.com (James Parsons)
Date: Mon, 08 Jan 2007 08:31:57 -0500
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <200717113630.820770@leena>
References: <200717113630.820770@leena>
Message-ID: <45A247CD.4050708@redhat.com>

isplist at logicore.net wrote:

>>>>http://sources.redhat.com/cluster/doc/cluster_schema.html
>>>>        
>>>>
>
>I've been looking at this and searching the net high and low and just can't 
>seem to find enough information to build a proper cluster.conf file. I'm 
>almost sure that it is the cause of some of the problems I am still suffering 
>months into this cluster learning.
>
>For example, I've seen all sorts of uses for "method name" but have not found 
>ONE single document showing/explaining all of the possible choices or why on 
>each one.
>
>That goes for MANY other areas of this file. Is there not any documentation 
>for building this file other than the above? It's just not enough for me at 
>least.
>
>Mike
>
>  
>
Man cluster.conf.

The method tag block denotes a fence group level. The only attribute 
that can be specified for a method tag block is a name attr. This just 
needs to be distinctive from all other method block names below a 
specific clusternode fence block. The name can be any string - it just 
does not matter. system-config-cluster generates unique interger values 
for each method block and converts them to strings in order to set 
method name attributes.

A method block allows you to group one or more fence types at a specific 
level of fencing; for example, if you  wish to employ power fencing as a 
first measure for a node, you would insert an initial method block 
within a clusternode's fence block, referencing power fence devices with 
instance specific attributes such as port numbers. Let's say that 
clusternode 'A' has dual power supplies...this is what the xml would 
look like:

<clusternode name="A">
  <fence>
    <method name="1">
      <device name="my_apc" port="1" option="off"/>
      <device name="my_apc" port="2" option="off"/>
      <device name="my_apc" port="1" option="on"/>
      <device name="my_apc" port="2" option="on"/>
    </method>
  </fence>
</clusternode>

While the defauly action for every fence agent is to reboot, the 
'option' atr is used above in the case of dual power supply nodes to 
insure both power supplies are off at the same time - making certain 
that the node is fenced.

Now, let's say that you are paranoid, and that you do not trust your 
power switch 100%. You can add a second level of fencing (as additional 
insurance) like so:

<clusternode name="A">
  <fence>
    <method name="1">
      <device name="my_apc" port="1" option="off"/>
      <device name="my_apc" port="2" option="off"/>
      <device name="my_apc" port="1" option="on"/>
      <device name="my_apc" port="2" option="on"/>
    </method>
    <method name="2">
      <device name="my_brocade" port="12"/>
    </method>
  </fence>
</clusternode>

The above sets up a primary fence method in he first method block. If 
that block fails to fence the node, then the next block will be 
attempted.  There is no limit that I know of for how many method blocks 
you wish to employ -- but 1 or 2 is the norm...anymore tends to suggest 
paranoid tendencies ;-)

As additional information on this subject, the following is from the 
schema doc that is mentioned above:


        A Note On Fencing


Fencing is specified within the cluster.conf file in two places. The first
place is within the <fencedevices> tag. Any device used for fencing a node
must be defined here as a <fencedevice> first. This applies to power
switches (APC, WTI, etc.) with multiple ports that are able to fence multiple
cluster nodes, as well as fabric switches and baseboard management fence
strategies (iLO, RSA, IPMI, Drac, etc.) that are usually 1 to 1 in nature; 
that is, one specified fence device is able to fence only one node.

After defining the fence devices to be used in the cluster, it is necessary to
associate the fence device listings with specific cluster nodes. The second 
place that fencing is specified within cluster.conf is within the <clusternode>
tag. Beneath the <clusternode> tag, is a <fence> tag. Beneath the <fence> tag is
one or more <method> tag sets. Within a <method> tag set, is a <device> tag set.
This is where the actual association between <fencedevice> and node takes place.
A <device> tag has a required "name" attribute that refers to the name of one
of the <fencedevice>'s specified in the <fencedevices> section of cluster.conf. 

More about <method> blocks: A method block is like a fence level. If a 
primary fence method is selected, yet the user wants to define a backup method
in case the first fence method fails, this is done by defining two <method>
blocks for a cluster node, each with a unique name parameter. The fence daemon
will call each fence method in the order they are specified under the
<clusternode><fence> tag set.

Fence specification within cluster.conf offers one other feature for 
customizing fence action. Within a <method> block, it is allowable to list 
more than one <device>. This is useful when fencing a node with redundant
power supplies, for example. The fence daemon will run the agent for each
device listed within a <method> block before determining success or failure.

I hope this sets you up with all that you need.

-J

>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>  
>


From leo.pleiman at raba.com  Mon Jan  8 14:07:01 2007
From: leo.pleiman at raba.com (Leo J Pleiman)
Date: Mon,  8 Jan 2007 09:07:01 -0500
Subject: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <1168034698.5634.4.camel@rei.boston.devel.redhat.com>
References: <004901c730b9$5112d460$8ec9100a@nicchio>
	<1168034698.5634.4.camel@rei.boston.devel.redhat.com>
Message-ID: <20070108090701.xy6dau05og488ss4@www.raba.com>

I have a 2 node cluster where the nodes seemed to pause for several 
second every
couple minutes, paused in that an interactive session would freeze. I 
found that
2 of the 8 cpus on each node were 25-50% wait or system all the time. After
applying this patch the problem appears to be gone. Thanks! I'm awaiting your
1.9.54 build.

-- 
Leo J Pleiman, RHCE
Principal Consultant
RABA Technologies
301.763.3527 (office)
410.688.3873 (cell)


Quoting Lon Hohberger <lhh at redhat.com>:

> On Fri, 2007-01-05 at 12:04 +0100, Marco Lusini wrote:
>> I was looking at Lon's RPMs, and they are (apparently)
>> based on rgmanager 1.9.53-1, while the last released
>> package is 1.9.54-1...
>> Would it be possible to have fixed RPMs compiled wrt the
>> last version?
>
> use .53 for now; I'll build new ones on .54 on Monday.
>
> -- Lon
>
>


From isplist at logicore.net  Mon Jan  8 15:34:26 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 8 Jan 2007 09:34:26 -0600
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <45A247CD.4050708@redhat.com>
Message-ID: <20071893426.356753@leena>

Not sure if thanks does it but this is very helpful and will now be part of my 
own documentation. Thank you! 

Mike

>> I've been looking at this and searching the net high and low and just
>> can't seem to find enough information to build a proper cluster.conf file. 

> Man cluster.conf.
> 
> The method tag block denotes a fence group level. The only attribute
> that can be specified for a method tag block is a name attr. This just
> needs to be distinctive from all other method block names below a
> specific clusternode fence block. The name can be any string - it just
> does not matter. system-config-cluster generates unique interger values
> for each method block and converts them to strings in order to set
> method name attributes.
> 
> A method block allows you to group one or more fence types at a specific
> level of fencing; for example, if you  wish to employ power fencing as a
> first measure for a node, you would insert an initial method block
> within a clusternode's fence block, referencing power fence devices with
> instance specific attributes such as port numbers. Let's say that
> clusternode 'A' has dual power supplies...this is what the xml would
> look like:
> 
> <clusternode name="A">
> <fence>
> <method name="1">
> <device name="my_apc" port="1" option="off"/>
> <device name="my_apc" port="2" option="off"/>
> <device name="my_apc" port="1" option="on"/>
> <device name="my_apc" port="2" option="on"/>
> </method>
> </fence>
> </clusternode>
> 
> While the defauly action for every fence agent is to reboot, the
> 'option' atr is used above in the case of dual power supply nodes to
> insure both power supplies are off at the same time - making certain
> that the node is fenced.
> 
> Now, let's say that you are paranoid, and that you do not trust your
> power switch 100%. You can add a second level of fencing (as additional
> insurance) like so:
> 
> <clusternode name="A">
> <fence>
> <method name="1">
> <device name="my_apc" port="1" option="off"/>
> <device name="my_apc" port="2" option="off"/>
> <device name="my_apc" port="1" option="on"/>
> <device name="my_apc" port="2" option="on"/>
> </method>
> <method name="2">
> <device name="my_brocade" port="12"/>
> </method>
> </fence>
> </clusternode>
> 
> The above sets up a primary fence method in he first method block. If
> that block fails to fence the node, then the next block will be
> attempted.  There is no limit that I know of for how many method blocks
> you wish to employ -- but 1 or 2 is the norm...anymore tends to suggest
> paranoid tendencies ;-)
> 
> As additional information on this subject, the following is from the
> schema doc that is mentioned above:
> 
> 
> A Note On Fencing
> 
> 
> Fencing is specified within the cluster.conf file in two places. The first
> place is within the <fencedevices> tag. Any device used for fencing a node
> must be defined here as a <fencedevice> first. This applies to power
> switches (APC, WTI, etc.) with multiple ports that are able to fence
> multiple
> cluster nodes, as well as fabric switches and baseboard management fence
> strategies (iLO, RSA, IPMI, Drac, etc.) that are usually 1 to 1 in nature;
> that is, one specified fence device is able to fence only one node.
> 
> After defining the fence devices to be used in the cluster, it is necessary
> to
> associate the fence device listings with specific cluster nodes. The second
> place that fencing is specified within cluster.conf is within the
> <clusternode>
> tag. Beneath the <clusternode> tag, is a <fence> tag. Beneath the <fence> 
> tag is
> one or more <method> tag sets. Within a <method> tag set, is a <device> tag
> set.
> This is where the actual association between <fencedevice> and node takes
> place.
> A <device> tag has a required "name" attribute that refers to the name of
> one
> of the <fencedevice>'s specified in the <fencedevices> section of
> cluster.conf.
> 
> More about <method> blocks: A method block is like a fence level. If a
> primary fence method is selected, yet the user wants to define a backup
> method
> in case the first fence method fails, this is done by defining two <method>
> blocks for a cluster node, each with a unique name parameter. The fence
> daemon
> will call each fence method in the order they are specified under the
> <clusternode><fence> tag set.
> 
> Fence specification within cluster.conf offers one other feature for
> customizing fence action. Within a <method> block, it is allowable to list
> more than one <device>. This is useful when fencing a node with redundant
> power supplies, for example. The fence daemon will run the agent for each
> device listed within a <method> block before determining success or failure.
> 
> I hope this sets you up with all that you need.
> 
> -J
> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Mon Jan  8 15:43:26 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 08 Jan 2007 10:43:26 -0500
Subject: [Linux-cluster] qdiskd + cman: trying to fix the use of
	quorumdev_poll.
In-Reply-To: <1168198194.4309.19.camel@localhost>
References: <1168198194.4309.19.camel@localhost>
Message-ID: <1168271006.15369.22.camel@rei.boston.devel.redhat.com>

On Sun, 2007-01-07 at 20:29 +0100, Simone Gotti wrote:
> Problem 2)
> 
> After fixing Problem 1, if I set in the quorumd tag of cluster.conf an
> interval > quorumdev_poll/1000*2 the quorum is lost then regained over
> and over as the polling frequency of qdiskd is less than the polling one
> of cman.
> Probably the right thing to do is to calculate the value of
> quorumdev_poll from the ccs return value of "/cluster/quorumd/@interval"
> and quorumdev_poll=interval*1000*2 should be ok.

I think the poll rate should be closer to (interval * tko * 1000) [10
seconds by default] - and not a function of just the quorum disk
interval.  

This is because after (interval*tko*1000), the master node of the
cluster will write an eviction message to a hung node - and that's when
qdiskd will either reboot the node or tell CMAN that its votes are no
longer valid.

I do not think it will cause any problems per se, but dropping qdiskd's
votes after ~2 seconds when the qdisk master won't write an eviction
notice for another ~8 seconds seems a bit odd.

Normal node failure delay should be >= 2*(i*t*1000).  There's a
parameter in the <totem> tag (which defaults to 5,000ms) - which should
be 2 * interval * tko * 1000, but I don't recall what it is right now.

qdiskd needs to time out before CMAN does.  While it doesn't have to be
"half or less", it's a good paranoia factor that's easy to remember, and
it gives the node plenty of time.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070108/91e528e9/attachment.sig>

From graeme.crawford at gmail.com  Mon Jan  8 18:20:36 2007
From: graeme.crawford at gmail.com (Graeme Crawford)
Date: Mon, 8 Jan 2007 20:20:36 +0200
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>
References: <C0C1791E8EC6F249B5570F01409BD3EE0148538F@rmamsem1.emea.roche.com>
Message-ID: <326f0a380701081020j16c6366r7e970cc1b76fca04@mail.gmail.com>

Next time, run "cman_tool leave" it has a few pre-req's so check the man page.
Then a "cman_tool expected vote_num" should sort out your quorum/votes.

graeme.

On 1/4/07, Pena, Francisco Javier <francisco_javier.pena at roche.com> wrote:
> Hello,
>
> I am finding a strange cman behavior when removing a node from a running cluster. The starting point is:
>
> - 3 nodes running RHEL 4 U4, GFS 6.1    (1 vote per node)
> - Quorum disk                                   (4 votes)
>
> I stop all cluster services on node 3, then modify the cluster.conf file to remove the node (and adjust the quorum disk votes to 3), and then "ccs_tool update" and "cman_tool version -r <new_version>". The cluster services keep running, however it looks like cman is not completely in sync with ccsd:
>
> # ccs_tool lsnode
>
> Cluster name: TestCluster, config_version: 9
>
> Nodename                        Votes Nodeid Iface Fencetype
> gfsnode1                           1    1          iLO_NODE1
> gfsnode2                           1    2          iLO_NODE2
>
>
> # cman_tool nodes
>
> Node  Votes Exp Sts  Name
>    0    4    0   M   /dev/emcpowera1
>    1    1    3   M   gfsnode1
>    2    1    3   M   gfsnode2
>    3    1    3   X   gfsnode3
>
> # cman_tool status
>
> Protocol version: 5.0.1
> Config version: 9
> Cluster name: TestCluster
> Cluster ID: 62260
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 2
> Expected_votes: 3
> Total_votes: 6
> Quorum: 4
> Active subsystems: 9
> Node name: gfsnode1
> Node ID: 1
> Node addresses: A.B.C.D
>
> CMAN still thinks the third node is part of the cluster, but has just stopped working. In addition to that, it is not updating the number of votes for the quorum disk. If I completely restart the cluster services on all nodes, I get the right information:
>
> - Correct votes for the quorum disk
> - Third node dissappears
> - The Expected_votes value is now 2
>
> I know from a previous post that two node clusters are a special case, even with quorum disk, but I am pretty sure the same problem will happen with higher node counts (I just do not have enough hardware to test it).
>
> So, is this considered as a bug or is it expected that the information from removed nodes is still there until the whole cluster is restarted?
>
> Thanks in advance,
>
> Javier Pe?a
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From isplist at logicore.net  Mon Jan  8 18:31:01 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 8 Jan 2007 12:31:01 -0600
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <326f0a380701081020j16c6366r7e970cc1b76fca04@mail.gmail.com>
Message-ID: <20071812311.543597@leena>

> Next time, run "cman_tool leave" it has a few pre-req's so check the man
> page.

I have these problems also, trying to shut down the cluster, I get;

cman_tool leave;

cman_tool: Can't leave cluster while there are 6 active subsystems


Mike


From lshen at cisco.com  Mon Jan  8 18:39:07 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Mon, 8 Jan 2007 10:39:07 -0800
Subject: [Linux-cluster] Remove the clusterness from GFS
Message-ID: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>

All we need is a cluster file system to aggregate local disks attached
to different nodes into a shared storage pool. GFS+GNBD fits in our
requirement nicely except the cluster suite that comes with it. We
really don't need/want to turn our system into a cluster by using GFS
since we're not very clear about what are the side effects that would
bring in. Would it slow down the system more, take up more memory and
affect the system bootup and shutdown sequencies etc? How easy is it to
remove some or all of the clusterness from GFS such as fencing, cman and
ccsd stuff? I understand that things like dlm must stay for GFS to work.

Thanks
lin      


From lhh at redhat.com  Mon Jan  8 18:50:03 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 08 Jan 2007 13:50:03 -0500
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
Message-ID: <1168282204.15369.43.camel@rei.boston.devel.redhat.com>

On Mon, 2007-01-08 at 10:39 -0800, Lin Shen (lshen) wrote:
> How easy is it to
> remove some or all of the clusterness from GFS such as fencing, cman and
> ccsd stuff? I understand that things like dlm must stay for GFS to work.

I would think it is very difficult.

You can use GFS on *one* node without a cluster.

In order to use a clustered file system, you need a cluster.  The
cluster acts as the control mechanism for accessing the file system.
Without it, each computer accessing GFS will have no knowledge of when
it is safe to write to or read from the file system.  This will lead to
file system corruption very quickly.

If you absolutely can not have a bit of "cluster software running",
you'll probably need to use a client/server approach like NFS instead of
a cluster file system like GFS.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070108/44c33f81/attachment.sig>

From isplist at logicore.net  Mon Jan  8 18:52:59 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 8 Jan 2007 12:52:59 -0600
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <326f0a380701081020j16c6366r7e970cc1b76fca04@mail.gmail.com>
Message-ID: <200718125259.606965@leena>

Fixed my shut down problems so anyone else having issues, here's how it works.

Man for cman_tool says;

//
leave;

Tells CMAN to leave the cluster. You cannot do this if there are subsystems 
(eg DLM, GFS) active.  
You  should dismount all GFS filesystems, shutdown CLVM, fenced and anything 
else using the cluster manager before using cman_tool leave.  Look at 
cman_tool status|services to  see  how  many (and which) services are running.
\\

Answers all :).

Mike


From jparsons at redhat.com  Mon Jan  8 19:04:45 2007
From: jparsons at redhat.com (James Parsons)
Date: Mon, 08 Jan 2007 14:04:45 -0500
Subject: [Linux-cluster] Removing a node from a running cluster
In-Reply-To: <200718125259.606965@leena>
References: <200718125259.606965@leena>
Message-ID: <45A295CD.5030009@redhat.com>

isplist at logicore.net wrote:

>Fixed my shut down problems so anyone else having issues, here's how it works.
>
>Man for cman_tool says;
>
>//
>leave;
>
>Tells CMAN to leave the cluster. You cannot do this if there are subsystems 
>(eg DLM, GFS) active.  
>You  should dismount all GFS filesystems, shutdown CLVM, fenced and anything 
>else using the cluster manager before using cman_tool leave.  Look at 
>cman_tool status|services to  see  how  many (and which) services are running.
>\\
>
>Answers all :).
>
WARNING: Shameless Promotion --

Conga does all of these things for you in a browser window...there is a 
dropdown menu on the node page that offers the user the option to have a 
node leave or join a cluster, completely delete a node, reboot a node, 
or use the fence subsystem to fence a node. With one mouse click and a 
confirmation dailog, all neccesary services are checked and shutdown for 
you and the node is removed/deleted/etc.

When you add a new node, you enter the ipaddr/hostname for the new node, 
and then all necessary packages are yummed and installed, all necessary 
services started, and a new configuration file reflecting the new node 
is propagated.

What if you add a node two a two-node cluster that does not use quorum 
disk, you ask? Conga removes the two_node=1 attr from the <cman> tag and 
reminds you that the cluster needs to be restarted...and provides a link 
to the appropriate cluster page where one mouse click and a confirmation 
dialog will restart the whole cluster.

-J


From lshen at cisco.com  Mon Jan  8 19:26:11 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Mon, 8 Jan 2007 11:26:11 -0800
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <1168282204.15369.43.camel@rei.boston.devel.redhat.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D80336076E@xmb-sjc-222.amer.cisco.com>


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> Sent: Monday, January 08, 2007 10:50 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Remove the clusterness from GFS
> 
> On Mon, 2007-01-08 at 10:39 -0800, Lin Shen (lshen) wrote:
> > How easy is it to
> > remove some or all of the clusterness from GFS such as 
> fencing, cman 
> > and ccsd stuff? I understand that things like dlm must stay 
> for GFS to work.
> 
> I would think it is very difficult.
> 
> You can use GFS on *one* node without a cluster.
> 
> In order to use a clustered file system, you need a cluster.  
> The cluster acts as the control mechanism for accessing the 
> file system.
> Without it, each computer accessing GFS will have no 
> knowledge of when it is safe to write to or read from the 
> file system.  This will lead to file system corruption very quickly.

I thought that's the duty of DLM. 

> 
> If you absolutely can not have a bit of "cluster software 
> running", you'll probably need to use a client/server 
> approach like NFS instead of a cluster file system like GFS.

How about Luster? It's a cluster file system, but seems to me it doesn't
require much extra cluster software.

Thanks
Lin

> 
> -- Lon
> 


From isplist at logicore.net  Mon Jan  8 19:35:53 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 8 Jan 2007 13:35:53 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <326f0a380701081020j16c6366r7e970cc1b76fca04@mail.gmail.com>
Message-ID: <200718133553.024528@leena>

Ok, confusion again... why does this work on one node but not another. They 
are identical nodes in every way.

# more stop_gfs

/etc/init.d/httpd stop
umount /var/www
vgchange -aln
/etc/init.d/clvmd stop
fence_tool leave
/etc/init.d/fenced stop
cman_tool leave
killall ccsd

On some nodes, I'm still getting;

cman_tool: Can't leave cluster while there are 1 active subsystems

Mike


>Fixed my shut down problems so anyone else having issues, here's how 
>it works.

>Man for cman_tool says;

>//
>leave;

>Tells CMAN to leave the cluster. You cannot do this if there are subsystems 
>(eg DLM, GFS) active.  
>You  should dismount all GFS filesystems, shutdown CLVM, fenced and anything 
>else using the cluster manager before using cman_tool leave.  Look at 
>cman_tool status|services to  see  how  many (and which) services are 
>running.
>\\

>Answers all :).

>Mike


From lhh at redhat.com  Mon Jan  8 22:45:45 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 08 Jan 2007 17:45:45 -0500
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D80336076E@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D80336076E@xmb-sjc-222.amer.cisco.com>
Message-ID: <1168296345.15369.144.camel@rei.boston.devel.redhat.com>

On Mon, 2007-01-08 at 11:26 -0800, Lin Shen (lshen) wrote:

> > 
> > If you absolutely can not have a bit of "cluster software 
> > running", you'll probably need to use a client/server 
> > approach like NFS instead of a cluster file system like GFS.
> 
> How about Luster? It's a cluster file system, but seems to me it doesn't
> require much extra cluster software.

Lustre clients do not need to be cluster aware.  (Neither do NFS
clients.)  

If you are willing to sacrifice fault tolerance, you can run Lustre
without a cluster stack.

If you want fault tolerance, you have to go get a third-party cluster
stack, like heartbeat (or linux-cluster; but no one's done it AFAIK), to
provide the failover.  OSS/OST locations are stored in a replicated LDAP
database, which you must set up as well.

As a side note, I think HP was working on building a (non-Free) metadata
server cluster product for Lustre:

http://h20311.www2.hp.com/HPC/cache/276636-0-0-0-121.html

GFS has no concept of "client" and "server".  If you mount a GFS volume,
you need to be part of that file system's cluster.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070108/682a39bf/attachment.sig>

From jvantuyl at engineyard.com  Tue Jan  9 07:25:36 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Tue, 9 Jan 2007 01:25:36 -0600
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
Message-ID: <325D4B40-0CC0-42C8-B96D-D5EAE5BBBC8C@engineyard.com>


On Jan 8, 2007, at 12:39 PM, Lin Shen (lshen) wrote:

> All we need is a cluster file system to aggregate local disks attached
> to different nodes into a shared storage pool. GFS+GNBD fits in our
> requirement nicely except the cluster suite that comes with it. We
> really don't need/want to turn our system into a cluster by using GFS
> since we're not very clear about what are the side effects that would
> bring in. Would it slow down the system more, take up more memory and
> affect the system bootup and shutdown sequencies etc? How easy is  
> it to
> remove some or all of the clusterness from GFS such as fencing,  
> cman and
> ccsd stuff? I understand that things like dlm must stay for GFS to  
> work.

Dlm must know the nodes in the cluster.  It most know when they are  
there.  That's CMAN.  It also must have all of the configuration to  
support knowing that.  That's CCSd.

GFS must be able to handle a node failure of any kind.  That's fencing.

Asking to run GFS without CMAN, fencing, and CCSd is like asking to  
run PHPmyadmin without Apache, PHP, or MySQL.

If you aren't sharing the data between two hosts simultaneously, you  
might try ReiserFS/XFS with CLVM.  CLVM still requires the CMAN stack  
but it doesn't introduce some of the more exciting failure behavior  
that GFS can.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/22128534/attachment.htm>

From jaap at sara.nl  Tue Jan  9 08:42:40 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Tue, 9 Jan 2007 09:42:40 +0100
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <200718133553.024528@leena>
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA2541@douwes.ka.sara.nl>

MIke,

> Ok, confusion again... why does this work on one node but not 
> another. They 
> are identical nodes in every way.
> 
> # more stop_gfs
> 
> /etc/init.d/httpd stop
> umount /var/www
> vgchange -aln
> /etc/init.d/clvmd stop
> fence_tool leave
> /etc/init.d/fenced stop
> cman_tool leave
> killall ccsd
> 
> On some nodes, I'm still getting;
> 
> cman_tool: Can't leave cluster while there are 1 active subsystems
> 
> Mike

You should check with cman_tool services(said below), which services are
still running/updating/joining etc. It sometimes happen that a service
cant be shutdown nicely. You can try to kill daemons by hand with a
soft/hard kill. 


Met vriendelijke groet, Kind Regards,

Jaap P. Dijkshoorn
Group Leader Cluster Computing
Systems Programmer
mailto:jaap at sara.nl    http://home.sara.nl/~jaapd

SARA Computing & Networking Services
Kruislaan 415     1098 SJ  Amsterdam
Tel: +31-(0)20-5923000
Fax: +31-(0)20-6683167
http://www.sara.nl


> 
> 
> 
> >Fixed my shut down problems so anyone else having issues, here's how 
> >it works.
> 
> >Man for cman_tool says;
> 
> >//
> >leave;
> 
> >Tells CMAN to leave the cluster. You cannot do this if there 
> are subsystems 
> >(eg DLM, GFS) active.  
> >You  should dismount all GFS filesystems, shutdown CLVM, 
> fenced and anything 
> >else using the cluster manager before using cman_tool leave. 
>  Look at 
> >cman_tool status|services to  see  how  many (and which) 
> services are 
> >running.
> >\\
> 
> >Answers all :).
> 
> >Mike
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/fa1931e6/attachment.bin>

From ramon at vanalteren.nl  Tue Jan  9 12:32:07 2007
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Tue, 09 Jan 2007 13:32:07 +0100
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <200718133553.024528@leena>
References: <200718133553.024528@leena>
Message-ID: <45A38B47.1060407@vanalteren.nl>

isplist at logicore.net wrote:
> Ok, confusion again... why does this work on one node but not another. They 
> are identical nodes in every way.
>
> # more stop_gfs
>
> /etc/init.d/httpd stop
> umount /var/www
> vgchange -aln
> /etc/init.d/clvmd stop
> fence_tool leave
> /etc/init.d/fenced stop
> cman_tool leave
> killall ccsd
>
> On some nodes, I'm still getting;
>
> cman_tool: Can't leave cluster while there are 1 active subsystems
>
>   
Check with cman_tool services

One that bit me before is *thinking* I unmounted a gfs system but it
failed because I was still running nfs which exported one of the gfs
filesystems.

Ramon


From pcaulfie at redhat.com  Tue Jan  9 13:34:47 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 09 Jan 2007 13:34:47 +0000
Subject: [Linux-cluster] qdiskd + cman: trying to fix the use
	of	quorumdev_poll.
In-Reply-To: <1168198194.4309.19.camel@localhost>
References: <1168198194.4309.19.camel@localhost>
Message-ID: <45A399F7.8070008@redhat.com>

Simone Gotti wrote:
> Hi all,
> 
> I'm using the openais based cman-2.0.35.el5 and I'm trying to understand
> how the quorum disk concept is implemented in rhcs, after various
> experiments I think that I found at least 2 problems:
> 
> Problem 1)
> 
> Little bug in the quorum disk polling mechanism:
> 
> looking at the code in cman/daemon/commands.c the variable
> quorumdev_poll = 10000 is expressed in milliseconds and used to call
> "quorum_device_timer_fn" every quorumdev_poll interval to check if
> qdiskd is informing cman that the node can use the quorum votes.
> 
> The same variable is then used in quorum_device_timer_fn, but here it's
> used as seconds:
> 
> if (quorum_device->last_hello.tv_sec + quorumdev_poll < now.tv_sec) {
> 
> so, when the qdisks dies, or the access to the quorum disk is lost it
> will take more than 2 hours to notify this and recalculate the quorum.
> 
> After changing the line:
> ========================================================================
> --- cman-2.0.35.orig/cman/daemon/commands.c  2007-01-07
> 21:01:30.000000000 +0100
> +++ cman-2.0.35.patched/cman/daemon/commands.c  2007-01-05
> 18:12:33.000000000 +0100
> @@ -1038,15 +1037,12 @@ static void ccsd_timer_fn(void *arg)
> 
>  static void quorum_device_timer_fn(void *arg)
>  {
>         struct timeval now;
>         if (!quorum_device || quorum_device->state == NODESTATE_DEAD)
>                 return;
> 
>         gettimeofday(&now, NULL);
> -       if (quorum_device->last_hello.tv_sec + quorumdev_poll <
> now.tv_sec) {
> +       if (quorum_device->last_hello.tv_sec + quorumdev_poll/1000 <
> now.tv_sec) {
>                 quorum_device->state = NODESTATE_DEAD;
>                 log_msg(LOG_INFO, "lost contact with quorum device\n");
>                 recalculate_quorum(0);
> ========================================================================
> 

Thanks. I've committed that version for now.

> it worked. A more precise fix should be the use if tv_usec/1000 instead
> of tv_sec.

True, it needs to take both into account. For the sake of time I've left the
granularity at seconds.
-- 

patrick


From pcaulfie at redhat.com  Tue Jan  9 13:35:59 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 09 Jan 2007 13:35:59 +0000
Subject: [Linux-cluster] qdiskd + cman: trying to fix the use
	of	quorumdev_poll.
In-Reply-To: <1168271006.15369.22.camel@rei.boston.devel.redhat.com>
References: <1168198194.4309.19.camel@localhost>
	<1168271006.15369.22.camel@rei.boston.devel.redhat.com>
Message-ID: <45A39A3F.7060208@redhat.com>

Lon Hohberger wrote:
> On Sun, 2007-01-07 at 20:29 +0100, Simone Gotti wrote:
>> Problem 2)
>>
>> After fixing Problem 1, if I set in the quorumd tag of cluster.conf an
>> interval > quorumdev_poll/1000*2 the quorum is lost then regained over
>> and over as the polling frequency of qdiskd is less than the polling one
>> of cman.
>> Probably the right thing to do is to calculate the value of
>> quorumdev_poll from the ccs return value of "/cluster/quorumd/@interval"
>> and quorumdev_poll=interval*1000*2 should be ok.
> 
> I think the poll rate should be closer to (interval * tko * 1000) [10
> seconds by default] - and not a function of just the quorum disk
> interval.  
> 
> This is because after (interval*tko*1000), the master node of the
> cluster will write an eviction message to a hung node - and that's when
> qdiskd will either reboot the node or tell CMAN that its votes are no
> longer valid.
> 
> I do not think it will cause any problems per se, but dropping qdiskd's
> votes after ~2 seconds when the qdisk master won't write an eviction
> notice for another ~8 seconds seems a bit odd.
> 
> Normal node failure delay should be >= 2*(i*t*1000).  There's a
> parameter in the <totem> tag (which defaults to 5,000ms) - which should
> be 2 * interval * tko * 1000, but I don't recall what it is right now.
> 
> qdiskd needs to time out before CMAN does.  While it doesn't have to be
> "half or less", it's a good paranoia factor that's easy to remember, and
> it gives the node plenty of time.


lon: do you reckon we need a blocker bug for "problem 1)" ?

-- 

patrick


From isplist at logicore.net  Tue Jan  9 15:45:43 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 9 Jan 2007 09:45:43 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA2541@douwes.ka.sara.nl>
Message-ID: <20071994543.553454@leena>

Hi there,

This is my current shutdown script which works on some servers, not on others.

/etc/init.d/httpd stop
umount /var/www
vgchange -aln
/etc/init.d/clvmd stop
fence_tool leave
/etc/init.d/fenced stop
cman_tool leave
killall ccsd

I run it...

Deactivating VG VolGroup01:                                [  OK  ]
Deactivating VG VolGroup02:                                [  OK  ]
Deactivating VG VolGroup03:                                [  OK  ]
Deactivating VG VolGroup04:                                [  OK  ]
Stopping clvm:                                             [  OK  ]
Stopping fence domain:                                     [  OK  ]
cman_tool: Can't leave cluster while there are 4 active subsystems

# cman_tool services
Service          Name                              GID LID State     Code
User:            "usrm::manager"                    13   6 run       -
[2 3 4 6 5 7 8]

What's usrm::manager? I can't seem to find anything on the redhat site and 
online searches lead to endless 'stuff'. I'm guessing what ever this is, it's 
the problem?

Mike

> You should check with cman_tool services(said below), which services are
> still running/updating/joining etc. It sometimes happen that a service
> cant be shutdown nicely. You can try to kill daemons by hand with a
> soft/hard kill.
> 
> 
> Met vriendelijke groet, Kind Regards,
> 
> Jaap P. Dijkshoorn
> Group Leader Cluster Computing
> Systems Programmer
> mailto:jaap at sara.nl    http://home.sara.nl/~jaapd
> 
> SARA Computing & Networking Services
> Kruislaan 415     1098 SJ  Amsterdam
> Tel: +31-(0)20-5923000
> Fax: +31-(0)20-6683167
> http://www.sara.nl


From chawkins at bplinux.com  Tue Jan  9 15:55:51 2007
From: chawkins at bplinux.com (Christopher Hawkins)
Date: Tue, 9 Jan 2007 10:55:51 -0500
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <20071994543.553454@leena>
Message-ID: <200701091527.l09FRFAA002785@mail2.ontariocreditcorp.com>

Are you unmounting the GFS filesystem first? That should be the first thing
in your script...

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> isplist at logicore.net
> Sent: Tuesday, January 09, 2007 10:46 AM
> To: linux clustering
> Subject: RE: [Linux-cluster] Can't leave cluster
> 
> Hi there,
> 
> This is my current shutdown script which works on some 
> servers, not on others.
> 
> /etc/init.d/httpd stop
> umount /var/www
> vgchange -aln
> /etc/init.d/clvmd stop
> fence_tool leave
> /etc/init.d/fenced stop
> cman_tool leave
> killall ccsd
> 
> I run it...
> 
> Deactivating VG VolGroup01:                                [  OK  ]
> Deactivating VG VolGroup02:                                [  OK  ]
> Deactivating VG VolGroup03:                                [  OK  ]
> Deactivating VG VolGroup04:                                [  OK  ]
> Stopping clvm:                                             [  OK  ]
> Stopping fence domain:                                     [  OK  ]
> cman_tool: Can't leave cluster while there are 4 active subsystems
> 
> # cman_tool services
> Service          Name                              GID LID 
> State     Code
> User:            "usrm::manager"                    13   6 run       -
> [2 3 4 6 5 7 8]
> 
> What's usrm::manager? I can't seem to find anything on the 
> redhat site and online searches lead to endless 'stuff'. I'm 
> guessing what ever this is, it's the problem?
> 
> Mike
> 
> > You should check with cman_tool services(said below), which 
> services 
> > are still running/updating/joining etc. It sometimes happen that a 
> > service cant be shutdown nicely. You can try to kill 
> daemons by hand 
> > with a soft/hard kill.
> > 
> > 
> > Met vriendelijke groet, Kind Regards,
> > 
> > Jaap P. Dijkshoorn
> > Group Leader Cluster Computing
> > Systems Programmer
> > mailto:jaap at sara.nl    http://home.sara.nl/~jaapd
> > 
> > SARA Computing & Networking Services
> > Kruislaan 415     1098 SJ  Amsterdam
> > Tel: +31-(0)20-5923000
> > Fax: +31-(0)20-6683167
> > http://www.sara.nl
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From isplist at logicore.net  Tue Jan  9 16:03:33 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 9 Jan 2007 10:03:33 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <200701091527.l09FRFAA002785@mail2.ontariocreditcorp.com>
Message-ID: <20071910333.437232@leena>

Yup, it's the second item in my script.

On Tue, 9 Jan 2007 10:55:51 -0500, Christopher Hawkins wrote:
> Are you unmounting the GFS filesystem first? That should be the first thing
> 
> in your script...
> 
>> -----Original Message-----
>> From: linux-cluster-bounces at redhat.com
>> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of
>> isplist at logicore.net
>> Sent: Tuesday, January 09, 2007 10:46 AM
>> To: linux clustering
>> Subject: RE: [Linux-cluster] Can't leave cluster
>> 
>> Hi there,
>> 
>> This is my current shutdown script which works on some
>> servers, not on others.
>> 
>> /etc/init.d/httpd stop
>> umount /var/www
>> vgchange -aln
>> /etc/init.d/clvmd stop
>> fence_tool leave
>> /etc/init.d/fenced stop
>> cman_tool leave
>> killall ccsd
>> 
>> I run it...
>> 
>> Deactivating VG VolGroup01:                                [  OK  ]
>> Deactivating VG VolGroup02:                                [  OK  ]
>> Deactivating VG VolGroup03:                                [  OK  ]
>> Deactivating VG VolGroup04:                                [  OK  ]
>> Stopping clvm:                                             [  OK  ]
>> Stopping fence domain:                                     [  OK  ]
>> cman_tool: Can't leave cluster while there are 4 active subsystems
>> 
>> # cman_tool services
>> Service          Name                              GID LID
>> State     Code
>> User:            "usrm::manager"                    13   6 run       -
>> [2 3 4 6 5 7 8]
>> 
>> What's usrm::manager? I can't seem to find anything on the
>> redhat site and online searches lead to endless 'stuff'. I'm
>> guessing what ever this is, it's the problem?
>> 
>> Mike
>> 
>>> You should check with cman_tool services(said below), which
>> services
>>> are still running/updating/joining etc. It sometimes happen that a
>>> service cant be shutdown nicely. You can try to kill
>> daemons by hand
>>> with a soft/hard kill.
>>> 
>>> 
>>> Met vriendelijke groet, Kind Regards,
>>> 
>>> Jaap P. Dijkshoorn
>>> Group Leader Cluster Computing
>>> Systems Programmer
>>> mailto:jaap at sara.nl    http://home.sara.nl/~jaapd
>>> 
>>> SARA Computing & Networking Services
>>> Kruislaan 415     1098 SJ  Amsterdam
>>> Tel: +31-(0)20-5923000
>>> Fax: +31-(0)20-6683167
>>> http://www.sara.nl
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From rpeterso at redhat.com  Tue Jan  9 16:07:54 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 09 Jan 2007 10:07:54 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <20071994543.553454@leena>
References: <20071994543.553454@leena>
Message-ID: <45A3BDDA.6070807@redhat.com>

isplist at logicore.net wrote:
> # cman_tool services
> Service          Name                              GID LID State     Code
> User:            "usrm::manager"                    13   6 run       -
> [2 3 4 6 5 7 8]
>
> What's usrm::manager? I can't seem to find anything on the redhat site and 
> online searches lead to endless 'stuff'. I'm guessing what ever this is, it's 
> the problem?
>
> Mike
>   
Hi Mike,

That's for rgmanager I think.  Perhaps your script should also do:
service rgmanager stop

Regards,

Bob Peterson
Red Hat Cluster Suite


From chawkins at bplinux.com  Tue Jan  9 16:13:44 2007
From: chawkins at bplinux.com (Christopher Hawkins)
Date: Tue, 9 Jan 2007 11:13:44 -0500
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <20071910333.437232@leena>
Message-ID: <200701091545.l09Fj7AA003313@mail2.ontariocreditcorp.com>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> isplist at logicore.net
> Sent: Tuesday, January 09, 2007 11:04 AM
> To: linux-cluster
> Subject: RE: [Linux-cluster] Can't leave cluster
> 
> Yup, it's the second item in my script.

Wow, a serious blonde moment. I have had the same issue from time to time
(with starting as well as stopping) if the scripts go too fast. I don't
recall which component was being sensitive, but you might try adding a sleep
5 here and there or running the commands manually, but with a good pause
between them, and see if that changes anything. 


From lhh at redhat.com  Tue Jan  9 17:01:15 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 09 Jan 2007 12:01:15 -0500
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <325D4B40-0CC0-42C8-B96D-D5EAE5BBBC8C@engineyard.com>
References: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
	<325D4B40-0CC0-42C8-B96D-D5EAE5BBBC8C@engineyard.com>
Message-ID: <1168362075.15369.190.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 01:25 -0600, Jayson Vantuyl wrote:

> 
> If you aren't sharing the data between two hosts simultaneously, you
> might try ReiserFS/XFS with CLVM.  CLVM still requires the CMAN stack
> but it doesn't introduce some of the more exciting failure behavior
> that GFS can.

If you mount a raw disk partition on one node at a time, you don't even
need CLVM :)

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/4b50dc5d/attachment.sig>

From pcaulfie at redhat.com  Tue Jan  9 17:10:32 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 09 Jan 2007 17:10:32 +0000
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <1168362075.15369.190.camel@rei.boston.devel.redhat.com>
References: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>	<325D4B40-0CC0-42C8-B96D-D5EAE5BBBC8C@engineyard.com>
	<1168362075.15369.190.camel@rei.boston.devel.redhat.com>
Message-ID: <45A3CC88.8030707@redhat.com>

Lon Hohberger wrote:
> On Tue, 2007-01-09 at 01:25 -0600, Jayson Vantuyl wrote:
> 
>> If you aren't sharing the data between two hosts simultaneously, you
>> might try ReiserFS/XFS with CLVM.  CLVM still requires the CMAN stack
>> but it doesn't introduce some of the more exciting failure behavior
>> that GFS can.
> 
> If you mount a raw disk partition on one node at a time, you don't even
> need CLVM :)
> 

...but you do need a /lot/ of care ...


-- 

patrick


From Bowie_Bailey at BUC.com  Tue Jan  9 17:17:04 2007
From: Bowie_Bailey at BUC.com (Bowie Bailey)
Date: Tue, 9 Jan 2007 12:17:04 -0500 
Subject: [Linux-cluster] Can't leave cluster
Message-ID: <4766EEE585A6D311ADF500E018C154E302685071@bnifex.cis.buc.com>

Christopher Hawkins wrote:
> 
> Wow, a serious blonde moment. I have had the same issue from time to
> time (with starting as well as stopping) if the scripts go too fast.
> I don't recall which component was being sensitive, but you might try
> adding a sleep 5 here and there or running the commands manually, but
> with a good pause between them, and see if that changes anything.

I had the same issue with CMAN failing to stop.  I found that adding a
"sleep 5" before the call to cman_tool in the init script fixed it.

-- 
Bowie


From kitgerrits at gmail.com  Tue Jan  9 17:23:56 2007
From: kitgerrits at gmail.com (Kit Gerrits)
Date: Tue, 9 Jan 2007 18:23:56 +0100
Subject: [Linux-cluster] Cluster software won't start at boot
Message-ID: <005501c73412$f1b16920$4c4b3291@kagtqp>


Ladies and Gentlemen, I have an interesting issue:

I hafe a pair of RHEL 2.1 machines in a cluster.
The cluster service is enabled for runlevels 2 - 5
Entering 'service cluster start' as root works fine
Oddly enough, the cluster service will -not start- at bootup

Any ideas?

[root at nzcs1 root]# chkconfig --list |grep cluster
cluster         0:off   1:off   2:on    3:on    4:on    5:on    6:off

# after manually starting the cluster service:
[root at nzcs1 root]# service cluster status
cluhbd (pid 4997) is running.
clusvcmgrd (pid 4993) is running.
cluquorumd (pid 4989) is running.
clupowerd (pid 4995) is running.
clumibd (pid 4999) is running.
cluscand (pid 5003) is running.
clurmtabd (pid 5001) is running.

[root at nzcs1 root]# grep cluster /var/log/messages
Jan  9 12:11:27 nzcs1 cluster[6946]: <notice> Shutting down Cluster Manager
services
Jan  9 12:11:27 nzcs1 cluster[7007]: <notice> Completed shutdown of Cluster
Manager
Jan  9 12:31:26 nzcs1 bigbrother: ^I^IStarting external script
bb-rh-cluster.sh
Jan  9 12:36:15 nzcs1 cluster[4980]: <notice> Starting cluster manager
services


Any hetlp would be appreciated!


Thanks,

Kit


From hosting at sylconia.nl  Tue Jan  9 17:59:39 2007
From: hosting at sylconia.nl (Support @ Sylconia)
Date: Tue, 09 Jan 2007 18:59:39 +0100
Subject: [Linux-cluster] performance on a 4 node cluster after 6/7 days
Message-ID: <19869.1168365584@sylconia.nl>

Dear reader,

short version:
we are experiencing performance problems after 6/7 days of running time on the non lock master(s).

long version:
We have the following setup:

a 4 node RHCS cluster where node 4 (backend) exports 3 raid disks via gnbd to the other nodes (1-3)
The other 3 nodes (frontend) import those exports via gnbd. We have created 4 LV's via clvmd on top of those imported disks.

logical volumes
/dev/mapper/vg0-tmp   9.7G  1.1M  9.7G   1% /phpsessions
/dev/mapper/vg0-config 9.7G  152K  9.7G   1% /config
/dev/mapper/vg0-logging 100G  126M  100G   1% /var/log/httpd
/dev/mapper/vg0-www   500G  222M  500G   1% /www

as lock manager we use lock_dlm with following rpm's installed 
dlm-kernel-2.6.9-44.3
dlm-1.0.1-1

gfs version
gfs_tool -V
gfs_tool 6.1.6 (built Aug 25 2006 15:17:50)

gnbd version
Copyright (C) Red Hat, Inc.  2004-2005  All rights reserved.
gnbd_import 1.0.8. (built Nov 14 2006 02:18:52)
Copyright (C) Red Hat, Inc.  2004  All rights reserved.

cman_tool status
Protocol version: 5.0.1

os version centos 4.4 on all nodes

all rpm's are from the centos.org website.

all nodes are connected via a seperate NIC (GB) and private gigabit VLAN no other network traffic is on this VLAN.

Now this is all running fine till 6 or 7 days running time than the nodes which are not lock master are becoming very slow in for example a df command. while df runs the cpu load rises to 4 or 5 and the node is not very responsive (it seems the os hangs for a few seconds)

Running the top command at the same time shows 
18524 root      15 -10     0    0    0 R 97.4  0.0   1:08.99 dlm_sendd
12959 root      18   0  4184  592  528 R  1.9  0.1   0:00.17 df

so i think the problem is in dlm but i do not know how to debug this can someone give me some pointers? I checked /proc/cluster/dlm* but honestly do not know what to look for. 

regards
Constan
Sylconia.nl

 
---- This message was sent via a demo version of  - http://atmail.com/


From rhcluster at natecarlson.com  Tue Jan  9 18:32:40 2007
From: rhcluster at natecarlson.com (Nate Carlson)
Date: Tue, 9 Jan 2007 12:32:40 -0600 (CST)
Subject: [Linux-cluster] Upgrading filesystem from gfs -> gfs2
Message-ID: <Pine.LNX.4.63.0701091230070.5075@tungsten.msp.technicality.org>

Hello,

Just curious - how hard is it to upgrade a filesystem from gfs to gfs2?

I'm not finding a FAQ for this anywhere.. :(

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From jon at levanta.com  Tue Jan  9 18:50:53 2007
From: jon at levanta.com (Jonathan Biggar)
Date: Tue, 09 Jan 2007 10:50:53 -0800
Subject: [Linux-cluster] Power based fencing in cluster causes single point
 of failure that can take down a cluster
Message-ID: <eo0o6e$dq5$1@sea.gmane.org>

If we set up a cluster and use network power switches for fencing, won't 
the failure of the power switch attached to a cluster member cause all 
services that were running on that node to fail to migrate to other 
cluster members?

This seems to happen to us in practice, because fencing the offline 
member fails due to the power switch being unavailable, so rgmanager 
never migrates the failed service(s) to another member.

Is there a general solution to this problem that I'm missing?

-- 
Jon Biggar
Levanta
jon at levanta.com
650-403-7252


From jwhiter at redhat.com  Tue Jan  9 19:00:58 2007
From: jwhiter at redhat.com (Josef Whiter)
Date: Tue, 9 Jan 2007 14:00:58 -0500
Subject: [Linux-cluster] Power based fencing in cluster causes single
	point of failure that can take down a cluster
In-Reply-To: <eo0o6e$dq5$1@sea.gmane.org>
References: <eo0o6e$dq5$1@sea.gmane.org>
Message-ID: <20070109190056.GG21486@korben.rdu.redhat.com>

You can either have redundant fence devices, or look into qdisk.

Josef

On Tue, Jan 09, 2007 at 10:50:53AM -0800, Jonathan Biggar wrote:
> If we set up a cluster and use network power switches for fencing, won't 
> the failure of the power switch attached to a cluster member cause all 
> services that were running on that node to fail to migrate to other 
> cluster members?
> 
> This seems to happen to us in practice, because fencing the offline 
> member fails due to the power switch being unavailable, so rgmanager 
> never migrates the failed service(s) to another member.
> 
> Is there a general solution to this problem that I'm missing?
> 
> -- 
> Jon Biggar
> Levanta
> jon at levanta.com
> 650-403-7252
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jon at levanta.com  Tue Jan  9 19:22:10 2007
From: jon at levanta.com (Jonathan Biggar)
Date: Tue, 09 Jan 2007 11:22:10 -0800
Subject: [Linux-cluster] Re: Power based fencing in cluster causes single
 point of failure that can take down a cluster
In-Reply-To: <20070109190056.GG21486@korben.rdu.redhat.com>
References: <eo0o6e$dq5$1@sea.gmane.org>
	<20070109190056.GG21486@korben.rdu.redhat.com>
Message-ID: <eo0q13$kni$1@sea.gmane.org>

Josef Whiter wrote:
> You can either have redundant fence devices, or look into qdisk.

Thanks for the reply.  Can you explain how qdisk would solve the 
problem?  It seems to me that the fencing device failing which 
simultaneously causes the cluster member to fail wouldn't be affected by 
qdisk.

Does qdisk have some feedback mechanism that tells the cluster that it's 
ok to restart the failed services on another node without fencing being 
successful?  I can't see how that can work reliably and still prevent 
split brain problems.

> On Tue, Jan 09, 2007 at 10:50:53AM -0800, Jonathan Biggar wrote:
>> If we set up a cluster and use network power switches for fencing, won't 
>> the failure of the power switch attached to a cluster member cause all 
>> services that were running on that node to fail to migrate to other 
>> cluster members?
>>
>> This seems to happen to us in practice, because fencing the offline 
>> member fails due to the power switch being unavailable, so rgmanager 
>> never migrates the failed service(s) to another member.
>>
>> Is there a general solution to this problem that I'm missing?

-- 
Jon Biggar
Levanta
jon at levanta.com
650-403-7252


From rpeterso at redhat.com  Tue Jan  9 19:23:20 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 09 Jan 2007 13:23:20 -0600
Subject: [Linux-cluster] Upgrading filesystem from gfs -> gfs2
In-Reply-To: <Pine.LNX.4.63.0701091230070.5075@tungsten.msp.technicality.org>
References: <Pine.LNX.4.63.0701091230070.5075@tungsten.msp.technicality.org>
Message-ID: <45A3EBA8.9000907@redhat.com>

Nate Carlson wrote:
> Hello,
>
> Just curious - how hard is it to upgrade a filesystem from gfs to gfs2?
>
> I'm not finding a FAQ for this anywhere.. :(
Hi Nate,

I wrote a little tool called gfs2_convert whose job is to convert a file 
system
from gfs1 to gfs2.  You just do something like:

gfs2_convert /your/file/system

And after it gives you some warnings and asks you the all-important "are 
you sure"
question, it converts it to gfs2.  Pretty simple, really.  But bear in 
mind that gfs2
is still being worked on, so you should not use it for a production box yet.

And always--ALWAYS--back up your gfs1 file system before running the tool,
because it's a brand new app and who knows; it might have bugs.  I 
tested it,
even under conditions where I would interrupt it during critical phases 
and restart
it, etc., so hopefully it won't have problems.  And if you do have 
problems, you
know who to open the bugzilla up against.  ;)

I also recommend that you run gfs_fsck on your file system first, just 
in case there's
some kind of weird fs corruption that might confuse the tool.

Regards,

Bob Peterson
Red Hat Cluster Suite


From natecars at natecarlson.com  Tue Jan  9 19:54:04 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Tue, 9 Jan 2007 13:54:04 -0600 (CST)
Subject: [Linux-cluster] Upgrading filesystem from gfs -> gfs2
In-Reply-To: <45A3EBA8.9000907@redhat.com>
References: <Pine.LNX.4.63.0701091230070.5075@tungsten.msp.technicality.org>
	<45A3EBA8.9000907@redhat.com>
Message-ID: <Pine.LNX.4.63.0701091353030.5075@tungsten.msp.technicality.org>

On Tue, 9 Jan 2007, Robert Peterson wrote:
> I wrote a little tool called gfs2_convert whose job is to convert a file 
> system from gfs1 to gfs2.  You just do something like:
>
> gfs2_convert /your/file/system
>
> And after it gives you some warnings and asks you the all-important "are 
> you sure" question, it converts it to gfs2.  Pretty simple, really. 
> But bear in mind that gfs2 is still being worked on, so you should not 
> use it for a production box yet.

Nifty! That's why I asked - I'm rolling out a new cluster, and wanted to 
go GFS1 since GFS2 is still "in the works", but wanted to make sure there 
was an easy upgrade path.. :)

> And always--ALWAYS--back up your gfs1 file system before running the 
> tool, because it's a brand new app and who knows; it might have bugs. 
> I tested it, even under conditions where I would interrupt it during 
> critical phases and restart it, etc., so hopefully it won't have 
> problems.  And if you do have problems, you know who to open the 
> bugzilla up against.  ;)

*grin*

> I also recommend that you run gfs_fsck on your file system first, just 
> in case there's some kind of weird fs corruption that might confuse the 
> tool.

So I guess it's fairly obvious that the FS needs to be offline?

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From rpeterso at redhat.com  Tue Jan  9 20:21:34 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 09 Jan 2007 14:21:34 -0600
Subject: [Linux-cluster] Upgrading filesystem from gfs -> gfs2
In-Reply-To: <Pine.LNX.4.63.0701091353030.5075@tungsten.msp.technicality.org>
References: <Pine.LNX.4.63.0701091230070.5075@tungsten.msp.technicality.org>	<45A3EBA8.9000907@redhat.com>
	<Pine.LNX.4.63.0701091353030.5075@tungsten.msp.technicality.org>
Message-ID: <45A3F94E.2090100@redhat.com>

Nate Carlson wrote:
> Nifty! That's why I asked - I'm rolling out a new cluster, and wanted 
> to go GFS1 since GFS2 is still "in the works", but wanted to make sure 
> there was an easy upgrade path.. :)
>
> So I guess it's fairly obvious that the FS needs to be offline?
Yes, the file system definitely needs to be offline and not mounted by 
any node.

BTW, I should also mention that the gfs2_convert tool won't convert file 
systems unless
they have the default (4K) block size.  If you created your file system 
with different block size,
then you can't convert it.  This was done purposely because of known 
GFS2 block size issues.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lhh at redhat.com  Tue Jan  9 20:42:39 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 09 Jan 2007 15:42:39 -0500
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <45A3CC88.8030707@redhat.com>
References: <08A9A3213527A6428774900A80DBD8D803360714@xmb-sjc-222.amer.cisco.com>
	<325D4B40-0CC0-42C8-B96D-D5EAE5BBBC8C@engineyard.com>
	<1168362075.15369.190.camel@rei.boston.devel.redhat.com>
	<45A3CC88.8030707@redhat.com>
Message-ID: <1168375359.15369.194.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 17:10 +0000, Patrick Caulfield wrote:

> ...but you do need a /lot/ of care ...

That is absolutely correct.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/0a1962b7/attachment.sig>

From lhh at redhat.com  Tue Jan  9 20:45:40 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 09 Jan 2007 15:45:40 -0500
Subject: [Linux-cluster] Cluster software won't start at boot
In-Reply-To: <005501c73412$f1b16920$4c4b3291@kagtqp>
References: <005501c73412$f1b16920$4c4b3291@kagtqp>
Message-ID: <1168375540.15369.198.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 18:23 +0100, Kit Gerrits wrote:
> Ladies and Gentlemen, I have an interesting issue:
> 
> I hafe a pair of RHEL 2.1 machines in a cluster.
> The cluster service is enabled for runlevels 2 - 5
> Entering 'service cluster start' as root works fine
> Oddly enough, the cluster service will -not start- at bootup
> 
> Any ideas?
> 
> [root at nzcs1 root]# chkconfig --list |grep cluster
> cluster         0:off   1:off   2:on    3:on    4:on    5:on    6:off
> 
> # after manually starting the cluster service:
> [root at nzcs1 root]# service cluster status
> cluhbd (pid 4997) is running.
> clusvcmgrd (pid 4993) is running.
> cluquorumd (pid 4989) is running.
> clupowerd (pid 4995) is running.
> clumibd (pid 4999) is running.
> cluscand (pid 5003) is running.
> clurmtabd (pid 5001) is running.
> 
> [root at nzcs1 root]# grep cluster /var/log/messages
> Jan  9 12:11:27 nzcs1 cluster[6946]: <notice> Shutting down Cluster Manager
> services
> Jan  9 12:11:27 nzcs1 cluster[7007]: <notice> Completed shutdown of Cluster
> Manager
> Jan  9 12:31:26 nzcs1 bigbrother: ^I^IStarting external script
> bb-rh-cluster.sh
> Jan  9 12:36:15 nzcs1 cluster[4980]: <notice> Starting cluster manager
> services

I've not seen this before; usually that "just works".  Do you have any
more information that could help, i.e.:

rpm -q clumanager
rpm -qV clumanager

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/8147a5c9/attachment.sig>

From lhh at redhat.com  Tue Jan  9 20:49:03 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 09 Jan 2007 15:49:03 -0500
Subject: [Linux-cluster] Power based fencing in cluster causes single
	point of failure that can take down a cluster
In-Reply-To: <20070109190056.GG21486@korben.rdu.redhat.com>
References: <eo0o6e$dq5$1@sea.gmane.org>
	<20070109190056.GG21486@korben.rdu.redhat.com>
Message-ID: <1168375743.15369.202.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 14:00 -0500, Josef Whiter wrote:
> You can either have redundant fence devices, or look into qdisk.

Qdisk doesn't obviate fencing confirmation, I'm afraid; in fact, it uses
fencing to kill nodes :(

I'd check out fence_scsi as a backup fencing device.  You're certainly
not the first to ask this question.

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/43c54ebc/attachment.sig>

From srramasw at cisco.com  Tue Jan  9 21:04:55 2007
From: srramasw at cisco.com (Sridharan Ramaswamy (srramasw))
Date: Tue, 9 Jan 2007 13:04:55 -0800
Subject: [Linux-cluster] GFS can be used as root filesystem?
Message-ID: <B14199FA0DBAAF4AA89E83EB41D3543502D7E999@xmb-sjc-22c.amer.cisco.com>

As anyone attempted to use GFS client on diskless Linux node to act as
its root file system?
 
Thinking about the dependencies of GFS, the likes of CMAN, clvmd, gnbd
(if needed) should start before GFS during the boot up process. But the
concern is CMAN would need to read /etc/cluster/cluster.conf file which
won't be available. Other components might need something from
filesystem too, like CLVM might look for lvm.conf. Sounds like a chicken
& egg problem. 
 
As anyone got around these aspects and able to use GFS mount as a root
filesystem?
 
Appreciate any ideas on this.
 
thanks,
Sridhar
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/9ac52e57/attachment.htm>

From grimme at atix.de  Tue Jan  9 21:07:26 2007
From: grimme at atix.de (Marc Grimme)
Date: Tue, 9 Jan 2007 22:07:26 +0100
Subject: [Linux-cluster] GFS can be used as root filesystem?
In-Reply-To: <B14199FA0DBAAF4AA89E83EB41D3543502D7E999@xmb-sjc-22c.amer.cisco.com>
References: <B14199FA0DBAAF4AA89E83EB41D3543502D7E999@xmb-sjc-22c.amer.cisco.com>
Message-ID: <200701092207.26902.grimme@atix.de>

On Tuesday 09 January 2007 22:04, Sridharan Ramaswamy (srramasw) wrote:
> As anyone attempted to use GFS client on diskless Linux node to act as
> its root file system?
>
> Thinking about the dependencies of GFS, the likes of CMAN, clvmd, gnbd
> (if needed) should start before GFS during the boot up process. But the
> concern is CMAN would need to read /etc/cluster/cluster.conf file which
> won't be available. Other components might need something from
> filesystem too, like CLVM might look for lvm.conf. Sounds like a chicken
> & egg problem.
>
> As anyone got around these aspects and able to use GFS mount as a root
> filesystem?
>
> Appreciate any ideas on this.
have a look at www.open-sharedroot.org. Works like a charm. There should also 
be a HOWTO.

Regards Marc.
>
> thanks,
> Sridhar

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From isplist at logicore.net  Tue Jan  9 21:12:25 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 9 Jan 2007 15:12:25 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <45A3BDDA.6070807@redhat.com>
Message-ID: <200719151225.638211@leena>

Thanks Bob,

Can't recall if I replied to this but have one other question.

>What's usrm::manager? I can't seem to find anything on the redhat site and
>online searches lead to endless 'stuff'. I'm guessing what ever this is,
>it's the problem?

> That's for rgmanager I think.  Perhaps your script should also do:
> service rgmanager stop

That was indeed what it was. Here is my final shutdown script;

service httpd stop
umount /var/www
vgchange -aln
service clvmd stop
fence_tool leave
service fenced stop
service rgmanager stop
cman_tool leave
killall ccsd

Two questions;

1: I probably don't need the last line in there correct?

2: Can I create a new service so that I can run this script to shut things 
down cleanly when I want to reboot the node? If so, what is the process?

Mike


From lhh at redhat.com  Tue Jan  9 21:15:15 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 09 Jan 2007 16:15:15 -0500
Subject: R: R: [Linux-cluster] High system CPU usage in one of a two 
	nodecluster
In-Reply-To: <1168034698.5634.4.camel@rei.boston.devel.redhat.com>
References: <004901c730b9$5112d460$8ec9100a@nicchio>
	<1168034698.5634.4.camel@rei.boston.devel.redhat.com>
Message-ID: <1168377315.15369.207.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-05 at 17:04 -0500, Lon Hohberger wrote:
> On Fri, 2007-01-05 at 12:04 +0100, Marco Lusini wrote:
> > I was looking at Lon's RPMs, and they are (apparently)
> > based on rgmanager 1.9.53-1, while the last released
> > package is 1.9.54-1...
> > Would it be possible to have fixed RPMs compiled wrt the
> > last version?
> 
> use .53 for now; I'll build new ones on .54 on Monday.

Or.. Tuesday, for those of you who were keeping track.

https://bugzilla.redhat.com/bugzilla/process_bug.cgi#c18

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/0b2763df/attachment.sig>

From isplist at logicore.net  Tue Jan  9 21:39:50 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 9 Jan 2007 15:39:50 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <45A3BDDA.6070807@redhat.com>
Message-ID: <200719153950.361989@leena>

Same problem again on another node.

stop-cluster-script

service httpd stop
umount /var/www
vgchange -aln
service clvmd stop
fence_tool leave
service fenced stop
service rgmanager stop
cman_tool leave
killall ccsd

I run it and;

]# ./stop_gfs
Stopping httpd:                                            [  OK  ]
  Found duplicate PV y6nVM03KVVWs0v68yQVmiGruP5hOSv1z: using /dev/sdd not 
/dev/sda
  Found duplicate PV wv0qVlspVX11RBlVI5IKyXLAVoH0eiZ3: using /dev/sde not 
/dev/sdb
  Found duplicate PV t9Fwnx7n6vrPpCZ8d3XKyO6V6cIvqeWR: using /dev/sdf not 
/dev/sdc
  0 logical volume(s) in volume group "VolGroup01" now active
  0 logical volume(s) in volume group "VolGroup04" now active
  0 logical volume(s) in volume group "VolGroup03" now active
  0 logical volume(s) in volume group "VolGroup02" now active
  Found duplicate PV y6nVM03KVVWs0v68yQVmiGruP5hOSv1z: using /dev/sdd not 
/dev/sda
  Found duplicate PV wv0qVlspVX11RBlVI5IKyXLAVoH0eiZ3: using /dev/sde not 
/dev/sdb
  Found duplicate PV t9Fwnx7n6vrPpCZ8d3XKyO6V6cIvqeWR: using /dev/sdf not 
/dev/sdc
  Found duplicate PV y6nVM03KVVWs0v68yQVmiGruP5hOSv1z: using /dev/sdd not 
/dev/sda
  Found duplicate PV wv0qVlspVX11RBlVI5IKyXLAVoH0eiZ3: using /dev/sde not 
/dev/sdb
  Found duplicate PV t9Fwnx7n6vrPpCZ8d3XKyO6V6cIvqeWR: using /dev/sdf not 
/dev/sdc
Deactivating VG VolGroup01:                                [  OK  ]
Deactivating VG VolGroup02:                                [  OK  ]
Deactivating VG VolGroup03:                                [  OK  ]
Deactivating VG VolGroup04:                                [  OK  ]
Stopping clvm:                                             [  OK  ]
Stopping fence domain:                                     [  OK  ]
Cluster Service Manager is stopped.
cman_tool: Can't leave cluster while there are 2 active subsystems

# cman_tool services
Service          Name                              GID LID State     Code


From bfilipek at crscold.com  Tue Jan  9 23:11:19 2007
From: bfilipek at crscold.com (Brad Filipek)
Date: Tue, 9 Jan 2007 17:11:19 -0600
Subject: [Linux-cluster] General 2-node cluster questions
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D0232837D@SRVEDI.upark.crscold.com>

I am in the process of setting up a 2-node cluster with a SAN for data
storage. I have a few general questions as this is my first time using
RHEL CS.

 
I have two boxes with RHEL4U4 and one application. Should the app be
installed locally on both nodes, and have the data on the SAN? Or should
the app and the data both be on the SAN? This will be an active/passive
config.

 
Also, does the app and data both need to sit on a GFS?

 
Thank you,

 
Brad Filipek


Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070109/408976dc/attachment.htm>

From mparadis at logicore.net  Tue Jan  9 23:40:49 2007
From: mparadis at logicore.net (Mike Paradis)
Date: Tue, 9 Jan 2007 17:40:49 -0600
Subject: [Linux-cluster] Quick off topic question
Message-ID: <200719174049.935516@leena>

Can one remotely log bash_history files such as one does with syslog.conf and 
@192.168.x.x for example?

I want to consolidate all of my bash history files onto one of the GFS 
servers.

Mike


From isplist at logicore.net  Tue Jan  9 23:41:15 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 9 Jan 2007 17:41:15 -0600
Subject: [Linux-cluster] Quick off topic question
Message-ID: <200719174115.718127@leena>

Can one remotely log bash_history files such as one does with syslog.conf and
@192.168.x.x for example?

I want to consolidate all of my bash history files onto one of the GFS 
servers.

Mike


From riaan at obsidian.co.za  Wed Jan 10 08:52:50 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 10 Jan 2007 10:52:50 +0200
Subject: [Linux-cluster] General 2-node cluster questions
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D0232837D@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D0232837D@SRVEDI.upark.crscold.com>
Message-ID: <45A4A962.5080800@obsidian.co.za>


Brad Filipek wrote:
> I am in the process of setting up a 2-node cluster with a SAN for data 
> storage. I have a few general questions as this is my first time using 
> RHEL CS.
> 
>  
> 
> I have two boxes with RHEL4U4 and one application. Should the app be 
> installed locally on both nodes, and have the data on the SAN? Or should 
> the app and the data both be on the SAN? This will be an active/passive 
> config.
> 

It is up to you to decide if you want the app on the SAN or not.

App locally installed

If the App is simple and/or part of the OS FS hierarchy (e.g. apache, or 
not in /opt), you can install/configure it on node1 and copy the 
configuration across (keeping in mind that you need to manually keep the 
configs in sync)

App installed to shared storage

(for illustration purposes I will use Oracle as the clustered app)

If the App is complex and goes into a distinct directory you can 
partition off (e.g. ORACLE_HOME somewhere in /opt/oracle), it might make 
more sense to have the whole /opt/oracle on the SAN aswell.

Any files that belong to or are required by the application (e.g 
/etc/oratab, initscripts) would have to be copied across manually from 
one node to the other. If you don't have an easy way of determining 
which files are located outside of the shared partition, you might have 
to install the app twice (once on each side), but it might get confused 
on the second node since it may get confused by the install done on node1.

>  
> 
> Also, does the app and data both need to sit on a GFS?
> 

For active-passive: they don't need to, but they can be. If you are 
using an active-passive cluster and only one node at a time will have 
the application running (and writing to the partition with the data on), 
you can use ext3.

Ext3 is a lot faster than GFS since it does not require the 
overhead/complexity of a clustered file system.

also - a tip - before you configure the app as a clustered service in 
rgmanager, make sure that it starts up flawlessly on both sides (after 
manually moving the VIP and filesystem resources from one node to the 
other). Otherwise, after you configure the app in rgmanager and things 
dont work, you may have to troubleshoot both the app startup and rgmanager.

greetings
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/35b65264/attachment.vcf>

From kitgerrits at gmail.com  Wed Jan 10 11:44:06 2007
From: kitgerrits at gmail.com (Kit Gerrits)
Date: Wed, 10 Jan 2007 12:44:06 +0100
Subject: [Linux-cluster] Cluster software won't start at boot
Message-ID: <001801c734ac$a26229f0$4c4b3291@kagtqp>

From: Lon Hohberger <lhh at redhat.com>
> I've not seen this before; usually that "just works".  Do you have any
more information that could help, i.e.:
> 
> rpm -q clumanager
> rpm -qV clumanager

Well, they people that set that system up are a bit strange.
They have runlevel 5 as initdefault, but the system does not show a
graphical login at boot.
(startx works, though)

Fyi:
[root at nzcs1 root]# rpm -qi clumanager
Name        : clumanager                   Relocations: (not relocateable)
Version     : 1.0.19                            Vendor: Red Hat, Inc.
Release     : 2                             Build Date: Mon 23 Dec 2002
10:08:02 PM MET
Install date: Thu 03 Jun 2004 02:55:23 PM MET      Build Host:
bugs.devel.redhat.com

...yadda yadda yadda...


We're carefully considering upgrading some parts of the system...
The owners have (now) realised the O/S is actually too old to handle the
LTO-1 drives correctly.
(The fact that the 2.1 install CD wouldn't recognise the drive wasn't enough
of a hint)


From lhh at redhat.com  Wed Jan 10 14:40:01 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 09:40:01 -0500
Subject: [Linux-cluster] General 2-node cluster questions
In-Reply-To: <9C01E18EF3BC2448A3B1A4812EB87D0232837D@SRVEDI.upark.crscold.com>
References: <9C01E18EF3BC2448A3B1A4812EB87D0232837D@SRVEDI.upark.crscold.com>
Message-ID: <1168440001.15369.220.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 17:11 -0600, Brad Filipek wrote:

> I have two boxes with RHEL4U4 and one application. Should the app be
> installed locally on both nodes, and have the data on the SAN? Or
> should the app and the data both be on the SAN? This will be an
> active/passive config.

That's a matter of "what works better for you".  Either one works.  For
Oracle 10g, I installed everything on to the SAN.

For something like Apache, you could make a new httpd.conf that points
the docroot to the SAN mount point (since httpd was already installed on
both nodes anyway).

> Also, does the app and data both need to sit on a GFS?

Not at all.  You can use a non-cluster FS if you want (e.g. ext3).
Remember that the service only runs on one node at a time; the file
system is only mounted in one place at a time, etc.  The key is that the
file system may only be mounted in one place at a time.

RHCS is designed to manage this, but you have to manually mount all this
stuff up and bring up the service IP address during configuration /
installation yourself.

Basically:

(a) partition disks
(b) set up clvm (if you're going to use LVM in the cluster)
(c) mkfs -t ext3 /dev/foo1
(d) mkdir /cluster/service0
(e) mount -t ext3 /dev/foo1 /cluster/service0
(f) ip addr add 192.168.1.2 dev eth0
(g) Install app.  Make it use 192.168.1.2 for all traffic,
and /cluster/service0 for data.
(h) Start app; give it a test from your clients.
(g) Stop app.
(h) ip addr del 192.168.1.2 dev eth0
(j) umount /cluster/service0
(k) [configure RHCS service]


-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/36331a4c/attachment.sig>

From lhh at redhat.com  Wed Jan 10 14:43:46 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 09:43:46 -0500
Subject: [Linux-cluster] Cluster software won't start at boot
In-Reply-To: <001801c734ac$a26229f0$4c4b3291@kagtqp>
References: <001801c734ac$a26229f0$4c4b3291@kagtqp>
Message-ID: <1168440226.15369.224.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 12:44 +0100, Kit Gerrits wrote:
> From: Lon Hohberger <lhh at redhat.com>
> > I've not seen this before; usually that "just works".  Do you have any
> more information that could help, i.e.:
> > 
> > rpm -q clumanager
> > rpm -qV clumanager
> 
> Well, they people that set that system up are a bit strange.
> They have runlevel 5 as initdefault, but the system does not show a
> graphical login at boot.
> (startx works, though)

That's weird, but certainly not the problem.  Although, it might be
related somehow...  clumanager doesn't start, and X doesn't start, but
both *should*.

> Fyi:
> [root at nzcs1 root]# rpm -qi clumanager
> Name        : clumanager                   Relocations: (not relocateable)
> Version     : 1.0.19                            Vendor: Red Hat, Inc.
> Release     : 2                             Build Date: Mon 23 Dec 2002
> 10:08:02 PM MET
> Install date: Thu 03 Jun 2004 02:55:23 PM MET      Build Host:
> bugs.devel.redhat.com

:o

That's kind of old; there's an updated 1.0.28 package on RHN.  However,
the init script didn't change much ... if at all.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/d94a9d43/attachment.sig>

From lhh at redhat.com  Wed Jan 10 14:45:14 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 09:45:14 -0500
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <200719174115.718127@leena>
References: <200719174115.718127@leena>
Message-ID: <1168440314.15369.226.camel@rei.boston.devel.redhat.com>

On Tue, 2007-01-09 at 17:41 -0600, isplist at logicore.net wrote:
> Can one remotely log bash_history files such as one does with syslog.conf and
> @192.168.x.x for example?
> 
> I want to consolidate all of my bash history files onto one of the GFS 
> servers.
> 
> Mike

Nope - not that I'm aware of.  The easiest way to do this is to just put
home directories for each user NFS or GFS (the latter requires the
client to be on the SAN and part of the cluster, of course).

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/34372a70/attachment.sig>

From fedele at fis.unical.it  Wed Jan 10 15:13:06 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Wed, 10 Jan 2007 16:13:06 +0100
Subject: [Linux-cluster] Cluster for number-crunching purposes
Message-ID: <45A50282.7060205@fis.unical.it>

I have a new 35-nodes cluster with a SAN for data storage, my SAN is connected via SCSI with two nodes.
OS is CentOS4 with ClusterSuite
Cluster purpose is numer-crinching:
SAN disks are GFS and exported via gnbd to the other 33 nodes in the cluster.
Configuration file cluster.conf is below.

This is my first cluster configured with the ClusterSuite, can anyone help me to understand if i made any mistake?

Thank you

  Fedele STABILE


/etc/cluster/cluster.conf

<?xml version="1.0"?>
<cluster config_version="138" name="linuxlab-cl">
         <fence_daemon post_fail_delay="0" post_join_delay="20"/>
         <clusternodes>
                 <clusternode name="server1" votes="20">
                         <fence>
                                 <method name="1">
                                         <device name="ILO_server1"/>
                                 </method>
                         </fence>
                 </clusternode>
                 <clusternode name="server2" votes="20">
                         <fence>
                                 <method name="1">
                                         <device name="ILO_sevrer2"/>
                                 </method>
                         </fence>
                 </clusternode>
                 <clusternode name="pc0" votes="0">
                         <fence>
                                 <method name="1">
                                         <device name="GNBD_server1" nodename="pc0"/>
                                 </method>
                         </fence>
                 </clusternode>
	.....
	.....
         </clusternodes>
         <cman expected_votes="1"/>
         <fencedevices>
                 <fencedevice agent="fence_ilo" .... />
                 <fencedevice agent="fence_gnbd" name="GNBD_server1" servers="server1 server2"/>
                 <fencedevice agent="fence_ilo" .... />
         </fencedevices>
         <rm>
	.....
	.....
</cluster>


From bfilipek at crscold.com  Wed Jan 10 15:24:51 2007
From: bfilipek at crscold.com (Brad Filipek)
Date: Wed, 10 Jan 2007 09:24:51 -0600
Subject: [Linux-cluster] General 2-node cluster questions
Message-ID: <9C01E18EF3BC2448A3B1A4812EB87D0232840A@SRVEDI.upark.crscold.com>

Hi Riaan and Lon,

Thanks for your replies. The App we use is called PRO5 and uses SSH to
run. The users SSH into the RHEL4 box, and their .bash_profile fires up
the PRO5 app which is located at /basis/pro5/pro5. Their .bash_profile
files look like this:

============================
# .bash_profile

# Get the aliases and functions
if [ -f ~/.bashrc ]; then
        . ~/.bashrc
fi

# User specific environment and startup programs

PATH=$PATH:$HOME/bin

export PATH
unset USERNAME
umask 0000
TERM=vt220;export TERM
TERMCAP=/basis/pro5/termcap;export TERMCAP
cd /basis/pro5
./pro5 -tT001 /live/cf.src/PGMSYS9999
exit
============================

Since I will be using an active/passive config in this scenario, would I
be able to install both PRO5 and it's data on an ext3 partition located
on the SAN? Would I even need to have a GFS partition at all? Obviously
SSH would run locally on each node. 

Thanks again,

Brad Filipek

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Riaan van Niekerk
Sent: Wednesday, January 10, 2007 2:53 AM
To: linux clustering
Subject: [<SPAM HC>] - Re: [Linux-cluster] General 2-node cluster
questions - Email found in subject


Brad Filipek wrote:
> I am in the process of setting up a 2-node cluster with a SAN for data

> storage. I have a few general questions as this is my first time using

> RHEL CS.
> 
>  
> 
> I have two boxes with RHEL4U4 and one application. Should the app be 
> installed locally on both nodes, and have the data on the SAN? Or
should 
> the app and the data both be on the SAN? This will be an
active/passive 
> config.
> 

It is up to you to decide if you want the app on the SAN or not.

App locally installed

If the App is simple and/or part of the OS FS hierarchy (e.g. apache, or

not in /opt), you can install/configure it on node1 and copy the 
configuration across (keeping in mind that you need to manually keep the

configs in sync)

App installed to shared storage

(for illustration purposes I will use Oracle as the clustered app)

If the App is complex and goes into a distinct directory you can 
partition off (e.g. ORACLE_HOME somewhere in /opt/oracle), it might make

more sense to have the whole /opt/oracle on the SAN aswell.

Any files that belong to or are required by the application (e.g 
/etc/oratab, initscripts) would have to be copied across manually from 
one node to the other. If you don't have an easy way of determining 
which files are located outside of the shared partition, you might have 
to install the app twice (once on each side), but it might get confused 
on the second node since it may get confused by the install done on
node1.

>  
> 
> Also, does the app and data both need to sit on a GFS?
> 

For active-passive: they don't need to, but they can be. If you are 
using an active-passive cluster and only one node at a time will have 
the application running (and writing to the partition with the data on),

you can use ext3.

Ext3 is a lot faster than GFS since it does not require the 
overhead/complexity of a clustered file system.

also - a tip - before you configure the app as a clustered service in 
rgmanager, make sure that it starts up flawlessly on both sides (after 
manually moving the VIP and filesystem resources from one node to the 
other). Otherwise, after you configure the app in rgmanager and things 
dont work, you may have to troubleshoot both the app startup and
rgmanager.

greetings
Riaan

Confidentiality Notice: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. 

If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments.


From breeves at redhat.com  Wed Jan 10 15:48:45 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 10 Jan 2007 15:48:45 +0000
Subject: [Linux-cluster] Cluster software won't start at boot
In-Reply-To: <1168440226.15369.224.camel@rei.boston.devel.redhat.com>
References: <001801c734ac$a26229f0$4c4b3291@kagtqp>
	<1168440226.15369.224.camel@rei.boston.devel.redhat.com>
Message-ID: <45A50ADD.5030008@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Lon Hohberger wrote:
>> Well, they people that set that system up are a bit strange.
>> They have runlevel 5 as initdefault, but the system does not show a
>> graphical login at boot.
>> (startx works, though)
> 
> That's weird, but certainly not the problem.  Although, it might be
> related somehow...  clumanager doesn't start, and X doesn't start, but
> both *should*.

Sounds like inittab weirdness - I saw symptoms like this a few times
while teaching class when students would do stuff like:

id:5:initdefault:

# System initialization.
si::sysinit:/etc/rc.d/rc.sysinit

l0:0:wait:/etc/rc.d/rc 0
l1:1:wait:/etc/rc.d/rc 1
l2:2:wait:/etc/rc.d/rc 2
l3:3:wait:/etc/rc.d/rc 3
l4:4:wait:/etc/rc.d/rc 4
l5:5:wait:/etc/rc.d/rc 3 <------
l6:6:wait:/etc/rc.d/rc 6
...

When attempting to change their default runlevel.

It makes life kinda exciting if things are disabled (K links) in rc3.d
but enabled (S links) in rc5.d - runlevel/who -r etc. report one thing,
but the services started are those belonging to the other runlevel.

It's also worth checking grub.conf incase they've overridden initdefault
from the kernel command line.

Kind regards,

Bryn.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFpQrd6YSQoMYUY94RAmX9AKCS1jCPfc6nGiawlmCbed0Uy/oFOwCePqYZ
0n1dGAgZcJZy4AdwGrG2Uuc=
=gM4S
-----END PGP SIGNATURE-----


From isplist at logicore.net  Wed Jan 10 15:55:07 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 10 Jan 2007 09:55:07 -0600
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <1168440314.15369.226.camel@rei.boston.devel.redhat.com>
Message-ID: <20071109557.814030@leena>

> Nope - not that I'm aware of.  The easiest way to do this is to just put
> home directories for each user NFS or GFS (the latter requires the
> client to be on the SAN and part of the cluster, of course).

Unfortunately, that would defeat the purpose behind my wanting to remotely log 
the activity.

Mike


From axehind007 at yahoo.com  Wed Jan 10 16:00:03 2007
From: axehind007 at yahoo.com (Brian Pontz)
Date: Wed, 10 Jan 2007 08:00:03 -0800 (PST)
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <20071109557.814030@leena>
Message-ID: <140844.91703.qm@web33215.mail.mud.yahoo.com>


--- "isplist at logicore.net" <isplist at logicore.net>
wrote:

> > Nope - not that I'm aware of.  The easiest way to
> do this is to just put
> > home directories for each user NFS or GFS (the
> latter requires the
> > client to be on the SAN and part of the cluster,
> of course).
> 
> Unfortunately, that would defeat the purpose behind
> my wanting to remotely log 
> the activity.

You can do this through syslog but it would require
you to modify the kernel code and recompile it. You
would basically printk() all exec's in the kernel.
Otherwise the honeynet project would probably be the
best people to ask about this.

Brian


From maarten.boot at mbu.hr  Wed Jan 10 16:03:17 2007
From: maarten.boot at mbu.hr (Maarten Boot)
Date: Wed, 10 Jan 2007 17:03:17 +0100
Subject: [Linux-cluster] Quick off topic question
Message-ID: <C3589D99D0DB4D4A9C0E9C64B04BC861CE7B2E@MBUEXC.mbu.local>

Or recompiling bash to use syslog next to bash history on exec 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Brian Pontz
Sent: Wednesday, January 10, 2007 5:00 PM
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Quick off topic question


--- "isplist at logicore.net" <isplist at logicore.net>
wrote:

> > Nope - not that I'm aware of.  The easiest way to
> do this is to just put
> > home directories for each user NFS or GFS (the
> latter requires the
> > client to be on the SAN and part of the cluster,
> of course).
> 
> Unfortunately, that would defeat the purpose behind
> my wanting to remotely log 
> the activity.

You can do this through syslog but it would require
you to modify the kernel code and recompile it. You
would basically printk() all exec's in the kernel.
Otherwise the honeynet project would probably be the
best people to ask about this.

Brian

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Jan 10 16:06:44 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 10 Jan 2007 10:06:44 -0600
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <140844.91703.qm@web33215.mail.mud.yahoo.com>
Message-ID: <200711010644.372396@leena>

Interesting, if a user can modify the history file, you can't see what's been 
done. 
I honestly thought this was something pretty basic for security and wanted to 
add this in all my nodes for a single history file.

Mike


> You can do this through syslog but it would require
> you to modify the kernel code and recompile it. You
> would basically printk() all exec's in the kernel.
> Otherwise the honeynet project would probably be the
> best people to ask about this.


From simone.gotti at email.it  Wed Jan 10 16:08:10 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Wed, 10 Jan 2007 17:08:10 +0100
Subject: [Linux-cluster] [PATCH] qdisk: fix crash or wrong behavior if
	"qdisk_read" returns an error.
Message-ID: <1168445290.4361.24.camel@localhost>

In qdisk/main.c:read_node_blocks if the call to "qdisk_read" returns an
error, the cycle isn't interrupted and the call swab_status_block_t will
make qdiskd crash or report bad node id, master status etc.... This will
probably (not reproduced) cause strange behavior like this node trying
to kill the others that are working correctly.

I putted a "continue" to skip the cycle after the error. As if nothing
about the node can be read it's better to not change the current
informations.

I hope the patch is correct.

Thanks.

Bye!


=============================================================================

[2725] warning: Error reading node ID block 1
[2725] warning: Error reading node ID block 2
[2725] warning: Error reading node ID block 3
[2725] warning: Error reading node ID block 4
[2725] warning: Error reading node ID block 5
[2725] warning: Error reading node ID block 6
[2725] warning: Error reading node ID block 7
[2725] warning: Error reading node ID block 8
[2725] warning: Error reading node ID block 9
[2725] warning: Error reading node ID block 10
[2725] warning: Error reading node ID block 11
[2725] warning: Error reading node ID block 12
[2725] warning: Error reading node ID block 13
[2725] warning: Error reading node ID block 14
[2725] warning: Error reading node ID block 15
[2725] warning: Error reading node ID block 16
[2725] debug: Node 16777216 is UP
[2725] crit: A master exists, but it's not me?!
diskRawWriteShadow: Input/output error
diskRawWriteShadow: aligned write returned -1, not 512
diskRawWriteShadow: Input/output error
Error writing node ID block 1
[2725] err: Error writing to quorum disk
Node ID: 1
Score (current / min req. / max allowed): 1 / 1 / 1
Current state: Master
Current disk state: None
Visible Set: { 16777216 }
Master Node ID: 16777216
Quorate Set: { 16777216 33554432 50331648 67108864 83886080 100663296
117440512 134217728 150994944 167772160 184549376 201326592 218103808
234881024 251658240 268435456 }

[2725] warning: Error reading node ID block 1
[2725] warning: Error reading node ID block 2
[2725] warning: Error reading node ID block 3
[2725] warning: Error reading node ID block 4
[2725] warning: Error reading node ID block 5
[2725] warning: Error reading node ID block 6
[2725] warning: Error reading node ID block 7
[2725] warning: Error reading node ID block 8
[2725] warning: Error reading node ID block 9
[2725] warning: Error reading node ID block 10
[2725] warning: Error reading node ID block 11
[2725] warning: Error reading node ID block 12
[2725] warning: Error reading node ID block 13
[2725] warning: Error reading node ID block 14
[2725] warning: Error reading node ID block 15
[2725] warning: Error reading node ID block 16
[2725] info: Node 1 is the master
[2725] crit: Critical Error: More than one master found!
diskRawWriteShadow: Input/output error
diskRawWriteShadow: aligned write returned -1, not 512
diskRawWriteShadow: Input/output error
Error writing node ID block 1
[2725] err: Error writing to quorum disk
Node ID: 1
Score (current / min req. / max allowed): 1 / 1 / 1
Current state: Master
Current disk state: None
Visible Set: { 1 }
Master Node ID: 1
Quorate Set: { 1 }

[2725] warning: Error reading node ID block 1
[2725] warning: Error reading node ID block 2
[2725] warning: Error reading node ID block 3
[2725] warning: Error reading node ID block 4
[2725] warning: Error reading node ID block 5
[2725] warning: Error reading node ID block 6
[2725] warning: Error reading node ID block 7
[2725] warning: Error reading node ID block 8
[2725] warning: Error reading node ID block 9
[2725] warning: Error reading node ID block 10
[2725] warning: Error reading node ID block 11
[2725] warning: Error reading node ID block 12
[2725] warning: Error reading node ID block 13
[2725] warning: Error reading node ID block 14
[2725] warning: Error reading node ID block 15
[2725] warning: Error reading node ID block 16
[2725] crit: A master exists, but it's not me?!
diskRawWriteShadow: Input/output error
diskRawWriteShadow: aligned write returned -1, not 512
diskRawWriteShadow: Input/output error
Error writing node ID block 1
[2725] err: Error writing to quorum disk
Node ID: 1
Score (current / min req. / max allowed): 1 / 1 / 1
Current state: Master
Current disk state: None
Visible Set: { 16777216 }
Master Node ID: 16777216
Quorate Set: { 16777216 33554432 50331648 67108864 83886080 100663296
117440512 134217728 150994944 167772160 184549376 201326592 218103808
234881024 251658240 268435456 }

[2725] warning: Error reading node ID block 1
[2725] warning: Error reading node ID block 2
[2725] warning: Error reading node ID block 3
[2725] warning: Error reading node ID block 4
[2725] warning: Error reading node ID block 5
[2725] warning: Error reading node ID block 6
[2725] warning: Error reading node ID block 7
[2725] warning: Error reading node ID block 8
[2725] warning: Error reading node ID block 9
[2725] warning: Error reading node ID block 10
[2725] warning: Error reading node ID block 11
[2725] warning: Error reading node ID block 12
[2725] warning: Error reading node ID block 13
[2725] warning: Error reading node ID block 14
[2725] warning: Error reading node ID block 15
[2725] warning: Error reading node ID block 16
[2725] crit: Critical Error: More than one master found!
diskRawWriteShadow: Input/output error
diskRawWriteShadow: aligned write returned -1, not 512
diskRawWriteShadow: Input/output error
Error writing node ID block 1
[2725] err: Error writing to quorum disk
Node ID: 1
Score (current / min req. / max allowed): 1 / 1 / 1
Current state: Master
Current disk state: None
Visible Set: { 1 }
Master Node ID: 1
Quorate Set: { 1 }

[2725] warning: Error reading node ID block 1
[2725] warning: Error reading node ID block 2
[2725] warning: Error reading node ID block 3
[2725] warning: Error reading node ID block 4
[2725] warning: Error reading node ID block 5
[2725] warning: Error reading node ID block 6
[2725] warning: Error reading node ID block 7
[2725] warning: Error reading node ID block 8
[2725] warning: Error reading node ID block 9
[2725] warning: Error reading node ID block 10
[2725] warning: Error reading node ID block 11
[2725] warning: Error reading node ID block 12
[2725] warning: Error reading node ID block 13
[2725] warning: Error reading node ID block 14
[2725] warning: Error reading node ID block 15
[2725] warning: Error reading node ID block 16
[2725] crit: A master exists, but it's not me?!
diskRawWriteShadow: Input/output error
diskRawWriteShadow: aligned write returned -1, not 512
diskRawWriteShadow: Input/output error
Error writing node ID block 1
[2725] err: Error writing to quorum disk
Node ID: 1
Score (current / min req. / max allowed): 1 / 1 / 1
Current state: Master
Current disk state: None
Visible Set: { 16777216 }
Master Node ID: 16777216
Quorate Set: { 16777216 33554432 50331648 67108864 83886080 100663296
117440512 134217728 150994944 167772160 184549376 201326592 218103808
234881024 251658240 268435456 }


-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Refill s.r.l. - Prodotti per TUTTE le stampanti sul mercato a prezzi sempre convenienti. Dal 1993, leader nel compatibile di qualit? in Italia.
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5188&d=10-1
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cman-2.0.35-qdisk-read_node_blocks-continue-on-disk-error.patch
Type: text/x-patch
Size: 461 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/5a520027/attachment.bin>

From breeves at redhat.com  Wed Jan 10 16:34:15 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 10 Jan 2007 16:34:15 +0000
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <140844.91703.qm@web33215.mail.mud.yahoo.com>
References: <140844.91703.qm@web33215.mail.mud.yahoo.com>
Message-ID: <45A51587.1020807@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Brian Pontz wrote:
>> Unfortunately, that would defeat the purpose behind
>> my wanting to remotely log 
>> the activity.
> 
> You can do this through syslog but it would require
> you to modify the kernel code and recompile it. You
> would basically printk() all exec's in the kernel.
> Otherwise the honeynet project would probably be the
> best people to ask about this.

Isn't that what the audit subsystem is designed for?

No need for custom kernels - just set some audit rules to monitor execs
and parse the auditd output. This still won't be a perfect replacement
for bash_history though as it will loose some detail of the arguments.

That said, if this is purely for security monitoring and not to have a
list of commands and their arguments for re-play purposes (that's the
goal of a shell history), I think audit would be the most
straightforward solution.

You need to set up two files to configure auditing, auditd.conf and
audit.rules. The first governs the daemon itself, the second tells it
what to audit.

There's currently no direct support for syslog-style @host remote
logging, but there is a "dispatcher" directive in auditd.conf that will
run an external command when audit starts and pipe each message to that
program's stdin - a simple wrapper would then be able to squirt the
messages to a remote server if needed.

Alternately, make /var/log/audit a separate filesystem on GFS and write
the logs here. That will probably need some twiddling as I think auditd
normally starts before GFS filesystems are mounted but shouldn't be
impossible.

A simple audit rule to get started could look like this:

- -a exit,always -S exec

You can be more specific about what to log, filter by uid, pid and other
attributes - see the auditctl man page for the details as well as the
sample rule files under /usr/share/doc/audit-*/.

One word of warning - it's possible to DoS yourself in a couple of ways
with audit. The default behavior when audit cannot create its logs is to
panic - this is for high security environments where no service is
better than insecure service. Disable it by setting "-f 0" or "-f 1" in
the rules file (silent/printk on error respectively).

Also, the volume of messages can be huge with a very broad ruleset - be
sure to allow enough space for the logs and to configure rotation if needed.

More info here:

http://people.redhat.com/sgrubb/audit/

Cheers,

Bryn.

> Brian
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFpRWH6YSQoMYUY94RAtxrAKCfBPDO2dcLLx8lWy/7gQbagM5KDACfYfX/
WOWcQ/oJTQ/JA8z7Uitx8lA=
=Dey9
-----END PGP SIGNATURE-----


From kagato at souja.net  Wed Jan 10 17:15:09 2007
From: kagato at souja.net (Jayson Vantuyl)
Date: Wed, 10 Jan 2007 11:15:09 -0600
Subject: [Linux-cluster] Cluster for number-crunching purposes
In-Reply-To: <45A50282.7060205@fis.unical.it>
References: <45A50282.7060205@fis.unical.it>
Message-ID: <83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>

I think having 20 votes per node with cman expecting 1 vote could  
completely break quorum calculation (although it would appear to work  
just fine until you had a network failure).

On Jan 10, 2007, at 9:13 AM, Fedele Stabile wrote:

> I have a new 35-nodes cluster with a SAN for data storage, my SAN  
> is connected via SCSI with two nodes.
> OS is CentOS4 with ClusterSuite
> Cluster purpose is numer-crinching:
> SAN disks are GFS and exported via gnbd to the other 33 nodes in  
> the cluster.
> Configuration file cluster.conf is below.
>
> This is my first cluster configured with the ClusterSuite, can  
> anyone help me to understand if i made any mistake?
>
> Thank you
>
>  Fedele STABILE
>
>
> /etc/cluster/cluster.conf
>
> <?xml version="1.0"?>
> <cluster config_version="138" name="linuxlab-cl">
>         <fence_daemon post_fail_delay="0" post_join_delay="20"/>
>         <clusternodes>
>                 <clusternode name="server1" votes="20">
>                         <fence>
>                                 <method name="1">
>                                         <device name="ILO_server1"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="server2" votes="20">
>                         <fence>
>                                 <method name="1">
>                                         <device name="ILO_sevrer2"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="pc0" votes="0">
>                         <fence>
>                                 <method name="1">
>                                         <device name="GNBD_server1"  
> nodename="pc0"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
> 	.....
> 	.....
>         </clusternodes>
>         <cman expected_votes="1"/>
>         <fencedevices>
>                 <fencedevice agent="fence_ilo" .... />
>                 <fencedevice agent="fence_gnbd" name="GNBD_server1"  
> servers="server1 server2"/>
>                 <fencedevice agent="fence_ilo" .... />
>         </fencedevices>
>         <rm>
> 	.....
> 	.....
> </cluster>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From fedele at fis.unical.it  Wed Jan 10 17:50:14 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Wed, 10 Jan 2007 18:50:14 +0100
Subject: [Linux-cluster] Cluster for number-crunching purposes
In-Reply-To: <83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
References: <45A50282.7060205@fis.unical.it>
	<83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
Message-ID: <45A52756.8000302@fis.unical.it>

I experienced that using vote=1 for all members gives the same quorum votes as result of command
cman_tool status

Instead i woulk create a quorum disk on SAN storage.
Can you help me?


Jayson Vantuyl wrote:
> I think having 20 votes per node with cman expecting 1 vote could  
> completely break quorum calculation (although it would appear to work  
> just fine until you had a network failure).
> 
> On Jan 10, 2007, at 9:13 AM, Fedele Stabile wrote:
> 
>> I have a new 35-nodes cluster with a SAN for data storage, my SAN  is 
>> connected via SCSI with two nodes.
>> OS is CentOS4 with ClusterSuite
>> Cluster purpose is numer-crinching:
>> SAN disks are GFS and exported via gnbd to the other 33 nodes in  the 
>> cluster.
>> Configuration file cluster.conf is below.
>>
>> This is my first cluster configured with the ClusterSuite, can  anyone 
>> help me to understand if i made any mistake?
>>
>> Thank you
>>
>>  Fedele STABILE
>>
>>
>> /etc/cluster/cluster.conf
>>
>> <?xml version="1.0"?>
>> <cluster config_version="138" name="linuxlab-cl">
>>         <fence_daemon post_fail_delay="0" post_join_delay="20"/>
>>         <clusternodes>
>>                 <clusternode name="server1" votes="20">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="ILO_server1"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="server2" votes="20">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="ILO_sevrer2"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="pc0" votes="0">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="GNBD_server1"  
>> nodename="pc0"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>     .....
>>     .....
>>         </clusternodes>
>>         <cman expected_votes="1"/>
>>         <fencedevices>
>>                 <fencedevice agent="fence_ilo" .... />
>>                 <fencedevice agent="fence_gnbd" name="GNBD_server1"  
>> servers="server1 server2"/>
>>                 <fencedevice agent="fence_ilo" .... />
>>         </fencedevices>
>>         <rm>
>>     .....
>>     .....
>> </cluster>
>>
>>
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 


From lhh at redhat.com  Wed Jan 10 17:56:57 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 12:56:57 -0500
Subject: [Linux-cluster] [PATCH] qdisk: fix crash or wrong behavior if
	"qdisk_read" returns an error.
In-Reply-To: <1168445290.4361.24.camel@localhost>
References: <1168445290.4361.24.camel@localhost>
Message-ID: <1168451817.15369.228.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 17:08 +0100, Simone Gotti wrote:
> In qdisk/main.c:read_node_blocks if the call to "qdisk_read" returns an
> error, the cycle isn't interrupted and the call swab_status_block_t will
> make qdiskd crash or report bad node id, master status etc.... This will
> probably (not reproduced) cause strange behavior like this node trying
> to kill the others that are working correctly.
> 
> I putted a "continue" to skip the cycle after the error. As if nothing
> about the node can be read it's better to not change the current
> informations.
> 
> I hope the patch is correct.

It looks right.

-- Lon

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/7dd7c083/attachment.sig>

From Andre at hudat.com  Wed Jan 10 18:12:00 2007
From: Andre at hudat.com (Andre Henry)
Date: Wed, 10 Jan 2007 13:12:00 -0500
Subject: [Linux-cluster] ccsd problems
Message-ID: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>

I have a two node cluster that has been humming along without issues 
for over a year. Reboots crashes no problems. Restart and all is well. 
I had a SCSI error yesterday now node 2 will not even start ccsd. All 
seems ok with packages, nics, kernel, modules. The system has been 
rebooted in the past.

No info other than "Unable to connect to cluster infrastructure" 
printed in logs. An strace seems to show its using IPv6 to connect to 
the other node. I have tried passing the -I and -4 option with no luck.

--
Andre


From kitgerrits at gmail.com  Wed Jan 10 18:14:58 2007
From: kitgerrits at gmail.com (Kit Gerrits)
Date: Wed, 10 Jan 2007 19:14:58 +0100
Subject: [Linux-cluster] Quick off topic question
Message-ID: <005d01c734e3$3cf79eb0$4c4b3291@kagtqp>


Keep in mind, that Bash does some interesting tricks with its bash_history.
(like maintaining a single history per session and fusing them afterwards).

It might be a good idea to mail&wipe the .bash_history file upon logout.


If you want to use the .bash_history file for autiding:
Some O/S'es / filesystems allow write-only access to files.
This would make sure the user cannot 'edit' the file to remove any traces.
(This is usually limited to /var/log, so I don't know if it can be applied
to a single file)


Regards,

Kit Gerrits


From kitgerrits at gmail.com  Wed Jan 10 18:18:37 2007
From: kitgerrits at gmail.com (Kit Gerrits)
Date: Wed, 10 Jan 2007 19:18:37 +0100
Subject: [Linux-cluster] Cluster software won't start at boot
Message-ID: <005e01c734e3$bf7ba340$4c4b3291@kagtqp>


Lon Hohberger wrote:
>>> Well, they people that set that system up are a bit strange.
>>> They have runlevel 5 as initdefault, but the system does not show a 
>>> graphical login at boot.
>>> (startx works, though)
>> 
>> That's weird, but certainly not the problem.  Although, it might be 
>> related somehow...  clumanager doesn't start, and X doesn't start, but 
>> both *should*.
>
>Sounds like inittab weirdness - I saw symptoms like this a few times while
teaching class when students would do stuff like:
>
>id:5:initdefault:
>
># System initialization.
>si::sysinit:/etc/rc.d/rc.sysinit
>
>l0:0:wait:/etc/rc.d/rc 0
>l1:1:wait:/etc/rc.d/rc 1
>l2:2:wait:/etc/rc.d/rc 2
>l3:3:wait:/etc/rc.d/rc 3
>l4:4:wait:/etc/rc.d/rc 4
>l5:5:wait:/etc/rc.d/rc 3 <------
>l6:6:wait:/etc/rc.d/rc 6
>...
>
>When attempting to change their default runlevel.
>

That trick sounds horribly familiar... ( Sorry, NDA ;-) )
Checked, but this is not the case (It's much more fun to redirect runlevels
to 6 or 0 :-D
  [root at nzcs1 etc]# grep :5 /etc/inittab
id:5:initdefault:
l5:5:wait:/etc/rc.d/rc 5
x:5:respawn:/etc/X11/prefdm -nodaemon

>It makes life kinda exciting if things are disabled (K links) in rc3.d but
enabled (S links) in rc5.d - runlevel/who -r etc. report
>one thing, but the services started are those belonging to the other
runlevel.

That's cute, checked and passed:
  [root at nzcs1 etc]# find . -type l -ls |grep cluster
2796355    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc0.d/K01cluster -> ../init.d/cluster
2976480    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc1.d/K01cluster -> ../init.d/cluster
3009017    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc2.d/S99cluster -> ../init.d/cluster
3026727    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc3.d/S99cluster -> ../init.d/cluster
3042076    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc4.d/S99cluster -> ../init.d/cluster
3058185    0 lrwxrwxrwx   1 root     root           17 Jun 16  2005
./rc.d/rc5.d/S99cluster -> ../init.d/cluster
3090980    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
./rc.d/rc6.d/K01cluster -> ../init.d/cluster

>It's also worth checking grub.conf incase they've overridden initdefault
from the kernel command line.

Foiled again!
>From /boot/grub/grub.conf:

default=13
fallback=10

# This entry (no. 10) added by Proliant HBA install script
title HP-2.4.9-e.24enterprise-1
        root (hd0,0)
        kernel /vmlinuz-2.4.9-e.24enterprise ro root=/dev/cciss/c0d0p5
hda=ide-scsi
        initrd /HP-initrd-2.4.9-e.24enterprise.img

# This entry (no. 13) added by Proliant HBA install script
title HP-2.4.9-e.24enterprise-2
        root (hd0,0)
        kernel /vmlinuz-2.4.9-e.24enterprise ro root=/dev/cciss/c0d0p5
hda=ide-scsi
        initrd /HP-initrd-2.4.9-e.24enterprise.img-0

(I'm still trying to figure out WTF HP did with my grub.conf)


I -DO- appreciate all the help I have received so far.
This is an interesting little trick they must have pulled...


Regards,

Kit


From lhh at redhat.com  Wed Jan 10 18:41:16 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 13:41:16 -0500
Subject: [Linux-cluster] ccsd problems
In-Reply-To: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
References: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
Message-ID: <1168454476.15369.230.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 13:12 -0500, Andre Henry wrote:
> I have a two node cluster that has been humming along without issues 
> for over a year. Reboots crashes no problems. Restart and all is well. 
> I had a SCSI error yesterday now node 2 will not even start ccsd. All 
> seems ok with packages, nics, kernel, modules. The system has been 
> rebooted in the past.
> 
> No info other than "Unable to connect to cluster infrastructure" 
> printed in logs. An strace seems to show its using IPv6 to connect to 
> the other node. I have tried passing the -I and -4 option with no luck.

Is it RHEL3 or RHEL4 ?

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/531bbb60/attachment.sig>

From lhh at redhat.com  Wed Jan 10 19:00:10 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 10 Jan 2007 14:00:10 -0500
Subject: [Linux-cluster] Cluster software won't start at boot
In-Reply-To: <005e01c734e3$bf7ba340$4c4b3291@kagtqp>
References: <005e01c734e3$bf7ba340$4c4b3291@kagtqp>
Message-ID: <1168455610.15369.232.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 19:18 +0100, Kit Gerrits wrote:

> That's cute, checked and passed:
>   [root at nzcs1 etc]# find . -type l -ls |grep cluster
> 2796355    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc0.d/K01cluster -> ../init.d/cluster
> 2976480    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc1.d/K01cluster -> ../init.d/cluster
> 3009017    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc2.d/S99cluster -> ../init.d/cluster
> 3026727    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc3.d/S99cluster -> ../init.d/cluster
> 3042076    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc4.d/S99cluster -> ../init.d/cluster
> 3058185    0 lrwxrwxrwx   1 root     root           17 Jun 16  2005
> ./rc.d/rc5.d/S99cluster -> ../init.d/cluster
> 3090980    0 lrwxrwxrwx   1 root     root           17 Jun  3  2004
> ./rc.d/rc6.d/K01cluster -> ../init.d/cluster

Maybe /etc/init.d/cluster isn't mode 755 for some reason ? *shrug*

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/a1b44e2b/attachment.sig>

From mvz+rhcluster at nimium.hr  Wed Jan 10 19:47:09 2007
From: mvz+rhcluster at nimium.hr (Miroslav Zubcic)
Date: Wed, 10 Jan 2007 20:47:09 +0100
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests -
	bug in fenced?
Message-ID: <45A542BD.9070107@nimium.hr>

Hi people.

I had this problem in spring last year while configuring one RH cluster
for local telco. RH tehnical support was not very useful. They told me
this is not a bug and so on ... So I will like to ask this here on RH
cluster list, in hope for better advice.

When I have 2-node cluster with RSA II management cards (fence_rsa agent)
configured to have 1 oracle database in failover together with VIP adress
and 5 luns shared from EMC storage, how can I pass one simple test with
pooling out main data ethernet cables from active node?

Let's say that I have interface bond0 (data subnet/vlan) and bond1 (fence
subnet/vlan) on each node. Our customers (and we also, it is logical) are
expecting if we pull out all two data cables from bond0 that inactive node
will kill/fence active node and take over it's services.

Unfortunately, what we see almost every time on acceptance test is that
two nodes are killing each other no matter if they have or does not have a
link.

Here is fragment from /var/adm/messages on the active node when I disable
bond0 (by pooling out cables):

---------------------------------------------------------------------
Jan  9 14:05:43 north clurgmgrd: [4593]: <warning> Link for bond0: Not
detected
Jan  9 14:05:43 north clurgmgrd: [4593]: <warning> No link on bond0...

Jan  9 14:05:43 north clurgmgrd[4593]: <notice> status on ip
"10.156.10.32/26" returned 1 (generic error)
Jan  9 14:05:43 north clurgmgrd[4593]: <notice> Stopping service ora_PROD
Jan  9 14:05:53 north kernel: CMAN: removing node south from the cluster :
Missed too many heartbeats
Jan  9 14:05:53 north fenced[4063]: north not a cluster member after 0 sec
post_fail_delay
Jan  9 14:05:53 north fenced[4063]: fencing node "south"
Jan  9 14:05:55 north shutdown: shutting down for system halt
Jan  9 14:05:55 north init: Switching to runlevel: 0
Jan  9 14:05:55 north login(pam_unix)[4599]: session closed for user root
Jan  9 14:05:56 north rgmanager: [4270]: <notice> Shutting down Cluster
Service Manager...
Jan  9 14:05:56 north clurgmgrd[4593]: <notice> Shutting down
Jan  9 14:05:56 north fenced[4063]: fence "south" success

	 [...]

Jan  9 14:11:19 north syslogd 1.4.1: restart.
----------------------------------------------------------

As we see here, clurgmgrd(8) on node "north" has DETECTED that there is no
link, it began to stop service "ora_PROD", system goes in shutdown. So
far, so good. But then, fenced(8) daemon decides to fence "south" node
(healthy node which has data link and all presupositions to take over
ora_PROD service (oracle + IP + 5 ext3 FS's from EMC storage)! Why?

Of course, south also is fenceing north, and I then have tragicomic
situation where both nodes are beeing rebooted by eacs other.

How can I prevent this? This looks like a bug. I don't want fenced to
fence other node south if it already "knows" that it is the one without link.

What to do? We cannot pass acceptance tests with such cluster state. :-(

Thanks for any advice ...


-- 
Miroslav Zubcic, Nimium d.o.o., email: <mvz at nimium.hr>
Tel: +385 01 4852 639, Fax: +385 01 4852 640, Mobile: +385 098 942 8672
Mrazoviceva 12, 10000 Zagreb, Hrvatska


From Andre at hudat.com  Wed Jan 10 19:49:40 2007
From: Andre at hudat.com (Andre Henry)
Date: Wed, 10 Jan 2007 14:49:40 -0500
Subject: [Linux-cluster] ccsd problems
In-Reply-To: <1168454476.15369.230.camel@rei.boston.devel.redhat.com>
References: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
	<1168454476.15369.230.camel@rei.boston.devel.redhat.com>
Message-ID: <1b5633ce2dba2364638f2b786860162e@hudat.com>

RHEL4

--
Thanks
Andre

On Jan 10, 2007, at 1:41 PM, Lon Hohberger wrote:

> On Wed, 2007-01-10 at 13:12 -0500, Andre Henry wrote:
>> I have a two node cluster that has been humming along without issues
>> for over a year. Reboots crashes no problems. Restart and all is well.
>> I had a SCSI error yesterday now node 2 will not even start ccsd. All
>> seems ok with packages, nics, kernel, modules. The system has been
>> rebooted in the past.
>>
>> No info other than "Unable to connect to cluster infrastructure"
>> printed in logs. An strace seems to show its using IPv6 to connect to
>> the other node. I have tried passing the -I and -4 option with no 
>> luck.
>
> Is it RHEL3 or RHEL4 ?
>
> -- Lon
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From breeves at redhat.com  Wed Jan 10 19:59:29 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 10 Jan 2007 19:59:29 +0000
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <005d01c734e3$3cf79eb0$4c4b3291@kagtqp>
References: <005d01c734e3$3cf79eb0$4c4b3291@kagtqp>
Message-ID: <45A545A1.9090808@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Kit Gerrits wrote:
> Keep in mind, that Bash does some interesting tricks with its bash_history.
> (like maintaining a single history per session and fusing them afterwards).
> 
> It might be a good idea to mail&wipe the .bash_history file upon logout.
> 
> 
> If you want to use the .bash_history file for autiding:
> Some O/S'es / filesystems allow write-only access to files.
> This would make sure the user cannot 'edit' the file to remove any traces.
> (This is usually limited to /var/log, so I don't know if it can be applied
> to a single file)
> 

Ext3 allows something close to this. Using its extended attributes you
can mark a file as append only (chattr +a <file>). Only the root account
can add/remove this attr.

It doesn't seem to play to well when the history fills up though - if I
set HISTFILESIZE and HISTSIZE both to 10, after 10 history items have
accumulated it ceases to record anything.

I don't think trying to use the shell history as a security audit is
really going to fly.

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFpUWg6YSQoMYUY94RAodyAJwPqvhL6kjsuNtk+41fjCTTm42WCQCfePBG
Ej02a3O1mY8reqbN/8KqRDM=
=mSYq
-----END PGP SIGNATURE-----


From jwhiter at redhat.com  Wed Jan 10 20:16:49 2007
From: jwhiter at redhat.com (Josef Whiter)
Date: Wed, 10 Jan 2007 15:16:49 -0500
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests -
	bug in fenced?
In-Reply-To: <45A542BD.9070107@nimium.hr>
References: <45A542BD.9070107@nimium.hr>
Message-ID: <20070110201648.GE24326@korben.rdu.redhat.com>

On Wed, Jan 10, 2007 at 08:47:09PM +0100, Miroslav Zubcic wrote:
> Hi people.
> 
> I had this problem in spring last year while configuring one RH cluster
> for local telco. RH tehnical support was not very useful. They told me
> this is not a bug and so on ... So I will like to ask this here on RH
> cluster list, in hope for better advice.
> 
> When I have 2-node cluster with RSA II management cards (fence_rsa agent)
> configured to have 1 oracle database in failover together with VIP adress
> and 5 luns shared from EMC storage, how can I pass one simple test with
> pooling out main data ethernet cables from active node?
> 
> Let's say that I have interface bond0 (data subnet/vlan) and bond1 (fence
> subnet/vlan) on each node. Our customers (and we also, it is logical) are
> expecting if we pull out all two data cables from bond0 that inactive node
> will kill/fence active node and take over it's services.
> 
> Unfortunately, what we see almost every time on acceptance test is that
> two nodes are killing each other no matter if they have or does not have a
> link.
> 
> Here is fragment from /var/adm/messages on the active node when I disable
> bond0 (by pooling out cables):
> 
> ---------------------------------------------------------------------
> Jan  9 14:05:43 north clurgmgrd: [4593]: <warning> Link for bond0: Not
> detected
> Jan  9 14:05:43 north clurgmgrd: [4593]: <warning> No link on bond0...
> 
> Jan  9 14:05:43 north clurgmgrd[4593]: <notice> status on ip
> "10.156.10.32/26" returned 1 (generic error)
> Jan  9 14:05:43 north clurgmgrd[4593]: <notice> Stopping service ora_PROD
> Jan  9 14:05:53 north kernel: CMAN: removing node south from the cluster :
> Missed too many heartbeats
> Jan  9 14:05:53 north fenced[4063]: north not a cluster member after 0 sec
> post_fail_delay
> Jan  9 14:05:53 north fenced[4063]: fencing node "south"
> Jan  9 14:05:55 north shutdown: shutting down for system halt
> Jan  9 14:05:55 north init: Switching to runlevel: 0
> Jan  9 14:05:55 north login(pam_unix)[4599]: session closed for user root
> Jan  9 14:05:56 north rgmanager: [4270]: <notice> Shutting down Cluster
> Service Manager...
> Jan  9 14:05:56 north clurgmgrd[4593]: <notice> Shutting down
> Jan  9 14:05:56 north fenced[4063]: fence "south" success
> 
> 	 [...]
> 
> Jan  9 14:11:19 north syslogd 1.4.1: restart.
> ----------------------------------------------------------
> 
> As we see here, clurgmgrd(8) on node "north" has DETECTED that there is no
> link, it began to stop service "ora_PROD", system goes in shutdown. So
> far, so good. But then, fenced(8) daemon decides to fence "south" node
> (healthy node which has data link and all presupositions to take over
> ora_PROD service (oracle + IP + 5 ext3 FS's from EMC storage)! Why?
> 
> Of course, south also is fenceing north, and I then have tragicomic
> situation where both nodes are beeing rebooted by eacs other.
> 
> How can I prevent this? This looks like a bug. I don't want fenced to
> fence other node south if it already "knows" that it is the one without link.
> 
> What to do? We cannot pass acceptance tests with such cluster state. :-(
> 
> Thanks for any advice ...
> 

This isn't a bug, its working as expected.  What you need in qdisk, set it up
with the proper hueristics and it will force the shutdown of the bad node before
the bad node has a chance to fence off the working node.

Josef


From jvantuyl at engineyard.com  Wed Jan 10 21:17:02 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Wed, 10 Jan 2007 15:17:02 -0600
Subject: [Linux-cluster] Quick off topic question
In-Reply-To: <45A545A1.9090808@redhat.com>
References: <005d01c734e3$3cf79eb0$4c4b3291@kagtqp>
	<45A545A1.9090808@redhat.com>
Message-ID: <D228C531-3624-4BE3-A92F-8727D8649E6C@engineyard.com>

In bash, shell history can be disabled with the command:

unset HISTFILE

It wasn't intended to be and isn't suitable for any form of security  
tracking.  Not to mention that at any point the intruder could  
manually execute a non-interactive shell which wouldn't log either.

I'd really recommend the auditing infrastructure.

On Jan 10, 2007, at 1:59 PM, Bryn M. Reeves wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Kit Gerrits wrote:
>> Keep in mind, that Bash does some interesting tricks with its  
>> bash_history.
>> (like maintaining a single history per session and fusing them  
>> afterwards).
>>
>> It might be a good idea to mail&wipe the .bash_history file upon  
>> logout.
>>
>>
>> If you want to use the .bash_history file for autiding:
>> Some O/S'es / filesystems allow write-only access to files.
>> This would make sure the user cannot 'edit' the file to remove any  
>> traces.
>> (This is usually limited to /var/log, so I don't know if it can be  
>> applied
>> to a single file)
>>
>
> Ext3 allows something close to this. Using its extended attributes you
> can mark a file as append only (chattr +a <file>). Only the root  
> account
> can add/remove this attr.
>
> It doesn't seem to play to well when the history fills up though -  
> if I
> set HISTFILESIZE and HISTSIZE both to 10, after 10 history items have
> accumulated it ceases to record anything.
>
> I don't think trying to use the shell history as a security audit is
> really going to fly.
>
> Kind regards,
>
> Bryn.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.5 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
> iD8DBQFFpUWg6YSQoMYUY94RAodyAJwPqvhL6kjsuNtk+41fjCTTm42WCQCfePBG
> Ej02a3O1mY8reqbN/8KqRDM=
> =mSYq
> -----END PGP SIGNATURE-----
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/c3b25fa1/attachment.htm>

From jvantuyl at engineyard.com  Wed Jan 10 21:22:35 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Wed, 10 Jan 2007 15:22:35 -0600
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests -
	bug in fenced?
In-Reply-To: <45A542BD.9070107@nimium.hr>
References: <45A542BD.9070107@nimium.hr>
Message-ID: <79A61925-4F6E-4947-863B-DCA0BD4C3A74@engineyard.com>

On Jan 10, 2007, at 1:47 PM, Miroslav Zubcic wrote:
> How can I prevent this? This looks like a bug. I don't want fenced to
> fence other node south if it already "knows" that it is the one  
> without link.
It is very possible to write a wrapper script for your fencing agent  
that simply checks the link and refuses to fence when the link is  
down.  However, this wouldn't really be good in any non-network  
related fencing situation.

The recommendation to use a qdiskd would be a good one and you could  
even use a link detection script as an arbitrary heuristic in this case.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070110/2165c10d/attachment.htm>

From lshen at cisco.com  Wed Jan 10 23:00:54 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Wed, 10 Jan 2007 15:00:54 -0800
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <1168282204.15369.43.camel@rei.boston.devel.redhat.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D8033E77FA@xmb-sjc-222.amer.cisco.com>

 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> Sent: Monday, January 08, 2007 10:50 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Remove the clusterness from GFS
> 
> On Mon, 2007-01-08 at 10:39 -0800, Lin Shen (lshen) wrote:
> > How easy is it to
> > remove some or all of the clusterness from GFS such as 
> fencing, cman 
> > and ccsd stuff? I understand that things like dlm must stay 
> for GFS to work.
> 
> I would think it is very difficult.
> 
> You can use GFS on *one* node without a cluster.
> 
> In order to use a clustered file system, you need a cluster.  
> The cluster acts as the control mechanism for accessing the 
> file system.
> Without it, each computer accessing GFS will have no 
> knowledge of when it is safe to write to or read from the 
> file system.  This will lead to file system corruption very quickly.
> 
> If you absolutely can not have a bit of "cluster software 
> running", you'll probably need to use a client/server 
> approach like NFS instead of a cluster file system like GFS.
> 
> 

It's not that we discriminate against cluster software :). We just have 
some worries about the potential impact the cluster suite could bring to
the system. Extra CPU and memory cost is ok, we can consider that's part
of running GFS. The part that gets us wonder is any potential behavioral
changes and instability to the system. After all, the system is
effectively tunrned into a cluster. I read some of the emails in the
alias about cluster issues aside from GFS.   

For instance, we support hot removal/insertion of nodes in the system,
I'm not clear how fencing will get in the way. We're not planning to add
any fencing hardware, and most likely will set fencing mechanism as
manual. Ideally, we'd like to disable fencing except the part that is
needed for running GFS.

lin  

       
From irwan at magnifix.com.my  Thu Jan 11 04:10:05 2007
From: irwan at magnifix.com.my (Mohd Irwan Jamaluddin)
Date: Thu, 11 Jan 2007 12:10:05 +0800
Subject: [Linux-cluster] Fencing Problem On APC 7950
Message-ID: <1168488605.26513.28.camel@kuli.magnifix.com.my>

Good day,

I'm running Red Hat Cluster Suite on RHEL 4 U3 with APC 7950
( http://www.apc.com/resource/include/techspec_index.cfm?base_sku=AP7950 )
as the fencing device. The version of my fence package is 1.32.18-0. 
I try to execute some fence_apc commands but I've got errors. Below are the details:

[root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 02 \
> -o Reboot -T -v
failed: unrecognised Reboot response

Same error occured for On/Off option.

Also, I found a weird problem if I put "2" instead of "02" for the Outlet Number option (-n option). 
Below are the error message:
[root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 2 \
> -o Reboot -T -v
failed: unrecognised menu response


H 
Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
Thanks in advanced for your response.

-- 
Regards,
+--------------------------------+
|       Mohd Irwan Jamaluddin    |
| ##    System Engineer,         |
| (o_   Magnifix Sdn. Bhd.       |
| //\   Tel: +60 3 42705073      |
| V_/_  Fax: +60 3 42701960      |
|       http://www.magnifix.com/ |      
+--------------------------------+
| "Every successful side needs   |              
| unsung heroes" - fcbayern.de   |      
+--------------------------------+
-------------- next part --------------


User Name : apc

Password  : ***


American Power Conversion               Network Management Card AOS      v2.2.7
(c) Copyright 2002 All Rights Reserved  Rack PDU APP                     v2.2.0
-------------------------------------------------------------------------------
Name      : APC for TMNet                             Date : 06/30/2002
Contact   : Mohd Irwan Jamaluddin                     Time : 10:31:03

Location  : Bilik Server Magnifix                     User : Administrator

Up Time   : 0 Days 15 Hours 22 Minutes                Stat : P+ N+ A+

Switched Rack PDU: Communication Established


------- Control Console -------------------------------------------------------

     1- Device Manager
     2- Network
     3- System
     4- Logout

     <ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Control Console -------------------------------------------------------

     1- Device Manager
     2- Network
     3- System
     4- Logout

     <ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


American Power Conversion               Network Management Card AOS      v2.2.7
(c) Copyright 2002 All Rights Reserved  Rack PDU APP                     v2.2.0
-------------------------------------------------------------------------------
Name      : APC for TMNet                             Date : 06/30/2002
Contact   : Mohd Irwan Jamaluddin                     Time : 10:31:03

Location  : Bilik Server Magnifix                     User : Administrator

Up Time   : 0 Days 15 Hours 22 Minutes                Stat : P+ N+ A+

Switched Rack PDU: Communication Established


------- Control Console -------------------------------------------------------

     1- Device Manager
     2- Network
     3- System
     4- Logout

     <ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Control Console -------------------------------------------------------

     1- Device Manager
     2- Network
     3- System
     4- Logout

     <ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log
> 1


------- Device Manager --------------------------------------------------------

     1- Phase Monitor/Configuration
     2- Outlet Restriction Configuration
     3- Outlet Control/Configuration
     4- Power Supply Status

     <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 3


------- Outlet Control/Configuration ------------------------------------------

     1- Outlet 01: Outlet 1                 ON
     2- Outlet 02: 2                        ON
     3- Outlet 03: Outlet 3                 ON
     4- Outlet 04: Outlet 4                 ON
     5- Outlet 05: Outlet 5                 ON
     6- Outlet 06: Outlet 6                 ON
     7- Outlet 07: Outlet 7                 ON
     8- Outlet 08: Outlet 8                 ON
     9- Outlet 09: Outlet 9                 ON
    10- Outlet 10: Outlet 10                ON
    11- Outlet 11: Outlet 11                ON
    12- Outlet 12: Outlet 12                ON
    13- Outlet 13: Outlet 13                ON
    14- Outlet 14: Outlet 14                ON
    15- Outlet 15: Outlet 15                ON
    16- Outlet 16: Outlet 16                ON
    17- Master Control/Configuration

     <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 2


------- Outlet 02: 2 ----------------------------------------------------------

        Name      : 2
        Outlet    : 2
        State     : ON

     1- Control Outlet    2
     2- Configure Outlet  2

     ?- Help, <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 1


------- Control Outlet --------------------------------------------------------

        Name      : 2
        Outlet    : 2
        State     : ON

     1- Immediate On              
     2- Immediate Off             
     3- Immediate Reboot          
     4- Delayed On                
     5- Delayed Off               
     6- Delayed Reboot            
     7- Cancel                    

     ?- Help, <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 3
        -----------------------------------------------------------------------
        Immediate Reboot          

        This command will immediately shutdown
        outlet 2 named '2', delay for 5 seconds,
        and then restart.

        Enter 'YES' to continue or <ENTER> to cancel : 

        Press <ENTER> to continue...


------- Control Outlet --------------------------------------------------------

        Name      : 2
        Outlet    : 2
        State     : ON

     1- Immediate On              
     2- Immediate Off             
     3- Immediate Reboot          
     4- Delayed On                
     5- Delayed Off               
     6- Delayed Reboot            
     7- Cancel                    

     ?- Help, <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Outlet 02: 2 ----------------------------------------------------------

        Name      : 2
        Outlet    : 2
        State     : ON

     1- Control Outlet    2
     2- Configure Outlet  2

     ?- Help, <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Outlet Control/Configuration ------------------------------------------

     1- Outlet 01: Outlet 1                 ON
     2- Outlet 02: 2                        ON
     3- Outlet 03: Outlet 3                 ON
     4- Outlet 04: Outlet 4                 ON
     5- Outlet 05: Outlet 5                 ON
     6- Outlet 06: Outlet 6                 ON
     7- Outlet 07: Outlet 7                 ON
     8- Outlet 08: Outlet 8                 ON
     9- Outlet 09: Outlet 9                 ON
    10- Outlet 10: Outlet 10                ON
    11- Outlet 11: Outlet 11                ON
    12- Outlet 12: Outlet 12                ON
    13- Outlet 13: Outlet 13                ON
    14- Outlet 14: Outlet 14                ON
    15- Outlet 15: Outlet 15                ON
    16- Outlet 16: Outlet 16                ON
    17- Master Control/Configuration

     <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Device Manager --------------------------------------------------------

     1- Phase Monitor/Configuration
     2- Outlet Restriction Configuration
     3- Outlet Control/Configuration
     4- Power Supply Status

     <ESC>- Back, <ENTER>- Refresh, <CTRL-L>- Event Log
> 


------- Control Console -------------------------------------------------------

     1- Device Manager
     2- Network
     3- System
     4- Logout

     <ESC>- Main Menu, <ENTER>- Refresh, <CTRL-L>- Event Log
> 

From jvantuyl at engineyard.com  Thu Jan 11 06:09:53 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Thu, 11 Jan 2007 00:09:53 -0600
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D8033E77FA@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D8033E77FA@xmb-sjc-222.amer.cisco.com>
Message-ID: <D7DD7D27-12F9-4A35-B429-A74899D48F80@engineyard.com>

> It's not that we discriminate against cluster software :). We just  
> have
> some worries about the potential impact the cluster suite could  
> bring to
> the system. Extra CPU and memory cost is ok, we can consider that's  
> part
> of running GFS. The part that gets us wonder is any potential  
> behavioral
> changes and instability to the system. After all, the system is
> effectively tunrned into a cluster. I read some of the emails in the
> alias about cluster issues aside from GFS.

Behavior aside, with full understanding of how it works, clustering  
is neither complex nor particularly troublesome.  Understand that the  
instability you read about comes not from the clustering but rather  
is the nature of sharing these resources between multiple machines.

I operate over 40 clusters with a total of well over 100 nodes and I  
can assure you that the day I implemented comprehensive fencing (i.e.  
removed fence_manual and wrote a fencing agent for our platform) was  
very likely the best day of my life.  Fencing is what makes a GFS  
cluster reliable.

> For instance, we support hot removal/insertion of nodes in the system,
> I'm not clear how fencing will get in the way. We're not planning  
> to add
> any fencing hardware, and most likely will set fencing mechanism as
> manual. Ideally, we'd like to disable fencing except the part that is
> needed for running GFS.
There are issues.  As long as you don't change to a two-node cluster  
at some point (going from 1 node to 2 nodes or 3 nodes to 2 nodes)  
you should be able to achieve this.  In my personal opinion, I would  
avoid running GFS on less than 3 nodes anyways (again, 2-node  
clusters exhibit behavior that is easily avoidable with a third box,  
even if it doesn't use the GFS).

In a controlled manner it is possible to unmount the FS, leave the  
cluster, then change the cluster composition from a still running node.

Adding isn't too much trouble.

In either case I suggest a quorum disk (qdisk).

As for uncontrolled crashes fencing is absolutely necessary to  
recover state of the FS.

Complete fencing is absolutely necessary for running GFS.

Suggesting that you don't need (indeed want) fencing is an indication  
that you don't understand how GFS will share your data.  Relying on  
manual fencing is a sign that you will likely lose a great deal of  
data someday.  Redhat won't even support that configuration due to  
liability concerns.

Fencing only makes sure that a machine that has lost contact with the  
cluster does not trash your data.

Without fencing, a node that is out of control can (and will) trash  
your GFS.  This will result in the downtime required to shut down the  
cluster, fsck the filesystem, and then bring it back up.  It will  
also still likely trash some data.

Make no mistake, when fencing occurs, the system is already behaving  
badly.  It fixes it, albeit brutally.

With fence_manual, when you have any sort of outage whatsoever, one  
node will be hosed and the entire cluster will halt.  At this point  
you will do one of three things:

1.  You may just restart the entire cluster.

or

2.  You may correctly make sure the dead machine is truly dead.  YOU  
WILL NOT BE ABLE TO DO THIS REMOTELY WITHOUT HARDWARE SUITABLE FOR  
FENCING.  At that point you will call fence_ack_manual (manually) to  
free up the cluster.

or

3.  You may, in your haste, run fence_ack_manual to free up the  
cluster.  If at any point the other node is not completely dead, your  
data may be forfeit.  Worse, it may not be visible immediately, only  
after it is widely corrupted.  At that point you will probably get  
the downed node running without realizing what damage you may have done.

In the meantime, everyone mounting your GFS will be hung.  A single  
hardware failure can freeze your cluster.  Totally.

Note that to take the only path that saves your data (#2) you will  
have to have remote power switches or the like to reset a toasted  
node in all cases.  So you will NOT save yourself any money and yet  
you WILL create trouble.  Also, have you considered fencing at your  
network switch (for networked storage) or at your storage device  
itself?  It is not always necessary to purchase remote power switches  
to fence your data.

If you are not able to abide fencing, you probably should farm this  
out to someone who can.

Fencing is the way to avoid the bad behavior you have read about.  It  
is not the cause of trouble--it's the solution.  GFS absolutely must  
have it in its entirety or no dice.

If you would like a more official, professional explanation as to why  
this is absolutely, unequivocally necessary, contact me by e-mail.   
I'll call you.  I could fly out.  I can even give you a report with a  
letterhead and everything.

However, removing fencing from GFS is not a possibility.  It's not  
even really hard.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/0a0626a0/attachment.htm>

From krikler_samuel at diligent.com  Thu Jan 11 06:34:48 2007
From: krikler_samuel at diligent.com (Krikler, Samuel)
Date: Thu, 11 Jan 2007 08:34:48 +0200
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-bug in fenced?
Message-ID: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>

Hi,

 
We got the same problem and tried to used qdisk without success.

I managed to create the qdisk but didn't manage to get both nodes
registered into it.

 
Could someone please point me to the documentation / description of how
to properly set up the qdisk for a 2 nodes-cluster?

 
Thanks a lot,

 
Samuel.

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Jayson Vantuyl
Sent: Wednesday, January 10, 2007 11:23 PM
To: linux clustering
Subject: Re: [Linux-cluster] RH Cluster doesn't pass basic acceptance
tests -bug in fenced?

 
On Jan 10, 2007, at 1:47 PM, Miroslav Zubcic wrote:

	How can I prevent this? This looks like a bug. I don't want
fenced to

	fence other node south if it already "knows" that it is the one
without link.

It is very possible to write a wrapper script for your fencing agent
that simply checks the link and refuses to fence when the link is down.
However, this wouldn't really be good in any non-network related fencing
situation.

 
The recommendation to use a qdiskd would be a good one and you could
even use a link detection script as an arbitrary heuristic in this case.

 
-- 

Jayson Vantuyl

Systems Architect

Engine Yard

jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/37a0c0ff/attachment.htm>

From jvantuyl at engineyard.com  Thu Jan 11 06:54:06 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Thu, 11 Jan 2007 00:54:06 -0600
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-bug in fenced?
In-Reply-To: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>
References: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>
Message-ID: <58679925-D017-49E9-AB3F-2ECBFDF09939@engineyard.com>

Samuel,

On Jan 11, 2007, at 12:34 AM, Krikler, Samuel wrote:
> Could someone please point me to the documentation / description of  
> how to properly set up the qdisk for a 2 nodes-cluster?
Now that is an interesting question.  I'm not so sure how the two- 
node setup handles a quorum node.

That said, I think the solution is actually to set up the qdisk to  
have a vote *AND* not configure the system as a two-node cluster.   
Basically, take off the two-node flag for CMAN, set CMAN's  
expected_votes to 2, give each node 1 vote and the qdisk 1 vote.

That way, two running nodes give you quorum, either node + qdisk  
gives you quorum, and either node - qdisk is inquorate.

Can any of the cluster gods comment on this?  I usually have 3 or  
more nodes.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/134aa86c/attachment.htm>

From shailesh at verismonetworks.com  Thu Jan 11 08:45:54 2007
From: shailesh at verismonetworks.com (Shailesh)
Date: Thu, 11 Jan 2007 14:15:54 +0530
Subject: [Linux-cluster] Not able to build cluster tools 1.03.00
Message-ID: <1168505154.6593.46.camel@shailesh>

Hi,
     I am attempting to build the tools in cluster-1.03.00.tar.gz on my
Redhat workstation RHEL-V4 which is having the kernel 2.6.9-5. 
I see a lot compile errors in the build. 

Can anybody suggest me if I am missing something,

Do I have to upgrade the kernel,if so which version will it be?

Thanks & Regards
Shailesh
 

From raj4linux at gmail.com  Thu Jan 11 08:52:27 2007
From: raj4linux at gmail.com (rajesh mishra)
Date: Thu, 11 Jan 2007 14:22:27 +0530
Subject: [Linux-cluster] Not able to build cluster tools 1.03.00
In-Reply-To: <1168505154.6593.46.camel@shailesh>
References: <1168505154.6593.46.camel@shailesh>
Message-ID: <5a8d914c0701110052h24c04a0cva699226a35a346c8@mail.gmail.com>

First did u take latest source code from the Red Hat repository..?
U need to specify what kind of error u r getting while compilation.
I strongly feel if u peep into the code u even can make out. There
might be minor problems.

With Regards
Rajesh.


On 1/11/07, Shailesh <shailesh at verismonetworks.com> wrote:
> Hi,
>     I am attempting to build the tools in cluster-1.03.00.tar.gz on my
> Redhat workstation RHEL-V4 which is having the kernel 2.6.9-5.
> I see a lot compile errors in the build.
>
> Can anybody suggest me if I am missing something,
>
> Do I have to upgrade the kernel,if so which version will it be?
>
> Thanks & Regards
> Shailesh
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From pcaulfie at redhat.com  Thu Jan 11 09:04:30 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 11 Jan 2007 09:04:30 +0000
Subject: [Linux-cluster] Not able to build cluster tools 1.03.00
In-Reply-To: <1168505154.6593.46.camel@shailesh>
References: <1168505154.6593.46.camel@shailesh>
Message-ID: <45A5FD9E.6090706@redhat.com>

Shailesh wrote:
> Hi,
>      I am attempting to build the tools in cluster-1.03.00.tar.gz on my
> Redhat workstation RHEL-V4 which is having the kernel 2.6.9-5. 
> I see a lot compile errors in the build. 
> 
> Can anybody suggest me if I am missing something,
> 
> Do I have to upgrade the kernel,if so which version will it be?
> 

If you want to use a Red Hat kernel then you should check out the RHEL4 branch from CVS or use the SRPMS.

If you want to use cluster-1.03 then you'll need a recent (but not too recent) kernel.org kernel.
(yes I know that's very vague, I can't remember which kernel it compiles against now, sorry!)

-- 

patrick


From fedele at fis.unical.it  Thu Jan 11 09:08:26 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Thu, 11 Jan 2007 10:08:26 +0100
Subject: [Linux-cluster] Problems updating cluster.conf
Message-ID: <45A5FE8A.5020403@fis.unical.it>

Good day,

I'm running Cluster Suite on CentOS4 on a 35 PC cluster and a SAN,
when i try to update the cluster.conf on my cluster I have this message:

[root at linuxlab1 cluster]# ccs_tool update /etc/cluster/cluster.conf
Failed to receive COMM_UPDATE_NOTICE_ACK from pc10.
Hint: Check the log on pc10 for reason.

Failed to update config file.
[root at linuxlab1 cluster]#

but in pc10 log files I can't see anything, also if I try to run ccsd on pc10 with the command
ccsd -n
I can't see the error

Versions of cman, ccsd amd rgmanager are:

ccs-1.0.7-0
cman-1.0.11-0
rgmanager-1.9.53-0

I have 18 PC on a 1Gbit LAN and 17 PC on 100Mbit

Fedele STABILE


From fedele at fis.unical.it  Thu Jan 11 09:38:31 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Thu, 11 Jan 2007 10:38:31 +0100
Subject: [Linux-cluster] Problems updating cluster.conf SOLVED but I have a
	question
In-Reply-To: <45A5FE8A.5020403@fis.unical.it>
References: <45A5FE8A.5020403@fis.unical.it>
Message-ID: <45A60597.2070300@fis.unical.it>

I solved my problem running ccs_tool update from a node on the slow network,
can you help me to explain the reason of this behaviour?

Fedele STABILE

Fedele Stabile wrote:
> Good day,
> 
> I'm running Cluster Suite on CentOS4 on a 35 PC cluster and a SAN,
> when i try to update the cluster.conf on my cluster I have this message:
> 
> [root at linuxlab1 cluster]# ccs_tool update /etc/cluster/cluster.conf
> Failed to receive COMM_UPDATE_NOTICE_ACK from pc10.
> Hint: Check the log on pc10 for reason.
> 
> Failed to update config file.
> [root at linuxlab1 cluster]#
> 
> but in pc10 log files I can't see anything, also if I try to run ccsd on 
> pc10 with the command
> ccsd -n
> I can't see the error
> 
> Versions of cman, ccsd amd rgmanager are:
> 
> ccs-1.0.7-0
> cman-1.0.11-0
> rgmanager-1.9.53-0
> 
> I have 18 PC on a 1Gbit LAN and 17 PC on 100Mbit
> 
> Fedele STABILE
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From kri_thi at yahoo.com  Thu Jan 11 11:34:54 2007
From: kri_thi at yahoo.com (krishnamurthi G)
Date: Thu, 11 Jan 2007 03:34:54 -0800 (PST)
Subject: [Linux-cluster] cluster version identification:how?
Message-ID: <20070111113454.28556.qmail@web90413.mail.mud.yahoo.com>

 Hi Frieds,

Is there any ways to find cluster version (if it is !). 
Problem: The cluster specific commands/paths/command outputs have been changed/changing completely from RHEL 2.1 to 2.4 to 2.6.
I am working on a project where I need support for all version/releases, some how I need to find cluster version if available so that I can parse accordingly.

Temporary Work around: The cluster config file is unique for different RHEL releases.
e.g RHEL 2.1 "/etc/cluster.xml" 
RHEL 2.4  "/etc/cluster.conf"
RHEL 2.6 "/etc/cluster/cluster.conf"

Check this config file and identify cluster type.

I appreciate if any of you help me for efficient solution?

Thanks in advance
- Krishna


____________________________________________________________________________________
Want to start your own business?
Learn how on Yahoo! Small Business.
http://smallbusiness.yahoo.com/r-index
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/a5b635df/attachment.htm>

From lhh at redhat.com  Thu Jan 11 14:30:08 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 11 Jan 2007 09:30:08 -0500
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	- bug in fenced?
In-Reply-To: <20070110201648.GE24326@korben.rdu.redhat.com>
References: <45A542BD.9070107@nimium.hr>
	<20070110201648.GE24326@korben.rdu.redhat.com>
Message-ID: <1168525808.15369.273.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 15:16 -0500, Josef Whiter wrote:

> > Thanks for any advice ...
> > 
> 
> This isn't a bug, its working as expected.  What you need in qdisk, set it up
> with the proper hueristics and it will force the shutdown of the bad node before
> the bad node has a chance to fence off the working node.

What he said.  With qdisk, you can have the node declare itself unfit
for cluster operation when bond0 or bond1 loses link; something like:

<quorumd min_score="2" votes="2" status_file="/tmp/qdisk_status_info">
   <heuristic program="ping -c1 -t1 <bond0 router>" score="1"
interval="2"/>
   <heuristic program="ping -c1 -t1 <bond1 router>" score="1"
interval="2"/>
</quorumd>

You could use more complex link monitoring (like the stuff
in /usr/share/cluster/ip.sh) if you wanted, but this gives you the basic
idea.

The idea here is that if bond0 *or* bond1 loses link, qdiskd declares
the node unfit (min_score = 2, and each route is 1 point, so loss of
either => fatal).  A feature was added after the initial release of
qdiskd to reboot the node on loss of required score (previously, it
would cause the node to become inquorate and block activity).

-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/5ed5f4b7/attachment.sig>

From lhh at redhat.com  Thu Jan 11 14:47:27 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 11 Jan 2007 09:47:27 -0500
Subject: [Linux-cluster] Remove the clusterness from GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D8033E77FA@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D8033E77FA@xmb-sjc-222.amer.cisco.com>
Message-ID: <1168526847.15369.288.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-10 at 15:00 -0800, Lin Shen (lshen) wrote:

> For instance, we support hot removal/insertion of nodes in the system,
> I'm not clear how fencing will get in the way. We're not planning to add
> any fencing hardware, and most likely will set fencing mechanism as
> manual. Ideally, we'd like to disable fencing except the part that is
> needed for running GFS.

Hmm, well, GFS requires every node mounting a volume directly to have
fencing.

You can use NFS to export the same GFS volume from multiple servers.
The idea here is that with more than one NFS server exporting the same
file system, you can achieve very high data parallel data throughput -
near the maximum the SAN allows - because the network bandwidth and
server bottlenecks are, in theory, eliminated.

This solution requires building a GFS cluster, say, 3 or 5 nodes + a
SAN.  Make one or more GFS volumes on the SAN, and mount on all nodes.
Export from all nodes.  Adding more clients is simple.  Just mount the
NFS export.  Fencing is needed for the GFS cluster, but not the NFS
clients.

You could do the same thing with Lustre (sort of).  Build a server
cluster, and mount over the network.  You'd only need fencing hardware
for the metadata server (I *think*; never tried it).  Adding a client is
easy: set up Lustre on the client and mount the file system.

There's some "waste" in the sense that to build either of these
solutions, you need several machines that act as a "storage farm" for
the best possible reliability.  

-- Lon


From lhh at redhat.com  Thu Jan 11 14:58:03 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 11 Jan 2007 09:58:03 -0500
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-bug in fenced?
In-Reply-To: <58679925-D017-49E9-AB3F-2ECBFDF09939@engineyard.com>
References: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>
	<58679925-D017-49E9-AB3F-2ECBFDF09939@engineyard.com>
Message-ID: <1168527483.15369.299.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-11 at 00:54 -0600, Jayson Vantuyl wrote:

> That said, I think the solution is actually to set up the qdisk to
> have a vote *AND* not configure the system as a two-node cluster. 
>  Basically, take off the two-node flag for CMAN, set CMAN's
> expected_votes to 2, give each node 1 vote and the qdisk 1 vote.

That's correct, if you're using a single heuristic to implement a
tiebreaker.

> That way, two running nodes give you quorum, either node + qdisk gives
> you quorum, and either node - qdisk is inquorate.

With a multi-point qdisk setup, you want qdisk to be required
(generally) - i.e., when monitoring multiple network paths.  However,
for a 2-node + tiebreaker setup, yours looks right.

> Can any of the cluster gods comment on this?  I usually have 3 or more
> nodes.

I hadn't considered the implications of doing 1 vote for 3+ node
clusters, but I don't think there are any; it should work, but it
wouldn't be particularly useful.

The man pages talk about the general setup for making N->1 failure
recovery work using qdisk, but it's missing the 2-node+tiebreaker case.
I'll have to add that (since it's a *very* interesting use case).

-- Lon


From lhh at redhat.com  Thu Jan 11 15:05:24 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 11 Jan 2007 10:05:24 -0500
Subject: [Linux-cluster] cluster version identification:how?
In-Reply-To: <20070111113454.28556.qmail@web90413.mail.mud.yahoo.com>
References: <20070111113454.28556.qmail@web90413.mail.mud.yahoo.com>
Message-ID: <1168527924.15369.308.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-11 at 03:34 -0800, krishnamurthi G wrote:
>  Hi Frieds,
> 
> Is there any ways to find cluster version (if it is !). 
> Problem: The cluster specific commands/paths/command outputs have been
> changed/changing completely from RHEL 2.1 to 2.4 to 2.6.


> I am working on a project where I need support for all
> version/releases, some how I need to find cluster version if available
> so that I can parse accordingly.

> Temporary Work around: The cluster config file is unique for different
> RHEL releases.
> e.g RHEL 2.1 "/etc/cluster.xml" 
> RHEL 2.4  "/etc/cluster.conf"
> RHEL 2.6 "/etc/cluster/cluster.conf"
> 
> Check this config file and identify cluster type.

RHEL 2.1: /etc/cluster.conf
RHEL3:    /etc/cluster.xml
RHEL4:    /etc/cluster/cluster.conf
RHEL5:    /etc/cluster/cluster.conf

Why not just do 'rpm -q redhat-release'?  I'm curious; why does the
cluster version matter: are you manipulating cluster.[xml|conf]
directly?  If so, you'll need to do a few extra things.

-- Lon


From ramon at vanalteren.nl  Thu Jan 11 16:04:34 2007
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Thu, 11 Jan 2007 17:04:34 +0100
Subject: [Linux-cluster] Fencing Problem On APC 7950
In-Reply-To: <1168488605.26513.28.camel@kuli.magnifix.com.my>
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
Message-ID: <45A66012.1090006@vanalteren.nl>

Hi,

Mohd Irwan Jamaluddin wrote:
> I'm running Red Hat Cluster Suite on RHEL 4 U3 with APC 7950
> ( http://www.apc.com/resource/include/techspec_index.cfm?base_sku=AP7950 )
> as the fencing device. The version of my fence package is 1.32.18-0. 
> I try to execute some fence_apc commands but I've got errors. Below are the details:
>
> [root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 02 \
>   
>> -o Reboot -T -v
>>     
> failed: unrecognised Reboot response
>
> Same error occured for On/Off option.
>
> Also, I found a weird problem if I put "2" instead of "02" for the Outlet Number option (-n option). 
> Below are the error message:
> [root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 2 \
>   
>> -o Reboot -T -v
>>     
> failed: unrecognised menu response
>
>
> H 
> Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
>   
we're running fencing through an apc 7920 which I assume is similar.
I've hacked up the fence_apc until it worked, it accepts outlet names
and numbers if I remember correctly.
I've attached ours to the mail, it's originally from cluster-1.02 but
works for me on cluster-1.03 as well.

No guarantees implied ;-) WFM, YMMV
> Thanks in advanced for your response.
>   
Hope it helps

Ramon
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: fence_apc
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/8e872b0d/attachment.ksh>

From erickson.jon at gmail.com  Thu Jan 11 16:08:31 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Thu, 11 Jan 2007 11:08:31 -0500
Subject: [Linux-cluster] multipathd/ lvm.static error
Message-ID: <6a90e4da0701110808k281cef1ej260de4ba6a06496@mail.gmail.com>

All,

I'm receiving these two errors on all of the systems I have setup in
my GFS cluster.
Even though I receive these errors my systems appear to be functioning
fine.  The first error makes sense because my local SCSI disk is not
multipath-ed, however, I do not know how to get rid of it.  The second
error is confusing to me especially because the system still comes up
with all of my multipath-ed devices in working order.


multipathd: error calling out /sbin/scsi_id -g -u -s /block/sda

lvm.static[6938]: segfault at 0000000000000000 rip 0000000000000000
rsp 0000007fbfff9388 error 14


-------------------------------
uname -r = 2.6.9-42.0.3.ELsmp

Packages Installed:
system-config-lvm-1.0.16-1.0
lvm2-cluster-2.02.06-7.0.RHEL4
lvm2-2.02.06-6.0.RHEL4
device-mapper-multipath-0.4.5-16.1.RHEL4
GFS-6.1.6-1
GFS-kernel-smp-2.6.9-60.3
GFS-kernheaders-2.6.9-60.3
kernel-smp-2.6.9-42.0.3.EL


Thanks,
Jon


From jamesm at xandros.com  Thu Jan 11 16:19:25 2007
From: jamesm at xandros.com (James McOrmond)
Date: Thu, 11 Jan 2007 11:19:25 -0500
Subject: [Linux-cluster] Fencing Problem On APC 79x0
In-Reply-To: <45A66012.1090006@vanalteren.nl>
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
Message-ID: <45A6638D.8020501@xandros.com>


Ramon van Alteren wrote:

>>Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
>>  
>>    
>>
>we're running fencing through an apc 7920 which I assume is similar.
>I've hacked up the fence_apc until it worked, it accepts outlet names
>and numbers if I remember correctly.
>I've attached ours to the mail, it's originally from cluster-1.02 but
>works for me on cluster-1.03 as well.
>
Do you know if this is a general fence_apc issue?  I've got a 7900 on 
order to do some work with and i'm wondering if I should expect to have 
to make these modifications?


-- 
James A. McOrmond (jamesm at xandros.com)
Hardware QA Lead & Network Administrator
Xandros Corporation, Ottawa, Canada.
Morpheus: ...after a century of war I remember that which matters most:
 *We are still HERE!* 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/977385f3/attachment.htm>

From jparsons at redhat.com  Thu Jan 11 16:50:18 2007
From: jparsons at redhat.com (Jim Parsons)
Date: Thu, 11 Jan 2007 11:50:18 -0500
Subject: [Linux-cluster] Fencing Problem On APC 7950
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
	<45A66012.1090006@vanalteren.nl>
Message-ID: <45A66ACA.6060607@redhat.com>

Thanks Ramon and Mohd. Attached is our latest, heavily refactored 
version of this agent. outlet naming and grouping should all work now - 
on every 7900 series switch. Please try it. Just rename it fence_apc 
when you drop it into /sbin and make certain the executable bits are set.

-J

Ramon van Alteren wrote:

>Hi,
>
>Mohd Irwan Jamaluddin wrote:
>
>>I'm running Red Hat Cluster Suite on RHEL 4 U3 with APC 7950
>>( http://www.apc.com/resource/include/techspec_index.cfm?base_sku=AP7950 )
>>as the fencing device. The version of my fence package is 1.32.18-0. 
>>I try to execute some fence_apc commands but I've got errors. Below are the details:
>>
>>[root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 02 \
>>  
>>
>>>-o Reboot -T -v
>>>    
>>>
>>failed: unrecognised Reboot response
>>
>>Same error occured for On/Off option.
>>
>>Also, I found a weird problem if I put "2" instead of "02" for the Outlet Number option (-n option). 
>>Below are the error message:
>>[root at orarac01 ~]# fence_apc -a 10.0.6.150 -l apc -p apc -n 2 \
>>  
>>
>>>-o Reboot -T -v
>>>    
>>>
>>failed: unrecognised menu response
>>
>>
>>H 
>>Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
>>  
>>
>we're running fencing through an apc 7920 which I assume is similar.
>I've hacked up the fence_apc until it worked, it accepts outlet names
>and numbers if I remember correctly.
>I've attached ours to the mail, it's originally from cluster-1.02 but
>works for me on cluster-1.03 as well.
>
>No guarantees implied ;-) WFM, YMMV
>
>>Thanks in advanced for your response.
>>  
>>
>Hope it helps
>
>Ramon
>
>
>------------------------------------------------------------------------
>
>#!/usr/bin/perl
>
>###############################################################################
>###############################################################################
>##
>##  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
>##  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
>##  
>##  This copyrighted material is made available to anyone wishing to use,
>##  modify, copy, or redistribute it subject to the terms and conditions
>##  of the GNU General Public License v.2.
>##
>###############################################################################
>###############################################################################
>
>use Getopt::Std;
>use Net::Telnet ();
>
># Get the program name from $0 and strip directory names
>$_=$0;
>s/.*\///;
>my $pname = $_;
>
># Change these if the text returned by your equipment is different.
># Test by running script with options -t -v and checking /tmp/apclog
>
>my $immediate = 'immediate'; # # Or 'delayed' - action string prefix on menu
>my $masterswitch = 'masterswitch plus '; # 'Device Manager' option to choose
>my $login_prompt = '/: /';
>my $cmd_prompt = '/> /';
>
>my $max_open_tries = 3;      # How many telnet attempts to make.  Because the 
>                             # APC can fail repeated login attempts, this number
>                             # should be more than 1
>my $open_wait = 5;           # Seconds to wait between each telnet attempt
>my $telnet_timeout = 2;      # Seconds to wait for matching telent response
>my $debuglog = '/tmp/apclog';# Location of debugging log when in verbose mode
>$opt_o = 'reboot';           # Default fence action.  
>
>my $logged_in = 0;
>
>my $t = new Net::Telnet;
>
>
>
># WARNING!! Do not add code bewteen "#BEGIN_VERSION_GENERATION" and 
># "#END_VERSION_GENERATION"  It is generated by the Makefile
>
>#BEGIN_VERSION_GENERATION
>$FENCE_RELEASE_NAME="1.02.00";
>$REDHAT_COPYRIGHT=("Copyright (C) Red Hat, Inc.  2004  All rights reserved.");
>$BUILD_DATE="(built Wed Jun 28 13:17:53 CEST 2006)";
>#END_VERSION_GENERATION
>
>sub usage 
>{
>	print "Usage:\n";
>	print "\n";
>	print "$pname [options]\n";
>	print "\n";
>	print "Options:\n";
>	print "  -a <ip>          IP address or hostname of MasterSwitch\n";
>	print "  -h               usage\n";
>	print "  -l <name>        Login name\n";
>	print "  -n <num>         Outlet number to change: [<switch>:]<outlet> \n";
>	print "  -o <string>      Action: Reboot (default), Off or On\n";
>	print "  -p <string>      Login password\n";
>	print "  -q               quiet mode\n";
>	print "  -T               Test mode (cancels action)\n";
>	print "  -V               version\n";
>	print "  -v               Log to file /tmp/apclog\n";
>	
>	exit 0;
>}
>
>sub fail
>{
>	($msg)=@_;
>	print $msg."\n" unless defined $opt_q;
>
>	if (defined $t)
>	{
>		# make sure we don't get stuck in a loop due to errors
>		$t->errmode('return');  
>
>		logout() if $logged_in;
>		$t->close 
>	}
>	exit 1;
>}
>
>sub fail_usage
>{
>	($msg)=@_;
>	print STDERR $msg."\n" if $msg;
>	print STDERR "Please use '-h' for usage.\n";
>	exit 1;
>}
>
>sub version
>{
>	print "$pname $FENCE_RELEASE_NAME $BUILD_DATE\n";
>	print "$SISTINA_COPYRIGHT\n" if ( $SISTINA_COPYRIGHT );
>	exit 0;
>}
>
>
>sub login
>{
>	for (my $i=0; $i<$max_open_tries; $i++)
>	{
>		$t->open($opt_a);
>		($_) = $t->waitfor($login_prompt);
>  
>		# Expect 'User Name : ' 
>		if (! /name/i) {
>			$t->close;
>			sleep($open_wait);
>			next;        
>		}
>
>		$t->print($opt_l);
>		($_) = $t->waitfor($login_prompt);
>
>		# Expect 'Password  : ' 
>		if (! /password/i ) {
>			$t->close;
>			sleep($open_wait);
>			next;         
>		}
>  
>		# Send password
>		$t->print($opt_p);  
>
>		(my $dummy, $_) = $t->waitfor('/(>|(?i:user name|password)\s*:) /');
>		if (/> /)
>		{
>			$logged_in = 1;
>
>			# send newline to flush prompt
>			$t->print("");  
>
>			return;
>		}
>		else
>		{
>			fail "invalid username or password";
>		}
>	}
>	fail "failed: telnet failed: ". $t->errmsg."\n" 
>}
>
># print_escape_char() -- utility subroutine for sending the 'Esc' character
>sub print_escape_char
>{
>	# The APC menu uses "<esc>" to go 'up' menues.  We must set
>	# the output_record_separator to "" so that "\n" is not printed
>	# after the "<esc>" character
>
>	$ors=$t->output_record_separator;
>	$t->output_record_separator("");
>	$t->print("\x1b"); # send escape
>	$t->output_record_separator("$ors");
>}
>
>
># Determine if the switch is a working state.  Also check to make sure that 
># the switch has been specified in the case that there are slave switches
># present.  This assumes that we are at the main menu.
>sub identify_switch
>{
>
>	($_) = $t->waitfor($cmd_prompt);
>	print_escape_char();
>
>	# determine what type of switch we are dealling with
>	($_) = $t->waitfor($cmd_prompt);
>	if ( /Switched Rack PDU: Communication Established/i)
>	{
>		# No further test needed
>	}
>	elsif ( /MS plus 1 : Serial Communication Established/i )
>	{
>		if ( defined $switchnum )
>		{
>			$masterswitch = $masterswitch . $switchnum;
>		}
>		elsif ( /MS plus [^1] : Serial Communication Established/i )
>		{
>			fail "multiple switches detected.  'switch' must be defined.";
>		}
>		else
>		{
>			$switchnum = 1;
>		}
>	}
>	else
>	{
>		fail "APC is in undetermined state"
>	}	
>
>	# send a newline to cause APC to reprint the menu
>	$t->print("");
>}
>
>
># Navigate through menus to the appropriate outlet control menu of the apc
># MasterSwitch and 79xx series switches.  Uses multi-line (mostly) 
># case-insensitive matches to recognise menus and works out what option number 
># to select from each menu.
>sub navigate
>{
>	# Limit the ammount of menu depths to 20.  We should never be this deep
>	for(my $i=20; $i ; $i--)
>	{
>		# Get the new text from the menu
>		($_) = $t->waitfor($cmd_prompt);
>		# Identify next option 
>		if ( 
>			# "Control Console", "1- Device Manager"
>			/--\s*control console.*(\d+)\s*-\s*device manager/is  ||
>
>			# "Device Manager", "2- Outlet Control"
>			/--\s*device manager.*(\d+)\s*-\s*outlet control/is ||
>
>			# 
>			# APC MasterSwitch Menus
>			#
>			# "Device Manager", "1- MasterSwitch plus 1"
>			/--\s*device manager.*(\d+)\s*-\s*$masterswitch/is ||
>
>			# "Device Manager", "1- Cluster Node 0   ON"
>			/--\s*(?:device manager|$masterswitch).*(\d+)\s*-\s+Outlet\s+$switchnum:$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
>
>			# "MasterSwitch plus 1", "1- Outlet 1:1  Outlet #1  ON"
>			/--\s*$masterswitch.*(\d+)\s*-\s*Outlet\s+$switchnum:$opt_n\s[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
>	
>			# Administrator outlet control menu
>			/--\s*Outlet $switchnum:$opt_n\D.*(\d+)\s*-\s*outlet control\s*$switchnum:?$opt_n\D/ism || 
>
>
>			#
>			# APC 79XX Menus
>			#
>			# "3- Outlet Control/Configuration"
>			/--\s*device manager.*(\d+)\s*-\s*Outlet Control/is ||
>
>			# "Device Manager", "1- Cluster Node 0   ON"
>			/--\s*Outlet Control.*(\d+)\s*-\s+Outlet\s+$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
>
>			# "Device Manager", "1- <name>   ON"
>			/--\s*Outlet Control.*(\d+)\s*-\s+$opt_n\D[^\n]*\s(?-i:ON|OFF)\*?\s/ism ||
>
>			# Administrator Outlet Control menu
>			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control\s*outlet\s+$opt_n\D/ism ||
>			/--\s*Outlet $opt_n\D.*(\d+)\s*-\s*control\s*outlet/ism 
>
>		) {
>			$t->print($1);
>			next;
>		}
>
>		# "Outlet Control X:N", "4- Immediate Reboot"
>		if ( 
>			/(\d+)\s*-\s*immediate $opt_o/is || 
>			/--\s*$opt_n.*(\d+)\s*-\s*immediate\s*$opt_o/is ||
>            /--\s*Control Outlet\D.*(\d+)\s*-\s*Immediate\s*$opt_o/is 
>		     ) {
>			$t->print($1);
>			last;
>		}
>
>		fail "failed: unrecognised menu response\n";
>	}
>}
>
>
>sub logout 
>{
>	# send a newline to make sure that we refresh the menus 
>	# ($t->waitfor() can hang otherwise)
>	$t->print("");
>
>	# Limit the ammount of menu depths to 20.  We should never be this deep
>	for(my $i=20; $i ; $i--)
>	{
>
>		# Get the new text from the menu
>		($_) = $t->waitfor($cmd_prompt);
>
>		if ( 
>			# "Control Console", "4- Logout"	 
>			/--\s*control console.*(\d+)\s*-\s*Logout/is
>		) {
>			$t->print($1);
>			last;
>		}
>		else 
>		{
>			print_escape_char();
>			next;
>		}
>	}
>}
>
>
>sub action
>{
>	# "Enter 'YES' to continue or <ENTER> to cancel : "
>	($_) = $t->waitfor('/: /');
>	if (! /.*immediate $opt_o.*YES.*to continue/si ) {
>		fail "failed: unrecognised $opt_o response\n";
>	}
>
>	# Test mode?
>	$t->print($opt_T?'NO':'YES');
>
>	# "Success", "Press <ENTER> to continue..." 
>	($_) = $t->waitfor('/continue/');
>	$t->print('');
>
>	if (defined $opt_T) {
>		logout(); 
>		print "success: test outlet $opt_n $opt_o\n" unless defined $opt_q; 
>		$t->close;
>
>		# Allow the APC some time to clean connection
>		# before next login.
>		sleep 1;
>
>		exit 0;
>	} elsif ( /Success/i ) {
>		logout();
>		print "success: outlet $opt_n $opt_o\n" unless defined $opt_q; 
>		$t->close;
>
>		# Allow the APC some time to clean connection
>		# before next login.
>		sleep 1;
>
>		exit 0;
>	} 
>
>	fail "failed: unrecognised action response\n";
>}
>
>
>sub get_options_stdin
>{
>	my $opt;
>	my $line = 0;
>	while( defined($in = <>) )
>	{
>		$_ = $in;
>		chomp;
>
>		# strip leading and trailing whitespace
>		s/^\s*//;
>		s/\s*$//;
>
>		# skip comments
>		next if /^#/;
>	
>		$line+=1;
>		$opt=$_;
>		next unless $opt;
>
>		($name,$val)=split /\s*=\s*/, $opt;
>
>		if ( $name eq "" )
>		{
>			print STDERR "parse error: illegal name in option $line\n";
>			exit 2;
>		} 
>		# DO NOTHING -- this field is used by fenced 
>		elsif ($name eq "agent" ) 
>		{
>		} 
>		elsif ($name eq "ipaddr" ) 
>		{
>			$opt_a = $val;
>		} 
>		elsif ($name eq "login" ) 
>		{
>			$opt_l = $val;
>		} 
>		elsif ($name eq "option" ) 
>		{
>			$opt_o = $val;
>		} 
>		elsif ($name eq "passwd" ) 
>		{
>			$opt_p = $val;
>		} 
>		elsif ($name eq "port" ) 
>		{
>			$opt_n = $val;
>		} 
>		elsif ($name eq "switch" ) 
>		{
>			$switchnum = $val;
>		} 
>		elsif ($name eq "test" ) 
>		{
>			$opt_T = $val;
>		} 
>		elsif ($name eq "verbose" ) 
>		{
>			$opt_v = $val;
>		} 
>		# Excess name/vals will fail
>		else 
>		{
>			fail "parse error: unknown option \"$opt\"";
>		}
>	}
>}
>		
>
>sub telnet_error
>{
>	fail "failed: telnet returned: ".$t->errmsg."\n";
>}
>
>
>### MAIN #######################################################
>
>if (@ARGV > 0) {
>	getopts("a:hl:n:o:p:qTvV") || fail_usage ;
>	
>	usage if defined $opt_h;
>	version if defined $opt_V;
>
>	fail_usage "Unkown parameter." if (@ARGV > 0);
>
>	fail_usage "No '-a' flag specified." unless defined $opt_a;
>	fail_usage "No '-n' flag specified." unless defined $opt_n;
>	fail_usage "No '-l' flag specified." unless defined $opt_l;
>	fail_usage "No '-p' flag specified." unless defined $opt_p;
>	fail_usage "Unrecognised action '$opt_o' for '-o' flag"
>	unless $opt_o =~ /^(Off|On|Reboot)$/i;
>
>	if ( $opt_n =~ /(\d+):(\d+)/ ) {
>		$switchnum=($1);
>		$opt_n = ($2);
>	}
>} else {
>	get_options_stdin();
>
>	fail "failed: no IP address" unless defined $opt_a;
>	fail "failed: no plug number" unless defined $opt_n;
>	fail "failed: no login name" unless defined $opt_l;
>	fail "failed: no password" unless defined $opt_p;
>	fail "failed: unrecognised action: $opt_o"
>	unless $opt_o =~ /^(Off|On|Reboot)$/i;
>} 
>
>$t->timeout($telnet_timeout);
>$t->input_log($debuglog) if $opt_v;
>$t->errmode('return');  
>
>&login;
>
>&identify_switch;
>
># Abort on failure beyond here
>$t->errmode(\&telnet_error);  
>
>&navigate;
>&action;
>
>exit 0;
>
>
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>


-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: fence_apc_master.py_done
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/4cab0846/attachment.ksh>

From jparsons at redhat.com  Thu Jan 11 16:54:34 2007
From: jparsons at redhat.com (Jim Parsons)
Date: Thu, 11 Jan 2007 11:54:34 -0500
Subject: [Linux-cluster] Fencing Problem On APC 79x0
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
	<45A6638D.8020501@xandros.com>
Message-ID: <45A66BCA.9050408@redhat.com>

James McOrmond wrote:

>
>
> Ramon van Alteren wrote:
>
>>>Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
>>>  
>>>    
>>>
>>we're running fencing through an apc 7920 which I assume is similar.
>>I've hacked up the fence_apc until it worked, it accepts outlet names
>>and numbers if I remember correctly.
>>I've attached ours to the mail, it's originally from cluster-1.02 but
>>works for me on cluster-1.03 as well.
>>
> Do you know if this is a general fence_apc issue?  I've got a 7900 on 
> order to do some work with and i'm wondering if I should expect to 
> have to make these modifications?


I maintain fence agents, and apc is one of out most stable agents. We 
just refactored it to allow for port aliasing and grouping, as well as 
support for master switch plus series.  I have several apc switches of 
different flavors that I use daily in development clusters in our lab. 
 I posted this latest version of the agent to this list about 20 minutes 
ago. Please try it out, if you can.

-J


From ramon at vanalteren.nl  Thu Jan 11 16:40:25 2007
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Thu, 11 Jan 2007 17:40:25 +0100
Subject: [Linux-cluster] Fencing Problem On APC 79x0
In-Reply-To: <45A6638D.8020501@xandros.com>
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
	<45A6638D.8020501@xandros.com>
Message-ID: <45A66879.9020305@vanalteren.nl>

James McOrmond wrote:
>
>
> Ramon van Alteren wrote:
>>> Anyone have ever faced similar problem with me? Here I attached the apclog for reference.
>>>   
>>>     
>> we're running fencing through an apc 7920 which I assume is similar.
>> I've hacked up the fence_apc until it worked, it accepts outlet names
>> and numbers if I remember correctly.
>> I've attached ours to the mail, it's originally from cluster-1.02 but
>> works for me on cluster-1.03 as well.
> Do you know if this is a general fence_apc issue?  I've got a 7900 on
> order to do some work with and i'm wondering if I should expect to
> have to make these modifications?
Nope sorry, just own 7920's


Regards,

Ramon


From erickson.jon at gmail.com  Thu Jan 11 16:41:04 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Thu, 11 Jan 2007 11:41:04 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section
Message-ID: <6a90e4da0701110841x27694fa3l8e5df13550ea0792@mail.gmail.com>

I have a couple of question regarding the Cluster Project FAQ ? GFS
tuning section (http://sources.redhat.com/cluster/faq.html#gfs_tuning).

First:
-	Use ?r 2048 on gfs_mkfs and mkfs.gfs2 for large file systems.
I noticed that when I used the ?r 2048 switch while creating my file
system it ended up creating the file system with the 256MB resource
group size.  When I omitted the ?r flag the file system was created
with 2048Mb resource group size.  Is there a problem with the ?r flag,
and does gfs_mkfs dynamically come up with the best resource group
size based on your file system size?  Another thing I did which ended
up in a problem was executing the gfs_mkfs command while my current
GFS file system was mounted.  The command completed successfully but
when I went into the mount point all the old files and directories
still showed up.  When I attempted to remove files bad things
happened?I believe I received invalid metadata blocks error and the
cluster went into an infinite loop trying to restart the service.  I
ended up fixing this problem by un-mounting my file system re-creating
the GFS file system and re-mounting.  This problem was caused by my
user error, but maybe there should be some sort of check that
determines whether the file system is currently mounted.

Second:
-	Break file systems up when huge numbers of file are involved.
This FAQ states that there is an amount of overhead when dealing with
lots (millions) of files.  What is a recommended limit of files in a
file system?  The theoretical limit of 8 exabytes for a file system
does not seem at all realistic if you can't have (millions) of files
in a file system.

I just curious to see what everyone thinks about this.  Thanks


-- 
Jon


From rpeterso at redhat.com  Thu Jan 11 17:56:09 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 11 Jan 2007 11:56:09 -0600
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section
In-Reply-To: <6a90e4da0701110841x27694fa3l8e5df13550ea0792@mail.gmail.com>
References: <6a90e4da0701110841x27694fa3l8e5df13550ea0792@mail.gmail.com>
Message-ID: <45A67A39.3090802@redhat.com>

Jon Erickson wrote:
> I have a couple of question regarding the Cluster Project FAQ ? GFS
> tuning section (http://sources.redhat.com/cluster/faq.html#gfs_tuning).
>
> First:
> -    Use ?r 2048 on gfs_mkfs and mkfs.gfs2 for large file systems.
> I noticed that when I used the ?r 2048 switch while creating my file
> system it ended up creating the file system with the 256MB resource
> group size.  When I omitted the ?r flag the file system was created
> with 2048Mb resource group size.  Is there a problem with the ?r flag,
> and does gfs_mkfs dynamically come up with the best resource group
> size based on your file system size?  Another thing I did which ended
> up in a problem was executing the gfs_mkfs command while my current
> GFS file system was mounted.  The command completed successfully but
> when I went into the mount point all the old files and directories
> still showed up.  When I attempted to remove files bad things
> happened?I believe I received invalid metadata blocks error and the
> cluster went into an infinite loop trying to restart the service.  I
> ended up fixing this problem by un-mounting my file system re-creating
> the GFS file system and re-mounting.  This problem was caused by my
> user error, but maybe there should be some sort of check that
> determines whether the file system is currently mounted.
>
> Second:
> -    Break file systems up when huge numbers of file are involved.
> This FAQ states that there is an amount of overhead when dealing with
> lots (millions) of files.  What is a recommended limit of files in a
> file system?  The theoretical limit of 8 exabytes for a file system
> does not seem at all realistic if you can't have (millions) of files
> in a file system.
>
> I just curious to see what everyone thinks about this.  Thanks
>
>
Hi Jon,

The newer gfs_mkfs (gfs1) and mkfs.gfs2 (gfs2) in the CVS HEAD will
create the RG size based on the size of the file system if "-r" is not 
specified,
so that would explain why it used 2048 in the case where you didn't 
specify -r.
The previous versions just always assumed 256MB unless -r was specified.

If you specified -r 2048 and it used 256 for its rg size, that would be 
a bug.
What version of the gfs_mkfs code were you running to get this?

I agree that it would be very nice if all the userspace GFS-related 
tools could
make sure the file system is not mounted anywhere first before running.
We even have a bugzilla from long ago about this regarding gfs_fsck:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=156012

It's easy enough to check if the local node (the one running mkfs or fsck)
has it mounted, but it's harder to figure out whether other nodes do because
the userland tools can't assume access to the cluster infrastructure 
like the
kernel code can.  So I guess we haven't thought of an elegant solution to
this yet; we almost need to query every node and check its cman_tool
services output to see if it is using resources pertaining to the file 
system,
but that would require some kind of socket or connection,
(e.g. ssh) but what should it do when it can't contact a node that's powered
off, etc?

Regarding the number of files in a GFS file system:  I don't have any kind
of recommendations because I haven't studied the exact performance impact
based on the number of inodes.  It would be cool if someone could do some
tests and see where the performance starts to degrade. 

The cluster team at Red Hat can work toward improving the performance
of GFS (in fact, we are; hence the change to gfs_mkfs for the rg size),
but many of the performance issues are already addressed with GFS2,
and since GFS2 was accepted by the upstream linux kernel, in a way
I think it makes more sense to focus more of our efforts there.

One thing I thought about doing was trying to use btrees instead of
linked lists for some of our more critical resources, like the RGs and
the glocks.  We'd have to figure out the impact of doing that; the overhead
to do that might also impact performance.  Just my $0.02.

Regards,

Bob Peterson
Red Hat Cluster Suite


From erickson.jon at gmail.com  Thu Jan 11 18:48:38 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Thu, 11 Jan 2007 13:48:38 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section
In-Reply-To: <45A67A39.3090802@redhat.com>
References: <6a90e4da0701110841x27694fa3l8e5df13550ea0792@mail.gmail.com>
	<45A67A39.3090802@redhat.com>
Message-ID: <6a90e4da0701111048y82806b6qfbd472bd39e5debf@mail.gmail.com>

Robert,

> What version of the gfs_mkfs code were you running to get this?
gfs_mkfs -V produced the following results:

gfs_mkfs 6.1.6 (built May 9 2006 17:48:45)
Copyright (C) Red Hat, Inc.  2004-2005 All rights reserved

Thanks,
Jon

On 1/11/07, Robert Peterson <rpeterso at redhat.com> wrote:
> Jon Erickson wrote:
> > I have a couple of question regarding the Cluster Project FAQ ? GFS
> > tuning section (http://sources.redhat.com/cluster/faq.html#gfs_tuning).
> >
> > First:
> > -    Use ?r 2048 on gfs_mkfs and mkfs.gfs2 for large file systems.
> > I noticed that when I used the ?r 2048 switch while creating my file
> > system it ended up creating the file system with the 256MB resource
> > group size.  When I omitted the ?r flag the file system was created
> > with 2048Mb resource group size.  Is there a problem with the ?r flag,
> > and does gfs_mkfs dynamically come up with the best resource group
> > size based on your file system size?  Another thing I did which ended
> > up in a problem was executing the gfs_mkfs command while my current
> > GFS file system was mounted.  The command completed successfully but
> > when I went into the mount point all the old files and directories
> > still showed up.  When I attempted to remove files bad things
> > happened?I believe I received invalid metadata blocks error and the
> > cluster went into an infinite loop trying to restart the service.  I
> > ended up fixing this problem by un-mounting my file system re-creating
> > the GFS file system and re-mounting.  This problem was caused by my
> > user error, but maybe there should be some sort of check that
> > determines whether the file system is currently mounted.
> >
> > Second:
> > -    Break file systems up when huge numbers of file are involved.
> > This FAQ states that there is an amount of overhead when dealing with
> > lots (millions) of files.  What is a recommended limit of files in a
> > file system?  The theoretical limit of 8 exabytes for a file system
> > does not seem at all realistic if you can't have (millions) of files
> > in a file system.
> >
> > I just curious to see what everyone thinks about this.  Thanks
> >
> >
> Hi Jon,
>
> The newer gfs_mkfs (gfs1) and mkfs.gfs2 (gfs2) in the CVS HEAD will
> create the RG size based on the size of the file system if "-r" is not
> specified,
> so that would explain why it used 2048 in the case where you didn't
> specify -r.
> The previous versions just always assumed 256MB unless -r was specified.
>
> If you specified -r 2048 and it used 256 for its rg size, that would be
> a bug.
> What version of the gfs_mkfs code were you running to get this?
>
> I agree that it would be very nice if all the userspace GFS-related
> tools could
> make sure the file system is not mounted anywhere first before running.
> We even have a bugzilla from long ago about this regarding gfs_fsck:
>
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=156012
>
> It's easy enough to check if the local node (the one running mkfs or fsck)
> has it mounted, but it's harder to figure out whether other nodes do because
> the userland tools can't assume access to the cluster infrastructure
> like the
> kernel code can.  So I guess we haven't thought of an elegant solution to
> this yet; we almost need to query every node and check its cman_tool
> services output to see if it is using resources pertaining to the file
> system,
> but that would require some kind of socket or connection,
> (e.g. ssh) but what should it do when it can't contact a node that's powered
> off, etc?
>
> Regarding the number of files in a GFS file system:  I don't have any kind
> of recommendations because I haven't studied the exact performance impact
> based on the number of inodes.  It would be cool if someone could do some
> tests and see where the performance starts to degrade.
>
> The cluster team at Red Hat can work toward improving the performance
> of GFS (in fact, we are; hence the change to gfs_mkfs for the rg size),
> but many of the performance issues are already addressed with GFS2,
> and since GFS2 was accepted by the upstream linux kernel, in a way
> I think it makes more sense to focus more of our efforts there.
>
> One thing I thought about doing was trying to use btrees instead of
> linked lists for some of our more critical resources, like the RGs and
> the glocks.  We'd have to figure out the impact of doing that; the overhead
> to do that might also impact performance.  Just my $0.02.
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From cjk at techma.com  Thu Jan 11 18:53:34 2007
From: cjk at techma.com (Kovacs, Corey J.)
Date: Thu, 11 Jan 2007 13:53:34 -0500
Subject: [Linux-cluster] GFS+EXT3 via NFS?
Message-ID: <7DCE72B3C36E2A45B7580F887EE4948C18FD84@tmaemail.techma.com>

We have a 5 node cluster that is exporting several GFS and EXT3 filesystems
distributed among 5 individual services each with it's own failover group
etc. For the most part, things work fine. However, sometimes when we move
these services around, the node that recieves the service doesn't re-export
the filesystems and clients get stale handles until we manually execute
"exportfs -ra" to clear this up.

Right now each NFS service exports both GFS and EXT3 filesystems
concurrently. There is some thought about seperating the filesystems so that
a service only exports GFS OR EXT3, buit not both. We'd like some input
though to see if this might really be the problem, or maybe something along
these lines etc. 

My "gut" feeling is that since a service is exporting a GFS filesystem, there
may be a built in assumption that the filesystem is exported via /etc/exports
and that the only thing transitioning is the IP address as per the unofficial
NFS cookbook.

The cluster is RHEL4.4.2 and the associated cluster/gfs packages. We have a
patched kernel based on the patches for the bnx2 driver which are the only
changes to the kernel which also happens to keep it from crashing :) Our
hardware is HP DL360-G5 machines connected to an EVA8000 SAN via qlogic FC
cards.


Any input is appreciated.


Thanks


Corey Kovacs
Senior Systems Engineer
Technology Management Associates
703.279.6168 (B)
855-6168 (R)


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/d32db284/attachment.htm>

From isplist at logicore.net  Thu Jan 11 20:21:04 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 11 Jan 2007 14:21:04 -0600
Subject: [Linux-cluster] Joomla MySQL cluster?
Message-ID: <200711114214.669265@leena>

Anyone working with Joomla and a GFS based MySQL cluster? 

Mike


From lgodoy at atichile.com  Thu Jan 11 20:38:02 2007
From: lgodoy at atichile.com (Luis Godoy Gonzalez)
Date: Thu, 11 Jan 2007 17:38:02 -0300
Subject: [Linux-cluster] Unexpected service restart
In-Reply-To: <83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
References: <45A50282.7060205@fis.unical.it>
	<83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
Message-ID: <45A6A02A.10709@atichile.com>

Hi
I have a problem with a service  (oracle ) , this service is restarted 
by clurgmgrd without a error message.
The "message" log show:

=============================================================
Jan  8 04:50:50 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh status
...
Jan  8 04:51:20 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh status
...
Jan  8 04:51:40 eir-db1 clurgmgrd[2922]: <notice> Stopping service Oracle
Jan  8 04:51:40 eir-db1 clurgmgrd: [2922]: <info> Removing IPv4 address 
xxx.xxx.xxx.xxx from bond1
...
Jan  8 04:51:50 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh stop
...
Jan  8 04:52:27 eir-db1 clurgmgrd: [2922]: <info> Removing IPv4 address 
yyy.yyy.yyy.yyy from bond0
Jan  8 04:52:37 eir-db1 clurgmgrd: [2922]: <info> unmounting /data
Jan  8 04:52:38 eir-db1 clurgmgrd[2922]: <notice> Service Oracle is 
recovering
Jan  8 04:52:38 eir-db1 clurgmgrd[2922]: <notice> Recovering failed 
service Oracle
Jan  8 04:52:38 eir-db1 clurgmgrd: [2922]: <info> mounting 
/dev/cciss/c1d0p1 on /data
Jan  8 04:52:38 eir-db1 kernel: kjournald starting.  Commit interval 5 
seconds
Jan  8 04:52:38 eir-db1 kernel: EXT3-fs warning: maximal mount count 
reached, running e2fsck is recommended
Jan  8 04:52:38 eir-db1 kernel: EXT3 FS on cciss/c1d0p1, internal journal
Jan  8 04:52:38 eir-db1 kernel: EXT3-fs: mounted filesystem with ordered 
data mode.
Jan  8 04:52:38 eir-db1 clurgmgrd: [2922]: <info> Adding IPv4 address 
xxx.xxx.xxx.xxx to bond0
Jan  8 04:52:39 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh start
...
Jan  8 04:52:47 eir-db1 su(pam_unix)[27069]: session closed for user oracle
Jan  8 04:52:47 eir-db1 clurgmgrd: [2922]: <info> Adding IPv4 address 
yyy.yyy.yyy.yyy to bond1
Jan  8 04:52:48 eir-db1 clurgmgrd[2922]: <notice> Service Oracle started
Jan  8 04:52:50 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh status
...
Jan  8 04:53:20 eir-db1 clurgmgrd: [2922]: <info> Executing 
/opt/oracle/OraHome10g/bin/oracle_mgr.sh status
============================================================================

the "/opt/oracle/OraHome10g/bin/oracle_mgr.sh status" NOT fail ( this is 
a basic test), so I don't understand the reason for the restart of services.
The service configuration is:

==============================================================================================
                <service autostart="1" name="Oracle">
                        <fs device="/dev/cciss/c1d0p1" force_unmount="0" 
fstype="ext3" mountpoint="/data" name="oracle_fs" options="">
                                <ip address="xxx.xxx.xxx.xxx" 
monitor_link="0">
                                        <script 
file="/opt/oracle/OraHome10g/bin/oracle_mgr.sh" name="Oracle_script"/>
                                </ip>
                        </fs>
                        <ip address="yyy.yyy.yyy.yyy" monitor_link="0"/>
                </service>
===============================================================================================
I need information about this restart reason.
Any idea ??
Do you need another Logs files??
How can i modify the logs level in the cluster?

In this moment I'm checking the hardware ( scsi controler and disk and 
drivers).
We have configurated a cluster using with Red Hat E4 U2.
I this moment we have 3 services running in the cluster, we have 
problem  only with oracle.
The hardware is: HP ProLiant DL385 Packaged Cluster with MSA500 G2

Thanks in advance
Luis G.


From lshen at cisco.com  Thu Jan 11 20:49:58 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Thu, 11 Jan 2007 12:49:58 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
Message-ID: <08A9A3213527A6428774900A80DBD8D8033E7B76@xmb-sjc-222.amer.cisco.com>

If I mount a tmpfs filesystem on top of a GFS based root filesystem at
say /tmp, can I safely share the contents under /tmp among the nodes?

lin 


From carlopmart at gmail.com  Thu Jan 11 22:47:54 2007
From: carlopmart at gmail.com (C. L. Martinez)
Date: Thu, 11 Jan 2007 23:47:54 +0100
Subject: [Linux-cluster] Maybe OT: GFS labels for iSCSI disks
Message-ID: <590a9c800701111447m710b7924yafcecf30488dbffe@mail.gmail.com>

Hi,

I'm using a RHEL 4 U4 with iscsitarget to serve local disks to two RHEL 4U4
servers with RHCS and GFS, using RHEL initiator.

When I add new raw disks to iscsitarget and restart the *iscsid*
service on RHEL
clients with GFS, sometimes the device naming (/dev/sdX) changes and it's a
mess
to find the older volumes with the new device name.

Does anybody know how to use to archieve persistent device naming for the *
iSCSI* volumes on *RHEL4*? According to this:
http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme
, I need to use Labels , but how can I assign labels on a GFS filesystem??

I think I need to use an *udev* rule for that, but I'm new to this, any help
or sample rule would be appreciated.

Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/52f51d12/attachment.htm>

From breeves at redhat.com  Thu Jan 11 22:50:19 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Thu, 11 Jan 2007 22:50:19 +0000
Subject: [Linux-cluster] tmpfs on top of GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D8033E7B76@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D8033E7B76@xmb-sjc-222.amer.cisco.com>
Message-ID: <45A6BF2B.5090601@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Lin Shen (lshen) wrote:
> If I mount a tmpfs filesystem on top of a GFS based root filesystem at
> say /tmp, can I safely share the contents under /tmp among the nodes?
> 
> lin 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Maybe I misunderstand your question, but tmpfs will not be able to share
its contents between nodes the way GFS does.

The tmpfs filesystem uses space available in the pagecache & swap areas
as a backing store instead of a dedicated block device.

Mounting it on top of a GFS root at /tmp is only going to be visible on
a single node. Each node can do that and have its own tmpfs storage
available, but this won't make the /tmp mounts shared between the
cluster nodes.

See Documentation/filesystems/tmpfs.txt in the kernel sources for more
details.

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
DPDFOCAEPU0Ukj8YFky9zEo=
=IHfq
-----END PGP SIGNATURE-----


From lhh at redhat.com  Thu Jan 11 23:10:59 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 11 Jan 2007 18:10:59 -0500
Subject: [Linux-cluster] Unexpected service restart
In-Reply-To: <45A6A02A.10709@atichile.com>
References: <45A50282.7060205@fis.unical.it>
	<83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
	<45A6A02A.10709@atichile.com>
Message-ID: <1168557059.15369.423.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-11 at 17:38 -0300, Luis Godoy Gonzalez wrote:
> Hi
> I have a problem with a service  (oracle ) , this service is restarted 
> by clurgmgrd without a error message.
> The "message" log show:

> ===============================================================================================
> I need information about this restart reason.
> Any idea ??

There are patches in the RHEL4 CVS branch which make the error reporting
much better; they will be going out with the 4 update 5 release.

> Do you need another Logs files??
> How can i modify the logs level in the cluster?

(1) Edit cluster.conf
  (a) Set /cluster/rm/@log_level to 7
  (b) Set /cluster/rm/@log_facility to "local4"
  (c) Increment the configuration version in cluster.conf
(2) Add "local4.* /var/log/rgmanager" to /etc/syslog.conf 
(3) Restart syslogd & rgmanager

> We have configurated a cluster using with Red Hat E4 U2.
> I this moment we have 3 services running in the cluster, we have 
> problem  only with oracle.

Try updating magma + magma-plugins, and try the latest rgmanager package
referenced in:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=212634

-- Lon


From lshen at cisco.com  Thu Jan 11 23:45:27 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Thu, 11 Jan 2007 15:45:27 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
In-Reply-To: <45A6BF2B.5090601@redhat.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D8033E7C8A@xmb-sjc-222.amer.cisco.com>

 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
> Sent: Thursday, January 11, 2007 2:50 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] tmpfs on top of GFS
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Lin Shen (lshen) wrote:
> > If I mount a tmpfs filesystem on top of a GFS based root 
> filesystem at 
> > say /tmp, can I safely share the contents under /tmp among 
> the nodes?
> > 
> > lin
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> Maybe I misunderstand your question, but tmpfs will not be 
> able to share its contents between nodes the way GFS does.
> 
> The tmpfs filesystem uses space available in the pagecache & 
> swap areas as a backing store instead of a dedicated block device.
> 
> Mounting it on top of a GFS root at /tmp is only going to be 
> visible on a single node. Each node can do that and have its 
> own tmpfs storage available, but this won't make the /tmp 
> mounts shared between the cluster nodes.
> 
> See Documentation/filesystems/tmpfs.txt in the kernel sources 
> for more details.
> 
> Kind regards,
> 
> Bryn.
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.5 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> 
> iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
> DPDFOCAEPU0Ukj8YFky9zEo=
> =IHfq
> -----END PGP SIGNATURE-----
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

I see. Can I use NFS to export tmpfs from one node to another then? Or
are there any other better ways to share a tmpfs among the nodes.

lin 


From mwill at penguincomputing.com  Thu Jan 11 23:47:54 2007
From: mwill at penguincomputing.com (Michael Will)
Date: Thu, 11 Jan 2007 15:47:54 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
Message-ID: <433093DF7AD7444DA65EFAFE3987879C35122B@orca.penguincomputing.com>

Whats the intent here? 

tmpfs is meant to be node-local temporary storage by design. What are
you trying to use it for?

Michael

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin Shen (lshen)
Sent: Thursday, January 11, 2007 3:45 PM
To: linux clustering
Subject: RE: [Linux-cluster] tmpfs on top of GFS

 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn M. Reeves
> Sent: Thursday, January 11, 2007 2:50 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] tmpfs on top of GFS
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Lin Shen (lshen) wrote:
> > If I mount a tmpfs filesystem on top of a GFS based root
> filesystem at
> > say /tmp, can I safely share the contents under /tmp among
> the nodes?
> > 
> > lin
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> Maybe I misunderstand your question, but tmpfs will not be able to 
> share its contents between nodes the way GFS does.
> 
> The tmpfs filesystem uses space available in the pagecache & swap 
> areas as a backing store instead of a dedicated block device.
> 
> Mounting it on top of a GFS root at /tmp is only going to be visible 
> on a single node. Each node can do that and have its own tmpfs storage

> available, but this won't make the /tmp mounts shared between the 
> cluster nodes.
> 
> See Documentation/filesystems/tmpfs.txt in the kernel sources for more

> details.
> 
> Kind regards,
> 
> Bryn.
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.5 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> 
> iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
> DPDFOCAEPU0Ukj8YFky9zEo=
> =IHfq
> -----END PGP SIGNATURE-----
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

I see. Can I use NFS to export tmpfs from one node to another then? Or
are there any other better ways to share a tmpfs among the nodes.

lin 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lshen at cisco.com  Thu Jan 11 23:52:30 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Thu, 11 Jan 2007 15:52:30 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
In-Reply-To: <433093DF7AD7444DA65EFAFE3987879C35122B@orca.penguincomputing.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D8033E7C98@xmb-sjc-222.amer.cisco.com>

 We're trying to use it as an in-memory DB (for its speed), but need to
share the DB contents among nodes.

lin

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Will
> Sent: Thursday, January 11, 2007 3:48 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
> Whats the intent here? 
> 
> tmpfs is meant to be node-local temporary storage by design. 
> What are you trying to use it for?
> 
> Michael
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin 
> Shen (lshen)
> Sent: Thursday, January 11, 2007 3:45 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
>  
> 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn 
> M. Reeves
> > Sent: Thursday, January 11, 2007 2:50 PM
> > To: linux clustering
> > Subject: Re: [Linux-cluster] tmpfs on top of GFS
> > 
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA1
> > 
> > Lin Shen (lshen) wrote:
> > > If I mount a tmpfs filesystem on top of a GFS based root
> > filesystem at
> > > say /tmp, can I safely share the contents under /tmp among
> > the nodes?
> > > 
> > > lin
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > Maybe I misunderstand your question, but tmpfs will not be able to 
> > share its contents between nodes the way GFS does.
> > 
> > The tmpfs filesystem uses space available in the pagecache & swap 
> > areas as a backing store instead of a dedicated block device.
> > 
> > Mounting it on top of a GFS root at /tmp is only going to 
> be visible 
> > on a single node. Each node can do that and have its own 
> tmpfs storage
> 
> > available, but this won't make the /tmp mounts shared between the 
> > cluster nodes.
> > 
> > See Documentation/filesystems/tmpfs.txt in the kernel 
> sources for more
> 
> > details.
> > 
> > Kind regards,
> > 
> > Bryn.
> > 
> > -----BEGIN PGP SIGNATURE-----
> > Version: GnuPG v1.4.5 (GNU/Linux)
> > Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> > 
> > iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
> > DPDFOCAEPU0Ukj8YFky9zEo=
> > =IHfq
> > -----END PGP SIGNATURE-----
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 
> I see. Can I use NFS to export tmpfs from one node to another 
> then? Or are there any other better ways to share a tmpfs 
> among the nodes.
> 
> lin 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From mwill at penguincomputing.com  Thu Jan 11 23:55:47 2007
From: mwill at penguincomputing.com (Michael Will)
Date: Thu, 11 Jan 2007 15:55:47 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
Message-ID: <433093DF7AD7444DA65EFAFE3987879C35122D@orca.penguincomputing.com>

That could be done with mysql's clustered in-memory database mode
without sharing anything through
a filesystem - the mysql daemons will communicate the data directly
through tcpip.

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin Shen (lshen)
Sent: Thursday, January 11, 2007 3:53 PM
To: linux clustering
Subject: RE: [Linux-cluster] tmpfs on top of GFS

 We're trying to use it as an in-memory DB (for its speed), but need to
share the DB contents among nodes.

lin

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Will
> Sent: Thursday, January 11, 2007 3:48 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
> Whats the intent here? 
> 
> tmpfs is meant to be node-local temporary storage by design. 
> What are you trying to use it for?
> 
> Michael
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin Shen 
> (lshen)
> Sent: Thursday, January 11, 2007 3:45 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
>  
> 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn
> M. Reeves
> > Sent: Thursday, January 11, 2007 2:50 PM
> > To: linux clustering
> > Subject: Re: [Linux-cluster] tmpfs on top of GFS
> > 
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA1
> > 
> > Lin Shen (lshen) wrote:
> > > If I mount a tmpfs filesystem on top of a GFS based root
> > filesystem at
> > > say /tmp, can I safely share the contents under /tmp among
> > the nodes?
> > > 
> > > lin
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > Maybe I misunderstand your question, but tmpfs will not be able to 
> > share its contents between nodes the way GFS does.
> > 
> > The tmpfs filesystem uses space available in the pagecache & swap 
> > areas as a backing store instead of a dedicated block device.
> > 
> > Mounting it on top of a GFS root at /tmp is only going to
> be visible
> > on a single node. Each node can do that and have its own
> tmpfs storage
> 
> > available, but this won't make the /tmp mounts shared between the 
> > cluster nodes.
> > 
> > See Documentation/filesystems/tmpfs.txt in the kernel
> sources for more
> 
> > details.
> > 
> > Kind regards,
> > 
> > Bryn.
> > 
> > -----BEGIN PGP SIGNATURE-----
> > Version: GnuPG v1.4.5 (GNU/Linux)
> > Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> > 
> > iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
> > DPDFOCAEPU0Ukj8YFky9zEo=
> > =IHfq
> > -----END PGP SIGNATURE-----
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 
> I see. Can I use NFS to export tmpfs from one node to another then? Or

> are there any other better ways to share a tmpfs among the nodes.
> 
> lin
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lshen at cisco.com  Fri Jan 12 00:47:57 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Thu, 11 Jan 2007 16:47:57 -0800
Subject: [Linux-cluster] tmpfs on top of GFS
In-Reply-To: <433093DF7AD7444DA65EFAFE3987879C35122D@orca.penguincomputing.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D8033E7CDA@xmb-sjc-222.amer.cisco.com>

We have postgre running on our system, don't feel like getting in
another database. 

lin   

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Will
> Sent: Thursday, January 11, 2007 3:56 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
> That could be done with mysql's clustered in-memory database 
> mode without sharing anything through a filesystem - the 
> mysql daemons will communicate the data directly through tcpip.
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin 
> Shen (lshen)
> Sent: Thursday, January 11, 2007 3:53 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] tmpfs on top of GFS
> 
>  We're trying to use it as an in-memory DB (for its speed), 
> but need to share the DB contents among nodes.
> 
> lin
> 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Will
> > Sent: Thursday, January 11, 2007 3:48 PM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] tmpfs on top of GFS
> > 
> > Whats the intent here? 
> > 
> > tmpfs is meant to be node-local temporary storage by design. 
> > What are you trying to use it for?
> > 
> > Michael
> > 
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lin Shen
> > (lshen)
> > Sent: Thursday, January 11, 2007 3:45 PM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] tmpfs on top of GFS
> > 
> >  
> > 
> > > -----Original Message-----
> > > From: linux-cluster-bounces at redhat.com 
> > > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bryn
> > M. Reeves
> > > Sent: Thursday, January 11, 2007 2:50 PM
> > > To: linux clustering
> > > Subject: Re: [Linux-cluster] tmpfs on top of GFS
> > > 
> > > -----BEGIN PGP SIGNED MESSAGE-----
> > > Hash: SHA1
> > > 
> > > Lin Shen (lshen) wrote:
> > > > If I mount a tmpfs filesystem on top of a GFS based root
> > > filesystem at
> > > > say /tmp, can I safely share the contents under /tmp among
> > > the nodes?
> > > > 
> > > > lin
> > > > 
> > > > --
> > > > Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > > 
> > > Maybe I misunderstand your question, but tmpfs will not 
> be able to 
> > > share its contents between nodes the way GFS does.
> > > 
> > > The tmpfs filesystem uses space available in the pagecache & swap 
> > > areas as a backing store instead of a dedicated block device.
> > > 
> > > Mounting it on top of a GFS root at /tmp is only going to
> > be visible
> > > on a single node. Each node can do that and have its own
> > tmpfs storage
> > 
> > > available, but this won't make the /tmp mounts shared between the 
> > > cluster nodes.
> > > 
> > > See Documentation/filesystems/tmpfs.txt in the kernel
> > sources for more
> > 
> > > details.
> > > 
> > > Kind regards,
> > > 
> > > Bryn.
> > > 
> > > -----BEGIN PGP SIGNATURE-----
> > > Version: GnuPG v1.4.5 (GNU/Linux)
> > > Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> > > 
> > > iD8DBQFFpr8q6YSQoMYUY94RAkCWAKC6b+VWxyW5ZMqJz+RuMJ5jCV5UTgCfYd4Z
> > > DPDFOCAEPU0Ukj8YFky9zEo=
> > > =IHfq
> > > -----END PGP SIGNATURE-----
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > 
> > I see. Can I use NFS to export tmpfs from one node to 
> another then? Or
> 
> > are there any other better ways to share a tmpfs among the nodes.
> > 
> > lin
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From irwan at magnifix.com.my  Fri Jan 12 04:29:14 2007
From: irwan at magnifix.com.my (Mohd Irwan Jamaluddin)
Date: Fri, 12 Jan 2007 12:29:14 +0800
Subject: [Linux-cluster] Fencing Problem On APC 7950
In-Reply-To: <45A66ACA.6060607@redhat.com>
References: <1168488605.26513.28.camel@kuli.magnifix.com.my>
	<45A66012.1090006@vanalteren.nl>  <45A66ACA.6060607@redhat.com>
Message-ID: <1168576154.26513.83.camel@kuli.magnifix.com.my>

On Thu, 2007-01-11 at 11:50 -0500, Jim Parsons wrote:
> Thanks Ramon and Mohd. Attached is our latest, heavily refactored 
> version of this agent. outlet naming and grouping should all work now - 
> on every 7900 series switch. Please try it. Just rename it fence_apc 
> when you drop it into /sbin and make certain the executable bits are set.
> 
> -J

Thanks Jim, it works now!

[root at orarac01 ~]# ./fence_apc_master.py_done -a 10.0.6.150 -l apc -p
apc -o Off -n 2 -v
Status check successful. Port 2 is OFF
Power Off successful

Nevertheless, I guess your script is missing the Test Mode (-T option):
[root at orarac01 ~]# ./fence_apc_master.py_done -h
Usage:

fence_apc [options]
Options:
   -a <ipaddress>           ip or hostname of APC switch
   -h                       print out help
   -l [login]               login name
   -n [port]                switch port
   -p [password]            password
   -o [action]              Reboot (default), Off, On, or Status
   -v Verbose               Verbose mode - writes file to /tmp/apclog
   -V                       Print Version, then exit


-- 
Regards,
+--------------------------------+-------------------------------------+
|       Mohd Irwan Jamaluddin    |  "Being a Bayern Munich fan is like |
| ##    System Engineer,         |   a love affair. If you don't take  |
| (o_   Magnifix Sdn. Bhd.       |   it seriously, it is no fun; if    |
| //\   Tel: +60 3 42705073      |   you do take it seriously, it      |
| V_/_  Fax: +60 3 42701960      |   breaks your heart."                |
|       http://www.magnifix.com/ |                                     |
+--------------------------------+-------------------------------------+


From kri_thi at yahoo.com  Fri Jan 12 07:02:31 2007
From: kri_thi at yahoo.com (krishnamurthi G)
Date: Thu, 11 Jan 2007 23:02:31 -0800 (PST)
Subject: [Linux-cluster] cluster version identification:how?
Message-ID: <20070112070231.6253.qmail@web90413.mail.mud.yahoo.com>

Hi Lon/Rajesh,

Thanks for the response. Earlier I thought RHEL release no. is directly mapped to cluster version and hence I was reading release from /etc/redhat-erlease file and get cluster specific commands to get cluster info. e.g.

RHEL 2.1    "/sbin/cluadmin -- cluster status"
RHEL 3.0    "/usr/sbin/clustat"

But I noticed that RHEL 3.0 could have cluster setup where  "/sbin/cluadmin -- cluster status" works. ( Where my code fails :-( ).
So now I need to find a way to detect cluster version and use corresponding cluster commands.

Rajesh has given a pointer to detect version For RHEL 2.1 and 3.0 run "rpm -qa |grep clumanager" and for rhel4.0 use ccs,cman,fence independent rpms to find the version. ( Rajesh which is the appropriate command to find version for RHEL4.0 ?)


m/c 1:
# rpm -qa |grep clumanager
clumanager-1.0.11-1
# rpm -q redhat-release
redhat-release-3AS-13.5.1
# cat /etc/redhat-release
Red Hat Enterprise Linux AS release 3 (Taroon Update 5)
# ls -l /sbin/cluadmin
-rwxr-xr-x    1 root     root       533296 Apr 17  2002 /sbin/cluadmin

m/c 2:
# rpm -qa |grep clumanager
clumanager-1.2.3-1
# rpm -q redhat-release
redhat-release-3AS-1
# cat /etc/redhat-release
Red Hat Enterprise Linux AS release 3 (Taroon)
# ls -l /sbin/cluadmin
ls: /sbin/cluadmin: No such file or directory
# ls -l /usr/sbin/clustat
-rwxr-xr-x    1 root     root        56128 Oct  6  2003 /usr/sbin/clustat

Do I need to consider second bit for cluster release? (e.g 2 in clumanager-1.2.3-1)
Thanks friends for your help.

Have a nice day !
- Krishna

----- Original Message ----
From: Lon Hohberger <lhh at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Thursday, January 11, 2007 8:35:24 PM
Subject: Re: [Linux-cluster] cluster version identification:how?

On Thu, 2007-01-11 at 03:34 -0800, krishnamurthi G wrote:
>  Hi Frieds,
> 
> Is there any ways to find cluster version (if it is !). 
> Problem: The cluster specific commands/paths/command outputs have been
> changed/changing completely from RHEL 2.1 to 2.4 to 2.6.


> I am working on a project where I need support for all
> version/releases, some how I need to find cluster version if available
> so that I can parse accordingly.

> Temporary Work around: The cluster config file is unique for different
> RHEL releases.
> e.g RHEL 2.1 "/etc/cluster.xml" 
> RHEL 2.4  "/etc/cluster.conf"
> RHEL 2.6 "/etc/cluster/cluster.conf"
> 
> Check this config file and identify cluster type.

RHEL 2.1: /etc/cluster.conf
RHEL3:    /etc/cluster.xml
RHEL4:    /etc/cluster/cluster.conf
RHEL5:    /etc/cluster/cluster.conf

Why not just do 'rpm -q redhat-release'?  I'm curious; why does the
cluster version matter: are you manipulating cluster.[xml|conf]
directly?  If so, you'll need to do a few extra things.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


____________________________________________________________________________________
Do you Yahoo!?
Everyone is raving about the all-new Yahoo! Mail beta.
http://new.mail.yahoo.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070111/80adf2f5/attachment.htm>

From shailesh at verismonetworks.com  Fri Jan 12 09:21:17 2007
From: shailesh at verismonetworks.com (Shailesh)
Date: Fri, 12 Jan 2007 14:51:17 +0530
Subject: [Linux-cluster] gnbd client and server on the same node
In-Reply-To: <20070112070231.6253.qmail@web90413.mail.mud.yahoo.com>
References: <20070112070231.6253.qmail@web90413.mail.mud.yahoo.com>
Message-ID: <1168593677.6593.61.camel@shailesh>

Hi,
     Has anybody tried out having a network of nodes ,where
each node is a GNBD client and also a GNBD server. ie nodes do both
gnbd_import and gnbd_export.

Did you see any issues regarding this? 

I would appreciate if you can throw any light on this.

Thanks & Regards
Shailesh  


From fedele at fis.unical.it  Fri Jan 12 09:54:54 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Fri, 12 Jan 2007 10:54:54 +0100
Subject: [Linux-cluster] A question about QUORUM DISK
Message-ID: <45A75AEE.1030600@fis.unical.it>

I'm just working with quorum, so i have this questions for all:

1) Is it possible/suggested create a quorum disk using a gfs filesystem?
2) Can i shrink my gfs on SAN storage and then create a partition for my quorum disk?

Thank you for your help

Fedele STABILE


From jvantuyl at engineyard.com  Fri Jan 12 11:22:02 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Fri, 12 Jan 2007 05:22:02 -0600
Subject: [Linux-cluster] A question about QUORUM DISK
In-Reply-To: <45A75AEE.1030600@fis.unical.it>
References: <45A75AEE.1030600@fis.unical.it>
Message-ID: <337E7168-6CFC-42EA-BCDB-11D24B2CB776@engineyard.com>

On Jan 12, 2007, at 3:54 AM, Fedele Stabile wrote:

> I'm just working with quorum, so i have this questions for all:
>
> 1) Is it possible/suggested create a quorum disk using a gfs  
> filesystem?
I'm not sure what you mean here.  Quorum disks have their own disk  
format.  It isn't strictly a filesystem since it's not mounted and it  
generally is created on a block device (like a disk) not a file (i.e.  
on a gfs filesystem).  The mkqdisk command is used to do this.

http://www.die.net/doc/linux/man/man5/qdisk.5.html
http://www.die.net/doc/linux/man/man8/mkqdisk.8.html

> 2) Can i shrink my gfs on SAN storage and then create a partition  
> for my quorum disk?
GFS doesn't have the capability to shrink.

http://sources.redhat.com/cluster/faq.html#gfs_shrink
>
> Thank you for your help
>
> Fedele STABILE

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070112/65f4db3a/attachment.htm>

From mvz+rhcluster at nimium.hr  Fri Jan 12 12:41:14 2007
From: mvz+rhcluster at nimium.hr (Miroslav Zubcic)
Date: Fri, 12 Jan 2007 13:41:14 +0100
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-	bug in fenced?
In-Reply-To: <20070110201648.GE24326@korben.rdu.redhat.com>
References: <45A542BD.9070107@nimium.hr>
	<20070110201648.GE24326@korben.rdu.redhat.com>
Message-ID: <45A781EA.3030201@nimium.hr>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Josef Whiter wrote:

> This isn't a bug, its working as expected.

IT People from the central bank doesn't think like that. I cannot blame
them, because it is strange to me, and to anybody who has seen this RH
cluster behaviour.

> What you need in qdisk, set it up
> with the proper hueristics and it will force the shutdown of the bad node before
> the bad node has a chance to fence off the working node.

This is just a workaround for lack of communication between clurgmgrd and
fenced daemons, where first is aware of ethernet/network failure and is
trying to disable active service, and fenced which is fencing other node
without any good reason, because it doesn't know that it's node is faulty one.

I have even better workaround (one bonding with native data ethernet and
tagged vlan for fence subnet) for this silly behaviour, but I will really
like to see this thing fixed, because people are laughing on us when
testing our cluster configurations (we are configuring Red Hat machines
and clusters).


- --
Miroslav Zubcic, Nimium d.o.o., email: <mvz at nimium.hr>
Tel: +385 01 4852 639, Fax: +385 01 4852 640, Mobile: +385 098 942 8672
Mrazoviceva 12, 10000 Zagreb, Hrvatska

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iQEVAwUBRaeB6sqzT+8/3SzgAQLkLQf9EMUnXY86JAttXzmIp9DwyHoWl6mB908k
kYVgSTjIUWMMVVnAEgKxnKlVqUnhzmtMtHhkoMs+08D0QDxAl4/T/i0iAF9RwC/9
0jKPCB+rOYvdoKyg9s1yg6ic2DFi4cs0kVd+WbhLvmNd3Q70ATCzSRc1k7aySQ/N
0x8Wn0Mg+4aPAJFBEM1XafYbvOQAvABgX9aRJXH1aS9LVn4sQRMiwcosCR/fZTLH
EWTUlJiKgWQDEKyE4QsoxeOXu290VlISv8Rqx3IHCeAfMiEa1tdVs9/9wUndbqO9
ui3e9l8KrCoI8mJW1YIjHEUY1p7H2X9rT3pm88TDHkf0XA4lnirYlw==
=BxUi
-----END PGP SIGNATURE-----


From mvz+rhcluster at nimium.hr  Fri Jan 12 12:46:48 2007
From: mvz+rhcluster at nimium.hr (Miroslav Zubcic)
Date: Fri, 12 Jan 2007 13:46:48 +0100
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-bug in fenced?
In-Reply-To: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>
References: <453D02254A9EBC45866DBF28FECEA46F0DFA88@ILEX01.corp.diligent.com>
Message-ID: <45A78338.9080902@nimium.hr>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Krikler, Samuel wrote:

> We got the same problem and tried to used qdisk without success.

Configure one bonding with 2-4 physical ethernets, configure vlan X on
ethernet switch, configure vlanX interface on RHEL. In this way, when the
last ethernet cable is pooled off fenced daemon on faulty node wan't be
able to fence other (healthy) node. This is my ad hoc solution for this
unlogic behaviour.

It doesn't involve change of cluster.conf or cluster configuration, or
additional software. It is simple.


- --
Miroslav Zubcic, Nimium d.o.o., email: <mvz at nimium.hr>
Tel: +385 01 4852 639, Fax: +385 01 4852 640, Mobile: +385 098 942 8672
Mrazoviceva 12, 10000 Zagreb, Hrvatska

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iQEVAwUBRaeDOMqzT+8/3SzgAQLAJAf+K48UbYEwZastyMCjm61aBCXfh+slhn/W
v1wUOsW5dqlqzUh6g0G01rJwWsw1asVIGoN/mSlERvexG//PzwxCXMXbCX/sZ4ps
eXCmRADWHiqI8d2fO2YTaiGpgFRjYn4uU2tFfkOgdIkB5hvhP446cmvySziGgplQ
Mk/rIebE1C3fcL+ta4ViHE1Ple37wftJHOV2GWiFNos+J95ffuPNWDh8hWqOSMuL
w+/EzhFBW0N+J8ZsOVM11/2Pp1RHnFB4KsMSqMmQfaeUg1uOOm8b+O3xhR9f9rq3
iJbHrWL2z3VJHquqMzMc95iGEb0UtueyL5F1x3GdovbdThlDrHejMw==
=dEE8
-----END PGP SIGNATURE-----


From Alain.Moulle at bull.net  Fri Jan 12 12:58:55 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 12 Jan 2007 13:58:55 +0100
Subject: [Linux-cluster] CS4 U4/ question about fencing decision on Heart
	beat failure
Message-ID: <45A7860F.4090204@bull.net>

Hi

A question about Heart-beat failure in case of HA pair:

In the case the Ethernet Network is globaly in failure,
I understand that both nodes in the HA pair will detect
an heart-beat failure after 21s and try to fence each other,
the first in the race wins to stay alive.
OK.
But if there is a failure locally on one node around the
Heart-Beat Ethernet interface (such us interface down for
example), is there always a race between both nodes ?
or is the error taken in account by CS4 so that the node in
error will always be the killed node, and the other where
there was no Ethernet error will always stay alive ?

Thanks
Alain Moull?


From simone.gotti at email.it  Fri Jan 12 13:59:53 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Fri, 12 Jan 2007 14:59:53 +0100
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
Message-ID: <1168610394.3266.10.camel@localhost>

Hi all,

On a 2 node openais cman cluster, I failed a network interface and
noticed that it didn't failed over the other node.

Looking at the rgmanager-2.0.16 code I noticed that:

handle_relocate_req is called with preferred_target = -1, but inside
this function, there are 2 checks to see if the preferred_target is
setted, the check is a 'if (preferred_target != 0)' so the function
thinks that a preferred target is choosed. Then, inside the cycle, the
only one target that really exists is "me" (as -1 isn't a real target)
and there a "goto exausted:", the service is then restarted only on the
locale node, where it fails again and so it's stopped. Changing these
checks to "> 0" worked. 

Before writing a patch I noticed that in the RHEL4 CVS tag is used a
NODE_ID_NONE instead of the numeric values, so the problem (not tested)
probably doesn't happen.
Is it probably a forgotten patch on HEAD and RHEL5?


The same problem is in the ip.sh resource scripts as it's missing the
patch for "Fix bug in ip.sh allowing start of the IP if the link was
down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
RHEL4 branch.

Thanks!

Bye!

-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Video-Corsi.com : Vuoi scoprire un modo rapido e veloce per imparare? Scopri i nostri VideoCorsi professionali
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5141&d=12-1


From lhh at redhat.com  Fri Jan 12 14:29:11 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 09:29:11 -0500
Subject: [Linux-cluster] GFS+EXT3 via NFS?
In-Reply-To: <7DCE72B3C36E2A45B7580F887EE4948C18FD84@tmaemail.techma.com>
References: <7DCE72B3C36E2A45B7580F887EE4948C18FD84@tmaemail.techma.com>
Message-ID: <1168612151.15369.433.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-11 at 13:53 -0500, Kovacs, Corey J. wrote:
> We have a 5 node cluster that is exporting several GFS and EXT3
> filesystems distributed among 5 individual services each with it's own
> failover group etc. For the most part, things work fine. However,
> sometimes when we move these services around, the node that recieves
> the service doesn't re-export the filesystems and clients get stale
> handles until we manually execute "exportfs -ra" to clear this up.


> Right now each NFS service exports both GFS and EXT3 filesystems
> concurrently. There is some thought about seperating the filesystems
> so that a service only exports GFS OR EXT3, buit not both. We'd like
> some input though to see if this might really be the problem, or maybe
> something along these lines etc. 
> 
> My "gut" feeling is that since a service is exporting a GFS
> filesystem, there may be a built in assumption that the filesystem is
> exported via /etc/exports and that the only thing transitioning is the
> IP address as per the unofficial NFS cookbook.

You can do this with GFS, but not ext3.  It works with GFS because GFS
can be mounted on all nodes - and exported from all nodes - at the same
time.

You'll either need to add a script to call 'exporfs -ra' to the service
after the ext3 file system is mounted or use cluster-managed NFS
exports.
 
Both will work with cluster-managed NFS exports, but only GFS can really
use /etc/exports.

( being able to look at your cluster config would help... ;) )

-- Lon


From lhh at redhat.com  Fri Jan 12 14:30:30 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 09:30:30 -0500
Subject: [Linux-cluster] Maybe OT: GFS labels for iSCSI disks
In-Reply-To: <590a9c800701111447m710b7924yafcecf30488dbffe@mail.gmail.com>
References: <590a9c800701111447m710b7924yafcecf30488dbffe@mail.gmail.com>
Message-ID: <1168612230.15369.436.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-11 at 23:47 +0100, C. L. Martinez wrote:
> Hi,
> 
> I'm using a RHEL 4 U4 with iscsitarget to serve local disks to two
> RHEL 4U4 servers with RHCS and GFS, using RHEL initiator.
> 
> When I add new raw disks to iscsitarget and restart the iscsid service
> on RHEL clients with GFS, sometimes the device naming (/dev/sdX)
> changes and it's a mess
> to find the older volumes with the new device name.
> 
> Does anybody know how to use to archieve persistent device naming
> forthe iSCSI volumes on RHEL4? According to this:
> http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme, I need to
> use Labels , but how can I assign labels on a GFS filesystem??
> 
> I think I need to use an udev rule for that, but I'm new to this,
> anyhelp or sample rule would be appreciated.

You could use clustered logical volume management (clvm).

-- Lon


From lhh at redhat.com  Fri Jan 12 14:41:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 09:41:49 -0500
Subject: [Linux-cluster] A question about QUORUM DISK
In-Reply-To: <45A75AEE.1030600@fis.unical.it>
References: <45A75AEE.1030600@fis.unical.it>
Message-ID: <1168612909.15369.450.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 10:54 +0100, Fedele Stabile wrote:
> I'm just working with quorum, so i have this questions for all:
> 
> 1) Is it possible/suggested create a quorum disk using a gfs filesystem?

No, it's a raw (unformatted) partition.

-- Lon


From jos at xos.nl  Fri Jan 12 14:48:31 2007
From: jos at xos.nl (Jos Vos)
Date: Fri, 12 Jan 2007 15:48:31 +0100
Subject: [Linux-cluster] Using IPv6 addresses as cluster resources
Message-ID: <20070112154831.A23065@xos037.xos.nl>

Hi,

For a cluster I need to have IPv6 addresses as resources (besides IPv4
addresses).  I guess the easiest way is to write a script and use that
as resource to do the job?  Anyone already done that?  Caveats?

Cheers,

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From cjk at techma.com  Fri Jan 12 15:09:45 2007
From: cjk at techma.com (Kovacs, Corey J.)
Date: Fri, 12 Jan 2007 10:09:45 -0500
Subject: =?us-ascii?Q?RE:_=5BLinux-cluster=5D_GFS+EXT3_via_NFS=3F?=
References: <7DCE72B3C36E2A45B7580F887EE4948C18FD84@tmaemail.techma.com> 
	<1168612151.15369.433.camel@rei.boston.devel.redhat.com>
Message-ID: <7DCE72B3C36E2A45B7580F887EE4948C18FD86@tmaemail.techma.com>

Lon, thanks. I could manually type the cluster.conf in but it would likely be
riddled with typos :)

Suffice it to say that the exports are managed by the cluster which I think
is the problem in our particular case as we have mixed GFS/EXT3 filesystems
being exported from the same services. 

That being said, what will the effect be of seperating the services by
filesystem type if a service exporting EXT3 fails over to a node exporting
non-cluster-managed GFS exports? Will the mechanics of moving a
cluster-managed export to a node with non-managed exports collide?

Thanks for your input.


Corey Kovacs
Senior Systems Engineer
Technology Management Associates
703.279.6168 (B)
855-6168 (R)


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Friday, January 12, 2007 9:29 AM
To: linux clustering
Subject: Re: [Linux-cluster] GFS+EXT3 via NFS?

On Thu, 2007-01-11 at 13:53 -0500, Kovacs, Corey J. wrote:
> We have a 5 node cluster that is exporting several GFS and EXT3 
> filesystems distributed among 5 individual services each with it's own 
> failover group etc. For the most part, things work fine. However, 
> sometimes when we move these services around, the node that recieves 
> the service doesn't re-export the filesystems and clients get stale 
> handles until we manually execute "exportfs -ra" to clear this up.


> Right now each NFS service exports both GFS and EXT3 filesystems 
> concurrently. There is some thought about seperating the filesystems 
> so that a service only exports GFS OR EXT3, buit not both. We'd like 
> some input though to see if this might really be the problem, or maybe 
> something along these lines etc.
> 
> My "gut" feeling is that since a service is exporting a GFS 
> filesystem, there may be a built in assumption that the filesystem is 
> exported via /etc/exports and that the only thing transitioning is the 
> IP address as per the unofficial NFS cookbook.

You can do this with GFS, but not ext3.  It works with GFS because GFS can be
mounted on all nodes - and exported from all nodes - at the same time.

You'll either need to add a script to call 'exporfs -ra' to the service after
the ext3 file system is mounted or use cluster-managed NFS exports.
 
Both will work with cluster-managed NFS exports, but only GFS can really use
/etc/exports.

( being able to look at your cluster config would help... ;) )

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From kudjak at gmail.com  Fri Jan 12 15:09:54 2007
From: kudjak at gmail.com (Jan Kudjak)
Date: Fri, 12 Jan 2007 16:09:54 +0100
Subject: [Linux-cluster] FAQ: mounting another gfs fileystem under gfs
Message-ID: <353fcd0b0701120709t67e91058x94327c206f6da392@mail.gmail.com>

hi,
imagine that customer wants to mount another gfs filesystem
under already mounted gfs filesystem

eg.

/dev/vg01/lvgfs   /gfs  (filesystem gfs)
/dev/vg01/lvgfs1  /gfs/shared/gfs1  (filesystem gfs)
/dev/vg01/lvgfs2  /gfs/shared/gfs2  (filesystem gfs)

do you thing this setup is correct or could it cause some problems ?

thanks
jan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070112/47e91379/attachment.htm>

From rpeterso at redhat.com  Fri Jan 12 15:15:39 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 12 Jan 2007 09:15:39 -0600
Subject: [Linux-cluster] FAQ: mounting another gfs fileystem under gfs
In-Reply-To: <353fcd0b0701120709t67e91058x94327c206f6da392@mail.gmail.com>
References: <353fcd0b0701120709t67e91058x94327c206f6da392@mail.gmail.com>
Message-ID: <45A7A61B.9080000@redhat.com>

Jan Kudjak wrote:
> hi,
> imagine that customer wants to mount another gfs filesystem
> under already mounted gfs filesystem
>
> eg.
>
> /dev/vg01/lvgfs   /gfs  (filesystem gfs)
> /dev/vg01/lvgfs1  /gfs/shared/gfs1  (filesystem gfs)
> /dev/vg01/lvgfs2  /gfs/shared/gfs2  (filesystem gfs)
>
> do you thing this setup is correct or could it cause some problems ?
>
> thanks
> jan
Hi Jan,

AFAIK, what you've described is supposed to work just fine.
However, this and similar scenarios don't get a lot of testing.
A few weeks back, I uncovered some bugs in a beta version of RHEL5
doing just that, which we consequently fixed.  So I'd say:  Go ahead
and do it, and if you run into problems, open up a bugzilla.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lhh at redhat.com  Fri Jan 12 16:00:15 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 11:00:15 -0500
Subject: [Linux-cluster] Using IPv6 addresses as cluster resources
In-Reply-To: <20070112154831.A23065@xos037.xos.nl>
References: <20070112154831.A23065@xos037.xos.nl>
Message-ID: <1168617615.15369.471.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 15:48 +0100, Jos Vos wrote:
> Hi,
> 
> For a cluster I need to have IPv6 addresses as resources (besides IPv4
> addresses).  I guess the easiest way is to write a script and use that
> as resource to do the job?  Anyone already done that?  Caveats?

The ip script should handle both ipv4 & ipv6 addrs correctly.

-- Lon


From kudjak at gmail.com  Fri Jan 12 16:01:33 2007
From: kudjak at gmail.com (Jan Kudjak)
Date: Fri, 12 Jan 2007 17:01:33 +0100
Subject: [Linux-cluster] FAQ: mounting another gfs fileystem under gfs
In-Reply-To: <45A7A61B.9080000@redhat.com>
References: <353fcd0b0701120709t67e91058x94327c206f6da392@mail.gmail.com>
	<45A7A61B.9080000@redhat.com>
Message-ID: <353fcd0b0701120801r7e45f557w9f16b6820667a7df@mail.gmail.com>

Thanks Robert,
i will go for it with RHEL 4 update 4.
It's a 2 node cluster, and my idea was that the risk is in situation when
for some reason
the /gfs gets locked .. nested could get unaccessible as well.
Jan


On 1/12/07, Robert Peterson <rpeterso at redhat.com> wrote:
>
> Jan Kudjak wrote:
> > hi,
> > imagine that customer wants to mount another gfs filesystem
> > under already mounted gfs filesystem
> >
> > eg.
> >
> > /dev/vg01/lvgfs   /gfs  (filesystem gfs)
> > /dev/vg01/lvgfs1  /gfs/shared/gfs1  (filesystem gfs)
> > /dev/vg01/lvgfs2  /gfs/shared/gfs2  (filesystem gfs)
> >
> > do you thing this setup is correct or could it cause some problems ?
> >
> > thanks
> > jan
> Hi Jan,
>
> AFAIK, what you've described is supposed to work just fine.
> However, this and similar scenarios don't get a lot of testing.
> A few weeks back, I uncovered some bugs in a beta version of RHEL5
> doing just that, which we consequently fixed.  So I'd say:  Go ahead
> and do it, and if you run into problems, open up a bugzilla.
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070112/477c54c8/attachment.htm>

From lhh at redhat.com  Fri Jan 12 16:02:04 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 11:02:04 -0500
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
In-Reply-To: <1168610394.3266.10.camel@localhost>
References: <1168610394.3266.10.camel@localhost>
Message-ID: <1168617724.15369.473.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> Hi all,
> 
> On a 2 node openais cman cluster, I failed a network interface and
> noticed that it didn't failed over the other node.
> 
> Looking at the rgmanager-2.0.16 code I noticed that:
> 
> handle_relocate_req is called with preferred_target = -1, but inside
> this function, there are 2 checks to see if the preferred_target is
> setted, the check is a 'if (preferred_target != 0)' so the function
> thinks that a preferred target is choosed. Then, inside the cycle, the
> only one target that really exists is "me" (as -1 isn't a real target)
> and there a "goto exausted:", the service is then restarted only on the
> locale node, where it fails again and so it's stopped. Changing these
> checks to "> 0" worked. 
> 
> Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> probably doesn't happen.
> Is it probably a forgotten patch on HEAD and RHEL5?
> 
> 
> The same problem is in the ip.sh resource scripts as it's missing the
> patch for "Fix bug in ip.sh allowing start of the IP if the link was
> down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> RHEL4 branch.
> 
> Thanks!

I'll check this out.

-- Lon


From rpeterso at redhat.com  Fri Jan 12 17:27:25 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 12 Jan 2007 11:27:25 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <200719151225.638211@leena>
References: <200719151225.638211@leena>
Message-ID: <45A7C4FD.8010908@redhat.com>

isplist at logicore.net wrote:
> That was indeed what it was. Here is my final shutdown script;
>
> service httpd stop
> umount /var/www
> vgchange -aln
> service clvmd stop
> fence_tool leave
> service fenced stop
> service rgmanager stop
> cman_tool leave
> killall ccsd
>
> Two questions;
>
> 1: I probably don't need the last line in there correct?
>
> 2: Can I create a new service so that I can run this script to shut things 
> down cleanly when I want to reboot the node? If so, what is the process?
>
> Mike
>   
Hi Mike,

1. I recommend "service ccsd stop" rather than killall ccsd.
2. In theory, this script should not be necessary on a RHEL, Fedora Core 
or centos box
    if you have your service scripts set up and chkconfig'ed on.  When 
you do
    /sbin/reboot, the service scripts are supposed to run in the correct 
order
    and take care of all this for you. Shudown should take you to 
runlevel 6,
    which should run the shutdown scripts in /etc/rc.d/rc6.d in the Kxx 
order.
    The httpd script should stop that service, then the
    the gfs script should take care of unmounting the gfs file systems 
at "stop".
    The clvmd script should take care of deactivating the vgs.  And the 
Kxx numbers
    should be set properly at install time to ensure the proper order.
    If there's a problem shutting down with the normal scripts, perhaps 
we need to
    file a bug and get the scripts changed.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lhh at redhat.com  Fri Jan 12 17:56:32 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 12:56:32 -0500
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-	bug in fenced?
In-Reply-To: <45A781EA.3030201@nimium.hr>
References: <45A542BD.9070107@nimium.hr>
	<20070110201648.GE24326@korben.rdu.redhat.com>
	<45A781EA.3030201@nimium.hr>
Message-ID: <1168624592.15369.495.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 13:41 +0100, Miroslav Zubcic wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Josef Whiter wrote:
> 
> > This isn't a bug, its working as expected.
> 
> IT People from the central bank doesn't think like that. I cannot blame
> them, because it is strange to me, and to anybody who has seen this RH
> cluster behaviour.
> 
> > What you need in qdisk, set it up
> > with the proper hueristics and it will force the shutdown of the bad node before
> > the bad node has a chance to fence off the working node.
> 
> This is just a workaround for lack of communication between clurgmgrd and
> fenced daemons, where first is aware of ethernet/network failure and is
> trying to disable active service, and fenced which is fencing other node
> without any good reason, because it doesn't know that it's node is faulty one.

There is no assumed correlation between the NIC(s) rgmanager uses for
services and the NIC(s) CMAN uses; many people use one network for
cluster traffic and another for service related traffic.  In this case,
a service failure due to a NIC link failing is far less of a problem:
The service fails, and it moves somewhere else in the cluster.

More generally, health of part of an rgmanager service != health of a
node.  They are independent, despite sometimes being correlative.


> I have even better workaround (one bonding with native data ethernet and
> tagged vlan for fence subnet) for this silly behaviour, but I will really
> like to see this thing fixed, because people are laughing on us when
> testing our cluster configurations (we are configuring Red Hat machines
> and clusters).

I think it's interesting to point out that CMAN, when run in 2-node
mode, expects the fencing devices and cluster paths to be on the same
links.  This has the effect that whenever you pull the links out of a
node, that node actually can not possibly fence the "good" node because
it can't reach the fence devices.  It sounds like you altered your
configuration to match this using vlans over the same links.

As a side note, it would also be trivial to add a 'reboot on link loss'
option to the IP script in rgmanager. *shrug*.

-- Lon


From fedele at fis.unical.it  Fri Jan 12 17:58:28 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Fri, 12 Jan 2007 18:58:28 +0100
Subject: [Linux-cluster] Quorum disk question
Message-ID: <45A7CC44.4010409@fis.unical.it>

Can I configure a quorum disk giving it only a vote option and no heuristic parameters?
I.e.

in /etc/cluster.conf can i configure quorum disk in this way?

<quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
</quorumd>


Thank you Fedele


From lhh at redhat.com  Fri Jan 12 18:03:27 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 13:03:27 -0500
Subject: [Linux-cluster] GFS+EXT3 via NFS?
In-Reply-To: <7DCE72B3C36E2A45B7580F887EE4948C18FD86@tmaemail.techma.com>
References: <7DCE72B3C36E2A45B7580F887EE4948C18FD84@tmaemail.techma.com>
	<1168612151.15369.433.camel@rei.boston.devel.redhat.com>
	<7DCE72B3C36E2A45B7580F887EE4948C18FD86@tmaemail.techma.com>
Message-ID: <1168625007.15369.499.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 10:09 -0500, Kovacs, Corey J. wrote:
> Lon, thanks. I could manually type the cluster.conf in but it would likely be
> riddled with typos :)
> 
> Suffice it to say that the exports are managed by the cluster which I think
> is the problem in our particular case as we have mixed GFS/EXT3 filesystems
> being exported from the same services. 

I wouldn't think this should matter, but it does depend on how they're
configured.  It should look something like:

  <service>
    <fs>
      <nfsexport>
        <nfsclient/>
        ...
      </nfsexport>
    </fs>
    <clusterfs>
      <nfsexport>
        <nfsclient/>
        ...
      </nfsexport>
    </clusterfs>
    <ip/>
  </service>


> That being said, what will the effect be of seperating the services by
> filesystem type if a service exporting EXT3 fails over to a node exporting
> non-cluster-managed GFS exports? Will the mechanics of moving a
> cluster-managed export to a node with non-managed exports collide?

They shouldn't - as long as the exports don't overlap.

-- Lon


From jvantuyl at engineyard.com  Fri Jan 12 18:20:26 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Fri, 12 Jan 2007 12:20:26 -0600
Subject: [Linux-cluster] Quorum disk question
In-Reply-To: <45A7CC44.4010409@fis.unical.it>
References: <45A7CC44.4010409@fis.unical.it>
Message-ID: <B2851925-8EFD-4222-BC92-77FEB2C8A030@engineyard.com>

You can.

If you can come up with good heuristics, even if it is just pinging  
your network gateway, it is very helpful at keeping your cluster  
functional.  Without this, a partition of your cluster will remain  
quorate, but not necessarily the partition that can still be reached  
from the Internet (or wherever its services are accessed from).

On Jan 12, 2007, at 11:58 AM, Fedele Stabile wrote:

> Can I configure a quorum disk giving it only a vote option and no  
> heuristic parameters?
> I.e.
>
> in /etc/cluster.conf can i configure quorum disk in this way?
>
> <quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
> </quorumd>
>
>
> Thank you Fedele
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070112/c14e8c95/attachment.htm>

From jparsons at redhat.com  Fri Jan 12 18:20:38 2007
From: jparsons at redhat.com (James Parsons)
Date: Fri, 12 Jan 2007 13:20:38 -0500
Subject: [Linux-cluster] Maybe OT: GFS labels for iSCSI disks
In-Reply-To: <590a9c800701111447m710b7924yafcecf30488dbffe@mail.gmail.com>
References: <590a9c800701111447m710b7924yafcecf30488dbffe@mail.gmail.com>
Message-ID: <45A7D176.8060108@redhat.com>

C. L. Martinez wrote:

> Hi,
>
> I'm using a RHEL 4 U4 with iscsitarget to serve local disks to two 
> RHEL 4U4 servers with RHCS and GFS, using RHEL initiator.
>
> When I add new raw disks to iscsitarget and restart the *iscsid* 
> service on RHEL clients with GFS, sometimes the device naming 
> (/dev/sdX) changes and it's a mess
> to find the older volumes with the new device name.
>
> Does anybody know how to use to archieve persistent device naming for 
> the *iSCSI* volumes on *RHEL4*? According to this: 
> http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme 
> <http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme>, I need to 
> use Labels , but how can I assign labels on a GFS filesystem??
>
> I think I need to use an *udev* rule for that, but I'm new to this, 
> any help or sample rule would be appreciated.
>
> Thanks.
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
Support for setting this up in the UI is planned for the next update of 
Conga...sorry that doesn't help you now, though.

-J


From lhh at redhat.com  Fri Jan 12 18:27:57 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 13:27:57 -0500
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
In-Reply-To: <1168617724.15369.473.camel@rei.boston.devel.redhat.com>
References: <1168610394.3266.10.camel@localhost>
	<1168617724.15369.473.camel@rei.boston.devel.redhat.com>
Message-ID: <1168626477.15369.503.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 11:02 -0500, Lon Hohberger wrote:
> On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> > Hi all,
> > 
> > On a 2 node openais cman cluster, I failed a network interface and
> > noticed that it didn't failed over the other node.
> > 
> > Looking at the rgmanager-2.0.16 code I noticed that:
> > 
> > handle_relocate_req is called with preferred_target = -1, but inside
> > this function, there are 2 checks to see if the preferred_target is
> > setted, the check is a 'if (preferred_target != 0)' so the function
> > thinks that a preferred target is choosed. Then, inside the cycle, the
> > only one target that really exists is "me" (as -1 isn't a real target)
> > and there a "goto exausted:", the service is then restarted only on the
> > locale node, where it fails again and so it's stopped. Changing these
> > checks to "> 0" worked. 
> > 
> > Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> > NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> > probably doesn't happen.

Good catch.

NODE_ID_NONE isn't in RHEL5[0]/HEAD right now; so the checks should be
">= 0" rather than "!= 0".  NODE_ID_NONE was (uint64_t)(-1) on RHEL4. 

> > The same problem is in the ip.sh resource scripts as it's missing the
> > patch for "Fix bug in ip.sh allowing start of the IP if the link was
> > down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> > RHEL4 branch.

You're right, it's missing.

-- Lon


From lhh at redhat.com  Fri Jan 12 18:29:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 13:29:49 -0500
Subject: [Linux-cluster] Quorum disk question
In-Reply-To: <45A7CC44.4010409@fis.unical.it>
References: <45A7CC44.4010409@fis.unical.it>
Message-ID: <1168626589.15369.506.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 18:58 +0100, Fedele Stabile wrote:
> Can I configure a quorum disk giving it only a vote option and no heuristic parameters?
> I.e.
> 
> in /etc/cluster.conf can i configure quorum disk in this way?
> 
> <quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
> </quorumd>

Not currently; there's a bug open about this:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=213533

-- Lon


From lhh at redhat.com  Fri Jan 12 18:33:02 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 13:33:02 -0500
Subject: [Linux-cluster] ccsd problems
In-Reply-To: <3EA13BD8-2FEF-41A2-9809-31B73D2D4E54@hudat.com>
References: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
	<1168454476.15369.230.camel@rei.boston.devel.redhat.com>
	<3EA13BD8-2FEF-41A2-9809-31B73D2D4E54@hudat.com>
Message-ID: <1168626782.15369.510.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 09:27 -0500, Andre Henry wrote:
> The cluster is RHEL4. I rebooted both nodes during the maintenance  
> window and still only node one came back online. Node 2 will not even  
> start ccsd. Any ideas or debugging advice ?

What does ccsd -n say?

-- Lon


From jvantuyl at engineyard.com  Fri Jan 12 18:34:12 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Fri, 12 Jan 2007 12:34:12 -0600
Subject: [Linux-cluster] RH Cluster doesn't pass basic acceptance tests
	-	bug in fenced?
In-Reply-To: <45A781EA.3030201@nimium.hr>
References: <45A542BD.9070107@nimium.hr>
	<20070110201648.GE24326@korben.rdu.redhat.com>
	<45A781EA.3030201@nimium.hr>
Message-ID: <E4D5C0F2-5FBF-47E4-8B65-9589FA772677@engineyard.com>

>> This isn't a bug, its working as expected.
>
> IT People from the central bank doesn't think like that. I cannot  
> blame
> them, because it is strange to me, and to anybody who has seen this RH
> cluster behaviour.
I have seen this behavior.  It is not strange to me.  This is only  
strange to people who do not understand how quorum systems work.

>> What you need in qdisk, set it up
>> with the proper hueristics and it will force the shutdown of the  
>> bad node before
>> the bad node has a chance to fence off the working node.
>
> This is just a workaround for lack of communication between  
> clurgmgrd and
> fenced daemons, where first is aware of ethernet/network failure  
> and is
> trying to disable active service, and fenced which is fencing other  
> node
> without any good reason, because it doesn't know that it's node is  
> faulty one.

This is *NOT* a workaround for lack of communication.  clurgmgrd is  
responsible for starting and stopping services.  Fencing is  
responsible for keeping nodes running.  clurgmgrd does not have the  
information and is not the right service to handle this.

The problem is that you have a two node cluster.  If you had three  
nodes, this would not be an issue.  In a two-node cluster, the two  
nodes are both capable of fencing each other even though they no  
longer have quorum.  There is mathematically no other way to have a  
majority of 2 nodes without both of them.

The Quorum Disk allows the running nodes to use a heuristic--like the  
ethernet link check you speak of (or a ping to the network gateway  
which would also be helpful).  This heuristic allows you to  
artificially reach quorum by giving extra votes to the node that can  
still determine that it is okay.

> I have even better workaround (one bonding with native data  
> ethernet and
> tagged vlan for fence subnet) for this silly behaviour, but I will  
> really
> like to see this thing fixed, because people are laughing on us when
> testing our cluster configurations (we are configuring Red Hat  
> machines
> and clusters).
The moment that a node fails for any reason other than an ethernet  
disconnection your workaround falls apart.

If some "Central Bank" is truly your customer, then you should be  
able to obtain a third node with no problems.  Otherwise, the Quorum  
Disk provides better behavior than your "workaround" by actually  
solving the problem in a generally applicable and sophisticated way.

This is a configuration problem.  If you desire not to be laughed at  
learn how to configure your software.  Also, for what its worth, I  
don't use bonding on my machines due to the switches I utilize (I use  
bridging instead), but I would recommend keeping this for reliability  
of the ethernet, as it is an important failure case.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070112/1f7742ef/attachment.htm>

From leonard.maiorani at crosswalkinc.com  Thu Jan 11 16:19:18 2007
From: leonard.maiorani at crosswalkinc.com (Leonard Maiorani)
Date: Thu, 11 Jan 2007 09:19:18 -0700
Subject: [Linux-cluster] GFS UID/GID limit
Message-ID: <2E02749DAF5338479606A056219BE109015883E5@smail.crosswalkinc.com>

Is there an upper bound for UID/GIDs? 

Repeatedly I have seen GFS quota file problems when I have had UIDs
greater than 32768. Is this a limit?

-Lenny


From adas at redhat.com  Fri Jan 12 18:54:17 2007
From: adas at redhat.com (Abhijith Das)
Date: Fri, 12 Jan 2007 12:54:17 -0600
Subject: [Linux-cluster] GFS UID/GID limit
In-Reply-To: <2E02749DAF5338479606A056219BE109015883E5@smail.crosswalkinc.com>
References: <2E02749DAF5338479606A056219BE109015883E5@smail.crosswalkinc.com>
Message-ID: <45A7D959.6030207@redhat.com>

Leonard Maiorani wrote:

>Is there an upper bound for UID/GIDs? 
>
>Repeatedly I have seen GFS quota file problems when I have had UIDs
>greater than 32768. Is this a limit?
>
>-Lenny
>
>  
>
There's a problem with the 'list' option in gfs_quota tool with large 
UID/GIDs. This problem has been fixed in RHEL4,

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=210362

GFS1 and GFS2 in RHEL5 still have this issue and a solution needs to be 
worked out.
What other problems are you referring to? A bugzilla would help.

Thanks,
--Abhi


From andre at hudat.com  Fri Jan 12 19:19:44 2007
From: andre at hudat.com (Andre Henry)
Date: Fri, 12 Jan 2007 14:19:44 -0500
Subject: [Linux-cluster] ccsd problems
In-Reply-To: <1168626782.15369.510.camel@rei.boston.devel.redhat.com>
References: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
	<1168454476.15369.230.camel@rei.boston.devel.redhat.com>
	<3EA13BD8-2FEF-41A2-9809-31B73D2D4E54@hudat.com>
	<1168626782.15369.510.camel@rei.boston.devel.redhat.com>
Message-ID: <72EB7A44-EEAB-4A97-946E-B02EFC17A194@hudat.com>


On Jan 12, 2007, at 1:33 PM, Lon Hohberger wrote:

> On Fri, 2007-01-12 at 09:27 -0500, Andre Henry wrote:
>> The cluster is RHEL4. I rebooted both nodes during the maintenance
>> window and still only node one came back online. Node 2 will not even
>> start ccsd. Any ideas or debugging advice ?
>
> What does ccsd -n say?
>
> -- Lon
>

Starting ccsd 1.0.7:
Built: Jun 22 2006 18:15:41
Copyright (C) Red Hat, Inc.  2004  All rights reserved.
   No Daemon:: SET

Unable to connect to cluster infrastructure after 30 seconds.

--
Thanks
Andre


From lhh at redhat.com  Fri Jan 12 20:14:13 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 15:14:13 -0500
Subject: [Linux-cluster] GFS UID/GID limit
In-Reply-To: <45A7D959.6030207@redhat.com>
References: <2E02749DAF5338479606A056219BE109015883E5@smail.crosswalkinc.com>
	<45A7D959.6030207@redhat.com>
Message-ID: <1168632853.15369.526.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 12:54 -0600, Abhijith Das wrote:
> Leonard Maiorani wrote:
> 
> >Is there an upper bound for UID/GIDs? 
> >
> >Repeatedly I have seen GFS quota file problems when I have had UIDs
> >greater than 32768. Is this a limit?
> >
> >-Lenny

> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=210362

To clarify, it looks like this will go out in the next RHGFS update for
RHEL4.

-- Lon


From lhh at redhat.com  Fri Jan 12 21:05:00 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 12 Jan 2007 16:05:00 -0500
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
In-Reply-To: <1168610394.3266.10.camel@localhost>
References: <1168610394.3266.10.camel@localhost>
Message-ID: <1168635900.15369.541.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> Hi all,
> 
> On a 2 node openais cman cluster, I failed a network interface and
> noticed that it didn't failed over the other node.
> 
> Looking at the rgmanager-2.0.16 code I noticed that:
> 
> handle_relocate_req is called with preferred_target = -1, but inside
> this function, there are 2 checks to see if the preferred_target is
> setted, the check is a 'if (preferred_target != 0)' so the function
> thinks that a preferred target is choosed. Then, inside the cycle, the
> only one target that really exists is "me" (as -1 isn't a real target)
> and there a "goto exausted:", the service is then restarted only on the
> locale node, where it fails again and so it's stopped. Changing these
> checks to "> 0" worked. 
> 
> Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> probably doesn't happen.
> Is it probably a forgotten patch on HEAD and RHEL5?

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222485

Please attach your patch if you have it; I wrote one, but yours is
already tested :)  (or you can send it here, too)


> The same problem is in the ip.sh resource scripts as it's missing the
> patch for "Fix bug in ip.sh allowing start of the IP if the link was
> down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> RHEL4 branch.

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222484

This one's already got a fix as you said in the RHEL4 branch; we'll use
it.

-- Lon


From isplist at logicore.net  Fri Jan 12 21:19:15 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 12 Jan 2007 15:19:15 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <45A7C4FD.8010908@redhat.com>
Message-ID: <2007112151915.382405@leena>

> 1. I recommend "service ccsd stop" rather than killall ccsd.

This is actually my latest. While it does not *seem* to work at times, it does 
take the node out of the cluster cleanly. I say seem because it tells me that 
the node is still in the cluster yet it's not.

# more stop_gfs
service httpd stop
umount /var/www
vgchange -aln
service clvmd stop
fence_tool leave
service fenced stop
cman_tool leave
service rgmanager stop
sleep 5
service cman stop


> 2. In theory, this script should not be necessary on a RHEL, Fedora Core
> or centos box if you have your service scripts set up and chkconfig'ed on. 
> When you do /sbin/reboot, the service scripts are supposed to run in the 
> correct order and take care of all this for you.

Never had, don't know why. Always figured it was because of the way I have to 
start my nodes. I wanted to add my shutdown script into the shutdown run 
levels so that it's automatic but am not sure how to add that in.

> Shudown should take you to runlevel 6, which should run the shutdown scripts 
> in /etc/rc.d/rc6.d in the Kxx order.

Do you mean I should just copy my shutdown script into that directory?

> If there's a problem shutting down with the normal scripts, perhaps
> we need to file a bug and get the scripts changed.

Well, here is my startup script for each node, maybe the answer lies in how I 
start them?

depmod -a
modprobe dm-mod
modprobe gfs
modprobe lock_dlm

service rgmanager start
ccsd
cman_tool join -w
fence_tool join -w
clvmd
vgchange -aly
mount -t gfs /dev/VolGroup04/web /var/www/

cp -f /var/www/system/httpd.conf /etc/httpd/conf/.
cp -f /var/www/system/php.ini /etc/.
/etc/init.d/httpd start

Mike


From rpeterso at redhat.com  Fri Jan 12 21:53:58 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 12 Jan 2007 15:53:58 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <2007112151915.382405@leena>
References: <2007112151915.382405@leena>
Message-ID: <45A80376.4080005@redhat.com>

Hi Mike,

isplist at logicore.net wrote:
>> 1. I recommend "service ccsd stop" rather than killall ccsd.
>>     
> This is actually my latest. While it does not *seem* to work at times, it does 
> take the node out of the cluster cleanly. I say seem because it tells me that 
> the node is still in the cluster yet it's not.
>
> # more stop_gfs
> service httpd stop
> umount /var/www
> vgchange -aln
> service clvmd stop
> fence_tool leave
> service fenced stop
> cman_tool leave
> service rgmanager stop
> sleep 5
> service cman stop
>   
Shouldn't there be a "service ccsd stop" at the end?
> Never had, don't know why. Always figured it was because of the way I have to 
> start my nodes. I wanted to add my shutdown script into the shutdown run 
> levels so that it's automatic but am not sure how to add that in.
>   
Well, normally the scripts are all in /etc/init.d/ and are the same for 
startup and shutdown.
In the runlevel directories, /etc/rc.d/rc3.d (runlevel 3), 
/etc/rc.d/rc5.d (runlevel 5) and
/etc/rc.d/rc6.d (shutdown) there are symlinks to the scripts.  If they 
start with Sxx
they run at startup, and if they start with Kxx they're run at shutdown 
at that runlevel.

These symlinks for the runlevels are created by the chkconfig tool.
So if I do the command "chkconfig ccsd on" it creates the symlinks for me
at the appropriate runlevels.

Ordinarily, at the top of the scripts have comments at the top that the
"chkconfig" tool uses to figure out how to name these symlinks.
So if you look at the top of /etc/init.d/ccsd, you'll see something like 
this:

# chkconfig: 345 20 80

The 345 means it starts up at runlevels 3, 4 and 5.  The "20" means it 
symlinks
S20ccsd in /etc/rc.d/rc3.d/ from /etc/init.d/ccsd.  The 80 means K80 at 
runlevel 6.
The scripts are run in numerical order, so number "S20" will be run before
any of the S21 scripts, etc.  And K80 will be run after K79.

> Do you mean I should just copy my shutdown script into that directory?
>   
Not exactly.  What I meant is that your script should not be necessary 
because
the shutdown init scripts should run automatically and take of everything
for you.  If you really want to use your script, you can add the appropriate
comments to your script, copy it to /etc/init.d, and do chkconfig 
<script name> on"
to create the symlinks.
>> If there's a problem shutting down with the normal scripts, perhaps
>> we need to file a bug and get the scripts changed.
>>     
> Well, here is my startup script for each node, maybe the answer lies in how I 
> start them?
>
> depmod -a
> modprobe dm-mod
> modprobe gfs
> modprobe lock_dlm
>
> service rgmanager start
> ccsd
> cman_tool join -w
> fence_tool join -w
> clvmd
> vgchange -aly
> mount -t gfs /dev/VolGroup04/web /var/www/
>
> cp -f /var/www/system/httpd.conf /etc/httpd/conf/.
> cp -f /var/www/system/php.ini /etc/.
> /etc/init.d/httpd start
>
> Mike
>   
Okay, so maybe you just need to do:
chkconfig cman on;chkconfig ccsd on; chkconfig gfs on; chkconfig clvmd on;
chkconfig rgmanager on; chkconfig fenced on; chkconfig httpd on;and so 
forth,
so they're started up at boot time, and taken down in the correct order
at shutdown time.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lshen at cisco.com  Sat Jan 13 01:43:21 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Fri, 12 Jan 2007 17:43:21 -0800
Subject: [Linux-cluster] gnbd client and server on the same node
In-Reply-To: <1168593677.6593.61.camel@shailesh>
Message-ID: <08A9A3213527A6428774900A80DBD8D803466983@xmb-sjc-222.amer.cisco.com>

It should work as long as you're not trying to import what you're
exporting.

lin

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Shailesh
> Sent: Friday, January 12, 2007 1:21 AM
> To: linux clustering
> Subject: [Linux-cluster] gnbd client and server on the same node
> 
> Hi,
>      Has anybody tried out having a network of nodes ,where 
> each node is a GNBD client and also a GNBD server. ie nodes 
> do both gnbd_import and gnbd_export.
> 
> Did you see any issues regarding this? 
> 
> I would appreciate if you can throw any light on this.
> 
> Thanks & Regards
> Shailesh  
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From rhcluster at natecarlson.com  Sat Jan 13 05:29:26 2007
From: rhcluster at natecarlson.com (Nate Carlson)
Date: Fri, 12 Jan 2007 23:29:26 -0600 (CST)
Subject: [Linux-cluster] Problems with GFS 1.03.00 on 2.6.18 on Debian
Message-ID: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>

Hey all,

I've got a working cluster with RHEL4. I'm attempting to add some Debian 
boxes to the cluster, using the Debian-provided 2.6.18 kernel on an amd64 
machine, and the provided 'redhat-cluster-modules' package. It's running 
GFS 1.03.00.

I'm able to join the cluster and fire up clvm with no problems, but when I 
try to mount a GFS volume, the kernel oops's. Below is the oops. Just 
curious if anyone has run into this error before, or if anyone's gotten 
this working successfully. (Or if anyone has used GFS on a recent kernel.) 
I think it may be a problem with recent kernels, since I've also tested 
Ubuntu's packages - both for GFS1 and GFS2.

Here's some package info, for any Debian geeks on the list:
linux-image-2.6.18-3-amd64
redhat-cluster-modules-2.6.18-3-amd64

I'd appreciate any advice on how to debug - thanks!

-- Oops --:
GFS: fsid=TechnicalityClu:GFS1.1: Joined cluster. Now mounting FS...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Trying to acquire journal lock...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Looking at journal...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Done
GFS: fsid=TechnicalityClu:GFS1.1: Scanning for log elements...
GFS: fsid=TechnicalityClu:GFS1.1: Found 0 unlinked inodes
GFS: fsid=TechnicalityClu:GFS1.1: Found quota changes for 0 IDs
GFS: fsid=TechnicalityClu:GFS1.1: Done
Unable to handle kernel paging request at fffffffff726d028 RIP:
  [<ffffffff802c420c>] do_add_mount+0x61/0x13a
PGD 203027 PUD 3d84067 PMD 0
Oops: 0000 [1] SMP
CPU 0
Modules linked in: lock_dlm dlm gfs lock_harness cman button ac battery ipv6 ext3 jbd mbcache dm_snapshot dm_mirror loop i2c_amd756 i2c_core evdev psmouse shpchp serio_raw amd_rng pci_hotplug pcspkr sg dm_round_robin dm_multipath dm_mod xfs raid1 md_mod ide_generic ide_cd cdrom sd_mod generic mptfc mptspi mptscsih scsi_transport_fc mptbase scsi_transport_spi amd74xx tg3 ide_core scsi_mod ohci_hcd thermal processor fan
Pid: 3062, comm: mount Not tainted 2.6.18-3-amd64 #1
RIP: 0010:[<ffffffff802c420c>]  [<ffffffff802c420c>] do_add_mount+0x61/0x13a
RSP: 0000:ffff8100f53c7c68  EFLAGS: 00010246
RAX: ffff810037ae8580 RBX: ffff8100f53c7e58 RCX: 0000000000000000
RDX: ffff8100f7fe5540 RSI: 0000000000000000 RDI: ffffffff804f7e24
RBP: fffffffff726d000 R08: 0000000000000001 R09: 0000000000000000
R10: ffffffff8027d39e R11: ffffffff8026f470 R12: 0000000000000000
R13: ffff8100f585e000 R14: 0000000000000000 R15: 0000000000000000
FS:  00002b60ce1551d0(0000) GS:ffffffff80520000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: fffffffff726d028 CR3: 00000000f6700000 CR4: 00000000000006e0
Process mount (pid: 3062, threadinfo ffff8100f53c6000, task ffff8100f7055870)
Stack:  0000000000000000 00000000f726d000 0000000000000000 ffff8100f585e000
  0000000000000000 ffffffff802c52bb ffff8100f53c7e48 ffff8100f5965000
  0000000939f3cae1 0000000137b1e005 0000000904ff7e17 ffff810037b1e006
Call Trace:
  [<ffffffff802c52bb>] do_mount+0x6ab/0x6ff
  [<ffffffff8022afd8>] mntput_no_expire+0x19/0x8b
  [<ffffffff80208696>] __handle_mm_fault+0x2f3/0x94f
  [<ffffffff8020a705>] do_page_fault+0x3d1/0x706
  [<ffffffff80220260>] __up_read+0x13/0x8a
  [<ffffffff8020a705>] do_page_fault+0x3d1/0x706
  [<ffffffff802088d7>] __handle_mm_fault+0x534/0x94f
  [<ffffffff802aae42>] zone_statistics+0x3e/0x6d
  [<ffffffff8020ded0>] __alloc_pages+0x5c/0x2a9
  [<ffffffff8024845d>] sys_mount+0x8a/0xd7
  [<ffffffff8025860e>] system_call+0x7e/0x83


Code: 48 8b 55 28 48 39 50 28 75 13 48 8b 13 48 39 50 20 41 bd f0
RIP  [<ffffffff802c420c>] do_add_mount+0x61/0x13a
  RSP <ffff8100f53c7c68>
CR2: fffffffff726d028

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From natecars at natecarlson.com  Sat Jan 13 07:21:27 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Sat, 13 Jan 2007 01:21:27 -0600 (CST)
Subject: [Linux-cluster] Problems with GFS 1.03.00 on 2.6.18 on Debian
In-Reply-To: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>
References: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>
Message-ID: <Pine.LNX.4.63.0701130120100.5974@tungsten.msp.technicality.org>

On Fri, 12 Jan 2007, Nate Carlson wrote:
> I've got a working cluster with RHEL4. I'm attempting to add some Debian 
> boxes to the cluster, using the Debian-provided 2.6.18 kernel on an 
> amd64 machine, and the provided 'redhat-cluster-modules' package. It's 
> running GFS 1.03.00.

For the heck of it, I tried this on a i386 install - same thing, here's 
the dump:

GFS: Trying to join cluster "lock_dlm", "TechnicalityClu:GFS1"
GFS: fsid=TechnicalityClu:GFS1.1: Joined cluster. Now mounting FS...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Trying to acquire journal lock...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Looking at journal...
GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Done
GFS: fsid=TechnicalityClu:GFS1.1: Scanning for log elements...
GFS: fsid=TechnicalityClu:GFS1.1: Found 0 unlinked inodes
GFS: fsid=TechnicalityClu:GFS1.1: Found quota changes for 0 IDs
GFS: fsid=TechnicalityClu:GFS1.1: Done
BUG: unable to handle kernel NULL pointer dereference at virtual address 00000018
  printing eip:
c0171a77
*pde = 00000000
Oops: 0000 [#1]
SMP
Modules linked in: lock_dlm dlm gfs lock_harness cman ipv6 button ac battery ext3 jbd mbcache dm_snapshot dm_mirror dm_mod loop amd_rng evdev shpchp i2c_amd756 pci_hotplug sg i2c_core rtc psmouse serio_raw pcspkr xfs raid1 md_mod ide_generic ide_cd cdrom generic sd_mod amd74xx ide_core mptfc scsi_transport_fc mptspi mptscsih mptbase scsi_transport_spi ohci_hcd tg3 scsi_mod usbcore thermal processor fan
CPU:    0
EIP:    0060:[<c0171a77>]    Not tainted VLI
EFLAGS: 00010293   (2.6.18-3-k7 #1)
EIP is at do_add_mount+0x64/0xfa
eax: 0000000c   ebx: dfbf0c00   ecx: 00000000   edx: c30ff0c0
esi: f6e2df30   edi: 00000000   ebp: 00000000   esp: f6e2de10
ds: 007b   es: 007b   ss: 0068
Process mount (pid: 3801, ti=f6e2c000 task=df957000 task.ti=f6e2c000)
Stack: 00000000 00000000 00000000 00000000 00000000 c0172a61 00000000 dfbf0c00
        f6f12000 f6f36000 00000000 00000000 fffffffe 04ff7e17 00000009 c31c9006
        c30ff0c0 fffffffe c30ff0c0 c01718ea fffffffe f6e2df0c c0167cab c31c9000
Call Trace:
  [<c0172a61>] do_mount+0x5f3/0x643
  [<c01718ea>] mntput_no_expire+0x11/0x68
  [<c0167cab>] link_path_walk+0xb3/0xbd
  [<c014429d>] filemap_nopage+0x192/0x309
  [<c014c9a7>] __handle_mm_fault+0x3d8/0x730
  [<c016800a>] do_path_lookup+0x20a/0x225
  [<c0145b35>] get_page_from_freelist+0x9b/0x360
  [<c0171556>] copy_mount_options+0x26/0x106
  [<c0172b1e>] sys_mount+0x6d/0xaa
  [<c0102bed>] sysenter_past_esp+0x56/0x79
Code: e0 ff ff 8b 00 8b 80 58 04 00 00 39 42 64 75 75 8b 43 14 39 42 14 75 10 8b 06 39 42 10 bf f0 ff ff ff 0f 84 85 00 00 00 8b 43 10 <8b> 40 0c 0f b7 40 28 25 00 f0 00 00 3d 00 a0 00 00 74 6a 8b 04
EIP: [<c0171a77>] do_add_mount+0x64/0xfa SS:ESP 0068:f6e2de10

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From vikrant.telkar at sunguru.com  Sat Jan 13 19:06:04 2007
From: vikrant.telkar at sunguru.com (Vikrant Telkar)
Date: Sat, 13 Jan 2007 11:06:04 -0800
Subject: [Linux-cluster] Re: Linux-cluster Digest, Vol 33, Issue 24
Message-ID: <20070113110604.F3CB2C1@resin06.mta.everyone.net>

Hello,
           I need help to install and configure Redhat Cluster. Need following thing for same.
1. I need documents which will help me to install and configure same.
2. From where can I download clusting software.
3. I also have to install GFS.

Thanks and Regards
vikrant.


--- linux-cluster-request at redhat.com wrote:

From: linux-cluster-request at redhat.com
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 33, Issue 24
Date: Fri, 12 Jan 2007 16:19:23 -0500 (EST)

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. Re: Can't leave cluster (Robert Peterson)
   2. Re: RH Cluster doesn't pass basic acceptance tests	-	bug in
      fenced? (Lon Hohberger)
   3. Quorum disk question (Fedele Stabile)
   4. RE: GFS+EXT3 via NFS? (Lon Hohberger)
   5. Re: Quorum disk question (Jayson Vantuyl)
   6. Re: Maybe OT: GFS labels for iSCSI disks (James Parsons)
   7. Re: 2 missing patches in HEAD and RHEL5 branch.	(rg_state.c
      and ip.sh) (Lon Hohberger)
   8. Re: Quorum disk question (Lon Hohberger)
   9. Re: ccsd problems (Lon Hohberger)
  10. Re: RH Cluster doesn't pass basic acceptance tests	-	bug in
      fenced? (Jayson Vantuyl)
  11. GFS UID/GID limit (Leonard Maiorani)
  12. Re: GFS UID/GID limit (Abhijith Das)
  13. Re: ccsd problems (Andre Henry)
  14. Re: GFS UID/GID limit (Lon Hohberger)
  15. Re: 2 missing patches in HEAD and RHEL5 branch.	(rg_state.c
      and ip.sh) (Lon Hohberger)
  16. Re: Can't leave cluster (isplist at logicore.net)


----------------------------------------------------------------------

Message: 1
Date: Fri, 12 Jan 2007 11:27:25 -0600
From: Robert Peterson <rpeterso at redhat.com>
Subject: Re: [Linux-cluster] Can't leave cluster
To: isplist at logicore.net, linux clustering <linux-cluster at redhat.com>
Message-ID: <45A7C4FD.8010908 at redhat.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

isplist at logicore.net wrote:
> That was indeed what it was. Here is my final shutdown script;
>
> service httpd stop
> umount /var/www
> vgchange -aln
> service clvmd stop
> fence_tool leave
> service fenced stop
> service rgmanager stop
> cman_tool leave
> killall ccsd
>
> Two questions;
>
> 1: I probably don't need the last line in there correct?
>
> 2: Can I create a new service so that I can run this script to shut things 
> down cleanly when I want to reboot the node? If so, what is the process?
>
> Mike
>   
Hi Mike,

1. I recommend "service ccsd stop" rather than killall ccsd.
2. In theory, this script should not be necessary on a RHEL, Fedora Core 
or centos box
    if you have your service scripts set up and chkconfig'ed on.  When 
you do
    /sbin/reboot, the service scripts are supposed to run in the correct 
order
    and take care of all this for you. Shudown should take you to 
runlevel 6,
    which should run the shutdown scripts in /etc/rc.d/rc6.d in the Kxx 
order.
    The httpd script should stop that service, then the
    the gfs script should take care of unmounting the gfs file systems 
at "stop".
    The clvmd script should take care of deactivating the vgs.  And the 
Kxx numbers
    should be set properly at install time to ensure the proper order.
    If there's a problem shutting down with the normal scripts, perhaps 
we need to
    file a bug and get the scripts changed.

Regards,

Bob Peterson
Red Hat Cluster Suite


------------------------------

Message: 2
Date: Fri, 12 Jan 2007 12:56:32 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] RH Cluster doesn't pass basic acceptance
	tests	-	bug in fenced?
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168624592.15369.495.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 13:41 +0100, Miroslav Zubcic wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Josef Whiter wrote:
> 
> > This isn't a bug, its working as expected.
> 
> IT People from the central bank doesn't think like that. I cannot blame
> them, because it is strange to me, and to anybody who has seen this RH
> cluster behaviour.
> 
> > What you need in qdisk, set it up
> > with the proper hueristics and it will force the shutdown of the bad node before
> > the bad node has a chance to fence off the working node.
> 
> This is just a workaround for lack of communication between clurgmgrd and
> fenced daemons, where first is aware of ethernet/network failure and is
> trying to disable active service, and fenced which is fencing other node
> without any good reason, because it doesn't know that it's node is faulty one.

There is no assumed correlation between the NIC(s) rgmanager uses for
services and the NIC(s) CMAN uses; many people use one network for
cluster traffic and another for service related traffic.  In this case,
a service failure due to a NIC link failing is far less of a problem:
The service fails, and it moves somewhere else in the cluster.

More generally, health of part of an rgmanager service != health of a
node.  They are independent, despite sometimes being correlative.


> I have even better workaround (one bonding with native data ethernet and
> tagged vlan for fence subnet) for this silly behaviour, but I will really
> like to see this thing fixed, because people are laughing on us when
> testing our cluster configurations (we are configuring Red Hat machines
> and clusters).

I think it's interesting to point out that CMAN, when run in 2-node
mode, expects the fencing devices and cluster paths to be on the same
links.  This has the effect that whenever you pull the links out of a
node, that node actually can not possibly fence the "good" node because
it can't reach the fence devices.  It sounds like you altered your
configuration to match this using vlans over the same links.

As a side note, it would also be trivial to add a 'reboot on link loss'
option to the IP script in rgmanager. *shrug*.

-- Lon


------------------------------

Message: 3
Date: Fri, 12 Jan 2007 18:58:28 +0100
From: Fedele Stabile <fedele at fis.unical.it>
Subject: [Linux-cluster] Quorum disk question
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <45A7CC44.4010409 at fis.unical.it>
Content-Type: text/plain; format=flowed; charset=ISO-8859-1

Can I configure a quorum disk giving it only a vote option and no heuristic parameters?
I.e.

in /etc/cluster.conf can i configure quorum disk in this way?

<quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
</quorumd>


Thank you Fedele


------------------------------

Message: 4
Date: Fri, 12 Jan 2007 13:03:27 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: RE: [Linux-cluster] GFS+EXT3 via NFS?
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168625007.15369.499.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 10:09 -0500, Kovacs, Corey J. wrote:
> Lon, thanks. I could manually type the cluster.conf in but it would likely be
> riddled with typos :)
> 
> Suffice it to say that the exports are managed by the cluster which I think
> is the problem in our particular case as we have mixed GFS/EXT3 filesystems
> being exported from the same services. 

I wouldn't think this should matter, but it does depend on how they're
configured.  It should look something like:

  <service>
    <fs>
      <nfsexport>
        <nfsclient/>
        ...
      </nfsexport>
    </fs>
    <clusterfs>
      <nfsexport>
        <nfsclient/>
        ...
      </nfsexport>
    </clusterfs>
    <ip/>
  </service>


> That being said, what will the effect be of seperating the services by
> filesystem type if a service exporting EXT3 fails over to a node exporting
> non-cluster-managed GFS exports? Will the mechanics of moving a
> cluster-managed export to a node with non-managed exports collide?

They shouldn't - as long as the exports don't overlap.

-- Lon


------------------------------

Message: 5
Date: Fri, 12 Jan 2007 12:20:26 -0600
From: Jayson Vantuyl <jvantuyl at engineyard.com>
Subject: Re: [Linux-cluster] Quorum disk question
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <B2851925-8EFD-4222-BC92-77FEB2C8A030 at engineyard.com>
Content-Type: text/plain; charset="us-ascii"

You can.

If you can come up with good heuristics, even if it is just pinging  
your network gateway, it is very helpful at keeping your cluster  
functional.  Without this, a partition of your cluster will remain  
quorate, but not necessarily the partition that can still be reached  
from the Internet (or wherever its services are accessed from).

On Jan 12, 2007, at 11:58 AM, Fedele Stabile wrote:

> Can I configure a quorum disk giving it only a vote option and no  
> heuristic parameters?
> I.e.
>
> in /etc/cluster.conf can i configure quorum disk in this way?
>
> <quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
> </quorumd>
>
>
> Thank you Fedele
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://www.redhat.com/archives/linux-cluster/attachments/20070112/c14e8c95/attachment.html

------------------------------

Message: 6
Date: Fri, 12 Jan 2007 13:20:38 -0500
From: James Parsons <jparsons at redhat.com>
Subject: Re: [Linux-cluster] Maybe OT: GFS labels for iSCSI disks
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <45A7D176.8060108 at redhat.com>
Content-Type: text/plain; charset=us-ascii; format=flowed

C. L. Martinez wrote:

> Hi,
>
> I'm using a RHEL 4 U4 with iscsitarget to serve local disks to two 
> RHEL 4U4 servers with RHCS and GFS, using RHEL initiator.
>
> When I add new raw disks to iscsitarget and restart the *iscsid* 
> service on RHEL clients with GFS, sometimes the device naming 
> (/dev/sdX) changes and it's a mess
> to find the older volumes with the new device name.
>
> Does anybody know how to use to archieve persistent device naming for 
> the *iSCSI* volumes on *RHEL4*? According to this: 
> http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme 
> <http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme>, I need to 
> use Labels , but how can I assign labels on a GFS filesystem??
>
> I think I need to use an *udev* rule for that, but I'm new to this, 
> any help or sample rule would be appreciated.
>
> Thanks.
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
Support for setting this up in the UI is planned for the next update of 
Conga...sorry that doesn't help you now, though.

-J


------------------------------

Message: 7
Date: Fri, 12 Jan 2007 13:27:57 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] 2 missing patches in HEAD and RHEL5
	branch.	(rg_state.c and ip.sh)
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168626477.15369.503.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 11:02 -0500, Lon Hohberger wrote:
> On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> > Hi all,
> > 
> > On a 2 node openais cman cluster, I failed a network interface and
> > noticed that it didn't failed over the other node.
> > 
> > Looking at the rgmanager-2.0.16 code I noticed that:
> > 
> > handle_relocate_req is called with preferred_target = -1, but inside
> > this function, there are 2 checks to see if the preferred_target is
> > setted, the check is a 'if (preferred_target != 0)' so the function
> > thinks that a preferred target is choosed. Then, inside the cycle, the
> > only one target that really exists is "me" (as -1 isn't a real target)
> > and there a "goto exausted:", the service is then restarted only on the
> > locale node, where it fails again and so it's stopped. Changing these
> > checks to "> 0" worked. 
> > 
> > Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> > NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> > probably doesn't happen.

Good catch.

NODE_ID_NONE isn't in RHEL5[0]/HEAD right now; so the checks should be
">= 0" rather than "!= 0".  NODE_ID_NONE was (uint64_t)(-1) on RHEL4. 

> > The same problem is in the ip.sh resource scripts as it's missing the
> > patch for "Fix bug in ip.sh allowing start of the IP if the link was
> > down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> > RHEL4 branch.

You're right, it's missing.

-- Lon


------------------------------

Message: 8
Date: Fri, 12 Jan 2007 13:29:49 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] Quorum disk question
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168626589.15369.506.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 18:58 +0100, Fedele Stabile wrote:
> Can I configure a quorum disk giving it only a vote option and no heuristic parameters?
> I.e.
> 
> in /etc/cluster.conf can i configure quorum disk in this way?
> 
> <quorumd interval="1" tko="10" votes="3" label="Quorum-disk">
> </quorumd>

Not currently; there's a bug open about this:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=213533

-- Lon


------------------------------

Message: 9
Date: Fri, 12 Jan 2007 13:33:02 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] ccsd problems
To: Andre Henry <andre at hudat.com>
Cc: linux-cluster at redhat.com
Message-ID: <1168626782.15369.510.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 09:27 -0500, Andre Henry wrote:
> The cluster is RHEL4. I rebooted both nodes during the maintenance  
> window and still only node one came back online. Node 2 will not even  
> start ccsd. Any ideas or debugging advice ?

What does ccsd -n say?

-- Lon


------------------------------

Message: 10
Date: Fri, 12 Jan 2007 12:34:12 -0600
From: Jayson Vantuyl <jvantuyl at engineyard.com>
Subject: Re: [Linux-cluster] RH Cluster doesn't pass basic acceptance
	tests	-	bug in fenced?
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <E4D5C0F2-5FBF-47E4-8B65-9589FA772677 at engineyard.com>
Content-Type: text/plain; charset="us-ascii"

>> This isn't a bug, its working as expected.
>
> IT People from the central bank doesn't think like that. I cannot  
> blame
> them, because it is strange to me, and to anybody who has seen this RH
> cluster behaviour.
I have seen this behavior.  It is not strange to me.  This is only  
strange to people who do not understand how quorum systems work.

>> What you need in qdisk, set it up
>> with the proper hueristics and it will force the shutdown of the  
>> bad node before
>> the bad node has a chance to fence off the working node.
>
> This is just a workaround for lack of communication between  
> clurgmgrd and
> fenced daemons, where first is aware of ethernet/network failure  
> and is
> trying to disable active service, and fenced which is fencing other  
> node
> without any good reason, because it doesn't know that it's node is  
> faulty one.

This is *NOT* a workaround for lack of communication.  clurgmgrd is  
responsible for starting and stopping services.  Fencing is  
responsible for keeping nodes running.  clurgmgrd does not have the  
information and is not the right service to handle this.

The problem is that you have a two node cluster.  If you had three  
nodes, this would not be an issue.  In a two-node cluster, the two  
nodes are both capable of fencing each other even though they no  
longer have quorum.  There is mathematically no other way to have a  
majority of 2 nodes without both of them.

The Quorum Disk allows the running nodes to use a heuristic--like the  
ethernet link check you speak of (or a ping to the network gateway  
which would also be helpful).  This heuristic allows you to  
artificially reach quorum by giving extra votes to the node that can  
still determine that it is okay.

> I have even better workaround (one bonding with native data  
> ethernet and
> tagged vlan for fence subnet) for this silly behaviour, but I will  
> really
> like to see this thing fixed, because people are laughing on us when
> testing our cluster configurations (we are configuring Red Hat  
> machines
> and clusters).
The moment that a node fails for any reason other than an ethernet  
disconnection your workaround falls apart.

If some "Central Bank" is truly your customer, then you should be  
able to obtain a third node with no problems.  Otherwise, the Quorum  
Disk provides better behavior than your "workaround" by actually  
solving the problem in a generally applicable and sophisticated way.

This is a configuration problem.  If you desire not to be laughed at  
learn how to configure your software.  Also, for what its worth, I  
don't use bonding on my machines due to the switches I utilize (I use  
bridging instead), but I would recommend keeping this for reliability  
of the ethernet, as it is an important failure case.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://www.redhat.com/archives/linux-cluster/attachments/20070112/1f7742ef/attachment.html

------------------------------

Message: 11
Date: Thu, 11 Jan 2007 09:19:18 -0700
From: "Leonard Maiorani" <leonard.maiorani at crosswalkinc.com>
Subject: [Linux-cluster] GFS UID/GID limit
To: <linux-cluster at redhat.com>
Message-ID:
	<2E02749DAF5338479606A056219BE109015883E5 at smail.crosswalkinc.com>
Content-Type: text/plain;	charset="us-ascii"

Is there an upper bound for UID/GIDs? 

Repeatedly I have seen GFS quota file problems when I have had UIDs
greater than 32768. Is this a limit?

-Lenny


------------------------------

Message: 12
Date: Fri, 12 Jan 2007 12:54:17 -0600
From: Abhijith Das <adas at redhat.com>
Subject: Re: [Linux-cluster] GFS UID/GID limit
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <45A7D959.6030207 at redhat.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Leonard Maiorani wrote:

>Is there an upper bound for UID/GIDs? 
>
>Repeatedly I have seen GFS quota file problems when I have had UIDs
>greater than 32768. Is this a limit?
>
>-Lenny
>
>  
>
There's a problem with the 'list' option in gfs_quota tool with large 
UID/GIDs. This problem has been fixed in RHEL4,

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=210362

GFS1 and GFS2 in RHEL5 still have this issue and a solution needs to be 
worked out.
What other problems are you referring to? A bugzilla would help.

Thanks,
--Abhi


------------------------------

Message: 13
Date: Fri, 12 Jan 2007 14:19:44 -0500
From: Andre Henry <andre at hudat.com>
Subject: Re: [Linux-cluster] ccsd problems
To: Lon Hohberger <lhh at redhat.com>
Cc: linux-cluster at redhat.com
Message-ID: <72EB7A44-EEAB-4A97-946E-B02EFC17A194 at hudat.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed


On Jan 12, 2007, at 1:33 PM, Lon Hohberger wrote:

> On Fri, 2007-01-12 at 09:27 -0500, Andre Henry wrote:
>> The cluster is RHEL4. I rebooted both nodes during the maintenance
>> window and still only node one came back online. Node 2 will not even
>> start ccsd. Any ideas or debugging advice ?
>
> What does ccsd -n say?
>
> -- Lon
>

Starting ccsd 1.0.7:
Built: Jun 22 2006 18:15:41
Copyright (C) Red Hat, Inc.  2004  All rights reserved.
   No Daemon:: SET

Unable to connect to cluster infrastructure after 30 seconds.

--
Thanks
Andre


------------------------------

Message: 14
Date: Fri, 12 Jan 2007 15:14:13 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] GFS UID/GID limit
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168632853.15369.526.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 12:54 -0600, Abhijith Das wrote:
> Leonard Maiorani wrote:
> 
> >Is there an upper bound for UID/GIDs? 
> >
> >Repeatedly I have seen GFS quota file problems when I have had UIDs
> >greater than 32768. Is this a limit?
> >
> >-Lenny

> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=210362

To clarify, it looks like this will go out in the next RHGFS update for
RHEL4.

-- Lon


------------------------------

Message: 15
Date: Fri, 12 Jan 2007 16:05:00 -0500
From: Lon Hohberger <lhh at redhat.com>
Subject: Re: [Linux-cluster] 2 missing patches in HEAD and RHEL5
	branch.	(rg_state.c and ip.sh)
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <1168635900.15369.541.camel at rei.boston.devel.redhat.com>
Content-Type: text/plain

On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> Hi all,
> 
> On a 2 node openais cman cluster, I failed a network interface and
> noticed that it didn't failed over the other node.
> 
> Looking at the rgmanager-2.0.16 code I noticed that:
> 
> handle_relocate_req is called with preferred_target = -1, but inside
> this function, there are 2 checks to see if the preferred_target is
> setted, the check is a 'if (preferred_target != 0)' so the function
> thinks that a preferred target is choosed. Then, inside the cycle, the
> only one target that really exists is "me" (as -1 isn't a real target)
> and there a "goto exausted:", the service is then restarted only on the
> locale node, where it fails again and so it's stopped. Changing these
> checks to "> 0" worked. 
> 
> Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> probably doesn't happen.
> Is it probably a forgotten patch on HEAD and RHEL5?

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222485

Please attach your patch if you have it; I wrote one, but yours is
already tested :)  (or you can send it here, too)


> The same problem is in the ip.sh resource scripts as it's missing the
> patch for "Fix bug in ip.sh allowing start of the IP if the link was
> down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> RHEL4 branch.

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222484

This one's already got a fix as you said in the RHEL4 branch; we'll use
it.

-- Lon


------------------------------

Message: 16
Date: Fri, 12 Jan 2007 15:19:15 -0600
From: "isplist at logicore.net" <isplist at logicore.net>
Subject: Re: [Linux-cluster] Can't leave cluster
To: linux-cluster <linux-cluster at redhat.com>
Message-ID: <2007112151915.382405 at leena>
Content-Type: text/plain; charset="ISO-8859-1"

> 1. I recommend "service ccsd stop" rather than killall ccsd.

This is actually my latest. While it does not *seem* to work at times, it does 
take the node out of the cluster cleanly. I say seem because it tells me that 
the node is still in the cluster yet it's not.

# more stop_gfs
service httpd stop
umount /var/www
vgchange -aln
service clvmd stop
fence_tool leave
service fenced stop
cman_tool leave
service rgmanager stop
sleep 5
service cman stop


> 2. In theory, this script should not be necessary on a RHEL, Fedora Core
> or centos box if you have your service scripts set up and chkconfig'ed on. 
> When you do /sbin/reboot, the service scripts are supposed to run in the 
> correct order and take care of all this for you.

Never had, don't know why. Always figured it was because of the way I have to 
start my nodes. I wanted to add my shutdown script into the shutdown run 
levels so that it's automatic but am not sure how to add that in.

> Shudown should take you to runlevel 6, which should run the shutdown scripts 
> in /etc/rc.d/rc6.d in the Kxx order.

Do you mean I should just copy my shutdown script into that directory?

> If there's a problem shutting down with the normal scripts, perhaps
> we need to file a bug and get the scripts changed.

Well, here is my startup script for each node, maybe the answer lies in how I 
start them?

depmod -a
modprobe dm-mod
modprobe gfs
modprobe lock_dlm

service rgmanager start
ccsd
cman_tool join -w
fence_tool join -w
clvmd
vgchange -aly
mount -t gfs /dev/VolGroup04/web /var/www/

cp -f /var/www/system/httpd.conf /etc/httpd/conf/.
cp -f /var/www/system/php.ini /etc/.
/etc/init.d/httpd start

Mike


------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 33, Issue 24
*********************************************


From isplist at logicore.net  Sat Jan 13 23:16:25 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 13 Jan 2007 17:16:25 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <45A80376.4080005@redhat.com>
Message-ID: <2007113171625.522238@leena>

Ok, I've been trying it the way you've suggested. I auto start the services on 
the node, then run a script to join the node and start a service in this case;

cman_tool -t 120 join -w
fence_tool -t 120 join -w
vgchange -aly
mount -t gfs /dev/VolGroup04/web /var/www/
cp -f /var/www/system/httpd.conf /etc/httpd/conf/.
cp -f /var/www/system/php.ini /etc/.
/etc/init.d/httpd start

This works just fine. Now, when I try to remove the node from the cluster, I 
still get;

cman_tool: Can't leave cluster while there are 3 active subsystems

cman_tool services shows that rgmanager is still running. I stop that, same 
problem, node is still in the cluster. What next?

My stop script is;

/etc/init.d/httpd stop
umount /var/www
vgchange -aln
fence_tool leave
cman_tool leave remove -w

Mike


From simone.gotti at email.it  Sun Jan 14 17:40:57 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Sun, 14 Jan 2007 18:40:57 +0100
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
In-Reply-To: <1168635900.15369.541.camel@rei.boston.devel.redhat.com>
References: <1168610394.3266.10.camel@localhost>
	<1168635900.15369.541.camel@rei.boston.devel.redhat.com>
Message-ID: <1168796457.3131.13.camel@localhost>

Hi Lon,

On Fri, 2007-01-12 at 16:05 -0500, Lon Hohberger wrote:
> On Fri, 2007-01-12 at 14:59 +0100, Simone Gotti wrote:
> > Hi all,
> > 
> > On a 2 node openais cman cluster, I failed a network interface and
> > noticed that it didn't failed over the other node.
> > 
> > Looking at the rgmanager-2.0.16 code I noticed that:
> > 
> > handle_relocate_req is called with preferred_target = -1, but inside
> > this function, there are 2 checks to see if the preferred_target is
> > setted, the check is a 'if (preferred_target != 0)' so the function
> > thinks that a preferred target is choosed. Then, inside the cycle, the
> > only one target that really exists is "me" (as -1 isn't a real target)
> > and there a "goto exausted:", the service is then restarted only on the
> > locale node, where it fails again and so it's stopped. Changing these
> > checks to "> 0" worked. 
> > 
> > Before writing a patch I noticed that in the RHEL4 CVS tag is used a
> > NODE_ID_NONE instead of the numeric values, so the problem (not tested)
> > probably doesn't happen.
> > Is it probably a forgotten patch on HEAD and RHEL5?
> 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222485
> 
> Please attach your patch if you have it; I wrote one, but yours is
> already tested :)  (or you can send it here, too)

My patch is the same as yours but you also modified the type of "target"
and "me" variables :D

I have a little doubt:

I did (as you said) a check with ">= 0" but probably it can also be ">
0" as looking in cmanccs.c:read_ccs_nodes I see:

[...]
if (check_nodeids && nodeid == 0) {
                char message[132];

                sprintf(message, "No node ID for %s, run 'ccs_tool
addnodeids' to fix", nodename);
                log_msg(LOG_ERR, message);
                write_cman_pipe(message);
                return -1;
        }
[...]

so looks like a nodeid should be > 0 (as 0 looks like it's not
accepted).
What do you think?


> 
> > The same problem is in the ip.sh resource scripts as it's missing the
> > patch for "Fix bug in ip.sh allowing start of the IP if the link was
> > down, preventing failover (linux-cluster reported)." in 1.5.2.16 of
> > RHEL4 branch.
> 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=222484
> 
> This one's already got a fix as you said in the RHEL4 branch; we'll use
> it.

Thanks!

Bye!
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Simone Gotti

 
 --
 Email.it, the professional e-mail, gratis per te: http://www.email.it/f
 
 Sponsor:
 Video Corsi GRATIS - Scopri come imparare velocemente e senza stress (Internet, Informatica, Web Marketing, Hobby
 Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=5145&d=14-1
-------------- next part --------------
A non-text attachment was scrubbed...
Name: rgmanager-2.0.16-handle_relocate_req-check_correctly-preferred_target.patch
Type: text/x-patch
Size: 795 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070114/8d5e4fd4/attachment.bin>

From rpeterso at redhat.com  Mon Jan 15 01:51:00 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Sun, 14 Jan 2007 19:51:00 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <2007113171625.522238@leena>
References: <2007113171625.522238@leena>
Message-ID: <45AADE04.2020400@redhat.com>

isplist at logicore.net wrote:
> Ok, I've been trying it the way you've suggested. I auto start the services on 
> the node, then run a script to join the node and start a service in this case;
>
> cman_tool -t 120 join -w
> fence_tool -t 120 join -w
> vgchange -aly
> mount -t gfs /dev/VolGroup04/web /var/www/
> cp -f /var/www/system/httpd.conf /etc/httpd/conf/.
> cp -f /var/www/system/php.ini /etc/.
> /etc/init.d/httpd start
>
> This works just fine. Now, when I try to remove the node from the cluster, I 
> still get;
>
> cman_tool: Can't leave cluster while there are 3 active subsystems
>
> cman_tool services shows that rgmanager is still running. I stop that, same 
> problem, node is still in the cluster. What next?
>
> My stop script is;
>
> /etc/init.d/httpd stop
> umount /var/www
> vgchange -aln
> fence_tool leave
> cman_tool leave remove -w
>
> Mike
>   
Hi Mike,

In theory, the script to join the node should not be necessary because the
cman init script should do the cman_tool join, the fenced init script should
do the fence_tool join, the clvmd script should do the vgchange -aly,
and the httpd init script should take care of httpd.

When the node is shut down, the rgmanager script should stop it.
And if all the start and stop scripts are set to run in their appropriate
run levels, there shouldn't be any resources left in cman_tool services
to keep the shutdown from occurring normally.  Perhaps you can do:

chkconfig --list | grep "cman\|rgmanager\|fenced\|ccsd\|clvmd\|gfs\|httpd"

and make sure the cluster services are all listed as "on" for 3, 4, and 5.
I believe these things shouldn't require any extra scripts to start or stop,
and if they are required, maybe we (or I) need to change the init scripts.
If average users are having problems with the scripts, let's get them fixed.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Mon Jan 15 06:15:41 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 15 Jan 2007 00:15:41 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <45AADE04.2020400@redhat.com>
Message-ID: <200711501541.704880@leena>

> In theory, the script to join the node should not be necessary because the
> cman init script should do the cman_tool join, the fenced init script should
> do the fence_tool join, the clvmd script should do the vgchange -aly,
> and the httpd init script should take care of httpd.

Got it and I've changed everything as it should be now. Those were left over 
from the ongoing learning about GFS Clustering.

> I believe these things shouldn't require any extra scripts to start or stop,
> and if they are required, maybe we (or I) need to change the init scripts.
> If average users are having problems with the scripts, let's get them fixed.

Seems to be working fine now. The only error I got was when I first restarted 
each node after changing the run levels, I found;

Jan 14 23:59:04 compdev fenced[10735]: fencing node "cweb92.domain.com"
Jan 14 23:59:04 compdev fenced[10735]: agent "fence_brocade" reports: parse 
error: unknown option "nodeid=92"
Jan 14 23:59:04 compdev fenced[10735]: fence "cweb92.domain.com" failed

Just wondering... should that node ID be the same as the node name?

Mike


From fedele at fis.unical.it  Mon Jan 15 13:03:00 2007
From: fedele at fis.unical.it (Fedele Stabile)
Date: Mon, 15 Jan 2007 14:03:00 +0100
Subject: [Linux-cluster] Another question about quorum disk
Message-ID: <45AB7B84.7040900@fis.unical.it>

Good day to all,
in my cluster only 2 nodes are SCSI connected to the storage where i created the qdisk partition and
actually only this two nodes see the votes of quorum disk.

How can I export the quorum disk to the entire cluster? Is it a must?

Fedele Stabile


From lhh at redhat.com  Mon Jan 15 14:44:41 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Jan 2007 09:44:41 -0500
Subject: [Linux-cluster] ccsd problems
In-Reply-To: <83C6820C-B087-4EB4-A72C-A74CF47CD246@hudat.com>
References: <a7db102f382a7e6c25a1845aecad75dd@hudat.com>
	<1168454476.15369.230.camel@rei.boston.devel.redhat.com>
	<3EA13BD8-2FEF-41A2-9809-31B73D2D4E54@hudat.com>
	<1168626782.15369.510.camel@rei.boston.devel.redhat.com>
	<72EB7A44-EEAB-4A97-946E-B02EFC17A194@hudat.com>
	<1168633016.15369.530.camel@rei.boston.devel.redhat.com>
	<83C6820C-B087-4EB4-A72C-A74CF47CD246@hudat.com>
Message-ID: <1168872281.4198.4.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-12 at 15:38 -0500, Andre Henry wrote:

> In any event it working now. If something is stuck in recover is  
> there anyway to flush it out w/o rebooting ?

It depends on what it is.  Rgmanager used to cause things to get stuck
in the recover state - but that's been fixed in CVS and with other
packages I've posted to linux-cluster.

fenced will get stuck in 'recover' until fencing completes; if fencing
is broken, you can unstick it by figuring out which node is fencing the
other(s), logging in, and doing something like the following if fencing
is failing (warning, untested):

mv /sbin/my_fence_agent /sbin/my_fence_agent.bak
ln -sf /bin/true /sbin/my_fence_agent
sleep 30
rm /sbin/my_fence_agent
mv /sbin/my_fence_agent.bak /sbin/my_fence_agent

I have a patch which will be going in to head soon which gives you a
manual override if fencing fails, but until it's there, the above is
basically a way to trick fenced to think fencing has completed.
Obviously, don't do this until you've manually shut down the node or
verified that it is down (i.e. 'ping' is not adequate).

-- Lon


From rpeterso at redhat.com  Mon Jan 15 15:29:03 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 15 Jan 2007 09:29:03 -0600
Subject: [Linux-cluster] Can't leave cluster
In-Reply-To: <200711501541.704880@leena>
References: <200711501541.704880@leena>
Message-ID: <45AB9DBF.4040905@redhat.com>

isplist at logicore.net wrote:
> Seems to be working fine now. The only error I got was when I first restarted 
> each node after changing the run levels, I found;
>
> Jan 14 23:59:04 compdev fenced[10735]: fencing node "cweb92.domain.com"
> Jan 14 23:59:04 compdev fenced[10735]: agent "fence_brocade" reports: parse 
> error: unknown option "nodeid=92"
> Jan 14 23:59:04 compdev fenced[10735]: fence "cweb92.domain.com" failed
>
> Just wondering... should that node ID be the same as the node name?
>
> Mike
>   
Hi Mike,

Sounds like a small problem with your cluster.conf file.
If you post it here or email it to me, I can probably tell you what's wrong.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lhh at redhat.com  Mon Jan 15 15:45:12 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Jan 2007 10:45:12 -0500
Subject: [Linux-cluster] 2 missing patches in HEAD and RHEL5 branch.
	(rg_state.c and ip.sh)
In-Reply-To: <1168796457.3131.13.camel@localhost>
References: <1168610394.3266.10.camel@localhost>
	<1168635900.15369.541.camel@rei.boston.devel.redhat.com>
	<1168796457.3131.13.camel@localhost>
Message-ID: <1168875912.4539.7.camel@rei.boston.devel.redhat.com>

On Sun, 2007-01-14 at 18:40 +0100, Simone Gotti wrote:

> My patch is the same as yours but you also modified the type of "target"
> and "me" variables :D

> I have a little doubt:
> 
> I did (as you said) a check with ">= 0" but probably it can also be ">
> 0"

Yes, either will work.  We always pass in '-1' to handle_relocate_req if
a node ID is not specified, and rgmanager's utility functions which
return node IDs return negative values if one is not found.  Also, in
libcman.h, 0 has a special meaning: CMAN_NODEID_US (or, "local node").  

*shrug*

Altering the type makes sure that we're comparing signed vs. signed
instead of signed vs. unsigned.  That stuff makes me cautious.

-- Lon


From lhh at redhat.com  Mon Jan 15 16:00:24 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 15 Jan 2007 11:00:24 -0500
Subject: [Linux-cluster] Another question about quorum disk
In-Reply-To: <45AB7B84.7040900@fis.unical.it>
References: <45AB7B84.7040900@fis.unical.it>
Message-ID: <1168876824.4539.23.camel@rei.boston.devel.redhat.com>

On Mon, 2007-01-15 at 14:03 +0100, Fedele Stabile wrote:
> Good day to all,
> in my cluster only 2 nodes are SCSI connected to the storage where i created the qdisk partition and
> actually only this two nodes see the votes of quorum disk.
> 
> How can I export the quorum disk to the entire cluster? Is it a must?

You can't in a way that would be sane for qdisk operations, and yes, you
have to.  Fortunately, however, qdisk is not required to create a fully
functional cluster.

It really was designed to do two things:

(a) It helps expand the functionality (or perhaps relax the
requirements) of the two-node case (that is, it lets you circumvent the
fence-race behavior in two node CMAN clusters), and

ex:
  - Check a resource that is part of a service; if the nodes split,
    the master is the one with that resource.
  - Check an upstream router and use that router as a 'tiebreaker
    vote'

(b) it allows you run with only one node online of a larger cluster
(say, 1/4 nodes online) if you have a special need to do so.

-- Lon


From bmarzins at redhat.com  Mon Jan 15 23:11:53 2007
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 15 Jan 2007 17:11:53 -0600
Subject: [Linux-cluster] multipathd/ lvm.static error
In-Reply-To: <6a90e4da0701110808k281cef1ej260de4ba6a06496@mail.gmail.com>
References: <6a90e4da0701110808k281cef1ej260de4ba6a06496@mail.gmail.com>
Message-ID: <20070115231153.GA4518@ether.msp.redhat.com>

On Thu, Jan 11, 2007 at 11:08:31AM -0500, Jon Erickson wrote:
> All,
> 
> I'm receiving these two errors on all of the systems I have setup in
> my GFS cluster.
> Even though I receive these errors my systems appear to be functioning
> fine.  The first error makes sense because my local SCSI disk is not
> multipath-ed, however, I do not know how to get rid of it.  The second
> error is confusing to me especially because the system still comes up
> with all of my multipath-ed devices in working order.

Even if you aren't multipathing your local disk, it should have a scsi id.
try manually running

# /sbin/scsi_id -v -g u -s /block/sda

If this fails, hopefully you'll get some debugging information. If this works
correctly, then try running

# /sbin/multipath -v6

and check if you can see this error there. If so, hopefully you'll get some
useful debugging information. Otherwise, you can start multipathd with
debugging

# /sbin/multipathd -v6

And parse through all the stuff you get to see there is any debugging
information there.


> 
> multipathd: error calling out /sbin/scsi_id -g -u -s /block/sda
> 
> lvm.static[6938]: segfault at 0000000000000000 rip 0000000000000000
> rsp 0000007fbfff9388 error 14
> 
> 
> -------------------------------
> uname -r = 2.6.9-42.0.3.ELsmp
> 
> Packages Installed:
> system-config-lvm-1.0.16-1.0
> lvm2-cluster-2.02.06-7.0.RHEL4
> lvm2-2.02.06-6.0.RHEL4
> device-mapper-multipath-0.4.5-16.1.RHEL4
> GFS-6.1.6-1
> GFS-kernel-smp-2.6.9-60.3
> GFS-kernheaders-2.6.9-60.3
> kernel-smp-2.6.9-42.0.3.EL
> 
> 
> Thanks,
> Jon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From ocrete at max-t.com  Tue Jan 16 01:05:15 2007
From: ocrete at max-t.com (Olivier =?ISO-8859-1?Q?Cr=EAte?=)
Date: Mon, 15 Jan 2007 20:05:15 -0500
Subject: [Linux-cluster] Strange crash with cman (stable branch)
Message-ID: <1168909515.22300.35.camel@cocagne.max-t.internal>

Hi,

I had 9 node running kernel 2.6.17.11 with a snapshot of the cman STABLE
tree (with in-kernel cman). No dlmm, fenced or gfs. We have have own app
and do the fencing ourselves. After 3 nodes died (for unrelated
reasons), all of the cman nodes disconnected, even though the cman using
service was still running. On every node, in the dmesg, I got messages
like the following:

CMAN: node ia-009 has been removed from the cluster : Missed too many heartbeats
CMAN: node ia-008 has been removed from the cluster : Missed too many heartbeats
CMAN: bad generation number 17 in HELLO message from 4, expected 16
CMAN: removing node ia-007 from the cluster : No response to messages
CMAN: node ia-006 has been removed from the cluster : No response to messages
CMAN: removing node ia-002 from the cluster : No response to messages
CMAN: removing node ia-004 from the cluster : No response to messages
CMAN: removing node ia-005 from the cluster : No response to messages
CMAN: removing node ia-003 from the cluster : No response to messages
CMAN: quorum lost, blocking activity
CMAN: node ia-001 has been removed from the cluster : No response to messages
CMAN: killed by NODEDOWN message
CMAN: we are leaving the cluster. No response to messages
SM: 03000003 sm_stop: SG still joined


Nodes ia-00[789] are the nodes that crashed.. and that message is on the
6 others.

-- 
Olivier Cr?te
ocrete at max-t.com
Maximum Throughput Inc.


From mike at bodaro.com  Tue Jan 16 01:24:45 2007
From: mike at bodaro.com (Mike Papper)
Date: Mon, 15 Jan 2007 17:24:45 -0800
Subject: [Linux-cluster] Doing without GFS mounting several clients read-only
In-Reply-To: <45A80376.4080005@redhat.com>
References: <2007112151915.382405@leena> <45A80376.4080005@redhat.com>
Message-ID: <45AC295D.3060200@bodaro.com>

Hi, I am considering using GFS + Linux Cluster so that multiple clients
can "share" the same filesystem. An alternative I am considering would
enable me to not have to use GFS - which I believe will reduce the
complexity of our system greatly.

I am hoping some users of GFS have encountered these issues before and
was hoping to get feedback - any is appreciated.

The alternative I am considering is to have a single filesystem
available to many clients using a SAN (iSCSI in this case). However only
one client would mount the filesystem (Reiser, XFS etc.) as read/write
while the others would mount it read-only. For my application, all files
are written once then only ever read or deleted.

Is it the case that when a new file is added (by the writer machine)
that the clients that are mounted read only would see and be able to
read this new file?

Does this apply to symbolic-link files as well?

Does anyone have experience with such a configuration?

Mike


From mike at bodaro.com  Tue Jan 16 01:24:48 2007
From: mike at bodaro.com (Mike Papper)
Date: Mon, 15 Jan 2007 17:24:48 -0800
Subject: [Linux-cluster] Complexity and use of GFS on non-redhat OS (CentOS,
	Fedora)
In-Reply-To: <45A80376.4080005@redhat.com>
References: <2007112151915.382405@leena> <45A80376.4080005@redhat.com>
Message-ID: <45AC2960.90105@bodaro.com>

Hi,

I would like to use GFS to enable multiple clients to access one large
filesystem supported via an iSCSI SAN. The files are written once and
then only read or deleted. In some ways GFS may be overkill for this
application (because I do not need to support appending/writing to a
file once its created) but it enables multiple clients access to a
single filesystem.

I know that GFS and the Linux Cluser are available on red Hat Enterprise
as well as CentOS and Fedora. I believe the cost of RH is very large
($1000 per client for RHEL plus another $2200 per client for the cluster
software) and I am seeking an alternative...

I would appreciate feedback concerning these items:

1) is the CentOS or Fedora Core 6 version of Cluster "production ready"
2) Does anyone have an experience that they can share using these other
OS to install and configure GFS?
3) If I use CentOS and add the Linux Cluster (I am talking about the
link on their site to download GFS et al.) what is involved (assuming
that I can start with the latest Cent OS) in terms of installation to
make it work?
4) Similar to above but with Fedora Core 6 - what extra work do I need
to do to install Linux Cluster + GFS (I', referring to things like
recompiling the kernel, putting in a kernel patch, installing RPMs etc.).
5) Is it advisable to put millions of files in a single directory? I
know that GFS has published limits of how many files per directory etc.
(although I can't recall the exact numbers right now) but is it OK to go
up to these limits without a performance penalty?
5a) Has anyone had experience with a large number of files or
directories per directory that was still under the limits published for
GFS where they ran into performance issues?

Any ideas on a good, clean way to get Linux Cluster + GFS running on our
system is appreciated.

Mike


From natecars at natecarlson.com  Tue Jan 16 04:24:47 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Mon, 15 Jan 2007 22:24:47 -0600 (CST)
Subject: [Linux-cluster] Complexity and use of GFS on non-redhat OS
	(CentOS, Fedora)
In-Reply-To: <45AC2960.90105@bodaro.com>
References: <2007112151915.382405@leena> <45A80376.4080005@redhat.com>
	<45AC2960.90105@bodaro.com>
Message-ID: <Pine.LNX.4.63.0701152221400.18712@tungsten.msp.technicality.org>

[Answering questions that I know the answer for.]

On Mon, 15 Jan 2007, Mike Papper wrote:
> 1) is the CentOS or Fedora Core 6 version of Cluster "production ready"

The CentOS implementation is built from the same sources as the RHEL4 
code. The CentOS team just rebuilds it. I have no experience with the FC6 
version.

> 2) Does anyone have an experience that they can share using these other
> OS to install and configure GFS?

CentOS works just fine.

> 3) If I use CentOS and add the Linux Cluster (I am talking about the 
> link on their site to download GFS et al.) what is involved (assuming 
> that I can start with the latest Cent OS) in terms of installation to 
> make it work?

The same as RHEL4.

> Any ideas on a good, clean way to get Linux Cluster + GFS running on our 
> system is appreciated.

CentOS is functionally identical to RHEL4 - it's just rebuilt from the 
source RPM's that Redhat provides, with some additional minor patches and 
tweaks. However, if you run into problems, you don't have official RedHat 
support behind you to help get it fixed. Since the GFS subsystem is rather 
complex, it can be very nice to have support when it breaks.  :)

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From natecars at natecarlson.com  Tue Jan 16 05:26:29 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Mon, 15 Jan 2007 23:26:29 -0600 (CST)
Subject: [Linux-cluster] Doing without GFS mounting several clients
	read-only
In-Reply-To: <45AC295D.3060200@bodaro.com>
References: <2007112151915.382405@leena> <45A80376.4080005@redhat.com>
	<45AC295D.3060200@bodaro.com>
Message-ID: <Pine.LNX.4.63.0701152326030.18712@tungsten.msp.technicality.org>

On Mon, 15 Jan 2007, Mike Papper wrote:
> The alternative I am considering is to have a single filesystem 
> available to many clients using a SAN (iSCSI in this case). However only 
> one client would mount the filesystem (Reiser, XFS etc.) as read/write 
> while the others would mount it read-only. For my application, all files 
> are written once then only ever read or deleted.

>From everything I've read, this will *not* work. Read the list archives; 
there has been lots of discussion of this before..

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From kri_thi at yahoo.com  Tue Jan 16 06:34:34 2007
From: kri_thi at yahoo.com (krishnamurthi G)
Date: Mon, 15 Jan 2007 22:34:34 -0800 (PST)
Subject: [Linux-cluster] cluster version identification:how?
Message-ID: <20070116063434.2567.qmail@web90403.mail.mud.yahoo.com>

Hi All,

Could you help me here on cluster version?
The whole idea behind this is to support different cluster versioned commandws on RHEL 2.1/3/4.
If I want to get status/name/resource group I will have to run different cluster commands (as different cluster versions are installed on same RHEL version). e.g If I want to get cluster nodes,
for RHEL2.1 I use "/sbin/cluadmin -- cluster status" whereas
for RHEL3.0 I use "/sbin/clustst"
So, I need to find a way to find cluster version and proceed accordingly.
I appreciate your inputs on this.
Thanks
- krishna

----- Forwarded Message ----
From: Lon Hohberger <lhh at redhat.com>
To: krishnamurthi G <kri_thi at yahoo.com>
Sent: Friday, January 12, 2007 8:11:04 PM
Subject: Re: [Linux-cluster] cluster version identification:how?

On Thu, 2007-01-11 at 23:02 -0800, krishnamurthi G wrote:

> RHEL 2.1    "/sbin/cluadmin -- cluster status"

Um, 2.1 has clustat, too?


> But I noticed that RHEL 3.0 could have cluster setup where
> "/sbin/cluadmin -- cluster status" works. ( Where my code fails :-( ).

clumanager-1.2.x on RHEL 3 does not include "cluadmin".

> So now I need to find a way to detect cluster version and use
> corresponding cluster commands.
> 
> Rajesh has given a pointer to detect version For RHEL 2.1 and 3.0 run
> "rpm -qa |grep clumanager" and for rhel4.0 use ccs,cman,fence
> independent rpms to find the version. ( Rajesh which is the
> appropriate command to find version for RHEL4.0 ?)

rpm -q clumanager 

1.0.x -> RHEL 2.1
1.2.x -> RHEL 3

Nothing: not installed; try: rpm -q rgmanager

1.9.x -> RHEL 4
2.0.x -> RHEL 5


> m/c 2:
> # rpm -qa |grep clumanager
> clumanager-1.2.3-1

Please don't run 1.2.3.  There are some pretty big bugs in it.

-- Lon


____________________________________________________________________________________
Don't get soaked.  Take a quick peak at the forecast
with the Yahoo! Search weather shortcut.
http://tools.search.yahoo.com/shortcuts/#loc_weather
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070115/9ed81789/attachment.htm>

From kudjak at gmail.com  Tue Jan 16 08:07:48 2007
From: kudjak at gmail.com (Jan Kudjak)
Date: Tue, 16 Jan 2007 09:07:48 +0100
Subject: [Linux-cluster] fence_ilo on hp integrity rx4640
Message-ID: <353fcd0b0701160007r1c624192wa2ebbb730f72652e@mail.gmail.com>

Hi,

has anybody experience with fencing through
iLO or MP interface on hp itanium servers ?
I suspect that standard fence_ilo for proliant
servers won't work in this case. I want to avoid fibre fencing.
Can you help me ?

Thanks

Jan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070116/6677febe/attachment.htm>

From grimme at atix.de  Tue Jan 16 09:32:32 2007
From: grimme at atix.de (Marc Grimme)
Date: Tue, 16 Jan 2007 10:32:32 +0100
Subject: [Linux-cluster] Lockingstatistics
Message-ID: <200701161032.32613.grimme@atix.de>

Hello,
is there a way to get a status on which node has which files locked and in 
what state?
Background:
We sometimes see processes ending up in deadlocks. And it is always very 
difficult to see which lock causes the problem. It would be very helpfull to 
get a dump on all the locks held by a given node in reference to the locked 
file.
-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From shailesh at verismonetworks.com  Tue Jan 16 13:02:43 2007
From: shailesh at verismonetworks.com (Shailesh)
Date: Tue, 16 Jan 2007 18:32:43 +0530
Subject: [Linux-cluster] where is 'clvmd'?
Message-ID: <1168952563.6593.79.camel@shailesh>

Hi All,
         Can you let me know which package installs the daemon 'clvmd'
for RHEL4 ?

Your response will be useful.

Thanks & Regards
Shailesh
 

From jos at xos.nl  Tue Jan 16 13:04:36 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 16 Jan 2007 14:04:36 +0100
Subject: [Linux-cluster] where is 'clvmd'?
In-Reply-To: <1168952563.6593.79.camel@shailesh>;
	from shailesh@verismonetworks.com on Tue, Jan 16, 2007 at
	06:32:43PM +0530
References: <1168952563.6593.79.camel@shailesh>
Message-ID: <20070116140436.B3984@xos037.xos.nl>

On Tue, Jan 16, 2007 at 06:32:43PM +0530, Shailesh wrote:

>          Can you let me know which package installs the daemon 'clvmd'
> for RHEL4 ?

lvm2-cluster

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From breeves at redhat.com  Tue Jan 16 13:10:57 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 16 Jan 2007 13:10:57 +0000
Subject: [Linux-cluster] where is 'clvmd'?
In-Reply-To: <20070116140436.B3984@xos037.xos.nl>
References: <1168952563.6593.79.camel@shailesh>
	<20070116140436.B3984@xos037.xos.nl>
Message-ID: <45ACCEE1.3090705@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Jos Vos wrote:
> On Tue, Jan 16, 2007 at 06:32:43PM +0530, Shailesh wrote:
> 
>>          Can you let me know which package installs the daemon 'clvmd'
>> for RHEL4 ?
> 
> lvm2-cluster
> 

Which you'll find in the RHGFS media on RHN (it's not part of the
regular clustersuite media).

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFrM7h6YSQoMYUY94RAhNhAKDNer6a9zQ2QnXdBC4miAjpX6x/SwCffj4Y
UjK6/KT1kkLEO9jWixxu1ws=
=Bivf
-----END PGP SIGNATURE-----


From jparsons at redhat.com  Tue Jan 16 14:10:07 2007
From: jparsons at redhat.com (James Parsons)
Date: Tue, 16 Jan 2007 09:10:07 -0500
Subject: [Linux-cluster] fence_ilo on hp integrity rx4640
In-Reply-To: <353fcd0b0701160007r1c624192wa2ebbb730f72652e@mail.gmail.com>
References: <353fcd0b0701160007r1c624192wa2ebbb730f72652e@mail.gmail.com>
Message-ID: <45ACDCBF.10400@redhat.com>

Jan Kudjak wrote:

> Hi,
>
> has anybody experience with fencing through
> iLO or MP interface on hp itanium servers ?
> I suspect that standard fence_ilo for proliant
> servers won't work in this case. I want to avoid fibre fencing.
> Can you help me ?
>
> Thanks
>
> Jan
>
>
Have you tried it yet? Try calling the agent directly with 
/sbin/fence_ilo with the proper params (available within the man page 
for fence_ilo). Let us know what happens.

-J


From teigland at redhat.com  Tue Jan 16 14:42:11 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 16 Jan 2007 08:42:11 -0600
Subject: [Linux-cluster] Lockingstatistics
In-Reply-To: <200701161032.32613.grimme@atix.de>
References: <200701161032.32613.grimme@atix.de>
Message-ID: <20070116144211.GA25778@redhat.com>

On Tue, Jan 16, 2007 at 10:32:32AM +0100, Marc Grimme wrote:
> Hello,
> is there a way to get a status on which node has which files locked and in 
> what state?
> Background:
> We sometimes see processes ending up in deadlocks. And it is always very 
> difficult to see which lock causes the problem. It would be very helpfull to 
> get a dump on all the locks held by a given node in reference to the locked 
> file.

In RHEL4 you can dump locks from gfs or the dlm:
gfs_tool lockdump <mountpoint>
echo "lockspace name" >> /proc/cluster/dlm_locks
cat /proc/cluster/dlm_locks

The number in type 2 locks is the inode number.

Dave


From rpeterso at redhat.com  Tue Jan 16 15:30:40 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 16 Jan 2007 09:30:40 -0600
Subject: [Linux-cluster] Lockingstatistics
In-Reply-To: <20070116144211.GA25778@redhat.com>
References: <200701161032.32613.grimme@atix.de>
	<20070116144211.GA25778@redhat.com>
Message-ID: <45ACEFA0.1090600@redhat.com>

David Teigland wrote:
> On Tue, Jan 16, 2007 at 10:32:32AM +0100, Marc Grimme wrote:
>   
>> Hello,
>> is there a way to get a status on which node has which files locked and in 
>> what state?
>> Background:
>> We sometimes see processes ending up in deadlocks. And it is always very 
>> difficult to see which lock causes the problem. It would be very helpfull to 
>> get a dump on all the locks held by a given node in reference to the locked 
>> file.
>>     
> In RHEL4 you can dump locks from gfs or the dlm:
> gfs_tool lockdump <mountpoint>
> echo "lockspace name" >> /proc/cluster/dlm_locks
> cat /proc/cluster/dlm_locks
>
> The number in type 2 locks is the inode number.
>
> Dave
>   
Hey, Marc.  Funny you should ask; I just added those to the FAQ on 
Monday.  See:

http://sourceware.redhat.com/cluster/faq.html#lock_dump_gfs

and

http://sourceware.redhat.com/cluster/faq.html#lock_dump_dlm

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Tue Jan 16 15:41:00 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 16 Jan 2007 09:41:00 -0600
Subject: [Linux-cluster] Slow Start on Storage Access
Message-ID: <20071169410.382834@leena>

Maybe I missed this in some of the other postings.

After I mount a GFS volume, there is always a delay when I first access the 
volume. I'm guessing somewhere between 30 seconds to 45 perhaps. This happens 
on first access and if the node has been idle for some time.

Is this because the node needs to do some calculations, get a fresh index or 
details about the storage, then it's fine? 

Mike


From erickson.jon at gmail.com  Tue Jan 16 16:13:29 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Tue, 16 Jan 2007 11:13:29 -0500
Subject: [Linux-cluster] multipathd/ lvm.static error
In-Reply-To: <20070115231153.GA4518@ether.msp.redhat.com>
References: <6a90e4da0701110808k281cef1ej260de4ba6a06496@mail.gmail.com>
	<20070115231153.GA4518@ether.msp.redhat.com>
Message-ID: <6a90e4da0701160813x5dd06caeqecc5321b9e12b831@mail.gmail.com>

On 1/15/07, Benjamin Marzinski <bmarzins at redhat.com> wrote:
> On Thu, Jan 11, 2007 at 11:08:31AM -0500, Jon Erickson wrote:
> > All,
> >
> > I'm receiving these two errors on all of the systems I have setup in
> > my GFS cluster.
> > Even though I receive these errors my systems appear to be functioning
> > fine.  The first error makes sense because my local SCSI disk is not
> > multipath-ed, however, I do not know how to get rid of it.  The second
> > error is confusing to me especially because the system still comes up
> > with all of my multipath-ed devices in working order.
>
> Even if you aren't multipathing your local disk, it should have a scsi id.
> try manually running
>
> # /sbin/scsi_id -v -g u -s /block/sda
>
> If this fails, hopefully you'll get some debugging information. If this works
> correctly, then try running

I ran this command and received a sg_io failed status 0x8 0x0 0x0 0x2
message.  I think this is because my system is setup for Integrated
Mirroring (IM) and the OS sees the scsi controller instead of the scsi
device.  The output of the scsi_id command gives me the vendor
"LSILOGIC".  I tried this command on another system with the IM turned
off and it worked fine.  I'm not sure how to continue from here?I read
a post on a debian bug list that describes using the ?p flag for the
command mpt-status to specify the scsi id of 0 or 1.[1]  Is there a
way to specify an id to scsi_id and if there is, how would that get
passed to the command on boot-up.

[1] http://www.mail-archive.com/debian-bugs-dist at lists.debian.org/msg290494.html


Thanks

>
> # /sbin/multipath -v6
>
> and check if you can see this error there. If so, hopefully you'll get some
> useful debugging information. Otherwise, you can start multipathd with
> debugging
>
> # /sbin/multipathd -v6
>
> And parse through all the stuff you get to see there is any debugging
> information there.
>
>
> >
> > multipathd: error calling out /sbin/scsi_id -g -u -s /block/sda
> >
> > lvm.static[6938]: segfault at 0000000000000000 rip 0000000000000000
> > rsp 0000007fbfff9388 error 14
> >
> >
> > -------------------------------
> > uname -r = 2.6.9-42.0.3.ELsmp
> >
> > Packages Installed:
> > system-config-lvm-1.0.16-1.0
> > lvm2-cluster-2.02.06-7.0.RHEL4
> > lvm2-2.02.06-6.0.RHEL4
> > device-mapper-multipath-0.4.5-16.1.RHEL4
> > GFS-6.1.6-1
> > GFS-kernel-smp-2.6.9-60.3
> > GFS-kernheaders-2.6.9-60.3
> > kernel-smp-2.6.9-42.0.3.EL
> >
> >
> > Thanks,
> > Jon
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From lhh at redhat.com  Tue Jan 16 21:27:14 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 16 Jan 2007 16:27:14 -0500
Subject: [Linux-cluster] Complexity and use of GFS on non-redhat OS
	(CentOS, Fedora)
In-Reply-To: <45AC2960.90105@bodaro.com>
References: <2007112151915.382405@leena> <45A80376.4080005@redhat.com>
	<45AC2960.90105@bodaro.com>
Message-ID: <1168982834.3600.19.camel@localhost.localdomain>

On Mon, 2007-01-15 at 17:24 -0800, Mike Papper wrote:

> I know that GFS and the Linux Cluser are available on red Hat Enterprise
> as well as CentOS and Fedora. I believe the cost of RH is very large
> ($1000 per client for RHEL plus another $2200 per client for the cluster
> software) and I am seeking an alternative...
> 
> I would appreciate feedback concerning these items:
> 
> 1) is the CentOS or Fedora Core 6 version of Cluster "production ready"

CentOS 4 probably is fine; FC6 packages are maybe not quite as stable
(yet) as the release on CentOS 4 or RHEL 4.  Of course, use the latest
packages in any case, and definitely report bugs you find.

(Obligatory note: if you need someone to call if it breaks, you still
might want to consider RHEL + RHGFS.)

> 2) Does anyone have an experience that they can share using these other
> OS to install and configure GFS?

I think it's part of the FC6 install.

> 3) If I use CentOS and add the Linux Cluster (I am talking about the
> link on their site to download GFS et al.) what is involved (assuming
> that I can start with the latest Cent OS) in terms of installation to
> make it work?

There should be no tricks; installation should "just work" in all of the
cases you mentioned.

> 4) Similar to above but with Fedora Core 6 - what extra work do I need
> to do to install Linux Cluster + GFS (I', referring to things like
> recompiling the kernel, putting in a kernel patch, installing RPMs etc.).

WRT FC6... Configuration should be similar to either of the previous
versions.  The FAQ should have lots of relevant information, as well.

-- Lon


From suvankar_moitra at yahoo.com  Wed Jan 17 04:30:36 2007
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Tue, 16 Jan 2007 20:30:36 -0800 (PST)
Subject: [Linux-cluster] fence_ilo on hp integrity rx4640
In-Reply-To: <45ACDCBF.10400@redhat.com>
Message-ID: <678210.9314.qm@web52306.mail.yahoo.com>

Hi jan,

Its should work , first check whether you are abail to
access the ilo or not. Then rectify the fenced_ilo
parameter.

regards


Suvankar
--- James Parsons <jparsons at redhat.com> wrote:

> Jan Kudjak wrote:
> 
> > Hi,
> >
> > has anybody experience with fencing through
> > iLO or MP interface on hp itanium servers ?
> > I suspect that standard fence_ilo for proliant
> > servers won't work in this case. I want to avoid
> fibre fencing.
> > Can you help me ?
> >
> > Thanks
> >
> > Jan
> >
> >
> Have you tried it yet? Try calling the agent
> directly with 
> /sbin/fence_ilo with the proper params (available
> within the man page 
> for fence_ilo). Let us know what happens.
> 
> -J
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


____________________________________________________________________________________
8:00? 8:25? 8:40? Find a flick in no time 
with the Yahoo! Search movie showtime shortcut.
http://tools.search.yahoo.com/shortcuts/#news


From ivanp at yu.net  Wed Jan 17 04:53:51 2007
From: ivanp at yu.net (Ivan Pantovic)
Date: Wed, 17 Jan 2007 05:53:51 +0100
Subject: [Linux-cluster] cvs stable won't compile
Message-ID: <45ADABDF.70003@yu.net>


It seems that latest STABLE CVS won't compile.

it is probably sneaked in regarding Bugzilla 216209 code change but now 
we have an undeclared 'iocb'

> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c: 
> In function `do_write_direct':
> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
> error: `iocb' undeclared (first use in this function)
> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
> error: (Each undeclared identifier is reported only once
> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
> error: for each function it appears in.)
> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:653: 
> error: too many arguments to function `do_write_direct_alloc'
> make[2]: *** 
> [/var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.o] 
> Error 1
> make[1]: *** 
> [_module_/var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs] 
> Error 2
> make[1]: Leaving directory `/usr/src/linux-2.6.18-gentoo-r5'
> make: *** [all] Error 2

Best regards.

-- 
Ivan Pantovic, System Engineer
-----
YUnet International  http://www.eunet.yu
Dubrovacka 35/III,   11000 Belgrade
Tel: +381 11 311 9901;  Fax: +381 11 311 9901; Mob: +381 63 302 288
-----
This  e-mail  is confidential and intended only for the recipient.
Unauthorized  distribution,  modification  or  disclosure  of  its
contents is prohibited. If you have received this e-mail in error,
please notify the sender by telephone  +381 11 311 9901.
-----


From wcheng at redhat.com  Wed Jan 17 05:50:34 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 17 Jan 2007 00:50:34 -0500
Subject: [Linux-cluster] cvs stable won't compile
In-Reply-To: <45ADABDF.70003@yu.net>
References: <45ADABDF.70003@yu.net>
Message-ID: <45ADB92A.7040400@redhat.com>

Ivan Pantovic wrote:

>
> It seems that latest STABLE CVS won't compile.
>
> it is probably sneaked in regarding Bugzilla 216209 code change but 
> now we have an undeclared 'iocb'

Yeah, very likely - I never added Asynchronous IO (AIO) code into 
branches other than RHELs. And the fix of 216209 was built on top of 
AIO. Sorry about that. I'll remove it when I get to office first thing 
tomorrow morning (it is about midnight here now).

For now, you can safely remove the following if statement without 
affecting code correctness.

                /* for asynchronous IO, the buffer can not be splitted */
                if (iocb) {
                        count = do_write_direct_alloc(file, buf, size, 
offset, iocb);
                        goto out_iocb_write;
                }

-- Wendy

>
>> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c: 
>> In function `do_write_direct':
>> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
>> error: `iocb' undeclared (first use in this function)
>> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
>> error: (Each undeclared identifier is reported only once
>> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:652: 
>> error: for each function it appears in.)
>> /var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.c:653: 
>> error: too many arguments to function `do_write_direct_alloc'
>> make[2]: *** 
>> [/var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs/ops_file.o] 
>> Error 1
>> make[1]: *** 
>> [_module_/var/tmp/portage/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/gfs] 
>> Error 2
>> make[1]: Leaving directory `/usr/src/linux-2.6.18-gentoo-r5'
>> make: *** [all] Error 2
>
>
> Best regards.
>


From hlawatschek at atix.de  Wed Jan 17 13:42:30 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Wed, 17 Jan 2007 14:42:30 +0100
Subject: [Linux-cluster] Lockingstatistics
In-Reply-To: <45ACEFA0.1090600@redhat.com>
References: <200701161032.32613.grimme@atix.de>
	<20070116144211.GA25778@redhat.com> <45ACEFA0.1090600@redhat.com>
Message-ID: <200701171442.30267.hlawatschek@atix.de>

Hi,

I did some reaserach work about the gfs_tool lockdump output and the decipher 
scripts I found in the cvs tree.
A first summary of my observations can be found here:  
http://www.open-sharedroot.org/documentation/gfs-lockdump-analysis

Any comments are welcome !!

Mark

On Tuesday 16 January 2007 16:30, Robert Peterson wrote:
> David Teigland wrote:
> > On Tue, Jan 16, 2007 at 10:32:32AM +0100, Marc Grimme wrote:
> >> Hello,
> >> is there a way to get a status on which node has which files locked and
> >> in what state?
> >> Background:
> >> We sometimes see processes ending up in deadlocks. And it is always very
> >> difficult to see which lock causes the problem. It would be very
> >> helpfull to get a dump on all the locks held by a given node in
> >> reference to the locked file.
> >
> > In RHEL4 you can dump locks from gfs or the dlm:
> > gfs_tool lockdump <mountpoint>
> > echo "lockspace name" >> /proc/cluster/dlm_locks
> > cat /proc/cluster/dlm_locks
> >
> > The number in type 2 locks is the inode number.
> >
> > Dave
>
> Hey, Marc.  Funny you should ask; I just added those to the FAQ on
> Monday.  See:
>
> http://sourceware.redhat.com/cluster/faq.html#lock_dump_gfs
>
> and
>
> http://sourceware.redhat.com/cluster/faq.html#lock_dump_dlm
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From problems at gmx.net  Wed Jan 17 15:39:43 2007
From: problems at gmx.net (Michael Schulz)
Date: Wed, 17 Jan 2007 16:39:43 +0100
Subject: [Linux-cluster] CLuster Suite
Message-ID: <20070117153943.19910@gmx.net>

Hi together,

I try to setup the Cluster suite 4.4: In the moment there are the following questions open:

   - is it possible to configure a separate connection (serial) for the
     heartbeat?

   - cluster without a fence device is possible but not recomended or
     supported?

   - when i start the cluster config tool I get every time the following
     message "Because this node is no currently part of a Cluster ..." but
     I have already configured the cluster on the first (this) node.

Paps. somebody can help me.

Regards
-- 
"Feel free" - 5 GB Mailbox, 50 FreeSMS/Monat ...
Jetzt GMX ProMail testen: http://www.gmx.net/de/go/promail


From storm at elemental.it  Wed Jan 17 15:53:24 2007
From: storm at elemental.it (St0rM)
Date: Wed, 17 Jan 2007 16:53:24 +0100
Subject: [Linux-cluster] Package list
Message-ID: <45AE4674.1050102@elemental.it>

Can someone list all the packages that, under Fedora, belongs to the 
cluster suite (and possibly what they do) ?

Thanks in advance.

-- 
St0rM

-----BEGIN GEEK CODE BLOCK-----
Version: 3.1
GIT d-() s:+>: a- C++(++++) UL++++$ P+ L++++$ E- W+++$ N- o+ K w--() !O
!M>+ !V PS+ PE Y+(++) PGP>+ t+ 5?>+ X++ R++ tv-- b+ DI+++ D+ G+ e* h---
r++ y+++
------END GEEK CODE BLOCK------

"There are only 10 types of people in the world:
    Those who understand binary, and those who don't"


From jparsons at redhat.com  Wed Jan 17 17:26:33 2007
From: jparsons at redhat.com (Jim Parsons)
Date: Wed, 17 Jan 2007 12:26:33 -0500
Subject: [Linux-cluster] CLuster Suite
References: <20070117153943.19910@gmx.net>
Message-ID: <45AE5C49.2080403@redhat.com>

Michael Schulz wrote:

>Hi together,
>
>I try to setup the Cluster suite 4.4: In the moment there are the following questions open:
>
>   - is it possible to configure a separate connection (serial) for the
>     heartbeat?
>
>   - cluster without a fence device is possible but not recomended or
>     supported?
>
Bad idea. Bad, bad, bad. :) That is, if you are actually going to do 
real work with the cluster.  You c an search the list archives for 
'without a fence' and get plenty of information.  Also, I think you will 
find this faq helpful. There is a section on fencing.

http://sources.redhat.com/cluster/faq.html

>
>
>   - when i start the cluster config tool I get every time the following
>     message "Because this node is no currently part of a Cluster ..." but
>     I have already configured the cluster on the first (this) node.
>
Is the node you are running the config tool on, actually a member of the 
cluster? Is it 'joined'?

-j


From jstoner at opsource.net  Wed Jan 17 17:58:19 2007
From: jstoner at opsource.net (Jeff Stoner)
Date: Wed, 17 Jan 2007 17:58:19 -0000
Subject: [Linux-cluster] Package list
In-Reply-To: <45AE4674.1050102@elemental.it>
Message-ID: <38A48FA2F0103444906AD22E14F1B5A30530C6C2@mailxchg01.corp.opsource.net>

Appendix B in the "Redhat Cluster Suite Configuring and Managing a
Cluster" manual lists the packages and their purpose. You can find it at
http://www.redhat.com/docs/manuals/csgfs/

--Jeff
SME - UNIX
OpSource Inc.

PGP Key ID 0x6CB364CA 

> -----Original Message-----
> 
> Can someone list all the packages that, under Fedora, belongs 
> to the cluster suite (and possibly what they do) ?
> 
> Thanks in advance.


From lshen at cisco.com  Wed Jan 17 18:23:46 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Wed, 17 Jan 2007 10:23:46 -0800
Subject: [Linux-cluster] Luster memory usage
Message-ID: <08A9A3213527A6428774900A80DBD8D803467151@xmb-sjc-222.amer.cisco.com>

Can someone tell me roughly how much memory a single instance of GFS
filesystem including the Cluster Suite uses? 

lin  


From Danny.Wall at health-first.org  Thu Jan 18 14:22:25 2007
From: Danny.Wall at health-first.org (Danny Wall)
Date: Thu, 18 Jan 2007 09:22:25 -0500
Subject: [Linux-cluster] Red Hat Cluster on same network as Heartbeat 2
Message-ID: <45AF3C51020000C800002C58@health-first.org>

Does anyone know of problems having Red Hat cluster traffic on the same
network as Heartbeat2 cluster traffic? We implemented a cluster network
to segment the traffic, and the idea was to put all cluster traffic on
the Cisco switches, instead of using loopback, or having Netgear hubs
all over the place for each cluster.

I know with a Red Hat cluster, if your cluster name is not unique (there
is a bug that does not change this automatically), you will have
problems with your cluster configs. I am not sure if there are any known
issues having RHCS and Heartbeat cluster traffic on the same broadcast
network, or is there any real potential for problems. I would assume
they would ignore each others traffic.

Thanks,
Danny Wall


##############################################################
This message is for the named person's use only.  It may 
contain confidential, proprietary, or legally privileged 
information.  No confidentiality or privilege is waived or 
lost by any mistransmission.  If you receive this message 
in error, please immediately delete it and all copies of it 
from your system, destroy any hard copies of it, and notify 
the sender.  You must not, directly or indirectly, use, 
disclose, distribute, print, or copy any part of this message
if you are not the intended recipient.  Health First reserves
the right to monitor all e-mail communications through its
networks.  Any views or opinions expressed in this message
are solely those of the individual sender, except (1) where
the message states such views or opinions are on behalf of 
a particular entity;  and (2) the sender is authorized by 
the entity to give such views or opinions.
##############################################################


From pcaulfie at redhat.com  Thu Jan 18 14:33:26 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 18 Jan 2007 14:33:26 +0000
Subject: [Linux-cluster] Red Hat Cluster on same network as Heartbeat
 2
In-Reply-To: <45AF3C51020000C800002C58@health-first.org>
References: <45AF3C51020000C800002C58@health-first.org>
Message-ID: <45AF8536.9090703@redhat.com>

Danny Wall wrote:
> Does anyone know of problems having Red Hat cluster traffic on the same
> network as Heartbeat2 cluster traffic? We implemented a cluster network
> to segment the traffic, and the idea was to put all cluster traffic on
> the Cisco switches, instead of using loopback, or having Netgear hubs
> all over the place for each cluster.
> 
> I know with a Red Hat cluster, if your cluster name is not unique (there
> is a bug that does not change this automatically), you will have
> problems with your cluster configs. I am not sure if there are any known
> issues having RHCS and Heartbeat cluster traffic on the same broadcast
> network, or is there any real potential for problems. I would assume
> they would ignore each others traffic.

If they are on different ports then they won't interfere. If they are on the
same port then you must change one of them. for cman simply add the following
mantra to cluster.conf

<cman port="6888"/>

-- 

patrick


From rpeterso at redhat.com  Thu Jan 18 16:15:02 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 18 Jan 2007 10:15:02 -0600
Subject: [Linux-cluster] Slow Start on Storage Access
In-Reply-To: <20071169410.382834@leena>
References: <20071169410.382834@leena>
Message-ID: <45AF9D06.2000202@redhat.com>

isplist at logicore.net wrote:
> Maybe I missed this in some of the other postings.
>
> After I mount a GFS volume, there is always a delay when I first access the 
> volume. I'm guessing somewhere between 30 seconds to 45 perhaps. This happens 
> on first access and if the node has been idle for some time.
>
> Is this because the node needs to do some calculations, get a fresh index or 
> details about the storage, then it's fine? 
>
> Mike
>   
Hi Mike,

The first access after a GFS mount will be understandably slower:  It 
can take a long
time to read in the resource group index and resource groups from disk.  
Once they're
in memory, access to them will be fast.  This should only happen after 
the mount.

It's a good question, so I added it to the FAQ.

After the system has been idle for a long time, the Linux VFS will force 
cached pages
to disk and free up the memory.  So in theory, it might take some time 
to re-read them
again.  Shouldn't be nearly as long as the first access after the mount.

Regards,

Bob Peterson
Red Hat Cluster Suite


From rajiv.vaidyanath at ccur.com  Thu Jan 18 16:20:15 2007
From: rajiv.vaidyanath at ccur.com (Rajiv Vaidyanath)
Date: Thu, 18 Jan 2007 11:20:15 -0500
Subject: [Linux-cluster] GNBD / 2.6.18 kernel / cluster-1.03.00.tar.gz
Message-ID: <1169137215.4434.7.camel@mouse>


Hi,

Forgive me if this has been discussed. I am still searching for a thread
that has an answer.

I cannot build GNBD (gnbd-kernel) on 2.6.18 due to devfs references. Is
there a workaround / fix available ?

Thanks,
Rajiv


From lhh at redhat.com  Thu Jan 18 17:53:28 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 18 Jan 2007 12:53:28 -0500
Subject: [Linux-cluster] Red Hat Cluster on same network as Heartbeat 2
In-Reply-To: <45AF3C51020000C800002C58@health-first.org>
References: <45AF3C51020000C800002C58@health-first.org>
Message-ID: <1169142808.15409.25.camel@rei.boston.devel.redhat.com>

On Thu, 2007-01-18 at 09:22 -0500, Danny Wall wrote:
> Does anyone know of problems having Red Hat cluster traffic on the same
> network as Heartbeat2 cluster traffic? We implemented a cluster network
> to segment the traffic, and the idea was to put all cluster traffic on
> the Cisco switches, instead of using loopback, or having Netgear hubs
> all over the place for each cluster.

It should work just fine.

-- Lon


From lhh at redhat.com  Thu Jan 18 17:58:57 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 18 Jan 2007 12:58:57 -0500
Subject: [Linux-cluster] CLuster Suite
In-Reply-To: <20070117153943.19910@gmx.net>
References: <20070117153943.19910@gmx.net>
Message-ID: <1169143137.15409.31.camel@rei.boston.devel.redhat.com>

On Wed, 2007-01-17 at 16:39 +0100, Michael Schulz wrote:
> Hi together,
> 
> I try to setup the Cluster suite 4.4: In the moment there are the following questions open:
> 
>    - is it possible to configure a separate connection (serial) for the
>      heartbeat?

...not unless you use PPP.  I don't know if anyone's done this.

>    - cluster without a fence device is possible but not recomended or
>      supported?

It is not impossible.  It is not supported.  Just make sure you make
lots of backups.

Fortunately, fence_scsi is just 'round the corner - and should work on
"most" shared arrays.

-- Lon


From isplist at logicore.net  Fri Jan 19 19:28:59 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 19 Jan 2007 13:28:59 -0600
Subject: [Linux-cluster] RH Cluster/GFS: How many CPU's?
Message-ID: <2007119132859.551022@leena>

Anyone know how many CPU's on one server that RHEL40 can handle? I'm also 
guessing that RH Cluster/GFS, being components could care less about that in 
order to run, correct?

I ask because I've run the tools on up to quad servers but am looking at some 
IBM 8 way servers so thought I'd ask first.

Mike


From erickson.jon at gmail.com  Fri Jan 19 21:02:59 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Fri, 19 Jan 2007 16:02:59 -0500
Subject: [Linux-cluster] clusterfs.sh error
Message-ID: <6a90e4da0701191302r359ca698t3f932e9311e41e83@mail.gmail.com>

All,

Every once and a while I get a error message on no matter what system
is the owner of the fs cluster service.

Error:
kernel: clusterfs.sh[21141]: segfault at 0000000000000008 rip
0000000000432098 rsp 0000007fbfffdd00 error 4

clurgmgrd[8879]: <notice> Stopping service fs_cluster
clurgmgrd[8879]: <notice> Service fs_cluster is recovering
clurgmgrd[8879]: <notice> Recovering failed fs_cluster
clurgmgrd[8879]: <notice> Service fs_cluster startred

Why is this happening? Should I be concered?

-- 
Jon


From ashish at prologixsoft.com  Sat Jan 20 13:18:37 2007
From: ashish at prologixsoft.com (Ashish Varman)
Date: Sat, 20 Jan 2007 18:48:37 +0530
Subject: [Linux-cluster] gfs problem on FC6
Message-ID: <45B216AD.3060104@prologixsoft.com>

Hello all

I am using a gfs filesystem on FC6 with a two node cluster.  Since I 
needed gfs_grow, I checked out "cluster" from sources.redhat.com with 
the tag "RHEL50".  I first tested gfs functionality without cluster ( -p 
lock_nolock ).  I then made a two node cluster and formatted the volume 
again with (-p lock_dlm).  On one node, the filesystem can be mounted 
but on the other it fails.  gfs_fsck runs and gives no errors. The 
errors from the mount command and in /var/log/messages are as follows

[root at clnode31 ~]# mount /dev/mapper/ExVolGroup-ExLogVol /mnt/tmp/ -t gfs
mount: wrong fs type, bad option, bad superblock on 
/dev/mapper/ExVolGroup-ExLogVol,
       missing codepage or other error
       In some cases useful info is found in syslog - try
       dmesg | tail  or so

[root at clnode31 ~]# tail /var/log/messages
Jan 20 18:45:45 clnode31 kernel: Trying to join cluster "lock_dlm", 
"alpha:third"
Jan 20 18:45:45 clnode31 kernel: dlm: third: recover 1
Jan 20 18:45:45 clnode31 kernel: dlm: third: add member 1
Jan 20 18:45:45 clnode31 kernel: dlm: third: total members 1 error 0
Jan 20 18:45:45 clnode31 kernel: dlm: third: dlm_recover_directory
Jan 20 18:45:45 clnode31 kernel: dlm: third: dlm_recover_directory 0 entries
Jan 20 18:45:45 clnode31 kernel: dlm: third: recover 1 done: 0 ms
Jan 20 18:45:45 clnode31 kernel: Joined cluster. Now mounting FS...
Jan 20 18:45:45 clnode31 kernel: GFS: fsid=alpha:third.4294967295: can't 
mount journal #4294967295
Jan 20 18:45:45 clnode31 kernel: GFS: fsid=alpha:third.4294967295: there 
are only 8 journals (0 - 7)

What am I doing wrong ????

Thanks in advance.

Yours Sincerely
Ashish Varman


From isplist at logicore.net  Sun Jan 21 22:23:57 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 21 Jan 2007 16:23:57 -0600
Subject: [Linux-cluster] webmin and GFS
Message-ID: <2007121162357.105702@leena>

Can anyone confirm that webmin is using the right tools when setting up gfs 
volumes?

In other words, is webmin using different tools than one would from the 
command line? It just seems that if I use webmin to create LVM partitions and 
volumes that things seem to act kinda funky. If I do it all from the command 
line, less funky.

Anyone know?


From ivanp at yu.net  Mon Jan 22 00:35:21 2007
From: ivanp at yu.net (Ivan Pantovic)
Date: Mon, 22 Jan 2007 01:35:21 +0100
Subject: [Linux-cluster] Problems with GFS 1.03.00 on 2.6.18 on Debian
In-Reply-To: <Pine.LNX.4.63.0701130120100.5974@tungsten.msp.technicality.org>
References: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>
	<Pine.LNX.4.63.0701130120100.5974@tungsten.msp.technicality.org>
Message-ID: <45B406C9.9020402@yu.net>


I've packed up CVS latest stable into custom ebuilds and tried it 
against gentoo-sources 2.6.18-r5.

I had the same error trying to mount gfs volume.

Then i used same packages with 2.6.16-r13 (i had it already) and it 
works fine.

Rest of the cluster is 2.6.16-r5 and 1.03.00.

Nate Carlson wrote:
> On Fri, 12 Jan 2007, Nate Carlson wrote:
>> I've got a working cluster with RHEL4. I'm attempting to add some 
>> Debian boxes to the cluster, using the Debian-provided 2.6.18 kernel 
>> on an amd64 machine, and the provided 'redhat-cluster-modules' 
>> package. It's running GFS 1.03.00.
>
> For the heck of it, I tried this on a i386 install - same thing, 
> here's the dump:
>
> GFS: Trying to join cluster "lock_dlm", "TechnicalityClu:GFS1"
> GFS: fsid=TechnicalityClu:GFS1.1: Joined cluster. Now mounting FS...
> GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Trying to acquire journal 
> lock...
> GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Looking at journal...
> GFS: fsid=TechnicalityClu:GFS1.1: jid=1: Done
> GFS: fsid=TechnicalityClu:GFS1.1: Scanning for log elements...
> GFS: fsid=TechnicalityClu:GFS1.1: Found 0 unlinked inodes
> GFS: fsid=TechnicalityClu:GFS1.1: Found quota changes for 0 IDs
> GFS: fsid=TechnicalityClu:GFS1.1: Done
> BUG: unable to handle kernel NULL pointer dereference at virtual 
> address 00000018
>  printing eip:
> c0171a77
> *pde = 00000000
> Oops: 0000 [#1]
> SMP
> Modules linked in: lock_dlm dlm gfs lock_harness cman ipv6 button ac 
> battery ext3 jbd mbcache dm_snapshot dm_mirror dm_mod loop amd_rng 
> evdev shpchp i2c_amd756 pci_hotplug sg i2c_core rtc psmouse serio_raw 
> pcspkr xfs raid1 md_mod ide_generic ide_cd cdrom generic sd_mod 
> amd74xx ide_core mptfc scsi_transport_fc mptspi mptscsih mptbase 
> scsi_transport_spi ohci_hcd tg3 scsi_mod usbcore thermal processor fan
> CPU:    0
> EIP:    0060:[<c0171a77>]    Not tainted VLI
> EFLAGS: 00010293   (2.6.18-3-k7 #1)
> EIP is at do_add_mount+0x64/0xfa
> eax: 0000000c   ebx: dfbf0c00   ecx: 00000000   edx: c30ff0c0
> esi: f6e2df30   edi: 00000000   ebp: 00000000   esp: f6e2de10
> ds: 007b   es: 007b   ss: 0068
> Process mount (pid: 3801, ti=f6e2c000 task=df957000 task.ti=f6e2c000)
> Stack: 00000000 00000000 00000000 00000000 00000000 c0172a61 00000000 
> dfbf0c00
>        f6f12000 f6f36000 00000000 00000000 fffffffe 04ff7e17 00000009 
> c31c9006
>        c30ff0c0 fffffffe c30ff0c0 c01718ea fffffffe f6e2df0c c0167cab 
> c31c9000
> Call Trace:
>  [<c0172a61>] do_mount+0x5f3/0x643
>  [<c01718ea>] mntput_no_expire+0x11/0x68
>  [<c0167cab>] link_path_walk+0xb3/0xbd
>  [<c014429d>] filemap_nopage+0x192/0x309
>  [<c014c9a7>] __handle_mm_fault+0x3d8/0x730
>  [<c016800a>] do_path_lookup+0x20a/0x225
>  [<c0145b35>] get_page_from_freelist+0x9b/0x360
>  [<c0171556>] copy_mount_options+0x26/0x106
>  [<c0172b1e>] sys_mount+0x6d/0xaa
>  [<c0102bed>] sysenter_past_esp+0x56/0x79
> Code: e0 ff ff 8b 00 8b 80 58 04 00 00 39 42 64 75 75 8b 43 14 39 42 
> 14 75 10 8b 06 39 42 10 bf f0 ff ff ff 0f 84 85 00 00 00 8b 43 10 <8b> 
> 40 0c 0f b7 40 28 25 00 f0 00 00 3d 00 a0 00 00 74 6a 8b 04
> EIP: [<c0171a77>] do_add_mount+0x64/0xfa SS:ESP 0068:f6e2de10
>
> ------------------------------------------------------------------------
> | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
> |       depriving some poor village of its idiot since 1981            |
> ------------------------------------------------------------------------
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Ivan Pantovic, System Engineer
-----
YUnet International  http://www.eunet.yu
Dubrovacka 35/III,   11000 Belgrade
Tel: +381 11 311 9901;  Fax: +381 11 311 9901; Mob: +381 63 302 288
-----
This  e-mail  is confidential and intended only for the recipient.
Unauthorized  distribution,  modification  or  disclosure  of  its
contents is prohibited. If you have received this e-mail in error,
please notify the sender by telephone  +381 11 311 9901.
-----


From ivanp at yu.net  Mon Jan 22 00:43:34 2007
From: ivanp at yu.net (Ivan Pantovic)
Date: Mon, 22 Jan 2007 01:43:34 +0100
Subject: [Linux-cluster] ntpdate cman and random fence-ing
Message-ID: <45B408B6.4090509@yu.net>


I have a straightforward question,

is it possible to screw-up cluster node(s) by issuing ntpdate while 
cluster is running?

I had random fencing of nodes (usually when load is low) and ...  
recently I discovered that nodes are syncronised to ntp server using 
crontab and ntpdate.
I know that ntpdate is all but not "gentle" and should be used only on 
system startup for "rude", prior to ntpd start, syncronisation.

I could go trough heartbeat routines and try to figure it out but just 
if someone already knows and is willing to share it?

Thanks.

-- 
Ivan Pantovic, System Engineer
-----
YUnet International  http://www.eunet.yu
Dubrovacka 35/III,   11000 Belgrade
Tel: +381 11 311 9901;  Fax: +381 11 311 9901; Mob: +381 63 302 288
-----
This  e-mail  is confidential and intended only for the recipient.
Unauthorized  distribution,  modification  or  disclosure  of  its
contents is prohibited. If you have received this e-mail in error,
please notify the sender by telephone  +381 11 311 9901.
-----


From natecars at natecarlson.com  Mon Jan 22 00:56:19 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Sun, 21 Jan 2007 18:56:19 -0600 (CST)
Subject: [Linux-cluster] Problems with GFS 1.03.00 on 2.6.18 on Debian
In-Reply-To: <45B406C9.9020402@yu.net>
References: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>
	<Pine.LNX.4.63.0701130120100.5974@tungsten.msp.technicality.org>
	<45B406C9.9020402@yu.net>
Message-ID: <Pine.LNX.4.63.0701211856000.20299@tungsten.msp.technicality.org>

On Mon, 22 Jan 2007, Ivan Pantovic wrote:
>
> I've packed up CVS latest stable into custom ebuilds and tried it 
> against gentoo-sources 2.6.18-r5.
>
> I had the same error trying to mount gfs volume.
>
> Then i used same packages with 2.6.16-r13 (i had it already) and it 
> works fine.

Interesting! Any idea what the difference is between -r5 and -r13?

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From natecars at natecarlson.com  Mon Jan 22 00:57:18 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Sun, 21 Jan 2007 18:57:18 -0600 (CST)
Subject: [Linux-cluster] Problems with GFS 1.03.00 on 2.6.18 on Debian
In-Reply-To: <Pine.LNX.4.63.0701211856000.20299@tungsten.msp.technicality.org>
References: <Pine.LNX.4.63.0701122326270.5974@tungsten.msp.technicality.org>
	<Pine.LNX.4.63.0701130120100.5974@tungsten.msp.technicality.org>
	<45B406C9.9020402@yu.net>
	<Pine.LNX.4.63.0701211856000.20299@tungsten.msp.technicality.org>
Message-ID: <Pine.LNX.4.63.0701211856510.20299@tungsten.msp.technicality.org>

On Sun, 21 Jan 2007, Nate Carlson wrote:
>> Then i used same packages with 2.6.16-r13 (i had it already) and it works 
>> fine.
>
> Interesting! Any idea what the difference is between -r5 and -r13?

Heh, never mind - missed that it was 2.6.*16* -r13.  :)

So it looks like we've probably got a real bug with GFS on either 2.6.17 
or 2.6.18?

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From fajarpri at cbn.net.id  Mon Jan 22 04:10:38 2007
From: fajarpri at cbn.net.id (Fajar Priyanto)
Date: Mon, 22 Jan 2007 11:10:38 +0700
Subject: [Linux-cluster] Has anyone succeeded in using WTI RPS-10?
Message-ID: <200701221110.39958.fajarpri@cbn.net.id>

Hi all,
We are setting up 2-note cluster with RHEL4U3 with RHCS-1.0.25 using WTI 
Remote Power Switch as fence device. We are aware that the device is rather 
obsolete, but in the mean time that's all we have.

We have followed the manual at 
http://library.n0i.net/linux-unix/redhat/rh-cm-en-1.0/ch-hwinfo.html, but it 
seems that system-config-cluster doesn't support that type. We have asked the 
WTI directly and they are suggesting us to use newer types, which is not 
possible at this time.

What we would like to know:
1. Is WTI RPS-10 really supported for RH Cluster suite?
2. We read somewhere from Google that we can use direct serial connection from 
both servers usin Null Modem cable as fence device. How?
3. We have tried the RHCS and it seems that the failover is working. We reboot 
node1, and the ftp service in node2 is automatically activated. However, when 
we simulate: service network stop in node1, the failover failed: cman: 
sendmsg failed -101

Please, any clues are really appreciated.
Thank you very much.
-- 
Fajar Priyanto | Reg'd Linux User #327841 | Linux tutorial 
http://linux2.arinet.org
11:03am up 2:53, 2.6.16.13-4-default GNU/Linux 
Let's use OpenOffice. http://www.openoffice.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/8c4160ca/attachment.sig>

From riaan at obsidian.co.za  Mon Jan 22 05:19:34 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Mon, 22 Jan 2007 07:19:34 +0200
Subject: [Linux-cluster] Cluster for number-crunching purposes
In-Reply-To: <45A52756.8000302@fis.unical.it>
References: <45A50282.7060205@fis.unical.it>	<83A2D7D7-125F-4586-A4AE-A0BB37F78ADD@souja.net>
	<45A52756.8000302@fis.unical.it>
Message-ID: <45B44966.5070508@obsidian.co.za>

Fedele Stabile wrote:
 >>> I have a new 35-nodes cluster with a SAN for data storage, my SAN  is
 >>> connected via SCSI with two nodes.
 >>> OS is CentOS4 with ClusterSuite
 >>> Cluster purpose is numer-crinching:
 >>> SAN disks are GFS and exported via gnbd to the other 33 nodes in  the
 >>> cluster.


Fedele Stabile wrote:
> I experienced that using vote=1 for all members gives the same quorum 
> votes as result of command
> cman_tool status
> 
> Instead i woulk create a quorum disk on SAN storage.
> Can you help me?
> 

If you need help with qdisk, the best documentation for it at the moment 
is via "man qdisk" (also assuming you use CentOS / Cluster Suite 4.4.

I need to make you aware of the following as per the man page:
"At this time, this daemon supports a maximum of 16  nodes.   This  is 
primarily  a  scalability issue:  As we increase the node count, we 
increase the amount of synchronous I/O contention on the shared quorum 
disk."

so using qdisk with 35 nodes may either
a) still work but not be supported - not a problem if you are running 
CentOS)
b) not work at all, meaning it will not be an option for you.

greetings
Riaan

-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/27ef49ea/attachment.vcf>

From riaan at obsidian.co.za  Mon Jan 22 05:53:19 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Mon, 22 Jan 2007 07:53:19 +0200
Subject: [Linux-cluster] RH Cluster/GFS: How many CPU's?
In-Reply-To: <2007119132859.551022@leena>
References: <2007119132859.551022@leena>
Message-ID: <45B4514F.9030801@obsidian.co.za>

isplist at logicore.net wrote:
> Anyone know how many CPU's on one server that RHEL40 can handle? I'm also 
> guessing that RH Cluster/GFS, being components could care less about that in 
> order to run, correct?
> 
> I ask because I've run the tools on up to quad servers but am looking at some 
> IBM 8 way servers so thought I'd ask first.
> 
> Mike


hi Mike

RHEL Configuration limits
http://www.redhat.com/rhel/details/limits/

kernel-smp supports up to 16 CPUs according to this page, up to 32 
according to http://www.redhat.com/promo/summit/presentations/cos.htm
the presentation 'Red Hat Enterprise Linux Update'

kernel-smp supports up to 8 logical CPUs on x86_64
kernel-largesmp supports up to 64 logical CPUs on x86_64

GFS / Cluster Suite kernel modules are available for all of these, 
including kernel-largesmp).

greetings
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/3dcdb504/attachment.vcf>

From isplist at logicore.net  Mon Jan 22 06:51:20 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 00:51:20 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
Message-ID: <200712205120.615438@leena>

I had a UPS outage overnight which took out one side of the power supplies on 
some of my storage. After restoring things, I now have a rather confusing 
problem.

For some reason, all volumes were lost and now I cannot get anything back. 
Since I had backups, I decided to clean everything up and start over. I have 
yet to figure out how NOT to get a segmentation fault and am at a loss on 
where to look next.

Fdisk shows the LVM partitions.
Pvscan gives segmentation fault.
Vgscan gives segmentation fault.
Lvscan shows no volumes.

I've tried countless combinations of vgchange -ay and -an as well as turning 
on/off clvmd services, cleaning things up, I'm at a loss with this.

Can someone offer a logical method of finding what is wrong?

Mike


From isplist at logicore.net  Mon Jan 22 06:05:57 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 00:05:57 -0600
Subject: [Linux-cluster] RH Cluster/GFS: How many CPU's?
In-Reply-To: <45B4514F.9030801@obsidian.co.za>
Message-ID: <20071220557.310637@leena>

> http://www.redhat.com/rhel/details/limits/

Thanks, I'll take a look at this.
 
> GFS / Cluster Suite kernel modules are available for all of these,
> including kernel-largesmp).

This is pretty much what I was wondering about. If I run SMP kernels, does 
that mean I'm going to have to learn new tools or just install another 
variation of GFS for said kernels? Sounds like the later.

Mike


From riaan at obsidian.co.za  Mon Jan 22 07:23:22 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Mon, 22 Jan 2007 09:23:22 +0200
Subject: [Linux-cluster] RH Cluster/GFS: How many CPU's?
In-Reply-To: <20071220557.310637@leena>
References: <20071220557.310637@leena>
Message-ID: <45B4666A.2090505@obsidian.co.za>

isplist at logicore.net wrote:
>> http://www.redhat.com/rhel/details/limits/
> 
> Thanks, I'll take a look at this.
>  
>> GFS / Cluster Suite kernel modules are available for all of these,
>> including kernel-largesmp).
> 
> This is pretty much what I was wondering about. If I run SMP kernels, does 
> that mean I'm going to have to learn new tools or just install another 
> variation of GFS for said kernels? Sounds like the later.
> 

to confirm - yes, the latter. Once you have the relevant CS / GFS kernel 
modules installed, the userland config / admin is identical.

Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/794d437c/attachment.vcf>

From pcaulfie at redhat.com  Mon Jan 22 10:26:20 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 22 Jan 2007 10:26:20 +0000
Subject: [Linux-cluster] ntpdate cman and random fence-ing
In-Reply-To: <45B408B6.4090509@yu.net>
References: <45B408B6.4090509@yu.net>
Message-ID: <45B4914C.2010903@redhat.com>

Ivan Pantovic wrote:
> 
> I have a straightforward question,
> 
> is it possible to screw-up cluster node(s) by issuing ntpdate while
> cluster is running?
> 
> I had random fencing of nodes (usually when load is low) and ... 
> recently I discovered that nodes are syncronised to ntp server using
> crontab and ntpdate.
> I know that ntpdate is all but not "gentle" and should be used only on
> system startup for "rude", prior to ntpd start, syncronisation.
> 
> I could go trough heartbeat routines and try to figure it out but just
> if someone already knows and is willing to share it?

It shouldn't be. cman uses the kernel jiffies for timing information rather than
actual wall-clock time.

Having said that I haven't tested this, and some of the userland code might be
more subject to clock variances. Mucking around with computer clocks too often
is usually a bad idea ;)

-- 

patrick


From jos at xos.nl  Mon Jan 22 11:57:26 2007
From: jos at xos.nl (Jos Vos)
Date: Mon, 22 Jan 2007 12:57:26 +0100
Subject: [Linux-cluster] GFS and SCSI commands DLOCK/DMEP
Message-ID: <200701221157.l0MBvRT04539@xos037.xos.nl>

Hi,

The vendor of a cabinet that we have - via two separate SCSI channels -
connected to two cluster nodes running RH CS and GFS (RHEL4 U4) responds
to a problem we have as follows:

   The GFS filesystem require RAID system to support special
   SCSI commands (DLOCK/DMEP).

Am I right that this is not true and that only older versions
of GFS relied on these SCSI commands?

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From breeves at redhat.com  Mon Jan 22 12:05:16 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Mon, 22 Jan 2007 12:05:16 +0000
Subject: [Linux-cluster] GFS and SCSI commands DLOCK/DMEP
In-Reply-To: <200701221157.l0MBvRT04539@xos037.xos.nl>
References: <200701221157.l0MBvRT04539@xos037.xos.nl>
Message-ID: <45B4A87C.9000006@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Jos Vos wrote:
> Hi,
> 
> The vendor of a cabinet that we have - via two separate SCSI channels -
> connected to two cluster nodes running RH CS and GFS (RHEL4 U4) responds
> to a problem we have as follows:
> 
>    The GFS filesystem require RAID system to support special
>    SCSI commands (DLOCK/DMEP).
> 
> Am I right that this is not true and that only older versions
> of GFS relied on these SCSI commands?
> 
> Thanks,
> 

Hi Jos,

Afaik, yes - these are the ancient history of clustersuite. Take a look
at the paper "Symmetric Cluster Architecture". This talks about the
evolution of locking in clustersuite, from the original DLOCK/DMEP
support, to a DLOCK/DMEP simulator (this went on to become GULM), and
finally to the distributed lock manager (DLM).

http://people.redhat.com/~teigland/sca.pdf

Although the paper is a couple of years old now, it covers this area
quite thoroughly.

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFtKh86YSQoMYUY94RAq1eAJ9Vj8aYXmBYFAiUqh8zILKeHGcKfgCffUth
vFtXKPxWsGGvrpqewXaO5ck=
=Xv8L
-----END PGP SIGNATURE-----


From fajarpri at cbn.net.id  Mon Jan 22 15:05:42 2007
From: fajarpri at cbn.net.id (Fajar Priyanto)
Date: Mon, 22 Jan 2007 22:05:42 +0700
Subject: [Linux-cluster] Has anyone succeeded in using WTI RPS-10?
In-Reply-To: <200701221110.39958.fajarpri@cbn.net.id>
References: <200701221110.39958.fajarpri@cbn.net.id>
Message-ID: <200701222205.43121.fajarpri@cbn.net.id>

On Monday 22 January 2007 11:10, Fajar Priyanto wrote:
> Hi all,
> We are setting up 2-note cluster with RHEL4U3 with RHCS-1.0.25 using WTI
> Remote Power Switch as fence device. We are aware that the device is rather
> obsolete, but in the mean time that's all we have.

Ok, we have been experimenting still.
When we rebooted node1, the ftp service we configure is failed over to node2 
successfully, and when node1 back on, the service is relocated to node1 
again.

However, when we unplugged eth0 from node1, the failed over failed, with 
this /var/log/messages:

In node1:
kernel eth0: link down
kernel CMAN removing node ftp2.test.com from the cluster: MIssed too many 
heartbeats
fenced : ftp2.test.com not a cluster member after 0 sec post_fail_delay
fenced : fencing node "ftp2.test.com" 
fenced : agent "fence_rps10" reports: Rebooting port 1 ... failed
fenced : agent "fence_rps10" reports: wait_for : Connection timed out

In node2:
CMAN : removing node ftp1.test.com from the cluster: Missed too many 
heartbeats
fenced: ftp1.test.com not a cluster member after 0 sec post_fail_delay
fenced : fencing node "ftp1.test.com" 
fenced : agent "fence_rps10" reports: wait_for : Connection timed out

This is my cluster.conf:
<?xml version="1.0"?>
<cluster config_version="38" name="ftpcluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="ftp1.test.com" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="RPS-10M"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="ftp2.test.com" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="RPS-10S"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <fencedevices>
                <fencedevice agent="fence_rps10" device="/dev/ttyS0" 
name="RPS-10M" option="reboot" port="0" speed="9600"/>
                <fencedevice agent="fence_rps10" device="/dev/ttyS0" 
name="RPS-10S" option="reboot" port="1" speed="9600"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="ftpdom" ordered="1" 
restricted="1">
                                <failoverdomainnode name="ftp1.test.com" 
priority="1"/>
                                <failoverdomainnode name="ftp2.test.com" 
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="192.168.1.201" monitor_link="0"/>
                        <script file="/etc/init.d/vsftpd" name="vsftpd"/>
                </resources>
                <service autostart="1" domain="ftpdom" name="ftp" 
recovery="relocate">
                        <ip ref="192.168.1.201">
                                <script ref="vsftpd"/>
                        </ip>
                </service>
        </rm>
        <cman expected_votes="1" two_node="1"/>
</cluster>

Thank you very much for any clues and helps.
-- 
Fajar Priyanto | Reg'd Linux User #327841 | Linux tutorial 
http://linux2.arinet.org
10:05pm up 2:06, 2.6.16.13-4-default GNU/Linux 
Let's use OpenOffice. http://www.openoffice.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/973bf8c8/attachment.sig>

From isplist at logicore.net  Mon Jan 22 15:11:23 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 09:11:23 -0600
Subject: [Linux-cluster] RH Cluster/GFS: How many CPU's?
In-Reply-To: <45B4666A.2090505@obsidian.co.za>
Message-ID: <200712291123.211588@leena>

> to confirm - yes, the latter. Once you have the relevant CS / GFS kernel
> modules installed, the userland config / admin is identical.

Wonderful. I hoped that's all it might be, makes life easier for getting into 
SMP's machines. By the way, when one installs, does the installer 
automatically chose the proper SMP modules? I think it might, correct?

Mike


From rpeterso at redhat.com  Mon Jan 22 15:14:16 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 22 Jan 2007 09:14:16 -0600
Subject: [Linux-cluster] gfs problem on FC6
In-Reply-To: <45B216AD.3060104@prologixsoft.com>
References: <45B216AD.3060104@prologixsoft.com>
Message-ID: <45B4D4C8.7000607@redhat.com>

Ashish Varman wrote:
> Hello all
>
> I am using a gfs filesystem on FC6 with a two node cluster.  Since I 
> needed gfs_grow, I checked out "cluster" from sources.redhat.com with 
> the tag "RHEL50".  I first tested gfs functionality without cluster ( 
> -p lock_nolock ).  I then made a two node cluster and formatted the 
> volume again with (-p lock_dlm).  On one node, the filesystem can be 
> mounted but on the other it fails.  gfs_fsck runs and gives no errors. 
> The errors from the mount command and in /var/log/messages are as follows
>
> [root at clnode31 ~]# mount /dev/mapper/ExVolGroup-ExLogVol /mnt/tmp/ -t gfs
> mount: wrong fs type, bad option, bad superblock on 
> /dev/mapper/ExVolGroup-ExLogVol,
>       missing codepage or other error
>       In some cases useful info is found in syslog - try
>       dmesg | tail  or so
>
> [root at clnode31 ~]# tail /var/log/messages
> Jan 20 18:45:45 clnode31 kernel: Trying to join cluster "lock_dlm", 
> "alpha:third"
> Jan 20 18:45:45 clnode31 kernel: dlm: third: recover 1
> Jan 20 18:45:45 clnode31 kernel: dlm: third: add member 1
> Jan 20 18:45:45 clnode31 kernel: dlm: third: total members 1 error 0
> Jan 20 18:45:45 clnode31 kernel: dlm: third: dlm_recover_directory
> Jan 20 18:45:45 clnode31 kernel: dlm: third: dlm_recover_directory 0 
> entries
> Jan 20 18:45:45 clnode31 kernel: dlm: third: recover 1 done: 0 ms
> Jan 20 18:45:45 clnode31 kernel: Joined cluster. Now mounting FS...
> Jan 20 18:45:45 clnode31 kernel: GFS: fsid=alpha:third.4294967295: 
> can't mount journal #4294967295
> Jan 20 18:45:45 clnode31 kernel: GFS: fsid=alpha:third.4294967295: 
> there are only 8 journals (0 - 7)
>
> What am I doing wrong ????
>
> Thanks in advance.
>
> Yours Sincerely
> Ashish Varman
Hi Ashish,

Hard to say offhand what is going on here.  I recommend checking the 
following
(off the top of my head; no particular order).

1. Make sure both nodes have /sbin/mount.gfs and /sbin/mount.gfs2
    It's okay to have /sbin/mount.gfs be a symlink to /sbin/mount.gfs2
2. Make sure both nodes are running the same version of the cluster software
    and GFS (i.e. via rpm -q).
3. Make sure the clustered bit is "on" for your vg.  Use vgs.  (Assuming,
    of course, that you're using lvm).  The vgs command should show
    something like "wz--nc", and the "c" at the end means it's clustered.
4. Make sure your /etc/cluster/cluster.conf file is the same on both nodes.
5. Make sure your gfs file system was made (gfs_mkfs) with that cluster 
name.
    Use you can check this with gfs_tool sb /dev/your_vg/your_lv table
6. If everything so far checks out, do the command "group_tool -v" to make
    sure both nodes are properly joined to the appropriate fence group and
    dlm group.
7. If everything looks good in #6, do the command "group_tool dump gfs"
    to see if there are any weird messages from the gfs_controld daemon
    regarding the mount that failed.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Mon Jan 22 16:14:56 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 10:14:56 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <200712205120.615438@leena>
Message-ID: <2007122101456.605050@leena>

Anyone?

> I had a UPS outage overnight which took out one side of the power supplies


From dbrieck at gmail.com  Mon Jan 22 16:19:09 2007
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 22 Jan 2007 11:19:09 -0500
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <2007122101456.605050@leena>
References: <200712205120.615438@leena> <2007122101456.605050@leena>
Message-ID: <8c1094290701220819l7b7c84fal6f44dd67132b6b85@mail.gmail.com>

On 1/22/07, isplist at logicore.net <isplist at logicore.net> wrote:
>
> Anyone?
>
> > I had a UPS outage overnight which took out one side of the power
> supplies
>
>
Try running it through a debugger, I don't think anyone will be able to help
you with such a general description.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/c5dc7f02/attachment.htm>

From isplist at logicore.net  Mon Jan 22 16:21:51 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 10:21:51 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <8c1094290701220819l7b7c84fal6f44dd67132b6b85@mail.gmail.com>
Message-ID: <2007122102151.952713@leena>

A debugger? 
I guess if I knew how to do that, I'd know how to find the problem :).

I tried to explain it as best I can but need help to try a few things to get a 
sense of where the problem might be. I don't know how to describe it any 
better than I don't know where to start since nothing that I know of has 
worked so far. I'm at a loss :(.

Thanks though.

Mike


On Mon, 22 Jan 2007 11:19:09 -0500, David Brieck Jr. wrote:
> On 1/22/07, isplist at logicore.net <isplist at logicore.net> wrote: > Anyone?
> 
>>> I had a UPS outage overnight which took out one side of the power
>>> supplies
>> 
> 
> Try running it through a debugger, I don't think anyone will be able to
> help you with such a general description.


From dbrieck at gmail.com  Mon Jan 22 16:26:54 2007
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 22 Jan 2007 11:26:54 -0500
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <2007122102151.952713@leena>
References: <8c1094290701220819l7b7c84fal6f44dd67132b6b85@mail.gmail.com>
	<2007122102151.952713@leena>
Message-ID: <8c1094290701220826t3cfe34f3re140b3097243e0e1@mail.gmail.com>

On 1/22/07, isplist at logicore.net <isplist at logicore.net> wrote:
>
> A debugger?
> I guess if I knew how to do that, I'd know how to find the problem :).
>
> I tried to explain it as best I can but need help to try a few things to
> get a
> sense of where the problem might be. I don't know how to describe it any
> better than I don't know where to start since nothing that I know of has
> worked so far. I'm at a loss :(.
>
> Thanks though.
>
> Mike
>
>
>
>
> On Mon, 22 Jan 2007 11:19:09 -0500, David Brieck Jr. wrote:
> > On 1/22/07, isplist at logicore.net <isplist at logicore.net> wrote: > Anyone?
> >
> >>> I had a UPS outage overnight which took out one side of the power
> >>> supplies
> >>
> >
> > Try running it through a debugger, I don't think anyone will be able to
> > help you with such a general description.


You can probably use with gdb or strace. If I had to venture a guess at your
problem I would say check the versions of your kernel modules and make sure
they match up with your kernel.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/c7377e2b/attachment.htm>

From isplist at logicore.net  Mon Jan 22 16:32:06 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 10:32:06 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <8c1094290701220826t3cfe34f3re140b3097243e0e1@mail.gmail.com>
Message-ID: <200712210326.771575@leena>

> You can probably use with gdb or strace. If I had to venture a guess at
> your problem I would say check the versions of your kernel modules and make
> sure they match up with your kernel.

That's a good place to look for sure but it all started when my storage went 
down, nothing was changed on the servers. The storage seems fine now, it was 
just one side of the power supplies that went down but it cause loss of 
connection to the cluster. 

Since then, can't reach anything, always segmentation errors. The hardware can 
be seen, I can create LVM partitions, just can't do anything else.

Mike


From rpeterso at redhat.com  Mon Jan 22 16:36:37 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 22 Jan 2007 10:36:37 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <200712205120.615438@leena>
References: <200712205120.615438@leena>
Message-ID: <45B4E815.60709@redhat.com>

isplist at logicore.net wrote:
> I had a UPS outage overnight which took out one side of the power supplies on 
> some of my storage. After restoring things, I now have a rather confusing 
> problem.
>
> For some reason, all volumes were lost and now I cannot get anything back. 
> Since I had backups, I decided to clean everything up and start over. I have 
> yet to figure out how NOT to get a segmentation fault and am at a loss on 
> where to look next.
>
> Fdisk shows the LVM partitions.
> Pvscan gives segmentation fault.
> Vgscan gives segmentation fault.
> Lvscan shows no volumes.
>
> I've tried countless combinations of vgchange -ay and -an as well as turning 
> on/off clvmd services, cleaning things up, I'm at a loss with this.
>
> Can someone offer a logical method of finding what is wrong?
>
> Mike
>   
Hi Mike,

If the lvm2 commands like pvscan segfault, I'd say that's a bug.
In my (admittedly somewhat warped) belief system, commands like that should
NEVER segfault.  They should be bullet proof, no matter what kind of 
corruption
they encounter.  They should give you a sane error message indicating 
what the
problem is.  I recommend making sure your software is up to date, then
I'd search bugzilla for lvm2 bugs matching your problem.  If you can't find
any similar, I'd file a new bug against the lvm2 component.

Also, make sure all your physical drives are showing up by doing
cat /proc/partitions.  I'd also look in dmesg for any weird messages 
relating
to the errors.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Mon Jan 22 16:52:30 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 10:52:30 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <45B4E815.60709@redhat.com>
Message-ID: <2007122105230.831679@leena>

Hi Bob,

> If the lvm2 commands like pvscan segfault, I'd say that's a bug.

Even after a hardware crash though? Everything was working fine just before 
that.

> problem is.  I recommend making sure your software is up to date, then
> I'd search bugzilla for lvm2 bugs matching your problem.  If you can't find
> any similar, I'd file a new bug against the lvm2 component.

To be honest, I'm nervous about upgrading software right now because that 
might add another level of complication to what ever is going on.
 
> Also, make sure all your physical drives are showing up by doing
> cat /proc/partitions.  I'd also look in dmesg for any weird messages
> relating to the errors.

Yes, the partitions do show up and I've looked all over the place for 
information but these errors are too general which is leaving me with nothing 
else to look into... not being a programmer or Linux pro that is :).

Mike


From isplist at logicore.net  Mon Jan 22 17:28:55 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 22 Jan 2007 11:28:55 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <45B4E815.60709@redhat.com>
Message-ID: <2007122112855.138847@leena>

Here's something;

# service clvmd start
Starting clvmd:                                            [  OK  ]
/etc/init.d/clvmd: line 25:  8667 Segmentation fault      $VGSCAN >/dev/null 
2>&1
Activating VGs:                                            [  OK  ]

The log shows;

Jan 22 11:22:44 cweb92 clvmd: Cluster LVM daemon started - connected to CMAN
Jan 22 11:22:44 cweb92 lvm[8635]: locking_type not set correctly in lvm.conf, 
cluster operations will not work.
Jan 22 11:22:44 cweb92 clvmd: clvmd startup succeeded
Jan 22 11:22:45 cweb92 clvmd: Activating VGs: succeeded

I just noticed this in my lvm.conf;

    # Type of locking to use. Defaults to file-based locking (1).
    # Turn locking off by setting to 0 (dangerous: risks metadata corruption
    # if LVM2 commands get run concurrently).
    locking_type = 1

    # Local non-LV directory that holds file-based locks while commands are
    # in progress.  A directory like /tmp that may get wiped on reboot is OK.
    locking_dir = "/var/lock/lvm"

It used to be;

locking_type = 2
locking_library = "/usr/lib/liblvm2clusterlock.so"

What is confusing is that I did not change this nor did I upgrade anything and 
it happened after this hardware crash? What in the world does it all mean?

Mike


From rpeterso at redhat.com  Mon Jan 22 18:39:36 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 22 Jan 2007 12:39:36 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <2007122112855.138847@leena>
References: <2007122112855.138847@leena>
Message-ID: <45B504E8.7050500@redhat.com>

isplist at logicore.net wrote:
> Jan 22 11:22:44 cweb92 lvm[8635]: locking_type not set correctly in lvm.conf, 
>   

> I just noticed this in my lvm.conf;
>
>     # Type of locking to use. Defaults to file-based locking (1).
>     # Turn locking off by setting to 0 (dangerous: risks metadata corruption
>     # if LVM2 commands get run concurrently).
>     locking_type = 1
>
>     # Local non-LV directory that holds file-based locks while commands are
>     # in progress.  A directory like /tmp that may get wiped on reboot is OK.
>     locking_dir = "/var/lock/lvm"
>
> It used to be;
>
> locking_type = 2
> locking_library = "/usr/lib/liblvm2clusterlock.so"
>
> What is confusing is that I did not change this nor did I upgrade anything and 
> it happened after this hardware crash? What in the world does it all mean?
>
> Mike
>   
Hi Mike,

For your situation, you want locking_type = 2.  Hard to say how it 
changed from 2 to 1.
Could you have installed an RPM that overwrote it or something?
I'm not an lvm2 guy, but it should not change by itself.  Perhaps an 
fsck after the crash
found /etc/lvm/lvm.conf to be bad and deleted it, and then lvm2 picked 
up the pieces
and started you out with a new default lvm.conf.  Hard to speculate on that.

I still think the lvm commands shouldn't segfault.  If you're not 
running the latest
and greatest lvm2 code, those developers might have already found and 
fixed the
segfault.  You could still do some bugzilla searches and see if it's a 
known issue. 
If you're only interested in getting back to business, I'd say change it 
back to 2 and
see if you can function normally again.

Regards,

Bob Peterson
Red Hat Cluster Suite


From orkcu at yahoo.com  Mon Jan 22 19:27:48 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Mon, 22 Jan 2007 11:27:48 -0800 (PST)
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <45B4E815.60709@redhat.com>
Message-ID: <646752.66269.qm@web50615.mail.yahoo.com>


--- Robert Peterson <rpeterso at redhat.com> wrote:

> isplist at logicore.net wrote:
> > I had a UPS outage overnight which took out one
> side of the power supplies on 
> > some of my storage. After restoring things, I now
> have a rather confusing 
> > problem.
> >
> > For some reason, all volumes were lost and now I
> cannot get anything back. 
> > Since I had backups, I decided to clean everything
> up and start over. I have 
> > yet to figure out how NOT to get a segmentation
> fault and am at a loss on 
> > where to look next.
> >
> > Fdisk shows the LVM partitions.
> > Pvscan gives segmentation fault.
> > Vgscan gives segmentation fault.
> > Lvscan shows no volumes.
> >
> > I've tried countless combinations of vgchange -ay
> and -an as well as turning 
> > on/off clvmd services, cleaning things up, I'm at
> a loss with this.
> >
> > Can someone offer a logical method of finding what
> is wrong?
> >
> > Mike
> >   
> Hi Mike,
> 
> If the lvm2 commands like pvscan segfault, I'd say
> that's a bug.
> In my (admittedly somewhat warped) belief system,
> commands like that should
> NEVER segfault.  They should be bullet proof, no
> matter what kind of 
> corruption
> they encounter.  They should give you a sane error
> message indicating 
> what the
> problem is.  I recommend making sure your software
> is up to date, then
> I'd search bugzilla for lvm2 bugs matching your
> problem.  If you can't find
> any similar, I'd file a new bug against the lvm2
> component.
I recomend also to force fsck to local filesystems,
maybe the thereis some kind of corruption in someone.
maybe some library file get corrupt ...
rpm -V <lib-package> could help
ldd /usr/sbin/pvscan should give you the libraries
files pvscan needs

just a guess

cu
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
8:00? 8:25? 8:40? Find a flick in no time 
with the Yahoo! Search movie showtime shortcut.
http://tools.search.yahoo.com/shortcuts/#news


From lhh at redhat.com  Mon Jan 22 19:41:41 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 22 Jan 2007 14:41:41 -0500
Subject: [Linux-cluster] clusterfs.sh error
In-Reply-To: <6a90e4da0701191302r359ca698t3f932e9311e41e83@mail.gmail.com>
References: <6a90e4da0701191302r359ca698t3f932e9311e41e83@mail.gmail.com>
Message-ID: <1169494901.9453.45.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-19 at 16:02 -0500, Jon Erickson wrote:
> All,
> 
> Every once and a while I get a error message on no matter what system
> is the owner of the fs cluster service.
> 
> Error:
> kernel: clusterfs.sh[21141]: segfault at 0000000000000008 rip
> 0000000000432098 rsp 0000007fbfffdd00 error 4
> 
> clurgmgrd[8879]: <notice> Stopping service fs_cluster
> clurgmgrd[8879]: <notice> Service fs_cluster is recovering
> clurgmgrd[8879]: <notice> Recovering failed fs_cluster
> clurgmgrd[8879]: <notice> Service fs_cluster startred
> 
> Why is this happening? Should I be concered?

There are two possible causes for that, both should be fixed in the
RHEL4 Update 4 packages - make sure you have the current bash &
rgmanager packages.

-- Lon


From srramasw at cisco.com  Mon Jan 22 21:19:34 2007
From: srramasw at cisco.com (Sridharan Ramaswamy (srramasw))
Date: Mon, 22 Jan 2007 13:19:34 -0800
Subject: [Linux-cluster] Basic two node cman configuration
Message-ID: <B14199FA0DBAAF4AA89E83EB41D3543502EAE2D6@xmb-sjc-22c.amer.cisco.com>

I'm trying to understand the basics of ccs/cman layer for a two-node
configuration. I understand, from the docs, this is a special condition
that need cman started with  two_node="1" and expected_votes="1". But I
still have problem starting cman. It complains the votes = -1 and fails
to start.
 
Jan 22 12:35:50 cfs1 cman: cman_tool: the two-node option requires
exactly two nodes with one vote each and expected votes of 1 (votes=-1)
failed

But if remove two_node setting from <cman two_node="1"
expected_votes="1"> to <cman> it actually starts fine!! Not sure where
it is getting votes = -1. 
 
Is this some silly mistake on my part or a known issue? 
 
TIA,
Sridharan
 
 
$ more /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster name="testgfs" config_version="1">
 
<cman two_node="1" expected_votes="1">
</cman>
 
<clusternodes>
<clusternode name="cfs1">
        <fence>
                <method name="single">
                        <device name="human" nodename="cfs1"/>
                </method>
        </fence>
</clusternode>
<clusternode name="cfs5">
        <fence>
                <method name="single">
                        <device name="human" nodename="cfs5"/>
                </method>
        </fence>
</clusternode>
</clusternodes>
 
<fencedevices>
        <fencedevice name="gnbd" agent="fence_gnbd" servers="cfs1"/>
        <fencedevice name="human" agent="fence_manual"/>
</fencedevices>
 
</cluster>
 
 
Command
-----------
 
$ service ccsd start
Starting ccsd:                                             [  OK  ]
$ service cman start
Starting cman:                                             [FAILED]
 
Versions
---------
 
$ ccsd -V
ccsd DEVEL.1166467095 (built Dec 18 2006 10:39:27)
Copyright (C) Red Hat, Inc.  2004  All rights reserved.

$ cman_tool version
5.0.1 config 1
 
$ uname -a
Linux cfs1 2.6.9-42.7.ELsmp #1 SMP Tue Sep 5 18:29:39 EDT 2006 i686 i686
i386 GNU/Linux

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070122/4ba86bd3/attachment.htm>

From lhh at redhat.com  Mon Jan 22 23:23:34 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 22 Jan 2007 18:23:34 -0500
Subject: [Linux-cluster] qdisk updates in CVS
Message-ID: <1169508214.9453.63.camel@rei.boston.devel.redhat.com>

Hi list,

I updated all the open bugzillas against qdiskd.  Patches are now in CVS
to resolve all of the open BZs as of this writing ;)

I can provide patches against specific releases if you need them (e.g.
cman-1.0.11-0, for example); just say the word.

-- Lon


From matthew at arts.usyd.edu.au  Tue Jan 23 06:41:13 2007
From: matthew at arts.usyd.edu.au (Matthew Geier)
Date: Tue, 23 Jan 2007 17:41:13 +1100
Subject: [Linux-cluster] NFS making umount fail.
Message-ID: <45B5AE09.3080007@arts.usyd.edu.au>


  My cluster won't shutdown properly and fails the service unless I 
manually kill of nfsd first

Jan 23 17:30:39 Aristotle0 clurgmgrd: [8508]: <info> Stopping Samba 
instance "SMB-Export"
Jan 23 17:30:39 Aristotle0 clurgmgrd: [8508]: <info> unmounting /home
Jan 23 17:30:39 Aristotle0 clurgmgrd: [8508]: <notice> Forcefully 
unmounting /home
Jan 23 17:30:40 Aristotle0 clurgmgrd: [8508]: <warning> Dropping 
node-wide NFS locks
Jan 23 17:30:50 Aristotle0 clurgmgrd: [8508]: <info> unmounting /home
Jan 23 17:30:50 Aristotle0 clurgmgrd: [8508]: <notice> Forcefully 
unmounting /home
Jan 23 17:30:51 Aristotle0 clurgmgrd: [8508]: <info> Sending reclaim 
notifications via Aristotle0.arts.usyd.edu.au
Jan 23 17:30:51 Aristotle0 rpc.statd[5097]: Version 1.0.6 Starting
Jan 23 17:30:51 Aristotle0 rpc.statd[5097]: Flags: No-Daemon Notify-Only
Jan 23 17:30:51 Aristotle0 rpc.statd[5097]: unlink 
(/tmp/statd-Aristotle0.arts.usyd.edu.au.4919/sm.bak/129.78.16.20): 
Permission denied
Jan 23 17:30:54 Aristotle0 rpc.statd[5097]: Caught signal 15, 
un-registering and exiting.
Jan 23 17:30:54 Aristotle0 clurgmgrd: [8508]: <err> 'umount /home' 
failed, error=0
Jan 23 17:30:54 Aristotle0 clurgmgrd[8508]: <notice> stop on fs "Files - 
home" returned 2 (invalid argument(s))

  At which point the service is failed. It will not shutdown until I 
/etc/init.d/nfs stop first.


  I also use quota's, and unfortunately, 'quotaon -a' is run in the 
system init scripts BEFORE the filesystems are mounted as they are 
mounted by the cluster service.

  As a result I lose enforement of quotas all the time....


From francisco_javier.pena at roche.com  Tue Jan 23 07:35:12 2007
From: francisco_javier.pena at roche.com (Pena, Francisco Javier)
Date: Tue, 23 Jan 2007 08:35:12 +0100
Subject: [Linux-cluster] Basic two node cman configuration
In-Reply-To: <B14199FA0DBAAF4AA89E83EB41D3543502EAE2D6@xmb-sjc-22c.amer.cisco.com>
Message-ID: <C0C1791E8EC6F249B5570F01409BD3EE01485400@rmamsem1.emea.roche.com>

>I'm trying to understand the basics of ccs/cman layer for a two-node configuration. I understand, from the docs, this is a special condition that need cman 
>started with  two_node="1" and expected_votes="1". But I still have problem starting cman. It complains the votes = -1 and fails to start.
>
>Jan 22 12:35:50 cfs1 cman: cman_tool: the two-node option requires exactly two nodes with one vote each and expected votes of 1 (votes=-1) failed
>
>But if remove two_node setting from <cman two_node="1" expected_votes="1"> to <cman> it actually starts fine!! Not sure where it is getting votes = -1. 
>
>Is this some silly mistake on my part or a known issue? 
>
>TIA,
>Sridharan
>
>[...cluster configuration...]

Hello Sridharan,

Have you tried specifying a number of votes for each cluster node? Instead of:

<clusternode name="cfs1">

Write something like this:

<clusternode name="cfs1" votes="1">

Maybe CMAN is assigning -1 as the number of votes for the node if no number of votes is specified.

Regards,

Javier Pe?a


From ashish at prologixsoft.com  Tue Jan 23 10:22:37 2007
From: ashish at prologixsoft.com (Ashish Varman)
Date: Tue, 23 Jan 2007 15:52:37 +0530
Subject: [Linux-cluster] Re: gfs problem on FC6
In-Reply-To: <20070120170006.98B0B732B5@hormel.redhat.com>
References: <20070120170006.98B0B732B5@hormel.redhat.com>
Message-ID: <45B5E1ED.1000509@prologixsoft.com>

Thanks a lot Bob

gfs2 turned out to be the culprit.  I had assumed that since I was 
installing gfs from source, I wouldn't need gfs2-utils rpm.  As a 
result, /sbin/mount.gfs became a broken symlink to /sbin/mount.gfs2.  
Installing gfs2-utils solved the problem.

What I am surprised at is that theere was no indication of a missing 
executable in either the error messages or /var/log/messages.  Also the 
lines

>> can't mount journal #4294967295
>> Jan 20 18:45:45 clnode31 kernel: GFS: fsid=alpha:third.4294967295: 
>> there are only 8 journals (0 - 7)

led me to believe that the filesystem had somehow got corrupted.

I have a few more questions that I am posting in a different message

Yours Sincerely
Ashish Varman


From ashish at prologixsoft.com  Tue Jan 23 10:25:47 2007
From: ashish at prologixsoft.com (Ashish Varman)
Date: Tue, 23 Jan 2007 15:55:47 +0530
Subject: [Linux-cluster] gfs on FC6 and ddraid
In-Reply-To: <20070122161503.16AA773605@hormel.redhat.com>
References: <20070122161503.16AA773605@hormel.redhat.com>
Message-ID: <45B5E2AB.8020406@prologixsoft.com>

Hello all

A few follow-up questions have popped up after my previous problem got 
solved

1. What would be the correct way to use gfs (not gfs2) on FC6.  Binary 
installation (where do I get the rpms?) or source installation (which 
tag to check out?)

2. Since software raid is not cluster aware, ddraid seemed a much needed 
answer to storage node failure.  However, I was unable to build it.  The 
tarballs mentioned in the README are nowhere to be found and the kernel 
version on FC6 is different from those mentioned in it.  Also, the build 
fails due to not finding the file "../dm-ddraid.h".  This too is nowhere 
to be found :(.  Any pointers?

3. In absence of ddraid, is there any commercial product that can manage 
cluster-aware software raid?

Yours Sincerely
Ashish Varman


From shirai at sc-i.co.jp  Tue Jan 23 11:46:15 2007
From: shirai at sc-i.co.jp (Shirai@SystemCreateINC)
Date: Tue, 23 Jan 2007 20:46:15 +0900
Subject: [Linux-cluster] GFS6.1 mount: fs type gfs not supported by kernel
References: <20070122161503.16AA773605@hormel.redhat.com>
	<45B5E2AB.8020406@prologixsoft.com>
Message-ID: <003701c73ee4$16d67ed0$620aa8c0@tostar>

Hi!

I installed RedHatGFS6.1 in RHEL4.0U4.
The kernel version is 2.6.9-42.0.3. ELsmp.
These were normally installed with RPM though the package of RHCS and RHGFS 
was described later.
And, the configuration was made by system-config-cluster (The lock uses 
GULM).

The service of ccsd, lock_gulmd, and clvm started normally.
However, the service of gfs doesn't start.

When the GFS filesystem is mount, it is displayed, "Mount:fs type gfs not 
supported by kernel".
The following messages are displayed when assuming modprobe gfs.

FATAL:Error inserting gfs 
(/lib/modules/2.6.9-42.0.3.ELsmp/kernel/fs/gfs/gfs.ko ):
Unknown symbol in module, or unknown parameter(see dmesg)

And, the output of dmesg is as follows.
Lock_Harness 2.6.9-60.3 (built Oct 11 2006 18:31:11) installed
gfs:Unknown symbol kcl_get_node_by_nodeid

Besides, is there a necessary package?

-----------------------------------------
INSTALLED PACKEGE
-----------------------------------------
RHCS

ccs-1.0.7-0.i686.rpm
cman-1.0.11-0.i686.rpm
cman-kernel-smp-2.6.9-45.2.i686.rpm
dlm-1.0.1-1.i686.rpm
dlm-kernel-smp-2.6.9-42.10.i686.rpm
dlm-kernheaders-2.6.9-42.10.i686.rpm
fence-1.32.25-1.i686.rpm
gulm-1.0.7-0.i686.rpm
iddev-2.0.0-3.i686.rpm
ipvsadm-1.24-6.i386.rpm
magma-1.0.6-0.i686.rpm
magma-plugins-1.0.9-0.i386.rpm
perl-Net-Telnet-3.03-3.noarch.rpm
rgmanager-1.9.53-0.i386.rpm
system-config-cluster-1.0.27-1.0.noarch.rpm

RHGFS
GFS-6.1.6-1.i386.rpm
GFS-kernel-smp-2.6.9-60.3.i686.rpm
lvm2-cluster-2.02.06-7.0.RHEL4.i386.rpm

Regards

------------------------------------------------------
Shirai Noriyuki
Chief Engineer Technical Div. System Create Inc
Kanda Toyo Bldg, 3-4-2 Kandakajicho
Chiyodaku Tokyo 101-0045 Japan
Tel81-3-5296-3775 Fax81-3-5296-3777
e-mail:shirai at sc-i.co.jp web:http://www.sc-i.co.jp
------------------------------------------------------


From pcaulfie at redhat.com  Tue Jan 23 11:57:47 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 23 Jan 2007 11:57:47 +0000
Subject: [Linux-cluster] GFS6.1 mount: fs type gfs not supported by kernel
In-Reply-To: <003701c73ee4$16d67ed0$620aa8c0@tostar>
References: <20070122161503.16AA773605@hormel.redhat.com>	<45B5E2AB.8020406@prologixsoft.com>
	<003701c73ee4$16d67ed0$620aa8c0@tostar>
Message-ID: <45B5F83B.1080209@redhat.com>

Shirai at SystemCreateINC wrote:
> Hi!
> 
> I installed RedHatGFS6.1 in RHEL4.0U4.
> The kernel version is 2.6.9-42.0.3. ELsmp.
> These were normally installed with RPM though the package of RHCS and
> RHGFS was described later.
> And, the configuration was made by system-config-cluster (The lock uses
> GULM).
> 
> The service of ccsd, lock_gulmd, and clvm started normally.
> However, the service of gfs doesn't start.
> 
> When the GFS filesystem is mount, it is displayed, "Mount:fs type gfs
> not supported by kernel".
> The following messages are displayed when assuming modprobe gfs.
> 
> FATAL:Error inserting gfs
> (/lib/modules/2.6.9-42.0.3.ELsmp/kernel/fs/gfs/gfs.ko ):
> Unknown symbol in module, or unknown parameter(see dmesg)
> 
> And, the output of dmesg is as follows.
> Lock_Harness 2.6.9-60.3 (built Oct 11 2006 18:31:11) installed
> gfs:Unknown symbol kcl_get_node_by_nodeid

That symbol is in the cman kernel module.

The list of packages you gave did have the cman-kernel package in it, make sure
it is started (and the module is loaded) and that it is the right version for
your kernel.

-- 

patrick


From devrim at gunduz.org  Tue Jan 23 11:51:36 2007
From: devrim at gunduz.org (=?iso-8859-9?Q?Devrim_G=DCND=DCZ?=)
Date: Tue, 23 Jan 2007 13:51:36 +0200 (EET)
Subject: [Linux-cluster] GFS6.1 mount: fs type gfs not supported by kernel
In-Reply-To: <003701c73ee4$16d67ed0$620aa8c0@tostar>
References: <20070122161503.16AA773605@hormel.redhat.com>
	<45B5E2AB.8020406@prologixsoft.com>
	<003701c73ee4$16d67ed0$620aa8c0@tostar>
Message-ID: <Pine.LNX.4.63.0701231350390.13426@mail.kivi.com.tr>


Hi,

On Tue, 23 Jan 2007, Shirai at SystemCreateINC wrote:

> FATAL:Error inserting gfs 
> (/lib/modules/2.6.9-42.0.3.ELsmp/kernel/fs/gfs/gfs.ko ):
> Unknown symbol in module, or unknown parameter(see dmesg)
>
> And, the output of dmesg is as follows.
> Lock_Harness 2.6.9-60.3 (built Oct 11 2006 18:31:11) installed

AFAICS the kernel module version does not match your kernel version. 
Please check and install correct versions.

Regards,
--
Devrim G?ND?Z
devrim~gunduz.org, devrim~PostgreSQL.org, devrim.gunduz~linux.org.tr
                       http://www.gunduz.org


From wendell at BISonline.com  Tue Jan 23 13:39:32 2007
From: wendell at BISonline.com (Wendell Dingus)
Date: Tue, 23 Jan 2007 08:39:32 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
Message-ID: <45B61014.5090500@BISonline.com>

I don't know where that breaking point is but I believe _we've_ stepped
over it.

4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
LVM) versus
3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
and LVM and way faster hardware/disks)

This is a migration from the former to the latter, so quantity/size of
files/dirs is mostly identical. Files being transferred from customer
sites to the old servers never cause more than about 20% CPU load and
that usually (quickly) falls to 1% or less after the initial xfer
begins. The new servers run to 100% where they usually remain until the
transfer completes. The current thinking as far as reason is the same
thing being discussed here.

I was advised of this mailing list posting by a a RH tech from an open
ticket I have on this problem.

The older server has 11million and 13million files respectively across
it's two filesystems. The newer server has about 27million files on it's
one larger filesystem. It appears I'm going to be forced to blow this
away and create multiple smaller filesystems (there is a redundant
backup drive with all the files thank goodness). Unless I hear otherwise
I'm inclined to limit the new RAID to 2TB filesystems max. So as one
filesystem fills up faster than the others I suppose we'll forever be
moving things from one to the other and symlinking it to death. Oh well...


>Robert,
/>
>> What version of the gfs_mkfs code were you running to get this?/
>gfs_mkfs -V produced the following results:
>
>gfs_mkfs 6.1.6 (built May 9 2006 17:48:45)
>Copyright (C) Red Hat, Inc. 2004-2005 All rights reserved
>
>Thanks,
>Jon
>
>On 1/11/07, Robert Peterson <rpeterso@???> wrote:/
>> Jon Erickson wrote:
>> > I have a couple of question regarding the Cluster Project FAQ ? GFS
>> > tuning section
(http://sources.redhat.com/cluster/faq.html#gfs_tuning).
>> >
>> > First:
>> > - Use ?r 2048 on gfs_mkfs and mkfs.gfs2 for large file systems.
>> > I noticed that when I used the ?r 2048 switch while creating my file
>> > system it ended up creating the file system with the 256MB resource
>> > group size. When I omitted the ?r flag the file system was created
>> > with 2048Mb resource group size. Is there a problem with the ?r flag,
>> > and does gfs_mkfs dynamically come up with the best resource group
>> > size based on your file system size? Another thing I did which ended
>> > up in a problem was executing the gfs_mkfs command while my current
>> > GFS file system was mounted. The command completed successfully but
>> > when I went into the mount point all the old files and directories
>> > still showed up. When I attempted to remove files bad things
>> > happened?I believe I received invalid metadata blocks error and the
>> > cluster went into an infinite loop trying to restart the service. I
>> > ended up fixing this problem by un-mounting my file system re-creating
>> > the GFS file system and re-mounting. This problem was caused by my
>> > user error, but maybe there should be some sort of check that
>> > determines whether the file system is currently mounted.
>> >
>> > Second:
>> > - Break file systems up when huge numbers of file are involved.
>> > This FAQ states that there is an amount of overhead when dealing with
>> > lots (millions) of files. What is a recommended limit of files in a
>> > file system? The theoretical limit of 8 exabytes for a file system
>> > does not seem at all realistic if you can't have (millions) of files
>> > in a file system.
>> >
>> > I just curious to see what everyone thinks about this. Thanks
>> >
>> >
>> Hi Jon,
>>
>> The newer gfs_mkfs (gfs1) and mkfs.gfs2 (gfs2) in the CVS HEAD will
>> create the RG size based on the size of the file system if "-r" is not
>> specified,
>> so that would explain why it used 2048 in the case where you didn't
>> specify -r.
>> The previous versions just always assumed 256MB unless -r was specified.
>>
>> If you specified -r 2048 and it used 256 for its rg size, that would be
>> a bug.
>> What version of the gfs_mkfs code were you running to get this?
>>
>> I agree that it would be very nice if all the userspace GFS-related
>> tools could
>> make sure the file system is not mounted anywhere first before running.
>> We even have a bugzilla from long ago about this regarding gfs_fsck:
>>
>> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=156012
>>
>> It's easy enough to check if the local node (the one running mkfs or
fsck)
>> has it mounted, but it's harder to figure out whether other nodes do
because
>> the userland tools can't assume access to the cluster infrastructure
>> like the
>> kernel code can. So I guess we haven't thought of an elegant solution to
>> this yet; we almost need to query every node and check its cman_tool
>> services output to see if it is using resources pertaining to the file
>> system,
>> but that would require some kind of socket or connection,
>> (e.g. ssh) but what should it do when it can't contact a node that's
powered
>> off, etc?
>>
>> Regarding the number of files in a GFS file system: I don't have any
kind
>> of recommendations because I haven't studied the exact performance
impact
>> based on the number of inodes. It would be cool if someone could do some
>> tests and see where the performance starts to degrade.
>>
>> The cluster team at Red Hat can work toward improving the performance
>> of GFS (in fact, we are; hence the change to gfs_mkfs for the rg size),
>> but many of the performance issues are already addressed with GFS2,
>> and since GFS2 was accepted by the upstream linux kernel, in a way
>> I think it makes more sense to focus more of our efforts there.
>>
>> One thing I thought about doing was trying to use btrees instead of
>> linked lists for some of our more critical resources, like the RGs and
>> the glocks. We'd have to figure out the impact of doing that; the
overhead
>> to do that might also impact performance. Just my $0.02.
>>
>> Regards,
>>
>> Bob Peterson
>> Red Hat Cluster Suite
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster@???
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>/
>
>

>-- 
>Jon


From shirai at sc-i.co.jp  Tue Jan 23 14:29:39 2007
From: shirai at sc-i.co.jp (shirai@SystemCreateInc)
Date: Tue, 23 Jan 2007 23:29:39 +0900
Subject: [Linux-cluster] GFS6.1 mount: fs type gfs not supported by kernel
References: <20070122161503.16AA773605@hormel.redhat.com><45B5E2AB.8020406@prologixsoft.com><003701c73ee4$16d67ed0$620aa8c0@tostar>
	<Pine.LNX.4.63.0701231350390.13426@mail.kivi.com.tr>
Message-ID: <EE32F314D65D4B36ABB51BA3A19C0961@ZSTAR>

Hi

Thank you Devrim.
However, I think that the kernel version is correct.
Because when I installed GFS-kernel-smp-2.6.9-60.3.i686.rpm, the kernel of
this version was demanded.

And, I will check cman-kernel that Patrick pointed out.


> > FATAL:Error inserting gfs
> > (/lib/modules/2.6.9-42.0.3.ELsmp/kernel/fs/gfs/gfs.ko ):
> > Unknown symbol in module, or unknown parameter(see dmesg)
> >
> > And, the output of dmesg is as follows.
> > Lock_Harness 2.6.9-60.3 (built Oct 11 2006 18:31:11) installed
>
> AFAICS the kernel module version does not match your kernel version.
> Please check and install correct versions.

Regards


From rpeterso at redhat.com  Tue Jan 23 15:09:47 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 23 Jan 2007 09:09:47 -0600
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <45B61014.5090500@BISonline.com>
References: <45B61014.5090500@BISonline.com>
Message-ID: <45B6253B.1040208@redhat.com>

Wendell Dingus wrote:
> I don't know where that breaking point is but I believe _we've_ stepped
> over it.
>
> 4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
> LVM) versus
> 3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
> and LVM and way faster hardware/disks)
>
> This is a migration from the former to the latter, so quantity/size of
> files/dirs is mostly identical. Files being transferred from customer
> sites to the old servers never cause more than about 20% CPU load and
> that usually (quickly) falls to 1% or less after the initial xfer
> begins. The new servers run to 100% where they usually remain until the
> transfer completes. The current thinking as far as reason is the same
> thing being discussed here.
>
> I was advised of this mailing list posting by a a RH tech from an open
> ticket I have on this problem.
>
> The older server has 11million and 13million files respectively across
> it's two filesystems. The newer server has about 27million files on it's
> one larger filesystem. It appears I'm going to be forced to blow this
> away and create multiple smaller filesystems (there is a redundant
> backup drive with all the files thank goodness). Unless I hear otherwise
> I'm inclined to limit the new RAID to 2TB filesystems max. So as one
> filesystem fills up faster than the others I suppose we'll forever be
> moving things from one to the other and symlinking it to death. Oh well...
>   
Hi Wendell,

If I understand you correctly, you're having a performance problem here.
If it were me, I guess I'd try some experiments to try to determine 
where the
bottleneck seems to be.  For example, I'd try temporarily switching to Gulm
locking protocol on the new servers to see if that brings the 
performance back.

Wendy Cheng has a performance-related bugzilla open right now in GFS:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=214239

Your problem might/might not be related.  Would you be willing to test a
patch for her?  I don't know if she has a patch ready to try yet.

Regards,

Bob Peterson
Red Hat Cluster Suite


From teigland at redhat.com  Tue Jan 23 15:12:40 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 23 Jan 2007 09:12:40 -0600
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <45B61014.5090500@BISonline.com>
References: <45B61014.5090500@BISonline.com>
Message-ID: <20070123151239.GA26324@redhat.com>

On Tue, Jan 23, 2007 at 08:39:32AM -0500, Wendell Dingus wrote:
> I don't know where that breaking point is but I believe _we've_ stepped
> over it.

The number of files in the fs is a non-issue; usage/access patterns is
almost always the issue.

> 4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
> LVM) versus
> 3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
> and LVM and way faster hardware/disks)
> 
> This is a migration from the former to the latter, so quantity/size of
> files/dirs is mostly identical. Files being transferred from customer
> sites to the old servers never cause more than about 20% CPU load and
> that usually (quickly) falls to 1% or less after the initial xfer
> begins. The new servers run to 100% where they usually remain until the
> transfer completes. The current thinking as far as reason is the same
> thing being discussed here.

This is strange, are you mounting with noatime?  Also, try setting this on
each node before it mounts gfs:

echo "0" > /proc/cluster/lock_dlm/drop_count

Dave


From wcheng at redhat.com  Tue Jan 23 15:01:59 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 23 Jan 2007 10:01:59 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <45B61014.5090500@BISonline.com>
References: <45B61014.5090500@BISonline.com>
Message-ID: <45B62367.6020905@redhat.com>

Wendell Dingus wrote:
>
> This is a migration from the former to the latter, so quantity/size of
> files/dirs is mostly identical. Files being transferred from customer
> sites to the old servers never cause more than about 20% CPU load and
> that usually (quickly) falls to 1% or less after the initial xfer
> begins. The new servers run to 100% where they usually remain until the
> transfer completes. The current thinking as far as reason is the same
> thing being discussed here.
>
> I was advised of this mailing list posting by a a RH tech from an open
> ticket I have on this problem.
>
>   
Could I have either your RH tech name (or ticket number) off-line so I 
could pass few tuning tips to see whether they help ?

-- Wendy

  
From erickson.jon at gmail.com  Tue Jan 23 15:18:41 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Tue, 23 Jan 2007 10:18:41 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <20070123151239.GA26324@redhat.com>
References: <45B61014.5090500@BISonline.com>
	<20070123151239.GA26324@redhat.com>
Message-ID: <6a90e4da0701230718i312bbc29q49189361312bebea@mail.gmail.com>

On 1/23/07, David Teigland <teigland at redhat.com> wrote:
> On Tue, Jan 23, 2007 at 08:39:32AM -0500, Wendell Dingus wrote:
> > I don't know where that breaking point is but I believe _we've_ stepped
> > over it.
>
> The number of files in the fs is a non-issue; usage/access patterns is
> almost always the issue.
>
> > 4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
> > LVM) versus
> > 3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
> > and LVM and way faster hardware/disks)
> >
> > This is a migration from the former to the latter, so quantity/size of
> > files/dirs is mostly identical. Files being transferred from customer
> > sites to the old servers never cause more than about 20% CPU load and
> > that usually (quickly) falls to 1% or less after the initial xfer
> > begins. The new servers run to 100% where they usually remain until the
> > transfer completes. The current thinking as far as reason is the same
> > thing being discussed here.
>
> This is strange, are you mounting with noatime?  Also, try setting this on
> each node before it mounts gfs:
>
> echo "0" > /proc/cluster/lock_dlm/drop_count
What does this do?


>
> Dave
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Jon


From riaan at obsidian.co.za  Tue Jan 23 15:44:56 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Tue, 23 Jan 2007 17:44:56 +0200
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <6a90e4da0701230718i312bbc29q49189361312bebea@mail.gmail.com>
References: <45B61014.5090500@BISonline.com>	<20070123151239.GA26324@redhat.com>
	<6a90e4da0701230718i312bbc29q49189361312bebea@mail.gmail.com>
Message-ID: <45B62D78.9090205@obsidian.co.za>


Jon Erickson wrote:
> On 1/23/07, David Teigland <teigland at redhat.com> wrote:
>> On Tue, Jan 23, 2007 at 08:39:32AM -0500, Wendell Dingus wrote:
>> > I don't know where that breaking point is but I believe _we've_ stepped
>> > over it.
>>
>> The number of files in the fs is a non-issue; usage/access patterns is
>> almost always the issue.
>>
>> > 4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
>> > LVM) versus
>> > 3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
>> > and LVM and way faster hardware/disks)
>> >
>> > This is a migration from the former to the latter, so quantity/size of
>> > files/dirs is mostly identical. Files being transferred from customer
>> > sites to the old servers never cause more than about 20% CPU load and
>> > that usually (quickly) falls to 1% or less after the initial xfer
>> > begins. The new servers run to 100% where they usually remain until the
>> > transfer completes. The current thinking as far as reason is the same
>> > thing being discussed here.
>>
>> This is strange, are you mounting with noatime?  Also, try setting 
>> this on
>> each node before it mounts gfs:
>>
>> echo "0" > /proc/cluster/lock_dlm/drop_count
> What does this do?
> 
> 

The /proc/cluster/lock_dlm/drop_count file is used to tune the number of 
locks that lock_dlm keeps in its cache. 0 sets it to unlimited. For more 
info, see:

"What does the /proc/cluster/lock_dlm/drop_count file do and why do some 
nodes exceed this value?"
http://kbase.redhat.com/faq/FAQ_85_9320.shtm

Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070123/f6bbc39c/attachment.vcf>

From wendell at BISonline.com  Tue Jan 23 16:02:46 2007
From: wendell at BISonline.com (Wendell Dingus)
Date: Tue, 23 Jan 2007 11:02:46 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <20070123151239.GA26324@redhat.com>
References: <45B61014.5090500@BISonline.com>
	<20070123151239.GA26324@redhat.com>
Message-ID: <45B631A6.6000108@BISonline.com>

One filesystem is mounted with atime because we're storing PHP session
files on it so that all servers can get to them. They weren't being
garbage collected previously, we had to remount with atime. The other
much larger filesystem (8TB versus 1TB for the one with atime enabled)
is mounted without atime. This larger one is primarily the one I've
experienced the spike in CPU usage when a file transfer begins (via UUCP
BTW).

We just did some searching and reading about this setting and it looks
like setting it to zero means it never drops DLM locks from memory. Ours
is at the default 50,000 value, meaning I assume after 50,000 files are
"touched" in some way, they start getting dropped out of this cache,
which can cause a performance hit. We've looked through "counters"
output and aren't sure exactly which one we're looking for that might
reveal if this threshold is being exceeded or not. Any pointers?

PS. Some info here which tells how to make this _permanent_ and we were
surprised adding the setting to sysctl.conf wasn't the primary method
mentioned: http://www.redhatmagazine.com/2006/12/15/tips_tricks/

Server1
                                  locks 794922
                             locks held 386470
                          incore inodes 379911
                       metadata buffers 5
                        unlinked inodes 0
                              quota IDs 0
                     incore log buffers 0
                         log space used 0.05%
              meta header cache entries 8
                     glock dependencies 1
                 glocks on reclaim list 0
                              log wraps 35
                   outstanding LM calls 0
                  outstanding BIO calls 0
                       fh2dentry misses 1
                       glocks reclaimed 111468752
                         glock nq calls 1283605791
                         glock dq calls 1283083972
                   glock prefetch calls 51880947
                          lm_lock calls 63041351
                        lm_unlock calls 62497023
                           lm callbacks 125557260
                     address operations 1213594855
                      dentry operations 15527454
                      export operations 1135237
                        file operations 1739567930
                       inode operations 25526974
                       super operations 431704873
                          vm operations 0
                        block I/O reads 113537577
                       block I/O writes 0

Server2
                                  locks 481974
                             locks held 233640
                          incore inodes 223437
                       metadata buffers 5
                        unlinked inodes 0
                              quota IDs 0
                     incore log buffers 0
                         log space used 0.10%
              meta header cache entries 0
                     glock dependencies 0
                 glocks on reclaim list 0
                              log wraps 4
                   outstanding LM calls 0
                  outstanding BIO calls 0
                       fh2dentry misses 0
                       glocks reclaimed 4977455
                         glock nq calls 0
                         glock dq calls 0
                   glock prefetch calls 513249
                          lm_lock calls 4991067
                        lm_unlock calls 4518146
                           lm callbacks 9562059
                     address operations 23406495
                      dentry operations 9440989
                      export operations 0
                        file operations 1807195617
                       inode operations 13159135
                       super operations 2626655
                          vm operations 0
                        block I/O reads 24550633
                       block I/O writes 84806

Server3
                                  locks 73380
                             locks held 22815
                          incore inodes 19140
                       metadata buffers 458
                        unlinked inodes 0
                              quota IDs 0
                     incore log buffers 0
                         log space used 0.20%
              meta header cache entries 36
                     glock dependencies 0
                 glocks on reclaim list 0
                              log wraps 60
                   outstanding LM calls 0
                  outstanding BIO calls 0
                       fh2dentry misses 0
                       glocks reclaimed 2875923
                         glock nq calls 530954329
                         glock dq calls 527130026
                   glock prefetch calls 55222
                          lm_lock calls 6770608
                        lm_unlock calls 2739686
                           lm callbacks 9605743
                     address operations 317847565
                      dentry operations 3659746
                      export operations 770507
                        file operations 3322
                       inode operations 8146407
                       super operations 12936727
                          vm operations 59
                        block I/O reads 3236
                       block I/O writes 7969620


David Teigland wrote:
> On Tue, Jan 23, 2007 at 08:39:32AM -0500, Wendell Dingus wrote:
>   
>> I don't know where that breaking point is but I believe _we've_ stepped
>> over it.
>>     
>
> The number of files in the fs is a non-issue; usage/access patterns is
> almost always the issue.
>
>   
>> 4-node RHEL3 and GFS6.0 cluster with (2) 2TB filesystems (GULM and no
>> LVM) versus
>> 3-node RHEL4 (x86_64) and GFS6.1 cluster with (1) 8TB+ filesystem (DLM
>> and LVM and way faster hardware/disks)
>>
>> This is a migration from the former to the latter, so quantity/size of
>> files/dirs is mostly identical. Files being transferred from customer
>> sites to the old servers never cause more than about 20% CPU load and
>> that usually (quickly) falls to 1% or less after the initial xfer
>> begins. The new servers run to 100% where they usually remain until the
>> transfer completes. The current thinking as far as reason is the same
>> thing being discussed here.
>>     
>
> This is strange, are you mounting with noatime?  Also, try setting this on
> each node before it mounts gfs:
>
> echo "0" > /proc/cluster/lock_dlm/drop_count
>
> Dave
>
>   


From benoit.duffau at devoteam.com  Tue Jan 23 16:05:47 2007
From: benoit.duffau at devoteam.com (Benoit DUFFAU)
Date: Tue, 23 Jan 2007 17:05:47 +0100
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
Message-ID: <1169568347.17130.31.camel@localhost.localdomain>

Hello all,

I've (tried to) set up a two-nodes cluster using the lattest kernel
(2.6.19.2) with GFS2 included in the kernel.

Everything went fine using the doc found in 

http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/usage.txt?rev=1.35&content-type=text/x-cvsweb-markup&cvsroot=cluster

so i did everything discribed to bring the cluster on.

The problem occurs when i try to export the gfs2 mounted file system
with NFS.

I use a debian stable (sarge) and the nfs-kernel-server package that
uses the nfsd kernel module

I export the repository (let's say /test) , I mount it on the nfs client
machine (let's say on /mnt for exemple) , I issue a ls -l /mnt (on the
client) and the server crashes.

I attach the syslog part that includes the following commands : 

1-mounting the gfs2 filesystem localy on the NFS serveur
2-mounting the exported directory (/test) on the machine named client1
3-issuing a "ls -l" on the client machine.

I also use on this machine Vmware and drbd 0.8rc2 but i did exactly the
same test on the other node with a untainted kernel since i do not use
vmware on the other node, and it crashes the same way ...

looks like if it was impossible to export gfs2 over NFS ?
Incompatibilities between lock methods ? 

does someone already notice this ?

Benoit DUFFAU

+---------------------------------------------------------------------+
Combining consulting and technology solutions offers enables Devoteam
to provide its customers with independent advice and effective solutions
that meet their strategic objectives (IT performance and optimisation)
in complementary areas: networks, systems infrastructure, security
and e-business applications.
Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
and an operating margin of 7%. The group counts 2,400 employees through
sixteen countries in Europe, the Middle East and North Africa.
Listed on Euronext (Eurolist B compartment) since October 28, 1999.
Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
Euronext Paris
ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
+---------------------------------------------------------------------+
-------------- next part --------------
Jan 23 15:40:35 ftp kernel: GFS2: fsid=: Trying to join cluster "lock_dlm", "sympa:sympafs"
Jan 23 15:40:35 ftp kernel: dlm: sympafs: recover 1
Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: Joined cluster. Now mounting FS...
Jan 23 15:40:35 ftp kernel: dlm: sympafs: add member 2
Jan 23 15:40:35 ftp kernel: dlm: sympafs: total members 1 error 0
Jan 23 15:40:35 ftp kernel: dlm: sympafs: dlm_recover_directory 
Jan 23 15:40:35 ftp kernel: dlm: sympafs: dlm_recover_directory 0 entries
Jan 23 15:40:35 ftp kernel: dlm: sympafs: recover 1 done: 40 ms 
Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0, already locked for use
Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0: Looking at journal...
Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0: Done
Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Trying to acquire journal lock...
Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Looking at journal...
Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Done
Jan 23 15:43:27 ftp rpc.mountd: authenticated mount request from client1:694 for /test (/test)
Jan 23 15:43:46 ftp kernel: original: gfs2_glock_nq_atime+0x17a/0x31a
Jan 23 15:43:46 ftp kernel: pid : 10025
Jan 23 15:43:47 ftp kernel: lock type : 2 lock state : 1
Jan 23 15:43:47 ftp kernel: new: gfs2_getattr+0x33/0x6c
Jan 23 15:43:47 ftp kernel: pid : 10025
Jan 23 15:43:47 ftp kernel: lock type : 2 lock state : 1
Jan 23 15:43:48 ftp kernel: ------------[ cut here ]------------
Jan 23 15:43:48 ftp kernel: kernel BUG at fs/gfs2/glock.c:1193!
Jan 23 15:43:48 ftp kernel: invalid opcode: 0000 [#1]
Jan 23 15:43:48 ftp kernel: SMP 
Jan 23 15:43:48 ftp kernel: Modules linked in: nfsd exportfs lockd nfs_acl sunrpc vmnet(P) vmmon(P) sctp drbd
Jan 23 15:43:48 ftp kernel: CPU:    1
Jan 23 15:43:48 ftp kernel: EIP:    0060:[add_to_queue+221/373]    Tainted: P      VLI
Jan 23 15:43:48 ftp kernel: EFLAGS: 00010286   (2.6.19.2 #3)
Jan 23 15:43:48 ftp kernel: EIP is at add_to_queue+0xdd/0x175
Jan 23 15:43:48 ftp kernel: eax: 00000020   ebx: d7945e68   ecx: d7945a38   edx: c0512440
Jan 23 15:43:48 ftp kernel: esi: d543b9dc   edi: d7945a78   ebp: c8b9e000   esp: d7945a34
Jan 23 15:43:48 ftp kernel: ds: 007b   es: 007b   ss: 0068
Jan 23 15:43:48 ftp kernel: Process nfsd (pid: 10025, ti=d7944000 task=da45f550 task.ti=d7944000)
Jan 23 15:43:48 ftp kernel: Stack: c0512440 00000002 00000001 d543b9dc d7945a78 00000000 c023021b d7945a78
Jan 23 15:43:48 ftp kernel:        d7945a78 d7945b00 d0bf51a8 d7945c78 c023f7bc d7945a78 00000003 00000008
Jan 23 15:43:48 ftp kernel:        d7945a78 d7945a78 d7945a78 d543b9dc da45f550 00000003 00000008 00000000
Jan 23 15:43:48 ftp kernel: Call Trace:
Jan 23 15:43:48 ftp kernel:  [gfs2_glock_nq+62/150] gfs2_glock_nq+0x3e/0x96
Jan 23 15:43:48 ftp kernel:  [gfs2_getattr+59/108] gfs2_getattr+0x3b/0x6c
Jan 23 15:43:48 ftp kernel:  [gfs2_getattr+51/108] gfs2_getattr+0x33/0x6c
Jan 23 15:43:48 ftp kernel:  [vfs_getattr+47/141] vfs_getattr+0x2f/0x8d
Jan 23 15:43:48 ftp kernel:  [pg0+948462326/1066693632] encode_post_op_attr+0x4a/0x238 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948424111/1066693632] fh_compose+0x1da/0x419 [nfsd]
Jan 23 15:43:48 ftp kernel:  [smp_send_reschedule+30/34] smp_send_reschedule+0x1e/0x22
Jan 23 15:43:48 ftp kernel:  [dput+34/291] dput+0x22/0x123
Jan 23 15:43:48 ftp kernel:  [pg0+948472018/1066693632] compose_entry_fh+0x10b/0x115 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948472571/1066693632] encode_entry+0x21f/0x536 [nfsd]
Jan 23 15:43:48 ftp kernel:  [activate_task+100/174] activate_task+0x64/0xae
Jan 23 15:43:48 ftp kernel:  [smp_send_reschedule+30/34] smp_send_reschedule+0x1e/0x22
Jan 23 15:43:48 ftp kernel:  [try_to_wake_up+657/668] try_to_wake_up+0x291/0x29c
Jan 23 15:43:48 ftp kernel:  [__next_cpu+32/47] __next_cpu+0x20/0x2f
Jan 23 15:43:48 ftp kernel:  [find_busiest_group+391/1168] find_busiest_group+0x187/0x490
Jan 23 15:43:48 ftp kernel:  [gfs2_glmutex_lock+142/148] gfs2_glmutex_lock+0x8e/0x94
Jan 23 15:43:48 ftp kernel:  [filldir_func+80/211] filldir_func+0x50/0xd3
Jan 23 15:43:48 ftp kernel:  [do_filldir_main+430/494] do_filldir_main+0x1ae/0x1ee
Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
Jan 23 15:43:48 ftp kernel:  [gfs2_dir_read+362/408] gfs2_dir_read+0x16a/0x198
Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
Jan 23 15:43:48 ftp kernel:  [gfs2_readdir+173/207] gfs2_readdir+0xad/0xcf
Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
Jan 23 15:43:48 ftp kernel:  [gfs2_glock_nq_atime+378/794] gfs2_glock_nq_atime+0x17a/0x31a
Jan 23 15:43:48 ftp kernel:  [dentry_open+88/94] dentry_open+0x58/0x5e
Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
Jan 23 15:43:48 ftp kernel:  [vfs_readdir+81/124] vfs_readdir+0x51/0x7c
Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948437677/1066693632] nfsd_readdir+0x95/0x107 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948461192/1066693632] nfsd3_proc_readdirplus+0x110/0x1ee [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948469685/1066693632] nfs3svc_decode_readdirplusargs+0x107/0x145 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948414565/1066693632] nfsd_dispatch+0xed/0x1d8 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948183442/1066693632] svc_process+0x375/0x5de [sunrpc]
Jan 23 15:43:48 ftp kernel:  [pg0+948414052/1066693632] nfsd+0x19c/0x2b0 [nfsd]
Jan 23 15:43:48 ftp kernel:  [pg0+948413640/1066693632] nfsd+0x0/0x2b0 [nfsd]
Jan 23 15:43:48 ftp kernel:  [kernel_thread_helper+7/16] kernel_thread_helper+0x7/0x10
Jan 23 15:43:48 ftp kernel:  =======================
Jan 23 15:43:48 ftp kernel: Code: 00 c7 04 24 9a ba 4f c0 89 44 24 04 e8 62 b5 ee ff 8b 46 20 89 44 24 08 8b 46 14 c7 04 24 40 24 51 c0 89 44 24 04 e8 48 b5 ee ff <0f> 0b a9 04 e8 b9 4f c0 8b 56 48 8d 4e 48 8b 5f 0c 8b 02 0f 18
Jan 23 15:43:48 ftp kernel: EIP: [add_to_queue+221/373] add_to_queue+0xdd/0x175 SS:ESP 0068:d7945a34


From noam at emet.co.il  Tue Jan 23 16:27:27 2007
From: noam at emet.co.il (Noam Meltzer)
Date: Tue, 23 Jan 2007 18:27:27 +0200
Subject: [Linux-cluster] clearing a fence from a node.
Message-ID: <fcfca6466f5.45b6538f@emet.co.il>

Hello,

I have the following problem with RHCS4 u4:
1. My cluster configuration has two nodes and I'm using DLM.
2. For some arbitrary reason I decide to take one of the nodes down. (/sbin/poweroff)
3. Cluster services are migrated to the other node.
4. When I try to poweron the first node again, it fails to join the cluster, because it is fenced out.
5. This means I am left with only one node in the cluster.
6. Only Solution I have found for this issue this far is to reboot both nodes simultaneously, make sure they boot at about the same time (~3 seconds gap at most), and then when the DLM service is started on both node at the same time, the cluster is reestablished with two members.
7. Failing to boot the cluster at the same time will put me back to the problem described in article 4.

I am looking for a manual way to clear the fencing prohibiting the first node from rejoining the cluster during reboot. Any ideas?

Best regards,
Noam Meltzer
Software Support Engineer & RHCE
E&M Computing


From isplist at logicore.net  Tue Jan 23 16:55:06 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 23 Jan 2007 10:55:06 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <45B4E815.60709@redhat.com>
Message-ID: <200712310556.426952@leena>

Thanks for the suggestions on this to date. I have yet however to find a 
solution and things have gotten even stranger.

Some new weirdness is that some nodes will now see the storage, some won't. It 
does not appear to be connectivity issues as I've tested all of that first.

Some nodes now see the storage from webmin, some don't. Only one node can see 
the partitions from the command line and a couple from webmin and the command 
line. Too weird for me.

When I say storage, I nmean the partitions as everything else has been cleaned 
up, there are no volumes or groups. I am trying to get back to a starting step 
so that I can re-create the storage.

So, rather than fix this, I want to nuke it all and start over. Can anyone 
offer some ideas on where I can start cleaning things up so that I can 
re-create it all again.

Mike


From isplist at logicore.net  Tue Jan 23 17:00:39 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 23 Jan 2007 11:00:39 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <200712205120.615438@leena>
Message-ID: <200712311039.670831@leena>

Does this output make any sense to anyone? Storage shows up under the adapter 
but not the scsi devices noe fdisk? Yet it shows up on fdisk on another node? 
I'm VERY confused here :).


# cat /proc/scsi/scsi
Attached devices:

]# cat /proc/scsi/qla2xxx/0|more
QLogic PCI to Fibre Channel Host Adapter for QLA22xx:
        Firmware version 2.02.08 TP, Driver version 8.01.04-d7
ISP: ISP2200, Serial# C57427
Request Queue = 0x1f400000, Response Queue = 0x1782000
Request Queue count = 2048, Response Queue count = 64
Total number of active commands = 0
Total number of interrupts = 2932
    Device queue depth = 0x10
Number of free request entries = 2039
Number of mailbox timeouts = 0
Number of ISP aborts = 0
Number of loop resyncs = 0
Number of retries for empty slots = 0
Number of reqs in pending_q= 0, retry_q= 0, done_q= 0, scsi_retry_q= 0
Host adapter:loop state = <READY>, flags = 0x883
Dpc flags = 0x4000000
MBX flags = 0x0
Link down Timeout = 000
Port down retry = 008
Login retry count = 008
Commands retried with dropped frame(s) = 0
Product ID = 4953 5020 2020 0001


SCSI Device Information:
scsi-qla0-adapter-node=200000e08b0393ed;
scsi-qla0-adapter-port=210000e08b0393ed;
scsi-qla0-target-0=23000080e512699a;
scsi-qla0-target-1=21000080e511feb0;
scsi-qla0-target-2=100000e00221fd57;
scsi-qla0-target-3=21000cc001000c8d;

FC Port Information:
scsi-qla0-port-0=200000e08b01099c:210000e08b01099c:6a1200:81;
scsi-qla0-port-1=200000e08b054e4f:210000e08b054e4f:6a1300:4;
scsi-qla0-port-2=200000e08b0c4d02:210000e08b0c4d02:6a1400:82;
scsi-qla0-port-3=200000e08b081bd7:210000e08b081bd7:6a1500:83;
scsi-qla0-port-4=200000e08b038fed:210000e08b038fed:6a1600:84;
scsi-qla0-port-5=200000e08b083ed7:210000e08b083ed7:6a1700:85;
scsi-qla0-port-6=200000e08b0395ed:210000e08b0395ed:6a1800:86;
scsi-qla0-port-7=200000e08b0394ed:210000e08b0394ed:6a1900:87;
scsi-qla0-port-8=200000e08b00caec:210000e08b00caec:6a1a00:88;
scsi-qla0-port-9=20000080e512699a:23000080e512699a:6a1cef:89;
scsi-qla0-port-10=200000e08b0832d7:210000e08b0832d7:6a1d00:8a;
scsi-qla0-port-11=20000080e511feb0:21000080e511feb0:6a1eef:8b;
scsi-qla0-port-12=100000e00201fd57:100000e00221fd57:6a1f00:8c;
scsi-qla0-port-13=200000e08b0843d7:210000e08b0843d7:691400:6;
scsi-qla0-port-14=200000e08b19b178:210000e08b19b178:691500:5;
scsi-qla0-port-15=200000e08b078f24:210000e08b078f24:691600:8e;
scsi-qla0-port-16=200000e08b038eed:210000e08b038eed:691700:7;
scsi-qla0-port-17=50050cc001000c8d:21000cc001000c8d:691ddc:8f;

SCSI LUN Information:
(Id:Lun)  * - indicates lun is not registered with the OS.
( 0: 0): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:89 00
( 0: 1): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:89 00
( 0: 2): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:89 00
( 1: 0): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:8b 00
( 1: 1): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:8b 00
( 1: 2): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:8b 00
( 2: 0): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:8c 00
( 2: 1): Total reqs 0, Pending reqs 0, flags 0x0, 0:0:8c 0c
( 3: 0): Total reqs 0, Pending reqs 0, flags 0x0*, 0:0:8f 00

# fdisk -l

Disk /dev/hda: 40.0 GB, 40020664320 bytes
255 heads, 63 sectors/track, 4865 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/hda1   *           1        4767    38290896   83  Linux
/dev/hda2            4768        4865      787185   82  Linux swap


From isplist at logicore.net  Tue Jan 23 17:23:51 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 23 Jan 2007 11:23:51 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <45B548E3.2080803@redhat.com>
Message-ID: <2007123112351.373721@leena>

Is it possible that all of my storage was trashed in some way, that the 
devices themselves need to be reformatted? 

I'm using external RAID storage devices and I've noticed that there seems to 
be a bad magic number error when I try to run the usual tools such as fsck or 
e2fsck.

# ./fsck /dev/sdg1
fsck 1.35 (28-Feb-2004)
e2fsck 1.35 (28-Feb-2004)
Couldn't find ext2 superblock, trying backup blocks...
fsck.ext2: Bad magic number in super-block while trying to open /dev/sdg1

# ./e2fsck -b 8193 /dev/sdg1
e2fsck 1.35 (28-Feb-2004)
./e2fsck: Bad magic number in super-block while trying to open /dev/sdg1

Is this why nothing is working and I'm getting seg errors on everything?

Mike


From netcerebrum at gmail.com  Tue Jan 23 17:33:58 2007
From: netcerebrum at gmail.com (Net Cerebrum)
Date: Tue, 23 Jan 2007 23:03:58 +0530
Subject: [Linux-cluster] HA Clustering - Need Help
Message-ID: <bc3908b0701230933g18301280x1e4a02d1a6edc7e5@mail.gmail.com>

Hello All,

I am totally new to HA clustering and am trying hard to grasp the
fundamentals in a limited time frame. I have been asked by my company to
create a high availability cluster using Red Hat Cluster Suite on hardware
comprising two servers running RHEL AS 4 and one shared external storage
array. The cluster would be running in Active-Active state. Oracle Database
version 9 (not RAC) would run on one of the servers while the Oracle
Applications version 11 would run on the other. In case of failure of either
of the servers, the service would be started on the other server. Both the
servers (nodes) would be connected to the storage array through two
redundant SCSI controllers.

Since the storage has redundant controllers, both the servers would be
connected to the storage array using two channels each and the requirement
is to make it an Active-Active Load Balanced configuration using a multipath
software. The storage vendor has suggested using the multipath option with
the mdadm software for creating multipath devices on the storage array.

I have gone through the manuals and since this is my first attempt at high
availabilty clustering I have many doubts and questions. What file system
should be used on the external storage ? Is it better to use ext3 or Red Hat
GFS ? At certain places it is mentioned that GFS should be used only if the
number of nodes is 3 or more and GULM is being used. Since we have only two
nodes, we plan to use DLM.  It is also mentioned that GFS and CLVM may not
work on a software RAID device. Would the multipath devices created
(/dev/md0, /dev/md1, etc) be considered to be software RAID devices, though
in the real sense they are not ? Further the development team is not too
sure about the compatibility between GFS and Oracle Database and
Applications. What could be the pros and cons of using  ext3 file system in
this scenario ?

The development team just wants one filesystem to be used on the storage
which would be mounted as /oracle on both the servers / nodes and all the
binaries and data would reside on this. Since this filesystem is going to be
mounted at boot time, my understanding is that no mounting or unmounting of
any filesystem will take place during the failover so the cluster
configuration should reflect that. The documentation repeatedly refers to
mounting of the file systems when failover takes place so that's giving rise
to a little confusion. Further there are references to a quorum partition in
documentation but I have not been able to find any provision to make use of
the same in the cluster configuration tool.

Please help me in clarifying these issues and suggest me how to go about
setting this cluster. I would be really grateful for any suggestions and
references.

Thanks,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070123/219842b4/attachment.htm>

From wcheng at redhat.com  Tue Jan 23 18:40:57 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Tue, 23 Jan 2007 13:40:57 -0500
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169568347.17130.31.camel@localhost.localdomain>
References: <1169568347.17130.31.camel@localhost.localdomain>
Message-ID: <45B656B9.2090104@redhat.com>

Benoit DUFFAU wrote:
> Hello all,
>
> I've (tried to) set up a two-nodes cluster using the lattest kernel
> (2.6.19.2) with GFS2 included in the kernel.
>
> Everything went fine using the doc found in 
>
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/usage.txt?rev=1.35&content-type=text/x-cvsweb-markup&cvsroot=cluster
>
> so i did everything discribed to bring the cluster on.
>
> The problem occurs when i try to export the gfs2 mounted file system
> with NFS.
>   
GFS2 used to have a special version of "filldir" routine but got removed 
due to lkml code review. Look like that (new) changes have issues. If 
you have some spare time, please open a bugzilla. We do have a rough 
idea what goes wrong - will find a solution to this issue soon.

-- Wendy
> I use a debian stable (sarge) and the nfs-kernel-server package that
> uses the nfsd kernel module
>
> I export the repository (let's say /test) , I mount it on the nfs client
> machine (let's say on /mnt for exemple) , I issue a ls -l /mnt (on the
> client) and the server crashes.
>
> I attach the syslog part that includes the following commands : 
>
> 1-mounting the gfs2 filesystem localy on the NFS serveur
> 2-mounting the exported directory (/test) on the machine named client1
> 3-issuing a "ls -l" on the client machine.
>
> I also use on this machine Vmware and drbd 0.8rc2 but i did exactly the
> same test on the other node with a untainted kernel since i do not use
> vmware on the other node, and it crashes the same way ...
>
> looks like if it was impossible to export gfs2 over NFS ?
> Incompatibilities between lock methods ? 
>
> does someone already notice this ?
>
> Benoit DUFFAU
>
> +---------------------------------------------------------------------+
> Combining consulting and technology solutions offers enables Devoteam
> to provide its customers with independent advice and effective solutions
> that meet their strategic objectives (IT performance and optimisation)
> in complementary areas: networks, systems infrastructure, security
> and e-business applications.
> Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
> and an operating margin of 7%. The group counts 2,400 employees through
> sixteen countries in Europe, the Middle East and North Africa.
> Listed on Euronext (Eurolist B compartment) since October 28, 1999.
> Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
> Euronext Paris
> ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
> +---------------------------------------------------------------------+
>   
> ------------------------------------------------------------------------
>
> Jan 23 15:40:35 ftp kernel: GFS2: fsid=: Trying to join cluster "lock_dlm", "sympa:sympafs"
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: recover 1
> Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: Joined cluster. Now mounting FS...
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: add member 2
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: total members 1 error 0
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: dlm_recover_directory 
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: dlm_recover_directory 0 entries
> Jan 23 15:40:35 ftp kernel: dlm: sympafs: recover 1 done: 40 ms 
> Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0, already locked for use
> Jan 23 15:40:35 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0: Looking at journal...
> Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=0: Done
> Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Trying to acquire journal lock...
> Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Looking at journal...
> Jan 23 15:40:36 ftp kernel: GFS2: fsid=sympa:sympafs.0: jid=1: Done
> Jan 23 15:43:27 ftp rpc.mountd: authenticated mount request from client1:694 for /test (/test)
> Jan 23 15:43:46 ftp kernel: original: gfs2_glock_nq_atime+0x17a/0x31a
> Jan 23 15:43:46 ftp kernel: pid : 10025
> Jan 23 15:43:47 ftp kernel: lock type : 2 lock state : 1
> Jan 23 15:43:47 ftp kernel: new: gfs2_getattr+0x33/0x6c
> Jan 23 15:43:47 ftp kernel: pid : 10025
> Jan 23 15:43:47 ftp kernel: lock type : 2 lock state : 1
> Jan 23 15:43:48 ftp kernel: ------------[ cut here ]------------
> Jan 23 15:43:48 ftp kernel: kernel BUG at fs/gfs2/glock.c:1193!
> Jan 23 15:43:48 ftp kernel: invalid opcode: 0000 [#1]
> Jan 23 15:43:48 ftp kernel: SMP 
> Jan 23 15:43:48 ftp kernel: Modules linked in: nfsd exportfs lockd nfs_acl sunrpc vmnet(P) vmmon(P) sctp drbd
> Jan 23 15:43:48 ftp kernel: CPU:    1
> Jan 23 15:43:48 ftp kernel: EIP:    0060:[add_to_queue+221/373]    Tainted: P      VLI
> Jan 23 15:43:48 ftp kernel: EFLAGS: 00010286   (2.6.19.2 #3)
> Jan 23 15:43:48 ftp kernel: EIP is at add_to_queue+0xdd/0x175
> Jan 23 15:43:48 ftp kernel: eax: 00000020   ebx: d7945e68   ecx: d7945a38   edx: c0512440
> Jan 23 15:43:48 ftp kernel: esi: d543b9dc   edi: d7945a78   ebp: c8b9e000   esp: d7945a34
> Jan 23 15:43:48 ftp kernel: ds: 007b   es: 007b   ss: 0068
> Jan 23 15:43:48 ftp kernel: Process nfsd (pid: 10025, ti=d7944000 task=da45f550 task.ti=d7944000)
> Jan 23 15:43:48 ftp kernel: Stack: c0512440 00000002 00000001 d543b9dc d7945a78 00000000 c023021b d7945a78
> Jan 23 15:43:48 ftp kernel:        d7945a78 d7945b00 d0bf51a8 d7945c78 c023f7bc d7945a78 00000003 00000008
> Jan 23 15:43:48 ftp kernel:        d7945a78 d7945a78 d7945a78 d543b9dc da45f550 00000003 00000008 00000000
> Jan 23 15:43:48 ftp kernel: Call Trace:
> Jan 23 15:43:48 ftp kernel:  [gfs2_glock_nq+62/150] gfs2_glock_nq+0x3e/0x96
> Jan 23 15:43:48 ftp kernel:  [gfs2_getattr+59/108] gfs2_getattr+0x3b/0x6c
> Jan 23 15:43:48 ftp kernel:  [gfs2_getattr+51/108] gfs2_getattr+0x33/0x6c
> Jan 23 15:43:48 ftp kernel:  [vfs_getattr+47/141] vfs_getattr+0x2f/0x8d
> Jan 23 15:43:48 ftp kernel:  [pg0+948462326/1066693632] encode_post_op_attr+0x4a/0x238 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948424111/1066693632] fh_compose+0x1da/0x419 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [smp_send_reschedule+30/34] smp_send_reschedule+0x1e/0x22
> Jan 23 15:43:48 ftp kernel:  [dput+34/291] dput+0x22/0x123
> Jan 23 15:43:48 ftp kernel:  [pg0+948472018/1066693632] compose_entry_fh+0x10b/0x115 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948472571/1066693632] encode_entry+0x21f/0x536 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [activate_task+100/174] activate_task+0x64/0xae
> Jan 23 15:43:48 ftp kernel:  [smp_send_reschedule+30/34] smp_send_reschedule+0x1e/0x22
> Jan 23 15:43:48 ftp kernel:  [try_to_wake_up+657/668] try_to_wake_up+0x291/0x29c
> Jan 23 15:43:48 ftp kernel:  [__next_cpu+32/47] __next_cpu+0x20/0x2f
> Jan 23 15:43:48 ftp kernel:  [find_busiest_group+391/1168] find_busiest_group+0x187/0x490
> Jan 23 15:43:48 ftp kernel:  [gfs2_glmutex_lock+142/148] gfs2_glmutex_lock+0x8e/0x94
> Jan 23 15:43:48 ftp kernel:  [filldir_func+80/211] filldir_func+0x50/0xd3
> Jan 23 15:43:48 ftp kernel:  [do_filldir_main+430/494] do_filldir_main+0x1ae/0x1ee
> Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
> Jan 23 15:43:48 ftp kernel:  [gfs2_dir_read+362/408] gfs2_dir_read+0x16a/0x198
> Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
> Jan 23 15:43:48 ftp kernel:  [gfs2_readdir+173/207] gfs2_readdir+0xad/0xcf
> Jan 23 15:43:48 ftp kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3
> Jan 23 15:43:48 ftp kernel:  [gfs2_glock_nq_atime+378/794] gfs2_glock_nq_atime+0x17a/0x31a
> Jan 23 15:43:48 ftp kernel:  [dentry_open+88/94] dentry_open+0x58/0x5e
> Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [vfs_readdir+81/124] vfs_readdir+0x51/0x7c
> Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948437677/1066693632] nfsd_readdir+0x95/0x107 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948461192/1066693632] nfsd3_proc_readdirplus+0x110/0x1ee [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948473401/1066693632] nfs3svc_encode_entry_plus+0x0/0x27 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948469685/1066693632] nfs3svc_decode_readdirplusargs+0x107/0x145 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948414565/1066693632] nfsd_dispatch+0xed/0x1d8 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948183442/1066693632] svc_process+0x375/0x5de [sunrpc]
> Jan 23 15:43:48 ftp kernel:  [pg0+948414052/1066693632] nfsd+0x19c/0x2b0 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [pg0+948413640/1066693632] nfsd+0x0/0x2b0 [nfsd]
> Jan 23 15:43:48 ftp kernel:  [kernel_thread_helper+7/16] kernel_thread_helper+0x7/0x10
> Jan 23 15:43:48 ftp kernel:  =======================
> Jan 23 15:43:48 ftp kernel: Code: 00 c7 04 24 9a ba 4f c0 89 44 24 04 e8 62 b5 ee ff 8b 46 20 89 44 24 08 8b 46 14 c7 04 24 40 24 51 c0 89 44 24 04 e8 48 b5 ee ff <0f> 0b a9 04 e8 b9 4f c0 8b 56 48 8d 4e 48 8b 5f 0c 8b 02 0f 18
> Jan 23 15:43:48 ftp kernel: EIP: [add_to_queue+221/373] add_to_queue+0xdd/0x175 SS:ESP 0068:d7945a34
>
>   
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Tue Jan 23 19:16:27 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 23 Jan 2007 13:16:27 -0600
Subject: [Linux-cluster] Summit kernel and GFS
Message-ID: <2007123131627.942528@leena>

I'm installing an IBM x440 system which I'd like to have GFS on. I am reading 
that I need to use the Summit kernel in order to gain full access to the 8 
CPU's. 

Is there a Summit/GFS kernel version or am I running into something?

Mike


From erickson.jon at gmail.com  Tue Jan 23 21:25:48 2007
From: erickson.jon at gmail.com (Jon Erickson)
Date: Tue, 23 Jan 2007 16:25:48 -0500
Subject: [Linux-cluster] 5.7 Million locks
Message-ID: <6a90e4da0701231325r182bc2cala82a9abee2cb3bd6@mail.gmail.com>

All,

I've been monitoring performance of my GFS cluster and noticed something weird.

I created 16 million directories with a script very similar to this:

for ($a=0; $a<256; $a++) {
  for ($b=0; $b<256; $b++) {
    for ($c=0; $c<256; $c++) {
	mkdir -p /mnt/gfs/hex($a)/hex($b)/hex($c);
    }
  }
}

This code wont work but you all get the idea.  Anyway I was running
chown on all the directories with a script like this:


chdir /mnt/gfs
DIRS = `ls -la | awk 'print {$9}'`

for dir in $DIRS; do
	chown -R newuser $dir/*
	echo $dir
done

This script will execute chown on 64K directories and print the
current parent directory.

This script starts to perform slower and slower as time goes on.  I
ran gfs_tool -c counters /mnt/gfs to monitor locks and the number of
locks is over 5.7million and growing.

The first few 64K directories completed chowning within 45sec...now
its over 8-9 minutes...and increasing every 64K.

Is there a way to remove these locks?

/proc/cluster/lock_dlm/drop_count is 50000 Would setting this value to 0 help?


-- 
Jon


From jos at xos.nl  Tue Jan 23 23:11:54 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 24 Jan 2007 00:11:54 +0100
Subject: [Linux-cluster] Shared storage problems with LSI controller
Message-ID: <200701232311.l0NNBsW20209@xos037.xos.nl>

Hi,

I have a configuration with two servers and a shared storage cabinet
(connected via two *independent* SCSI busses) causing fatal SCSI errors
when one server is doing a lot of I/O and the other server is rebooting
(i.e. loading the Linux driver and initializing the controller).

This problem is fully reproducable with the latest RHEL4 kernel, but
it is *not* reproducable with RHEL5b2.

When using this shared device with cluster suite and GFS (I only tried
this with RHEL4), the GFS filesystem is damaged unrepairable when one
node reboots!

I see some buzilla entries about this driver (although with different
errors) and when Googling I found some more complaints about weak error
handling/recovery in this driver.

I tried to port the MPT Fusion driver from the RHEL5b2 kernel to the
RHEL4 kernel, but this seems to require some non-trivial backporting.

Is this indeed a problem with the LSI driver?  Are there any upgrades
for the driver that can be compiled for the RHEL4 kernels?

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From shirai at sc-i.co.jp  Wed Jan 24 01:59:33 2007
From: shirai at sc-i.co.jp (Shirai@SystemCreateINC)
Date: Wed, 24 Jan 2007 10:59:33 +0900
Subject: [Linux-cluster] GFS6.1 mount: fs type gfs not supported by kernel
References: <20070122161503.16AA773605@hormel.redhat.com><45B5E2AB.8020406@prologixsoft.com><003701c73ee4$16d67ed0$620aa8c0@tostar><Pine.LNX.4.63.0701231350390.13426@mail.kivi.com.tr>
	<EE32F314D65D4B36ABB51BA3A19C0961@ZSTAR>
Message-ID: <007901c73f5b$4b915ce0$620aa8c0@tostar>

Hi,
I renewed the following packages.
As a result, the mount was correctly done as for the GFS filesystem.
The problem of cman-kernel that Patrick pointed out.

Thanks!!

gulm-1.0.8-0.i686.rpm
cman-kernel-smp-2.6.9-45.8.i686.rpm  rgmanager-1.9.54-1.i386.rpm
dlm-kernel-smp-2.6.9-44.3.i686.rpm

Regards

------------------------------------------------------
Shirai Noriyuki
Chief Engineer Technical Div. System Create Inc
Kanda Toyo Bldg, 3-4-2 Kandakajicho
Chiyodaku Tokyo 101-0045 Japan
Tel81-3-5296-3775 Fax81-3-5296-3777
e-mail:shirai at sc-i.co.jp web:http://www.sc-i.co.jp
------------------------------------------------------


> Hi
>
> Thank you Devrim.
> However, I think that the kernel version is correct.
> Because when I installed GFS-kernel-smp-2.6.9-60.3.i686.rpm, the kernel of
> this version was demanded.
>
> And, I will check cman-kernel that Patrick pointed out.
>
>
>> > FATAL:Error inserting gfs
>> > (/lib/modules/2.6.9-42.0.3.ELsmp/kernel/fs/gfs/gfs.ko ):
>> > Unknown symbol in module, or unknown parameter(see dmesg)
>> >
>> > And, the output of dmesg is as follows.
>> > Lock_Harness 2.6.9-60.3 (built Oct 11 2006 18:31:11) installed
>>
>> AFAICS the kernel module version does not match your kernel version.
>> Please check and install correct versions.
>
> Regards
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
> -- 
> No virus found in this incoming message.
> Checked by AVG Free Edition.
> Version: 7.5.432 / Virus Database: 268.17.8/649 - Release Date: 2007/01/23 
> 20:40
>
> 


From swhiteho at redhat.com  Wed Jan 24 09:44:38 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 24 Jan 2007 09:44:38 +0000
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
Message-ID: <1169631878.11001.43.camel@quoit.chygwyn.com>

Hi,

This problem (NFS readdirplus bug) should be fixed in the upstream
kernel as well as FC-6. Its due to NFS' readdirplus calling the vfs stat
function (gfs2_getattr) for the '.' directory from readdir's filldir
callback. As a result it tried to lock the directory twice which
triggers the recursive locking check.

The URL of the fix is:
http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=dcf3dd852f554bb0016aa23892596717cc123a26

Please let me know if that doesn't cure the problem,

Steve.


From riaan at obsidian.co.za  Wed Jan 24 09:56:17 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 24 Jan 2007 11:56:17 +0200
Subject: [Linux-cluster] Summit kernel and GFS
In-Reply-To: <2007123131627.942528@leena>
References: <2007123131627.942528@leena>
Message-ID: <45B72D41.1010903@obsidian.co.za>

isplist at logicore.net wrote:
> I'm installing an IBM x440 system which I'd like to have GFS on. I am reading 
> that I need to use the Summit kernel in order to gain full access to the 8 
> CPU's. 
> 
> Is there a Summit/GFS kernel version or am I running into something?
> 
> Mike
> 

Do you have a URL to the docs? Do the docs say which version of RHEL you 
need to install?

The Summit kernel was part of RHEL 2.1 (e.g. RHEL 2.1 update 6 came with 
  a package kernel-summit-2.4.9-e.57.i686.rpm). The bits in the kernel 
that made the Summit-class systems (x44[x]) special, were folded into 
the regular kernels in RHEL 3 and onwards

(my guess is those are some really old docs. you won't be running GFS on 
  RHEL 2.1, since that old version of GFS has been withdrawn from 
marketing mid lastyear).

Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070124/3f28fe86/attachment.vcf>

From benoit.duffau at devoteam.com  Wed Jan 24 11:35:29 2007
From: benoit.duffau at devoteam.com (Benoit DUFFAU)
Date: Wed, 24 Jan 2007 12:35:29 +0100
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169631878.11001.43.camel@quoit.chygwyn.com>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
Message-ID: <1169638529.17590.18.camel@localhost.localdomain>

Le mercredi 24 janvier 2007 ? 09:44 +0000, Steven Whitehouse a ?crit :
> Hi,
> 
> This problem (NFS readdirplus bug) should be fixed in the upstream
> kernel as well as FC-6. Its due to NFS' readdirplus calling the vfs stat
> function (gfs2_getattr) for the '.' directory from readdir's filldir
> callback. As a result it tried to lock the directory twice which
> triggers the recursive locking check.
> 
> The URL of the fix is:
> http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=dcf3dd852f554bb0016aa23892596717cc123a26
> 
> Please let me know if that doesn't cure the problem,
> 

Hi, 

thank you for answering me so quickly !!!

I patched my 2.6.19.2 kernel but this does not work. 

The kernel does not crash any longer (great ! :) ) but the export is now
impossible ! 

when i try to mount the exported directory i have the following error on
the client : 

client1:~# mount server1:/test /mnt -o defaults,udp,async,soft -t nfs
mount: server1:/test: can't read superblock

i have one question :

when i compile the patched kernel it says : 

fs/gfs2/ops_inode.c: In function ? gfs2_getattr ?:
fs/gfs2/ops_inode.c:1047: Warning : empty body in an if-statement

are you sure with your code ?? :

+       if (unlock);
+               gfs2_glock_dq_uninit(&gh);

shouldn't it be : 

+       if (unlock)
+               gfs2_glock_dq_uninit(&gh);

instead ???

I completly don't understand what this piece of code is doing but i find
it strange to do a if (var); ... 

the problem is that even if i try to compile another kernel without
the ; after the if santance it crashes the same way before :(

here again the kernel dump

Jan 24 12:27:55 replica1 kernel: ------------[ cut here ]------------
Jan 24 12:27:55 replica1 kernel: kernel BUG at fs/gfs2/glock.c:1193!
Jan 24 12:27:55 replica1 kernel: invalid opcode: 0000 [#1]
Jan 24 12:27:55 replica1 kernel: SMP
Jan 24 12:27:55 replica1 kernel: CPU:    1
Jan 24 12:27:55 replica1 kernel: EIP:    0060:[add_to_queue+181/297]
Not tainted VLI
Jan 24 12:27:55 replica1 kernel: EFLAGS: 00010282   (2.6.19.2-grsec #5)
Jan 24 12:27:55 replica1 kernel: eax: 00000020   ebx: da62be68   ecx:
da62ba28   edx: c05f1c60
Jan 24 12:27:55 replica1 kernel: esi: ddc422fc   edi: da62ba88   ebp:
dafff000   esp: da62ba24
Jan 24 12:27:55 replica1 kernel: ds: 0068   es: 007b   ss: 0068
Jan 24 12:27:55 replica1 kernel: Process nfsd (pid: 3123, ti=da62a000
task=da81d560 task.ti=da62a000)
Jan 24 12:27:55 replica1 kernel: Stack: c05f1c60 00000002 00000001
ddc422fc da62ba88 00000000 dafff000 c02aa65c
Jan 24 12:27:55 replica1 kernel:        da62ba88 dab35514 da630194
da62ba88 da6301b0 c02accfc da62ba88 00000003
Jan 24 12:27:55 replica1 kernel:        00000000 da62ba88 00000000
dd028600 dab35514 c02b9698 dab35514 00000001
Jan 24 12:27:55 replica1 kernel: Call Trace:

Benoit

+---------------------------------------------------------------------+
Combining consulting and technology solutions offers enables Devoteam
to provide its customers with independent advice and effective solutions
that meet their strategic objectives (IT performance and optimisation)
in complementary areas: networks, systems infrastructure, security
and e-business applications.
Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
and an operating margin of 7%. The group counts 2,400 employees through
sixteen countries in Europe, the Middle East and North Africa.
Listed on Euronext (Eurolist B compartment) since October 28, 1999.
Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
Euronext Paris
ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
+---------------------------------------------------------------------+


From swhiteho at redhat.com  Wed Jan 24 11:49:07 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 24 Jan 2007 11:49:07 +0000
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169638529.17590.18.camel@localhost.localdomain>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
	<1169638529.17590.18.camel@localhost.localdomain>
Message-ID: <1169639347.11001.67.camel@quoit.chygwyn.com>

Hi,

On Wed, 2007-01-24 at 12:35 +0100, Benoit DUFFAU wrote:
> Le mercredi 24 janvier 2007 ? 09:44 +0000, Steven Whitehouse a ?crit :
> > Hi,
> > 
> > This problem (NFS readdirplus bug) should be fixed in the upstream
> > kernel as well as FC-6. Its due to NFS' readdirplus calling the vfs stat
> > function (gfs2_getattr) for the '.' directory from readdir's filldir
> > callback. As a result it tried to lock the directory twice which
> > triggers the recursive locking check.
> > 
> > The URL of the fix is:
> > http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=dcf3dd852f554bb0016aa23892596717cc123a26
> > 
> > Please let me know if that doesn't cure the problem,
> > 
> 
> Hi, 
> 
> thank you for answering me so quickly !!!
> 
> I patched my 2.6.19.2 kernel but this does not work. 
> 
> The kernel does not crash any longer (great ! :) ) but the export is now
> impossible ! 
> 
> when i try to mount the exported directory i have the following error on
> the client : 
> 
> client1:~# mount server1:/test /mnt -o defaults,udp,async,soft -t nfs
> mount: server1:/test: can't read superblock
> 
> i have one question :
> 
> when i compile the patched kernel it says : 
> 
> fs/gfs2/ops_inode.c: In function ? gfs2_getattr ?:
> fs/gfs2/ops_inode.c:1047: Warning : empty body in an if-statement
> 
> are you sure with your code ?? :
> 
> +       if (unlock);
> +               gfs2_glock_dq_uninit(&gh);
> 
> shouldn't it be : 
> 
> +       if (unlock)
> +               gfs2_glock_dq_uninit(&gh);
> 
> instead ???
> 
Yes, you are right it should be. I don't know how that stray ; got in
there and I'll fix that now.

> I completly don't understand what this piece of code is doing but i find
> it strange to do a if (var); ... 
> 
> the problem is that even if i try to compile another kernel without
> the ; after the if santance it crashes the same way before :(
> 
> here again the kernel dump
> 
> Jan 24 12:27:55 replica1 kernel: ------------[ cut here ]------------
> Jan 24 12:27:55 replica1 kernel: kernel BUG at fs/gfs2/glock.c:1193!
> Jan 24 12:27:55 replica1 kernel: invalid opcode: 0000 [#1]
> Jan 24 12:27:55 replica1 kernel: SMP
> Jan 24 12:27:55 replica1 kernel: CPU:    1
> Jan 24 12:27:55 replica1 kernel: EIP:    0060:[add_to_queue+181/297]
> Not tainted VLI
> Jan 24 12:27:55 replica1 kernel: EFLAGS: 00010282   (2.6.19.2-grsec #5)
> Jan 24 12:27:55 replica1 kernel: eax: 00000020   ebx: da62be68   ecx:
> da62ba28   edx: c05f1c60
> Jan 24 12:27:55 replica1 kernel: esi: ddc422fc   edi: da62ba88   ebp:
> dafff000   esp: da62ba24
> Jan 24 12:27:55 replica1 kernel: ds: 0068   es: 007b   ss: 0068
> Jan 24 12:27:55 replica1 kernel: Process nfsd (pid: 3123, ti=da62a000
> task=da81d560 task.ti=da62a000)
> Jan 24 12:27:55 replica1 kernel: Stack: c05f1c60 00000002 00000001
> ddc422fc da62ba88 00000000 dafff000 c02aa65c
> Jan 24 12:27:55 replica1 kernel:        da62ba88 dab35514 da630194
> da62ba88 da6301b0 c02accfc da62ba88 00000003
> Jan 24 12:27:55 replica1 kernel:        00000000 da62ba88 00000000
> dd028600 dab35514 c02b9698 dab35514 00000001
> Jan 24 12:27:55 replica1 kernel: Call Trace:
> 
The bit I really need to see is the next bit - i.e. the stack back
trace. Was it identical to the last trace you posted?

Steve.

> Benoit
> 
> +---------------------------------------------------------------------+
> Combining consulting and technology solutions offers enables Devoteam
> to provide its customers with independent advice and effective solutions
> that meet their strategic objectives (IT performance and optimisation)
> in complementary areas: networks, systems infrastructure, security
> and e-business applications.
> Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
> and an operating margin of 7%. The group counts 2,400 employees through
> sixteen countries in Europe, the Middle East and North Africa.
> Listed on Euronext (Eurolist B compartment) since October 28, 1999.
> Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
> Euronext Paris
> ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
> +---------------------------------------------------------------------+
> 


From riaan at obsidian.co.za  Wed Jan 24 13:37:11 2007
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 24 Jan 2007 15:37:11 +0200
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <45B631A6.6000108@BISonline.com>
References: <45B61014.5090500@BISonline.com>	<20070123151239.GA26324@redhat.com>
	<45B631A6.6000108@BISonline.com>
Message-ID: <45B76107.3060507@obsidian.co.za>


> PS. Some info here which tells how to make this _permanent_ and we were
> surprised adding the setting to sysctl.conf wasn't the primary method
> mentioned: http://www.redhatmagazine.com/2006/12/15/tips_tricks/
> 

Yes, this is very unfortunate. Hacking the init scripts is not a very 
elegant way to perform changes to the default behaviour, IMHO. The 
reasons I can think of are
a) sysctl works with /proc/sys and has no clue about /proc/cluster
b) sysctl -p executes about 20% into /etc/rc.sysinit, WAY before the 
cluster services would start up (and /proc/cluster created).

It would be great if we could have this setting changeable via a 
standard mechanism (e.g. checking /etc/sysconfig/cluster, which 
/etc/init.d/gfs does source, but

Question to the developers - what would be the best mechanism to have 
the number of cached locks set in /etc/sysconfig/cluster for GFS in 
RHEL4 and RHEL5? Bugzilla RFE? Global Support Service Request? I have 
heard about featurezilla from a Red Hat employee before, but I could not 
find it (so I assume it is not open to regular customers/partners)

tnx
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070124/3ca2836b/attachment.vcf>

From wcheng at redhat.com  Wed Jan 24 15:02:45 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 24 Jan 2007 10:02:45 -0500
Subject: [Linux-cluster] Cluster Project FAQ - GFS tuning section]
In-Reply-To: <45B76107.3060507@obsidian.co.za>
References: <45B61014.5090500@BISonline.com>	<20070123151239.GA26324@redhat.com>	<45B631A6.6000108@BISonline.com>
	<45B76107.3060507@obsidian.co.za>
Message-ID: <45B77515.6020304@redhat.com>

Riaan van Niekerk wrote:
> Question to the developers - what would be the best mechanism to have 
> the number of cached locks set in /etc/sysconfig/cluster for GFS in 
> RHEL4 and RHEL5? Bugzilla RFE? Global Support Service Request? I have 
> heard about featurezilla from a Red Hat employee before, but I could 
> not find it (so I assume it is not open to regular customers/partners)
>
The GFS side solution has been worked on under Red Hat bugzilla 214239

(ref: https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=214239)

The patch has not been finalized yet (will work with Dave Teigland to 
make sure DLM side is ok too). We're still taking inputs (before code 
freeze at end of this month - that's next week).

-- Wendy


From benoit.duffau at devoteam.com  Wed Jan 24 15:25:42 2007
From: benoit.duffau at devoteam.com (Benoit DUFFAU)
Date: Wed, 24 Jan 2007 16:25:42 +0100
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169639347.11001.67.camel@quoit.chygwyn.com>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
	<1169638529.17590.18.camel@localhost.localdomain>
	<1169639347.11001.67.camel@quoit.chygwyn.com>
Message-ID: <1169652342.17590.43.camel@localhost.localdomain>

Le mercredi 24 janvier 2007 ? 11:49 +0000, Steven Whitehouse a ?crit :
> > 
> > are you sure with your code ?? :
> > 
> > +       if (unlock);
> > +               gfs2_glock_dq_uninit(&gh);
> > 
> > shouldn't it be : 
> > 
> > +       if (unlock)
> > +               gfs2_glock_dq_uninit(&gh);
> > 
> > instead ???
> > 
> Yes, you are right it should be. I don't know how that stray ; got in
> there and I'll fix that now.
> 

great ! 

> > 
> > here again the kernel dump
> > 
> > Jan 24 12:27:55 replica1 kernel: ------------[ cut here ]------------
> > Jan 24 12:27:55 replica1 kernel: Call Trace:
> > 
> The bit I really need to see is the next bit - i.e. the stack back
> trace. Was it identical to the last trace you posted?
> 

ok so here is the complete trace using your patch 


Hope it helps

Benoit

+---------------------------------------------------------------------+
Combining consulting and technology solutions offers enables Devoteam
to provide its customers with independent advice and effective solutions
that meet their strategic objectives (IT performance and optimisation)
in complementary areas: networks, systems infrastructure, security
and e-business applications.
Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
and an operating margin of 7%. The group counts 2,400 employees through
sixteen countries in Europe, the Middle East and North Africa.
Listed on Euronext (Eurolist B compartment) since October 28, 1999.
Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
Euronext Paris
ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
+---------------------------------------------------------------------+
-------------- next part --------------
Jan 24 16:17:35 client1 kernel: ------------[ cut here ]------------ 
Jan 24 16:17:35 client1 kernel: kernel BUG at fs/gfs2/glock.c:1193! 
Jan 24 16:17:35 client1 kernel: invalid opcode: 0000 [#1] 
Jan 24 16:17:35 client1 kernel: SMP  
Jan 24 16:17:35 client1 kernel: CPU:    1 
Jan 24 16:17:35 client1 kernel: EIP:    0060:[add_to_queue+222/376]    Not tainted VLI 
Jan 24 16:17:35 client1 kernel: EFLAGS: 00010282   (2.6.19.2-grsec #6) 
Jan 24 16:17:35 client1 kernel: EIP is at add_to_queue+0xde/0x178 
Jan 24 16:17:35 client1 kernel: eax: 00000020   ebx: dbc27e68   ecx: dbc27a2c   edx: c05e6b00 
Jan 24 16:17:35 client1 kernel: esi: db29007c   edi: dbc27a88   ebp: dbc13000   esp: dbc27a28 
Jan 24 16:17:35 client1 kernel: ds: 0068   es: 007b   ss: 0068 
Jan 24 16:17:35 client1 kernel: Process nfsd (pid: 3159, ti=dbc26000 task=c16f6a70 task.ti=dbc26000) 
Jan 24 16:17:35 client1 kernel: Stack: c05e6b00 00000002 00000001 db29007c dbc27a88 00000000 c02aa833 dbc27a88  
Jan 24 16:17:35 client1 kernel:        db2b8494 dc307594 dbc27a88 dc3075b0 c02acef8 dbc27a88 00000003 00000000  
Jan 24 16:17:35 client1 kernel:        dbc27a88 00000000 dbb68e00 db2b8494 c02b9894 db2b8494 00000001 c029f68c  
Jan 24 16:17:35 client1 kernel: Call Trace: 
Jan 24 16:17:35 client1 kernel:  [gfs2_glock_nq+62/150] gfs2_glock_nq+0x3e/0x96 
Jan 24 16:17:35 client1 kernel:  [gfs2_lookupi+170/324] gfs2_lookupi+0xaa/0x144 
Jan 24 16:17:35 client1 kernel:  [gfs2_permission+67/173] gfs2_permission+0x43/0xad 
Jan 24 16:17:35 client1 kernel:  [gfs2_check_acl+0/109] gfs2_check_acl+0x0/0x6d 
Jan 24 16:17:35 client1 kernel:  [gfs2_lookupi+162/324] gfs2_lookupi+0xa2/0x144 
Jan 24 16:17:35 client1 kernel:  [d_lookup+37/72] d_lookup+0x25/0x48 
Jan 24 16:17:35 client1 kernel:  [gfs2_lookup+50/113] gfs2_lookup+0x32/0x71 
Jan 24 16:17:35 client1 kernel:  [__lookup_hash+150/178] __lookup_hash+0x96/0xb2 
Jan 24 16:17:35 client1 kernel:  [lookup_one_len+100/113] lookup_one_len+0x64/0x71 
Jan 24 16:17:35 client1 kernel:  [compose_entry_fh+194/279] compose_entry_fh+0xc2/0x117 
Jan 24 16:17:35 client1 kernel:  [encode_entry+471/1335] encode_entry+0x1d7/0x537 
Jan 24 16:17:35 client1 kernel:  [activate_task+100/174] activate_task+0x64/0xae 
Jan 24 16:17:35 client1 kernel:  [smp_send_reschedule+30/34] smp_send_reschedule+0x1e/0x22 
Jan 24 16:17:35 client1 kernel:  [try_to_wake_up+657/668] try_to_wake_up+0x291/0x29c 
Jan 24 16:17:35 client1 kernel:  [__next_cpu+32/47] __next_cpu+0x20/0x2f 
Jan 24 16:17:35 client1 kernel:  [find_busiest_group+391/1168] find_busiest_group+0x187/0x490 
Jan 24 16:17:35 client1 kernel:  [do_convert+80/210] do_convert+0x50/0xd2 
Jan 24 16:17:35 client1 kernel:  [filldir_func+80/211] filldir_func+0x50/0xd3 
Jan 24 16:17:35 client1 kernel:  [do_filldir_main+430/494] do_filldir_main+0x1ae/0x1ee 
Jan 24 16:17:35 client1 kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3 
Jan 24 16:17:35 client1 kernel:  [gfs2_dir_read+362/408] gfs2_dir_read+0x16a/0x198 
Jan 24 16:17:35 client1 kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3 
Jan 24 16:17:35 client1 kernel:  [gfs2_readdir+173/207] gfs2_readdir+0xad/0xcf 
Jan 24 16:17:35 client1 kernel:  [filldir_func+0/211] filldir_func+0x0/0xd3 
Jan 24 16:17:35 client1 kernel:  [gfs2_glock_nq_atime+378/794] gfs2_glock_nq_atime+0x17a/0x31a 
Jan 24 16:17:35 client1 kernel:  [dentry_open+88/94] dentry_open+0x58/0x5e 
Jan 24 16:17:35 client1 kernel:  [nfs3svc_encode_entry_plus+0/39] nfs3svc_encode_entry_plus+0x0/0x27 
Jan 24 16:17:35 client1 kernel:  [vfs_readdir+81/124] vfs_readdir+0x51/0x7c 
Jan 24 16:17:35 client1 kernel:  [nfs3svc_encode_entry_plus+0/39] nfs3svc_encode_entry_plus+0x0/0x27 
Jan 24 16:17:35 client1 kernel:  [nfsd_readdir+149/263] nfsd_readdir+0x95/0x107 
Jan 24 16:17:35 client1 kernel:  [nfs3svc_encode_entry_plus+0/39] nfs3svc_encode_entry_plus+0x0/0x27 
Jan 24 16:17:35 client1 kernel:  [nfsd3_proc_readdirplus+272/494] nfsd3_proc_readdirplus+0x110/0x1ee 
Jan 24 16:17:35 client1 kernel:  [nfs3svc_encode_entry_plus+0/39] nfs3svc_encode_entry_plus+0x0/0x27 
Jan 24 16:17:35 client1 kernel:  [nfs3svc_decode_readdirplusargs+263/325] nfs3svc_decode_readdirplusargs+0x107/0x145 
Jan 24 16:17:35 client1 kernel:  [nfsd_dispatch+237/469] nfsd_dispatch+0xed/0x1d5 
Jan 24 16:17:35 client1 kernel:  [svc_process+885/1502] svc_process+0x375/0x5de 
Jan 24 16:17:35 client1 kernel:  [nfsd+412/675] nfsd+0x19c/0x2a3 
Jan 24 16:17:35 client1 kernel:  [nfsd+0/675] nfsd+0x0/0x2a3 
Jan 24 16:17:35 client1 kernel:  [kernel_thread_helper+7/16] kernel_thread_helper+0x7/0x10 
Jan 24 16:17:35 client1 kernel:  ======================= 
Jan 24 16:17:35 client1 kernel: Code: 00 c7 04 24 5c c4 5c c0 89 44 24 04 e8 e2 1f ec ff 8b 46 20 89 44 24 08 8b 46 14 c7 04 24 00 6b 5e c0 89 44 24 04 e8 c8 1f ec ff <0f> 0b ea aa c3 5c c0 a9 04 8b 56 48 8d 
4e 48 8b 5f 0c 8b 02 0f  
Jan 24 16:17:35 svcstxlvs2 kernel: EIP: [add_to_queue+222/376] add_to_queue+0xde/0x178 SS:ESP 0068:dbc27a28 


From jaap at sara.nl  Wed Jan 24 15:35:58 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Wed, 24 Jan 2007 16:35:58 +0100
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>

Hi People,

In bugzilla ticket:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=190756

is stated that there is a fix for the GFS(NFS) problem. Is this fix also
done in the Stable GFS tree?


Met vriendelijke groet, Kind Regards,

Jaap P. Dijkshoorn
Group Leader Cluster Computing
Systems Programmer
mailto:jaap at sara.nl    http://home.sara.nl/~jaapd

SARA Computing & Networking Services
Kruislaan 415     1098 SJ  Amsterdam
Tel: +31-(0)20-5923000
Fax: +31-(0)20-6683167
http://www.sara.nl
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070124/e8ffca4d/attachment.bin>

From rpeterso at redhat.com  Wed Jan 24 16:33:24 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 24 Jan 2007 10:33:24 -0600
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>
Message-ID: <45B78A54.1010204@redhat.com>

Jaap Dijkshoorn wrote:
> Hi People,
>
> In bugzilla ticket:
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=190756
>
> is stated that there is a fix for the GFS(NFS) problem. Is this fix also
> done in the Stable GFS tree?
>   
Hi Jaap,

Sorry about that; my mistake.  I fixed it in CVS for RHEL4, HEAD, RHEL5, 
but I
forgot STABLE.  I just committed the fix now to the STABLE branch.  
Thanks for
keeping me honest.

Regards,

Bob Peterson
Red Hat Cluster Suite


From lhh at redhat.com  Wed Jan 24 16:39:52 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 24 Jan 2007 11:39:52 -0500
Subject: [Linux-cluster] Shared storage problems with LSI controller
In-Reply-To: <200701232311.l0NNBsW20209@xos037.xos.nl>
References: <200701232311.l0NNBsW20209@xos037.xos.nl>
Message-ID: <1169656792.3010.69.camel@localhost.localdomain>

On Wed, 2007-01-24 at 00:11 +0100, Jos Vos wrote:
> Hi,
> 
> I have a configuration with two servers and a shared storage cabinet
> (connected via two *independent* SCSI busses) causing fatal SCSI errors
> when one server is doing a lot of I/O and the other server is rebooting
> (i.e. loading the Linux driver and initializing the controller).
> 
> This problem is fully reproducable with the latest RHEL4 kernel, but
> it is *not* reproducable with RHEL5b2.
> 
> When using this shared device with cluster suite and GFS (I only tried
> this with RHEL4), the GFS filesystem is damaged unrepairable when one
> node reboots!
> 
> I see some buzilla entries about this driver (although with different
> errors) and when Googling I found some more complaints about weak error
> handling/recovery in this driver.
> 
> I tried to port the MPT Fusion driver from the RHEL5b2 kernel to the
> RHEL4 kernel, but this seems to require some non-trivial backporting.
> 
> Is this indeed a problem with the LSI driver?  Are there any upgrades
> for the driver that can be compiled for the RHEL4 kernels?

I've seen abysmal performance in some megaraid+jbod configurations (e.g.
50+ seconds to get 2 block reads and 2 block writes on RHEL2.1), but
I've never seen corruption like what you're describing.  Of course, it's
been almost 4 years since I used a host-RAID configuration, and I
haven't ever used one with GFS... ;(

Apparently the SCSI megaraid driver has gone into maintenance mode, so
it's not going to get any better.

Your controllers are in "cluster mode" and/or "have cache entirely
disabled", right?

-- Lon


From jbrassow at redhat.com  Wed Jan 24 17:15:28 2007
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Wed, 24 Jan 2007 11:15:28 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <2007123112351.373721@leena>
References: <2007123112351.373721@leena>
Message-ID: <28c2b88723d4cd63970c70f388b8bb63@redhat.com>

are you using fibre channel fencing in your cluster?

  brassow

On Jan 23, 2007, at 11:23 AM, isplist at logicore.net wrote:

> Is it possible that all of my storage was trashed in some way, that the
> devices themselves need to be reformatted?
>
> I'm using external RAID storage devices and I've noticed that there 
> seems to
> be a bad magic number error when I try to run the usual tools such as 
> fsck or
> e2fsck.
>
> # ./fsck /dev/sdg1
> fsck 1.35 (28-Feb-2004)
> e2fsck 1.35 (28-Feb-2004)
> Couldn't find ext2 superblock, trying backup blocks...
> fsck.ext2: Bad magic number in super-block while trying to open 
> /dev/sdg1
>
> # ./e2fsck -b 8193 /dev/sdg1
> e2fsck 1.35 (28-Feb-2004)
> ./e2fsck: Bad magic number in super-block while trying to open 
> /dev/sdg1
>
> Is this why nothing is working and I'm getting seg errors on 
> everything?
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From jos at xos.nl  Wed Jan 24 17:19:22 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 24 Jan 2007 18:19:22 +0100
Subject: [Linux-cluster] Shared storage problems with LSI controller
In-Reply-To: <1169656792.3010.69.camel@localhost.localdomain>;
	from lhh@redhat.com on Wed, Jan 24, 2007 at 11:39:52AM -0500
References: <200701232311.l0NNBsW20209@xos037.xos.nl>
	<1169656792.3010.69.camel@localhost.localdomain>
Message-ID: <20070124181922.B26747@xos037.xos.nl>

On Wed, Jan 24, 2007 at 11:39:52AM -0500, Lon Hohberger wrote:

> I've seen abysmal performance in some megaraid+jbod configurations (e.g.
> 50+ seconds to get 2 block reads and 2 block writes on RHEL2.1), but
> I've never seen corruption like what you're describing.  Of course, it's
> been almost 4 years since I used a host-RAID configuration, and I
> haven't ever used one with GFS... ;(

Correction: I'm *not* using host-RAID.  The cabinet does the RAID
internally (configured via network interface) and is just a SAN
(the LV's on the cabinet show up as disks) for the connected hosts
and the LSI 22320R is only used as an "ordinary" SCSI controller.

I'm using the MPT Fusion (mpt*) driver.

> Your controllers are in "cluster mode" and/or "have cache entirely
> disabled", right?

The first yes, w.r.t. cache settings I have to check all settings on
both the controller and the cabinet.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From isplist at logicore.net  Wed Jan 24 17:39:30 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 11:39:30 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <28c2b88723d4cd63970c70f388b8bb63@redhat.com>
Message-ID: <2007124113930.012838@leena>

On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
> are you using fibre channel fencing in your cluster?

Yes, Brocade switches with Xyratex chassis.

Mike


From isplist at logicore.net  Wed Jan 24 17:49:57 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 11:49:57 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <28c2b88723d4cd63970c70f388b8bb63@redhat.com>
Message-ID: <2007124114957.175877@leena>

Yet another thing I don't understand here...

Failed to save logical volume :

  /dev/sdb: read failed after 0 of 4096 at 0: Input/output error
  /dev/sdc: read failed after 0 of 4096 at 0: Input/output error
  /dev/sdd: read failed after 0 of 4096 at 0: Input/output error
  clvmd not running on node cweb93
  clvmd not running on node cweb94
  clvmd not running on node sq59
  clvmd not running on node qm250
  Failed to activate new LV.

First of all, there are a total of 8 nodes on this cluster. So, why are these 
the only one's complaining?

Second, why are they complaining at all? I am trying to re-create the volumes 
on another server. I can't get that server up without the other nodes up. Now 
I can't get the volumes created without the other nodes up.

Talk about confusing.

Mike


On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
> are you using fibre channel fencing in your cluster?
> 
> brassow
> 
> On Jan 23, 2007, at 11:23 AM, isplist at logicore.net wrote:
> 
>> Is it possible that all of my storage was trashed in some way, that the
>> devices themselves need to be reformatted?
>> 
>> I'm using external RAID storage devices and I've noticed that there
>> seems to
>> be a bad magic number error when I try to run the usual tools such as
>> fsck or
>> e2fsck.
>> 
>> # ./fsck /dev/sdg1
>> fsck 1.35 (28-Feb-2004)
>> e2fsck 1.35 (28-Feb-2004)
>> Couldn't find ext2 superblock, trying backup blocks...
>> fsck.ext2: Bad magic number in super-block while trying to open
>> /dev/sdg1
>> 
>> # ./e2fsck -b 8193 /dev/sdg1
>> e2fsck 1.35 (28-Feb-2004)
>> ./e2fsck: Bad magic number in super-block while trying to open
>> /dev/sdg1
>> 
>> Is this why nothing is working and I'm getting seg errors on
>> everything?
>> 
>> Mike
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Jan 24 17:55:32 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 11:55:32 -0600
Subject: [Linux-cluster] Summit kernel and GFS
In-Reply-To: <45B72D41.1010903@obsidian.co.za>
Message-ID: <2007124115532.845245@leena>

>> Is there a Summit/GFS kernel version or am I running into something?

> Do you have a URL to the docs? Do the docs say which version of RHEL you
> need to install?

I've mostly looked on the RH and IBM sites. They both reference RHEL20 but it 
seems that everything was included into the newer kernels ongoing now. The IBM 
documentation is from late 2002.

> that made the Summit-class systems (x44[x]) special, were folded into
> the regular kernels in RHEL 3 and onwards

I saw someone else talking about working with the x440 systems which is also 
why I'm guessing it should work, I hope, out of the box.
 
Mike


From lhh at redhat.com  Wed Jan 24 17:58:37 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 24 Jan 2007 12:58:37 -0500
Subject: [Linux-cluster] qdisk updates in CVS
In-Reply-To: <1169508214.9453.63.camel@rei.boston.devel.redhat.com>
References: <1169508214.9453.63.camel@rei.boston.devel.redhat.com>
Message-ID: <1169661517.3010.71.camel@localhost.localdomain>

On Mon, 2007-01-22 at 18:23 -0500, Lon Hohberger wrote:
> Hi list,
> 
> I updated all the open bugzillas against qdiskd.  Patches are now in CVS
> to resolve all of the open BZs as of this writing ;)
> 
> I can provide patches against specific releases if you need them (e.g.
> cman-1.0.11-0, for example); just say the word.

*Test* packages for RHEL4 Update 4:

http://people.redhat.com/lhh/packages.html

-- Lon


From isplist at logicore.net  Wed Jan 24 18:05:25 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 12:05:25 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <28c2b88723d4cd63970c70f388b8bb63@redhat.com>
Message-ID: <200712412525.873699@leena>

Wait now... vgchange -aln on all of the other nodes perhaps?

Mike


Yet another thing I don't understand here...

Failed to save logical volume :

  /dev/sdb: read failed after 0 of 4096 at 0: Input/output error
  /dev/sdc: read failed after 0 of 4096 at 0: Input/output error
  /dev/sdd: read failed after 0 of 4096 at 0: Input/output error
  clvmd not running on node cweb93
  clvmd not running on node cweb94
  clvmd not running on node sq59
  clvmd not running on node qm250
  Failed to activate new LV.

First of all, there are a total of 8 nodes on this cluster. So, why are these
the only one's complaining?

Second, why are they complaining at all? I am trying to re-create the volumes
on another server. I can't get that server up without the other nodes up. Now
I can't get the volumes created without the other nodes up.

Talk about confusing.

Mike


On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
> are you using fibre channel fencing in your cluster?
> 
> brassow
> 
> On Jan 23, 2007, at 11:23 AM, isplist at logicore.net wrote:
> 
>> Is it possible that all of my storage was trashed in some way, that the
>> devices themselves need to be reformatted?
>> 
>> I'm using external RAID storage devices and I've noticed that there
>> seems to
>> be a bad magic number error when I try to run the usual tools such as
>> fsck or
>> e2fsck.
>> 
>> # ./fsck /dev/sdg1
>> fsck 1.35 (28-Feb-2004)
>> e2fsck 1.35 (28-Feb-2004)
>> Couldn't find ext2 superblock, trying backup blocks...
>> fsck.ext2: Bad magic number in super-block while trying to open
>> /dev/sdg1
>> 
>> # ./e2fsck -b 8193 /dev/sdg1
>> e2fsck 1.35 (28-Feb-2004)
>> ./e2fsck: Bad magic number in super-block while trying to open
>> /dev/sdg1
>> 
>> Is this why nothing is working and I'm getting seg errors on
>> everything?
>> 
>> Mike
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From ivanp at yu.net  Wed Jan 24 18:19:16 2007
From: ivanp at yu.net (Ivan Pantovic)
Date: Wed, 24 Jan 2007 19:19:16 +0100
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <45B78A54.1010204@redhat.com>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>
	<45B78A54.1010204@redhat.com>
Message-ID: <45B7A324.3030905@yu.net>


Robert Peterson wrote:
> Jaap Dijkshoorn wrote:
> 
>> Hi People,
>>
>> In bugzilla ticket:
>> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=190756
>>
>> is stated that there is a fix for the GFS(NFS) problem. Is this fix also
>> done in the Stable GFS tree?
>>   
> 
> Hi Jaap,
> 
> Sorry about that; my mistake.  I fixed it in CVS for RHEL4, HEAD, RHEL5, 
> but I
> forgot STABLE.  I just committed the fix now to the STABLE branch.  
> Thanks for
> keeping me honest.
> 
> Regards,
> 

Is it the same mistake W. Cheng made building AIO support only in  RHEL?

> Bob Peterson
> Red Hat Cluster Suite
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Ivan Pantovic, System Engineer
-----
YUnet International  http://www.eunet.yu
Dubrovacka 35/III,   11000 Belgrade
Tel: +381 11 311 9901;  Fax: +381 11 311 9901; Mob: +381 63 302 288
-----
This  e-mail  is confidential and intended only for the recipient.
Unauthorized  distribution,  modification  or  disclosure  of  its
contents is prohibited. If you have received this e-mail in error,
please notify the sender by telephone  +381 11 311 9901.
-----


From rpeterso at redhat.com  Wed Jan 24 18:59:05 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 24 Jan 2007 12:59:05 -0600
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <45B7A324.3030905@yu.net>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>	<45B78A54.1010204@redhat.com>
	<45B7A324.3030905@yu.net>
Message-ID: <45B7AC79.2050802@redhat.com>

Ivan Pantovic wrote:
> Is it the same mistake W. Cheng made building AIO support only in  RHEL?
Hi Ivan,

You're right that Wendy's AIO change isn't in the STABLE branch either.
Unfortunately, most of us are all tied up with RHEL5 right now because 
of it's
imminent release.  We'll go through and clean up the STABLE branch when
RHEL5 is out the door.

Regards,

Bob Peterson
Red Hat Cluster Suite


From jbrassow at redhat.com  Wed Jan 24 19:28:28 2007
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Wed, 24 Jan 2007 13:28:28 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <2007124113930.012838@leena>
References: <2007124113930.012838@leena>
Message-ID: <5452091d30b48ce289bffc6c151c420b@redhat.com>

I would ensure that when the fencing occurred, that it didn't fence off 
the devices on the brocade switch.  I would log into the brocade switch 
and ensure that all nodes are properly connected (and their ports are 
not disabled).

  brassow

On Jan 24, 2007, at 11:39 AM, isplist at logicore.net wrote:

> On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
>> are you using fibre channel fencing in your cluster?
>
> Yes, Brocade switches with Xyratex chassis.
>
> Mike
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From isplist at logicore.net  Wed Jan 24 19:28:08 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 13:28:08 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults: Solved
In-Reply-To: <28c2b88723d4cd63970c70f388b8bb63@redhat.com>
Message-ID: <200712413288.126068@leena>

Sorry, talking to myself in case it ever helps someone :). 

I guess I am surprised that some of the heavy tech's on here didn't ask me a 
few questions that would have quickly led to the answer "Oh ya, your 
partitions are trashed" :).

The reason for the storage errors in my earlier post today now makes sense... 
the storage is still connected but is formatting so it's not ready for use.

What started this?

A UPS decided to pop it's breakers, taking out one of the power supplies on a 
number of my storage devices. The storage devices all have dual power supplies 
but the brocade switches were accidentally connected to the same UPS also, so, 
both power supplies failed on those.

Somehow, all of the storage was trashed when this happened. Why??? All of the 
partitions on the FC network were corrupted and ruined. 
I'm very confused about it but certainly willing to answer any questions 
anyone will have so that my problems might turn into potential future 
solutions.

Once I figured that out, I had to get to work on it. The easiest fix in my 
mind, since I'd have to reformat everything anyhow was to start from scratch, 
clean it all up and start over.

I logged into one node. 
I tried to clean things up but got errors from some of the other nodes. 
I turned off clvmd on all of the other nodes then got back to it.
I logged into all nodes and did the same there, nuked everything in each 
/etc/lvm directory because there would end up being conflicts, and there were.
Once everything was clean, I nuked the messed up partitions.
I then pvcreated, vgcreated and lvcreated a new device. 
I am now making the gfs filesystem and have recovered two of them so far.
Sad thing is that it takes days to reformat so I will have learned a valuable 
lesson about storage, yet again. Multipath is also next on my list.

So.. why did this happen?

1: The partitions were trashed on EVERY device connected to the FC network. 
Was it GFS or was it the FC network that trashed the storage? I would go with 
the FC network unless someone has other ideas.

2: After that happened, the nodes/storage, everything got out of sync. 
The only fix has been to nuke ALL of the clvmd information so that I could 
rebuild. Before that, I was not able to get anything done because nodes would 
be out of synch. 

I wish there was some cluster wide command to clear information from certain 
services, especially node wide services.

Anyhow, hope this helps someone and puts my huge thread to rest.

Mike


From isplist at logicore.net  Wed Jan 24 19:29:05 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 13:29:05 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <5452091d30b48ce289bffc6c151c420b@redhat.com>
Message-ID: <200712413295.205710@leena>

Good point and that was indeed an issue. Some of the ports WERE blocked. All 
have since been cleared.

Mike


On Wed, 24 Jan 2007 13:28:28 -0600, Jonathan E Brassow wrote:
> I would ensure that when the fencing occurred, that it didn't fence off
> 
> the devices on the brocade switch.  I would log into the brocade switch
> and ensure that all nodes are properly connected (and their ports are
> not disabled).
> 
> brassow
> 
> On Jan 24, 2007, at 11:39 AM, isplist at logicore.net wrote:
> 
>> On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
>>> are you using fibre channel fencing in your cluster?
>>> 
>> Yes, Brocade switches with Xyratex chassis.
>> 
>> Mike
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Jan 24 19:34:13 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 13:34:13 -0600
Subject: [Linux-cluster] Starting a cluster - Quorum Numbers
Message-ID: <2007124133413.298924@leena>

Perhaps I've missed this somewhere but here I ask :).

I don't have a fixed number of nodes in my cluster. I sometimes use 4 
machines, sometimes 8, sometimes more. I could split the clusters up but it 
would be simpler for me if I could only turn up the nodes I need as I need 
them.

The problem;

When I start my cluster, it locks up at DLM waiting on other nodes to come up. 
If I don't need them all, then I only fire up the ones I need but then DLM 
remains locked up until I turn up yet more nodes.

I believe this is related to my cluster.conf file but I've yet to find enough 
information that will explain how I can get around this problem.

Can someone shed some light?

Mike


From jbrassow at redhat.com  Wed Jan 24 19:40:34 2007
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Wed, 24 Jan 2007 13:40:34 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <200712413295.205710@leena>
References: <200712413295.205710@leena>
Message-ID: <978291c841cb43d2578ad709d8b4f2d5@redhat.com>

I'm wondering if that didn't start this whole thing off...

If you want to test it, get all your volumes created, shutdown one 
machine, disable it's FC ports and restart it.  Was clvmd barfing 
because of this?  Could it see some, but not all of the devices.

Would the fix have been as simple as simply reenabling the ports to all 
the machines?  I don't know...

I can't imagine your storage being trashed by the FC network.  Even if 
the logical volume headers were destroyed, they can be brought back by 
vgcfgrestore.  That would just leave the file systems.  They have 
journaling.  So assuming some rouge process didn't mess with the disk, 
they should be able to recover just fine as long as the storage (CLVM 
volumes) are visible.

  brassow

On Jan 24, 2007, at 1:29 PM, isplist at logicore.net wrote:

> Good point and that was indeed an issue. Some of the ports WERE 
> blocked. All
> have since been cleared.
>
> Mike
>
>
> On Wed, 24 Jan 2007 13:28:28 -0600, Jonathan E Brassow wrote:
>> I would ensure that when the fencing occurred, that it didn't fence 
>> off
>>
>> the devices on the brocade switch.  I would log into the brocade 
>> switch
>> and ensure that all nodes are properly connected (and their ports are
>> not disabled).
>>
>> brassow
>>
>> On Jan 24, 2007, at 11:39 AM, isplist at logicore.net wrote:
>>
>>> On Wed, 24 Jan 2007 11:15:28 -0600, Jonathan E Brassow wrote:
>>>> are you using fibre channel fencing in your cluster?
>>>>
>>> Yes, Brocade switches with Xyratex chassis.
>>>
>>> Mike
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From jbrassow at redhat.com  Wed Jan 24 19:50:27 2007
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Wed, 24 Jan 2007 13:50:27 -0600
Subject: [Linux-cluster] Starting a cluster - Quorum Numbers
In-Reply-To: <2007124133413.298924@leena>
References: <2007124133413.298924@leena>
Message-ID: <2882a1d44a1bb29165a84e460deedfb5@redhat.com>

You must achieve quorum before the cluster services are enabled.  That 
means 1/2 of your machines listed in cluster.conf +1 must come up.

After that, if you do a 'cman_tool leave remove' on the machines you 
don't need, you should be able to go down as low as you want.  (I think 
there is a way to make the cman initscript do this automatically...  I 
also think you can specify a node count or quorum number in 
cluster.conf to control when it's allowed to start the service - but 
this last one might be unlikely, given that it could lead to 
split-brain.)

Hopefully, someone with more knowledge will expand on this.

  brassow

On Jan 24, 2007, at 1:34 PM, isplist at logicore.net wrote:

> Perhaps I've missed this somewhere but here I ask :).
>
> I don't have a fixed number of nodes in my cluster. I sometimes use 4
> machines, sometimes 8, sometimes more. I could split the clusters up 
> but it
> would be simpler for me if I could only turn up the nodes I need as I 
> need
> them.
>
> The problem;
>
> When I start my cluster, it locks up at DLM waiting on other nodes to 
> come up.
> If I don't need them all, then I only fire up the ones I need but then 
> DLM
> remains locked up until I turn up yet more nodes.
>
> I believe this is related to my cluster.conf file but I've yet to find 
> enough
> information that will explain how I can get around this problem.
>
> Can someone shed some light?
>
> Mike
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From isplist at logicore.net  Wed Jan 24 20:00:13 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 24 Jan 2007 14:00:13 -0600
Subject: [Linux-cluster] Can't get past Segmentation Faults
In-Reply-To: <978291c841cb43d2578ad709d8b4f2d5@redhat.com>
Message-ID: <200712414013.083602@leena>

> If you want to test it, get all your volumes created, shutdown one
> machine, disable it's FC ports and restart it.  Was clvmd barfing
> because of this?  Could it see some, but not all of the devices.

I'll test this once everything is back together.
 
> Would the fix have been as simple as simply reenabling the ports to all
> the machines?  I don't know...

No, I looked at this also. All machines had access. What I did notice was that 
the switch was in a state with flashing ports so I restarted it and all seemed 
to go fine after that.
 
> I can't imagine your storage being trashed by the FC network.  Even if
> the logical volume headers were destroyed, they can be brought back by
> vgcfgrestore.  That would just leave the file systems.  They have

Well, that's where things got weird. It didn't matter what I tried, I would 
get seg faults. I was not able to run ANY tools on the storage other than 
fdisk'ing them to LVM, that's it. Nothing else worked.

> journaling.  So assuming some rouge process didn't mess with the disk,
> they should be able to recover just fine as long as the storage (CLVM
> volumes) are visible.

That's the other funky thing that happened and is why I posted asking if what 
I see from webmin is the same as what I see from the command line. I asked 
that because I was sometimes able to see volumes from one or the other.

Either way though, I was not able to do anything with the volume I would see 
because I would get seg faults. The seg faults ended once I reformatted one of 
the devices, which is one of the one's back on line now.

Mike


From comaniliut at gmail.com  Wed Jan 24 20:51:56 2007
From: comaniliut at gmail.com (Coman Iliut)
Date: Wed, 24 Jan 2007 15:51:56 -0500
Subject: [Linux-cluster] Kernel messages causing node to be fenced out - Bug?
Message-ID: <9cf395480701241251i1923ca8cr4d431d39dcbcca73@mail.gmail.com>

Hi,

We have a setup with two HP DL360 nodes connected to an MSA500 disk array
via SCSI cables. We are running RH4U3 and our product has an active passive
design. The Active-passive is managed internally in the product.

Every now and then one of the nodes outputs the below kernel messages after
which the other node fences it out. This causes a failover for our product.

Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 0,2,0
Jan 19 13:38:58 n1 kernel: FS1 move use event 2
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 (first)
Jan 19 13:38:58 n1 kernel: FS1 add nodes
Jan 19 13:38:58 n1 kernel: FS1 total nodes 1
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 0 resources
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 0,2,2
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 finished
Jan 19 13:38:58 n1 kernel: FS1 move flags 1,0,0 ids 2,2,2
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 2,5,2
Jan 19 13:38:58 n1 kernel: FS1 move use event 5
Jan 19 13:38:58 n1 kernel: FS1 recover event 5
Jan 19 13:38:58 n1 kernel: FS1 add node 2
Jan 19 13:38:58 n1 kernel: FS1 total nodes 2
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 7409 resources
Jan 19 13:38:58 n1 kernel: FS1 purge requests
Jan 19 13:38:58 n1 kernel: FS1 purged 0 requests
Jan 19 13:38:58 n1 kernel: FS1 mark waiting requests
Jan 19 13:38:58 n1 kernel: FS1 marked 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 2,5,5
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 resend marked requests
Jan 19 13:38:58 n1 kernel: FS1 resent 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 finished
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 unlock ff9b0297 no id
Jan 19 13:38:59 n1 kernel:  -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2


Then the other node says "missed too many heartbeats" and fences it out. it
does some minor recovery work and is all fine.

Is this a bug? The two nodes don't seem to do much at the time when this
happens.
We have seen this on another similar setup (2 DL360, MSA500). It seems to
happen quite regularly.

I remember I saw a mention about something similar on a mailing list and
Patrick Caulfield answered:

If you're running the cman from RHEL4 Update 3 then there's a bug in
there you might be hitting.

You'll need to upgrade all the nodes in the cluster to get rid of it.
I can't tell for sure
if it is that problem you're having without seeing more kernel messages though.

http://www.spinics.net/lists/cluster/msg07016.html

Any ideas?

Thanks.

-- 
Coman ILIUT

Mitel Networks
Ottawa, ON
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070124/39961b74/attachment.htm>

From lhh at redhat.com  Wed Jan 24 21:01:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 24 Jan 2007 16:01:49 -0500
Subject: [Linux-cluster] Shared storage problems with LSI controller
In-Reply-To: <20070124181922.B26747@xos037.xos.nl>
References: <200701232311.l0NNBsW20209@xos037.xos.nl>
	<1169656792.3010.69.camel@localhost.localdomain>
	<20070124181922.B26747@xos037.xos.nl>
Message-ID: <1169672509.3010.74.camel@localhost.localdomain>

On Wed, 2007-01-24 at 18:19 +0100, Jos Vos wrote:
> On Wed, Jan 24, 2007 at 11:39:52AM -0500, Lon Hohberger wrote:
> 
> > I've seen abysmal performance in some megaraid+jbod configurations (e.g.
> > 50+ seconds to get 2 block reads and 2 block writes on RHEL2.1), but
> > I've never seen corruption like what you're describing.  Of course, it's
> > been almost 4 years since I used a host-RAID configuration, and I
> > haven't ever used one with GFS... ;(
> 
> Correction: I'm *not* using host-RAID.  The cabinet does the RAID
> internally (configured via network interface) and is just a SAN
> (the LV's on the cabinet show up as disks) for the connected hosts
> and the LSI 22320R is only used as an "ordinary" SCSI controller.
> 
> I'm using the MPT Fusion (mpt*) driver.

Argh, I got my wires crossed with another email I read concerning PV220S
+ megaraid.  Forget I said anything :) 

I don't have any experience with your hardware :(

-- Lon


From brandonlamb at gmail.com  Wed Jan 24 21:41:57 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Wed, 24 Jan 2007 13:41:57 -0800
Subject: [Linux-cluster] Is this even possible with GFS, replacing NFS
Message-ID: <f25c44610701241341k608fec67k5503bd3227d5aa83@mail.gmail.com>

I am on about day 4 or so of researching filesystems and cluster
software, and ready to pull my hair out.

First let me describe my situation.

I have a 20 disk scsi raid-10. The server that it is connected to has
4 gigs of ram, and exports a single directory via NFS. We only have a
150gig partition, using about 100 of it. We use this for our mail data
store, 2.8 million files using maildir format.

We have 5 linux boxes connected via NFS over gigabit (NOT using jumbo
frames) and they all write to the users maildirs via this nfs mount.

Now, this IS working and has been for over 1.5-2 years, however it is
slow as hell.

Here is where my question comes in. We are planning on splitting the
current raid into two 10 disk raid10s so that we can replicate and
have a live spare. That part I already am 90% sure of how I will
handle. But in rebuilding our mail system I began looking at NFS
alternatives and came down to OpenAFS and RedHat GFS.

After a ton of reading and a few usenet posts it came down to AFS
probably was not the solution and to try GFS.

Ok, so my current plan is figuring out how to mount the raid10
parition /dev/sda1 on the "server" using iSCSI on the 5 mail heads. I
got the iSCSI part working in a test setup with 2 machines, 1 being
the sever and 1 being the client. This worked great, doing a test
transfer of a 771meg maildir took 24 secs compared to the same test
over NFS of 6 mins 32 secs.

SO. Now I am at the point that I THINK I should be able to setup iscsi
on 5 mail heads to mount this server's /dev/sda1 over gigabit ethernet
using jumbo frames on a private switch (so bandwidth should not be a
problem?) and then setup GFS as the filesystem for this /dev/sda1 so
that all 5 can mount it RW.

So is this actually possible or am I wasting my time? At this point I
FINALLY got the damn cluster-1.0.3 from cvs to compile with the latest
2.6.19.2 kernel and some hack files from kernel.org for fs/dlm/
Kconfig, lowcomms-tcp, locomms-sctp, Makefile and got GFS to compile
as kernel modules and the cluster stuff compiled.

But I cant seem to get it to work, cluster.conf is complaining about
stuff and whatnot, hopefully more reading will give me answers, but is
all this work and research in vane?

My current test setup is using FC6, kernel 2.6.19.2, i have the
cluster-1.0.3, i installed gfs2-utils, i have the iscsi target. From
what I can tell I just need to finish getting GFS to work and then i
should be able to mount my first gfs filesystem over iscsi on my test
client machine.

Sorry for the long post, hopefully I didnt forget anything.


From rpeterso at redhat.com  Wed Jan 24 22:20:26 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 24 Jan 2007 16:20:26 -0600
Subject: [Linux-cluster] Is this even possible with GFS, replacing NFS
In-Reply-To: <f25c44610701241341k608fec67k5503bd3227d5aa83@mail.gmail.com>
References: <f25c44610701241341k608fec67k5503bd3227d5aa83@mail.gmail.com>
Message-ID: <45B7DBAA.3060507@redhat.com>

Brandon Lamb wrote:
> I am on about day 4 or so of researching filesystems and cluster
> software, and ready to pull my hair out.
>
> First let me describe my situation.
>
> I have a 20 disk scsi raid-10. The server that it is connected to has
> 4 gigs of ram, and exports a single directory via NFS. We only have a
> 150gig partition, using about 100 of it. We use this for our mail data
> store, 2.8 million files using maildir format.
>
> We have 5 linux boxes connected via NFS over gigabit (NOT using jumbo
> frames) and they all write to the users maildirs via this nfs mount.
>
> Now, this IS working and has been for over 1.5-2 years, however it is
> slow as hell.
>
> Here is where my question comes in. We are planning on splitting the
> current raid into two 10 disk raid10s so that we can replicate and
> have a live spare. That part I already am 90% sure of how I will
> handle. But in rebuilding our mail system I began looking at NFS
> alternatives and came down to OpenAFS and RedHat GFS.
>
> After a ton of reading and a few usenet posts it came down to AFS
> probably was not the solution and to try GFS.
>
> Ok, so my current plan is figuring out how to mount the raid10
> parition /dev/sda1 on the "server" using iSCSI on the 5 mail heads. I
> got the iSCSI part working in a test setup with 2 machines, 1 being
> the sever and 1 being the client. This worked great, doing a test
> transfer of a 771meg maildir took 24 secs compared to the same test
> over NFS of 6 mins 32 secs.
>
> SO. Now I am at the point that I THINK I should be able to setup iscsi
> on 5 mail heads to mount this server's /dev/sda1 over gigabit ethernet
> using jumbo frames on a private switch (so bandwidth should not be a
> problem?) and then setup GFS as the filesystem for this /dev/sda1 so
> that all 5 can mount it RW.
>
> So is this actually possible or am I wasting my time? At this point I
> FINALLY got the damn cluster-1.0.3 from cvs to compile with the latest
> 2.6.19.2 kernel and some hack files from kernel.org for fs/dlm/
> Kconfig, lowcomms-tcp, locomms-sctp, Makefile and got GFS to compile
> as kernel modules and the cluster stuff compiled.
>
> But I cant seem to get it to work, cluster.conf is complaining about
> stuff and whatnot, hopefully more reading will give me answers, but is
> all this work and research in vane?
>
> My current test setup is using FC6, kernel 2.6.19.2, i have the
> cluster-1.0.3, i installed gfs2-utils, i have the iscsi target. From
> what I can tell I just need to finish getting GFS to work and then i
> should be able to mount my first gfs filesystem over iscsi on my test
> client machine.
>
> Sorry for the long post, hopefully I didnt forget anything.
Hi Brandon,

You should be able to accomplish what you've described.

You told us a lot about your environment and what you're trying to do,
but unfortunately you didn't send us your cluster.conf file or the
complaint messages you were referring to.

Also, you didn't tell us if the cman service is starting okay on all nodes,
etc.  You also didn't mention whether you're using clvmd, the clustered
logical volume manager, etc.

Regards,

Bob Peterson
Red Hat Cluster Suite


From brandonlamb at gmail.com  Wed Jan 24 22:26:27 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Wed, 24 Jan 2007 14:26:27 -0800
Subject: [Linux-cluster] Is this even possible with GFS, replacing NFS
In-Reply-To: <45B7DBAA.3060507@redhat.com>
References: <f25c44610701241341k608fec67k5503bd3227d5aa83@mail.gmail.com>
	<45B7DBAA.3060507@redhat.com>
Message-ID: <f25c44610701241426k2faa6af6m8af43cddfd29302f@mail.gmail.com>

On 1/24/07, Robert Peterson <rpeterso at redhat.com> wrote:
> Brandon Lamb wrote:
> > I am on about day 4 or so of researching filesystems and cluster
> > software, and ready to pull my hair out.
> >
> > First let me describe my situation.
> >
> > I have a 20 disk scsi raid-10. The server that it is connected to has
> > 4 gigs of ram, and exports a single directory via NFS. We only have a
> > 150gig partition, using about 100 of it. We use this for our mail data
> > store, 2.8 million files using maildir format.
> >
> > We have 5 linux boxes connected via NFS over gigabit (NOT using jumbo
> > frames) and they all write to the users maildirs via this nfs mount.
> >
> > Now, this IS working and has been for over 1.5-2 years, however it is
> > slow as hell.
> >
> > Here is where my question comes in. We are planning on splitting the
> > current raid into two 10 disk raid10s so that we can replicate and
> > have a live spare. That part I already am 90% sure of how I will
> > handle. But in rebuilding our mail system I began looking at NFS
> > alternatives and came down to OpenAFS and RedHat GFS.
> >
> > After a ton of reading and a few usenet posts it came down to AFS
> > probably was not the solution and to try GFS.
> >
> > Ok, so my current plan is figuring out how to mount the raid10
> > parition /dev/sda1 on the "server" using iSCSI on the 5 mail heads. I
> > got the iSCSI part working in a test setup with 2 machines, 1 being
> > the sever and 1 being the client. This worked great, doing a test
> > transfer of a 771meg maildir took 24 secs compared to the same test
> > over NFS of 6 mins 32 secs.
> >
> > SO. Now I am at the point that I THINK I should be able to setup iscsi
> > on 5 mail heads to mount this server's /dev/sda1 over gigabit ethernet
> > using jumbo frames on a private switch (so bandwidth should not be a
> > problem?) and then setup GFS as the filesystem for this /dev/sda1 so
> > that all 5 can mount it RW.
> >
> > So is this actually possible or am I wasting my time? At this point I
> > FINALLY got the damn cluster-1.0.3 from cvs to compile with the latest
> > 2.6.19.2 kernel and some hack files from kernel.org for fs/dlm/
> > Kconfig, lowcomms-tcp, locomms-sctp, Makefile and got GFS to compile
> > as kernel modules and the cluster stuff compiled.
> >
> > But I cant seem to get it to work, cluster.conf is complaining about
> > stuff and whatnot, hopefully more reading will give me answers, but is
> > all this work and research in vane?
> >
> > My current test setup is using FC6, kernel 2.6.19.2, i have the
> > cluster-1.0.3, i installed gfs2-utils, i have the iscsi target. From
> > what I can tell I just need to finish getting GFS to work and then i
> > should be able to mount my first gfs filesystem over iscsi on my test
> > client machine.
> >
> > Sorry for the long post, hopefully I didnt forget anything.
> Hi Brandon,
>
> You should be able to accomplish what you've described.
>
> You told us a lot about your environment and what you're trying to do,
> but unfortunately you didn't send us your cluster.conf file or the
> complaint messages you were referring to.
>
> Also, you didn't tell us if the cman service is starting okay on all nodes,
> etc.  You also didn't mention whether you're using clvmd, the clustered
> logical volume manager, etc.
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Yes not yet. First I wanted to make sure I could actually do what I
wanted, now I will dive more into actually getting it to work. I'll
post new mails for request for help after I try to get it working on
my own first a little more.

Thanks for replying back! I am a little more inspired to plug away at it now.

=D

Thanks!

Brandon


From chekov at ucla.edu  Wed Jan 24 23:46:24 2007
From: chekov at ucla.edu (Alan Wood)
Date: Wed, 24 Jan 2007 15:46:24 -0800 (PST)
Subject: [Linux-cluster] RE: HA Clustering - Need Help
In-Reply-To: <20070124133712.838207321C@hormel.redhat.com>
References: <20070124133712.838207321C@hormel.redhat.com>
Message-ID: <Pine.LNX.4.64.0701241514200.2201@cpe-76-168-2-122.socal.res.rr.com>

some quick comments on your post from someone who has tried an 
active-active cluster on a shared SCSI device.

1.  If you want to have the same block partition mounted on two different 
computers at the same time, then you need some cluster file system like 
GFS, you can't use ext3.  There are other cluster filesystems out there 
(like lustre) but GFS is most well tied to the RH Cluster Suite and 
designed for high availability as opposed to paralell computing.
2.  If you are going to run GFS in a production environment the 
recommendation is to not use 2-node.  GFS 5 required 3 nodes but GFS 6 
offers a 2-node option;  However when using two nodes it is harder to know 
which node is "broken" when something goes wrong, so you'll note a lot of 
discusson on this list about fencing gone awry and needing some sort of 
tiebeaker like a quorum disk.  If you take care in setting it up a 2-node 
cluster will work but you'll want to test it extensively before putting it 
into production.
3.  multipathing should work fine and you can build clvm volumes on top of 
multipath devices.  Software RAID is different and not really related.

as for recommendations:
1.  don't use SCSI shared storage.  I and others have had reliability 
issues with heavy load in these scenarios.
2.  use more than 2 nodes.
3.  go active-passive is possible.  as is often pointed out, the entire 
idea of a high availability cluster is that there is enough processing 
horsepower to handle the entire remaining load if one node fails.  in a 
2-node cluster then you'll have to provision each node to be able to run 
everything.  it is far easier to set it up so that one node therefore runs 
everything and the other node awaits failure than having active-active.

just my $.02
-alan

> Date: Tue, 23 Jan 2007 23:03:58 +0530
> From: "Net Cerebrum" <netcerebrum at gmail.com>
> Subject: [Linux-cluster] HA Clustering - Need Help
> To: linux-cluster at redhat.com
>
> Hello All,
>
> I am totally new to HA clustering and am trying hard to grasp the
> fundamentals in a limited time frame. I have been asked by my company to
> create a high availability cluster using Red Hat Cluster Suite on hardware
> comprising two servers running RHEL AS 4 and one shared external storage
> array. The cluster would be running in Active-Active state. Oracle Database
> version 9 (not RAC) would run on one of the servers while the Oracle
> Applications version 11 would run on the other. In case of failure of either
> of the servers, the service would be started on the other server. Both the
> servers (nodes) would be connected to the storage array through two
> redundant SCSI controllers.
>
> Since the storage has redundant controllers, both the servers would be
> connected to the storage array using two channels each and the requirement
> is to make it an Active-Active Load Balanced configuration using a multipath
> software. The storage vendor has suggested using the multipath option with
> the mdadm software for creating multipath devices on the storage array.
>
> I have gone through the manuals and since this is my first attempt at high
> availabilty clustering I have many doubts and questions. What file system
> should be used on the external storage ? Is it better to use ext3 or Red Hat
> GFS ? At certain places it is mentioned that GFS should be used only if the
> number of nodes is 3 or more and GULM is being used. Since we have only two
> nodes, we plan to use DLM.  It is also mentioned that GFS and CLVM may not
> work on a software RAID device. Would the multipath devices created
> (/dev/md0, /dev/md1, etc) be considered to be software RAID devices, though
> in the real sense they are not ? Further the development team is not too
> sure about the compatibility between GFS and Oracle Database and
> Applications. What could be the pros and cons of using  ext3 file system in
> this scenario ?
>
> The development team just wants one filesystem to be used on the storage
> which would be mounted as /oracle on both the servers / nodes and all the
> binaries and data would reside on this. Since this filesystem is going to be
> mounted at boot time, my understanding is that no mounting or unmounting of
> any filesystem will take place during the failover so the cluster
> configuration should reflect that. The documentation repeatedly refers to
> mounting of the file systems when failover takes place so that's giving rise
> to a little confusion. Further there are references to a quorum partition in
> documentation but I have not been able to find any provision to make use of
> the same in the cluster configuration tool.
>
> Please help me in clarifying these issues and suggest me how to go about
> setting this cluster. I would be really grateful for any suggestions and
> references.
>
> Thanks,


From brandonlamb at gmail.com  Thu Jan 25 01:22:00 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Wed, 24 Jan 2007 17:22:00 -0800
Subject: [Linux-cluster] I just cannot get this to compile. Am I doing
	something horribly wrong?
Message-ID: <f25c44610701241722y58c44606ra028dc17760e0c78@mail.gmail.com>

1) Installed Fedora Core 6
2) Ran yum updates
3) Compiled linux-2.6.19.2 kernel with GFS builtin, dlm and nolock as
modules, and included configfs, ipv6 and sctp.
4) Rebooted, made sure everything works, thats fine.
[root at gfs1 cluster-1.03.00]# uname -a
Linux gfs1.olypen.com 2.6.19.2 #1 SMP Wed Jan 24 08:55:44 PST 2007
i686 i686 i386 GNU/Linux

5) installed openais from svn.osdl.org/openais, installed ok
6) installed libvolume_id, udev-104, installed ok

This is where it will not work. I downloaded the cluster-1.03.tar.gz
and also downloaded via cvs, neither will compile

i did ./configure --kernel_src=/usr/src/kernels/linux-2.6.19.2, that
was fine. This worked for both the tarball and cvs

Now when I do make on the cvs version i get

---------------------------
make[2]: Entering directory `/usr/src/cluster/fence/fence_node'
gcc -Wall -I../include -I/usr/src/cluster/fence/../ccs/lib
-I//usr/include -I../config -I../../cman/lib -O2
-D_FILE_OFFSET_BITS=64 -DFENCE_RELEASE_NAME=\"DEVEL.1169659211\"
-I../include -I/usr/src/cluster/fence/../ccs/lib -I//usr/include
-I../config -I../../cman/lib agent.o fence_node.o
-L//usr/src/cluster/fence/../ccs/lib -L//usr/lib -L../../cman/lib
-lccs -lcman  -o fence_node
agent.o: In function `dispatch_fence_agent':
agent.c:(.text+0x51a): undefined reference to `cman_admin_init'
agent.c:(.text+0x5a2): undefined reference to `cman_node_fenced'
collect2: ld returned 1 exit status
make[2]: *** [fence_node] Error 1
make[2]: Leaving directory `/usr/src/cluster/fence/fence_node'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/usr/src/cluster/fence'
make: *** [all] Error 2
---------------------------

and if i try to compile with the tarball

---------------------------
make -C /usr/src/kernels/linux-2.6.19.2
M=/usr/src/cluster-1.03.00/cman-kernel/src modules USING_KBUILD=yes
make[3]: Entering directory `/usr/src/kernels/linux-2.6.19.2'
  CC [M]  /usr/src/cluster-1.03.00/cman-kernel/src/cnxman.o
/usr/src/cluster-1.03.00/cman-kernel/src/cnxman.c: In function
?do_ioctl_join_cluster?:
/usr/src/cluster-1.03.00/cman-kernel/src/cnxman.c:1751: error:
?system_utsname? undeclared (first use in this function)
/usr/src/cluster-1.03.00/cman-kernel/src/cnxman.c:1751: error: (Each
undeclared identifier is reported only once
/usr/src/cluster-1.03.00/cman-kernel/src/cnxman.c:1751: error: for
each function it appears in.)
make[4]: *** [/usr/src/cluster-1.03.00/cman-kernel/src/cnxman.o] Error 1
make[3]: *** [_module_/usr/src/cluster-1.03.00/cman-kernel/src] Error 2
make[3]: Leaving directory `/usr/src/kernels/linux-2.6.19.2'
make[2]: *** [all] Error 2
make[2]: Leaving directory `/usr/src/cluster-1.03.00/cman-kernel/src'
make[1]: *** [install] Error 2
make[1]: Leaving directory `/usr/src/cluster-1.03.00/cman-kernel'
make: *** [all] Error 2
---------------------------

So did I miss something along the way?


From ashish at prologixsoft.com  Thu Jan 25 05:37:26 2007
From: ashish at prologixsoft.com (Ashish Varman)
Date: Thu, 25 Jan 2007 11:07:26 +0530
Subject: [Linux-cluster] Re: gfs on FC6 and ddraid
In-Reply-To: <45B5E2AB.8020406@prologixsoft.com>
References: <20070122161503.16AA773605@hormel.redhat.com>
	<45B5E2AB.8020406@prologixsoft.com>
Message-ID: <45B84216.7030305@prologixsoft.com>

Anyone ?

Ashish Varman wrote:

> Hello all
>
> A few follow-up questions have popped up after my previous problem got 
> solved
>
> 1. What would be the correct way to use gfs (not gfs2) on FC6.  Binary 
> installation (where do I get the rpms?) or source installation (which 
> tag to check out?)
>
> 2. Since software raid is not cluster aware, ddraid seemed a much 
> needed answer to storage node failure.  However, I was unable to build 
> it.  The tarballs mentioned in the README are nowhere to be found and 
> the kernel version on FC6 is different from those mentioned in it.  
> Also, the build fails due to not finding the file "../dm-ddraid.h".  
> This too is nowhere to be found :(.  Any pointers?
>
> 3. Besides ddraid, is there any commercial product that can manage 
> cluster-aware software raid?
>
> Yours Sincerely
> Ashish Varman
>
>


From brandonlamb at gmail.com  Thu Jan 25 08:16:33 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Thu, 25 Jan 2007 00:16:33 -0800
Subject: [Linux-cluster] Aha! Think I found it!
Message-ID: <f25c44610701250016r4b794048p48f11372ea453158@mail.gmail.com>

So when the cluster-1.03 didnt compile right it looks like it still
installed some libcman.* in /usr something (lib i think it was).
Anyways I removed those and attempted to make install in the cvs
cluster dir in /usr/src and it compiled. Moving on....


From jaap at sara.nl  Thu Jan 25 10:11:52 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Thu, 25 Jan 2007 11:11:52 +0100
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <45B78A54.1010204@redhat.com>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>
	<45B78A54.1010204@redhat.com>
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>

Hi Bob,

> 
> Jaap Dijkshoorn wrote:
> > Hi People,
> >
> > In bugzilla ticket:
> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=190756
> >
> > is stated that there is a fix for the GFS(NFS) problem. Is 
> this fix also
> > done in the Stable GFS tree?
> >   
> Hi Jaap,
> 
> Sorry about that; my mistake.  I fixed it in CVS for RHEL4, 
> HEAD, RHEL5, 
> but I
> forgot STABLE.  I just committed the fix now to the STABLE branch.  
> Thanks for
> keeping me honest.

No problem, thanks for this. We are very happy that this maybe is the
fix we need in our environment.
I saw some remarks later about cleaning up the stable branche. Is the
stable branche stable enough to use it now.. or do we have to wait for
the cleanup?

> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 

Met vriendelijke groet, Kind Regards,

Jaap P. Dijkshoorn
Group Leader Cluster Computing
Systems Programmer
mailto:jaap at sara.nl    http://home.sara.nl/~jaapd

SARA Computing & Networking Services
Kruislaan 415     1098 SJ  Amsterdam
Tel: +31-(0)20-5923000
Fax: +31-(0)20-6683167
http://www.sara.nl
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070125/db8c212e/attachment.bin>

From jaap at sara.nl  Thu Jan 25 10:18:26 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Thu, 25 Jan 2007 11:18:26 +0100
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl><45B78A54.1010204@redhat.com>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA2849@douwes.ka.sara.nl>

Bob or others,

Ohh i forgot... which kernel is used for the STABLE branche?

Thanks!!!

Met vriendelijke groet, Kind Regards,

Jaap P. Dijkshoorn
Group Leader Cluster Computing
Systems Programmer
mailto:jaap at sara.nl    http://home.sara.nl/~jaapd

SARA Computing & Networking Services
Kruislaan 415     1098 SJ  Amsterdam
Tel: +31-(0)20-5923000
Fax: +31-(0)20-6683167
http://www.sara.nl


> 
> Hi Bob,
> 
> > 
> > Jaap Dijkshoorn wrote:
> > > Hi People,
> > >
> > > In bugzilla ticket:
> > > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=190756
> > >
> > > is stated that there is a fix for the GFS(NFS) problem. Is 
> > this fix also
> > > done in the Stable GFS tree?
> > >   
> > Hi Jaap,
> > 
> > Sorry about that; my mistake.  I fixed it in CVS for RHEL4, 
> > HEAD, RHEL5, 
> > but I
> > forgot STABLE.  I just committed the fix now to the STABLE branch.  
> > Thanks for
> > keeping me honest.
> 
> No problem, thanks for this. We are very happy that this maybe is the
> fix we need in our environment.
> I saw some remarks later about cleaning up the stable branche. Is the
> stable branche stable enough to use it now.. or do we have to wait for
> the cleanup?
> 
> > 
> > Regards,
> > 
> > Bob Peterson
> > Red Hat Cluster Suite
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> Met vriendelijke groet, Kind Regards,
> 
> Jaap P. Dijkshoorn
> Group Leader Cluster Computing
> Systems Programmer
> mailto:jaap at sara.nl    http://home.sara.nl/~jaapd
> 
> SARA Computing & Networking Services
> Kruislaan 415     1098 SJ  Amsterdam
> Tel: +31-(0)20-5923000
> Fax: +31-(0)20-6683167
> http://www.sara.nl
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070125/a49e59a3/attachment.bin>

From swhiteho at redhat.com  Thu Jan 25 10:30:22 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 25 Jan 2007 10:30:22 +0000
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169652342.17590.43.camel@localhost.localdomain>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
	<1169638529.17590.18.camel@localhost.localdomain>
	<1169639347.11001.67.camel@quoit.chygwyn.com>
	<1169652342.17590.43.camel@localhost.localdomain>
Message-ID: <1169721022.11001.81.camel@quoit.chygwyn.com>

Hi,

On Wed, 2007-01-24 at 16:25 +0100, Benoit DUFFAU wrote:
> Le mercredi 24 janvier 2007 ? 11:49 +0000, Steven Whitehouse a ?crit :
> > > 
> > > are you sure with your code ?? :
> > > 
> > > +       if (unlock);
> > > +               gfs2_glock_dq_uninit(&gh);
> > > 
> > > shouldn't it be : 
> > > 
> > > +       if (unlock)
> > > +               gfs2_glock_dq_uninit(&gh);
> > > 
> > > instead ???
> > > 
> > Yes, you are right it should be. I don't know how that stray ; got in
> > there and I'll fix that now.
> > 
> 
> great ! 
> 
> > > 
> > > here again the kernel dump
> > > 
> > > Jan 24 12:27:55 replica1 kernel: ------------[ cut here ]------------
> > > Jan 24 12:27:55 replica1 kernel: Call Trace:
> > > 
> > The bit I really need to see is the next bit - i.e. the stack back
> > trace. Was it identical to the last trace you posted?
> > 
> 
> ok so here is the complete trace using your patch 
> 
> 
> Hope it helps
> 
Yes, that was just the thing, thanks for sending that. Please see if the
follow patch (it assumes that you have already applied the original fix
I pointed you at) does the trick,

Steve.


diff --git a/fs/gfs2/inode.c b/fs/gfs2/inode.c
index f7c8d31..88fcfb4 100644
--- a/fs/gfs2/inode.c
+++ b/fs/gfs2/inode.c
@@ -395,8 +395,10 @@ struct inode *gfs2_lookup_simple(struct 
  * @is_root: If 1, ignore the caller's permissions
  * @i_gh: An uninitialized holder for the new inode glock
  *
- * There will always be a vnode (Linux VFS inode) for the d_gh inode unless
- * @is_root is true.
+ * This can be called via the VFS filldir function when NFS is doing
+ * a readdirplus and the inode which its intending to stat isn't
+ * already in cache. In this case we must not take the directory glock
+ * again, since the readdir call will have already taken that lock.
  *
  * Returns: errno
  */
@@ -409,8 +411,9 @@ struct inode *gfs2_lookupi(struct inode 
 	struct gfs2_holder d_gh;
 	struct gfs2_inum_host inum;
 	unsigned int type;
-	int error = 0;
+	int error;
 	struct inode *inode = NULL;
+	int unlock = 0;
 
 	if (!name->len || name->len > GFS2_FNAMESIZE)
 		return ERR_PTR(-ENAMETOOLONG);
@@ -422,9 +425,12 @@ struct inode *gfs2_lookupi(struct inode 
 		return dir;
 	}
 
-	error = gfs2_glock_nq_init(dip->i_gl, LM_ST_SHARED, 0, &d_gh);
-	if (error)
-		return ERR_PTR(error);
+	if (gfs2_glock_is_locked_by_me(dip->i_gl) == 0) {
+		error = gfs2_glock_nq_init(dip->i_gl, LM_ST_SHARED, 0, &d_gh);
+		if (error)
+			return ERR_PTR(error);
+		unlock = 1;
+	}
 
 	if (!is_root) {
 		error = permission(dir, MAY_EXEC, NULL);
@@ -439,10 +445,11 @@ struct inode *gfs2_lookupi(struct inode 
 	inode = gfs2_inode_lookup(sb, &inum, type);
 
 out:
-	gfs2_glock_dq_uninit(&d_gh);
+	if (unlock)
+		gfs2_glock_dq_uninit(&d_gh);
 	if (error == -ENOENT)
 		return NULL;
-	return inode;
+	return inode ? inode : ERR_PTR(error);
 }
 
 static int pick_formal_ino_1(struct gfs2_sbd *sdp, u64 *formal_ino)
diff --git a/fs/gfs2/lops.c b/fs/gfs2/lops.c
index 4d7f94d..16bb4b4 100644
--- a/fs/gfs2/lops.c
+++ b/fs/gfs2/lops.c
@@ -69,13 +69,16 @@ static void buf_lo_add(struct gfs2_sbd *
 	struct gfs2_bufdata *bd = container_of(le, struct gfs2_bufdata, bd_le);
 	struct gfs2_trans *tr;
 
-	if (!list_empty(&bd->bd_list_tr))
+	gfs2_log_lock(sdp);
+	if (!list_empty(&bd->bd_list_tr)) {
+		gfs2_log_unlock(sdp);
 		return;
-
+	}
 	tr = current->journal_info;
 	tr->tr_touched = 1;
 	tr->tr_num_buf++;
 	list_add(&bd->bd_list_tr, &tr->tr_list_buf);
+	gfs2_log_unlock(sdp);
 
 	if (!list_empty(&le->le_list))
 		return;
@@ -84,7 +87,6 @@ static void buf_lo_add(struct gfs2_sbd *
 
 	gfs2_meta_check(sdp, bd->bd_bh);
 	gfs2_pin(sdp, bd->bd_bh);
-
 	gfs2_log_lock(sdp);
 	sdp->sd_log_num_buf++;
 	list_add(&le->le_list, &sdp->sd_log_le_buf);
@@ -98,11 +100,13 @@ static void buf_lo_incore_commit(struct 
 	struct list_head *head = &tr->tr_list_buf;
 	struct gfs2_bufdata *bd;
 
+	gfs2_log_lock(sdp);
 	while (!list_empty(head)) {
 		bd = list_entry(head->next, struct gfs2_bufdata, bd_list_tr);
 		list_del_init(&bd->bd_list_tr);
 		tr->tr_num_buf--;
 	}
+	gfs2_log_unlock(sdp);
 	gfs2_assert_warn(sdp, !tr->tr_num_buf);
 }
 
@@ -462,13 +466,17 @@ static void databuf_lo_add(struct gfs2_s
 	struct address_space *mapping = bd->bd_bh->b_page->mapping;
 	struct gfs2_inode *ip = GFS2_I(mapping->host);
 
+	gfs2_log_lock(sdp);
 	tr->tr_touched = 1;
 	if (list_empty(&bd->bd_list_tr) &&
 	    (ip->i_di.di_flags & GFS2_DIF_JDATA)) {
 		tr->tr_num_buf++;
 		list_add(&bd->bd_list_tr, &tr->tr_list_buf);
+		gfs2_log_unlock(sdp);
 		gfs2_pin(sdp, bd->bd_bh);
 		tr->tr_num_buf_new++;
+	} else {
+		gfs2_log_unlock(sdp);
 	}
 	gfs2_trans_add_gl(bd->bd_gl);
 	gfs2_log_lock(sdp);
diff --git a/fs/gfs2/ops_inode.c b/fs/gfs2/ops_inode.c
index 747c731..5591f89 100644
--- a/fs/gfs2/ops_inode.c
+++ b/fs/gfs2/ops_inode.c
@@ -1018,7 +1018,7 @@ static int gfs2_getattr(struct vfsmount 
 	}
 
 	generic_fillattr(inode, stat);
-	if (unlock);
+	if (unlock)
 		gfs2_glock_dq_uninit(&gh);
 
 	return 0;


From swhiteho at redhat.com  Thu Jan 25 12:29:49 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 25 Jan 2007 12:29:49 +0000
Subject: [Linux-cluster] Re: 2.6.20-rc4 gfs2 bug
In-Reply-To: <20070125050731.GA23270@chaos.ao.net>
References: <20070125050731.GA23270@chaos.ao.net>
Message-ID: <1169728189.11001.107.camel@quoit.chygwyn.com>

Hi,

On Thu, 2007-01-25 at 00:07 -0500, Dan Merillat wrote:
> Running 2.6.20-rc4 _WITH_ the following patch: (Shouldn't be the issue,
> but just in case, I'm listing it here)
> 
> Date:	Fri, 29 Dec 2006 21:03:57 +0100
> From:	Ingo Molnar <mingo at elte.hu>
> Subject: [patch] remove MAX_ARG_PAGES
> Message-ID: <20061229200357.GA5940 at elte.hu>
> 
> Linux fileserver 2.6.20-rc4MAX_ARGS #4 PREEMPT Fri Jan 12 03:58:25 EST 2007 x86_64 GNU/Linux
> 
> This happened when I started testing gfs2 for the first time.  I
> installed userspace from CVS, loaded the gfs2/dlm modules, mkfs.gfs2,
> then "mount -t gfs2 -v /dev/vg1/gfs2 /mnt/gfs"
> 
> This was the initial mount of the new filesystem.  I can create
> directories, but attempting a stress-test with bonnie seems to have
> deadlocked something.  (at "Start 'em", immediately.)
> 
> To clarify: the two oopses happened at first mount.  After that, I
> created files/directories, then attempted to stress it a bit with
> bonnie++.  No further oops/dmesg output.
> 
Are you in a position to test a kernel without preempt? I have a
sneeking suspicion that its related to that.

> For the GFS2 folks, latest CVS gfs_tool doesn't have lockdump, is there
> any way to examine what I'm stuck on?
> 
There isn't anything similar at the moment, however we do know about
that and there is a bugzilla entry relating to it, #221300. I don't
think that in this case it will help though since I think this is
something more fundamental in the locking code.

I'll see if I can reproduce something similar locally in the mean time,

Steve.


From dbrieck at gmail.com  Thu Jan 25 14:55:25 2007
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Thu, 25 Jan 2007 09:55:25 -0500
Subject: [Linux-cluster] Piranha not bring up all interfaces
Message-ID: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>

I'm having a problem with our LVS setup using piranha and I'm hoping
this would be the right place to ask about it. What's happening is
that whenever we start or restart pulse, it brings up all the virtual
servers but for some reason it will not bring up all of the interfaces
for the VIP addresses.

If I just keep restarting pulse it will sometimes bring up different
interfaces and sometimes bring up 6/10 or 8/10 or 9/10 of the
interfaces it is supposed to bring up. However if I manually go back
and issue the ifconfig command to bring up the missing interfaces
everything works just fine.

There are no errors logged and starting pulse from the command line
with 'pulse -n -v' doesn't indicate anything is wrong either. We have
10 active virtual servers and each has 3 real servers behind it.

Any ideas?


From jparsons at redhat.com  Thu Jan 25 16:35:33 2007
From: jparsons at redhat.com (James Parsons)
Date: Thu, 25 Jan 2007 11:35:33 -0500
Subject: [Linux-cluster] Starting a cluster - Quorum Numbers
In-Reply-To: <2882a1d44a1bb29165a84e460deedfb5@redhat.com>
References: <2007124133413.298924@leena>
	<2882a1d44a1bb29165a84e460deedfb5@redhat.com>
Message-ID: <45B8DC55.2080606@redhat.com>

Jonathan E Brassow wrote:

> You must achieve quorum before the cluster services are enabled.  That 
> means 1/2 of your machines listed in cluster.conf +1 must come up.
>
> After that, if you do a 'cman_tool leave remove' on the machines you 
> don't need, you should be able to go down as low as you want.  (I 
> think there is a way to make the cman initscript do this 
> automatically...  I also think you can specify a node count or quorum 
> number in cluster.conf to control when it's allowed to start the 
> service - but this last one might be unlikely, given that it could 
> lead to split-brain.)
>
I am sorry if this sounds like another commercial announcement, but 
Conga will allow you to control which nodes are joined or out of the 
cluster very easily. It also display necessary quorum, and what the 
total votes in the cluster currently is. Conga runs on RHEL5 and will 
also be in the RHEL4 Update 5 distribution -- it will run OK on all of  
RHEL4 generally.

My intention here is not to schill, but rather to announce that there is 
an alternative in place for the need to ssh around the cluster and exec 
specific commands with specific args. *shrug*

-J


From benoit.duffau at devoteam.com  Thu Jan 25 16:55:02 2007
From: benoit.duffau at devoteam.com (Benoit DUFFAU)
Date: Thu, 25 Jan 2007 17:55:02 +0100
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169721022.11001.81.camel@quoit.chygwyn.com>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
	<1169638529.17590.18.camel@localhost.localdomain>
	<1169639347.11001.67.camel@quoit.chygwyn.com>
	<1169652342.17590.43.camel@localhost.localdomain>
	<1169721022.11001.81.camel@quoit.chygwyn.com>
Message-ID: <1169744102.12218.41.camel@localhost.localdomain>

Le jeudi 25 janvier 2007 ? 10:30 +0000, Steven Whitehouse a ?crit :
> Hi,
> > 
> > Hope it helps
> > 
> Yes, that was just the thing, thanks for sending that. Please see if the
> follow patch (it assumes that you have already applied the original fix
> I pointed you at) does the trick,
> 

It works just GREAT !!! 

Actually, doing some more test i figured out that the problem was only
by mounting the export with the hard option !

If one mounts the NFS export with the soft option using the first patch
you provided before already did the trick ! 

Now, with this patch it works using either hard or soft mount option for
NFS client !

Thank you so much !!

Benoit 

> Steve.
> 
> 
> diff --git a/fs/gfs2/inode.c b/fs/gfs2/inode.c
> index f7c8d31..88fcfb4 100644
> --- a/fs/gfs2/inode.c
> +++ b/fs/gfs2/inode.c
> @@ -395,8 +395,10 @@ struct inode *gfs2_lookup_simple(struct 
>   * @is_root: If 1, ignore the caller's permissions
>   * @i_gh: An uninitialized holder for the new inode glock
>   *
> - * There will always be a vnode (Linux VFS inode) for the d_gh inode unless
> - * @is_root is true.
> + * This can be called via the VFS filldir function when NFS is doing
> + * a readdirplus and the inode which its intending to stat isn't
> + * already in cache. In this case we must not take the directory glock
> + * again, since the readdir call will have already taken that lock.
>   *
>   * Returns: errno
>   */
> @@ -409,8 +411,9 @@ struct inode *gfs2_lookupi(struct inode 
>  	struct gfs2_holder d_gh;
>  	struct gfs2_inum_host inum;
>  	unsigned int type;
> -	int error = 0;
> +	int error;
>  	struct inode *inode = NULL;
> +	int unlock = 0;
>  
>  	if (!name->len || name->len > GFS2_FNAMESIZE)
>  		return ERR_PTR(-ENAMETOOLONG);
> @@ -422,9 +425,12 @@ struct inode *gfs2_lookupi(struct inode 
>  		return dir;
>  	}
>  
> -	error = gfs2_glock_nq_init(dip->i_gl, LM_ST_SHARED, 0, &d_gh);
> -	if (error)
> -		return ERR_PTR(error);
> +	if (gfs2_glock_is_locked_by_me(dip->i_gl) == 0) {
> +		error = gfs2_glock_nq_init(dip->i_gl, LM_ST_SHARED, 0, &d_gh);
> +		if (error)
> +			return ERR_PTR(error);
> +		unlock = 1;
> +	}
>  
>  	if (!is_root) {
>  		error = permission(dir, MAY_EXEC, NULL);
> @@ -439,10 +445,11 @@ struct inode *gfs2_lookupi(struct inode 
>  	inode = gfs2_inode_lookup(sb, &inum, type);
>  
>  out:
> -	gfs2_glock_dq_uninit(&d_gh);
> +	if (unlock)
> +		gfs2_glock_dq_uninit(&d_gh);
>  	if (error == -ENOENT)
>  		return NULL;
> -	return inode;
> +	return inode ? inode : ERR_PTR(error);
>  }
>  
>  static int pick_formal_ino_1(struct gfs2_sbd *sdp, u64 *formal_ino)
> diff --git a/fs/gfs2/lops.c b/fs/gfs2/lops.c
> index 4d7f94d..16bb4b4 100644
> --- a/fs/gfs2/lops.c
> +++ b/fs/gfs2/lops.c
> @@ -69,13 +69,16 @@ static void buf_lo_add(struct gfs2_sbd *
>  	struct gfs2_bufdata *bd = container_of(le, struct gfs2_bufdata, bd_le);
>  	struct gfs2_trans *tr;
>  
> -	if (!list_empty(&bd->bd_list_tr))
> +	gfs2_log_lock(sdp);
> +	if (!list_empty(&bd->bd_list_tr)) {
> +		gfs2_log_unlock(sdp);
>  		return;
> -
> +	}
>  	tr = current->journal_info;
>  	tr->tr_touched = 1;
>  	tr->tr_num_buf++;
>  	list_add(&bd->bd_list_tr, &tr->tr_list_buf);
> +	gfs2_log_unlock(sdp);
>  
>  	if (!list_empty(&le->le_list))
>  		return;
> @@ -84,7 +87,6 @@ static void buf_lo_add(struct gfs2_sbd *
>  
>  	gfs2_meta_check(sdp, bd->bd_bh);
>  	gfs2_pin(sdp, bd->bd_bh);
> -
>  	gfs2_log_lock(sdp);
>  	sdp->sd_log_num_buf++;
>  	list_add(&le->le_list, &sdp->sd_log_le_buf);
> @@ -98,11 +100,13 @@ static void buf_lo_incore_commit(struct 
>  	struct list_head *head = &tr->tr_list_buf;
>  	struct gfs2_bufdata *bd;
>  
> +	gfs2_log_lock(sdp);
>  	while (!list_empty(head)) {
>  		bd = list_entry(head->next, struct gfs2_bufdata, bd_list_tr);
>  		list_del_init(&bd->bd_list_tr);
>  		tr->tr_num_buf--;
>  	}
> +	gfs2_log_unlock(sdp);
>  	gfs2_assert_warn(sdp, !tr->tr_num_buf);
>  }
>  
> @@ -462,13 +466,17 @@ static void databuf_lo_add(struct gfs2_s
>  	struct address_space *mapping = bd->bd_bh->b_page->mapping;
>  	struct gfs2_inode *ip = GFS2_I(mapping->host);
>  
> +	gfs2_log_lock(sdp);
>  	tr->tr_touched = 1;
>  	if (list_empty(&bd->bd_list_tr) &&
>  	    (ip->i_di.di_flags & GFS2_DIF_JDATA)) {
>  		tr->tr_num_buf++;
>  		list_add(&bd->bd_list_tr, &tr->tr_list_buf);
> +		gfs2_log_unlock(sdp);
>  		gfs2_pin(sdp, bd->bd_bh);
>  		tr->tr_num_buf_new++;
> +	} else {
> +		gfs2_log_unlock(sdp);
>  	}
>  	gfs2_trans_add_gl(bd->bd_gl);
>  	gfs2_log_lock(sdp);
> diff --git a/fs/gfs2/ops_inode.c b/fs/gfs2/ops_inode.c
> index 747c731..5591f89 100644
> --- a/fs/gfs2/ops_inode.c
> +++ b/fs/gfs2/ops_inode.c
> @@ -1018,7 +1018,7 @@ static int gfs2_getattr(struct vfsmount 
>  	}
>  
>  	generic_fillattr(inode, stat);
> -	if (unlock);
> +	if (unlock)
>  		gfs2_glock_dq_uninit(&gh);
>  
>  	return 0;
> 


+---------------------------------------------------------------------+
Combining consulting and technology solutions offers enables Devoteam
to provide its customers with independent advice and effective solutions
that meet their strategic objectives (IT performance and optimisation)
in complementary areas: networks, systems infrastructure, security
and e-business applications.
Created in 1995, Devoteam achieved in 2005 a turnover of 199 million euros
and an operating margin of 7%. The group counts 2,400 employees through
sixteen countries in Europe, the Middle East and North Africa.
Listed on Euronext (Eurolist B compartment) since October 28, 1999.
Part of the Nexteconomy, CAC SMALL 90, IT CAC 50, SBF 250 index of 
Euronext Paris
ISIN: FR 000007379 3, Reuters: DVTM.LM, Bloomberg: DEVO FP
+---------------------------------------------------------------------+


From dave at eons.com  Thu Jan 25 16:56:19 2007
From: dave at eons.com (Dave Berry)
Date: Thu, 25 Jan 2007 11:56:19 -0500
Subject: [Linux-cluster] Logging of cluster info
Message-ID: <45B8E133.5060805@eons.com>

Is there a way to turn up logging on the cluster package?  I have a 3 
node GFS cluster that is running NFS as a service and the nfsd service 
got in a disk wait state locking up my client boxes.  There was nothing 
in /var/log/messages and the GFS cluster needs to get rebooted.

Thanks,

Dave Berry


From lhh at redhat.com  Thu Jan 25 18:15:32 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Jan 2007 13:15:32 -0500
Subject: [Linux-cluster] Logging of cluster info
In-Reply-To: <45B8E133.5060805@eons.com>
References: <45B8E133.5060805@eons.com>
Message-ID: <1169748932.2980.22.camel@localhost.localdomain>

On Thu, 2007-01-25 at 11:56 -0500, Dave Berry wrote:
> Is there a way to turn up logging on the cluster package?  I have a 3 
> node GFS cluster that is running NFS as a service and the nfsd service 
> got in a disk wait state locking up my client boxes.  There was nothing 
> in /var/log/messages and the GFS cluster needs to get rebooted.

You can do it for rgmanager, but I don't know about GFS.

<rm log_level="7" ...>

-- Lon


From lhh at redhat.com  Thu Jan 25 18:17:16 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Jan 2007 13:17:16 -0500
Subject: [Linux-cluster] Piranha not bring up all interfaces
In-Reply-To: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>
References: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>
Message-ID: <1169749037.2980.25.camel@localhost.localdomain>

On Thu, 2007-01-25 at 09:55 -0500, David Brieck Jr. wrote:
> I'm having a problem with our LVS setup using piranha and I'm hoping
> this would be the right place to ask about it. What's happening is
> that whenever we start or restart pulse, it brings up all the virtual
> servers but for some reason it will not bring up all of the interfaces
> for the VIP addresses.
> 
> If I just keep restarting pulse it will sometimes bring up different
> interfaces and sometimes bring up 6/10 or 8/10 or 9/10 of the
> interfaces it is supposed to bring up. However if I manually go back
> and issue the ifconfig command to bring up the missing interfaces
> everything works just fine.
> 
> There are no errors logged and starting pulse from the command line
> with 'pulse -n -v' doesn't indicate anything is wrong either. We have
> 10 active virtual servers and each has 3 real servers behind it.
> 
> Any ideas?

Happen to be running bonding on e1000?

-- Lon


From lhh at redhat.com  Thu Jan 25 18:18:48 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Jan 2007 13:18:48 -0500
Subject: [Linux-cluster] Starting a cluster - Quorum Numbers
In-Reply-To: <45B8DC55.2080606@redhat.com>
References: <2007124133413.298924@leena>
	<2882a1d44a1bb29165a84e460deedfb5@redhat.com>
	<45B8DC55.2080606@redhat.com>
Message-ID: <1169749128.2980.28.camel@localhost.localdomain>

On Thu, 2007-01-25 at 11:35 -0500, James Parsons wrote:
> Jonathan E Brassow wrote:
> 
> > You must achieve quorum before the cluster services are enabled.  That 
> > means 1/2 of your machines listed in cluster.conf +1 must come up.
> >

> My intention here is not to schill, but rather to announce that there is 
> an alternative in place for the need to ssh around the cluster and exec 
> specific commands with specific args. *shrug*

That, and, it's designed to handle the whole 'remove' / 'add' of cluster
nodes pretty dynamically in a far less painful way than doing it
manually over and over.

-- Lon


From lhh at redhat.com  Thu Jan 25 18:21:33 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Jan 2007 13:21:33 -0500
Subject: [Linux-cluster] Re: gfs on FC6 and ddraid
In-Reply-To: <45B84216.7030305@prologixsoft.com>
References: <20070122161503.16AA773605@hormel.redhat.com>
	<45B5E2AB.8020406@prologixsoft.com> <45B84216.7030305@prologixsoft.com>
Message-ID: <1169749294.2980.32.camel@localhost.localdomain>

On Thu, 2007-01-25 at 11:07 +0530, Ashish Varman wrote:
> Anyone ?
> 
> Ashish Varman wrote:
> 
> > Hello all
> >
> > A few follow-up questions have popped up after my previous problem got 
> > solved
> >
> > 1. What would be the correct way to use gfs (not gfs2) on FC6.  Binary 
> > installation (where do I get the rpms?) or source installation (which 
> > tag to check out?)
> >
> > 2. Since software raid is not cluster aware, ddraid seemed a much 
> > needed answer to storage node failure.  However, I was unable to build 
> > it.  The tarballs mentioned in the README are nowhere to be found and 
> > the kernel version on FC6 is different from those mentioned in it.  
> > Also, the build fails due to not finding the file "../dm-ddraid.h".  
> > This too is nowhere to be found :(.  Any pointers?
> >
> > 3. Besides ddraid, is there any commercial product that can manage 
> > cluster-aware software raid?

There are at least two open source projects DRBD (0.8.x/unstable;
0.7.x/stable does not support multiple writers IIRC) and cluster
mirroring + GNBD which will provide multiple-writer-RAID1.  Both are
still in development.

-- Lon


From mm at yuhu.biz  Thu Jan 25 18:28:21 2007
From: mm at yuhu.biz (Marian Marinov)
Date: Thu, 25 Jan 2007 20:28:21 +0200
Subject: [Linux-cluster] Re: gfs on FC6 and ddraid
In-Reply-To: <1169749294.2980.32.camel@localhost.localdomain>
References: <20070122161503.16AA773605@hormel.redhat.com>
	<45B84216.7030305@prologixsoft.com>
	<1169749294.2980.32.camel@localhost.localdomain>
Message-ID: <200701252028.21126.mm@yuhu.biz>

On Thursday 25 January 2007 20:21, Lon Hohberger wrote:
> On Thu, 2007-01-25 at 11:07 +0530, Ashish Varman wrote:
> > Anyone ?
> >
> > Ashish Varman wrote:
> > > Hello all
> > >
> > > A few follow-up questions have popped up after my previous problem got
> > > solved
> > >
> > > 1. What would be the correct way to use gfs (not gfs2) on FC6.  Binary
> > > installation (where do I get the rpms?) or source installation (which
> > > tag to check out?)
> > >
> > > 2. Since software raid is not cluster aware, ddraid seemed a much
> > > needed answer to storage node failure.  However, I was unable to build
> > > it.  The tarballs mentioned in the README are nowhere to be found and
> > > the kernel version on FC6 is different from those mentioned in it.
> > > Also, the build fails due to not finding the file "../dm-ddraid.h".
> > > This too is nowhere to be found :(.  Any pointers?
> > >
> > > 3. Besides ddraid, is there any commercial product that can manage
> > > cluster-aware software raid?
>
> There are at least two open source projects DRBD (0.8.x/unstable;
> 0.7.x/stable does not support multiple writers IIRC) and cluster
> mirroring + GNBD which will provide multiple-writer-RAID1.  Both are
> still in development.
>
DRBD 8.0.0 (the last release) now support Primary/Primary mode with OCFS2 and 
GFS.

  Marian


From rpeterso at redhat.com  Thu Jan 25 18:39:33 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 25 Jan 2007 12:39:33 -0600
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>	<45B78A54.1010204@redhat.com>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
Message-ID: <45B8F965.5020906@redhat.com>

Jaap Dijkshoorn wrote:
> No problem, thanks for this. We are very happy that this maybe is the
> fix we need in our environment.
> I saw some remarks later about cleaning up the stable branche. Is the
> stable branche stable enough to use it now.. or do we have to wait for
> the cleanup?
>   
Hi Jaap,

STABLE is, well, stable as far as I know.  I did a massive diff on STABLE's
GFS code yesterday to compare it to RHEL4, just to make sure I hadn't 
forgotten
anything else, and I only found a couple of small things that shouldn't 
impact most
people.

Regards,

Bob Peterson
Red Hat Cluster Suite


From dbrieck at gmail.com  Thu Jan 25 18:58:07 2007
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Thu, 25 Jan 2007 13:58:07 -0500
Subject: [Linux-cluster] Piranha not bring up all interfaces
In-Reply-To: <1169749037.2980.25.camel@localhost.localdomain>
References: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>
	<1169749037.2980.25.camel@localhost.localdomain>
Message-ID: <8c1094290701251058r64311641ob431e322d26b84c2@mail.gmail.com>

On 1/25/07, Lon Hohberger <lhh at redhat.com> wrote:
> On Thu, 2007-01-25 at 09:55 -0500, David Brieck Jr. wrote:
> > I'm having a problem with our LVS setup using piranha and I'm hoping
> > this would be the right place to ask about it. What's happening is
> > that whenever we start or restart pulse, it brings up all the virtual
> > servers but for some reason it will not bring up all of the interfaces
> > for the VIP addresses.
> >
> > If I just keep restarting pulse it will sometimes bring up different
> > interfaces and sometimes bring up 6/10 or 8/10 or 9/10 of the
> > interfaces it is supposed to bring up. However if I manually go back
> > and issue the ifconfig command to bring up the missing interfaces
> > everything works just fine.
> >
> > There are no errors logged and starting pulse from the command line
> > with 'pulse -n -v' doesn't indicate anything is wrong either. We have
> > 10 active virtual servers and each has 3 real servers behind it.
> >
> > Any ideas?
>
> Happen to be running bonding on e1000?
>
> -- Lon
>

The system has 4 e1000 ports, 2 are bonded for cluster traffic, one is
an internet interface and the other is for LAN traffic.

The interfaces that aren't coming up are on eth3, not bond0.

I take it this is a known bug?

The debug output from the command line contains something like this:

/sbin/ifconfig eth2:1 10.1.1.1 netmask 255.255.255.0 up
/sbin/ifconfig eth3:1 xx.xx.5.68 netmask 255.255.255.192 up
/sbin/ifconfig eth3:2 xx.xx.5.69 netmask 255.255.255.192 up
/sbin/ifconfig eth3:3 xx.xx.5.70 netmask 255.255.255.192 up
/sbin/ifconfig eth3:4 xx.xx.5.71 netmask 255.255.255.192 up
/sbin/ifconfig eth3:5 xx.xx.5.77 netmask 255.255.255.192 up
/sbin/ifconfig eth3:6 xx.xx.5.78 netmask 255.255.255.192 up
/sbin/ifconfig eth3:7 xx.xx.5.79 netmask 255.255.255.192 up
/sbin/ifconfig eth3:8 xx.xx.5.81 netmask 255.255.255.192 up
/sbin/ifconfig eth3:9 xx.xx.5.82 netmask 255.255.255.192 up
/sbin/ifconfig eth3:11 xx.xx.5.84 netmask 255.255.255.192 up
/usr/sbin/send_arp


From comaniliut at gmail.com  Thu Jan 25 19:28:30 2007
From: comaniliut at gmail.com (Coman Iliut)
Date: Thu, 25 Jan 2007 14:28:30 -0500
Subject: [Linux-cluster] Kernel messages causing node to be fenced out - Bug?
In-Reply-To: <9cf395480701241251i1923ca8cr4d431d39dcbcca73@mail.gmail.com>
References: <9cf395480701241251i1923ca8cr4d431d39dcbcca73@mail.gmail.com>
Message-ID: <9cf395480701251128l2a211330y49f7010822e6b9ba@mail.gmail.com>

Hi,

We have a setup with two HP DL360 nodes connected to an MSA500 disk array
via SCSI cables. We are running RH4U3 and our product has an active passive
design. The Active-passive is managed internally in the product.

Every now and then one of the nodes outputs the below kernel messages after
which the other node fences it out. This causes a failover for our product.

Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 0,2,0
Jan 19 13:38:58 n1 kernel: FS1 move use event 2
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 (first)
Jan 19 13:38:58 n1 kernel: FS1 add nodes
Jan 19 13:38:58 n1 kernel: FS1 total nodes 1
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 0 resources
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 0,2,2
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 finished
Jan 19 13:38:58 n1 kernel: FS1 move flags 1,0,0 ids 2,2,2
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 2,5,2
Jan 19 13:38:58 n1 kernel: FS1 move use event 5
Jan 19 13:38:58 n1 kernel: FS1 recover event 5
Jan 19 13:38:58 n1 kernel: FS1 add node 2
Jan 19 13:38:58 n1 kernel: FS1 total nodes 2
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 7409 resources
Jan 19 13:38:58 n1 kernel: FS1 purge requests
Jan 19 13:38:58 n1 kernel: FS1 purged 0 requests
Jan 19 13:38:58 n1 kernel: FS1 mark waiting requests
Jan 19 13:38:58 n1 kernel: FS1 marked 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 2,5,5
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 resend marked requests
Jan 19 13:38:58 n1 kernel: FS1 resent 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 finished
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 unlock ff9b0297 no id
Jan 19 13:38:59 n1 kernel:  -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2


Then the other node says "missed too many heartbeats" and fences it out. it
does some minor recovery work and is all fine.

Is this a bug? The two nodes don't seem to do much at the time when this
happens.
We have seen this on another similar setup (2 DL360, MSA500). It seems to
happen quite regularly.

I remember I saw a mention about something similar on a mailing list and
Patrick Caulfield answered:

If you're running the cman from RHEL4 Update 3 then there's a bug in
there you might be hitting.

You'll need to upgrade all the nodes in the cluster to get rid of it.
I can't tell for sure
if it is that problem you're having without seeing more kernel messages though.

http://www.spinics.net/lists/cluster/msg07016.html

Any ideas?

Thanks.

Coman ILIUT

Mitel Networks
Ottawa, ON
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070125/c76e5c22/attachment.htm>

From comaniliut at gmail.com  Thu Jan 25 20:18:55 2007
From: comaniliut at gmail.com (Coman Iliut)
Date: Thu, 25 Jan 2007 15:18:55 -0500
Subject: [Linux-cluster] GFS bug?
Message-ID: <9cf395480701251218r6324faf5h2a9f65179cbba568@mail.gmail.com>

Hi,

We have a setup with two HP DL360 nodes connected to an MSA500 disk array
via SCSI cables. We are running RH4U3 and our product has an active passive
design. The Active-passive is managed internally in the product.

Every now and then one of the nodes outputs the below kernel messages after
which the other node fences it out. This causes a failover for our product.

Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 0,2,0
Jan 19 13:38:58 n1 kernel: FS1 move use event 2
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 (first)
Jan 19 13:38:58 n1 kernel: FS1 add nodes
Jan 19 13:38:58 n1 kernel: FS1 total nodes 1
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 0 resources
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 0,2,2
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 finished
Jan 19 13:38:58 n1 kernel: FS1 move flags 1,0,0 ids 2,2,2
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 2,5,2
Jan 19 13:38:58 n1 kernel: FS1 move use event 5
Jan 19 13:38:58 n1 kernel: FS1 recover event 5
Jan 19 13:38:58 n1 kernel: FS1 add node 2
Jan 19 13:38:58 n1 kernel: FS1 total nodes 2
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 7409 resources
Jan 19 13:38:58 n1 kernel: FS1 purge requests
Jan 19 13:38:58 n1 kernel: FS1 purged 0 requests
Jan 19 13:38:58 n1 kernel: FS1 mark waiting requests
Jan 19 13:38:58 n1 kernel: FS1 marked 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 2,5,5
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 resend marked requests
Jan 19 13:38:58 n1 kernel: FS1 resent 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 finished
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 unlock ff9b0297 no id
Jan 19 13:38:59 n1 kernel:  -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2


Then the other node says "missed too many heartbeats" and fences it out. it
does some minor recovery work and is all fine.

Is this a bug? The two nodes don't seem to do much at the time when this
happens.
We have seen this on another similar setup (2 DL360, MSA500). It seems to
happen quite regularly.

I remember I saw a mention about something similar on a mailing list and
Patrick Caulfield answered:

If you're running the cman from RHEL4 Update 3 then there's a bug in there
you might be hitting.

You'll need to upgrade all the nodes in the cluster to get rid of it. I
can't tell for sure
if it is that problem you're having without seeing more kernel messages
though.

http://www.spinics.net/lists/cluster/msg07016.html

Any ideas?

Thanks.


-- 
Coman ILIUT

Mitel Networks
Ottawa, ON
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070125/9ce1264d/attachment.htm>

From lhh at redhat.com  Thu Jan 25 21:50:46 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 25 Jan 2007 16:50:46 -0500
Subject: [Linux-cluster] Piranha not bring up all interfaces
In-Reply-To: <8c1094290701251058r64311641ob431e322d26b84c2@mail.gmail.com>
References: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>
	<1169749037.2980.25.camel@localhost.localdomain>
	<8c1094290701251058r64311641ob431e322d26b84c2@mail.gmail.com>
Message-ID: <1169761846.2980.41.camel@localhost.localdomain>

On Thu, 2007-01-25 at 13:58 -0500, David Brieck Jr. wrote:
> On 1/25/07, Lon Hohberger <lhh at redhat.com> wrote:
> > On Thu, 2007-01-25 at 09:55 -0500, David Brieck Jr. wrote:
> > > I'm having a problem with our LVS setup using piranha and I'm hoping
> > > this would be the right place to ask about it. What's happening is
> > > that whenever we start or restart pulse, it brings up all the virtual
> > > servers but for some reason it will not bring up all of the interfaces
> > > for the VIP addresses.
> > >
> > > If I just keep restarting pulse it will sometimes bring up different
> > > interfaces and sometimes bring up 6/10 or 8/10 or 9/10 of the
> > > interfaces it is supposed to bring up. However if I manually go back
> > > and issue the ifconfig command to bring up the missing interfaces
> > > everything works just fine.
> > >
> > > There are no errors logged and starting pulse from the command line
> > > with 'pulse -n -v' doesn't indicate anything is wrong either. We have
> > > 10 active virtual servers and each has 3 real servers behind it.
> > >
> > > Any ideas?
> >
> > Happen to be running bonding on e1000?
> >
> > -- Lon
> >
> 
> The system has 4 e1000 ports, 2 are bonded for cluster traffic, one is
> an internet interface and the other is for LAN traffic.
> 
> The interfaces that aren't coming up are on eth3, not bond0.
> 
> I take it this is a known bug?
> 
> The debug output from the command line contains something like this:
> 
> /sbin/ifconfig eth2:1 10.1.1.1 netmask 255.255.255.0 up
> /sbin/ifconfig eth3:1 xx.xx.5.68 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:2 xx.xx.5.69 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:3 xx.xx.5.70 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:4 xx.xx.5.71 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:5 xx.xx.5.77 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:6 xx.xx.5.78 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:7 xx.xx.5.79 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:8 xx.xx.5.81 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:9 xx.xx.5.82 netmask 255.255.255.192 up
> /sbin/ifconfig eth3:11 xx.xx.5.84 netmask 255.255.255.192 up
> /usr/sbin/send_arp

ifconfig ioctl()s (e.g. SIOCGIFCONF, etc.) on 4+ e1000s sometimes fail
for some reason.  I've only seen it with bonding and on RHEL3.  In
clumanager from RHCS3, we worked around it it by allowing users to
switch to RHEL4-ish behavior (e.g. use the iproute2 utilities, which use
the netlink socket instead of ioctls).

The same update was never made for piranha.  Maybe it's needed *shrug*.

I don't know about RHEL4, and I thought it was fixed in RHEL3 U8, but it
might be something to consider...

(Obviously it shouldn't fail).

-- Lon


From srramasw at cisco.com  Thu Jan 25 22:54:34 2007
From: srramasw at cisco.com (Sridharan Ramaswamy (srramasw))
Date: Thu, 25 Jan 2007 14:54:34 -0800
Subject: [Linux-cluster] GFS works in MIPS64 and mixed-endian platforms?
Message-ID: <B14199FA0DBAAF4AA89E83EB41D3543502F2260B@xmb-sjc-22c.amer.cisco.com>

Has anyone compiled and ran GFS on MIPS64 platform like Cavium? Any
known issues to watch out for?
 
Also can GNBD-server node and GNBD-client/GFS node be on different
endian types? Like GNBD server being MIPS64 and client being i386.
 
Appreciate any info on this.
 
thanks,
Sridharan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070125/9b36eb9f/attachment.htm>

From brandonlamb at gmail.com  Fri Jan 26 01:32:59 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Thu, 25 Jan 2007 17:32:59 -0800
Subject: [Linux-cluster] LVM2 from cvs is a no go
Message-ID: <f25c44610701251732o1f0056c4x2fadb06063d6fc35@mail.gmail.com>

So I am following the instructions from the usage.txt

I have instaled a 2.6.19.2 kernel with gfs2, dlm, configfs, ipv6 and
sctp. I installed openais from the svn.osdl.org/openais, i installed
libvolume_id using udev-104, and i installed cluster from the cvs.

Now I am onto instaling LVM2/CLVM (optional) it says, but it will not
compile. Am I missing something? Below is the output from make


=====================
[root at mail1 LVM2]# make
make -C doc
make -C include
make[1]: Entering directory `/usr/src/LVM2/include'
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/usr/src/LVM2/include'
make -C man
make[1]: Entering directory `/usr/src/LVM2/doc'
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/usr/src/LVM2/doc'
make -C scripts
make[1]: Entering directory `/usr/src/LVM2/man'
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/usr/src/LVM2/man'
make -C lib
make[1]: Entering directory `/usr/src/LVM2/scripts'
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/usr/src/LVM2/scripts'
make[1]: Entering directory `/usr/src/LVM2/lib'
gcc -c -I. -I../include -DHAVE_CONFIG_H  -fPIC -Wall -Wundef -Wshadow
-Wcast-align -Wwrite-strings -Wmissing-prototypes
-Wmissing-declarations -Wnested-externs -Winline -Wmissing-noreturn
-O2 report/report.c -o report/report.o
gcc -c -I. -I../include -DHAVE_CONFIG_H  -fPIC -Wall -Wundef -Wshadow
-Wcast-align -Wwrite-strings -Wmissing-prototypes
-Wmissing-declarations -Wnested-externs -Winline -Wmissing-noreturn
-O2 format1/format1.c -o format1/format1.o
report/report.c:67: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:67: warning: its scope is only this definition or
declaration, which is probably not what you want
report/report.c:67: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_string_disp?:
report/report.c:69: warning: implicit declaration of function
?dm_report_field_string?
report/report.c:69: warning: nested extern declaration of
?dm_report_field_string?
report/report.c: At top level:
report/report.c:74: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:74: warning: ?struct dm_report? declared inside parameter list
report/report.c:83: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:83: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_devices_disp?:
report/report.c:139: warning: implicit declaration of function
?dm_report_field_set_value?
report/report.c:139: warning: nested extern declaration of
?dm_report_field_set_value?
report/report.c: At top level:
report/report.c:146: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:146: warning: ?struct dm_report? declared inside parameter list
report/report.c:176: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:176: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_modules_disp?:
report/report.c:189: warning: passing argument 1 of ?_tags_disp? from
incompatible pointer type
report/report.c:189: warning: passing argument 3 of ?_tags_disp? from
incompatible pointer type
report/report.c: At top level:
report/report.c:194: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:194: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_vgfmt_disp?:
report/report.c:203: warning: passing argument 1 of ?_string_disp?
from incompatible pointer type
report/report.c:203: warning: passing argument 3 of ?_string_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:208: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:208: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_pvfmt_disp?:
report/report.c:218: warning: passing argument 1 of ?_string_disp?
from incompatible pointer type
report/report.c:218: warning: passing argument 3 of ?_string_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:223: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:223: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_lvkmaj_disp?:
report/report.c:230: warning: implicit declaration of function
?dm_report_field_int?
report/report.c:230: warning: nested extern declaration of ?dm_report_field_int?
report/report.c:232: warning: implicit declaration of function
?dm_report_field_uint64?
report/report.c:232: warning: nested extern declaration of
?dm_report_field_uint64?
report/report.c: At top level:
report/report.c:237: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:237: warning: ?struct dm_report? declared inside parameter list
report/report.c:251: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:251: warning: ?struct dm_report? declared inside parameter list
report/report.c:336: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:336: warning: ?struct dm_report? declared inside parameter list
report/report.c:362: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:362: warning: ?struct dm_report? declared inside parameter list
report/report.c:406: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:406: warning: ?struct dm_report? declared inside parameter list
report/report.c:421: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:421: warning: ?struct dm_report? declared inside parameter list
report/report.c:435: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:435: warning: ?struct dm_report? declared inside parameter list
report/report.c:453: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:453: warning: ?struct dm_report? declared inside parameter list
report/report.c:487: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:487: warning: ?struct dm_report? declared inside parameter list
report/report.c:506: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:506: warning: ?struct dm_report? declared inside parameter list
report/report.c:536: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:536: warning: ?struct dm_report? declared inside parameter list
report/report.c:565: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:565: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_vgsize_disp?:
report/report.c:572: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:572: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:577: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:577: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_segstart_disp?:
report/report.c:584: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:584: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:589: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:589: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_segsize_disp?:
report/report.c:596: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:596: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:601: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:601: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_chunksize_disp?:
report/report.c:611: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:611: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:616: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:616: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_pvused_disp?:
report/report.c:627: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:627: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:632: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:632: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_pvfree_disp?:
report/report.c:643: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:643: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:648: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:648: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_pvsize_disp?:
report/report.c:659: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:659: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:664: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:664: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_devsize_disp?:
report/report.c:672: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:672: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:677: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:677: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_vgfree_disp?:
report/report.c:684: warning: passing argument 1 of ?_size64_disp?
from incompatible pointer type
report/report.c:684: warning: passing argument 3 of ?_size64_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:689: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:689: warning: ?struct dm_report? declared inside parameter list
report/report.c:709: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:709: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_uint32_disp?:
report/report.c:711: warning: implicit declaration of function
?dm_report_field_uint32?
report/report.c:711: warning: nested extern declaration of
?dm_report_field_uint32?
report/report.c: At top level:
report/report.c:716: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:716: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_int32_disp?:
report/report.c:718: warning: implicit declaration of function
?dm_report_field_int32?
report/report.c:718: warning: nested extern declaration of
?dm_report_field_int32?
report/report.c: At top level:
report/report.c:723: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:723: warning: ?struct dm_report? declared inside parameter list
report/report.c: In function ?_lvsegcount_disp?:
report/report.c:730: warning: passing argument 1 of ?_uint32_disp?
from incompatible pointer type
report/report.c:730: warning: passing argument 3 of ?_uint32_disp?
from incompatible pointer type
report/report.c: At top level:
report/report.c:735: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:735: warning: ?struct dm_report? declared inside parameter list
report/report.c:785: warning: ?struct dm_report_field? declared inside
parameter list
report/report.c:785: warning: ?struct dm_report? declared inside parameter list
report/report.c:856: error: array type has incomplete element type
report/report.c:873: error: array type has incomplete element type
In file included from report/report.c:874:
report/columns.h:21: error: ?DM_REPORT_FIELD_TYPE_STRING? undeclared
here (not in a function)
report/columns.h:24: error: ?DM_REPORT_FIELD_TYPE_NUMBER? undeclared
here (not in a function)
report/report.c: In function ?report_init?:
report/report.c:889: error: ?DM_REPORT_OUTPUT_ALIGNED? undeclared
(first use in this function)
report/report.c:889: error: (Each undeclared identifier is reported only once
report/report.c:889: error: for each function it appears in.)
report/report.c:892: error: ?DM_REPORT_OUTPUT_BUFFERED? undeclared
(first use in this function)
report/report.c:895: error: ?DM_REPORT_OUTPUT_HEADINGS? undeclared
(first use in this function)
report/report.c:897: warning: implicit declaration of function ?dm_report_init?
report/report.c:897: warning: nested extern declaration of ?dm_report_init?
report/report.c:898: warning: return makes pointer from integer without a cast
report/report.c: In function ?report_object?:
report/report.c:916: warning: implicit declaration of function
?dm_report_object?
report/report.c:916: warning: nested extern declaration of ?dm_report_object?
make[1]: *** [report/report.o] Error 1
make[1]: *** Waiting for unfinished jobs....
make[1]: Leaving directory `/usr/src/LVM2/lib'
make: *** [lib] Error 2
=====================


From Michael.Hagmann at hilti.com  Fri Jan 26 07:47:01 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Fri, 26 Jan 2007 08:47:01 +0100
Subject: [Linux-cluster] HA Clustering - Need Help
In-Reply-To: <bc3908b0701230933g18301280x1e4a02d1a6edc7e5@mail.gmail.com>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA53015D792F@LI-OWL.hag.hilti.com>

Hi
 
what I can recommend ( in short ) is a RHEL4 U4+ / GFS  Cluster. When you
mount the same File system ( in the same time ) on more than one Node you
need a Clusterfilesystem ( like GFS or maybe ocfs2 )
 
Example Config:
 
RHEL4 U4 / GFS with DLM and Quorumdisk ( when you only have 2 nodes ) also
very Important is the fencing method ( we use now the iLO interface from our
HP Servers ). And for the Cluster interconnect I recommend you a separate
Network. For the Multipath connection you can use the device-mapper
multipath tools ( comes with RHEL4 U4 ) or you use the Vendor specific
Driver, like the Qlogic Driver from HP in our Case. When you don't have a
Storage box with integrated ( what i think is the best solution ) then you
can also use the lvm mirroring. See also the presentation from Heinz
Mauelshagen(
http://people.redhat.com/~heinzm/talks/MassenspeicherUnunterbrochen.odp in
German, maybe he has a English one ) 
 
Also you should always use a odd number of member (like 3,5,7,...), because
the fencing is then better. But when you have a real HA Solution, in the
most of the Time you have also Two Datacenters. And then the Cluster should
also work when one Datacenter is not available. Then you need either a new
Datacenter ;-), for the third member or you fail back to the Problem with
the fencing! And then maybe the quorum disk is the best solution.
 
We have around 20 RHEL4 / GFS Cluster in HA Configuration ( also with two
Datacenters ), but without quorum disk ( was not available in U3 ). We use
on all our Cluster the Shared Root Extension from Atix (
http://www.opensharedroot.org/documentation/the-opensharedroot-mini-howto/ )
because we come from the TruCluster / Tru64 Side and like the Shared Root
approach.
 
The last tip from me is, write a test plan and on every configchange you can
check your Installation again.
 
I hope this help
 
good luck
 
Mike


  _____  

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Net Cerebrum
Sent: Dienstag, 23. Januar 2007 18:34
To: linux-cluster at redhat.com
Subject: [Linux-cluster] HA Clustering - Need Help


Hello All,

I am totally new to HA clustering and am trying hard to grasp the
fundamentals in a limited time frame. I have been asked by my company to
create a high availability cluster using Red Hat Cluster Suite on hardware
comprising two servers running RHEL AS 4 and one shared external storage
array. The cluster would be running in Active-Active state. Oracle Database
version 9 (not RAC) would run on one of the servers while the Oracle
Applications version 11 would run on the other. In case of failure of either
of the servers, the service would be started on the other server. Both the
servers (nodes) would be connected to the storage array through two
redundant SCSI controllers. 

Since the storage has redundant controllers, both the servers would be
connected to the storage array using two channels each and the requirement
is to make it an Active-Active Load Balanced configuration using a multipath
software. The storage vendor has suggested using the multipath option with
the mdadm software for creating multipath devices on the storage array. 

I have gone through the manuals and since this is my first attempt at high
availabilty clustering I have many doubts and questions. What file system
should be used on the external storage ? Is it better to use ext3 or Red Hat
GFS ? At certain places it is mentioned that GFS should be used only if the
number of nodes is 3 or more and GULM is being used. Since we have only two
nodes, we plan to use DLM.  It is also mentioned that GFS and CLVM may not
work on a software RAID device. Would the multipath devices created
(/dev/md0, /dev/md1, etc) be considered to be software RAID devices, though
in the real sense they are not ? Further the development team is not too
sure about the compatibility between GFS and Oracle Database and
Applications. What could be the pros and cons of using  ext3 file system in
this scenario ? 

The development team just wants one filesystem to be used on the storage
which would be mounted as /oracle on both the servers / nodes and all the
binaries and data would reside on this. Since this filesystem is going to be
mounted at boot time, my understanding is that no mounting or unmounting of
any filesystem will take place during the failover so the cluster
configuration should reflect that. The documentation repeatedly refers to
mounting of the file systems when failover takes place so that's giving rise
to a little confusion. Further there are references to a quorum partition in
documentation but I have not been able to find any provision to make use of
the same in the cluster configuration tool. 

Please help me in clarifying these issues and suggest me how to go about
setting this cluster. I would be really grateful for any suggestions and
references.

Thanks,

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/99dde13f/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 6329 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/99dde13f/attachment.bin>

From grimme at atix.de  Fri Jan 26 08:19:05 2007
From: grimme at atix.de (Marc Grimme)
Date: Fri, 26 Jan 2007 09:19:05 +0100
Subject: [Linux-cluster] Unable to obtain lock
Message-ID: <200701260919.05228.grimme@atix.de>

Hello,
yesterday we saw a clusterfreeze (which seems to come from the rgmanager) with 
RHEL4/U4 GFS installed (see logs) consisting of 6 nodes x86_64 Architecture.
After fencing one node the cluster came back to live.
Any idea what could have happend? 
And why didn't the cluster "heal" itself?

Thanks and Regards 
Marc.

------------------------------------------------------------------------------------------
Jan 25 17:05:42 lilr623a clurgmgrd: [20947]:  Executing
/usr/local/swadmin/caa/SAP/P06DB status  
Jan 25 17:05:47 lilr623e clurgmgrd[21137]:  #48: Unable to obtain
cluster lock: Connection timed out  
Jan 25 17:05:48 lilr623d clurgmgrd[20293]:  #48: Unable to obtain
cluster lock: Connection timed out  
Jan 25 17:05:48 lilr623b clurgmgrd[20883]:  #48: Unable to obtain
cluster lock: Connection timed out  
Jan 25 17:05:49 lilr623f clurgmgrd[21489]:  #48: Unable to obtain
cluster lock: Connection timed out  
Jan 25 17:06:03 lilr623a clurgmgrd: [20947]:  Executing /etc/init.d/ldap
status  
Jan 25 17:06:19 lilr623f clurgmgrd[21489]:  #50: Unable to obtain
cluster lock: Connection timed out  
Jan 25 17:06:22 lilr623a clurgmgrd: [20947]:  Executing
/usr/local/swadmin/caa/SAP/P06DB status  
--------------------------------------------------------------------------------------
-- 
Gruss / Regards,

** Visit us at CeBIT 2007 in Hannover/Germany **
** in Hall 5, Booth G48/2  (15.-21. of March) **

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From breeves at redhat.com  Fri Jan 26 09:40:19 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Fri, 26 Jan 2007 09:40:19 +0000
Subject: [Linux-cluster] LVM2 from cvs is a no go
In-Reply-To: <f25c44610701251732o1f0056c4x2fadb06063d6fc35@mail.gmail.com>
References: <f25c44610701251732o1f0056c4x2fadb06063d6fc35@mail.gmail.com>
Message-ID: <45B9CC83.3090108@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Brandon Lamb wrote:
> So I am following the instructions from the usage.txt
> 
> I have instaled a 2.6.19.2 kernel with gfs2, dlm, configfs, ipv6 and
> sctp. I installed openais from the svn.osdl.org/openais, i installed
> libvolume_id using udev-104, and i installed cluster from the cvs.
> 
> Now I am onto instaling LVM2/CLVM (optional) it says, but it will not
> compile. Am I missing something? Below is the output from make
> 
> 

All those errors relate to symbols in the device-mapper library. It
looks like the version installed on your system is too old to build
current CVS.

Grab a later version from http://sourceware.org/dm/ and you should find
the build problems go away. You might also want to look at the
"--with-dm=/path" configure option for LVM2 - that way you can build it
against the latest libdevmapper without replacing whatever version is
installed on your distribution.

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.2 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFucyD6YSQoMYUY94RAsVnAJwLOGxEuZ1iT5IevzKYjjGzP5YOCgCcD3Dd
1ySTGmGnUupjv1JbiqlpFgk=
=6AZ/
-----END PGP SIGNATURE-----


From brandonlamb at gmail.com  Fri Jan 26 09:45:55 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Fri, 26 Jan 2007 01:45:55 -0800
Subject: [Linux-cluster] LVM2 from cvs is a no go
In-Reply-To: <45B9CC83.3090108@redhat.com>
References: <f25c44610701251732o1f0056c4x2fadb06063d6fc35@mail.gmail.com>
	<45B9CC83.3090108@redhat.com>
Message-ID: <f25c44610701260145i4db70d63w59a3058b63f68514@mail.gmail.com>

On 1/26/07, Bryn M. Reeves <breeves at redhat.com> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Brandon Lamb wrote:
> > So I am following the instructions from the usage.txt
> >
> > I have instaled a 2.6.19.2 kernel with gfs2, dlm, configfs, ipv6 and
> > sctp. I installed openais from the svn.osdl.org/openais, i installed
> > libvolume_id using udev-104, and i installed cluster from the cvs.
> >
> > Now I am onto instaling LVM2/CLVM (optional) it says, but it will not
> > compile. Am I missing something? Below is the output from make
> >
> >
>
> All those errors relate to symbols in the device-mapper library. It
> looks like the version installed on your system is too old to build
> current CVS.
>
> Grab a later version from http://sourceware.org/dm/ and you should find
> the build problems go away. You might also want to look at the
> "--with-dm=/path" configure option for LVM2 - that way you can build it
> against the latest libdevmapper without replacing whatever version is
> installed on your distribution.
>
> Kind regards,
>
> Bryn.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.2.2 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
> iD8DBQFFucyD6YSQoMYUY94RAsVnAJwLOGxEuZ1iT5IevzKYjjGzP5YOCgCcD3Dd
> 1ySTGmGnUupjv1JbiqlpFgk=
> =6AZ/
> -----END PGP SIGNATURE-----
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Aha! I'll give that a try thanks!


From jaap at sara.nl  Fri Jan 26 11:50:22 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Fri, 26 Jan 2007 12:50:22 +0100
Subject: [Linux-cluster] Stable to which kernel?
In-Reply-To: <45B8F965.5020906@redhat.com>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>	<45B78A54.1010204@redhat.com><339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
	<45B8F965.5020906@redhat.com>
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA287C@douwes.ka.sara.nl>

HI Bob,


Thanks! We will try it out soon. Can you tell me to which kernel the
stable release is compiled against?

Best Regards,
Jaap


> >   
> Hi Jaap,
> 
> STABLE is, well, stable as far as I know.  I did a massive 
> diff on STABLE's
> GFS code yesterday to compare it to RHEL4, just to make sure I hadn't 
> forgotten
> anything else, and I only found a couple of small things that 
> shouldn't 
> impact most
> people.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/78dfa764/attachment.bin>

From swhiteho at redhat.com  Fri Jan 26 12:33:31 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Fri, 26 Jan 2007 12:33:31 +0000
Subject: [Linux-cluster] problem exporting GFS2 filesystem with NFS
In-Reply-To: <1169744102.12218.41.camel@localhost.localdomain>
References: <1169631878.11001.43.camel@quoit.chygwyn.com>
	<1169638529.17590.18.camel@localhost.localdomain>
	<1169639347.11001.67.camel@quoit.chygwyn.com>
	<1169652342.17590.43.camel@localhost.localdomain>
	<1169721022.11001.81.camel@quoit.chygwyn.com>
	<1169744102.12218.41.camel@localhost.localdomain>
Message-ID: <1169814811.11001.134.camel@quoit.chygwyn.com>

Hi,

On Thu, 2007-01-25 at 17:55 +0100, Benoit DUFFAU wrote:
> Le jeudi 25 janvier 2007 ? 10:30 +0000, Steven Whitehouse a ?crit :
> > Hi,
> > > 
> > > Hope it helps
> > > 
> > Yes, that was just the thing, thanks for sending that. Please see if the
> > follow patch (it assumes that you have already applied the original fix
> > I pointed you at) does the trick,
> > 
> 
> It works just GREAT !!! 
> 
> Actually, doing some more test i figured out that the problem was only
> by mounting the export with the hard option !
> 
> If one mounts the NFS export with the soft option using the first patch
> you provided before already did the trick ! 
> 
> Now, with this patch it works using either hard or soft mount option for
> NFS client !
> 
> Thank you so much !!
> 
> Benoit 
> 
Excellent :-) I've just committed the fix to the GFS2 -nmw git tree this
morning from whence it will eventually make its way into the kernel.
Thanks for testing it out,

Steve.


From dbrieck at gmail.com  Fri Jan 26 13:31:14 2007
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Fri, 26 Jan 2007 08:31:14 -0500
Subject: [Linux-cluster] Piranha not bring up all interfaces
In-Reply-To: <1169761846.2980.41.camel@localhost.localdomain>
References: <8c1094290701250655y3f2f5a4dy64eacd47378e6cbe@mail.gmail.com>
	<1169749037.2980.25.camel@localhost.localdomain>
	<8c1094290701251058r64311641ob431e322d26b84c2@mail.gmail.com>
	<1169761846.2980.41.camel@localhost.localdomain>
Message-ID: <8c1094290701260531v4ba166b9y4de5c6648a0676e@mail.gmail.com>

> ifconfig ioctl()s (e.g. SIOCGIFCONF, etc.) on 4+ e1000s sometimes fail
> for some reason.  I've only seen it with bonding and on RHEL3.  In
> clumanager from RHCS3, we worked around it it by allowing users to
> switch to RHEL4-ish behavior (e.g. use the iproute2 utilities, which use
> the netlink socket instead of ioctls).
>
> The same update was never made for piranha.  Maybe it's needed *shrug*.
>
> I don't know about RHEL4, and I thought it was fixed in RHEL3 U8, but it
> might be something to consider...
>
> (Obviously it shouldn't fail).
>
> -- Lon

The system is Update 4 so I would think if it was fixed I would be
seeing it. Can you suggest a workaround? I'm just thinking I'll have
to concoct some sort of cron job to make sure all the interfaces are
up, especially for when the backup takes over from the master in a
failover situation.


From basv at sara.nl  Fri Jan 26 14:40:59 2007
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 26 Jan 2007 15:40:59 +0100
Subject: [Linux-cluster] NFS bug fixed in GFS stable tree?
In-Reply-To: <45B8F965.5020906@redhat.com>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>	<45B78A54.1010204@redhat.com>	<339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>
	<45B8F965.5020906@redhat.com>
Message-ID: <45BA12FB.2050807@sara.nl>

Robert Peterson wrote:
> Jaap Dijkshoorn wrote:
>> No problem, thanks for this. We are very happy that this maybe is the
>> fix we need in our environment.
>> I saw some remarks later about cleaning up the stable branche. Is the
>> stable branche stable enough to use it now.. or do we have to wait for
>> the cleanup?
>>   
> Hi Jaap,
> 
> STABLE is, well, stable as far as I know.  I did a massive diff on STABLE's
> GFS code yesterday to compare it to RHEL4, just to make sure I hadn't 
> forgotten
> anything else, and I only found a couple of small things that shouldn't 
> impact most
> people.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
Bob,

  I just build the new STABLE source agains our 2.6.17.11 kernel and did 
not encounter any problems, except for rgmanager. Some files are missing
that it tries to install:

install: cannot stat `utils/config-utils.sh': No such file or directory
install: cannot stat `utils/ra-skelet.sh': No such file or directory
install: cannot stat `utils/messages.sh': No such file or directory
install: cannot stat `utils/httpd-parse-config.pl': No such file or 
directory
install: cannot stat `utils/tomcat-parse-config.pl': No such file or 
directory
make[3]: *** [install] Error 1

Regards

> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From rpeterso at redhat.com  Fri Jan 26 14:48:11 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 26 Jan 2007 08:48:11 -0600
Subject: [Linux-cluster] Stable to which kernel?
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA287C@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA282C@douwes.ka.sara.nl>	<45B78A54.1010204@redhat.com><339554D0FE9DD94A8E5ACE4403676CEB01DA2847@douwes.ka.sara.nl>	<45B8F965.5020906@redhat.com>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA287C@douwes.ka.sara.nl>
Message-ID: <45BA14AB.9090100@redhat.com>

Jaap Dijkshoorn wrote:
> HI Bob,
>
>
> Thanks! We will try it out soon. Can you tell me to which kernel the
> stable release is compiled against?
>
> Best Regards,
> Jaap
>   
Hi Jaap,

To be perfectly, honest, I don't know which kernel it compiles against.
The problem is, the kernels are a moving target, and so is the cluster 
software,
because we're always porting improvements and bug fixes.  The only
information I could find around here said "a recent kernel" but that was
written quite a while ago, and I didn't take the time to dig very deep 
for it. 
I'm guessing some flavors of 2.6.17 or 2.6.18 but I make no promises.

I'm hoping to update STABLE so that it compiles against more recent
upstream (kernel.org) kernels, but I don't know when I'll get the chance
to do that.  I'm hoping to start on it next week.  Right now I'm targeting
2.6.20-rc6.  The problem, of course, is that it may impact other people
out there using STABLE on a different kernel.

Regards,

Bob Peterson
Red Hat Cluster Suite


From comaniliut at gmail.com  Fri Jan 26 15:09:51 2007
From: comaniliut at gmail.com (Coman Iliut)
Date: Fri, 26 Jan 2007 10:09:51 -0500
Subject: [Linux-cluster] Kernel messages causing node to be fenced out
Message-ID: <9cf395480701260709y706c7161j55eb857a4b5ae90@mail.gmail.com>

Hi,

We have a setup with two HP DL360 nodes connected to an MSA500 disk array
via SCSI cables. We are running RH4U3 and our product has an active passive
design. The Active-passive is managed internally in the product.

Every now and then one of the nodes outputs the below kernel messages after
which the other node fences it out. This causes a failover for our product.

Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 0,2,0
Jan 19 13:38:58 n1 kernel: FS1 move use event 2
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 (first)
Jan 19 13:38:58 n1 kernel: FS1 add nodes
Jan 19 13:38:58 n1 kernel: FS1 total nodes 1
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 0 resources
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 0,2,2
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 finished
Jan 19 13:38:58 n1 kernel: FS1 move flags 1,0,0 ids 2,2,2
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 2,5,2
Jan 19 13:38:58 n1 kernel: FS1 move use event 5
Jan 19 13:38:58 n1 kernel: FS1 recover event 5
Jan 19 13:38:58 n1 kernel: FS1 add node 2
Jan 19 13:38:58 n1 kernel: FS1 total nodes 2
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 7409 resources
Jan 19 13:38:58 n1 kernel: FS1 purge requests
Jan 19 13:38:58 n1 kernel: FS1 purged 0 requests
Jan 19 13:38:58 n1 kernel: FS1 mark waiting requests
Jan 19 13:38:58 n1 kernel: FS1 marked 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 2,5,5
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 resend marked requests
Jan 19 13:38:58 n1 kernel: FS1 resent 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 finished
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 unlock ff9b0297 no id
Jan 19 13:38:59 n1 kernel:  -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2


Then the other node says "missed too many heartbeats" and fences it out. it
does some minor recovery work and is all fine.

Is this a bug? The two nodes don't seem to do much at the time when this
happens.
We have seen this on another similar setup (2 DL360, MSA500). It seems to
happen quite regularly.

I remember I saw a mention about something similar on a mailing list and
Patrick Caulfield answered:

"If you're running the cman from RHEL4 Update 3 then there's a bug in there
you might be hitting.

You'll need to upgrade all the nodes in the cluster to get rid of it. I
can't tell for sure
if it is that problem you're having without seeing more kernel messages
though."


Any ideas?

Thanks.

-- 
Coman ILIUT

Mitel Networks
Ottawa, ON
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/bf73033f/attachment.htm>

From orkcu at yahoo.com  Fri Jan 26 16:07:12 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Fri, 26 Jan 2007 08:07:12 -0800 (PST)
Subject: [Linux-cluster] rgmanager fail to start a 'clone' service,
	is this a bug or expected behavior?
Message-ID: <905493.7053.qm@web50602.mail.yahoo.com>

Hi

I am just testing an active-active apache services
with Redhat Cluster suit but rgmanager fail to start
the httpd services in the second server althougt it
claim it succefully started it ...

I filled a bug :
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=224462

there you can find the rg section of the cluster.conf
file.

as you can see the only differences between the two
services (apache25 and apache26) are the failover
domains, which are just only one blade for each
service.

My intentions are to have both blade servers running
apache at the same time and the cluster suit just
monitoring the services and the GFS filesystem in each
blade (we have a cisco ccs in front of that servers to
distribute connections)

I also make a mistake (on purpose) in the script file
(path) of the apache26 service and rgmanager complain
about it (it couldn't find the file) so in some way it
did some checks but do not start the services nor try
to mount the GFS filesystems :-( 
even if the apache25 service is stoped or disabled

log_level="7" inside <rg> tag do not offer any more
log or trace :-(

We are running RHEL4.4ES with gfscs from centos-4.4

so, is this a bug or an expected behavior?

thanks in advance
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
Food fight? Enjoy some healthy debate 
in the Yahoo! Answers Food & Drink Q&A.
http://answers.yahoo.com/dir/?link=list&sid=396545367


From rpeterso at redhat.com  Fri Jan 26 16:14:55 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 26 Jan 2007 10:14:55 -0600
Subject: [Linux-cluster] rgmanager fail to start a 'clone' service,	is
	this a bug or expected behavior?
In-Reply-To: <905493.7053.qm@web50602.mail.yahoo.com>
References: <905493.7053.qm@web50602.mail.yahoo.com>
Message-ID: <45BA28FF.9010601@redhat.com>

Roger PeXa Escobio wrote:
> Hi
>
> I am just testing an active-active apache services
> with Redhat Cluster suit but rgmanager fail to start
> the httpd services in the second server althougt it
> claim it succefully started it ...
>   
Hi Roger,

You may have already done this, but just in case...
I recommend that you check to make sure you're not a victim of the
httpd service-script problems described in the cluster faq here:
http://sources.redhat.com/cluster/faq.html#rgm_wontrestart

Regards,

Bob Peterson
Red Hat Cluster Suite


From orkcu at yahoo.com  Fri Jan 26 16:20:26 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Fri, 26 Jan 2007 08:20:26 -0800 (PST)
Subject: [Linux-cluster] rgmanager fail to start a 'clone' service,
	is this a bug or expected behavior?
In-Reply-To: <45BA28FF.9010601@redhat.com>
Message-ID: <723205.42794.qm@web50611.mail.yahoo.com>


--- Robert Peterson <rpeterso at redhat.com> wrote:

> Roger PeXa Escobio wrote:
> > Hi
> >
> > I am just testing an active-active apache services
> > with Redhat Cluster suit but rgmanager fail to
> start
> > the httpd services in the second server althougt
> it
> > claim it succefully started it ...
> >   
> Hi Roger,
> 
> You may have already done this, but just in case...
> I recommend that you check to make sure you're not a
> victim of the
> httpd service-script problems described in the
> cluster faq here:
>
http://sources.redhat.com/cluster/faq.html#rgm_wontrestart

I did already :-)

thanks to that pacht the service apache25 run preaty
good :-) it even mount the GFS filesystems in case it
is not mounted 

the problem is with the service apache26, where
rgmanager say that the service is started when it is
not (not even try to run the script not to mount the
filesystem)


cu
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
8:00? 8:25? 8:40? Find a flick in no time 
with the Yahoo! Search movie showtime shortcut.
http://tools.search.yahoo.com/shortcuts/#news


From comaniliut at gmail.com  Fri Jan 26 16:40:59 2007
From: comaniliut at gmail.com (Coman Iliut)
Date: Fri, 26 Jan 2007 11:40:59 -0500
Subject: [Linux-cluster] Kernel messages causing node to be fenced out
Message-ID: <9cf395480701260840w19c887c6o20fa499927d13a88@mail.gmail.com>

Hi,

We have a setup with two HP DL360 nodes connected to an MSA500 disk array
via SCSI cables. We are running RH4U3 and our product has an active passive
design. The Active-passive is managed internally in the product.

Every now and then one of the nodes outputs the below kernel messages after
which the other node fences it out. This causes a failover for our product.

Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 0,2,0
Jan 19 13:38:58 n1 kernel: FS1 move use event 2
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 (first)
Jan 19 13:38:58 n1 kernel: FS1 add nodes
Jan 19 13:38:58 n1 kernel: FS1 total nodes 1
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 0 resources
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 0,2,2
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 2 finished
Jan 19 13:38:58 n1 kernel: FS1 move flags 1,0,0 ids 2,2,2
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,1,0 ids 2,5,2
Jan 19 13:38:58 n1 kernel: FS1 move use event 5
Jan 19 13:38:58 n1 kernel: FS1 recover event 5
Jan 19 13:38:58 n1 kernel: FS1 add node 2
Jan 19 13:38:58 n1 kernel: FS1 total nodes 2
Jan 19 13:38:58 n1 kernel: FS1 rebuild resource directory
Jan 19 13:38:58 n1 kernel: FS1 rebuilt 7409 resources
Jan 19 13:38:58 n1 kernel: FS1 purge requests
Jan 19 13:38:58 n1 kernel: FS1 purged 0 requests
Jan 19 13:38:58 n1 kernel: FS1 mark waiting requests
Jan 19 13:38:58 n1 kernel: FS1 marked 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 done
Jan 19 13:38:58 n1 kernel: FS1 move flags 0,0,1 ids 2,5,5
Jan 19 13:38:58 n1 kernel: FS1 process held requests
Jan 19 13:38:58 n1 kernel: FS1 processed 0 requests
Jan 19 13:38:58 n1 kernel: FS1 resend marked requests
Jan 19 13:38:58 n1 kernel: FS1 resent 0 requests
Jan 19 13:38:58 n1 kernel: FS1 recover event 5 finished
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 send einval to 2
Jan 19 13:38:58 n1 kernel: FS1 unlock ff9b0297 no id
Jan 19 13:38:59 n1 kernel:  -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2
Jan 19 13:38:59 n1 kernel: 2712 ex punlock -2
Jan 19 13:38:59 n1 kernel: 2712 en punlock 7,3019aa2


Then the other node says "missed too many heartbeats" and fences it out. it
does some minor recovery work and is all fine.

Is this a bug? The two nodes don't seem to do much at the time when this
happens.
We have seen this on another similar setup (2 DL360, MSA500). It seems to
happen quite regularly.

I remember I saw a mention about something similar on a mailing list and
Patrick Caulfield answered:

If you're running the cman from RHEL4 Update 3 then there's a bug in there
you might be hitting.

You'll need to upgrade all the nodes in the cluster to get rid of it. I
can't tell for sure
if it is that problem you're having without seeing more kernel messages
though.

http://www.spinics.net/lists/cluster/msg07016.html

Any ideas?

Thanks.

-- 
Coman ILIUT

Mitel Networks
Ottawa, ON
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/9ba64feb/attachment.htm>

From kudjak at gmail.com  Fri Jan 26 16:54:01 2007
From: kudjak at gmail.com (Jan Kudjak)
Date: Fri, 26 Jan 2007 17:54:01 +0100
Subject: [Linux-cluster] kernel panic when i/o issued on GFS on killed node
Message-ID: <353fcd0b0701260854s66f1bfedu5239cab1eb6a57d@mail.gmail.com>

Hello,
I am doing some tests with 2 node cluster, using manual fencing.
I've read the faq so i know that it is not production ready.
Anyway
When I issue eg. touch /gfs/foo on node1 which was killed
previously from node2 with cman_tool kill -n node1
I get a kernel panic on node1.
Is this behaviour desired ? Anyway the node should be fenced, so eg.
kernel.panic = 3 in /etc/sysctl.conf
will actually do the job. but I just don't like seeing kernel panic :)
I am using update 4 rhel, rhcs and rhgfs packages.
Thanks in advance

Jan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/d260954f/attachment.htm>

From lhh at redhat.com  Fri Jan 26 18:08:12 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Jan 2007 13:08:12 -0500
Subject: [Linux-cluster] rgmanager fail to start a 'clone' service, is
	this a bug or expected behavior?
In-Reply-To: <905493.7053.qm@web50602.mail.yahoo.com>
References: <905493.7053.qm@web50602.mail.yahoo.com>
Message-ID: <1169834892.3596.41.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-26 at 08:07 -0800, Roger PeXa Escobio wrote:
> Hi
> 
> I am just testing an active-active apache services
> with Redhat Cluster suit but rgmanager fail to start
> the httpd services in the second server althougt it
> claim it succefully started it ...
> 
> I filled a bug :
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=224462

> We are running RHEL4.4ES with gfscs from centos-4.4
> 
> so, is this a bug or an expected behavior?

Mostly, it's expected behavior; see lengthy explanation and example
outputs of rg_test in the bug.  However:

(a) rgmanager should log resource collisions, and
(b) it might be worth investigating having rgmanager figure out which
resources will never collide and allow reuse of certain "unique"
parameters (e.g. mountpoint, in your particular case).

-- Lon


From lhh at redhat.com  Fri Jan 26 18:15:40 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 26 Jan 2007 13:15:40 -0500
Subject: [Linux-cluster] Unable to obtain lock
In-Reply-To: <200701260919.05228.grimme@atix.de>
References: <200701260919.05228.grimme@atix.de>
Message-ID: <1169835340.3596.45.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-26 at 09:19 +0100, Marc Grimme wrote:
> Hello,
> yesterday we saw a clusterfreeze (which seems to come from the rgmanager) with 
> RHEL4/U4 GFS installed (see logs) consisting of 6 nodes x86_64 Architecture.
> After fencing one node the cluster came back to live.
> Any idea what could have happend? 

Check 'dmesg' and 'cman_tool status'.  Also look at /proc/slabinfo,
specifically 'dlm_lkb' bits.  There's a chance that you hit a bug that's
already fixed. :)

> And why didn't the cluster "heal" itself?

Need above information (and possibly more) to answer this.

-- Lon


From grimme at atix.de  Fri Jan 26 18:28:42 2007
From: grimme at atix.de (Marc Grimme)
Date: Fri, 26 Jan 2007 19:28:42 +0100
Subject: [Linux-cluster] Unable to obtain lock
In-Reply-To: <1169835340.3596.45.camel@rei.boston.devel.redhat.com>
References: <200701260919.05228.grimme@atix.de>
	<1169835340.3596.45.camel@rei.boston.devel.redhat.com>
Message-ID: <200701261928.43372.grimme@atix.de>

On Friday 26 January 2007 19:15, Lon Hohberger wrote:
> On Fri, 2007-01-26 at 09:19 +0100, Marc Grimme wrote:
> > Hello,
> > yesterday we saw a clusterfreeze (which seems to come from the rgmanager)
> > with RHEL4/U4 GFS installed (see logs) consisting of 6 nodes x86_64
> > Architecture. After fencing one node the cluster came back to live.
> > Any idea what could have happend?
>
> Check 'dmesg' and 'cman_tool status'.  Also look at /proc/slabinfo,
> specifically 'dlm_lkb' bits.  There's a chance that you hit a bug that's
> already fixed. :)
cman_tool status:
root at lilr623e:~# cman_tool status
Protocol version: 5.0.1
Config version: 12
Cluster name: lilr623
Cluster ID: 13279
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 6
Expected_votes: 2
Total_votes: 6
Quorum: 4
Active subsystems: 62
Node name: lilr623e-ics0
Node ID: 5
Node addresses: 192.168.10.149
root at lilr623e:~# cat /proc/slabinfo | grep dlm_lkb
dlm_lkb           189628 195177    232   17    1 : tunables  120   60    8 : 
slabdata  11481  11481    384
nodea
dlm_lkb           2074114 2077587    232   17    1 : tunables  120   60    8 : 
slabdata 122211 122211    180
nodeb
dlm_lkb           454319 499392    232   17    1 : tunables  120   60    8 : 
slabdata  29376  29376      0
nodec
dlm_lkb           242144 251719    232   17    1 : tunables  120   60    8 : 
slabdata  14807  14807    480
noded
dlm_lkb           248672 286382    232   17    1 : tunables  120   60    8 : 
slabdata  16846  16846    212
nodef
dlm_lkb            62934  62934    232   17    1 : tunables  120   60    8 : 
slabdata   3702   3702      0

>
> > And why didn't the cluster "heal" itself?
>
> Need above information (and possibly more) to answer this.
What more?? ;-)
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

** Visit us at CeBIT 2007 in Hannover/Germany **
** in Hall 5, Booth G48/2  (15.-21. of March) **

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From orkcu at yahoo.com  Fri Jan 26 19:02:54 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Fri, 26 Jan 2007 11:02:54 -0800 (PST)
Subject: [Linux-cluster] rgmanager fail to start a 'clone' service,
	is this a bug or expected behavior?
In-Reply-To: <1169834892.3596.41.camel@rei.boston.devel.redhat.com>
Message-ID: <237781.98964.qm@web50608.mail.yahoo.com>


--- Lon Hohberger <lhh at redhat.com> wrote:

> On Fri, 2007-01-26 at 08:07 -0800, Roger PeXa
> Escobio wrote:
> > Hi
> > 
> > I am just testing an active-active apache services
> > with Redhat Cluster suit but rgmanager fail to
> start
> > the httpd services in the second server althougt
> it
> > claim it succefully started it ...
> > 
> > I filled a bug :
> >
>
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=224462
> 
> > We are running RHEL4.4ES with gfscs from
> centos-4.4
> > 
> > so, is this a bug or an expected behavior?
> 
> Mostly, it's expected behavior; see lengthy
> explanation and example
> outputs of rg_test in the bug.  However:
> 
> (a) rgmanager should log resource collisions, and
> (b) it might be worth investigating having rgmanager
> figure out which
> resources will never collide and allow reuse of
> certain "unique"
> parameters (e.g. mountpoint, in your particular
> case).
> 
thanks a lot

what you write in both bugzilla numbers are very
helpfull, now I understand the problem and how to fix
it, at least for my scenario :-)
(with clvm I can label the filesystem and them trust
the relation "device" <--> "content of device" for
both servers, after that, I can create the FSresource
as shared resources, and then make a reference to them
in each service)
I never realiced the fact that rgmanager "was designed
to allow coexistence of all resources on a single node
for a given configuration" so it is very serious about
collisions ;-)


but I agree about investigating b) because it make
sense to have a cluster with a bunch of sets of nodes
where each set might run services with same mount
point and scripts but different content (/var/www and
httpd <--> website and new-website-version ) ;-)

again, thanks a lot

roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
Don't get soaked.  Take a quick peak at the forecast
with the Yahoo! Search weather shortcut.
http://tools.search.yahoo.com/shortcuts/#loc_weather


From srramasw at cisco.com  Fri Jan 26 22:01:14 2007
From: srramasw at cisco.com (Sridharan Ramaswamy (srramasw))
Date: Fri, 26 Jan 2007 14:01:14 -0800
Subject: [Linux-cluster] GFS works in MIPS64 and mixed-endian platforms?
In-Reply-To: <B14199FA0DBAAF4AA89E83EB41D3543502F2260B@xmb-sjc-22c.amer.cisco.com>
Message-ID: <B14199FA0DBAAF4AA89E83EB41D3543502F22AD0@xmb-sjc-22c.amer.cisco.com>

Can anyone comment on GFS's MIPS and mixed-endian support? 
 
- Sridharan


________________________________

	From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Sridharan
Ramaswamy (srramasw)
	Sent: Thursday, January 25, 2007 2:55 PM
	To: linux-cluster at redhat.com
	Subject: [Linux-cluster] GFS works in MIPS64 and mixed-endian
platforms?
	
	
	Has anyone compiled and ran GFS on MIPS64 platform like Cavium?
Any known issues to watch out for?
	 
	Also can GNBD-server node and GNBD-client/GFS node be on
different endian types? Like GNBD server being MIPS64 and client being
i386.
	 
	Appreciate any info on this.
	 
	thanks,
	Sridharan

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070126/b9add26e/attachment.htm>

From isplist at logicore.net  Mon Jan 29 04:19:21 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 28 Jan 2007 22:19:21 -0600
Subject: [Linux-cluster] IBM x440 Users?
Message-ID: <2007128221921.618467@leena>

I thought I've seen a few posts from someone using the IBM x440 servers? I'm 
trying to get RHEL40/GFS going on one of these and wonder if you might contact 
me.

Thanks very much.

Mike


From dpinkard at AccessLine.com  Mon Jan 29 06:45:29 2007
From: dpinkard at AccessLine.com (dpinkard at AccessLine.com)
Date: Sun, 28 Jan 2007 22:45:29 -0800
Subject: [Linux-cluster] version compatibility
Message-ID: <C7F9DF38B33FA44E85FBD1757447D48A1361F263@mail.accessline.com>


I notced that the releases directory
(ftp://sources.redhat.com/pub/cluster/releases/) hasn't progressed since
August, and wanted to inquire as to what versions work with what versions. 

I've tried the newest CVS (cvs -d
:pserver:cvs at sources.redhat.com:/cvs/cluster checkout cluster) against
2.6.19.2 (the latest) as well as stable (-r STABLE) against 2.6.19.2, and
2.6.16.38, AND 2.6.16.33-xen0 with all erroring in different, but creative,
ways. My ultimate goal is to build it against my xen3.0.4 kernel
(2.6.16.33-xen0), but i'm not sure how to muddle through the kernel errors.

Any hints in figuring it out or getting it all to match up is appreciated.

Here's what STABLE does against 2.6.16.33-xen0:

make -C /root/src/xen-3.0.4-testing.hg/linux-2.6.16.33-xen0/
M=/root/src/cluster-1.03.00/gfs-kernel/src/nolock
symverfile=/root/src/cluster-1.03.00/gfs-kernel/src/nolock/../harness/lock_h
arness.symvers modules USING_KBUILD=yes
make[4]: Entering directory
`/root/src/xen-3.0.4-testing.hg/linux-2.6.16.33-xen0'
  CC [M]  /root/src/cluster-1.03.00/gfs-kernel/src/nolock/main.o
/root/src/cluster-1.03.00/gfs-kernel/src/nolock/main.c: In function
`nolock_plock_get':
/root/src/cluster-1.03.00/gfs-kernel/src/nolock/main.c:250: error: too many
arguments to function `posix_test_lock'
/root/src/cluster-1.03.00/gfs-kernel/src/nolock/main.c:250: warning:
assignment makes integer from pointer without a cast
make[5]: *** [/root/src/cluster-1.03.00/gfs-kernel/src/nolock/main.o] Error
1
make[4]: *** [_module_/root/src/cluster-1.03.00/gfs-kernel/src/nolock] Error
2
make[4]: Leaving directory
`/root/src/xen-3.0.4-testing.hg/linux-2.6.16.33-xen0'


Thanks!


From netcerebrum at gmail.com  Mon Jan 29 06:52:22 2007
From: netcerebrum at gmail.com (Net Cerebrum)
Date: Mon, 29 Jan 2007 12:22:22 +0530
Subject: [Linux-cluster] RE: HA Clustering - Need Help
In-Reply-To: <Pine.LNX.4.64.0701241514200.2201@cpe-76-168-2-122.socal.res.rr.com>
References: <20070124133712.838207321C@hormel.redhat.com>
	<Pine.LNX.4.64.0701241514200.2201@cpe-76-168-2-122.socal.res.rr.com>
Message-ID: <bc3908b0701282252v3c46fe7fo75aad35b202f46b@mail.gmail.com>

On 1/25/07, Alan Wood <chekov at ucla.edu> wrote:
>
> some quick comments on your post from someone who has tried an
> active-active cluster on a shared SCSI device.
>
> 1.  If you want to have the same block partition mounted on two different
> computers at the same time, then you need some cluster file system like
> GFS, you can't use ext3.  There are other cluster filesystems out there
> (like lustre) but GFS is most well tied to the RH Cluster Suite and
> designed for high availability as opposed to paralell computing.
> 2.  If you are going to run GFS in a production environment the
> recommendation is to not use 2-node.  GFS 5 required 3 nodes but GFS 6
> offers a 2-node option;  However when using two nodes it is harder to know
> which node is "broken" when something goes wrong, so you'll note a lot of
> discusson on this list about fencing gone awry and needing some sort of
> tiebeaker like a quorum disk.  If you take care in setting it up a 2-node
> cluster will work but you'll want to test it extensively before putting it
> into production.
> 3.  multipathing should work fine and you can build clvm volumes on top of
> multipath devices.  Software RAID is different and not really related.
>
> as for recommendations:
> 1.  don't use SCSI shared storage.  I and others have had reliability
> issues with heavy load in these scenarios.
> 2.  use more than 2 nodes.
> 3.  go active-passive is possible.  as is often pointed out, the entire
> idea of a high availability cluster is that there is enough processing
> horsepower to handle the entire remaining load if one node fails.  in a
> 2-node cluster then you'll have to provision each node to be able to run
> everything.  it is far easier to set it up so that one node therefore runs
> everything and the other node awaits failure than having active-active.
>
> just my $.02
> -alan
>
>
>
Thank you very much for your excellent suggestions and tips but I could not
some of them since I am bound by the specifications laid down by the
development team looking into this project. I have made substantial progress
in this project and a large number of issues have been resolved.  Since it
had to be an Active-Active configuration with  both the nodes accessing the
shared storage at the same time, we have gone for GFS as the file system
using the latest release as suggested by you. The documentation for current
release of RHCS does not talk about any quorum partitions but as suggested
by you, I have left some space partitioned which could be used for the
purpose if need arises. The multipathing is also working fine using the md
driver and we have been able to build logical volumes over the multipath
devices.

I am now dealing with the issue of configuring the network interfaces. As of
now I have configured ethernet bonding on each of the hosts to achieve
network interface redundancy also. However this leads to a lot of network
traffic since the same interfaces are being used for heartbeat / monitoring
also. Therefore, I am thinking of using the two ethernet interfaces
individually, one interface for monitoring and the other one for the LAN
through which the clients will be able to access the hosts. They would be
connected to separate switches and the fence devices would also be on the
monitoring / control network. So I assume that the arrangement would be
something like:

Node A
eth0 - 192.168.100.1
eth1 - 172.16.1.101
fence device - 192.168.100.11

Node B
eth0 - 192.168.100.2
eth1 - 172.16.1.102
fence device - 192.168.100.12

The interfaces eth0 and fence devices would be connected through a switch,
while the other interfaces (eth1) would be on the LAN where clients would be
accessing them. In addition there would be two more floating / shared IP
addresses 172.16.1.201 for the database server and 172.16.1.202 for the
application server which would be defined in the Resources section of
Cluster Configuration Tool and would not be mentioned in /etc/hosts (read
somewhere in the documentation).

Please let me know if these assumptions are correct. I am just wondering how
does the cluster manager figure out which interfaces to use for heartbeat
and monitoring. I haven't seen any such configuration option in the
system-config-cluster program.

The issue which then needs to be resolved is of assigning hostname aliases
to the shared IP addresses since as per the developers, the application
manager and the database need to use a hostname and not an IP address.

Looking forward to your comments,

Thanks a lot.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/b11da04d/attachment.htm>

From netcerebrum at gmail.com  Mon Jan 29 07:24:43 2007
From: netcerebrum at gmail.com (Net Cerebrum)
Date: Mon, 29 Jan 2007 12:54:43 +0530
Subject: [Linux-cluster] HA Clustering - Need Help
In-Reply-To: <9C203D6FD2BF9D49BFF3450201DEDA53015D792F@LI-OWL.hag.hilti.com>
References: <bc3908b0701230933g18301280x1e4a02d1a6edc7e5@mail.gmail.com>
	<9C203D6FD2BF9D49BFF3450201DEDA53015D792F@LI-OWL.hag.hilti.com>
Message-ID: <bc3908b0701282324t2b1605f1pf35aee67de50a2d0@mail.gmail.com>

On 1/26/07, Hagmann, Michael <Michael.Hagmann at hilti.com> wrote:
>
>  Hi
>
> what I can recommend ( in short ) is a RHEL4 U4+ / GFS  Cluster. When you
> mount the same File system ( in the same time ) on more than one Node you
> need a Clusterfilesystem ( like GFS or maybe ocfs2 )
>
> RHEL4 U4 / GFS with DLM and Quorumdisk ( when you only have 2 nodes ) also
> very Important is the fencing method ( we use now the iLO interface from our
> HP Servers ). And for the Cluster interconnect I recommend you a separate
> Network.
>

As suggested, we have gone for the above arrangement. Have also made
provision for quorum disks, if required. Initially, I put everything on a
single network and now I am trying to split it into two different networks,
as described in the previous email.

For the Multipath connection you can use the device-mapper multipath tools (
> comes with RHEL4 U4 ) or you use the Vendor specific Driver, like the Qlogic
> Driver from HP in our Case.
>

The storage vendor, HP, had suggested making a software raid using the
multipath option to achieve the desired functionality and we followed it.
There seemed to be some conflict with the device-mapper tools and the md
raid devices were being stopped after initialisation but making a new initrd
image helped and solved the problem. Thanks for the link the the
presentation but I could not find the English translation.
   Also you should always use a odd number of member (like 3,5,7,...),
because the fencing is then better. But when you have a real HA Solution, in
the most of the Time you have also Two Datacenters. And then the Cluster
should also work when one Datacenter is not available. Then you need either
a new Datacenter ;-), for the third member or you fail back to the Problem
with the fencing! And then maybe the quorum disk is the best solution.

I am forced to go ahead with two nodes only. Will update the forum once
everything is in place and working.Thank you very much for your suggestions.
 ------------------------------
**
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/74b3a130/attachment.htm>

From kudjak at gmail.com  Mon Jan 29 08:08:03 2007
From: kudjak at gmail.com (Jan Kudjak)
Date: Mon, 29 Jan 2007 09:08:03 +0100
Subject: [Linux-cluster] kernel panic when i/o issued on GFS on killed node
Message-ID: <353fcd0b0701290008u1f5c202fu366981640001df69@mail.gmail.com>

Hello,
I am doing some tests with 2 node cluster, using manual fencing.
I've read the faq so i know that it is not production ready.
Anyway
When I issue eg. touch /gfs/foo on node1 which was killed
previously from node2 with cman_tool kill -n node1
I get a kernel panic on node1.
Is this behaviour desired ? Anyway the node should be fenced, so eg.
kernel.panic = 3 in /etc/sysctl.conf
will actually do the job. but I just don't like seeing kernel panic :)
I am using update 4 rhel, rhcs and rhgfs packages.
Thanks in advance

Jan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/d31dc1e9/attachment.htm>

From orkcu at yahoo.com  Mon Jan 29 13:28:21 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Mon, 29 Jan 2007 05:28:21 -0800 (PST)
Subject: [Linux-cluster] RE: HA Clustering - Need Help
In-Reply-To: <bc3908b0701282252v3c46fe7fo75aad35b202f46b@mail.gmail.com>
Message-ID: <823260.36811.qm@web50608.mail.yahoo.com>


--- Net Cerebrum <netcerebrum at gmail.com> wrote:

> I am now dealing with the issue of configuring the
> network interfaces. As of
> now I have configured ethernet bonding on each of
> the hosts to achieve
> network interface redundancy also. However this
> leads to a lot of network
> traffic since the same interfaces are being used for
> heartbeat / monitoring
> also. Therefore, I am thinking of using the two
> ethernet interfaces
> individually, one interface for monitoring and the
> other one for the LAN
> through which the clients will be able to access the
> hosts. They would be
> connected to separate switches and the fence devices
> would also be on the
> monitoring / control network. So I assume that the
> arrangement would be
> something like:
> 
> Node A
> eth0 - 192.168.100.1
> eth1 - 172.16.1.101
> fence device - 192.168.100.11
> 
> Node B
> eth0 - 192.168.100.2
> eth1 - 172.16.1.102
> fence device - 192.168.100.12
> 
> The interfaces eth0 and fence devices would be
> connected through a switch,
> while the other interfaces (eth1) would be on the
> LAN where clients would be
> accessing them. In addition there would be two more
> floating / shared IP
> addresses 172.16.1.201 for the database server and
> 172.16.1.202 for the
> application server which would be defined in the
> Resources section of
> Cluster Configuration Tool and would not be
> mentioned in /etc/hosts (read
> somewhere in the documentation).
> 
> Please let me know if these assumptions are correct.
> I am just wondering how
> does the cluster manager figure out which interfaces
> to use for heartbeat
> and monitoring. I haven't seen any such
> configuration option in the
> system-config-cluster program.
I guess it will use the interface with the right IP to
reach the nodes ;-) or the one used to reach the
default router if the nodes are outside any of its
local networks
I guess, in your configuration, if the nodes names are
bounding to IP 192.168.100.x (through DNS or
/etc/hosts) then cluster applications will use eth0
for its traffic
am I wrong?


> 
> The issue which then needs to be resolved is of
> assigning hostname aliases
> to the shared IP addresses since as per the
> developers, the application
> manager and the database need to use a hostname and
> not an IP address.

DNS ?
/etc/hosts in each developer computer?

cu
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
It's here! Your new message!  
Get new email alerts with the free Yahoo! Toolbar.
http://tools.search.yahoo.com/toolbar/features/mail/


From netcerebrum at gmail.com  Mon Jan 29 13:49:10 2007
From: netcerebrum at gmail.com (Net Cerebrum)
Date: Mon, 29 Jan 2007 19:19:10 +0530
Subject: [Linux-cluster] RE: HA Clustering - Need Help
In-Reply-To: <823260.36811.qm@web50608.mail.yahoo.com>
References: <bc3908b0701282252v3c46fe7fo75aad35b202f46b@mail.gmail.com>
	<823260.36811.qm@web50608.mail.yahoo.com>
Message-ID: <bc3908b0701290549x31c232a3y163e40d52fc54398@mail.gmail.com>

On 1/29/07, Roger Pe?a Escobio <orkcu at yahoo.com> wrote:
>
>
> --- Net Cerebrum <netcerebrum at gmail.com> wrote:
>
> > I am now dealing with the issue of configuring the
> > network interfaces. As of
> > now I have configured ethernet bonding on each of
> > the hosts to achieve
> > network interface redundancy also. However this
> > leads to a lot of network
> > traffic since the same interfaces are being used for
> > heartbeat / monitoring
> > also. Therefore, I am thinking of using the two
> > ethernet interfaces
> > individually, one interface for monitoring and the
> > other one for the LAN
> > through which the clients will be able to access the
> > hosts. They would be
> > connected to separate switches and the fence devices
> > would also be on the
> > monitoring / control network. So I assume that the
> > arrangement would be
> > something like:
> >
> > Node A
> > eth0 - 192.168.100.1
> > eth1 - 172.16.1.101
> > fence device - 192.168.100.11
> >
> > Node B
> > eth0 - 192.168.100.2
> > eth1 - 172.16.1.102
> > fence device - 192.168.100.12
> >
> > The interfaces eth0 and fence devices would be
> > connected through a switch,
> > while the other interfaces (eth1) would be on the
> > LAN where clients would be
> > accessing them. In addition there would be two more
> > floating / shared IP
> > addresses 172.16.1.201 for the database server and
> > 172.16.1.202 for the
> > application server which would be defined in the
> > Resources section of
> > Cluster Configuration Tool and would not be
> > mentioned in /etc/hosts (read
> > somewhere in the documentation).
> >
> > Please let me know if these assumptions are correct.
> > I am just wondering how
> > does the cluster manager figure out which interfaces
> > to use for heartbeat
> > and monitoring. I haven't seen any such
> > configuration option in the
> > system-config-cluster program.
> I guess it will use the interface with the right IP to
> reach the nodes ;-) or the one used to reach the
> default router if the nodes are outside any of its
> local networks
> I guess, in your configuration, if the nodes names are
> bounding to IP 192.168.100.x (through DNS or
> /etc/hosts) then cluster applications will use eth0
> for its traffic
> am I wrong?


If that's true then I think we need to specify only the monitoring IPs in
the cluster configuration along with the host names. What happens to the
other set of static IP addresses which belong to the interfaces on the LAN
subnet ? Do they get used anywhere in the configuration ? Any ideas ?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/0e6f96d6/attachment.htm>

From lhh at redhat.com  Mon Jan 29 18:44:46 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Jan 2007 13:44:46 -0500
Subject: [Linux-cluster] Unable to obtain lock
In-Reply-To: <200701261928.43372.grimme@atix.de>
References: <200701260919.05228.grimme@atix.de>
	<1169835340.3596.45.camel@rei.boston.devel.redhat.com>
	<200701261928.43372.grimme@atix.de>
Message-ID: <1170096286.30401.50.camel@rei.boston.devel.redhat.com>

On Fri, 2007-01-26 at 19:28 +0100, Marc Grimme wrote:
> On Friday 26 January 2007 19:15, Lon Hohberger wrote:
> > On Fri, 2007-01-26 at 09:19 +0100, Marc Grimme wrote:
> > > Hello,
> > > yesterday we saw a clusterfreeze (which seems to come from the rgmanager)
> > > with RHEL4/U4 GFS installed (see logs) consisting of 6 nodes x86_64
> > > Architecture. After fencing one node the cluster came back to live.
> > > Any idea what could have happend?
> >
> > Check 'dmesg' and 'cman_tool status'.  Also look at /proc/slabinfo,
> > specifically 'dlm_lkb' bits.  There's a chance that you hit a bug that's
> > already fixed. :)

> dlm_lkb           189628 195177    232   17    1 : tunables  120   60    8 : 
> slabdata  11481  11481    384
> nodea
> dlm_lkb           2074114 2077587    232   17    1 : tunables  120   60    8 : 
> slabdata 122211 122211    180
> nodeb
> dlm_lkb           454319 499392    232   17    1 : tunables  120   60    8 : 
> slabdata  29376  29376      0
> nodec
> dlm_lkb           242144 251719    232   17    1 : tunables  120   60    8 : 
> slabdata  14807  14807    480
> noded
> dlm_lkb           248672 286382    232   17    1 : tunables  120   60    8 : 
> slabdata  16846  16846    212
> nodef
> dlm_lkb            62934  62934    232   17    1 : tunables  120   60    8 : 
> slabdata   3702   3702      0

You've hit "the bug".

> > Need above information (and possibly more) to answer this.
> What more?? ;-)

Nothing; test packages here:

http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.i386.rpm
http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.x86_64.rpm
http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.src.rpm


From Darrell.Frazier at crc.army.mil  Mon Jan 29 19:34:39 2007
From: Darrell.Frazier at crc.army.mil (Frazier, Darrell USA CRC (Contractor))
Date: Mon, 29 Jan 2007 13:34:39 -0600
Subject: [Linux-cluster] How to turn off the cluster attribute of a lo
	cal volume
Message-ID: <A5502A8A1836A54FB9CB33BDC6A5544404C4381C@safeb1mf533c.crc.army.mil>

That did the trick Bob. I had to turn off clustering for lvmconf (lvmconf
--disable-cluster). After that, a vgchange -cn swapvg01 turned off the
clustering attribute. Thanx much!!

  _____  

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Frazier, Darrell USA
CRC (Contractor)
Sent: Friday, January 05, 2007 1:46 PM
To: 'linux clustering'
Subject: RE: [Linux-cluster] How to turn off the cluster attribute of a lo
cal volume


Thanks Bob, I will try that. I wonder how that bit got turned on using
standard local lvm commands? Interesting. 

-----Original Message----- 
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com
<mailto:linux-cluster-bounces at redhat.com> ] On Behalf Of Robert Peterson 
Sent: Friday, January 05, 2007 10:46 AM 
To: linux clustering 
Subject: Re: [Linux-cluster] How to turn off the cluster attribute of a
local volume 

Frazier, Darrell USA CRC (Contractor) wrote: 
> 
> Hi, 
> 
> I have an issue I havent yet been able to find an answer to. I created 
> a local volume on a cluster node to give it more swap space using 
> command-line tools (pvcreate, vgcreate, lvcreate). Unbeknownst to me, 
> at the time I created the volume, the clvmd subsystem was dead but locked.

> 
> Anyway, now the system I have created the filesystem on thinks that 
> the partition created is a clustered partition. I found this out using 
> the vgs command I found in the cluster FAQ (you da man Bob Peterson) 
> 
>  VG       #PV #LV #SN Attr   VSize   VFree 
>   homevg     1   1   0 wz--n-   3.12G   1.12G 
>   optvg      1   1   0 wz--n-   7.84G   2.84G 
>   rootvg     1   1   0 wz--n-   3.12G   1.12G 
>   swapvg00   1   1   0 wz--n-   3.12G   1.12G 
>  / //swapvg01   1   1   0 wz--nc   9.32G 324.00M/ 
>   tmpvg      1   1   0 wz--n-   4.72G   1.69G 
>   u01vg      1   1   0 wz--n-  33.00G  12.00G 
>   u02vg      1   1   0 wz--nc 399.61G      0 
>   usrvg      1   1   0 wz--n-   6.28G   2.28G 
>   varvg      1   1   0 wz--n-   6.28G   2.28G 
> 
> Though I would love to know how this happened. It is more important to 
> me right now to know how to disable the clustering attribute on this 
> partition. Thanx much in advance. 
> 
> *Darrell J. Frazier* 
> Unix System Administrator 
> US Army Combat Readiness Center 
> *//* 
> 
Hi Darrell, 

Glad to be of service! 
What you want to disable the clustering bit is:  vgchange -cn The answer
isn't "exactly" in the faq, but you can find something close

here: 

http://sources.redhat.com/cluster/faq.html#clvmd_clustered
<http://sources.redhat.com/cluster/faq.html#clvmd_clustered>  

Regards, 

Bob Peterson 
Red Hat Cluster Suite 

-- 
Linux-cluster mailing list 
Linux-cluster at redhat.com 
https://www.redhat.com/mailman/listinfo/linux-cluster
<https://www.redhat.com/mailman/listinfo/linux-cluster>  

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/6769466a/attachment.htm>

From jaap at sara.nl  Mon Jan 29 19:46:18 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Mon, 29 Jan 2007 20:46:18 +0100
Subject: [Linux-cluster] GFS crashing on 2.6.18.6
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>

People,

We have build the stable GFS tree againt 2.6.18.6. After we executed
ccs,cman, fenced and clvmd we tried to mount the GFS filesystems, with
the following result:


CMAN: Waiting to join or form a Linux-cluster
CMAN: sending membership request
CMAN: got node ifs4
CMAN: got node ifs3
CMAN: got node ifs2
CMAN: got node ifs1
CMAN: quorum regained, resuming activity
GFS: Trying to join cluster "lock_dlm", "lisa:lisa_vg1_local"
GFS: fsid=lisa:lisa_vg1_local.1: Joined cluster. Now mounting FS...
GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Trying to acquire journal
lock...
GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Looking at journal...
GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Done
GFS: fsid=lisa:lisa_vg1_local.1: Scanning for log elements...
GFS: fsid=lisa:lisa_vg1_local.1: Found 0 unlinked inodes
GFS: fsid=lisa:lisa_vg1_local.1: Found quota changes for 0 IDs
GFS: fsid=lisa:lisa_vg1_local.1: Done
BUG: unable to handle kernel NULL pointer dereference at virtual address
0000001
8
 printing eip:
c0164efc
*pde = 36d55001
*pte = 00000000
Oops: 0000 [#1]
SMP
Modules linked in: lock_dlm dlm dm_round_robin dm_multipath sg
ide_floppy ide_cd
 cdrom qla2xxx firmware_class scsi_transport_fc siimage piix e1000 gfs
lock_harn
ess cman dm_mod
CPU:    0
EIP:    0060:[<c0164efc>]    Tainted: GF     VLI
EFLAGS: 00010293   (2.6.18.6-sara1 #1)
EIP is at do_add_mount+0x66/0xeb
eax: 0000000c   ebx: f79e5200   ecx: 00000000   edx: c5900cc0
esi: f6cfbf34   edi: ffffffea   ebp: f6cfbef4   esp: f6cfbee4
ds: 007b   es: 007b   ss: 0068
Process mount (pid: 6360, ti=f6cfa000 task=f6d96ab0 task.ti=f6cfa000)
Stack: 00000000 f6d0e000 f6d2b000 f6d0e003 f6cfbf10 c0164e8a 00000000
f6cfbf34
       00000000 00000000 f6cfbf34 f6cfbf90 c016546d 00000000 f73d6000
f6d2b000
       00000000 f6d0e000 f6d73000 f73d6000 f7816894 c5900cc0 000000d0
0805b5f0
Call Trace:
 [<c0103560>] show_stack_log_lvl+0x8f/0x97
 [<c01036d0>] show_registers+0x125/0x18e
 [<c01038c3>] die+0x116/0x1c8
 [<c010e7bf>] do_page_fault+0x4a3/0x57b
 [<c0103245>] error_code+0x39/0x40
 [<c0164e8a>] do_new_mount+0x67/0x73
 [<c016546d>] do_mount+0x162/0x179
 [<c01656fe>] sys_mount+0x6a/0xa6
 [<c010279f>] syscall_call+0x7/0xb
Code: ff ff 8b 00 8b 80 48 04 00 00 39 42 64 75 7d 8b 43 14 bf f0 ff ff
ff 39 42 14 75 07 8b 06 39 42 10 74 69 8b 43 10 bf ea ff ff ff <8b> 40
0c 0f b7 40 28 25 00 f0 00 00 3d 00 a0 00 00 74 4e 8b 45
EIP: [<c0164efc>] do_add_mount+0x66/0xeb SS:ESP 0068:f6cfbee4


Does anyone have a clue what happend here?


Met vriendelijke groet, Kind Regards,

Jaap P. Dijkshoorn
Group Leader Cluster Computing
Systems Programmer
mailto:jaap at sara.nl    http://home.sara.nl/~jaapd

SARA Computing & Networking Services
Kruislaan 415     1098 SJ  Amsterdam
Tel: +31-(0)20-5923000
Fax: +31-(0)20-6683167
http://www.sara.nl
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/4da13ff7/attachment.bin>

From Michael.Hagmann at hilti.com  Mon Jan 29 20:31:23 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Mon, 29 Jan 2007 21:31:23 +0100
Subject: [Linux-cluster] Unable to obtain lock
In-Reply-To: <1170096286.30401.50.camel@rei.boston.devel.redhat.com>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA53015D80F5@LI-OWL.hag.hilti.com>

Hi

is "the bug" that bugzilla bug Entry?

Bugzilla Bug 212634: rgmanager times out when using clustat

thx mike

Michael Hagmann
UNIX Systems Engineering
Enterprise Systems Technology

Hilti Corporation
9494 Schaan  Liechtenstein

Department FIBS
Feldkircherstrasse 100   P.O.Box 333
P +423-234 2467  F +423-234 6467
E michael.hagmann at hilti.com
www.hilti.com

 
-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Montag, 29. Januar 2007 19:45
To: linux clustering
Subject: Re: [Linux-cluster] Unable to obtain lock

On Fri, 2007-01-26 at 19:28 +0100, Marc Grimme wrote:
> On Friday 26 January 2007 19:15, Lon Hohberger wrote:
> > On Fri, 2007-01-26 at 09:19 +0100, Marc Grimme wrote:
> > > Hello,
> > > yesterday we saw a clusterfreeze (which seems to come from the 
> > > rgmanager) with RHEL4/U4 GFS installed (see logs) consisting of 6 
> > > nodes x86_64 Architecture. After fencing one node the cluster came
back to live.
> > > Any idea what could have happend?
> >
> > Check 'dmesg' and 'cman_tool status'.  Also look at /proc/slabinfo, 
> > specifically 'dlm_lkb' bits.  There's a chance that you hit a bug 
> > that's already fixed. :)

> dlm_lkb           189628 195177    232   17    1 : tunables  120   60
8 : 
> slabdata  11481  11481    384
> nodea
> dlm_lkb           2074114 2077587    232   17    1 : tunables  120
60    8 : 
> slabdata 122211 122211    180
> nodeb
> dlm_lkb           454319 499392    232   17    1 : tunables  120   60
8 : 
> slabdata  29376  29376      0
> nodec
> dlm_lkb           242144 251719    232   17    1 : tunables  120   60
8 : 
> slabdata  14807  14807    480
> noded
> dlm_lkb           248672 286382    232   17    1 : tunables  120   60
8 : 
> slabdata  16846  16846    212
> nodef
> dlm_lkb            62934  62934    232   17    1 : tunables  120   60
8 : 
> slabdata   3702   3702      0

You've hit "the bug".

> > Need above information (and possibly more) to answer this.
> What more?? ;-)

Nothing; test packages here:

http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.i386.rpm
http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.x86_64.rpm
http://people.redhat.com/lhh/rgmanager-1.9.54-2.218112hf.src.rpm


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jleafey at utmem.edu  Mon Jan 29 20:50:15 2007
From: jleafey at utmem.edu (Jay Leafey)
Date: Mon, 29 Jan 2007 14:50:15 -0600
Subject: [Linux-cluster] (?) Problem with relocating extents within a VG
Message-ID: <45BE5E07.2080700@utmem.edu>

Environment:
Two HP DL360 servers, each connected to the SAN via two FC adapters, SAN 
is an HP EVA3000 (I think).

CentOS 4.4
ccs-1.0.7-0.x86_64
cman-1.0.11-0.x86_64
GFS-6.1.6-1.x86_64
GFS-kernel-smp-2.6.9-60.3.x86_64
GFS-kernheaders-2.6.9-60.3.x86_64
kernel-smp-2.6.9-42.0.3.EL.x86_64
lvm2-cluster-2.02.06-7.0.RHEL4.x86_64
lvm2-2.02.06-6.0.RHEL4.x86_64
rgmanager-1.9.54-1.x86_64

I have a volume group made up of two LUNs from our SAN.  I was 
attempting to move the extents of one logical volume in the VG from one 
PV to another when I received the following error:

[root at cobalt ~]# pvmove --name old_coeusdev_data /dev/dm-10 /dev/dm-11
   Error locking on node cobalt.utmem.edu: Resource temporarily unavailable
   Failed to activate old_coeusdev_data

For now, an LV stays where it is located (i.e. on a specific PV) once it 
is created.  I really need to move all of the LVs off of that PV so I 
can release it from the VG (alphabet soup!).  That PV is part of a 
storage pool we need to upgrade to larger drives.

Anyway, any thoughts on this, or other information needed?
-- 
Jay Leafey - University of Tennessee
E-Mail:  jleafey at utmem.edu  Phone:  901-448-5848  FAX:  901-448-8199
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 5153 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/c1607ec3/attachment.bin>

From lhh at redhat.com  Mon Jan 29 21:03:46 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Jan 2007 16:03:46 -0500
Subject: [Linux-cluster] Manual override / manual fencing replacement
Message-ID: <1170104627.30401.91.camel@rei.boston.devel.redhat.com>

Hi,

I put a patch in to HEAD / RHEL4 / RHEL5 / STABLE branches that obviates
the need to configure fence_manual for use in clusters, and allows
manual override if fencing fails.

Here's how it works:
1. Try fencing as usual
2. If fencing *fails*, open a manual override socket and wait for user
input for a few seconds.
3. If we get no input, start the loop over...


Why does it obviate manual fencing?
* If you have no fencing configured, fencing immediately fails (this is
part of the designed-in behavior of fenced!), thereby activating the
possibility of using the manual override.  I.e. No fencing implies
"manual override only"!


How is this better than manual fencing?
* You do not have to configure fencing at all in order for this to work.
(Woooo! Simpler configurations rock!)

* This is a general manual override case which works with all types of
fencing, without hanging forever waiting for input.

* The fence devices, if configured, will be retried.  Previously, if you
used fence_manual as a backup, you *had* to use fence_ack_manual -- even
if the problem with the fence device was only suffering a temporary
problem (ex: a network fencing device that only allows one login at a
time).

* Since both methods require manual intervention, the net effect is
approximately the same in the "no-fencing" case.  In fact, I committed a
sample "fence_ack_manual" shell script replacement which works like the
original command (note: script is only in -HEAD branch).


Why did I do this?
* I think that you should not have to configure manual fencing in order
for it to work; it should the default behavior.

* I think this is a better and more general solution to a problem where
a fence device fails.  Currently, the only way to un-break a cluster
where fencing is permanently dead is to do something like
"mv /sbin/fence_foo /sbin/fence_foo.bak; cp /bin/true /sbin/fence_foo" -
and reverse the process after fencing completes.


Why did I bother to write this up?
* I want to remove the fence_manual and fence_ack_manual commands (from
the HEAD branch), and I want to replace fence_ack_manual with the shell
script that does the same thing with the patch.  If anyone has strong
opinions against this, please comment.


What other information is there?
* fence*manual will not go away in the RHEL4/RHEL5 branches.

* This should not impact your configuration -- even if you are a
fence_manual consumer.  If you are using RHEL4, STABLE, or RHEL5
branches, your configuration will still work.

* Even if you are using HEAD, the removal of fence_manual from your
system (but not your configuration) with this new feature will simply
cause fencing to immediately fail, activating the manual override.


Comments?  Can I nuke fence_manual and fence_ack_manual from the HEAD
trunk of CVS? :)

-- Lon


From lhh at redhat.com  Mon Jan 29 21:05:21 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Jan 2007 16:05:21 -0500
Subject: [Linux-cluster] Unable to obtain lock
In-Reply-To: <9C203D6FD2BF9D49BFF3450201DEDA53015D80F5@LI-OWL.hag.hilti.com>
References: <9C203D6FD2BF9D49BFF3450201DEDA53015D80F5@LI-OWL.hag.hilti.com>
Message-ID: <1170104721.30401.93.camel@rei.boston.devel.redhat.com>

On Mon, 2007-01-29 at 21:31 +0100, Hagmann, Michael wrote:
> Hi
> 
> is "the bug" that bugzilla bug Entry?
> 
> Bugzilla Bug 212634: rgmanager times out when using clustat

Yes; that's one of the side effects.

-- Lon


From beres.laszlo at sys-admin.hu  Mon Jan 29 21:05:30 2007
From: beres.laszlo at sys-admin.hu (BERES Laszlo)
Date: Mon, 29 Jan 2007 22:05:30 +0100
Subject: [Linux-cluster] Problem with relocation
Message-ID: <45BE619A.5060704@sys-admin.hu>

Hi all,

we've tried to relocate services with clusvcadm -r <group> -m <member>,
but failed. Disabling and enabling service on the old/new nodes are OK,
but relocation never worked. Did we miss something?

RHEL 4 U4 with latest packages.

Thanks,

-- 
B?RES L?szl?	 RHCE, RHCX
senior IT engineer, trainer


From lhh at redhat.com  Mon Jan 29 21:09:33 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 29 Jan 2007 16:09:33 -0500
Subject: [Linux-cluster] Problem with relocation
In-Reply-To: <45BE619A.5060704@sys-admin.hu>
References: <45BE619A.5060704@sys-admin.hu>
Message-ID: <1170104973.30401.96.camel@rei.boston.devel.redhat.com>

On Mon, 2007-01-29 at 22:05 +0100, BERES Laszlo wrote:
> Hi all,
> 
> we've tried to relocate services with clusvcadm -r <group> -m <member>,
> but failed. Disabling and enabling service on the old/new nodes are OK,
> but relocation never worked. Did we miss something?
> 
> RHEL 4 U4 with latest packages.

Hmm, are there any log messages?

-- Lon


From beres.laszlo at sys-admin.hu  Mon Jan 29 21:11:40 2007
From: beres.laszlo at sys-admin.hu (BERES Laszlo)
Date: Mon, 29 Jan 2007 22:11:40 +0100
Subject: [Linux-cluster] Problem with relocation
In-Reply-To: <1170104973.30401.96.camel@rei.boston.devel.redhat.com>
References: <45BE619A.5060704@sys-admin.hu>
	<1170104973.30401.96.camel@rei.boston.devel.redhat.com>
Message-ID: <45BE630C.60203@sys-admin.hu>

Lon Hohberger ?rta:

> Hmm, are there any log messages?

Nothing at all, only when I enabled the services. The rest of the
cluster works properly.

-- 
B?RES L?szl?	 RHCE, RHCX
senior IT engineer, trainer


From orkcu at yahoo.com  Mon Jan 29 21:42:40 2007
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Mon, 29 Jan 2007 13:42:40 -0800 (PST)
Subject: [Linux-cluster] RE: HA Clustering - Need Help
In-Reply-To: <bc3908b0701290549x31c232a3y163e40d52fc54398@mail.gmail.com>
Message-ID: <506579.4242.qm@web50613.mail.yahoo.com>


--- Net Cerebrum <netcerebrum at gmail.com> wrote:

> On 1/29/07, Roger Pe?a Escobio <orkcu at yahoo.com>
> wrote:
> >
> >
> > --- Net Cerebrum <netcerebrum at gmail.com> wrote:
> >
> > > I am now dealing with the issue of configuring
> the
> > > network interfaces. As of
> > > now I have configured ethernet bonding on each
> of
> > > the hosts to achieve
> > > network interface redundancy also. However this
> > > leads to a lot of network
> > > traffic since the same interfaces are being used
> for
> > > heartbeat / monitoring
> > > also. Therefore, I am thinking of using the two
> > > ethernet interfaces
> > > individually, one interface for monitoring and
> the
> > > other one for the LAN
> > > through which the clients will be able to access
> the
> > > hosts. They would be
> > > connected to separate switches and the fence
> devices
> > > would also be on the
> > > monitoring / control network. So I assume that
> the
> > > arrangement would be
> > > something like:
> > >
> > > Node A
> > > eth0 - 192.168.100.1
> > > eth1 - 172.16.1.101
> > > fence device - 192.168.100.11
> > >
> > > Node B
> > > eth0 - 192.168.100.2
> > > eth1 - 172.16.1.102
> > > fence device - 192.168.100.12
> > >
> > > The interfaces eth0 and fence devices would be
> > > connected through a switch,
> > > while the other interfaces (eth1) would be on
> the
> > > LAN where clients would be
> > > accessing them. In addition there would be two
> more
> > > floating / shared IP
> > > addresses 172.16.1.201 for the database server
> and
> > > 172.16.1.202 for the
> > > application server which would be defined in the
> > > Resources section of
> > > Cluster Configuration Tool and would not be
> > > mentioned in /etc/hosts (read
> > > somewhere in the documentation).
> > >
> > > Please let me know if these assumptions are
> correct.
> > > I am just wondering how
> > > does the cluster manager figure out which
> interfaces
> > > to use for heartbeat
> > > and monitoring. I haven't seen any such
> > > configuration option in the
> > > system-config-cluster program.
> > I guess it will use the interface with the right
> IP to
> > reach the nodes ;-) or the one used to reach the
> > default router if the nodes are outside any of its
> > local networks
> > I guess, in your configuration, if the nodes names
> are
> > bounding to IP 192.168.100.x (through DNS or
> > /etc/hosts) then cluster applications will use
> eth0
> > for its traffic
> > am I wrong?
> 
> 
> If that's true then I think we need to specify only
> the monitoring IPs in
> the cluster configuration along with the host names.
> What happens to the
> other set of static IP addresses which belong to the
> interfaces on the LAN
> subnet ? Do they get used anywhere in the
> configuration ? Any ideas ?

you mean yours 172.16.1.x IPs ? the floating IP (the
one that is move between the node during
relocalization) ?
well, I think, again just guessing, that they are not
used for cluster internal comunication not its network
interfases, but ofcourse, should be monitoread by
cluster services.

cu
roger


__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )


____________________________________________________________________________________
Do you Yahoo!?
Everyone is raving about the all-new Yahoo! Mail beta.
http://new.mail.yahoo.com


From jaap at sara.nl  Mon Jan 29 21:49:32 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Mon, 29 Jan 2007 22:49:32 +0100
Subject: [Linux-cluster] GFS crashing on 2.6.18.6
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>
Message-ID: <339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>


People,

This problem does not occure with kernel 2.6.17.14

> 
> People,
> 
> We have build the stable GFS tree againt 2.6.18.6. After we executed
> ccs,cman, fenced and clvmd we tried to mount the GFS filesystems, with
> the following result:
> 
> 
> CMAN: Waiting to join or form a Linux-cluster
> CMAN: sending membership request
> CMAN: got node ifs4
> CMAN: got node ifs3
> CMAN: got node ifs2
> CMAN: got node ifs1
> CMAN: quorum regained, resuming activity
> GFS: Trying to join cluster "lock_dlm", "lisa:lisa_vg1_local"
> GFS: fsid=lisa:lisa_vg1_local.1: Joined cluster. Now mounting FS...
> GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Trying to acquire journal
> lock...
> GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Looking at journal...
> GFS: fsid=lisa:lisa_vg1_local.1: jid=1: Done
> GFS: fsid=lisa:lisa_vg1_local.1: Scanning for log elements...
> GFS: fsid=lisa:lisa_vg1_local.1: Found 0 unlinked inodes
> GFS: fsid=lisa:lisa_vg1_local.1: Found quota changes for 0 IDs
> GFS: fsid=lisa:lisa_vg1_local.1: Done
> BUG: unable to handle kernel NULL pointer dereference at 
> virtual address
> 0000001
> 8
>  printing eip:
> c0164efc
> *pde = 36d55001
> *pte = 00000000
> Oops: 0000 [#1]
> SMP
> Modules linked in: lock_dlm dlm dm_round_robin dm_multipath sg
> ide_floppy ide_cd
>  cdrom qla2xxx firmware_class scsi_transport_fc siimage piix e1000 gfs
> lock_harn
> ess cman dm_mod
> CPU:    0
> EIP:    0060:[<c0164efc>]    Tainted: GF     VLI
> EFLAGS: 00010293   (2.6.18.6-sara1 #1)
> EIP is at do_add_mount+0x66/0xeb
> eax: 0000000c   ebx: f79e5200   ecx: 00000000   edx: c5900cc0
> esi: f6cfbf34   edi: ffffffea   ebp: f6cfbef4   esp: f6cfbee4
> ds: 007b   es: 007b   ss: 0068
> Process mount (pid: 6360, ti=f6cfa000 task=f6d96ab0 task.ti=f6cfa000)
> Stack: 00000000 f6d0e000 f6d2b000 f6d0e003 f6cfbf10 c0164e8a 00000000
> f6cfbf34
>        00000000 00000000 f6cfbf34 f6cfbf90 c016546d 00000000 f73d6000
> f6d2b000
>        00000000 f6d0e000 f6d73000 f73d6000 f7816894 c5900cc0 000000d0
> 0805b5f0
> Call Trace:
>  [<c0103560>] show_stack_log_lvl+0x8f/0x97
>  [<c01036d0>] show_registers+0x125/0x18e
>  [<c01038c3>] die+0x116/0x1c8
>  [<c010e7bf>] do_page_fault+0x4a3/0x57b
>  [<c0103245>] error_code+0x39/0x40
>  [<c0164e8a>] do_new_mount+0x67/0x73
>  [<c016546d>] do_mount+0x162/0x179
>  [<c01656fe>] sys_mount+0x6a/0xa6
>  [<c010279f>] syscall_call+0x7/0xb
> Code: ff ff 8b 00 8b 80 48 04 00 00 39 42 64 75 7d 8b 43 14 
> bf f0 ff ff
> ff 39 42 14 75 07 8b 06 39 42 10 74 69 8b 43 10 bf ea ff ff ff <8b> 40
> 0c 0f b7 40 28 25 00 f0 00 00 3d 00 a0 00 00 74 4e 8b 45
> EIP: [<c0164efc>] do_add_mount+0x66/0xeb SS:ESP 0068:f6cfbee4
> 
> 
> Does anyone have a clue what happend here?
> 
> 
> Met vriendelijke groet, Kind Regards,
> 
> Jaap P. Dijkshoorn
> Group Leader Cluster Computing
> Systems Programmer
> mailto:jaap at sara.nl    http://home.sara.nl/~jaapd
> 
> SARA Computing & Networking Services
> Kruislaan 415     1098 SJ  Amsterdam
> Tel: +31-(0)20-5923000
> Fax: +31-(0)20-6683167
> http://www.sara.nl
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3199 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070129/bb3a8ede/attachment.bin>

From lionel.hay at agriculture.gouv.fr  Tue Jan 30 11:27:24 2007
From: lionel.hay at agriculture.gouv.fr (lionel.hay)
Date: Tue, 30 Jan 2007 12:27:24 +0100
Subject: [Linux-cluster] HEARTBEAT and WAN
Message-ID: <45BF2B9C.7050301@agriculture.gouv.fr>

I'am looking for some exemples of files (ha.ch, haresources ..) to 
create a cluster between two LAN (over a WAN)

- How do I find the rights values for ha.cf ?
Do I use bcast,ucast, mcast ?
- Each machine have it's own IP and SUBMASK
FIRST 10.64.1.2/255.255.128.0 on eth0 with 10.64.0.254 for routing
SECOND 10.64.128.76/255.255.252.0 on eth0 with 10.64.128.254 for routing
My tracert command, give me 5 devices between my two LAN (including my 
own devices)
How do I choose the floating IP ? (on second subnet where my primary is, 
or second subnet where is my second)
How can I prevent false takeover in case of overloading ?

Thanks for your help


-- 
Lionel HAY
DDAF Pyrenees Atlantiques
64000 PAU
FRANCE
Tel : 0559021254


From mwill at penguincomputing.com  Tue Jan 30 14:15:59 2007
From: mwill at penguincomputing.com (Michael Will)
Date: Tue, 30 Jan 2007 06:15:59 -0800
Subject: [Linux-cluster] HEARTBEAT and WAN
Message-ID: <433093DF7AD7444DA65EFAFE3987879C33D845@orca.penguincomputing.com>

A floating ip address makes only sense if it is the same on both systems since it is used to hide the different identities of the servers implementing the service, making the failover transparent to the clients that can just reconnect to the same ip address after a failure.

If you do need that feature you could achieve that with a virtual private network vpn that layers over both subnets but then your clients need to be on that private network as well.

You can do without that that if your clients can be programmed to try both ip addresses before giving up.

Michael Will / SE Technical Lead
www.penguincomputing.com

 -----Original Message-----
From: 	lionel.hay [mailto:lionel.hay at agriculture.gouv.fr]
Sent:	Tue Jan 30 03:27:53 2007
To:	linux-cluster at redhat.com
Subject:	[Linux-cluster] HEARTBEAT and WAN

I'am looking for some exemples of files (ha.ch, haresources ..) to 
create a cluster between two LAN (over a WAN)

- How do I find the rights values for ha.cf ?
Do I use bcast,ucast, mcast ?
- Each machine have it's own IP and SUBMASK
FIRST 10.64.1.2/255.255.128.0 on eth0 with 10.64.0.254 for routing
SECOND 10.64.128.76/255.255.252.0 on eth0 with 10.64.128.254 for routing
My tracert command, give me 5 devices between my two LAN (including my 
own devices)
How do I choose the floating IP ? (on second subnet where my primary is, 
or second subnet where is my second)
How can I prevent false takeover in case of overloading ?

Thanks for your help


-- 
Lionel HAY
DDAF Pyrenees Atlantiques
64000 PAU
FRANCE
Tel : 0559021254

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070130/ff19c454/attachment.htm>

From lhh at redhat.com  Tue Jan 30 15:40:50 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 30 Jan 2007 10:40:50 -0500
Subject: [Linux-cluster] HEARTBEAT and WAN
In-Reply-To: <45BF2B9C.7050301@agriculture.gouv.fr>
References: <45BF2B9C.7050301@agriculture.gouv.fr>
Message-ID: <1170171650.30401.110.camel@rei.boston.devel.redhat.com>

Adding CC: Linux-HA list.

-- Lon

On Tue, 2007-01-30 at 12:27 +0100, lionel.hay wrote:
> I'am looking for some exemples of files (ha.ch, haresources ..) to 
> create a cluster between two LAN (over a WAN)
> 
> - How do I find the rights values for ha.cf ?
> Do I use bcast,ucast, mcast ?
> - Each machine have it's own IP and SUBMASK
> FIRST 10.64.1.2/255.255.128.0 on eth0 with 10.64.0.254 for routing
> SECOND 10.64.128.76/255.255.252.0 on eth0 with 10.64.128.254 for routing
> My tracert command, give me 5 devices between my two LAN (including my 
> own devices)
> How do I choose the floating IP ? (on second subnet where my primary is, 
> or second subnet where is my second)
> How can I prevent false takeover in case of overloading ?
> 
> Thanks for your help
> 
> 
> 


From isplist at logicore.net  Tue Jan 30 16:50:24 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 30 Jan 2007 10:50:24 -0600
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
Message-ID: <2007130105024.667549@leena>

I saw a few posts from some working on this. Has anyone gotten this to work 
properly? My nodes simply do not need their hard drives, it's a waste of power 
and hardware since I have plenty of central storage. I would like to remove 
drives from nodes and boot the cluster diskless.

I've seen the open shared-root project and such but those seem to be about 
building the entire cluster. Since I have the cluster going, I'm looking for 
information on how I can convert my nodes to diskless. 

Thanks for any information you can provide. 

Mike


From natecars at natecarlson.com  Tue Jan 30 17:02:28 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Tue, 30 Jan 2007 11:02:28 -0600 (CST)
Subject: [Linux-cluster] GFS crashing on 2.6.18.6
In-Reply-To: <339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>
Message-ID: <Pine.LNX.4.63.0701301102050.7347@tungsten.msp.technicality.org>

On Mon, 29 Jan 2007, Jaap Dijkshoorn wrote:
>
> This problem does not occure with kernel 2.6.17.14
>

*nods*

You're the third or fourth person to report this. I really wish that 
someone who knew the code would look into it.. :(

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From Michael.Hagmann at hilti.com  Tue Jan 30 19:30:56 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Tue, 30 Jan 2007 20:30:56 +0100
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <2007130105024.667549@leena>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA53015D8520@LI-OWL.hag.hilti.com>

Hi Mike

we have a lot of this clusters running in a production environment.

I think to convert a running cluster is not so trivial, but for this
question just ask the guys from opensharedroot.org.

Mike

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
isplist at logicore.net
Sent: Dienstag, 30. Januar 2007 17:50
To: linux-cluster
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster

I saw a few posts from some working on this. Has anyone gotten this to
work properly? My nodes simply do not need their hard drives, it's a
waste of power and hardware since I have plenty of central storage. I
would like to remove drives from nodes and boot the cluster diskless.

I've seen the open shared-root project and such but those seem to be
about building the entire cluster. Since I have the cluster going, I'm
looking for information on how I can convert my nodes to diskless. 

Thanks for any information you can provide. 

Mike


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lshen at cisco.com  Tue Jan 30 21:33:04 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Tue, 30 Jan 2007 13:33:04 -0800
Subject: [Linux-cluster] Software RAID support by GFS
In-Reply-To: <1165419365.29732.22.camel@rei.boston.devel.redhat.com>
Message-ID: <08A9A3213527A6428774900A80DBD8D8035F4BFB@xmb-sjc-222.amer.cisco.com>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> Sent: Wednesday, December 06, 2006 7:36 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Software RAID support by GFS
> 
> On Tue, 2006-12-05 at 17:02 -0800, Lin Shen (lshen) wrote:
> > The GFS FAQ says that it won't work properly with software 
> RAID since 
> > software RAID (the MD driver?) is not cluster-aware? Is 
> this still the 
> > case? And can someone elaborate on this limitation?
> > 
> > Cisco Systems
> 
> Still doesn't work; we have clvm now, which does.
>
 
So with clvm in the picture, I should be able to do software raid across
nodes, right? I'm also using gnbd in my setup, would you mind giving
some detailed info about how to do this?

lin
 
> I'll dig up the answer and write up all the reasons so this 
> can go in the cluster faq.  Stay tuned.
> 
> -- Lon
> 


From jvantuyl at engineyard.com  Tue Jan 30 21:39:57 2007
From: jvantuyl at engineyard.com (Jayson Vantuyl)
Date: Tue, 30 Jan 2007 15:39:57 -0600
Subject: [Linux-cluster] Software RAID support by GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D8035F4BFB@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D8035F4BFB@xmb-sjc-222.amer.cisco.com>
Message-ID: <DFE42BD5-AAA9-4DEA-B2DD-4B2AF8BDA0F9@engineyard.com>

> So with clvm in the picture, I should be able to do software raid  
> across
> nodes, right? I'm also using gnbd in my setup, would you mind giving
> some detailed info about how to do this?
Unfortunately no.  CLVM will allow you to slice up a shared storage  
into pieces across nodes, but it won't let you run MD RAID across them.

I really recommend CLVM on top of Coraid units here.  See http:// 
www.coraid.com.

Maybe you could roll something using GNBD, but I don't really know if  
that works well.

-- 
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl at engineyard.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070130/08050dd0/attachment.htm>

From breeves at redhat.com  Tue Jan 30 21:53:24 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Tue, 30 Jan 2007 21:53:24 +0000
Subject: [Linux-cluster] Software RAID support by GFS
In-Reply-To: <08A9A3213527A6428774900A80DBD8D8035F4BFB@xmb-sjc-222.amer.cisco.com>
References: <08A9A3213527A6428774900A80DBD8D8035F4BFB@xmb-sjc-222.amer.cisco.com>
Message-ID: <45BFBE54.90307@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Lin Shen (lshen) wrote:
> So with clvm in the picture, I should be able to do software raid across
> nodes, right? I'm also using gnbd in my setup, would you mind giving
> some detailed info about how to do this?
> 
> lin
>  

Only if you're going to use cmirror (cluster mirror). On its own, clvmd
just allows you to use the LVM2 tools safely in a shared storage
environment - e.g. preventing you growing an LV on one node while you
shrink it on another - it doesn't address actually getting the data
synced between devices on separate nodes.

There's some discussion of cmirror in the paper here:

http://developer.osdl.org/dev/clusters/docs/cluster_summit_mirror_paper.pdf

(a bit old, but has some useful background)

And CVS is in the cluster repository:
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/cmirror/?cvsroot=cluster
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/cmirror-kernel/?cvsroot=cluster

Kind regards,

Bryn.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFv75U6YSQoMYUY94RAvV0AKDYsv7/03Md7xXfpxWLvJEBorsnkwCgpzBV
ZDY+hYI4ndG8YepobkLXhyQ=
=qL0X
-----END PGP SIGNATURE-----


From dramstha at cisco.com  Wed Jan 31 00:23:51 2007
From: dramstha at cisco.com (David Ramsthaler (dramstha))
Date: Tue, 30 Jan 2007 16:23:51 -0800
Subject: [Linux-cluster] Loop Device on a GFS file
Message-ID: <D7CB273CE89FE644AE6B20F3634539A4032A5238@xmb-sjc-229.amer.cisco.com>

 
Can anyone give me info on why a loop device on a gfs file is read-only?
A write() 

returns "operation not permitted".

 
-David

 
[root at cfs2 xdd]# ll /mnt/gfs/bigFile

-rw-r--r--  1 root root 1048576000 Jan 30 11:06 /mnt/gfs/bigFile

[root at cfs2 xdd]# losetup /dev/loop0 /mnt/gfs/bigFile

 
[root at cfs2 xdd]# mkfs -t ext3 /dev/loop0

mke2fs 1.35 (28-Feb-2004)

Warning: could not erase sector 2: Attempt to write block from
filesystem resulted in short write

Filesystem label=

OS type: Linux

Block size=4096 (log=2)

Fragment size=4096 (log=2)

128000 inodes, 256000 blocks

12800 blocks (5.00%) reserved for the super user

First data block=0

Maximum filesystem blocks=264241152

8 block groups

32768 blocks per group, 32768 fragments per group

16000 inodes per group

Superblock backups stored on blocks: 

        32768, 98304, 163840, 229376

 
Warning: could not erase sector 0: Attempt to write block from
filesystem resulted in short write

mkfs.ext3: Attempt to write block from filesystem resulted in short
write while zeroing block 255984 at end of filesystem

Writing inode tables: 0/8

Could not write 8 blocks in inode table starting at 66: Attempt to write
block from filesystem resulted in short write

[root at cfs2 xdd]#

 
But the same setup works on a local file:

 
[root at cfs2 xdd]# ll /tmp/bigFile

-rw-r--r--  1 root root 314572800 Jan 29 16:50 /tmp/bigFile

[root at cfs2 xdd]# losetup -d /dev/loop0

[root at cfs2 xdd]# losetup /dev/loop0 /tmp/bigFile

[root at cfs2 xdd]# mkfs -t ext3 /dev/loop0

mke2fs 1.35 (28-Feb-2004)

Filesystem label=

OS type: Linux

Block size=1024 (log=0)

Fragment size=1024 (log=0)

76912 inodes, 307200 blocks

15360 blocks (5.00%) reserved for the super user

First data block=1

Maximum filesystem blocks=67633152

38 block groups

8192 blocks per group, 8192 fragments per group

2024 inodes per group

Superblock backups stored on blocks: 

        8193, 24577, 40961, 57345, 73729, 204801, 221185

 
Writing inode tables: done                            

Creating journal (8192 blocks): done

Writing superblocks and filesystem accounting information: done

 
This filesystem will be automatically checked every 22 mounts or

180 days, whichever comes first.  Use tune2fs -c or -i to override.

[root at cfs2 xdd]#

 
-David

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070130/ae700a3b/attachment.htm>

From brandonlamb at gmail.com  Wed Jan 31 02:19:13 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Tue, 30 Jan 2007 18:19:13 -0800
Subject: [Linux-cluster] Will there be some kind of update soon?
Message-ID: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>

Hey I was just wondering if there will be any update or a page anytime
soon with basically a

Download this this and this for this kernel version to get GFS going.

?

I have been beating my head on the wall for over a week on a FC6
installation trying to download various versions of cluster,
devicemapper, lvm2, openais, getting the latest from cvs and svn.

Its been really confusing trying to figure out first what all is
needed for getting GFS up and running, like needing a cvs copy of
device-mapper and lvm2. so after finally trying to figure out how to
get evyerhing I still cant get stuff to compile.

I realize I am not including any logs or make errors but after
downloading and attempting to follow the instructions from usage.txt
verbatim and thigns just dont work i give up. I really want to try out
GFS but today in one day I had ocfs2 up and running on 3 machines,
which im not basing on redhat or anything but that is the difference.

I dont know if im just incredibly stupid and missing obvious things or
what. I have been browsing the mailing list archives and im not seeing
a whole lot of others emailing with problems isntalling, so am I the
only one trying this out on machine with a fresh OS install? Ahhhhhh!!

I am partly frustrated because I dont know if it would be bad manners
to send errors of everything to this list, because i dont want to seem
like im bothering the devs.

Also, can one of the devs maybe give an order that you have to isntall
stuff. For example do i need the very latest of dm isntalled first,
then openais, then this then that? because one thing seems to only
compile if you have another.

sorry for the really long post. ive been up past 2am every day for the
last week+ trying to figure this out hehe


From brandonlamb at gmail.com  Wed Jan 31 07:01:37 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Tue, 30 Jan 2007 23:01:37 -0800
Subject: [Linux-cluster] Re: Will there be some kind of update soon?
In-Reply-To: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
References: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
Message-ID: <f25c44610701302301k38aff42bi62454f468643d58f@mail.gmail.com>

On 1/30/07, Brandon Lamb <brandonlamb at gmail.com> wrote:
> Hey I was just wondering if there will be any update or a page anytime
> soon with basically a
>
> Download this this and this for this kernel version to get GFS going.
>
> ?
>
> I have been beating my head on the wall for over a week on a FC6
> installation trying to download various versions of cluster,
> devicemapper, lvm2, openais, getting the latest from cvs and svn.
>
> Its been really confusing trying to figure out first what all is
> needed for getting GFS up and running, like needing a cvs copy of
> device-mapper and lvm2. so after finally trying to figure out how to
> get evyerhing I still cant get stuff to compile.
>
> I realize I am not including any logs or make errors but after
> downloading and attempting to follow the instructions from usage.txt
> verbatim and thigns just dont work i give up. I really want to try out
> GFS but today in one day I had ocfs2 up and running on 3 machines,
> which im not basing on redhat or anything but that is the difference.
>
> I dont know if im just incredibly stupid and missing obvious things or
> what. I have been browsing the mailing list archives and im not seeing
> a whole lot of others emailing with problems isntalling, so am I the
> only one trying this out on machine with a fresh OS install? Ahhhhhh!!
>
> I am partly frustrated because I dont know if it would be bad manners
> to send errors of everything to this list, because i dont want to seem
> like im bothering the devs.
>
> Also, can one of the devs maybe give an order that you have to isntall
> stuff. For example do i need the very latest of dm isntalled first,
> then openais, then this then that? because one thing seems to only
> compile if you have another.
>
> sorry for the really long post. ive been up past 2am every day for the
> last week+ trying to figure this out hehe

A-ha! I just came across CentOS. It looks like it comes with the GFS
stuff like redhat ES, I am downloading it and will try to run it on
that instead of trying to figure out this cvs source stuff. Cross your
fingers for me!


From basv at sara.nl  Wed Jan 31 07:25:45 2007
From: basv at sara.nl (Bas van der Vlies)
Date: Wed, 31 Jan 2007 08:25:45 +0100
Subject: [Linux-cluster] Re: Will there be some kind of update soon?
In-Reply-To: <f25c44610701302301k38aff42bi62454f468643d58f@mail.gmail.com>
References: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
	<f25c44610701302301k38aff42bi62454f468643d58f@mail.gmail.com>
Message-ID: <45C04479.4060900@sara.nl>


> A-ha! I just came across CentOS. It looks like it comes with the GFS
> stuff like redhat ES, I am downloading it and will try to run it on
> that instead of trying to figure out this cvs source stuff. Cross your
> fingers for me!
> 

I am using the STABLE CVS tree cvs and the kernel modules only works 
with 2.6.17.X kernel versions.  With 2.6.18.X versions i get core dump, 
posted no reply. 2.6.19 does not build. devfs is removed from that 
kernel. devfs is used by some GFS sources.

rgmanager builds but a make install gives errors, posted the errors to 
this list.

Regards


-- 
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From brandonlamb at gmail.com  Wed Jan 31 07:22:23 2007
From: brandonlamb at gmail.com (Brandon Lamb)
Date: Tue, 30 Jan 2007 23:22:23 -0800
Subject: [Linux-cluster] Re: Will there be some kind of update soon?
In-Reply-To: <45C04479.4060900@sara.nl>
References: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
	<f25c44610701302301k38aff42bi62454f468643d58f@mail.gmail.com>
	<45C04479.4060900@sara.nl>
Message-ID: <f25c44610701302322x69cb416eideea2c56af4ad5e7@mail.gmail.com>

On 1/30/07, Bas van der Vlies <basv at sara.nl> wrote:
>
> > A-ha! I just came across CentOS. It looks like it comes with the GFS
> > stuff like redhat ES, I am downloading it and will try to run it on
> > that instead of trying to figure out this cvs source stuff. Cross your
> > fingers for me!
> >
>
> I am using the STABLE CVS tree cvs and the kernel modules only works
> with 2.6.17.X kernel versions.  With 2.6.18.X versions i get core dump,
> posted no reply. 2.6.19 does not build. devfs is removed from that
> kernel. devfs is used by some GFS sources.
>
> rgmanager builds but a make install gives errors, posted the errors to
> this list.
>
> Regards
>
>
>
> --
> ********************************************************************
> *                                                                  *
> *  Bas van der Vlies                     e-mail: basv at sara.nl      *
> *  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
> *  Kruislaan 415                         fax:    +31 20 6683167    *
> *  1098 SJ Amsterdam                                               *
> *                                                                  *
> ********************************************************************
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Ahhh, maybe if I tried that kernel version then. One of the usage.txt
versions that i printed said to use a latest specifying 2.6.19.2 so
thats what I downloaded. Once I get centos burned I will mess around
with GFS some more.


From grimme at atix.de  Wed Jan 31 07:39:39 2007
From: grimme at atix.de (Marc Grimme)
Date: Wed, 31 Jan 2007 08:39:39 +0100
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <2007130105024.667549@leena>
References: <2007130105024.667549@leena>
Message-ID: <200701310839.39946.grimme@atix.de>

Mike,
On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
> I saw a few posts from some working on this. Has anyone gotten this to work
> properly? My nodes simply do not need their hard drives, it's a waste of
> power and hardware since I have plenty of central storage. I would like to
> remove drives from nodes and boot the cluster diskless.
>
> I've seen the open shared-root project and such but those seem to be about
> building the entire cluster. Since I have the cluster going, I'm looking
> for information on how I can convert my nodes to diskless.
>
> Thanks for any information you can provide.
>
> Mike
There are quite a some people (increasing every day) using open-sharedroot 
clusters in productive environments with loads of different applications. I 
also wrote a MiniHowto and some other docs (have a look at 
www.open-sharedroot.org) to give assistance.

As Mike Hagmann already stated it is not a trivial task to build up such a 
cluster (as it's not a trivialtask to build up any type of productive 
clusters, but isn't that a reason why we are doing all that ;-) ) but the 
MiniHowto should help you on the one hand and on the other hand you can get 
more help from open-sharedroot or us at ATIX (www.atix.de). If you like.
Feel free to ask.
Regards and have fun
Marc.
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

** Visit us at CeBIT 2007 in Hannover/Germany **
** in Hall 5, Booth G48/2  (15.-21. of March) **

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From tmornini at engineyard.com  Wed Jan 31 09:26:02 2007
From: tmornini at engineyard.com (Tom Mornini)
Date: Wed, 31 Jan 2007 01:26:02 -0800
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <200701310839.39946.grimme@atix.de>
References: <2007130105024.667549@leena> <200701310839.39946.grimme@atix.de>
Message-ID: <7C25FA3F-603D-4AE8-864F-25E3F17F6349@engineyard.com>

We strongly considered this when implementing our cluster  
infrastructure,
but decided against it when we realized just how devastating *any*  
problem
with that shared root would be...

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


On Jan 30, 2007, at 11:39 PM, Marc Grimme wrote:

> Mike,
> On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
>> I saw a few posts from some working on this. Has anyone gotten  
>> this to work
>> properly? My nodes simply do not need their hard drives, it's a  
>> waste of
>> power and hardware since I have plenty of central storage. I would  
>> like to
>> remove drives from nodes and boot the cluster diskless.
>>
>> I've seen the open shared-root project and such but those seem to  
>> be about
>> building the entire cluster. Since I have the cluster going, I'm  
>> looking
>> for information on how I can convert my nodes to diskless.
>>
>> Thanks for any information you can provide.
>>
>> Mike
> There are quite a some people (increasing every day) using open- 
> sharedroot
> clusters in productive environments with loads of different  
> applications. I
> also wrote a MiniHowto and some other docs (have a look at
> www.open-sharedroot.org) to give assistance.
>
> As Mike Hagmann already stated it is not a trivial task to build up  
> such a
> cluster (as it's not a trivialtask to build up any type of productive
> clusters, but isn't that a reason why we are doing all that ;-) )  
> but the
> MiniHowto should help you on the one hand and on the other hand you  
> can get
> more help from open-sharedroot or us at ATIX (www.atix.de). If you  
> like.
> Feel free to ask.
> Regards and have fun
> Marc.
>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> -- 
> Gruss / Regards,
>
> ** Visit us at CeBIT 2007 in Hannover/Germany **
> ** in Hall 5, Booth G48/2  (15.-21. of March) **
>
> Marc Grimme
> Phone: +49-89 452 3538-14
> http://www.atix.de/               http://www.open-sharedroot.org/
>
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From shailesh at verismonetworks.com  Wed Jan 31 11:25:44 2007
From: shailesh at verismonetworks.com (Shailesh)
Date: Wed, 31 Jan 2007 16:55:44 +0530
Subject: [Linux-cluster] gfs mount
In-Reply-To: <Pine.LNX.4.63.0701301102050.7347@tungsten.msp.technicality.org>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>
	<Pine.LNX.4.63.0701301102050.7347@tungsten.msp.technicality.org>
Message-ID: <1170242745.6648.109.camel@shailesh>

Hi,
    how many mounts can I make of a gfs filesystem ?

 I have two nodes in my cluster,each node has a volume created on their
respective hard-disks and are exported to each other using GNBD.

The problem I an facing is ,after 2 mounts of the file-system I am not
able to mount any further.  I get File Exists error.  
 
here is what I am doing

< @ Node 1 >

 gfs_mkfs -p lock_dlm -t alpha:0 -j 2 /dev/node1/export1
 mount -t gfs /dev/v1/export1 /mnt/node1/exp1

< @ Node 2 >
       
gfs_mkfs -p lock_dlm -t alpha:0 -j 2 /dev/node2/export2
mount -t gfs /dev/node2/export2 /mnt/node2/exp2


Now at 'Node 1'  I try to mount the gnbd device exported by 'Node 2'

< @ Node 1 >
       
gfs_mkfs -p lock_dlm -t alpha:0 -j 2 /dev/gnbd/export2

mount -t gfs /dev/node2/export2 /mnt/node1/v2

"mount : File Exists"

I am using gfs 6.1.0 version and a two node cluster on kernel 2.6.9-42.

However I dont get any error in mounting if two separate file lock table
were used while formatting like "alpha:0" and "alpha:1" for each node's
volume.

Can you figure out why this is happening?

Thanks & Regards
Shailesh


From breeves at redhat.com  Wed Jan 31 11:33:41 2007
From: breeves at redhat.com (Bryn M. Reeves)
Date: Wed, 31 Jan 2007 11:33:41 +0000
Subject: [Linux-cluster] gfs mount
In-Reply-To: <1170242745.6648.109.camel@shailesh>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>	<339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>	<Pine.LNX.4.63.0701301102050.7347@tungsten.msp.technicality.org>
	<1170242745.6648.109.camel@shailesh>
Message-ID: <45C07E95.8030204@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Shailesh wrote:
> However I dont get any error in mounting if two separate file lock table
> were used while formatting like "alpha:0" and "alpha:1" for each node's
> volume.
> 
> Can you figure out why this is happening?

Each GFS filesystem needs its own locktable - giving them names like
"alpha:0" and "alpha:1" is the right thing to do here.

Kind regards,

Bryn.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFwH6V6YSQoMYUY94RAtJwAJ9QdjIr2/NMsfLD+Y4e8WJlpKCT7gCePQU7
q2h1/AGZ01gjmVtxmvxsxL4=
=zS0O
-----END PGP SIGNATURE-----


From rpeterso at redhat.com  Wed Jan 31 15:11:31 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 31 Jan 2007 09:11:31 -0600
Subject: [Linux-cluster] Loop Device on a GFS file
In-Reply-To: <D7CB273CE89FE644AE6B20F3634539A4032A5238@xmb-sjc-229.amer.cisco.com>
References: <D7CB273CE89FE644AE6B20F3634539A4032A5238@xmb-sjc-229.amer.cisco.com>
Message-ID: <45C0B1A3.3090405@redhat.com>

David Ramsthaler (dramstha) wrote:
>
> Can anyone give me info on why a loop device on a gfs file is 
> read-only? A write()
>
> returns ?operation not permitted?.
>
> -David
>
Hi David,

You're talking about my infamous bugzilla bug 164499!
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=164499

There's a fix available in RHEL4 U4 and all the CVS branches.

Regards,

Bob Peterson
Red Hat Cluster Suite


From jleafey at utmem.edu  Wed Jan 31 15:20:03 2007
From: jleafey at utmem.edu (Jay Leafey)
Date: Wed, 31 Jan 2007 09:20:03 -0600
Subject: [Linux-cluster] gfs mount
In-Reply-To: <45C07E95.8030204@redhat.com>
References: <339554D0FE9DD94A8E5ACE4403676CEB01DA28DD@douwes.ka.sara.nl>
	<339554D0FE9DD94A8E5ACE4403676CEB01DA28E0@douwes.ka.sara.nl>
	<Pine.LNX.4.63.0701301102050.7347@tungsten.msp.technicality.org>
	<1170242745.6648.109.camel@shailesh> <45C07E95.8030204@redhat.com>
Message-ID: <45C0B3A3.8000907@utmem.edu>

Bryn M. Reeves wrote:
> 
> Shailesh wrote:
>> However I dont get any error in mounting if two separate file lock table
>> were used while formatting like "alpha:0" and "alpha:1" for each node's
>> volume.
>>
>> Can you figure out why this is happening?
> 
> Each GFS filesystem needs its own locktable - giving them names like
> "alpha:0" and "alpha:1" is the right thing to do here.
> 
> Kind regards,
> 
> Bryn.
> 

In addition, each of the file systems was built with only two journals. 
  You need to have a journal for each node mounting the file system. 
Just change "-j 2" to "-j n" in the gfs_mkfs command, where n is the 
number of nodes in the cluster.  Journals are not too big (default is 
128 MB each, but you can reduce them down to 32 MB) so it might not hurt 
to have a few more than absolutely required.  See the gfs_mkfs manpage 
for more information.

Just my $.02!
-- 
Jay Leafey - University of Tennessee
E-Mail:  jleafey at utmem.edu  Phone:  901-448-5848  FAX:  901-448-8199
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 5153 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070131/6772bd52/attachment.bin>

From rpeterso at redhat.com  Wed Jan 31 15:23:53 2007
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 31 Jan 2007 09:23:53 -0600
Subject: [Linux-cluster] Will there be some kind of update soon?
In-Reply-To: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
References: <f25c44610701301819h65be674cp89a3b57c4a726290@mail.gmail.com>
Message-ID: <45C0B489.7020007@redhat.com>

Brandon Lamb wrote:
> Hey I was just wondering if there will be any update or a page anytime
> soon with basically a
>
> Download this this and this for this kernel version to get GFS going.
Hi Brandon,

I've been working on rebasing the STABLE branch in CVS to the newer
upstream kernels, particularly 2.6.20 varieties.  Unfortunately, I ran into
some kernel vfs differences that threw a monkey-wrench into my work.
If it was just a matter of getting it to compile, I would have had it 
done last week.
When I'm done, I'll post to linux-cluster and let everyone know.  Also,
I'll double-check that the "usage.txt" file is still valid.  Hopefully 
this will
all be done in the next few days.

FC6 is a lot like RHEL5 (except it's lagging a bit in fixes) so the HEAD
branch of CVS is more apt to compile there.  Of course, everyone should
keep in mind that there are two very different worlds: (1) The RHEL4
and STABLE branch of CVS that do things one way, and (2) The HEAD
and RHEL5 branch of CVS that do things very differently under the covers.
Mixing and matching won't work.  One major difference is that (1) uses
cman-kernel and (2) uses openais.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Wed Jan 31 16:38:04 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 31 Jan 2007 10:38:04 -0600
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <7C25FA3F-603D-4AE8-864F-25E3F17F6349@engineyard.com>
Message-ID: <200713110384.972553@leena>

Did you ever give it a try at all? I mean, isn't anything potentially 
devastating if it becomes broken? 

In my case, it's silly to have drives running on every server and then also 
having centralized storage. It would save power, heat and another level of 
failure if I could use a sharedroot system.

Of course, along the same lines, if it's too complex or finicky then it might 
not be worth the trouble... for now.

Mike


On Wed, 31 Jan 2007 01:26:02 -0800, Tom Mornini wrote:
> We strongly considered this when implementing our cluster
> 
> infrastructure,
> but decided against it when we realized just how devastating *any*
> problem
> with that shared root would be...
> 
> --
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)
> 
> 
> On Jan 30, 2007, at 11:39 PM, Marc Grimme wrote:
> 
>> Mike,
>> On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
>>> I saw a few posts from some working on this. Has anyone gotten
>>> this to work
>>> properly? My nodes simply do not need their hard drives, it's a
>>> waste of
>>> power and hardware since I have plenty of central storage. I would
>>> like to
>>> remove drives from nodes and boot the cluster diskless.
>>> 
>>> I've seen the open shared-root project and such but those seem to
>>> be about
>>> building the entire cluster. Since I have the cluster going, I'm
>>> looking
>>> for information on how I can convert my nodes to diskless.
>>> 
>>> Thanks for any information you can provide.
>>> 
>>> Mike
>> There are quite a some people (increasing every day) using open-
>> sharedroot
>> clusters in productive environments with loads of different
>> applications. I
>> also wrote a MiniHowto and some other docs (have a look at
>> www.open-sharedroot.org) to give assistance.
>> 
>> As Mike Hagmann already stated it is not a trivial task to build up
>> such a
>> cluster (as it's not a trivialtask to build up any type of productive
>> clusters, but isn't that a reason why we are doing all that ;-) )
>> but the
>> MiniHowto should help you on the one hand and on the other hand you
>> can get
>> more help from open-sharedroot or us at ATIX (www.atix.de). If you
>> like.
>> Feel free to ask.
>> Regards and have fun
>> Marc.
>>> 
>>> 
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>> 
>> --
>> Gruss / Regards,
>> 
>> ** Visit us at CeBIT 2007 in Hannover/Germany **
>> ** in Hall 5, Booth G48/2  (15.-21. of March) **
>> 
>> Marc Grimme
>> Phone: +49-89 452 3538-14
>> http://www.atix.de/               http://www.open-sharedroot.org/
>> 
>> **
>> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
>> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From tmornini at engineyard.com  Wed Jan 31 19:10:55 2007
From: tmornini at engineyard.com (Tom Mornini)
Date: Wed, 31 Jan 2007 11:10:55 -0800
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <200713110384.972553@leena>
References: <200713110384.972553@leena>
Message-ID: <D482AEE8-6BC4-4C52-A782-E387672AA91E@engineyard.com>

We boot from flash drives, then pivot root to SAN storage.

I agree with no drives in servers, but shared root is a
whole different ball game if you mean everyone using a
single filesystem for root.

--  
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


On Jan 31, 2007, at 8:38 AM, isplist at logicore.net wrote:

> Did you ever give it a try at all? I mean, isn't anything potentially
> devastating if it becomes broken?
>
> In my case, it's silly to have drives running on every server and  
> then also
> having centralized storage. It would save power, heat and another  
> level of
> failure if I could use a sharedroot system.
>
> Of course, along the same lines, if it's too complex or finicky  
> then it might
> not be worth the trouble... for now.
>
> Mike
>
>
> On Wed, 31 Jan 2007 01:26:02 -0800, Tom Mornini wrote:
>> We strongly considered this when implementing our cluster
>>
>> infrastructure,
>> but decided against it when we realized just how devastating *any*
>> problem
>> with that shared root would be...
>>
>> --
>> -- Tom Mornini, CTO
>> -- Engine Yard, Ruby on Rails Hosting
>> -- Reliability, Ease of Use, Scalability
>> -- (866) 518-YARD (9273)
>>
>>
>> On Jan 30, 2007, at 11:39 PM, Marc Grimme wrote:
>>
>>> Mike,
>>> On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
>>>> I saw a few posts from some working on this. Has anyone gotten
>>>> this to work
>>>> properly? My nodes simply do not need their hard drives, it's a
>>>> waste of
>>>> power and hardware since I have plenty of central storage. I would
>>>> like to
>>>> remove drives from nodes and boot the cluster diskless.
>>>>
>>>> I've seen the open shared-root project and such but those seem to
>>>> be about
>>>> building the entire cluster. Since I have the cluster going, I'm
>>>> looking
>>>> for information on how I can convert my nodes to diskless.
>>>>
>>>> Thanks for any information you can provide.
>>>>
>>>> Mike
>>> There are quite a some people (increasing every day) using open-
>>> sharedroot
>>> clusters in productive environments with loads of different
>>> applications. I
>>> also wrote a MiniHowto and some other docs (have a look at
>>> www.open-sharedroot.org) to give assistance.
>>>
>>> As Mike Hagmann already stated it is not a trivial task to build up
>>> such a
>>> cluster (as it's not a trivialtask to build up any type of  
>>> productive
>>> clusters, but isn't that a reason why we are doing all that ;-) )
>>> but the
>>> MiniHowto should help you on the one hand and on the other hand you
>>> can get
>>> more help from open-sharedroot or us at ATIX (www.atix.de). If you
>>> like.
>>> Feel free to ask.
>>> Regards and have fun
>>> Marc.
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>> --
>>> Gruss / Regards,
>>>
>>> ** Visit us at CeBIT 2007 in Hannover/Germany **
>>> ** in Hall 5, Booth G48/2  (15.-21. of March) **
>>>
>>> Marc Grimme
>>> Phone: +49-89 452 3538-14
>>> http://www.atix.de/               http://www.open-sharedroot.org/
>>>
>>> **
>>> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
>>> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Jan 31 19:34:02 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 31 Jan 2007 13:34:02 -0600
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <D482AEE8-6BC4-4C52-A782-E387672AA91E@engineyard.com>
Message-ID: <200713113342.116652@leena>

I'm thinking for application servers/cluster only, not workstation users.


On Wed, 31 Jan 2007 11:10:55 -0800, Tom Mornini wrote:
> We boot from flash drives, then pivot root to SAN storage.
> 
> I agree with no drives in servers, but shared root is a
> whole different ball game if you mean everyone using a
> single filesystem for root.
> 
> --
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)
> 
> 
> On Jan 31, 2007, at 8:38 AM, isplist at logicore.net wrote:
> 
>> Did you ever give it a try at all? I mean, isn't anything potentially
>> devastating if it becomes broken?
>> 
>> In my case, it's silly to have drives running on every server and
>> then also
>> having centralized storage. It would save power, heat and another
>> level of
>> failure if I could use a sharedroot system.
>> 
>> Of course, along the same lines, if it's too complex or finicky
>> then it might
>> not be worth the trouble... for now.
>> 
>> Mike
>> 
>> 
>> On Wed, 31 Jan 2007 01:26:02 -0800, Tom Mornini wrote:
>>> We strongly considered this when implementing our cluster
>>> 
>>> infrastructure,
>>> but decided against it when we realized just how devastating *any*
>>> problem
>>> with that shared root would be...
>>> 
>>> --
>>> -- Tom Mornini, CTO
>>> -- Engine Yard, Ruby on Rails Hosting
>>> -- Reliability, Ease of Use, Scalability
>>> -- (866) 518-YARD (9273)
>>> 
>>> 
>>> On Jan 30, 2007, at 11:39 PM, Marc Grimme wrote:
>>> 
>>>> Mike,
>>>> On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
>>>>> I saw a few posts from some working on this. Has anyone gotten
>>>>> this to work
>>>>> properly? My nodes simply do not need their hard drives, it's a
>>>>> waste of
>>>>> power and hardware since I have plenty of central storage. I would
>>>>> like to
>>>>> remove drives from nodes and boot the cluster diskless.
>>>>> 
>>>>> I've seen the open shared-root project and such but those seem to
>>>>> be about
>>>>> building the entire cluster. Since I have the cluster going, I'm
>>>>> looking
>>>>> for information on how I can convert my nodes to diskless.
>>>>> 
>>>>> Thanks for any information you can provide.
>>>>> 
>>>>> Mike
>>>> There are quite a some people (increasing every day) using open-
>>>> sharedroot
>>>> clusters in productive environments with loads of different
>>>> applications. I
>>>> also wrote a MiniHowto and some other docs (have a look at
>>>> www.open-sharedroot.org) to give assistance.
>>>> 
>>>> As Mike Hagmann already stated it is not a trivial task to build up
>>>> such a
>>>> cluster (as it's not a trivialtask to build up any type of
>>>> productive
>>>> clusters, but isn't that a reason why we are doing all that ;-) )
>>>> but the
>>>> MiniHowto should help you on the one hand and on the other hand you
>>>> can get
>>>> more help from open-sharedroot or us at ATIX (www.atix.de). If you
>>>> like.
>>>> Feel free to ask.
>>>> Regards and have fun
>>>> Marc.
>>>> 
>>>>> 
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>> 
>>>> --
>>>> Gruss / Regards,
>>>> 
>>>> ** Visit us at CeBIT 2007 in Hannover/Germany **
>>>> ** in Hall 5, Booth G48/2  (15.-21. of March) **
>>>> 
>>>> Marc Grimme
>>>> Phone: +49-89 452 3538-14
>>>> http://www.atix.de/               http://www.open-sharedroot.org/
>>>> 
>>>> **
>>>> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
>>>> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>>>> 
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From Michael.Hagmann at hilti.com  Wed Jan 31 20:07:48 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Wed, 31 Jan 2007 21:07:48 +0100
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <7C25FA3F-603D-4AE8-864F-25E3F17F6349@engineyard.com>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA53015D88AC@LI-OWL.hag.hilti.com>

When you come from the TruCluster side you will love it,like me.

But as Tom said is totaly diffrent and you should aware of, as example
Human Error like delete the /etc or others.
we implement for this reason ( as we do in our TruCluster Enviroment ) a
rsync based clone script. That mean we can clone with one Command the
/boot and root to a "clone /boot" and a "clone root". In the grub we add
a entry to boot the clone, when someone also delete the /boot, you have
to change the /boot disk in the qlogic menu to the clone /boot.
We also use this localclone, when we make a OS upgrade. First step is
make a localclone and then upgrade the system. When you have trouble
during the upgrade you always have the chance to boot the localclone and
you have a working cluster. It's also great when you have done the
upgrade and a few days later a big Problem ocoured, you can reboot into
the localclone and have a working cluster. 

What I wan't say is we are aware of this Problem ( because we use it
long time ) and we handle that with spez. Processes and Tools. The
Shared-Root it's a diffrent approach, like TruCluster and OpenVMS.

We love it

Mike

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Tom Mornini
Sent: Mittwoch, 31. Januar 2007 10:26
To: linux clustering
Subject: Re: [Linux-cluster] Diskless Shared-Root GFS/Cluster

We strongly considered this when implementing our cluster
infrastructure, but decided against it when we realized just how
devastating *any* problem with that shared root would be...

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From lshen at cisco.com  Wed Jan 31 20:15:13 2007
From: lshen at cisco.com (Lin Shen (lshen))
Date: Wed, 31 Jan 2007 12:15:13 -0800
Subject: [Linux-cluster] Multiple instances of GFS
Message-ID: <08A9A3213527A6428774900A80DBD8D8035F4FF9@xmb-sjc-222.amer.cisco.com>

What's the extra overhead of running multiple instances of GFS vs. a
single instance in the same cluster? I guess each instance will have its
own set of daemons (aka gfs_glocked, gfs_inoded, etc), right? Wonder
what's the cost in terms of memory, CPU and disk space of those daemons?


BTW, what's the minimum disk space required for running a single GFS
filesystem?

Lin  


From mwill at penguincomputing.com  Wed Jan 31 20:20:07 2007
From: mwill at penguincomputing.com (Michael Will)
Date: Wed, 31 Jan 2007 12:20:07 -0800
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
Message-ID: <433093DF7AD7444DA65EFAFE3987879C33D85F@orca.penguincomputing.com>

Look at www.scyld.com who do it with diskless lightweight compute nodes where /root is a local ramdisk populated at boottime through the headnode(s).  It solves a lot of the issues you where mentioning.

Michael
Ps: I work for penguincomputing who owns scyld.

 -----Original Message-----
From: 	isplist at logicore.net [mailto:isplist at logicore.net]
Sent:	Wed Jan 31 11:34:34 2007
To:	linux-cluster
Subject:	Re: [Linux-cluster] Diskless Shared-Root GFS/Cluster

I'm thinking for application servers/cluster only, not workstation users.


On Wed, 31 Jan 2007 11:10:55 -0800, Tom Mornini wrote:
> We boot from flash drives, then pivot root to SAN storage.
> 
> I agree with no drives in servers, but shared root is a
> whole different ball game if you mean everyone using a
> single filesystem for root.
> 
> --
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)
> 
> 
> On Jan 31, 2007, at 8:38 AM, isplist at logicore.net wrote:
> 
>> Did you ever give it a try at all? I mean, isn't anything potentially
>> devastating if it becomes broken?
>> 
>> In my case, it's silly to have drives running on every server and
>> then also
>> having centralized storage. It would save power, heat and another
>> level of
>> failure if I could use a sharedroot system.
>> 
>> Of course, along the same lines, if it's too complex or finicky
>> then it might
>> not be worth the trouble... for now.
>> 
>> Mike
>> 
>> 
>> On Wed, 31 Jan 2007 01:26:02 -0800, Tom Mornini wrote:
>>> We strongly considered this when implementing our cluster
>>> 
>>> infrastructure,
>>> but decided against it when we realized just how devastating *any*
>>> problem
>>> with that shared root would be...
>>> 
>>> --
>>> -- Tom Mornini, CTO
>>> -- Engine Yard, Ruby on Rails Hosting
>>> -- Reliability, Ease of Use, Scalability
>>> -- (866) 518-YARD (9273)
>>> 
>>> 
>>> On Jan 30, 2007, at 11:39 PM, Marc Grimme wrote:
>>> 
>>>> Mike,
>>>> On Tuesday 30 January 2007 17:50, isplist at logicore.net wrote:
>>>>> I saw a few posts from some working on this. Has anyone gotten
>>>>> this to work
>>>>> properly? My nodes simply do not need their hard drives, it's a
>>>>> waste of
>>>>> power and hardware since I have plenty of central storage. I would
>>>>> like to
>>>>> remove drives from nodes and boot the cluster diskless.
>>>>> 
>>>>> I've seen the open shared-root project and such but those seem to
>>>>> be about
>>>>> building the entire cluster. Since I have the cluster going, I'm
>>>>> looking
>>>>> for information on how I can convert my nodes to diskless.
>>>>> 
>>>>> Thanks for any information you can provide.
>>>>> 
>>>>> Mike
>>>> There are quite a some people (increasing every day) using open-
>>>> sharedroot
>>>> clusters in productive environments with loads of different
>>>> applications. I
>>>> also wrote a MiniHowto and some other docs (have a look at
>>>> www.open-sharedroot.org) to give assistance.
>>>> 
>>>> As Mike Hagmann already stated it is not a trivial task to build up
>>>> such a
>>>> cluster (as it's not a trivialtask to build up any type of
>>>> productive
>>>> clusters, but isn't that a reason why we are doing all that ;-) )
>>>> but the
>>>> MiniHowto should help you on the one hand and on the other hand you
>>>> can get
>>>> more help from open-sharedroot or us at ATIX (www.atix.de). If you
>>>> like.
>>>> Feel free to ask.
>>>> Regards and have fun
>>>> Marc.
>>>> 
>>>>> 
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>> 
>>>> --
>>>> Gruss / Regards,
>>>> 
>>>> ** Visit us at CeBIT 2007 in Hannover/Germany **
>>>> ** in Hall 5, Booth G48/2  (15.-21. of March) **
>>>> 
>>>> Marc Grimme
>>>> Phone: +49-89 452 3538-14
>>>> http://www.atix.de/               http://www.open-sharedroot.org/
>>>> 
>>>> **
>>>> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
>>>> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>>>> 
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070131/3dc3e93f/attachment.htm>

From Christian.Schaefer2 at messe-muenchen.de  Wed Jan 31 20:33:14 2007
From: Christian.Schaefer2 at messe-muenchen.de (Schaefer Christian)
Date: Wed, 31 Jan 2007 21:33:14 +0100
Subject: [Linux-cluster] Diskless Shared-Root GFS/Cluster
In-Reply-To: <9C203D6FD2BF9D49BFF3450201DEDA53015D88AC@LI-OWL.hag.hilti.com>
Message-ID: <56825E6A1E167E41A80CE0F6F6916CAB01C44134@MNTSVCL10E.mmgmuc.de>

Hello Tom,

>From my point ov view, the shared-root technology has several advantages compared with traditional approaches. And in my eyes it is worth beeing at the mercy of this "devastating problem" :).

As I learned about the shared root environment I was also a little bit confused, if this is really working as we migrated our webserver farm in 2005. But the last 2 years taught me, that it really works, and it really works good. To give you an idea of what we are doing: we are hosting about 70 different websites and applications such as shops and several backends with about 2.6 mio page impressions every month. The numbers tend to grow fast...

Another positiv aspect is administration: Imaging you only have to change a configuration file one time, not 6 or 10 times. Or backup can be achieved by tar'ing one single filesystem....

Yust ask the guys at atix.de, they can surely tell the story better than me abd know the technical aspects at a more detailed way than me. 

Best Regards, 

Christian Sch?fer
System Engineer
Abt. IT-Applications
Messe M?nchen 

Tel.: +49 (0) 89 949 21985 
E-Mail: christian.schaefer2 at messe-muenchen.de 
WWW: www.messe-muenchen.de 

 
> -----Urspr?ngliche Nachricht-----
> Von: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Im Auftrag von 
> Hagmann, Michael
> Gesendet: Mittwoch, 31. Januar 2007 21:08
> An: linux clustering
> Betreff: RE: [Linux-cluster] Diskless Shared-Root GFS/Cluster
> 
> When you come from the TruCluster side you will love it,like me.
> 
> But as Tom said is totaly diffrent and you should aware of, as example
> Human Error like delete the /etc or others.
> we implement for this reason ( as we do in our TruCluster 
> Enviroment ) a
> rsync based clone script. That mean we can clone with one Command the
> /boot and root to a "clone /boot" and a "clone root". In the 
> grub we add
> a entry to boot the clone, when someone also delete the 
> /boot, you have
> to change the /boot disk in the qlogic menu to the clone /boot.
> We also use this localclone, when we make a OS upgrade. First step is
> make a localclone and then upgrade the system. When you have trouble
> during the upgrade you always have the chance to boot the 
> localclone and
> you have a working cluster. It's also great when you have done the
> upgrade and a few days later a big Problem ocoured, you can 
> reboot into
> the localclone and have a working cluster. 
> 
> What I wan't say is we are aware of this Problem ( because we use it
> long time ) and we handle that with spez. Processes and Tools. The
> Shared-Root it's a diffrent approach, like TruCluster and OpenVMS.
> 
> We love it
> 
> Mike
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Tom Mornini
> Sent: Mittwoch, 31. Januar 2007 10:26
> To: linux clustering
> Subject: Re: [Linux-cluster] Diskless Shared-Root GFS/Cluster
> 
> We strongly considered this when implementing our cluster
> infrastructure, but decided against it when we realized just how
> devastating *any* problem with that shared root would be...
> 
> -- 
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From isplist at logicore.net  Wed Jan 31 20:43:39 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 31 Jan 2007 14:43:39 -0600
Subject: [Linux-cluster] Alternative? Diskless Shared-Root GFS/Cluster
In-Reply-To: <9C203D6FD2BF9D49BFF3450201DEDA53015D88AC@LI-OWL.hag.hilti.com>
Message-ID: <2007131144339.631166@leena>

Ok, might as well ask this... since I can't seem to find anything on it. How 
about just a central storage that can be split up into many small segments so 
that blades can boot over the network, then joint the GFS cluster?

I mean, all I want to do is to remove the drives since they really aren't 
being used. All of the work is being done on the GFS cluster once a machine is 
up and running. It barely does anything with it's drive other than the OS of 
course, even logging is all remote.

Isn't there a simpler way of getting this done without having to get into 
whole new technologies? All of the blades have PXE boot capabilities, there 
must be some simple way of doing this?

Mike


From rainer at ultra-secure.de  Wed Jan 31 21:07:25 2007
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Wed, 31 Jan 2007 22:07:25 +0100 (CET)
Subject: [Linux-cluster] Alternative? Diskless Shared-Root GFS/Cluster
In-Reply-To: <2007131144339.631166@leena>
References: <2007131144339.631166@leena>
Message-ID: <15513.217.71.83.52.1170277645.squirrel@www.ultra-secure.de>

Am Mi, 31.01.2007, 21:43, schrieb isplist at logicore.net:
> Ok, might as well ask this... since I can't seem to find anything on it.
> How
> about just a central storage that can be split up into many small segments
> so
> that blades can boot over the network, then joint the GFS cluster?
>
> I mean, all I want to do is to remove the drives since they really aren't
> being used. All of the work is being done on the GFS cluster once a
> machine is
> up and running. It barely does anything with it's drive other than the OS
> of
> course, even logging is all remote.
>
> Isn't there a simpler way of getting this done without having to get into
> whole new technologies? All of the blades have PXE boot capabilities,
> there
> must be some simple way of doing this?


Well, you can do SAN-boot only, of course.
Activate the BIOS in your HBA(s) and install on the SAN.
Might be a bit tricky, depending on your actual hardware, but it certainly
works (with some tricks).
Qlogic-cards need the QLA-failover option in the driver (deprecated in the
kernel.org vanilla driver, but the qlogic.com driver still does it).
I think I read the either RHEL5 or RHEL4U5 will enable DM-multipathing for
the boot-partition.


cheers,
Rainer


From isplist at logicore.net  Wed Jan 31 21:13:14 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 31 Jan 2007 15:13:14 -0600
Subject: [Linux-cluster] Alternative? Diskless Shared-Root GFS/Cluster
In-Reply-To: <15513.217.71.83.52.1170277645.squirrel@www.ultra-secure.de>
Message-ID: <2007131151314.055269@leena>

Just seems to be the future of things. I think this needs to be built into 
RH's GFS cluster... in a SIMPLE forward moving manner :). 

Then we could all claim to be GREEN too by saving power and well, other stuff.

Mike


> Well, you can do SAN-boot only, of course.
> Activate the BIOS in your HBA(s) and install on the SAN.
> Might be a bit tricky, depending on your actual hardware, but it certainly
> works (with some tricks).
> Qlogic-cards need the QLA-failover option in the driver (deprecated in the
> kernel.org vanilla driver, but the qlogic.com driver still does it).
> I think I read the either RHEL5 or RHEL4U5 will enable DM-multipathing for
> the boot-partition.
> 
> 
> cheers,
> Rainer


From dmerillat at sequiam.com  Thu Jan 25 05:07:44 2007
From: dmerillat at sequiam.com (Dan Merillat)
Date: Thu, 25 Jan 2007 05:07:44 -0000
Subject: [Linux-cluster] 2.6.20-rc4 gfs2 bug
Message-ID: <20070125050731.GA23270@chaos.ao.net>

Running 2.6.20-rc4 _WITH_ the following patch: (Shouldn't be the issue,
but just in case, I'm listing it here)

Date:	Fri, 29 Dec 2006 21:03:57 +0100
From:	Ingo Molnar <mingo at elte.hu>
Subject: [patch] remove MAX_ARG_PAGES
Message-ID: <20061229200357.GA5940 at elte.hu>

Linux fileserver 2.6.20-rc4MAX_ARGS #4 PREEMPT Fri Jan 12 03:58:25 EST 2007 x86_64 GNU/Linux

This happened when I started testing gfs2 for the first time.  I
installed userspace from CVS, loaded the gfs2/dlm modules, mkfs.gfs2,
then "mount -t gfs2 -v /dev/vg1/gfs2 /mnt/gfs"

This was the initial mount of the new filesystem.  I can create
directories, but attempting a stress-test with bonnie seems to have
deadlocked something.  (at "Start 'em", immediately.)

To clarify: the two oopses happened at first mount.  After that, I
created files/directories, then attempted to stress it a bit with
bonnie++.  No further oops/dmesg output.

For the GFS2 folks, latest CVS gfs_tool doesn't have lockdump, is there
any way to examine what I'm stuck on?

This machine is specifically for testing new things before I put them
into production, so I can leave it hung like this indefinitely for
debugging.


[845566.571468] GFS2 (built Jan 12 2007 04:02:27) installed
[849416.113382] DLM (built Jan 12 2007 04:01:21) installed
[849416.352219] Lock_DLM (built Jan 12 2007 04:02:46) installed
[850966.368016] GFS2: fsid=: Trying to join cluster "lock_dlm", "internal:gfs-test"
[850971.783223] dlm: gfs-test: recover 1
[850971.783242] dlm: gfs-test: add member 1
[850971.783246] dlm: gfs-test: total members 1 error 0
[850971.783248] dlm: gfs-test: dlm_recover_directory
[850971.783260] dlm: gfs-test: dlm_recover_directory 0 entries
[850971.783270] dlm: gfs-test: recover 1 done: 0 ms
[850971.783454] GFS2: fsid=internal:gfs-test.0: Joined cluster. Now mounting FS...
[850973.409048] GFS2: fsid=internal:gfs-test.0: jid=0, already locked for use
[850973.409135] GFS2: fsid=internal:gfs-test.0: jid=0: Looking at journal...
[850973.504558] GFS2: fsid=internal:gfs-test.0: jid=0: Done
[850973.504653] GFS2: fsid=internal:gfs-test.0: jid=1: Trying to acquire journal lock...
[850973.517086] GFS2: fsid=internal:gfs-test.0: jid=1: Looking at journal...
[850973.691546] GFS2: fsid=internal:gfs-test.0: jid=1: Done
[850973.691635] GFS2: fsid=internal:gfs-test.0: jid=2: Trying to acquire journal lock...
[850973.702646] GFS2: fsid=internal:gfs-test.0: jid=2: Looking at journal...
[850973.846397] GFS2: fsid=internal:gfs-test.0: jid=2: Done


[850973.869288] ------------[ cut here ]------------
[850973.869294] kernel BUG at fs/gfs2/glock.c:738!
[850973.869297] invalid opcode: 0000 [1] PREEMPT 
[850973.869300] CPU 0 
[850973.869302] Modules linked in: lock_dlm dlm gfs2 scsi_tgt bttv video_buf firmware_class ir_common compat_ioctl32 btcx_risc tveeprom videodev v4l2_common v4l1_compat radeon nbd eth1394 ohci1394 dm_crypt eeprom w83627hf hwmon_vid i2c_isa i2c_viapro snd_via82xx snd_mpu401_uart snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore
[850973.869324] Pid: 31076, comm: gfs2_glockd Not tainted 2.6.20-rc4MAX_ARGS #4
[850973.869327] RIP: 0010:[<ffffffff8816cabb>]  [<ffffffff8816cabb>] :gfs2:gfs2_glmutex_unlock+0x2b/0x40
[850973.869355] RSP: 0018:ffff81001849be70  EFLAGS: 00010282
[850973.869359] RAX: ffff810023ff4ee0 RBX: ffff810023ff4e68 RCX: ffffffff88185800
[850973.869363] RDX: 0000000000000000 RSI: ffff810023ff4ec0 RDI: ffff810023ff4e68
[850973.869366] RBP: ffff810023ff4f38 R08: 0000000000000000 R09: 0000000000006052
[850973.869370] R10: 0000000000000000 R11: ffffffff8816de60 R12: ffff810023ff4e68
[850973.869374] R13: ffff810023ff4eb0 R14: ffff81003ffd6850 R15: ffff81003ffd6870
[850973.869378] FS:  00002aebf51826d0(0000) GS:ffffffff807fb000(0000) knlGS:00000000f72026c0
[850973.869381] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[850973.869384] CR2: 00002b9e93097fe0 CR3: 0000000003a79000 CR4: 00000000000006e0
[850973.869388] Process gfs2_glockd (pid: 31076, threadinfo ffff81001849a000, task ffff810000b82890)
[850973.869390] Stack:  ffff810023ff4eb0 ffffffff8816cc08 ffff81001849beb0 ffff810024322000
[850973.869397]  ffff8100243223b8 ffff8100074cf968 ffffffff88163510 ffffffff88163528
[850973.869402]  0000000000000000 ffff810000b82890 ffffffff8029fe70 ffff81001849bec8
[850973.869407] Call Trace:
[850973.869421]  [<ffffffff8816cc08>] :gfs2:gfs2_reclaim_glock+0x138/0x180
[850973.869434]  [<ffffffff88163510>] :gfs2:gfs2_glockd+0x0/0xf0
[850973.869445]  [<ffffffff88163528>] :gfs2:gfs2_glockd+0x18/0xf0
[850973.869453]  [<ffffffff8029fe70>] autoremove_wake_function+0x0/0x30
[850973.869465]  [<ffffffff88163510>] :gfs2:gfs2_glockd+0x0/0xf0
[850973.869471]  [<ffffffff80234d43>] kthread+0xd3/0x110
[850973.869476]  [<ffffffff80229407>] schedule_tail+0x37/0xc0
[850973.869481]  [<ffffffff8029fca0>] keventd_create_kthread+0x0/0xa0
[850973.869485]  [<ffffffff80264618>] child_rip+0xa/0x12
[850973.869490]  [<ffffffff8029fca0>] keventd_create_kthread+0x0/0xa0
[850973.869497]  [<ffffffff80234c70>] kthread+0x0/0x110
[850973.869501]  [<ffffffff8026460e>] child_rip+0x0/0x12
[850973.869504] 
[850973.869505] 
[850973.869506] Code: 0f 0b 66 66 90 eb fe 66 66 66 90 66 66 66 90 66 66 90 66 66 
[850973.869514] RIP  [<ffffffff8816cabb>] :gfs2:gfs2_glmutex_unlock+0x2b/0x40
[850973.869528]  RSP <ffff81001849be70>
[850973.869530]  <6>note: gfs2_glockd[31076] exited with preempt_count 1


[850986.762341] ------------[ cut here ]------------
[850986.762346] kernel BUG at fs/gfs2/glock.c:738!
[850986.762349] invalid opcode: 0000 [2] PREEMPT 
[850986.762351] CPU 0 
[850986.762353] Modules linked in: lock_dlm dlm gfs2 scsi_tgt bttv video_buf firmware_class ir_common compat_ioctl32 btcx_risc tveeprom videodev v4l2_common v4l1_compat radeon nbd eth1394 ohci1394 dm_crypt eeprom w83627hf hwmon_vid i2c_isa i2c_viapro snd_via82xx snd_mpu401_uart snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore
[850986.762376] Pid: 31075, comm: gfs2_scand Not tainted 2.6.20-rc4MAX_ARGS #4
[850986.762379] RIP: 0010:[<ffffffff8816cabb>]  [<ffffffff8816cabb>] :gfs2:gfs2_glmutex_unlock+0x2b/0x40
[850986.762405] RSP: 0000:ffff81001f221e80  EFLAGS: 00010286
[850986.762408] RAX: ffff810023ff4940 RBX: ffff810023ff48c8 RCX: 0000000000000000
[850986.762412] RDX: 0000000000000146 RSI: ffff810023ff4920 RDI: ffff810023ff48c8
[850986.762416] RBP: ffff810024322000 R08: ffff81001f220000 R09: 00000000f6e88388
[850986.762418] R10: 0000000000000000 R11: 00000000ffffffff R12: 0000000000000000
[850986.762422] R13: 0000000000000000 R14: ffffffff8816d2f0 R15: ffff81003ffd6870
[850986.762426] FS:  00002b5212e53ae0(0000) GS:ffffffff807fb000(0000) knlGS:00000000f6e88bb0
[850986.762429] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[850986.762432] CR2: 00000000f706f000 CR3: 000000002b96b000 CR4: 00000000000006e0
[850986.762436] Process gfs2_scand (pid: 31075, threadinfo ffff81001f220000, task ffff810025448850)
[850986.762438] Stack:  ffff810023ff48c8 ffffffff8816a86c 0000000000000147 ffff810024322000
[850986.762445]  ffff8100074cf968 ffffffff88163600 ffff81003ffd6850 ffffffff8816ae54
[850986.762450]  ffff81003ffd6850 000000000000000f ffff810024322000 ffffffff88163618
[850986.762454] Call Trace:
[850986.762469]  [<ffffffff8816a86c>] :gfs2:examine_bucket+0x8c/0x100
[850986.762481]  [<ffffffff88163600>] :gfs2:gfs2_scand+0x0/0x70
[850986.762494]  [<ffffffff8816ae54>] :gfs2:gfs2_scand_internal+0x24/0x40
[850986.762506]  [<ffffffff88163618>] :gfs2:gfs2_scand+0x18/0x70
[850986.762514]  [<ffffffff80234d43>] kthread+0xd3/0x110
[850986.762519]  [<ffffffff80229407>] schedule_tail+0x37/0xc0
[850986.762525]  [<ffffffff8029fca0>] keventd_create_kthread+0x0/0xa0
[850986.762530]  [<ffffffff80264618>] child_rip+0xa/0x12
[850986.762535]  [<ffffffff8029fca0>] keventd_create_kthread+0x0/0xa0
[850986.762542]  [<ffffffff80234c70>] kthread+0x0/0x110
[850986.762545]  [<ffffffff8026460e>] child_rip+0x0/0x12
[850986.762548] 
[850986.762549] 
[850986.762550] Code: 0f 0b 66 66 90 eb fe 66 66 66 90 66 66 66 90 66 66 90 66 66 
[850986.762559] RIP  [<ffffffff8816cabb>] :gfs2:gfs2_glmutex_unlock+0x2b/0x40
[850986.762572]  RSP <ffff81001f221e80>
[850986.762575]  <6>note: gfs2_scand[31075] exited with preempt_count 1