From rodgersr at yahoo.com  Sun Oct  1 02:52:03 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Sat, 30 Sep 2006 19:52:03 -0700 (PDT)
Subject: [Linux-cluster] ip-tiebreaker quotes need explainging
Message-ID: <20061001025203.76527.qmail@web34209.mail.mud.yahoo.com>

Can anyone explain why this is so. Why is it only used on maintaining
qourum and not startup?


"The IP tiebreaker is typically used to *maintain* a quorum after a node
failure, because there are certain network faults in which two nodes may
see the tiebreaker - but not each other.
-- Lon"


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060930/e1e68714/attachment.htm>

From andre at hudat.com  Sun Oct  1 19:13:44 2006
From: andre at hudat.com (andre at hudat.com)
Date: Sun, 1 Oct 2006 15:13:44 -0400
Subject: [Linux-cluster] Panic
Message-ID: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>

I have the following panic on two nodes hours apart. Each node Is in a
different state ( as in states of the US ). NO I am not running a cluster
over a WAN, just two separate clusters in two different locations. Files are
written on one cluster and I have a script that does an SCP of the file to
the other cluster. Both machines running the latest RHEL4 with the latest
GFS updates. This just started happening. Happened twice since Friday
morning. Any hints ? What is happening with clvmd here ? What does the
global conflict message mean ?

--
Andre

Oct  1 13:26:47 fs1.fl.apexrad.com kernel: purged 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd mark waiting requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd marked 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd recover event 5 done
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd move flags 0,0,1 ids 2,5,5
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd process held requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd processed 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd resend marked requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd resent 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: clvmd recover event 5 finished
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 total nodes 1
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 rebuild resource directory
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 rebuilt 0 resources
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 recover event 4 done
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 move flags 0,0,1 ids 0,4,4
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 process held requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 processed 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 recover event 4 finished
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 move flags 1,0,0 ids 4,4,4
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 move flags 0,1,0 ids 4,7,4
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 move use event 7
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 recover event 7
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 add node 2
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 total nodes 2
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 rebuild resource directory
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 rebuilt 6 resources
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 purge requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 purged 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 mark waiting requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 marked 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 recover event 7 done
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 move flags 0,0,1 ids 4,7,7
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 process held requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 processed 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 resend marked requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 resent 0 requests
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01 recover event 7 finished
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 245-253 ex
1 own 4158637196, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 254-26c ex
1 own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 26d-27b ex
1 own 4158637196, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 27c-28b ex
1 own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 28c-29b ex
1 own 4158637196, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 29c-2ac ex
1 own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 2ad-2b9 ex
1 own 4158637196, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 2ba-2c7 ex
1 own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-ff ex 0
own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 c8-2c7 ex
0 own 4158638348, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 200-2c7 ex
0 own 4158638348, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1 ex 0
own 4101191756, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638828, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-2c7 ex 0
own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-fff ex 0
own 4158638348, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 2c8-fff ex
0 own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638348, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-3f ex 0
own 4158636236, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638348, pid 444u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-fff ex 1
own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
70000-7ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
80000-8ffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
90000-9ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
a0000-affff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
b0000-bffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
c0000-cffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
d0000-dffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
e0000-effff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
f0000-fffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
100000-10ffff ex 1 own 4101191756, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
110000-11ffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
120000-12ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
130000-13ffff ex 1 own 4158638828, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
140000-14ffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
150000-15ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
160000-16ffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
170000-17ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
180000-18ffff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
190000-19ffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1a0000-1affff ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1b0000-1bffff ex 1 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c0000-1c2aa7 ex 1 own 4158636236, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-ff ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 200-2ff ex
0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c28a8-1c2aa7 ex 0 own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c26da-1c27d9 ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 44b-54a ex
0 own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c0ba5-1c0ca4 ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c0780-1c087f ex 0 own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c12a8-1c13a7 ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c277d-1c287c ex 0 own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c276a-1c2869 ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c10eb-1c11ea ex 0 own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 1c04-1d03
ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 fe00-ffff
ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1 ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-fff ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0
1c1aa8-1c2aa7 ex 0 own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-fff ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-3f ex 0
own 4158637196, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: 4424 global conflict 0 0-1ff ex 0
own 4158638348, pid 296u
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lock_dlm:  Assertion failed on
line 428 of file /usr/src/build/765787-i686/BUIL
D/gfs-kernel-2.6.9-58/smp/src/dlm/lock.c
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lock_dlm:  assertion:  "!error"
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lock_dlm:  time = 185852977
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: lvol01: num=2,684f0dd err=-22
cur=3 req=5 lkf=44
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: ------------[ cut here
]------------
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: kernel BUG at
/usr/src/build/765787-i686/BUILD/gfs-kernel-2.6.9-58/smp/src/dlm/
lock.c:428!
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: invalid operand: 0000 [#1]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: SMP
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: Modules linked in: nfs nfsd
exportfs lockd nfs_acl autofs4 i2c_dev i2c_core loc
k_dlm(U) gfs(U) lock_harness(U) dlm(U) cman(U) md5 ipv6 sunrpc dm_mirror
button battery ac uhci_hcd ehci_hcd hw_random e10
00 floppy sg ext3 jbd dm_mod megaraid_mbox megaraid_mm sd_mod scsi_mod
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: CPU:    0
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: EIP:    0060:[<f8df7779>]    Not
tainted VLI
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: EFLAGS: 00010246
(2.6.9-42.ELsmp)
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: EIP is at do_dlm_lock+0x134/0x14e
[lock_dlm]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: eax: 00000001   ebx: ffffffea
ecx: d18c5dc0   edx: f8dfc221
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: esi: f8df7798   edi: c387e600
ebp: e194f780   esp: d18c5dbc
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: ds: 007b   es: 007b   ss: 0068
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: Process rmdir (pid: 23174,
threadinfo=d18c5000 task=f64f0b30)
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: Stack: f8dfc221 20202020 32202020
20202020 20202020 34383620 64643066 32200018
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:        20202020 e194f780 00000001
00000003 e194f780 f8df7828 00000005 f8dff940
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:        f8919000 f8eba936 00000000
00000001 d16c1dd4 d16c1db8 f8919000 f8eb08fe
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: Call Trace:
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8df7828>]
lm_dlm_lock+0x49/0x52 [lock_dlm]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eba936>]
gfs_lm_lock+0x35/0x4d [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eb08fe>]
gfs_glock_xmote_th+0x130/0x172 [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eaffbd>]
rq_promote+0xc8/0x147 [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eb01a9>] run_queue+0x91/0xc1
[gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eb11b9>]
gfs_glock_nq+0xcf/0x116 [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eb18f5>] nq_m_sync+0x44/0x64
[gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8eb1a5e>]
gfs_glock_nq_m+0x149/0x15d [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<f8ec87d9>]
gfs_rmdir+0x6a/0x168 [gfs]
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<c0168a55>]
vfs_rmdir+0x1a3/0x1f1
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<c0168b44>] sys_rmdir+0xa1/0xf4
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<c011ae55>]
do_page_fault+0x0/0x5c6
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  [<c02d4703>]
syscall_call+0x7/0xb
Oct  1 13:26:47 fs1.fl.apexrad.com kernel: Code: 26 50 0f bf 45 24 50 53 ff
75 08 ff 75 04 ff 75 0c ff 77 18 68 4c c3 df f
8 e8 32 b1 32 c7 83 c4 38 68 21 c2 df f8 e8 25 b1 32 c7 <0f> 0b ac 01 5e c1
df f8 68 23 c2 df f8 e8 e0 a8 32 c7 83 c4 20
Oct  1 13:26:47 fs1.fl.apexrad.com kernel:  <0>Fatal exception: panic in 5
seconds
Oct  1 13:49:59 fs1.fl.apexrad.com syslogd 1.4.1: restart.


From mnapolis at redhat.com  Mon Oct  2 00:12:14 2006
From: mnapolis at redhat.com (Isauro Michael Napolis)
Date: Mon, 02 Oct 2006 10:12:14 +1000
Subject: [Linux-cluster] ip-tiebreaker quotes need explainging
In-Reply-To: <20061001025203.76527.qmail@web34209.mail.mud.yahoo.com>
References: <20061001025203.76527.qmail@web34209.mail.mud.yahoo.com>
Message-ID: <1159747934.4661.4.camel@localhost.localdomain>

Hi,

The ip tiebreaker is primarily used during network split-brain
situations in a even numbered (2,4, etc.) cluster.  The ip tiebreaker
provides the extra vote (to form a quorum (50% + 1)) to determine who
should be the next/ new master. 

cheers,
Michael


On Sun, 2006-10-01 at 12:52, Rick Rodgers wrote:
> Can anyone explain why this is so. Why is it only used on maintaining
> qourum and not startup?
> 
> 
> "The IP tiebreaker is typically used to *maintain* a quorum after a node
> failure, because there are certain network faults in which two nodes may
> see the tiebreaker - but not each other.
> -- Lon"
> 
> 
> 
> ______________________________________________________________________
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rpeterso at redhat.com  Mon Oct  2 00:14:54 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Sun, 01 Oct 2006 19:14:54 -0500
Subject: [Linux-cluster] Panic
In-Reply-To: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>
References: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>
Message-ID: <452059FE.3030104@redhat.com>

andre at hudat.com wrote:
> I have the following panic on two nodes hours apart. Each node Is in a
> different state ( as in states of the US ). NO I am not running a cluster
> over a WAN, just two separate clusters in two different locations. Files are
> written on one cluster and I have a script that does an SCP of the file to
> the other cluster. Both machines running the latest RHEL4 with the latest
> GFS updates. This just started happening. Happened twice since Friday
> morning. Any hints ? What is happening with clvmd here ? What does the
> global conflict message mean ?
>
> --
> Andre
>   
Hi Andre,

This might be the same as bugzilla bug 208134.
See https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208134.
There is a patch to try with the bugzilla.

Regards,

Bob Peterson
Red Hat Cluster Suite


From andre at hudat.com  Mon Oct  2 02:24:51 2006
From: andre at hudat.com (Andre Henry)
Date: Sun, 1 Oct 2006 22:24:51 -0400
Subject: [Linux-cluster] Panic
In-Reply-To: <452059FE.3030104@redhat.com>
Message-ID: <003501c6e5c9$f11dd850$0245450a@AndreLaptop>

Says I am not authorized. Even after creating an account. Can I access
bugzilla from my rhn account ?

--
Thanks
Andre

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> bounces at redhat.com] On Behalf Of Robert Peterson
> Sent: Sunday, October 01, 2006 8:15 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Panic
> 
> andre at hudat.com wrote:
> > I have the following panic on two nodes hours apart. Each node Is in a
> > different state ( as in states of the US ). NO I am not running a
> cluster
> > over a WAN, just two separate clusters in two different locations. Files
> are
> > written on one cluster and I have a script that does an SCP of the file
> to
> > the other cluster. Both machines running the latest RHEL4 with the
> latest
> > GFS updates. This just started happening. Happened twice since Friday
> > morning. Any hints ? What is happening with clvmd here ? What does the
> > global conflict message mean ?
> >
> > --
> > Andre
> >
> Hi Andre,
> 
> This might be the same as bugzilla bug 208134.
> See https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208134.
> There is a patch to try with the bugzilla.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rodgersr at yahoo.com  Mon Oct  2 03:52:38 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Sun, 1 Oct 2006 20:52:38 -0700 (PDT)
Subject: [Linux-cluster] ip-tiebreaker quotes need explainging
Message-ID: <20061002035238.66323.qmail@web34203.mail.mud.yahoo.com>

Thanks for replying to me especically on a Sunday. I was familiar with everything you 
said. The question concerned not being able to use the tiebreaker to form quarum on STARTUP.
Why is this? 

Also Lon sent me a URL  to a doc I need but it seems the URL is stale. Can you help
me? Here is the URL:


http://people.redhat.com/lhh/rhcm-3-internals.odt


Rick


----- Original Message ----
From: Isauro Michael Napolis <mnapolis at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Sunday, October 1, 2006 5:12:14 PM
Subject: Re: [Linux-cluster] ip-tiebreaker quotes need explainging

Hi,

The ip tiebreaker is primarily used during network split-brain
situations in a even numbered (2,4, etc.) cluster.  The ip tiebreaker
provides the extra vote (to form a quorum (50% + 1)) to determine who
should be the next/ new master. 

cheers,
Michael


On Sun, 2006-10-01 at 12:52, Rick Rodgers wrote:
> Can anyone explain why this is so. Why is it only used on maintaining
> qourum and not startup?
> 
> 
> "The IP tiebreaker is typically used to *maintain* a quorum after a node
> failure, because there are certain network faults in which two nodes may
> see the tiebreaker - but not each other.
> -- Lon"
> 
> 
> 
> ______________________________________________________________________
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061001/1db510d9/attachment.htm>

From rodgersr at yahoo.com  Mon Oct  2 03:59:42 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Sun, 1 Oct 2006 20:59:42 -0700 (PDT)
Subject: [Linux-cluster] How does disk prevent split brain?
Message-ID: <20061002035942.52252.qmail@web34213.mail.mud.yahoo.com>

According to RH documentation and the engineers you can have a safe two node 
cluster without having to have an ip-tiebreaker. They say it will use the disk tie-breaker.

How is this possible?  Since clulockd is running separately on each of the nodes how does
it prevent each node from accessing the disk at the same time and declaring himself the 
Active node? If it is through some sort of locking mechanism that insures this can you please
be technically specific in what it is?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061001/04ee3773/attachment.htm>

From rodgersr at yahoo.com  Mon Oct  2 05:12:28 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Sun, 1 Oct 2006 22:12:28 -0700 (PDT)
Subject: [Linux-cluster] RH documentation and RH Engineering do not agree
Message-ID: <20061002051228.1904.qmail@web34209.mail.mud.yahoo.com>


I am using a two node cluster without a tiebreaker and find that the 
documentation that RH provides and some of the technical info provided
by RH engineering folks do not agree with each other. Redhaat docs say that if 
all netowrking fails that the nodes will not failover because they still have the
disk to act as heartbeat. Also the docs say that the services will continue. Yet, I tested
theis and it does not aahhpen this way. The system will get stonithed. So what is the the
real story here? Below are the docs I am talking about.

Below are two excerpts from RH documentation that says the following about loss of network connections
in a two node cluster:

----------------------------------------------------------------------------
    Total Network Connection Failure

    A total network connection failure occurs when all the heartbeat network connections between the systems fail.     This can be caused by one of the following:

             All the heartbeat network cables are disconnected from a system.
          All the serial connections and network interfaces used for heartbeat communication fail.

    If a total network connection failure occurs, both systems detect the problem, but they also detect that the SCSI     disk connections are still active. Therefore, services remain running on the systems and are not interrupted.


---------------------------------------------------------------------------------
From a RH FAQ list
----------------------------------------------------------------------------------

    E.4. Common Behaviors: Two Member Cluster with Disk-basedTie-breaker

    Loss of network connectivity to other member, shared media still accessible
    Common Causes: Network connectivity lost.
    Test Case: Disconnect all network cables from a member.
    Expected Behavior: No fail-over unless disk updates are also lost. Services will not be able to be
    relocated in most cases, which is due to the fact that the lock server requires network connectivity.
----------------------------------------------------------------------


However this does not seem to be the case. The systems stop the service or get STONITHed.

Below is some info form this message board with a reply from RH engineering that seems to 
confirm that the nodes will get STONITHed.  THis is followed by more RH engineering that 
conform to the RH docs. ????/

  
  RE: [Linux-cluster] Tiebreaker IP

------------------------------------------------------------------------

    * /From/: <JACOB_LIBERMAN Dell com>
    * /To/: <linux-cluster redhat com>
    * /Subject/: RE: [Linux-cluster] Tiebreaker IP
    * /Date/: Fri, 26 Aug 2005 13:24:39 -0500

------------------------------------------------------------------------

    Rob,

    Heres a summary of what I have observed with this configuration. You may
    want to verify the accuracy of my observations on your own. 

    Starting with RHEL3, the RHCS verified node membership via a network
    heartbeat rather than/in addition to a disk timestamp. The network
    heartbeat traffic moves over the same interface that is used to access
    the network resources. This means that there is no dedicated heartbeat
    interface like you would see in a microsoft cluster.

    The tiebreaker IP is used to prevent a split brain situation in a a
    cluster with an even number of nodes. Lets say you have 2 active cluster
    nodes... say nodeA and nodeB, and nodeA owns an NFS disk resource and
    IP. Then lets say nodeA fails to receive a heartbeat from nodeB over its
    primary interface. This could mean several things: nodeA's interface is
    down, nodeB's interface is down, or their shared switch is down. So if
    nodeA and nodeB stop communicating with eachother, they will both try to
    ping the tiebreaker IP, which is usually your default gateway IP. If
    nodeA gets a response from the tiebreaker IP, it will continue to own
    the resource. If it cant, it will assume its external interface is down
    and fence/reboot itself. The same holds true for nodeB. Unlike RHEL2.1
    which used STONITH, RHEL3 cluster nodes reboot themselves. Therefor,
    even if nodeB can reach the tiebreaker and CANT reach nodeA, it will not
    get the cluster resource until nodeA releases it. This prevents the
    nodes from accessing the shared disk resource concommitantly.

    This configuration prevents split brain by ensuring the resource owner
    doesn't get killed accidentally by its peer. For those that remember,
    ping=ponging was a big problem with RHEL2.1 clusters. If they couldn't
    read their partners disk timestamp update in a timely manner -- due to
    IO latency or whatever -- they would reboot their partner node. On
    reboot, the rebooted node would STONITH the other node, etc.

    Anyway, I hope this answers your questions. It is fairly easy to test.
    Set up a 2 node cluster, then reboot the service owner. If the service
    starts on the other node, you should be configured correctly. Next
    disconnect the service owner from the network. The service owner should
    reboot itself with the watchdog or fail over the resource, depending on
    how its ocnfigured. Repeat this test with the non-service owner. (the
    resources should not move in this case.) then take turns disconnecting
    them from the shared storage. 

    Cheers, jacob


-----------------------------------------------------------------------------
     RE: [Linux-cluster] Tiebreaker IP

------------------------------------------------------------------------

    * /From/: Lon Hohberger <lhh redhat com>
    * /To/: linux clustering <linux-cluster redhat com>
    * /Subject/: RE: [Linux-cluster] Tiebreaker IP
    * /Date/: Mon, 29 Aug 2005 15:19:40 -0400

------------------------------------------------------------------------

    On Fri, 2005-08-26 at 13:24 -0500, JACOB_LIBERMAN Dell com wrote:

    > If it cant, it will assume its external interface is down
    > and fence/reboot itself. The same holds true for nodeB. Unlike RHEL2.1
    > which used STONITH, RHEL3 cluster nodes reboot themselves. 

    Both use STONITH.  RHEL3 cluster nodes are more paranoid about running
    without STONITH.

    If STONITH is configured on a RHEL3 cluster, the node will instead wait
    to be shot -- or for a new quorum to form -- if it loses network
    connectivity.

    > Anyway, I hope this answers your questions. It is fairly easy to test.
    > Set up a 2 node cluster, then reboot the service owner. If the service
    > starts on the other node, you should be configured correctly. Next
    > disconnect the service owner from the network. The service owner should
    > reboot itself with the watchdog or fail over the resource, depending on
    > how its ocnfigured.

    It should reboot itself because it loses quorum, really.  Basically,
    without STONITH, a node thinks like this on RHEL3:

    "I was quorate and now I'm not, and no one can cut me off from shared
    storage... Uh, oh, REBOOT!"

-- Lon

------------------------------------------------------------------------


more
-----------------------------------------------------------------------------  

    >The disk tiebreaker works in a similar way, except that it lets the

    >cluster limp in along in a safe, semi-split-brain (split brain) in a

    >network outage.  What I mean is that because there's state information

    >written to/read from the shared raw partitions, the nodes can actually

    >tell via other means whether or not the other node is "alive" or not as

    >opposed to relying solely on the network traffic.

    >Both nodes update state information on the shared partitions.  When one

    >node detects that the other node has not updated its information for a

    >period of time, that node is "down" according to the disk subsystem.If

    >this coincides with a "down" status from the membership daemon, the node

    >is fenced and services are failed over.  If the node never goes down

    >(and keeps updating its information on the shared partitions), then the

    >node is never fenced and services never fail over.

    -- Lon


14. What is a quorum disk/partition and what does it do for you?

      A quorum disk or partition is a section of a disk that's set up
      for use with components of the cluster project. It has a couple of
      purposes. Again, I'll explain with an example.

      Suppose you have nodes A and B, and node A fails to get several of
      cluster manager's "heartbeat" packets from node B. Node A doesn't
      know why it hasn't received the packets, but there are several
      possibilities: either node B has failed, the network switch or hub
      has failed, node A's network adapter has failed, or maybe just
      because node B was just too busy to send the packet. That can
      happen if your cluster is extremely large, your systems are
      extremely busy or your network is flakey.

      Node A doesn't know which is the case, and it doesn't know whether
      the problem lies within itself or with node B. This is especially
      problematic in a two-node cluster because both nodes, out of touch
      with one another, can try to fence the other.

      So before fencing a node, it would be nice to have another way to
      check if the other node is really alive, even though we can't seem
      to contact it. A quorum disk gives you the ability to do just
      that. Before fencing a node that's out of touch, the cluster
      software can check whether the node is still alive based on
      whether it has written data to the quorum partition.

      In the case of two-node systems, the quorum disk also acts as a
      tie-breaker. If a node has access to the quorum disk and the
      network, that counts as two votes.

      A node that has lost contact with the network or the quorum disk
      has lost a vote, and therefore may safely be fenced.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061001/34f3c0d0/attachment.htm>

From lhh at redhat.com  Mon Oct  2 13:10:16 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 09:10:16 -0400
Subject: Fwd: Re: [Linux-cluster] Disk tie breaker -how does it work?
In-Reply-To: <20060929214217.2230.qmail@web34210.mail.mud.yahoo.com>
References: <20060929214217.2230.qmail@web34210.mail.mud.yahoo.com>
Message-ID: <1159794616.3047.2.camel@localhost.localdomain>

On Fri, 2006-09-29 at 14:42 -0700, Rick Rodgers wrote:
> Thanks but the page can not be found. 404 error. Do i need to 
> download some keys?

Whoops, typo.

>         http://people.redhat.com/lhh/rhcm-3-internals.odt

http://people.redhat.com/lhh/rhcm-el3-internals.odt

Tested it this time.

-- Lon


From troels at arvin.dk  Mon Oct  2 13:16:37 2006
From: troels at arvin.dk (Troels Arvin)
Date: Mon, 02 Oct 2006 15:16:37 +0200
Subject: [Linux-cluster] Quorum disk: Can it be a partition?
Message-ID: <pan.2006.10.02.13.16.36.469000@arvin.dk>

Hello,

About the new quorum disk feature in RH Cluster Suite Update 4:
Does it have to be a complete, dedicated disk, or is it just as fine to
use a disk partition?

-- 
Greetings from Troels Arvin


From rpeterso at redhat.com  Mon Oct  2 13:46:14 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 02 Oct 2006 08:46:14 -0500
Subject: [Linux-cluster] Quorum disk: Can it be a partition?
In-Reply-To: <pan.2006.10.02.13.16.36.469000@arvin.dk>
References: <pan.2006.10.02.13.16.36.469000@arvin.dk>
Message-ID: <45211826.1050703@redhat.com>

Troels Arvin wrote:
> Hello,
>
> About the new quorum disk feature in RH Cluster Suite Update 4:
> Does it have to be a complete, dedicated disk, or is it just as fine to
> use a disk partition?
Hi Troels,

It doesn't have to be a disk; a partition is just fine.
http://sources.redhat.com/cluster/faq.html#quorum

Regards,

Bob Peterson
Red Hat Cluster Suite


From teigland at redhat.com  Mon Oct  2 15:31:14 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 2 Oct 2006 10:31:14 -0500
Subject: [Linux-cluster] Panic
In-Reply-To: <452059FE.3030104@redhat.com>
References: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>
	<452059FE.3030104@redhat.com>
Message-ID: <20061002153114.GD19242@redhat.com>

On Sun, Oct 01, 2006 at 07:14:54PM -0500, Robert Peterson wrote:
> andre at hudat.com wrote:
> >I have the following panic on two nodes hours apart. Each node Is in a
> >different state ( as in states of the US ). NO I am not running a cluster
> >over a WAN, just two separate clusters in two different locations. Files 
> >are
> >written on one cluster and I have a script that does an SCP of the file to
> >the other cluster. Both machines running the latest RHEL4 with the latest
> >GFS updates. This just started happening. Happened twice since Friday
> >morning. Any hints ? What is happening with clvmd here ? What does the
> >global conflict message mean ?

The clvmd and "global conflict" lines are just normal stuff from the debug
buffer that was dumped on the panic, they're not relevant here.

> This might be the same as bugzilla bug 208134.
> See https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208134.
> There is a patch to try with the bugzilla.

Bug 208134 is not related, but bug 199673 may be.

Dave


From Andre at hudat.com  Mon Oct  2 15:58:50 2006
From: Andre at hudat.com (Andre Henry)
Date: Mon, 2 Oct 2006 11:58:50 -0400
Subject: [Linux-cluster] Panic
In-Reply-To: <20061002153114.GD19242@redhat.com>
References: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>
	<452059FE.3030104@redhat.com> <20061002153114.GD19242@redhat.com>
Message-ID: <c7489bcd74b0bdea21ddd6d8c6a9a195@hudat.com>

This is exactly what we are doing. The system creates and deletes 
several thousand files per day.

I take it this has not made it to a GFS release as yet ? So I would 
have to manually apply the patch and rebuild the DLM kernel ?

--
Thanks
Andre

On Oct 2, 2006, at 11:31 AM, David Teigland wrote:

> On Sun, Oct 01, 2006 at 07:14:54PM -0500, Robert Peterson wrote:
>> andre at hudat.com wrote:
>>> I have the following panic on two nodes hours apart. Each node Is in 
>>> a
>>> different state ( as in states of the US ). NO I am not running a 
>>> cluster
>>> over a WAN, just two separate clusters in two different locations. 
>>> Files
>>> are
>>> written on one cluster and I have a script that does an SCP of the 
>>> file to
>>> the other cluster. Both machines running the latest RHEL4 with the 
>>> latest
>>> GFS updates. This just started happening. Happened twice since Friday
>>> morning. Any hints ? What is happening with clvmd here ? What does 
>>> the
>>> global conflict message mean ?
>
> The clvmd and "global conflict" lines are just normal stuff from the 
> debug
> buffer that was dumped on the panic, they're not relevant here.
>
>> This might be the same as bugzilla bug 208134.
>> See https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208134.
>> There is a patch to try with the bugzilla.
>
> Bug 208134 is not related, but bug 199673 may be.
>
> Dave
>


From teigland at redhat.com  Mon Oct  2 16:07:25 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 2 Oct 2006 11:07:25 -0500
Subject: [Linux-cluster] Panic
In-Reply-To: <c7489bcd74b0bdea21ddd6d8c6a9a195@hudat.com>
References: <003401c6e58d$b71dc9d0$0245450a@AndreLaptop>
	<452059FE.3030104@redhat.com> <20061002153114.GD19242@redhat.com>
	<c7489bcd74b0bdea21ddd6d8c6a9a195@hudat.com>
Message-ID: <20061002160725.GE19242@redhat.com>

On Mon, Oct 02, 2006 at 11:58:50AM -0400, Andre Henry wrote:
> This is exactly what we are doing. The system creates and deletes 
> several thousand files per day.
> 
> I take it this has not made it to a GFS release as yet ? So I would 
> have to manually apply the patch and rebuild the DLM kernel ?

It'll be in the next errata release.  Until then, yes, you can patch and
rebuild the dlm kernel module.  Here's the diff:

http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/dlm-kernel/src/Attic/lkb.c.diff?r1=1.3.2.1&r2=1.3.2.1.12.1&cvsroot=cluster&only_with_tag=RHEL4U4&f=h

Dave


From dbrieck at gmail.com  Mon Oct  2 16:10:34 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 2 Oct 2006 12:10:34 -0400
Subject: [Linux-cluster] GNBD Ports
Message-ID: <8c1094290610020910s53ac1575gf6c596f4cdef54a8@mail.gmail.com>

Which ports would need to be open on a firewall to use GNBD server?

Nothing is mentioned in this document about them.
http://sources.redhat.com/cluster/faq.html#iptables

Thanks.


From lhh at redhat.com  Mon Oct  2 16:17:39 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 12:17:39 -0400
Subject: [Linux-cluster] ip-tiebreaker quotes need explainging
In-Reply-To: <20061002035238.66323.qmail@web34203.mail.mud.yahoo.com>
References: <20061002035238.66323.qmail@web34203.mail.mud.yahoo.com>
Message-ID: <1159805859.22558.0.camel@rei.boston.devel.redhat.com>

On Sun, 2006-10-01 at 20:52 -0700, Rick Rodgers wrote:
> Thanks for replying to me especically on a Sunday. I was familiar with
> everything you 
> said. The question concerned not being able to use the tiebreaker to
> form quarum on STARTUP.
> Why is this? 
> 
> Also Lon sent me a URL  to a doc I need but it seems the URL is stale.
> Can you help
> me? Here is the URL:
> 
> 
> http://people.redhat.com/lhh/rhcm-3-internals.odt

http://people.redhat.com/lhh/rhcm-el3-internals.odt
                                  ^^^

Typo.

-- Lon


From lhh at redhat.com  Mon Oct  2 16:20:14 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 12:20:14 -0400
Subject: [Linux-cluster] How does disk prevent split brain?
In-Reply-To: <20061002035942.52252.qmail@web34213.mail.mud.yahoo.com>
References: <20061002035942.52252.qmail@web34213.mail.mud.yahoo.com>
Message-ID: <1159806014.22558.3.camel@rei.boston.devel.redhat.com>

On Sun, 2006-10-01 at 20:59 -0700, Rick Rodgers wrote:
> According to RH documentation and the engineers you can have a safe
> two node 
> cluster without having to have an ip-tiebreaker. They say it will use
> the disk tie-breaker.
> 
> How is this possible?  Since clulockd is running separately on each of
> the nodes how does
> it prevent each node from accessing the disk at the same time and
> declaring himself the 
> Active node? If it is through some sort of locking mechanism that
> insures this can you please
> be technically specific in what it is?

Only one will be the lock master, because there is knowledge that both
nodes are actually still online.

You can not disable/move services when the cluster network is dead, but
both nodes can continue running the services they already are running.
Most people should probably use the disk tiebreaker if they want
failover if a node's cluster network gets disconnected.

-- Lon


From rodgersr at yahoo.com  Mon Oct  2 16:38:04 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Mon, 2 Oct 2006 09:38:04 -0700 (PDT)
Subject: [Linux-cluster] How does disk prevent split brain?
Message-ID: <20061002163804.82857.qmail@web34212.mail.mud.yahoo.com>

Thanks for the info. Yes I understand what you are saying. However, if 
you do not specify and ip-tiebreaker I assumed it used the disk tiebreaker
by default.

----- Original Message ----
From: Lon Hohberger <lhh at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Monday, October 2, 2006 9:20:14 AM
Subject: Re: [Linux-cluster] How does disk prevent split brain?

On Sun, 2006-10-01 at 20:59 -0700, Rick Rodgers wrote:
> According to RH documentation and the engineers you can have a safe
> two node 
> cluster without having to have an ip-tiebreaker. They say it will use
> the disk tie-breaker.
> 
> How is this possible?  Since clulockd is running separately on each of
> the nodes how does
> it prevent each node from accessing the disk at the same time and
> declaring himself the 
> Active node? If it is through some sort of locking mechanism that
> insures this can you please
> be technically specific in what it is?

Only one will be the lock master, because there is knowledge that both
nodes are actually still online.

You can not disable/move services when the cluster network is dead, but
both nodes can continue running the services they already are running.
Most people should probably use the disk tiebreaker if they want
failover if a node's cluster network gets disconnected.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061002/30374df7/attachment.htm>

From damian.osullivan at hp.com  Mon Oct  2 16:41:37 2006
From: damian.osullivan at hp.com (O'Sullivan, Damian)
Date: Mon, 2 Oct 2006 17:41:37 +0100
Subject: [Linux-cluster] LVM2 cluster problem
Message-ID: <644A0966265D9D40AC7584FCE95611130308B00D@dubexc01.emea.cpqcorp.net>

Hi all,

A few days ago I had to reset a node in the cluster. On reboot I get the
following :

Loading jbd.ko module
Loading ext3.ko module
Loading dm-mirror.ko module
Loading dm-zero.ko module
Loading dm-snapshot.ko module
Making device-mapper control node
Scanning logical volumes
  Unable to open external locking library liblvm2clusterlock.so
  Reading all physical volumes.  This may take a while...
cdrom: open failed.
  Found volume group "VolGroup00" using metadata type lvm2
  Found volume group "MSA_VOL" using metadata type lvm2
Activating logical volumes
  Unable to open external locking library liblvm2clusterlock.so
  Locking inactive: ignoring clustered volume group VolGroup00
ERROR: /bin/lvm exited abnormally! (pid 402)
Creating root device
Mounting root filesystem
mount: error 6 mounting ext3
mount: error 2 mounting none
Switching to new root
switchroot: mount failed: 22
umount /initrd/dev failed: 2
Kernel panic - not syncing: Attempted to kill init!

This is a result of the following commands in my initrd :

insmod /lib/jbd.ko
echo "Loading ext3.ko module"
insmod /lib/ext3.ko
echo "Loading dm-mirror.ko module"
insmod /lib/dm-mirror.ko
echo "Loading dm-zero.ko module"
insmod /lib/dm-zero.ko
echo "Loading dm-snapshot.ko module"
insmod /lib/dm-snapshot.ko
/sbin/udevstart
echo Making device-mapper control node
mkdmnod
echo Scanning logical volumes
lvm vgscan --ignorelockingfailure
echo Activating logical volumes
lvm vgchange -ay --ignorelockingfailure VolGroup00
echo Creating root device
mkrootdev /dev/root
umount /sys
echo Mounting root filesystem
mount -o defaults --ro -t ext3 /dev/root /sysroot
mount -t tmpfs --bind /dev /sysroot/dev
echo Switching to new root
switchroot /sysroot
umount /initrd/dev


The other node is exactly the same and comes up no problem with the same
initrd + kernel. I can access all the logical volumes from a rescue
disk.
All based on Centos 4.4 with latest updates.
Any ideas?

Thanks,

D.

Kernel : 2.6.9-42.0.2.Elsmp
Cluster LVM daemon version: 2.02.06 (2006-05-12)
Protocol version:           0.2.1


From lhh at redhat.com  Mon Oct  2 18:14:54 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 14:14:54 -0400
Subject: [Linux-cluster] How does disk prevent split brain?
In-Reply-To: <20061002163804.82857.qmail@web34212.mail.mud.yahoo.com>
References: <20061002163804.82857.qmail@web34212.mail.mud.yahoo.com>
Message-ID: <1159812894.3103.18.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-02 at 09:38 -0700, Rick Rodgers wrote:
> Thanks for the info. Yes I understand what you are saying. However,
> if 
> you do not specify and ip-tiebreaker I assumed it used the disk
> tiebreaker
> by default.

That's correct.

-- Lon


From lhh at redhat.com  Mon Oct  2 18:16:03 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 14:16:03 -0400
Subject: [Linux-cluster] RH documentation and RH Engineering do not agree
In-Reply-To: <20061002051228.1904.qmail@web34209.mail.mud.yahoo.com>
References: <20061002051228.1904.qmail@web34209.mail.mud.yahoo.com>
Message-ID: <1159812963.3103.20.camel@rei.boston.devel.redhat.com>

On Sun, 2006-10-01 at 22:12 -0700, Rick Rodgers wrote:

> However this does not seem to be the case. The systems stop the
> service or get STONITHed.

That was a bug.

  cludb -p cluquorumd%disk_quorum 1

In a future release, it will be set to this by default.

-- Lon

> 


From Leonardo.Mello at planejamento.gov.br  Mon Oct  2 18:47:30 2006
From: Leonardo.Mello at planejamento.gov.br (Leonardo Rodrigues de Mello)
Date: Mon, 2 Oct 2006 15:47:30 -0300
Subject: RES: [Linux-cluster] RH documentation and RH Engineering do not agree
Message-ID: <1DDCE5B29CB5BC42BC2BFC39E3F1C8A3255BF2@corp-bsa-mp01.planejamento.gov.br>

Lon,

Congratulations for the documentation!!! :-D


-----Mensagem original-----
De:	linux-cluster-bounces at redhat.com em nome de Lon Hohberger
Enviada:	seg 2/10/2006 15:16
Para:	linux clustering
Cc:	
Assunto:	Re: [Linux-cluster] RH documentation and RH Engineering do not agree

On Sun, 2006-10-01 at 22:12 -0700, Rick Rodgers wrote:

> However this does not seem to be the case. The systems stop the
> service or get STONITHed.

That was a bug.

  cludb -p cluquorumd%disk_quorum 1

In a future release, it will be set to this by default.

-- Lon

> 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2970 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061002/3e651c6a/attachment.bin>

From lhh at redhat.com  Mon Oct  2 20:56:03 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 16:56:03 -0400
Subject: [Linux-cluster] ip-tiebreaker quotes need explainging
In-Reply-To: <20061001025203.76527.qmail@web34209.mail.mud.yahoo.com>
References: <20061001025203.76527.qmail@web34209.mail.mud.yahoo.com>
Message-ID: <1159822563.3103.48.camel@rei.boston.devel.redhat.com>

On Sat, 2006-09-30 at 19:52 -0700, Rick Rodgers wrote:
> Can anyone explain why this is so. Why is it only used on maintaining
> qourum and not startup?
> 
> 
> "The IP tiebreaker is typically used to *maintain* a quorum after a node
> failure, because there are certain network faults in which two nodes may
> see the tiebreaker - but not each other.
> -- Lon"


In certain situations  (ex: ARP storms, switch loops, etc.), it is
possible to see the IP-tiebreaker (an upstream router) but *not* your
peer over the switch.  If this happens to both nodes, you have a split
brain.  I should update the internals big to reflect the 'why'.  You can
change this behavior using cludb.

Note that IP tiebreakers have to be in the cluster communications path -
you can't use heartbeating over a private network and an IP tiebreaker
on another network.

-- Lon


From lhh at redhat.com  Mon Oct  2 21:12:57 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 17:12:57 -0400
Subject: [Linux-cluster] clurmtabd
In-Reply-To: <20060929213741.97281.qmail@web34212.mail.mud.yahoo.com>
References: <20060929213741.97281.qmail@web34212.mail.mud.yahoo.com>
Message-ID: <1159823577.3103.56.camel@rei.boston.devel.redhat.com>

On Fri, 2006-09-29 at 14:37 -0700, Rick Rodgers wrote:
> it does not seem to work that way. I tested it and it only got what
> was mounted on the specified directory. Not the subdirectories.
> Has this changed recently (in the last 2 years?)

http://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=80081

It looks like it was never fixed in the clumanager-1.2.x tree.

It was fixed for RHEL2.1 (clumanager-1.0.x) a long time ago, but not for
RHCS3.  You should file a bugzilla if you need it fixed.

-- Lon


From lhh at redhat.com  Mon Oct  2 21:18:45 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 02 Oct 2006 17:18:45 -0400
Subject: [Linux-cluster] clurmtabd
In-Reply-To: <1159823577.3103.56.camel@rei.boston.devel.redhat.com>
References: <20060929213741.97281.qmail@web34212.mail.mud.yahoo.com>
	<1159823577.3103.56.camel@rei.boston.devel.redhat.com>
Message-ID: <1159823925.3103.58.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-02 at 17:12 -0400, Lon Hohberger wrote:
> On Fri, 2006-09-29 at 14:37 -0700, Rick Rodgers wrote:
> > it does not seem to work that way. I tested it and it only got what
> > was mounted on the specified directory. Not the subdirectories.
> > Has this changed recently (in the last 2 years?)
> 
> http://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=80081
> 
> It looks like it was never fixed in the clumanager-1.2.x tree.
> 
> It was fixed for RHEL2.1 (clumanager-1.0.x) a long time ago, but not for
> RHCS3.  You should file a bugzilla if you need it fixed.

I've filed this as a clone of 80081:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208995

-- Lon


From rodgersr at yahoo.com  Mon Oct  2 21:45:19 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Mon, 2 Oct 2006 14:45:19 -0700 (PDT)
Subject: [Linux-cluster] RH documentation and RH Engineering do not agree
Message-ID: <20061002214519.56791.qmail@web34211.mail.mud.yahoo.com>

Ok thanks for the info. It was all good documentation just left me a little confused.
Thanks for all your feedback it has beenn a great help.

Rick

----- Original Message ----
From: Lon Hohberger <lhh at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Monday, October 2, 2006 11:16:03 AM
Subject: Re: [Linux-cluster] RH documentation and RH Engineering do not agree

On Sun, 2006-10-01 at 22:12 -0700, Rick Rodgers wrote:

> However this does not seem to be the case. The systems stop the
> service or get STONITHed.

That was a bug.

  cludb -p cluquorumd%disk_quorum 1

In a future release, it will be set to this by default.

-- Lon

> 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061002/9d054ebf/attachment.htm>

From rodgersr at yahoo.com  Mon Oct  2 21:53:13 2006
From: rodgersr at yahoo.com (Rick Rodgers)
Date: Mon, 2 Oct 2006 14:53:13 -0700 (PDT)
Subject: [Linux-cluster] clurmtabd
Message-ID: <20061002215313.69567.qmail@web34201.mail.mud.yahoo.com>

Ok thanks.At least now I know I am not going crazy :-)

----- Original Message ----
From: Lon Hohberger <lhh at redhat.com>
To: linux clustering <linux-cluster at redhat.com>
Sent: Monday, October 2, 2006 2:12:57 PM
Subject: Re: [Linux-cluster] clurmtabd

On Fri, 2006-09-29 at 14:37 -0700, Rick Rodgers wrote:
> it does not seem to work that way. I tested it and it only got what
> was mounted on the specified directory. Not the subdirectories.
> Has this changed recently (in the last 2 years?)

http://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=80081

It looks like it was never fixed in the clumanager-1.2.x tree.

It was fixed for RHEL2.1 (clumanager-1.0.x) a long time ago, but not for
RHCS3.  You should file a bugzilla if you need it fixed.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061002/5e064f7e/attachment.htm>

From jprats at cesca.es  Tue Oct  3 06:18:45 2006
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 03 Oct 2006 08:18:45 +0200
Subject: [Linux-cluster] problems relocating services
Message-ID: <452200C5.6080406@cesca.es>

Hi all,
I have a problem relocating services, sometimes fails. I have all 
requiered operations nested, so it shoud not be a race condition. How 
could add some verbosity to syslog to find out why is failing?

Thanks,

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From lhh at redhat.com  Tue Oct  3 13:50:33 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 03 Oct 2006 09:50:33 -0400
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <452200C5.6080406@cesca.es>
References: <452200C5.6080406@cesca.es>
Message-ID: <1159883433.8020.3.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-03 at 08:18 +0200, Jordi Prats wrote:
> Hi all,
> I have a problem relocating services, sometimes fails. I have all 
> requiered operations nested, so it shoud not be a race condition. How 
> could add some verbosity to syslog to find out why is failing?

Did it go in to the "failed" state, or did it simply "fail" to relocate?


Change /etc/syslog.conf to add the following on all nodes:

   local4.* /var/log/rgmanager

Change your "rm" tag in cluster.conf to enable debugging:

   <rm log_level="7" log_facility="local4">

(don't forget to increment the configuration version)

Run ccs_tool update /etc/cluster/cluster.conf


-- Lon


From lhh at redhat.com  Tue Oct  3 14:30:38 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 03 Oct 2006 10:30:38 -0400
Subject: [Linux-cluster] Red Hat Linux AS 4 U3 Clustering
In-Reply-To: <7BED60E643BD1C4F8A84E3F0B411C14A0F3F31@srit_mail.renaissance-it.com>
References: <7BED60E643BD1C4F8A84E3F0B411C14A0F3F31@srit_mail.renaissance-it.com>
Message-ID: <1159885839.8020.7.camel@rei.boston.devel.redhat.com>

On Sat, 2006-09-30 at 12:13 +0530, Jotheswaran M wrote:
> Hi All,
>  
> I am new to this forum, I have a problem with Red Hat Linux AS 4 U3
> Clustering I have used IBM Xseries 366 servers with two HBA's and
> DS4300 SAN storage.
>  
> I have installed and configured the OS and the clustering with out any
> issues. I am running oracle9i as the database and the same has been
> configured in the cluster and it works fine, I can also fail over it
> works fine.
>  
> The problem is if I shutdown one server or remove the power chord of
> one server the cluster doesn't switch over but if I go through the
> normal shutdown the cluster switches.
>  
> Can you gueys help me to resolve this please.

Ok, first of all - please try updating rgmanager, magma and
magma-plugins to the U4 versions (you don't have to update anything
else). :)

-- Lon


From spatuality at yahoo.ca  Tue Oct  3 14:33:58 2006
From: spatuality at yahoo.ca (Brian)
Date: Tue, 3 Oct 2006 07:33:58 -0700 (PDT)
Subject: [Linux-cluster] fence_drac broken with DRAC/MC 1.3
Message-ID: <20061003143358.38113.qmail@web30809.mail.mud.yahoo.com>

Hi group,

I have submitted a bug report for this problem, but thought it might be useful to let the group know what I've found.

I'm running RHEL 4 Update 4 on Dell PowerEdge 1955 blade servers in a chassis with DRAC/MC 1.3 firmware.

The fence_drac is able to power off/on the blade, but the script is not returning the correct status after the power is switched off/on.

Example command issued:
# fence_drac -a 10.0.0.20 -l username -p password -D debug.txt -m Server-10 -v -o off
detected drac version 'DRAC/MC'
failed: telnet returned: pattern match timed-out

Result:
Server is shut off harshly (ie. about 3 services are shutdown in init 6, then power is cut to the machine). For troubleshooting, running init 6 manually results in a full, normal shutdown of the server. If I run fence_node, with fence_drac as the script to run setup in /etc/cluster/cluster.conf, the missing expected response of server off/on results in the node being power cycled repeatedly.

Problem:
Its great that the server is getting shut down, but the Perl Telnet interface needs a known response to feedback an expected result.

I'm guessing changing the script is fairly trivial to get this working with DRAC/MC 1.3. If anyone else has this working, please pass along the fix. I will try working on this next week to see if I can kick it into working.


Brian


From damian.osullivan at hp.com  Tue Oct  3 15:17:57 2006
From: damian.osullivan at hp.com (O'Sullivan, Damian)
Date: Tue, 3 Oct 2006 16:17:57 +0100
Subject: [Linux-cluster] LVM2 cluster problem
In-Reply-To: <644A0966265D9D40AC7584FCE95611130308B00D@dubexc01.emea.cpqcorp.net>
Message-ID: <644A0966265D9D40AC7584FCE95611130308B58E@dubexc01.emea.cpqcorp.net>

> ERROR: /bin/lvm exited abnormally! (pid 402) Creating root 
> device Mounting root filesystem
> mount: error 6 mounting ext3
> mount: error 2 mounting none
> Switching to new root
> switchroot: mount failed: 22
> umount /initrd/dev failed: 2
> Kernel panic - not syncing: Attempted to kill init!
> 

Just a follow up to my own mail. The local storage was marked as
"clustered" for some reason.
vgs showed this up. I took away the c bit and it works again.

D.


From spatuality at yahoo.ca  Tue Oct  3 21:26:22 2006
From: spatuality at yahoo.ca (Brian)
Date: Tue, 3 Oct 2006 14:26:22 -0700 (PDT)
Subject: [Linux-cluster] fence_drac broken with DRAC/MC 1.3
Message-ID: <20061003212622.67382.qmail@web30807.mail.mud.yahoo.com>

I have fixed the problem and posted the change to Bugzilla. It was due to the $telnet_timeout value of 5 seconds being too short for the 1.3 DRAC/MC firmware. Dell decided to make the telnet connection slower than 1.2 for some reason.

/sbin/fence_drac, line 33:
From:
my $telnet_timeout = 10;      # Seconds to wait for matching telent response

To:
my $telnet_timeout = 10;      # Seconds to wait for matching telent response


Brian


----- Original Message ----
From: Brian <spatuality at yahoo.ca>
To: linux-cluster at redhat.com
Sent: Tuesday, October 3, 2006 10:33:58 AM
Subject: [Linux-cluster] fence_drac broken with DRAC/MC 1.3

Hi group,

I have submitted a bug report for this problem, but thought it might be useful to let the group know what I've found.

I'm running RHEL 4 Update 4 on Dell PowerEdge 1955 blade servers in a chassis with DRAC/MC 1.3 firmware.

The fence_drac is able to power off/on the blade, but the script is not returning the correct status after the power is switched off/on.

Example command issued:
# fence_drac -a 10.0.0.20 -l username -p password -D debug.txt -m Server-10 -v -o off
detected drac version 'DRAC/MC'
failed: telnet returned: pattern match timed-out

Result:
Server is shut off harshly (ie. about 3 services are shutdown in init 6, then power is cut to the machine). For troubleshooting, running init 6 manually results in a full, normal shutdown of the server. If I run fence_node, with fence_drac as the script to run setup in /etc/cluster/cluster.conf, the missing expected response of server off/on results in the node being power cycled repeatedly.

Problem:
Its great that the server is getting shut down, but the Perl Telnet interface needs a known response to feedback an expected result.

I'm guessing changing the script is fairly trivial to get this working with DRAC/MC 1.3. If anyone else has this working, please pass along the fix. I will try working on this next week to see if I can kick it into working.


Brian


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From spatuality at yahoo.ca  Tue Oct  3 21:31:12 2006
From: spatuality at yahoo.ca (Brian)
Date: Tue, 3 Oct 2006 14:31:12 -0700 (PDT)
Subject: [Linux-cluster] fence_drac broken with DRAC/MC 1.3
Message-ID: <20061003213112.40010.qmail@web30810.mail.mud.yahoo.com>

Sorry about that. The change was supposed to be from 5 seconds to 10 seconds. 

Brian


----- Original Message ----
From: Brian <spatuality at yahoo.ca>
To: linux clustering <linux-cluster at redhat.com>
Sent: Tuesday, October 3, 2006 5:26:22 PM
Subject: Re: [Linux-cluster] fence_drac broken with DRAC/MC 1.3

I have fixed the problem and posted the change to Bugzilla. It was due to the $telnet_timeout value of 5 seconds being too short for the 1.3 DRAC/MC firmware. Dell decided to make the telnet connection slower than 1.2 for some reason.

/sbin/fence_drac, line 33:
From:
my $telnet_timeout = 10;      # Seconds to wait for matching telent response

To:
my $telnet_timeout = 10;      # Seconds to wait for matching telent response


Brian


----- Original Message ----
From: Brian <spatuality at yahoo.ca>
To: linux-cluster at redhat.com
Sent: Tuesday, October 3, 2006 10:33:58 AM
Subject: [Linux-cluster] fence_drac broken with DRAC/MC 1.3

Hi group,

I have submitted a bug report for this problem, but thought it might be useful to let the group know what I've found.

I'm running RHEL 4 Update 4 on Dell PowerEdge 1955 blade servers in a chassis with DRAC/MC 1.3 firmware.

The fence_drac is able to power off/on the blade, but the script is not returning the correct status after the power is switched off/on.

Example command issued:
# fence_drac -a 10.0.0.20 -l username -p password -D debug.txt -m Server-10 -v -o off
detected drac version 'DRAC/MC'
failed: telnet returned: pattern match timed-out

Result:
Server is shut off harshly (ie. about 3 services are shutdown in init 6, then power is cut to the machine). For troubleshooting, running init 6 manually results in a full, normal shutdown of the server. If I run fence_node, with fence_drac as the script to run setup in /etc/cluster/cluster.conf, the missing expected response of server off/on results in the node being power cycled repeatedly.

Problem:
Its great that the server is getting shut down, but the Perl Telnet interface needs a known response to feedback an expected result.

I'm guessing changing the script is fairly trivial to get this working with DRAC/MC 1.3. If anyone else has this working, please pass along the fix. I will try working on this next week to see if I can kick it into working.


Brian


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jprats at cesca.es  Wed Oct  4 07:51:33 2006
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 04 Oct 2006 09:51:33 +0200
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <1159883433.8020.3.camel@rei.boston.devel.redhat.com>
References: <452200C5.6080406@cesca.es>
	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>
Message-ID: <45236805.60709@cesca.es>

Thanks,
It goes to failed state. Today I've been reloacating services but it 
fails to relocate (is not going to failed state):

Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Service projectes is stopped
Oct  4 09:07:49 inf04 clurgmgrd[6299]: <warning> #70: Attempting to 
restart service projectes locally.
Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Starting stopped service 
projectes

On the other node log do not appear anything related to the relocation. 
Maybe is a communication problem between nodes?

Jordi

Lon Hohberger wrote:
> On Tue, 2006-10-03 at 08:18 +0200, Jordi Prats wrote:
>   
>> Hi all,
>> I have a problem relocating services, sometimes fails. I have all 
>> requiered operations nested, so it shoud not be a race condition. How 
>> could add some verbosity to syslog to find out why is failing?
>>     
>
> Did it go in to the "failed" state, or did it simply "fail" to relocate?
>
>
> Change /etc/syslog.conf to add the following on all nodes:
>
>    local4.* /var/log/rgmanager
>
> Change your "rm" tag in cluster.conf to enable debugging:
>
>    <rm log_level="7" log_facility="local4">
>
> (don't forget to increment the configuration version)
>
> Run ccs_tool update /etc/cluster/cluster.conf
>
>
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>   


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From sandra-llistes at fib.upc.edu  Wed Oct  4 12:16:55 2006
From: sandra-llistes at fib.upc.edu (sandra-llistes)
Date: Wed, 04 Oct 2006 14:16:55 +0200
Subject: [Linux-cluster] GFS and samba problem, again
Message-ID: <4523A637.1060706@fib.upc.edu>

Hi,

I sent a mail a few days ago to this list related with GFS+samba problems.

Since the, we have installed a sepparated test environment also with 
two linux servers where we  have tested a samba server with an 
exported share in GFS. The share is read-only and only one server is 
exporting it.

When we try to access from a single windows client it works fine, but 
when we try to access to the same file from 2 or more windows clients 
simoultaneously, windows hangs and samba also does. This seems not to 
happen with concurrent access to different files or with linux clients.

We've also tested to export the same share without GFS and in this 
case it works fine.

It seems to be a locking problem with samba, GFS and windows clients. 
Does any of you have experienced similar problems? Do you have any 
suggestion about this?

Following is the share configuration in smb.conf:

[public]
   comment         = ShareGFS
   path            = /public
   writeable       = No
   read only       = Yes
   write list      = @admsamba
   force group     = admsamba
   create mask     = 0775
   directory mask  = 0775
   oplocks         = No
   locking = Yes
   strict locking = Yes
# I proved with locking/Strick locking=Yes and No. Always happens the 
same problem

I attach some samba logs (Level 3).
Software Versions:
	Fedora 5
	Samba 3.0.23
	GFS  6.1.5
	kernel 2.6.17-1.2187_FC5

Any help will be appreciated.

Sandra Hernandez

-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: GFSlog.smbd
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061004/91cb3b0b/attachment.ksh>

From lhh at redhat.com  Wed Oct  4 16:59:54 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 04 Oct 2006 12:59:54 -0400
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <45236805.60709@cesca.es>
References: <452200C5.6080406@cesca.es>
	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>
	<45236805.60709@cesca.es>
Message-ID: <1159981194.12856.14.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-04 at 09:51 +0200, Jordi Prats wrote:
> Thanks,
> It goes to failed state. Today I've been reloacating services but it 
> fails to relocate (is not going to failed state):
> 
> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Service projectes is stopped
> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <warning> #70: Attempting to 
> restart service projectes locally.
> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Starting stopped service 
> projectes
> 
> On the other node log do not appear anything related to the relocation. 
> Maybe is a communication problem between nodes?

It sounds like rgmanager isn't running on the other node or something;
check /proc/cluster/services? 

-- Lon


From jprats at cesca.es  Wed Oct  4 18:34:57 2006
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 04 Oct 2006 20:34:57 +0200
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <1159981194.12856.14.camel@rei.boston.devel.redhat.com>
References: <452200C5.6080406@cesca.es>	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>	<45236805.60709@cesca.es>
	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>
Message-ID: <4523FED1.9010404@cesca.es>

It appears to be running on both nodes:

# cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "Magma"                             3   4 run       -
[1 2]

User:            "usrm::manager"                     2   3 run       -
[1 2]

# ccs_tool lsnode

Cluster name: dades, config_version: 76

Nodename                        Votes Nodeid Iface Fencetype
inf04                              1
inf05                              1

# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  inf04                                    Online, rgmanager
  inf05                                    Online, Local, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  projectes            inf04                          started
  local                inf04                          started
  mysql                inf05                          started
  postgres             inf05                          started

# ps -fea | grep rg
root      6362     1  0 Oct03 ?        00:00:00 clurgmgrd
root      6364  6362  0 Oct03 ?        00:03:49 clurgmgrd
root     19138  7151  0 09:07 pts/1    00:00:00 tail /var/log/rgmanager
-f -n5000
root      7073  6954  0 20:28 pts/3    00:00:00 grep rg

# ps -fea | grep rg
root      6362     1  0 Oct03 ?        00:00:00 clurgmgrd
root      6364  6362  0 Oct03 ?        00:03:49 clurgmgrd
root     19138  7151  0 09:07 pts/1    00:00:00 tail /var/log/rgmanager
-f -n5000
root      7073  6954  0 20:28 pts/3    00:00:00 grep rg

The same information is displayed on both nodes.

Our version is:

# clustat -v
clustat version 1.9.53
Connected via: CMAN/SM Plugin v1.1.7.1

Any ideas?

Thanks,
Jordi


Lon Hohberger wrote:
> On Wed, 2006-10-04 at 09:51 +0200, Jordi Prats wrote:
>> Thanks,
>> It goes to failed state. Today I've been reloacating services but it 
>> fails to relocate (is not going to failed state):
>>
>> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Service projectes is stopped
>> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <warning> #70: Attempting to 
>> restart service projectes locally.
>> Oct  4 09:07:49 inf04 clurgmgrd[6299]: <notice> Starting stopped service 
>> projectes
>>
>> On the other node log do not appear anything related to the relocation. 
>> Maybe is a communication problem between nodes?
> 
> It sounds like rgmanager isn't running on the other node or something;
> check /proc/cluster/services? 
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 

-- 
......................................................................
        __
       / /          Jordi Prats Catal?
 C E / S / C A      Departament de Sistemes
     /_/            Centre de Supercomputaci? de Catalunya

 Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
 T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
......................................................................
pgp:0x5D0D1321
......................................................................


From lhh at redhat.com  Thu Oct  5 16:25:29 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 05 Oct 2006 12:25:29 -0400
Subject: [Linux-cluster] Xen virtual machine fencing
Message-ID: <1160065529.18145.12.camel@rei.boston.devel.redhat.com>

Hi,

I committed an updated agent for fencing Xen virtual machines to CVS.
It allows fencing of any virtual machine from any other host in the
cluster, and handles the case where the VM no longer exists.  Note that
there is no 'on' function mostly due to the fact that it would require a
lot of configuration knowledge about the VM which is currently not
available.

The README is not 100% complete, and neither are any of the features
mentioned in TODO. ;)

Basically, here's how to get it running:

  - build (requires nss, openais, cman, & nspr development stuff)
  - install openais + cman
  - generate a key file (e.g. dd if=/dev/urandom
    of=/etc/cluster/fence_xvm.key bs=4096 count=1)
  - scp /etc/cluster/fence_xvm.key to all dom0 cluster nodes.
  - start cman
  - start fence_xvmd with whatever options you like on all members of
    the dom0 cluster (must be started with same options cluster-wide
  - start domU nodes
  - scp /etc/cluster/fence_xvm.key to all domU machines.
  - install fence_xvm on domU nodes
  - fence_xvm -H <name of domU>  || fence_xvm -u -H <uuid>
    (boom)

If anyone wants to take up the ball on anything in the TODO, let me
know.  (If you want to implement the SSL part, you need to use the
nss/nspr libraries, and NOT openssl, due to licensing and other
reasons).

-- Lon


From lhh at redhat.com  Thu Oct  5 19:18:29 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 05 Oct 2006 15:18:29 -0400
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <4523FED1.9010404@cesca.es>
References: <452200C5.6080406@cesca.es>
	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>
	<45236805.60709@cesca.es>
	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>
	<4523FED1.9010404@cesca.es>
Message-ID: <1160075910.18145.18.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-04 at 20:34 +0200, Jordi Prats wrote:
> It appears to be running on both nodes:
> 
> # cat /proc/cluster/services
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 run       -
> [1 2]
> 
> DLM Lock Space:  "Magma"                             3   4 run       -
> [1 2]
> 
> User:            "usrm::manager"                     2   3 run       -
> [1 2]
> 
> # ccs_tool lsnode
> 
> Cluster name: dades, config_version: 76

Could you post your service blob?  If you're using a script, it might
not be installed on the other node.  When you do a "relocate", does
anything appear in the logs on the other node?

-- Lon


From danwest at comcast.net  Thu Oct  5 19:35:14 2006
From: danwest at comcast.net (danwest)
Date: Thu, 05 Oct 2006 15:35:14 -0400
Subject: [Linux-cluster] qdiskd vote not represented by cman
Message-ID: <1160076914.3666.2.camel@belmont.site>


Shouldn?t we expect to see the qdisk votes reported in ?Total_votes?
from cman (see cman_tool below).  If we have four nodes with 1 vote each
and a qdisk configured with 1 vote (see cluster.conf snippet below)
shouldn?t we see a total vote count of 5?  With the qdisk config shown
below I would expect to be able to sustain a loss of 2 out of 4 nodes
and still have quorum but in fact the loss of 2 nodes dissolves quorum
every time, effectively locking the cluster.

 
Thanks,

 Dan

 
Node4:~ # cman_tool status

Protocol version: 5.0.1

Config version: 28

Cluster name: testcluster

Cluster ID: 26387

Cluster Member: Yes

Membership state: Cluster-Member

Nodes: 4

Expected_votes: 4

Total_votes: 4

Quorum: 3

Active subsystems: 8

Node name: node4

Node addresses: X.X.X.X

 
<clusternodes>

                <clusternode name="node1" votes="1" nodeid="1">

                        <fence>

                                <method name="single">

                                        <device name="blade_enclosure"
blade="1"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="node2" votes="1" nodeid="2">

                        <fence>

                                <method name="single">

                                        <device name="blade_enclosure"
blade="2"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="node3" votes="1" nodeid="3">

                        <fence>

                                <method name="single">

                                        <device name="blade_enclosure"
blade="3"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="node4" votes="1" nodeid="4">

                        <fence>

                                <method name="single">

                                        <device name="blade_enclosure"
blade="4"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="4">

        </cman>

       <quorumd interval="1" tko="15" votes="1"
device="/dev/disk/by-name/360060480000190020528565454444341"

          status_file="/var/log/qdisk_status" log_level="7">

                <heuristic program="ping X.X.X.X -c1 -t1" score="1"
interval="2"/>

                <heuristic program="[ -d / ]" score="2" interval="2"/>

      </quorumd>

 
From isplist at logicore.net  Fri Oct  6 01:25:26 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 5 Oct 2006 20:25:26 -0500
Subject: [Linux-cluster] Cluster.conf
Message-ID: <2006105202526.771406@leena>

I've been messing with GFS for a while now, learning curve kinda high but 
slowly getting it. I'm now at the point where I need to fence things better so 
that reliability becomes better now that I'm getting closer to actually using 
this on something production.

Thing is, I've asked before about cluster.conf file building without replies 
so am still unsure about this part right now. I've seen countless variations, 
many with things I've not even seen before and for the most part, many seem to 
be custom made, like someone's recipe :).

So, the question remains... where can I find VERY good details and information 
that will help me understand the building of this file.

I'm now using a McData ED-5000 switch and need to make sure that fencing is 
working correctly. My (probably silly) cluster. conf file looks like;

<?xml version="1.0"?>
<cluster config_version="41" name="vgcomp">
    <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
    <clusternode name="cweb92" nodeid="92" votes="1"/>
    <clusternode name="cweb93" nodeid="93" votes="1"/>
    <clusternode name="cweb94" nodeid="94" votes="1"/>
    <clusternode name="dev" nodeid="99" votes="1"/>
    <clusternode name="qm247" nodeid="247" votes="1"/>
    <clusternode name="qm248" nodeid="248" votes="1"/>
    <clusternode name="qm249" nodeid="249" votes="1"/>
    <clusternode name="qm250" nodeid="250" votes="1"/>
</clusternodes>
        <cman/>
<fencedevices>
    <fencedevice agent="fence_mcdata" ipaddr="x.x.x.x" login="xxxxx" 
name="ED5000" passwd="xxxxx"/>
</fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
</cluster>

Isn't there something missing for fencing in each clusternode line?

Note also that every time I start the cluster, I get quorum'd out until I log 
into another node and run cman_tool expected -e 1 to regain. I've seen a way 
to fix that in my travels but you think I can find it now? Nope :).

ANY help to make this man's crummy conf file work properly would be welcome.

Mike


From pcaulfie at redhat.com  Fri Oct  6 07:12:28 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 06 Oct 2006 08:12:28 +0100
Subject: [Linux-cluster] qdiskd vote not represented by cman
In-Reply-To: <1160076914.3666.2.camel@belmont.site>
References: <1160076914.3666.2.camel@belmont.site>
Message-ID: <452601DC.4030206@redhat.com>

danwest wrote:
> Shouldn?t we expect to see the qdisk votes reported in ?Total_votes?
> from cman (see cman_tool below). 

Yes. if the quorum disk is registered correctly with cman you should see the
votes it contributes and also it's "node name" in cman_tool nodes.


> If we have four nodes with 1 vote each
> and a qdisk configured with 1 vote (see cluster.conf snippet below)
> shouldn?t we see a total vote count of 5?  With the qdisk config shown
> below I would expect to be able to sustain a loss of 2 out of 4 nodes
> and still have quorum but in fact the loss of 2 nodes dissolves quorum
> every time, effectively locking the cluster.

-- 

patrick


From gwaters1 at csc.com  Fri Oct  6 12:03:00 2006
From: gwaters1 at csc.com (Grant Waters)
Date: Fri, 6 Oct 2006 13:03:00 +0100
Subject: [Linux-cluster] Fw: STONITH 
Message-ID: <OFF21F75A0.F25C0C23-ON802571FF.0041ED4C-802571FF.00424F12@csc.com>

 Forgot to say I also get the following msgs in syslog when I telnet to 
the NPS....

Oct  6 12:53:34 node1 cluquorumd[27339]: Cannot log into WTI 
Network/Telnet Power Switch.
Oct  6 12:53:34 node1 cluquorumd[27339]: <err> STONITH: Device at 
xx.xxx.xxx.xxx controlling node2-h FAILED status check: Bad configuration
Oct  6 12:53:47 node1 cluquorumd[2384]: <crit> Error returned from STONITH 
device(s) controlling node1-h. See system logs on node2-h for more 
information.

I obscured the IP address in there - but it is the correct address of the 
NPS.

What could this "Bad Config" be - is it the /etc/cluster.xml?
 
Regards,
GXW  :o)
----- Forwarded by Grant Waters/GIS/CSC on 06/10/2006 13:00 -----

Grant Waters/GIS/CSC 
06/10/2006 12:11

To
linux-cluster at redhat.com
cc

Subject
STONITH 


I had a quick search through your threads but couldn't find an exact hit 
which includes a resolution so I thought I'd try posting this here.

We have a two node RH ES 3.0 cluster which uses an MSA 500 G2 shared array 
with a single LUN, and a crossover cable set up as eth1 for heartbeat.
Both nodes are dual fed through an NPS power switch.

All works fine and has done for 18 months but we've had 2 outages recently 
where the following happens...

We appear to lose eth1, and the MSA 500 G2 starts timing out, and by the 
time I get in in the morning I can see errors on the MSA 500 G2 LCDs 
saying "43 REDUNDANCY FAILED" and "POWER OK" resepctively on the secondary 
and primary controllers.

Both servers are up, but the failover node appears to have been forcibly 
rebooted by STONITH, with 2 plugs in the NPS being turned off & on again.

This leaves neither node able to talk to the shared array, and the service 
down.

Powering cycling both nodes and the array fixes the problem, but I want to 
know whats causing it in the first place.  It doesn't appear to be related 
to load, although I can't rule that out - both outages were at approx 
04:40 on a Friday.

Here are the key msgs from syslog...

Sep 29 04:44:50 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:44:51 node1 kernel: cciss: cmd f79252b0 timedout
.......~100 of these
Sep 29 04:44:51 node1 kernel: cciss: cmd f79216f8 timedout
Sep 29 04:44:53 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:44:53 node1 kernel: tg3: eth1: Flow control is off for TX and 
off for RX.
Sep 29 04:45:03 node1 clumembd[2411]: <info> Membership View #3:0x00000001
Sep 29 04:45:04 node1 cluquorumd[2389]: <warning> --> Commencing STONITH 
<--
Sep 29 04:45:06 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned 
/Off.
Sep 29 04:45:07 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:45:08 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned 
/Off.
Sep 29 04:45:08 node1 cluquorumd[2389]: <notice> STONITH: node2-h has been 
fenced!
Sep 29 04:45:10 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned 
/On.
Sep 29 04:45:12 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned 
/On.
Sep 29 04:45:12 node1 cluquorumd[2389]: <notice> STONITH: node2-h is no 
longer fenced off.
Sep 29 04:45:14 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:45:14 node1 kernel: tg3: eth1: Flow control is off for TX and 
off for RX.
Sep 29 04:47:41 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:47:44 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:47:44 node1 kernel: tg3: eth1: Flow control is on for TX and on 
for RX.

I thought it would go again this morning so I turned up the cluster daemon 
loglevels, and unfortunately it didn't crash but I spotted this in the 
debug msgs....

Oct  6 04:39:31 node1 clulockd[2462]: <debug> ioctl(fd,SIOCGARP,ar 
[eth1]): No such device or address
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Connect: Member #1 
(192.168.100.101) [IPv4]
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Processing message on 11
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Received 188 bytes from peer
Oct  6 04:39:31 node1 clulockd[2462]: <debug> LOCK_LOCK | LOCK_TRYLOCK
Oct  6 04:39:31 node1 clulockd[2462]: <debug> lockd_trylock: member #1 
lock 0
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Replying ACK

The point is the cluster is working fine, and fails over and back fine.  I 
can telnet onto the NPS from both nodes so thats OK too.
As far as I can tell eth1 is set up OK, and working across 192.168 
addresses.

Any ideas where to start looking at this?

Regards,
GXW  :o)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/2b2d86d6/attachment.htm>

From ilya at nigma.ru  Fri Oct  6 08:53:33 2006
From: ilya at nigma.ru (Ilya M. Slepnev)
Date: Fri, 06 Oct 2006 12:53:33 +0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <2006105202526.771406@leena>
References: <2006105202526.771406@leena>
Message-ID: <1160124814.5597.3.camel@localhost.localdomain>

I'd like to know that also!-) I can't find a good manual, explaining me
how to write cluster.conf...

On Thu, 2006-10-05 at 20:25 -0500, isplist at logicore.net wrote:
> I've been messing with GFS for a while now, learning curve kinda high but 
> slowly getting it. I'm now at the point where I need to fence things better so 
> that reliability becomes better now that I'm getting closer to actually using 
> this on something production.
> 
> Thing is, I've asked before about cluster.conf file building without replies 
> so am still unsure about this part right now. I've seen countless variations, 
> many with things I've not even seen before and for the most part, many seem to 
> be custom made, like someone's recipe :).
> 
> So, the question remains... where can I find VERY good details and information 
> that will help me understand the building of this file.
> 
> I'm now using a McData ED-5000 switch and need to make sure that fencing is 
> working correctly. My (probably silly) cluster. conf file looks like;
> 
> <?xml version="1.0"?>
> <cluster config_version="41" name="vgcomp">
>     <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
> <clusternodes>
>     <clusternode name="cweb92" nodeid="92" votes="1"/>
>     <clusternode name="cweb93" nodeid="93" votes="1"/>
>     <clusternode name="cweb94" nodeid="94" votes="1"/>
>     <clusternode name="dev" nodeid="99" votes="1"/>
>     <clusternode name="qm247" nodeid="247" votes="1"/>
>     <clusternode name="qm248" nodeid="248" votes="1"/>
>     <clusternode name="qm249" nodeid="249" votes="1"/>
>     <clusternode name="qm250" nodeid="250" votes="1"/>
> </clusternodes>
>         <cman/>
> <fencedevices>
>     <fencedevice agent="fence_mcdata" ipaddr="x.x.x.x" login="xxxxx" 
> name="ED5000" passwd="xxxxx"/>
> </fencedevices>
>         <rm>
>                 <failoverdomains/>
>                 <resources/>
>         </rm>
> </cluster>
> 
> Isn't there something missing for fencing in each clusternode line?
> 
> Note also that every time I start the cluster, I get quorum'd out until I log 
> into another node and run cman_tool expected -e 1 to regain. I've seen a way 
> to fix that in my travels but you think I can find it now? Nope :).
> 
> ANY help to make this man's crummy conf file work properly would be welcome.
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 191 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/78fb9427/attachment.sig>

From gwaters1 at csc.com  Fri Oct  6 11:10:38 2006
From: gwaters1 at csc.com (Grant Waters)
Date: Fri, 6 Oct 2006 12:10:38 +0100
Subject: [Linux-cluster] STONITH 
Message-ID: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>

I had a quick search through your threads but couldn't find an exact hit 
which includes a resolution so I thought I'd try posting this here.

We have a two node RH ES 3.0 cluster which uses an MSA 500 G2 shared array 
with a single LUN, and a crossover cable set up as eth1 for heartbeat.
Both nodes are dual fed through an NPS power switch.

All works fine and has done for 18 months but we've had 2 outages recently 
where the following happens...

We appear to lose eth1, and the MSA 500 G2 starts timing out, and by the 
time I get in in the morning I can see errors on the MSA 500 G2 LCDs 
saying "43 REDUNDANCY FAILED" and "POWER OK" resepctively on the secondary 
and primary controllers.

Both servers are up, but the failover node appears to have been forcibly 
rebooted by STONITH, with 2 plugs in the NPS being turned off & on again.

This leaves neither node able to talk to the shared array, and the service 
down.

Powering cycling both nodes and the array fixes the problem, but I want to 
know whats causing it in the first place.  It doesn't appear to be related 
to load, although I can't rule that out - both outages were at approx 
04:40 on a Friday.

Here are the key msgs from syslog...

Sep 29 04:44:50 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:44:51 node1 kernel: cciss: cmd f79252b0 timedout
.......~100 of these
Sep 29 04:44:51 node1 kernel: cciss: cmd f79216f8 timedout
Sep 29 04:44:53 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:44:53 node1 kernel: tg3: eth1: Flow control is off for TX and 
off for RX.
Sep 29 04:45:03 node1 clumembd[2411]: <info> Membership View #3:0x00000001
Sep 29 04:45:04 node1 cluquorumd[2389]: <warning> --> Commencing STONITH 
<--
Sep 29 04:45:06 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned 
/Off.
Sep 29 04:45:07 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:45:08 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned 
/Off.
Sep 29 04:45:08 node1 cluquorumd[2389]: <notice> STONITH: node2-h has been 
fenced!
Sep 29 04:45:10 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned 
/On.
Sep 29 04:45:12 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned 
/On.
Sep 29 04:45:12 node1 cluquorumd[2389]: <notice> STONITH: node2-h is no 
longer fenced off.
Sep 29 04:45:14 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:45:14 node1 kernel: tg3: eth1: Flow control is off for TX and 
off for RX.
Sep 29 04:47:41 node1 kernel: tg3: eth1: Link is down.
Sep 29 04:47:44 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full 
duplex.
Sep 29 04:47:44 node1 kernel: tg3: eth1: Flow control is on for TX and on 
for RX.

I thought it would go again this morning so I turned up the cluster daemon 
loglevels, and unfortunately it didn't crash but I spotted this in the 
debug msgs....

Oct  6 04:39:31 node1 clulockd[2462]: <debug> ioctl(fd,SIOCGARP,ar 
[eth1]): No such device or address
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Connect: Member #1 
(192.168.100.101) [IPv4]
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Processing message on 11
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Received 188 bytes from peer
Oct  6 04:39:31 node1 clulockd[2462]: <debug> LOCK_LOCK | LOCK_TRYLOCK
Oct  6 04:39:31 node1 clulockd[2462]: <debug> lockd_trylock: member #1 
lock 0
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Replying ACK

The point is the cluster is working fine, and fails over and back fine.  I 
can telnet onto the NPS from both nodes so thats OK too.
As far as I can tell eth1 is set up OK, and working across 192.168 
addresses.

Any ideas where to start looking at this?

Regards,
GXW  :o)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/2a51f4a7/attachment.htm>

From eric at bootseg.com  Fri Oct  6 14:43:21 2006
From: eric at bootseg.com (Eric Kerin)
Date: Fri, 06 Oct 2006 10:43:21 -0400
Subject: [Linux-cluster] STONITH
In-Reply-To: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>
References: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>
Message-ID: <45266B89.80705@bootseg.com>

Grant Waters wrote:
>
> I had a quick search through your threads but couldn't find an exact 
> hit which includes a resolution so I thought I'd try posting this here.
> We appear to lose eth1, and the MSA 500 G2 starts timing out, and by 
> the time I get in in the morning I can see errors on the MSA 500 G2 
> LCDs saying "43 REDUNDANCY FAILED" and "POWER OK" resepctively on the 
> secondary and primary controllers.
I've had the same problem, although I'm running RHEL 4.  About 3 times 
in the last year I've had failures where the nodes can no longer access 
the MSA 500 G2, with the same errors shown on the controllers.

Each time HP has told me to "Upgrade the firmware"  (and this is over 
the course of a year or more at this point).  Since the problem only 
happens every few months, by the time it happens again HP has a new 
firmware release out and they tell me to upgrade again.

Not much help to fix your fencing problem.  But since your MSA 500 G2 
problem is the same as mine, I figured it was worth a mention.

Thanks,
Eric Kerin


From filipe.miranda at gmail.com  Fri Oct  6 14:50:54 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 6 Oct 2006 11:50:54 -0300
Subject: [Linux-cluster] IPMI IBM x366 basics
Message-ID: <a6d13c780610060750j3ee8d190nc1b5a3a37b44190e@mail.gmail.com>

Hello there,

I am tryingo to implement a 6 nodes Cluster using RHEL4 + GFS.
We will use the IPMI as our fence device since we are using IBM x366
machines, they are IPMI compliant.
Could someone help me out on how to setup tihs machines to use this IPMI
functionaliy>
Does it have a special NIC for it>
This NIC must me connected to the same physical network as the private
network between these nodes to communicate right>
You I get the some IPMI 101 basics>


Thank you,

-- 
---
Filipe T Miranda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/69dc09ac/attachment.htm>

From cjk at techma.com  Fri Oct  6 15:57:03 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Fri, 6 Oct 2006 11:57:03 -0400
Subject: =?us-ascii?Q?RE:_=5BLinux-cluster=5D_STONITH=20?=
In-Reply-To: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079F18@tmaemail.techma.com>

What exactly do you mean by outage? Power outage? If so, power for what?
Just network gear? As far as I
know the MSA500 shouldn't "timeout" it's a hard scsi connection thats not in
any way network dependant. I 
probably missed something but I'm not clear on your description of what
happened. If it's scsi timeoutes, then
see below about profiles.
 
The MSA will not failover correctly under Linux unless the "profile" for the
connections defined in the controllers
are set up correctly. Even if it's been done in the past, check it again.
I've had the profile setting reset to the 
defaults after updating firmware. Even then, there needs to be I/O going down
the pipe in order for the controllers
to failover correctly.
 
If everything went down, then I can almost gaurentee that the nodes came back
online before the MSA was
operational again. These they're pretty slow booting and I'd bet just about
any computer will boot way before
the MSA will and this not be able to see any of the devices it presents. A
reboot of the nodes then fixes that 
problem.
 
Aside from all of this, you probably need to figure out why the primary
controller failed in the first place. The fact
that the redundancy failed on you is not good. Sounds like it failed over but
you likely have other issues that are
preventing the device paths from being maintained.
 
FInally, if all else is good, try forcelby failing the controllers over by
pulling the active one out and see how long it
takes to recover. Then set your heartbeat tineout slightly longer than that
value.
 
 
Hope the ramble helps.
 
 
Corey
 
 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Grant Waters
Sent: Friday, October 06, 2006 7:11 AM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] STONITH 


I had a quick search through your threads but couldn't find an exact hit
which includes a resolution so I thought I'd try posting this here. 

We have a two node RH ES 3.0 cluster which uses an MSA 500 G2 shared array
with a single LUN, and a crossover cable set up as eth1 for heartbeat. 
Both nodes are dual fed through an NPS power switch. 

All works fine and has done for 18 months but we've had 2 outages recently
where the following happens... 

We appear to lose eth1, and the MSA 500 G2 starts timing out, and by the time
I get in in the morning I can see errors on the MSA 500 G2 LCDs saying "43
REDUNDANCY FAILED" and "POWER OK" resepctively on the secondary and primary
controllers. 

Both servers are up, but the failover node appears to have been forcibly
rebooted by STONITH, with 2 plugs in the NPS being turned off & on again. 

This leaves neither node able to talk to the shared array, and the service
down. 

Powering cycling both nodes and the array fixes the problem, but I want to
know whats causing it in the first place.  It doesn't appear to be related to
load, although I can't rule that out - both outages were at approx 04:40 on a
Friday. 

Here are the key msgs from syslog... 

Sep 29 04:44:50 node1 kernel: tg3: eth1: Link is down. 
Sep 29 04:44:51 node1 kernel: cciss: cmd f79252b0 timedout 
.......~100 of these 
Sep 29 04:44:51 node1 kernel: cciss: cmd f79216f8 timedout 
Sep 29 04:44:53 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full
duplex. 
Sep 29 04:44:53 node1 kernel: tg3: eth1: Flow control is off for TX and off
for RX. 
Sep 29 04:45:03 node1 clumembd[2411]: <info> Membership View #3:0x00000001 
Sep 29 04:45:04 node1 cluquorumd[2389]: <warning> --> Commencing STONITH <-- 
Sep 29 04:45:06 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned /Off.

Sep 29 04:45:07 node1 kernel: tg3: eth1: Link is down. 
Sep 29 04:45:08 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned /Off.

Sep 29 04:45:08 node1 cluquorumd[2389]: <notice> STONITH: node2-h has been
fenced! 
Sep 29 04:45:10 node1 cluquorumd[2389]: Power to NPS outlet(s) 6 turned /On. 
Sep 29 04:45:12 node1 cluquorumd[2389]: Power to NPS outlet(s) 2 turned /On. 
Sep 29 04:45:12 node1 cluquorumd[2389]: <notice> STONITH: node2-h is no
longer fenced off. 
Sep 29 04:45:14 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full
duplex. 
Sep 29 04:45:14 node1 kernel: tg3: eth1: Flow control is off for TX and off
for RX. 
Sep 29 04:47:41 node1 kernel: tg3: eth1: Link is down. 
Sep 29 04:47:44 node1 kernel: tg3: eth1: Link is up at 1000 Mbps, full
duplex. 
Sep 29 04:47:44 node1 kernel: tg3: eth1: Flow control is on for TX and on for
RX. 

I thought it would go again this morning so I turned up the cluster daemon
loglevels, and unfortunately it didn't crash but I spotted this in the debug
msgs.... 

Oct  6 04:39:31 node1 clulockd[2462]: <debug> ioctl(fd,SIOCGARP,ar [eth1]):
No such device or address 
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Connect: Member #1
(192.168.100.101) [IPv4] 
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Processing message on 11 
Oct  6 04:39:31 node1 clulockd[2462]: <debug> Received 188 bytes from peer 
Oct  6 04:39:31 node1 clulockd[2462]: <debug> LOCK_LOCK | LOCK_TRYLOCK 
Oct  6 04:39:31 node1 clulockd[2462]: <debug> lockd_trylock: member #1 lock 0

Oct  6 04:39:31 node1 clulockd[2462]: <debug> Replying ACK 

The point is the cluster is working fine, and fails over and back fine.  I
can telnet onto the NPS from both nodes so thats OK too. 
As far as I can tell eth1 is set up OK, and working across 192.168 addresses.


Any ideas where to start looking at this? 

Regards,
GXW  :o)


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/6493031d/attachment.htm>

From jos at xos.nl  Fri Oct  6 17:05:11 2006
From: jos at xos.nl (Jos Vos)
Date: Fri, 6 Oct 2006 19:05:11 +0200
Subject: [Linux-cluster] IPMI IBM x366 basics
In-Reply-To: <a6d13c780610060750j3ee8d190nc1b5a3a37b44190e@mail.gmail.com>;
	from filipe.miranda@gmail.com on Fri, Oct 06, 2006 at 11:50:54AM
	-0300
References: <a6d13c780610060750j3ee8d190nc1b5a3a37b44190e@mail.gmail.com>
Message-ID: <20061006190511.B5863@xos037.xos.nl>

On Fri, Oct 06, 2006 at 11:50:54AM -0300, Filipe Miranda wrote:

> We will use the IPMI as our fence device since we are using IBM x366
> machines, they are IPMI compliant.
> Could someone help me out on how to setup tihs machines to use this IPMI
> functionaliy>
> Does it have a special NIC for it>

AFAIK, the first NIC of an x366 can act in dual-mode: as IPMI device,
with its own IP address, and as a normal NIC in Linux, with it own address.
Both have their own MAC address.

> I am tryingo to implement a 6 nodes Cluster using RHEL4 + GFS.
> This NIC must me connected to the same physical network as the private
> network between these nodes to communicate right>
> You I get the some IPMI 101 basics>

When you share the first NIC for IPMI and "normal" use, you need to
use the same switch.  If possible, I'd choose to use this NIC
exclusively for IPMI, connected to a dedicated switch for.

In RHCS, just configure that card as "IPMI Lan", with its IP address,
user and password (all changeable from the x366 BIOS) and "password"
as authentication type.
 
-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From dist-list at LEXUM.UMontreal.CA  Fri Oct  6 17:37:39 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 06 Oct 2006 13:37:39 -0400
Subject: [Linux-cluster] clustering and web throttling/quotas ?
Message-ID: <45269463.6030307@lexum.umontreal.ca>

Hello,
We are using director in front of several web servers.

I'm looking a way to block web client based on download quota , etc ?

I know mod_cband but in a cluster/webfarm setup it does not seems to be
the soltution


From adas at redhat.com  Fri Oct  6 19:56:29 2006
From: adas at redhat.com (Abhijith Das)
Date: Fri, 06 Oct 2006 14:56:29 -0500
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <4523A637.1060706@fib.upc.edu>
References: <4523A637.1060706@fib.upc.edu>
Message-ID: <4526B4ED.9050907@redhat.com>

sandra-llistes wrote:

> Hi,
>
> I sent a mail a few days ago to this list related with GFS+samba 
> problems.
>
> Since the, we have installed a sepparated test environment also with 
> two linux servers where we  have tested a samba server with an 
> exported share in GFS. The share is read-only and only one server is 
> exporting it.
>
> When we try to access from a single windows client it works fine, but 
> when we try to access to the same file from 2 or more windows clients 
> simoultaneously, windows hangs and samba also does. This seems not to 
> happen with concurrent access to different files or with linux clients.
>
> We've also tested to export the same share without GFS and in this 
> case it works fine.
>
> It seems to be a locking problem with samba, GFS and windows clients. 
> Does any of you have experienced similar problems? Do you have any 
> suggestion about this?
>
> Following is the share configuration in smb.conf:
>
> [public]
>   comment         = ShareGFS
>   path            = /public
>   writeable       = No
>   read only       = Yes
>   write list      = @admsamba
>   force group     = admsamba
>   create mask     = 0775
>   directory mask  = 0775
>   oplocks         = No
>   locking = Yes
>   strict locking = Yes
> # I proved with locking/Strick locking=Yes and No. Always happens the 
> same problem
>
> I attach some samba logs (Level 3).
> Software Versions:
>     Fedora 5
>     Samba 3.0.23
>     GFS  6.1.5
>     kernel 2.6.17-1.2187_FC5
>
> Any help will be appreciated.
>
> Sandra Hernandez

Hi Sandra,
I'm not very familiar with the locking of samba, but I did try the 
scenario you described on my test cluster. I'm unable to reproduce your 
problem. I have an identical smb.conf as you've pasted above. Accessing 
(reading a txt file, or playing a video clip) from two windows clients 
simultaneously works just fine without any glitches.
If I understood it right, the test case you describe has one node in a 
cluster exporting a single samba share over a GFS filesystem and you're 
using multiple windows clients to access the same file in this share. 
This is a fairly basic operation IMO and it is quite odd that you should 
see this failure. Maybe you can try the CVS version of cluster suite 
(cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster checkout -r RHEL4 
cluster) to see if the problem persists. Also, I'd be interested in 
knowing the behavior when you mount GFS on only one node (the one that's 
exporting) and also when you use GFS with lock_nolock on a standalone 
machine.
Thanks,
--Abhi


From filipe.miranda at gmail.com  Sat Oct  7 00:05:07 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 6 Oct 2006 21:05:07 -0300
Subject: [Linux-cluster] IPMI IBM x366 basics
In-Reply-To: <20061006190511.B5863@xos037.xos.nl>
References: <a6d13c780610060750j3ee8d190nc1b5a3a37b44190e@mail.gmail.com>
	<20061006190511.B5863@xos037.xos.nl>
Message-ID: <a6d13c780610061705m4880dac7m19deb01df28bf112@mail.gmail.com>

Jos,

Thanks a lot,

I entered the Bios setup and found out the BMC configurantion, where I set
the IP address and User+password information.
It worked just fine, thanks for the hint!!!

Regards,
Filipe Miranda

On 10/6/06, Jos Vos <jos at xos.nl> wrote:
>
> On Fri, Oct 06, 2006 at 11:50:54AM -0300, Filipe Miranda wrote:
>
> > We will use the IPMI as our fence device since we are using IBM x366
> > machines, they are IPMI compliant.
> > Could someone help me out on how to setup tihs machines to use this IPMI
> > functionaliy>
> > Does it have a special NIC for it>
>
> AFAIK, the first NIC of an x366 can act in dual-mode: as IPMI device,
> with its own IP address, and as a normal NIC in Linux, with it own
> address.
> Both have their own MAC address.
>
> > I am tryingo to implement a 6 nodes Cluster using RHEL4 + GFS.
> > This NIC must me connected to the same physical network as the private
> > network between these nodes to communicate right>
> > You I get the some IPMI 101 basics>
>
> When you share the first NIC for IPMI and "normal" use, you need to
> use the same switch.  If possible, I'd choose to use this NIC
> exclusively for IPMI, connected to a dedicated switch for.
>
> In RHCS, just configure that card as "IPMI Lan", with its IP address,
> user and password (all changeable from the x366 BIOS) and "password"
> as authentication type.
>
> --
> --    Jos Vos <jos at xos.nl>
> --    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
> --    Amsterdam, The Netherlands        |     Fax: +31 20 6948204
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061006/8277ac26/attachment.htm>

From zvedavec at gmail.com  Sat Oct  7 14:42:35 2006
From: zvedavec at gmail.com (Zvedavec)
Date: Sat, 7 Oct 2006 16:42:35 +0200
Subject: [Linux-cluster] NFS problem ?
Message-ID: <d040935f0610070742n65bca0bbvc7c800f156a85a25@mail.gmail.com>

Dear all

I build a small linux cluster. Based on Fedora Core 5 at master node.
Hardware of the nodes and master are identical identicky -
mainborards: Asus M2NPV-MX
:http://support.asus.com/download/do...&model=M2NPV-MX

During instalation/configuration I did ->>
Step by step :
1. DHCP - working well, give correct IPs to nodes
2. TFTP - file with kernel is loaded to node and boot start
3. NFS - working at master, but during the booting of system I deal
with the problem with mounting NFS rootimage.

I try also SLIM http://slim.cs.hku.hk/vmware/index.html instalation.
Everything is almost same like in my instalation. Everything working
to last screen of instalation guide, but after same error like in my
isnataltion: Error mesage:

Error, fail to mount system image!
It may be due to the network driver fail to load,
NFS server export incorrect or
network failure.

We try many thing to fix it and finaly we find that when we compile
kernel with parameter

CONFIG_ROOT_NFS: Y

It helps a litle bit. We can reach login screen. But during booting we still
have a lot of error messages that system is just readable.

With CD and live distro on node everything work well, I can mount nfs
disk from master.

Thanks for any advice/solution/anything what help. Thank you.

best regards, Skeptik


From jos at xos.nl  Sun Oct  8 13:04:08 2006
From: jos at xos.nl (Jos Vos)
Date: Sun, 08 Oct 2006 15:04:08 +0200
Subject: [Linux-cluster] Distributing cluster.conf with <quorumd> tag
Message-ID: <200610081304.k98D48Z32299@xos037.xos.nl>

Hi,

After manually editting cluster.conf to add a <quorumd> entry, it seems
to be impossible to distribute the config it to the other nodes using
system-config-cluster, because then a new version without the <quorumd>
entry is distributed.

Is there a way to distribute a cluster.conf *with* <quorumd> using ccsd
to all cluster nodes?

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jos at xos.nl  Sun Oct  8 17:10:43 2006
From: jos at xos.nl (Jos Vos)
Date: Sun, 08 Oct 2006 19:10:43 +0200
Subject: [Linux-cluster] Quorum partition size requirements?
Message-ID: <200610081710.k98HAhd02563@xos037.xos.nl>

Hi,

What are the size requirements for a quorum disk (RHEL4 U4 qdisk)?

Mkqdisk seems to write a fixed amount of status blocks (always for
16 nodes) and it doesn't complain when running mkqdisk on just 1 MB,
so I guess the needs are minimal, but I wnat to be sure.

(Back in RHEL 2.1 the old-style quorum disk needed to be 10+ MB.)

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jprats at cesca.es  Mon Oct  9 06:57:54 2006
From: jprats at cesca.es (Jordi Prats)
Date: Mon, 09 Oct 2006 08:57:54 +0200
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <1160075910.18145.18.camel@rei.boston.devel.redhat.com>
References: <452200C5.6080406@cesca.es>	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>	<45236805.60709@cesca.es>	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>	<4523FED1.9010404@cesca.es>
	<1160075910.18145.18.camel@rei.boston.devel.redhat.com>
Message-ID: <4529F2F2.3010903@cesca.es>

Hi,
I'm attaching to you my services configuration. If I disable a service 
on node1 and enable it on node2, it succesfully runs on the other node. 
So, aparently all scripts are installed on both nodes and functional.

Relocating a service do not appear nothing on the other node's log. So, 
must be a communications problem. Where can I start to search any 
problem related to this? Network seems to be ok, and I can do ssh 
between nodes. Sending pings with mtr -i 0.01 does not loose any packet.

Thanks,

Services:
                <service autostart="1" domain="projectes" name="projectes">
                        <fs ref="NFS_Ordal">
                                <fs device="/dev/vg_ordal/tdx" 
force_unmount="1" fstype="ext3" mountpoint="/projectes2/tdx" name="TDX" 
options="">
                                        <fs 
device="/dev/padicat/dades.padicat" force_fsck="0" force_unmount="1" 
fstype="ext3" mountpoint="/projectes2/padicat" name="Dades padicat" 
options="" self_fence="0">
                                                <ip ref="192.168.12.203">
                                                        <script 
file="/etc/init.d/add.nfs.projectes" name="add.nfs.projectes"/>
                                                </ip>
                                        </fs>
                                </fs>
                        </fs>
                </service>
                <service autostart="1" domain="local" name="local">
                        <fs ref="NFS_usr_local">
                                <ip ref="192.168.12.204">
                                        <script 
file="/etc/init.d/add.nfs.local" name="add.nfs.local"/>
                                </ip>
                        </fs>
                </service>
                <service autostart="1" domain="mysql" exclusive="1" 
name="mysql">
                        <ip ref="192.168.12.201">
                                <fs device="/dev/vg_bbdd/mysql01" 
force_unmount="1" fstype="ext3" mountpoint="/CLUSTER/MySQL/mysql-5.0.20" 
name="mysql" options="">
                                        <script 
file="/etc/init.d/mysqld" name="mysql"/>
                                </fs>
                        </ip>
                </service>
                <service autostart="1" domain="postgres" name="postgres">
                        <fs ref="Postgres">
                                <ip ref="192.168.12.202">
                                        <script ref="init.postgres"/>
                                </ip>
                        </fs>
                </service>


Lon Hohberger wrote:
> On Wed, 2006-10-04 at 20:34 +0200, Jordi Prats wrote:
>   
>> It appears to be running on both nodes:
>>
>> # cat /proc/cluster/services
>> Service          Name                              GID LID State     Code
>> Fence Domain:    "default"                           1   2 run       -
>> [1 2]
>>
>> DLM Lock Space:  "Magma"                             3   4 run       -
>> [1 2]
>>
>> User:            "usrm::manager"                     2   3 run       -
>> [1 2]
>>
>> # ccs_tool lsnode
>>
>> Cluster name: dades, config_version: 76
>>     
>
> Could you post your service blob?  If you're using a script, it might
> not be installed on the other node.  When you do a "relocate", does
> anything appear in the logs on the other node?
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>   


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From hirantha at vcs.informatics.lk  Mon Oct  9 09:47:53 2006
From: hirantha at vcs.informatics.lk (Hirantha Wijayawardena)
Date: Mon, 9 Oct 2006 15:17:53 +0530
Subject: [Linux-cluster] Quorum disk: Can it be a partition?
In-Reply-To: <45211826.1050703@redhat.com>
Message-ID: <20061009094721.65AAB27C57@ux-mail.informatics.lk>

Hi Bob,

Just another quick question;
Which disk/partition should be elected as quorum disk/partition? Either
shared storage or individual nodes?

Thanks in advance

- Hirantha

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-
> bounces at redhat.com] On Behalf Of Robert Peterson
> Sent: Monday, October 02, 2006 7:16 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Quorum disk: Can it be a partition?
> 
> Troels Arvin wrote:
> > Hello,
> >
> > About the new quorum disk feature in RH Cluster Suite Update 4:
> > Does it have to be a complete, dedicated disk, or is it just as fine to
> > use a disk partition?
> Hi Troels,
> 
> It doesn't have to be a disk; a partition is just fine.
> http://sources.redhat.com/cluster/faq.html#quorum
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jos at xos.nl  Mon Oct  9 09:59:27 2006
From: jos at xos.nl (Jos Vos)
Date: Mon, 9 Oct 2006 11:59:27 +0200
Subject: [Linux-cluster] Quorum disk: Can it be a partition?
In-Reply-To: <20061009094721.65AAB27C57@ux-mail.informatics.lk>;
	from hirantha@vcs.informatics.lk on Mon, Oct 09, 2006 at
	03:17:53PM +0530
References: <45211826.1050703@redhat.com>
	<20061009094721.65AAB27C57@ux-mail.informatics.lk>
Message-ID: <20061009115927.C7198@xos037.xos.nl>

On Mon, Oct 09, 2006 at 03:17:53PM +0530, Hirantha Wijayawardena wrote:

> Just another quick question;
> Which disk/partition should be elected as quorum disk/partition? Either
> shared storage or individual nodes?

Shared storage.  The idea of the quorum disk is that others can see that
you still write something to the storage (and thus are still alive) :-).

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From sara_sodagar at yahoo.com  Mon Oct  9 11:55:31 2006
From: sara_sodagar at yahoo.com (sara sodagar)
Date: Mon, 9 Oct 2006 04:55:31 -0700 (PDT)
Subject: [Linux-cluster] heartbeat network
Message-ID: <20061009115531.50994.qmail@web31802.mail.mud.yahoo.com>

We want to implement a web hosting service using
RHEL4. The idea is to load balance several web servers
using a hardware network load balancer and use GFS as
the shared storage. As I know we have to install
redhat cluster suit and redhat GFS to implement shared
storage. My question is : 

 
1)       If we don???t want to set up cluster service,
and just want the shared storage (GFS), do we still
need heartbeat network? 

 
__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From sara_sodagar at yahoo.com  Mon Oct  9 11:57:21 2006
From: sara_sodagar at yahoo.com (sara sodagar)
Date: Mon, 9 Oct 2006 04:57:21 -0700 (PDT)
Subject: [Linux-cluster] heartbeat network
Message-ID: <20061009115721.863.qmail@web31811.mail.mud.yahoo.com>

We want to implement a web hosting service using
RHEL4. The idea is to load balance several web servers
using a hardware network load balancer and use GFS as
the shared storage. As I know we have to install
redhat cluster suit and redhat GFS to implement shared
storage. My question is : 

1)If we don???t want to set up cluster service,and
just want the shared storage (GFS), do we still need
heartbeat network? 

I would be grateful if anyone could help me regarding
this issue.

--Regards
Sara 

 
__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From jprats at cesca.es  Mon Oct  9 14:29:39 2006
From: jprats at cesca.es (Jordi Prats)
Date: Mon, 09 Oct 2006 16:29:39 +0200
Subject: [Linux-cluster] LVS port translation
Message-ID: <452A5CD3.60500@cesca.es>

Hi all,
It's possible to configure piranha to allow to define a different port 
on real servers using NAT?

Using ipvsadm would be:

/sbin/ipvsadm -a -t 123.456.789.1:80 -r 192.168.10.1:82 -m -w 1

On the current version is not possible, there is any patch to allow 
this? If there is not, I could do it. Would it be accepted as a 
contribution on main distribution? Witch roules would I have to follow?

Thanks,

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From rpeterso at redhat.com  Mon Oct  9 15:11:55 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 09 Oct 2006 10:11:55 -0500
Subject: [Linux-cluster] NFS problem ?
In-Reply-To: <d040935f0610070742n65bca0bbvc7c800f156a85a25@mail.gmail.com>
References: <d040935f0610070742n65bca0bbvc7c800f156a85a25@mail.gmail.com>
Message-ID: <452A66BB.7080005@redhat.com>

Zvedavec wrote:
> Dear all
>
> I build a small linux cluster. Based on Fedora Core 5 at master node.
> Hardware of the nodes and master are identical identicky -
> mainborards: Asus M2NPV-MX
> :http://support.asus.com/download/do...&model=M2NPV-MX
>
> During instalation/configuration I did ->>
> Step by step :
> 1. DHCP - working well, give correct IPs to nodes
> 2. TFTP - file with kernel is loaded to node and boot start
> 3. NFS - working at master, but during the booting of system I deal
> with the problem with mounting NFS rootimage.
>
> I try also SLIM http://slim.cs.hku.hk/vmware/index.html instalation.
> Everything is almost same like in my instalation. Everything working
> to last screen of instalation guide, but after same error like in my
> isnataltion: Error mesage:
>
> Error, fail to mount system image!
> It may be due to the network driver fail to load,
> NFS server export incorrect or
> network failure.
>
> We try many thing to fix it and finaly we find that when we compile
> kernel with parameter
>
> CONFIG_ROOT_NFS: Y
>
> It helps a litle bit. We can reach login screen. But during booting we 
> still
> have a lot of error messages that system is just readable.
>
> With CD and live distro on node everything work well, I can mount nfs
> disk from master.
>
> Thanks for any advice/solution/anything what help. Thank you.
>
> best regards, Skeptik
Hi Skeptik,

The problem of not being about to mount the root partition might be security
related.  Check the /etc/exports file on your nfs server and try using 
"insecure"
and "no_root_squash" options temporarily to see if your root partition will
mount.  For example, suppose you have a powerpc client 5 that dhcp assigns
an address of 10.0.0.5, do something like this in /etc/exports:

/tftpboot/ppc5           10.0.0.5(rw,insecure,sync,no_root_squash)

Also, you may want to verify that your nfs client version is compatible with
your nfs server version.  For example, if your server exports nfs v3 and 
your
client requires nfs v4.

I am not aware of any problems serving NFS in a cluster.

Regards,

Bob Peterson
Red Hat Cluster Suite


From brentonr at dorm.org  Mon Oct  9 15:18:26 2006
From: brentonr at dorm.org (Brenton Rothchild)
Date: Mon, 09 Oct 2006 10:18:26 -0500
Subject: [Linux-cluster] Re: [Cluster-devel] LVS port translation
In-Reply-To: <452A5CD3.60500@cesca.es>
References: <452A5CD3.60500@cesca.es>
Message-ID: <452A6842.5020608@dorm.org>

What version of piranha are you using?  I have a patch
against piranha-0.8.1 and piranha-0.8.2 that I've been using
in heavy production for about 9 months now.

We just added a "port" value to real server sections in the lvs.cf file, i.e.

virtual server_XYZ {
     address = 123.456.789.1 eth0
     active = 1
     port = 80
     ... (additional options) ...

     server node-X {
         address = 192.168.10.1
         port = 82
         active = 1
         weight = 1
     }

     ...
}

I can send the patch to the list if you're interested - shouldn't be
too hard to patch against other versions (I haven't looked to see if the
RPMs have been updated lately.)

-Brenton Rothchild

Jordi Prats wrote:
> Hi all,
> It's possible to configure piranha to allow to define a different port 
> on real servers using NAT?
> 
> Using ipvsadm would be:
> 
> /sbin/ipvsadm -a -t 123.456.789.1:80 -r 192.168.10.1:82 -m -w 1
> 
> On the current version is not possible, there is any patch to allow 
> this? If there is not, I could do it. Would it be accepted as a 
> contribution on main distribution? Witch roules would I have to follow?
> 
> Thanks,
> 


From rpeterso at redhat.com  Mon Oct  9 15:48:23 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 09 Oct 2006 10:48:23 -0500
Subject: [Linux-cluster] GNBD Ports
In-Reply-To: <8c1094290610020910s53ac1575gf6c596f4cdef54a8@mail.gmail.com>
References: <8c1094290610020910s53ac1575gf6c596f4cdef54a8@mail.gmail.com>
Message-ID: <452A6F47.30206@redhat.com>

David Brieck Jr. wrote:
> Which ports would need to be open on a firewall to use GNBD server?
>
> Nothing is mentioned in this document about them.
> http://sources.redhat.com/cluster/faq.html#iptables
>
> Thanks. 
Hi David,

For GNBD you need to enable port 14567 (tcp).
I added it to the faq too, so thanks for pointing out the omission. 

Regards,

Bob Peterson
Red Hat Cluster Suite


From rpeterso at redhat.com  Mon Oct  9 15:52:04 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 09 Oct 2006 10:52:04 -0500
Subject: [Linux-cluster] Quorum partition size requirements?
In-Reply-To: <200610081710.k98HAhd02563@xos037.xos.nl>
References: <200610081710.k98HAhd02563@xos037.xos.nl>
Message-ID: <452A7024.80007@redhat.com>

Jos Vos wrote:
> Hi,
>
> What are the size requirements for a quorum disk (RHEL4 U4 qdisk)?
>
> Mkqdisk seems to write a fixed amount of status blocks (always for
> 16 nodes) and it doesn't complain when running mkqdisk on just 1 MB,
> so I guess the needs are minimal, but I wnat to be sure.
>
> (Back in RHEL 2.1 the old-style quorum disk needed to be 10+ MB.)
>
> Thanks,
>   
Hi Jos,

The official answer is 10MB. The real number is something like 100KB,
but we'd like to reserve 10MB for possible future expansion and features.
I added this to the faq too:

http://sources.redhat.com/cluster/faq.html#quorumdisksize

Regards,

Bob Peterson
Red Hat Cluster Suite


From jos at xos.nl  Mon Oct  9 16:41:18 2006
From: jos at xos.nl (Jos Vos)
Date: Mon, 09 Oct 2006 18:41:18 +0200
Subject: [Linux-cluster] Quorum disks and two-node clusters
Message-ID: <200610091641.k99GfIc13121@xos037.xos.nl>

Hi,

The RHCS 4 U4 release notes say:

   With properly configured heuristics, you could define the following
   cluster behavior:

   * In the event of a network-partition failure, provide a method to
     decide which member wins the fence race in a two-node cluster.

Can someone give a config example of this?  Should the score be
asymmetric in that case?  A scenario of what happens w.r.t. score,
votes, and how the respective cman subsystems react to this would
be nice to have.

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jparsons at redhat.com  Mon Oct  9 16:52:26 2006
From: jparsons at redhat.com (James Parsons)
Date: Mon, 09 Oct 2006 12:52:26 -0400
Subject: [Linux-cluster] Distributing cluster.conf with <quorumd> tag
In-Reply-To: <200610081304.k98D48Z32299@xos037.xos.nl>
References: <200610081304.k98D48Z32299@xos037.xos.nl>
Message-ID: <452A7E4A.4040101@redhat.com>

Jos Vos wrote:

>Hi,
>
>After manually editting cluster.conf to add a <quorumd> entry, it seems
>to be impossible to distribute the config it to the other nodes using
>system-config-cluster, because then a new version without the <quorumd>
>entry is distributed.
>
>Is there a way to distribute a cluster.conf *with* <quorumd> using ccsd
>to all cluster nodes?
>
>Thanks,
>
>--
>--    Jos Vos <jos at xos.nl>
>--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
>--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>  
>
Sorry, Jos - but system-config-cluster support for quorum disk is coming 
in the next rhel4 cs release and in rhel5 cs. You will have to 
distribute your conf file by hand. this is done using two helper tools 
for a DLM cluster.

First you need to run 'ccs_tool update [path to conf file]' with a new 
incremently higher config_version in the file.

Then you will need to run 'cman_tool -r version [the version number you 
chose]'

I am terribly sorry for the inconvenience.

-J


From jos at xos.nl  Mon Oct  9 17:16:41 2006
From: jos at xos.nl (Jos Vos)
Date: Mon, 9 Oct 2006 19:16:41 +0200
Subject: [Linux-cluster] Distributing cluster.conf with <quorumd> tag
In-Reply-To: <452A7E4A.4040101@redhat.com>;
	from jparsons@redhat.com on Mon, Oct 09, 2006 at 12:52:26PM -0400
References: <200610081304.k98D48Z32299@xos037.xos.nl>
	<452A7E4A.4040101@redhat.com>
Message-ID: <20061009191641.B13151@xos037.xos.nl>

On Mon, Oct 09, 2006 at 12:52:26PM -0400, James Parsons wrote:

> Sorry, Jos - but system-config-cluster support for quorum disk is coming 
> in the next rhel4 cs release and in rhel5 cs. You will have to 
> distribute your conf file by hand. this is done using two helper tools 
> for a DLM cluster.

Well, as long as you don't need to manually "scp" to every node I'm
already happy, a command-line is more convenient than a GUI anyway :-)

> First you need to run 'ccs_tool update [path to conf file]' with a new 
> incremently higher config_version in the file.

OK, that worked and seemed to have distributed the config file.

> Then you will need to run 'cman_tool -r version [the version number you 
> chose]'

You mean "cman_tool version -r [...]" I guess?

Do I need to run that on every node or just on one node?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jparsons at redhat.com  Mon Oct  9 17:33:18 2006
From: jparsons at redhat.com (James Parsons)
Date: Mon, 09 Oct 2006 13:33:18 -0400
Subject: [Linux-cluster] Distributing cluster.conf with <quorumd> tag
In-Reply-To: <20061009191641.B13151@xos037.xos.nl>
References: <200610081304.k98D48Z32299@xos037.xos.nl>	<452A7E4A.4040101@redhat.com>
	<20061009191641.B13151@xos037.xos.nl>
Message-ID: <452A87DE.9090407@redhat.com>

Jos Vos wrote:

>On Mon, Oct 09, 2006 at 12:52:26PM -0400, James Parsons wrote:
>
>  
>
>>Sorry, Jos - but system-config-cluster support for quorum disk is coming 
>>in the next rhel4 cs release and in rhel5 cs. You will have to 
>>distribute your conf file by hand. this is done using two helper tools 
>>for a DLM cluster.
>>    
>>
>
>Well, as long as you don't need to manually "scp" to every node I'm
>already happy, a command-line is more convenient than a GUI anyway :-)
>
>  
>
>>First you need to run 'ccs_tool update [path to conf file]' with a new 
>>incremently higher config_version in the file.
>>    
>>
>
>OK, that worked and seemed to have distributed the config file.
>
>  
>
>>Then you will need to run 'cman_tool -r version [the version number you 
>>chose]'
>>    
>>
>
>You mean "cman_tool version -r [...]" I guess?
>
>Do I need to run that on every node or just on one node?
>
>  
>
Just one node.

One drawback in the design of s-c-cluster, is its inability to handle 
anything but  its own very rigerous schema. This was done to keep the UI 
from choking on an unknown XML tag. When the conf file is loaded, it is 
checked with xmllint --relaxng, and even if it finds problems and you 
select to continue anyway, it will not persist tag sets it does not know 
about in the output file when changes are made and propagated to the 
cluster.

I have been considering adding the ability for users to add their own 
tag sets and attributes, and have the UI persist them in the model and 
write them back out, but not provide a way to edit them in the UI. This 
would address when support of a new feature lags in the UI (like quorum 
disk) as well as user defined specialty tags.

-J


From jprats at cesca.es  Mon Oct  9 19:56:35 2006
From: jprats at cesca.es (Jordi Prats)
Date: Mon, 09 Oct 2006 21:56:35 +0200
Subject: [Linux-cluster] Re: [Cluster-devel] LVS port translation
In-Reply-To: <452A6842.5020608@dorm.org>
References: <452A5CD3.60500@cesca.es> <452A6842.5020608@dorm.org>
Message-ID: <452AA973.3010904@cesca.es>

Hi,
It would be great if you could send it to me! There's any reason why it
is not included in main distribution?

If there's any thing left to do I could try to do it because I prefere
not to depen on a patch to upgrade.

Thank you!
Jordi

Brenton Rothchild wrote:
> What version of piranha are you using?  I have a patch
> against piranha-0.8.1 and piranha-0.8.2 that I've been using
> in heavy production for about 9 months now.
> 
> We just added a "port" value to real server sections in the lvs.cf file,
> i.e.
> 
> virtual server_XYZ {
>     address = 123.456.789.1 eth0
>     active = 1
>     port = 80
>     ... (additional options) ...
> 
>     server node-X {
>         address = 192.168.10.1
>         port = 82
>         active = 1
>         weight = 1
>     }
> 
>     ...
> }
> 
> I can send the patch to the list if you're interested - shouldn't be
> too hard to patch against other versions (I haven't looked to see if the
> RPMs have been updated lately.)
> 
> -Brenton Rothchild
> 
> Jordi Prats wrote:
>> Hi all,
>> It's possible to configure piranha to allow to define a different port
>> on real servers using NAT?
>>
>> Using ipvsadm would be:
>>
>> /sbin/ipvsadm -a -t 123.456.789.1:80 -r 192.168.10.1:82 -m -w 1
>>
>> On the current version is not possible, there is any patch to allow
>> this? If there is not, I could do it. Would it be accepted as a
>> contribution on main distribution? Witch roules would I have to follow?
>>
>> Thanks,
>>
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 

-- 
......................................................................
        __
       / /          Jordi Prats Catal?
 C E / S / C A      Departament de Sistemes
     /_/            Centre de Supercomputaci? de Catalunya

 Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
 T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
......................................................................
pgp:0x5D0D1321
......................................................................


From lhh at redhat.com  Mon Oct  9 20:39:59 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 16:39:59 -0400
Subject: [Linux-cluster] Quorum disks and two-node clusters
In-Reply-To: <200610091641.k99GfIc13121@xos037.xos.nl>
References: <200610091641.k99GfIc13121@xos037.xos.nl>
Message-ID: <1160426400.31581.37.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-09 at 18:41 +0200, Jos Vos wrote:


> Can someone give a config example of this?  Should the score be
> asymmetric in that case?  A scenario of what happens w.r.t. score,
> votes, and how the respective cman subsystems react to this would
> be nice to have.

A simple example is pinging an upstream router, like an "ip tiebreaker"
in RHCS3.  If you can't ping it, you lose.

More complex examples might involve checking network connectivity, then
following up with a check for particular service components.  It's up to
the administrator to make sure the logic makes sense for their
particular installation.

As an example, here's a totally untested, unverified, and probably
broken script that is meant to be the only heuristic (give it 10 points
or something in the configuration).  It does sequential checks of
multiple things.  The idea is that this script should give weight to the
owner of a particular service in the event of a network partition, but
ensures that you have "upstream" network connectivity first (after all,
what's the point of running a network service if no one can reach the
service?).  It assumes a lot of things, like node name == `hostname`,
service having an IP address, etc...

This is just one idea.  I need to write up more/better examples in the
near future.  If anyone else has ideas to add, feel free.

When the score drops below 1/2, qdiskd advertises to CMAN that the
"quorum device" is gone.  CMAN loses local votes and the node becomes
inquorate.  There's a bug in qdisk that prevents qdisk from rebooting
the node the way it is documented to upon this loss of score.  (This is
already fixed in CVS.)

-- Lon

#!/bin/bash
SVC=test
SVCIP=10.1.1.10  # Service IP
TBIP=10.1.1.1    # IP tiebreaker

#
# Check tiebreaker(s).
#
ping -c1 -t1 $TBIP
if [ $? -ne 0 ]; then
        exit 1
fi

#
# If rgmanager is not running, we are quorate.  The administrator either
# didn't start it, or stopped it cleanly (the watchdog should catch
# rgmanager crashes).
#
if ! service rgmanager status; then
        exit 0
fi


#
# Cut up the XML attrs of the service output from clustat into chunks
# XXX is this even possible in a network partition?  It should be
#     with the -f option to clustat, but untested.
#
while read info; do
        declare field val

        field=${info/=*/}
        val=${info/*=/}

        # XXX breaks up last_transition_str.  Fix later.
        if [ "$field" != "$val" ] &&
           [ "$field" != "last_transition_str" ]; then
                eval x_$field=$val
        fi
done < <(clustat -fxs $SVC | grep "name=\"$SVC\"" | \
	 sed -e s/[\\\ ]\\+/\\\n/g)

if [ "$x_service_owner" = "" ] ||
   [ "$x_service_owner" = "none" ]; then
        #
        # Service disabled/failed/stopped/otherwise not running or
        # state unavailable (maybe rgmanager's just booting?)
        #
        exit 0
fi

if [ "$x_service_owner" = "`hostname`" ]; then
        #
        # I own the service.
        #
        exit 0
fi

#
# I don't own the service.
# If we can see the IP and we're not the owner, we're quorate.
#
ping -c1 -t1 $SVCIP
exit $?


From lhh at redhat.com  Mon Oct  9 20:44:13 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 16:44:13 -0400
Subject: [Linux-cluster] heartbeat network
In-Reply-To: <20061009115721.863.qmail@web31811.mail.mud.yahoo.com>
References: <20061009115721.863.qmail@web31811.mail.mud.yahoo.com>
Message-ID: <1160426653.31581.41.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-09 at 04:57 -0700, sara sodagar wrote:
> We want to implement a web hosting service using
> RHEL4. The idea is to load balance several web servers
> using a hardware network load balancer and use GFS as
> the shared storage. As I know we have to install
> redhat cluster suit and redhat GFS to implement shared
> storage. My question is : 
> 
> 1)If we don???t want to set up cluster service,and
> just want the shared storage (GFS), do we still need
> heartbeat network? 

You don't need a private network, but you will want a firewall somewhere
to prevent people from attacking the cluster.  Your hardware load
balancer should do nicely, assuming the nodes are using it as a gateway.
You also still need fencing.

-- Lon


From lhh at redhat.com  Mon Oct  9 20:53:24 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 16:53:24 -0400
Subject: [Linux-cluster] STONITH
In-Reply-To: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>
References: <OF4F032FF3.3522028D-ON802571FF.003B27EC-802571FF.003D83C8@csc.com>
Message-ID: <1160427204.31581.48.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-06 at 12:10 +0100, Grant Waters wrote:

> Powering cycling both nodes and the array fixes the problem, but I
> want to know whats causing it in the first place.  It doesn't appear
> to be related to load, although I can't rule that out - both outages
> were at approx 04:40 on a Friday. 

The tg3 link mysteriously disappearing/reappearing looks like the
culprit.  clumanager doesn't control those kinds of things...

(a) up the failover interval to 30sec.  If it's just a flaky
card/driver/cable/etc., this buys more time.

(b) cludb -p clumembd%rtp 10

If you think it's a scheduling problem.

(c) cludb -p cluster%msgsvc_noarp 1 

Gets rid of "SIOCGARP..." errors.

(d) cludb -p clulockd%loglevel 4

Because clulockd @ debug level is a waste of resources.

-- Lon


From lhh at redhat.com  Mon Oct  9 20:57:28 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 16:57:28 -0400
Subject: [Linux-cluster] Fw: STONITH
In-Reply-To: <OFF21F75A0.F25C0C23-ON802571FF.0041ED4C-802571FF.00424F12@csc.com>
References: <OFF21F75A0.F25C0C23-ON802571FF.0041ED4C-802571FF.00424F12@csc.com>
Message-ID: <1160427448.31581.53.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-06 at 13:03 +0100, Grant Waters wrote:
> 
>  Forgot to say I also get the following msgs in syslog when I telnet
> to the NPS.... 

> What could this "Bad Config" be - is it the /etc/cluster.xml?

It's just pedantic error reporting.  The NPS/IPS/etc. refuses the
connection if someone's logged in, and the STONITH module thinks
something like:

"The IP is refusing connections! It must be wrong!".

If you're logged in to the switch at the time it happens, it's safe to
ignore it ;)

-- Lon


From lhh at redhat.com  Mon Oct  9 20:59:12 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 16:59:12 -0400
Subject: [Linux-cluster] qdiskd vote not represented by cman
In-Reply-To: <452601DC.4030206@redhat.com>
References: <1160076914.3666.2.camel@belmont.site>
	<452601DC.4030206@redhat.com>
Message-ID: <1160427553.31581.55.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-06 at 08:12 +0100, Patrick Caulfield wrote:
> danwest wrote:
> > Shouldn?t we expect to see the qdisk votes reported in ?Total_votes?
> > from cman (see cman_tool below). 
> 
> Yes. if the quorum disk is registered correctly with cman you should see the
> votes it contributes and also it's "node name" in cman_tool nodes.

Yes.

Is qdiskd running?

-- Lon


From lhh at redhat.com  Mon Oct  9 21:11:35 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 17:11:35 -0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <2006105202526.771406@leena>
References: <2006105202526.771406@leena>
Message-ID: <1160428295.31581.57.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-05 at 20:25 -0500, isplist at logicore.net wrote:
> I've been messing with GFS for a while now, learning curve kinda high but 
> slowly getting it. I'm now at the point where I need to fence things better so 
> that reliability becomes better now that I'm getting closer to actually using 
> this on something production.
> 
> Thing is, I've asked before about cluster.conf file building without replies 
> so am still unsure about this part right now. I've seen countless variations, 
> many with things I've not even seen before and for the most part, many seem to 
> be custom made, like someone's recipe :).
> 
> So, the question remains... where can I find VERY good details and information 
> that will help me understand the building of this file.

We'll have this up for you (hopefully) tomorrow morning.  It keeps
slipping through the cracks.

-- Lon


From lhh at redhat.com  Mon Oct  9 21:15:59 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 09 Oct 2006 17:15:59 -0400
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <4529F2F2.3010903@cesca.es>
References: <452200C5.6080406@cesca.es>
	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>
	<45236805.60709@cesca.es>
	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>
	<4523FED1.9010404@cesca.es>
	<1160075910.18145.18.camel@rei.boston.devel.redhat.com>
	<4529F2F2.3010903@cesca.es>
Message-ID: <1160428559.31581.62.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-09 at 08:57 +0200, Jordi Prats wrote:
> Hi,
> I'm attaching to you my services configuration. If I disable a service 
> on node1 and enable it on node2, it succesfully runs on the other node. 
> So, aparently all scripts are installed on both nodes and functional.
> 
> Relocating a service do not appear nothing on the other node's log. So, 
> must be a communications problem. Where can I start to search any 
> problem related to this? Network seems to be ok, and I can do ssh 
> between nodes. Sending pings with mtr -i 0.01 does not loose any packet.
> 
> Thanks,
> 
> Services:
>                 <service autostart="1" domain="projectes" name="projectes">
>                 </service>
>                 <service autostart="1" domain="local" name="local">
>                 </service>
>                 <service autostart="1" domain="mysql" exclusive="1" 
> name="mysql">
>                 </service>
>                 <service autostart="1" domain="postgres" name="postgres">
>                 </service>

* Make sure none of the domains are restricted ("restricted="1""),
unless you really mean it.  This means the service will not run on any
node outside of the failover domain, even if you tell it to!

* Nothing will automatically run on the same node as the mysql service
unless you do manual enable/disable, I think.

-- Lon


From jprats at cesca.es  Mon Oct  9 21:25:45 2006
From: jprats at cesca.es (Jordi Prats)
Date: Mon, 09 Oct 2006 23:25:45 +0200
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <1160428559.31581.62.camel@rei.boston.devel.redhat.com>
References: <452200C5.6080406@cesca.es>	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>	<45236805.60709@cesca.es>	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>	<4523FED1.9010404@cesca.es>	<1160075910.18145.18.camel@rei.boston.devel.redhat.com>	<4529F2F2.3010903@cesca.es>
	<1160428559.31581.62.camel@rei.boston.devel.redhat.com>
Message-ID: <452ABE59.4030501@cesca.es>

Hi Lon,
There is any detailed documentation about all this options and consequences?

Thanks

Lon Hohberger wrote:
> On Mon, 2006-10-09 at 08:57 +0200, Jordi Prats wrote:
>> Hi,
>> I'm attaching to you my services configuration. If I disable a service 
>> on node1 and enable it on node2, it succesfully runs on the other node. 
>> So, aparently all scripts are installed on both nodes and functional.
>>
>> Relocating a service do not appear nothing on the other node's log. So, 
>> must be a communications problem. Where can I start to search any 
>> problem related to this? Network seems to be ok, and I can do ssh 
>> between nodes. Sending pings with mtr -i 0.01 does not loose any packet.
>>
>> Thanks,
>>
>> Services:
>>                 <service autostart="1" domain="projectes" name="projectes">
>>                 </service>
>>                 <service autostart="1" domain="local" name="local">
>>                 </service>
>>                 <service autostart="1" domain="mysql" exclusive="1" 
>> name="mysql">
>>                 </service>
>>                 <service autostart="1" domain="postgres" name="postgres">
>>                 </service>
> 
> * Make sure none of the domains are restricted ("restricted="1""),
> unless you really mean it.  This means the service will not run on any
> node outside of the failover domain, even if you tell it to!
> 
> * Nothing will automatically run on the same node as the mysql service
> unless you do manual enable/disable, I think.
> 
> -- Lon
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 

-- 
......................................................................
        __
       / /          Jordi Prats Catal?
 C E / S / C A      Departament de Sistemes
     /_/            Centre de Supercomputaci? de Catalunya

 Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
 T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
......................................................................
pgp:0x5D0D1321
......................................................................


From rpeterso at redhat.com  Mon Oct  9 22:03:02 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Mon, 09 Oct 2006 17:03:02 -0500
Subject: [Linux-cluster] Quorum disk: Can it be a partition?
In-Reply-To: <20061009115927.C7198@xos037.xos.nl>
References: <45211826.1050703@redhat.com>	<20061009094721.65AAB27C57@ux-mail.informatics.lk>
	<20061009115927.C7198@xos037.xos.nl>
Message-ID: <452AC716.2040207@redhat.com>

Jos Vos wrote:
> On Mon, Oct 09, 2006 at 03:17:53PM +0530, Hirantha Wijayawardena wrote:
>> Just another quick question;
>> Which disk/partition should be elected as quorum disk/partition? Either
>> shared storage or individual nodes?
>>     
> Shared storage.  The idea of the quorum disk is that others can see that
> you still write something to the storage (and thus are still alive) :-).

Yes.  You definitely want to use shared storage.

Regards,

Bob Peterson
Red Hat Cluster Suite


From sandra-llistes at fib.upc.edu  Tue Oct 10 07:43:53 2006
From: sandra-llistes at fib.upc.edu (sandra-llistes)
Date: Tue, 10 Oct 2006 09:43:53 +0200
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <4526B4ED.9050907@redhat.com>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
Message-ID: <452B4F39.60906@fib.upc.edu>

Hi Abhi,

Thanks for the reply.
Please, Could you try to play a video clip from four or more windows 
simultaneously? sometimes with less than tree clients it can work, but 
for four or more always fails.
Could you give more information about your system? is Fedora or RHEL? 
what OS,samba,GFS version? Any special locking configuration in samba 
or linux kernel?

I had tested samba with GFS only mounted in one node, and samba hangs 
likewise, but GFS mounted with no_lock option in node1 works fine.

I download the lastest version for GFS2 and cluster with "cvs -d 
:pserver:cvs at sources.redhat.com:/cvs/cluster checkout cluster"and I'm 
trying to make it work in our test environment. I hope this works.
I don't know if I had said that we have Fedora 5 and not RHEL 
installed. I don't know if it can be some differences.

Also, I tried OCFS2 and works ok with this samba configuration, the 
problem is that doesn't support quotas.
Regards,

Sandra

Abhijith Das wrote:
> sandra-llistes wrote:
> 
>> Hi,
>>
>> I sent a mail a few days ago to this list related with GFS+samba 
>> problems.
>>
>> Since the, we have installed a sepparated test environment also with 
>> two linux servers where we  have tested a samba server with an 
>> exported share in GFS. The share is read-only and only one server is 
>> exporting it.
>>
>> When we try to access from a single windows client it works fine, but 
>> when we try to access to the same file from 2 or more windows clients 
>> simoultaneously, windows hangs and samba also does. This seems not to 
>> happen with concurrent access to different files or with linux clients.
>>
>> We've also tested to export the same share without GFS and in this 
>> case it works fine.
>>
>> It seems to be a locking problem with samba, GFS and windows clients. 
>> Does any of you have experienced similar problems? Do you have any 
>> suggestion about this?
>>
>> Following is the share configuration in smb.conf:
>>
>> [public]
>>   comment         = ShareGFS
>>   path            = /public
>>   writeable       = No
>>   read only       = Yes
>>   write list      = @admsamba
>>   force group     = admsamba
>>   create mask     = 0775
>>   directory mask  = 0775
>>   oplocks         = No
>>   locking = Yes
>>   strict locking = Yes
>> # I proved with locking/Strick locking=Yes and No. Always happens the 
>> same problem
>>
>> I attach some samba logs (Level 3).
>> Software Versions:
>>     Fedora 5
>>     Samba 3.0.23
>>     GFS  6.1.5
>>     kernel 2.6.17-1.2187_FC5
>>
>> Any help will be appreciated.
>>
>> Sandra Hernandez
> 
> Hi Sandra,
> I'm not very familiar with the locking of samba, but I did try the 
> scenario you described on my test cluster. I'm unable to reproduce your 
> problem. I have an identical smb.conf as you've pasted above. Accessing 
> (reading a txt file, or playing a video clip) from two windows clients 
> simultaneously works just fine without any glitches.
> If I understood it right, the test case you describe has one node in a 
> cluster exporting a single samba share over a GFS filesystem and you're 
> using multiple windows clients to access the same file in this share. 
> This is a fairly basic operation IMO and it is quite odd that you should 
> see this failure. Maybe you can try the CVS version of cluster suite 
> (cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster checkout -r RHEL4 
> cluster) to see if the problem persists. Also, I'd be interested in 
> knowing the behavior when you mount GFS on only one node (the one that's 
> exporting) and also when you use GFS with lock_nolock on a standalone 
> machine.
> Thanks,
> --Abhi


From francisco_javier.pena at roche.com  Tue Oct 10 08:21:11 2006
From: francisco_javier.pena at roche.com (Pena, Francisco Javier)
Date: Tue, 10 Oct 2006 10:21:11 +0200
Subject: [Linux-cluster] Quorum disks and two-node clusters
In-Reply-To: <1160426400.31581.37.camel@rei.boston.devel.redhat.com>
Message-ID: <C0C1791E8EC6F249B5570F01409BD3EE0148523B@rmamsem1.emea.roche.com>

[...]

> When the score drops below 1/2, qdiskd advertises to CMAN 
> that the "quorum device" is gone.  CMAN loses local votes and 
> the node becomes inquorate.  There's a bug in qdisk that 
> prevents qdisk from rebooting the node the way it is 
> documented to upon this loss of score.  (This is already 
> fixed in CVS.)
> 
> -- Lon
> 

Lon, 

Do you know if there are any plans to release an updated cman package, including this bugfix, before U5? My company is planning to start implementing GFS, and this would definitely be required for us.

Regards,

Javier Pe?a


From dbrieck at gmail.com  Tue Oct 10 12:45:27 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Tue, 10 Oct 2006 08:45:27 -0400
Subject: [Linux-cluster] Re: Hard lockups during file transfer to GNBD/GFS
	device
In-Reply-To: <8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
References: <8c1094290609280915o6b6b4962ud0d090e58e5d7fc6@mail.gmail.com>
	<8c1094290609281208i6a5eaf8br70697c6b5d085cf@mail.gmail.com>
	<8c1094290609281227j1303ec11u300932ab8d4953ab@mail.gmail.com>
	<20060928195844.GB25242@redhat.com>
	<8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
Message-ID: <8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>

On 9/29/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> On 9/28/06, David Teigland <teigland at redhat.com> wrote:
> >
> > Could you try it without multipath?  You have quite a few layers there.
> > Dave
> >
> >
>
> Thanks for the response. I unloaded gfs, clvm, gnbd and multipath, the
> reloaded gnbd, clvm and gfs. It was only talking to one of the gnbd
> servers and without multipath. Here's the log from this crash. It
> seems to have more info in it.
>
> I'm kinda confused why it still has references to mulitpath though. I
> unloaded the multipath module so I'm not sure why it's still in there.
> SNIP

Since I didn't hear back from anyone I decided to try things a little
differently. Instead of rsyncing on the local machine, I ran the rsync
from another cluster member who also mounts the same partition I was
trying to move things too.

So instead of

rsync -a /path/to/files/ /mnt/http/

I used

rsync -a root at 10.1.1.121::/path/to/files/  /mnt/http/

and I didn't have a crash at all. Why would this not cause a problem
when the first one did? Is this more of an rsync problem maybe? I do
have 2 NFS exports, could those have been causing the problem?

Thanks.


From matt at rebelbase.com  Tue Oct 10 13:38:55 2006
From: matt at rebelbase.com (Matt Eagleson)
Date: Tue, 10 Oct 2006 06:38:55 -0700
Subject: [Linux-cluster] GFS Problem: invalid metadata block
Message-ID: <452BA26F.1070707@rebelbase.com>

Hello,

I have been evaluating a GFS cluster as an NFS solution and have unfortunately run in to a serious problem which I cannot explain.  Both of the GFS filesystems I am exporting became corrupt and unusable.

The system is Redhat AS4 with 2.6.9-42.0.2.ELsmp.  I cannot find anything unusual on the host or the SAN at the time of this error.  Nobody was logged in to the nodes.

Can anyone help me understand what is happening here?

Here are the logs:

Node 2:
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Trying to acquire journal lock...
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Looking at journal...
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Acquiring the transaction lock...
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Replaying journal...
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Replayed 6 of 7 blocks
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: replays = 6, skips = 0, sames = 1
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Journal replayed in 1s
1:14:01 file04 kernel: GFS: fsid=nfs:data.0: jid=1: Done
1:14:14 file04 kernel: GFS: fsid=nfs:data.0: fatal: invalid metadata block
1:14:14 file04 kernel: GFS: fsid=nfs:data.0:   bh = 65578 (magic)
1:14:14 file04 kernel: GFS: fsid=nfs:data.0:   function = gfs_rgrp_read
1:14:14 file04 kernel: GFS: fsid=nfs:data.0:   file = /usr/src/redhat/BUILD/gfs-kernel-2.6.9-58/smp/src/gfs/rgrp.c, line = 830
1:14:14 file04 kernel: GFS: fsid=nfs:data.0:   time = 1159924454
1:14:14 file04 kernel: GFS: fsid=nfs:data.0: about to withdraw from the cluster
1:14:14 file04 kernel: GFS: fsid=nfs:data.0: waiting for outstanding I/O
1:14:14 file04 kernel: GFS: fsid=nfs:data.0: telling LM to withdraw
1:14:14 file04 kernel: lock_dlm: withdraw abandoned memory
1:14:14 file04 kernel: GFS: fsid=nfs:data.0: withdrawn
1:14:41 file04 kernel: GFS: fsid=nfs:home.0: fatal: invalid metadata block
1:14:41 file04 kernel: GFS: fsid=nfs:home.0:   bh = 24 (magic)
1:14:41 file04 kernel: GFS: fsid=nfs:home.0:   function = gfs_get_meta_buffer
1:14:41 file04 kernel: GFS: fsid=nfs:home.0:   file = /usr/src/redhat/BUILD/gfs-kernel-2.6.9-58/smp/src/gfs/dio.c, line = 1223
1:14:41 file04 kernel: GFS: fsid=nfs:home.0:   time = 1159924481
1:14:41 file04 kernel: GFS: fsid=nfs:home.0: about to withdraw from the cluster
1:14:41 file04 kernel: GFS: fsid=nfs:home.0: waiting for outstanding I/O
1:14:41 file04 kernel: GFS: fsid=nfs:home.0: telling LM to withdraw
1:14:43 file04 kernel: lock_dlm: withdraw abandoned memory
1:14:43 file04 kernel: GFS: fsid=nfs:home.0: withdrawn

Node 1:
1:14:46 file03 kernel: GFS: fsid=nfs:data.1: fatal: invalid metadata block
1:14:46 file03 kernel: GFS: fsid=nfs:data.1:   bh = 24 (magic)
1:14:46 file03 kernel: GFS: fsid=nfs:data.1:   function = gfs_get_meta_buffer
1:14:46 file03 kernel: GFS: fsid=nfs:data.1:   file = /usr/src/redhat/BUILD/gfs-kernel-2.6.9-58/smp/src/gfs/dio.c, line = 1223
1:14:46 file03 kernel: GFS: fsid=nfs:data.1:   time = 1159924486
1:14:46 file03 kernel: GFS: fsid=nfs:data.1: about to withdraw from the cluster
1:14:46 file03 kernel: GFS: fsid=nfs:data.1: waiting for outstanding I/O
1:14:46 file03 kernel: GFS: fsid=nfs:data.1: telling LM to withdraw
1:14:48 file03 kernel: lock_dlm: withdraw abandoned memory
1:14:48 file03 kernel: GFS: fsid=nfs:data.1: withdrawn
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Trying to acquire journal lock...
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Looking at journal...
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Acquiring the transaction lock...
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Replaying journal...
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Replayed 4 of 4 blocks
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: replays = 4, skips = 0, sames = 0
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Journal replayed in 1s
1:15:30 file03 kernel: GFS: fsid=nfs:home.1: jid=0: Done
1:15:58 file03 kernel: GFS: fsid=nfs:home.1: fatal: invalid metadata block
1:15:58 file03 kernel: GFS: fsid=nfs:home.1:   bh = 21 (magic)
1:15:58 file03 kernel: GFS: fsid=nfs:home.1:   function = gfs_rgrp_read
1:15:58 file03 kernel: GFS: fsid=nfs:home.1:   file = /usr/src/redhat/BUILD/gfs-kernel-2.6.9-58/smp/src/gfs/rgrp.c, line = 830
1:15:58 file03 kernel: GFS: fsid=nfs:home.1:   time = 1159924558
1:15:58 file03 kernel: GFS: fsid=nfs:home.1: about to withdraw from the cluster
1:15:58 file03 kernel: GFS: fsid=nfs:home.1: waiting for outstanding I/O
1:15:58 file03 kernel: GFS: fsid=nfs:home.1: telling LM to withdraw
1:15:58 file03 kernel: lock_dlm: withdraw abandoned memory
1:15:58 file03 kernel: GFS: fsid=nfs:home.1: withdraw


From brentonr at dorm.org  Tue Oct 10 13:44:24 2006
From: brentonr at dorm.org (Brenton Rothchild)
Date: Tue, 10 Oct 2006 08:44:24 -0500
Subject: [Linux-cluster] LVS port translation
In-Reply-To: <452AA973.3010904@cesca.es>
References: <452A5CD3.60500@cesca.es> <452A6842.5020608@dorm.org>
	<452AA973.3010904@cesca.es>
Message-ID: <452BA3B8.6090009@dorm.org>

Jordi Prats wrote:
> Hi,
> It would be great if you could send it to me! There's any reason why it
> is not included in main distribution?
> 
> If there's any thing left to do I could try to do it because I prefere
> not to depen on a patch to upgrade.
> 
> Thank you!
> Jordi
> 

Jordi,

Attached is the patch I have been using.

It does the following things:

  - Adds a "port" option to real-server blocks in lvs.cf.  This value defaults
    to the virtual service port number if specified, and port 80 if neither are
    specified (this is the default as is currently in piranha).  This port value
    is passed to the "-r" argument for ipvsadm as called by nanny.

  - Adds the "-r" argument to nanny itself, similar to the already existing "-p"
    argument.  The manpage for nanny has been updated to reflect this usage.

  - The special token "%p" has been added to nanny's external check command syntax
    in order to utilize the specific real-server port number in command-line-specified
    external check commands with nanny.  Also see nanny manpage updates.


So basically what you can do is add a "port" value inside your real-server blocks
in lvs.cf, and be on your way :)

I don't know if there's anything left to do in order to have such a patch submitted
to the piranha sources - I've never done that before.  If anyone would like to get
it in there, I would love it for the same reason: no more custom building/patching
on upgrades :)

Comments certainly welcome!

-Brenton Rothchild

-------------- next part --------------
A non-text attachment was scrubbed...
Name: piranha-0.8.2-rport.patch
Type: text/x-patch
Size: 15270 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061010/9696eeec/attachment.bin>

From jparsons at redhat.com  Tue Oct 10 15:08:28 2006
From: jparsons at redhat.com (James Parsons)
Date: Tue, 10 Oct 2006 11:08:28 -0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <1160428295.31581.57.camel@rei.boston.devel.redhat.com>
References: <2006105202526.771406@leena>
	<1160428295.31581.57.camel@rei.boston.devel.redhat.com>
Message-ID: <452BB76C.1020606@redhat.com>

Lon Hohberger wrote:

>On Thu, 2006-10-05 at 20:25 -0500, isplist at logicore.net wrote:
>  
>
>>I've been messing with GFS for a while now, learning curve kinda high but 
>>slowly getting it. I'm now at the point where I need to fence things better so 
>>that reliability becomes better now that I'm getting closer to actually using 
>>this on something production.
>>
>>Thing is, I've asked before about cluster.conf file building without replies 
>>so am still unsure about this part right now. I've seen countless variations, 
>>many with things I've not even seen before and for the most part, many seem to 
>>be custom made, like someone's recipe :).
>>
>>So, the question remains... where can I find VERY good details and information 
>>that will help me understand the building of this file.
>>    
>>
>
>We'll have this up for you (hopefully) tomorrow morning.  It keeps
>slipping through the cracks.
>
>-- Lon
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>  
>
This document is now linked off of our sources.redhat.com cluster page at:
http://sources.redhat.com/cluster/doc/cluster_schema.html

It does not have the resource descriptions in it yet...later today, I hope.

-Jim


From jos at xos.nl  Tue Oct 10 15:23:22 2006
From: jos at xos.nl (Jos Vos)
Date: Tue, 10 Oct 2006 17:23:22 +0200
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <452BB76C.1020606@redhat.com>;
	from jparsons@redhat.com on Tue, Oct 10, 2006 at 11:08:28AM -0400
References: <2006105202526.771406@leena>
	<1160428295.31581.57.camel@rei.boston.devel.redhat.com>
	<452BB76C.1020606@redhat.com>
Message-ID: <20061010172322.B25971@xos037.xos.nl>

On Tue, Oct 10, 2006 at 11:08:28AM -0400, James Parsons wrote:

> This document is now linked off of our sources.redhat.com cluster page at:
> http://sources.redhat.com/cluster/doc/cluster_schema.html
> 
> It does not have the resource descriptions in it yet...later today, I hope.

Thanks, but I'm also missing the description for the <quorumd> tag
and its children, which is even more important, as it has to be
written manually now.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From wcheng at redhat.com  Tue Oct 10 19:54:36 2006
From: wcheng at redhat.com (S. Wendy Cheng)
Date: Tue, 10 Oct 2006 15:54:36 -0400
Subject: [Linux-cluster] GFS Problem: invalid metadata block
In-Reply-To: <452BA26F.1070707@rebelbase.com>
References: <452BA26F.1070707@rebelbase.com>
Message-ID: <1160510076.29653.4.camel@dhcp59-234.rdu.redhat.com>

On Tue, 2006-10-10 at 06:38 -0700, Matt Eagleson wrote:
> Hello,
> 
> I have been evaluating a GFS cluster as an NFS solution and have unfortunately run in to a serious problem which I cannot explain.  Both of the GFS filesystems I am exporting became corrupt and unusable.
> 
> The system is Redhat AS4 with 2.6.9-42.0.2.ELsmp.  I cannot find anything unusual on the host or the SAN at the time of this error.  Nobody was logged in to the nodes.
> 
> Can anyone help me understand what is happening here?
> 

By the time this check was hit, the filesystem had been corrupted. Do
you have a way to recreate this issue ? What is the SAN (storage) you
use ? Any third party kernel modules in the system ?

-- Wendy


From adas at redhat.com  Tue Oct 10 20:02:53 2006
From: adas at redhat.com (Abhijith Das)
Date: Tue, 10 Oct 2006 15:02:53 -0500
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <452B4F39.60906@fib.upc.edu>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu>
Message-ID: <452BFC6D.60902@redhat.com>

sandra-llistes wrote:

> Hi Abhi,
>
> Thanks for the reply.
> Please, Could you try to play a video clip from four or more windows 
> simultaneously? sometimes with less than tree clients it can work, but 
> for four or more always fails.
> Could you give more information about your system? is Fedora or RHEL? 
> what OS,samba,GFS version? Any special locking configuration in samba 
> or linux kernel?
>
> I had tested samba with GFS only mounted in one node, and samba hangs 
> likewise, but GFS mounted with no_lock option in node1 works fine.
>
> I download the lastest version for GFS2 and cluster with "cvs -d 
> :pserver:cvs at sources.redhat.com:/cvs/cluster checkout cluster"and I'm 
> trying to make it work in our test environment. I hope this works.
> I don't know if I had said that we have Fedora 5 and not RHEL 
> installed. I don't know if it can be some differences.
>
> Also, I tried OCFS2 and works ok with this samba configuration, the 
> problem is that doesn't support quotas.
> Regards,
>
> Sandra

Hi Sandra,
I'm in the process of gathering a few windows boxes to run your test. I 
should hopefully have 4 windows clients tomorrow.
A warning first up, I'd recommend that you *not* use GFS2 and the latest 
cluster suite for your tests just yet. With constant development going 
on, some components are unstable and more problems is not what you need 
right now :-). The RHEL4 tag in CVS has stable code from the most recent 
release. I'd suggest you compile gfs and the cluster suite from that CVS 
branch.
I'm running a 3-node x86 cluster with RHEL4. The cluster suite and gfs 
are from the RHEL4 branch of CVS along with some innocuous patches. The 
samba version is 3.0.10-1.4E.2. I'm using an smb.conf almost identical 
to the one you posted in your previous mail. I don't have any other 
kernel/samba locking settings that I'm aware of.
You did mention in an email few weeks ago that you were trying to export 
the same GFS mount over multiple samba servers on multiple nodes 
simultaneously (active-active samba). I'm guessing you achieved this by 
setting the locking and pid directories of samba to be on the shared gfs 
filesystem. (This is a wrong approach and doesn't work. There's a lot of 
debate on this in the samba and samba-technical list archives are 
samba.org). I'm wondering if you still have these directories on the GFS 
filesystem, which could possibly be causing your hang?
Also, do you see anything unusual in /var/log/messages on the GFS node 
when this hang occurs? I'm interested in any kernel-panic/assertion 
failures in GFS that might indicate some problem.
Thanks,
--Abhi


From rpeterso at redhat.com  Tue Oct 10 20:09:45 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 10 Oct 2006 15:09:45 -0500
Subject: [Linux-cluster] GFS Problem: invalid metadata block
In-Reply-To: <452BA26F.1070707@rebelbase.com>
References: <452BA26F.1070707@rebelbase.com>
Message-ID: <452BFE09.9070304@redhat.com>

Matt Eagleson wrote:
> Hello,
>
> I have been evaluating a GFS cluster as an NFS solution and have 
> unfortunately run in to a serious problem which I cannot explain.  
> Both of the GFS filesystems I am exporting became corrupt and unusable.
>
> The system is Redhat AS4 with 2.6.9-42.0.2.ELsmp.  I cannot find 
> anything unusual on the host or the SAN at the time of this error.  
> Nobody was logged in to the nodes.
>
> Can anyone help me understand what is happening here?
>
> Here are the logs:

Hi Matt,

These errors indicate file system corruption on your SAN.  The "bh =" is 
the
block number where the error was detected.  Two of the errors were found
in GFS resource group data ("RG"), which are areas on disk that indicate 
which
blocks on the SAN are allocated and which aren't.  (Not to be confused 
with the
Resource Groups in rgmanager, which is something completely different.) 
The third error is usually reserved for the quota file inode. 

Corruption in the RG information is extremely rare, and may indicate a 
hardware
problem with your SAN.  The fact that both nodes detected problems in 
different
areas is an indication that the problem might be in the SAN itself 
rather than
the motherboards, fibre channel cards or memory of the nodes, although 
that's
still not guaranteed.  Many things can cause data corruption.

I recommend you:

1. Verify the hardware is working properly in all respects.  One way you 
can do this
    is to make a backup of the raw data to another device and verify the 
copy against
    the original without GFS or any of the cluster software in the mix.
    For example, unmount the file system from all nodes in the cluster, 
then do
    something like "dd if=/dev/my_vg/lvol0 of=/mnt/backup/sanbackup" then:
    "diff /dev/my_vg/lvol0 mnt/backup/sanbackup"  (assuming of course that
    /dev/my_vg/lvol0 is the logical volume you have your GFS partition 
on, and
    /mnt/backup/ is some scratch area big enough to hold that much data.)
    The idea here is simply to test that reading from the SAN give you
    the same data twice.  If that works successfully on one node, try it 
on the other node.
2. Once you verify the hardware is working properly, run gfs_fsck on it.
    The latest version of gfs_fsck can repair most GFS rg corruption.
3. If the file system is fixed okay, you should back it up.
4. You may want to do a similar test, only writing data to the SAN, then 
reading it
    back and verifying the results.  Obviously this will destroy the 
data on your SAN
    unless you are careful, so if this is a production machine, please 
take measures
    to protect the data before trying anything like this.
5. If you can read and write to the SAN reliably from both nodes without 
GFS,
    then try using it again with GFS and see if the problem comes back.

Perhaps someone else (the SAN manufacturer?) can recommend hardware
tests you can run to verify the data integrity.

I realize these kinds of tests take a long time to do, but if it's a 
hardware problem,
you really need to know.  There's a outside chance the problem is somewhere
in the GFS core, but I've personally only seen this type of corruption 
once or twice
so I think it's unlikely.  If you can recreate this kind of corruption 
with some kind of
test, please let us know how.

Regards,

Bob Peterson
Red Hat Cluster Suite


From jstoner at opsource.net  Tue Oct 10 20:27:06 2006
From: jstoner at opsource.net (Jeff Stoner)
Date: Tue, 10 Oct 2006 21:27:06 +0100
Subject: [Linux-cluster] Cluster.conf
Message-ID: <38A48FA2F0103444906AD22E14F1B5A3045A6B8A@mailxchg01.corp.opsource.net>

My thanks to Stephen the intern. His work is much appreciated!

--Jeff
SME - UNIX
OpSource Inc.

PGP Key ID 0x6CB364CA 

> -----Original Message-----
> This document is now linked off of our sources.redhat.com 
> cluster page at:
> http://sources.redhat.com/cluster/doc/cluster_schema.html
> 
> It does not have the resource descriptions in it yet...later 
> today, I hope.


From isplist at logicore.net  Tue Oct 10 20:32:35 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 10 Oct 2006 15:32:35 -0500
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <452BB76C.1020606@redhat.com>
Message-ID: <20061010153235.684112@leena>

> This document is now linked off of our sources.redhat.com cluster page at:
> http://sources.redhat.com/cluster/doc/cluster_schema.html

Very very cool... this is what I've been looking for and I'm guessing, many 
others. 

Can't thank you enough!

Mike


From matt at rebelbase.com  Tue Oct 10 21:00:53 2006
From: matt at rebelbase.com (Matt Eagleson)
Date: Tue, 10 Oct 2006 14:00:53 -0700
Subject: [Linux-cluster] GFS Problem: invalid metadata block
In-Reply-To: <452BFE09.9070304@redhat.com>
References: <452BA26F.1070707@rebelbase.com> <452BFE09.9070304@redhat.com>
Message-ID: <452C0A05.6080200@rebelbase.com>

Thank you Robert and Wendy for taking the time to answer my question -- 
I appreciate it.

As you suggested it was indeed a SAN problem.  Someone else in my 
organization was attempting to use the same section of disk as another 
filesystem on a different host.  I can understand why GFS would be 
unhappy with that.

--Matt

Robert Peterson wrote:
> Matt Eagleson wrote:
>> Hello,
>>
>> I have been evaluating a GFS cluster as an NFS solution and have 
>> unfortunately run in to a serious problem which I cannot explain.  
>> Both of the GFS filesystems I am exporting became corrupt and unusable.
>>
>> The system is Redhat AS4 with 2.6.9-42.0.2.ELsmp.  I cannot find 
>> anything unusual on the host or the SAN at the time of this error.  
>> Nobody was logged in to the nodes.
>>
>> Can anyone help me understand what is happening here?
>>
>> Here are the logs:
> 
> Hi Matt,
> 
> These errors indicate file system corruption on your SAN.  The "bh =" is 
> the
> block number where the error was detected.  Two of the errors were found
> in GFS resource group data ("RG"), which are areas on disk that indicate 
> which
> blocks on the SAN are allocated and which aren't.  (Not to be confused 
> with the
> Resource Groups in rgmanager, which is something completely different.) 
> The third error is usually reserved for the quota file inode.
> Corruption in the RG information is extremely rare, and may indicate a 
> hardware
> problem with your SAN.  The fact that both nodes detected problems in 
> different
> areas is an indication that the problem might be in the SAN itself 
> rather than
> the motherboards, fibre channel cards or memory of the nodes, although 
> that's
> still not guaranteed.  Many things can cause data corruption.
> 
> I recommend you:
> 
> 1. Verify the hardware is working properly in all respects.  One way you 
> can do this
>    is to make a backup of the raw data to another device and verify the 
> copy against
>    the original without GFS or any of the cluster software in the mix.
>    For example, unmount the file system from all nodes in the cluster, 
> then do
>    something like "dd if=/dev/my_vg/lvol0 of=/mnt/backup/sanbackup" then:
>    "diff /dev/my_vg/lvol0 mnt/backup/sanbackup"  (assuming of course that
>    /dev/my_vg/lvol0 is the logical volume you have your GFS partition 
> on, and
>    /mnt/backup/ is some scratch area big enough to hold that much data.)
>    The idea here is simply to test that reading from the SAN give you
>    the same data twice.  If that works successfully on one node, try it 
> on the other node.
> 2. Once you verify the hardware is working properly, run gfs_fsck on it.
>    The latest version of gfs_fsck can repair most GFS rg corruption.
> 3. If the file system is fixed okay, you should back it up.
> 4. You may want to do a similar test, only writing data to the SAN, then 
> reading it
>    back and verifying the results.  Obviously this will destroy the data 
> on your SAN
>    unless you are careful, so if this is a production machine, please 
> take measures
>    to protect the data before trying anything like this.
> 5. If you can read and write to the SAN reliably from both nodes without 
> GFS,
>    then try using it again with GFS and see if the problem comes back.
> 
> Perhaps someone else (the SAN manufacturer?) can recommend hardware
> tests you can run to verify the data integrity.
> 
> I realize these kinds of tests take a long time to do, but if it's a 
> hardware problem,
> you really need to know.  There's a outside chance the problem is somewhere
> in the GFS core, but I've personally only seen this type of corruption 
> once or twice
> so I think it's unlikely.  If you can recreate this kind of corruption 
> with some kind of
> test, please let us know how.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From filipe.miranda at gmail.com  Tue Oct 10 23:26:04 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Tue, 10 Oct 2006 20:26:04 -0300
Subject: [Linux-cluster] qdiskd vote not represented by cman
In-Reply-To: <1160427553.31581.55.camel@rei.boston.devel.redhat.com>
References: <1160076914.3666.2.camel@belmont.site>
	<452601DC.4030206@redhat.com>
	<1160427553.31581.55.camel@rei.boston.devel.redhat.com>
Message-ID: <a6d13c780610101626x57449ea6of7e62f78bb66e98d@mail.gmail.com>

Isnt the qdisk votes be set to a number higher than the sum of the nodes?
For example: If I have 3 nodes each with 1 vote. My qdisk vote value should
be set to 4.
Am I wrong?

Regards,
Filipe Miranda

On 10/9/06, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Fri, 2006-10-06 at 08:12 +0100, Patrick Caulfield wrote:
> > danwest wrote:
> > > Shouldn't we expect to see the qdisk votes reported in "Total_votes"
> > > from cman (see cman_tool below).
> >
> > Yes. if the quorum disk is registered correctly with cman you should see
> the
> > votes it contributes and also it's "node name" in cman_tool nodes.
>
> Yes.
>
> Is qdiskd running?
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061010/62dc3d89/attachment.htm>

From rhurst at bidmc.harvard.edu  Wed Oct 11 13:09:26 2006
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Wed, 11 Oct 2006 09:09:26 -0400
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
Message-ID: <1160572166.4042.28.camel@WSBID06223>

I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output
from their utility reveals a problem with fcntl/flock on the GFS
filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):


        open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) =
        3
        fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
        len=0}) = -1 ENOLCK (No locks available)


When I put their files on an ordinary ext3 filesystem, all works fine:


        open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) =
        3
        fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
        len=0}) = 0


The man page on fcntl states that when flock is setup with
whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire
file... so locking zero bytes in this manner is acceptable.  Help?


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061011/9c66f8ab/attachment.htm>

From rpeterso at redhat.com  Wed Oct 11 13:20:39 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 11 Oct 2006 08:20:39 -0500
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
In-Reply-To: <1160572166.4042.28.camel@WSBID06223>
References: <1160572166.4042.28.camel@WSBID06223>
Message-ID: <452CEFA7.3030300@redhat.com>

Robert Hurst wrote:
> I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output 
> from their utility reveals a problem with fcntl/flock on the GFS 
> filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):
>
>     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
>     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
>     len=0}) = -1 ENOLCK (No locks available)
>
>
> When I put their files on an ordinary ext3 filesystem, all works fine:
>
>     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
>     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
>     len=0}) = 0
>
>
> The man page on fcntl states that when flock is setup with 
> whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire 
> file... so locking zero bytes in this manner is acceptable.  Help?
Hi Robert,

What version of the cluster software and GFS are you using?

Regards,

Bob Peterson
Red Hat CLuster Suite


From rhurst at bidmc.harvard.edu  Wed Oct 11 13:24:57 2006
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Wed, 11 Oct 2006 09:24:57 -0400
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
In-Reply-To: <452CEFA7.3030300@redhat.com>
References: <1160572166.4042.28.camel@WSBID06223> <452CEFA7.3030300@redhat.com>
Message-ID: <1160573097.4042.42.camel@WSBID06223>

RHEL 4 U4 on dual AMD Opteron servers.  I don't believe this is an
issue, but none of the Software Compatibility packages were installed
either.  Thanks.

ClusterSuite:
ccs-1.0.7-0.src.rpm             ipvsadm-1.24-6.src.rpm
cman-1.0.11-0.src.rpm           magma-1.0.6-0.src.rpm
cman-kernel-2.6.9-45.5.src.rpm  magma-plugins-1.0.6-0.src.rpm
dlm-1.0.1-1.src.rpm             perl-Net-Telnet-3.03-3.noarch.rpm
dlm-kernel-2.6.9-42.13.src.rpm  piranha-0.8.2-1.src.rpm
fence-1.32.25-1.src.rpm         rgmanager-1.9.53-0.src.rpm
gulm-1.0.7-0.src.rpm            system-config-cluster-1.0.25-1.0.src.rpm
iddev-2.0.0-3.src.rpm

GFS:
GFS-6.1.6-1.src.rpm            gnbd-kernel-2.6.9-9.44.src.rpm
GFS-kernel-2.6.9-58.3.src.rpm  libsepol-devel-1.1.1-2.x86_64.rpm
gnbd-1.0.7-0.src.rpm           lvm2-cluster-2.02.06-7.0.RHEL4.src.rpm


On Wed, 2006-10-11 at 08:20 -0500, Robert Peterson wrote:

> Robert Hurst wrote:
> > I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output 
> > from their utility reveals a problem with fcntl/flock on the GFS 
> > filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):
> >
> >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> >     len=0}) = -1 ENOLCK (No locks available)
> >
> >
> > When I put their files on an ordinary ext3 filesystem, all works fine:
> >
> >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> >     len=0}) = 0
> >
> >
> > The man page on fcntl states that when flock is setup with 
> > whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire 
> > file... so locking zero bytes in this manner is acceptable.  Help?


> Hi Robert,
> 
> What version of the cluster software and GFS are you using?
> 
> Regards,
> 
> Bob Peterson
> Red Hat CLuster Suite
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061011/ca9caff4/attachment.htm>

From filipe.miranda at gmail.com  Wed Oct 11 13:25:57 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Wed, 11 Oct 2006 10:25:57 -0300
Subject: [Linux-cluster] OracleRAC10gR2 plus GFS using GULM (lock servers
	need access to the shared storage?)
Message-ID: <a6d13c780610110625n320778edq388f61be598e1c45@mail.gmail.com>

Hello everybody,

I read Oracle RAC10gR2 plus GFS 6.1 (Using RHCS for v4 U3) documentation and
Oracle will only support an implementation of GFS using GULM lock servers in
conjunction with the RAC solution.
The documentation says that the lock servers (3 machines separate from the
RAC nodes, for example) also need access to the storage, why ? Could someone
explain why the lock servers also need access to the shared storage and
acess to which LUNs?

Regards,
-- 
---
Filipe T Miranda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061011/6be7bf53/attachment.htm>

From lhh at redhat.com  Wed Oct 11 14:27:39 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 11 Oct 2006 10:27:39 -0400
Subject: [Linux-cluster] Quorum disks and two-node clusters
In-Reply-To: <C0C1791E8EC6F249B5570F01409BD3EE0148523B@rmamsem1.emea.roche.com>
References: <C0C1791E8EC6F249B5570F01409BD3EE0148523B@rmamsem1.emea.roche.com>
Message-ID: <1160576894.11134.41.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-10 at 10:21 +0200, Pena, Francisco Javier wrote:
> [...]
> 
> > When the score drops below 1/2, qdiskd advertises to CMAN 
> > that the "quorum device" is gone.  CMAN loses local votes and 
> > the node becomes inquorate.  There's a bug in qdisk that 
> > prevents qdisk from rebooting the node the way it is 
> > documented to upon this loss of score.  (This is already 
> > fixed in CVS.)
> > 
> > -- Lon
> > 
> 
> Lon, 
> 
> Do you know if there are any plans to release an updated cman package, including this bugfix, before U5? My company is planning to start implementing GFS, and this would definitely be required for us.
> 
> Regards,
> 
> Javier Pe?a

Hi Javier,

I do not think there is a plan to release the fix early, but that could
change at any time.  Note that because the changes were very simple, you
can use the checked-out version from CVS and it will work on U4+errata
as a drop-in replacement.

I am interested to know how you are going to use qdiskd such that this
is a show-stopper for your organization.

The only case where the lack of rebooting becomes problematic is if you
have heuristics which are not concerned with network connectivity (note:
this is a perfectly valid case, of course).

That is, if two nodes see each other, but one thinks it is inquorate,
the quorate node will *not* fence the inquorate node, and... well,
things will not work very smoothly (this is the case where a reboot is
needed).

In network outages (one of the things qdiskd was designed to help with),
the nodes will not see each other, so the node which still thinks it is
quorate will fence the inquorate node, and the cluster will cleanly
continue.

-- Lon


From lhh at redhat.com  Wed Oct 11 14:28:03 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 11 Oct 2006 10:28:03 -0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <38A48FA2F0103444906AD22E14F1B5A3045A6B8A@mailxchg01.corp.opsource.net>
References: <38A48FA2F0103444906AD22E14F1B5A3045A6B8A@mailxchg01.corp.opsource.net>
Message-ID: <1160576929.11134.43.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-10 at 21:27 +0100, Jeff Stoner wrote:
> My thanks to Stephen the intern. His work is much appreciated!

Yes, yes it is.

-- Lon


From lhh at redhat.com  Wed Oct 11 14:29:08 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 11 Oct 2006 10:29:08 -0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <20061010172322.B25971@xos037.xos.nl>
References: <2006105202526.771406@leena>
	<1160428295.31581.57.camel@rei.boston.devel.redhat.com>
	<452BB76C.1020606@redhat.com>  <20061010172322.B25971@xos037.xos.nl>
Message-ID: <1160576999.11134.45.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-10 at 17:23 +0200, Jos Vos wrote:
> On Tue, Oct 10, 2006 at 11:08:28AM -0400, James Parsons wrote:
> 
> > This document is now linked off of our sources.redhat.com cluster page at:
> > http://sources.redhat.com/cluster/doc/cluster_schema.html
> > 
> > It does not have the resource descriptions in it yet...later today, I hope.
> 
> Thanks, but I'm also missing the description for the <quorumd> tag
> and its children, which is even more important, as it has to be
> written manually now.
> 

That's coming, but in the mean time, how is the man page inadequate?
I'd like to know so I can fix it.

-- Lon


From isplist at logicore.net  Wed Oct 11 14:36:39 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 11 Oct 2006 09:36:39 -0500
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <1160576999.11134.45.camel@rei.boston.devel.redhat.com>
Message-ID: <2006101193639.497730@leena>

> That's coming, but in the mean time, how is the man page inadequate?
> I'd like to know so I can fix it.

Suggestion;

How about some examples for us folks who get a better sense of how things work 
by seeing it :). 

Mike


From jos at xos.nl  Wed Oct 11 14:48:33 2006
From: jos at xos.nl (Jos Vos)
Date: Wed, 11 Oct 2006 16:48:33 +0200
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <2006101193639.497730@leena>;
	from isplist@logicore.net on Wed, Oct 11, 2006 at 09:36:39AM -0500
References: <1160576999.11134.45.camel@rei.boston.devel.redhat.com>
	<2006101193639.497730@leena>
Message-ID: <20061011164833.A2772@xos037.xos.nl>

On Wed, Oct 11, 2006 at 09:36:39AM -0500, isplist at logicore.net wrote:

> Suggestion;
> 
> How about some examples for us folks who get a better sense of how things work
> by seeing it :). 

Yes, agreed, that is what's missing.  Also the total picture (how the
algorithm of cman - with/without communication with qdiskd - works
w.r.t. votes, scores, etc. works) is not described very well (well,
that's an understatement :-)).

For the pre-quorum-disk situation that isn't a big issue, as the
heartbeats are internal to the cman subsystem, but with a quorum
disk things get more complex.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From eloy.acosta at fon.com  Wed Oct 11 14:51:00 2006
From: eloy.acosta at fon.com (eloy.acosta at fon.com)
Date: Wed, 11 Oct 2006 14:51:00 +0000 (UTC)
Subject: [Linux-cluster] OracleRAC10gR2 plus GFS using GULM (lock
	servers need access to the shared storage?)
In-Reply-To: <a6d13c780610110625n320778edq388f61be598e1c45@mail.gmail.com>
Message-ID: <12987846.74111160578260181.JavaMail.root@fonesmta01>


Why don?t use ASM??? 

----- Original Message ----- 
From: Filipe Miranda <filipe.miranda at gmail.com> 
To: Linux Cluster <linux-cluster at redhat.com> 
Sent: mi?rcoles 11 de octubre de 2006 15H25 GMT+0100 
Subject: [Linux-cluster] OracleRAC10gR2 plus GFS using GULM (lock servers need access to the shared storage?) 

Hello everybody, 

I read Oracle RAC10gR2 plus GFS 6.1 (Using RHCS for v4 U3) documentation and Oracle will only support an implementation of GFS using GULM lock servers in conjunction with the RAC solution. 
The documentation says that the lock servers (3 machines separate from the RAC nodes, for example) also need access to the storage, why ? Could someone explain why the lock servers also need access to the shared storage and acess to which LUNs? 

Regards, 
-- 
--- 
Filipe T Miranda 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061011/86e6b420/attachment.htm>

From sandra-llistes at fib.upc.edu  Wed Oct 11 15:33:42 2006
From: sandra-llistes at fib.upc.edu (sandra-llistes)
Date: Wed, 11 Oct 2006 17:33:42 +0200
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <452BFC6D.60902@redhat.com>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu> <452BFC6D.60902@redhat.com>
Message-ID: <452D0ED6.9040605@fib.upc.edu>

Hi Abhi,

 >I'm in the process of gathering a few windows boxes to run your 
test. >I should hopefully have 4 windows clients tomorrow.

Tell me something about the results.

 >A warning first up, I'd recommend that you *not* use GFS2 and the 
 >latest cluster suite for your tests just yet. With constant 
 >development going on, some components are unstable and more problems 
 >is not what you need right now :-) . The RHEL4 tag in CVS has stable 
 >code from the most recent release. I'd suggest you compile gfs and e 
 >cluster suite from that CVS branch.

Yes, I've been installing GFS2 in our test environment, and it seems 
very experimental for production use. I'm almost finishing 
installation, so I'll try it with samba to see how it works.

 >I'm running a 3-node x86 cluster with RHEL4. The cluster suite and 
 >gfs are from the RHEL4 branch of CVS along with some innocuous 
 >patches. The samba version is 3.0.10-1.4E.2. I'm using an smb.conf 
 >almost identical to the one you posted in your previous mail. I 
don't >have any other kernel/samba locking settings that I'm aware of.

Ok, I'm going to try with the CVS version you're proposing.

 >You did mention in an email few weeks ago that you were trying to 
 >export the same GFS mount over multiple samba servers on multiple 
 >nodes simultaneously (active-active samba). I'm guessing you 
achieved >this by setting the locking and pid directories of samba to 
be on the >shared gfs filesystem. (This is a wrong approach and 
doesn't work. >There's a lot of debate on this in the samba and 
samba-technical list >archives are samba.org). I'm wondering if you 
still have these >directories on the GFS filesystem, which could 
possibly be causing >your hang?

Well, this was one of the unsuccessful test I did, but now I have 
samba in ext3 filesystem (locking and pids included). A few days ago, 
in samba-technical list, a proposal for a clustered samba was made. 
Details are in the following document: 
http://wiki.samba.org/index.php/Samba_%26_Clustering if you're 
interested in.
Of course, it's a proposal and I guess It won't be opperative soon.

 >Also, do you see anything unusual in /var/log/messages on the GFS 
 >node when this hang occurs? I'm interested in any 
 >kernel-panic/assertion failures in GFS that might indicate some 
 >problem.

I don't see nothing abnormal in GFS logs when samba hangs occur, but I 
made strace of smbd and I saw a lot of call systems that were 
unfinished until samba is restarted.

4665  11:09:31.242316 <... geteuid32 resumed> ) = 503 <0.000118>
4665  11:09:31.242405 write(19, "close fd=22 fnum=6371 (numopen=2"..., 
34) = 34 <0.000031>
4665  11:09:31.242572 nanosleep({0, 2000001},  <unfinished ...>
4667  11:09:31.245063 kill(4665, SIG_0) = 0 <0.000018>
4665  11:09:31.248047 <... nanosleep resumed> NULL) = 0 <0.005406>
4665  11:09:31.249355 nanosleep({0, 2000001}, NULL) = 0 <0.002621>
4665  11:09:31.252091 nanosleep({0, 2000001}, NULL) = 0 <0.003853>
4665  11:09:31.256088 nanosleep({0, 2000001}, NULL) = 0 <0.003906>
.................. a lot of nanosleeps ..............................
4665  11:10:04.887037 nanosleep({0, 2000001},  <unfinished ...>
4665  11:10:04.887219 <... nanosleep resumed> 0) = ? 
ERESTART_RESTARTBLOCK (To be restarted) <0.000111>
4665  11:10:04.888197 +++ killed by SIGKILL +++
4667  11:10:04.890712 kill(4665, SIG_0 <unfinished ...>
4666  11:10:04.920965 kill(4665, SIG_0) = -1 ESRCH (No such process) 
<0.000017>
4667  11:10:04.934486 kill(4665, SIG_0 <unfinished ...>

Many Thanks,

		Sandra Hern?ndez


From filipe.miranda at gmail.com  Wed Oct 11 16:31:25 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Wed, 11 Oct 2006 13:31:25 -0300
Subject: [Linux-cluster] OracleRAC10gR2 plus GFS using GULM (lock servers
	need access to the shared storage?)
In-Reply-To: <12987846.74111160578260181.JavaMail.root@fonesmta01>
References: <a6d13c780610110625n320778edq388f61be598e1c45@mail.gmail.com>
	<12987846.74111160578260181.JavaMail.root@fonesmta01>
Message-ID: <a6d13c780610110931w690a01f4t13c8bd8fa48d6f4e@mail.gmail.com>

Answering my question, the lock servers need to have access to the shared
LUNs  because CLVM does its locking across all nodes of the cluster, so the
all the nodes need to be able to see the shared storage so when you do CLVM
operations they can do the locking properly.

Now the question is, can I disable the CLVM on the lock servers and the
answer is yes, just use the clvm=0 on the clusternode tag on cluster.conf.

But what if later on, a few months from now, the customer decides to add
access to a GFS LUNs from the lock servers?
Then I will have to use lvm tags to map the LUNs accordinly. I'm still
trying to test this, but no sucess yet.
I found the documentation for it on
http://sources.redhat.com/cgi-bin/cvsweb.cgi/LVM2/doc/tagging.txt?rev=1.1&content-type=text/x-cvsweb-markup&cvsroot=lvm2&only_with_tag=v2_02_10

But I'm having a hard time to setup this...


On 10/11/06, eloy.acosta at fon.com <eloy.acosta at fon.com> wrote:
>
> Why don?t use ASM???
>
> ----- Original Message -----
> From: Filipe Miranda <filipe.miranda at gmail.com>
> To: Linux Cluster <linux-cluster at redhat.com>
> Sent: mi?rcoles 11 de octubre de 2006 15H25 GMT+0100
> Subject: [Linux-cluster] OracleRAC10gR2 plus GFS using GULM (lock servers
> need access to the shared storage?)
>
> Hello everybody,
>
> I read Oracle RAC10gR2 plus GFS 6.1 (Using RHCS for v4 U3) documentation
> and Oracle will only support an implementation of GFS using GULM lock
> servers in conjunction with the RAC solution.
> The documentation says that the lock servers (3 machines separate from the
> RAC nodes, for example) also need access to the storage, why ? Could someone
> explain why the lock servers also need access to the shared storage and
> acess to which LUNs?
>
> Regards,
> --
> ---
> Filipe T Miranda
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061011/41e22298/attachment.htm>

From lhh at redhat.com  Wed Oct 11 16:50:29 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 11 Oct 2006 12:50:29 -0400
Subject: [Linux-cluster] qdiskd vote not represented by cman
In-Reply-To: <a6d13c780610101626x57449ea6of7e62f78bb66e98d@mail.gmail.com>
References: <1160076914.3666.2.camel@belmont.site>
	<452601DC.4030206@redhat.com>
	<1160427553.31581.55.camel@rei.boston.devel.redhat.com>
	<a6d13c780610101626x57449ea6of7e62f78bb66e98d@mail.gmail.com>
Message-ID: <1160585464.11134.86.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-10 at 20:26 -0300, Filipe Miranda wrote:
> the qdisk votes be set to a number higher than the sum of the nodes?
> For example: If I have 3 nodes each with 1 vote. My qdisk vote value
> should be set to 4.
> Am I wrong? 

It's usually set to the sum-total of votes in the cluster, but you can
really use any number >= to that value.

e.g.

In a 4 node cluster, each node contributes one vote.  If qdisk is set to
4 votes, the max expected votes is 8.  Any one node's vote + qdisk votes
is enough for quorum (5 votes).

-- Lon


From spatuality at yahoo.ca  Wed Oct 11 17:38:00 2006
From: spatuality at yahoo.ca (Brian)
Date: Wed, 11 Oct 2006 10:38:00 -0700 (PDT)
Subject: [Linux-cluster] LVM, Cluster, GFS, iSCSI problems
Message-ID: <20061011173800.29903.qmail@web30810.mail.mud.yahoo.com>

Hi group,

I am trying to setup GFS on two systems but there is a clvm / LVM problem. Here's what's happening:

RHEL 4.4 system x86_64 fully updated (except for latest .3 kernel as cluster dependencies are missing). Running cluster suite fully updated. Using 2 Dell PowerEdge 1955 Blade servers connected to a Promise m500i iSCSI disk array unit.

iSCSI is connecting okay to both servers. One server had LVM and GFS mounted properly and working but the other did not. What's happening now with both servers is the following:

# lvdisplay
    Logging initialised at Wed Oct 11 12:00:45 2006
    Set umask to 0077
    Loaded external locking library liblvm2clusterlock.so
    Finding all logical volumes
  --- Logical volume ---
  LV Name                /dev/vg_gfstest/lv_gfstest
  VG Name                vg_gfstest
  LV UUID                seA0f2-DBgZ-D0gX-gNtT-cT4n-dKFM-0XAdzL
  LV Write Access        read/write
  LV Status              NOT available
  LV Size                20.00 GB
  Current LE             5120
  Segments               1
  Allocation             inherit
  Read ahead sectors     0


I'm guessing that the "NOT Available" bit is causing the volume group to not show up in /dev/ :). This is bad. I'm not sure how to fix this.


Running vgchange to activate the volume group doesn't work:
# vgchange -a y
    Logging initialised at Wed Oct 11 13:24:21 2006
    Set umask to 0077
    Loaded external locking library liblvm2clusterlock.so
    Finding all volume groups
    Finding volume group "vg_gfstest"
  Error locking on node javaapps2: Internal lvm error, check syslog
  Error locking on node javaapps1: Internal lvm error, check syslog
  0 logical volume(s) in volume group "vg_gfstest" now active


The log in /var/log/messages:
Oct 11 13:24:23 server1 lvm[6407]: Volume group for uuid not found: yiH21FlyZwB03nTxylMnyNQsRGnvTqbMseA0f2DBgZD0gXgNtTcT4ndKFM0XAdzL


Has anyone seen this before, or know what to do with this? I suspect this is cluster related as I have not come across this on a stand alone system, but do correct me if I am wrong.

Brian


From lhh at redhat.com  Wed Oct 11 17:48:03 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 11 Oct 2006 13:48:03 -0400
Subject: [Linux-cluster] Cluster.conf
In-Reply-To: <20061011164833.A2772@xos037.xos.nl>
References: <1160576999.11134.45.camel@rei.boston.devel.redhat.com>
	<2006101193639.497730@leena>  <20061011164833.A2772@xos037.xos.nl>
Message-ID: <1160588918.11134.100.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-11 at 16:48 +0200, Jos Vos wrote:
> On Wed, Oct 11, 2006 at 09:36:39AM -0500, isplist at logicore.net wrote:
> 
> > Suggestion;
> > 
> > How about some examples for us folks who get a better sense of how things work
> > by seeing it :). 
> 
> Yes, agreed, that is what's missing.  Also the total picture (how the
> algorithm of cman - with/without communication with qdiskd - works
> w.r.t. votes, scores, etc. works) is not described very well (well,
> that's an understatement :-)).

You're right, it's not.  It explains voting restrictions when
configuring qdisk, but... not *why*... e.g.:

   * Cluster node votes should be more or less equal.
   ...
   * The total number of votes assigned to the quorum device should be
     equal to or greater than the total number of node-votes in the
     cluster.  While  it is possible to assign only one (or a few) votes
     to the quorum device, the effects of doing so have not been
     explored.

CMAN has no knowledge of qdisk's internal "score" information nor its
heuristics, or even how it works.  I should make a note of that.

-- Lon


From dharmadeep at hotmail.com  Wed Oct 11 18:08:47 2006
From: dharmadeep at hotmail.com (Dharmadeep Muppalla)
Date: Wed, 11 Oct 2006 23:38:47 +0530
Subject: [Linux-cluster] benchmarks results for gfs with dlm
In-Reply-To: <mailman.34193.1160582521.1982.linux-cluster@redhat.com>
Message-ID: <BAY107-F3546367B90014BD2308B04C8140@phx.gbl>

Hi,

        Are there any benchmark results for gfs with dlm? If so, what 
benchmarks were used ?

Thanks,
dharmadeep


From adas at redhat.com  Wed Oct 11 22:29:20 2006
From: adas at redhat.com (Abhijith Das)
Date: Wed, 11 Oct 2006 17:29:20 -0500
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <452D0ED6.9040605@fib.upc.edu>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu> <452BFC6D.60902@redhat.com>
	<452D0ED6.9040605@fib.upc.edu>
Message-ID: <452D7040.8090704@redhat.com>

Hi Sandra,
I tried your test with 4 windows machines. 3 real machines and 1 
simulated in vmware - all running windows xp home. Everything runs fine:
smbstatus on the samba server shows me this:
[root at niobe-04 ~]# smbstatus

Samba version 3.0.10-1.4E.2
PID     Username      Group         Machine
-------------------------------------------------------------------
  775   testmonkey    testmonkeys   ccc-t3n2rexrla7 (10.15.80.203)
  777   testmonkey    testmonkeys   migael       (10.15.80.222)
  774   testmonkey    testmonkeys   bbb-34gtsedgprj (10.15.80.6)
  776   testmonkey    testmonkeys   schumi       (10.15.80.209)

Service      pid     machine       Connected at
-------------------------------------------------------
public         774   bbb-34gtsedgprj  Wed Oct 11 15:03:10 2006
public         776   schumi        Wed Oct 11 15:03:11 2006
IPC$           777   migael        Wed Oct 11 15:43:58 2006
public         777   migael        Wed Oct 11 15:03:25 2006
public         775   ccc-t3n2rexrla7  Wed Oct 11 15:03:10 2006

Locked files:
Pid    DenyMode   Access      R/W        Oplock           Name
--------------------------------------------------------------
777    DENY_NONE  0x20089     RDONLY     NONE             
/public/TruthHappens.ogg   Wed Oct 11 17:39:16 2006
775    DENY_NONE  0x20089     RDONLY     NONE             
/public/TruthHappens.ogg   Wed Oct 11 17:39:16 2006
774    DENY_NONE  0x20089     RDONLY     NONE             
/public/TruthHappens.ogg   Wed Oct 11 17:39:07 2006
776    DENY_NONE  0x20089     RDONLY     NONE             
/public/TruthHappens.ogg   Wed Oct 11 17:39:06 2006

My smb.conf looks like this :
[public]
        comment         = ShareGFS
        path            = /public
        writeable       = No
        read only       = Yes
        write list      = @admsamba
        force group     = root
        create mask     = 0775
        directory mask  = 0775
        oplocks         = No
        locking = Yes
        strict locking = Yes

Also, since the share is readonly, there shouldn't be (m)any locks 
involved, which makes your problem seem all the more odd. Let me know if 
there's anything else I can try. Also, I'm curious about your test 
results with gfs2 and gfs1 from the RHEL4 branch.

Regards,
--Abhi


From Alain.Moulle at bull.net  Thu Oct 12 10:11:27 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Thu, 12 Oct 2006 12:11:27 +0200
Subject: [Linux-cluster] CS4 U4 / problem with force_umount option ?
Message-ID: <452E14CF.1030009@bull.net>

Hi

I've tested the CS4 U4. It seems there is a problem with
the force_umount parameter :
if the fs device is mounted on node2 under /test(linked to "appli") and that
I've done "cd /test on node2 ,
and if on node1 I try to do :
clusvcadm -r appli -m node1
the relocate fails.

Any idea ?

My cluster.conf extract is as :

                <resources>
                        <fs device="/dev/sdx" force_unmount="1" fstype="ext2"
mountpoint="/test" name="testHAfs" options=""/>
                </resources>
                <service domain="domainHA" name="HAtest" autostart="0"
checkinterval="60">
                        <script file="/tmp/testHAmanage" name="HAtest"/>
                        <fs ref="testHAfs"/>
                </service>


Thanks
Alain


From eric.lemoine at gmail.com  Thu Oct 12 14:46:51 2006
From: eric.lemoine at gmail.com (Eric Lemoine)
Date: Thu, 12 Oct 2006 16:46:51 +0200
Subject: [Linux-cluster] manual fencing issue
Message-ID: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>

Hi,

I'm a new member to the linux-cluster mailing list.

I'm trying to set up a 2-node cluster using RedHat Cluster based on
OpenAIS, with manual fencing (for now).

When I reboot a node (l6-z-5), I get the following error messages in
/var/log/syslog of the other node (l6-z-12):

Oct 12 16:38:02 l6-z-12 fenced[4508]: fencing node "l6-z-5"
Oct 12 16:38:02 l6-z-12 fenced[4508]: fence "l6-z-5" failed

Probably because of that fencing failure, the service located on
l6-z-5 doesn't failover to l6-z-12. The fenced error messages repeat
until l6-z-5 rejoins the cluster.

Does anyone know what's going on?

My cluster.conf includes this for the fencing part:

        <fencedevices>
                <fencedevice agent="fencemanual" name="last_resort"/>
        </fencedevices>

My distro is Ubuntu Edgy, which has packages for RedHat Cluster and
OpenAIS. The version of cman seems to be 2.20061002, and the version
of openais 0.80.1.

Thanks a lot,

-- 
Eric


From jos at xos.nl  Thu Oct 12 14:52:34 2006
From: jos at xos.nl (Jos Vos)
Date: Thu, 12 Oct 2006 16:52:34 +0200
Subject: [Linux-cluster] RHEL4 cluster source RPMs incomplete
Message-ID: <200610121452.k9CEqYg13019@xos037.xos.nl>

Hi,

Can someone of the cluster team maybe check the completeness of the
RHCS/RHGFS source RPMs in the RHEL4 updates tree on public ftp sites?

It looks like some RHEL4 packages were not propagated to the public
sites.  The last version of system-config-cluster is older than the
RHEL4 version, and I'm also missing the cman-kernel and gnbd-kernel
packages for the -42.0.3.EL kernel, while the new dlm-kernel and
GFS-kernel packages do exist.

Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jon at levanta.com  Thu Oct 12 16:51:51 2006
From: jon at levanta.com (Jonathan Biggar)
Date: Thu, 12 Oct 2006 09:51:51 -0700
Subject: [Linux-cluster] Re: manual fencing issue
In-Reply-To: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
References: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
Message-ID: <eglrr7$3kb$1@sea.gmane.org>

Eric Lemoine wrote:
> Hi,
> 
> I'm a new member to the linux-cluster mailing list.
> 
> I'm trying to set up a 2-node cluster using RedHat Cluster based on
> OpenAIS, with manual fencing (for now).
> 
> When I reboot a node (l6-z-5), I get the following error messages in
> /var/log/syslog of the other node (l6-z-12):
> 
> Oct 12 16:38:02 l6-z-12 fenced[4508]: fencing node "l6-z-5"
> Oct 12 16:38:02 l6-z-12 fenced[4508]: fence "l6-z-5" failed
> 
> Probably because of that fencing failure, the service located on
> l6-z-5 doesn't failover to l6-z-12. The fenced error messages repeat
> until l6-z-5 rejoins the cluster.
> 
> Does anyone know what's going on?

Did you ever run fence_ack_manual to acknowledge the manual fencing request?

-- 
Jon Biggar
Levanta
jon at levanta.com
650-403-7252


From eric.lemoine at gmail.com  Thu Oct 12 18:13:47 2006
From: eric.lemoine at gmail.com (Eric Lemoine)
Date: Thu, 12 Oct 2006 20:13:47 +0200
Subject: [Linux-cluster] Re: manual fencing issue
In-Reply-To: <eglrr7$3kb$1@sea.gmane.org>
References: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
	<eglrr7$3kb$1@sea.gmane.org>
Message-ID: <5cac192f0610121113o6e2bc36bi8f2ff0add9d1abb0@mail.gmail.com>

On 10/12/06, Jonathan Biggar <jon at levanta.com> wrote:
> Eric Lemoine wrote:
> > Hi,
> >
> > I'm a new member to the linux-cluster mailing list.
> >
> > I'm trying to set up a 2-node cluster using RedHat Cluster based on
> > OpenAIS, with manual fencing (for now).
> >
> > When I reboot a node (l6-z-5), I get the following error messages in
> > /var/log/syslog of the other node (l6-z-12):
> >
> > Oct 12 16:38:02 l6-z-12 fenced[4508]: fencing node "l6-z-5"
> > Oct 12 16:38:02 l6-z-12 fenced[4508]: fence "l6-z-5" failed
> >
> > Probably because of that fencing failure, the service located on
> > l6-z-5 doesn't failover to l6-z-12. The fenced error messages repeat
> > until l6-z-5 rejoins the cluster.
> >
> > Does anyone know what's going on?
>
> Did you ever run fence_ack_manual to acknowledge the manual fencing request?

Just tried.

(1) reboot l6-z-5
(2) fenced error messages are repeatedly written to /var/log/syslog on l6-z-12
(3) from l6-z-12, ack manual fencing using "fence_ack_manual -n l6-z-5". Gives:

"Warning:  If the node "l6-z-5" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable!  Please verify that the node shown above has
been reset or disconnected from storage.

Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such device or address"

(4) fenced error messages are still repeatedly written to /var/log/syslog.

So acknowledging the manual fence operation didn't help at all.
Actually I don't think fenced has called fence_manual, because there's
no syslog message indicating that fence_manual was called. And I get
such messages when I manually call fence_manual.

Thanks,

-- 
Eric


From filipe.miranda at gmail.com  Thu Oct 12 18:27:19 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Thu, 12 Oct 2006 15:27:19 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
Message-ID: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>

Hello everybody,

I have the following environment and I need some help please.
3 machines are GULM lock servers (x86 CPUs)
3 machines are ORACLE RAC nodes that will use GFS (x86_64 CPUs)
Only the RAC nodes have access to the shared storage.

Using Red Hat Enterprise Linux 4.3, Red Hat Cluster Suite Update4 and GFS
6.1 Update3

I setup the RHEL4 plus RHCS and things are just fine. I edited the
cluster.conf and add the follwing tag to the lock servers members so they
wont need access to the shared storage: "clvmd=0"  Then I installed the CLVM
package on all nodes.

So far so good I created my Physical Volumes then I created my Volume Group.
The problem is: When I try to create the Logical Volume I get the follwing
error:

[root at rac-node1 ~]# lvcreate -L 1G -n lv01 vg01
  Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1 not
/dev/emcpowerb1
  Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using /dev/emcpowerc1
not /dev/sdr1
  Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1 not
/dev/emcpowerq1
  Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1 not
/dev/emcpowera1
  Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using /dev/emcpowerd1
not /dev/sdq1
  Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1 not
/dev/emcpowerj1
  Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using /dev/emcpowerk1
not /dev/sdj1
  Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1 not
/dev/emcpoweri1
  Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using /dev/emcpowerl1
not /dev/sdi1
  Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1 not
/dev/emcpowerh1
  Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using /dev/emcpowerm1
not /dev/sdh1
  Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1 not
/dev/emcpowerg1
  Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using /dev/emcpowern1
not /dev/sdg1
  Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1 not
/dev/emcpowerf1
  Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using /dev/emcpowero1
not /dev/sdf1
  Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1 not
/dev/emcpowere1
  Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using /dev/emcpowerp1
not /dev/sde1
  Error locking on node rac-node1: Internal lvm error, check syslog
  Error locking on node UNKNOWN [0.0.ffff0000.dfc0800a]: Internal lvm error,
check syslog
  Error locking on node lock-server1: Internal lvm error, check syslog
  Error locking on node lock-server2: Internal lvm error, check syslog
  Error locking on node lock-server3: Internal lvm error, check syslog
  Failed to activate new LV.

The /var/log/message:

Oct 12 13:40:13 rac-node1 lvm[8014]: Volume group g01 metadata is
inconsistent
Oct 12 13:40:13 rac-node1 lvm[8014]: Volume group for uuid not found:
y0GXAB2eSuhEFaUR7DhGCBry5wZXUwGGEhU1pDSl3v0wjmTVPLuJSkHB2HXz4FNV

[root at rac-node1 ~]# vgdisplay vg01
  Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1 not
/dev/emcpowerb1
  Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using /dev/emcpowerc1
not /dev/sdr1
  Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1 not
/dev/emcpowerq1
  Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1 not
/dev/emcpowera1
  Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using /dev/emcpowerd1
not /dev/sdq1
  Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1 not
/dev/emcpowerj1
  Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using /dev/emcpowerk1
not /dev/sdj1
  Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1 not
/dev/emcpoweri1
  Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using /dev/emcpowerl1
not /dev/sdi1
  Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1 not
/dev/emcpowerh1
  Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using /dev/emcpowerm1
not /dev/sdh1
  Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1 not
/dev/emcpowerg1
  Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using /dev/emcpowern1
not /dev/sdg1
  Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1 not
/dev/emcpowerf1
  Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using /dev/emcpowero1
not /dev/sdf1
  Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1 not
/dev/emcpowere1
  Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using /dev/emcpowerp1
not /dev/sde1
  --- Volume group ---
  VG Name               vg01
  System ID
  Format                lvm2
  Metadata Areas        16
  Metadata Sequence No  2
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               0
  Max PV                0
  Cur PV                16
  Act PV                16
  VG Size               539.00 GB
  PE Size               32.00 MB
  Total PE              17248
  Alloc PE / Size       32 / 1.00 GB
  Free  PE / Size       17216 / 538.00 GB
  VG UUID               y0GXAB-2eSu-hEFa-UR7D-hGCB-ry5w-ZXUwGG

[root at rac-node1 ~]# lvdisplay /dev/vg01/lv01
  Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1 not
/dev/emcpowerb1
  Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using /dev/emcpowerc1
not /dev/sdr1
  Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1 not
/dev/emcpowerq1
  Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1 not
/dev/emcpowera1
  Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using /dev/emcpowerd1
not /dev/sdq1
  Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1 not
/dev/emcpowerj1
  Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using /dev/emcpowerk1
not /dev/sdj1
  Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1 not
/dev/emcpoweri1
  Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using /dev/emcpowerl1
not /dev/sdi1
  Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1 not
/dev/emcpowerh1
  Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using /dev/emcpowerm1
not /dev/sdh1
  Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1 not
/dev/emcpowerg1
  Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using /dev/emcpowern1
not /dev/sdg1
  Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1 not
/dev/emcpowerf1
  Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using /dev/emcpowero1
not /dev/sdf1
  Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1 not
/dev/emcpowere1
  Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using /dev/emcpowerp1
not /dev/sde1
  --- Logical volume ---
  LV Name                /dev/vg01/lv01
  VG Name                vg01
  LV UUID                EhU1pD-Sl3v-0wjm-TVPL-uJSk-HB2H-Xz4FNV
  LV Write Access        read/write
  LV Status              available
  # open                 0
  LV Size                1.00 GB
  Current LE             32
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:7


Any ideas what could it be? Suggestions?

Regards,

-- 
---
Filipe T Miranda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061012/df1e1770/attachment.htm>

From kanderso at redhat.com  Thu Oct 12 22:06:03 2006
From: kanderso at redhat.com (Kevin Anderson)
Date: Thu, 12 Oct 2006 17:06:03 -0500
Subject: [Linux-cluster] manual fencing issue
In-Reply-To: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
References: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
Message-ID: <1160690763.2787.29.camel@dhcp80-204.msp.redhat.com>

On Thu, 2006-10-12 at 16:46 +0200, Eric Lemoine wrote:
> Hi,
> 
> I'm a new member to the linux-cluster mailing list.
> 
> I'm trying to set up a 2-node cluster using RedHat Cluster based on
> OpenAIS, with manual fencing (for now).
> 
> When I reboot a node (l6-z-5), I get the following error messages in
> /var/log/syslog of the other node (l6-z-12):
> 
> Oct 12 16:38:02 l6-z-12 fenced[4508]: fencing node "l6-z-5"
> Oct 12 16:38:02 l6-z-12 fenced[4508]: fence "l6-z-5" failed
> 
> Probably because of that fencing failure, the service located on
> l6-z-5 doesn't failover to l6-z-12. The fenced error messages repeat
> until l6-z-5 rejoins the cluster.
> 
> Does anyone know what's going on?
> 
> My cluster.conf includes this for the fencing part:
> 
>         <fencedevices>
>                 <fencedevice agent="fencemanual" name="last_resort"/>

Try fence_manual as the agent name.

Kevin


From eric.lemoine at gmail.com  Fri Oct 13 05:45:39 2006
From: eric.lemoine at gmail.com (Eric Lemoine)
Date: Fri, 13 Oct 2006 07:45:39 +0200
Subject: [Linux-cluster] manual fencing issue
In-Reply-To: <1160690763.2787.29.camel@dhcp80-204.msp.redhat.com>
References: <5cac192f0610120746h13bd61b1gebb377e1bbbc1c3e@mail.gmail.com>
	<1160690763.2787.29.camel@dhcp80-204.msp.redhat.com>
Message-ID: <5cac192f0610122245g3ca19433v6539459b03c64f58@mail.gmail.com>

On 10/13/06, Kevin Anderson <kanderso at redhat.com> wrote:
> On Thu, 2006-10-12 at 16:46 +0200, Eric Lemoine wrote:
> > Hi,
> >
> > I'm a new member to the linux-cluster mailing list.
> >
> > I'm trying to set up a 2-node cluster using RedHat Cluster based on
> > OpenAIS, with manual fencing (for now).
> >
> > When I reboot a node (l6-z-5), I get the following error messages in
> > /var/log/syslog of the other node (l6-z-12):
> >
> > Oct 12 16:38:02 l6-z-12 fenced[4508]: fencing node "l6-z-5"
> > Oct 12 16:38:02 l6-z-12 fenced[4508]: fence "l6-z-5" failed
> >
> > Probably because of that fencing failure, the service located on
> > l6-z-5 doesn't failover to l6-z-12. The fenced error messages repeat
> > until l6-z-5 rejoins the cluster.
> >
> > Does anyone know what's going on?
> >
> > My cluster.conf includes this for the fencing part:
> >
> >         <fencedevices>
> >                 <fencedevice agent="fencemanual" name="last_resort"/>
>
> Try fence_manual as the agent name.

That works! Thanks,
-- 
Eric


From pcaulfie at redhat.com  Fri Oct 13 07:28:25 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 13 Oct 2006 08:28:25 +0100
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
Message-ID: <452F4019.8040803@redhat.com>

Filipe Miranda wrote:
> Hello everybody,
> 
> I have the following environment and I need some help please.
> 3 machines are GULM lock servers (x86 CPUs)
> 3 machines are ORACLE RAC nodes that will use GFS (x86_64 CPUs)
> Only the RAC nodes have access to the shared storage.
> 
> Using Red Hat Enterprise Linux 4.3, Red Hat Cluster Suite Update4 and
> GFS 6.1 Update3
> 
> I setup the RHEL4 plus RHCS and things are just fine. I edited the
> cluster.conf and add the follwing tag to the lock servers members so
> they wont need access to the shared storage: "clvmd=0"  Then I installed
> the CLVM package on all nodes.
> 
> So far so good I created my Physical Volumes then I created my Volume Group.
> The problem is: When I try to create the Logical Volume I get the
> follwing error:
> 
> [root at rac-node1 ~]# lvcreate -L 1G -n lv01 vg01
>   Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1
> not /dev/emcpowerb1
>   Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using
> /dev/emcpowerc1 not /dev/sdr1
>   Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1
> not /dev/emcpowerq1
>   Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1
> not /dev/emcpowera1
>   Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using
> /dev/emcpowerd1 not /dev/sdq1
>   Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1
> not /dev/emcpowerj1
>   Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using
> /dev/emcpowerk1 not /dev/sdj1
>   Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1
> not /dev/emcpoweri1
>   Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using
> /dev/emcpowerl1 not /dev/sdi1
>   Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1
> not /dev/emcpowerh1
>   Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using
> /dev/emcpowerm1 not /dev/sdh1
>   Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1
> not /dev/emcpowerg1
>   Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using
> /dev/emcpowern1 not /dev/sdg1
>   Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1
> not /dev/emcpowerf1
>   Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using
> /dev/emcpowero1 not /dev/sdf1
>   Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1
> not /dev/emcpowere1
>   Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using
> /dev/emcpowerp1 not /dev/sde1
>   Error locking on node rac-node1: Internal lvm error, check syslog
>   Error locking on node UNKNOWN [0.0.ffff0000.dfc0800a]: Internal lvm
> error, check syslog
>   Error locking on node lock-server1: Internal lvm error, check syslog
>   Error locking on node lock-server2: Internal lvm error, check syslog
>   Error locking on node lock-server3: Internal lvm error, check syslog
>   Failed to activate new LV.
> 
You need to adjust the filters in /etc/lvm/lvm.conf to exclude the /dev/sda
devices

-- 

patrick


From rhurst at bidmc.harvard.edu  Fri Oct 13 11:30:00 2006
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Fri, 13 Oct 2006 07:30:00 -0400
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
In-Reply-To: <1160573097.4042.42.camel@WSBID06223>
References: <1160572166.4042.28.camel@WSBID06223>
	<452CEFA7.3030300@redhat.com> <1160573097.4042.42.camel@WSBID06223>
Message-ID: <1160739000.2575.2.camel@WSBID06223>

Hi Bob,

Was this the information you were looking for?  Short of writing my own,
is there any test suite I can execute to test functionality?


On Wed, 2006-10-11 at 09:24 -0400, Robert Hurst wrote:

> RHEL 4 U4 on dual AMD Opteron servers.  I don't believe this is an
> issue, but none of the Software Compatibility packages were installed
> either.  Thanks.
> 
> ClusterSuite:
> ccs-1.0.7-0.src.rpm             ipvsadm-1.24-6.src.rpm
> cman-1.0.11-0.src.rpm           magma-1.0.6-0.src.rpm
> cman-kernel-2.6.9-45.5.src.rpm  magma-plugins-1.0.6-0.src.rpm
> dlm-1.0.1-1.src.rpm             perl-Net-Telnet-3.03-3.noarch.rpm
> dlm-kernel-2.6.9-42.13.src.rpm  piranha-0.8.2-1.src.rpm
> fence-1.32.25-1.src.rpm         rgmanager-1.9.53-0.src.rpm
> gulm-1.0.7-0.src.rpm
> system-config-cluster-1.0.25-1.0.src.rpm
> iddev-2.0.0-3.src.rpm
> 
> GFS:
> GFS-6.1.6-1.src.rpm            gnbd-kernel-2.6.9-9.44.src.rpm
> GFS-kernel-2.6.9-58.3.src.rpm  libsepol-devel-1.1.1-2.x86_64.rpm
> gnbd-1.0.7-0.src.rpm           lvm2-cluster-2.02.06-7.0.RHEL4.src.rpm
> 
> 
> On Wed, 2006-10-11 at 08:20 -0500, Robert Peterson wrote: 
> 
> > Robert Hurst wrote:
> > > I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output 
> > > from their utility reveals a problem with fcntl/flock on the GFS 
> > > filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):
> > >
> > >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> > >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> > >     len=0}) = -1 ENOLCK (No locks available)
> > >
> > >
> > > When I put their files on an ordinary ext3 filesystem, all works fine:
> > >
> > >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> > >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> > >     len=0}) = 0
> > >
> > >
> > > The man page on fcntl states that when flock is setup with 
> > > whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire 
> > > file... so locking zero bytes in this manner is acceptable.  Help?
> 
> 
> 
> > Hi Robert,
> > 
> > What version of the cluster software and GFS are you using?
> > 
> > Regards,
> > 
> > Bob Peterson
> > Red Hat CLuster Suite
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/01eca0a3/attachment.htm>

From eric.lemoine at gmail.com  Fri Oct 13 11:49:17 2006
From: eric.lemoine at gmail.com (Eric Lemoine)
Date: Fri, 13 Oct 2006 13:49:17 +0200
Subject: [Linux-cluster] migratory group
Message-ID: <5cac192f0610130449p1539344ak7ebc2d997ba65223@mail.gmail.com>

Hi,

I'm trying to migrate a Xen domU from one cluster node to another
using clusvcadm. I get the following error message when doing
<clusvcadm -M>:

root at l6-z-5:~# clusvcadm -M domu-0 -m l6-z-12
Trying to relocate service:domu-0 to me...Invalid operation for resource

I looked into the rgmanager code and it seems to me that my group
isn't "migratory", so I get an RG_EINVAL error code.

How do I make my group migratory in the cluster.conf file? I found
neither doc nor examples on the web.

Thanks,
-- 
Eric


From m.catanese at kinetikon.com  Fri Oct 13 12:03:18 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Fri, 13 Oct 2006 14:03:18 +0200
Subject: [Linux-cluster] Unable to connect to cluster infrastructure -
	cluster died
Message-ID: <4BA469DE-8EA6-434E-BCD6-CB799A0DBF86@kinetikon.com>

Hi all,
i had a perfectly working 2-node cluster.

I saw kernel security updates and cluster bugfix update, so i waited  
2 weeks and decided, today, to do the updates

I disabled my cluster service (oracle) , patched both machines and  
rebooted

After reboot i had:

[root at lvzbe1 kernel]# clustat
Could not connect to cluster service

and a bunch of
Oct 13 13:51:55 lvzbe2 ccsd[3381]: Unable to connect to cluster  
infrastructure after 3840 seconds.
Oct 13 13:52:26 lvzbe2 ccsd[3381]: Unable to connect to cluster  
infrastructure after 3870 seconds.
Oct 13 13:52:56 lvzbe2 ccsd[3381]: Unable to connect to cluster  
infrastructure after 3900 seconds.
Oct 13 13:53:26 lvzbe2 ccsd[3381]: Unable to connect to cluster  
infrastructure after 3930 seconds.

Cluster DIED.

I did investigations and i discovered that someone _forgot_ to  
compile dlm-smp and cman-smp for the latest redhat kernel.

this is the "old" kernel:

[root at lvzbe1 kernel]# cd /lib/modules/2.6.9-42.0.2.ELsmp/kernel/
[root at lvzbe1 kernel]# ls -la
total 44
drwxr-xr-x  10 root root 4096 Sep  4 10:17 .
drwxr-xr-x   3 root root 4096 Oct 13 12:56 ..
drwxr-xr-x   3 root root 4096 Sep  4 10:17 arch
drwxr-xr-x   2 root root 4096 Oct 13 12:56 cluster
drwxr-xr-x   2 root root 4096 Sep  4 10:17 crypto
drwxr-xr-x  29 root root 4096 Sep  4 10:17 drivers
drwxr-xr-x  22 root root 4096 Sep  4 10:17 fs
drwxr-xr-x   3 root root 4096 Sep  4 10:17 lib
drwxr-xr-x  13 root root 4096 Sep  4 10:17 net
drwxr-xr-x  10 root root 4096 Sep  4 10:17 sound
[root at lvzbe1 kernel]#


and this is the "new" one:

root at lvzbe1 kernel]# cd /lib/modules/2.6.9-42.0.3.ELsmp/kernel/
[root at lvzbe1 kernel]# ls -la
total 36
drwxr-xr-x   9 root root 4096 Oct 13 12:20 .
drwxr-xr-x   3 root root 4096 Oct 13 12:31 ..
drwxr-xr-x   3 root root 4096 Oct 13 12:20 arch
drwxr-xr-x   2 root root 4096 Oct 13 12:20 crypto
drwxr-xr-x  29 root root 4096 Oct 13 12:20 drivers
drwxr-xr-x  22 root root 4096 Oct 13 12:20 fs
drwxr-xr-x   3 root root 4096 Oct 13 12:20 lib
drwxr-xr-x  13 root root 4096 Oct 13 12:20 net
drwxr-xr-x  10 root root 4096 Oct 13 12:20 sound
[root at lvzbe1 kernel]#


As you can see, the latest kernel does not have the "cluster" directory.

This is the latest cman:

[root at lvzbe1 kernel]# rpm -qil cman-kernel-smp-2.6.9-45.5
Name        : cman-kernel-smp              Relocations: (not  
relocatable)
Version     : 2.6.9                             Vendor: Red Hat, Inc.
Release     : 45.5                          Build Date: Fri 18 Aug  
2006 07:05:34 PM CEST
Install Date: Fri 13 Oct 2006 12:56:36 PM CEST      Build Host: hs20- 
bc1-3.build.redhat.com
Group       : System Environment/Kernel     Source RPM: cman- 
kernel-2.6.9-45.5.src.rpm
Size        : 340198                           License: GPL
Signature   : DSA/SHA1, Tue 22 Aug 2006 09:51:57 PM CEST, Key ID  
219180cddb42a60e
Packager    : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
Summary     : cman-kernel-smp - The Cluster Manager kernel smp modules
Description :
cman-kernel-smp - The Cluster Manager kernel smp modules
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.ko
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.symvers
[root at lvzbe1 kernel]#


and this is the latest dlm:

rpm -qil dlm-kernel-smp-2.6.9-44.2
Name        : dlm-kernel-smp               Relocations: (not  
relocatable)
Version     : 2.6.9                             Vendor: Red Hat, Inc.
Release     : 44.2                          Build Date: Tue 26 Sep  
2006 10:49:24 PM CEST
Install Date: Fri 13 Oct 2006 12:20:35 PM CEST      Build Host: hs20- 
bc2-3.build.redhat.com
Group       : System Environment/Kernel     Source RPM: dlm- 
kernel-2.6.9-44.2.src.rpm
Size        : 329858                           License: GPL
Signature   : DSA/SHA1, Thu 28 Sep 2006 09:44:31 PM CEST, Key ID  
219180cddb42a60e
Packager    : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
Summary     : dlm-kernel-smp - The Distributed Lock Manager kernel  
modules.
Description :
dlm-kernel-smp - The Distributed Lock Manager kernel-smp modules.
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.ko
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.symvers

Luckily this is not (yet) a production system, and i REALLY hope i  
did something wrong, even if im sure i did not.

Can i download cman-kernel-src.rpm and dlm-kernel.src.rpm and compile  
myself, while waiting for answers from you ?


Matteo


From dbrieck at gmail.com  Fri Oct 13 12:16:00 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Fri, 13 Oct 2006 08:16:00 -0400
Subject: [Linux-cluster] Unable to connect to cluster infrastructure -
	cluster died
In-Reply-To: <4BA469DE-8EA6-434E-BCD6-CB799A0DBF86@kinetikon.com>
References: <4BA469DE-8EA6-434E-BCD6-CB799A0DBF86@kinetikon.com>
Message-ID: <8c1094290610130516t68fcdfebtf80f7fd709f9223d@mail.gmail.com>

On 10/13/06, Matteo Catanese <m.catanese at kinetikon.com> wrote:
> Hi all,
> i had a perfectly working 2-node cluster.
>
> I saw kernel security updates and cluster bugfix update, so i waited
> 2 weeks and decided, today, to do the updates
>
> I disabled my cluster service (oracle) , patched both machines and
> rebooted
>
> After reboot i had:
>
> [root at lvzbe1 kernel]# clustat
> Could not connect to cluster service
>
> and a bunch of
> Oct 13 13:51:55 lvzbe2 ccsd[3381]: Unable to connect to cluster
> infrastructure after 3840 seconds.
> Oct 13 13:52:26 lvzbe2 ccsd[3381]: Unable to connect to cluster
> infrastructure after 3870 seconds.
> Oct 13 13:52:56 lvzbe2 ccsd[3381]: Unable to connect to cluster
> infrastructure after 3900 seconds.
> Oct 13 13:53:26 lvzbe2 ccsd[3381]: Unable to connect to cluster
> infrastructure after 3930 seconds.
>
> Cluster DIED.
>
> I did investigations and i discovered that someone _forgot_ to
> compile dlm-smp and cman-smp for the latest redhat kernel.
>
> this is the "old" kernel:
>
> [root at lvzbe1 kernel]# cd /lib/modules/2.6.9-42.0.2.ELsmp/kernel/
> [root at lvzbe1 kernel]# ls -la
> total 44
> drwxr-xr-x  10 root root 4096 Sep  4 10:17 .
> drwxr-xr-x   3 root root 4096 Oct 13 12:56 ..
> drwxr-xr-x   3 root root 4096 Sep  4 10:17 arch
> drwxr-xr-x   2 root root 4096 Oct 13 12:56 cluster
> drwxr-xr-x   2 root root 4096 Sep  4 10:17 crypto
> drwxr-xr-x  29 root root 4096 Sep  4 10:17 drivers
> drwxr-xr-x  22 root root 4096 Sep  4 10:17 fs
> drwxr-xr-x   3 root root 4096 Sep  4 10:17 lib
> drwxr-xr-x  13 root root 4096 Sep  4 10:17 net
> drwxr-xr-x  10 root root 4096 Sep  4 10:17 sound
> [root at lvzbe1 kernel]#
>
>
> and this is the "new" one:
>
> root at lvzbe1 kernel]# cd /lib/modules/2.6.9-42.0.3.ELsmp/kernel/
> [root at lvzbe1 kernel]# ls -la
> total 36
> drwxr-xr-x   9 root root 4096 Oct 13 12:20 .
> drwxr-xr-x   3 root root 4096 Oct 13 12:31 ..
> drwxr-xr-x   3 root root 4096 Oct 13 12:20 arch
> drwxr-xr-x   2 root root 4096 Oct 13 12:20 crypto
> drwxr-xr-x  29 root root 4096 Oct 13 12:20 drivers
> drwxr-xr-x  22 root root 4096 Oct 13 12:20 fs
> drwxr-xr-x   3 root root 4096 Oct 13 12:20 lib
> drwxr-xr-x  13 root root 4096 Oct 13 12:20 net
> drwxr-xr-x  10 root root 4096 Oct 13 12:20 sound
> [root at lvzbe1 kernel]#
>
>
> As you can see, the latest kernel does not have the "cluster" directory.
>
> This is the latest cman:
>
> [root at lvzbe1 kernel]# rpm -qil cman-kernel-smp-2.6.9-45.5
> Name        : cman-kernel-smp              Relocations: (not
> relocatable)
> Version     : 2.6.9                             Vendor: Red Hat, Inc.
> Release     : 45.5                          Build Date: Fri 18 Aug
> 2006 07:05:34 PM CEST
> Install Date: Fri 13 Oct 2006 12:56:36 PM CEST      Build Host: hs20-
> bc1-3.build.redhat.com
> Group       : System Environment/Kernel     Source RPM: cman-
> kernel-2.6.9-45.5.src.rpm
> Size        : 340198                           License: GPL
> Signature   : DSA/SHA1, Tue 22 Aug 2006 09:51:57 PM CEST, Key ID
> 219180cddb42a60e
> Packager    : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
> Summary     : cman-kernel-smp - The Cluster Manager kernel smp modules
> Description :
> cman-kernel-smp - The Cluster Manager kernel smp modules
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.ko
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.symvers
> [root at lvzbe1 kernel]#
>
>
> and this is the latest dlm:
>
> rpm -qil dlm-kernel-smp-2.6.9-44.2
> Name        : dlm-kernel-smp               Relocations: (not
> relocatable)
> Version     : 2.6.9                             Vendor: Red Hat, Inc.
> Release     : 44.2                          Build Date: Tue 26 Sep
> 2006 10:49:24 PM CEST
> Install Date: Fri 13 Oct 2006 12:20:35 PM CEST      Build Host: hs20-
> bc2-3.build.redhat.com
> Group       : System Environment/Kernel     Source RPM: dlm-
> kernel-2.6.9-44.2.src.rpm
> Size        : 329858                           License: GPL
> Signature   : DSA/SHA1, Thu 28 Sep 2006 09:44:31 PM CEST, Key ID
> 219180cddb42a60e
> Packager    : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
> Summary     : dlm-kernel-smp - The Distributed Lock Manager kernel
> modules.
> Description :
> dlm-kernel-smp - The Distributed Lock Manager kernel-smp modules.
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.ko
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.symvers
>
> Luckily this is not (yet) a production system, and i REALLY hope i
> did something wrong, even if im sure i did not.
>
> Can i download cman-kernel-src.rpm and dlm-kernel.src.rpm and compile
> myself, while waiting for answers from you ?
>
>
> Matteo
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>

The cluster packages are kernel specific and lag behind normal kernel
updates. Not sure if they release cluster updates outside the update
cycle though, I haven't been using them for more than two updates.


From m.catanese at kinetikon.com  Fri Oct 13 12:31:33 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Fri, 13 Oct 2006 14:31:33 +0200
Subject: [Linux-cluster] Unable to connect to cluster infrastructure
	-	cluster died
Message-ID: <6425DB9F-59B7-4CF8-95D5-B818C99F01C5@kinetikon.com>

 > The cluster packages are kernel specific and lag behind normal kernel
 > updates. Not sure if they release cluster updates outside the update
 > cycle though, I haven't been using them for more than two updates.

Oh it's a matter of one or two %define in the spec file.

It was a security update and not a feature enhancement, so things  
should not break

I will try to compile (and not install) by myself


Anyway those things  must really not happen.

Matteo


From orkcu at yahoo.com  Fri Oct 13 12:58:16 2006
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Fri, 13 Oct 2006 05:58:16 -0700 (PDT)
Subject: [Linux-cluster] Unable to connect to cluster infrastructure -
	cluster died
In-Reply-To: <4BA469DE-8EA6-434E-BCD6-CB799A0DBF86@kinetikon.com>
Message-ID: <20061013125816.89957.qmail@web50601.mail.yahoo.com>


--- Matteo Catanese <m.catanese at kinetikon.com> wrote:

> This is the latest cman:
> 
> [root at lvzbe1 kernel]# rpm -qil
> cman-kernel-smp-2.6.9-45.5
> Name        : cman-kernel-smp             
> cman-kernel-smp - The Cluster Manager kernel smp
> modules
> /lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster
>
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.ko
>
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/cman.symvers
> [root at lvzbe1 kernel]#
> 
> 
> and this is the latest dlm:
> 
> rpm -qil dlm-kernel-smp-2.6.9-44.2
> Name        : dlm-kernel-smp              
> dlm-kernel-smp - The Distributed Lock Manager
> kernel-smp modules.
>
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.ko
>
/lib/modules/2.6.9-42.0.2.ELsmp/kernel/cluster/dlm.symvers
> 
> Luckily this is not (yet) a production system, and i
> REALLY hope i  
> did something wrong, even if im sure i did not.

I think you didn't, but maybe next time you might
double check if the lastes cluster kernel module are
as updated as the lastest kernel from redhat ;-)

> 
> Can i download cman-kernel-src.rpm and
> dlm-kernel.src.rpm and compile  
> myself, while waiting for answers from you ?

I think you might try to copy the cluster kernel
module from the old kernel to the new one, them run
depmod (if you are not running the lasted kernel at
that time check the depmod man page to update just the
new kernel's module dependencies)

just a workaround until you get the right package from
redhat

cu
roger


__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From jan.gerrit at kootstra.org.uk  Fri Oct 13 13:14:53 2006
From: jan.gerrit at kootstra.org.uk (Jan Gerrit)
Date: Fri, 13 Oct 2006 15:14:53 +0200
Subject: [Linux-cluster] Does anyone run an Oracle database on a Red Hat
 Cluster without Real Application Cluster?
In-Reply-To: <20061013120327.AAD5A73506@hormel.redhat.com>
References: <20061013120327.AAD5A73506@hormel.redhat.com>
Message-ID: <452F914D.70308@kootstra.org.uk>

Dear list members,


Does anyone on the mailinglist run an Oracle database on a Red Hat 
cluster without using Real Application Cluster (RAC)?

If, so how did you configure it?

If you do use RAC can you give the reasons why you chose to use RAC?

We need to create a database environment that gives a 99.9% uptime based 
on 24 hours per day on a daily bases.


Best regards,


Jan Gerrit Kootstra


From jos at xos.nl  Fri Oct 13 13:40:14 2006
From: jos at xos.nl (Jos Vos)
Date: Fri, 13 Oct 2006 15:40:14 +0200
Subject: [Linux-cluster] Cluster name vs. alias in cluster.conf
Message-ID: <200610131340.k9DDeEn23005@xos037.xos.nl>

Hi,

Since system-config-cluster-1.0.27, the "name" attribute in the "cluster"
node always stays "alpha_cluster" (when editting a new config with the
GUI) and only the "alias" attribute is set to the name of the cluster
specified in the GUI.

Is this a feature or a bug?  Someone also told me that this causes
problems with the locktable names of GFS filesystems, because the
cluster's name (not the alias) is used as part of the locktable name.

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From m.catanese at kinetikon.com  Fri Oct 13 14:58:52 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Fri, 13 Oct 2006 16:58:52 +0200
Subject: [Linux-cluster] Unable to connect to cluster infrastructure
	-	cluster died
Message-ID: <53CDF6B5-73A4-4D26-A98E-C611FA7F1ACF@kinetikon.com>

Just booted with the old kernel and all is working.

Matteo


From jparsons at redhat.com  Fri Oct 13 15:49:22 2006
From: jparsons at redhat.com (James Parsons)
Date: Fri, 13 Oct 2006 11:49:22 -0400
Subject: [Linux-cluster] Cluster name vs. alias in cluster.conf
In-Reply-To: <200610131340.k9DDeEn23005@xos037.xos.nl>
References: <200610131340.k9DDeEn23005@xos037.xos.nl>
Message-ID: <452FB582.2030206@redhat.com>

Jos Vos wrote:

>Hi,
>
>Since system-config-cluster-1.0.27, the "name" attribute in the "cluster"
>node always stays "alpha_cluster" (when editting a new config with the
>GUI) and only the "alias" attribute is set to the name of the cluster
>specified in the GUI.
>
>Is this a feature or a bug?  Someone also told me that this causes
>problems with the locktable names of GFS filesystems, because the
>cluster's name (not the alias) is used as part of the locktable name.
>
>  
>
Great question, Jos.

First of all, the alias attribute for cluster name was added for two 
reasons:
1) First, we started building some tools that made cluster creation 
easier and automated alot of the drudgery. We felt (perhaps incorrectly) 
that the true name of the cluster was probably something most users did 
not care about, so we began using generated alphanumeric strings to name 
clusters and make two clusters created with the deployment tools very 
unlikely, as two clusters with the same name on the same subnet would be 
a very bad thing. So that an administrator could name the clusters 
something convenient to remember what they were (file system, web 
sertvices, etc.) and because of #2 below, we introduced an attribute 
that would only be used by the UI.

2) After a cluster is up and running, changing the name of a cluster 
requires shutting down the cluster and reconfiguring it. If a running 
cluster's purpose were to change, it might be convenient to be able to 
rename it in the UI without having to pull your databases offline for 
such a minor procedure. I guess you would have to change the properties 
on GFS as well, if you were using it. I am not sure if you can do that 
very easily offhand...but it seems a shame to have to remake your FS 
just because you want to call the cluster "Production CNC Services - 
Dept. 1471" one day, instead of "Cluster1"...so this is not meant to be 
smarmy - but rather explain the intended use case for this feature.

So, this is why the alias for cluster name was introduced. Recently I 
was made aware that admins were using the alias'd name of the cluster as 
an arg to mkfs_gfs (or whatever the gfs creation utility is called - i 
forget right now :) and of course, that is a BAD thing. So, I have 
changed the labelling and presentation in the UI to make it very evident 
that the alias'd name is an alias, and in fact we display the true name 
as well.

Now, if you are telling me that creation of any new cluster with 
s-c-cluster produces a cluster named 'alpha_cluster' and not what the 
user specifies, then this is a bug and it will be fixed for the next 
update. The intended action is for the modelbuilder code to check if a 
name exists, and if not, use the alias field for the initial name of the 
cluster.

I will check this out today. Thanks.

-Jim


From dbrieck at gmail.com  Fri Oct 13 17:00:02 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Fri, 13 Oct 2006 13:00:02 -0400
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
References: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
Message-ID: <8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>

On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> I've been trying to find more information about using GFS  and MySQL
> to create a simple active-active mysql cluster without the need for
> the actual mysql cluster (wouldn't work for our situation).
>
> The only thing I've seen on the mailing list is the following:
>
> -----
> It is possible to use mysql on shared storage with enabled external locking
> and also disabling the query cache and few other things:
>
> enable-locking
> query_cache_wlock_invalidate
> query_cache_size= 0
> query_cache_type= 0
> delay_key_write = OFF
> flush
>
> in mysqld section
>
>
> this configuration worked for my 10 node cluster .
> ----
>
> but other than that no one has posted anything. I also found this press release:
> http://www.mysql.com/news-and-events/press-release/release_2005_13.html
>
> from mysql and redhat that says:
>
> "MySQL and Red Hat plan to test the MySQL database with Red Hat's
> Cluster Suite and Global File System (GFS). Red Hat GFS allows a
> cluster of MySQL servers to simultaneously read and write data to a
> single shared file system on a SAN, achieving high performance and
> reducing the complexity and overhead of managing redundant data
> copies. With Red Hat Cluster Suite and GFS, MySQL customers can get a
> highly available clustered database solution based on all open source
> technologies."
>
> But once again I can't find any follow up to this. Can anyone give me
> a hand? I'd want to run 3 active mysql servers at least on one set of
> data shared with GFS.
>
> Thanks.
>

Is anyone able to help me find more information on this?


From rpeterso at redhat.com  Fri Oct 13 18:01:23 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 13 Oct 2006 13:01:23 -0500
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>
References: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
	<8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>
Message-ID: <452FD473.1040204@redhat.com>

David Brieck Jr. wrote:
> On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
>> I've been trying to find more information about using GFS  and MySQL
>> to create a simple active-active mysql cluster without the need for
>> the actual mysql cluster (wouldn't work for our situation).
>>
>> The only thing I've seen on the mailing list is the following:
>>
>> -----
>> It is possible to use mysql on shared storage with enabled external 
>> locking
>> and also disabling the query cache and few other things:
>>
>> enable-locking
>> query_cache_wlock_invalidate
>> query_cache_size= 0
>> query_cache_type= 0
>> delay_key_write = OFF
>> flush
>>
>> in mysqld section
>>
>>
>> this configuration worked for my 10 node cluster .
>> ----
>>
>> but other than that no one has posted anything. I also found this 
>> press release:
>> http://www.mysql.com/news-and-events/press-release/release_2005_13.html
>>
>> from mysql and redhat that says:
>>
>> "MySQL and Red Hat plan to test the MySQL database with Red Hat's
>> Cluster Suite and Global File System (GFS). Red Hat GFS allows a
>> cluster of MySQL servers to simultaneously read and write data to a
>> single shared file system on a SAN, achieving high performance and
>> reducing the complexity and overhead of managing redundant data
>> copies. With Red Hat Cluster Suite and GFS, MySQL customers can get a
>> highly available clustered database solution based on all open source
>> technologies."
>>
>> But once again I can't find any follow up to this. Can anyone give me
>> a hand? I'd want to run 3 active mysql servers at least on one set of
>> data shared with GFS.
>>
>> Thanks.
> Is anyone able to help me find more information on this?
Hi David,

Here is everything I know about it at this time:
http://sources.redhat.com/cluster/faq.html#gfs_mysql

Regards,

Bob Peterson
Red Hat Cluster Suite


From jos at xos.nl  Fri Oct 13 18:08:07 2006
From: jos at xos.nl (Jos Vos)
Date: Fri, 13 Oct 2006 20:08:07 +0200
Subject: [Linux-cluster] Cluster name vs. alias in cluster.conf
In-Reply-To: <452FB582.2030206@redhat.com>;
	from jparsons@redhat.com on Fri, Oct 13, 2006 at 11:49:22AM -0400
References: <200610131340.k9DDeEn23005@xos037.xos.nl>
	<452FB582.2030206@redhat.com>
Message-ID: <20061013200807.A24909@xos037.xos.nl>

On Fri, Oct 13, 2006 at 11:49:22AM -0400, James Parsons wrote:

> such a minor procedure. I guess you would have to change the properties 
> on GFS as well, if you were using it. I am not sure if you can do that 
> very easily offhand...but it seems a shame to have to remake your FS 

You can rename the tables (on an unmounted GFS):

   gfs_tool sb <device> table clustername:fsname

> Now, if you are telling me that creation of any new cluster with 
> s-c-cluster produces a cluster named 'alpha_cluster' and not what the 
> user specifies, then this is a bug and it will be fixed for the next 
> update. The intended action is for the modelbuilder code to check if a 

Yes, this (always name it alpha_cluster) seems to be the case :-(.

> I will check this out today. Thanks.

I think I open a bugzilla issue for this, so that I and others can
track it.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From tmornini at engineyard.com  Fri Oct 13 18:48:12 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Fri, 13 Oct 2006 11:48:12 -0700
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <452FD473.1040204@redhat.com>
References: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
	<8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>
	<452FD473.1040204@redhat.com>
Message-ID: <BC244494-E885-46EA-8073-67A5D3AFEC0E@engineyard.com>

David Brieck Jr. wrote:
> On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
>> I've been trying to find more information about using GFS  and MySQL
>> to create a simple active-active mysql cluster without the need for
>> the actual mysql cluster (wouldn't work for our situation).
>>
>> The only thing I've seen on the mailing list is the following:
>>
>> -----
>> It is possible to use mysql on shared storage with enabled  
>> external locking
>> and also disabling the query cache and few other things:
>>
>> enable-locking
>> query_cache_wlock_invalidate
>> query_cache_size= 0
>> query_cache_type= 0
>> delay_key_write = OFF
>> flush

I too found this, and tried it. It worked well, except for the HUGE  
problem (in my case) that it only works for MyISAM tables.

Additionally, I didn't test it all that well, as it was unsuitable  
otherwise.

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From adas at redhat.com  Fri Oct 13 19:15:47 2006
From: adas at redhat.com (Abhijith Das)
Date: Fri, 13 Oct 2006 14:15:47 -0500
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
In-Reply-To: <1160572166.4042.28.camel@WSBID06223>
References: <1160572166.4042.28.camel@WSBID06223>
Message-ID: <452FE5E3.8010603@redhat.com>

Hi Robert,

> I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output 
> from their utility reveals a problem with fcntl/flock on the GFS 
> filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):
>
>     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
>     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
>     len=0}) = -1 ENOLCK (No locks available)
>
The only place GFS returns ENOLCK is when mandatory locking is turned 
on. It looks like you have the flags for mandatory locking set on this 
particular file. i.e. S_ISGID is set and S_IXGRP is not set. If you do 
an 'ls -al' on this file, you should see something like
    [adas at radium locking]$ ls -al templock
    -rwxr-Sr-x  1 adas adas 6 Oct 13 13:50 templock
The 'S' in the permissions for the group indicate mandatory locking. GFS 
does not support mandatory locking and will refuse to
apply fcntl() or flock() on such files.
You can try doing the same fcntl() with start=0 and len=0 on some other 
regular file on gfs and see if that works for you.

> When I put their files on an ordinary ext3 filesystem, all works fine:
>
>     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
>     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
>     len=0}) = 0
>
When you copied the files over, they might have lost these flags, or 
your ext3 fs supports mandatory locking ('-o mand' mount option)

> The man page on fcntl states that when flock is setup with 
> whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire 
> file... so locking zero bytes in this manner is acceptable.  Help?

This works correctly in my test environment.

Hope this helps.

Regards,
--Abhi


From dbrieck at gmail.com  Fri Oct 13 20:32:01 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Fri, 13 Oct 2006 16:32:01 -0400
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <BC244494-E885-46EA-8073-67A5D3AFEC0E@engineyard.com>
References: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
	<8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>
	<452FD473.1040204@redhat.com>
	<BC244494-E885-46EA-8073-67A5D3AFEC0E@engineyard.com>
Message-ID: <8c1094290610131332m494b46f2j3a488bf66ffee70e@mail.gmail.com>

On 10/13/06, Tom Mornini <tmornini at engineyard.com> wrote:
> David Brieck Jr. wrote:
> > On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> >> I've been trying to find more information about using GFS  and MySQL
> >> to create a simple active-active mysql cluster without the need for
> >> the actual mysql cluster (wouldn't work for our situation).
> >>
> >> The only thing I've seen on the mailing list is the following:
> >>
> >> -----
> >> It is possible to use mysql on shared storage with enabled
> >> external locking
> >> and also disabling the query cache and few other things:
> >>
> >> enable-locking
> >> query_cache_wlock_invalidate
> >> query_cache_size= 0
> >> query_cache_type= 0
> >> delay_key_write = OFF
> >> flush
>
> I too found this, and tried it. It worked well, except for the HUGE
> problem (in my case) that it only works for MyISAM tables.
>
> Additionally, I didn't test it all that well, as it was unsuitable
> otherwise.
>
> --
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)
>
>

Thanks for your response, I'm trying it as well right now. Not having
InnoDB tables isn't a huge problem for us, we can certainly work
around that with an additional server that just does innodb if we
absolutely need them.

Where there any other configurations you made to get it to work
properly? I'm using LVS to route traffic to the real servers in my
tests.

Thanks


From jan.gerrit at kootstra.org.uk  Fri Oct 13 20:51:16 2006
From: jan.gerrit at kootstra.org.uk (Jan Gerrit)
Date: Fri, 13 Oct 2006 22:51:16 +0200
Subject: [Linux-cluster] Fw: Oracle community responded to my question about
 Oracle cluster without RAC. Use Oracle Data Guard and TAF.
In-Reply-To: <20061013160005.C627773624@hormel.redhat.com>
References: <20061013160005.C627773624@hormel.redhat.com>
Message-ID: <452FFC44.7080704@kootstra.org.uk>

Dear community,


 From Oracle groups I got the suggestion to use Oracle Data Guard 
together with Transparent Application Failover.

Thought to keep you posted and share the info.


Best regards,


Jan Gerrit Kootstra


From tmornini at engineyard.com  Fri Oct 13 21:06:52 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Fri, 13 Oct 2006 14:06:52 -0700
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <8c1094290610131332m494b46f2j3a488bf66ffee70e@mail.gmail.com>
References: <8c1094290609181334q7015262csc2479cc6626886fd@mail.gmail.com>
	<8c1094290610131000l7d0a602ey8b1aa551723e8f12@mail.gmail.com>
	<452FD473.1040204@redhat.com>
	<BC244494-E885-46EA-8073-67A5D3AFEC0E@engineyard.com>
	<8c1094290610131332m494b46f2j3a488bf66ffee70e@mail.gmail.com>
Message-ID: <43A586B5-1ACB-456F-96E1-EBD9FE693A00@engineyard.com>

On Oct 13, 2006, at 1:32 PM, David Brieck Jr. wrote:

> On 10/13/06, Tom Mornini <tmornini at engineyard.com> wrote:
>> David Brieck Jr. wrote:
>> > On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
>> >> I've been trying to find more information about using GFS  and  
>> MySQL
>> >> to create a simple active-active mysql cluster without the need  
>> for
>> >> the actual mysql cluster (wouldn't work for our situation).
>> >>
>> >> The only thing I've seen on the mailing list is the following:
>> >>
>> >> -----
>> >> It is possible to use mysql on shared storage with enabled
>> >> external locking
>> >> and also disabling the query cache and few other things:
>> >>
>> >> enable-locking
>> >> query_cache_wlock_invalidate
>> >> query_cache_size= 0
>> >> query_cache_type= 0
>> >> delay_key_write = OFF
>> >> flush
>>
>> I too found this, and tried it. It worked well, except for the HUGE
>> problem (in my case) that it only works for MyISAM tables.
>>
>> Additionally, I didn't test it all that well, as it was unsuitable
>> otherwise.
>
> Thanks for your response, I'm trying it as well right now. Not having
> InnoDB tables isn't a huge problem for us, we can certainly work
> around that with an additional server that just does innodb if we
> absolutely need them.

You're lucky!

> Where there any other configurations you made to get it to work
> properly? I'm using LVS to route traffic to the real servers in my
> tests.

No. My my.cnf has those exact lines in it.

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From mwill at penguincomputing.com  Fri Oct 13 21:10:38 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Fri, 13 Oct 2006 14:10:38 -0700
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
Message-ID: <433093DF7AD7444DA65EFAFE3987879C2B84E4@jellyfish.highlyscyld.com>

So what was the result?

A two node active/active mysql setup operating on the same tables, or
more than just two? 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Tom Mornini
Sent: Friday, October 13, 2006 2:07 PM
To: linux clustering
Subject: Re: [Linux-cluster] Re: Multiple Active MySQL instances

On Oct 13, 2006, at 1:32 PM, David Brieck Jr. wrote:

> On 10/13/06, Tom Mornini <tmornini at engineyard.com> wrote:
>> David Brieck Jr. wrote:
>> > On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
>> >> I've been trying to find more information about using GFS  and
>> MySQL
>> >> to create a simple active-active mysql cluster without the need
>> for
>> >> the actual mysql cluster (wouldn't work for our situation).
>> >>
>> >> The only thing I've seen on the mailing list is the following:
>> >>
>> >> -----
>> >> It is possible to use mysql on shared storage with enabled 
>> >> external locking and also disabling the query cache and few other 
>> >> things:
>> >>
>> >> enable-locking
>> >> query_cache_wlock_invalidate
>> >> query_cache_size= 0
>> >> query_cache_type= 0
>> >> delay_key_write = OFF
>> >> flush
>>
>> I too found this, and tried it. It worked well, except for the HUGE 
>> problem (in my case) that it only works for MyISAM tables.
>>
>> Additionally, I didn't test it all that well, as it was unsuitable 
>> otherwise.
>
> Thanks for your response, I'm trying it as well right now. Not having 
> InnoDB tables isn't a huge problem for us, we can certainly work 
> around that with an additional server that just does innodb if we 
> absolutely need them.

You're lucky!

> Where there any other configurations you made to get it to work 
> properly? I'm using LVS to route traffic to the real servers in my 
> tests.

No. My my.cnf has those exact lines in it.

--
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From tmornini at engineyard.com  Fri Oct 13 21:29:06 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Fri, 13 Oct 2006 14:29:06 -0700
Subject: [Linux-cluster] Re: Multiple Active MySQL instances
In-Reply-To: <433093DF7AD7444DA65EFAFE3987879C2B84E4@jellyfish.highlyscyld.com>
References: <433093DF7AD7444DA65EFAFE3987879C2B84E4@jellyfish.highlyscyld.com>
Message-ID: <D8587AA8-5E53-4E95-8CBE-79ACB27D720F@engineyard.com>

Two nodes, but I'm quite certain more would have worked fine.

On Oct 13, 2006, at 2:10 PM, Michael Will wrote:

> So what was the result?
>
> A two node active/active mysql setup operating on the same tables, or
> more than just two?
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Tom Mornini
> Sent: Friday, October 13, 2006 2:07 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Re: Multiple Active MySQL instances
>
> On Oct 13, 2006, at 1:32 PM, David Brieck Jr. wrote:
>
>> On 10/13/06, Tom Mornini <tmornini at engineyard.com> wrote:
>>> David Brieck Jr. wrote:
>>>> On 9/18/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
>>>>> I've been trying to find more information about using GFS  and
>>> MySQL
>>>>> to create a simple active-active mysql cluster without the need
>>> for
>>>>> the actual mysql cluster (wouldn't work for our situation).
>>>>>
>>>>> The only thing I've seen on the mailing list is the following:
>>>>>
>>>>> -----
>>>>> It is possible to use mysql on shared storage with enabled
>>>>> external locking and also disabling the query cache and few other
>>>>> things:
>>>>>
>>>>> enable-locking
>>>>> query_cache_wlock_invalidate
>>>>> query_cache_size= 0
>>>>> query_cache_type= 0
>>>>> delay_key_write = OFF
>>>>> flush
>>>
>>> I too found this, and tried it. It worked well, except for the HUGE
>>> problem (in my case) that it only works for MyISAM tables.
>>>
>>> Additionally, I didn't test it all that well, as it was unsuitable
>>> otherwise.
>>
>> Thanks for your response, I'm trying it as well right now. Not having
>> InnoDB tables isn't a huge problem for us, we can certainly work
>> around that with an additional server that just does innodb if we
>> absolutely need them.
>
> You're lucky!
>
>> Where there any other configurations you made to get it to work
>> properly? I'm using LVS to route traffic to the real servers in my
>> tests.
>
> No. My my.cnf has those exact lines in it.
>
> --
> -- Tom Mornini, CTO
> -- Engine Yard, Ruby on Rails Hosting
> -- Reliability, Ease of Use, Scalability
> -- (866) 518-YARD (9273)

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From filipe.miranda at gmail.com  Fri Oct 13 21:45:52 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 13 Oct 2006 18:45:52 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <452F4019.8040803@redhat.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
Message-ID: <a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>

Patrick,
The modifications on the /etc/lvm/lvm.conf
From:

 # By default we accept every block device:
    filter = [ "a/.*/" ]
To:
    filter = [ "a/.*/", "r|/dev/sda" ]

But when re restart the machine, and run the command vgdisplay I got this
message:

[root at lock-server1 ~]# vgdisplay
  Failed to create regex device filter
[root at lock-server1 ~]# vgscan
  Failed to create regex device filter

Is the filter correct?


On 10/13/06, Patrick Caulfield <pcaulfie at redhat.com> wrote:
>
> Filipe Miranda wrote:
> > Hello everybody,
> >
> > I have the following environment and I need some help please.
> > 3 machines are GULM lock servers (x86 CPUs)
> > 3 machines are ORACLE RAC nodes that will use GFS (x86_64 CPUs)
> > Only the RAC nodes have access to the shared storage.
> >
> > Using Red Hat Enterprise Linux 4.3, Red Hat Cluster Suite Update4 and
> > GFS 6.1 Update3
> >
> > I setup the RHEL4 plus RHCS and things are just fine. I edited the
> > cluster.conf and add the follwing tag to the lock servers members so
> > they wont need access to the shared storage: "clvmd=0"  Then I installed
> > the CLVM package on all nodes.
> >
> > So far so good I created my Physical Volumes then I created my Volume
> Group.
> > The problem is: When I try to create the Logical Volume I get the
> > follwing error:
> >
> > [root at rac-node1 ~]# lvcreate -L 1G -n lv01 vg01
> >   Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1
> > not /dev/emcpowerb1
> >   Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using
> > /dev/emcpowerc1 not /dev/sdr1
> >   Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1
> > not /dev/emcpowerq1
> >   Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1
> > not /dev/emcpowera1
> >   Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using
> > /dev/emcpowerd1 not /dev/sdq1
> >   Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1
> > not /dev/emcpowerj1
> >   Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using
> > /dev/emcpowerk1 not /dev/sdj1
> >   Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1
> > not /dev/emcpoweri1
> >   Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using
> > /dev/emcpowerl1 not /dev/sdi1
> >   Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1
> > not /dev/emcpowerh1
> >   Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using
> > /dev/emcpowerm1 not /dev/sdh1
> >   Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1
> > not /dev/emcpowerg1
> >   Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using
> > /dev/emcpowern1 not /dev/sdg1
> >   Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1
> > not /dev/emcpowerf1
> >   Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using
> > /dev/emcpowero1 not /dev/sdf1
> >   Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1
> > not /dev/emcpowere1
> >   Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using
> > /dev/emcpowerp1 not /dev/sde1
> >   Error locking on node rac-node1: Internal lvm error, check syslog
> >   Error locking on node UNKNOWN [0.0.ffff0000.dfc0800a]: Internal lvm
> > error, check syslog
> >   Error locking on node lock-server1: Internal lvm error, check syslog
> >   Error locking on node lock-server2: Internal lvm error, check syslog
> >   Error locking on node lock-server3: Internal lvm error, check syslog
> >   Failed to activate new LV.
> >
> You need to adjust the filters in /etc/lvm/lvm.conf to exclude the
> /dev/sda
> devices
>
> --
>
> patrick
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/7477437b/attachment.htm>

From filipe.miranda at gmail.com  Fri Oct 13 21:50:24 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 13 Oct 2006 18:50:24 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
Message-ID: <a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>

Patrick,

Ops, my bad, I'll try this one:

filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close pipe)

I'll keep you posted.


On 10/13/06, Filipe Miranda <filipe.miranda at gmail.com> wrote:
>
> Patrick,
> The modifications on the /etc/lvm/lvm.conf
> From:
>
>  # By default we accept every block device:
>     filter = [ "a/.*/" ]
> To:
>     filter = [ "a/.*/", "r|/dev/sda" ]
>
> But when re restart the machine, and run the command vgdisplay I got this
> message:
>
> [root at lock-server1 ~]# vgdisplay
>   Failed to create regex device filter
> [root at lock-server1 ~]# vgscan
>
>   Failed to create regex device filter
>
> Is the filter correct?
>
>
>
> On 10/13/06, Patrick Caulfield < pcaulfie at redhat.com> wrote:
> >
> > Filipe Miranda wrote:
> > > Hello everybody,
> > >
> > > I have the following environment and I need some help please.
> > > 3 machines are GULM lock servers (x86 CPUs)
> > > 3 machines are ORACLE RAC nodes that will use GFS (x86_64 CPUs)
> > > Only the RAC nodes have access to the shared storage.
> > >
> > > Using Red Hat Enterprise Linux 4.3, Red Hat Cluster Suite Update4 and
> > > GFS 6.1 Update3
> > >
> > > I setup the RHEL4 plus RHCS and things are just fine. I edited the
> > > cluster.conf and add the follwing tag to the lock servers members so
> > > they wont need access to the shared storage: "clvmd=0"  Then I
> > installed
> > > the CLVM package on all nodes.
> > >
> > > So far so good I created my Physical Volumes then I created my Volume
> > Group.
> > > The problem is: When I try to create the Logical Volume I get the
> > > follwing error:
> > >
> > > [root at rac-node1 ~]# lvcreate -L 1G -n lv01 vg01
> > >   Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1
> >
> > > not /dev/emcpowerb1
> > >   Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using
> > > /dev/emcpowerc1 not /dev/sdr1
> > >   Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1
> > > not /dev/emcpowerq1
> > >   Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1
> > > not /dev/emcpowera1
> > >   Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using
> > > /dev/emcpowerd1 not /dev/sdq1
> > >   Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1
> >
> > > not /dev/emcpowerj1
> > >   Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using
> > > /dev/emcpowerk1 not /dev/sdj1
> > >   Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1
> > > not /dev/emcpoweri1
> > >   Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using
> > > /dev/emcpowerl1 not /dev/sdi1
> > >   Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1
> > > not /dev/emcpowerh1
> > >   Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using
> > > /dev/emcpowerm1 not /dev/sdh1
> > >   Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1
> > > not /dev/emcpowerg1
> > >   Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using
> > > /dev/emcpowern1 not /dev/sdg1
> > >   Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1
> > > not /dev/emcpowerf1
> > >   Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using
> > > /dev/emcpowero1 not /dev/sdf1
> > >   Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1
> >
> > > not /dev/emcpowere1
> > >   Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using
> > > /dev/emcpowerp1 not /dev/sde1
> > >   Error locking on node rac-node1: Internal lvm error, check syslog
> > >   Error locking on node UNKNOWN [ 0.0.ffff0000.dfc0800a]: Internal lvm
> > > error, check syslog
> > >   Error locking on node lock-server1: Internal lvm error, check syslog
> > >   Error locking on node lock-server2: Internal lvm error, check syslog
> >
> > >   Error locking on node lock-server3: Internal lvm error, check syslog
> > >   Failed to activate new LV.
> > >
> > You need to adjust the filters in /etc/lvm/lvm.conf to exclude the
> > /dev/sda
> > devices
> >
> > --
> >
> > patrick
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
>
> --
> ---
> Filipe T Miranda
> Red Hat Certified Engineer


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/8aa22aa1/attachment.htm>

From agk at redhat.com  Fri Oct 13 21:54:00 2006
From: agk at redhat.com (Alasdair G Kergon)
Date: Fri, 13 Oct 2006 22:54:00 +0100
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
Message-ID: <20061013215400.GK17654@agk.surrey.redhat.com>

On Fri, Oct 13, 2006 at 06:50:24PM -0300, Filipe Miranda wrote:
> filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close pipe)
 
Still won't work - swap the order of the two items; first matches
everything so second isn't looked at.
(man lvm.conf)

Alasdair
-- 
agk at redhat.com


From filipe.miranda at gmail.com  Fri Oct 13 22:04:39 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 13 Oct 2006 19:04:39 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <20061013215400.GK17654@agk.surrey.redhat.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
	<20061013215400.GK17654@agk.surrey.redhat.com>
Message-ID: <a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>

Alasdair,

Do I need to reboot the machine to test this configurations changes? Or is
there a way do testing it without rebooting the machine?

I will try to swap the order of the filters,

Thanks for the hint

On 10/13/06, Alasdair G Kergon <agk at redhat.com> wrote:
>
> On Fri, Oct 13, 2006 at 06:50:24PM -0300, Filipe Miranda wrote:
> > filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close pipe)
>
> Still won't work - swap the order of the two items; first matches
> everything so second isn't looked at.
> (man lvm.conf)
>
> Alasdair
> --
> agk at redhat.com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/62dda3fd/attachment.htm>

From tmornini at engineyard.com  Fri Oct 13 23:51:09 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Fri, 13 Oct 2006 16:51:09 -0700
Subject: [Linux-cluster] dlm: no version for "kcl_register_service" found:
	kernel tainted
References: <20061013194123.GB25294@proton.souja.net>
Message-ID: <B8176801-8724-49C8-8BA7-7C91A9F7AC4B@engineyard.com>

Hello all.

We're getting problems when adding cluster nodes to our cluster.

Everything has been reasonable and stable until this point.

Here's the syslog with a great deal of context:

Oct 13 04:08:53 ey00-s00017 kernel: VFS: Mounted root (reiserfs  
filesystem) read only.
Oct 13 04:08:53 ey00-s00017 kernel: Adding 262136k swap on /dev/ 
sda2.  Priority: -1 extents:1 across:262136k
Oct 13 04:09:04 ey00-s00017 kernel: CMAN 1.03.00 (built Sep  8 2006  
03:49:59) installed
Oct 13 04:09:04 ey00-s00017 kernel: NET: Registered protocol family 30
Oct 13 04:09:04 ey00-s00017 kernel: CMAN: Waiting to join or form a  
Linux-cluster
Oct 13 04:09:05 ey00-s00017 kernel: CMAN: sending membership request
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00025
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00019
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00030
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00024
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00010
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00016
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00004
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00011
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00005
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00009
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00002
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00015
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00014
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00008
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00003
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00006
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00012
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00013
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00007
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00001
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00000
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-04
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-05
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-03
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-00
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-01
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-02
Oct 13 04:09:06 ey00-s00017 kernel: dlm: no version for  
"kcl_register_service" found: kernel tainted.
Oct 13 04:09:06 ey00-s00017 kernel: DLM 1.03.00 (built Sep  8 2006  
03:50:23) installed
Oct 13 04:09:57 ey00-s00017 kernel: CMAN: node ey00-s00018 rejoining
Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in  
phase 1 TRANSITION.96 (2)

Same thing on a different slice:

Oct 13 04:08:36 ey00-s00018 kernel: VFS: Mounted root (reiserfs  
filesystem) readonly.
Oct 13 04:08:36 ey00-s00018 kernel: Adding 262136k swap on /dev/ 
sda2.  Priority:-1 extents:1 across:262136k
Oct 13 04:08:48 ey00-s00018 kernel: CMAN 1.03.00 (built Sep  8 2006  
03:49:59) installed
Oct 13 04:08:48 ey00-s00018 kernel: NET: Registered protocol family 30
Oct 13 04:08:48 ey00-s00018 kernel: CMAN: Waiting to join or form a  
Linux-cluster
Oct 13 04:08:49 ey00-s00018 kernel: CMAN: sending membership request
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00025
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00019
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00030
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00024
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00010
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00016
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-02
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-01
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-00
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-03
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-05
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-04
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00000
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00001
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00007
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00013
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00012
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00006
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00003
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00008
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00014
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00015
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00002
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00009
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00005
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00011
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00017
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00004
Oct 13 04:08:50 ey00-s00018 kernel: dlm: no version for  
"kcl_register_service" found: kernel tainted.
Oct 13 04:08:50 ey00-s00018 kernel: DLM 1.03.00 (built Sep  8 2006  
03:50:23) installed
Oct 13 04:16:10 ey00-s00018 kernel: CMAN: got WAIT barrier not in  
phase 1 TRANSITION.96 (2)

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From filipe.miranda at gmail.com  Sat Oct 14 00:39:25 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Fri, 13 Oct 2006 21:39:25 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
	<20061013215400.GK17654@agk.surrey.redhat.com>
	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
Message-ID: <a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>

Alasdair,


We did the modifications but still the same error:

Since the lock servers does not have access to the LUNs that will be in use
by the GFS I used the option:

<clusternode name="lock-node3" votes="1" clvmd="0">

Only the rac-nodes have access to the shared LUNs...


[root at rac-node1 ~]# lvcreate -L4G -n lv01 vg01

  Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1 not
/dev/emcpowerb1

  Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using /dev/emcpowerc1
not /dev/sdr1

  Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1 not
/dev/emcpowerq1

  Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1 not
/dev/emcpowera1

  Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using /dev/emcpowerd1
not /dev/sdq1

  Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1 not
/dev/emcpowerj1

  Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using /dev/emcpowerk1
not /dev/sdj1

  Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1 not
/dev/emcpoweri1

  Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using /dev/emcpowerl1
not /dev/sdi1

  Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1 not
/dev/emcpowerh1

  Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using /dev/emcpowerm1
not /dev/sdh1

  Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1 not
/dev/emcpowerg1

  Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using /dev/emcpowern1
not /dev/sdg1

  Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1 not
/dev/emcpowerf1

  Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using /dev/emcpowero1
not /dev/sdf1

  Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1 not
/dev/emcpowere1

  Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using /dev/emcpowerp1
not /dev/sde1

  Error locking on node lock-node1: Internal lvm error, check syslog

  Error locking on node lock-node2: Internal lvm error, check syslog

  Error locking on node lock-node3: Internal lvm error, check syslog
  Failed to activate new LV.

File:
/var/log/messages:

Oct 13 21:20:46 lock-node1 lvm[5478]: Volume group for uuid not found:
UzvBBmBj7m53APMbye1XXztWjdIavfgX8L5rGTOB3i3KGYPazw1AVaGCmWsXZpqR


Here is the cluster.conf

[root at rac-node1 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="cluster" config_version="6" name="cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="120"/>
        <clusternodes>
                <clusternode name="lock-node1" votes="1" clvmd="0">
                        <fence>
                                <method name="1">
                                        <device name="lock-node1-fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="lock-node2" votes="1" clvmd="0">
                        <fence>
                                <method name="1">
                                        <device name="lock-node2-fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="lock-node3" votes="1" clvmd="0">
                        <fence>
                                <method name="1">
                                        <device name="lock-node3-fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="rac-node1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="rac-node1-fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="rac-node2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="rac-node2-fence"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="rac-node3" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="rac-node3-fence"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <gulm>
                <lockserver name="lock-node1"/>
                <lockserver name="lock-node2"/>
                <lockserver name="lock-node3"/>
        </gulm>
        <fencedevices>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.4" login="xxxx" name="lock-node1-fence" passwd="xxxx"/>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.5" login="xxxx" name="lock-node2-fence" passwd="xxxx"/>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.6" login="xxxx" name="lock-node3-fence" passwd="xxxx"/>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.1" login="xxxx" name="rac-node1-fence" passwd="xxxx"/>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.2" login="xxxx" name="rac-node2-fence" passwd="xxxx"/>
                <fencedevice agent="fence_ipmilan" auth="none"
ipaddr="20.20.20.3" login="xxxx" name="rac-node3-fence" passwd="xxxx"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
</cluster>


/etc/lvm/lvm.conf

# By default we accept every block device:

    # filter = [ "a/.*/" ]

    filter = [ "r|/dev/sda|", "a/.*/" ]


Regards,
Filipe Miranda

On 10/13/06, Filipe Miranda <filipe.miranda at gmail.com> wrote:
>
> Alasdair,
>
> Do I need to reboot the machine to test this configurations changes? Or is
> there a way do testing it without rebooting the machine?
>
> I will try to swap the order of the filters,
>
> Thanks for the hint
>
> On 10/13/06, Alasdair G Kergon <agk at redhat.com> wrote:
> >
> > On Fri, Oct 13, 2006 at 06:50:24PM -0300, Filipe Miranda wrote:
> > > filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close pipe)
> >
> > Still won't work - swap the order of the two items; first matches
> > everything so second isn't looked at.
> > (man lvm.conf)
> >
> > Alasdair
> > --
> > agk at redhat.com
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
>
> --
> ---
> Filipe T Miranda
> Red Hat Certified Engineer
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/8afbe9b1/attachment.htm>

From linuxr at gmail.com  Sat Oct 14 02:52:23 2006
From: linuxr at gmail.com (Marc )
Date: Fri, 13 Oct 2006 22:52:23 -0400
Subject: [Linux-cluster] Fw: Oracle community responded to my question
	about Oracle cluster without RAC. Use Oracle Data Guard and TAF.
In-Reply-To: <452FFC44.7080704@kootstra.org.uk>
References: <20061013160005.C627773624@hormel.redhat.com>
	<452FFC44.7080704@kootstra.org.uk>
Message-ID: <b60a966f0610131952j6115188aga18559cf7ebb2f0d@mail.gmail.com>

Jan,

That's great that you got a response from that community.  That is, unlike
this community which often doesn't seem to offer specific help when asked.

Marc


On 10/13/06, Jan Gerrit <jan.gerrit at kootstra.org.uk> wrote:
>
> Dear community,
>
>
> From Oracle groups I got the suggestion to use Oracle Data Guard
> together with Transparent Application Failover.
>
> Thought to keep you posted and share the info.
>
>
> Best regards,
>
>
> Jan Gerrit Kootstra
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061013/d819126e/attachment.htm>

From rhurst at bidmc.harvard.edu  Sat Oct 14 13:35:52 2006
From: rhurst at bidmc.harvard.edu (Robert Hurst)
Date: Sat, 14 Oct 2006 09:35:52 -0400
Subject: [Linux-cluster] GFS bug with fcntl/flock ?
In-Reply-To: <452FE5E3.8010603@redhat.com>
References: <1160572166.4042.28.camel@WSBID06223> <452FE5E3.8010603@redhat.com>
Message-ID: <1160832952.2668.3.camel@WSBID06223>

Perfect, thank you!!

[root at net1 ~]# cd /usr/local/etc/cachesys
[root at net1 cachesys]# ll
total 68
-rw-r-Sr--  1 root root  3072 Oct 10 09:22 cache.reg
-r-xr-xr-x  1 root root 35221 Oct 10 09:22 ccontrol
-r-xr-xr-x  1 root root 16609 Oct 10 09:22 csession
[root at net1 cachesys]# chmod -s cache.reg
[root at net1 cachesys]# ll
total 68
-rw-r--r--  1 root root  3072 Oct 10 09:22 cache.reg
-r-xr-xr-x  1 root root 35221 Oct 10 09:22 ccontrol
-r-xr-xr-x  1 root root 16609 Oct 10 09:22 csession
[root at net1 cachesys]# ccontrol all
    Configuration     Version ID        Port   Directory
    ----------------  ----------------  -----
--------------------------------
dn >TOBY              5.0.19.6202.3            /toby/sys


On Fri, 2006-10-13 at 14:15 -0500, Abhijith Das wrote:

> Hi Robert,
> 
> > I am running InterSystems Cach? 5.0.19 for AMD64, and an strace output 
> > from their utility reveals a problem with fcntl/flock on the GFS 
> > filesystem (mount -t gfs /dev/VGSHARE/lvol0 /usr/local):
> >
> >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> >     len=0}) = -1 ENOLCK (No locks available)
> >
> The only place GFS returns ENOLCK is when mandatory locking is turned 
> on. It looks like you have the flags for mandatory locking set on this 
> particular file. i.e. S_ISGID is set and S_IXGRP is not set. If you do 
> an 'ls -al' on this file, you should see something like
>     [adas at radium locking]$ ls -al templock
>     -rwxr-Sr-x  1 adas adas 6 Oct 13 13:50 templock
> The 'S' in the permissions for the group indicate mandatory locking. GFS 
> does not support mandatory locking and will refuse to
> apply fcntl() or flock() on such files.
> You can try doing the same fcntl() with start=0 and len=0 on some other 
> regular file on gfs and see if that works for you.
> 
> > When I put their files on an ordinary ext3 filesystem, all works fine:
> >
> >     open("/usr/local/etc/cachesys/cache.reg", O_RDONLY|O_NONBLOCK) = 3
> >     fcntl(3, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, start=0,
> >     len=0}) = 0
> >
> When you copied the files over, they might have lost these flags, or 
> your ext3 fs supports mandatory locking ('-o mand' mount option)
> 
> > The man page on fcntl states that when flock is setup with 
> > whence=SEEK_SET, start=0, and len=0, the lock will occur on the entire 
> > file... so locking zero bytes in this manner is acceptable.  Help?
> 
> This works correctly in my test environment.
> 
> Hope this helps.
> 
> Regards,
> --Abhi
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


Robert Hurst, Sr. Cach? Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ? Fax: 617-754-8730 ? Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061014/c467a207/attachment.htm>

From jparsons at redhat.com  Sat Oct 14 16:16:48 2006
From: jparsons at redhat.com (Jim Parsons)
Date: Sat, 14 Oct 2006 12:16:48 -0400
Subject: [Linux-cluster] CLVM error 6 nodes
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>	<452F4019.8040803@redhat.com>	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>	<20061013215400.GK17654@agk.surrey.redhat.com>	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
	<a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>
Message-ID: <45310D70.2040407@redhat.com>

Filipe Miranda wrote:

> Alasdair,
>
>
> We did the modifications but still the same error:
>
> Since the lock servers does not have access to the LUNs that will be 
> in use by the GFS I used the option:
>
><clusternode name="lock-node3" votes="1" clvmd="0">
>
Um, I may be out on a limb here, but I know of no clvmd attribute under 
clusternode in our cluster.conf schema.

-Jim

> Only the rac-nodes have access to the shared LUNs...
>
>
> [root at rac-node1 ~]# lvcreate -L4G -n lv01 vg01
>
>   Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1 
> not /dev/emcpowerb1
>
>   Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using 
> /dev/emcpowerc1 not /dev/sdr1
>
>   Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1 
> not /dev/emcpowerq1
>
>   Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1 
> not /dev/emcpowera1
>
>   Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using 
> /dev/emcpowerd1 not /dev/sdq1
>
>   Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1 
> not /dev/emcpowerj1
>
>   Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using 
> /dev/emcpowerk1 not /dev/sdj1
>
>   Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1 
> not /dev/emcpoweri1
>
>   Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using 
> /dev/emcpowerl1 not /dev/sdi1
>
>   Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1 
> not /dev/emcpowerh1
>
>   Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using 
> /dev/emcpowerm1 not /dev/sdh1
>
>   Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1 
> not /dev/emcpowerg1
>
>   Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using 
> /dev/emcpowern1 not /dev/sdg1
>
>   Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1 
> not /dev/emcpowerf1
>
>   Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using 
> /dev/emcpowero1 not /dev/sdf1
>
>   Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1 
> not /dev/emcpowere1
>
>   Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using 
> /dev/emcpowerp1 not /dev/sde1
>
>   Error locking on node lock-node1: Internal lvm error, check syslog
>
>   Error locking on node lock-node2: Internal lvm error, check syslog
>
>   Error locking on node lock-node3: Internal lvm error, check syslog
>
>   Failed to activate new LV.
>
> File:
> /var/log/messages:
>
> Oct 13 21:20:46 lock-node1 lvm[5478]: Volume group for uuid not found: 
> UzvBBmBj7m53APMbye1XXztWjdIavfgX8L5rGTOB3i3KGYPazw1AVaGCmWsXZpqR
>
>
> Here is the cluster.conf
>
>[root at rac-node1 ~]# cat /etc/cluster/cluster.conf
><?xml version="1.0
>"?>
><cluster alias="cluster" config_version="6" name="cluster">
>        <fence_daemon post_fail_delay="0" post_join_delay="120"/>
>        <clusternodes>
>
>                <clusternode name="lock-node1" votes="1" clvmd="0">
>                        <fence>
>                                <method name="1">
>
>                                        <device name="lock-node1-fence"/>
>                                </method>
>                        </fence>
>                </clusternode>
>
>                <clusternode name="lock-node2" votes="1" clvmd="0">
>                        <fence>
>
>                                <method name="1">
>                                        <device name="lock-node2-fence"/>
>                                </method>
>                        </fence>
>
>                </clusternode>
>                <clusternode name="lock-node3" votes="1" clvmd="0">
>                        <fence>
>                                <method name="1">
>
>                                        <device name="lock-node3-fence"/>
>                                </method>
>                        </fence>
>                </clusternode>
>
>                <clusternode name="rac-node1" votes="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="rac-node1-fence"/>
>
>                                </method>
>                        </fence>
>                </clusternode>
>                <clusternode name="rac-node2" votes="1">
>
>                        <fence>
>                                <method name="1">
>                                        <device name="rac-node2-fence"/>
>                                </method>
>
>                        </fence>
>                </clusternode>
>                <clusternode name="rac-node3" votes="1">
>                        <fence>
>                                <method name="1">
>
>                                        <device name="rac-node3-fence"/>
>                                </method>
>                        </fence>
>                </clusternode>
>
>        </clusternodes>
>        <gulm>
>                <lockserver name="lock-node1"/>
>                <lockserver name="lock-node2"/>
>                <lockserver name="lock-node3"/>
>
>        </gulm>
>        <fencedevices>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.4 <http://20.20.20.4>" login="xxxx" name="lock-node1-fence" passwd="xxxx"/>
>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.5 <http://20.20.20.5>" login="xxxx" name="lock-node2-fence" passwd="xxxx"/>
>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.6 <http://20.20.20.6>" login="xxxx" name="lock-node3-fence" passwd="xxxx"/>
>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.1 <http://20.20.20.1>" login="xxxx" name="rac-node1-fence" passwd="xxxx"/>
>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.2 <http://20.20.20.2>" login="xxxx" name="rac-node2-fence" passwd="xxxx"/>
>
>                <fencedevice agent="fence_ipmilan" auth="none"
>ipaddr="20.20.20.3 <http://20.20.20.3>" login="xxxx" name="rac-node3-fence" passwd="xxxx"/>
>
>        </fencedevices>
>        <rm>
>                <failoverdomains/>
>                <resources/>
>        </rm>
></cluster>
>
>
> /etc/lvm/lvm.conf
>
> # By default we accept every block device:
>
>     # filter = [ "a/.*/" ]
>
>     filter = [ "r|/dev/sda|", "a/.*/" ]
>
>  
>
> Regards,
> Filipe Miranda
>
> On 10/13/06, *Filipe Miranda* <filipe.miranda at gmail.com 
> <mailto:filipe.miranda at gmail.com>> wrote:
>
>     Alasdair,
>
>     Do I need to reboot the machine to test this configurations
>     changes? Or is there a way do testing it without rebooting the
>     machine?
>
>     I will try to swap the order of the filters,
>
>     Thanks for the hint
>
>
>     On 10/13/06, *Alasdair G Kergon* < agk at redhat.com
>     <mailto:agk at redhat.com> > wrote:
>
>         On Fri, Oct 13, 2006 at 06:50:24PM -0300, Filipe Miranda wrote:
>> filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close
>         pipe)
>
>         Still won't work - swap the order of the two items; first matches
>         everything so second isn't looked at.
>         (man lvm.conf)
>
>         Alasdair
>         --
>         agk at redhat.com <mailto:agk at redhat.com>
>
>         --
>         Linux-cluster mailing list
>         Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>         https://www.redhat.com/mailman/listinfo/linux-cluster
>         <https://www.redhat.com/mailman/listinfo/linux-cluster>
>
>
>
>
>     -- 
>     ---
>     Filipe T Miranda
>     Red Hat Certified Engineer
>
>
>
>
> -- 
> ---
> Filipe T Miranda
> Red Hat Certified Engineer
>
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>


From filipe.miranda at gmail.com  Sat Oct 14 17:25:09 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Sat, 14 Oct 2006 14:25:09 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <45310D70.2040407@redhat.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
	<20061013215400.GK17654@agk.surrey.redhat.com>
	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
	<a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>
	<45310D70.2040407@redhat.com>
Message-ID: <a6d13c780610141025h765fc8b1k69fef384f2c84ea2@mail.gmail.com>

You are right Jim,

It's clvm=0, not clvmd=0!!

I wish I could have a document that has all the possible tags, options that
I could use on cluster.conf...

I Found one on the Cluster Projetc, but I think its not complete...

Thanks for the hint!

Regards,
Filipe Miranda

On 10/14/06, Jim Parsons <jparsons at redhat.com> wrote:
>
> Filipe Miranda wrote:
>
> > Alasdair,
> >
> >
> > We did the modifications but still the same error:
> >
> > Since the lock servers does not have access to the LUNs that will be
> > in use by the GFS I used the option:
> >
> ><clusternode name="lock-node3" votes="1" clvmd="0">
> >
> Um, I may be out on a limb here, but I know of no clvmd attribute under
> clusternode in our cluster.conf schema.
>
> -Jim
>
> > Only the rac-nodes have access to the shared LUNs...
> >
> >
> > [root at rac-node1 ~]# lvcreate -L4G -n lv01 vg01
> >
> >   Found duplicate PV o2Xf8uUmskTL5fiVQAgY0nJ1ZJSMA9U3: using /dev/sds1
> > not /dev/emcpowerb1
> >
> >   Found duplicate PV 3yduFLdX3FWPWWMb9lIbtBf3JIPjYnHF: using
> > /dev/emcpowerc1 not /dev/sdr1
> >
> >   Found duplicate PV Kyha3qI6nVE4odg77UXf7igS3FenGJNn: using /dev/sdd1
> > not /dev/emcpowerq1
> >
> >   Found duplicate PV WS1LyhqQ8HaE2fIuSnXNd5sgTRtNzNAJ: using /dev/sdt1
> > not /dev/emcpowera1
> >
> >   Found duplicate PV DuBJ7dZsS3PIO7n5U6hINxPkWorZDzvx: using
> > /dev/emcpowerd1 not /dev/sdq1
> >
> >   Found duplicate PV ZECZzAtbA0e9pFbl9oL0lZg4q7fkS5x4: using /dev/sdk1
> > not /dev/emcpowerj1
> >
> >   Found duplicate PV bnVVmL6WhS2mesnOFUkT4fEfR0cFhybD: using
> > /dev/emcpowerk1 not /dev/sdj1
> >
> >   Found duplicate PV XyXrg2zdxxMS5jo03f9I4QYtGM3ILLGV: using /dev/sdl1
> > not /dev/emcpoweri1
> >
> >   Found duplicate PV SLE5v2eTD7cJlpRUDGG35xfXkRbW86i1: using
> > /dev/emcpowerl1 not /dev/sdi1
> >
> >   Found duplicate PV acGyUd2wX7FnOF94Cbt0ombp10iUWMSf: using /dev/sdm1
> > not /dev/emcpowerh1
> >
> >   Found duplicate PV ll8eNZ0JRh9katV0eui4BcxSc6HBggSI: using
> > /dev/emcpowerm1 not /dev/sdh1
> >
> >   Found duplicate PV ptGubq8R16LxywZ458P7ebmdG3Fq2aJo: using /dev/sdn1
> > not /dev/emcpowerg1
> >
> >   Found duplicate PV PLQ3uON7pYe7nY16gRmAP94WBaEydRwf: using
> > /dev/emcpowern1 not /dev/sdg1
> >
> >   Found duplicate PV PsVYTeKNy6EcqWYbJwQ4KEbPp2Q8HjWv: using /dev/sdo1
> > not /dev/emcpowerf1
> >
> >   Found duplicate PV hvekbzDAltJ3t23QveOMz1axfhj9Mp2j: using
> > /dev/emcpowero1 not /dev/sdf1
> >
> >   Found duplicate PV 5OhUbKbZLW5bTc3tpJeU4YlH0dTttJHF: using /dev/sdp1
> > not /dev/emcpowere1
> >
> >   Found duplicate PV dFtPhq6pkwFdl41NTrAguAEFB3601CTb: using
> > /dev/emcpowerp1 not /dev/sde1
> >
> >   Error locking on node lock-node1: Internal lvm error, check syslog
> >
> >   Error locking on node lock-node2: Internal lvm error, check syslog
> >
> >   Error locking on node lock-node3: Internal lvm error, check syslog
> >
> >   Failed to activate new LV.
> >
> > File:
> > /var/log/messages:
> >
> > Oct 13 21:20:46 lock-node1 lvm[5478]: Volume group for uuid not found:
> > UzvBBmBj7m53APMbye1XXztWjdIavfgX8L5rGTOB3i3KGYPazw1AVaGCmWsXZpqR
> >
> >
> > Here is the cluster.conf
> >
> >[root at rac-node1 ~]# cat /etc/cluster/cluster.conf
> ><?xml version="1.0
> >"?>
> ><cluster alias="cluster" config_version="6" name="cluster">
> >        <fence_daemon post_fail_delay="0" post_join_delay="120"/>
> >        <clusternodes>
> >
> >                <clusternode name="lock-node1" votes="1" clvmd="0">
> >                        <fence>
> >                                <method name="1">
> >
> >                                        <device name="lock-node1-fence"/>
> >                                </method>
> >                        </fence>
> >                </clusternode>
> >
> >                <clusternode name="lock-node2" votes="1" clvmd="0">
> >                        <fence>
> >
> >                                <method name="1">
> >                                        <device name="lock-node2-fence"/>
> >                                </method>
> >                        </fence>
> >
> >                </clusternode>
> >                <clusternode name="lock-node3" votes="1" clvmd="0">
> >                        <fence>
> >                                <method name="1">
> >
> >                                        <device name="lock-node3-fence"/>
> >                                </method>
> >                        </fence>
> >                </clusternode>
> >
> >                <clusternode name="rac-node1" votes="1">
> >                        <fence>
> >                                <method name="1">
> >                                        <device name="rac-node1-fence"/>
> >
> >                                </method>
> >                        </fence>
> >                </clusternode>
> >                <clusternode name="rac-node2" votes="1">
> >
> >                        <fence>
> >                                <method name="1">
> >                                        <device name="rac-node2-fence"/>
> >                                </method>
> >
> >                        </fence>
> >                </clusternode>
> >                <clusternode name="rac-node3" votes="1">
> >                        <fence>
> >                                <method name="1">
> >
> >                                        <device name="rac-node3-fence"/>
> >                                </method>
> >                        </fence>
> >                </clusternode>
> >
> >        </clusternodes>
> >        <gulm>
> >                <lockserver name="lock-node1"/>
> >                <lockserver name="lock-node2"/>
> >                <lockserver name="lock-node3"/>
> >
> >        </gulm>
> >        <fencedevices>
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.4 <http://20.20.20.4>" login="xxxx"
> name="lock-node1-fence" passwd="xxxx"/>
> >
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.5 <http://20.20.20.5>" login="xxxx"
> name="lock-node2-fence" passwd="xxxx"/>
> >
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.6 <http://20.20.20.6>" login="xxxx"
> name="lock-node3-fence" passwd="xxxx"/>
> >
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.1 <http://20.20.20.1>" login="xxxx"
> name="rac-node1-fence" passwd="xxxx"/>
> >
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.2 <http://20.20.20.2>" login="xxxx"
> name="rac-node2-fence" passwd="xxxx"/>
> >
> >                <fencedevice agent="fence_ipmilan" auth="none"
> >ipaddr="20.20.20.3 <http://20.20.20.3>" login="xxxx"
> name="rac-node3-fence" passwd="xxxx"/>
> >
> >        </fencedevices>
> >        <rm>
> >                <failoverdomains/>
> >                <resources/>
> >        </rm>
> ></cluster>
> >
> >
> > /etc/lvm/lvm.conf
> >
> > # By default we accept every block device:
> >
> >     # filter = [ "a/.*/" ]
> >
> >     filter = [ "r|/dev/sda|", "a/.*/" ]
> >
> >
> >
> > Regards,
> > Filipe Miranda
> >
> > On 10/13/06, *Filipe Miranda* <filipe.miranda at gmail.com
> > <mailto:filipe.miranda at gmail.com>> wrote:
> >
> >     Alasdair,
> >
> >     Do I need to reboot the machine to test this configurations
> >     changes? Or is there a way do testing it without rebooting the
> >     machine?
> >
> >     I will try to swap the order of the filters,
> >
> >     Thanks for the hint
> >
> >
> >     On 10/13/06, *Alasdair G Kergon* < agk at redhat.com
> >     <mailto:agk at redhat.com> > wrote:
> >
> >         On Fri, Oct 13, 2006 at 06:50:24PM -0300, Filipe Miranda wrote:
> >> filter = [ "a/.*/", "r|/dev/sda|" ] (forgot to use the close
> >         pipe)
> >
> >         Still won't work - swap the order of the two items; first
> matches
> >         everything so second isn't looked at.
> >         (man lvm.conf)
> >
> >         Alasdair
> >         --
> >         agk at redhat.com <mailto:agk at redhat.com>
> >
> >         --
> >         Linux-cluster mailing list
> >         Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
> >         https://www.redhat.com/mailman/listinfo/linux-cluster
> >         <https://www.redhat.com/mailman/listinfo/linux-cluster>
> >
> >
> >
> >
> >     --
> >     ---
> >     Filipe T Miranda
> >     Red Hat Certified Engineer
> >
> >
> >
> >
> > --
> > ---
> > Filipe T Miranda
> > Red Hat Certified Engineer
> >
> >
> >------------------------------------------------------------------------
> >
> >--
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
---
Filipe T Miranda
Red Hat Certified Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061014/48ce0762/attachment.htm>

From jparsons at redhat.com  Sat Oct 14 17:15:54 2006
From: jparsons at redhat.com (Jim Parsons)
Date: Sat, 14 Oct 2006 13:15:54 -0400
Subject: [Linux-cluster] CLVM error 6 nodes
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>	<452F4019.8040803@redhat.com>	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>	<20061013215400.GK17654@agk.surrey.redhat.com>	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>	<a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>	<45310D70.2040407@redhat.com>
	<a6d13c780610141025h765fc8b1k69fef384f2c84ea2@mail.gmail.com>
Message-ID: <45311B4A.7070408@redhat.com>

Filipe Miranda wrote:

> You are right Jim,
>
> It's clvm=0, not clvmd=0!!
>
> I wish I could have a document that has all the possible tags, options 
> that I could use on cluster.conf...
>
> I Found one on the Cluster Projetc, but I think its not complete...

Filipe,
That doc went up less than a week ago - I would love to finsh the 
resource tag section (I have all of the data - it just needs formatting) 
but with beta2 looming, it must wait.

I would be DELIGHTED to accept your patches for the document, though! It 
is in the sources CVS under cluster/doc   : )

-Jim


From filipe.miranda at gmail.com  Sat Oct 14 18:01:40 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Sat, 14 Oct 2006 15:01:40 -0300
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <45311B4A.7070408@redhat.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<452F4019.8040803@redhat.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
	<20061013215400.GK17654@agk.surrey.redhat.com>
	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
	<a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>
	<45310D70.2040407@redhat.com>
	<a6d13c780610141025h765fc8b1k69fef384f2c84ea2@mail.gmail.com>
	<45311B4A.7070408@redhat.com>
Message-ID: <a6d13c780610141101o6a34b2b6v9d31155e165485ea@mail.gmail.com>

Jim,

I'm glad that you have all the data to make the Cluster Projetc FAQ's
cluster.conf tags section complete.

I would love to have something to cooperate in this project, but I'm still a
newbie in this subject and
the Cluster Project FAQ helped me a LOT, and I use it as a reference...

Are you sure that here is no clvm=0 on the clusternode tag of the
cluster.conf file?
Or is there a way to setup the lock servers to not access the shared LUNs ?

I didn't quite understand why yhe lock servers need access to the shared
LUNs... can't the locks be managed through network?

Thank you,

Regards,
Filipe Miranda

On 10/14/06, Jim Parsons <jparsons at redhat.com> wrote:
>
> Filipe Miranda wrote:
>
> > You are right Jim,
> >
> > It's clvm=0, not clvmd=0!!
> >
> > I wish I could have a document that has all the possible tags, options
> > that I could use on cluster.conf...
> >
> > I Found one on the Cluster Projetc, but I think its not complete...
>
> Filipe,
> That doc went up less than a week ago - I would love to finsh the
> resource tag section (I have all of the data - it just needs formatting)
> but with beta2 looming, it must wait.
>
> I would be DELIGHTED to accept your patches for the document, though! It
> is in the sources CVS under cluster/doc   : )
>
> -Jim
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061014/357ee128/attachment.htm>

From filipe.miranda at gmail.com  Sun Oct 15 20:04:35 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Sun, 15 Oct 2006 18:04:35 -0200
Subject: [Linux-cluster] CLVM error 6 nodes
In-Reply-To: <a6d13c780610141101o6a34b2b6v9d31155e165485ea@mail.gmail.com>
References: <a6d13c780610121127r7733326aw1222ff2d4e147c80@mail.gmail.com>
	<a6d13c780610131445t6e81891es62f39b8f6cf38127@mail.gmail.com>
	<a6d13c780610131450v721ea2b0ud5b1fc9bb60e9a7c@mail.gmail.com>
	<20061013215400.GK17654@agk.surrey.redhat.com>
	<a6d13c780610131504l4beab348l4da1dc5e8c01fabe@mail.gmail.com>
	<a6d13c780610131739l2af78833t257aaf55c8a4e84b@mail.gmail.com>
	<45310D70.2040407@redhat.com>
	<a6d13c780610141025h765fc8b1k69fef384f2c84ea2@mail.gmail.com>
	<45311B4A.7070408@redhat.com>
	<a6d13c780610141101o6a34b2b6v9d31155e165485ea@mail.gmail.com>
Message-ID: <a6d13c780610151304n307376f6od695c27ce0a209f3@mail.gmail.com>

After a lot of reading, the reason the lock servers should access the shared
storage is because of CLVM.
CLVM should be running on all nodes and all nodes part of the quorum should
have access to the shared storage.
I'm not sure if it is possible to exclude the lock servers from accessing
the shared storage and letting them manage the locks through network as it
was done on GFS 6.0.

Regards,
Filipe Miranda

On 10/14/06, Filipe Miranda <filipe.miranda at gmail.com> wrote:
>
> Jim,
>
> I'm glad that you have all the data to make the Cluster Projetc FAQ's
> cluster.conf tags section complete.
>
> I would love to have something to cooperate in this project, but I'm still
> a newbie in this subject and
> the Cluster Project FAQ helped me a LOT, and I use it as a reference...
>
> Are you sure that here is no clvm=0 on the clusternode tag of the
> cluster.conf file?
> Or is there a way to setup the lock servers to not access the shared LUNs
> ?
>
> I didn't quite understand why yhe lock servers need access to the shared
> LUNs... can't the locks be managed through network?
>
> Thank you,
>
> Regards,
> Filipe Miranda
>
> On 10/14/06, Jim Parsons <jparsons at redhat.com> wrote:
> >
> > Filipe Miranda wrote:
> >
> > > You are right Jim,
> > >
> > > It's clvm=0, not clvmd=0!!
> > >
> > > I wish I could have a document that has all the possible tags, options
> > > that I could use on cluster.conf.. .
> > >
> > > I Found one on the Cluster Projetc, but I think its not complete...
> >
> > Filipe,
> > That doc went up less than a week ago - I would love to finsh the
> > resource tag section (I have all of the data - it just needs formatting)
> >
> > but with beta2 looming, it must wait.
> >
> > I would be DELIGHTED to accept your patches for the document, though! It
> > is in the sources CVS under cluster/doc   : )
> >
> > -Jim
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061015/2938b86b/attachment.htm>

From katriel at penguin-it.co.il  Mon Oct 16 13:12:56 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Mon, 16 Oct 2006 15:12:56 +0200
Subject: [Linux-cluster] qdisk and 2-node cluster
Message-ID: <45338558.5000303@penguin-it.co.il>

Hello list.

Is there any special considerations to take with a 2-node cluster and qdisk?
If my logic is right, using "two_node=1 expected_votes=1" and qdisk will
not solve a split brain. A failed node will still have 1 vote left, even
after qdisk "evicts" itself.
Should I use "two_node=1 expected_votes=2" ?

Thanks,
-- 
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953


From jwhiter at redhat.com  Mon Oct 16 13:19:37 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Mon, 16 Oct 2006 09:19:37 -0400
Subject: [Linux-cluster] qdisk and 2-node cluster
In-Reply-To: <45338558.5000303@penguin-it.co.il>
References: <45338558.5000303@penguin-it.co.il>
Message-ID: <20061016131937.GA10076@korben.rdu.redhat.com>

With two node clusters you want to remove the "two_node=1 expected_votes=1"
options.  Thank you,

Josef

On Mon, Oct 16, 2006 at 03:12:56PM +0200, Katriel Traum wrote:
> Hello list.
> 
> Is there any special considerations to take with a 2-node cluster and qdisk?
> If my logic is right, using "two_node=1 expected_votes=1" and qdisk will
> not solve a split brain. A failed node will still have 1 vote left, even
> after qdisk "evicts" itself.
> Should I use "two_node=1 expected_votes=2" ?
> 
> Thanks,
> -- 
> Katriel Traum, PenguinIT
> RHCE, CLP
> Mobile: 054-6789953
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From tmornini at engineyard.com  Mon Oct 16 15:24:12 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Mon, 16 Oct 2006 08:24:12 -0700
Subject: [Linux-cluster] CMAN: got WAIT barrier not in phase 1 TRANSITION.96
	(2)
References: <20061013194123.GB25294@proton.souja.net>
Message-ID: <B646F4D3-B6FF-4EF4-8967-633BC1B4D24F@engineyard.com>

Hello all.

We're getting problems when adding cluster nodes to our cluster.

Everything has been reasonable and stable until this point.

I posted these logs earlier with a different subject. I had
misunderstood the warning message and posted the wrong error
message in the subject.

Here's the syslog with a great deal of context:

Oct 13 04:08:53 ey00-s00017 kernel: VFS: Mounted root (reiserfs  
filesystem) read only.
Oct 13 04:08:53 ey00-s00017 kernel: Adding 262136k swap on /dev/ 
sda2.  Priority: -1 extents:1 across:262136k
Oct 13 04:09:04 ey00-s00017 kernel: CMAN 1.03.00 (built Sep  8 2006  
03:49:59) installed
Oct 13 04:09:04 ey00-s00017 kernel: NET: Registered protocol family 30
Oct 13 04:09:04 ey00-s00017 kernel: CMAN: Waiting to join or form a  
Linux-cluster
Oct 13 04:09:05 ey00-s00017 kernel: CMAN: sending membership request
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00025
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00019
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00030
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00024
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00010
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00016
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00004
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00011
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00005
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00009
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00002
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00015
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00014
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00008
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00003
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00006
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00012
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00013
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00007
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00001
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00000
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-04
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-05
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-03
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-00
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-01
Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-02
Oct 13 04:09:06 ey00-s00017 kernel: dlm: no version for  
"kcl_register_service" found: kernel tainted.
Oct 13 04:09:06 ey00-s00017 kernel: DLM 1.03.00 (built Sep  8 2006  
03:50:23) installed
Oct 13 04:09:57 ey00-s00017 kernel: CMAN: node ey00-s00018 rejoining
Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in  
phase 1 TRANSITION.96 (2)

Same thing on a different slice:

Oct 13 04:08:36 ey00-s00018 kernel: VFS: Mounted root (reiserfs  
filesystem) readonly.
Oct 13 04:08:36 ey00-s00018 kernel: Adding 262136k swap on /dev/ 
sda2.  Priority:-1 extents:1 across:262136k
Oct 13 04:08:48 ey00-s00018 kernel: CMAN 1.03.00 (built Sep  8 2006  
03:49:59) installed
Oct 13 04:08:48 ey00-s00018 kernel: NET: Registered protocol family 30
Oct 13 04:08:48 ey00-s00018 kernel: CMAN: Waiting to join or form a  
Linux-cluster
Oct 13 04:08:49 ey00-s00018 kernel: CMAN: sending membership request
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00025
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00019
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00030
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00024
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00010
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00016
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-02
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-01
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-00
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-03
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-05
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-04
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00000
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00001
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00007
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00013
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00012
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00006
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00003
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00008
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00014
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00015
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00002
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00009
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00005
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00011
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00017
Oct 13 04:08:50 ey00-s00018 kernel: CMAN: got node ey00-s00004
Oct 13 04:08:50 ey00-s00018 kernel: dlm: no version for  
"kcl_register_service" found: kernel tainted.
Oct 13 04:08:50 ey00-s00018 kernel: DLM 1.03.00 (built Sep  8 2006  
03:50:23) installed
Oct 13 04:16:10 ey00-s00018 kernel: CMAN: got WAIT barrier not in  
phase 1 TRANSITION.96 (2)

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From pcaulfie at redhat.com  Mon Oct 16 15:34:52 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 16 Oct 2006 16:34:52 +0100
Subject: [Linux-cluster] CMAN: got WAIT barrier not in phase 1
	TRANSITION.96 (2)
In-Reply-To: <B646F4D3-B6FF-4EF4-8967-633BC1B4D24F@engineyard.com>
References: <20061013194123.GB25294@proton.souja.net>
	<B646F4D3-B6FF-4EF4-8967-633BC1B4D24F@engineyard.com>
Message-ID: <4533A69C.8060604@redhat.com>

Tom Mornini wrote:
> Hello all.
> 
> We're getting problems when adding cluster nodes to our cluster.
> 
> Everything has been reasonable and stable until this point.
> 
> I posted these logs earlier with a different subject. I had
> misunderstood the warning message and posted the wrong error
> message in the subject.
> 
> Here's the syslog with a great deal of context:
> 
> Oct 13 04:08:53 ey00-s00017 kernel: VFS: Mounted root (reiserfs
> filesystem) read only.
> Oct 13 04:08:53 ey00-s00017 kernel: Adding 262136k swap on /dev/sda2. 
> Priority: -1 extents:1 across:262136k
> Oct 13 04:09:04 ey00-s00017 kernel: CMAN 1.03.00 (built Sep  8 2006
> 03:49:59) installed
> Oct 13 04:09:04 ey00-s00017 kernel: NET: Registered protocol family 30
> Oct 13 04:09:04 ey00-s00017 kernel: CMAN: Waiting to join or form a
> Linux-cluster
> Oct 13 04:09:05 ey00-s00017 kernel: CMAN: sending membership request
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00025
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00019
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00030
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00024
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00010
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00016
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00004
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00011
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00005
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00009
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00002
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00015
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00014
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00008
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00003
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00006
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00012
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00013
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00007
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00001
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00000
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-04
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-05
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-03
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-00
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-01
> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-02
> Oct 13 04:09:06 ey00-s00017 kernel: dlm: no version for
> "kcl_register_service" found: kernel tainted.
> Oct 13 04:09:06 ey00-s00017 kernel: DLM 1.03.00 (built Sep  8 2006
> 03:50:23) installed
> Oct 13 04:09:57 ey00-s00017 kernel: CMAN: node ey00-s00018 rejoining
> Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in phase
> 1 TRANSITION.96 (2)

That message should be harmless. does it prevent the cluster reaching quorum ?

-- 

patrick


From sandra-llistes at fib.upc.edu  Mon Oct 16 15:36:10 2006
From: sandra-llistes at fib.upc.edu (sandra-llistes)
Date: Mon, 16 Oct 2006 17:36:10 +0200
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <452D7040.8090704@redhat.com>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu> <452BFC6D.60902@redhat.com>
	<452D0ED6.9040605@fib.upc.edu> <452D7040.8090704@redhat.com>
Message-ID: <4533A6EA.6030503@fib.upc.edu>

Hi Abhi,

Your mail astonished me. The only difference between your environment 
and our is that you've RHEL4 and we've Fedora 5 with GFS.
I'm sorry about this but I have to insist. Are you completelly sure 
that your samba access was simoultaneous? because if you probe one 
client, and then another it isn't the same.
I found people that complain about the same problem we have:
http://www.redhat.com/archives/linux-cluster/2004-November/msg00065.html
http://www.centos.org/modules/newbb/viewtopic.php?topic_id=4997

Well, I had finally compiled GFS2 but I was uncapable to start cman 
daemons.
[root at server2 ~]#  /etc/init.d/cman start
Starting cluster:
    Loading modules... done
    Mounting configfs... done
    Starting ccsd... done
    Starting cman... failed
/usr/sbin/cman_tool: aisexec daemon didn't start

And I obtain this when I try: # strace /usr/sbin/cman_tool -t 120 -w join
....
connect(5, {sa_family=AF_FILE, path="/var/run/cman_admin"}, 110) = -1 
ENOENT (No such file or directory)
close(5)
write(2, "/usr/sbin/cman_tool: ", 21/usr/sbin/cman_tool: )   = 21
write(2, "aisexec daemon didn\'t start\n", 28aisexec daemon didn't start
) = 28
exit_group(1)

So I'm stalled by that way.

I'm downloading RHEL 30-days evaluating version and I'm also trying 
samba 3.0.23c compiling with cluster support.
I will tell you something as soon as I finish all test.
Regards,

Sandra

Abhijith Das wrote:
> Hi Sandra,
> I tried your test with 4 windows machines. 3 real machines and 1 
> simulated in vmware - all running windows xp home. Everything runs fine:
> smbstatus on the samba server shows me this:
> [root at niobe-04 ~]# smbstatus
> 
> Samba version 3.0.10-1.4E.2
> PID     Username      Group         Machine
> -------------------------------------------------------------------
>  775   testmonkey    testmonkeys   ccc-t3n2rexrla7 (10.15.80.203)
>  777   testmonkey    testmonkeys   migael       (10.15.80.222)
>  774   testmonkey    testmonkeys   bbb-34gtsedgprj (10.15.80.6)
>  776   testmonkey    testmonkeys   schumi       (10.15.80.209)
> 
> Service      pid     machine       Connected at
> -------------------------------------------------------
> public         774   bbb-34gtsedgprj  Wed Oct 11 15:03:10 2006
> public         776   schumi        Wed Oct 11 15:03:11 2006
> IPC$           777   migael        Wed Oct 11 15:43:58 2006
> public         777   migael        Wed Oct 11 15:03:25 2006
> public         775   ccc-t3n2rexrla7  Wed Oct 11 15:03:10 2006
> 
> Locked files:
> Pid    DenyMode   Access      R/W        Oplock           Name
> --------------------------------------------------------------
> 777    DENY_NONE  0x20089     RDONLY     NONE             
> /public/TruthHappens.ogg   Wed Oct 11 17:39:16 2006
> 775    DENY_NONE  0x20089     RDONLY     NONE             
> /public/TruthHappens.ogg   Wed Oct 11 17:39:16 2006
> 774    DENY_NONE  0x20089     RDONLY     NONE             
> /public/TruthHappens.ogg   Wed Oct 11 17:39:07 2006
> 776    DENY_NONE  0x20089     RDONLY     NONE             
> /public/TruthHappens.ogg   Wed Oct 11 17:39:06 2006
> 
> My smb.conf looks like this :
> [public]
>        comment         = ShareGFS
>        path            = /public
>        writeable       = No
>        read only       = Yes
>        write list      = @admsamba
>        force group     = root
>        create mask     = 0775
>        directory mask  = 0775
>        oplocks         = No
>        locking = Yes
>        strict locking = Yes
> 
> Also, since the share is readonly, there shouldn't be (m)any locks 
> involved, which makes your problem seem all the more odd. Let me know if 
> there's anything else I can try. Also, I'm curious about your test 
> results with gfs2 and gfs1 from the RHEL4 branch.
> 
> Regards,
> --Abhi


From adas at redhat.com  Mon Oct 16 16:56:34 2006
From: adas at redhat.com (Abhijith Das)
Date: Mon, 16 Oct 2006 11:56:34 -0500
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <4533A6EA.6030503@fib.upc.edu>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu> <452BFC6D.60902@redhat.com>
	<452D0ED6.9040605@fib.upc.edu> <452D7040.8090704@redhat.com>
	<4533A6EA.6030503@fib.upc.edu>
Message-ID: <4533B9C2.8080906@redhat.com>

Hi Sandra,

> Hi Abhi,
>
> Your mail astonished me. The only difference between your environment 
> and our is that you've RHEL4 and we've Fedora 5 with GFS.

I'll get FC5 installed on a test cluster and try it out.

> I'm sorry about this but I have to insist. Are you completelly sure 
> that your samba access was simoultaneous? because if you probe one 
> client, and then another it isn't the same.
> I found people that complain about the same problem we have:
> http://www.redhat.com/archives/linux-cluster/2004-November/msg00065.html
> http://www.centos.org/modules/newbb/viewtopic.php?topic_id=4997

Yes, the access was simultaneous. If you have a specific test, (specific 
types of files or windows programs) I can try that too. I'm not denying 
that you're seeing problems with GFS+samba :-). If I can reproduce your 
problem, I can chase it down and we'll know why this is happening.

> Well, I had finally compiled GFS2 but I was uncapable to start cman 
> daemons.
> [root at server2 ~]#  /etc/init.d/cman start
> Starting cluster:
>    Loading modules... done
>    Mounting configfs... done
>    Starting ccsd... done
>    Starting cman... failed
> /usr/sbin/cman_tool: aisexec daemon didn't start
>
> And I obtain this when I try: # strace /usr/sbin/cman_tool -t 120 -w join
> ....
> connect(5, {sa_family=AF_FILE, path="/var/run/cman_admin"}, 110) = -1 
> ENOENT (No such file or directory)
> close(5)
> write(2, "/usr/sbin/cman_tool: ", 21/usr/sbin/cman_tool: )   = 21
> write(2, "aisexec daemon didn\'t start\n", 28aisexec daemon didn't start
> ) = 28
> exit_group(1)
>
> So I'm stalled by that way.

Looks like the openais component of the cluster is missing. The cluster 
architecture has changed quite a bit for GFS2. Please refer to this for 
help installing GFS2:
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/usage.txt?rev=1.35&content-type=text/x-cvsweb-markup&cvsroot=cluster

> I'm downloading RHEL 30-days evaluating version and I'm also trying 
> samba 3.0.23c compiling with cluster support.
> I will tell you something as soon as I finish all test.
> Regards,
>
> Sandra

Thanks,
--Abhi


From matthew at arts.usyd.edu.au  Tue Oct 17 07:08:36 2006
From: matthew at arts.usyd.edu.au (Matthew Geier)
Date: Tue, 17 Oct 2006 17:08:36 +1000
Subject: [Linux-cluster] Running MySQL under cluster-suite
Message-ID: <45348174.70600@arts.usyd.edu.au>


  I'm trying to set up an active/passive failover for MySQL and have a 
problem with getting cluster suite to start mysql. It just keeps telling 
me to 'look in the logs' for a problem that isn't there.

  The main issue seems to be that it decides that MySQL has failed and I 
can't 'disable' it to 'enable' it. The disable command fails 'cause 
MySQL isn't actually running anymore.


Oct 17 17:00:40 Aristotle1 clurgmgrd[7495]: <notice> Stopping service MySQL
Oct 17 17:00:40 Aristotle1 clurgmgrd: [7495]: <info> Executing 
/etc/init.d/mysqld stop
Oct 17 17:00:40 Aristotle1 mysqld: Stopping MySQL:  failed
Oct 17 17:00:40 Aristotle1 clurgmgrd[7495]: <notice> stop on script 
"MySQL" returned 1 (generic error)
Oct 17 17:00:40 Aristotle1 clurgmgrd: [7495]: <info> /dev/dm-9 is not 
mounted
Oct 17 17:00:45 Aristotle1 clurgmgrd[7495]: <crit> #12: RG MySQL failed 
to stop; intervention required
Oct 17 17:00:45 Aristotle1 clurgmgrd[7495]: <notice> Service MySQL is failed


  The only way I can get past this is manually mount the database file 
system, manually create the IP address and start MySQL manually - then 
tell the cluster manager to disable MySQL. Only even then it gets in a 
wierd state and one let me re-enable MySQL again - only this time it 
doesn't write anything useful in the logs.

  Any one have any pointers ?. I'm not trying to load balance or 
anything, just having a standby machine to take over if the first fails.


From fajar at telkom.net.id  Tue Oct 17 07:13:31 2006
From: fajar at telkom.net.id (Fajar A. Nugraha)
Date: Tue, 17 Oct 2006 14:13:31 +0700
Subject: [Linux-cluster] Running MySQL under cluster-suite
In-Reply-To: <45348174.70600@arts.usyd.edu.au>
References: <45348174.70600@arts.usyd.edu.au>
Message-ID: <4534829B.6040504@telkom.net.id>

Matthew Geier wrote:
>
>  I'm trying to set up an active/passive failover for MySQL and have a
> problem with getting cluster suite to start mysql. It just keeps
> telling me to 'look in the logs' for a problem that isn't there.
>
>  The main issue seems to be that it decides that MySQL has failed and
> I can't 'disable' it to 'enable' it. The disable command fails 'cause
> MySQL isn't actually running anymore.
I think you should modify mysql startup script so that if you stop it
when it's already dead, it would return 0 instead of 1.

Regards,

Fajar


From rpeterso at redhat.com  Tue Oct 17 14:28:52 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 17 Oct 2006 09:28:52 -0500
Subject: [Linux-cluster] Running MySQL under cluster-suite
In-Reply-To: <45348174.70600@arts.usyd.edu.au>
References: <45348174.70600@arts.usyd.edu.au>
Message-ID: <4534E8A4.3070706@redhat.com>

Matthew Geier wrote:
>
>  I'm trying to set up an active/passive failover for MySQL and have a 
> problem with getting cluster suite to start mysql. It just keeps 
> telling me to 'look in the logs' for a problem that isn't there.
>
>  The main issue seems to be that it decides that MySQL has failed and 
> I can't 'disable' it to 'enable' it. The disable command fails 'cause 
> MySQL isn't actually running anymore.
>
>
> Oct 17 17:00:40 Aristotle1 clurgmgrd[7495]: <notice> Stopping service 
> MySQL
> Oct 17 17:00:40 Aristotle1 clurgmgrd: [7495]: <info> Executing 
> /etc/init.d/mysqld stop
> Oct 17 17:00:40 Aristotle1 mysqld: Stopping MySQL:  failed
> Oct 17 17:00:40 Aristotle1 clurgmgrd[7495]: <notice> stop on script 
> "MySQL" returned 1 (generic error)
> Oct 17 17:00:40 Aristotle1 clurgmgrd: [7495]: <info> /dev/dm-9 is not 
> mounted
> Oct 17 17:00:45 Aristotle1 clurgmgrd[7495]: <crit> #12: RG MySQL 
> failed to stop; intervention required
> Oct 17 17:00:45 Aristotle1 clurgmgrd[7495]: <notice> Service MySQL is 
> failed
>
>
>  The only way I can get past this is manually mount the database file 
> system, manually create the IP address and start MySQL manually - then 
> tell the cluster manager to disable MySQL. Only even then it gets in a 
> wierd state and one let me re-enable MySQL again - only this time it 
> doesn't write anything useful in the logs.
>
>  Any one have any pointers ?. I'm not trying to load balance or 
> anything, just having a standby machine to take over if the first fails.
Hi Matthew,

See http://sources.redhat.com/cluster/faq.html#gfs_mysql and 
http://sources.redhat.com/cluster/faq.html#rgm_wontrestart

Regards,

Bob Peterson
Red Hat Cluster Suite


From jon.daniels at voxsurf.com  Tue Oct 17 14:50:00 2006
From: jon.daniels at voxsurf.com (Jonathan Daniels)
Date: Tue, 17 Oct 2006 15:50:00 +0100
Subject: [Linux-cluster] Problems with failover and services
In-Reply-To: <4534E8A4.3070706@redhat.com>
References: <45348174.70600@arts.usyd.edu.au> <4534E8A4.3070706@redhat.com>
Message-ID: <4534ED98.2000507@voxsurf.com>

Hi Linux Clusterers,

I have set up the following cluster environment:

2 x HP DL385, with RedHat EL4 Update 3. These are the clustered nodes
RedHat Cluster Suite 4 on each node
Apache 2.2.2 on each node
A dummy daemon on each node

Initial problem:

RHEL4 U3 kernel version 2.6.9-34
CMan kernel/headers 2.6.9-43.8

I created a simple 2 node cluster running Apache httpd server. When it 
started up as normal the virtual IP was in place and the apache daemon 
was running on the 'owning' server. However whenever I failed over (by 
shutting down network services), the floating IP doesn't get assigned to 
the standby server, and the apache daemon never starts on that standby 
server.

I was also having deadlocks between CMan and RGManager and found that 
this was due to a known and fixed bug in RHEL4U3 and Cman so I upgraded 
them to the following:

RHEL4 U3 kernel version 2.6.9-34.0.1
CMan kernel/headers 2.6.9-43.8.3

Now I start up the "system-cluster-config" and see no services at all. I 
also removed GFS but I have known RHCS to run without GFS, and in any 
case the two apache servers and dummy daemons do not share storage - I 
simply want to perform the failover initially.

Anyone have any workarounds?

Many thanks,
Jon


From rpeterso at redhat.com  Tue Oct 17 16:41:48 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Tue, 17 Oct 2006 11:41:48 -0500
Subject: [Linux-cluster] Problems with failover and services
In-Reply-To: <4534ED98.2000507@voxsurf.com>
References: <45348174.70600@arts.usyd.edu.au> <4534E8A4.3070706@redhat.com>
	<4534ED98.2000507@voxsurf.com>
Message-ID: <453507CC.6060907@redhat.com>

Jonathan Daniels wrote:
> Hi Linux Clusterers,
>
> I have set up the following cluster environment:
>
> 2 x HP DL385, with RedHat EL4 Update 3. These are the clustered nodes
> RedHat Cluster Suite 4 on each node
> Apache 2.2.2 on each node
> A dummy daemon on each node
>
> Initial problem:
>
> RHEL4 U3 kernel version 2.6.9-34
> CMan kernel/headers 2.6.9-43.8
>
> I created a simple 2 node cluster running Apache httpd server. When it 
> started up as normal the virtual IP was in place and the apache daemon 
> was running on the 'owning' server. However whenever I failed over (by 
> shutting down network services), the floating IP doesn't get assigned 
> to the standby server, and the apache daemon never starts on that 
> standby server.
>
> I was also having deadlocks between CMan and RGManager and found that 
> this was due to a known and fixed bug in RHEL4U3 and Cman so I 
> upgraded them to the following:
>
> RHEL4 U3 kernel version 2.6.9-34.0.1
> CMan kernel/headers 2.6.9-43.8.3
>
> Now I start up the "system-cluster-config" and see no services at all. 
> I also removed GFS but I have known RHCS to run without GFS, and in 
> any case the two apache servers and dummy daemons do not share storage 
> - I simply want to perform the failover initially.
>
> Anyone have any workarounds?
>
> Many thanks,
> Jon
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
Hi Jon,

You may also be another victim of the init-scripts-not-returning-zero thing.
See: http://sources.redhat.com/cluster/faq.html#rgm_wontrestart

Regards,

Bob Peterson
Red Hat Cluster Suite


From jon.daniels at voxsurf.com  Tue Oct 17 16:54:45 2006
From: jon.daniels at voxsurf.com (Jonathan Daniels)
Date: Tue, 17 Oct 2006 17:54:45 +0100
Subject: [Linux-cluster] Problems with failover and services
In-Reply-To: <453507CC.6060907@redhat.com>
References: <45348174.70600@arts.usyd.edu.au> <4534E8A4.3070706@redhat.com>
	<4534ED98.2000507@voxsurf.com> <453507CC.6060907@redhat.com>
Message-ID: <45350AD5.6070401@voxsurf.com>

Hi Robert,

Thanks for this. However, the init scripts run OK from the command line, 
but the rgmanager never starts them, i.e. it never executes 
/etc/init.d/httpd start (for example), and they were never visible in 
the "Cluster Management" tab.

I think the root of the problem here was some mismatched versioning. I 
had kernel 2.6.9-34.0.1, cman-kernel-smp-2.6.9-43.8.3.i686, 
cman-kernheaders-2.6.9-43.8.3.i686 and rgmanager-1.9.54-1.i386. Rolling 
back to kernel version 2.6.9-43, cman version 2.6.9-43.8, while keeping 
rgmanager-1.9.54-1.i386 seems to yield some better results in terms of 
the ability to view and manage sercvices. I'm still running tests on the 
failover so I will post updates as I get them :)

The RG manager lockup problem seems to resolve itself when I send about 
10 - 15 "kill -9"s to the rgmanager pid.

Thanks,
Jon


Robert Peterson wrote:

> Jonathan Daniels wrote:
>
>> Hi Linux Clusterers,
>>
>> I have set up the following cluster environment:
>>
>> 2 x HP DL385, with RedHat EL4 Update 3. These are the clustered nodes
>> RedHat Cluster Suite 4 on each node
>> Apache 2.2.2 on each node
>> A dummy daemon on each node
>>
>> Initial problem:
>>
>> RHEL4 U3 kernel version 2.6.9-34
>> CMan kernel/headers 2.6.9-43.8
>>
>> I created a simple 2 node cluster running Apache httpd server. When 
>> it started up as normal the virtual IP was in place and the apache 
>> daemon was running on the 'owning' server. However whenever I failed 
>> over (by shutting down network services), the floating IP doesn't get 
>> assigned to the standby server, and the apache daemon never starts on 
>> that standby server.
>>
>> I was also having deadlocks between CMan and RGManager and found that 
>> this was due to a known and fixed bug in RHEL4U3 and Cman so I 
>> upgraded them to the following:
>>
>> RHEL4 U3 kernel version 2.6.9-34.0.1
>> CMan kernel/headers 2.6.9-43.8.3
>>
>> Now I start up the "system-cluster-config" and see no services at 
>> all. I also removed GFS but I have known RHCS to run without GFS, and 
>> in any case the two apache servers and dummy daemons do not share 
>> storage - I simply want to perform the failover initially.
>>
>> Anyone have any workarounds?
>>
>> Many thanks,
>> Jon
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
> Hi Jon,
>
> You may also be another victim of the init-scripts-not-returning-zero 
> thing.
> See: http://sources.redhat.com/cluster/faq.html#rgm_wontrestart
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>


From katriel at penguin-it.co.il  Tue Oct 17 18:11:42 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Tue, 17 Oct 2006 20:11:42 +0200
Subject: [Linux-cluster] cman & qdiskd
Message-ID: <45351CDE.3090502@penguin-it.co.il>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hello.

I've seen this subject on the list, but no real solutions.
I'm using Cluster 4 update 4, with qdiskd and a shared disk.
I've understood from the documentation and list that a "cman_tool
status" should reflect the number of votes the quorum daemon holds.

My setup is pretty straight forward, 2-node cluster, shared storage (AoE
for testing).
qdiskd configuration:
<quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
d/e0.0" status_file="/tmp/qdisk-status">
                <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
interval="2"/>
                <heuristic program="ping 192.168.22.60 -c1 -t1"
score="1" interval="2"/>
                <heuristic program="ping 192.168.22.100 -c1 -t1"
score="1" interval="2"/>
        </quorumd>

cman_tool status shows:
[root at n1 ~]# cman_tool status
Protocol version: 5.0.1
Config version: 8
Cluster name: alpha_cluster
Cluster ID: 50356
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 2
Total_votes: 2
Quorum: 2
Active subsystems: 4
Node name: n1
Node addresses: 192.168.22.201

qdiskd is running, scoring a perfect 3 out of 3, but no votes...
When disconnecting one of the nodes, the other will loose quorum. Am I
missing something?

Any insight appreciated.
- --
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFNRzZDWy+Hv/461sRApxVAJoCk07nzlRCsMbXk/+3rZcXUg+CLQCg0a24
387gAU9u4ThcOLsrnFi1YkU=
=GaOt
-----END PGP SIGNATURE-----


From jwhiter at redhat.com  Tue Oct 17 18:25:11 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Tue, 17 Oct 2006 14:25:11 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <45351CDE.3090502@penguin-it.co.il>
References: <45351CDE.3090502@penguin-it.co.il>
Message-ID: <20061017182510.GD10076@korben.rdu.redhat.com>

What does you're cluster.conf look like?  What about /proc/cluster/nodes?  Are
you sure qdiskd is starting?  Your quorum stuff looks fine.  Do both nodes see
/dev/etherd/e0.0 as the same disk?

Josef

On Tue, Oct 17, 2006 at 08:11:42PM +0200, Katriel Traum wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Hello.
> 
> I've seen this subject on the list, but no real solutions.
> I'm using Cluster 4 update 4, with qdiskd and a shared disk.
> I've understood from the documentation and list that a "cman_tool
> status" should reflect the number of votes the quorum daemon holds.
> 
> My setup is pretty straight forward, 2-node cluster, shared storage (AoE
> for testing).
> qdiskd configuration:
> <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
> d/e0.0" status_file="/tmp/qdisk-status">
>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> interval="2"/>
>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> score="1" interval="2"/>
>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> score="1" interval="2"/>
>         </quorumd>
> 
> cman_tool status shows:
> [root at n1 ~]# cman_tool status
> Protocol version: 5.0.1
> Config version: 8
> Cluster name: alpha_cluster
> Cluster ID: 50356
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 2
> Expected_votes: 2
> Total_votes: 2
> Quorum: 2
> Active subsystems: 4
> Node name: n1
> Node addresses: 192.168.22.201
> 
> qdiskd is running, scoring a perfect 3 out of 3, but no votes...
> When disconnecting one of the nodes, the other will loose quorum. Am I
> missing something?
> 
> Any insight appreciated.
> - --
> Katriel Traum, PenguinIT
> RHCE, CLP
> Mobile: 054-6789953
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.5 (GNU/Linux)
> Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
> 
> iD8DBQFFNRzZDWy+Hv/461sRApxVAJoCk07nzlRCsMbXk/+3rZcXUg+CLQCg0a24
> 387gAU9u4ThcOLsrnFi1YkU=
> =GaOt
> -----END PGP SIGNATURE-----
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From sgreene at kentz-ojs.com  Tue Oct 17 17:40:12 2006
From: sgreene at kentz-ojs.com (Sherwyn Greene)
Date: Tue, 17 Oct 2006 13:40:12 -0400
Subject: [Linux-cluster] Cluster Help
Message-ID: <200610171840.k9HIeTkl018487@mx3.redhat.com>

Hi All,
 
    Need some information on getting the right storage array to buy for my
cluster configuration can anyone help me with this
 
my Setup so far
 
2 Athlon 2800 Sempron System
1 Wti Network power switch
1 24 Port 10/100 Cat5 switch
2 Adaptec 39160 scsi host adapter pci cards installed in the systems.
 
Yes, I would like to have a shared storage setup, the question is, can I do
it from the scsi cards I have & if so what other equipment I need to have
this setup.

 
Thanks
 
Sherwyn Greene
Planner / I.T. Technician
Project Controls Dept.
Kentz-OJ's E&I Services J.V.
+1 (868) 648-0876
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061017/55927a17/attachment.htm>

From katriel at penguin-it.co.il  Tue Oct 17 18:42:59 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Tue, 17 Oct 2006 20:42:59 +0200
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <20061017182510.GD10076@korben.rdu.redhat.com>
References: <45351CDE.3090502@penguin-it.co.il>
	<20061017182510.GD10076@korben.rdu.redhat.com>
Message-ID: <45352433.3080603@penguin-it.co.il>

qdisk is running:
[root at n1 ~]# service qdiskd status
qdiskd (pid 1199) is running...
[root at n2 ~]# service qdiskd status
qdiskd (pid 873) is running...

/tmp/qdisk-status:
[root at n1 ~]# cat /tmp/qdisk-status
Node ID: 1
Score (current / min req. / max allowed): 3 / 2 / 3
Current state: Master
Current disk state: None
Visible Set: { 1 2 }
Master Node ID: 1
Quorate Set: { 1 2 }


Both nodes see /dev/etherd/e0.0 and can access it (tcpdump shows both
accessing it for timestamps I suppose)
/proc/cluster/nodes shows the same as "cman_tool nodes":
[root at n1 ~]# cman_tool nodes
Node  Votes Exp Sts  Name
   1    1    2   M   n1
   2    1    2   M   n2

Everything looks OK, it's just not working.

cluster.conf:
<?xml version="1.0"?>
<cluster config_version="9" name="alpha_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="n1" votes="1" nodeid="1">
                        <fence>
                                <method name="1">
                                        <device name="man_fence"
nodename="n1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="n2" votes="1" nodeid="2">
                        <fence>
                                <method name="1">
                                        <device name="man_fence"
nodename="n2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="man_fence"/>
        </fencedevices>
        <rm log_level="7">
                <failoverdomains/>
                <resources>
                        <ip address="192.168.22.250" monitor_link="1"/>
                        <script file="/etc/init.d/httpd" name="httpd"/>
                </resources>
                <service autostart="1" name="apache" recovery="relocate">
                        <ip ref="192.168.22.250"/>
                        <script ref="httpd"/>
                </service>
        </rm>
        <quorumd interval="1" tko="5" votes="3" log_level="7"
device="/dev/etherd/e0.0" status_file="/tmp/qdisk-status">
                <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
interval="2"/>
                <heuristic program="ping 192.168.22.60 -c1 -t1"
score="1" interval="2"/>
                <heuristic program="ping 192.168.22.100 -c1 -t1"
score="1" interval="2"/>
        </quorumd>
</cluster>

Katriel

Josef Whiter wrote:
> What does you're cluster.conf look like?  What about /proc/cluster/nodes?  Are
> you sure qdiskd is starting?  Your quorum stuff looks fine.  Do both nodes see
> /dev/etherd/e0.0 as the same disk?
> 
> Josef
> 
> On Tue, Oct 17, 2006 at 08:11:42PM +0200, Katriel Traum wrote:
> Hello.
> 
> I've seen this subject on the list, but no real solutions.
> I'm using Cluster 4 update 4, with qdiskd and a shared disk.
> I've understood from the documentation and list that a "cman_tool
> status" should reflect the number of votes the quorum daemon holds.
> 
> My setup is pretty straight forward, 2-node cluster, shared storage (AoE
> for testing).
> qdiskd configuration:
> <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
> d/e0.0" status_file="/tmp/qdisk-status">
>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> interval="2"/>
>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> score="1" interval="2"/>
>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> score="1" interval="2"/>
>         </quorumd>
> 
> cman_tool status shows:
> [root at n1 ~]# cman_tool status
> Protocol version: 5.0.1
> Config version: 8
> Cluster name: alpha_cluster
> Cluster ID: 50356
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 2
> Expected_votes: 2
> Total_votes: 2
> Quorum: 2
> Active subsystems: 4
> Node name: n1
> Node addresses: 192.168.22.201
> 
> qdiskd is running, scoring a perfect 3 out of 3, but no votes...
> When disconnecting one of the nodes, the other will loose quorum. Am I
> missing something?
> 
> Any insight appreciated.
>>
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953


From jwhiter at redhat.com  Tue Oct 17 18:49:34 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Tue, 17 Oct 2006 14:49:34 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <45352433.3080603@penguin-it.co.il>
References: <45351CDE.3090502@penguin-it.co.il>
	<20061017182510.GD10076@korben.rdu.redhat.com>
	<45352433.3080603@penguin-it.co.il>
Message-ID: <20061017184932.GE10076@korben.rdu.redhat.com>

Hmm well thats odd.  Stop qdiskd on both nodes and then start it on one and
watch /var/log/messages and see what it spits out.  If it doesn't spit out any
obvious errors do the same on the next node.  If it still doesnt work copy the
relevant parts of the logs into something like http://pastebin.com and post the
url so i can take a look.

Josef

On Tue, Oct 17, 2006 at 08:42:59PM +0200, Katriel Traum wrote:
> qdisk is running:
> [root at n1 ~]# service qdiskd status
> qdiskd (pid 1199) is running...
> [root at n2 ~]# service qdiskd status
> qdiskd (pid 873) is running...
> 
> /tmp/qdisk-status:
> [root at n1 ~]# cat /tmp/qdisk-status
> Node ID: 1
> Score (current / min req. / max allowed): 3 / 2 / 3
> Current state: Master
> Current disk state: None
> Visible Set: { 1 2 }
> Master Node ID: 1
> Quorate Set: { 1 2 }
> 
> 
> Both nodes see /dev/etherd/e0.0 and can access it (tcpdump shows both
> accessing it for timestamps I suppose)
> /proc/cluster/nodes shows the same as "cman_tool nodes":
> [root at n1 ~]# cman_tool nodes
> Node  Votes Exp Sts  Name
>    1    1    2   M   n1
>    2    1    2   M   n2
> 
> Everything looks OK, it's just not working.
> 
> cluster.conf:
> <?xml version="1.0"?>
> <cluster config_version="9" name="alpha_cluster">
>         <fence_daemon post_fail_delay="0" post_join_delay="3"/>
>         <clusternodes>
>                 <clusternode name="n1" votes="1" nodeid="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="man_fence"
> nodename="n1"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="n2" votes="1" nodeid="2">
>                         <fence>
>                                 <method name="1">
>                                         <device name="man_fence"
> nodename="n2"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>         </clusternodes>
>         <cman/>
>         <fencedevices>
>                 <fencedevice agent="fence_manual" name="man_fence"/>
>         </fencedevices>
>         <rm log_level="7">
>                 <failoverdomains/>
>                 <resources>
>                         <ip address="192.168.22.250" monitor_link="1"/>
>                         <script file="/etc/init.d/httpd" name="httpd"/>
>                 </resources>
>                 <service autostart="1" name="apache" recovery="relocate">
>                         <ip ref="192.168.22.250"/>
>                         <script ref="httpd"/>
>                 </service>
>         </rm>
>         <quorumd interval="1" tko="5" votes="3" log_level="7"
> device="/dev/etherd/e0.0" status_file="/tmp/qdisk-status">
>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> interval="2"/>
>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> score="1" interval="2"/>
>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> score="1" interval="2"/>
>         </quorumd>
> </cluster>
> 
> Katriel
> 
> Josef Whiter wrote:
> > What does you're cluster.conf look like?  What about /proc/cluster/nodes?  Are
> > you sure qdiskd is starting?  Your quorum stuff looks fine.  Do both nodes see
> > /dev/etherd/e0.0 as the same disk?
> > 
> > Josef
> > 
> > On Tue, Oct 17, 2006 at 08:11:42PM +0200, Katriel Traum wrote:
> > Hello.
> > 
> > I've seen this subject on the list, but no real solutions.
> > I'm using Cluster 4 update 4, with qdiskd and a shared disk.
> > I've understood from the documentation and list that a "cman_tool
> > status" should reflect the number of votes the quorum daemon holds.
> > 
> > My setup is pretty straight forward, 2-node cluster, shared storage (AoE
> > for testing).
> > qdiskd configuration:
> > <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
> > d/e0.0" status_file="/tmp/qdisk-status">
> >                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> > interval="2"/>
> >                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> > score="1" interval="2"/>
> >                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> > score="1" interval="2"/>
> >         </quorumd>
> > 
> > cman_tool status shows:
> > [root at n1 ~]# cman_tool status
> > Protocol version: 5.0.1
> > Config version: 8
> > Cluster name: alpha_cluster
> > Cluster ID: 50356
> > Cluster Member: Yes
> > Membership state: Cluster-Member
> > Nodes: 2
> > Expected_votes: 2
> > Total_votes: 2
> > Quorum: 2
> > Active subsystems: 4
> > Node name: n1
> > Node addresses: 192.168.22.201
> > 
> > qdiskd is running, scoring a perfect 3 out of 3, but no votes...
> > When disconnecting one of the nodes, the other will loose quorum. Am I
> > missing something?
> > 
> > Any insight appreciated.
> >>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> -- 
> Katriel Traum, PenguinIT
> RHCE, CLP
> Mobile: 054-6789953
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From sgreene at kentz-ojs.com  Tue Oct 17 18:18:12 2006
From: sgreene at kentz-ojs.com (Sherwyn Greene)
Date: Tue, 17 Oct 2006 14:18:12 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <45351CDE.3090502@penguin-it.co.il>
Message-ID: <200610171918.k9HJISE6018571@mx1.redhat.com>

Hi, I may not be able to help you with your problem, but maybe you can help
me with one.

1 What type of shared storage you have setup on your cluster?

I'm tring to setup a cluster with 2 nodes and one shared storage for web
pages & ftp 
Can you help me?

Thanks 


Sherwyn Greene
Planner / I.T. Technician
Project Controls Dept.
Kentz-OJ's E&I Services J.V.
+1 (868) 648-0876

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Katriel Traum
Sent: Tuesday, October 17, 2006 2:12 PM
To: linux clustering
Subject: [Linux-cluster] cman & qdiskd

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hello.

I've seen this subject on the list, but no real solutions.
I'm using Cluster 4 update 4, with qdiskd and a shared disk.
I've understood from the documentation and list that a "cman_tool status"
should reflect the number of votes the quorum daemon holds.

My setup is pretty straight forward, 2-node cluster, shared storage (AoE for
testing).
qdiskd configuration:
<quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
d/e0.0" status_file="/tmp/qdisk-status">
                <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
interval="2"/>
                <heuristic program="ping 192.168.22.60 -c1 -t1"
score="1" interval="2"/>
                <heuristic program="ping 192.168.22.100 -c1 -t1"
score="1" interval="2"/>
        </quorumd>

cman_tool status shows:
[root at n1 ~]# cman_tool status
Protocol version: 5.0.1
Config version: 8
Cluster name: alpha_cluster
Cluster ID: 50356
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 2
Total_votes: 2
Quorum: 2
Active subsystems: 4
Node name: n1
Node addresses: 192.168.22.201

qdiskd is running, scoring a perfect 3 out of 3, but no votes...
When disconnecting one of the nodes, the other will loose quorum. Am I
missing something?

Any insight appreciated.
- --
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFNRzZDWy+Hv/461sRApxVAJoCk07nzlRCsMbXk/+3rZcXUg+CLQCg0a24
387gAU9u4ThcOLsrnFi1YkU=
=GaOt
-----END PGP SIGNATURE-----

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From katriel at penguin-it.co.il  Tue Oct 17 20:43:37 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Tue, 17 Oct 2006 22:43:37 +0200
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <20061017184932.GE10076@korben.rdu.redhat.com>
References: <45351CDE.3090502@penguin-it.co.il>	<20061017182510.GD10076@korben.rdu.redhat.com>	<45352433.3080603@penguin-it.co.il>
	<20061017184932.GE10076@korben.rdu.redhat.com>
Message-ID: <45354079.8080709@penguin-it.co.il>

Okay. My bad.
Using older cman/dlm kernel modules with newer cman/qdisk binaries
works, but I guess the older kernel module doesn't know how to accept a
"quorum  node".
After using the same version all around, it works.

Thanks for the help
Katriel

Josef Whiter wrote:
> Hmm well thats odd.  Stop qdiskd on both nodes and then start it on one and
> watch /var/log/messages and see what it spits out.  If it doesn't spit out any
> obvious errors do the same on the next node.  If it still doesnt work copy the
> relevant parts of the logs into something like http://pastebin.com and post the
> url so i can take a look.
> 
> Josef
> 
> On Tue, Oct 17, 2006 at 08:42:59PM +0200, Katriel Traum wrote:
>> qdisk is running:
>> [root at n1 ~]# service qdiskd status
>> qdiskd (pid 1199) is running...
>> [root at n2 ~]# service qdiskd status
>> qdiskd (pid 873) is running...
>>
>> /tmp/qdisk-status:
>> [root at n1 ~]# cat /tmp/qdisk-status
>> Node ID: 1
>> Score (current / min req. / max allowed): 3 / 2 / 3
>> Current state: Master
>> Current disk state: None
>> Visible Set: { 1 2 }
>> Master Node ID: 1
>> Quorate Set: { 1 2 }
>>
>>
>> Both nodes see /dev/etherd/e0.0 and can access it (tcpdump shows both
>> accessing it for timestamps I suppose)
>> /proc/cluster/nodes shows the same as "cman_tool nodes":
>> [root at n1 ~]# cman_tool nodes
>> Node  Votes Exp Sts  Name
>>    1    1    2   M   n1
>>    2    1    2   M   n2
>>
>> Everything looks OK, it's just not working.
>>
>> cluster.conf:
>> <?xml version="1.0"?>
>> <cluster config_version="9" name="alpha_cluster">
>>         <fence_daemon post_fail_delay="0" post_join_delay="3"/>
>>         <clusternodes>
>>                 <clusternode name="n1" votes="1" nodeid="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="man_fence"
>> nodename="n1"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="n2" votes="1" nodeid="2">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="man_fence"
>> nodename="n2"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>         </clusternodes>
>>         <cman/>
>>         <fencedevices>
>>                 <fencedevice agent="fence_manual" name="man_fence"/>
>>         </fencedevices>
>>         <rm log_level="7">
>>                 <failoverdomains/>
>>                 <resources>
>>                         <ip address="192.168.22.250" monitor_link="1"/>
>>                         <script file="/etc/init.d/httpd" name="httpd"/>
>>                 </resources>
>>                 <service autostart="1" name="apache" recovery="relocate">
>>                         <ip ref="192.168.22.250"/>
>>                         <script ref="httpd"/>
>>                 </service>
>>         </rm>
>>         <quorumd interval="1" tko="5" votes="3" log_level="7"
>> device="/dev/etherd/e0.0" status_file="/tmp/qdisk-status">
>>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
>> interval="2"/>
>>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
>> score="1" interval="2"/>
>>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
>> score="1" interval="2"/>
>>         </quorumd>
>> </cluster>
>>
>> Katriel
>>
>> Josef Whiter wrote:
>>> What does you're cluster.conf look like?  What about /proc/cluster/nodes?  Are
>>> you sure qdiskd is starting?  Your quorum stuff looks fine.  Do both nodes see
>>> /dev/etherd/e0.0 as the same disk?
>>>
>>> Josef
>>>
>>> On Tue, Oct 17, 2006 at 08:11:42PM +0200, Katriel Traum wrote:
>>> Hello.
>>>
>>> I've seen this subject on the list, but no real solutions.
>>> I'm using Cluster 4 update 4, with qdiskd and a shared disk.
>>> I've understood from the documentation and list that a "cman_tool
>>> status" should reflect the number of votes the quorum daemon holds.
>>>
>>> My setup is pretty straight forward, 2-node cluster, shared storage (AoE
>>> for testing).
>>> qdiskd configuration:
>>> <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
>>> d/e0.0" status_file="/tmp/qdisk-status">
>>>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
>>> interval="2"/>
>>>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
>>> score="1" interval="2"/>
>>>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
>>> score="1" interval="2"/>
>>>         </quorumd>
>>>
>>> cman_tool status shows:
>>> [root at n1 ~]# cman_tool status
>>> Protocol version: 5.0.1
>>> Config version: 8
>>> Cluster name: alpha_cluster
>>> Cluster ID: 50356
>>> Cluster Member: Yes
>>> Membership state: Cluster-Member
>>> Nodes: 2
>>> Expected_votes: 2
>>> Total_votes: 2
>>> Quorum: 2
>>> Active subsystems: 4
>>> Node name: n1
>>> Node addresses: 192.168.22.201
>>>
>>> qdiskd is running, scoring a perfect 3 out of 3, but no votes...
>>> When disconnecting one of the nodes, the other will loose quorum. Am I
>>> missing something?
>>>
>>> Any insight appreciated.
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> -- 
>> Katriel Traum, PenguinIT
>> RHCE, CLP
>> Mobile: 054-6789953
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953


From katriel at penguin-it.co.il  Tue Oct 17 20:46:04 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Tue, 17 Oct 2006 22:46:04 +0200
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <200610171918.k9HJISE6018571@mx1.redhat.com>
References: <200610171918.k9HJISE6018571@mx1.redhat.com>
Message-ID: <4535410C.509@penguin-it.co.il>

Hi.
This is just a test system, so I've been using AoE's "vblade".
vblade is an AoE device emulated with software. I've only been using a
100M loop device for a quorum disk.

+Katriel

Sherwyn Greene wrote:
> Hi, I may not be able to help you with your problem, but maybe you can help
> me with one.
> 
> 1 What type of shared storage you have setup on your cluster?
> 
> I'm tring to setup a cluster with 2 nodes and one shared storage for web
> pages & ftp 
> Can you help me?
> 
> Thanks 
> 
> 
> Sherwyn Greene
> Planner / I.T. Technician
> Project Controls Dept.
> Kentz-OJ's E&I Services J.V.
> +1 (868) 648-0876
> 
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Katriel Traum
> Sent: Tuesday, October 17, 2006 2:12 PM
> To: linux clustering
> Subject: [Linux-cluster] cman & qdiskd
> 
> Hello.
> 
> I've seen this subject on the list, but no real solutions.
> I'm using Cluster 4 update 4, with qdiskd and a shared disk.
> I've understood from the documentation and list that a "cman_tool status"
> should reflect the number of votes the quorum daemon holds.
> 
> My setup is pretty straight forward, 2-node cluster, shared storage (AoE for
> testing).
> qdiskd configuration:
> <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
> d/e0.0" status_file="/tmp/qdisk-status">
>                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> interval="2"/>
>                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> score="1" interval="2"/>
>                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> score="1" interval="2"/>
>         </quorumd>
> 
> cman_tool status shows:
> [root at n1 ~]# cman_tool status
> Protocol version: 5.0.1
> Config version: 8
> Cluster name: alpha_cluster
> Cluster ID: 50356
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 2
> Expected_votes: 2
> Total_votes: 2
> Quorum: 2
> Active subsystems: 4
> Node name: n1
> Node addresses: 192.168.22.201
> 
> qdiskd is running, scoring a perfect 3 out of 3, but no votes...
> When disconnecting one of the nodes, the other will loose quorum. Am I
> missing something?
> 
> Any insight appreciated.
> --
> Katriel Traum, PenguinIT
> RHCE, CLP
> Mobile: 054-6789953

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953


From jason at monsterjam.org  Wed Oct 18 01:37:15 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Tue, 17 Oct 2006 21:37:15 -0400
Subject: [Linux-cluster] virtual address went down?
Message-ID: <20061018013715.GA31630@monsterjam.org>

so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
but the virtual address seems to have disappeared.. its not pingable, neither server has it 
configured anymore.. The only application I had using the virtual address was apache (just for 
testing it). what logs/information should I be looking at to see what happened and why?

regards,
Jason


From isplist at logicore.net  Wed Oct 18 02:09:11 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 17 Oct 2006 21:09:11 -0500
Subject: [Linux-cluster] Problem starting cluster
Message-ID: <2006101721911.100477@leena>

After changing over to a McData FC switch, I cannot seem to start my cluster 
anymore. Can someone give me some input, ideas?

After loading all appropriate items;

[root at dev new]# cman_tool -t 30 join -w
Timed-out waiting for cluster

[root at dev new]# tail -20 /var/log/messages
Oct 17 20:59:00 dev ccsd[2537]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 17 20:59:00 dev ccsd[2537]: Initial status:: Inquorate
Oct 17 20:59:04 dev kernel: CMAN: sending membership request
Oct 17 20:59:04 dev kernel: CMAN: Cluster membership rejected
Oct 17 20:59:04 dev ccsd[2537]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 20:59:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9660 seconds.
Oct 17 20:59:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9690 seconds.
Oct 17 21:00:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9720 seconds.
Oct 17 21:00:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9750 seconds.
Oct 17 21:01:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9780 seconds.
Oct 17 21:01:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9810 seconds.
Oct 17 21:02:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9840 seconds.
Oct 17 21:02:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9870 seconds.
Oct 17 21:03:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9900 seconds.
Oct 17 21:03:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9930 seconds.
Oct 17 21:04:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9960 seconds.
Oct 17 21:04:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 9990 seconds.
Oct 17 21:05:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 10020 seconds.
Oct 17 21:05:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 10050 seconds.
Oct 17 21:06:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
after 10080 seconds.


From jwhiter at redhat.com  Wed Oct 18 02:13:58 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Tue, 17 Oct 2006 22:13:58 -0400
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <2006101721911.100477@leena>
References: <2006101721911.100477@leena>
Message-ID: <20061018021357.GA20018@korben.rdu.redhat.com>

The membership requests are getting rejected, what do the other nodes say during
the same time?

Josef

On Tue, Oct 17, 2006 at 09:09:11PM -0500, isplist at logicore.net wrote:
> After changing over to a McData FC switch, I cannot seem to start my cluster 
> anymore. Can someone give me some input, ideas?
> 
> After loading all appropriate items;
> 
> [root at dev new]# cman_tool -t 30 join -w
> Timed-out waiting for cluster
> 
> [root at dev new]# tail -20 /var/log/messages
> Oct 17 20:59:00 dev ccsd[2537]: Connected to cluster infrastruture via: 
> CMAN/SM Plugin v1.1.7.1
> Oct 17 20:59:00 dev ccsd[2537]: Initial status:: Inquorate
> Oct 17 20:59:04 dev kernel: CMAN: sending membership request
> Oct 17 20:59:04 dev kernel: CMAN: Cluster membership rejected
> Oct 17 20:59:04 dev ccsd[2537]: Cluster manager shutdown.  Attemping to 
> reconnect...
> Oct 17 20:59:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9660 seconds.
> Oct 17 20:59:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9690 seconds.
> Oct 17 21:00:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9720 seconds.
> Oct 17 21:00:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9750 seconds.
> Oct 17 21:01:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9780 seconds.
> Oct 17 21:01:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9810 seconds.
> Oct 17 21:02:25 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9840 seconds.
> Oct 17 21:02:55 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9870 seconds.
> Oct 17 21:03:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9900 seconds.
> Oct 17 21:03:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9930 seconds.
> Oct 17 21:04:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9960 seconds.
> Oct 17 21:04:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 9990 seconds.
> Oct 17 21:05:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 10020 seconds.
> Oct 17 21:05:56 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 10050 seconds.
> Oct 17 21:06:26 dev ccsd[2537]: Unable to connect to cluster infrastructure 
> after 10080 seconds.
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Oct 18 02:20:01 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 17 Oct 2006 21:20:01 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018021357.GA20018@korben.rdu.redhat.com>
Message-ID: <2006101721201.277346@leena>

On Tue, 17 Oct 2006 22:13:58 -0400, Josef Whiter wrote:
> The membership requests are getting rejected, what do the other nodes say
> during the same time?

Hi there,

The other machines are in various states of trying to join but here's the 
output of one for example;

Failed to create lockfile.
Hint: ccsd is already running.
Timed-out waiting for cluster
fence_tool: cluster is not active
clvmd could not connect to cluster manager
Consult syslog for more information
  connect() failed on local socket: Connection refused
  Locking type 2 initialisation failed.

PS: Seems to be a typo in the above? :).

And the output of logging;

Oct 17 21:17:57 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:17:57 cweb92 ccsd[2330]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 21:17:57 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:17:58 cweb92 kernel: CMAN: sending membership request
Oct 17 21:17:58 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:17:58 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:17:58 cweb92 ccsd[2330]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 17 21:17:58 cweb92 ccsd[2330]: Initial status:: Inquorate
Oct 17 21:18:02 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:02 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:02 cweb92 ccsd[2330]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 21:18:02 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:03 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:03 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:03 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:03 cweb92 ccsd[2330]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 17 21:18:03 cweb92 ccsd[2330]: Initial status:: Inquorate
Oct 17 21:18:06 cweb92 sshd(pam_unix)[3232]: session opened for user root by 
root(uid=0)
Oct 17 21:18:07 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:07 cweb92 ccsd[2330]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 21:18:07 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:07 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:08 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:08 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:08 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:08 cweb92 ccsd[2330]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 17 21:18:08 cweb92 ccsd[2330]: Initial status:: Inquorate
Oct 17 21:18:12 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:12 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:12 cweb92 ccsd[2330]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 21:18:12 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:13 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:13 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:13 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 17 21:18:13 cweb92 ccsd[2330]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 17 21:18:13 cweb92 ccsd[2330]: Initial status:: Inquorate
Oct 17 21:18:17 cweb92 kernel: CMAN: sending membership request
Oct 17 21:18:17 cweb92 kernel: CMAN: Cluster membership rejected
Oct 17 21:18:17 cweb92 ccsd[2330]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 17 21:18:17 cweb92 clvmd: Can't open cluster manager socket: Network is 
down
Oct 17 21:18:26 cweb92 ccsd[2330]: Unable to connect to cluster infrastructure 
after 11130 seconds.
Oct 17 21:18:56 cweb92 ccsd[2330]: Unable to connect to cluster infrastructure 
after 11160 seconds.

Mike

 
> Josef
> 
> On Tue, Oct 17, 2006 at 09:09:11PM -0500, isplist at logicore.net wrote:
>> After changing over to a McData FC switch, I cannot seem to start my
>> cluster
>> anymore. Can someone give me some input, ideas?
>> 
>> After loading all appropriate items;
>> 
>> [root at dev new]# cman_tool -t 30 join -w
>> Timed-out waiting for cluster
>> 
>> [root at dev new]# tail -20 /var/log/messages
>> Oct 17 20:59:00 dev ccsd[2537]: Connected to cluster infrastruture via:
>> CMAN/SM Plugin v1.1.7.1
>> Oct 17 20:59:00 dev ccsd[2537]: Initial status:: Inquorate
>> Oct 17 20:59:04 dev kernel: CMAN: sending membership request
>> Oct 17 20:59:04 dev kernel: CMAN: Cluster membership rejected
>> Oct 17 20:59:04 dev ccsd[2537]: Cluster manager shutdown.  Attemping to
>> reconnect...
>> Oct 17 20:59:25 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9660 seconds.
>> Oct 17 20:59:55 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9690 seconds.
>> Oct 17 21:00:25 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9720 seconds.
>> Oct 17 21:00:55 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9750 seconds.
>> Oct 17 21:01:25 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9780 seconds.
>> Oct 17 21:01:55 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9810 seconds.
>> Oct 17 21:02:25 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9840 seconds.
>> Oct 17 21:02:55 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9870 seconds.
>> Oct 17 21:03:26 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9900 seconds.
>> Oct 17 21:03:56 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9930 seconds.
>> Oct 17 21:04:26 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9960 seconds.
>> Oct 17 21:04:56 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 9990 seconds.
>> Oct 17 21:05:26 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 10020 seconds.
>> Oct 17 21:05:56 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 10050 seconds.
>> Oct 17 21:06:26 dev ccsd[2537]: Unable to connect to cluster
>> infrastructure
>> after 10080 seconds.
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From pcaulfie at redhat.com  Wed Oct 18 07:27:17 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 18 Oct 2006 08:27:17 +0100
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <2006101721201.277346@leena>
References: <2006101721201.277346@leena>
Message-ID: <4535D755.2000105@redhat.com>

If all the nodes are showing "rejected" with no reason then it's possible
there's another cluster on the network that shares the same cluster name/number.

Try changing the cluster name of this cluster, and look very carefully around
the network for other clusters!

Otherwise, it's also possible that the cluster.conf files are not in sync
somehow - make sure that they are identical on all nodes.

-- 

patrick


From francisco_javier.pena at roche.com  Wed Oct 18 07:32:08 2006
From: francisco_javier.pena at roche.com (Pena, Francisco Javier)
Date: Wed, 18 Oct 2006 09:32:08 +0200
Subject: [Linux-cluster] Quorum disks and two-node clusters
In-Reply-To: <1160576894.11134.41.camel@rei.boston.devel.redhat.com>
Message-ID: <C0C1791E8EC6F249B5570F01409BD3EE0148524D@rmamsem1.emea.roche.com>

> On Tue, 2006-10-10 at 10:21 +0200, Pena, Francisco Javier wrote:
> > [...]
> > Lon,
> > 
> > Do you know if there are any plans to release an updated 
> cman package, 
> > including this bugfix, before U5? My company is planning to start 
> > implementing GFS, and this would definitely be required for us.
> > 
> > Regards,
> > 
> > Javier Pe?a
> 
> Hi Javier,
> 
> I do not think there is a plan to release the fix early, but 
> that could change at any time.  Note that because the changes 
> were very simple, you can use the checked-out version from 
> CVS and it will work on U4+errata as a drop-in replacement.
> 
> I am interested to know how you are going to use qdiskd such 
> that this is a show-stopper for your organization.
> 
> The only case where the lack of rebooting becomes problematic 
> is if you have heuristics which are not concerned with 
> network connectivity (note: this is a perfectly valid case, 
> of course).
> 
> That is, if two nodes see each other, but one thinks it is 
> inquorate, the quorate node will *not* fence the inquorate 
> node, and... well, things will not work very smoothly (this 
> is the case where a reboot is needed).
> 
> In network outages (one of the things qdiskd was designed to 
> help with), the nodes will not see each other, so the node 
> which still thinks it is quorate will fence the inquorate 
> node, and the cluster will cleanly continue.
> 
> -- Lon
> 

Hi Lon,

After doing some additional checks with my test environment, I think I was too fast in assuming this would be absolutely required. I assumed that having the failed node reboot itself would eliminate the need to fence that node, but it looks like this is not the case. Which makes me wonder, why do we want the server to reboot itself, if it is going to be fenced anyway?

I we could avoid the fencing the failed node, we would be able to solve some problems I found with iLO fencing: if a node loses power completely, the iLO card will not work, so we will never be able to fence the failed node, and the whole cluster will be stopped. If we can assume that an inquorate node will inmediately reboot, we might continue working without any manual interaction.

Thanks for your answer. Regards,

Javier Pe?a


From jon.daniels at voxsurf.com  Wed Oct 18 09:16:35 2006
From: jon.daniels at voxsurf.com (Jonathan Daniels)
Date: Wed, 18 Oct 2006 10:16:35 +0100
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <200610171918.k9HJISE6018571@mx1.redhat.com>
References: <200610171918.k9HJISE6018571@mx1.redhat.com>
Message-ID: <4535F0F3.7000406@voxsurf.com>

Hi Sherwyn,

We use a Hewlett-Packard MSA 1000, connected to our servers (nodes) by 
DLogic 2300 fibre channels, which, if you have a spare ?10,000, is fine. 
The partition I use has been turned into a GFS filesystem (get the GFS 
tools from the RedHat Network) which enables the shared storage to be 
mounted, and more importantly, written to without fear of corruption.

We also use TeraStations for our NFS servers which I believe are 
somewhat cheaper. Depends on the 'criticality' of your setup.

Thanks,
Jon


Sherwyn Greene wrote:

>Hi, I may not be able to help you with your problem, but maybe you can help
>me with one.
>
>1 What type of shared storage you have setup on your cluster?
>
>I'm tring to setup a cluster with 2 nodes and one shared storage for web
>pages & ftp 
>Can you help me?
>
>Thanks 
>
>
>Sherwyn Greene
>Planner / I.T. Technician
>Project Controls Dept.
>Kentz-OJ's E&I Services J.V.
>+1 (868) 648-0876
>
>-----Original Message-----
>From: linux-cluster-bounces at redhat.com
>[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Katriel Traum
>Sent: Tuesday, October 17, 2006 2:12 PM
>To: linux clustering
>Subject: [Linux-cluster] cman & qdiskd
>
>-----BEGIN PGP SIGNED MESSAGE-----
>Hash: SHA1
>
>Hello.
>
>I've seen this subject on the list, but no real solutions.
>I'm using Cluster 4 update 4, with qdiskd and a shared disk.
>I've understood from the documentation and list that a "cman_tool status"
>should reflect the number of votes the quorum daemon holds.
>
>My setup is pretty straight forward, 2-node cluster, shared storage (AoE for
>testing).
>qdiskd configuration:
><quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
>d/e0.0" status_file="/tmp/qdisk-status">
>                <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
>interval="2"/>
>                <heuristic program="ping 192.168.22.60 -c1 -t1"
>score="1" interval="2"/>
>                <heuristic program="ping 192.168.22.100 -c1 -t1"
>score="1" interval="2"/>
>        </quorumd>
>
>cman_tool status shows:
>[root at n1 ~]# cman_tool status
>Protocol version: 5.0.1
>Config version: 8
>Cluster name: alpha_cluster
>Cluster ID: 50356
>Cluster Member: Yes
>Membership state: Cluster-Member
>Nodes: 2
>Expected_votes: 2
>Total_votes: 2
>Quorum: 2
>Active subsystems: 4
>Node name: n1
>Node addresses: 192.168.22.201
>
>qdiskd is running, scoring a perfect 3 out of 3, but no votes...
>When disconnecting one of the nodes, the other will loose quorum. Am I
>missing something?
>
>Any insight appreciated.
>- --
>Katriel Traum, PenguinIT
>RHCE, CLP
>Mobile: 054-6789953
>-----BEGIN PGP SIGNATURE-----
>Version: GnuPG v1.4.5 (GNU/Linux)
>Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org
>
>iD8DBQFFNRzZDWy+Hv/461sRApxVAJoCk07nzlRCsMbXk/+3rZcXUg+CLQCg0a24
>387gAU9u4ThcOLsrnFi1YkU=
>=GaOt
>-----END PGP SIGNATURE-----
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>  
>


From vaibhav.srivastava at celstream.com  Wed Oct 18 10:16:15 2006
From: vaibhav.srivastava at celstream.com (Vaibhav Srivastava)
Date: Wed, 18 Oct 2006 15:46:15 +0530
Subject: [Linux-cluster] Simulating multiple nodes.
In-Reply-To: <4535F0F3.7000406@voxsurf.com>
Message-ID: <CCEE49B9CE065146BA4FE34B2748A5320320A064@CEL-BANGT-M01.celstream-in.com>


Hi,

I am a new member of community. I was just wondering is there any way to
simulate cluster on a stand alone machine? Can that be done using User
Mode Linux?

______________________________________________________________
DISCLAIMER:
This electronic message, and any attachments to this electronic message are
intended for the exclusive use of the addressee(s)named herein and may contain
legally privileged and confidential information. It is the property of Celstream
Technologies Ltd. If you are not the intended recipient, you are hereby
strictly notified not to copy, forward, distribute or use this message or any
attachments thereto. If you have received this message in error, please delete
it and all copies thereof from your system and notify the sender at Celstream
Technologies or administrator at celstream.com immediately
_______________________________________________________________


From jwhiter at redhat.com  Wed Oct 18 12:10:51 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Wed, 18 Oct 2006 08:10:51 -0400
Subject: [Linux-cluster] Quorum disks and two-node clusters
In-Reply-To: <C0C1791E8EC6F249B5570F01409BD3EE0148524D@rmamsem1.emea.roche.com>
References: <1160576894.11134.41.camel@rei.boston.devel.redhat.com>
	<C0C1791E8EC6F249B5570F01409BD3EE0148524D@rmamsem1.emea.roche.com>
Message-ID: <20061018121051.GC20018@korben.rdu.redhat.com>

On Wed, Oct 18, 2006 at 09:32:08AM +0200, Pena, Francisco Javier wrote:
> Hi Lon,
> 
> After doing some additional checks with my test environment, I think I was too fast in assuming this would be absolutely required. I assumed that having the failed node reboot itself would eliminate the need to fence that node, but it looks like this is not the case. Which makes me wonder, why do we want the server to reboot itself, if it is going to be fenced anyway?
>

In the case that you have a seperate network for the heartbeat interface.  If a
node loses its heartbeat connection, it will assume the rest of the cluster is
out and will go down swinging, so you run the risk of having good nodes fenced.
With qdisk, if you have the hueristics setup right, it will recognize that the
node that has lost its heartbeat connection is the bad one and reboot itself,
without trying to fence the other good nodes.
 
> I we could avoid the fencing the failed node, we would be able to solve some problems I found with iLO fencing: if a node loses power completely, the iLO card will not work, so we will never be able to fence the failed node, and the whole cluster will be stopped. If we can assume that an inquorate node will inmediately reboot, we might continue working without any manual interaction.
> 

This is a problem.  You may want to consider adding a second fence level with
manual fencing.  If you cannot connect to the iLo interface chances are the box
is truly down and you don't have to worry about the implications of using manual
fencing, and it will allow the cluster to continue working, you will just have
to remember to do fence_ack_manual on the node that did the fencing.

Josef


From jwhiter at redhat.com  Wed Oct 18 12:11:49 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Wed, 18 Oct 2006 08:11:49 -0400
Subject: [Linux-cluster] Simulating multiple nodes.
In-Reply-To: <CCEE49B9CE065146BA4FE34B2748A5320320A064@CEL-BANGT-M01.celstream-in.com>
References: <4535F0F3.7000406@voxsurf.com>
	<CCEE49B9CE065146BA4FE34B2748A5320320A064@CEL-BANGT-M01.celstream-in.com>
Message-ID: <20061018121148.GD20018@korben.rdu.redhat.com>

You can use xen or vmware, and just have a partition that the individual nodes
share.

Josef

On Wed, Oct 18, 2006 at 03:46:15PM +0530, Vaibhav Srivastava wrote:
> 
> Hi,
> 
> I am a new member of community. I was just wondering is there any way to
> simulate cluster on a stand alone machine? Can that be done using User
> Mode Linux?
> 
> ______________________________________________________________
> DISCLAIMER:
> This electronic message, and any attachments to this electronic message are
> intended for the exclusive use of the addressee(s)named herein and may contain
> legally privileged and confidential information. It is the property of Celstream
> Technologies Ltd. If you are not the intended recipient, you are hereby
> strictly notified not to copy, forward, distribute or use this message or any
> attachments thereto. If you have received this message in error, please delete
> it and all copies thereof from your system and notify the sender at Celstream
> Technologies or administrator at celstream.com immediately
> _______________________________________________________________
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jvantuyl at qualityhumans.com  Tue Oct 17 17:07:39 2006
From: jvantuyl at qualityhumans.com (Jayson Vantuyl)
Date: Tue, 17 Oct 2006 12:07:39 -0500
Subject: [Linux-cluster] CMAN: got WAIT barrier not in phase 1
	TRANSITION.96 (2)
In-Reply-To: <4533A69C.8060604@redhat.com>
References: <20061013194123.GB25294@proton.souja.net>
	<B646F4D3-B6FF-4EF4-8967-633BC1B4D24F@engineyard.com>
	<4533A69C.8060604@redhat.com>
Message-ID: <20061017170739.GF13291@proton.souja.net>

> > Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in phase
> > 1 TRANSITION.96 (2)
> That message should be harmless. does it prevent the cluster reaching quorum ?
Hello Patrick / list, I've been working with Tom on this problem.

It doesn't prevent quorum, although after this point the nodes
mysteriously can't seem to join the fence domain.  I've checked and it
doesn't appear that anyone is trying to fence anyone else, so I'm at a
bit of a loss to explain what's going on.

The really bizarre thing is that the old nodes don't seem to play with
the new ones despite them being joined into the cluster (i.e. fence
domain on old nodes shows running, fence domain on new node says joining
indefinitely).  If you prod it enough (start enough new nodes),
eventually the existing cluster will blow apart (nodes start kicking
each other for inconsistency and the like).

Let me explain a few things about our cluster:

We are running Xen.

The control VM for each node is in the cluster with 1 vote.

The application VMs are dynamically spawned and are entered into the
cluster.

The application VMs have 0 votes (so as to prevent one physical machine
from accidentally grabbing a quorum of votes if it has too many
application VMs running on it).

We are currently using fence_manual for debugging purposes (we have an
APC MasterSwitch to eventually use for fencing).

We are experiencing the following problems:

After a certain size (about 20 cluster members) we start having serious
issues with the cluster holding together.  Nodes are sometimes kicked
for having an inconsistent view.  There is often a complaint about the
count of members not matching between nodes as well.  Right now we have
the 1.03 version of everything installed (it was packaged and we are
trying to avoid building too much from scratch).

When a node starts up with an old cluster.conf, it never seems to
automatically update to the newer version.  If the file is updated while
a node is down, must it be manually synched up before resuming?

Finally, a random question.  When I'm debugging this stuff, I use
"cman_tool services" to keep tabs on some things.  What does the stuff
in the Code column mean?
-- 
Jayson Vantuyl
Quality Humans, Inc.


From tmornini at engineyard.com  Wed Oct 18 17:05:27 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Wed, 18 Oct 2006 10:05:27 -0700
Subject: [Linux-cluster] CMAN: got WAIT barrier not in phase 1
	TRANSITION.96 (2)
In-Reply-To: <4533A69C.8060604@redhat.com>
References: <20061013194123.GB25294@proton.souja.net>
	<B646F4D3-B6FF-4EF4-8967-633BC1B4D24F@engineyard.com>
	<4533A69C.8060604@redhat.com>
Message-ID: <9E06AF22-3CA0-4BC6-8189-8E5ABCCF0B68@engineyard.com>

On Oct 16, 2006, at 8:34 AM, Patrick Caulfield wrote:

> Tom Mornini wrote:
>> We're getting problems when adding cluster nodes to our cluster.

snip...

>> Oct 13 04:09:04 ey00-s00017 kernel: CMAN: Waiting to join or form a
>> Linux-cluster
>> Oct 13 04:09:05 ey00-s00017 kernel: CMAN: sending membership request
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00025
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00019
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00030
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00024
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00010
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00016
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00004
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00011
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00005
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00009
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00002
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00015
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00014
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00008
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00003
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00006
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00012
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00013
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00007
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00001
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-s00000
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-04
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-05
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-03
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-00
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-01
>> Oct 13 04:09:06 ey00-s00017 kernel: CMAN: got node ey00-02
>> Oct 13 04:09:06 ey00-s00017 kernel: dlm: no version for
>> "kcl_register_service" found: kernel tainted.
>> Oct 13 04:09:06 ey00-s00017 kernel: DLM 1.03.00 (built Sep  8 2006
>> 03:50:23) installed
>> Oct 13 04:09:57 ey00-s00017 kernel: CMAN: node ey00-s00018 rejoining
>> Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in  
>> phase
>> 1 TRANSITION.96 (2)
>
> That message should be harmless. does it prevent the cluster  
> reaching quorum ?

>> Oct 13 04:17:18 ey00-s00017 kernel: CMAN: got WAIT barrier not in  
>> phase
>> 1 TRANSITION.96 (2)
>>
> That message should be harmless. does it prevent the cluster  
> reaching quorum ?

Hello Patrick / list, I've been working with Tom on this problem.

It doesn't prevent quorum, although after this point the nodes
mysteriously can't seem to join the fence domain.  I've checked and it
doesn't appear that anyone is trying to fence anyone else, so I'm at a
bit of a loss to explain what's going on.

The really bizarre thing is that the old nodes don't seem to play with
the new ones despite them being joined into the cluster (i.e. fence
domain on old nodes shows running, fence domain on new node says joining
indefinitely).  If you prod it enough (start enough new nodes),
eventually the existing cluster will blow apart (nodes start kicking
each other for inconsistency and the like).

Let me explain a few things about our cluster:

We are running Xen.

The control VM for each node is in the cluster with 1 vote.

The application VMs are dynamically spawned and are entered into the
cluster.

The application VMs have 0 votes (so as to prevent one physical machine
from accidentally grabbing a quorum of votes if it has too many
application VMs running on it).

We are currently using fence_manual for debugging purposes (we have an
APC MasterSwitch to eventually use for fencing).

We are experiencing the following problems:

After a certain size (about 20 cluster members) we start having serious
issues with the cluster holding together.  Nodes are sometimes kicked
for having an inconsistent view.  There is often a complaint about the
count of members not matching between nodes as well.  Right now we have
the 1.03 version of everything installed (it was packaged and we are
trying to avoid building too much from scratch).

When a node starts up with an old cluster.conf, it never seems to
automatically update to the newer version.  If the file is updated while
a node is down, must it be manually synched up before resuming?

Finally, a random question.  When I'm debugging this stuff, I use
"cman_tool services" to keep tabs on some things.  What does the stuff
in the Code column mean?

-- 

Jayson Vantuyl
Quality Humans, Inc.


From isplist at logicore.net  Wed Oct 18 18:11:47 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 13:11:47 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <4535D755.2000105@redhat.com>
Message-ID: <20061018131147.727308@leena>

> If all the nodes are showing "rejected" with no reason then it's possible
> there's another cluster on the network that shares the same cluster
> name/number.

No other cluster as this is the only one I'm working on.
 
> Try changing the cluster name of this cluster, and look very carefully
> around the network for other clusters!

Tried changing the name in cluster.conf, no difference. 
 
> Otherwise, it's also possible that the cluster.conf files are not in sync
> somehow - make sure that they are identical on all nodes.

I've manually copied them to each node but I'm only trying to start the one 
first. Getting the same error trying to start the cluster on any node.

Mike


From jwhiter at redhat.com  Wed Oct 18 18:16:53 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Wed, 18 Oct 2006 14:16:53 -0400
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018131147.727308@leena>
References: <4535D755.2000105@redhat.com> <20061018131147.727308@leena>
Message-ID: <20061018181653.GE20018@korben.rdu.redhat.com>

Chkconfig everything off, bring all nodes up, then on all nodes run each of
these commands at the same time

service ccsd start
service cman start
service fenced start

and if any problems occur see what the logs spit out.

Josef

On Wed, Oct 18, 2006 at 01:11:47PM -0500, isplist at logicore.net wrote:
> > If all the nodes are showing "rejected" with no reason then it's possible
> > there's another cluster on the network that shares the same cluster
> > name/number.
> 
> No other cluster as this is the only one I'm working on.
>  
> > Try changing the cluster name of this cluster, and look very carefully
> > around the network for other clusters!
> 
> Tried changing the name in cluster.conf, no difference. 
>  
> > Otherwise, it's also possible that the cluster.conf files are not in sync
> > somehow - make sure that they are identical on all nodes.
> 
> I've manually copied them to each node but I'm only trying to start the one 
> first. Getting the same error trying to start the cluster on any node.
> 
> Mike
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Oct 18 18:22:54 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 13:22:54 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018181653.GE20018@korben.rdu.redhat.com>
Message-ID: <20061018132254.439486@leena>

I actually have little on for auto start. I was never able to start the 
cluster other than by doing the following;

depmod -a
modprobe dm-mod
modprobe gfs
modprobe lock_dlm
ccsd
cman_tool join -t 120 -w
fence_tool join -t 120 -w
clvmd
vgchange -aly
mount -t gfs /dev/vgcomp/rimfire /var/www/

Mike


On Wed, 18 Oct 2006 14:16:53 -0400, Josef Whiter wrote:
> Chkconfig everything off, bring all nodes up, then on all nodes run each of
> 
> these commands at the same time
> 
> service ccsd start
> service cman start
> service fenced start
> 
> and if any problems occur see what the logs spit out.
> 
> Josef
> 
> On Wed, Oct 18, 2006 at 01:11:47PM -0500, isplist at logicore.net wrote:
>>> If all the nodes are showing "rejected" with no reason then it's
>>> possible
>>> there's another cluster on the network that shares the same cluster
>>> name/number.
>>> 
>> No other cluster as this is the only one I'm working on.
>> 
>>> Try changing the cluster name of this cluster, and look very carefully
>>> around the network for other clusters!
>>> 
>> Tried changing the name in cluster.conf, no difference.
>> 
>>> Otherwise, it's also possible that the cluster.conf files are not in
>>> sync
>>> somehow - make sure that they are identical on all nodes.
>>> 
>> I've manually copied them to each node but I'm only trying to start the
>> one
>> first. Getting the same error trying to start the cluster on any node.
>> 
>> Mike
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Oct 18 18:34:15 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 13:34:15 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018181653.GE20018@korben.rdu.redhat.com>
Message-ID: <20061018133415.781485@leena>

> service ccsd start
> service cman start
> service fenced start

Just for the heck of it, I stopped all services to be sure, then started them 
again manually as suggested above.

cman is stuck and failed trying to start, here's what is showing in the log;

Oct 18 13:26:37 cweb92 ccsd[2693]: Starting ccsd 1.0.7:
Oct 18 13:26:37 cweb92 ccsd[2693]:  Built: Aug 25 2006 19:13:15
Oct 18 13:26:37 cweb92 ccsd[2693]:  Copyright (C) Red Hat, Inc.  2004  All 
rights reser                      ved.
Oct 18 13:26:37 cweb92 ccsd:  succeeded
Oct 18 13:26:43 cweb92 ccsd[2693]: cluster.conf (cluster name = comp, version 
= 42) fou                      nd.
Oct 18 13:26:43 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 18 13:26:44 cweb92 ccsd[2693]: Connected to cluster infrastruture via: 
CMAN/SM Plug                      in v1.1.7.1
Oct 18 13:26:44 cweb92 ccsd[2693]: Initial status:: Inquorate
Oct 18 13:27:15 cweb92 kernel: CMAN: forming a new cluster
Oct 18 13:32:15 cweb92 cman: Timed-out waiting for cluster failed

So, it's stuck inquorate before it even get's going?

Here's my cluster.conf which I'm SURE is the problem;

<?xml version="1.0"?>
<cluster config_version="42" name="comp">
    <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
    <clusternode name="cweb92" votes="1"/>
    <clusternode name="cweb93" votes="1"/>
    <clusternode name="cweb94" votes="1"/>
    <clusternode name="dev" votes="1"/>
    <clusternode name="qm247" votes="1"/>
    <clusternode name="qm248" votes="1"/>
    <clusternode name="qm249" votes="1"/>
    <clusternode name="qm250" votes="1"/>
</clusternodes>
        <cman/>
<fencedevices>
    <fencedevice agent="fence_mcdata" ipaddr="192.168.x.x" login="user" 
name="ED5000" passwd="xxxx"/>
</fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
</cluster>


From jwhiter at redhat.com  Wed Oct 18 18:52:44 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Wed, 18 Oct 2006 14:52:44 -0400
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018133415.781485@leena>
References: <20061018181653.GE20018@korben.rdu.redhat.com>
	<20061018133415.781485@leena>
Message-ID: <20061018185243.GF20018@korben.rdu.redhat.com>

It's waiting to form a cluster, which means its waiting for enough members to
join for it to become quorate.  Try starting all the nodes at the same time and
see what happens.

Josef

On Wed, Oct 18, 2006 at 01:34:15PM -0500, isplist at logicore.net wrote:
> > service ccsd start
> > service cman start
> > service fenced start
> 
> Just for the heck of it, I stopped all services to be sure, then started them 
> again manually as suggested above.
> 
> cman is stuck and failed trying to start, here's what is showing in the log;
> 
> Oct 18 13:26:37 cweb92 ccsd[2693]: Starting ccsd 1.0.7:
> Oct 18 13:26:37 cweb92 ccsd[2693]:  Built: Aug 25 2006 19:13:15
> Oct 18 13:26:37 cweb92 ccsd[2693]:  Copyright (C) Red Hat, Inc.  2004  All 
> rights reser                      ved.
> Oct 18 13:26:37 cweb92 ccsd:  succeeded
> Oct 18 13:26:43 cweb92 ccsd[2693]: cluster.conf (cluster name = comp, version 
> = 42) fou                      nd.
> Oct 18 13:26:43 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 18 13:26:44 cweb92 ccsd[2693]: Connected to cluster infrastruture via: 
> CMAN/SM Plug                      in v1.1.7.1
> Oct 18 13:26:44 cweb92 ccsd[2693]: Initial status:: Inquorate
> Oct 18 13:27:15 cweb92 kernel: CMAN: forming a new cluster
> Oct 18 13:32:15 cweb92 cman: Timed-out waiting for cluster failed
> 
> So, it's stuck inquorate before it even get's going?
> 
> Here's my cluster.conf which I'm SURE is the problem;
> 
> <?xml version="1.0"?>
> <cluster config_version="42" name="comp">
>     <fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
> <clusternodes>
>     <clusternode name="cweb92" votes="1"/>
>     <clusternode name="cweb93" votes="1"/>
>     <clusternode name="cweb94" votes="1"/>
>     <clusternode name="dev" votes="1"/>
>     <clusternode name="qm247" votes="1"/>
>     <clusternode name="qm248" votes="1"/>
>     <clusternode name="qm249" votes="1"/>
>     <clusternode name="qm250" votes="1"/>
> </clusternodes>
>         <cman/>
> <fencedevices>
>     <fencedevice agent="fence_mcdata" ipaddr="192.168.x.x" login="user" 
> name="ED5000" passwd="xxxx"/>
> </fencedevices>
>         <rm>
>                 <failoverdomains/>
>                 <resources/>
>         </rm>
> </cluster>
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Oct 18 19:01:20 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 14:01:20 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061018185243.GF20018@korben.rdu.redhat.com>
Message-ID: <2006101814120.147149@leena>

Interesting...

Fired up 3 nodes... two ended up waiting for cluster quorum at the fence_tool 
startup, one failed as before. Unblocked the quorum and still, only one node 
is now in the cluster.

Log on one node that has the same problem;

# tail /var/log/messages
Oct 18 14:00:09 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 18 14:00:10 cweb93 kernel: CMAN: sending membership request
Oct 18 14:00:10 cweb93 kernel: CMAN: Cluster membership rejected
Oct 18 14:00:10 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 18 14:00:10 cweb93 ccsd[2411]: Connected to cluster infrastruture via: 
CMAN/               SM Plugin v1.1.7.1
Oct 18 14:00:10 cweb93 ccsd[2411]: Initial status:: Inquorate
Oct 18 14:00:14 cweb93 kernel: CMAN: sending membership request
Oct 18 14:00:14 cweb93 kernel: CMAN: Cluster membership rejected
Oct 18 14:00:14 cweb93 ccsd[2411]: Cluster manager shutdown.  Attemping to 
recon               nect...
Oct 18 14:00:15 cweb93 clvmd: Can't open cluster manager socket: Network is 
down


From jason at monsterjam.org  Wed Oct 18 19:29:36 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Wed, 18 Oct 2006 15:29:36 -0400
Subject: [Linux-cluster] virtual address went down?
In-Reply-To: <20061018013715.GA31630@monsterjam.org>
References: <20061018013715.GA31630@monsterjam.org>
Message-ID: <20061018192936.GA26836@monsterjam.org>

since no one responded to my question, I tried stopping the services on 
the first box:
service rgmanager stop
service gfs stop
service clvmd stop
service fenced stop
service cman stop
service ccsd stop

everything came down fine.
then I started em back up..
service ccsd start
this seemed to hang for about 2 minutes, then I got a panic..
as shown in the attached graphic..

this is on  2.6.9-34.ELsmp redhat  Enterprise Linux AS release 4 (Nahant 
Update 4)
running ccs-1.0.3-0,
cman-kernel-hugemem-2.6.9-43.8
cman-kernel-2.6.9-43.8
cman-1.0.4-0
cman-kernel-smp-2.6.9-43.8
cman-kernheaders-2.6.9-43.8

 build from sources..

heres my cluster.conf

<?xml version="1.0"?>
<cluster config_version="22" name="progressive">
        <fence_daemon clean_start="0" post_fail_delay="0" 
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="tf1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="1" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="2" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="1" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="2" switch="1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="tf2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="3" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="4" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="3" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="4" switch="1"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_apc" ipaddr="192.168.1.8" 
login="xxx"                                                         
name="apc_power_switch" passwd="xxx"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="httpd" ordered="1" 
restricted="1">
                                <failoverdomainnode name="tf1" 
priority="1"/>
                                <failoverdomainnode name="tf2" 
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <script file="/etc/init.d/httpd" 
name="cluster_apache"/>
                        <fs device="/dev/mapper/diskarray-lv1" 
fstype="ext3" mou                                                        
ntpoint="/mnt/gfs/htdocs" name="apache_content"/>
                        <ip address="192.168.1.7" monitor_link="1"/>
                </resources>
                <service autostart="1" domain="httpd" name="Apache 
Service">
                        <ip ref="192.168.1.7"/>
                        <script ref="cluster_apache"/>
                        <fs ref="apache_content"/>
                </service>
        </rm>
</cluster>

ooh and shortly after the first box came back up, the second one got 
rebooted automagically (power fenced from the first one im guessing) for 
good measure.

but now the virtual address is working again.

any help appreciated 


Jason


On Tue, Oct 17, 2006 at 09:37:15PM -0400, jason at monsterjam.org wrote:
> so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
> but the virtual address seems to have disappeared.. its not pingable, neither server has it 
> configured anymore.. The only application I had using the virtual address was apache (just for 
> testing it). what logs/information should I be looking at to see what happened and why?
> 
> regards,
> Jason
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
================================================
|    Jason Welsh   jason at monsterjam.org        |
| http://monsterjam.org    DSS PGP: 0x5E30CC98 |
|    gpg key: http://monsterjam.org/gpg/       |
================================================


From katriel at penguin-it.co.il  Wed Oct 18 19:38:13 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Wed, 18 Oct 2006 21:38:13 +0200
Subject: [Linux-cluster] node fails to stop when inquorate
Message-ID: <453682A5.9070008@penguin-it.co.il>

Hello.

I've been seeing some strange behavior on a failed node that perhaps
some of the forum members have encountered.

A 2-node cluster with qdiskd running. Disconnecting one node from the
network causes it to be "fabric fenced", and the remaining node
continues working as expected.
When trying to restart the failed node, rgmanager's script sends it (the
rgmanager process) into zombie land, which makes the script loop forever.
The (ugly) workaround I've been using is killing the process manually
and then manually removing /var/lock/subsys/rgmanager, which causes "rc"
to skip it.

Is there a better way to restart a failed node? Shouldn't a failed node
be "hard booted" by cman?

Thanks,
-- 
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953


From lhh at redhat.com  Wed Oct 18 20:20:06 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:20:06 -0400
Subject: [Linux-cluster] node fails to stop when inquorate
In-Reply-To: <453682A5.9070008@penguin-it.co.il>
References: <453682A5.9070008@penguin-it.co.il>
Message-ID: <1161202806.6685.69.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-18 at 21:38 +0200, Katriel Traum wrote:

> The (ugly) workaround I've been using is killing the process manually
> and then manually removing /var/lock/subsys/rgmanager, which causes "rc"
> to skip it.

> Is there a better way to restart a failed node? Shouldn't a failed node
> be "hard booted" by cman?

Nodes don't "know" they're fenced with fabric-level fencing; it's a
deficiency in the model itself.

The easiest thing to do is 'reboot -fn'.  A fenced node may have
outstanding buffers which never get cleaned up - so you can't "un-fence"
them until they have been rebooted anyway.

Rgmanager's child processes are probably trying to umount the a file
system that has been fenced and are stuck in disk-wait - which may be
"forever", depending on the storage configuration.

There's an patch outstanding for qdiskd which makes it reboot on loss of
score, which triggers a reboot.  However, I don't think this is your
problem.

-- Lon


From lhh at redhat.com  Wed Oct 18 20:22:42 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:22:42 -0400
Subject: [Linux-cluster] virtual address went down?
In-Reply-To: <20061018013715.GA31630@monsterjam.org>
References: <20061018013715.GA31630@monsterjam.org>
Message-ID: <1161202963.6685.71.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-17 at 21:37 -0400, jason at monsterjam.org wrote:
> so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
> but the virtual address seems to have disappeared.. its not pingable, neither server has it 
> configured anymore..

You're using 'ip addr list', not 'ifconfig' to look for it, right?

> The only application I had using the virtual address was apache (just for 
> testing it). what logs/information should I be looking at to see what happened and why?

Firewall bits?

What kind of ethernet cards do you have, out of curiosity, and are they
in any sort of bonding configuration?

-- Lon


From lhh at redhat.com  Wed Oct 18 20:25:29 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:25:29 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <45351CDE.3090502@penguin-it.co.il>
References: <45351CDE.3090502@penguin-it.co.il>
Message-ID: <1161203129.6685.75.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-17 at 20:11 +0200, Katriel Traum wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Hello.
> 
> I've seen this subject on the list, but no real solutions.
> I'm using Cluster 4 update 4, with qdiskd and a shared disk.
> I've understood from the documentation and list that a "cman_tool
> status" should reflect the number of votes the quorum daemon holds.
> 
> My setup is pretty straight forward, 2-node cluster, shared storage (AoE
> for testing).
> qdiskd configuration:
> <quorumd interval="1" tko="5" votes="3" log_level="7" device="/dev/ether
> d/e0.0" status_file="/tmp/qdisk-status">

What does /tmp/qdisk-status say?

Also, check 'dmesg' -- there should be information in dmesg when the
quorum device is registered/unregistered.

-- Lon


From jason at monsterjam.org  Wed Oct 18 20:32:08 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Wed, 18 Oct 2006 16:32:08 -0400
Subject: [Linux-cluster] virtual address went down?
In-Reply-To: <1161202963.6685.71.camel@rei.boston.devel.redhat.com>
References: <20061018013715.GA31630@monsterjam.org>
	<1161202963.6685.71.camel@rei.boston.devel.redhat.com>
Message-ID: <20061018203208.GA28571@monsterjam.org>

> > so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
> > but the virtual address seems to have disappeared.. its not pingable, neither server has it 
> > configured anymore..
> 
> You're using 'ip addr list', not 'ifconfig' to look for it, right?
actually, I was using ifconfig, I forgot that you need to use ip addr 
list.. my mistake.. but the address was for sure not pingable.

> 
> > The only application I had using the virtual address was apache (just for 
> > testing it). what logs/information should I be looking at to see what happened and why?
> 
> Firewall bits?
not sure what you mean here.. 

> 
> What kind of ethernet cards do you have, out of curiosity, and are they
> in any sort of bonding configuration?
> 
[jason at tf1 ~]$ lspci | grep -i ether
02:0c.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 
100] (rev 0d)
06:07.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit 
Ethernet Controller (rev 05)
07:08.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit 
Ethernet Controller (rev 05)

im not doing any bonding.. 

Jason


> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
================================================
|    Jason Welsh   jason at monsterjam.org        |
| http://monsterjam.org    DSS PGP: 0x5E30CC98 |
|    gpg key: http://monsterjam.org/gpg/       |
================================================


From lhh at redhat.com  Wed Oct 18 20:53:44 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:53:44 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <20061017184932.GE10076@korben.rdu.redhat.com>
References: <45351CDE.3090502@penguin-it.co.il>
	<20061017182510.GD10076@korben.rdu.redhat.com>
	<45352433.3080603@penguin-it.co.il>
	<20061017184932.GE10076@korben.rdu.redhat.com>
Message-ID: <1161204824.6685.83.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-17 at 14:49 -0400, Josef Whiter wrote:

> http://pastebin.com and post the
> url so i can take a look.

Me too ;)

> On Tue, Oct 17, 2006 at 08:42:59PM +0200, Katriel Traum wrote:
> > qdisk is running:
> > [root at n1 ~]# service qdiskd status
> > qdiskd (pid 1199) is running...
> > [root at n2 ~]# service qdiskd status
> > qdiskd (pid 873) is running...
> > 
> > /tmp/qdisk-status:
> > [root at n1 ~]# cat /tmp/qdisk-status
> > Node ID: 1
> > Score (current / min req. / max allowed): 3 / 2 / 3
> > Current state: Master
> > Current disk state: None
> > Visible Set: { 1 2 }
> > Master Node ID: 1
> > Quorate Set: { 1 2 }

This indicates that it should be working correctly... The master
election took place, the quorate set is right, and so is your
configuration.

> >         <quorumd interval="1" tko="5" votes="3" log_level="7"
> > device="/dev/etherd/e0.0" status_file="/tmp/qdisk-status">
> >                 <heuristic program="ping 192.168.22.1 -c1 -t1" score="1"
> > interval="2"/>
> >                 <heuristic program="ping 192.168.22.60 -c1 -t1"
> > score="1" interval="2"/>
> >                 <heuristic program="ping 192.168.22.100 -c1 -t1"
> > score="1" interval="2"/>
> >         </quorumd>

Everything looks right.  On my two node cluster, I'm running with
expected votes = 4, 2 votes for qdisk, 1 vote per node... *scratches
head*.  Qdisk seems to be operating correctly, but it apparently didn't
register correctly with cman or something...?

-- Lon


From lhh at redhat.com  Wed Oct 18 20:54:15 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:54:15 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <1161203129.6685.75.camel@rei.boston.devel.redhat.com>
References: <45351CDE.3090502@penguin-it.co.il>
	<1161203129.6685.75.camel@rei.boston.devel.redhat.com>
Message-ID: <1161204855.6685.85.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-18 at 16:25 -0400, Lon Hohberger wrote:

> What does /tmp/qdisk-status say?

Disregard.

> Also, check 'dmesg' -- there should be information in dmesg when the
> quorum device is registered/unregistered.

I was incorrect -- only loss of the quorum disk is noted.

-- Lon


From lhh at redhat.com  Wed Oct 18 20:56:01 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 18 Oct 2006 16:56:01 -0400
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <45354079.8080709@penguin-it.co.il>
References: <45351CDE.3090502@penguin-it.co.il>
	<20061017182510.GD10076@korben.rdu.redhat.com>
	<45352433.3080603@penguin-it.co.il>
	<20061017184932.GE10076@korben.rdu.redhat.com>
	<45354079.8080709@penguin-it.co.il>
Message-ID: <1161204961.6685.87.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-17 at 22:43 +0200, Katriel Traum wrote:
> Okay. My bad.
> Using older cman/dlm kernel modules with newer cman/qdisk binaries
> works, but I guess the older kernel module doesn't know how to accept a
> "quorum  node".
> After using the same version all around, it works.

Oh ;)

Man, I gotta read the whole thread before I open my gaping maw.

-- Lon


From katriel at penguin-it.co.il  Wed Oct 18 21:01:46 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Wed, 18 Oct 2006 23:01:46 +0200
Subject: [Linux-cluster] cman & qdiskd
In-Reply-To: <1161204961.6685.87.camel@rei.boston.devel.redhat.com>
References: <45351CDE.3090502@penguin-it.co.il>	<20061017182510.GD10076@korben.rdu.redhat.com>	<45352433.3080603@penguin-it.co.il>	<20061017184932.GE10076@korben.rdu.redhat.com>	<45354079.8080709@penguin-it.co.il>
	<1161204961.6685.87.camel@rei.boston.devel.redhat.com>
Message-ID: <4536963A.4040800@penguin-it.co.il>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

I was gonna say something, but figured you'll notice eventually :-)

+Katriel

Lon Hohberger wrote:
> On Tue, 2006-10-17 at 22:43 +0200, Katriel Traum wrote:
>> Okay. My bad.
>> Using older cman/dlm kernel modules with newer cman/qdisk binaries
>> works, but I guess the older kernel module doesn't know how to accept a
>> "quorum  node".
>> After using the same version all around, it works.
> 
> Oh ;)
> 
> Man, I gotta read the whole thread before I open my gaping maw.
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

- --
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFNpY6DWy+Hv/461sRAqvpAKCGzujH5KVkacUxqOdMx7u+Z86czQCaA1CZ
IqDVJRochNX17JKfwRZLY6E=
=mwRS
-----END PGP SIGNATURE-----


From isplist at logicore.net  Wed Oct 18 21:02:50 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 16:02:50 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <4535D755.2000105@redhat.com>
Message-ID: <2006101816250.293715@leena>

Anyone else have any thoughts??? I'd love to get this working :).

Mike


From katriel at penguin-it.co.il  Wed Oct 18 21:07:50 2006
From: katriel at penguin-it.co.il (Katriel Traum)
Date: Wed, 18 Oct 2006 23:07:50 +0200
Subject: [Linux-cluster] node fails to stop when inquorate
In-Reply-To: <1161202806.6685.69.camel@rei.boston.devel.redhat.com>
References: <453682A5.9070008@penguin-it.co.il>
	<1161202806.6685.69.camel@rei.boston.devel.redhat.com>
Message-ID: <453697A6.2020103@penguin-it.co.il>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Thanks,
I'll look into it.
I don't think qdiskd rebootign is a good solution for this scenario.
Are there any cases in which cman reboots a machine? maybe this should
be configurable (not only when qdiskd tells it to reboot)

Thanks,
+Katriel

Lon Hohberger wrote:
> On Wed, 2006-10-18 at 21:38 +0200, Katriel Traum wrote:
> 
>> The (ugly) workaround I've been using is killing the process manually
>> and then manually removing /var/lock/subsys/rgmanager, which causes "rc"
>> to skip it.
> 
>> Is there a better way to restart a failed node? Shouldn't a failed node
>> be "hard booted" by cman?
> 
> Nodes don't "know" they're fenced with fabric-level fencing; it's a
> deficiency in the model itself.
> 
> The easiest thing to do is 'reboot -fn'.  A fenced node may have
> outstanding buffers which never get cleaned up - so you can't "un-fence"
> them until they have been rebooted anyway.
> 
> Rgmanager's child processes are probably trying to umount the a file
> system that has been fenced and are stuck in disk-wait - which may be
> "forever", depending on the storage configuration.
> 
> There's an patch outstanding for qdiskd which makes it reboot on loss of
> score, which triggers a reboot.  However, I don't think this is your
> problem.
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

- --
Katriel Traum, PenguinIT
RHCE, CLP
Mobile: 054-6789953
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org

iD8DBQFFNpefDWy+Hv/461sRAlwZAKCGMPfGwsFmsAd09Z0Z3Y3vxmudwQCfd+09
2oGyyKMkxpPV6SSQUH8J4jk=
=rrou
-----END PGP SIGNATURE-----


From rpeterso at redhat.com  Wed Oct 18 21:13:00 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 18 Oct 2006 16:13:00 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <2006101814120.147149@leena>
References: <2006101814120.147149@leena>
Message-ID: <453698DC.4000604@redhat.com>

isplist at logicore.net wrote:
> Interesting...
>
> Fired up 3 nodes... two ended up waiting for cluster quorum at the fence_tool 
> startup, one failed as before. Unblocked the quorum and still, only one node 
> is now in the cluster.
>
> Log on one node that has the same problem;
>
> # tail /var/log/messages
> Oct 18 14:00:09 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 18 14:00:10 cweb93 kernel: CMAN: sending membership request
> Oct 18 14:00:10 cweb93 kernel: CMAN: Cluster membership rejected
> Oct 18 14:00:10 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 18 14:00:10 cweb93 ccsd[2411]: Connected to cluster infrastruture via: 
> CMAN/               SM Plugin v1.1.7.1
> Oct 18 14:00:10 cweb93 ccsd[2411]: Initial status:: Inquorate
> Oct 18 14:00:14 cweb93 kernel: CMAN: sending membership request
> Oct 18 14:00:14 cweb93 kernel: CMAN: Cluster membership rejected
> Oct 18 14:00:14 cweb93 ccsd[2411]: Cluster manager shutdown.  Attemping to 
> recon               nect...
> Oct 18 14:00:15 cweb93 clvmd: Can't open cluster manager socket: Network is 
> down
>   
Hi Mike,

I recommend checking if you have a firewall that's blocking traffic on 
any or all of the nodes.
Perhaps you can temporarily "service iptables stop" and see if it 
connects that way.
If that's your problem, here is a link to which ports to enable:

http://sources.redhat.com/cluster/faq.html#iptables

If that's not your problem, please let everyone know what version of the 
cluster
software you're running.  There are vast differences between the stable 
code
(such as STABLE in cvs and RHEL4, etc) and the development code (HEAD in 
cvs).

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Wed Oct 18 21:20:54 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 18 Oct 2006 16:20:54 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <453698DC.4000604@redhat.com>
Message-ID: <20061018162054.801458@leena>

Thanks for the reply :).

> I recommend checking if you have a firewall that's blocking traffic on
> any or all of the nodes.

Nothing at the firewall, everything is before that and was working before I 
added the McData switch.

> If that's not your problem, please let everyone know what version of the
> cluster software you're running.  There are vast differences between the 
> stable code (such as STABLE in cvs and RHEL4, etc) and the development code 
> (HEAD in cvs).

 	GFS 6.1.6-1.XOS.1 	System Environment/Kernel 	GFS - The Global File System
	GFS-kernel 2.6.9-49.1.5.XOS.1 	System Environment/Kernel 	GFS-kernel - The 
Global File System kernel modules
	GFS-kernel 2.6.9-35.5.XOS.1 	System Environment/Kernel 	GFS-kernel - The 
Global File System kernel modules
	GFS-kernel 2.6.9-58.3.XOS.1 	System Environment/Kernel 	GFS-kernel - The 
Global File System kernel modules
	gnbd 1.0.7-0.XOS.1 	System Environment/Kernel 	gnbd - GFS's Network Block 
Device
	gnbd-kernel 2.6.9-9.31.2.XOS.1 	System Environment/Kernel 	gnbd-kernel - The 
kernel module for GFS's Network Block Device
	gnbd-kernel 2.6.9-9.44.XOS.1 	System Environment/Kernel 	gnbd-kernel - The 
kernel module for GFS's Network Block Device
	gnbd-kernel 2.6.9-8.27.XOS.1 	System Environment/Kernel 	gnbd-kernel - The 
kernel module for GFS's Network Block Device
	gulm 1.0.7-0.XOS.1 	System Environment/Daemons 	gulm - One possible lock 
manager for GFS
ccs 1.0.7-0.XOS.1 	System Environment/Base 	CCS - The Cluster Configuration 
System
	cman 1.0.11-0.XOS.1 	System Environment/Base 	cman - The Cluster Manager
	cman-kernel 2.6.9-36.0.XOS.1 	System Environment/Kernel 	cman-kernel - The 
Cluster Manager kernel modules
	cman-kernel 2.6.9-43.8.5.XOS.1 	System Environment/Kernel 	cman-kernel - The 
Cluster Manager kernel modules
	cman-kernel 2.6.9-45.5.XOS.1 	System Environment/Kernel 	cman-kernel - The 
Cluster Manager kernel modules
	fence 1.32.25-1.XOS.1 	System Environment/Base 	fence - The cluster I/O 
fencing system
	lvm2-cluster 2.02.06-7.0.RHEL4.XOS.1 	System Environment/Base 	Cluster 
extenstions for userland logical volume management tools
	magma 1.0.6-0.XOS.1 	System Environment/Libraries 	A cluster/lock manager API 
abstraction library
	magma-plugins 1.0.9-0.XOS.1 	System Environment/Libraries 	Cluster manager 
plugins for magma
	piranha 0.8.2-1.XOS.1 	System Environment/Base 	Cluster administation tools
	system-config-cluster 1.0.25-1.0.XOS.1 	Applications/System 	
system-config-cluster is a utility which allows you to manage cluster 
configuration in a graphical setting.

Am I missing anything?


From pcaulfie at redhat.com  Thu Oct 19 07:49:07 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 19 Oct 2006 08:49:07 +0100
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <2006101814120.147149@leena>
References: <2006101814120.147149@leena>
Message-ID: <45372DF3.3040405@redhat.com>

isplist at logicore.net wrote:
> Interesting...
> 
> Fired up 3 nodes... two ended up waiting for cluster quorum at the fence_tool 
> startup, one failed as before. Unblocked the quorum and still, only one node 
> is now in the cluster.
> 
> Log on one node that has the same problem;
> 
> # tail /var/log/messages
> Oct 18 14:00:09 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 18 14:00:10 cweb93 kernel: CMAN: sending membership request
> Oct 18 14:00:10 cweb93 kernel: CMAN: Cluster membership rejected
> Oct 18 14:00:10 cweb93 kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 18 14:00:10 cweb93 ccsd[2411]: Connected to cluster infrastruture via: 
> CMAN/               SM Plugin v1.1.7.1
> Oct 18 14:00:10 cweb93 ccsd[2411]: Initial status:: Inquorate
> Oct 18 14:00:14 cweb93 kernel: CMAN: sending membership request
> Oct 18 14:00:14 cweb93 kernel: CMAN: Cluster membership rejected

Some other node is rejecting the cluster membership. You need to find that
node and look at the message it has put into syslog.

-- 

patrick


From dicky_nnc at yahoo.com.hk  Thu Oct 19 15:31:21 2006
From: dicky_nnc at yahoo.com.hk (Dicky)
Date: Thu, 19 Oct 2006 23:31:21 +0800
Subject: [Linux-cluster] Cluster Suite 4 failover problem
Message-ID: <45379A49.8060700@yahoo.com.hk>

Hi All,

I have two machines (named node1 -->192.168.0.27 and node2 
-->192.168.0.28) installed Red Hat Cluster Suite 4 with DLM with 1 NIC 
for each machine. I have created a manual fence, a failover domain, two 
services (1st service is "www - listening address is 192.168.0.111" , 
2nd service is "ftp - listening address is 192.168.0.112).

After having the initital setup, everything seems working fine, i can 
relocate the service from node1 to node 2 or vice versa manually, stop 
and start the services.

But when i tried to test the failover capibility, i.e. shutdown the 
network service in one node e.g. shutdown the  eth0 of node1, the failed 
service won't work in most time, following was the scenarios i tested:

Scenario: Running services running in node1, then i shutdown the eth0 of 
node1

Result: Services not failover to node2, and the clustat in node1 shows that:

Member Status: Quorate

  Member Name                      Status
  ------ ----                              ------
  node1                                    Offline
  node2                                    Online, Local, rgmanager

  Service Name     Owner (Last)                   State
  ------- ----         ----- ------                       -----
  ftp                       unkonwn                          started
  www                   unkonwn                          started

Both services were no longer working. when i restarted the eth0 in 
node1, restarted the cman service in node1, it still didn't work. Also, 
when i tried to restart the rgmanager in node1, it only showed that 
"Waiting for services to stop: " and wating forever. Even i tried to 
kill the process of the rgmanager, it didn't work. Finally, i  have to 
reset both machines to get the cluster service back to normal.

I would appreciate if anyone could help or anyone can share if they also 
got such experience before.
I also attached the cluster.conf below for any reference.

======cluster.conf=========
<?xml version="1.0"?>
<cluster config_version="34" name="alpha_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="node1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="Fence" 
nodename="node1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="Fence" 
nodename="node2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="Fence"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="aaa" ordered="0" 
restricted="0">
                                <failoverdomainnode name="node1" 
priority="1"/>
                                <failoverdomainnode name="node2" 
priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="192.168.0.111" monitor_link="0"/>
                        <script file="/etc/rc.d/init.d/httpd" name="www"/>
                        <script file="/etc/rc.d/init.d/vsftpd" name="ftp"/>
                        <ip address="192.168.0.112" monitor_link="0"/>
                </resources>
                <service autostart="1" domain="aaa" name="ftp" 
recovery="relocate">
                        <ip ref="192.168.0.112"/>
                        <script ref="ftp"/>
                </service>
                <service autostart="1" domain="aaa" name="www" 
recovery="relocate">
                        <ip ref="192.168.0.111"/>
                        <script ref="www"/>
                </service>
        </rm>
</cluster>
==========END==========

Many Thanks,
Dicky


From jon.daniels at voxsurf.com  Thu Oct 19 15:45:44 2006
From: jon.daniels at voxsurf.com (Jonathan Daniels)
Date: Thu, 19 Oct 2006 16:45:44 +0100
Subject: [Linux-cluster] Cluster Suite 4 failover problem
In-Reply-To: <45379A49.8060700@yahoo.com.hk>
References: <45379A49.8060700@yahoo.com.hk>
Message-ID: <45379DA8.9000103@voxsurf.com>

Hi,

What is output to the "/var/log/messages" files of each node? That 
should provide a clue as to what the problem is.  Also, did you install 
the 'fence' RPM and any Clustered LVM / GFS RPMs?

You also might consider rebooting the "downed" node - this function is 
generally taken care of by fencing devices automatically and, as I 
understand it, "manual fencing" means you gotta reboot :), the 
assumption being that a failed node won't be allowed back in the cluster 
until it's restarted.

Thanks,
Jon


Dicky wrote:

> Hi All,
>
> I have two machines (named node1 -->192.168.0.27 and node2 
> -->192.168.0.28) installed Red Hat Cluster Suite 4 with DLM with 1 NIC 
> for each machine. I have created a manual fence, a failover domain, 
> two services (1st service is "www - listening address is 
> 192.168.0.111" , 2nd service is "ftp - listening address is 
> 192.168.0.112).
>
> After having the initital setup, everything seems working fine, i can 
> relocate the service from node1 to node 2 or vice versa manually, stop 
> and start the services.
>
> But when i tried to test the failover capibility, i.e. shutdown the 
> network service in one node e.g. shutdown the  eth0 of node1, the 
> failed service won't work in most time, following was the scenarios i 
> tested:
>
> Scenario: Running services running in node1, then i shutdown the eth0 
> of node1
>
> Result: Services not failover to node2, and the clustat in node1 shows 
> that:
>
> Member Status: Quorate
>
>  Member Name                      Status
>  ------ ----                              ------
>  node1                                    Offline
>  node2                                    Online, Local, rgmanager
>
>  Service Name     Owner (Last)                   State
>  ------- ----         ----- ------                       -----
>  ftp                       unkonwn                          started
>  www                   unkonwn                          started
>
> Both services were no longer working. when i restarted the eth0 in 
> node1, restarted the cman service in node1, it still didn't work. 
> Also, when i tried to restart the rgmanager in node1, it only showed 
> that "Waiting for services to stop: " and wating forever. Even i tried 
> to kill the process of the rgmanager, it didn't work. Finally, i  have 
> to reset both machines to get the cluster service back to normal.
>
> I would appreciate if anyone could help or anyone can share if they 
> also got such experience before.
> I also attached the cluster.conf below for any reference.
>
> ======cluster.conf=========
> <?xml version="1.0"?>
> <cluster config_version="34" name="alpha_cluster">
>        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
>        <clusternodes>
>                <clusternode name="node1" votes="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="Fence" 
> nodename="node1"/>
>                                </method>
>                        </fence>
>                </clusternode>
>                <clusternode name="node2" votes="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="Fence" 
> nodename="node2"/>
>                                </method>
>                        </fence>
>                </clusternode>
>        </clusternodes>
>        <cman expected_votes="1" two_node="1"/>
>        <fencedevices>
>                <fencedevice agent="fence_manual" name="Fence"/>
>        </fencedevices>
>        <rm>
>                <failoverdomains>
>                        <failoverdomain name="aaa" ordered="0" 
> restricted="0">
>                                <failoverdomainnode name="node1" 
> priority="1"/>
>                                <failoverdomainnode name="node2" 
> priority="1"/>
>                        </failoverdomain>
>                </failoverdomains>
>                <resources>
>                        <ip address="192.168.0.111" monitor_link="0"/>
>                        <script file="/etc/rc.d/init.d/httpd" name="www"/>
>                        <script file="/etc/rc.d/init.d/vsftpd" 
> name="ftp"/>
>                        <ip address="192.168.0.112" monitor_link="0"/>
>                </resources>
>                <service autostart="1" domain="aaa" name="ftp" 
> recovery="relocate">
>                        <ip ref="192.168.0.112"/>
>                        <script ref="ftp"/>
>                </service>
>                <service autostart="1" domain="aaa" name="www" 
> recovery="relocate">
>                        <ip ref="192.168.0.111"/>
>                        <script ref="www"/>
>                </service>
>        </rm>
> </cluster>
> ==========END==========
>
> Many Thanks,
> Dicky
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Thu Oct 19 16:32:30 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 19 Oct 2006 11:32:30 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <4537961F.7080607@redhat.com>
Message-ID: <20061019113230.691224@leena>

I'm not seeing any of these...


Oct 19 11:22:07 dev ccsd[4478]: Initial status:: Inquorate
Oct 19 11:22:11 dev kernel: CMAN: sending membership request
Oct 19 11:22:11 dev kernel: CMAN: Cluster membership rejected
Oct 19 11:22:11 dev ccsd[4478]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 19 11:22:11 dev kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 19 11:22:11 dev kernel: CMAN: sending membership request
Oct 19 11:22:11 dev kernel: CMAN: Cluster membership rejected
Oct 19 11:22:12 dev clvmd: Can't open cluster manager socket: Network is down
Oct 19 11:22:16 dev ccsd[4478]: Unable to connect to cluster infrastructure 
after 30 seconds.
Oct 19 11:22:46 dev ccsd[4478]: Unable to connect to cluster infrastructure 
after 60 seconds.
[root at dev new]#

I tried starting on two nodes. The third, which has the same cluster.conf file 
is now waiting for fence_tool cluster quorum.  This node however is now in the 
cluster but the others don't see it or join.

I'm also getting this lockfile error;

Failed to create lockfile.
Hint: ccsd is already running.
Timed-out waiting for cluster
fence_tool: cluster is not active
clvmd could not connect to cluster manager
Consult syslog for more information
  No volume groups found
mount: special device /dev/vgcomp/rimfire does not exist

(I'm aware of the missing volume, working on the cluster first).

Mike


> There are 5 reasons you can get that message. The rejecting node will say
> exactly what the problem is. It'll be one of:
> 
> - mismatching cluster.conf version numbers
> - mismatch cluster names
> - mismatch cluster number (a hash of the name)
> - node has the wrong node ID (ie it joined with the same
> name and a different node ID or vice versa)
> - protocol version differs (or other software mismatch - there are several
> error messages for these but they boil down to the same thing)
> 
> patrick


From isplist at logicore.net  Thu Oct 19 17:01:41 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 19 Oct 2006 12:01:41 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <4537961F.7080607@redhat.com>
Message-ID: <2006101912141.321604@leena>

Here's a new log after a fresh reboot of the node with matching cluster.conf, 
same as the others;

What's the evil warning about?

Oct 19 11:58:42 cweb92 rgmanager: clurgmgrd startup succeeded
Oct 19 11:58:42 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing 
connection.
Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing connect: Connection 
refused
Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid request 
descriptor
Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid request 
descriptor
Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-21).
Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing disconnect: Invalid 
request descriptor
Oct 19 11:58:42 cweb92 clurgmgrd[2527]: <notice> Resource Group Manager 
Starting
Oct 19 11:58:43 cweb92 clurgmgrd[2527]: <info> Loading Service Data
Oct 19 11:58:45 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing 
connection.
Oct 19 11:58:45 cweb92 ccsd[2318]: Error while processing connect: Connection 
refused
Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #5: Couldn't connect to ccsd!
Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #8: Couldn't initialize 
services
Oct 19 11:58:47 cweb92 rc: Starting webmin:  succeeded
Oct 19 11:59:11 cweb92 ccsd[2318]: Unable to connect to cluster infrastructure 
after 60 seconds.
Oct 19 11:59:41 cweb92 ccsd[2318]: Unable to connect to cluster infrastructure 
after 90 seconds.


From dbrieck at gmail.com  Thu Oct 19 18:05:25 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Thu, 19 Oct 2006 14:05:25 -0400
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <2006101912141.321604@leena>
References: <4537961F.7080607@redhat.com> <2006101912141.321604@leena>
Message-ID: <8c1094290610191105j68dd46a2k69c47c7f5408c281@mail.gmail.com>

On 10/19/06, isplist at logicore.net <isplist at logicore.net> wrote:
> Here's a new log after a fresh reboot of the node with matching cluster.conf,
> same as the others;
>
> What's the evil warning about?
>
> Oct 19 11:58:42 cweb92 rgmanager: clurgmgrd startup succeeded
> Oct 19 11:58:42 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing
> connection.
> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing connect: Connection
> refused
> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid request
> descriptor
> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid request
> descriptor
> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-21).
> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something evil.
> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing disconnect: Invalid
> request descriptor
> Oct 19 11:58:42 cweb92 clurgmgrd[2527]: <notice> Resource Group Manager
> Starting
> Oct 19 11:58:43 cweb92 clurgmgrd[2527]: <info> Loading Service Data
> Oct 19 11:58:45 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing
> connection.
> Oct 19 11:58:45 cweb92 ccsd[2318]: Error while processing connect: Connection
> refused
> Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #5: Couldn't connect to ccsd!
> Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #8: Couldn't initialize
> services
> Oct 19 11:58:47 cweb92 rc: Starting webmin:  succeeded
> Oct 19 11:59:11 cweb92 ccsd[2318]: Unable to connect to cluster infrastructure
> after 60 seconds.
> Oct 19 11:59:41 cweb92 ccsd[2318]: Unable to connect to cluster infrastructure
> after 90 seconds.
>
>

Something that happened to me might apply: If you are using GFS and
DLM, you can't start your fence domain if one of the nodes is trying
to fence another node and it's failing. This has happened to me when I
didn't specify a fence device for one of my nodes. If you the fencing
is a goof, you can try to increase the post join delay to keep new
nodes that are entering the cluster from being fenced before they can
join.

I would suggest that instead of the init scripts just run the command
by hand and increase the verbosity. That will probably shed much more
light on things than the logs.


From jstoner at opsource.net  Thu Oct 19 18:18:56 2006
From: jstoner at opsource.net (Jeff Stoner)
Date: Thu, 19 Oct 2006 19:18:56 +0100
Subject: [Linux-cluster] Cluster Suite 4 failover problem
Message-ID: <38A48FA2F0103444906AD22E14F1B5A3045A7638@mailxchg01.corp.opsource.net>

> -----Original Message-----
> Scenario: Running services running in node1, then i shutdown 
> the eth0 of
> node1
> 
> Result: Services not failover to node2, and the clustat in 
> node1 shows that:
> 
> Member Status: Quorate
> 
>   Member Name                      Status
>   ------ ----                              ------
>   node1                                    Offline
>   node2                                    Online, Local, rgmanager
> 
>   Service Name     Owner (Last)                   State
>   ------- ----         ----- ------                       -----
>   ftp                       unkonwn                          started
>   www                   unkonwn                          started
> 
>        <ip address="192.168.0.111" monitor_link="0"/>
>        <ip address="192.168.0.112" monitor_link="0"/>

Some things to be aware of:

- cluster checks the status of "ftp" and "www" services using the
scripts. If the scripts say the service is running, cluster considers
the service as available. Most service scripts simply look for the
process name or PID to determine if a service is running.

- you have monitor_link turned off on your IP addresses - this means
when you down your eth0 interface, cluster manager isn't going to notice
that those IP addresses aren't working.

With manual fencing, the cluster is going to sit there and wait until
you manually reboot that failed node.


--Jeff
SME - UNIX
OpSource Inc.

PGP Key ID 0x6CB364CA 


From lhh at redhat.com  Thu Oct 19 19:09:10 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 19 Oct 2006 15:09:10 -0400
Subject: [Linux-cluster] node fails to stop when inquorate
In-Reply-To: <453697A6.2020103@penguin-it.co.il>
References: <453682A5.9070008@penguin-it.co.il>
	<1161202806.6685.69.camel@rei.boston.devel.redhat.com>
	<453697A6.2020103@penguin-it.co.il>
Message-ID: <1161284951.6685.93.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-18 at 23:07 +0200, Katriel Traum wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Thanks,
> I'll look into it.
> I don't think qdiskd rebootign is a good solution for this scenario.
> Are there any cases in which cman reboots a machine? maybe this should
> be configurable (not only when qdiskd tells it to reboot)

Qdiskd tells CMAN that the quorum device is gone (or CMAN notices that
quorum device is gone if, say, qdiskd hangs on I/O for a long time).
CMAN currently does not do much with the information, except block all
services locally.  I don't think CMAN itself ever reboots the machine
due to loss of the quorum device..

-- Lon


From lhh at redhat.com  Thu Oct 19 19:14:32 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 19 Oct 2006 15:14:32 -0400
Subject: [Linux-cluster] Cluster Suite 4 failover problem
In-Reply-To: <45379A49.8060700@yahoo.com.hk>
References: <45379A49.8060700@yahoo.com.hk>
Message-ID: <1161285273.6685.98.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-19 at 23:31 +0800, Dicky wrote:

> Both services were no longer working. when i restarted the eth0 in 
> node1, restarted the cman service in node1, it still didn't work. Also, 
> when i tried to restart the rgmanager in node1, it only showed that 
> "Waiting for services to stop: " and wating forever. Even i tried to 
> kill the process of the rgmanager, it didn't work. Finally, i  have to 
> reset both machines to get the cluster service back to normal.

Sounds like 'fencing' isn't working.  After node2 decides node1 is dead,
you have to power off node1, then run "fence_ack_manual" on node2.  That
should let things fail over.

It looks like there's a typo in clustat, too, but I don't think that's
related :)


> ======cluster.conf=========
>                         <failoverdomain name="aaa" ordered="0" 
> restricted="0">
>                                 <failoverdomainnode name="node1" 
> priority="1"/>
>                                 <failoverdomainnode name="node2" 
> priority="1"/>
>                         </failoverdomain>

FYI, you don't need to define a failover domain if all nodes in the
cluster are equal.

-- Lon


From jason at monsterjam.org  Fri Oct 20 00:40:30 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Thu, 19 Oct 2006 20:40:30 -0400
Subject: [Linux-cluster] virtual address went down? (with panic link)
Message-ID: <20061020004030.GA1723@monsterjam.org>

I think the mailing list doesnt like attachments, so heres a link to the panic that was supposed 
to go along with this post.
http://monsterjam.org/crash/panic.jpg

I tried stopping the services on 
the first box of my 2 node cluster:
service rgmanager stop
service gfs stop
service clvmd stop
service fenced stop
service cman stop
service ccsd stop

everything came down fine.
then I started em back up..
service ccsd start
this seemed to hang for about 2 minutes, then I got a panic..
as shown in the linked above  graphic..

this is on  2.6.9-34.ELsmp redhat  Enterprise Linux AS release 4 (Nahant 
Update 4)
running ccs-1.0.3-0,
cman-kernel-hugemem-2.6.9-43.8
cman-kernel-2.6.9-43.8
cman-1.0.4-0
cman-kernel-smp-2.6.9-43.8
cman-kernheaders-2.6.9-43.8

 built from sources..

heres my cluster.conf

<?xml version="1.0"?>
<cluster config_version="22" name="progressive">
        <fence_daemon clean_start="0" post_fail_delay="0" 
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="tf1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="1" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="2" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="1" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="2" switch="1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="tf2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="3" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        off" 
port="4" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="3" switch="1"/>
                                        <device name="apc_power_switch" 
option="                                                        on" 
port="4" switch="1"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_apc" ipaddr="192.168.1.8" 
login="xxx"                                                         
name="apc_power_switch" passwd="xxx"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="httpd" ordered="1" 
restricted="1">
                                <failoverdomainnode name="tf1" 
priority="1"/>
                                <failoverdomainnode name="tf2" 
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <script file="/etc/init.d/httpd" 
name="cluster_apache"/>
                        <fs device="/dev/mapper/diskarray-lv1" 
fstype="ext3" mou                                                        
ntpoint="/mnt/gfs/htdocs" name="apache_content"/>
                        <ip address="192.168.1.7" monitor_link="1"/>
                </resources>
                <service autostart="1" domain="httpd" name="Apache 
Service">
                        <ip ref="192.168.1.7"/>
                        <script ref="cluster_apache"/>
                        <fs ref="apache_content"/>
                </service>
        </rm>
</cluster>

ooh and shortly after the first box came back up, the second one got 
rebooted automagically (power fenced from the first one im guessing) for 
good measure.

any help appreciated 


Jason


On Tue, Oct 17, 2006 at 09:37:15PM -0400, jason at monsterjam.org wrote:
> so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
> but the virtual address seems to have disappeared.. its not pingable, neither server has it 
> configured anymore.. The only application I had using the virtual address was apache (just for 
> testing it). what logs/information should I be looking at to see what happened and why?
> 
> regards,
> Jason
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Fri Oct 20 01:42:07 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 19 Oct 2006 20:42:07 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <8c1094290610191105j68dd46a2k69c47c7f5408c281@mail.gmail.com>
Message-ID: <2006101920427.220327@leena>

> DLM, you can't start your fence domain if one of the nodes is trying
> to fence another node and it's failing. This has happened to me when I

This does not seem to be the case as one node finally started but none of the 
others will.

I keep getting a lockfile error on any of the nodes now? Below is some logging 
also which talks about 'evil' :). I'm guessing I got something stuck on all 
the nodes which I need to remove to get things working again?

Mike


>> Oct 19 11:58:42 cweb92 rgmanager: clurgmgrd startup succeeded
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing
>> connection.
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing connect:
>> Connection
>> refused
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something
>> evil.
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid
>> request
>> descriptor
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-111).
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something
>> evil.
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing get: Invalid
>> request
>> descriptor
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Invalid descriptor specified (-21).
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Someone may be attempting something
>> evil.
>> Oct 19 11:58:42 cweb92 ccsd[2318]: Error while processing disconnect:
>> Invalid
>> request descriptor
>> Oct 19 11:58:42 cweb92 clurgmgrd[2527]: <notice> Resource Group Manager
>> Starting
>> Oct 19 11:58:43 cweb92 clurgmgrd[2527]: <info> Loading Service Data
>> Oct 19 11:58:45 cweb92 ccsd[2318]: Cluster is not quorate.  Refusing
>> connection.
>> Oct 19 11:58:45 cweb92 ccsd[2318]: Error while processing connect:
>> Connection
>> refused
>> Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #5: Couldn't connect to
>> ccsd!
>> Oct 19 11:58:45 cweb92 clurgmgrd[2527]: <crit> #8: Couldn't initialize
>> services
>> Oct 19 11:58:47 cweb92 rc: Starting webmin:  succeeded
>> Oct 19 11:59:11 cweb92 ccsd[2318]: Unable to connect to cluster
>> infrastructure
>> after 60 seconds.
>> Oct 19 11:59:41 cweb92 ccsd[2318]: Unable to connect to cluster
>> infrastructure
>> after 90 seconds.
>> 
>> 


From dicky_nnc at yahoo.com.hk  Fri Oct 20 02:10:47 2006
From: dicky_nnc at yahoo.com.hk (Dicky)
Date: Fri, 20 Oct 2006 10:10:47 +0800
Subject: [Linux-cluster] Re: Cluster Suite 4 failover problem
Message-ID: <45383027.9000800@yahoo.com.hk>

Hi,

Thx for the reply. :)

Yes, i have installed the 'fence' rpm, and others according to the Redhat Cluster Suite documenation's "RPM Selection Criteria: Red Hat Cluster Suite with DLM"
, following are the rpms i have installed:

=====RPM Installed=====

ccs, fence, gulm, iddev, magma, magma-plugins, perl-Net-Telnet,	system-config-cluster, ipvsadm,
piranha, ccs-devel, gulm-devel, iddev-devel, magma-devel,

====END=======

I didn't install GFS.

Here is the /var/log/messages output when i try to restart the rgmanager service from the failed node after i re-enable eth0:

===/var/log/messages ==
 
rgmanager: [1074]: <notice> Shutting down Cluster Service Manager...
clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection timed out
clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection timed out
clurgmgrd[31777]: <warning> #67: Shutting down uncleanly
clurgmgrd: [31777]: <info> Executing /etc/rc.d/init.d/vsftpd stop
clurgmgrd: [31777]: <info> Executing /etc/rc.d/init.d/httpd stop
vsftpd: vsftpd shutdown succeeded
clurgmgrd: [31777]: <info> Removing IPv4 address 192.168.0.112 from eth0
httpd: httpd shutdown succeeded
clurgmgrd: [31777]: <info> Removing IPv4 address 192.168.0.111 from eth0

=======END============

Then it hanged forver until i manually reset the machine.

I would like to know if the waiting is caused by this line :"
clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection timed out
" ?? If so, why and how to solve it??

Also, i would like to know even i type " reboot" , it also hanged in this line: "Shutting down Cluster Service Manager...
Waiting for services to stop: " which caused me have press the reset button, which may caused the file system corrupted, so manually press the reset button is dangerous.
Is there anyway for me to shutdown the rgmanager properly?


Second question is, why the cluster didn't failover but the status showed that the services were "started" ??? Is there anything i missed in the configuration process??

Many thanks,
Dicky


> Hi,
> 
> What is output to the "/var/log/messages" files of
> each node? That 
> should provide a clue as to what the problem is. 
> Also, did you install 
> the 'fence' RPM and any Clustered LVM / GFS RPMs?
> 
> You also might consider rebooting the "downed" node
> - this function is 
> generally taken care of by fencing devices
> automatically and, as I 
> understand it, "manual fencing" means you gotta
> reboot :), the 
> assumption being that a failed node won't be allowed
> back in the cluster 
> until it's restarted.
> 
> Thanks,
> Jon


From ramon at vanalteren.nl  Fri Oct 20 07:40:40 2006
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Fri, 20 Oct 2006 09:40:40 +0200
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
Message-ID: <45387D78.2060203@vanalteren.nl>

Hi List,

I'm hoping someone can provide me with pointers to solve the following 
problem:

I've setup a cluster of 5 nodes with cluster-1.03.00 compiled from 
source. The cluster works fine.
I can fence the nodes and all nodes see each other.

I've created a gfs filesystem on a coraid shared device using clvm

All nodes see the filesystem and see small changes to the filesystem.

Last night I started a stress-test with iozone on all nodes:

mkdir /mnt/$(hostname)
iozone -Rab /home/iozone-$(hostname)-test-${DATE}.xls -i0 -g16G -f 
/mnt/$(hostname)/iozone-test${DATE}

This test started at 4 AM and is still running on all nodes.

If I run the same test on a single node it produces a nice test-report 
indicating that we get an average write performance of 35MB/s.
This is within expectations of the hardware with the current setup.

Most operations on the gfs filesystem take long the first time, gfs_tool 
counters /mnt takes roughly a minute the first time, afterwards they 
react normal, within a second response. The same is true for operations 
like ls, df, etc.

I have no clue why concurrent writes hang and would appreciate any 
pointers on where to start looking.

kernel:
2.6.16-gentoo-r13
glibc:
glibc-2.3.6-r4
gcc:
gcc-3.4.6
Hardware:
x86_64 Intel(R) Xeon(R) CPU 5140  @ 2.33GHz

Thank you,

Ramon van Alteren


From dan.hawker at astrium.eads.net  Fri Oct 20 09:22:50 2006
From: dan.hawker at astrium.eads.net (HAWKER, Dan)
Date: Fri, 20 Oct 2006 10:22:50 +0100
Subject: [Linux-cluster] CVS Tarballs???
Message-ID: <7F6B06837A5DBD49AC6E1650EFF54906012230AF@auk52177.ukr.astrium.corp>


Hi All,

Are there any nightly (or particular milestones) tarballs of CVS & GIT
elements of the RHCS development tree available by any chance???

I'd like to do some preliminary testing for a small cluster we are planning
to setup, however my web access is behind a particularly nasty
proxy/firewall here at work that I have no control over. Hence CVS/GIT
access is not allowed :( Regular http seems (mostly) fine however.

TIA

Dan

--

Dan Hawker
Linux System Administrator
Astrium

-- 

This email (including any attachments) may contain confidential and/or privileged information or information otherwise protected from disclosure. If you are not the intended recipient, please notify the sender immediately, do not copy this message or any attachments and do not use it for any purpose or disclose its content to any person, but delete this message and any attachments from your system. Astrium disclaims any and all liability if this email transmission was virus corrupted, altered or falsified.
---------------------------------------------------------------------
Astrium Limited, Registered in England and Wales No. 2449259
Registered Office: Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2AS, England


From ramon at vanalteren.nl  Fri Oct 20 10:48:29 2006
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Fri, 20 Oct 2006 12:48:29 +0200
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
In-Reply-To: <45387D78.2060203@vanalteren.nl>
References: <45387D78.2060203@vanalteren.nl>
Message-ID: <4538A97D.4070700@vanalteren.nl>

Some additional info,

I ran gfs_tool counters <mountpoint> on all stuck nodes
They all seem to have a large amount of outstanding BIO calls.

Any way I can find out what is causing this ?
Any other information to look for ?

As far as I can see, there's no bottleneck at the shared coraid storage AoE.

gfs_tool counters output on: mrcluster1

                                   locks 3024
                              locks held 108
                           incore inodes 12
                        metadata buffers 166
                         unlinked inodes 0
                               quota IDs 0
                      incore log buffers 1
                          log space used 0.15%
               meta header cache entries 9998
                      glock dependencies 33
                  glocks on reclaim list 0
                               log wraps 8
                    outstanding LM calls 0
                   outstanding BIO calls 805
                        fh2dentry misses 0
                        glocks reclaimed 5992
                          glock nq calls 4784596
                          glock dq calls 4784236
                    glock prefetch calls 25
                           lm_lock calls 6341
                         lm_unlock calls 5269
                            lm callbacks 11907
                      address operations 254155990
                       dentry operations 1978
                       export operations 0
                         file operations 1457916
                        inode operations 5489
                        super operations 1577443
                           vm operations 0
                         block I/O reads 136200
                        block I/O writes 81780453
gfs_tool counters output on: mrcluster2

                                   locks 3020
                              locks held 107
                           incore inodes 10
                        metadata buffers 604
                         unlinked inodes 0
                               quota IDs 0
                      incore log buffers 1
                          log space used 0.15%
               meta header cache entries 10000
                      glock dependencies 34
                  glocks on reclaim list 0
                               log wraps 8
                    outstanding LM calls 0
                   outstanding BIO calls 490
                        fh2dentry misses 0
                        glocks reclaimed 3725
                          glock nq calls 4755003
                          glock dq calls 4754287
                    glock prefetch calls 12
                           lm_lock calls 4364
                         lm_unlock calls 3017
                            lm callbacks 8140
                      address operations 252523873
                       dentry operations 1957
                       export operations 0
                         file operations 1444785
                        inode operations 5425
                        super operations 1564779
                           vm operations 0
                         block I/O reads 135658
                        block I/O writes 81574696
gfs_tool counters output on: mrcluster3

                                   locks 3018
                              locks held 135
                           incore inodes 9
                        metadata buffers 1
                         unlinked inodes 0
                               quota IDs 0
                      incore log buffers 1
                          log space used 0.15%
               meta header cache entries 9997
                      glock dependencies 20
                  glocks on reclaim list 0
                               log wraps 25
                    outstanding LM calls 0
                   outstanding BIO calls 191
                        fh2dentry misses 0
                        glocks reclaimed 11097
                          glock nq calls 15308139
                          glock dq calls 15307573
                    glock prefetch calls 13
                           lm_lock calls 8734
                         lm_unlock calls 7813
                            lm callbacks 17167
                      address operations 941469125
                       dentry operations 5308
                       export operations 0
                         file operations 4730084
                        inode operations 17157
                        super operations 5170894
                           vm operations 0
                         block I/O reads 333851
                        block I/O writes 4449228
gfs_tool counters output on: mrcluster4

                                   locks 3017
                              locks held 206
                           incore inodes 7
                        metadata buffers 2945
                         unlinked inodes 0
                               quota IDs 2
                      incore log buffers 5
                          log space used 0.24%
               meta header cache entries 9343
                      glock dependencies 54
                  glocks on reclaim list 0
                               log wraps 2
                    outstanding LM calls 0
                   outstanding BIO calls 249
                        fh2dentry misses 0
                        glocks reclaimed 2075
                          glock nq calls 1485236
                          glock dq calls 1485023
                    glock prefetch calls 0
                           lm_lock calls 1967
                         lm_unlock calls 1326
                            lm callbacks 3657
                      address operations 66603382
                       dentry operations 1747
                       export operations 0
                         file operations 457700
                        inode operations 4821
                        super operations 484938
                           vm operations 0
                         block I/O reads 28969
                        block I/O writes 21973543

Grtz Ramon


From jwhiter at redhat.com  Fri Oct 20 12:41:18 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Fri, 20 Oct 2006 08:41:18 -0400
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
In-Reply-To: <4538A97D.4070700@vanalteren.nl>
References: <45387D78.2060203@vanalteren.nl> <4538A97D.4070700@vanalteren.nl>
Message-ID: <20061020124117.GI20018@korben.rdu.redhat.com>

In your previous message you asked about the latency.  With gfs1, there is a
certain amount of latency involved with stat calls, so ls -al, du, df all take a
great deal of time comparitively.  With these calls first you have to traverse
the FS in order to cache all the inforamation about the files, so every lookup
requires a lock on each directory to the file, and then a lock on the file
itself inorder to read its information off of the disk.  Then thats just the
lookup, then we have to grab a shared lock again to get the stat information
from the file.  Each lock mind you requires exporting the lock to all of the
other nodes so they know about it and getting confirmation back on that lock.
So for every stat lookup you are looking at at the very least 2 seperate locks,
one for the lookup and then one for the stat.  Every subsequent call is faster
because the lookups no longer require the locks to lookup the file, as the inode
information is now cached, so we just need the lock for the file.

If gfs_tool counters is stuck, you'll want to get a couple instances of sysrq-t
from all nodes and see if you can see who is hanging, wether its in D state or
if the particular process isn't makeing progress.

Josef

On Fri, Oct 20, 2006 at 12:48:29PM +0200, Ramon van Alteren wrote:
> Some additional info,
> 
> I ran gfs_tool counters <mountpoint> on all stuck nodes
> They all seem to have a large amount of outstanding BIO calls.
> 
> Any way I can find out what is causing this ?
> Any other information to look for ?
> 
> As far as I can see, there's no bottleneck at the shared coraid storage AoE.
> 
> gfs_tool counters output on: mrcluster1
> 
>                                   locks 3024
>                              locks held 108
>                           incore inodes 12
>                        metadata buffers 166
>                         unlinked inodes 0
>                               quota IDs 0
>                      incore log buffers 1
>                          log space used 0.15%
>               meta header cache entries 9998
>                      glock dependencies 33
>                  glocks on reclaim list 0
>                               log wraps 8
>                    outstanding LM calls 0
>                   outstanding BIO calls 805
>                        fh2dentry misses 0
>                        glocks reclaimed 5992
>                          glock nq calls 4784596
>                          glock dq calls 4784236
>                    glock prefetch calls 25
>                           lm_lock calls 6341
>                         lm_unlock calls 5269
>                            lm callbacks 11907
>                      address operations 254155990
>                       dentry operations 1978
>                       export operations 0
>                         file operations 1457916
>                        inode operations 5489
>                        super operations 1577443
>                           vm operations 0
>                         block I/O reads 136200
>                        block I/O writes 81780453
> gfs_tool counters output on: mrcluster2
> 
>                                   locks 3020
>                              locks held 107
>                           incore inodes 10
>                        metadata buffers 604
>                         unlinked inodes 0
>                               quota IDs 0
>                      incore log buffers 1
>                          log space used 0.15%
>               meta header cache entries 10000
>                      glock dependencies 34
>                  glocks on reclaim list 0
>                               log wraps 8
>                    outstanding LM calls 0
>                   outstanding BIO calls 490
>                        fh2dentry misses 0
>                        glocks reclaimed 3725
>                          glock nq calls 4755003
>                          glock dq calls 4754287
>                    glock prefetch calls 12
>                           lm_lock calls 4364
>                         lm_unlock calls 3017
>                            lm callbacks 8140
>                      address operations 252523873
>                       dentry operations 1957
>                       export operations 0
>                         file operations 1444785
>                        inode operations 5425
>                        super operations 1564779
>                           vm operations 0
>                         block I/O reads 135658
>                        block I/O writes 81574696
> gfs_tool counters output on: mrcluster3
> 
>                                   locks 3018
>                              locks held 135
>                           incore inodes 9
>                        metadata buffers 1
>                         unlinked inodes 0
>                               quota IDs 0
>                      incore log buffers 1
>                          log space used 0.15%
>               meta header cache entries 9997
>                      glock dependencies 20
>                  glocks on reclaim list 0
>                               log wraps 25
>                    outstanding LM calls 0
>                   outstanding BIO calls 191
>                        fh2dentry misses 0
>                        glocks reclaimed 11097
>                          glock nq calls 15308139
>                          glock dq calls 15307573
>                    glock prefetch calls 13
>                           lm_lock calls 8734
>                         lm_unlock calls 7813
>                            lm callbacks 17167
>                      address operations 941469125
>                       dentry operations 5308
>                       export operations 0
>                         file operations 4730084
>                        inode operations 17157
>                        super operations 5170894
>                           vm operations 0
>                         block I/O reads 333851
>                        block I/O writes 4449228
> gfs_tool counters output on: mrcluster4
> 
>                                   locks 3017
>                              locks held 206
>                           incore inodes 7
>                        metadata buffers 2945
>                         unlinked inodes 0
>                               quota IDs 2
>                      incore log buffers 5
>                          log space used 0.24%
>               meta header cache entries 9343
>                      glock dependencies 54
>                  glocks on reclaim list 0
>                               log wraps 2
>                    outstanding LM calls 0
>                   outstanding BIO calls 249
>                        fh2dentry misses 0
>                        glocks reclaimed 2075
>                          glock nq calls 1485236
>                          glock dq calls 1485023
>                    glock prefetch calls 0
>                           lm_lock calls 1967
>                         lm_unlock calls 1326
>                            lm callbacks 3657
>                      address operations 66603382
>                       dentry operations 1747
>                       export operations 0
>                         file operations 457700
>                        inode operations 4821
>                        super operations 484938
>                           vm operations 0
>                         block I/O reads 28969
>                        block I/O writes 21973543
> 
> Grtz Ramon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From ramon at vanalteren.nl  Fri Oct 20 13:34:33 2006
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Fri, 20 Oct 2006 15:34:33 +0200
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
In-Reply-To: <20061020124117.GI20018@korben.rdu.redhat.com>
References: <45387D78.2060203@vanalteren.nl> <4538A97D.4070700@vanalteren.nl>
	<20061020124117.GI20018@korben.rdu.redhat.com>
Message-ID: <4538D069.3050609@vanalteren.nl>

Hi Josef,

Josef Whiter wrote:
> In your previous message you asked about the latency.  With gfs1, there is a
> certain amount of latency involved with stat calls, so ls -al, du, df all take a
> great deal of time comparitively.  With these calls first you have to traverse
> the FS in order to cache all the inforamation about the files, so every lookup
> requires a lock on each directory to the file, and then a lock on the file
> itself inorder to read its information off of the disk.  Then thats just the
> lookup, then we have to grab a shared lock again to get the stat information
> from the file.  Each lock mind you requires exporting the lock to all of the
> other nodes so they know about it and getting confirmation back on that lock.
> So for every stat lookup you are looking at at the very least 2 seperate locks,
> one for the lookup and then one for the stat.  Every subsequent call is faster
> because the lookups no longer require the locks to lookup the file, as the inode
> information is now cached, so we just need the lock for the file.

Yes, this was the previous exchange. In the same exchange I was advised 
by Wendy Cheng to switch to iozone because it would avoid such multiple 
lock calls (less stat on files) instead of bonnie++ which we were 
previously testing with.

My test-run last night started at 4AM with 4 different iozone processes 
using a temp file in different directories on the same filesystem / 
logical volume.

AFAIK this would avoid the problem you mention above ?

All iozone processes were in D (uninteruptable sleep) by the time I woke 
up and had a look 8AM this morning.

I would expect gfs to deal with this gracefully and return a performance 
metric on multiple writes because:

* Not in same directory so no dir-lock to pass around
* Different files

> If gfs_tool counters is stuck, you'll want to get a couple instances of sysrq-t
> from all nodes and see if you can see who is hanging, wether its in D state or
> if the particular process isn't makeing progress.

I interrupted the processes by now and found one node that was hanging.
I'm still completely clueless as to what is causing this.

Any pointers and/or ideas on where to look, testcases to run or any info 
at all that might be helpfull in finding the cause of the problem would 
be much appreciated.

I'm also opening a similar case with the coraid support department to 
see if they have something to say about it.

Thanx,

Ramon


From jwhiter at redhat.com  Fri Oct 20 13:46:42 2006
From: jwhiter at redhat.com (Josef Whiter)
Date: Fri, 20 Oct 2006 09:46:42 -0400
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
In-Reply-To: <4538D069.3050609@vanalteren.nl>
References: <45387D78.2060203@vanalteren.nl> <4538A97D.4070700@vanalteren.nl>
	<20061020124117.GI20018@korben.rdu.redhat.com>
	<4538D069.3050609@vanalteren.nl>
Message-ID: <20061020134641.GK20018@korben.rdu.redhat.com>

On Fri, Oct 20, 2006 at 03:34:33PM +0200, Ramon van Alteren wrote:
> Yes, this was the previous exchange. In the same exchange I was advised 
> by Wendy Cheng to switch to iozone because it would avoid such multiple 
> lock calls (less stat on files) instead of bonnie++ which we were 
> previously testing with.
>

Ahh I mised that, apologies.
 
> My test-run last night started at 4AM with 4 different iozone processes 
> using a temp file in different directories on the same filesystem / 
> logical volume.
> 
> AFAIK this would avoid the problem you mention above ?
> 
> All iozone processes were in D (uninteruptable sleep) by the time I woke 
> up and had a look 8AM this morning.
> 
> I would expect gfs to deal with this gracefully and return a performance 
> metric on multiple writes because:
> 
> * Not in same directory so no dir-lock to pass around
> * Different files
>

Ok if you are hitting this problem then the best thing to do is grab serveral
instances of sysrq-t from all nodes so we can see who/what is hanging and start
poking around.  Thank you,

Josef
 

From isplist at logicore.net  Fri Oct 20 13:59:44 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 08:59:44 -0500
Subject: [Linux-cluster] Updating cluster.conf
Message-ID: <2006102085944.756502@leena>

Can't seem to find notes on how to update cluster.conf.

I keep updating the file but the old one keeps coming back. Since I'm not able 
to start the cluster, I cannot run the commands;

ccs_tool update /etc/cluster/cluster.conf
cman_tool version -r x

Are there any other lists for GFS? I've been trying to start this cluster for 
a week now and can't seem to get anywhere. Are these messages archived for 
searching somewhere?

I wish there was someone on this list who was using McData gear, that's 
another area I need help with, making sure my cluster.conf file is right.

Mike


From jon.daniels at voxsurf.com  Fri Oct 20 09:36:25 2006
From: jon.daniels at voxsurf.com (Jonathan Daniels)
Date: Fri, 20 Oct 2006 10:36:25 +0100
Subject: [Linux-cluster] Re: Cluster Suite 4 failover problem
In-Reply-To: <45383027.9000800@yahoo.com.hk>
References: <45383027.9000800@yahoo.com.hk>
Message-ID: <45389899.8010300@voxsurf.com>

I had a very similar set of problems just recently and found that 
uninstalling the "fence" RPM solved about 90% of them, incuding a 
hanging RGManager which required me to "switch-off-and-switch-on" the 
servers many times. I suspect that the problem was more to do with my 
unfamiliarity with fencing, but I wonder if there are some issues when 
running fencing and having no fence devices in use, and how the fenced 
daemon then interacts with RGManager. I do know that there are (fixed) 
similar lockup issues with RG Manager rgmanager-1.9.46-0, CMAN 
cman-kernel-2.6.9-43.8 and kernel 2.6.9-34 which disappear with an 
upgrade to kernel 2.6.9-34.0.1 and CMan cman-kernel-smp-2.6.9-43.8.3, 
but a new set of problems were introduced for me when I did that so I 
rolled back and uninstalled fenced, et viola!

I still find that on occasion I have to kill -9 the rgmanger process 
(sometimes I have to do it more than once) and I realise that an 
unfenced cluster is unsupported, but it solved the problems for me.

Hope this helps,
Jon


Dicky wrote:

> Hi,
>
> Thx for the reply. :)
>
> Yes, i have installed the 'fence' rpm, and others according to the 
> Redhat Cluster Suite documenation's "RPM Selection Criteria: Red Hat 
> Cluster Suite with DLM"
> , following are the rpms i have installed:
>
> =====RPM Installed=====
>
> ccs, fence, gulm, iddev, magma, magma-plugins, perl-Net-Telnet,    
> system-config-cluster, ipvsadm,
> piranha, ccs-devel, gulm-devel, iddev-devel, magma-devel,
>
> ====END=======
>
> I didn't install GFS.
>
> Here is the /var/log/messages output when i try to restart the 
> rgmanager service from the failed node after i re-enable eth0:
>
> ===/var/log/messages ==
>
> rgmanager: [1074]: <notice> Shutting down Cluster Service Manager...
> clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection 
> timed out
> clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection 
> timed out
> clurgmgrd[31777]: <warning> #67: Shutting down uncleanly
> clurgmgrd: [31777]: <info> Executing /etc/rc.d/init.d/vsftpd stop
> clurgmgrd: [31777]: <info> Executing /etc/rc.d/init.d/httpd stop
> vsftpd: vsftpd shutdown succeeded
> clurgmgrd: [31777]: <info> Removing IPv4 address 192.168.0.112 from eth0
> httpd: httpd shutdown succeeded
> clurgmgrd: [31777]: <info> Removing IPv4 address 192.168.0.111 from eth0
>
> =======END============
>
> Then it hanged forver until i manually reset the machine.
>
> I would like to know if the waiting is caused by this line :"
> clurgmgrd[31777]: <err> #50: Unable to obtain cluster lock: Connection 
> timed out
> " ?? If so, why and how to solve it??
>
> Also, i would like to know even i type " reboot" , it also hanged in 
> this line: "Shutting down Cluster Service Manager...
> Waiting for services to stop: " which caused me have press the reset 
> button, which may caused the file system corrupted, so manually press 
> the reset button is dangerous.
> Is there anyway for me to shutdown the rgmanager properly?
>
>
> Second question is, why the cluster didn't failover but the status 
> showed that the services were "started" ??? Is there anything i missed 
> in the configuration process??
>
> Many thanks,
> Dicky
>
>
>
>> Hi,
>>
>> What is output to the "/var/log/messages" files of
>> each node? That should provide a clue as to what the problem is. 
>> Also, did you install the 'fence' RPM and any Clustered LVM / GFS RPMs?
>>
>> You also might consider rebooting the "downed" node
>> - this function is generally taken care of by fencing devices
>> automatically and, as I understand it, "manual fencing" means you gotta
>> reboot :), the assumption being that a failed node won't be allowed
>> back in the cluster until it's restarted.
>>
>> Thanks,
>> Jon
>
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From lists at brimer.org  Fri Oct 20 14:06:27 2006
From: lists at brimer.org (Barry Brimer)
Date: Fri, 20 Oct 2006 09:06:27 -0500 (CDT)
Subject: [Linux-cluster] Updating cluster.conf
In-Reply-To: <2006102085944.756502@leena>
References: <2006102085944.756502@leena>
Message-ID: <Pine.LNX.4.61.0610200903110.27895@localhost.localdomain>

> I keep updating the file but the old one keeps coming back. Since I'm not able
> to start the cluster, I cannot run the commands;
>
> ccs_tool update /etc/cluster/cluster.conf
> cman_tool version -r x

You could scp the cluster.conf to the member machines.  Make sure you are 
updating the version number in your cluster.conf.  If you can use the old 
cluster.conf, you could then use system-config-cluster to update your 
cluster configuration which will propogate the new file for you.  Make 
sure you inspect /var/log/messages .. it can be very helpful.  Make sure 
that no new iptables firewall rules could be getting in your way.

Hope this helps.

Barry


From gforte at leopard.us.udel.edu  Fri Oct 20 14:10:56 2006
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Fri, 20 Oct 2006 10:10:56 -0400
Subject: [Linux-cluster] Updating cluster.conf
In-Reply-To: <2006102085944.756502@leena>
References: <2006102085944.756502@leena>
Message-ID: <4538D8F0.3020406@leopard.us.udel.edu>

isplist at logicore.net wrote:
> Are there any other lists for GFS? I've been trying to start this cluster for 
> a week now and can't seem to get anywhere. Are these messages archived for 
> searching somewhere?

Have you ever clicked on the link at the bottom of every list message?  ;-)

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE


From ramon at vanalteren.nl  Fri Oct 20 14:16:40 2006
From: ramon at vanalteren.nl (Ramon van Alteren)
Date: Fri, 20 Oct 2006 16:16:40 +0200
Subject: [Linux-cluster] GFS filesystem "hang" with cluster-1.03.00
In-Reply-To: <20061020134641.GK20018@korben.rdu.redhat.com>
References: <45387D78.2060203@vanalteren.nl>
	<4538A97D.4070700@vanalteren.nl>	<20061020124117.GI20018@korben.rdu.redhat.com>	<4538D069.3050609@vanalteren.nl>
	<20061020134641.GK20018@korben.rdu.redhat.com>
Message-ID: <4538DA48.3010609@vanalteren.nl>

Josef Whiter wrote:
> On Fri, Oct 20, 2006 at 03:34:33PM +0200, Ramon van Alteren wrote:
>> Yes, this was the previous exchange. In the same exchange I was advised 
>> by Wendy Cheng to switch to iozone because it would avoid such multiple 
>> lock calls (less stat on files) instead of bonnie++ which we were 
>> previously testing with.
>>
> Ahh I mised that, apologies.

Nevermind ;-)

<snip>
> Ok if you are hitting this problem then the best thing to do is grab serveral
> instances of sysrq-t from all nodes so we can see who/what is hanging and start
> poking around.  Thank you,

Ok, I'll recompile kernels with sysrq included and try to reproduce.

Regards,

Ramon


From Markus at hochholdinger.net  Fri Oct 20 14:20:37 2006
From: Markus at hochholdinger.net (Markus Hochholdinger)
Date: Fri, 20 Oct 2006 16:20:37 +0200
Subject: [Linux-cluster] io scheduler and gnbd
Message-ID: <200610201620.41620.Markus@hochholdinger.net>

hi,

i'm succesfully using gnbd as a single service for a long time. Now i 
discovered a weired problem with the gnbd devices with kernel 2.6.18. I build 
the gnbd.ko module out of the cvs tree.
All works fine if you don't do to much an the gnbds. But if you stress test 
the devices, the gnbds will hang, e.g. reads and writes hang. If you restart 
the gnbd server, the client will continue to read and write until the next 
hang.
So i first checked my gnbd servers and tried from 1.01 till 1.03 and the 
latest cvs. But the problem is still there. From another gnbd client i had no 
problem, with none of these gnbd server versions (i was impressed you can mix 
these versions). Also changing the kernel on the gnbd server didn't helped.

So i was stick to the gnbd client with kernel 2.6.18. I have to use this 
kernel because of the new hardware. So i tried a little and found out that 
changing the default io scheduler for the gnbd devices on the client makes 
the hanging write and reads resume. The default scheduler was cfq and with 
this i can easily reproduce this behavior. With the deadline scheduler it 
doesn't.

So i read a little about io scheduling on linux. And my assumption is a gnbd 
device shouldn't need any io scheduling, because the network has no latency 
when seeking like a hard disk. On the gnbd server there are getting request 
from more than one gnbd client, so scheduled io on the client would mix up 
the scheduling on the server. And also the server does its own io scheduling 
when writing to the real disk.
So i could use the noop scheduler or have i missed something?

Has anyone on the list more info about io scheduling and gnbd?


-- 
greetings

eMHa
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061020/a0e6d037/attachment.sig>

From isplist at logicore.net  Fri Oct 20 14:24:53 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 09:24:53 -0500
Subject: [Linux-cluster] Updating cluster.conf
In-Reply-To: <Pine.LNX.4.61.0610200903110.27895@localhost.localdomain>
Message-ID: <2006102092453.636785@leena>

Thanks for the input.

> You could scp the cluster.conf to the member machines.  Make sure you are
> updating the version number in your cluster.conf.

Thing is, I can't run any cman commands which include updating the conf file. 
Since I cannot start the cluster, I cannot run any cluster commands. Isn't 
that part of my problem, needing to figure out why/if cman is running?

It looks like cman won't start and because of that, fenced won't start either. 
There are plenty of errors in the logs which I've posted many times. I'm no 
longer sure what I'm looking for.

> cluster.conf, you could then use system-config-cluster to update your

system-config-cluster has never worked on any of my machines using ANY version 
of RH. I've posted about that but have not found any answers.

> cluster configuration which will propogate the new file for you.  Make
> sure you inspect /var/log/messages .. it can be very helpful.  Make sure
> that no new iptables firewall rules could be getting in your way.

Well, here's another paste of what's going on now...

First, how I'm starting my cluster, then some logging then my cluster.conf;

depmod -a
modprobe dm-mod
modprobe gfs
modprobe lock_dlm
ccsd
cman_tool join -t 120 -w
fence_tool join -t 120 -w
clvmd
vgchange -aly
mount -t gfs /dev/vgcomp/rimfire /var/www/

Then a snip of my logging;

Oct 20 09:06:03 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 20 09:06:04 cweb92 kernel: CMAN: sending membership request
Oct 20 09:06:04 cweb92 kernel: CMAN: Cluster membership rejected
Oct 20 09:06:04 cweb92 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 20 09:06:04 cweb92 ccsd[4788]: Connected to cluster infrastruture via: 
CMAN/SM Plugin v1.1.7.1
Oct 20 09:06:04 cweb92 ccsd[4788]: Initial status:: Inquorate
Oct 20 09:06:08 cweb92 kernel: CMAN: sending membership request
Oct 20 09:06:08 cweb92 kernel: CMAN: Cluster membership rejected
Oct 20 09:06:08 cweb92 ccsd[4788]: Cluster manager shutdown.  Attemping to 
reconnect...
Oct 20 09:06:23 cweb92 clvmd: Can't open cluster manager socket: Network is 
down
Oct 20 09:06:23 cweb92 ccsd[4788]: Remote copy of cluster.conf is from quorate 
node.
Oct 20 09:06:23 cweb92 ccsd[4788]:  Local version # : 42
Oct 20 09:06:23 cweb92 ccsd[4788]:  Remote version #: 42
Oct 20 09:06:23 cweb92 ccsd[4788]: Remote copy of cluster.conf is from quorate 
node.
Oct 20 09:06:23 cweb92 ccsd[4788]:  Local version # : 42
Oct 20 09:06:23 cweb92 ccsd[4788]:  Remote version #: 42
Oct 20 09:06:27 cweb92 ccsd[4788]: Unable to connect to cluster infrastructure 
after 30 seconds.
Oct 20 09:06:57 cweb92 ccsd[4788]: Unable to connect to cluster infrastructure 
after 60 seconds.

Cluster.conf;

<?xml version="1.0"?>
<cluster config_version="46" name="vgcomp">
<fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
    <clusternodes>
        <clusternode name="cweb92" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="16"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="cweb93" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="17"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="cweb94" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="18"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm250" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="19"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm249" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="20"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm248" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="21"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm247" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="22"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="dev" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_mcdata" port="23"/>
                    </method>
                </fence>
        </clusternode>
    </clusternodes>
<fencedevices>
    <fencedevice agent="fence_mcdata" ipaddr="x.x.x.x" login="user" 
name="ED5000" passwd="xxxx"/>
</fencedevices>
</cluster>

Mike


From isplist at logicore.net  Fri Oct 20 14:25:49 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 09:25:49 -0500
Subject: [Linux-cluster] Updating cluster.conf
In-Reply-To: <4538D8F0.3020406@leopard.us.udel.edu>
Message-ID: <2006102092549.379148@leena>

>> a week now and can't seem to get anywhere. Are these messages archived for
>> searching somewhere?
>> 
> Have you ever clicked on the link at the bottom of every list message?  -)

Never even noticed them, been too caught up reading the replies and trying to 
put the help to use.

Good lead though :).

Mike


From cosimo at streppone.it  Fri Oct 20 14:47:11 2006
From: cosimo at streppone.it (Cosimo Streppone)
Date: Fri, 20 Oct 2006 16:47:11 +0200
Subject: [Linux-cluster] Samba share resource fails to start after upgrade
 to Cluster Suite 4U4
Message-ID: <4538E16F.3060709@streppone.it>

Hi all clustering guys,

I'm experiencing a problem with a 2 node cluster just upgraded
(this morning) to current RHEL 4U4 and CS.

cluster.conf file didn't change in respect to this samba resource.
It is a external smb share that should be mounted on the active
node.

Relevant error messages from system log:

   Oct 20 14:14:05 clu4-2 smbd[7467]:   params.c:OpenConfFile() - Unable to open 
configuration file "/etc/samba/smb.conf.//share/exportdb":
   Oct 20 14:14:06 clu4-2 clurgmgrd: [3205]: <err> Samba service failed: /usr/sbin/smbd -D 
-s "/etc/samba/smb.conf.//share/exportdb"
   Oct 20 14:14:20 clu4-2 clurgmgrd: [3205]: <warning> 
"/etc/samba/smb.conf.//share/exportdb" missing during stop
   Oct 20 14:14:40 clu4-2 clurgmgrd: [3205]: <warning> 
"/etc/samba/smb.conf.//share/exportdb" missing during stop
   Oct 20 14:14:48 clu4-2 clurgmgrd: [3205]: <warning> 
"/etc/samba/smb.conf.//share/exportdb" missing during stop

I don't know why it searches for "/etc/samba/smb.conf.//share/exportdb"...
Maybe something changed in the way smb shares are managed?

Should I change resource name from "//share/exportdb" to "myshare",
I can't seem to find a place where the actual smb path "//share/exportdb"
can be specified.

Can you help me on this?
Thank you.

-- 
Cosimo


From isplist at logicore.net  Fri Oct 20 17:13:24 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 12:13:24 -0500
Subject: [Linux-cluster] Updating cluster.conf
In-Reply-To: <Pine.LNX.4.61.0610200903110.27895@localhost.localdomain>
Message-ID: <20061020121324.211430@leena>

> You could scp the cluster.conf to the member machines.  Make sure you are
> updating the version number in your cluster.conf.  If you can use the old
> cluster.conf, you could then use system-config-cluster to update your
> cluster configuration which will propogate the new file for you.  Make
> sure you inspect /var/log/messages .. it can be very helpful.  Make sure
> that no new iptables firewall rules could be getting in your way.

Yes, I've been doing all of these things and every time I find solutions that 
seem to be related, I still don't have the answer.

[root at cweb92 etc]# ccs_test connect
ccs_connect failed: Connection refused

Nothing much in the log other than;
Oct 20 11:35:28 cweb92 ccsd[5369]: Error while processing connect: Connection 
refused
Oct 20 11:35:46 cweb92 ccsd[5369]: Unable to connect to cluster infrastructure 
after 210 seconds.

Mike


From bmarzins at redhat.com  Sat Oct 21 00:46:15 2006
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 20 Oct 2006 19:46:15 -0500
Subject: [Linux-cluster] io scheduler and gnbd
In-Reply-To: <200610201620.41620.Markus@hochholdinger.net>
References: <200610201620.41620.Markus@hochholdinger.net>
Message-ID: <20061021004614.GB4771@ether.msp.redhat.com>

On Fri, Oct 20, 2006 at 04:20:37PM +0200, Markus Hochholdinger wrote:
> hi,
> 
> i'm succesfully using gnbd as a single service for a long time. Now i 
> discovered a weired problem with the gnbd devices with kernel 2.6.18. I build 
> the gnbd.ko module out of the cvs tree.
> All works fine if you don't do to much an the gnbds. But if you stress test 
> the devices, the gnbds will hang, e.g. reads and writes hang. If you restart 
> the gnbd server, the client will continue to read and write until the next 
> hang.
> So i first checked my gnbd servers and tried from 1.01 till 1.03 and the 
> latest cvs. But the problem is still there. From another gnbd client i had no 
> problem, with none of these gnbd server versions (i was impressed you can mix 
> these versions). Also changing the kernel on the gnbd server didn't helped.
> 
> So i was stick to the gnbd client with kernel 2.6.18. I have to use this 
> kernel because of the new hardware. So i tried a little and found out that 
> changing the default io scheduler for the gnbd devices on the client makes 
> the hanging write and reads resume. The default scheduler was cfq and with 
> this i can easily reproduce this behavior. With the deadline scheduler it 
> doesn't.
> 
> So i read a little about io scheduling on linux. And my assumption is a gnbd 
> device shouldn't need any io scheduling, because the network has no latency 
> when seeking like a hard disk. On the gnbd server there are getting request 
> from more than one gnbd client, so scheduled io on the client would mix up 
> the scheduling on the server. And also the server does its own io scheduling 
> when writing to the real disk.
> So i could use the noop scheduler or have i missed something?
> 
> Has anyone on the list more info about io scheduling and gnbd?
> 
If gnbd isn't working with the latest kernel, it's pretty definitely a bug. I'll
take a look and see if I can't reproduce it.  As far as IO scheduling goes,
if you can work around this with a different scheduler, that's great, but
depending on your IO patterns, gnbd should see a benefit from reordering
requests.  It is more efficient to send a fewer number of larger requests to the
server. There are a couple or reasons why but the big one is that the gnbd
server does not reorder requests itself.

Currently the gnbd server receives a request, performs the request, returns
the result, and then goes on to the next request. There is only one thread per
client per device. Obviously, your need to have the IO complete to disk before
you return the read result. But because gnbd is pretending to be a block device,
when the server says that the data has been written out, the data must be
actually on disk. This means that the request must be synced to disk before the
server returns a write result and goes on to the next request. So the gnbd
server always has it's requests complete to disk before it gets a new one, so
it cannot usefully reorder them.  It can reorder requests if they come in from
different clients, but I don't think that this gets you much.

Now that (I believe) you can do async IO to a device opened with the O_SYNC
flag from userspace, the gnbd server could be rewritten much more effectively.
Unfortuntely, it probably won't happen anytime soon. 

Thanks for the heads up, and if you wouldn't mind filing a bugzilla about this
at bugzilla.redhat.com, that would be helpful.

-Ben

> 
> -- 
> greetings
> 
> eMHa


> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Sat Oct 21 01:39:33 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 20:39:33 -0500
Subject: [Linux-cluster] FC Fabric and GFS
Message-ID: <20061020203933.676281@leena>

Is there some weird problem with McData FC switched and GFS? I finally got my 
cluster working (***finally***) and now I can't see any of the storage.

The McData is a fabric switch. I'm not sure what info someone would need in 
order to give me any feedback so will reply with what ever I am asked for.

This mess has been down for over a week now. Trying to get this McData ED-5000 
switch working with everything has been nearly enough to make me quit 
computers for good.

Please, ANY help in trying to get my storage seen from GFS would be wonderful. 
If I'm on the wrong list, perhaps you can point me to another one.

Mike


From isplist at logicore.net  Sat Oct 21 01:49:04 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 20 Oct 2006 20:49:04 -0500
Subject: [Linux-cluster] GFS Shut down script
Message-ID: <2006102020494.202600@leena>

I have not been able to find a way of shutting down the GFS/Cluster when I 
reboot and so have been messing up my cluster.

Where could I put the commands (which file is part of some other shutdown 
sequence) so that when I hit CTRL-ALT-DEL or type shutdown for example, that 
the commands will run, shutting down my cluster cleanly.

Mike


From rpeterso at redhat.com  Sat Oct 21 04:16:54 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 20 Oct 2006 23:16:54 -0500
Subject: [Linux-cluster] FC Fabric and GFS
In-Reply-To: <20061020203933.676281@leena>
References: <20061020203933.676281@leena>
Message-ID: <45399F36.2040806@redhat.com>

isplist at logicore.net wrote:
> Is there some weird problem with McData FC switched and GFS? I finally got my 
> cluster working (***finally***) and now I can't see any of the storage.
>
> The McData is a fabric switch. I'm not sure what info someone would need in 
> order to give me any feedback so will reply with what ever I am asked for.
>
> This mess has been down for over a week now. Trying to get this McData ED-5000 
> switch working with everything has been nearly enough to make me quit 
> computers for good.
>
> Please, ANY help in trying to get my storage seen from GFS would be wonderful. 
> If I'm on the wrong list, perhaps you can point me to another one.
>
> Mike
>   
Hi Mike,

I don't know how helpful this will be because I'm not sure exactly what 
you mean by
it not seeing your storage (there are many layers you could be referring 
to),
and I'm not familiar with McData, however:

The first step is to insert the device driver for your Host Bus Adapter and
see if it reports any problems, or if it reports the hardware is there 
in dmesg.
Here's what mine looks like for a QLogic 2400 HBA:

QLogic Fibre Channel HBA Driver
ACPI: PCI interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 169
qla2400 0000:01:00.0: Found an ISP2432, irq 169, iobase 0xffffff0000014000
qla2400 0000:01:00.0: Configuring PCI space...
PCI: Setting latency timer of device 0000:01:00.0 to 64
qla2400 0000:01:00.0: Configure NVRAM parameters...
qla2400 0000:01:00.0: Verifying loaded RISC code...
qla2400 0000:01:00.0: Allocated (1061 KB) for firmware dump...
qla2400 0000:01:00.0: Waiting for LIP to complete...
qla2400 0000:01:00.0: LIP reset occured (f8f7).
qla2400 0000:01:00.0: LIP occured (f8f7).
qla2400 0000:01:00.0: LIP reset occured (f7f7).
qla2400 0000:01:00.0: LOOP UP detected (4 Gbps).
qla2400 0000:01:00.0: Topology - (F_Port), Host Loop address 0x0
scsi2 : qla2xxx
qla2400 0000:01:00.0:
 QLogic Fibre Channel HBA Driver: 8.01.04-d7
  QLogic QLE2460 - PCI-Express to 4Gb FC, Single Channel
  ISP2432: PCIe (2.5Gb/s x4) @ 0000:01:00.0 hdma+, host#=2, fw=4.00.18 [IP]
  Vendor: WINSYS    Model: SA3482            Rev: 347B
  Type:   Direct-Access                      ANSI SCSI revision: 03
qla2400 0000:01:00.0: scsi(2:0:0:0): Enabled tagged queuing, queue depth 32.
SCSI device sdc: 2342664192 512-byte hdwr sectors (1199444 MB)
SCSI device sdc: drive cache: write back
SCSI device sdc: 2342664192 512-byte hdwr sectors (1199444 MB)
SCSI device sdc: drive cache: write back
 sdc: unknown partition table
Attached scsi disk sdc at scsi2, channel 0, id 0, lun 0

 From these messages, you can tell that the SAN is working, and is
recognized as /dev/sdc by the kernel.
If it complains about lip errors and such, it could be your cable or 
connectors.
If the HBA seems to work, but doesn't see the SAN, you may need to
configure the fabric, and I have no idea how to do that on McData.
I've only watched someone else do it--in a hurry--and on a Tornado,
and that was using a web interface.

Next, cat /proc/partitions to make sure your SAN shows up there.
In my case, I see a line that looks like this:
   8    32 1171332096 sdc

Third, make sure your /etc/lvm/lvm.conf isn't filtering out the device,
thereby making it invisible as far as lvm is concerned.  A line like this:
filter = [ "r/sdc/", "r/disk/", "a/.*/" ]
would make /dev/sdc invisible to lvm.

Next, check your /etc/lvm/lvm.conf for locking_type = 2.
You'll need that to share access to the SAN on the cluster.

Next, make sure your clvmd service is started.  If not, do:
service clvmd start (and perhaps chkconfig clvmd on).

Next, try a vgchange -aly and vgscan to get clvmd to look again for it,
although the clvmd service should take care of this.

Next, use the pvdisplay command to see if lvm recognizes the
physical volumes.  If you haven't done this yet, you'll probably
want to create one, like this:
pvcreate /dev/sdc

Use vgs to see if your volume group was found.  If you haven't
created one yet, you may need to create one with vgcreate.
vgcreate mikes_vg /dev/sdc

Finally, use lvs to make sure it sees your logical volume.
If you haven't created one, do something like:
lvcreate -L 39G mikes_vg

I hope this was helpful.  If not, post which part you're stuck on.

Regards,

Bob Peterson
Red Hat Cluster Suite


From orkcu at yahoo.com  Sat Oct 21 04:21:49 2006
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Fri, 20 Oct 2006 21:21:49 -0700 (PDT)
Subject: [Linux-cluster] FC Fabric and GFS
In-Reply-To: <20061020203933.676281@leena>
Message-ID: <20061021042149.38968.qmail@web50606.mail.yahoo.com>


--- "isplist at logicore.net" <isplist at logicore.net>
wrote:

> Is there some weird problem with McData FC switched
> and GFS? I finally got my 
> cluster working (***finally***) and now I can't see
> any of the storage.
>
> Please, ANY help in trying to get my storage seen
> from GFS would be wonderful. 
> If I'm on the wrong list, perhaps you can point me
> to another one.

first, the linux system (kernel) needs to see somehow,
and create the devices (plurar because you may have
several path to reach each LUN created in the san)

second, if you have several path you will need to
"join" those path into one path and create a "virtual"
device  who represent the real-only one LUN :-) ,
maybe dm-multipath is ok with you or you can pay for
powerpath 

finally, you will be able to create the GFS filesystem
in the "virtual" device.

so, do your linux kernel see the LUN ?

cu
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From rpeterso at redhat.com  Sat Oct 21 04:36:31 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 20 Oct 2006 23:36:31 -0500
Subject: [Linux-cluster] GFS Shut down script
In-Reply-To: <2006102020494.202600@leena>
References: <2006102020494.202600@leena>
Message-ID: <4539A3CF.9080904@redhat.com>

isplist at logicore.net wrote:
> I have not been able to find a way of shutting down the GFS/Cluster when I 
> reboot and so have been messing up my cluster.
>
> Where could I put the commands (which file is part of some other shutdown 
> sequence) so that when I hit CTRL-ALT-DEL or type shutdown for example, that 
> the commands will run, shutting down my cluster cleanly.
>
> Mike
>   
Hi Mike,

In theory, when you do /sbin/reboot or shutdown, it should automatically 
call the
service shutdown scripts, at least if you're using Fedora or RHEL.
These are the same scripts that start your services.  If you're running RHEL
or some variation of that software (i.e. STABLE cvs branch, etc.) they 
will be:

/etc/rc.d/init.d/ccsd
/etc/rc.d/init.d/cman
/etc/rc.d/init.d/fenced
/etc/rc.d/init.d/clvmd
/etc/rc.d/init.d/gfs

They are started in that order, usually because init finds them in
/etc/rc.d/rc3.d/S* (for runlevel 3) or /etc/rc.d/rc5.d/S* (for runlevel 5).
They are likewise stopped in the opposite order, as dictated by
matching K scripts at runlevel 6: /etc/rc.d/rc6.d/K*

For the HEAD branch (RHEL5, FC6, etc.) there are only 3:

/etc/rc.d/init.d/cman
/etc/rc.d/init.d/clvmd
/etc/rc.d/init.d/gfs

Again, started in that order, and stopped in reverse order.

Those scripts should take care of shutting down the cluster cleanly.
However, if the gfs service can't unmount the gfs mount points
because they're in use, then all bets are off and your cluster may
not shut down cleanly.

You may also want to take a look at the faq, at this question:
http://sources.redhat.com/cluster/faq.html#cman_shutdown

I hope this helps.

Regards,

Bob Peterson
Red Hat Cluster Suite


From dicky_nnc at yahoo.com.hk  Sat Oct 21 13:59:19 2006
From: dicky_nnc at yahoo.com.hk (Dicky)
Date: Sat, 21 Oct 2006 21:59:19 +0800
Subject: [Linux-cluster] RE: Cluster Suite 4 failover problem 
Message-ID: <453A27B7.6050605@yahoo.com.hk>

HI  Jeff & Lon,

Thanks for the reply.

Regarding the didn't failover issue (just displayed the "Owner --> 
unknown"  and "State --> started" but actually none services were 
available), i checked the log and agreed that it should be the 
fence_manual problem. It is because the log message showed that the 
fence_manul was waiting node2 to rejoin the cluster, as soon as i 
executed the command: fence_ack_manual -n node2, the failed services 
failover to node1, all failed service back to normal.

I would like to know if there is any solution or workaround for this 
situation other than buying a fence device :) ????? Can i remove the 
fence.rpm ??? Will it cause any extra problems????? It is because in 
production environment, we never know when will the machine down and 
cannot execute the fence_ack_manual command immediately.

========/var/log/messages======
kernel: CMAN: removing node node2 from the cluster : Missed too many 
heartbeats
fenced[2447]: node2 not a cluster member after 0 sec post_fail_delay
fenced[2447]: fencing node "node2"
fence_manual: Node node2 needs to be reset before recovery can procede.  
Waiting for node2 to rejoin the cluster or for manual acknowledgement 
that it has been reset (i.e. fence_ack_manual -n node2)
=======END================


Regarding the monitor_link issue, i have tried to set the "monitor_link 
=1 " for both resource ip i.e. 192.168.0.111 and 192.168.0.112 , then i 
shutdown eth0 of node2 and re-enable it,  when i tried to restart the 
rgmanager in node2 i.e. the failed node, it still showing the msg 
"Shutting down Cluster Service Manager... Waiting for services to stop: 
", i have to kill the rgmanager's processes or even worse i have to 
reset the machine. Any ideas??

One more thing is even the monitor_link=0 in the cluster.conf, the 
system-config-cluster --> Resource --> IP address's Monitor Link box is 
being ticked!!! Why??

Many thanks,
Dicky


From isplist at logicore.net  Sat Oct 21 19:54:21 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 14:54:21 -0500
Subject: [Linux-cluster] ccs files
Message-ID: <20061021145421.250441@leena>

My cluster *seems* to be working fine with only the /etc/cluster/cluster.conf 
file. However, I read that I need to have nodes.ccs and fence.ccs as well. I 
cannot however, find any info on where to place these files and if they are in 
fact required?

Are there two different setups, one using ccs files and another using .conf 
files?

Mike


From isplist at logicore.net  Sat Oct 21 19:56:14 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 14:56:14 -0500
Subject: [Linux-cluster] FC Fabric and GFS
In-Reply-To: <20061021192801.62486.qmail@web50613.mail.yahoo.com>
Message-ID: <20061021145614.102571@leena>

> I guess you are getting a problem with  FC zone
> configuration or maybe somekind of fabric license
> problem

Yes, you're right, it does not seem to be a GFS issue after all but a 
fabric/Linux issue which I need to figure out.

Thanks.

Mike


From isplist at logicore.net  Sat Oct 21 20:07:08 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 15:07:08 -0500
Subject: [Linux-cluster] FC Fabric and GFS
In-Reply-To: <45399F36.2040806@redhat.com>
Message-ID: <200610211578.183387@leena>

> I don't know how helpful this will be because I'm not sure exactly what
> you mean by it not seeing your storage (there are many layers you could be 
>referring to),

It seems to be a fabric issue. When I re-connect everything over to regular FC 
hub's, everything's fine, storage shows up on all nodes. When I re-connect the 
nodes to the McData fabric switch, no storage shows up.

> Third, make sure your /etc/lvm/lvm.conf isn't filtering out the device,
> thereby making it invisible as far as lvm is concerned.  A line like this:
> filter = [ "r/sdc/", "r/disk/", "a/.*/" ]
> would make /dev/sdc invisible to lvm.

I have the default which seems to allow everything.
   filter = [ "a/.*/" ]

Other's are commented out.

   # filter = [ "r|/dev/cdrom|" ]
   # filter = [ "a/loop/", "r/.*/" ]
   # filter =[ "a|loop|", "r|/dev/hdc|", "a|/dev/ide|", "r|.*|" ]
   # filter = [ "a|^/dev/hda8$|", "r/.*/" ]
 
> Next, check your /etc/lvm/lvm.conf for locking_type = 2.
> You'll need that to share access to the SAN on the cluster.

Yes, locking_type = 2 is in there.
 
> Next, make sure your clvmd service is started.  If not, do:
> service clvmd start (and perhaps chkconfig clvmd on).

Yes, it's running.

I can't get to any of the following since the storage is not seen;
 
> Next, try a vgchange -aly and vgscan to get clvmd to look again for it,
> pvcreate /dev/sdc
> vgcreate mikes_vg /dev/sdc
> lvcreate -L 39G mikes_vg

yup, I can do all of the above when I can see the storage by using the FC 
hub's but the McData is not allowing the storage to be seen. 
 
> I hope this was helpful.  If not, post which part you're stuck on.

Yes, that's all helpful, a nice order of things to check. I'm looking on the 
RH site trying to find reasons why a fabric switch might make a difference and 
not allow the storage to be seen.

Mike


From isplist at logicore.net  Sat Oct 21 20:11:38 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 15:11:38 -0500
Subject: [Linux-cluster] GFS Shut down script
In-Reply-To: <4539A3CF.9080904@redhat.com>
Message-ID: <20061021151138.281725@leena>

> These are the same scripts that start your services.  If you're running RHEL
> or some variation of that software (i.e. STABLE cvs branch, etc.) they
> will be:

Problem is, I was never able to get the cluster up by using auto loading 
services. As such, I created a small script that fires everything up.

//
depmod -a
modprobe dm-mod
modprobe gfs
modprobe lock_dlm

ccsd
cman_tool join -w
fence_tool join -w
clvmd
vgchange -aly
//
Then what ever other things I wanted when firing up a node. As such, now I 
need to shut these things down automatically when I reboot a node because it's 
way too easy to foget and then mess up the cluster.

> /etc/rc.d/init.d/cman
> /etc/rc.d/init.d/clvmd
> /etc/rc.d/init.d/gfs

Could I add my shutdown commands into one of the above?

> However, if the gfs service can't unmount the gfs mount points
> because they're in use, then all bets are off and your cluster may
> not shut down cleanly.

That does seem to happen too regularly, I try to leave a cluster and things 
break.

> http://sources.redhat.com/cluster/faq.html#cman_shutdown

Will do, thanks much.

Mike


From jos at xos.nl  Sat Oct 21 20:11:12 2006
From: jos at xos.nl (Jos Vos)
Date: Sat, 21 Oct 2006 22:11:12 +0200
Subject: [Linux-cluster] ccs files
In-Reply-To: <20061021145421.250441@leena>;
	from isplist@logicore.net on Sat, Oct 21, 2006 at 02:54:21PM -0500
References: <20061021145421.250441@leena>
Message-ID: <20061021221112.C20029@xos037.xos.nl>

On Sat, Oct 21, 2006 at 02:54:21PM -0500, isplist at logicore.net wrote:

> My cluster *seems* to be working fine with only the /etc/cluster/cluster.conf 
> file. However, I read that I need to have nodes.ccs and fence.ccs as well. I 
> cannot however, find any info on where to place these files and if they are in
> fact required?

Where do you read that?  Isn't that old documentation?  I see them
listed in the GFS 6.0 guide, but not in the GFS 6.1 guide.

AFAIK cluster.conf is the only config file used in the latest software
versions of RHCS and GFS.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From isplist at logicore.net  Sat Oct 21 20:13:40 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 15:13:40 -0500
Subject: [Linux-cluster] ccs files
In-Reply-To: <20061021221112.C20029@xos037.xos.nl>
Message-ID: <20061021151340.532466@leena>

> Where do you read that?  Isn't that old documentation?  I see them
> listed in the GFS 6.0 guide, but not in the GFS 6.1 guide.

Yes, searching online, on RH site and the net in general. 
 
> AFAIK cluster.conf is the only config file used in the latest software
> versions of RHCS and GFS.

Ok, so nothing else needed, good to know :)

Mike


From isplist at logicore.net  Sat Oct 21 20:17:49 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sat, 21 Oct 2006 15:17:49 -0500
Subject: [Linux-cluster] GFS Shut down script
In-Reply-To: <4539A3CF.9080904@redhat.com>
Message-ID: <20061021151749.745925@leena>

> You may also want to take a look at the faq, at this question:
> http://sources.redhat.com/cluster/faq.html#cman_shutdown

PS: Just tried this on a node and I get;

My script contains;

//
umount /var/www
vgchange -aln
killall clvmd
fence_tool leave
cman_tool leave remove -w
killall ccsd
//

I get;
cman_tool: Can't leave cluster while there are 3 active subsystems

Plus, I'm confused on why it says 3 active subsystems? Guess I'm not clear why 
only 3 got votes?

# cman_tool nodes
Node  Votes Exp Sts  Name
   1    1    1   M   dev
   2    1    8   M   cweb92
   3    1    1   M   cweb93
   4    1    1   M   cweb94
   5    1    8   M   qm250
   6    1    8   M   qm247
   7    1    8   M   qm248
   8    1    8   M   qm249

Mike


From rpeterso at redhat.com  Sat Oct 21 20:53:46 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Sat, 21 Oct 2006 15:53:46 -0500
Subject: [Linux-cluster] ccs files
In-Reply-To: <20061021145421.250441@leena>
References: <20061021145421.250441@leena>
Message-ID: <453A88DA.5000707@redhat.com>

isplist at logicore.net wrote:
> My cluster *seems* to be working fine with only the /etc/cluster/cluster.conf 
> file. However, I read that I need to have nodes.ccs and fence.ccs as well. I 
> cannot however, find any info on where to place these files and if they are in 
> fact required?
>
> Are there two different setups, one using ccs files and another using .conf 
> files?
>
> Mike
>   
Hi Mike,

You don't need those files for the RHEL4 or STABLE branches.  That's the 
old way of
doing things.

Bob Peterson
Red Hat Cluster Suite


From rpeterso at redhat.com  Sat Oct 21 21:26:47 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Sat, 21 Oct 2006 16:26:47 -0500
Subject: [Linux-cluster] GFS Shut down script
In-Reply-To: <20061021151749.745925@leena>
References: <20061021151749.745925@leena>
Message-ID: <453A9097.3060604@redhat.com>

isplist at logicore.net wrote:
> I get;
> cman_tool: Can't leave cluster while there are 3 active subsystems
>   
Hi Mike,

That message means there is some part of the cluster infrastructure that 
is still
running.  These are just pieces that are talking to each other.  They 
include:

1. clvmd
2. fenced
3. rgmanager
4. gfs

And so forth.  You can find out how many subsystems are active by doing:

cat /proc/cluster/status

Try executing the shutdown scripts manually and check the count after each:
service rgmanager stop; cat /proc/cluster/status
service gfs stop;cat /proc/cluster/status
vgchange -aln
service clvmd stop;cat /proc/cluster/status
service fenced stop;cat /proc/cluster/status

If doing this manually gets your subsystems down to 0, you should be able
to do the service cman stop (or cman_tool leave) without getting that error.
If that's the case, make sure all those service scripts appear as Kxxx files
in /etc/rc.d/rc6.d/

Regards,

Bob Peterson,
Red Hat Cluster SUite


From Markus at hochholdinger.net  Sat Oct 21 23:31:03 2006
From: Markus at hochholdinger.net (Markus Hochholdinger)
Date: Sun, 22 Oct 2006 01:31:03 +0200
Subject: [Linux-cluster] io scheduler and gnbd
In-Reply-To: <20061021004614.GB4771@ether.msp.redhat.com>
References: <200610201620.41620.Markus@hochholdinger.net>
	<20061021004614.GB4771@ether.msp.redhat.com>
Message-ID: <200610220131.07612.Markus@hochholdinger.net>

hi,

Am Samstag, 21. Oktober 2006 02:46 schrieb Benjamin Marzinski:
> On Fri, Oct 20, 2006 at 04:20:37PM +0200, Markus Hochholdinger wrote:
> > i'm succesfully using gnbd as a single service for a long time. Now i
> > discovered a weired problem with the gnbd devices with kernel 2.6.18. I
> > build the gnbd.ko module out of the cvs tree.
> > All works fine if you don't do to much an the gnbds. But if you stress
> > test the devices, the gnbds will hang, e.g. reads and writes hang. If you
> > restart the gnbd server, the client will continue to read and write until
> > the next hang.
> > So i first checked my gnbd servers and tried from 1.01 till 1.03 and the
> > latest cvs. But the problem is still there. From another gnbd client i
> > had no problem, with none of these gnbd server versions (i was impressed
> > you can mix these versions). Also changing the kernel on the gnbd server
> > didn't helped.
> > So i was stick to the gnbd client with kernel 2.6.18. I have to use this
> > kernel because of the new hardware. So i tried a little and found out
> > that changing the default io scheduler for the gnbd devices on the client
> > makes the hanging write and reads resume. The default scheduler was cfq
> > and with this i can easily reproduce this behavior. With the deadline
> > scheduler it doesn't.
> > So i read a little about io scheduling on linux. And my assumption is a
> > gnbd device shouldn't need any io scheduling, because the network has no
> > latency when seeking like a hard disk. On the gnbd server there are
> > getting request from more than one gnbd client, so scheduled io on the
> > client would mix up the scheduling on the server. And also the server
> > does its own io scheduling when writing to the real disk.
> > So i could use the noop scheduler or have i missed something?
> > Has anyone on the list more info about io scheduling and gnbd?
> If gnbd isn't working with the latest kernel, it's pretty definitely a bug.

i've spend the whole saturday to solve this problem. And i figured out that it 
is not the gnbd's fault.
It's the 3ware controller in the gnbd server!
I thought the gnbd server is all ok because my other gnbd clients had no 
problem, but i was wrong. The new gnbd client, which doesn't work 
correctly/smothly, was connected with 1GBit/s network in contrast to the 
other clients with 100MBit/s.
The 3ware controller in the gnbd server has very little performance (i don't 
get it if it is the driver or the hardware). I've JBOD configured in the 
3ware controller and manage 3 hard disks with lvm, which i export over gnbd. 
Even without gnbd i can't write more than 5MB/s on one of the JBOD disks if i 
write more than the system can cache. (Because of a lot of RAM in this gnbd 
server and the stripe set of three disks i didn't notice before.)
When i turn on WRITE CACHE in the 3ware controller (without bbu) i get 75MB/s 
per JBOD disk!
And with this, gnbd all works fine. (I think i've to replace the 3ware 
controller or to buy a bbu.)
So after having a fast gnbd server i couldn't reproduce the problem.


> I'll take a look and see if I can't reproduce it.  As far as IO scheduling
> goes, if you can work around this with a different scheduler, that's great,
> but depending on your IO patterns, gnbd should see a benefit from
> reordering requests.  It is more efficient to send a fewer number of larger
> requests to the server. There are a couple or reasons why but the big one
> is that the gnbd server does not reorder requests itself.
> Currently the gnbd server receives a request, performs the request, returns
> the result, and then goes on to the next request. There is only one thread
> per client per device. Obviously, your need to have the IO complete to disk
> before you return the read result. But because gnbd is pretending to be a
> block device, when the server says that the data has been written out, the
> data must be actually on disk. This means that the request must be synced
> to disk before the server returns a write result and goes on to the next
> request. So the gnbd server always has it's requests complete to disk
> before it gets a new one, so it cannot usefully reorder them.  It can
> reorder requests if they come in from different clients, but I don't think
> that this gets you much.

Well, i've a setup like this:
gnbd-server, 3 disks, lvm with stripes, logical volumes, a lot of gnbd 
clients.
So if one gnbd client reads/writes it's sure another one is also reading and 
writing. So the final scheduling is in the block device of the 3 disks resp. 
if there comes data from more than one client at a time it will be 
rescheduled in the block device.
And i think here was the problem.
Either the io scheduler on the gnbd server gets over(?) filled or the ordering 
coming from the gnbd client has produced some kind of dead lock together with 
the bad performance!?

Many thanks for your explanation of the io and scheduling behavior of the gnbd 
server. So i will go back to use a scheduler for the gnbd devices.
Also i think it should be wise to use the same io scheduler on the gnbd server 
as on the client.

BTW: I've recently set up exactly the same combination of hard disks, gnbd 
server and client like in this case. And there i get great performance over 
1GBit/s network (read/write about 80MB/s).


> Now that (I believe) you can do async IO to a device opened with the O_SYNC
> flag from userspace, the gnbd server could be rewritten much more
> effectively. Unfortuntely, it probably won't happen anytime soon.

In my case i mean the async io comes on the gnbd server from multiple clients 
writing to the same hard disks (physcal volumes) because of the lvm and 
stripes.
So the io scheduler of the real hard disks has to manage pre scheduled events 
from gnbd. And before this email i thought doing scheduling twice will be a 
waste of time because the network doesn't need time to seek like a hard disk. 
But as you explained, the gnbd server and client is optimized for large data 
blocks and therefore scheduling on the client is a good thing.


> Thanks for the heads up, and if you wouldn't mind filing a bugzilla about
> this at bugzilla.redhat.com, that would be helpful.

Hm, i didn't really find a bug. Perhaps i found a bug in the linux io 
scheduler or in some hardware.
But as far as i can see, gnbd behaves like a rock. And if the hard disks 
didn't get managed the data, i think gnbd can't help here.

BTW: Now that i've tested all gnbd kernel modules and all gnbd client and 
server versions against each other (1.01, 1.02, 1.03 and latest cvs) and it 
worked, i must say that you guys make great work :-) It never crashed (oh, 
well, one time i get kernel panics when i loaded the gnbd kernel module with 
all debugging on, but the system kept on running). I was really surprised.

BTW2: I was afraid when i will upgrade my gnbd servers and clients in which 
order i've to update. Now i know it doesn't really matter. Great :-)


-- 
greetings

eMHa
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061022/7442b87f/attachment.sig>

From pcaulfie at redhat.com  Mon Oct 23 08:05:36 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 23 Oct 2006 09:05:36 +0100
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <20061019113230.691224@leena>
References: <20061019113230.691224@leena>
Message-ID: <453C77D0.5060201@redhat.com>

isplist at logicore.net wrote:
> I'm not seeing any of these...
> 

of what?

> 
> Oct 19 11:22:07 dev ccsd[4478]: Initial status:: Inquorate
> Oct 19 11:22:11 dev kernel: CMAN: sending membership request
> Oct 19 11:22:11 dev kernel: CMAN: Cluster membership rejected
> Oct 19 11:22:11 dev ccsd[4478]: Cluster manager shutdown.  Attemping to 
> reconnect...
> Oct 19 11:22:11 dev kernel: CMAN: Waiting to join or form a Linux-cluster
> Oct 19 11:22:11 dev kernel: CMAN: sending membership request
> Oct 19 11:22:11 dev kernel: CMAN: Cluster membership rejected
> Oct 19 11:22:12 dev clvmd: Can't open cluster manager socket: Network is down
> Oct 19 11:22:16 dev ccsd[4478]: Unable to connect to cluster infrastructure 
> after 30 seconds.
> Oct 19 11:22:46 dev ccsd[4478]: Unable to connect to cluster infrastructure 
> after 60 seconds.
> [root at dev new]#

I'll repeat this information, because it's all that actually matters. If a
node is being reject by cman that IS another node that is rejecting that node
and it WILL tell syslog why (unless you have kernel logging switched off for
some reason).

 - mismatching cluster.conf version numbers
 - mismatch cluster names
 - mismatch cluster number (a hash of the name)
 - node has the wrong node ID (ie it joined with the same
   name and a different node ID or vice versa)
 - protocol version differs (or other software mismatch - there are several
   error messages for these but they boil down to the same thing)


Something else you might like to try is changing the port number that this
cluster is using, or changing the cluster name to something totally different.

If you find that things work after doing this then you can be sure there is
another cluster with that name/number on the network. If not, then you need to
double/triple check that the config files really do all match on all nodes.


-- 

patrick


From dbrieck at gmail.com  Mon Oct 23 13:48:34 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 23 Oct 2006 09:48:34 -0400
Subject: [Linux-cluster] Re: Hard lockups during file transfer to GNBD/GFS
	device
In-Reply-To: <8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>
References: <8c1094290609280915o6b6b4962ud0d090e58e5d7fc6@mail.gmail.com>
	<8c1094290609281208i6a5eaf8br70697c6b5d085cf@mail.gmail.com>
	<8c1094290609281227j1303ec11u300932ab8d4953ab@mail.gmail.com>
	<20060928195844.GB25242@redhat.com>
	<8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
	<8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>
Message-ID: <8c1094290610230648x7ac8189dy3b2612fb9191125b@mail.gmail.com>

On 10/10/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> On 9/29/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> > On 9/28/06, David Teigland <teigland at redhat.com> wrote:
> > >
> > > Could you try it without multipath?  You have quite a few layers there.
> > > Dave
> > >
> > >
> >
> > Thanks for the response. I unloaded gfs, clvm, gnbd and multipath, the
> > reloaded gnbd, clvm and gfs. It was only talking to one of the gnbd
> > servers and without multipath. Here's the log from this crash. It
> > seems to have more info in it.
> >
> > I'm kinda confused why it still has references to mulitpath though. I
> > unloaded the multipath module so I'm not sure why it's still in there.
> > SNIP
>
> Since I didn't hear back from anyone I decided to try things a little
> differently. Instead of rsyncing on the local machine, I ran the rsync
> from another cluster member who also mounts the same partition I was
> trying to move things too.
>
> So instead of
>
> rsync -a /path/to/files/ /mnt/http/
>
> I used
>
> rsync -a root at 10.1.1.121::/path/to/files/  /mnt/http/
>
> and I didn't have a crash at all. Why would this not cause a problem
> when the first one did? Is this more of an rsync problem maybe? I do
> have 2 NFS exports, could those have been causing the problem?
>
> Thanks.
>

I am now seeing other crashes. Here's one from this weekend. It's huge
and I'm not even sure how to see where things might have gone wrong.
At the time of the crash the system was doing a backup to the gnbd
server (if that helps). I'd also appreciate it if maybe someone could
explain how to read this as far as where the problem actually started.

Thanks again.

Oct 22 00:17:56 db1 kernel: do_IRQ: stack overflow: 476
Oct 22 00:17:56 db1 kernel:  [<02107c6b>] do_IRQ+0x49/0x1ae
Oct 22 00:17:56 db1 kernel:  [<022cb5ea>] _read_unlock_bh+0x3/0x8
Oct 22 00:17:56 db1 kernel:  [<f8fea2bd>] do_bindings+0x9d/0x1e3 [iptable_nat]
Oct 22 00:17:56 db1 kernel:  [<f8fe91c6>] ip_nat_fn+0x1c6/0x1d4 [iptable_nat]
Oct 22 00:17:56 db1 kernel:  [<0228227b>] nf_iterate+0x40/0x81
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<02282581>] nf_hook_slow+0x47/0xbc
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<022929b7>] ip_finish_output+0x1c5/0x1ce
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
Oct 22 00:17:56 db1 kernel:  [<0211df36>] find_busiest_group+0xdd/0x295
Oct 22 00:17:56 db1 kernel:  [<022a62a7>] tcp_v4_send_check+0x7c/0xb9
Oct 22 00:17:56 db1 kernel:  [<022a10b1>] tcp_transmit_skb+0x6bf/0x7d2
Oct 22 00:17:56 db1 kernel:  [<022a19be>] tcp_write_xmit+0x74/0x254
Oct 22 00:17:56 db1 kernel:  [<0229ef8e>] __tcp_data_snd_check+0x3d/0xaa
Oct 22 00:17:56 db1 kernel:  [<0229f471>] tcp_rcv_established+0x166/0x725
Oct 22 00:17:56 db1 kernel:  [<022a72ce>] tcp_v4_do_rcv+0x1b/0xe9
Oct 22 00:17:56 db1 kernel:  [<02274c39>] __release_sock+0x39/0x55
Oct 22 00:17:56 db1 kernel:  [<022751b8>] release_sock+0x1f/0x4f
Oct 22 00:17:56 db1 kernel:  [<02277e42>] sk_stream_wait_memory+0x113/0x1a8
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<02297ca1>] tcp_sendmsg+0xb5c/0xe4d
Oct 22 00:17:56 db1 kernel:  [<022b2f5d>] inet_sendmsg+0x38/0x42
Oct 22 00:17:56 db1 kernel:  [<0227221f>] sock_sendmsg+0xdb/0xf7
Oct 22 00:17:56 db1 kernel:  [<02286b85>] qdisc_restart+0x12/0x1b9
Oct 22 00:17:56 db1 kernel:  [<f88df37d>]
bond_dev_queue_xmit+0x1f3/0x1fb [bonding]
Oct 22 00:17:56 db1 kernel:  [<f88e29bb>]
bond_xmit_roundrobin+0xdb/0xe3 [bonding]
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<f89e95c8>] sock_xmit+0x11d/0x217 [gnbd]
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:56 db1 kernel:  [<02141db2>] mempool_alloc+0x7b/0x135
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<f89e9938>] __gnbd_send_req+0x276/0x2e7 [gnbd]
Oct 22 00:17:56 db1 kernel:  [<f89e9dc0>] do_gnbd_request+0xfa/0x143 [gnbd]
Oct 22 00:17:56 db1 kernel:  [<02220380>] __generic_unplug_device+0x2b/0x2d
Oct 22 00:17:56 db1 kernel:  [<02220397>] generic_unplug_device+0x15/0x21
Oct 22 00:17:56 db1 kernel:  [<02220992>] get_request_wait+0x51/0xb9
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<022213c1>] __make_request+0x2bf/0x46c
Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:56 db1 kernel:  [<f88f623c>] multipath_map+0x64/0x7f [dm_multipath]
Oct 22 00:17:56 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:56 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
Oct 22 00:17:56 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:56 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
Oct 22 00:17:56 db1 kernel:  [<f8f2c3c1>] diaper_make_request+0x8b/0x91 [gfs]
Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<022217d6>] submit_bio+0xca/0xd2
Oct 22 00:17:56 db1 kernel:  [<0215e681>] bio_alloc+0x100/0x168
Oct 22 00:17:56 db1 kernel:  [<0215e038>] submit_bh+0x141/0x166
Oct 22 00:17:56 db1 kernel:  [<0215e0b9>] ll_rw_block+0x5c/0x72
Oct 22 00:17:56 db1 kernel:  [<f8f2d658>] gfs_dreread+0x65/0xd6 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f2d5a4>] gfs_dread+0x20/0x36 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f2e4ec>] gfs_get_meta_buffer+0xb1/0x250 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f2a647>] gfs_block_map+0x242/0x3de [gfs]
Oct 22 00:17:56 db1 kernel:  [<022cad75>] __cond_resched+0x14/0x39
Oct 22 00:17:56 db1 kernel:  [<f8f49cd6>] get_block+0x2a/0x6d [gfs]
Oct 22 00:17:56 db1 kernel:  [<0215cd70>] __block_prepare_write+0x165/0x3ec
Oct 22 00:17:56 db1 kernel:  [<0215d682>] block_prepare_write+0x16/0x23
Oct 22 00:17:56 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4a258>] gfs_prepare_write+0x122/0x135 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
Oct 22 00:17:56 db1 kernel:  [<02140ec8>]
generic_file_buffered_write+0x218/0x533
Oct 22 00:17:56 db1 kernel:  [<02126347>] current_fs_time+0x44/0x4c
Oct 22 00:17:56 db1 kernel:  [<0214156c>]
__generic_file_aio_write_nolock+0x389/0x3b7
Oct 22 00:17:56 db1 kernel:  [<021415d3>]
generic_file_aio_write_nolock+0x39/0x7f
Oct 22 00:17:56 db1 kernel:  [<02141736>] generic_file_write_nolock+0x84/0x99
Oct 22 00:17:56 db1 kernel:  [<f8f38055>] gfs_glock_nq+0xe3/0x116 [gfs]
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<f8f58bac>] gfs_trans_begin_i+0xfd/0x15a [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4c0fc>] do_do_write_buf+0x2a6/0x452 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4c3c3>] do_write_buf+0x11b/0x15e [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4b31c>] walk_vm+0xd7/0x100 [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4c4a7>] __gfs_write+0xa1/0xbb [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4c2a8>] do_write_buf+0x0/0x15e [gfs]
Oct 22 00:17:56 db1 kernel:  [<f8f4c4cc>] gfs_write+0xb/0xe [gfs]
Oct 22 00:17:56 db1 kernel:  [<0215a52f>] vfs_write+0xb6/0xe2
Oct 22 00:17:56 db1 kernel:  [<0215a5f9>] sys_write+0x3c/0x62
Oct 22 00:17:56 db1 kernel:  =======================
Oct 22 00:17:56 db1 kernel: Unable to handle kernel paging request at
virtual address 000104e6
Oct 22 00:17:56 db1 kernel:  printing eip:
Oct 22 00:17:56 db1 kernel: 02105fc0
Oct 22 00:17:56 db1 kernel: *pde = 00004001
Oct 22 00:17:56 db1 kernel: Oops: 0000 [#1]
Oct 22 00:17:56 db1 kernel: SMP
Oct 22 00:17:56 db1 kernel: Modules linked in: ipt_REDIRECT
iptable_nat ip_conntrack iptable_mangle iptable_filter ip_tables
mptctl mptbase dell_rbu parport_
pc lp parport autofs4 i2c_dev i2c_core lock_dlm(U) gfs(U)
lock_harness(U) dm_round_robin gnbd(U) dlm(U) cman(U) sunrpc
ipmi_devintf ipmi_si ipmi_msghandler m
d5 ipv6 dm_mirror dm_multipath button battery ac joydev uhci_hcd
ehci_hcd hw_random e1000 bonding(U) floppy sg ext3 jbd dm_mod
megaraid_mbox megaraid_mm sd_m
od scsi_mod
Oct 22 00:17:56 db1 kernel: CPU:    35683031
Oct 22 00:17:56 db1 kernel: EIP:    0060:[<02105fc0>]    Not tainted VLI
Oct 22 00:17:56 db1 kernel: EFLAGS: 00010093   (2.6.9-42.0.2.ELhugemem)
Oct 22 00:17:56 db1 kernel: EIP is at show_trace+0x11/0x6b
Oct 22 00:17:56 db1 kernel: eax: 00010ffd   ebx: 000104e6   ecx:
a9dae200   edx: 022d65b8
Oct 22 00:17:56 db1 kernel: esi: 000104e6   edi: 00010000   ebp:
00000000   esp: a9dae200
Oct 22 00:17:56 db1 kernel: ds: 007b   es: 007b   ss: 0068
Oct 22 00:17:56 db1 kernel: Process  (pid: 122215904,
threadinfo=a9dae000 task=a9dae094)
Oct 22 00:17:56 db1 kernel: Stack: a9dae000 00005480 02385480 021060a4
0000001c 02107c6b 022d6d40 000001dc
Oct 22 00:17:56 db1 kernel:        c4dbbea0 000000a9 00000000 c4dbbe00
c4dbbea0 00000000 fffecca1 00000000
Oct 22 00:17:56 db1 kernel:        c4dbbea0 f8fee558 c4dbbe00 c4dbbea0
00000000 f8fee558 c4db007b 0000007b
Oct 22 00:17:56 db1 kernel: Call Trace:
Oct 22 00:17:56 db1 kernel:  [<021060a4>] dump_stack+0x11/0x13
Oct 22 00:17:56 db1 kernel:  [<02107c6b>] do_IRQ+0x49/0x1ae
Oct 22 00:17:56 db1 kernel:  [<022cb5ea>] _read_unlock_bh+0x3/0x8
Oct 22 00:17:56 db1 kernel:  [<f8fea2bd>] do_bindings+0x9d/0x1e3 [iptable_nat]
Oct 22 00:17:56 db1 kernel:  [<f8fe91c6>] ip_nat_fn+0x1c6/0x1d4 [iptable_nat]
Oct 22 00:17:56 db1 kernel:  [<0228227b>] nf_iterate+0x40/0x81
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<02282581>] nf_hook_slow+0x47/0xbc
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<022929b7>] ip_finish_output+0x1c5/0x1ce
Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
Oct 22 00:17:56 db1 kernel:  [<0211df36>] find_busiest_group+0xdd/0x295
Oct 22 00:17:56 db1 kernel:  [<022a62a7>] tcp_v4_send_check+0x7c/0xb9
Oct 22 00:17:56 db1 kernel:  [<022a10b1>] tcp_transmit_skb+0x6bf/0x7d2
Oct 22 00:17:56 db1 kernel:  [<022a19be>] tcp_write_xmit+0x74/0x254
Oct 22 00:17:56 db1 kernel:  [<0229ef8e>] __tcp_data_snd_check+0x3d/0xaa
Oct 22 00:17:56 db1 kernel:  [<0229f471>] tcp_rcv_established+0x166/0x725
Oct 22 00:17:56 db1 kernel:  [<022a72ce>] tcp_v4_do_rcv+0x1b/0xe9
Oct 22 00:17:56 db1 kernel:  [<02274c39>] __release_sock+0x39/0x55
Oct 22 00:17:56 db1 kernel:  [<022751b8>] release_sock+0x1f/0x4f
Oct 22 00:17:56 db1 kernel:  [<02277e42>] sk_stream_wait_memory+0x113/0x1a8
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:56 db1 kernel:  [<02297ca1>] tcp_sendmsg+0xb5c/0xe4d
Oct 22 00:17:57 db1 kernel:  [<022b2f5d>] inet_sendmsg+0x38/0x42
Oct 22 00:17:57 db1 kernel:  [<0227221f>] sock_sendmsg+0xdb/0xf7
Oct 22 00:17:57 db1 kernel:  [<02286b85>] qdisc_restart+0x12/0x1b9
Oct 22 00:17:57 db1 kernel:  [<f88df37d>]
bond_dev_queue_xmit+0x1f3/0x1fb [bonding]
Oct 22 00:17:57 db1 kernel:  [<f88e29bb>]
bond_xmit_roundrobin+0xdb/0xe3 [bonding]
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<f89e95c8>] sock_xmit+0x11d/0x217 [gnbd]
Oct 22 00:17:57 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
Oct 22 00:17:57 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
Oct 22 00:17:57 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
Oct 22 00:17:57 db1 kernel:  [<02141db2>] mempool_alloc+0x7b/0x135
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<f89e9938>] __gnbd_send_req+0x276/0x2e7 [gnbd]
Oct 22 00:17:57 db1 kernel:  [<f89e9dc0>] do_gnbd_request+0xfa/0x143 [gnbd]
Oct 22 00:17:57 db1 kernel:  [<02220380>] __generic_unplug_device+0x2b/0x2d
Oct 22 00:17:57 db1 kernel:  [<02220397>] generic_unplug_device+0x15/0x21
Oct 22 00:17:57 db1 kernel:  [<02220992>] get_request_wait+0x51/0xb9
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<022213c1>] __make_request+0x2bf/0x46c
Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:57 db1 kernel:  [<f88f623c>] multipath_map+0x64/0x7f [dm_multipath]
Oct 22 00:17:57 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:57 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
Oct 22 00:17:57 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:57 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
Oct 22 00:17:57 db1 kernel:  [<f8f2c3c1>] diaper_make_request+0x8b/0x91 [gfs]
Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<022217d6>] submit_bio+0xca/0xd2
Oct 22 00:17:57 db1 kernel:  [<0215e681>] bio_alloc+0x100/0x168
Oct 22 00:17:57 db1 kernel:  [<0215e038>] submit_bh+0x141/0x166
Oct 22 00:17:57 db1 kernel:  [<0215e0b9>] ll_rw_block+0x5c/0x72
Oct 22 00:17:57 db1 kernel:  [<f8f2d658>] gfs_dreread+0x65/0xd6 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f2d5a4>] gfs_dread+0x20/0x36 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f2e4ec>] gfs_get_meta_buffer+0xb1/0x250 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f2a647>] gfs_block_map+0x242/0x3de [gfs]
Oct 22 00:17:57 db1 kernel:  [<022cad75>] __cond_resched+0x14/0x39
Oct 22 00:17:57 db1 kernel:  [<f8f49cd6>] get_block+0x2a/0x6d [gfs]
Oct 22 00:17:57 db1 kernel:  [<0215cd70>] __block_prepare_write+0x165/0x3ec
Oct 22 00:17:57 db1 kernel:  [<0215d682>] block_prepare_write+0x16/0x23
Oct 22 00:17:57 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4a258>] gfs_prepare_write+0x122/0x135 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
Oct 22 00:17:57 db1 kernel:  [<02140ec8>]
generic_file_buffered_write+0x218/0x533
Oct 22 00:17:57 db1 kernel:  [<02126347>] current_fs_time+0x44/0x4c
Oct 22 00:17:57 db1 kernel:  [<0214156c>]
__generic_file_aio_write_nolock+0x389/0x3b7
Oct 22 00:17:57 db1 kernel:  [<021415d3>]
generic_file_aio_write_nolock+0x39/0x7f
Oct 22 00:17:57 db1 kernel:  [<02141736>] generic_file_write_nolock+0x84/0x99
Oct 22 00:17:57 db1 kernel:  [<f8f38055>] gfs_glock_nq+0xe3/0x116 [gfs]
Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
Oct 22 00:17:57 db1 kernel:  [<f8f58bac>] gfs_trans_begin_i+0xfd/0x15a [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4c0fc>] do_do_write_buf+0x2a6/0x452 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4c3c3>] do_write_buf+0x11b/0x15e [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4b31c>] walk_vm+0xd7/0x100 [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4c4a7>] __gfs_write+0xa1/0xbb [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4c2a8>] do_write_buf+0x0/0x15e [gfs]
Oct 22 00:17:57 db1 kernel:  [<f8f4c4cc>] gfs_write+0xb/0xe [gfs]
Oct 22 00:17:57 db1 kernel:  [<0215a52f>] vfs_write+0xb6/0xe2
Oct 22 00:17:57 db1 kernel:  [<0215a5f9>] sys_write+0x3c/0x62
Oct 22 00:17:57 db1 kernel:  =======================
Oct 22 00:17:57 db1 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000001
Oct 22 00:17:57 db1 kernel:  printing eip:
Oct 22 00:17:57 db1 kernel: 02105fc0
Oct 22 00:17:57 db1 kernel: *pde = 00004001


From isplist at logicore.net  Mon Oct 23 14:06:55 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Mon, 23 Oct 2006 09:06:55 -0500
Subject: [Linux-cluster] Problem starting cluster
In-Reply-To: <453C77D0.5060201@redhat.com>
Message-ID: <200610239655.593873@leena>

>> I'm not seeing any of these...
> 
> of what?

Sorry, I meant I'm not finding any of the specific things you mentioned.

> node is being reject by cman that IS another node that is rejecting that
> node and it WILL tell syslog why (unless you have kernel logging switched 

> - mismatching cluster.conf version numbers
> - mismatch cluster names
> - mismatch cluster number (a hash of the name)
> - node has the wrong node ID (ie it joined with the same
> name and a different node ID or vice versa)
> - protocol version differs (or other software mismatch - there are several
> error messages for these but they boil down to the same thing)

Right, what I meant was that none of these things seemed to be the reason. I 
was not able to find the reasons. I shut down every machine in the cluster and 
started one back up, one at a time, unblocked three of 8 to get them all up 
and running again and since then, the cluster has been fine. I'm stumped since 
I didn't find any anomalies between cluster.conf files and other things 
mentioned.

> Something else you might like to try is changing the port number that this
> cluster is using, or changing the cluster name to something totally
> different.

Yes, I had also tried this, not the port number but the name.
 
The cluster is back now, it was related to rejections somewhere but I was not 
able to find where. Now I need to figure out why I can't see the FC fabric.

Mike


From bmarzins at redhat.com  Mon Oct 23 17:19:47 2006
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 23 Oct 2006 12:19:47 -0500
Subject: [Linux-cluster] Re: Hard lockups during file transfer to GNBD/GFS
	device
In-Reply-To: <8c1094290610230648x7ac8189dy3b2612fb9191125b@mail.gmail.com>
References: <8c1094290609280915o6b6b4962ud0d090e58e5d7fc6@mail.gmail.com>
	<8c1094290609281208i6a5eaf8br70697c6b5d085cf@mail.gmail.com>
	<8c1094290609281227j1303ec11u300932ab8d4953ab@mail.gmail.com>
	<20060928195844.GB25242@redhat.com>
	<8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
	<8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>
	<8c1094290610230648x7ac8189dy3b2612fb9191125b@mail.gmail.com>
Message-ID: <20061023171946.GA23495@ether.msp.redhat.com>

On Mon, Oct 23, 2006 at 09:48:34AM -0400, David Brieck Jr. wrote:
> On 10/10/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> >On 9/29/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> >> On 9/28/06, David Teigland <teigland at redhat.com> wrote:
> >> >
> >> > Could you try it without multipath?  You have quite a few layers there.
> >> > Dave
> >> >
> >> >
> >>
> >> Thanks for the response. I unloaded gfs, clvm, gnbd and multipath, the
> >> reloaded gnbd, clvm and gfs. It was only talking to one of the gnbd
> >> servers and without multipath. Here's the log from this crash. It
> >> seems to have more info in it.
> >>
> >> I'm kinda confused why it still has references to mulitpath though. I
> >> unloaded the multipath module so I'm not sure why it's still in there.
> >> SNIP
> >
> >Since I didn't hear back from anyone I decided to try things a little
> >differently. Instead of rsyncing on the local machine, I ran the rsync
> >from another cluster member who also mounts the same partition I was
> >trying to move things too.
> >
> >So instead of
> >
> >rsync -a /path/to/files/ /mnt/http/
> >
> >I used
> >
> >rsync -a root at 10.1.1.121::/path/to/files/  /mnt/http/
> >
> >and I didn't have a crash at all. Why would this not cause a problem
> >when the first one did? Is this more of an rsync problem maybe? I do
> >have 2 NFS exports, could those have been causing the problem?
> >
> >Thanks.
> >
> 
> I am now seeing other crashes. Here's one from this weekend. It's huge
> and I'm not even sure how to see where things might have gone wrong.
> At the time of the crash the system was doing a backup to the gnbd
> server (if that helps). I'd also appreciate it if maybe someone could
> explain how to read this as far as where the problem actually started.

I'd bet that the problem started exactly where the crash says it did,
with a stack overflow. This is with running gfs on top of clvm on top of
multipath on to of gnbd, right?  So for IO to complete, it needs to go through
gfs, device-mapper, and gnbd functions. It's possible that some functions in
those modules aren't incredibly efficient with their stack space usage (A
number of functions like this have been found and fixed in GFS over the years).
Since it's a pretty long function stack, it would take too much waste in
a couple of functions to put this over the edge.

Which means that this setup probably needs some testing, and functions need
auditing.

Just to make sure of some things:

This is not a crash on a gnbd server node. correct? More specifically, you
aren't running a gnbd client and server on the same node for the same device.
That is bad. On a gnbd server machine, you cannot gnbd_import the devices that
you just gnbd_exported from that machine, and there's no reason to anyway.

Is anything else running that would effect the gfs file system on this node?
You mentioned NFS earlier. Are the gnbd client machines using NFS to serve
up the gfs file system?

Are you running bonded ethernet? There's nothing wrong with that, it just adds
more functions to the stack. That is an amazingly ugly stack trace, and I'm
trying to figure out what all is on there.

> Thanks again.
> 
> Oct 22 00:17:56 db1 kernel: do_IRQ: stack overflow: 476
> Oct 22 00:17:56 db1 kernel:  [<02107c6b>] do_IRQ+0x49/0x1ae
> Oct 22 00:17:56 db1 kernel:  [<022cb5ea>] _read_unlock_bh+0x3/0x8
> Oct 22 00:17:56 db1 kernel:  [<f8fea2bd>] do_bindings+0x9d/0x1e3 
> [iptable_nat]
> Oct 22 00:17:56 db1 kernel:  [<f8fe91c6>] ip_nat_fn+0x1c6/0x1d4 
> [iptable_nat]
> Oct 22 00:17:56 db1 kernel:  [<0228227b>] nf_iterate+0x40/0x81
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<02282581>] nf_hook_slow+0x47/0xbc
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<022929b7>] ip_finish_output+0x1c5/0x1ce
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
> Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
> Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
> Oct 22 00:17:56 db1 kernel:  [<0211df36>] find_busiest_group+0xdd/0x295
> Oct 22 00:17:56 db1 kernel:  [<022a62a7>] tcp_v4_send_check+0x7c/0xb9
> Oct 22 00:17:56 db1 kernel:  [<022a10b1>] tcp_transmit_skb+0x6bf/0x7d2
> Oct 22 00:17:56 db1 kernel:  [<022a19be>] tcp_write_xmit+0x74/0x254
> Oct 22 00:17:56 db1 kernel:  [<0229ef8e>] __tcp_data_snd_check+0x3d/0xaa
> Oct 22 00:17:56 db1 kernel:  [<0229f471>] tcp_rcv_established+0x166/0x725
> Oct 22 00:17:56 db1 kernel:  [<022a72ce>] tcp_v4_do_rcv+0x1b/0xe9
> Oct 22 00:17:56 db1 kernel:  [<02274c39>] __release_sock+0x39/0x55
> Oct 22 00:17:56 db1 kernel:  [<022751b8>] release_sock+0x1f/0x4f
> Oct 22 00:17:56 db1 kernel:  [<02277e42>] sk_stream_wait_memory+0x113/0x1a8
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<02297ca1>] tcp_sendmsg+0xb5c/0xe4d
> Oct 22 00:17:56 db1 kernel:  [<022b2f5d>] inet_sendmsg+0x38/0x42
> Oct 22 00:17:56 db1 kernel:  [<0227221f>] sock_sendmsg+0xdb/0xf7
> Oct 22 00:17:56 db1 kernel:  [<02286b85>] qdisc_restart+0x12/0x1b9
> Oct 22 00:17:56 db1 kernel:  [<f88df37d>]
> bond_dev_queue_xmit+0x1f3/0x1fb [bonding]
> Oct 22 00:17:56 db1 kernel:  [<f88e29bb>]
> bond_xmit_roundrobin+0xdb/0xe3 [bonding]
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<f89e95c8>] sock_xmit+0x11d/0x217 [gnbd]
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:56 db1 kernel:  [<02141db2>] mempool_alloc+0x7b/0x135
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<f89e9938>] __gnbd_send_req+0x276/0x2e7 [gnbd]
> Oct 22 00:17:56 db1 kernel:  [<f89e9dc0>] do_gnbd_request+0xfa/0x143 [gnbd]
> Oct 22 00:17:56 db1 kernel:  [<02220380>] __generic_unplug_device+0x2b/0x2d
> Oct 22 00:17:56 db1 kernel:  [<02220397>] generic_unplug_device+0x15/0x21
> Oct 22 00:17:56 db1 kernel:  [<02220992>] get_request_wait+0x51/0xb9
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<022213c1>] __make_request+0x2bf/0x46c
> Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:56 db1 kernel:  [<f88f623c>] multipath_map+0x64/0x7f 
> [dm_multipath]
> Oct 22 00:17:56 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 
> [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:56 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
> Oct 22 00:17:56 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 
> [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
> Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:56 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
> Oct 22 00:17:56 db1 kernel:  [<f8f2c3c1>] diaper_make_request+0x8b/0x91 
> [gfs]
> Oct 22 00:17:56 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<022217d6>] submit_bio+0xca/0xd2
> Oct 22 00:17:56 db1 kernel:  [<0215e681>] bio_alloc+0x100/0x168
> Oct 22 00:17:56 db1 kernel:  [<0215e038>] submit_bh+0x141/0x166
> Oct 22 00:17:56 db1 kernel:  [<0215e0b9>] ll_rw_block+0x5c/0x72
> Oct 22 00:17:56 db1 kernel:  [<f8f2d658>] gfs_dreread+0x65/0xd6 [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f2d5a4>] gfs_dread+0x20/0x36 [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f2e4ec>] gfs_get_meta_buffer+0xb1/0x250 
> [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f2a647>] gfs_block_map+0x242/0x3de [gfs]
> Oct 22 00:17:56 db1 kernel:  [<022cad75>] __cond_resched+0x14/0x39
> Oct 22 00:17:56 db1 kernel:  [<f8f49cd6>] get_block+0x2a/0x6d [gfs]
> Oct 22 00:17:56 db1 kernel:  [<0215cd70>] __block_prepare_write+0x165/0x3ec
> Oct 22 00:17:56 db1 kernel:  [<0215d682>] block_prepare_write+0x16/0x23
> Oct 22 00:17:56 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4a258>] gfs_prepare_write+0x122/0x135 
> [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
> Oct 22 00:17:56 db1 kernel:  [<02140ec8>]
> generic_file_buffered_write+0x218/0x533
> Oct 22 00:17:56 db1 kernel:  [<02126347>] current_fs_time+0x44/0x4c
> Oct 22 00:17:56 db1 kernel:  [<0214156c>]
> __generic_file_aio_write_nolock+0x389/0x3b7
> Oct 22 00:17:56 db1 kernel:  [<021415d3>]
> generic_file_aio_write_nolock+0x39/0x7f
> Oct 22 00:17:56 db1 kernel:  [<02141736>] 
> generic_file_write_nolock+0x84/0x99
> Oct 22 00:17:56 db1 kernel:  [<f8f38055>] gfs_glock_nq+0xe3/0x116 [gfs]
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<f8f58bac>] gfs_trans_begin_i+0xfd/0x15a [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4c0fc>] do_do_write_buf+0x2a6/0x452 [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4c3c3>] do_write_buf+0x11b/0x15e [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4b31c>] walk_vm+0xd7/0x100 [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4c4a7>] __gfs_write+0xa1/0xbb [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4c2a8>] do_write_buf+0x0/0x15e [gfs]
> Oct 22 00:17:56 db1 kernel:  [<f8f4c4cc>] gfs_write+0xb/0xe [gfs]
> Oct 22 00:17:56 db1 kernel:  [<0215a52f>] vfs_write+0xb6/0xe2
> Oct 22 00:17:56 db1 kernel:  [<0215a5f9>] sys_write+0x3c/0x62
> Oct 22 00:17:56 db1 kernel:  =======================
> Oct 22 00:17:56 db1 kernel: Unable to handle kernel paging request at
> virtual address 000104e6
> Oct 22 00:17:56 db1 kernel:  printing eip:
> Oct 22 00:17:56 db1 kernel: 02105fc0
> Oct 22 00:17:56 db1 kernel: *pde = 00004001
> Oct 22 00:17:56 db1 kernel: Oops: 0000 [#1]
> Oct 22 00:17:56 db1 kernel: SMP
> Oct 22 00:17:56 db1 kernel: Modules linked in: ipt_REDIRECT
> iptable_nat ip_conntrack iptable_mangle iptable_filter ip_tables
> mptctl mptbase dell_rbu parport_
> pc lp parport autofs4 i2c_dev i2c_core lock_dlm(U) gfs(U)
> lock_harness(U) dm_round_robin gnbd(U) dlm(U) cman(U) sunrpc
> ipmi_devintf ipmi_si ipmi_msghandler m
> d5 ipv6 dm_mirror dm_multipath button battery ac joydev uhci_hcd
> ehci_hcd hw_random e1000 bonding(U) floppy sg ext3 jbd dm_mod
> megaraid_mbox megaraid_mm sd_m
> od scsi_mod
> Oct 22 00:17:56 db1 kernel: CPU:    35683031
> Oct 22 00:17:56 db1 kernel: EIP:    0060:[<02105fc0>]    Not tainted VLI
> Oct 22 00:17:56 db1 kernel: EFLAGS: 00010093   (2.6.9-42.0.2.ELhugemem)
> Oct 22 00:17:56 db1 kernel: EIP is at show_trace+0x11/0x6b
> Oct 22 00:17:56 db1 kernel: eax: 00010ffd   ebx: 000104e6   ecx:
> a9dae200   edx: 022d65b8
> Oct 22 00:17:56 db1 kernel: esi: 000104e6   edi: 00010000   ebp:
> 00000000   esp: a9dae200
> Oct 22 00:17:56 db1 kernel: ds: 007b   es: 007b   ss: 0068
> Oct 22 00:17:56 db1 kernel: Process  (pid: 122215904,
> threadinfo=a9dae000 task=a9dae094)
> Oct 22 00:17:56 db1 kernel: Stack: a9dae000 00005480 02385480 021060a4
> 0000001c 02107c6b 022d6d40 000001dc
> Oct 22 00:17:56 db1 kernel:        c4dbbea0 000000a9 00000000 c4dbbe00
> c4dbbea0 00000000 fffecca1 00000000
> Oct 22 00:17:56 db1 kernel:        c4dbbea0 f8fee558 c4dbbe00 c4dbbea0
> 00000000 f8fee558 c4db007b 0000007b
> Oct 22 00:17:56 db1 kernel: Call Trace:
> Oct 22 00:17:56 db1 kernel:  [<021060a4>] dump_stack+0x11/0x13
> Oct 22 00:17:56 db1 kernel:  [<02107c6b>] do_IRQ+0x49/0x1ae
> Oct 22 00:17:56 db1 kernel:  [<022cb5ea>] _read_unlock_bh+0x3/0x8
> Oct 22 00:17:56 db1 kernel:  [<f8fea2bd>] do_bindings+0x9d/0x1e3 
> [iptable_nat]
> Oct 22 00:17:56 db1 kernel:  [<f8fe91c6>] ip_nat_fn+0x1c6/0x1d4 
> [iptable_nat]
> Oct 22 00:17:56 db1 kernel:  [<0228227b>] nf_iterate+0x40/0x81
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<02282581>] nf_hook_slow+0x47/0xbc
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<022929b7>] ip_finish_output+0x1c5/0x1ce
> Oct 22 00:17:56 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
> Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:56 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:56 db1 kernel:  [<02293093>] ip_queue_xmit+0x395/0x3f9
> Oct 22 00:17:56 db1 kernel:  [<022927d8>] dst_output+0x0/0x1a
> Oct 22 00:17:56 db1 kernel:  [<0211df36>] find_busiest_group+0xdd/0x295
> Oct 22 00:17:56 db1 kernel:  [<022a62a7>] tcp_v4_send_check+0x7c/0xb9
> Oct 22 00:17:56 db1 kernel:  [<022a10b1>] tcp_transmit_skb+0x6bf/0x7d2
> Oct 22 00:17:56 db1 kernel:  [<022a19be>] tcp_write_xmit+0x74/0x254
> Oct 22 00:17:56 db1 kernel:  [<0229ef8e>] __tcp_data_snd_check+0x3d/0xaa
> Oct 22 00:17:56 db1 kernel:  [<0229f471>] tcp_rcv_established+0x166/0x725
> Oct 22 00:17:56 db1 kernel:  [<022a72ce>] tcp_v4_do_rcv+0x1b/0xe9
> Oct 22 00:17:56 db1 kernel:  [<02274c39>] __release_sock+0x39/0x55
> Oct 22 00:17:56 db1 kernel:  [<022751b8>] release_sock+0x1f/0x4f
> Oct 22 00:17:56 db1 kernel:  [<02277e42>] sk_stream_wait_memory+0x113/0x1a8
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:56 db1 kernel:  [<02297ca1>] tcp_sendmsg+0xb5c/0xe4d
> Oct 22 00:17:57 db1 kernel:  [<022b2f5d>] inet_sendmsg+0x38/0x42
> Oct 22 00:17:57 db1 kernel:  [<0227221f>] sock_sendmsg+0xdb/0xf7
> Oct 22 00:17:57 db1 kernel:  [<02286b85>] qdisc_restart+0x12/0x1b9
> Oct 22 00:17:57 db1 kernel:  [<f88df37d>]
> bond_dev_queue_xmit+0x1f3/0x1fb [bonding]
> Oct 22 00:17:57 db1 kernel:  [<f88e29bb>]
> bond_xmit_roundrobin+0xdb/0xe3 [bonding]
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<f89e95c8>] sock_xmit+0x11d/0x217 [gnbd]
> Oct 22 00:17:57 db1 kernel:  [<022929c0>] ip_finish_output2+0x0/0x18d
> Oct 22 00:17:57 db1 kernel:  [<022927e7>] dst_output+0xf/0x1a
> Oct 22 00:17:57 db1 kernel:  [<022825bd>] nf_hook_slow+0x83/0xbc
> Oct 22 00:17:57 db1 kernel:  [<02141db2>] mempool_alloc+0x7b/0x135
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<f89e9938>] __gnbd_send_req+0x276/0x2e7 [gnbd]
> Oct 22 00:17:57 db1 kernel:  [<f89e9dc0>] do_gnbd_request+0xfa/0x143 [gnbd]
> Oct 22 00:17:57 db1 kernel:  [<02220380>] __generic_unplug_device+0x2b/0x2d
> Oct 22 00:17:57 db1 kernel:  [<02220397>] generic_unplug_device+0x15/0x21
> Oct 22 00:17:57 db1 kernel:  [<02220992>] get_request_wait+0x51/0xb9
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<022213c1>] __make_request+0x2bf/0x46c
> Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:57 db1 kernel:  [<f88f623c>] multipath_map+0x64/0x7f 
> [dm_multipath]
> Oct 22 00:17:57 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 
> [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:57 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
> Oct 22 00:17:57 db1 kernel:  [<f88693ca>] __map_bio+0x35/0xb0 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<f88695cd>] __clone_and_map+0xc2/0x2f0 
> [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<f88698a2>] __split_bio+0xa7/0x103 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<f8869a19>] dm_request+0x11b/0x130 [dm_mod]
> Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:57 db1 kernel:  [<0215e848>] bio_clone+0x84/0x9c
> Oct 22 00:17:57 db1 kernel:  [<f8f2c3c1>] diaper_make_request+0x8b/0x91 
> [gfs]
> Oct 22 00:17:57 db1 kernel:  [<022216fc>] generic_make_request+0x18e/0x19e
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<022217d6>] submit_bio+0xca/0xd2
> Oct 22 00:17:57 db1 kernel:  [<0215e681>] bio_alloc+0x100/0x168
> Oct 22 00:17:57 db1 kernel:  [<0215e038>] submit_bh+0x141/0x166
> Oct 22 00:17:57 db1 kernel:  [<0215e0b9>] ll_rw_block+0x5c/0x72
> Oct 22 00:17:57 db1 kernel:  [<f8f2d658>] gfs_dreread+0x65/0xd6 [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f2d5a4>] gfs_dread+0x20/0x36 [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f2e4ec>] gfs_get_meta_buffer+0xb1/0x250 
> [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f2a647>] gfs_block_map+0x242/0x3de [gfs]
> Oct 22 00:17:57 db1 kernel:  [<022cad75>] __cond_resched+0x14/0x39
> Oct 22 00:17:57 db1 kernel:  [<f8f49cd6>] get_block+0x2a/0x6d [gfs]
> Oct 22 00:17:57 db1 kernel:  [<0215cd70>] __block_prepare_write+0x165/0x3ec
> Oct 22 00:17:57 db1 kernel:  [<0215d682>] block_prepare_write+0x16/0x23
> Oct 22 00:17:57 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4a258>] gfs_prepare_write+0x122/0x135 
> [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f49cac>] get_block+0x0/0x6d [gfs]
> Oct 22 00:17:57 db1 kernel:  [<02140ec8>]
> generic_file_buffered_write+0x218/0x533
> Oct 22 00:17:57 db1 kernel:  [<02126347>] current_fs_time+0x44/0x4c
> Oct 22 00:17:57 db1 kernel:  [<0214156c>]
> __generic_file_aio_write_nolock+0x389/0x3b7
> Oct 22 00:17:57 db1 kernel:  [<021415d3>]
> generic_file_aio_write_nolock+0x39/0x7f
> Oct 22 00:17:57 db1 kernel:  [<02141736>] 
> generic_file_write_nolock+0x84/0x99
> Oct 22 00:17:57 db1 kernel:  [<f8f38055>] gfs_glock_nq+0xe3/0x116 [gfs]
> Oct 22 00:17:57 db1 kernel:  [<021204e9>] autoremove_wake_function+0x0/0x2d
> Oct 22 00:17:57 db1 kernel:  [<f8f58bac>] gfs_trans_begin_i+0xfd/0x15a [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4c0fc>] do_do_write_buf+0x2a6/0x452 [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4c3c3>] do_write_buf+0x11b/0x15e [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4b31c>] walk_vm+0xd7/0x100 [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4c4a7>] __gfs_write+0xa1/0xbb [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4c2a8>] do_write_buf+0x0/0x15e [gfs]
> Oct 22 00:17:57 db1 kernel:  [<f8f4c4cc>] gfs_write+0xb/0xe [gfs]
> Oct 22 00:17:57 db1 kernel:  [<0215a52f>] vfs_write+0xb6/0xe2
> Oct 22 00:17:57 db1 kernel:  [<0215a5f9>] sys_write+0x3c/0x62
> Oct 22 00:17:57 db1 kernel:  =======================
> Oct 22 00:17:57 db1 kernel: Unable to handle kernel NULL pointer
> dereference at virtual address 00000001
> Oct 22 00:17:57 db1 kernel:  printing eip:
> Oct 22 00:17:57 db1 kernel: 02105fc0
> Oct 22 00:17:57 db1 kernel: *pde = 00004001
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From dbrieck at gmail.com  Mon Oct 23 17:32:48 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 23 Oct 2006 13:32:48 -0400
Subject: [Linux-cluster] Re: Hard lockups during file transfer to GNBD/GFS
	device
In-Reply-To: <20061023171946.GA23495@ether.msp.redhat.com>
References: <8c1094290609280915o6b6b4962ud0d090e58e5d7fc6@mail.gmail.com>
	<8c1094290609281208i6a5eaf8br70697c6b5d085cf@mail.gmail.com>
	<8c1094290609281227j1303ec11u300932ab8d4953ab@mail.gmail.com>
	<20060928195844.GB25242@redhat.com>
	<8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
	<8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>
	<8c1094290610230648x7ac8189dy3b2612fb9191125b@mail.gmail.com>
	<20061023171946.GA23495@ether.msp.redhat.com>
Message-ID: <8c1094290610231032w5184472dn749f0bfbc7456624@mail.gmail.com>

On 10/23/06, Benjamin Marzinski <bmarzins at redhat.com> wrote:
> On Mon, Oct 23, 2006 at 09:48:34AM -0400, David Brieck Jr. wrote:
> > On 10/10/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> > >On 9/29/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> > >> On 9/28/06, David Teigland <teigland at redhat.com> wrote:
> > >> >
> > >> > Could you try it without multipath?  You have quite a few layers there.
> > >> > Dave
> > >> >
> > >> >
> > >>
> > >> Thanks for the response. I unloaded gfs, clvm, gnbd and multipath, the
> > >> reloaded gnbd, clvm and gfs. It was only talking to one of the gnbd
> > >> servers and without multipath. Here's the log from this crash. It
> > >> seems to have more info in it.
> > >>
> > >> I'm kinda confused why it still has references to mulitpath though. I
> > >> unloaded the multipath module so I'm not sure why it's still in there.
> > >> SNIP
> > >
> > >Since I didn't hear back from anyone I decided to try things a little
> > >differently. Instead of rsyncing on the local machine, I ran the rsync
> > >from another cluster member who also mounts the same partition I was
> > >trying to move things too.
> > >
> > >So instead of
> > >
> > >rsync -a /path/to/files/ /mnt/http/
> > >
> > >I used
> > >
> > >rsync -a root at 10.1.1.121::/path/to/files/  /mnt/http/
> > >
> > >and I didn't have a crash at all. Why would this not cause a problem
> > >when the first one did? Is this more of an rsync problem maybe? I do
> > >have 2 NFS exports, could those have been causing the problem?
> > >
> > >Thanks.
> > >
> >
> > I am now seeing other crashes. Here's one from this weekend. It's huge
> > and I'm not even sure how to see where things might have gone wrong.
> > At the time of the crash the system was doing a backup to the gnbd
> > server (if that helps). I'd also appreciate it if maybe someone could
> > explain how to read this as far as where the problem actually started.
>
> I'd bet that the problem started exactly where the crash says it did,
> with a stack overflow. This is with running gfs on top of clvm on top of
> multipath on to of gnbd, right?

Yes, that's correct, GFS, CLVM, Multipath, and GNBD client.

> So for IO to complete, it needs to go through
> gfs, device-mapper, and gnbd functions. It's possible that some functions in
> those modules aren't incredibly efficient with their stack space usage (A
> number of functions like this have been found and fixed in GFS over the years).
> Since it's a pretty long function stack, it would take too much waste in
> a couple of functions to put this over the edge.
>
> Which means that this setup probably needs some testing, and functions need
> auditing.
>
> Just to make sure of some things:
>
> This is not a crash on a gnbd server node. correct? More specifically, you
> aren't running a gnbd client and server on the same node for the same device.
> That is bad. On a gnbd server machine, you cannot gnbd_import the devices that
> you just gnbd_exported from that machine, and there's no reason to anyway.

This is a GNBD node and not a server. My server simply mounts the
device and does not reimport the export.

>
> Is anything else running that would effect the gfs file system on this node?
> You mentioned NFS earlier. Are the gnbd client machines using NFS to serve
> up the gfs file system?

The node that was crashing that started this whole thing does have an
NFS export, however this is a different node which does not have
anything exported or imported via NFS.

>
> Are you running bonded ethernet?

Yes, there are two dual Intel NICs in each of my nodes. 2 are bonded
for the heartbeat, gnbd exports, cluster manager, etc, and the other 2
are bonded for normal network traffic. I have two switches that are
then trunked and setup with 2 vlans, one for cluster stuff, another
for normal traffic. So in effect each bonded channel has one plug in
one switch and one in another, allowing me to loose an entire switch
and stay online.

> There's nothing wrong with that, it just adds
> more functions to the stack. That is an amazingly ugly stack trace, and I'm
> trying to figure out what all is on there.

As far as stack traces goes I'm pretty much a novice, it's honestly
not that often (if ever) that I've had to investigate why a linux box
crashed. This is just amazingly huge compared to the one I posted
before and others I've run into.

>
> > Thanks again.
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster


From dbrieck at gmail.com  Mon Oct 23 17:39:08 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Mon, 23 Oct 2006 13:39:08 -0400
Subject: [Linux-cluster] Re: Hard lockups during file transfer to GNBD/GFS
	device
In-Reply-To: <8c1094290610231032w5184472dn749f0bfbc7456624@mail.gmail.com>
References: <8c1094290609280915o6b6b4962ud0d090e58e5d7fc6@mail.gmail.com>
	<8c1094290609281208i6a5eaf8br70697c6b5d085cf@mail.gmail.com>
	<8c1094290609281227j1303ec11u300932ab8d4953ab@mail.gmail.com>
	<20060928195844.GB25242@redhat.com>
	<8c1094290609290651r62cec5f9n28278d6a81c3e6ef@mail.gmail.com>
	<8c1094290610100545j1f91d0a2h519cd9033b439390@mail.gmail.com>
	<8c1094290610230648x7ac8189dy3b2612fb9191125b@mail.gmail.com>
	<20061023171946.GA23495@ether.msp.redhat.com>
	<8c1094290610231032w5184472dn749f0bfbc7456624@mail.gmail.com>
Message-ID: <8c1094290610231039g739fe237m360f0a213d97e62f@mail.gmail.com>

On 10/23/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> On 10/23/06, Benjamin Marzinski <bmarzins at redhat.com> wrote:
> > On Mon, Oct 23, 2006 at 09:48:34AM -0400, David Brieck Jr. wrote:
> > > On 10/10/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> > > >On 9/29/06, David Brieck Jr. <dbrieck at gmail.com> wrote:
> > > >> On 9/28/06, David Teigland <teigland at redhat.com> wrote:
> > > >> >
> > > >> > Could you try it without multipath?  You have quite a few layers there.
> > > >> > Dave
> > > >> >
> > > >> >
> > > >>
> > > >> Thanks for the response. I unloaded gfs, clvm, gnbd and multipath, the
> > > >> reloaded gnbd, clvm and gfs. It was only talking to one of the gnbd
> > > >> servers and without multipath. Here's the log from this crash. It
> > > >> seems to have more info in it.
> > > >>
> > > >> I'm kinda confused why it still has references to mulitpath though. I
> > > >> unloaded the multipath module so I'm not sure why it's still in there.
> > > >> SNIP
> > > >
> > > >Since I didn't hear back from anyone I decided to try things a little
> > > >differently. Instead of rsyncing on the local machine, I ran the rsync
> > > >from another cluster member who also mounts the same partition I was
> > > >trying to move things too.
> > > >
> > > >So instead of
> > > >
> > > >rsync -a /path/to/files/ /mnt/http/
> > > >
> > > >I used
> > > >
> > > >rsync -a root at 10.1.1.121::/path/to/files/  /mnt/http/
> > > >
> > > >and I didn't have a crash at all. Why would this not cause a problem
> > > >when the first one did? Is this more of an rsync problem maybe? I do
> > > >have 2 NFS exports, could those have been causing the problem?
> > > >
> > > >Thanks.
> > > >
> > >
> > > I am now seeing other crashes. Here's one from this weekend. It's huge
> > > and I'm not even sure how to see where things might have gone wrong.
> > > At the time of the crash the system was doing a backup to the gnbd
> > > server (if that helps). I'd also appreciate it if maybe someone could
> > > explain how to read this as far as where the problem actually started.
> >
> > I'd bet that the problem started exactly where the crash says it did,
> > with a stack overflow. This is with running gfs on top of clvm on top of
> > multipath on to of gnbd, right?
>
> Yes, that's correct, GFS, CLVM, Multipath, and GNBD client.
>
> > So for IO to complete, it needs to go through
> > gfs, device-mapper, and gnbd functions. It's possible that some functions in
> > those modules aren't incredibly efficient with their stack space usage (A
> > number of functions like this have been found and fixed in GFS over the years).
> > Since it's a pretty long function stack, it would take too much waste in
> > a couple of functions to put this over the edge.
> >
> > Which means that this setup probably needs some testing, and functions need
> > auditing.
> >
> > Just to make sure of some things:
> >
> > This is not a crash on a gnbd server node. correct? More specifically, you
> > aren't running a gnbd client and server on the same node for the same device.
> > That is bad. On a gnbd server machine, you cannot gnbd_import the devices that
> > you just gnbd_exported from that machine, and there's no reason to anyway.
>
> This is a GNBD node and not a server. My server simply mounts the
> device and does not reimport the export.
>
> >
> > Is anything else running that would effect the gfs file system on this node?
> > You mentioned NFS earlier. Are the gnbd client machines using NFS to serve
> > up the gfs file system?
>
> The node that was crashing that started this whole thing does have an
> NFS export, however this is a different node which does not have
> anything exported or imported via NFS.
>
> >
> > Are you running bonded ethernet?
>
> Yes, there are two dual Intel NICs in each of my nodes. 2 are bonded
> for the heartbeat, gnbd exports, cluster manager, etc, and the other 2
> are bonded for normal network traffic. I have two switches that are
> then trunked and setup with 2 vlans, one for cluster stuff, another
> for normal traffic. So in effect each bonded channel has one plug in
> one switch and one in another, allowing me to loose an entire switch
> and stay online.
>
> > There's nothing wrong with that, it just adds
> > more functions to the stack. That is an amazingly ugly stack trace, and I'm
> > trying to figure out what all is on there.
>
> As far as stack traces goes I'm pretty much a novice, it's honestly
> not that often (if ever) that I've had to investigate why a linux box
> crashed. This is just amazingly huge compared to the one I posted
> before and others I've run into.
>
> >
> > > Thanks again.
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
>

One other thing I wanted to mention: the bonded interfaces are mode=0
(round robin). Not sure if that would have any effect but it's worth
mentioning.


From tadashi.iwashita at uniadex.co.jp  Tue Oct 24 11:22:50 2006
From: tadashi.iwashita at uniadex.co.jp (tadashi.iwashita at uniadex.co.jp)
Date: Tue, 24 Oct 2006 20:22:50 +0900
Subject: [Linux-cluster] GFS problem in plock.c?
Message-ID: <496258BCD1F8034DB4043D35871A3B4402230E62@ZZ01X1MB02.ixas.unisys.co.jp>

Hello All,

I have experienced system hang-up after running the latest
LTP(http://ltp.sourceforge.net/) tool as a part of durability testings
on our GFS environement. I was using 2 DELL PE1950 servers which
installed CentOS4.3(IA32) and DELL EMC AX150 was setup as a GFS shared
storage connected to each servers. We did "./runltp -d /gfs3" to run the
LTP tool on one server and another server remained just idle. Here are
the extracted /var/log/messages taken when the system was stopped:

Oct  4 13:15:47 centos1 kernel: lock_dlm:  Assertion failed on line 500
of file
/home/buildcentos/rpmbuild/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/plock.c
Oct  4 13:15:47 centos1 kernel: lock_dlm:  assertion:  "!error"
Oct  4 13:15:47 centos1 kernel: lock_dlm:  time = 71704458
Oct  4 13:15:47 centos1 kernel: error=-11
Oct  4 13:15:47 centos1 kernel:
Oct  4 13:15:47 centos1 kernel: ------------[ cut here ]------------
Oct  4 13:15:47 centos1 kernel: kernel BUG at
/home/buildcentos/rpmbuild/BUILD/fs-kernel-2.6.9-49/smp/src/dlm/plock.c:
500!
Oct  4 13:15:47 centos1 kernel: invalid operand: 0000 [#1]
Oct  4 13:15:47 centos1 kernel: SMP
Oct  4 13:15:47 centos1 kernel: Modules linked in: parport_pc lp parport
autofs i2c_dev i2c_core lock_dlm(U) gfs(U) lock_harness(U) dlm(U)
cman(U) sunrpc dm_m      rror dm_multipath dm_mod button battery ac md5
ipv6 joydev uhci_hcd ehci_hcd hw      random shpchp bnx2 ext3 jbd
qla6312 qla2xxx scsi_transport_fc megaraid_sas sd_m      d scsi_mod
Oct  4 13:15:47 centos1 kernel: CPU:    0

Does anyone know whether this is a known problem or not? Or any
suggestion?
It is most likely I will be able to reproduce this problem.

Thanks,
Tadashi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061024/538fde3e/attachment.htm>

From dist-list at LEXUM.UMontreal.CA  Tue Oct 24 12:16:47 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Tue, 24 Oct 2006 08:16:47 -0400
Subject: [Linux-cluster] LVS Director : Monitoring HTTPS site ?
Message-ID: <453E042F.1020107@lexum.umontreal.ca>

Hello,
On the Director :  I'm trying to monitor  an HTTPS server.

The GET / HTTP/1.0 of course does not ouput the expect HTTP string : HTTP

How do you handle HTTPS sites ?

thanks !


From brentonr at dorm.org  Tue Oct 24 14:08:10 2006
From: brentonr at dorm.org (Brenton Rothchild)
Date: Tue, 24 Oct 2006 09:08:10 -0500
Subject: [Linux-cluster] LVS Director : Monitoring HTTPS site ?
In-Reply-To: <453E042F.1020107@lexum.umontreal.ca>
References: <453E042F.1020107@lexum.umontreal.ca>
Message-ID: <453E1E4A.3050003@dorm.org>

FM wrote:
> Hello,
> On the Director :  I'm trying to monitor  an HTTPS server.
> 
> The GET / HTTP/1.0 of course does not ouput the expect HTTP string : HTTP
> 
> How do you handle HTTPS sites ?
> 

We've used an external check script with nanny to check via 'wget':

------ file: /usr/local/bin/lvs_check_https ------
#!/bin/bash
LINES=`wget -q -O - --no-check-certificate https://$1:$2 | wc -c`
if [ $LINES -gt "0" ]; then
         echo "OK"
else
         echo "FAILURE"
fi
exit 0
------ end file ------

and the corresponding entries in lvs.cf:

------ file: /etc/sysconfig/ha/lvs.cf ------
virtual https_server_XYZ {
     ...
     send_program = "/usr/local/bin/lvs_check_https %h %p"
     expect = "OK"
     ...
}
------ end file ------

This script, of course, assumes that the web server respond with _something_
to indicate it's up, vs. _nothing_ to indicate it's down.

It's worked quite well for us for the last year or so.

-Brenton Rothchild


From tim.post at netkinetics.net  Tue Oct 24 15:28:02 2006
From: tim.post at netkinetics.net (Tim Post)
Date: Tue, 24 Oct 2006 23:28:02 +0800
Subject: [Linux-cluster] LVS Director : Monitoring HTTPS site ?
In-Reply-To: <453E042F.1020107@lexum.umontreal.ca>
References: <453E042F.1020107@lexum.umontreal.ca>
Message-ID: <1161703683.5511.598.camel@tower>

wget / lynx .. if your still getting garbage headers you may want to
take a look at pound :

http://www.apsis.ch/pound/

Not to be used as a LB, but its an excellent , very very tiny reverse
proxy that does very well with https and request / header sanatizing.

Setup a special IP off the cvip to use for monitoring, and you can
happily monitor many (http/https) from that special IP.

HTH,
-Tim

On Tue, 2006-10-24 at 08:16 -0400, FM wrote:
> Hello,
> On the Director :  I'm trying to monitor  an HTTPS server.
> 
> The GET / HTTP/1.0 of course does not ouput the expect HTTP string : HTTP
> 
> How do you handle HTTPS sites ?
> 
> thanks !
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 


From isplist at logicore.net  Tue Oct 24 18:26:51 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 13:26:51 -0500
Subject: [Linux-cluster] Fencing Hardware
Message-ID: <20061024132651.975769@leena>

I bought a huge McData enterprise switch for a song because I figured it would 
be the simplest way of doing fencing on the number of nodes I'd need.

Turns out, this is making me nuts now and I'm wondering if it would be simpler 
to just use hubs over fabric and GNDB servers over specific hardware.

Anyone have any thoughts that might have done both?

Mike


From dbrieck at gmail.com  Tue Oct 24 18:48:55 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Tue, 24 Oct 2006 14:48:55 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024132651.975769@leena>
References: <20061024132651.975769@leena>
Message-ID: <8c1094290610241148p7a8263eduead5921a64d5cf63@mail.gmail.com>

On 10/24/06, isplist at logicore.net <isplist at logicore.net> wrote:
> I bought a huge McData enterprise switch for a song because I figured it would
> be the simplest way of doing fencing on the number of nodes I'd need.
>
> Turns out, this is making me nuts now and I'm wondering if it would be simpler
> to just use hubs over fabric and GNDB servers over specific hardware.
>
> Anyone have any thoughts that might have done both?
>
> Mike
>

Is there a reason you're not considering power fencing? We use APC
switched PDUs for our fence device and they work beautifully.


From isplist at logicore.net  Tue Oct 24 18:51:37 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 13:51:37 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <8c1094290610241148p7a8263eduead5921a64d5cf63@mail.gmail.com>
Message-ID: <20061024135137.274564@leena>

> Is there a reason you're not considering power fencing? We use APC
> switched PDUs for our fence device and they work beautifully.

No, I'll consider anything that's not too costly up front, that allows growth 
and that's fairly easy to work with.

mike


From isplist at logicore.net  Tue Oct 24 18:54:20 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 13:54:20 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <8c1094290610241148p7a8263eduead5921a64d5cf63@mail.gmail.com>
Message-ID: <20061024135420.606158@leena>

> Is there a reason you're not considering power fencing? We use APC
> switched PDUs for our fence device and they work beautifully.

Oh, um, this won't work for me. I'm using blade servers. They have a recessed 
power button, I think a jumper as well but they can't be shut off like a stand 
alone server would.

Mike


From dbrieck at gmail.com  Tue Oct 24 19:00:48 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Tue, 24 Oct 2006 15:00:48 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024135137.274564@leena>
References: <8c1094290610241148p7a8263eduead5921a64d5cf63@mail.gmail.com>
	<20061024135137.274564@leena>
Message-ID: <8c1094290610241200o25942fdfk3236a0bd95c83ecd@mail.gmail.com>

On 10/24/06, isplist at logicore.net <isplist at logicore.net> wrote:
> > Is there a reason you're not considering power fencing? We use APC
> > switched PDUs for our fence device and they work beautifully.
>
> No, I'll consider anything that's not too costly up front, that allows growth
> and that's fairly easy to work with.
>
> mike
>

We buy all our hardware through dell so you might be able to find
better prices than those listed here, but any of the APC Switched PDUs
will work for you. Of course you need to make sure they fit into your
data center's power grid as appropriate, but you can pickup a 24 port
PDU for between $600 and $700 depending on your power setup.

http://accessories.us.dell.com/sna/category.aspx?c=us&category_id=6250&cs=555&l=en&s=biz&nf=18037~0~130093&~ck=anav

We have dual power supples in all our servers so we use two ports per
server since the rack is on two different circuits, but if you used
the Y cable and weren't worried about that level of redundancy then 24
ports would go a long way.


From isplist at logicore.net  Tue Oct 24 19:12:18 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 14:12:18 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <8c1094290610241200o25942fdfk3236a0bd95c83ecd@mail.gmail.com>
Message-ID: <20061024141218.897140@leena>

The blade servers each have three power supplies but I can't turn those off 
since the blades share the power. There might be a jumper on the blade for a 
reset or perhaps cutting off the Ethernet connection, something like that but 
a power cut off would not work.

Mike


> We buy all our hardware through dell so you might be able to find
> better prices than those listed here, but any of the APC Switched PDUs
> will work for you. Of course you need to make sure they fit into your
> data center's power grid as appropriate, but you can pickup a 24 port
> PDU for between $600 and $700 depending on your power setup.
> 
> http://accessories.us.dell.com/sna/category.aspx?c=us&category_id=6250&cs=55
5&
> l=en&s=biz&nf=18037~0~130093&~ck=anav
> 
> We have dual power supples in all our servers so we use two ports per
> server since the rack is on two different circuits, but if you used
> the Y cable and weren't worried about that level of redundancy then 24
> ports would go a long way.


From jparsons at redhat.com  Tue Oct 24 19:15:35 2006
From: jparsons at redhat.com (James Parsons)
Date: Tue, 24 Oct 2006 15:15:35 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <8c1094290610241200o25942fdfk3236a0bd95c83ecd@mail.gmail.com>
References: <8c1094290610241148p7a8263eduead5921a64d5cf63@mail.gmail.com>	<20061024135137.274564@leena>
	<8c1094290610241200o25942fdfk3236a0bd95c83ecd@mail.gmail.com>
Message-ID: <453E6657.1070405@redhat.com>

David Brieck Jr. wrote:

> On 10/24/06, isplist at logicore.net <isplist at logicore.net> wrote:
>
>> > Is there a reason you're not considering power fencing? We use APC
>> > switched PDUs for our fence device and they work beautifully.
>>
>> No, I'll consider anything that's not too costly up front, that 
>> allows growth
>> and that's fairly easy to work with.
>>
>> mike
>>
>
> We buy all our hardware through dell so you might be able to find
> better prices than those listed here, but any of the APC Switched PDUs
> will work for you. Of course you need to make sure they fit into your
> data center's power grid as appropriate, but you can pickup a 24 port
> PDU for between $600 and $700 depending on your power setup.
>
> http://accessories.us.dell.com/sna/category.aspx?c=us&category_id=6250&cs=555&l=en&s=biz&nf=18037~0~130093&~ck=anav 
>
>
> We have dual power supples in all our servers so we use two ports per
> server since the rack is on two different circuits, but if you used
> the Y cable and weren't worried about that level of redundancy then 24
> ports would go a long way.
>
In addition, of all the available agents in the cluster suite, apc has 
alot to be said for it. The agent was tested at the apc facility in 
Rhode Island last year on all of their current 79XX series switches and 
it did great. A new agent is being coded right now that supports outlet 
naming and outlet grouping. There is also a really dependable snmp 
version of the apc fence agent that ships with cluster suite (but 
requires doing a couple of chores to use it -- instructions are provided 
in a README).

We work hard to keep all of the agent scripts current with new firmware 
revs and such -- but I guess the proximity of APC and the dependability 
of the agent (It gets looked at alot) and the switches make them a 
pretty safe fencing choice.

PS: I hold no interest in APCC! My only interest is in reliable fencing 
-- whatever your choice may be :)

-Jim


From redhat at watson-wilson.ca  Tue Oct 24 19:20:21 2006
From: redhat at watson-wilson.ca (Neil Watson)
Date: Tue, 24 Oct 2006 15:20:21 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024132651.975769@leena>
References: <20061024132651.975769@leena>
Message-ID: <20061024192021.GA19197@watson-wilson.ca>

Don't most blade centres come with remote management controllers?  For
example HP's iLO cards.

-- 
Neil Watson             | Debian Linux
System Administrator    | Uptime 16 days
http://watson-wilson.ca


From isplist at logicore.net  Tue Oct 24 19:23:04 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 14:23:04 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024192021.GA19197@watson-wilson.ca>
Message-ID: <2006102414234.695894@leena>

Each blade has one slot for an add on card which I've taken up for Fibre 
Channel cards. In my case at least, there's no room for anything else and I'm 
not sure there are any packages that handle external power controls for my 
machines.

Mike


On Tue, 24 Oct 2006 15:20:21 -0400, Neil Watson wrote:
> Don't most blade centres come with remote management controllers?  For
> 
> example HP's iLO cards.


From gforte at leopard.us.udel.edu  Tue Oct 24 19:25:36 2006
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 24 Oct 2006 15:25:36 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <2006102414234.695894@leena>
References: <2006102414234.695894@leena>
Message-ID: <453E68B0.2000406@leopard.us.udel.edu>

check your documentation, or call your vendor.  The power should be 
configurable at the enclosure level, not the individual blades.  There 
should be some sort of interface that you can log into and say "down the 
power on the blade in slot X".  Though this may be an add-on that you 
haven't purchased.

-g

isplist at logicore.net wrote:
> Each blade has one slot for an add on card which I've taken up for Fibre 
> Channel cards. In my case at least, there's no room for anything else and I'm 
> not sure there are any packages that handle external power controls for my 
> machines.
> 
> Mike
> 
> 
> On Tue, 24 Oct 2006 15:20:21 -0400, Neil Watson wrote:
>> Don't most blade centres come with remote management controllers?  For
>>
>> example HP's iLO cards.
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


-- 
Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE


From isplist at logicore.net  Tue Oct 24 19:28:04 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 14:28:04 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <453E68B0.2000406@leopard.us.udel.edu>
Message-ID: <2006102414284.101887@leena>

Right, yes, I do have such software but there is no SDK or anything that I 
know of that would allow external control. And, the software would not lend 
itself to external controls for fencing, that I know of.

Anyone using Cubix Blade servers?

Mike


On Tue, 24 Oct 2006 15:25:36 -0400, Greg Forte wrote:
> check your documentation, or call your vendor.  The power should be
> 
> configurable at the enclosure level, not the individual blades.  There
> should be some sort of interface that you can log into and say "down the
> power on the blade in slot X".  Though this may be an add-on that you
> haven't purchased.
> 
> -g
> 
> isplist at logicore.net wrote:
>> Each blade has one slot for an add on card which I've taken up for Fibre
>> Channel cards. In my case at least, there's no room for anything else and
>> I'm
>> not sure there are any packages that handle external power controls for my
>> machines.
>> 
>> Mike
>> 
>> 
>> On Tue, 24 Oct 2006 15:20:21 -0400, Neil Watson wrote:
>>> Don't most blade centres come with remote management controllers?  For
>>> 
>>> example HP's iLO cards.
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From dist-list at LEXUM.UMontreal.CA  Tue Oct 24 19:38:04 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Tue, 24 Oct 2006 15:38:04 -0400
Subject: [Linux-cluster] GFS performance problem 
Message-ID: <453E6B9C.7030009@lexum.umontreal.ca>

Hello,

Here is my setup :
RedHat Enterprise 4 (update 3).
5 web servers connected to a 600 GB  GFS (noatime) on a SAN.
On the GFS : all web sites ROOTS and httpd logs files
All web servers are writing to the same log. No problem here.

The prob is with rsync. When writing on the GSF it is very very slow !
(40 % slower when we are lucky :) )

lots of question :
*) could it be because if I have 1 GFS for the heavy write on the log
and another GFS for the websites' files ?
*) expect for the noatime, is there other technic to speed up GFS ?
*) Can several webserver access EXT3 FS (read only) when only on other
server have RW access to it ?
*) is there a options to tune rsync when using GFS ?
*) we are using DLM as the locking system. All servers are connected
with Gb RJ45. Is DLM using the network to manage the lock. And if it is
the case, could my problem come from the network latency ?

As I said, lots of questions here :)


From isplist at logicore.net  Tue Oct 24 19:53:28 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 14:53:28 -0500
Subject: [Linux-cluster] Cluster vs Distributed?
Message-ID: <20061024145328.951929@leena>

I'm wondering if I'm missing the full potential of what GFS/clustering could 
do for me so figure I'll ask.

I decided to start learning about GFS/Clusters because I really wanted a good 
way of running a distributed environment for some Internet based services.

For example, I'm using GFS and remote storage to provide central areas for 
multiple web, mail and other servers to get/store/share their data. So, each 
server has it's own small drive and OS but each server shares central areas 
for it's services, be it web, mail, etc. If one server goes down, there are 
others there to keep going based on a front end of load balancers. 

Of course, if a machine needs rebuilding, then it's a full rebuild, then copy 
what I need, make it what it's supposed to be and stick it back into the 
cluster. Each server shares it's data with the others.

Now on the other hand, the clustering I think the Linux LVS (is that what it's 
called?) would give me a single single image type cluster would it not? I 
mean, each node would be a part of a single system, just keep adding nodes to 
expand the system and performance.

Which is best then? It seems each does similar things with it's own pluses and 
minuses. Is my use of the GFS clustering a good use or am I missing serious 
benefits?

Mike


From lhh at redhat.com  Tue Oct 24 20:39:17 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 24 Oct 2006 16:39:17 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024132651.975769@leena>
References: <20061024132651.975769@leena>
Message-ID: <1161722357.4518.15.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-24 at 13:26 -0500, isplist at logicore.net wrote:
> I bought a huge McData enterprise switch for a song because I figured it would 
> be the simplest way of doing fencing on the number of nodes I'd need.
> 
> Turns out, this is making me nuts now and I'm wondering if it would be simpler 
> to just use hubs over fabric and GNDB servers over specific hardware.

What's wrong with it?  Does the agent not work with your switch?

-- Lon


From lhh at redhat.com  Tue Oct 24 20:41:32 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 24 Oct 2006 16:41:32 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <2006102414284.101887@leena>
References: <2006102414284.101887@leena>
Message-ID: <1161722492.4518.18.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-24 at 14:28 -0500, isplist at logicore.net wrote:
> Right, yes, I do have such software but there is no SDK or anything that I 
> know of that would allow external control. And, the software would not lend 
> itself to external controls for fencing, that I know of.
> 
> Anyone using Cubix Blade servers?

Most any console that has power management and either an ssh, a telnet,
or a web-accessible port should be able to be coerced in to powering
things off and on.

-- Lon


From lhh at redhat.com  Tue Oct 24 20:55:45 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 24 Oct 2006 16:55:45 -0400
Subject: [Linux-cluster] Cluster vs Distributed?
In-Reply-To: <20061024145328.951929@leena>
References: <20061024145328.951929@leena>
Message-ID: <1161723346.4518.33.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-24 at 14:53 -0500, isplist at logicore.net wrote:

> Of course, if a machine needs rebuilding, then it's a full rebuild, then copy 
> what I need, make it what it's supposed to be and stick it back into the 
> cluster. Each server shares it's data with the others.

You should be able to automate this step, at least, mostly.  If you are
using Red Hat Enterprise Linux or Fedora Core, you can automate most of
the installation and configuration using Kickstart.

> Now on the other hand, the clustering I think the Linux LVS (is that what it's 
> called?) would give me a single single image type cluster would it not? I 
> mean, each node would be a part of a single system, just keep adding nodes to 
> expand the system and performance.

LVS appears to web (or other) network clients to be a high-capacity
single system.  In fact, you probably *are* using LVS right now -- it's
what is likely doing the load balancing.  LVS "real servers" (in your
case, the GFS cluster) do not have to be a part of a cluster, and in
fact, do not even need to know one another exist (of course, this
changes when you introduce GFS :) ).

Contrast to OpenSSI, which gives you a single system image for processes
running on the cluster:

  http://openssi.org/

> Which is best then? It seems each does similar things with it's own pluses and 
> minuses. Is my use of the GFS clustering a good use or am I missing serious 
> benefits?

Not all applications designed to work on a single system are instantly
scalable just because GFS is the back-end store.  Any application which
does not do proper file locking could cause problems for other instances
of that application elsewhere in the cluster, for example.

It all depends on what you're trying to do.  Could you draw us a
picture/diagram?  :)

-- Lon


From isplist at logicore.net  Tue Oct 24 21:08:11 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 16:08:11 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <1161722357.4518.15.camel@rei.boston.devel.redhat.com>
Message-ID: <2006102416811.498091@leena>

>> Turns out, this is making me nuts now and I'm wondering if it would be
>> simpler to just use hubs over fabric and GNDB servers over specific 
>> hardware.
>> 
> What's wrong with it?  Does the agent not work with your switch?

I don't know really, it's now looking like a licensing issue. Either way, I've 
moved everything over to Brocade switches at this point, can't take the McData 
McNonsense anymore.

So, now need to look into fencing_brocade to get things going again.

Mike


From orkcu at yahoo.com  Tue Oct 24 21:30:33 2006
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Tue, 24 Oct 2006 14:30:33 -0700 (PDT)
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <2006102414234.695894@leena>
Message-ID: <20061024213033.23390.qmail@web50615.mail.yahoo.com>


--- "isplist at logicore.net" <isplist at logicore.net>
wrote:

> Each blade has one slot for an add on card which
> I've taken up for Fibre 
> Channel cards. In my case at least, there's no room
> for anything else and I'm 
> not sure there are any packages that handle external
> power controls for my 
> machines.
what kind of blade do you have? HP? Dell? other
branch?

I think you can poweroff and poweron the blade you
want from the enclosure menu-cli-webInterfase
(whatever interfase the enclosure give you)

I am preaty sure you can do it from HP iLo, and
someone  just talk me that Dell have the same feature,
so you can reboot the blase from remote locations

cu
roger


__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From isplist at logicore.net  Tue Oct 24 21:34:41 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 16:34:41 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024213033.23390.qmail@web50615.mail.yahoo.com>
Message-ID: <20061024163441.903943@leena>

Can't be done with the Cubix, there is no method of logging in to control 
individual servers. There might be some third party software that ties into 
the maintenance side of things but none other.

Each blade has one slot for add-on cards, which are already used for the FC 
cards I use.

Mike


On Tue, 24 Oct 2006 14:30:33 -0700 (PDT), Pe?a wrote:
> 
> 
> --- "isplist at logicore.net" <isplist at logicore.net>
> wrote:
> 
>> Each blade has one slot for an add on card which
>> I've taken up for Fibre
>> Channel cards. In my case at least, there's no room
>> for anything else and I'm
>> not sure there are any packages that handle external
>> power controls for my
>> machines.
> what kind of blade do you have? HP? Dell? other
> branch?
> 
> I think you can poweroff and poweron the blade you
> want from the enclosure menu-cli-webInterfase
> (whatever interfase the enclosure give you)
> 
> I am preaty sure you can do it from HP iLo, and
> someone  just talk me that Dell have the same feature,
> so you can reboot the blase from remote locations
> 
> cu
> roger
> 
> 
> __________________________________________
> RedHat Certified Engineer ( RHCE )
> Cisco Certified Network Associate ( CCNA )
> 
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com


From isplist at logicore.net  Tue Oct 24 21:38:43 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 16:38:43 -0500
Subject: [Linux-cluster] Brocade Fencing
In-Reply-To: <20061024213033.23390.qmail@web50615.mail.yahoo.com>
Message-ID: <20061024163843.984175@leena>

I've had it with the McData switch, it's wasting my time. Rather, I've now 
connected everything into two Brocade switches so am looking for info on 
extending each to talk together now.

I'm guessing I really don't have to get into a fabric zone just to get the 
fencing I'm after?

Mike


From jparsons at redhat.com  Tue Oct 24 21:39:34 2006
From: jparsons at redhat.com (James Parsons)
Date: Tue, 24 Oct 2006 17:39:34 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024213033.23390.qmail@web50615.mail.yahoo.com>
References: <20061024213033.23390.qmail@web50615.mail.yahoo.com>
Message-ID: <453E8816.8070307@redhat.com>

Roger PeXa Escobio wrote:

>--- "isplist at logicore.net" <isplist at logicore.net>
>wrote:
>
>  
>
>>Each blade has one slot for an add on card which
>>I've taken up for Fibre 
>>Channel cards. In my case at least, there's no room
>>for anything else and I'm 
>>not sure there are any packages that handle external
>>power controls for my 
>>machines.
>>    
>>
>what kind of blade do you have? HP? Dell? other
>branch?
>
He is using cubix blades. I checked into their cubix management 
controller 2 device for remote poweroff/on...IT ONLY WORKS WITH INTERNET 
EXPLORER!!! YIKES!

I'm outa here... :)

-j


From isplist at logicore.net  Tue Oct 24 21:53:40 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 16:53:40 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <453E8816.8070307@redhat.com>
Message-ID: <20061024165340.339119@leena>

>> what kind of blade do you have? HP? Dell? other
>> branch?
>> 
> He is using cubix blades. I checked into their cubix management
> controller 2 device for remote poweroff/on...IT ONLY WORKS WITH INTERNET
> EXPLORER!!! YIKES!

I'm not clear of what you're looking at here?
The machines I'm using are www.cubix.com, Cubix Density, 1210's actually.

Either way, I've moved everything over to Brocade switches now so should be 
able to get this going soon.

Mike


From jparsons at redhat.com  Tue Oct 24 21:59:04 2006
From: jparsons at redhat.com (James Parsons)
Date: Tue, 24 Oct 2006 17:59:04 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024165340.339119@leena>
References: <20061024165340.339119@leena>
Message-ID: <453E8CA8.4000806@redhat.com>

isplist at logicore.net wrote:

>>>what kind of blade do you have? HP? Dell? other
>>>branch?
>>>
>>>      
>>>
>>He is using cubix blades. I checked into their cubix management
>>controller 2 device for remote poweroff/on...IT ONLY WORKS WITH INTERNET
>>EXPLORER!!! YIKES!
>>    
>>
>
>I'm not clear of what you're looking at here?
>The machines I'm using are www.cubix.com, Cubix Density, 1210's actually.
>  
>
Yup. I went to their web page - the only management controller for their 
product line was the one mentioned above.


From isplist at logicore.net  Tue Oct 24 22:19:30 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 24 Oct 2006 17:19:30 -0500
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <453E8CA8.4000806@redhat.com>
Message-ID: <20061024171930.861637@leena>

> Yup. I went to their web page - the only management controller for their
> product line was the one mentioned above.

Pretty incredible considering their gear. 

Mike


From basv at sara.nl  Wed Oct 25 07:03:00 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Wed, 25 Oct 2006 09:03:00 +0200
Subject: [Linux-cluster] Again DLM messages and high load
Message-ID: <453F0C24.6080708@sara.nl>

WE are using:
  GFS         : CVS 1.0.3 stable
  kernel      : 2.6.17.11-sara1
  NFS-daemons : 128
  GFS-servers   : 5

This node was the master and when this message was displayed, the load 
will rise to the number of NFS daemons and nfs does not work more. We 
had to reboot the node:
  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: cancel reply ret 0
  Oct 25 03:12:31 ifs4 kernel: lock_dlm: unlock sb_status 0 2,a45325d 
flags 0
  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: 
process_lockqueue_reply id a50a027c state 0

I had to reboot the master node (ifs4) when the node went down the other 
nodes re-elected another master. 3 nodes use the same master and on one 
node has another master. Is this oke?:

node 1,2,4
Fence Domain:    "default" 1   2 run       - [5 2 1 3 4]

node 3:
Fence Domain:    "default" 1   2 run       - [2 5 1 3 4]

cman_tool nodes is the same for all nodes.


Regards

-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From riaan at obsidian.co.za  Wed Oct 25 08:27:43 2006
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 25 Oct 2006 10:27:43 +0200
Subject: [Linux-cluster] Again DLM messages and high load
In-Reply-To: <453F0C24.6080708@sara.nl>
References: <453F0C24.6080708@sara.nl>
Message-ID: <453F1FFF.8080100@obsidian.co.za>


Bas van der Vlies wrote:
> WE are using:
>  GFS         : CVS 1.0.3 stable
>  kernel      : 2.6.17.11-sara1
>  NFS-daemons : 128
>  GFS-servers   : 5
> 
> This node was the master and when this message was displayed, the load 
> will rise to the number of NFS daemons and nfs does not work more. We 
> had to reboot the node:
>  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: cancel reply ret 0
>  Oct 25 03:12:31 ifs4 kernel: lock_dlm: unlock sb_status 0 2,a45325d 
> flags 0
>  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: process_lockqueue_reply 
> id a50a027c state 0
> 
> I had to reboot the master node (ifs4) when the node went down the other 
> nodes re-elected another master. 3 nodes use the same master and on one 
> node has another master. Is this oke?:
> 
> node 1,2,4
> Fence Domain:    "default" 1   2 run       - [5 2 1 3 4]
> 
> node 3:
> Fence Domain:    "default" 1   2 run       - [2 5 1 3 4]
> 
> cman_tool nodes is the same for all nodes.
> 
> 
> Regards
> 

good day Bas

We had the EXACT same symptom (load average rising to number of NFSDs, 
NFS then becomes unresponsive - these processes actually become 
defunct), happening about 2x a week.

We have had a service request open with Red Hat for the past 3 months. 
Our biggest problem was with regards to capturing the sysrq T output, 
which we could not provide (since the problem always surfaced so 
quickly, and being a production server, our biggest concern was getting 
the service up, rather than capture debugging info) and therefore could 
not take the issue further.

We still had this problem with the DLM/GFS kernel modules accompanying 
kernel 2.6.9-42.0.EL

We loaded the DLM/GFS kernel modules accompanying kernel 2.6.9-42.0.2.EL:
GFS-kernel-smp-2.6.9-60.1
dlm-kernel-smp-2.6.9-44.2
a week and a half ago, and since then we have not seen this or two other 
problem symptoms.

The bugzilla entry we were tracking (some assertion failures):
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=199673
it was the only significant change  between kernel modules for 42.EL and 
42.0.2.EL .
The DLM errata is in http://rhn.redhat.com/errata/RHBA-2006-0702.html

I am not sure how these versions map to the CVS versions, or if our NFSD 
problem is indeed solved. However, it has never stayed away this long.

If our NFS problem does occur again, I will let you know.

greetings
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061025/17da9bf0/attachment.vcf>

From riaan at obsidian.co.za  Wed Oct 25 08:45:21 2006
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 25 Oct 2006 10:45:21 +0200
Subject: [Linux-cluster] GFS performance problem
In-Reply-To: <453E6B9C.7030009@lexum.umontreal.ca>
References: <453E6B9C.7030009@lexum.umontreal.ca>
Message-ID: <453F2421.10404@obsidian.co.za>


FM wrote:
> Hello,
> 
> Here is my setup :
> RedHat Enterprise 4 (update 3).
> 5 web servers connected to a 600 GB  GFS (noatime) on a SAN.
> On the GFS : all web sites ROOTS and httpd logs files
> All web servers are writing to the same log. No problem here.
> 
> The prob is with rsync. When writing on the GSF it is very very slow !
> (40 % slower when we are lucky :) )
> 
> lots of question :
> *) could it be because if I have 1 GFS for the heavy write on the log
> and another GFS for the websites' files ?
> *) expect for the noatime, is there other technic to speed up GFS ?

a) we have increased the number of GFS locks cached to bring them closer 
in line with the number of locks in use:
echo "200000" > /proc/cluster/lock_dlm/drop_count
(not sure how much of a performance increase this got us)

b) We have disabled quotas
gfs_tool settune /mnt/san quota_account 0
We saw a 3 - 5 increase in performance

c) if you are using lots of small files, look into section 5.8 in the 
GFS manual, Data Journaling. However,
- I dont know how to change this for existing data on a GFS
- I have asked on the mailing list, and no-one seems to be using it.

> *) Can several webserver access EXT3 FS (read only) when only on other
> server have RW access to it ?

no. the RO server will get confused when things change from under it, 
since it is not expecting things to change

> *) is there a options to tune rsync when using GFS ?
> *) we are using DLM as the locking system. All servers are connected
> with Gb RJ45. Is DLM using the network to manage the lock. And if it is
> the case, could my problem come from the network latency ?

DLM is using the network, yes. not sure about latency. We use GB 
ethernet with RJ45/CAT5+ and have not had any problems related to DLM 
and the network (that we are aware of). As it was explained to me by Red 
Hat Support, DLM is extremely efficient, being able to master/distribute 
thousands of locks per second between nodes.

> 
> As I said, lots of questions here :)
> 

and some answers. Wish I had answers to all your questions.

greetings
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 324 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061025/5fc62eff/attachment.vcf>

From basv at sara.nl  Wed Oct 25 11:14:02 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Wed, 25 Oct 2006 13:14:02 +0200
Subject: [Linux-cluster] Again DLM messages and high load
In-Reply-To: <453F1FFF.8080100@obsidian.co.za>
References: <453F0C24.6080708@sara.nl> <453F1FFF.8080100@obsidian.co.za>
Message-ID: <453F46FA.2090303@sara.nl>

Riaan van Niekerk wrote:
> 
> Bas van der Vlies wrote:
>> WE are using:
>>  GFS         : CVS 1.0.3 stable
>>  kernel      : 2.6.17.11-sara1
>>  NFS-daemons : 128
>>  GFS-servers   : 5
>>
>> This node was the master and when this message was displayed, the load 
>> will rise to the number of NFS daemons and nfs does not work more. We 
>> had to reboot the node:
>>  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: cancel reply ret 0
>>  Oct 25 03:12:31 ifs4 kernel: lock_dlm: unlock sb_status 0 2,a45325d 
>> flags 0
>>  Oct 25 03:12:31 ifs4 kernel: dlm: lisa_vg5_lv1: 
>> process_lockqueue_reply id a50a027c state 0
>>
>> I had to reboot the master node (ifs4) when the node went down the 
>> other nodes re-elected another master. 3 nodes use the same master and 
>> on one node has another master. Is this oke?:
>>
>> node 1,2,4
>> Fence Domain:    "default" 1   2 run       - [5 2 1 3 4]
>>
>> node 3:
>> Fence Domain:    "default" 1   2 run       - [2 5 1 3 4]
>>
>> cman_tool nodes is the same for all nodes.
>>
>>
>> Regards
>>
> 
> good day Bas
> 
> We had the EXACT same symptom (load average rising to number of NFSDs, 
> NFS then becomes unresponsive - these processes actually become 
> defunct), happening about 2x a week.
> 
> We have had a service request open with Red Hat for the past 3 months. 
> Our biggest problem was with regards to capturing the sysrq T output, 
> which we could not provide (since the problem always surfaced so 
> quickly, and being a production server, our biggest concern was getting 
> the service up, rather than capture debugging info) and therefore could 
> not take the issue further.
> 
> We still had this problem with the DLM/GFS kernel modules accompanying 
> kernel 2.6.9-42.0.EL
> 
> We loaded the DLM/GFS kernel modules accompanying kernel 2.6.9-42.0.2.EL:
> GFS-kernel-smp-2.6.9-60.1
> dlm-kernel-smp-2.6.9-44.2
> a week and a half ago, and since then we have not seen this or two other 
> problem symptoms.
> 
> The bugzilla entry we were tracking (some assertion failures):
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=199673
> it was the only significant change  between kernel modules for 42.EL and 
> 42.0.2.EL .
> The DLM errata is in http://rhn.redhat.com/errata/RHBA-2006-0702.html
> 
> I am not sure how these versions map to the CVS versions, or if our NFSD 
> problem is indeed solved. However, it has never stayed away this long.
> 
> If our NFS problem does occur again, I will let you know.
> 
> greetings
> Riaan

Riaan,

  Thanks for the info. We had this problem also several times in a week 
with the previous versions. Now we use the latest version from CVS 
STABLE and hit this bug again, the uptime was 50 days ;-)

Regards


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From dist-list at LEXUM.UMontreal.CA  Wed Oct 25 12:01:10 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Wed, 25 Oct 2006 08:01:10 -0400
Subject: [Linux-cluster] GFS performance problem
In-Reply-To: <453F2421.10404@obsidian.co.za>
References: <453E6B9C.7030009@lexum.umontreal.ca>
	<453F2421.10404@obsidian.co.za>
Message-ID: <453F5206.1080303@lexum.umontreal.ca>

Thank you for the answers.

Another bottleneck that I could have is the way I connect to the SAN : 
device mapper multipath instead of the HP officiel module.

I do not not if the performance can we increase with the official module  ?


Riaan van Niekerk wrote:
> 
> 
> FM wrote:
>> Hello,
>>
>> Here is my setup :
>> RedHat Enterprise 4 (update 3).
>> 5 web servers connected to a 600 GB  GFS (noatime) on a SAN.
>> On the GFS : all web sites ROOTS and httpd logs files
>> All web servers are writing to the same log. No problem here.
>>
>> The prob is with rsync. When writing on the GSF it is very very slow !
>> (40 % slower when we are lucky :) )
>>
>> lots of question :
>> *) could it be because if I have 1 GFS for the heavy write on the log
>> and another GFS for the websites' files ?
>> *) expect for the noatime, is there other technic to speed up GFS ?
> 
> a) we have increased the number of GFS locks cached to bring them closer 
> in line with the number of locks in use:
> echo "200000" > /proc/cluster/lock_dlm/drop_count
> (not sure how much of a performance increase this got us)
> 
> b) We have disabled quotas
> gfs_tool settune /mnt/san quota_account 0
> We saw a 3 - 5 increase in performance
> 
> c) if you are using lots of small files, look into section 5.8 in the 
> GFS manual, Data Journaling. However,
> - I dont know how to change this for existing data on a GFS
> - I have asked on the mailing list, and no-one seems to be using it.
> 
>> *) Can several webserver access EXT3 FS (read only) when only on other
>> server have RW access to it ?
> 
> no. the RO server will get confused when things change from under it, 
> since it is not expecting things to change
> 
>> *) is there a options to tune rsync when using GFS ?
>> *) we are using DLM as the locking system. All servers are connected
>> with Gb RJ45. Is DLM using the network to manage the lock. And if it is
>> the case, could my problem come from the network latency ?
> 
> DLM is using the network, yes. not sure about latency. We use GB 
> ethernet with RJ45/CAT5+ and have not had any problems related to DLM 
> and the network (that we are aware of). As it was explained to me by Red 
> Hat Support, DLM is extremely efficient, being able to master/distribute 
> thousands of locks per second between nodes.
> 
>>
>> As I said, lots of questions here :)
>>
> 
> and some answers. Wish I had answers to all your questions.
> 
> greetings
> Riaan
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From riaan at obsidian.co.za  Wed Oct 25 13:27:40 2006
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Wed, 25 Oct 2006 15:27:40 +0200
Subject: [Linux-cluster] GFS performance problem
In-Reply-To: <453F5206.1080303@lexum.umontreal.ca>
References: <453E6B9C.7030009@lexum.umontreal.ca>	<453F2421.10404@obsidian.co.za>
	<453F5206.1080303@lexum.umontreal.ca>
Message-ID: <453F664C.5040508@obsidian.co.za>


FM wrote:
> Thank you for the answers.
> 
> Another bottleneck that I could have is the way I connect to the SAN : 
> device mapper multipath instead of the HP officiel module.
> 
> I do not not if the performance can we increase with the official module  ?
> 

There is no SecurePath for RHEL 4 (assuming that is what you mean by 
"official module". That leaves you with
a) device-mapper-multipath
b) the Emulex or Qlogic HBA-based failover.

I have heard mostly good things about (a) even though I dont have a lot 
of production experience with it myself. It seems the trend is to move 
away from vendor/array and HBA-based multipathing towards OS-based 
multipathing, be it Windows or Linux.

> 
> 
> 
> Riaan van Niekerk wrote:
>>
>>
>> FM wrote:
>>> Hello,
>>>
>>> Here is my setup :
>>> RedHat Enterprise 4 (update 3).
>>> 5 web servers connected to a 600 GB  GFS (noatime) on a SAN.
>>> On the GFS : all web sites ROOTS and httpd logs files
>>> All web servers are writing to the same log. No problem here.
>>>
>>> The prob is with rsync. When writing on the GSF it is very very slow !
>>> (40 % slower when we are lucky :) )
>>>
>>> lots of question :
>>> *) could it be because if I have 1 GFS for the heavy write on the log
>>> and another GFS for the websites' files ?
>>> *) expect for the noatime, is there other technic to speed up GFS ?
>>
>> a) we have increased the number of GFS locks cached to bring them 
>> closer in line with the number of locks in use:
>> echo "200000" > /proc/cluster/lock_dlm/drop_count
>> (not sure how much of a performance increase this got us)
>>
>> b) We have disabled quotas
>> gfs_tool settune /mnt/san quota_account 0
>> We saw a 3 - 5 increase in performance
>>
>> c) if you are using lots of small files, look into section 5.8 in the 
>> GFS manual, Data Journaling. However,
>> - I dont know how to change this for existing data on a GFS
>> - I have asked on the mailing list, and no-one seems to be using it.
>>
>>> *) Can several webserver access EXT3 FS (read only) when only on other
>>> server have RW access to it ?
>>
>> no. the RO server will get confused when things change from under it, 
>> since it is not expecting things to change
>>
>>> *) is there a options to tune rsync when using GFS ?
>>> *) we are using DLM as the locking system. All servers are connected
>>> with Gb RJ45. Is DLM using the network to manage the lock. And if it is
>>> the case, could my problem come from the network latency ?
>>
>> DLM is using the network, yes. not sure about latency. We use GB 
>> ethernet with RJ45/CAT5+ and have not had any problems related to DLM 
>> and the network (that we are aware of). As it was explained to me by 
>> Red Hat Support, DLM is extremely efficient, being able to 
>> master/distribute thousands of locks per second between nodes.
>>
>>>
>>> As I said, lots of questions here :)
>>>
>>
>> and some answers. Wish I had answers to all your questions.
>>
>> greetings
>> Riaan
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 324 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061025/173ddd95/attachment.vcf>

From jparsons at redhat.com  Wed Oct 25 14:57:45 2006
From: jparsons at redhat.com (James Parsons)
Date: Wed, 25 Oct 2006 10:57:45 -0400
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <20061024171930.861637@leena>
References: <20061024171930.861637@leena>
Message-ID: <453F7B69.4080504@redhat.com>

isplist at logicore.net wrote:

>>Yup. I went to their web page - the only management controller for their
>>product line was the one mentioned above.
>>    
>>
>
>Pretty incredible considering their gear. 
>
>Mike
>
>  
>
So, Mike - I am pursuing this a bit farther for you because I know how 
much pain you have been going through with your cluster. When the cubix 
web page said their blade management controller worked only with IE, I 
figured it had activeX features built inti it, but I took a chance and 
wrote the customer support people at cubix...I hope you don't mind. I 
asked the following:

>>Hi,

>>Is a web browser the *only* way to access the management controller's 
>>power up/down capability?

>>Is it possible to telnet to it, or to snmp interface with it? I am 
>>seeking a way to power blades up or down from an automated script.

>>Perhaps you do such a thing in your testing lab...

>>Thanks,

>>-Jim

They replied in less than 24 hours with the following:


>Hi Jim,

>It is possible to control the power on/off functionality using http posts
>embedded in a script (possibly cURL) but could also be accomplished by using
>simple HTML. The following post string would power on or off a blade located
>at the CMC 2 IP address "Whatever" chassis 1, Blade 1. 

>Port = Enclosure
>Channel = Blade

>Ports are enumerated from bottom to top on the back of the CMC 2 blade
>Channels from left to right facing the front of the enclosure.

>http://192.168.10.127/control?port=1&channel=1&pwrbtn=press

>Stephen McGowan
>Cubix Technical Support / 800-953-0204

So, that is helpful! The cubix management appears to be a card that will work across their product line -- I cannot be sure as I did not know your exact system description when I wrote cubix, but I would expect it is very likely that it does work with your blade enclosure.

So, if you had this management controller, you could write a simple python or perl script called fence_cubix that would take as args on stdin the addr of the controller card, the port and the channel values that map to the blade you wish to fence. You would put these args in the cluster.conf file like so:
<xml snip 8< >
  <clusternode name="node1" nodeid="1">
   <method name="1">
    <device name="my_cubix_card" port="1" channel="3"/>
   </method>
  </clusternode>
<more xml snipping 8< >
  <fencedevice name="my_cubix_card" agent="fence_cubix" ipaddr="192.168.10.127"/>

So, now inside your python script called fence_cubix, all of the args from the cluster.cong file will be passed to it as key-value pairs when a node needs to be fenced...just sort them out with simple string manipulations, and build the query string to send to the ipaddr as an http request.  Then drop you script in /sbin with the other fence agent scripts.

Look at the other fence agents for hints - most of then also support a command line interface that is handy for testing.

I hope this is helpful to you.

_j


From sandra-llistes at fib.upc.edu  Wed Oct 25 15:31:51 2006
From: sandra-llistes at fib.upc.edu (sandra-llistes)
Date: Wed, 25 Oct 2006 17:31:51 +0200
Subject: [Linux-cluster] GFS and samba problem, again
In-Reply-To: <4533B9C2.8080906@redhat.com>
References: <4523A637.1060706@fib.upc.edu> <4526B4ED.9050907@redhat.com>
	<452B4F39.60906@fib.upc.edu> <452BFC6D.60902@redhat.com>
	<452D0ED6.9040605@fib.upc.edu> <452D7040.8090704@redhat.com>
	<4533A6EA.6030503@fib.upc.edu> <4533B9C2.8080906@redhat.com>
Message-ID: <453F8367.9040606@fib.upc.edu>

Hello,

I've installed RHES4 evaluation version with kernel 2.6.10 and GFS 
from CVS. In this environment with same samba configuration as Fedora, 
We've opened one file simultaneouslly from 14 PCs and it works fine.
I've proved with 2.6.10 kernel because Red Hat came with 2.6.9 kernel, 
but I wasn't able to compile with it, so I've probed next version.

I will try to copy same kernel+GFS and compile it in Fedora 5 to see 
if it works too.
Thanks,

Sandra Hern?ndez

Abhijith Das wrote:
> Hi Sandra,
> 
>> Hi Abhi,
>>
>> Your mail astonished me. The only difference between your environment 
>> and our is that you've RHEL4 and we've Fedora 5 with GFS.
> 
> I'll get FC5 installed on a test cluster and try it out.
> 
>> I'm sorry about this but I have to insist. Are you completelly sure 
>> that your samba access was simoultaneous? because if you probe one 
>> client, and then another it isn't the same.
>> I found people that complain about the same problem we have:
>> http://www.redhat.com/archives/linux-cluster/2004-November/msg00065.html
>> http://www.centos.org/modules/newbb/viewtopic.php?topic_id=4997
> 
> Yes, the access was simultaneous. If you have a specific test, (specific 
> types of files or windows programs) I can try that too. I'm not denying 
> that you're seeing problems with GFS+samba :-). If I can reproduce your 
> problem, I can chase it down and we'll know why this is happening.
> 
>> Well, I had finally compiled GFS2 but I was uncapable to start cman 
>> daemons.
>> [root at server2 ~]#  /etc/init.d/cman start
>> Starting cluster:
>>    Loading modules... done
>>    Mounting configfs... done
>>    Starting ccsd... done
>>    Starting cman... failed
>> /usr/sbin/cman_tool: aisexec daemon didn't start
>>
>> And I obtain this when I try: # strace /usr/sbin/cman_tool -t 120 -w join
>> ....
>> connect(5, {sa_family=AF_FILE, path="/var/run/cman_admin"}, 110) = -1 
>> ENOENT (No such file or directory)
>> close(5)
>> write(2, "/usr/sbin/cman_tool: ", 21/usr/sbin/cman_tool: )   = 21
>> write(2, "aisexec daemon didn\'t start\n", 28aisexec daemon didn't start
>> ) = 28
>> exit_group(1)
>>
>> So I'm stalled by that way.
> 
> Looks like the openais component of the cluster is missing. The cluster 
> architecture has changed quite a bit for GFS2. Please refer to this for 
> help installing GFS2:
> http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/usage.txt?rev=1.35&content-type=text/x-cvsweb-markup&cvsroot=cluster 
> 
> 
>> I'm downloading RHEL 30-days evaluating version and I'm also trying 
>> samba 3.0.23c compiling with cluster support.
>> I will tell you something as soon as I finish all test.
>> Regards,
>>
>> Sandra
> 
> Thanks,
> --Abhi


From Alain.Moulle at bull.net  Wed Oct 25 15:55:01 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 25 Oct 2006 17:55:01 +0200
Subject: [Linux-cluster] CS4 U2 / crash when "less /proc/cluster/status"
Message-ID: <453F88D5.9010101@bull.net>

Hi

Strange, when you do :
less /proc/cluster/status
on one node of a HA pair, the system crashes !
If you use cat /proc/cluster/status, no crash.
Any explanation ?
Is it always true with CS4 U4 ?

Thanks
Alain Moull?


From rpeterso at redhat.com  Wed Oct 25 16:45:36 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Wed, 25 Oct 2006 11:45:36 -0500
Subject: [Linux-cluster] CS4 U2 / crash when "less /proc/cluster/status"
In-Reply-To: <453F88D5.9010101@bull.net>
References: <453F88D5.9010101@bull.net>
Message-ID: <453F94B0.1090006@redhat.com>

Alain Moulle wrote:
> Hi
>
> Strange, when you do :
> less /proc/cluster/status
> on one node of a HA pair, the system crashes !
> If you use cat /proc/cluster/status, no crash.
> Any explanation ?
> Is it always true with CS4 U4 ?
>
> Thanks
> Alain Moull?
>   
Hi Alain,

Please file a new bugzilla for this problem.  Assign it to 
rpeterso at redhat.com
I was able to recreate it on RHEL4 U4, and that surprises me.  
Incidentally,
cat /proc/cluster/status works just fine.  I'll investigate why less 
doesn't.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Wed Oct 25 17:36:33 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 25 Oct 2006 12:36:33 -0500
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <1161723346.4518.33.camel@rei.boston.devel.redhat.com>
Message-ID: <20061025123633.576140@leena>

> You should be able to automate this step, at least, mostly.  If you are
> using Red Hat Enterprise Linux or Fedora Core, you can automate most of
> the installation and configuration using Kickstart.

I've looked at Kickstart, seems darn complicated in that there are endless 
things you need to make sure are in there. I know after installing a machine 
there is a default one built but is that enough?

> LVS appears to web (or other) network clients to be a high-capacity
> single system.  In fact, you probably *are* using LVS right now -- it's

Right, looks like a single PC to a user for example, maybe I was thinking 
something else like an SSI system where I just add nodes to add resources. 
I've created various clusters for my various services, web, mail, mysql, etc.

> It all depends on what you're trying to do.  Could you draw us a
> picture/diagram?  :)

Well, like anyone, I just want to use my servers in a better manner. Rather 
than constantly having to upgrade, I prefer the idea of just adding a node or 
more when needed. I have a large number of blade servers (left over from a 
kaput ISP) and so have been using those.

As such, I've created multiple clusters which all share GFS filesystems. I 
have web, mail, mysql clusters which all share data. I've written basic 
scripts to allow for the various needs of the different machines. 

Then I use Radware load balancers in front of it all but am thinking of trying 
out some of the various Linux based LB's that are out there. I'd like 
something that does not take a specialist to maintain :).

Things seem to work well but I'm just wondering if I'm missing more of the 
potential of the Linux Cluster.

PS: I saw someone asking about sharing data on MySQL, that's something I'd 
love to do. In fact, I'd like to get rid of the big box IBM servers over using 
smaller blade servers. Problem is, the blade servers don't allow for much 
memory, from 512 to 2GB. the IBM's allow for 5GB's. But I wonder if I could 
still get away with many low memory MySQL servers sharing GFS storage?
I would guess that one or more would write but that many could read.

Mike


From isplist at logicore.net  Wed Oct 25 17:45:39 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 25 Oct 2006 12:45:39 -0500
Subject: [Linux-cluster] Fencing using brocade
In-Reply-To: <45081AD4.5050801@redhat.com>
Message-ID: <20061025124539.507466@leena>

>> I want to use my brocade switch as the fencing device for my cluster. I

> The system-config-cluster application supports brocade fencing. It is a
> two part process - first you define the switch as a fence device; type
> brocade, then you select a node an click "Manage fencing for this node"
> and declare a fence instance.

For what ever reason, I cannot run system-config-cluster on any of my 
machines. Could you look at this and let me know if it looks right? Seems to 
be working.

<?xml version="1.0"?>
<cluster config_version="57" name="vgcomp">
<fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
    <clusternodes>
        <clusternode name="compdev" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="0"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="cweb92" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="1"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="cweb93" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="2"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="cweb94" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="3"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm250" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="4"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm249" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="5"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm248" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="6"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="qm247" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="7"/>
                    </method>
                </fence>
        </clusternode>
        <clusternode name="dev" votes="1">
                <fence>
                    <method name="1">
                        <device name="fence_brocade" port="8"/>
                    </method>
                </fence>
        </clusternode>
    </clusternodes>
<fencedevices>
    <fencedevice agent="fence_brocade" ipaddr="x.x.x.x" login="user" 
name="brocade" passwd="xxx"/>
</fencedevices>
</cluster>


From isplist at logicore.net  Wed Oct 25 17:49:56 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 25 Oct 2006 12:49:56 -0500
Subject: [Linux-cluster] Changing LUN
In-Reply-To: <1159255185.2997.3.camel@localhost.localdomain>
Message-ID: <20061025124956.135188@leena>

>> After adding storage, my cluster comes up with different /dev/sda,
>> /dev/sdb, etc settings. My initial device now comes up as sdc when it used 

> You should be using lvm2 and lvm2_cluster to handle this issue.  LVM2
> handles the name changing of the device on reboot.  This often happens
> depending on the scan order for the devices.  By using a volume manager,

Yes, I am using lvm2 indeed. So, you're saying that it does not matter then? 
Once I create a volume, lvm will know it from something it put on it perhaps 
rather than it's possibly changing LUN id?

Mike


From jparsons at redhat.com  Wed Oct 25 17:53:55 2006
From: jparsons at redhat.com (James Parsons)
Date: Wed, 25 Oct 2006 13:53:55 -0400
Subject: [Linux-cluster] Fencing using brocade
In-Reply-To: <20061025124539.507466@leena>
References: <20061025124539.507466@leena>
Message-ID: <453FA4B3.2090309@redhat.com>

isplist at logicore.net wrote:

>>>I want to use my brocade switch as the fencing device for my cluster. I
>>>      
>>>
>
>  
>
>>The system-config-cluster application supports brocade fencing. It is a
>>two part process - first you define the switch as a fence device; type
>>brocade, then you select a node an click "Manage fencing for this node"
>>and declare a fence instance.
>>    
>>
>
>For what ever reason, I cannot run system-config-cluster on any of my 
>machines. Could you look at this and let me know if it looks right? Seems to 
>be working.
>
><?xml version="1.0"?>
><cluster config_version="57" name="vgcomp">
><fence_daemon clean_start="1" post_fail_delay="0" post_join_delay="3"/>
>    <clusternodes>
>        <clusternode name="compdev" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="0"/>
>
Here you are not referring to the name of the fence device, but the 
agent name. you need to refer to the name you gave the fence device - in 
this case, "brocade". Same for all nodes.

>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="cweb92" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="1"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="cweb93" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="2"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="cweb94" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="3"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="qm250" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="4"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="qm249" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="5"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="qm248" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="6"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="qm247" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="7"/>
>                    </method>
>                </fence>
>        </clusternode>
>        <clusternode name="dev" votes="1">
>                <fence>
>                    <method name="1">
>                        <device name="fence_brocade" port="8"/>
>                    </method>
>                </fence>
>        </clusternode>
>    </clusternodes>
><fencedevices>
>    <fencedevice agent="fence_brocade" ipaddr="x.x.x.x" login="user" 
>name="brocade" passwd="xxx"/>
></fencedevices>
></cluster>
>
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>  
>


From isplist at logicore.net  Wed Oct 25 18:00:18 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 25 Oct 2006 13:00:18 -0500
Subject: [Linux-cluster] Fencing using brocade
In-Reply-To: <453FA4B3.2090309@redhat.com>
Message-ID: <2006102513018.124779@leena>

>> <fence>
>> <method name="1">
>> <device name="fence_brocade" port="0"/>
>> 
> Here you are not referring to the name of the fence device, but the
> agent name. you need to refer to the name you gave the fence device - in
> this case, "brocade". Same for all nodes.

Ah, I see, the name in the <fencedevice> section. I could have multiple 
fencing devices and refer to them with the <device name> pointing to different 
<fencedevice>.

Thanks very much.

Mike


>> <fencedevice agent="fence_brocade" ipaddr="x.x.x.x" login="user"
>> name="brocade" passwd="xxx"/>
>> </fencedevices>
>> </cluster>
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster


From dbrieck at gmail.com  Wed Oct 25 18:33:57 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Wed, 25 Oct 2006 14:33:57 -0400
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <20061025123633.576140@leena>
References: <1161723346.4518.33.camel@rei.boston.devel.redhat.com>
	<20061025123633.576140@leena>
Message-ID: <8c1094290610251133k16a22d72n4452346f76331ebf@mail.gmail.com>

On 10/25/06, isplist at logicore.net <isplist at logicore.net> wrote:

> PS: I saw someone asking about sharing data on MySQL, that's something I'd
> love to do. In fact, I'd like to get rid of the big box IBM servers over using
> smaller blade servers. Problem is, the blade servers don't allow for much
> memory, from 512 to 2GB. the IBM's allow for 5GB's. But I wonder if I could
> still get away with many low memory MySQL servers sharing GFS storage?
> I would guess that one or more would write but that many could read.
>
> Mike
>

So far things seem to be working fairly well with multiple active
MySQL servers. You can't use the query cache (for obvious reasons) and
you can't use innodb tables but for the most part it's working well.
The one thing I ran into that I didn't anticipate was that after you
add, edit or remove a user or grants you need to flush the privileges
on all the server manually. I haven't found a configuration option to
tell MySQL not to cache those values so I'll probably just have to
either modify my scripts to automatically flush after these actions or
just have a cron job running on the nodes every 10 minutes or so to
keep everything in sync.

I did manage to get LVM DR working after some trouble initially. One
thing I should note, you probably want to enable persistence,
otherwise you really seem to take a performance hit.

Here's the script I use to check to see if a server is alive:

#!/bin/sh

TEST=`/usr/bin/mysqladmin --user=piranha --password=piranha ping
--host=$1 | grep -c "mysqld is alive"`

if [ $TEST == "1" ]; then
        echo "OK"
else
        echo "FAIL"
fi


One thing about servers with smaller amounts of RAM: it won't matter
how many small servers you have if you have queries that constantly
have to load large tables (mainly for sorts) into memory and you don't
have that much your server will probably crawl.

I should note we're just running our DNS servers (MyDNS) and our
spamassasin database on it, but so far no problems. It was even
inadvertently tested on night and everything worked perfectly. We'll
know more once some of our larger databases are moved over.


From dist-list at LEXUM.UMontreal.CA  Wed Oct 25 19:23:59 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Wed, 25 Oct 2006 15:23:59 -0400
Subject: [Linux-cluster] weird time stamp  in the message log file
Message-ID: <453FB9CF.4090100@lexum.umontreal.ca>

Our problem righ now is the speed of rsync + GFS.

Today looking in the message log file I saw that :

Oct 25 15:10:04 catanzaro sshd(pam_unix)[28352]: session opened for user
root by (uid=0)
Oct 25 19:12:23 catanzaro rsyncd[28269]: sent 69 bytes  received
11680096 bytes  total size 2019826667
Oct 25 15:12:52 catanzaro rsyncd[28438]: connect from
salerno.lan.lexum.pri (192.168.4.27)
Oct 25 15:12:52 catanzaro rsyncd[28438]: rsync to canlii-jur-pe from
rsync_salerno at salerno.lan.lexum.pri (192.168.4.27)
Oct 25 19:13:03 catanzaro rsyncd[28438]: sent 69 bytes  received 50125
bytes  total size 84377660
Oct 25 15:13:04 catanzaro rsyncd[28443]: connect from
salerno.lan.lexum.pri (192.168.4.27)
Oct 25 15:13:04 catanzaro rsyncd[28443]: rsync to canlii-jur-qc from
rsync_salerno at salerno.lan.lexum.pri (192.168.4.27)
Oct 25 19:14:02 catanzaro rsyncd[28443]: sent 69 bytes  received 3018179
bytes  total size 457621016
Oct 25 15:14:03 catanzaro rsyncd[28462]: connect from
salerno.lan.lexum.pri (192.168.4.27)
Oct 25 15:14:03 catanzaro rsyncd[28462]: rsync to canlii-jur-sk from
rsync_salerno at salerno.lan.lexum.pri (192.168.4.27)


As you can see there is some info from the future !

all servers are using LAN ntp server and are in sync (we are using
kerberos so time is important to us :) )

Does some of you already saw that kind of strange thinks ?


From lhh at redhat.com  Wed Oct 25 19:36:31 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 25 Oct 2006 15:36:31 -0400
Subject: [Linux-cluster] CS4 U2 / crash when "less /proc/cluster/status"
In-Reply-To: <453F94B0.1090006@redhat.com>
References: <453F88D5.9010101@bull.net>  <453F94B0.1090006@redhat.com>
Message-ID: <1161804991.4518.35.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-25 at 11:45 -0500, Robert Peterson wrote:
> Alain Moulle wrote:
> > Hi
> >
> > Strange, when you do :
> > less /proc/cluster/status
> > on one node of a HA pair, the system crashes !
> > If you use cat /proc/cluster/status, no crash.
> > Any explanation ?
> > Is it always true with CS4 U4 ?
> >
> > Thanks
> > Alain Moull?
> >   
> Hi Alain,
> 
> Please file a new bugzilla for this problem.  Assign it to 
> rpeterso at redhat.com
> I was able to recreate it on RHEL4 U4, and that surprises me.  
> Incidentally,
> cat /proc/cluster/status works just fine.  I'll investigate why less 
> doesn't.

I wonder if 'more' uses lseek() and less doesn't?

-- Lon


From mwill at penguincomputing.com  Wed Oct 25 20:02:12 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Wed, 25 Oct 2006 13:02:12 -0700
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <8c1094290610251133k16a22d72n4452346f76331ebf@mail.gmail.com>
References: <1161723346.4518.33.camel@rei.boston.devel.redhat.com>	<20061025123633.576140@leena>
	<8c1094290610251133k16a22d72n4452346f76331ebf@mail.gmail.com>
Message-ID: <453FC2C4.2020109@jellyfish.highlyscyld.com>

Are the actual data files shared in this setup between the active mysql 
daemons?

Last time I looked into this it seemed that with shared-nothing model each
mysql daemon would have to keep it's own copy of the data and updates
would be propagated from active to passive daemons (master-slave model)
or between active daemons (ndb in-ram database model)

Are the mysql daemons running on the GFS I/O nodes that have access to 
shared
storage via SAN or iSCSI and coordinate locking through GFS 
infrastructure, or are
the mysql daemons running on client nodes that use GFS to remotely 
access storage
that is provided by other  GFS I/O nodes that in turn have access to shared
storage via SAN or iSCSI?

Michael

David Brieck Jr. wrote:
> On 10/25/06, isplist at logicore.net <isplist at logicore.net> wrote:
>
>> PS: I saw someone asking about sharing data on MySQL, that's 
>> something I'd
>> love to do. In fact, I'd like to get rid of the big box IBM servers 
>> over using
>> smaller blade servers. Problem is, the blade servers don't allow for 
>> much
>> memory, from 512 to 2GB. the IBM's allow for 5GB's. But I wonder if I 
>> could
>> still get away with many low memory MySQL servers sharing GFS storage?
>> I would guess that one or more would write but that many could read.
>>
>> Mike
>>
>
> So far things seem to be working fairly well with multiple active
> MySQL servers. You can't use the query cache (for obvious reasons) and
> you can't use innodb tables but for the most part it's working well.
> The one thing I ran into that I didn't anticipate was that after you
> add, edit or remove a user or grants you need to flush the privileges
> on all the server manually. I haven't found a configuration option to
> tell MySQL not to cache those values so I'll probably just have to
> either modify my scripts to automatically flush after these actions or
> just have a cron job running on the nodes every 10 minutes or so to
> keep everything in sync.
>
> I did manage to get LVM DR working after some trouble initially. One
> thing I should note, you probably want to enable persistence,
> otherwise you really seem to take a performance hit.
>
> Here's the script I use to check to see if a server is alive:
>
> #!/bin/sh
>
> TEST=`/usr/bin/mysqladmin --user=piranha --password=piranha ping
> --host=$1 | grep -c "mysqld is alive"`
>
> if [ $TEST == "1" ]; then
>        echo "OK"
> else
>        echo "FAIL"
> fi
>
>
>
> One thing about servers with smaller amounts of RAM: it won't matter
> how many small servers you have if you have queries that constantly
> have to load large tables (mainly for sorts) into memory and you don't
> have that much your server will probably crawl.
>
> I should note we're just running our DNS servers (MyDNS) and our
> spamassasin database on it, but so far no problems. It was even
> inadvertently tested on night and everything worked perfectly. We'll
> know more once some of our larger databases are moved over.
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Wed Oct 25 21:43:55 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Wed, 25 Oct 2006 16:43:55 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
Message-ID: <20061025164355.971031@leena>

Ok, here's a problem;

I have three physical volumes;

#pvscan
  PV /dev/sdc1   VG storec    lvm2 [745.76 GB / 0    free]
  PV /dev/sdb1   VG storeb    lvm2 [745.76 GB / 0    free]
  PV /dev/sda1   VG rimfire   lvm2 [572.72 GB / 0    free]
  Total: 3 [2.02 TB] / in use: 3 [2.02 TB] / in no VG: 0 [0   ]

I have three volume groups;

# vgscan
  Reading all physical volumes.  This may take a while...
  Found volume group "storec" using metadata type lvm2
  Found volume group "storeb" using metadata type lvm2
  Found volume group "rimfire" using metadata type lvm2

I have three separate logical volumes;
# lvscan
  ACTIVE            '/dev/storec/web' [745.76 GB] inherit
  ACTIVE            '/dev/storeb/qm' [745.76 GB] inherit
  ACTIVE            '/dev/rimfire/rimfire' [572.72 GB] inherit


# mount -t gfs /dev/rimfire/rimfire /gfs/rimfire/
mount: File exists

Why is it that I can only mount one at a time? I need at least two per node, 
non contiguous, separate.

Mike


From jason at monsterjam.org  Thu Oct 26 00:56:01 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Wed, 25 Oct 2006 20:56:01 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
Message-ID: <20061026005601.GA24086@monsterjam.org>

ok, I was just logging into the 2 nodes of my cluster, tf1 and tf2, I noticed that tf1 was NOT 
available via ssh, but tf2 was. tf1 was pingable, but that was it. I looked on tft2 and 
noticed that he had taken over the cluster virtual ip address 

2: eth0: <BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast qlen 1000
    link/ether 00:11:43:d7:c9:c6 brd ff:ff:ff:ff:ff:ff
    inet 192.168.1.6/24 brd 192.168.1.255 scope global eth0
    inet 192.168.1.7/32 scope global eth0
    inet6 fe80::211:43ff:fed7:c9c6/64 scope link 
       valid_lft forever preferred_lft forever

and in the syslog on tf2, I saw
Oct 25 20:26:00 tf2 kernel: CMAN: removing node tf1 from the cluster : Missed too many 
heartbeats
Oct 25 20:26:00 tf2 fenced[4091]: tf1 not a cluster member after 0 sec post_fail_delay
Oct 25 20:26:00 tf2 fenced[4091]: fencing node "tf1"
Oct 25 20:26:04 tf2 kernel: e100: eth2: e100_watchdog: link down
Oct 25 20:26:08 tf2 fenced[4091]: fence "tf1" success
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Trying to acquire journal 
lock...
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Looking at journal...
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Acquiring the transaction 
lock...
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Replaying journal...
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Replayed 0 of 11 blocks
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: replays = 0, skips = 0, sames 
= 11
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Journal replayed in 1s
Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Done
Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> Magma Event: Membership Change 
Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> State change: tf1 DOWN 
Oct 25 20:26:27 tf2 clurgmgrd[4903]: <notice> Starting stopped service Apache Service 
Oct 25 20:26:29 tf2 httpd: httpd startup succeeded
Oct 25 20:26:29 tf2 clurgmgrd[4903]: <notice> Service Apache Service started 
Oct 25 20:26:36 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
Oct 25 20:28:08 tf2 kernel: e100: eth2: e100_watchdog: link down
Oct 25 20:28:10 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
Oct 25 20:29:40 tf2 kernel: CMAN: node tf1 rejoining

so i noticed that after a few more mins, tf1 *appeared* to be rebooting,
and I saw this in the syslog of tf2

Oct 25 20:34:25 tf2 kernel: CMAN: too many transition restarts - will die
Oct 25 20:34:25 tf2 kernel: CMAN: we are leaving the cluster. Inconsistent cluster view
Oct 25 20:34:25 tf2 kernel: WARNING: dlm_emergency_shutdown
Oct 25 20:34:25 tf2 clurgmgrd[4903]: <warning> #67: Shutting down uncleanly 
Oct 25 20:34:25 tf2 kernel: WARNING: dlm_emergency_shutdown
Oct 25 20:34:25 tf2 kernel: SM: 00000001 sm_stop: SG still joined
Oct 25 20:34:25 tf2 kernel: SM: 01000002 sm_stop: SG still joined
Oct 25 20:34:25 tf2 kernel: SM: 02000004 sm_stop: SG still joined
Oct 25 20:34:25 tf2 kernel: SM: 03000005 sm_stop: SG still joined
Oct 25 20:34:25 tf2 ccsd[3988]: Cluster manager shutdown.  Attemping to reconnect... 
Oct 25 20:34:26 tf2 httpd: httpd shutdown succeeded
Oct 25 20:34:26 tf2 kernel: parted nodes
Oct 25 20:34:26 tf2 kernel: clvmd rebuilt 0 resources
Oct 25 20:34:26 tf2 kernel: clvmd purge requests
Oct 25 20:34:26 tf2 kernel: clvmd purged 0 requests
Oct 25 20:34:26 tf2 kernel: clvmd mark waiting requests
Oct 25 20:34:26 tf2 kernel: clvmd marked 0 requests
Oct 25 20:34:26 tf2 kernel: clvmd purge locks of departed nodes
Oct 25 20:34:26 tf2 kernel: lv1 purged 1 locks
Oct 25 20:34:26 tf2 kernel: lv1 update remastered resources
Oct 25 20:34:26 tf2 kernel: clvmd purged 0 locks
Oct 25 20:34:26 tf2 kernel: clvmd update remastered resources
Oct 25 20:34:26 tf2 kernel: clvmd updated 1 resources
Oct 25 20:34:26 tf2 kernel: clvmd rebuild locks
Oct 25 20:34:26 tf2 kernel: clvmd rebuilt 0 locks
Oct 25 20:34:26 tf2 kernel: clvmd recover event 7 done
Oct 25 20:34:26 tf2 kernel: Magma move flags 0,0,1 ids 6,7,7
Oct 25 20:34:26 tf2 kernel: Magma process held requests
Oct 25 20:34:26 tf2 kernel: Magma processed 0 requests
Oct 25 20:34:26 tf2 kernel: Magma resend marked requests
Oct 25 20:34:26 tf2 kernel: Magma resend 6403d9 lq 1 flg 200000 node -1/-1 "usrm::vf"
Oct 25 20:34:26 tf2 kernel: Magma resent 1 requests
Oct 25 20:34:26 tf2 kernel: Magma recover event 7 finished
Oct 25 20:34:26 tf2 kernel: clvmd move flags 0,0,1 ids 2,7,7
Oct 25 20:34:26 tf2 kernel: clvmd process held requests
Oct 25 20:34:26 tf2 kernel: clvmd processed 0 requests
Oct 25 20:34:26 tf2 kernel: clvmd resend marked requests
Oct 25 20:34:26 tf2 kernel: clvmd resent 0 requests
Oct 25 20:34:26 tf2 kernel: clvmd recover event 7 finished
Oct 25 20:34:26 tf2 kernel: lv1 updated 525 resources
Oct 25 20:34:26 tf2 kernel: lv1 rebuild locks
Oct 25 20:34:26 tf2 kernel: lv1 rebuilt 0 locks
Oct 25 20:34:26 tf2 kernel: lv1 recover event 7 done
Oct 25 20:34:26 tf2 kernel: lv1 move flags 0,0,1 ids 3,7,7
Oct 25 20:34:26 tf2 kernel: lv1 process held requests
Oct 25 20:34:26 tf2 kernel: lv1 processed 0 requests
Oct 25 20:34:26 tf2 kernel: lv1 resend marked requests
Oct 25 20:34:26 tf2 kernel: lv1 resent 0 requests
Oct 25 20:34:26 tf2 kernel: lv1 recover event 7 finished
Oct 25 20:34:26 tf2 kernel: 4189 pr_start last_stop 0 last_start 4 last_finish 0
Oct 25 20:34:26 tf2 kernel: 4189 pr_start count 2 type 2 event 4 flags 250
Oct 25 20:34:26 tf2 kernel: 4189 claim_jid 1
Oct 25 20:34:26 tf2 kernel: 4189 pr_start 4 done 1
Oct 25 20:34:26 tf2 kernel: 4189 pr_finish flags 5a
Oct 25 20:34:26 tf2 kernel: 4168 recovery_done jid 1 msg 309 a
Oct 25 20:34:26 tf2 kernel: 4168 recovery_done nodeid 2 flg 18
Oct 25 20:34:26 tf2 kernel: 4189 pr_start last_stop 4 last_start 7 last_finish 4
Oct 25 20:34:26 tf2 kernel: 4189 pr_start count 1 type 1 event 7 flags 21a
Oct 25 20:34:26 tf2 kernel: 4189 pr_start cb jid 0 id 1
Oct 25 20:34:26 tf2 kernel: 4189 pr_start 7 done 0
Oct 25 20:34:26 tf2 kernel: 4192 recovery_done jid 0 msg 309 11a
Oct 25 20:34:26 tf2 kernel: 4192 recovery_done nodeid 1 flg 1b
Oct 25 20:34:26 tf2 kernel: 4192 recovery_done start_done 7
Oct 25 20:34:26 tf2 kernel: 4189 pr_finish flags 1a
Oct 25 20:34:26 tf2 kernel: 
Oct 25 20:34:26 tf2 kernel: lock_dlm:  Assertion failed on line 428 of file 
/usr/src/redhat/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/lock.c
Oct 25 20:34:26 tf2 kernel: lock_dlm:  assertion:  "!error"
Oct 25 20:34:26 tf2 kernel: lock_dlm:  time = 623964971
Oct 25 20:34:26 tf2 kernel: lv1: num=2,1a err=-22 cur=-1 req=3 lkf=10000
Oct 25 20:34:26 tf2 kernel: 
Oct 25 20:34:26 tf2 kernel: ------------[ cut here ]------------
Oct 25 20:34:26 tf2 kernel: kernel BUG at 
/usr/src/redhat/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/lock.c:428!
Oct 25 20:34:26 tf2 kernel: invalid operand: 0000 [#1]
Oct 25 20:34:26 tf2 kernel: SMP 
Oct 25 20:34:26 tf2 kernel: Modules linked in: dcdipm(U) dcdbas(U) parport_pc lp parport 
autofs4 i2c_dev i2c_core lock_dlm(U) gfs(U) lock_harness(U) dlm(U) cman(U) md5 ipv6 sunrpc 
button battery ac uhci_hcd ehci_hcd hw_random shpchp eepro100 e100 mii e1000 floppy sg ext3 
jbd dm_mod aic7xxx megaraid_mbox megaraid_mm sd_mod scsi_mod
Oct 25 20:34:26 tf2 kernel: CPU:    2
Oct 25 20:34:26 tf2 kernel: EIP:    0060:[<f8acc779>]    Tainted: P      VLI
Oct 25 20:34:26 tf2 kernel: EFLAGS: 00010246   (2.6.9-34.ELsmp) 
Oct 25 20:34:26 tf2 kernel: EIP is at do_dlm_lock+0x134/0x14e [lock_dlm]
Oct 25 20:34:26 tf2 kernel: eax: 00000001   ebx: ffffffea   ecx: f1be9d50   edx: f8ad115f
Oct 25 20:34:26 tf2 kernel: esi: f8acc798   edi: f7e7da00   ebp: c2355b00   esp: f1be9d4c
Oct 25 20:34:26 tf2 kernel: ds: 007b   es: 007b   ss: 0068
Oct 25 20:34:26 tf2 kernel: Process umount (pid: 13456, threadinfo=f1be9000 task=f66c7230)
Oct 25 20:34:26 tf2 kernel: Stack: f8ad115f 20202020 32202020 20202020 20202020 20202020 
61312020 f1f40018 
Oct 25 20:34:26 tf2 kernel:        f1f422b8 c2355b00 00000003 00000000 c2355b00 f8acc828 
00000003 f8ad4860 
Oct 25 20:34:26 tf2 kernel:        f8b20000 f8bf45b2 00000008 00000001 f4fbc5c4 f4fbc5a8 
f8b20000 f8bea5cd 
Oct 25 20:34:26 tf2 kernel: Call Trace:
Oct 25 20:34:26 tf2 kernel:  [<f8acc828>] lm_dlm_lock+0x49/0x52 [lock_dlm]
Oct 25 20:34:26 tf2 kernel:  [<f8bf45b2>] gfs_lm_lock+0x35/0x4d [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8bea5cd>] gfs_glock_xmote_th+0x130/0x172 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8be9c91>] rq_promote+0xc8/0x147 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8be9e7d>] run_queue+0x91/0xc1 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8beae88>] gfs_glock_nq+0xcf/0x116 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8beb40f>] gfs_glock_nq_init+0x13/0x26 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8c02e64>] gfs_permission+0x0/0x61 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8c02e9e>] gfs_permission+0x3a/0x61 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<f8c02e64>] gfs_permission+0x0/0x61 [gfs]
Oct 25 20:34:26 tf2 kernel:  [<c0165870>] permission+0x2b/0x4f
Oct 25 20:34:26 tf2 kernel:  [<c0165dbf>] __link_path_walk+0x148/0xbb5
Oct 25 20:34:26 tf2 kernel:  [<c016686f>] link_path_walk+0x43/0xbe
Oct 25 20:34:26 tf2 kernel:  [<c0150309>] do_brk+0x1f2/0x22c
Oct 25 20:34:26 tf2 kernel:  [<c0166c04>] path_lookup+0x14b/0x17f
Oct 25 20:34:26 tf2 kernel:  [<c0166d4c>] __user_walk+0x21/0x51
Oct 25 20:34:26 tf2 kernel:  [<c0162460>] sys_readlink+0x20/0x82
Oct 25 20:34:26 tf2 kernel:  [<c0150309>] do_brk+0x1f2/0x22c
Oct 25 20:34:26 tf2 kernel:  [<c011ad21>] do_page_fault+0x0/0x5c6
Oct 25 20:34:26 tf2 kernel:  [<c02d2657>] syscall_call+0x7/0xb
Oct 25 20:34:26 tf2 kernel: Code: 26 50 0f bf 45 24 50 53 ff 75 08 ff 75 04 ff 75 0c ff 77 18 
68 8a 12 ad f8 e8 ce 5e 65 c7 83 c4 38 68 5f 11 ad f8 e8 c1 5e 65 c7 <0f> 0b ac 01 a7 10 ad f8 
68 61 11 ad f8 e8 7c 56 65 c7 83 c4 20 
Oct 25 20:34:26 tf2 kernel:  <0>Fatal exception: panic in 5 seconds

and now tf2 is  unreachable too.. 
ideas? suggestions?
 

Jason


From orkcu at yahoo.com  Thu Oct 26 02:10:16 2006
From: orkcu at yahoo.com (Roger Pe�a Escobio)
Date: Wed, 25 Oct 2006 19:10:16 -0700 (PDT)
Subject: [Linux-cluster] Fencing Hardware
In-Reply-To: <453E8816.8070307@redhat.com>
Message-ID: <20061026021016.12712.qmail@web50602.mail.yahoo.com>


--- James Parsons <jparsons at redhat.com> wrote:

> Roger PeXa Escobio wrote:
> 
> >--- "isplist at logicore.net" <isplist at logicore.net>
> >wrote:
> >
> >  
> >
> >>Each blade has one slot for an add on card which
> >>I've taken up for Fibre 
> >>Channel cards. In my case at least, there's no
> room
> >>for anything else and I'm 
> >>not sure there are any packages that handle
> external
> >>power controls for my 
> >>machines.
> >>    
> >>
> >what kind of blade do you have? HP? Dell? other
> >branch?
> >
> He is using cubix blades. I checked into their cubix
> management 
> controller 2 device for remote poweroff/on...IT ONLY
> WORKS WITH INTERNET 
> EXPLORER!!! YIKES!

after a search in google
I found this page:
http://cubix.com/cv/cvmanagementspecs.php

CubixVision remotely monitors and controls the
following:

    * Blade server reset
    * Blade server power off/on 
although it look like this application run over IE-6
(java application?)
chapter 8 in this url:
http://cubix.com/support/product/chassis/bladestation/
should give details of how manage the blades with this
software
but if this is the case  I think there isn't a way to
implement a fency mechanism, am I correct?

cu
roger

__________________________________________
RedHat Certified Engineer ( RHCE )
Cisco Certified Network Associate ( CCNA )

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From cosimo at streppone.it  Thu Oct 26 06:48:57 2006
From: cosimo at streppone.it (Cosimo Streppone)
Date: Thu, 26 Oct 2006 08:48:57 +0200
Subject: [Linux-cluster] Samba share resource fails to start after upgrade
	to Cluster Suite 4U4
In-Reply-To: <4538E16F.3060709@streppone.it>
References: <4538E16F.3060709@streppone.it>
Message-ID: <45405A59.1060806@streppone.it>

Me wrote:

> Hi all clustering guys,
> 
> I'm experiencing a problem with a 2 node cluster just upgraded
> (this morning) to current RHEL 4U4 and CS.
> 
> I don't know why it searches for "/etc/samba/smb.conf.//share/exportdb"...
> Maybe something changed in the way smb shares are managed?

I'm opening a service request.
I'll try to report the results on list.

-- 
Cosimo


From pcaulfie at redhat.com  Thu Oct 26 07:38:04 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 26 Oct 2006 08:38:04 +0100
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061026005601.GA24086@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
Message-ID: <454065DC.8020706@redhat.com>

jason at monsterjam.org wrote:
> ok, I was just logging into the 2 nodes of my cluster, tf1 and tf2, I noticed that tf1 was NOT 
> available via ssh, but tf2 was. tf1 was pingable, but that was it. I looked on tft2 and 
> noticed that he had taken over the cluster virtual ip address 
> 
> 2: eth0: <BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast qlen 1000
>     link/ether 00:11:43:d7:c9:c6 brd ff:ff:ff:ff:ff:ff
>     inet 192.168.1.6/24 brd 192.168.1.255 scope global eth0
>     inet 192.168.1.7/32 scope global eth0
>     inet6 fe80::211:43ff:fed7:c9c6/64 scope link 
>        valid_lft forever preferred_lft forever
> 
> and in the syslog on tf2, I saw
> Oct 25 20:26:00 tf2 kernel: CMAN: removing node tf1 from the cluster : Missed too many 
> heartbeats

That's the important message in all this, all the rest is just consequence.

The node has been kicked out of the cluster for not sending heartbeats in a timely manner. Ether the network is fearfully busy or
the one of nodes is.

-- 

patrick


From pcaulfie at redhat.com  Thu Oct 26 08:10:06 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 26 Oct 2006 09:10:06 +0100
Subject: [Linux-cluster] CS4 U2 / crash when "less /proc/cluster/status"
In-Reply-To: <1161804991.4518.35.camel@rei.boston.devel.redhat.com>
References: <453F88D5.9010101@bull.net> <453F94B0.1090006@redhat.com>
	<1161804991.4518.35.camel@rei.boston.devel.redhat.com>
Message-ID: <45406D5E.9070603@redhat.com>

Lon Hohberger wrote:
> On Wed, 2006-10-25 at 11:45 -0500, Robert Peterson wrote:
>> Alain Moulle wrote:
>>> Hi
>>>
>>> Strange, when you do :
>>> less /proc/cluster/status
>>> on one node of a HA pair, the system crashes !
>>> If you use cat /proc/cluster/status, no crash.
>>> Any explanation ?
>>> Is it always true with CS4 U4 ?
>>>
>>> Thanks
>>> Alain Moull?
>>>   
>> Hi Alain,
>>
>> Please file a new bugzilla for this problem.  Assign it to 
>> rpeterso at redhat.com
>> I was able to recreate it on RHEL4 U4, and that surprises me.  
>> Incidentally,
>> cat /proc/cluster/status works just fine.  I'll investigate why less 
>> doesn't.
> 
> I wonder if 'more' uses lseek() and less doesn't?

less reads the start of the file in a small chunk (64 bytes I think) and the code in proc_cluster_status doesn't stop when it
reaches the 'length' parameter (slapped wrist).

That routine should really be changed to use seq_file like the rest of them.

It probably used to be OK, but more and more stuff has been added to the /proc file over time and it's a lot bigger than it used to be !


-- 

patrick


From riaan at obsidian.co.za  Thu Oct 26 10:10:45 2006
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Thu, 26 Oct 2006 12:10:45 +0200
Subject: [Linux-cluster] GFS Data Journaling - answers
Message-ID: <454089A5.5060009@obsidian.co.za>

hi all

I asked Red Hat Global Support Services some questions about GFS Data 
Journaling. From what Red Hat tell me, I can sumarize:

a) only benefits writes, not reads
b) cannot be applied retroactively on non-empty files (I can think of a 
very cludgy way, touch newfile ; [set property using gfstool] ; cat 
oldfile > newfile ; mv newfile oldfile ). but this is not something I 
would like to do on 3 TB of small files
c) we would have to test extensively to tell if it is worth the effort
d) it might even make performance worse since it will place higher I/O 
load on the SAN / SP.
e) it would be extremely difficult to do a representative simulation. It 
might perform better on a non-starved RAID controller or SAN SP, but the 
performance might drop when applying the change on our production, 
starved SP.

(SP = Storage Processor, EMC speak for the thing that handles the I/O in 
the SAN storage array)

E.g. in our environment at least, we can cross this off our list as an 
avenue of exploration for increasing performance.

greetings
Riaan

...

Eduardo:

I have talked to one of the GFS developers regarding this issue. I'll 
answer your questions one by one:


a) is it safe to do?

It is safe, but it is not recommended do be done on a production 
filesystem. You should create a new filesystem and experiment with it.

b) does the change favour mostly writes?

Yes, but Beware! This change will only affect the fsync() calls. The 
developer says that you will have quicker fsync() calls, BUT on the 
other hand, you will have more I/O. Now, if the SAN is saturated on I/O, 
or on CPU needed for I/O, then it might be that it will be actually 
slower for you.

c) any ballpark performance improvement we can expect

Engineering does not give you any measurable numbers. They say that the 
fsyncs are quicker because of the higher i/o, and that in some cases the 
performance can be better or worse, you would need to create a test case 
and see how it pans out for you.

d) I see "gfs_tool setflag jdata" will only have an effect on 
zero-length files. is there any way to run "gfs_tool setflag jdata" 
against existing files with some kind of copy/cat/mv operation to have 
the existing mail data benefit from this changed property?

the developer says that it is not possible.

e) is there any way to see if the flag has been set on a directory / 
file? (I only see setflag/clearflag operations, not any kind "getflag"

The developer says that you can run gfs_tool stat on a file, and see if 
the jdata flag is active for the file:

# gfs_tool stat /mnt/gfs/x | grep jdata
   jdata
   inherit_jdata

Therefore you can see if the jdata, and inherit_jdata is active per inode.


As a last comment on this issue. I have asked the engineers if this 
option gives any performance increase in READs. The answer is NO. So I 
believe that if the system is getting maxed out on reads or 75% reads, 
there is very little performance boost this feature can provide, and 
might even cause drops in performance. The recommended method is to 
create a test environment and see how it pans out.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061026/224adeb3/attachment.vcf>

From hlawatschek at atix.de  Thu Oct 26 10:11:34 2006
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Thu, 26 Oct 2006 12:11:34 +0200
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <20061025123633.576140@leena>
References: <20061025123633.576140@leena>
Message-ID: <200610261211.34421.hlawatschek@atix.de>

Hi Mike,

> Well, like anyone, I just want to use my servers in a better manner. Rather
> than constantly having to upgrade, I prefer the idea of just adding a node
> or more when needed. I have a large number of blade servers (left over from
> a kaput ISP) and so have been using those.

You could create a gfs based diskless shared root cluster. That means, all 
nodes are booting from the same shared storage boot device and share the same 
root device. That'll give you a filesystem based ssi and the flexibility you 
need.
>
> As such, I've created multiple clusters which all share GFS filesystems. I
> have web, mail, mysql clusters which all share data. I've written basic
> scripts to allow for the various needs of the different machines.

To achieve the maximum flexibility, you could create one physical cluster that 
hosts different shared root devices which can be used as shared root devices 
for different "sub" - clusters (web, mail, mysql).
>
> Things seem to work well but I'm just wondering if I'm missing more of the
> potential of the Linux Cluster.

have a look at http://www.open-sharedroot.org
and http://www.redhat.com/magazine/021jul06/features/gfs_update/

Mark

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From isplist at logicore.net  Thu Oct 26 13:22:14 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 08:22:14 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <454065DC.8020706@redhat.com>
Message-ID: <2006102682214.930963@leena>

I missed the original on this but I had something interesting happen 
yesterday. I installed a new node, introduced it into the cluster and 
immediately, all of the other nodes died with a kernel panic. Not sure why 
yet.

Mike


On Thu, 26 Oct 2006 08:38:04 +0100, Patrick Caulfield wrote:
> jason at monsterjam.org wrote:
> 
>> ok, I was just logging into the 2 nodes of my cluster, tf1 and tf2, I
>> noticed that tf1 was NOT
>> available via ssh, but tf2 was. tf1 was pingable, but that was it. I
>> looked on tft2 and
>> noticed that he had taken over the cluster virtual ip address
>> 
>> 2: eth0: <BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast qlen 1000
>> link/ether 00:11:43:d7:c9:c6 brd ff:ff:ff:ff:ff:ff
>> inet 192.168.1.6/24 brd 192.168.1.255 scope global eth0
>> inet 192.168.1.7/32 scope global eth0
>> inet6 fe80::211:43ff:fed7:c9c6/64 scope link
>> valid_lft forever preferred_lft forever
>> 
>> and in the syslog on tf2, I saw
>> Oct 25 20:26:00 tf2 kernel: CMAN: removing node tf1 from the cluster :
>> Missed too many
>> heartbeats
>> 
> That's the important message in all this, all the rest is just consequence.
> 
> The node has been kicked out of the cluster for not sending heartbeats in a
> timely manner. Ether the network is fearfully busy or
> the one of nodes is.


From marek.dabrowski at infor.pl  Thu Oct 26 13:35:53 2006
From: marek.dabrowski at infor.pl (Marek Dabrowski)
Date: Thu, 26 Oct 2006 15:35:53 +0200
Subject: [Linux-cluster] GFS+ACL
Message-ID: <4540B9B9.40107@infor.pl>

Hello

I have 2 node cluster with GFS. Some strange problem occur - I set acl 
rigths on some directorires. After about a few minutes, acls whitch I 
set was deleted and appear ald acls. I dont know how this happened? Any 
idea?

Regards and sorry my english
Marek


From pcaulfie at redhat.com  Thu Oct 26 13:37:14 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 26 Oct 2006 14:37:14 +0100
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <2006102682214.930963@leena>
References: <2006102682214.930963@leena>
Message-ID: <4540BA0A.5040305@redhat.com>

isplist at logicore.net wrote:
> I missed the original on this but I had something interesting happen 
> yesterday. I installed a new node, introduced it into the cluster and 
> immediately, all of the other nodes died with a kernel panic. Not sure why 
> yet.

That sounds like the U3 (inconsistent cluster view) bug. it's fixed in the latest builds.

-- 

patrick


From isplist at logicore.net  Thu Oct 26 13:47:37 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 08:47:37 -0500
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <200610261211.34421.hlawatschek@atix.de>
Message-ID: <2006102684737.339594@leena>

> You could create a gfs based diskless shared root cluster. That means, all
> nodes are booting from the same shared storage boot device and share the
> same root device. That'll give you a filesystem based ssi and the 
> flexibility you need.

That is exactly what I'd like to do. The nodes cannot boot from a diskette 
however but I do have network boot available to me.

> To achieve the maximum flexibility, you could create one physical cluster
> that hosts different shared root devices which can be used as shared root 
> devices for different "sub" - clusters (web, mail, mysql).

So, concatenate all of the drive space and form multiple roots? Then just add 
storage space to the overall storage on the fiber channel side? I've been 
doing it the other way around, I've been creating multiple allocations of the 
centralized storage so that I would have a web root, a mail root, etc.
 
> have a look at http://www.open-sharedroot.org
> and http://www.redhat.com/magazine/021jul06/features/gfs_update/

U certainly will, thank you very much for the input Mark.

Mike


From isplist at logicore.net  Thu Oct 26 13:49:25 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 08:49:25 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <4540BA0A.5040305@redhat.com>
Message-ID: <2006102684925.864940@leena>

I'll update everything, boot the new node and let you know how it goes. 

Mike


>> I missed the original on this but I had something interesting happen
>> yesterday. I installed a new node, introduced it into the cluster and
>> immediately, all of the other nodes died with a kernel panic. Not sure why
>> yet.
>> 
> That sounds like the U3 (inconsistent cluster view) bug. it's fixed in the
> latest builds.


From dbrieck at gmail.com  Thu Oct 26 14:11:09 2006
From: dbrieck at gmail.com (David Brieck Jr.)
Date: Thu, 26 Oct 2006 10:11:09 -0400
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <453FC2C4.2020109@jellyfish.highlyscyld.com>
References: <1161723346.4518.33.camel@rei.boston.devel.redhat.com>
	<20061025123633.576140@leena>
	<8c1094290610251133k16a22d72n4452346f76331ebf@mail.gmail.com>
	<453FC2C4.2020109@jellyfish.highlyscyld.com>
Message-ID: <8c1094290610260711g20c33bfer74042ed7c6b055ac@mail.gmail.com>

On 10/25/06, Michael Will <mwill at penguincomputing.com> wrote:
> Are the actual data files shared in this setup between the active mysql
> daemons?
>
> Last time I looked into this it seemed that with shared-nothing model each
> mysql daemon would have to keep it's own copy of the data and updates
> would be propagated from active to passive daemons (master-slave model)
> or between active daemons (ndb in-ram database model)
>
> Are the mysql daemons running on the GFS I/O nodes that have access to
> shared
> storage via SAN or iSCSI and coordinate locking through GFS
> infrastructure, or are
> the mysql daemons running on client nodes that use GFS to remotely
> access storage
> that is provided by other  GFS I/O nodes that in turn have access to shared
> storage via SAN or iSCSI?
>
> Michael
>

We're using GNBD for the nodes to connect to the storage. We don't
have the fastest storage setup right now, but I'm hopeful that if
everything works well we'll be purchasing a faster storage setup.

As far as MySQL using GFS (excluding anything with active-active) and
using DLM to do locks, here are some comparisons:

Benchmark on GFS

Benchmark DBD suite: 2.15
Date of test:        2006-10-26  9:49:43
Running tests on:    Linux 2.6.9-42.0.2.ELhugemem i686
Arguments:           --small-test --tcpip --fast --fast-insert --lock-tables
Comments:
Limits from:
Server version:      MySQL 4.1.20/
Optimization:        None
Hardware:

alter-table: Total time: 94 wallclock secs ( 0.02 usr  0.01 sys +
0.00 cusr  0.00 csys =  0.03 CPU)
big-tables: Total time:  4 wallclock secs ( 0.13 usr  0.14 sys +  0.00
cusr  0.00 csys =  0.27 CPU)
connect: Total time:  5 wallclock secs ( 0.38 usr  0.53 sys +  0.00
cusr  0.00 csys =  0.91 CPU)
create: Total time:  8 wallclock secs ( 0.02 usr  0.01 sys +  0.00
cusr  0.00 csys =  0.03 CPU)
insert: Total time: 17 wallclock secs ( 2.19 usr  1.99 sys +  0.00
cusr  0.00 csys =  4.18 CPU)
select: Total time: 13 wallclock secs ( 2.36 usr  1.03 sys +  0.00
cusr  0.00 csys =  3.39 CPU)

Benchmark on Local

alter-table: Total time: 70 wallclock secs ( 0.02 usr  0.00 sys +
0.00 cusr  0.00 csys =  0.02 CPU)
big-tables: Total time:  2 wallclock secs ( 0.11 usr  0.14 sys +  0.00
cusr  0.00 csys =  0.25 CPU)
connect: Total time:  4 wallclock secs ( 0.37 usr  0.55 sys +  0.00
cusr  0.00 csys =  0.92 CPU)
create: Total time:  1 wallclock secs ( 0.01 usr  0.00 sys +  0.00
cusr  0.00 csys =  0.01 CPU)
insert: Total time: 13 wallclock secs ( 2.27 usr  1.95 sys +  0.00
cusr  0.00 csys =  4.22 CPU)
select: Total time: 12 wallclock secs ( 2.21 usr  0.97 sys +  0.00
cusr  0.00 csys =  3.18 CPU)

It's pretty darn close and I'm willing to take a small performance hit.

Here's some relevant info: local storage is RAID5 and GFS is RAID10
and shared using CLVM, multipath, and GNBD. So the speed of the test
locally would probably be faster if it were either RAID1 or 10, not 5.


From lhh at redhat.com  Thu Oct 26 14:14:31 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:14:31 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061026005601.GA24086@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
Message-ID: <1161872071.4518.55.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-25 at 20:56 -0400, jason at monsterjam.org wrote:
> ok, I was just logging into the 2 nodes of my cluster, tf1 and tf2, I noticed that tf1 was NOT 
> available via ssh, but tf2 was. tf1 was pingable, but that was it. I looked on tft2 and 
> noticed that he had taken over the cluster virtual ip address 
> 
> 2: eth0: <BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast qlen 1000
>     link/ether 00:11:43:d7:c9:c6 brd ff:ff:ff:ff:ff:ff
>     inet 192.168.1.6/24 brd 192.168.1.255 scope global eth0
>     inet 192.168.1.7/32 scope global eth0
>     inet6 fe80::211:43ff:fed7:c9c6/64 scope link 
>        valid_lft forever preferred_lft forever

Well, I can walk through what happened here.

> Oct 25 20:26:00 tf2 kernel: CMAN: removing node tf1 from the cluster : Missed too many 
> heartbeats

Node died for some reason.

> Oct 25 20:26:00 tf2 fenced[4091]: tf1 not a cluster member after 0 sec post_fail_delay
> Oct 25 20:26:00 tf2 fenced[4091]: fencing node "tf1"
> Oct 25 20:26:04 tf2 kernel: e100: eth2: e100_watchdog: link down
> Oct 25 20:26:08 tf2 fenced[4091]: fence "tf1" success

^^ Fence recovery.

> Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Trying to acquire journal 
> lock...
...
> Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Done

^^ GFS recovery

> Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> Magma Event: Membership Change 
> Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> State change: tf1 DOWN 

^^ Rgmanager recovery

> Oct 25 20:26:27 tf2 clurgmgrd[4903]: <notice> Starting stopped service Apache Service 
> Oct 25 20:26:29 tf2 httpd: httpd startup succeeded
> Oct 25 20:26:29 tf2 clurgmgrd[4903]: <notice> Service Apache Service started 
> Oct 25 20:26:36 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
> Oct 25 20:28:08 tf2 kernel: e100: eth2: e100_watchdog: link down
> Oct 25 20:28:10 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
> Oct 25 20:29:40 tf2 kernel: CMAN: node tf1 rejoining

^^ CMAN restarted on tf1 (rebooted)


> Oct 25 20:34:25 tf2 kernel: CMAN: too many transition restarts - will die
> Oct 25 20:34:25 tf2 kernel: CMAN: we are leaving the cluster. Inconsistent cluster view

Argh.  That's not good.  I *think* this is a bug in CMAN-kernel in U3,
which was fixed in U4.

> Oct 25 20:34:26 tf2 kernel: lock_dlm:  Assertion failed on line 428 of file 
> /usr/src/redhat/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/lock.c
> Oct 25 20:34:26 tf2 kernel: lock_dlm:  assertion:  "!error"
...
> Oct 25 20:34:26 tf2 kernel: ------------[ cut here ]------------
...
> Oct 25 20:34:26 tf2 kernel:  <0>Fatal exception: panic in 5 seconds
> 
> and now tf2 is  unreachable too.. 
> ideas? suggestions?

The panic above is a bug in the dlm-kernel rpm/package; I don't know
much more than that.  When a machine panics, it stops responding to
things over the network.

-- Lon


From lhh at redhat.com  Thu Oct 26 14:18:13 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:18:13 -0400
Subject: [Linux-cluster] virtual address went down?
In-Reply-To: <20061018203208.GA28571@monsterjam.org>
References: <20061018013715.GA31630@monsterjam.org>
	<1161202963.6685.71.camel@rei.boston.devel.redhat.com>
	<20061018203208.GA28571@monsterjam.org>
Message-ID: <1161872293.4518.59.camel@rei.boston.devel.redhat.com>

On Wed, 2006-10-18 at 16:32 -0400, jason at monsterjam.org wrote:
> > > so Ive had a test cluster running for quite a while now, both nodes of a 2 node cluster are up, 
> > > but the virtual address seems to have disappeared.. its not pingable, neither server has it 
> > > configured anymore..
> > 
> > You're using 'ip addr list', not 'ifconfig' to look for it, right?
> actually, I was using ifconfig, I forgot that you need to use ip addr 
> list.. my mistake.. but the address was for sure not pingable.
> 
> > 
> > > The only application I had using the virtual address was apache (just for 
> > > testing it). what logs/information should I be looking at to see what happened and why?
> > 
> > Firewall bits?
> not sure what you mean here.. 

IPTables.  Though, if it was working and it's available via ip addr
list, it should still be working.

When this happens, does the IP still appear in 'ip addr list' ?  What
about running 'clustat' -- is the service started, or did it perhaps
fail?

> > What kind of ethernet cards do you have, out of curiosity, and are they
> > in any sort of bonding configuration?
> > 
> [jason at tf1 ~]$ lspci | grep -i ether
> 02:0c.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 
> 100] (rev 0d)
> 06:07.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit 
> Ethernet Controller (rev 05)
> 07:08.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit 
> Ethernet Controller (rev 05)
> 
> im not doing any bonding.. 

Ok, there was/is a bug with e1000 in bonding configurations if you have
4+ of the cards in the system.  That's what I was looking for.  Since
your configuration has none of the above criteria, it's unlikely to be
that problem.

-- Lon


From lhh at redhat.com  Thu Oct 26 14:36:38 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:36:38 -0400
Subject: [Linux-cluster] RE: Cluster Suite 4 failover problem
In-Reply-To: <453A27B7.6050605@yahoo.com.hk>
References: <453A27B7.6050605@yahoo.com.hk>
Message-ID: <1161873399.4518.78.camel@rei.boston.devel.redhat.com>

On Sat, 2006-10-21 at 21:59 +0800, Dicky wrote:
> HI  Jeff & Lon,
> 
> Thanks for the reply.
> 
> Regarding the didn't failover issue (just displayed the "Owner --> 
> unknown"  and "State --> started" but actually none services were 
> available), i checked the log and agreed that it should be the 
> fence_manual problem. It is because the log message showed that the 
> fence_manul was waiting node2 to rejoin the cluster, as soon as i 
> executed the command: fence_ack_manual -n node2, the failed services 
> failover to node1, all failed service back to normal.

Sweet.

> I would like to know if there is any solution or workaround for this 
> situation other than buying a fence device :) ????? Can i remove the 
> fence.rpm ??? Will it cause any extra problems????? 

AAhhhhh...

Well, *just* file system and data corruption in the case that a node
*didn't* fully die. ;)

> It is because in 
> production environment, we never know when will the machine down and 
> cannot execute the fence_ack_manual command immediately.

Ok, here's the scoop.  Fencing is there to protect your data.  If you
don't care about your data, or you are not sharing data between the
nodes, then you do not need fencing.

It works by preventing a node which is *believed* to be dead from
writing to your shared data.  The key word is *believed*.  Sometimes,
due to load spikes, live hangs, network partitions, or other events
outside of administrative control, a node is believed to be dead but is
not, in actuality, dead.  If it "wakes up" and starts writing happily
along where it left off, your file system(s) and data probably will not
last long.


> Regarding the monitor_link issue, i have tried to set the "monitor_link 
> =1 " for both resource ip i.e. 192.168.0.111 and 192.168.0.112 , then i 
> shutdown eth0 of node2 and re-enable it,  when i tried to restart the 
> rgmanager in node2 i.e. the failed node, it still showing the msg 
> "Shutting down Cluster Service Manager... Waiting for services to stop: 
> ", i have to kill the rgmanager's processes or even worse i have to 
> reset the machine. Any ideas??

If you want to test link monitoring, yank the cable out.  That's what
it's designed to detect. :)

I suspect this is a case of rgmanager trying to take locks.
Unfortunately, I think CMAN and the DLM would still be using the IP you
just pulled off the system.  Rgmanager is probably blocking trying to
take a lock, and hangs.

Are there any log messages?


> One more thing is even the monitor_link=0 in the cluster.conf, the 
> system-config-cluster --> Resource --> IP address's Monitor Link box is 
> being ticked!!! Why??

That sounds like a bug in system-config-cluster.  Bugzilla.

-- Lon


From lhh at redhat.com  Thu Oct 26 14:43:17 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:43:17 -0400
Subject: [Linux-cluster] migratory group
In-Reply-To: <5cac192f0610130449p1539344ak7ebc2d997ba65223@mail.gmail.com>
References: <5cac192f0610130449p1539344ak7ebc2d997ba65223@mail.gmail.com>
Message-ID: <1161873797.4518.80.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-13 at 13:49 +0200, Eric Lemoine wrote:
> Hi,
> 
> I'm trying to migrate a Xen domU from one cluster node to another
> using clusvcadm. I get the following error message when doing
> <clusvcadm -M>:
> 
> root at l6-z-5:~# clusvcadm -M domu-0 -m l6-z-12
> Trying to relocate service:domu-0 to me...Invalid operation for resource
> 
> I looked into the rgmanager code and it seems to me that my group
> isn't "migratory", so I get an RG_EINVAL error code.

> How do I make my group migratory in the cluster.conf file? I found
> neither doc nor examples on the web.

Just use relocate ;) migratory isn't tested yet.

Sorry for the late response!  It's designed for use in the future by the
open source hypervisor spelled with three letters.

-- Lon


From lhh at redhat.com  Thu Oct 26 14:44:35 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:44:35 -0400
Subject: [Linux-cluster] RHEL4 cluster source RPMs incomplete
In-Reply-To: <200610121452.k9CEqYg13019@xos037.xos.nl>
References: <200610121452.k9CEqYg13019@xos037.xos.nl>
Message-ID: <1161873875.4518.82.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-12 at 16:52 +0200, Jos Vos wrote:
> Hi,
> 
> Can someone of the cluster team maybe check the completeness of the
> RHCS/RHGFS source RPMs in the RHEL4 updates tree on public ftp sites?
> 
> It looks like some RHEL4 packages were not propagated to the public
> sites.  The last version of system-config-cluster is older than the
> RHEL4 version, and I'm also missing the cman-kernel and gnbd-kernel
> packages for the -42.0.3.EL kernel, while the new dlm-kernel and
> GFS-kernel packages do exist.

Ugh, are they still "not there" ?  Sometimes, the packages are a bit
behind.

-- Lon


From dist-list at LEXUM.UMontreal.CA  Thu Oct 26 14:44:49 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Thu, 26 Oct 2006 10:44:49 -0400
Subject: [Linux-cluster] GFS Data Journaling - answers
In-Reply-To: <454089A5.5060009@obsidian.co.za>
References: <454089A5.5060009@obsidian.co.za>
Message-ID: <4540C9E1.7000003@lexum.umontreal.ca>

Thanks !
No I have one less performance test to try :)

Riaan van Niekerk wrote:
> hi all
>
> I asked Red Hat Global Support Services some questions about GFS Data
> Journaling. From what Red Hat tell me, I can sumarize:
>
> a) only benefits writes, not reads
> b) cannot be applied retroactively on non-empty files (I can think of
> a very cludgy way, touch newfile ; [set property using gfstool] ; cat
> oldfile > newfile ; mv newfile oldfile ). but this is not something I
> would like to do on 3 TB of small files
> c) we would have to test extensively to tell if it is worth the effort
> d) it might even make performance worse since it will place higher I/O
> load on the SAN / SP.
> e) it would be extremely difficult to do a representative simulation.
> It might perform better on a non-starved RAID controller or SAN SP,
> but the performance might drop when applying the change on our
> production, starved SP.
>
> (SP = Storage Processor, EMC speak for the thing that handles the I/O
> in the SAN storage array)
>
> E.g. in our environment at least, we can cross this off our list as an
> avenue of exploration for increasing performance.
>
> greetings
> Riaan
>
> ...
>
> Eduardo:
>
> I have talked to one of the GFS developers regarding this issue. I'll
> answer your questions one by one:
>
>
> a) is it safe to do?
>
> It is safe, but it is not recommended do be done on a production
> filesystem. You should create a new filesystem and experiment with it.
>
> b) does the change favour mostly writes?
>
> Yes, but Beware! This change will only affect the fsync() calls. The
> developer says that you will have quicker fsync() calls, BUT on the
> other hand, you will have more I/O. Now, if the SAN is saturated on
> I/O, or on CPU needed for I/O, then it might be that it will be
> actually slower for you.
>
> c) any ballpark performance improvement we can expect
>
> Engineering does not give you any measurable numbers. They say that
> the fsyncs are quicker because of the higher i/o, and that in some
> cases the performance can be better or worse, you would need to create
> a test case and see how it pans out for you.
>
> d) I see "gfs_tool setflag jdata" will only have an effect on
> zero-length files. is there any way to run "gfs_tool setflag jdata"
> against existing files with some kind of copy/cat/mv operation to have
> the existing mail data benefit from this changed property?
>
> the developer says that it is not possible.
>
> e) is there any way to see if the flag has been set on a directory /
> file? (I only see setflag/clearflag operations, not any kind "getflag"
>
> The developer says that you can run gfs_tool stat on a file, and see
> if the jdata flag is active for the file:
>
> # gfs_tool stat /mnt/gfs/x | grep jdata
>   jdata
>   inherit_jdata
>
> Therefore you can see if the jdata, and inherit_jdata is active per
> inode.
>
>
> As a last comment on this issue. I have asked the engineers if this
> option gives any performance increase in READs. The answer is NO. So I
> believe that if the system is getting maxed out on reads or 75% reads,
> there is very little performance boost this feature can provide, and
> might even cause drops in performance. The recommended method is to
> create a test environment and see how it pans out.
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Thu Oct 26 14:47:09 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:47:09 -0400
Subject: [Linux-cluster] NFS Options
In-Reply-To: <20060728134956.GA3323@rover.pcbi.upenn.edu>
References: <44CA0209.3040403@opticalart.de>
	<20060728134956.GA3323@rover.pcbi.upenn.edu>
Message-ID: <1161874029.4518.85.camel@rei.boston.devel.redhat.com>

On Fri, 2006-07-28 at 09:49 -0400, Bryan Cardillo wrote:
> On Fri, Jul 28, 2006 at 02:24:41PM +0200, Frank Hellmann wrote:
> > I couldn't find a reference how pass extra options into the NFS Client
> > portion of the cluster.conf, like
> > 
> > <nfsclient name="MyClient"
> > options="rw,all_squash,anonuid=500,anongid=100" target="*"/>
> 
>         [...]
> 
> > Jul 28 13:42:33 clusty2 clurgmgrd: [3906]: <err> Export Option
> > anonuid=500 invalid
> > Jul 28 13:42:33 clusty2 clurgmgrd: [3906]: <err> Export Option
> > anongid=100 invalid
> > 
> > 
> > Using either anonuid="500" or anonuid\=500 didn't work either.
> > 
> > What is the offical way of including these. Any ideas?

Very late response here, but it's because matching in anonuid/anongid is
wrong in /usr/share/cluster/nfsclient.sh:

                anonuid)
                        ;;
                anongid)
                        ;;

Should be anonuid=*) / anongid=*):

                anonuid=*)
                        ;;
                anongid=*)
                        ;;

If you have time, could you file a bugzilla?

-- Lon


From lhh at redhat.com  Thu Oct 26 14:48:07 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 26 Oct 2006 10:48:07 -0400
Subject: [Linux-cluster] NFS Options
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91E82@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91E82@osdn06.osd.mil>
Message-ID: <1161874087.4518.87.camel@rei.boston.devel.redhat.com>

On Fri, 2006-07-28 at 11:47 -0400, Patton, Matthew F, CTR, OSD-PA&E
wrote:
> Classification: UNCLASSIFIED
> 
> This seems to be a repeating issue with RedHat (?if not other
> distributions?) - startup scripts that try to be smart but in fact are
> are stupid and cause problems for no good reason. If the admin can't
> figure out how to write an options string then the daemon will die and
> spit out errors on its own. Scripts have *no* business making lame
> attempts at checking values.
> 
> </rant>

comment out line #269 of nfsclient.sh ;)

-- Lon


From jos at xos.nl  Thu Oct 26 14:51:24 2006
From: jos at xos.nl (Jos Vos)
Date: Thu, 26 Oct 2006 16:51:24 +0200
Subject: [Linux-cluster] RHEL4 cluster source RPMs incomplete
In-Reply-To: <1161873875.4518.82.camel@rei.boston.devel.redhat.com>;
	from lhh@redhat.com on Thu, Oct 26, 2006 at 10:44:35AM -0400
References: <200610121452.k9CEqYg13019@xos037.xos.nl>
	<1161873875.4518.82.camel@rei.boston.devel.redhat.com>
Message-ID: <20061026165124.A10943@xos037.xos.nl>

On Thu, Oct 26, 2006 at 10:44:35AM -0400, Lon Hohberger wrote:

> On Thu, 2006-10-12 at 16:52 +0200, Jos Vos wrote:

> > Can someone of the cluster team maybe check the completeness of the
> > RHCS/RHGFS source RPMs in the RHEL4 updates tree on public ftp sites?
> > 
> > It looks like some RHEL4 packages were not propagated to the public
> > sites.  The last version of system-config-cluster is older than the
> > RHEL4 version, and I'm also missing the cman-kernel and gnbd-kernel
> > packages for the -42.0.3.EL kernel, while the new dlm-kernel and
> > GFS-kernel packages do exist.
> 
> Ugh, are they still "not there" ?  Sometimes, the packages are a bit
> behind.

The kernel modules in the meantime arrived, system-config-cluster 1.0.27
(this was an U4 update) never appeared... :-(

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From jason at monsterjam.org  Thu Oct 26 14:53:12 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Thu, 26 Oct 2006 10:53:12 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1161872071.4518.55.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
Message-ID: <20061026145312.GA23304@monsterjam.org>

ok, thanks for the breakdown.. so basically, I just need to rebuild all 
of my packages from 
ftp://updates.redhat.com:/enterprise/4AS/en/RHGFS/SRPMS
and try again, right?

regards,
Jason


On Thu, Oct 26, 2006 at 10:14:31AM -0400, Lon Hohberger wrote:
> On Wed, 2006-10-25 at 20:56 -0400, jason at monsterjam.org wrote:
> > ok, I was just logging into the 2 nodes of my cluster, tf1 and tf2, I noticed that tf1 was NOT 
> > available via ssh, but tf2 was. tf1 was pingable, but that was it. I looked on tft2 and 
> > noticed that he had taken over the cluster virtual ip address 
> > 
> > 2: eth0: <BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast qlen 1000
> >     link/ether 00:11:43:d7:c9:c6 brd ff:ff:ff:ff:ff:ff
> >     inet 192.168.1.6/24 brd 192.168.1.255 scope global eth0
> >     inet 192.168.1.7/32 scope global eth0
> >     inet6 fe80::211:43ff:fed7:c9c6/64 scope link 
> >        valid_lft forever preferred_lft forever
> 
> Well, I can walk through what happened here.
> 
> > Oct 25 20:26:00 tf2 kernel: CMAN: removing node tf1 from the cluster : Missed too many 
> > heartbeats
> 
> Node died for some reason.
> 
> > Oct 25 20:26:00 tf2 fenced[4091]: tf1 not a cluster member after 0 sec post_fail_delay
> > Oct 25 20:26:00 tf2 fenced[4091]: fencing node "tf1"
> > Oct 25 20:26:04 tf2 kernel: e100: eth2: e100_watchdog: link down
> > Oct 25 20:26:08 tf2 fenced[4091]: fence "tf1" success
> 
> ^^ Fence recovery.
> 
> > Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Trying to acquire journal 
> > lock...
> ...
> > Oct 25 20:26:15 tf2 kernel: GFS: fsid=progressive:lv1.1: jid=0: Done
> 
> ^^ GFS recovery
> 
> > Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> Magma Event: Membership Change 
> > Oct 25 20:26:27 tf2 clurgmgrd[4903]: <info> State change: tf1 DOWN 
> 
> ^^ Rgmanager recovery
> 
> > Oct 25 20:26:27 tf2 clurgmgrd[4903]: <notice> Starting stopped service Apache Service 
> > Oct 25 20:26:29 tf2 httpd: httpd startup succeeded
> > Oct 25 20:26:29 tf2 clurgmgrd[4903]: <notice> Service Apache Service started 
> > Oct 25 20:26:36 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
> > Oct 25 20:28:08 tf2 kernel: e100: eth2: e100_watchdog: link down
> > Oct 25 20:28:10 tf2 kernel: e100: eth2: e100_watchdog: link up, 100Mbps, full-duplex
> > Oct 25 20:29:40 tf2 kernel: CMAN: node tf1 rejoining
> 
> ^^ CMAN restarted on tf1 (rebooted)
> 
> 
> > Oct 25 20:34:25 tf2 kernel: CMAN: too many transition restarts - will die
> > Oct 25 20:34:25 tf2 kernel: CMAN: we are leaving the cluster. Inconsistent cluster view
> 
> Argh.  That's not good.  I *think* this is a bug in CMAN-kernel in U3,
> which was fixed in U4.
> 
> > Oct 25 20:34:26 tf2 kernel: lock_dlm:  Assertion failed on line 428 of file 
> > /usr/src/redhat/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/lock.c
> > Oct 25 20:34:26 tf2 kernel: lock_dlm:  assertion:  "!error"
> ...
> > Oct 25 20:34:26 tf2 kernel: ------------[ cut here ]------------
> ...
> > Oct 25 20:34:26 tf2 kernel:  <0>Fatal exception: panic in 5 seconds
> > 
> > and now tf2 is  unreachable too.. 
> > ideas? suggestions?
> 
> The panic above is a bug in the dlm-kernel rpm/package; I don't know
> much more than that.  When a machine panics, it stops responding to
> things over the network.
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
================================================
|    Jason Welsh   jason at monsterjam.org        |
| http://monsterjam.org    DSS PGP: 0x5E30CC98 |
|    gpg key: http://monsterjam.org/gpg/       |
================================================


From isplist at logicore.net  Thu Oct 26 14:54:35 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 09:54:35 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <9344F1B4E9BB3E46AA0E76EDD7DCC158140CE8@RSRCNEX02CN1.rsrc.osd.mil>
Message-ID: <2006102695435.356707@leena>

> you probably goofed the VG or LV definitions. post output of vgdisplay and
> lvdisplay

Ah, so many things to learn... all at once no less :).

# vgdisplay|more
  --- Volume group ---
  VG Name               storec
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  6
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               0
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               745.76 GB
  PE Size               4.00 MB
  Total PE              190914
  Alloc PE / Size       190914 / 745.76 GB
  Free  PE / Size       0 / 0
  VG UUID               iovndV-jXr1-amlr-zVsD-Ili2-h2y5-6RxHwa

  --- Volume group ---
  VG Name               storeb
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  7
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
 MAX LV                0
  Cur LV                1
  Open LV               0
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               745.76 GB
  PE Size               4.00 MB
  Total PE              190914
  Alloc PE / Size       190914 / 745.76 GB
  Free  PE / Size       0 / 0
  VG UUID               q9aS3c-jOVn-ih9N-tTnl-ufvy-y0NR-t1RFDa

  --- Volume group ---
  VG Name               rimfire
  System ID
  Format                lvm2
  Metadata Areas        1
  Metadata Sequence No  2
  VG Access             read/write
  VG Status             resizable
  Clustered             yes
  Shared                no
  MAX LV                0
  Cur LV                1
  Open LV               0
  Max PV                0
  Cur PV                1
  Act PV                1
  VG Size               572.72 GB
  PE Size               4.00 MB
  Total PE              146616
  Alloc PE / Size       146616 / 572.72 GB
  Free  PE / Size       0 / 0
  VG UUID               meTNO2-CDAL-GZ8w-J7Hv-FuG2-ysfu-Hy3iQc

# lvdisplay|more
  --- Logical volume ---
  LV Name                /dev/storec/web
  VG Name                storec
  LV UUID                uosGKp-PlY4-Z83y-343h-VBo5-ATL3-08qf3C
  LV Write Access        read/write
  LV Status              available
  # open                 0
  LV Size                745.76 GB
  Current LE             190914
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:0

  --- Logical volume ---
  LV Name                /dev/storeb/qm
  VG Name                storeb
  LV UUID                Tbcyi3-6iBd-ND2W-lEFQ-Whp5-asJ1-Q3lMSI
  LV Write Access        read/write
  LV Status              available
  # open                 0
  LV Size                745.76 GB
  Current LE             190914
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:1

  --- Logical volume ---
  LV Name                /dev/rimfire/rimfire
  VG Name                rimfire
  LV UUID                tHbr02-iWJF-F9NY-Ix5z-3csX-pX0G-35wrOy
  LV Write Access        read/write
  LV Status              available
  # open                 0
  LV Size                572.72 GB
  Current LE             146616
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:2


From dist-list at LEXUM.UMontreal.CA  Thu Oct 26 14:57:56 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Thu, 26 Oct 2006 10:57:56 -0400
Subject: [Linux-cluster] GFS performance problem
In-Reply-To: <453F664C.5040508@obsidian.co.za>
References: <453E6B9C.7030009@lexum.umontreal.ca>	<453F2421.10404@obsidian.co.za>	<453F5206.1080303@lexum.umontreal.ca>
	<453F664C.5040508@obsidian.co.za>
Message-ID: <4540CCF4.5020309@lexum.umontreal.ca>

Thanks again :)

I read an interesting Q/A in the cluster project faq :
#  Why is GFS slow doing things like 'ls -lr *' whereas 'ls -r *' is fast?
Mostly due to design constraints. An ls -r * can simply traverse the
directory structures, which is very fast. An ls -lr * has to traverse
the directory, but also has to stat each file to get more details for
the ls. That means it has to acquire and release a cluster lock on each
file, which can be slow. We've tried to address these problems with the
new GFS2 file system.

Because the way rsync works, it must stat all the files and so have to
create a lock for each one.

Our setup is :
*) all GFS are LUN on SAN (RAID 5)
*) 6 web servers write to same logs file.
*) One of these 6 servers writes into the the web data folder (hundreds
of thousand HTML files) , all others are readers. It is in this folder
that we are using rsync

So what about that :
1 GFS with dlm_lock for our web log files (where all servers are writing).
1 GFS with nolock where the web data folder is located where all servers
read but only one write.

Sound good or .... ;-) ?

Thank again

Riaan van Niekerk wrote:
>
>
> FM wrote:
>> Thank you for the answers.
>>
>> Another bottleneck that I could have is the way I connect to the SAN
>> : device mapper multipath instead of the HP officiel module.
>>
>> I do not not if the performance can we increase with the official
>> module  ?
>>
>
> There is no SecurePath for RHEL 4 (assuming that is what you mean by
> "official module". That leaves you with
> a) device-mapper-multipath
> b) the Emulex or Qlogic HBA-based failover.
>
> I have heard mostly good things about (a) even though I dont have a
> lot of production experience with it myself. It seems the trend is to
> move away from vendor/array and HBA-based multipathing towards
> OS-based multipathing, be it Windows or Linux.
>
>>
>>
>>
>> Riaan van Niekerk wrote:
>>>
>>>
>>> FM wrote:
>>>> Hello,
>>>>
>>>> Here is my setup :
>>>> RedHat Enterprise 4 (update 3).
>>>> 5 web servers connected to a 600 GB  GFS (noatime) on a SAN.
>>>> On the GFS : all web sites ROOTS and httpd logs files
>>>> All web servers are writing to the same log. No problem here.
>>>>
>>>> The prob is with rsync. When writing on the GSF it is very very slow !
>>>> (40 % slower when we are lucky :) )
>>>>
>>>> lots of question :
>>>> *) could it be because if I have 1 GFS for the heavy write on the log
>>>> and another GFS for the websites' files ?
>>>> *) expect for the noatime, is there other technic to speed up GFS ?
>>>
>>> a) we have increased the number of GFS locks cached to bring them
>>> closer in line with the number of locks in use:
>>> echo "200000" > /proc/cluster/lock_dlm/drop_count
>>> (not sure how much of a performance increase this got us)
>>>
>>> b) We have disabled quotas
>>> gfs_tool settune /mnt/san quota_account 0
>>> We saw a 3 - 5 increase in performance
>>>
>>> c) if you are using lots of small files, look into section 5.8 in
>>> the GFS manual, Data Journaling. However,
>>> - I dont know how to change this for existing data on a GFS
>>> - I have asked on the mailing list, and no-one seems to be using it.
>>>
>>>> *) Can several webserver access EXT3 FS (read only) when only on other
>>>> server have RW access to it ?
>>>
>>> no. the RO server will get confused when things change from under
>>> it, since it is not expecting things to change
>>>
>>>> *) is there a options to tune rsync when using GFS ?
>>>> *) we are using DLM as the locking system. All servers are connected
>>>> with Gb RJ45. Is DLM using the network to manage the lock. And if
>>>> it is
>>>> the case, could my problem come from the network latency ?
>>>
>>> DLM is using the network, yes. not sure about latency. We use GB
>>> ethernet with RJ45/CAT5+ and have not had any problems related to
>>> DLM and the network (that we are aware of). As it was explained to
>>> me by Red Hat Support, DLM is extremely efficient, being able to
>>> master/distribute thousands of locks per second between nodes.
>>>
>>>>
>>>> As I said, lots of questions here :)
>>>>
>>>
>>> and some answers. Wish I had answers to all your questions.
>>>
>>> greetings
>>> Riaan
>>>
>>> -- 
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rpeterso at redhat.com  Thu Oct 26 15:29:03 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 26 Oct 2006 10:29:03 -0500
Subject: [Linux-cluster] CS4 U2 / crash when "less /proc/cluster/status"
In-Reply-To: <45406D5E.9070603@redhat.com>
References: <453F88D5.9010101@bull.net>
	<453F94B0.1090006@redhat.com>	<1161804991.4518.35.camel@rei.boston.devel.redhat.com>
	<45406D5E.9070603@redhat.com>
Message-ID: <4540D43F.1000609@redhat.com>

Patrick Caulfield wrote:
> less reads the start of the file in a small chunk (64 bytes I think) and the code in proc_cluster_status doesn't stop when it
> reaches the 'length' parameter (slapped wrist).
>
> That routine should really be changed to use seq_file like the rest of them.
>
> It probably used to be OK, but more and more stuff has been added to the /proc file over time and it's a lot bigger than it used to be !
Patrick,

Yeah, that's how I'd fix it.  The other proc files that use seq_file 
don't panic, so it probably will work.

Bob Peterson
Red Hat Cluster Suite


From rpeterso at redhat.com  Thu Oct 26 18:41:53 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 26 Oct 2006 13:41:53 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <20061025164355.971031@leena>
References: <20061025164355.971031@leena>
Message-ID: <45410171.8090901@redhat.com>

isplist at logicore.net wrote:
> # mount -t gfs /dev/rimfire/rimfire /gfs/rimfire/
> mount: File exists
>
> Why is it that I can only mount one at a time? I need at least two per node, 
> non contiguous, separate.
>
> Mike
>   
Hi Mike,

I'm guessing that maybe you gave them the same locking table on 
gfs_mkfs, and they're supposed
to be different.  When you did mkfs, did you use the same -t 
cluster:fsname for more than one?
You can find this out by doing:

gfs_tool sb /dev/rimfire/rimfire table
gfs_tool sb /dev/storec/web table
gfs_tool sb /dev/storeb/qm table

And see if the same value appears. You can change it after the mkfs has 
already been
done with this command:

gfs_tool sb /dev/<device> table cluster:new_name

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Thu Oct 26 18:53:05 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 13:53:05 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45410171.8090901@redhat.com>
Message-ID: <2006102613535.004162@leena>

> When you did mkfs, did you use the same -t
> cluster:fsname for more than one?

Ah, you're correct of course.

# gfs_tool sb /dev/rimfire/rimfire table
current lock table name = "vgcomp:gfs1"
# gfs_tool sb /dev/storec/web table
current lock table name = "vgcomp:gfs1"
# gfs_tool sb /dev/storeb/qm table
current lock table name = "vgcomp:gfs1"

> And see if the same value appears. You can change it after the mkfs has
> already been done with this command:
> 
> gfs_tool sb /dev/<device> table cluster:new_name

I did that one time and totally messed everything up. I might have more 
experience now so might not mess it up. 

Although, I'm now looking at the diskless booting ideas which might make a lot 
more sense for my needs. From what I can tell, I can have multiple roots even 
with a single combined storage system? I'm not sure, still reading, need to 
try some basic two node stuff maybe.

Thanks very much, another one for my notes.

Mike


> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite


From isplist at logicore.net  Thu Oct 26 19:03:20 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 14:03:20 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45410171.8090901@redhat.com>
Message-ID: <2006102614320.878188@leena>

Darn, now I get;

# mount -t gfs /dev/rimfire/rimfire /gfs/rimfire/
mount: permission denied

I think that's where a full range of headaches started last time? :)


On Thu, 26 Oct 2006 13:41:53 -0500, Robert Peterson wrote:
> isplist at logicore.net wrote:
> 
>> # mount -t gfs /dev/rimfire/rimfire /gfs/rimfire/
>> mount: File exists
>> 
>> Why is it that I can only mount one at a time? I need at least two per
>> node,
>> non contiguous, separate.
>> 
>> Mike
>> 
> Hi Mike,
> 
> I'm guessing that maybe you gave them the same locking table on
> gfs_mkfs, and they're supposed
> to be different.  When you did mkfs, did you use the same -t
> cluster:fsname for more than one?
> You can find this out by doing:
> 
> gfs_tool sb /dev/rimfire/rimfire table
> gfs_tool sb /dev/storec/web table
> gfs_tool sb /dev/storeb/qm table
> 
> And see if the same value appears. You can change it after the mkfs has
> already been
> done with this command:
> 
> gfs_tool sb /dev/<device> table cluster:new_name
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite


From rpeterso at redhat.com  Thu Oct 26 19:03:50 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 26 Oct 2006 14:03:50 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <2006102613535.004162@leena>
References: <2006102613535.004162@leena>
Message-ID: <45410696.4010202@redhat.com>

isplist at logicore.net wrote:
> Although, I'm now looking at the diskless booting ideas which might make a lot 
> more sense for my needs. From what I can tell, I can have multiple roots even 
> with a single combined storage system? I'm not sure, still reading, need to 
> try some basic two node stuff maybe.
>
> Thanks very much, another one for my notes.
>
> Mike
>   
Hi Mike,

Well, I have some experience with this as well, although that was in my 
pre-GFS days.
I was using nfs to serve multiple root directories.  So here's what I'd do:

1. Set up your dhcp server to dole out unique IP addresses to your 
clients based on
MAC address.
2. Mount your GFS partition as /tftpboot
3. In your /tftpboot, create root directories for your clients, and put 
what you need there.
In my case, I had tiny root partitions and my diskless clients were tiny 
(literally) embedded
devices that had to run something like Denx's ELDK, The Embedded Linux 
Development Kit,
(google it) which included tools to get their kernel from tftpboot, and 
boot from the tftp server.
My client root file systems also had separate mount points for commonly 
shared areas that
didn't tend to change a lot: /lib, /usr/lib, /bin, /sbin, and several 
more, etc., so that if I changed
one, I changed them all.

You still need a minimal set of these libs and such on the client root 
fs in order to boot far enough to
where the fstab can actually mount these things though.

Also, the clients need to have enough smarts to each mount their OWN 
root partition
so they don't bump into one another.
4. Of course you need nfs serving out the data, and xinetd accepting 
tftp connections.

For extra credit, you could use a NFS failover service through rgmanager 
for your NFS server.  :)
I wrote a little bit about this in my NFS/GFS Cookbook, but not in much 
detail.

I hope this helps.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Thu Oct 26 19:17:40 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 14:17:40 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45410696.4010202@redhat.com>
Message-ID: <20061026141740.387533@leena>

> 1. Set up your dhcp server to dole out unique IP addresses to your
> clients based on MAC address.

Ok, so the actual IP's I would have otherwise installed on each server.

> 2. Mount your GFS partition as /tftpboot

My setup includes multiple storage devices. I can either have multiple devices 
or I can create one large filesystem for all of the nodes in this cluster. 
Which do you think is best since I can keep adding or taking from a volume 
group.

> 3. In your /tftpboot, create root directories for your clients, and put
> what you need there.

As in server images and such? This I have to read about, the whole idea is new 
to me as of this morning :).

>You still need a minimal set of these libs and such on the client root fs in 
>order to boot far enough to where the fstab can actually mount these things 
>though.

My machines share CDROM/Floppy so can't boot from those automatically. I do 
have network boot on each server however and of course, I'd like not to have 
to use any drives on any of the blades. Then I could put more into the central 
storage.

> 4. Of course you need nfs serving out the data, and xinetd accepting
> tftp connections.

I'm using fibre channel so all of the nodes see the storage as their own.

Thanks for all this info, I'll continue looking for more and build one of 
these. I am sure it's exactly what I need over having to build multiple 
machines all the time.

Mike


From isplist at logicore.net  Thu Oct 26 19:54:40 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 14:54:40 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <9344F1B4E9BB3E46AA0E76EDD7DCC158140CF2@RSRCNEX02CN1.rsrc.osd.mil>
Message-ID: <20061026145440.766428@leena>

Yes, agreed.

In my case, I already have fibre channel storage set up now. Each node can see 
the storage as it's own so I can avoid things such as NFS for example. 

I like the idea of network boots for two reasons. 

One is that I have a drive in each blade which costs power and yet, I still 
have to add drives into the central storage. Seems that I could save money and 
power by not having any drives in each blade/server. I'd prefer that.

Second is that right now, when I need a new node, I have to install a new 
server, configure the server rather heavily etc, and in the end, that server 
and my time is not being used very effectively.. it seems.

Most of the things I want to run seem to allow for central storage. I am 
running email, web and sql services for the most part, other things might be 
on their own or their own clusters. Web servers can share data, mail servers 
and from what I've seen, sql servers can too. 

So, when I need more resources, it would be a simple matter of configuring the 
net boot options and firing up another blade and I'm done.

Of course there are other aspects but I'm talking about the basics. Am I 
missing something?

Mike


On Thu, 26 Oct 2006 15:40:11 -0400, Patton, Matthew F, CTR, OSD-PA&E wrote:
> Classification: UNCLASSIFIED
> 
> before getting too mired in minutia, read/reread all the howto etc on
> netbooting. things like /usr lend themselves nicely to read-only NFS (or
> GFS) mounts. / (root) should be writable by just the host it applies to. So
> GFS is again, not particularly useful IMO. mailspools and common
> html/jsp/applet sort of directories are candidates for GFS. Before you go
> with clustering, might want to think hard whether it makes sense to go to
> all that bother. a load balancer (eg. piranah or commercial) and machines
> mounting NFS may accomplish what you are really after.


From rpeterso at redhat.com  Thu Oct 26 20:43:08 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 26 Oct 2006 15:43:08 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <20061026141740.387533@leena>
References: <20061026141740.387533@leena>
Message-ID: <45411DDC.1060803@redhat.com>

isplist at logicore.net wrote:
> My setup includes multiple storage devices. I can either have multiple devices 
> or I can create one large filesystem for all of the nodes in this cluster. 
> Which do you think is best since I can keep adding or taking from a volume 
> group.
> My machines share CDROM/Floppy so can't boot from those automatically. I do 
> have network boot on each server however and of course, I'd like not to have 
> to use any drives on any of the blades. Then I could put more into the central 
> storage.
>   
> I'm using fibre channel so all of the nodes see the storage as their own.
>
> Thanks for all this info, I'll continue looking for more and build one of 
> these. I am sure it's exactly what I need over having to build multiple 
> machines all the time.
>
> Mike
>   
Hi Mike,

Given that all boxes can see the shared storage, I guess I'd recommend 
making
a separate file system for each diskless client's root partition.  That 
way, each can't
interfere with one another's root fs.

I may have to give some more thought to how I'd set this up.  The 
problem is,
your clients aren't going to have access to the GFS file systems until they
start running the init scripts, such as ccsd, cman, fenced, clvmd, etc.
Therefore the kernel won't be able to see it as a boot device when the 
kernel boots.

Since each client needs its own root partition, not to be touched by the 
others anyway,
perhaps you can create tiny ext3 partitions on the SAN for each one to 
boot from.
I'll have to think about that.

You could also look at: 
http://sources.redhat.com/cluster/faq.html#gfs_diskless

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Thu Oct 26 21:58:55 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 16:58:55 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45410171.8090901@redhat.com>
Message-ID: <20061026165855.943665@leena>

> And see if the same value appears. You can change it after the mkfs has 
> already been done with this command:
> 
> gfs_tool sb /dev/<device> table cluster:new_name

Interesting.. everything's gone now? I changed and now all volume groups have 
disappeared. I'm guessing I've lost all my data?

I changed things as follows;

From;

# gfs_tool sb /dev/rimfire/rimfire table
current lock table name = "vgcomp:gfs1"

# gfs_tool sb /dev/storec/web table
current lock table name = "vgcomp:gfs1"

# gfs_tool sb /dev/storeb/qm table
current lock table name = "vgcomp:gfs1"

To;

gfs_tool sb /dev/storeb/qm table qm:gfs1
gfs_tool sb /dev/storec/web table web:gfs1
gfs_tool sb /dev/rimfire/rimfire table rim:gfs1


From rpeterso at redhat.com  Thu Oct 26 23:00:46 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Thu, 26 Oct 2006 18:00:46 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <20061026165855.943665@leena>
References: <20061026165855.943665@leena>
Message-ID: <45413E1E.9020004@redhat.com>

isplist at logicore.net wrote:
> Interesting.. everything's gone now? I changed and now all volume groups have 
> disappeared. I'm guessing I've lost all my data?
>
> I changed things as follows;
>
> From;
>
> # gfs_tool sb /dev/rimfire/rimfire table
> current lock table name = "vgcomp:gfs1"
>
> # gfs_tool sb /dev/storec/web table
> current lock table name = "vgcomp:gfs1"
>
> # gfs_tool sb /dev/storeb/qm table
> current lock table name = "vgcomp:gfs1"
>
> To;
>
> gfs_tool sb /dev/storeb/qm table qm:gfs1
> gfs_tool sb /dev/storec/web table web:gfs1
> gfs_tool sb /dev/rimfire/rimfire table rim:gfs1

Hi,

Well  gfs_tool should be used on the gfs blocks INSIDE of the logical 
volumes,
which are inside the volume groups, which are all part of lvm.
So perhaps the problem is that you specified the physical devices rather 
than
the logical volumes.  I'm very sorry if I misled you here.  You want to 
do something like this:

gfs_tool sb /dev/VolGroup03/lvol0 table qm:gfs1

Where /dev/VolGroup03/lvol0 is a logical volume defined to be in a 
volume group
VolGroup03, which contains the physical volumes that correspond to your
hardware.

Regards,

Bob Peterson
Red Hat Cluster Suite


From isplist at logicore.net  Thu Oct 26 23:26:53 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 18:26:53 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45413E1E.9020004@redhat.com>
Message-ID: <20061026182653.264960@leena>

I didn't actually lose any data but I have to put it all back in so, no big 
deal, part of the learning curve.


On Thu, 26 Oct 2006 18:00:46 -0500, Robert Peterson wrote:
> isplist at logicore.net wrote:
> 
>> Interesting.. everything's gone now? I changed and now all volume groups
>> have
>> disappeared. I'm guessing I've lost all my data?
>> 
>> I changed things as follows;
>> 
>> From;
>> 
>> # gfs_tool sb /dev/rimfire/rimfire table
>> current lock table name = "vgcomp:gfs1"
>> 
>> # gfs_tool sb /dev/storec/web table
>> current lock table name = "vgcomp:gfs1"
>> 
>> # gfs_tool sb /dev/storeb/qm table
>> current lock table name = "vgcomp:gfs1"
>> 
>> To;
>> 
>> gfs_tool sb /dev/storeb/qm table qm:gfs1
>> gfs_tool sb /dev/storec/web table web:gfs1
>> gfs_tool sb /dev/rimfire/rimfire table rim:gfs1
>> 
> Hi,
> 
> Well  gfs_tool should be used on the gfs blocks INSIDE of the logical
> volumes,
> which are inside the volume groups, which are all part of lvm.
> So perhaps the problem is that you specified the physical devices rather
> than
> the logical volumes.  I'm very sorry if I misled you here.  You want to
> do something like this:
> 
> gfs_tool sb /dev/VolGroup03/lvol0 table qm:gfs1
> 
> Where /dev/VolGroup03/lvol0 is a logical volume defined to be in a
> volume group
> VolGroup03, which contains the physical volumes that correspond to your
> hardware.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite


From isplist at logicore.net  Fri Oct 27 00:17:09 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Thu, 26 Oct 2006 19:17:09 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45413E1E.9020004@redhat.com>
Message-ID: <2006102619179.936306@leena>

FYI,

> So perhaps the problem is that you specified the physical devices rather
> than the logical volumes.  I'm very sorry if I misled you here.  You want to
> do something like this:

Didn't even clue in, too many things going on at once I guess.

One reason this was not understood properly is that I've never used these 
tools. I use the command line programs such as pvcreate to create the storage, 
vgcreate tp create the logical volumes and lvcreate to create the logical 
volumes. Can't wait for that web based tool that's coming out with RHEL5.

Mike


> gfs_tool sb /dev/VolGroup03/lvol0 table qm:gfs1
> 
> Where /dev/VolGroup03/lvol0 is a logical volume defined to be in a
> volume group
> VolGroup03, which contains the physical volumes that correspond to your
> hardware.
> 
> Regards,
> 
> Bob Peterson
> Red Hat Cluster Suite


From jason at monsterjam.org  Fri Oct 27 01:03:15 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Thu, 26 Oct 2006 21:03:15 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1161872071.4518.55.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
Message-ID: <20061027010315.GA470@monsterjam.org>

ok, heres the rest of the story.. i.e. heres the messages from tf1 (the original master)
Oct 25 20:31:14 tf1 rpcidmapd: rpc.idmapd startup succeeded
Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 1 [Phy 1] for non-raid devices
Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PV22XS            Rev: E.17
Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 03
Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 2 [virtual] for logical drives
Oct 25 20:31:14 tf1 kernel:   Vendor: MegaRAID  Model: LD 0 RAID5  139G  Rev: 351X
Oct 25 20:31:14 tf1 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Oct 25 20:31:14 tf1 kernel: scsi1 (2,0,0) : reservation conflict
Oct 25 20:31:14 tf1 last message repeated 2 times
Oct 25 20:31:14 tf1 kernel: sdb: Unit Not Ready, error = 0x70018
Oct 25 20:31:14 tf1 kernel: SCSI device sdb: 286228480 512-byte hdwr sectors (146549 MB)
Oct 25 20:31:14 tf1 kernel: sdb: asking for cache data failed
Oct 25 20:31:14 tf1 kernel: sdb: assuming drive cache: write through
Oct 25 20:31:14 tf1 kernel: scsi1 (2,0,0) : reservation conflict
Oct 25 20:31:14 tf1 last message repeated 2 times
Oct 25 20:31:14 tf1 kernel: sdb: Unit Not Ready, error = 0x70018
Oct 25 20:31:14 tf1 kernel: SCSI device sdb: 286228480 512-byte hdwr sectors (146549 MB)
Oct 25 20:31:14 tf1 kernel: sdb: asking for cache data failed
Oct 25 20:31:14 tf1 kernel: sdb: assuming drive cache: write through
Oct 25 20:31:14 tf1 kernel:  sdb: sdb1
Oct 25 20:31:14 tf1 kernel: Attached scsi disk sdb at scsi1, channel 2, id 0, lun 0
Oct 25 20:31:14 tf1 kernel: Adaptec aacraid driver (1.1-5[2412])
Oct 25 20:31:14 tf1 kernel: device-mapper: 4.5.0-ioctl (2005-10-04) initialised: dm-devel at redhat.com
Oct 25 20:31:14 tf1 kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
Oct 25 20:31:14 tf1 kernel: EXT3-fs: write access will be enabled during recovery.

so sdb is the gfs volume and is already locked by the other server at this point is my guess.

Oct 25 20:31:14 tf1 kernel: shpchp: acpi_pciehprm:\_SB_.PCI0.PBLO OSHP fails=0x5
Oct 25 20:31:14 tf1 kernel: shpchp: acpi_shpchprm:   Slot sun(0) at s:b:d:f=0x00:04:1f:00
Oct 25 20:31:14 tf1 kernel: shpchp: acpi_pciehprm:\_SB_.PCI0.PBLO OSHP fails=0x5
Oct 25 20:31:14 tf1 last message repeated 4 times
Oct 25 20:31:14 tf1 ccsd[4128]: Starting ccsd 1.0.3: 
Oct 25 20:31:14 tf1 kernel: shpchp: acpi_pciehprm:\_SB_.PCI0.PBLO OSHP fails=0x5
Oct 25 20:31:14 tf1 ccsd[4128]:  Built: May 22 2006 16:15:59 
Oct 25 20:31:14 tf1 kernel: shpchp: acpi_pciehprm:\_SB_.PCI0.PBLO OSHP fails=0x5
Oct 25 20:31:14 tf1 ccsd[4128]:  Copyright (C) Red Hat, Inc.  2004  All rights reserved. 
Oct 25 20:31:14 tf1 kernel: shpchp: acpi_pciehprm:\_SB_.PCI0.VPR0 OSHP fails=0x5
Oct 25 20:31:14 tf1 last message repeated 7 times
Oct 25 20:31:14 tf1 kernel: shpchp: shpc_init : shpc_cap_offset == 0
Oct 25 20:31:14 tf1 last message repeated 8 times
Oct 25 20:31:14 tf1 kernel: shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
Oct 25 20:31:14 tf1 kernel: hw_random hardware driver 1.0.0 loaded

...
Oct 25 20:29:48 tf1 rc.sysinit: Setting clock : Wed Oct 25 20:29:48 EDT 2006 succeeded 
Oct 25 20:29:48 tf1 rc.sysinit: Loading default keymap succeeded 
Oct 25 20:29:48 tf1 rc.sysinit: Setting hostname tf1.localdomain:  succeeded 
Oct 25 20:29:53 tf1 fsck: /: clean, 36754/131616 files, 139153/263056 blocks 
Oct 25 20:29:53 tf1 rc.sysinit: Checking root filesystem succeeded 
Oct 25 20:29:53 tf1 rc.sysinit: Remounting root filesystem in read-write mode:  succeeded 
Oct 25 20:29:53 tf1 lvm.static:   Locking inactive: ignoring clustered volume group diskarray 
Oct 25 20:29:53 tf1 rc.sysinit: Setting up Logical Volume Management: failed 
Oct 25 20:29:53 tf1 fsck: fsck.gfs: invalid option -- a 
Oct 25 20:29:53 tf1 fsck: Please use '-h' for usage. 


Oct 25 20:31:17 tf1 ccsd[4128]: Remote copy of cluster.conf is from quorate node. 
Oct 25 20:31:17 tf1 ccsd[4128]:  Local version # : 22 
Oct 25 20:31:17 tf1 ccsd[4128]:  Remote version #: 22 
Oct 25 20:31:17 tf1 kernel: CMAN: Waiting to join or form a Linux-cluster
Oct 25 20:31:18 tf1 ccsd[4128]: Connected to cluster infrastruture via: CMAN/SM Plugin v1.1.5 
Oct 25 20:31:18 tf1 ccsd[4128]: Initial status:: Inquorate 
Oct 25 20:31:19 tf1 kernel: CMAN: sending membership request
Oct 25 20:31:19 tf1 kernel: CMAN: got node tf2
Oct 25 20:33:17 tf1 cman: Timed-out waiting for cluster failed
Oct 25 20:33:17 tf1 lock_gulmd: no <gulm> section detected in /etc/cluster/cluster.conf succeeded
Oct 25 20:35:17 tf1 fenced: startup failed
Oct 25 20:36:13 tf1 kernel: CMAN: removing node tf2 from the cluster : No response to messages
Oct 25 20:36:13 tf1 kernel: ------------[ cut here ]------------
Oct 25 20:36:13 tf1 kernel: kernel BUG at /usr/src/redhat/BUILD/cman-kernel-2.6.9-43/smp/src/membership.c:3150!
Oct 25 20:36:13 tf1 kernel: invalid operand: 0000 [#1]
Oct 25 20:36:13 tf1 kernel: SMP 
Oct 25 20:36:13 tf1 kernel: Modules linked in: cman(U) md5 ipv6 sunrpc button battery ac uhci_hcd ehci_hcd hw_random shpchp 
eepro100 e100 mii e100
0 floppy sg ext3 jbd dm_mod aic7xxx megaraid_mbox megaraid_mm sd_mod scsi_mod
Oct 25 20:36:13 tf1 kernel: CPU:    3
Oct 25 20:36:13 tf1 kernel: EIP:    0060:[<f896ae2a>]    Not tainted VLI
Oct 25 20:36:13 tf1 kernel: EFLAGS: 00010246   (2.6.9-34.ELsmp) 
Oct 25 20:36:13 tf1 kernel: EIP is at elect_master+0x2e/0x3a [cman]
Oct 25 20:36:13 tf1 kernel: eax: 00000000   ebx: f619dfa0   ecx: 00000080   edx: 00000080
Oct 25 20:36:13 tf1 kernel: esi: f897dfc4   edi: f619dfd8   ebp: 00000000   esp: f619df98
Oct 25 20:36:13 tf1 kernel: ds: 007b   es: 007b   ss: 0068
Oct 25 20:36:13 tf1 kernel: Process cman_memb (pid: 4192, threadinfo=f619d000 task=f757f1b0)
Oct 25 20:36:13 tf1 kernel: Stack: f897de88 f89688d1 c22b9800 f62bd540 f8966eb7 f757f1b0 f757f1b0 f896709a 
Oct 25 20:36:13 tf1 kernel:        0000001f 00000000 f6622bb0 00000000 f757f1b0 c011e71b 00100100 00200200 
Oct 25 20:36:13 tf1 kernel:        00000000 00000000 0000007b f8966ed8 00000000 00000000 c01041f5 00000000 
Oct 25 20:36:13 tf1 kernel: Call Trace:
Oct 25 20:36:13 tf1 kernel:  [<f89688d1>] a_node_just_died+0x13a/0x199 [cman]
Oct 25 20:36:13 tf1 kernel:  [<f8966eb7>] process_dead_nodes+0x4e/0x6f [cman]
Oct 25 20:36:13 tf1 kernel:  [<f896709a>] membership_kthread+0x1c2/0x39d [cman]
Oct 25 20:36:13 tf1 kernel:  [<c011e71b>] default_wake_function+0x0/0xc
Oct 25 20:36:13 tf1 kernel:  [<f8966ed8>] membership_kthread+0x0/0x39d [cman]
Oct 25 20:36:13 tf1 kernel:  [<c01041f5>] kernel_thread_helper+0x5/0xb
Oct 25 20:36:13 tf1 kernel: Code: a8 ed 97 f8 89 c3 ba 01 00 00 00 39 ca 7d 1c a1 ac ed 97 f8 8b 04 90 85 c0 74 0d 83 78 1c 02 75 
07 89 03 8b 40 1
4 eb 0d 42 eb e0 <0f> 0b 4e 0c 68 1d 97 f8 31 c0 5b c3 a1 ac ed 97 f8 e8 79 80 7e 
Oct 25 20:36:13 tf1 kernel:  <0>Fatal exception: panic in 5 seconds
Oct 26 12:19:46 tf1 syslogd 1.4.1: restart.


so my question now is that it appears that I have something misconfigured.. tf1 should come up as secondary while tf2 is running as 
primary, right? or should tf1 come up and take over as primary and tf2 let him?

Jason


From isplist at logicore.net  Fri Oct 27 06:09:43 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 27 Oct 2006 01:09:43 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45410171.8090901@redhat.com>
Message-ID: <200610271943.988010@leena>

> gfs_tool sb /dev/<device> table cluster:new_name

Interesting, so, what does one do when they have more than one GFS drive that 
they want all nodes to have access to? 
My cluster.conf file has vgcomp in it as the cluster name yet I need a unique 
name for each storage if I need to keep them separate. 

# lvscan
  ACTIVE            '/dev/VolGroup03/qm' [745.76 GB] inherit
  ACTIVE            '/dev/VolGroup02/web' [745.76 GB] inherit
  ACTIVE            '/dev/VolGroup01/rimfire' [572.72 GB] inherit

All of my nodes have vgcomp as the name of the cluster yet some need access to 
one or more of the devices above. Can I add or do I need multiple names in the 
cluster.conf file in order to allow the system to mount any of these?

Because, for example, trying to mount the following leads to the denied error, 
it seems, because the cluster is expecting vgcomp.

# mount -t gfs /dev/VolGroup01/rimfire /gfs/rimfire/
mount: permission denied

Mike


From Alain.Moulle at bull.net  Fri Oct 27 06:33:59 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 27 Oct 2006 08:33:59 +0200
Subject: [Linux-cluster] CS4 U2 & U4 / crash when "less
	/proc/cluster/status"
Message-ID: <4541A857.3070204@bull.net>

Is there already a patch number somewhere for U4 ?
Thanks
Alain Moull?

Patrick Caulfield wrote:

>> less reads the start of the file in a small chunk (64 bytes I think) and the
code in proc_cluster_status doesn't stop when it
>> reaches the 'length' parameter (slapped wrist).
>>
>> That routine should really be changed to use seq_file like the rest of them.
>>
>> It probably used to be OK, but more and more stuff has been added to the
/proc file over time and it's a lot bigger than it used to be !

Patrick,

Yeah, that's how I'd fix it.  The other proc files that use seq_file
don't panic, so it probably will work.

Bob Peterson
Red Hat Cluster Suite


From pcaulfie at redhat.com  Fri Oct 27 07:35:28 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 27 Oct 2006 08:35:28 +0100
Subject: [Linux-cluster] CS4 U2 & U4 / crash when
	"less	/proc/cluster/status"
In-Reply-To: <4541A857.3070204@bull.net>
References: <4541A857.3070204@bull.net>
Message-ID: <4541B6C0.2090204@redhat.com>

Alain Moulle wrote:
> Is there already a patch number somewhere for U4 ?
> Thanks
> Alain Moull?

No, I'm afraid not.

-- 

patrick


From hlawatschek at atix.de  Fri Oct 27 09:37:44 2006
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Fri, 27 Oct 2006 11:37:44 +0200
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45411DDC.1060803@redhat.com>
References: <20061026141740.387533@leena> <45411DDC.1060803@redhat.com>
Message-ID: <200610271137.44843.hlawatschek@atix.de>

Robert,

> Given that all boxes can see the shared storage, I guess I'd recommend
> making
> a separate file system for each diskless client's root partition.  That
> way, each can't
> interfere with one another's root fs.
You can put your complete shared root partition onto gfs.  
Each server has ist own part on the sharedroot where hostdependent files can 
be stored.
You can boot the kernel from a shared storage device.
have a look at the mini-howto:
http://www.open-sharedroot.org/documentation/the-opensharedroot-mini-howto/

> You could also look at:
> http://sources.redhat.com/cluster/faq.html#gfs_diskless
The url mentioned in that faq should be  http://www.open-sharedroot.org/

I also had a speech about that at the redhat summit. Maybe it can be found on 
the summit homepage ?

Mark
-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From dist-list at LEXUM.UMontreal.CA  Fri Oct 27 12:42:54 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 27 Oct 2006 08:42:54 -0400
Subject: [Linux-cluster] GFS lock_nolock for multiple readers and one
	writers ?
Message-ID: <4541FECE.8070009@lexum.umontreal.ca>

Is it safe to do that kind of config IF I am 100 % sure that only one 
computer will write to the GFS mount ?

Regards,


From johannes.russek at io-consulting.net  Fri Oct 27 13:50:50 2006
From: johannes.russek at io-consulting.net (Johannes russek)
Date: Fri, 27 Oct 2006 15:50:50 +0200
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <8c1094290610260711g20c33bfer74042ed7c6b055ac@mail.gmail.com>
Message-ID: <LGEOIPCNDMCLNENHDMKPGELJCPAA.johannes.russek@io-consulting.net>

i'm sorry to hit just into this, but did i get it right that active/active
mysql does actually work?
regards, johannes

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com]On Behalf Of David Brieck Jr.
> Sent: Thursday, October 26, 2006 4:11 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
>
>
> On 10/25/06, Michael Will <mwill at penguincomputing.com> wrote:
> > Are the actual data files shared in this setup between the active mysql
> > daemons?
> >
> > Last time I looked into this it seemed that with shared-nothing
> model each
> > mysql daemon would have to keep it's own copy of the data and updates
> > would be propagated from active to passive daemons (master-slave model)
> > or between active daemons (ndb in-ram database model)
> >
> > Are the mysql daemons running on the GFS I/O nodes that have access to
> > shared
> > storage via SAN or iSCSI and coordinate locking through GFS
> > infrastructure, or are
> > the mysql daemons running on client nodes that use GFS to remotely
> > access storage
> > that is provided by other  GFS I/O nodes that in turn have
> access to shared
> > storage via SAN or iSCSI?
> >
> > Michael
> >
>
> We're using GNBD for the nodes to connect to the storage. We don't
> have the fastest storage setup right now, but I'm hopeful that if
> everything works well we'll be purchasing a faster storage setup.
>
> As far as MySQL using GFS (excluding anything with active-active) and
> using DLM to do locks, here are some comparisons:
>
> Benchmark on GFS
>
> Benchmark DBD suite: 2.15
> Date of test:        2006-10-26  9:49:43
> Running tests on:    Linux 2.6.9-42.0.2.ELhugemem i686
> Arguments:           --small-test --tcpip --fast --fast-insert
> --lock-tables
> Comments:
> Limits from:
> Server version:      MySQL 4.1.20/
> Optimization:        None
> Hardware:
>
> alter-table: Total time: 94 wallclock secs ( 0.02 usr  0.01 sys +
> 0.00 cusr  0.00 csys =  0.03 CPU)
> big-tables: Total time:  4 wallclock secs ( 0.13 usr  0.14 sys +  0.00
> cusr  0.00 csys =  0.27 CPU)
> connect: Total time:  5 wallclock secs ( 0.38 usr  0.53 sys +  0.00
> cusr  0.00 csys =  0.91 CPU)
> create: Total time:  8 wallclock secs ( 0.02 usr  0.01 sys +  0.00
> cusr  0.00 csys =  0.03 CPU)
> insert: Total time: 17 wallclock secs ( 2.19 usr  1.99 sys +  0.00
> cusr  0.00 csys =  4.18 CPU)
> select: Total time: 13 wallclock secs ( 2.36 usr  1.03 sys +  0.00
> cusr  0.00 csys =  3.39 CPU)
>
> Benchmark on Local
>
> alter-table: Total time: 70 wallclock secs ( 0.02 usr  0.00 sys +
> 0.00 cusr  0.00 csys =  0.02 CPU)
> big-tables: Total time:  2 wallclock secs ( 0.11 usr  0.14 sys +  0.00
> cusr  0.00 csys =  0.25 CPU)
> connect: Total time:  4 wallclock secs ( 0.37 usr  0.55 sys +  0.00
> cusr  0.00 csys =  0.92 CPU)
> create: Total time:  1 wallclock secs ( 0.01 usr  0.00 sys +  0.00
> cusr  0.00 csys =  0.01 CPU)
> insert: Total time: 13 wallclock secs ( 2.27 usr  1.95 sys +  0.00
> cusr  0.00 csys =  4.22 CPU)
> select: Total time: 12 wallclock secs ( 2.21 usr  0.97 sys +  0.00
> cusr  0.00 csys =  3.18 CPU)
>
> It's pretty darn close and I'm willing to take a small performance hit.
>
> Here's some relevant info: local storage is RAID5 and GFS is RAID10
> and shared using CLVM, multipath, and GNBD. So the speed of the test
> locally would probably be faster if it were either RAID1 or 10, not 5.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From wcheng at redhat.com  Fri Oct 27 13:47:22 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 27 Oct 2006 09:47:22 -0400
Subject: [Linux-cluster] GFS lock_nolock for multiple readers and one
	writers ?
In-Reply-To: <4541FECE.8070009@lexum.umontreal.ca>
References: <4541FECE.8070009@lexum.umontreal.ca>
Message-ID: <45420DEA.1060408@redhat.com>

FM wrote:
> Is it safe to do that kind of config IF I am 100 % sure that only one 
> computer will write to the GFS mount ?
>
> Regards,
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
But your read won't be correct due to Linux write-back design. Is that 
ok with your application ?

-- Wendy


From isplist at logicore.net  Fri Oct 27 14:24:31 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 27 Oct 2006 09:24:31 -0500
Subject: [Linux-cluster] The /etc/lmv Directory and files
Message-ID: <2006102792431.237290@leena>

I've noticed that the /etc/lvm directory ends up containing a lot of old files 
and information. Things I changed long ago and no longer use seem to sit in 
the backup directories and I'm guessing are never used again.

Since I'm learning, I decided to go ahead and nuke all the old stuff because I 
wonder if the lvm get's confused at all by using some of that old info?

Hope I'm explaining this correctly.

Mike


From tmornini at engineyard.com  Fri Oct 27 14:35:04 2006
From: tmornini at engineyard.com (Tom Mornini)
Date: Fri, 27 Oct 2006 07:35:04 -0700
Subject: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
In-Reply-To: <LGEOIPCNDMCLNENHDMKPGELJCPAA.johannes.russek@io-consulting.net>
References: <LGEOIPCNDMCLNENHDMKPGELJCPAA.johannes.russek@io-consulting.net>
Message-ID: <E4CDEC89-7E0F-45B0-AC8D-C0F38BFBD058@engineyard.com>

It only works for the MyISAM table type.

InnoDB, for instance, will not work.

--  
-- Tom Mornini

On Oct 27, 2006, at 6:50 AM, Johannes russek wrote:

> i'm sorry to hit just into this, but did i get it right that active/ 
> active
> mysql does actually work?
> regards, johannes
>
>> -----Original Message-----
>> From: linux-cluster-bounces at redhat.com
>> [mailto:linux-cluster-bounces at redhat.com]On Behalf Of David Brieck  
>> Jr.
>> Sent: Thursday, October 26, 2006 4:11 PM
>> To: linux clustering
>> Subject: Re: [Linux-cluster] Cluster vs Distributed? & MySQL Cluster?
>>
>>
>> On 10/25/06, Michael Will <mwill at penguincomputing.com> wrote:
>>> Are the actual data files shared in this setup between the active  
>>> mysql
>>> daemons?
>>>
>>> Last time I looked into this it seemed that with shared-nothing
>> model each
>>> mysql daemon would have to keep it's own copy of the data and  
>>> updates
>>> would be propagated from active to passive daemons (master-slave  
>>> model)
>>> or between active daemons (ndb in-ram database model)
>>>
>>> Are the mysql daemons running on the GFS I/O nodes that have  
>>> access to
>>> shared
>>> storage via SAN or iSCSI and coordinate locking through GFS
>>> infrastructure, or are
>>> the mysql daemons running on client nodes that use GFS to remotely
>>> access storage
>>> that is provided by other  GFS I/O nodes that in turn have
>> access to shared
>>> storage via SAN or iSCSI?
>>>
>>> Michael
>>>
>>
>> We're using GNBD for the nodes to connect to the storage. We don't
>> have the fastest storage setup right now, but I'm hopeful that if
>> everything works well we'll be purchasing a faster storage setup.
>>
>> As far as MySQL using GFS (excluding anything with active-active) and
>> using DLM to do locks, here are some comparisons:
>>
>> Benchmark on GFS
>>
>> Benchmark DBD suite: 2.15
>> Date of test:        2006-10-26  9:49:43
>> Running tests on:    Linux 2.6.9-42.0.2.ELhugemem i686
>> Arguments:           --small-test --tcpip --fast --fast-insert
>> --lock-tables
>> Comments:
>> Limits from:
>> Server version:      MySQL 4.1.20/
>> Optimization:        None
>> Hardware:
>>
>> alter-table: Total time: 94 wallclock secs ( 0.02 usr  0.01 sys +
>> 0.00 cusr  0.00 csys =  0.03 CPU)
>> big-tables: Total time:  4 wallclock secs ( 0.13 usr  0.14 sys +   
>> 0.00
>> cusr  0.00 csys =  0.27 CPU)
>> connect: Total time:  5 wallclock secs ( 0.38 usr  0.53 sys +  0.00
>> cusr  0.00 csys =  0.91 CPU)
>> create: Total time:  8 wallclock secs ( 0.02 usr  0.01 sys +  0.00
>> cusr  0.00 csys =  0.03 CPU)
>> insert: Total time: 17 wallclock secs ( 2.19 usr  1.99 sys +  0.00
>> cusr  0.00 csys =  4.18 CPU)
>> select: Total time: 13 wallclock secs ( 2.36 usr  1.03 sys +  0.00
>> cusr  0.00 csys =  3.39 CPU)
>>
>> Benchmark on Local
>>
>> alter-table: Total time: 70 wallclock secs ( 0.02 usr  0.00 sys +
>> 0.00 cusr  0.00 csys =  0.02 CPU)
>> big-tables: Total time:  2 wallclock secs ( 0.11 usr  0.14 sys +   
>> 0.00
>> cusr  0.00 csys =  0.25 CPU)
>> connect: Total time:  4 wallclock secs ( 0.37 usr  0.55 sys +  0.00
>> cusr  0.00 csys =  0.92 CPU)
>> create: Total time:  1 wallclock secs ( 0.01 usr  0.00 sys +  0.00
>> cusr  0.00 csys =  0.01 CPU)
>> insert: Total time: 13 wallclock secs ( 2.27 usr  1.95 sys +  0.00
>> cusr  0.00 csys =  4.22 CPU)
>> select: Total time: 12 wallclock secs ( 2.21 usr  0.97 sys +  0.00
>> cusr  0.00 csys =  3.18 CPU)
>>
>> It's pretty darn close and I'm willing to take a small performance  
>> hit.
>>
>> Here's some relevant info: local storage is RAID5 and GFS is RAID10
>> and shared using CLVM, multipath, and GNBD. So the speed of the test
>> locally would probably be faster if it were either RAID1 or 10,  
>> not 5.
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)


From dist-list at LEXUM.UMontreal.CA  Fri Oct 27 14:46:15 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Fri, 27 Oct 2006 10:46:15 -0400
Subject: [Linux-cluster] GFS lock_nolock for multiple readers and one
	writers ?
In-Reply-To: <45420DEA.1060408@redhat.com>
References: <4541FECE.8070009@lexum.umontreal.ca> <45420DEA.1060408@redhat.com>
Message-ID: <45421BB7.6040702@lexum.umontreal.ca>

Thank you for the reply,

All readers are  HTTPD servers reading static HTML files. Copies are
made using RSYNC.

But anyway, I will stay with the dlm_lock for consistency reason

Have a nice day


Wendy Cheng wrote:
> FM wrote:
>> Is it safe to do that kind of config IF I am 100 % sure that only one
>> computer will write to the GFS mount ?
>>
>> Regards,
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> But your read won't be correct due to Linux write-back design. Is that
> ok with your application ?
>
> -- Wendy
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From rpeterso at redhat.com  Fri Oct 27 15:25:43 2006
From: rpeterso at redhat.com (Robert Peterson)
Date: Fri, 27 Oct 2006 10:25:43 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <200610271137.44843.hlawatschek@atix.de>
References: <20061026141740.387533@leena> <45411DDC.1060803@redhat.com>
	<200610271137.44843.hlawatschek@atix.de>
Message-ID: <454224F7.6060100@redhat.com>

Mark Hlawatschek wrote:
> The url mentioned in that faq should be  http://www.open-sharedroot.org/
>   
This fix will go into my next update.  Thanks for the correction.
> I also had a speech about that at the redhat summit. Maybe it can be found on 
> the summit homepage ?
>   
I didn't see it out there.  Do you have a link I can use?
> Mark
>   
Regards,

Bob Peterson
Red Hat Cluster Suite


From hlawatschek at atix.de  Fri Oct 27 15:41:09 2006
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Fri, 27 Oct 2006 17:41:09 +0200
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <454224F7.6060100@redhat.com>
References: <20061026141740.387533@leena>
	<200610271137.44843.hlawatschek@atix.de>
	<454224F7.6060100@redhat.com>
Message-ID: <200610271741.09856.hlawatschek@atix.de>

Hi Robert,

> > I also had a speech about that at the redhat summit. Maybe it can be
> > found on the summit homepage ?
>
> I didn't see it out there.  Do you have a link I can use?

an abstract can be found at:
http://www.atix.de/event-archiv/diskless-shared-root-cluster-scalable-infrastructures-based-on-linux/

The complete slides: 
http://www.atix.de/downloads/vortrage-und-workshops/ATIX_Shared-Root-Cluster.pdf/

Thanks,

Mark


-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From lhh at redhat.com  Fri Oct 27 15:41:27 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 11:41:27 -0400
Subject: [Linux-cluster] problems relocating services
In-Reply-To: <452ABE59.4030501@cesca.es>
References: <452200C5.6080406@cesca.es>
	<1159883433.8020.3.camel@rei.boston.devel.redhat.com>
	<45236805.60709@cesca.es>
	<1159981194.12856.14.camel@rei.boston.devel.redhat.com>
	<4523FED1.9010404@cesca.es>
	<1160075910.18145.18.camel@rei.boston.devel.redhat.com>
	<4529F2F2.3010903@cesca.es>
	<1160428559.31581.62.camel@rei.boston.devel.redhat.com>
	<452ABE59.4030501@cesca.es>
Message-ID: <1161963687.4518.133.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-09 at 23:25 +0200, Jordi Prats wrote:
> Hi Lon,
> There is any detailed documentation about all this options and consequences?

For failover domains:

http://people.redhat.com/lhh/fd.html

The 'exclusive' property means the service must be run a node
exclusively, that is, with no other services (unless the administrator
manually moves the service).  If the service can not have a node to
itself, the service is not runnable, and is stopped or disabled.

-- Lon


From lhh at redhat.com  Fri Oct 27 15:43:02 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 11:43:02 -0400
Subject: [Linux-cluster] RHEL4 cluster source RPMs incomplete
In-Reply-To: <20061026165124.A10943@xos037.xos.nl>
References: <200610121452.k9CEqYg13019@xos037.xos.nl>
	<1161873875.4518.82.camel@rei.boston.devel.redhat.com>
	<20061026165124.A10943@xos037.xos.nl>
Message-ID: <1161963782.4518.135.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-26 at 16:51 +0200, Jos Vos wrote:

> > Ugh, are they still "not there" ?  Sometimes, the packages are a bit
> > behind.
> 
> The kernel modules in the meantime arrived, system-config-cluster 1.0.27
> (this was an U4 update) never appeared... :-(
> 

Ok, I'll find out about this.


From lhh at redhat.com  Fri Oct 27 15:44:01 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 11:44:01 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061026145312.GA23304@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
Message-ID: <1161963841.4518.139.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-26 at 10:53 -0400, jason at monsterjam.org wrote:
> ok, thanks for the breakdown.. so basically, I just need to rebuild all 
> of my packages from 
> ftp://updates.redhat.com:/enterprise/4AS/en/RHGFS/SRPMS
> and try again, right?

Currently, yes, though it might be simpler to just check out CVS / RHEL4
and do it that way.

Make sure to install the kernel-devel package.

-- Lon


From lhh at redhat.com  Fri Oct 27 16:06:01 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 12:06:01 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061027010315.GA470@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061027010315.GA470@monsterjam.org>
Message-ID: <1161965161.4518.159.camel@rei.boston.devel.redhat.com>

On Thu, 2006-10-26 at 21:03 -0400, jason at monsterjam.org wrote:

> Oct 25 20:31:14 tf1 rpcidmapd: rpc.idmapd startup succeeded
> Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
> Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
> Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 1 [Phy 1] for non-raid devices
> Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
> Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
> Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PV22XS            Rev: E.17
> Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 03
> Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 2 [virtual] for logical drives
> Oct 25 20:31:14 tf1 kernel:   Vendor: MegaRAID  Model: LD 0 RAID5  139G  Rev: 351X
> Oct 25 20:31:14 tf1 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
> Oct 25 20:31:14 tf1 kernel: scsi1 (2,0,0) : reservation conflict

Those things are in "cluster mode", right?


> Oct 25 20:31:14 tf1 kernel: sdb: asking for cache data failed
> Oct 25 20:31:14 tf1 kernel: sdb: assuming drive cache: write through
> Oct 25 20:31:14 tf1 kernel:  sdb: sdb1
> Oct 25 20:31:14 tf1 kernel: Attached scsi disk sdb at scsi1, channel 2, id 0, lun 0
> Oct 25 20:31:14 tf1 kernel: Adaptec aacraid driver (1.1-5[2412])
> Oct 25 20:31:14 tf1 kernel: device-mapper: 4.5.0-ioctl (2005-10-04) initialised: dm-devel at redhat.com
> Oct 25 20:31:14 tf1 kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
> Oct 25 20:31:14 tf1 kernel: EXT3-fs: write access will be enabled during recovery.
> 
> so sdb is the gfs volume and is already locked by the other server at this point is my guess.

GFS doesn't do SCSI reservations.  Both nodes need concurrent write
access to the disks.  More to the point, see below...

> Oct 25 20:36:13 tf1 kernel: ------------[ cut here ]------------
> ...
> Oct 25 20:36:13 tf1 kernel:  <0>Fatal exception: panic in 5 seconds

^^^ Argh.

> so my question now is that it appears that I have something misconfigured.. tf1 should come up as secondary while tf2 is running as 
> primary, right? or should tf1 come up and take over as primary and tf2 let him?

Irrespective of anything you did (or didn't do), the panic above is a
bug in cman (or maybe the kernel, but not likely).

... The node panicked trying to start up the cluster software, before
GFS (or rgmanager, or dlm) was even in the picture.  You'll note that in
the modules list, 'gfs' and 'dlm' are not even listed.

I hope the newer cman-kernel / dlm-kernel fixes it ;) 

-- Lon


From lhh at redhat.com  Fri Oct 27 16:06:36 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 12:06:36 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1161963841.4518.139.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
	<1161963841.4518.139.camel@rei.boston.devel.redhat.com>
Message-ID: <1161965196.4518.161.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-27 at 11:44 -0400, Lon Hohberger wrote:
> On Thu, 2006-10-26 at 10:53 -0400, jason at monsterjam.org wrote:
> > ok, thanks for the breakdown.. so basically, I just need to rebuild all 
> > of my packages from 
> > ftp://updates.redhat.com:/enterprise/4AS/en/RHGFS/SRPMS
> > and try again, right?
> 
> Currently, yes, though it might be simpler to just check out CVS / RHEL4
> and do it that way.
> 
> Make sure to install the kernel-devel package.

Or, install the CentOS rebuilds:

http://mirror.centos.org/centos/4/csgfs/i386/RPMS/

-- Lon


From isplist at logicore.net  Fri Oct 27 16:20:08 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 27 Oct 2006 11:20:08 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <4540BA0A.5040305@redhat.com>
Message-ID: <2006102711208.265687@leena>

PS: I then updated all of the software on that node so that it was the same as 
all of the other nodes, fired it up and it's now a new node in the cluster.

Mike


On Thu, 26 Oct 2006 14:37:14 +0100, Patrick Caulfield wrote:
> isplist at logicore.net wrote:
> 
>> I missed the original on this but I had something interesting happen
>> yesterday. I installed a new node, introduced it into the cluster and
>> immediately, all of the other nodes died with a kernel panic. Not sure why
>> yet.
>> 
> That sounds like the U3 (inconsistent cluster view) bug. it's fixed in the
> latest builds.


From lhh at redhat.com  Fri Oct 27 18:44:26 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 27 Oct 2006 14:44:26 -0400
Subject: [Linux-cluster] Samba share resource fails to start after
	upgrade to Cluster Suite 4U4
In-Reply-To: <4538E16F.3060709@streppone.it>
References: <4538E16F.3060709@streppone.it>
Message-ID: <1161974666.4518.208.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-20 at 16:47 +0200, Cosimo Streppone wrote:
> Hi all clustering guys,
> 
> I'm experiencing a problem with a 2 node cluster just upgraded
> (this morning) to current RHEL 4U4 and CS.
> 
> cluster.conf file didn't change in respect to this samba resource.
> It is a external smb share that should be mounted on the active
> node.
> 
> Relevant error messages from system log:
> 
>    Oct 20 14:14:05 clu4-2 smbd[7467]:   params.c:OpenConfFile() - Unable to open 
> configuration file "/etc/samba/smb.conf.//share/exportdb":
>    Oct 20 14:14:06 clu4-2 clurgmgrd: [3205]: <err> Samba service failed: /usr/sbin/smbd -D 
> -s "/etc/samba/smb.conf.//share/exportdb"
>    Oct 20 14:14:20 clu4-2 clurgmgrd: [3205]: <warning> 
> "/etc/samba/smb.conf.//share/exportdb" missing during stop
>    Oct 20 14:14:40 clu4-2 clurgmgrd: [3205]: <warning> 
> "/etc/samba/smb.conf.//share/exportdb" missing during stop
>    Oct 20 14:14:48 clu4-2 clurgmgrd: [3205]: <warning> 
> "/etc/samba/smb.conf.//share/exportdb" missing during stop
> 
> I don't know why it searches for "/etc/samba/smb.conf.//share/exportdb"...
> Maybe something changed in the way smb shares are managed?

Could you post your service config?

-- Lon


From cosimo at streppone.it  Fri Oct 27 21:16:08 2006
From: cosimo at streppone.it (Cosimo Streppone)
Date: Fri, 27 Oct 2006 23:16:08 +0200
Subject: [Linux-cluster] Samba share resource fails to start after	upgrade
	to Cluster Suite 4U4
In-Reply-To: <1161974666.4518.208.camel@rei.boston.devel.redhat.com>
References: <4538E16F.3060709@streppone.it>
	<1161974666.4518.208.camel@rei.boston.devel.redhat.com>
Message-ID: <45427718.5030304@streppone.it>

Lon Hohberger wrote:

> On Fri, 2006-10-20 at 16:47 +0200, Cosimo Streppone wrote:
> 
>>I'm experiencing a problem with a 2 node cluster just upgraded
>>(this morning) to current RHEL 4U4 and CS.

> Could you post your service config?

Here it is, assuming you're talking about cluster.conf.
Now smb share has been temporarily excluded from service because
of these problems.
Before that, I "attached" it as a "shared resource" to my service.

-- 
Cosimo

-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cluster.conf.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061027/2ff86a28/attachment.txt>

From jason at monsterjam.org  Sat Oct 28 03:14:02 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Fri, 27 Oct 2006 23:14:02 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1161965161.4518.159.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061027010315.GA470@monsterjam.org>
	<1161965161.4518.159.camel@rei.boston.devel.redhat.com>
Message-ID: <20061028031402.GB20446@monsterjam.org>

Im 99% sure that these disks are in the shared/clusterd mode.
Ill update my rpms from http://mirror.centos.org/centos/4/csgfs/i386/RPMS/
and see what I get.

Jason

On Fri, Oct 27, 2006 at 12:06:01PM -0400, Lon Hohberger wrote:
> On Thu, 2006-10-26 at 21:03 -0400, jason at monsterjam.org wrote:
> 
> > Oct 25 20:31:14 tf1 rpcidmapd: rpc.idmapd startup succeeded
> > Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
> > Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
> > Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 1 [Phy 1] for non-raid devices
> > Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PERC 4/DC         Rev: 351X
> > Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 02
> > Oct 25 20:31:14 tf1 kernel:   Vendor: DELL      Model: PV22XS            Rev: E.17
> > Oct 25 20:31:14 tf1 kernel:   Type:   Processor                          ANSI SCSI revision: 03
> > Oct 25 20:31:14 tf1 kernel: scsi[1]: scanning scsi channel 2 [virtual] for logical drives
> > Oct 25 20:31:14 tf1 kernel:   Vendor: MegaRAID  Model: LD 0 RAID5  139G  Rev: 351X
> > Oct 25 20:31:14 tf1 kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
> > Oct 25 20:31:14 tf1 kernel: scsi1 (2,0,0) : reservation conflict
> 
> Those things are in "cluster mode", right?
> 
> 
> > Oct 25 20:31:14 tf1 kernel: sdb: asking for cache data failed
> > Oct 25 20:31:14 tf1 kernel: sdb: assuming drive cache: write through
> > Oct 25 20:31:14 tf1 kernel:  sdb: sdb1
> > Oct 25 20:31:14 tf1 kernel: Attached scsi disk sdb at scsi1, channel 2, id 0, lun 0
> > Oct 25 20:31:14 tf1 kernel: Adaptec aacraid driver (1.1-5[2412])
> > Oct 25 20:31:14 tf1 kernel: device-mapper: 4.5.0-ioctl (2005-10-04) initialised: dm-devel at redhat.com
> > Oct 25 20:31:14 tf1 kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
> > Oct 25 20:31:14 tf1 kernel: EXT3-fs: write access will be enabled during recovery.
> > 
> > so sdb is the gfs volume and is already locked by the other server at this point is my guess.
> 
> GFS doesn't do SCSI reservations.  Both nodes need concurrent write
> access to the disks.  More to the point, see below...
> 
> > Oct 25 20:36:13 tf1 kernel: ------------[ cut here ]------------
> > ...
> > Oct 25 20:36:13 tf1 kernel:  <0>Fatal exception: panic in 5 seconds
> 
> ^^^ Argh.
> 
> > so my question now is that it appears that I have something misconfigured.. tf1 should come up as secondary while tf2 is running as 
> > primary, right? or should tf1 come up and take over as primary and tf2 let him?
> 
> Irrespective of anything you did (or didn't do), the panic above is a
> bug in cman (or maybe the kernel, but not likely).
> 
> ... The node panicked trying to start up the cluster software, before
> GFS (or rgmanager, or dlm) was even in the picture.  You'll note that in
> the modules list, 'gfs' and 'dlm' are not even listed.
> 
> I hope the newer cman-kernel / dlm-kernel fixes it ;) 
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
================================================
|    Jason Welsh   jason at monsterjam.org        |
| http://monsterjam.org    DSS PGP: 0x5E30CC98 |
|    gpg key: http://monsterjam.org/gpg/       |
================================================


From jason at monsterjam.org  Sun Oct 29 02:19:52 2006
From: jason at monsterjam.org (jason at monsterjam.org)
Date: Sat, 28 Oct 2006 22:19:52 -0400
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1161965196.4518.161.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
	<1161963841.4518.139.camel@rei.boston.devel.redhat.com>
	<1161965196.4518.161.camel@rei.boston.devel.redhat.com>
Message-ID: <20061029021952.GA30056@monsterjam.org>

ummm, If I install these centos rpms for GFS and ClusterServer, wont I also need to run the centos set of kernels?
Is this really doable on a Red Hat Enterprise Linux AS release 4 (Nahant Update 4) server?

Jason

On Fri, Oct 27, 
2006 at 12:06:36PM -0400, Lon Hohberger 
wrote:
> On Fri, 2006-10-27 at 11:44 -0400, Lon Hohberger wrote:
> > On Thu, 2006-10-26 at 10:53 -0400, jason at monsterjam.org wrote:
> > > ok, thanks for the breakdown.. so basically, I just need to rebuild all 
> > > of my packages from 
> > > ftp://updates.redhat.com:/enterprise/4AS/en/RHGFS/SRPMS
> > > and try again, right?
> > 
> > Currently, yes, though it might be simpler to just check out CVS / RHEL4
> > and do it that way.
> > 
> > Make sure to install the kernel-devel package.
> 
> Or, install the CentOS rebuilds:
> 
> http://mirror.centos.org/centos/4/csgfs/i386/RPMS/
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
================================================
|    Jason Welsh   jason at monsterjam.org        |
| http://monsterjam.org    DSS PGP: 0x5E30CC98 |
|    gpg key: http://monsterjam.org/gpg/       |
================================================


From eth at ethaniel.com  Sun Oct 29 21:30:48 2006
From: eth at ethaniel.com (Arkadiy Kulev)
Date: Mon, 30 Oct 2006 00:30:48 +0300
Subject: [Linux-cluster] problems with fenced and gman_controld
Message-ID: <796650718.20061030003048@ethaniel.com>

Hello everyone,

I have setted up a one-node cluster and having difficulties with the
fenced and gfs_controld.

I have started "ccsd" (everything went ok). I did "cman_tool join" (it
went ok too).
But when I do "fenced" or "gfs_controld" I get the following errors:

Oct 30 00:57:02 localhost fenced[2586]: group_init error 0 111
Oct 30 01:15:35 localhost gfs_controld[2622]: group_init error 0 111

The messages log:

Oct 30 00:54:11 localhost openais[2506]: [CLM  ] CLM CONFIGURATION CHANGE
Oct 30 00:54:11 localhost openais[2506]: [CLM  ] New Configuration:
Oct 30 00:54:11 localhost openais[2506]: [CLM  ]        r(0) ip(10.0.0.110)
Oct 30 00:54:11 localhost openais[2506]: [CLM  ] Members Left:
Oct 30 00:54:11 localhost openais[2506]: [CLM  ] Members Joined:
Oct 30 00:54:11 localhost openais[2506]: [CLM  ]        r(0) ip(10.0.0.110)
Oct 30 00:54:11 localhost openais[2506]: [CMAN ] quorum regained, resuming activity
Oct 30 00:54:11 localhost openais[2506]: [SYNC ] This node is within the primary component and will provide service.
Oct 30 00:54:11 localhost openais[2506]: [TOTEM] entering OPERATIONAL state.
Oct 30 00:54:11 localhost openais[2506]: [CLM  ] got nodejoin message 10.0.0.110


the cluster.conf:

<?xml version="1.0"?>
<cluster name="clust" config_version="13">
    <cman expected_votes="1">
<multicast addr="224.0.0.1"/>
    </cman>
    <clusternodes>
      <clusternode name="10.0.0.110" votes="1" nodeid="1">
<multicast addr="224.0.0.1" interface="eth0"/>
       <fence>
        <method name="single">
         <device name="human" ipaddr="10.0.0.110"/>
       </method>
      </fence>
     </clusternode>
   </clusternodes>
  <fence_devices>
   <fence_device name="human" agent="fence_manual"/>
  </fence_devices>
 </cluster>


-- 
Best regards,
 Arkadiy                          mailto:eth at ethaniel.com


From diaclau76 at gmail.com  Sun Oct 29 23:40:54 2006
From: diaclau76 at gmail.com (claudia quispe)
Date: Sun, 29 Oct 2006 18:40:54 -0500
Subject: [Linux-cluster] problems with fenced and gman_controld
In-Reply-To: <796650718.20061030003048@ethaniel.com>
References: <796650718.20061030003048@ethaniel.com>
Message-ID: <a44f5690610291540i512f4e68q73c04ff8363ceecf@mail.gmail.com>

holas.. alguien me puede ayudar.. el GFS por primera ves... por fis..
cuales. son la configuracion que tengo que hacer.. gracias...


On 10/29/06, Arkadiy Kulev <eth at ethaniel.com> wrote:
>
> Hello everyone,
>
> I have setted up a one-node cluster and having difficulties with the
> fenced and gfs_controld.
>
> I have started "ccsd" (everything went ok). I did "cman_tool join" (it
> went ok too).
> But when I do "fenced" or "gfs_controld" I get the following errors:
>
> Oct 30 00:57:02 localhost fenced[2586]: group_init error 0 111
> Oct 30 01:15:35 localhost gfs_controld[2622]: group_init error 0 111
>
> The messages log:
>
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ] CLM CONFIGURATION CHANGE
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ] New Configuration:
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ]        r(0) ip(10.0.0.110
> )
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ] Members Left:
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ] Members Joined:
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ]        r(0) ip(10.0.0.110
> )
> Oct 30 00:54:11 localhost openais[2506]: [CMAN ] quorum regained, resuming
> activity
> Oct 30 00:54:11 localhost openais[2506]: [SYNC ] This node is within the
> primary component and will provide service.
> Oct 30 00:54:11 localhost openais[2506]: [TOTEM] entering OPERATIONAL
> state.
> Oct 30 00:54:11 localhost openais[2506]: [CLM  ] got nodejoin message
> 10.0.0.110
>
>
> the cluster.conf:
>
> <?xml version="1.0"?>
> <cluster name="clust" config_version="13">
>    <cman expected_votes="1">
> <multicast addr="224.0.0.1"/>
>    </cman>
>    <clusternodes>
>      <clusternode name="10.0.0.110" votes="1" nodeid="1">
> <multicast addr="224.0.0.1" interface="eth0"/>
>       <fence>
>        <method name="single">
>         <device name="human" ipaddr="10.0.0.110"/>
>       </method>
>      </fence>
>     </clusternode>
>   </clusternodes>
> <fence_devices>
>   <fence_device name="human" agent="fence_manual"/>
> </fence_devices>
> </cluster>
>
>
>
>
> --
> Best regards,
> Arkadiy                          mailto:eth at ethaniel.com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061029/ca18955e/attachment.htm>

From diaclau76 at gmail.com  Sun Oct 29 23:41:23 2006
From: diaclau76 at gmail.com (claudia quispe)
Date: Sun, 29 Oct 2006 18:41:23 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061029021952.GA30056@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
	<1161963841.4518.139.camel@rei.boston.devel.redhat.com>
	<1161965196.4518.161.camel@rei.boston.devel.redhat.com>
	<20061029021952.GA30056@monsterjam.org>
Message-ID: <a44f5690610291541w2a5e66d5y8bf8095f3caa9259@mail.gmail.com>

holas.. alguien me puede ayudar.. el GFS por primera ves... por fis..
cuales. son la configuracion que tengo que hacer.. gracias...


On 10/28/06, jason at monsterjam.org <jason at monsterjam.org> wrote:
>
> ummm, If I install these centos rpms for GFS and ClusterServer, wont I
> also need to run the centos set of kernels?
> Is this really doable on a Red Hat Enterprise Linux AS release 4 (Nahant
> Update 4) server?
>
> Jason
>
> On Fri, Oct 27,
> 2006 at 12:06:36PM -0400, Lon Hohberger
> wrote:
> > On Fri, 2006-10-27 at 11:44 -0400, Lon Hohberger wrote:
> > > On Thu, 2006-10-26 at 10:53 -0400, jason at monsterjam.org wrote:
> > > > ok, thanks for the breakdown.. so basically, I just need to rebuild
> all
> > > > of my packages from
> > > > ftp://updates.redhat.com:/enterprise/4AS/en/RHGFS/SRPMS
> > > > and try again, right?
> > >
> > > Currently, yes, though it might be simpler to just check out CVS /
> RHEL4
> > > and do it that way.
> > >
> > > Make sure to install the kernel-devel package.
> >
> > Or, install the CentOS rebuilds:
> >
> > http://mirror.centos.org/centos/4/csgfs/i386/RPMS/
> >
> > -- Lon
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> ================================================
> |    Jason Welsh   jason at monsterjam.org        |
> | http://monsterjam.org    DSS PGP: 0x5E30CC98 |
> |    gpg key: http://monsterjam.org/gpg/       |
> ================================================
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061029/f42e3aa7/attachment.htm>

From diaclau76 at gmail.com  Sun Oct 29 23:41:41 2006
From: diaclau76 at gmail.com (claudia quispe)
Date: Sun, 29 Oct 2006 18:41:41 -0500
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <200610271741.09856.hlawatschek@atix.de>
References: <20061026141740.387533@leena>
	<200610271137.44843.hlawatschek@atix.de> <454224F7.6060100@redhat.com>
	<200610271741.09856.hlawatschek@atix.de>
Message-ID: <a44f5690610291541w46149f8ct87a346c8fdc1bd34@mail.gmail.com>

holas.. alguien me puede ayudar.. el GFS por primera ves... por fis..
cuales. son la configuracion que tengo que hacer.. gracias...


On 10/27/06, Mark Hlawatschek <hlawatschek at atix.de> wrote:
>
> Hi Robert,
>
> > > I also had a speech about that at the redhat summit. Maybe it can be
> > > found on the summit homepage ?
> >
> > I didn't see it out there.  Do you have a link I can use?
>
> an abstract can be found at:
>
> http://www.atix.de/event-archiv/diskless-shared-root-cluster-scalable-infrastructures-based-on-linux/
>
> The complete slides:
>
> http://www.atix.de/downloads/vortrage-und-workshops/ATIX_Shared-Root-Cluster.pdf/
>
> Thanks,
>
> Mark
>
>
> --
> Gruss / Regards,
>
> Dipl.-Ing. Mark Hlawatschek
> http://www.atix.de/
> http://www.open-sharedroot.org/
>
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061029/d0780c43/attachment.htm>

From diaclau76 at gmail.com  Sun Oct 29 23:42:02 2006
From: diaclau76 at gmail.com (claudia quispe)
Date: Sun, 29 Oct 2006 18:42:02 -0500
Subject: [Linux-cluster] RHEL4 cluster source RPMs incomplete
In-Reply-To: <1161963782.4518.135.camel@rei.boston.devel.redhat.com>
References: <200610121452.k9CEqYg13019@xos037.xos.nl>
	<1161873875.4518.82.camel@rei.boston.devel.redhat.com>
	<20061026165124.A10943@xos037.xos.nl>
	<1161963782.4518.135.camel@rei.boston.devel.redhat.com>
Message-ID: <a44f5690610291542h393ef24cybefaf24f8ac0e7b5@mail.gmail.com>

holas.. alguien me puede ayudar.. el GFS por primera ves... por fis..
cuales. son la configuracion que tengo que hacer.. gracias...


On 10/27/06, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Thu, 2006-10-26 at 16:51 +0200, Jos Vos wrote:
>
> > > Ugh, are they still "not there" ?  Sometimes, the packages are a bit
> > > behind.
> >
> > The kernel modules in the meantime arrived, system-config-cluster 1.0.27
> > (this was an U4 update) never appeared... :-(
> >
>
> Ok, I'll find out about this.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061029/ea16c281/attachment.htm>

From isplist at logicore.net  Mon Oct 30 00:52:04 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 29 Oct 2006 18:52:04 -0600
Subject: [Linux-cluster] Multiple Roots
Message-ID: <2006102918524.237384@leena>

This might be a long shot but... is there a way of taking a single GFS 
filesystem and creating multiple roots that only certain machines in the 
cluster will see?

I'm almost sure I read somewhere that there is a way to do this. The reason 
is, 

I keep having to separate my storage up and I would much rather maintain one 
large storage array (contiguous) and dole out the space as needed to the 
various applications. 

That is an inherent part of fibre channel storage which is what I'm using so 
assume it can be done with GFS.

If I didn't explain some part correctly, by all means, just ask.

Thanks much for any input on this.

Mike


From isplist at logicore.net  Mon Oct 30 01:00:37 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 29 Oct 2006 19:00:37 -0600
Subject: [Linux-cluster] PS: Multiple roots
Message-ID: <2006102919037.243468@leena>

Forgot to mention, I'm asking about multiple roots and not multiple 
partitions. 

I'm guessing the only way would be to create GFS partitions, assign each as 
'use all available space' (what ever that option is called), then keep an eye 
on the overall aggregated storage as a volume group.

Mike


From irwan at magnifix.com.my  Mon Oct 30 02:14:17 2006
From: irwan at magnifix.com.my (Mohd Irwan Jamaluddin)
Date: Mon, 30 Oct 2006 10:14:17 +0800
Subject: [Linux-cluster] Network-attached power switches for RHCS
Message-ID: <1162174457.4330.22.camel@kuli.magnifix.com.my>

Good day guys,

I would like to have your opinion regarding Network-attached power
switches for Red Hat Cluster Suite (RHCS). I know there are 2 main
brands available; APC & WTI. Both of them are having several models.

Below are my finding:

APC: MasterSwitch plus, 1U, 15A, 120V, (8)5-15 (AP9225)
http://www.apc.com/resource/include/techspec_index.cfm?
base_sku=AP9225&ISOCountryCode=us 

APC: MasterSwitch plus exp, 1U, 15A, 120V, (8)5-15 (AP9225EXP)
http://www.apc.com/resource/include/techspec_index.cfm?
base_sku=AP9225EXP# 

WTI: NetReach? Model IPS-800
http://www.wti.com/specsnr/ips8spex.htm

WTI: NetReach? Model IPS-800-CE
http://www.wti.com/specsnr/ips8cespex.htm

>From your experience, which one is better in term or performance,
reliability & value? If you have any other suggestion, please don't
hesitate to let me know.

Thanks.

-- 
Regards,
+--------------------------------+
|       Mohd Irwan Jamaluddin    |
| ##    System Engineer,         |
| (o_   Magnifix Sdn. Bhd.       |
| //\   Tel: +603 42705073       |
| V_/_  Fax: +603 42701960       |
|       http://www.magnifix.com/ |      
+--------------------------------+
| "Every successful side needs   |              
| unsung heroes" - fcbayern.de   |      
+--------------------------------+


From chawkins at bplinux.com  Mon Oct 30 03:48:36 2006
From: chawkins at bplinux.com (Christopher Hawkins)
Date: Sun, 29 Oct 2006 22:48:36 -0500
Subject: [Linux-cluster] Multiple Roots
In-Reply-To: <2006102918524.237384@leena>
Message-ID: <200610300327.k9U3RqAA014580@mail2.ontariocreditcorp.com>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> isplist at logicore.net
> Sent: Sunday, October 29, 2006 7:52 PM
> To: linux-cluster
> Subject: [Linux-cluster] Multiple Roots
> 
> This might be a long shot but... is there a way of taking a 
> single GFS filesystem and creating multiple roots that only 
> certain machines in the cluster will see?
> 
> I'm almost sure I read somewhere that there is a way to do 
> this. The reason is, 
> 
> I keep having to separate my storage up and I would much 
> rather maintain one large storage array (contiguous) and dole 
> out the space as needed to the various applications. 
> 
> That is an inherent part of fibre channel storage which is 
> what I'm using so assume it can be done with GFS.
> 
> If I didn't explain some part correctly, by all means, just ask.
> 
> Thanks much for any input on this.
> 
> Mike
> 
I think you might want cdsl's (context dependent symlinks). Have you worked
with these before?   Like:
Move /etc to /a_node_name/etc and replace /etc with a symlink: 

ln -s /@hostname/etc /etc

Then every node will insert it's hostname into the path and go to the right
place for /etc.   

Chris


From isplist at logicore.net  Mon Oct 30 04:29:44 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 29 Oct 2006 22:29:44 -0600
Subject: [Linux-cluster] Multiple Roots
In-Reply-To: <200610300327.k9U3RqAA014580@mail2.ontariocreditcorp.com>
Message-ID: <20061029222944.076893@leena>

>> This might be a long shot but... is there a way of taking a
>> single GFS filesystem and creating multiple roots that only
>> certain machines in the cluster will see?

> I think you might want cdsl's (context dependent symlinks). Have you worked
> with these before?   Like:
> Move /etc to /a_node_name/etc and replace /etc with a symlink:

No, I had not looked at this yet. It's only for the nodes data, not so much 
the OS since I'll be looking at shared root nodes in the near future. 

While I'm putting all this together though, I started wondering what might be 
the best way of dealing with storage.

I can create one large volume group and keep adding/removing space as needed 
with one shared root for everything (or the above idea possibly) or create 
multiple volume groups and individual logical volumes for the various needs 
I'll have. 
What I'm saying is, it's not clear to me what the best way of dealing with 
storage is just yet, so that I can fail over the nodes, move storage around, 
etc.

Mike


> 
> ln -s /@hostname/etc /etc
> 
> Then every node will insert it's hostname into the path and go to the right
> place for /etc.
> 
> Chris
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From Alain.Moulle at bull.net  Mon Oct 30 07:28:05 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 30 Oct 2006 08:28:05 +0100
Subject: [Linux-cluster] CS4 U2 & U4 / crash when	"less
	/proc/cluster/status"
Message-ID: <4545A985.5010900@bull.net>

Hi
Is it always useful to create a defect or did you already
create it since you have reproduce the problem and found
how to fix it ?
Alain Moull?

Alain Moulle wrote:

>> Is there already a patch number somewhere for U4 ?
>> Thanks
>> Alain Moull?

> From: Patrick Caulfield
> No, I'm afraid not.
> Patrick


From pcaulfie at redhat.com  Mon Oct 30 09:13:11 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 30 Oct 2006 09:13:11 +0000
Subject: [Linux-cluster] CS4 U2 & U4 / crash
	when	"less	/proc/cluster/status"
In-Reply-To: <4545A985.5010900@bull.net>
References: <4545A985.5010900@bull.net>
Message-ID: <4545C227.9060505@redhat.com>

Alain Moulle wrote:
> Hi
> Is it always useful to create a defect or did you already
> create it since you have reproduce the problem and found
> how to fix it ?

Yes, can you create a bugzilla for this please. it gives us something to 
work against.

Patrick


From lhh at redhat.com  Mon Oct 30 14:24:10 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 30 Oct 2006 09:24:10 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <20061029021952.GA30056@monsterjam.org>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
	<1161963841.4518.139.camel@rei.boston.devel.redhat.com>
	<1161965196.4518.161.camel@rei.boston.devel.redhat.com>
	<20061029021952.GA30056@monsterjam.org>
Message-ID: <1162218250.4518.210.camel@rei.boston.devel.redhat.com>

On Sat, 2006-10-28 at 22:19 -0400, jason at monsterjam.org wrote:
> ummm, If I install these centos rpms for GFS and ClusterServer, wont I also need to run the centos set of kernels?
> Is this really doable on a Red Hat Enterprise Linux AS release 4 (Nahant Update 4) server?

Good point; I have no idea.  However, the non-kernel RPMs should be
fine.

-- Lon


From hyclak at math.ohiou.edu  Mon Oct 30 14:27:26 2006
From: hyclak at math.ohiou.edu (Matt Hyclak)
Date: Mon, 30 Oct 2006 09:27:26 -0500
Subject: [Linux-cluster] weird happenings on my cluster and another panic.
In-Reply-To: <1162218250.4518.210.camel@rei.boston.devel.redhat.com>
References: <20061026005601.GA24086@monsterjam.org>
	<1161872071.4518.55.camel@rei.boston.devel.redhat.com>
	<20061026145312.GA23304@monsterjam.org>
	<1161963841.4518.139.camel@rei.boston.devel.redhat.com>
	<1161965196.4518.161.camel@rei.boston.devel.redhat.com>
	<20061029021952.GA30056@monsterjam.org>
	<1162218250.4518.210.camel@rei.boston.devel.redhat.com>
Message-ID: <20061030142726.GA23458@math.ohiou.edu>

On Mon, Oct 30, 2006 at 09:24:10AM -0500, Lon Hohberger enlightened us:
> > ummm, If I install these centos rpms for GFS and ClusterServer, wont I also need to run the centos set of kernels?
> > Is this really doable on a Red Hat Enterprise Linux AS release 4 (Nahant Update 4) server?
> 
> Good point; I have no idea.  However, the non-kernel RPMs should be
> fine.
> 

They are built from the same SRPMs, so I don't imagine it would be a problem.

Matt

-- 
Matt Hyclak
Department of Mathematics 
Department of Social Work
Ohio University
(740) 593-1263


From teigland at redhat.com  Mon Oct 30 16:27:00 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 30 Oct 2006 10:27:00 -0600
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <45413E1E.9020004@redhat.com>
References: <20061026165855.943665@leena> <45413E1E.9020004@redhat.com>
Message-ID: <20061030162700.GB24813@redhat.com>

On Thu, Oct 26, 2006 at 06:00:46PM -0500, Robert Peterson wrote:
> isplist at logicore.net wrote:
> >Interesting.. everything's gone now? I changed and now all volume groups 
> >have disappeared. I'm guessing I've lost all my data?
> >
> >I changed things as follows;
> >
> >From;
> >
> ># gfs_tool sb /dev/rimfire/rimfire table
> >current lock table name = "vgcomp:gfs1"
> >
> ># gfs_tool sb /dev/storec/web table
> >current lock table name = "vgcomp:gfs1"
> >
> ># gfs_tool sb /dev/storeb/qm table
> >current lock table name = "vgcomp:gfs1"
> >
> >To;
> >
> >gfs_tool sb /dev/storeb/qm table qm:gfs1
> >gfs_tool sb /dev/storec/web table web:gfs1
> >gfs_tool sb /dev/rimfire/rimfire table rim:gfs1

clustername:fsname

Assuming "vgcomp" is what cluster.conf says, then
vgcomp:qm
vgcomp:web
vgcomp:rim


From teigland at redhat.com  Mon Oct 30 16:29:32 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 30 Oct 2006 10:29:32 -0600
Subject: [Linux-cluster] Multiple Logical Volumes
In-Reply-To: <200610271943.988010@leena>
References: <45410171.8090901@redhat.com> <200610271943.988010@leena>
Message-ID: <20061030162932.GC24813@redhat.com>

On Fri, Oct 27, 2006 at 01:09:43AM -0500, isplist at logicore.net wrote:

> Because, for example, trying to mount the following leads to the denied
> error, it seems, because the cluster is expecting vgcomp.
> 
> # mount -t gfs /dev/VolGroup01/rimfire /gfs/rimfire/
> mount: permission denied

Indeed (see prev mail), and /var/log/messages will usually give you a
specific error re the failed mount.

Dave


From teigland at redhat.com  Mon Oct 30 16:32:19 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 30 Oct 2006 10:32:19 -0600
Subject: [Linux-cluster] problems with fenced and gman_controld
In-Reply-To: <796650718.20061030003048@ethaniel.com>
References: <796650718.20061030003048@ethaniel.com>
Message-ID: <20061030163219.GD24813@redhat.com>

On Mon, Oct 30, 2006 at 12:30:48AM +0300, Arkadiy Kulev wrote:
> Hello everyone,
> 
> I have setted up a one-node cluster and having difficulties with the
> fenced and gfs_controld.
> 
> I have started "ccsd" (everything went ok). I did "cman_tool join" (it
> went ok too).
> But when I do "fenced" or "gfs_controld" I get the following errors:
> 
> Oct 30 00:57:02 localhost fenced[2586]: group_init error 0 111
> Oct 30 01:15:35 localhost gfs_controld[2622]: group_init error 0 111

Looks like groupd isn't running, see the "cman" init script or
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/usage.txt?cvsroot=cluster

> <?xml version="1.0"?>
> <cluster name="clust" config_version="13">
>     <cman expected_votes="1">
> <multicast addr="224.0.0.1"/>
>     </cman>
>     <clusternodes>
>       <clusternode name="10.0.0.110" votes="1" nodeid="1">
> <multicast addr="224.0.0.1" interface="eth0"/>
>        <fence>
>         <method name="single">
>          <device name="human" ipaddr="10.0.0.110"/>
>        </method>
>       </fence>
>      </clusternode>
>    </clusternodes>
>   <fence_devices>
>    <fence_device name="human" agent="fence_manual"/>
>   </fence_devices>

"fencedevices", "fencedevice"

Dave


From lhh at redhat.com  Mon Oct 30 18:16:27 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 30 Oct 2006 13:16:27 -0500
Subject: [Linux-cluster] Network-attached power switches for RHCS
In-Reply-To: <1162174457.4330.22.camel@kuli.magnifix.com.my>
References: <1162174457.4330.22.camel@kuli.magnifix.com.my>
Message-ID: <1162232187.4518.312.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-30 at 10:14 +0800, Mohd Irwan Jamaluddin wrote:
> Good day guys,
> 
> I would like to have your opinion regarding Network-attached power
> switches for Red Hat Cluster Suite (RHCS). I know there are 2 main
> brands available; APC & WTI. Both of them are having several models.
> 
> Below are my finding:
> 
> APC: MasterSwitch plus, 1U, 15A, 120V, (8)5-15 (AP9225)
> http://www.apc.com/resource/include/techspec_index.cfm?
> base_sku=AP9225&ISOCountryCode=us 
> 
> APC: MasterSwitch plus exp, 1U, 15A, 120V, (8)5-15 (AP9225EXP)
> http://www.apc.com/resource/include/techspec_index.cfm?
> base_sku=AP9225EXP# 
> 
> WTI: NetReach? Model IPS-800
> http://www.wti.com/specsnr/ips8spex.htm
> 
> WTI: NetReach? Model IPS-800-CE
> http://www.wti.com/specsnr/ips8cespex.htm
> 
> >From your experience, which one is better in term or performance,
> reliability & value? If you have any other suggestion, please don't
> hesitate to let me know.

1U APC switches are less expensive and have SNMP capabilities, but to
feed NSPF power to a cluster, you would need at least two of them; the
APC 9225 switches only have one power rail.  This means if you trip over
the switch's power cable, all cluster nodes powered by the APC unit die.
APC switches can also often be daisy chained together, but our agents do
not generally support the configuration.  (I think the CVS version does
on the most recent hardware, but probably not on the 9225...  Jim?)

Ex: single power supplies connected to typical APC devices:

              power1  power2
                |       |
    server A   APC1    APC2   server B
           +----1       1-----+

Ex: dual power supplies connected to typical APC devices:

              power1  
                |     
    server A   APC1           server B
    |      +----1             |      |
    |           2-------------+      |
    |                                |
    |                  APC2          |
    +-------------------1            |
                        2------------+
                        |
                      power2


The WTI IPS800 switches above have two power rails with 4 ports each.
The CE version is 208v; the non-CE is 110~120v, otherwise, they're the
same.  They cost more per unit than the 9225.  You can control dual
power supplies on separate rails, giving you NSPF as far as power-cords
are concerned.  If you pull one of the power sources, the power switch
is still accessible and fencing will still work, because the internal
electronics can run off of either power rail.  (I'm not sure what the
fault mode is if the electronics fail, though; I *think* it leaves the
ports in their current states; contact WTI if you have questions about
this).

Ex: dual power supplies connected to typical WTI devices:

              power1
                |
    server A   IPS   server B
           +----1    |
               ===   |
                5----+
                |
              power2

Ex: dual power supplies connected to typical WTI devices:

              power1
                |
    server A   IPS   server B
    |      +----1    |      |
    |           2----+      |
    |          ===          |
    +-----------5           |
                6-----------+
                |
              power2

>From a reliability standpoint, APC and WTI both make extremely reliable
devices.  I've never had any switch from either vendor go bonkers on me.

Not to state the opinion of any company I may or may not be employed by
or affiliated with, I personally generally prefer the WTI devices over
APC devices because of:

(a) Design - I *totally* dig the dual power rail configuration.  It has
higher power capacity per switch (30A, 15A per rail/4 ports), as well
two power sources (note: it's *two* rails; not a single, redundant rail,
even though the switch control electronics can run off of either rail)

(b) Firmware revisions on APC devices have broken fencing agents on more
than one occasion. (Though, this isn't so much a problem with the newer
APC SNMP fencing agents, but I don't like setting up SNMP...).

Jim Parsons (current fencing maintainer) has differing opinions on the
matter; I believe he prefers APC units over WTI units.

-- Lon


From lhh at redhat.com  Mon Oct 30 20:12:32 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 30 Oct 2006 15:12:32 -0500
Subject: [Linux-cluster] Samba share resource fails to start
	after	upgrade to Cluster Suite 4U4
In-Reply-To: <45427718.5030304@streppone.it>
References: <4538E16F.3060709@streppone.it>
	<1161974666.4518.208.camel@rei.boston.devel.redhat.com>
	<45427718.5030304@streppone.it>
Message-ID: <1162239152.4518.333.camel@rei.boston.devel.redhat.com>

On Fri, 2006-10-27 at 23:16 +0200, Cosimo Streppone wrote:

> Here it is, assuming you're talking about cluster.conf.

Yeah, that's it.

> Now smb share has been temporarily excluded from service because
> of these problems.
> Before that, I "attached" it as a "shared resource" to my service.


> 		<resources>
> 			<ip address="10.1.1.200" monitor_link="1"/>
> 			<fs device="/dev/mapper/mpath0p1" force_fsck="0" force_unmount="0" fsid="4275" fstype="ext2" mountpoint="/mnt/san" name="Slim2k disk" options="noatime,nodiratime" self_fence="1"/>
> 			<smb name="//share/exportdb" workgroup="WORKGROUP"/>

The share name should not contain any slashes.  With a valid name, the
<smb> resource agent generates a brain-dead example configuration for
use as something for the administrator to build on.

Ok, so, "exportdb" or something would be a better name.  Using that,
given the rest your service configuration, the agent will generate an
*example* configuration with two properties:

(1) An export called:

   //exportdb/Slim2k_disk

... which enables /mnt/san to be visible by others.

(2) That listens only on the 10.1.1.200 IP address.

Now, if you edit the configuration file after it is generated by the
cluster (which is fully expected; it's just a template), the <smb> agent
will *not* overwrite your changes.

(3) with workgroup as specified ("WORKGROUP" in your configuration)

If you already have a Samba configuration file that works for you, you
can use it by doing one of the following two things:

(a) Use the <script> facility:

> 			<script file="/etc/rc.d/init.d/smb" name="Samba"/>

^^ e.g. that one

(b) Or, you can rename the configuration file you wish to use
to /etc/samba/smb.conf.<name> and reference it in cluster.conf using:

   <smb name="<name>" />

(The "workgroup" parameter of the <smb> resource is ignored if the
configuration is not generated by the cluster, so whatever is in your
smb.conf.<name> will be used; same goes for netbios name.)

If you go this route, make sure you create appropriate "listen",
"netbios name", and "workgroup" samba properties.


Using both <smb> and <script ... "/etc/init.d/smb" /> probably won't
work ;)  so you should pick one or the other.

-- Lon


From lhh at redhat.com  Mon Oct 30 21:18:31 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 30 Oct 2006 16:18:31 -0500
Subject: [Linux-cluster] Samba share resource fails to start
	after	upgrade to Cluster Suite 4U4
In-Reply-To: <1162239152.4518.333.camel@rei.boston.devel.redhat.com>
References: <4538E16F.3060709@streppone.it>
	<1161974666.4518.208.camel@rei.boston.devel.redhat.com>
	<45427718.5030304@streppone.it>
	<1162239152.4518.333.camel@rei.boston.devel.redhat.com>
Message-ID: <1162243111.4518.336.camel@rei.boston.devel.redhat.com>

On Mon, 2006-10-30 at 15:12 -0500, Lon Hohberger wrote:

> If you go this route, make sure you create appropriate "listen",
> "netbios name", and "workgroup" samba properties.

Sorry, "bind interfaces", not "listen".

-- Lon


From irwan at magnifix.com.my  Tue Oct 31 06:16:26 2006
From: irwan at magnifix.com.my (Mohd Irwan Jamaluddin)
Date: Tue, 31 Oct 2006 14:16:26 +0800
Subject: [Linux-cluster] Network-attached power switches for RHCS
In-Reply-To: <1162232187.4518.312.camel@rei.boston.devel.redhat.com>
References: <1162174457.4330.22.camel@kuli.magnifix.com.my>
	<1162232187.4518.312.camel@rei.boston.devel.redhat.com>
Message-ID: <1162275386.23296.31.camel@kuli.magnifix.com.my>

On Mon, 2006-10-30 at 13:16 -0500, Lon Hohberger wrote:
> 1U APC switches are less expensive and have SNMP capabilities, but to
> feed NSPF power to a cluster, you would need at least two of them; the
> APC 9225 switches only have one power rail.  This means if you trip over
> the switch's power cable, all cluster nodes powered by the APC unit die.
> APC switches can also often be daisy chained together, but our agents do
> not generally support the configuration.  (I think the CVS version does
> on the most recent hardware, but probably not on the 9225...  Jim?)
> 
> Ex: single power supplies connected to typical APC devices:
> 
>               power1  power2
>                 |       |
>     server A   APC1    APC2   server B
>            +----1       1-----+
> 
> Ex: dual power supplies connected to typical APC devices:
> 
>               power1  
>                 |     
>     server A   APC1           server B
>     |      +----1             |      |
>     |           2-------------+      |
>     |                                |
>     |                  APC2          |
>     +-------------------1            |
>                         2------------+
>                         |
>                       power2
> 
> 
> 
> The WTI IPS800 switches above have two power rails with 4 ports each.
> The CE version is 208v; the non-CE is 110~120v, otherwise, they're the
> same.  They cost more per unit than the 9225.  You can control dual
> power supplies on separate rails, giving you NSPF as far as power-cords
> are concerned.  If you pull one of the power sources, the power switch
> is still accessible and fencing will still work, because the internal
> electronics can run off of either power rail.  (I'm not sure what the
> fault mode is if the electronics fail, though; I *think* it leaves the
> ports in their current states; contact WTI if you have questions about
> this).
> 
> Ex: dual power supplies connected to typical WTI devices:
> 
>               power1
>                 |
>     server A   IPS   server B
>            +----1    |
>                ===   |
>                 5----+
>                 |
>               power2
> 
> Ex: dual power supplies connected to typical WTI devices:
> 
>               power1
>                 |
>     server A   IPS   server B
>     |      +----1    |      |
>     |           2----+      |
>     |          ===          |
>     +-----------5           |
>                 6-----------+
>                 |
>               power2
> 
> >From a reliability standpoint, APC and WTI both make extremely reliable
> devices.  I've never had any switch from either vendor go bonkers on me.
> 
> Not to state the opinion of any company I may or may not be employed by
> or affiliated with, I personally generally prefer the WTI devices over
> APC devices because of:
> 
> (a) Design - I *totally* dig the dual power rail configuration.  It has
> higher power capacity per switch (30A, 15A per rail/4 ports), as well
> two power sources (note: it's *two* rails; not a single, redundant rail,
> even though the switch control electronics can run off of either rail)
> 
> (b) Firmware revisions on APC devices have broken fencing agents on more
> than one occasion. (Though, this isn't so much a problem with the newer
> APC SNMP fencing agents, but I don't like setting up SNMP...).
> 
> Jim Parsons (current fencing maintainer) has differing opinions on the
> matter; I believe he prefers APC units over WTI units.
> 
> -- Lon

Thanks Lon for all the superb informations.

Due to support availability, I prefer APC over WTI. It's just that APC
has office in Malaysia and provide support locally.

Now the problem is; I'm not be able to use the APC9225 model due to
output voltage issue. APC9225 uses 120V while we in Malaysia use
220-240V/50Hz. 

APC suggest me to use another model, APC7921[1]. Nevertheless APC7921 is
not included in Red Hat documentation as a preferred (supported???)
model. So, any idea on this issue? Is Red Hat going to support me even
though I'm using unrecommended model?

I have 2 servers to be clustered with a single service. Is it ok to use
1 network power switch, or is it recommended 2 power switches (1 power
switch per server)?

[1] http://www.apc.com/resource/include/techspec_index.cfm?
base_sku=AP7921
[2] http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/s1-
hardware-clustertable.html#TBL-HARDWARE-FENCEDEVS 

-- 
Regards,
+--------------------------------+
|       Mohd Irwan Jamaluddin    |
| ##    System Engineer,         |
| (o_   Magnifix Sdn. Bhd.       |
| //\   Tel: +603 42705073       |
| V_/_  Fax: +603 42701960       |
|       http://www.magnifix.com/ |      
+--------------------------------+
| "Every successful side needs   |              
| unsung heroes" - fcbayern.de   |      
+--------------------------------+


From riaan at obsidian.co.za  Tue Oct 31 10:58:47 2006
From: riaan at obsidian.co.za (Riaan van Niekerk)
Date: Tue, 31 Oct 2006 12:58:47 +0200
Subject: [Linux-cluster] qdisk and multipathing problems
Message-ID: <45472C67.8010604@obsidian.co.za>

I am trying to get qdisk going in a 2-node RHCS 4.4 cluster with EMC 
PowerPath

I was able to format the device:
mkqdisk -c /dev/emcpowera1 -l quorum_disk_1

this command shows multiple copies (4 paths, native devices, and one EMC 
pseudo device)

[jabbah:~]# mkqdisk -L
mkqdisk v0.5
/dev/sdc1:
         Magic:   eb7a62c2
         Label:   quorum_disk_1
         Created: Mon Oct 30 16:15:57 2006
         Host:    sargas.telkomsa.net

/dev/sdd1:
         Magic:   eb7a62c2
         Label:   quorum_disk_1
         Created: Mon Oct 30 16:15:57 2006
         Host:    sargas.telkomsa.net

/dev/sdg1:
         Magic:   eb7a62c2
         Label:   quorum_disk_1
         Created: Mon Oct 30 16:15:57 2006
         Host:    sargas.telkomsa.net

/dev/sdh1:
         Magic:   eb7a62c2
         Label:   quorum_disk_1
         Created: Mon Oct 30 16:15:57 2006
         Host:    sargas.telkomsa.net

/dev/emcpowera1:
         Magic:   eb7a62c2
         Label:   quorum_disk_1
         Created: Mon Oct 30 16:15:57 2006
         Host:    sargas.telkomsa.net


Here is my stanza from cluster.conf

<quorumd interval="1" tko="10" votes="2" device="/dev/emcpowera1" 
label="quorum_disk_1">
  <heuristic program="ping 192.168.222.254 -c1 -t1" score="2" interval="2"/>
</quorumd>

I tell it to use /dev/emcpowera1 but it insists on using the first path, 
  /dev/sdc1

[jabbah:~]# cat /proc/cluster/nodes
Node  Votes Exp Sts  Name
    0    2    0   M   /dev/sdc1
    1    1    2   M   jabbah
    2    1    2   M   sargas

Is there any way to blacklist /dev/sd* devices from being scanned 
(similar to what is done for LVM)? or force qdisk to use /dev/emcpowera1?

using the native device is bad for obvious reasons (if you lose the 
path, you lose the node or even the whole quorum disk, depending on 
where the path failure is)

qdisk seems somewhat unreliable with my setup, with intermittent 
messages like this:

Oct 31 10:22:57 sargas qdiskd[18712]: <crit> Critical Error: More than 
one master found!
Oct 31 10:22:58 sargas qdiskd[11964]: <crit> A master exists, but it's 
not me?!

Also, "mkqdisk -f" does not "Find the cluster quorum disk with the given 
label" but instead just displays the version number

[jabbah:~]# mkqdisk -f  quorum_disk_1
mkqdisk v0.5

is this a bug? I logged it as 
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=213218

greetings
Riaan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: riaan.vcf
Type: text/x-vcard
Size: 310 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061031/b843ee4d/attachment.vcf>

From jos at xos.nl  Tue Oct 31 11:04:17 2006
From: jos at xos.nl (Jos Vos)
Date: Tue, 31 Oct 2006 12:04:17 +0100
Subject: [Linux-cluster] qdisk and multipathing problems
In-Reply-To: <45472C67.8010604@obsidian.co.za>;
	from riaan@obsidian.co.za on Tue, Oct 31, 2006 at 12:58:47PM +0200
References: <45472C67.8010604@obsidian.co.za>
Message-ID: <20061031120417.A18887@xos037.xos.nl>

On Tue, Oct 31, 2006 at 12:58:47PM +0200, Riaan van Niekerk wrote:

> <quorumd interval="1" tko="10" votes="2" device="/dev/emcpowera1" 
> label="quorum_disk_1">
>   <heuristic program="ping 192.168.222.254 -c1 -t1" score="2" interval="2"/>
> </quorumd>
> 
> I tell it to use /dev/emcpowera1 but it insists on using the first path, 
>   /dev/sdc1
> 
> [...]
> 
> Is there any way to blacklist /dev/sd* devices from being scanned 
> (similar to what is done for LVM)? or force qdisk to use /dev/emcpowera1?

I think you should use device=... *or* label=..., not both.  So with
multipath problems, you need to use only device=...

Unfortunately, behavior of qdiskd and mount differs a lot in the way
they handle device scanning to search for labels :-(.  Qdiskd just
sequentially scans *everything* in /proc/partitions, while mount does
honour /etc/fstab.order, but on its turn only scans devices ending on
a digit (partitions), which can be quite annoying.  A bit of a mess,
IMHO, and the behaviour of both mount and qdiskd can be improved for
more consistency and flexibility.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From lhh at redhat.com  Tue Oct 31 14:36:15 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 31 Oct 2006 09:36:15 -0500
Subject: [Linux-cluster] qdisk and multipathing problems
In-Reply-To: <45472C67.8010604@obsidian.co.za>
References: <45472C67.8010604@obsidian.co.za>
Message-ID: <1162305375.4518.348.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-31 at 12:58 +0200, Riaan van Niekerk wrote:

> I tell it to use /dev/emcpowera1 but it insists on using the first path, 
>   /dev/sdc1

Don't specify the label and it will use the device.

> is this a bug? I logged it as 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=213218

Sort of, but not really.  It should work with multipath schemes where
the original devices are hidden from userland ;)

-- Lon


From lhh at redhat.com  Tue Oct 31 15:07:20 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 31 Oct 2006 10:07:20 -0500
Subject: [Linux-cluster] qdisk and multipathing problems
In-Reply-To: <20061031120417.A18887@xos037.xos.nl>
References: <45472C67.8010604@obsidian.co.za>
	<20061031120417.A18887@xos037.xos.nl>
Message-ID: <1162307240.4518.357.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-31 at 12:04 +0100, Jos Vos wrote:

> A bit of a mess,
> IMHO, and the behaviour of both mount and qdiskd can be improved for
> more consistency and flexibility.

True, it could be better.  The label= field was added as an afterthought
for people using SCSI devices which might have different numbers on
different nodes.

/me wonders how 'findfs' works in this case...

-- Lon


From isplist at logicore.net  Tue Oct 31 19:53:29 2006
From: isplist at logicore.net (isplist at logicore.net)
Date: Tue, 31 Oct 2006 13:53:29 -0600
Subject: [Linux-cluster] Piranha/LVS/Load balancing
Message-ID: <20061031135329.292530@leena>

Anyone know of a CLEAR and easy to understand document on setting up a Piranha 
LVS load balancing setup? I've read everything I can find and since mine won't 
work, I'm just making it worse now.

I need to put a load balancer in front of the cluster.

Mike


From lhh at redhat.com  Tue Oct 31 20:47:44 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 31 Oct 2006 15:47:44 -0500
Subject: [Linux-cluster] Network-attached power switches for RHCS
In-Reply-To: <1162275386.23296.31.camel@kuli.magnifix.com.my>
References: <1162174457.4330.22.camel@kuli.magnifix.com.my>
	<1162232187.4518.312.camel@rei.boston.devel.redhat.com>
	<1162275386.23296.31.camel@kuli.magnifix.com.my>
Message-ID: <1162327664.4518.361.camel@rei.boston.devel.redhat.com>

On Tue, 2006-10-31 at 14:16 +0800, Mohd Irwan Jamaluddin wrote:

> Due to support availability, I prefer APC over WTI. It's just that APC
> has office in Malaysia and provide support locally.
> 
> Now the problem is; I'm not be able to use the APC9225 model due to
> output voltage issue. APC9225 uses 120V while we in Malaysia use
> 220-240V/50Hz. 
> 
> APC suggest me to use another model, APC7921[1]. Nevertheless APC7921 is
> not included in Red Hat documentation as a preferred (supported???)
> model. So, any idea on this issue? Is Red Hat going to support me even
> though I'm using unrecommended model?

I'm pretty sure the 7921 works with fence_apc on linux-cluster and/or
RHCS4.  If it doesn't, it's a bug.

-- Lon


From g.aulisi at mclink.it  Thu Oct 12 06:13:19 2006
From: g.aulisi at mclink.it (Guido Aulisi)
Date: Thu, 12 Oct 2006 06:13:19 -0000
Subject: [Linux-cluster] Mount hangs mounting gfs file sytem from second node
Message-ID: <1.3.200610120812.24989@mclink.it>

Hi,
I'm using kernel 2.6.17.13 and the latest cvs stable cluster suite.
I can't mount a gfs file system from the second node because mount hangs (state D).

/proc/cluster/services is:
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           2   2 run       -
[1 2]

DLM Lock Space:  "clvmd"                             2   3 run       -
[1 2]

DLM Lock Space:  "oraclefs"                          2   4 join      S-4,4,1
[1 2]

Now the firstnode can't unmount the file system.
can version is 5.0.1
I'm using LVM2 with clvmd

Thanks in advance for helping me
Guido Aulisi


From raf at raf.org  Thu Oct 12 13:09:47 2006
From: raf at raf.org (raf)
Date: Thu, 12 Oct 2006 13:09:47 -0000
Subject: [Linux-cluster] Re: Centralized Cron
Message-ID: <20061012130943.GA29648@raf.org>

nick burnett wrote:
> saju john write:
> > Dear All,
> > 
> > Is there any way to make a centalized cron while using
> > Redhat HA cluster with Sahred storage. I mean to put the
> > crontab entry for a particular user on shared storage, so
> > that when the cluster shifts, on the other node cron should
> > read from the cron file in shared storage.
> > 
> > This setup has the advantage that we don't need to manullay
> > update the cron entry in both nodes.
> > 
> > I tried two ways , but not success
> > 
> > a) Make a soft link from /var/spool/cron/<user> to
> > /path/to/shared/storage/<user>. This will work as long as I
> > didn't make any changes to existing crontab. Once I make
> > changes to crontab, the link is removed and file is created
> > at /var/spool/cron/<user>
> > 
> > b) Soft link the cron directory in /var/spool to
> > /path/to/shared/storage/cron. This is working till the
> > cluster shift. The cron is getting dead when the cluster
> > shifts as it lose the /var/spool/cron link's destination
> > driectory which will be mapped to the other node
> > 
> > Thanks in advance
> > 
> > Saju John
> > 
> You need to add in a heartbeat trigger. The cron daemon runs
> on one server only. When that server goes offline, then
> start the cron daemon on the backup server. This is a
> terrible solution though.

i just noticed the above on the net with no solution proposed.
in case you still one:

the elegant solution to this is to write a little shell
script that takes another command as its arguments, detects
whether or not it is running on the primary server, and if
it is, runs the given command. otherwise it does nothing.
then, run cron on both servers with completely identical
crontabs with the name of this script inserted before every
cronjob.

i've done this on a solaris cluster (the script is called
"clustered" naturally enough) and it works a treat. you
can even install the crontabs together from another host
to keep them always in sync.

i don't know what command you'd need on a linux cluster
to determine whether or not you are running on the primary
host but there must be something that does it.

cheers,
raf


From matteo.catanese at fastwebnet.it  Fri Oct 13 13:21:05 2006
From: matteo.catanese at fastwebnet.it (Matteo Catanese)
Date: Fri, 13 Oct 2006 13:21:05 -0000
Subject: [Linux-cluster] Unable to connect to cluster infrastructure
	-	cluster died
Message-ID: <3BD50E83-8599-4599-B5F1-6197EEF633BC@fastwebnet.it>

I just booted with the old kernel and things went ok again :-)


From mparadis at logicore.net  Fri Oct 27 18:51:20 2006
From: mparadis at logicore.net (Mike Paradis)
Date: Fri, 27 Oct 2006 13:51:20 -0500
Subject: [Linux-cluster] Load balancer
Message-ID: <20061027135120.863609@leena>

Just wondering what others might be using for load balancers? I have radware 
devices but I have a hankering to use a redhat based solution.

Everything is set up using GFS/cluster. I'll be messing with sharedroot 
systems in the near future.

I have three areas I need to balance.
1: A web server cluster
2: A mail cluster
3: MySQL cluster will also be included soon.

Any suggestions on something that's relatively easy to set up and get going 
on? 

Mike