From claudio.tassini at gmail.com  Sat Sep  1 00:57:48 2007
From: claudio.tassini at gmail.com (Claudio Tassini)
Date: Sat, 1 Sep 2007 02:57:48 +0200
Subject: [Linux-cluster] Multipathed quorum disk
Message-ID: <39fdf1c70708311757h75a57fc3r15b740ed8ad0f58b@mail.gmail.com>

Hi,
I recently upgraded a 2-nodes cluster adding two more nodes. I would like a
single node to remain in cluster even if the other three are out of service,
so I'm trying to add a quorum disk to the cluster.

The problem is that the quorum disk is a LUN in a shared storage which has
not the same device name through all the cluster nodes. Moreover, we use
device-mapper AND lvm. I could resolve the problem using an lvm logical
volume, because it would always have the same name and recognize the
underlying "dm" or "sd" device name even if it changes across a reboot, but
I've read that it's not advisable to use a logical volume as quorum device.

Any idea?

-- 
Claudio Tassini
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070901/aa42e09f/attachment.htm>

From ianbrn at gmail.com  Sat Sep  1 10:50:27 2007
From: ianbrn at gmail.com (Ian Brown)
Date: Sat, 1 Sep 2007 13:50:27 +0300
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is bettter;
	gfs_controld error
Message-ID: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>

  - Hello,
   I had installed RHEL5 on two x86_64 machine on the same LAN; afterwards I
   had installed the RHEL5 cluster suite packege (cman-2.0.60-1.el5) and
   openais-0.80.2-1.el5.


   I had also installed kmod-gfs-0.1.16-5.2.6.18_8.el5 and gfs-utils
and gfs2-utils.

   I had crated a 2-node cluster and started the cman service OK on both nodes.

   Now I tried to create a gfs partition with gfs_mkfs (with -p lock_dlm)
   and mount it, and I got errors when trying to mount it (this errors
talk about
   gfs_controld).

   I made a second try with mkfs.gfs2 (also with -p lock_dlm) );
   this time I **could** mounted the gfs2 partition succesfully.

   My questions are:

     - should I be able with this installation to create and mount a gfs
     partition ? in case this is possible - what can be my mistale ?

     - is gfs2 considered safe to work with ? or is it still experimental and
     not recommended ? which features do I have in GFS2 which I don't have in
     GFS?

     Regards,
     Ian


From wcheng at redhat.com  Sat Sep  1 17:38:19 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Sat, 01 Sep 2007 13:38:19 -0400
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is bettter;
	gfs_controld error
In-Reply-To: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>
References: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>
Message-ID: <46D9A38B.50304@redhat.com>

Ian Brown wrote:

>  - Hello,
>   I had installed RHEL5 on two x86_64 machine on the same LAN; afterwards I
>   had installed the RHEL5 cluster suite packege (cman-2.0.60-1.el5) and
>   openais-0.80.2-1.el5.
>
>
>   I had also installed kmod-gfs-0.1.16-5.2.6.18_8.el5 and gfs-utils
>and gfs2-utils.
>
>   I had crated a 2-node cluster and started the cman service OK on both nodes.
>
>   Now I tried to create a gfs partition with gfs_mkfs (with -p lock_dlm)
>   and mount it, and I got errors when trying to mount it (this errors
>talk about
>   gfs_controld).
>  
>
You didn't include the error message here ? This could be a known issue 
where gfs kernel module is not loaded by default (due to a RPM 
dependency problem). To check it out: before mounting the gfs partition ...

1) shell> lsmod
This is to check whether gfs (not gfs2) kernel module is loaded. If yes, 
mount the gfs partition, then read the /var/log/messages file and 
cut-and-paste the print-out (a.k.a the gfs_controld error messages) and 
repost here.

2) shell> cd /lib/modules/'your kernel version'/ extra/gfs
Check if gfs.ko is there. If not, you have installation problems.

3)  shell> insmod gfs.ko
This is to manually load gfs kernel module

4) Retry the mount. If still failing, send us the /var/log/messages file.

>   I made a second try with mkfs.gfs2 (also with -p lock_dlm) );
>   this time I **could** mounted the gfs2 partition succesfully.
>  
>

GFS2 is part of the base kernel, so it doesn't need to worry about RPM 
dependency.

>   My questions are:
>
>     - should I be able with this installation to create and mount a gfs
>     partition ? in case this is possible - what can be my mistale ?
>  
>

See above.

>     - is gfs2 considered safe to work with ? or is it still experimental and
>     not recommended ? which features do I have in GFS2 which I don't have in
>     GFS?
>
>    
>
The advantage of GFS2 are (my personal opinion - not necessarily Red 
Hat's) :
1. It is mainstream and will be well maintained and updated; vs. GFS 
starts to enter maintanence mode. We're hoping to phase out GFS as soon 
as GFS2 is proved to be stable.
2.  It preforms better (faster), particularly for smaller file size, but 
not as stable as GFS.

However, there are tools to facilitate people to migrate from GFS to 
GFS2. So if you want stability, GFS is not a bad choice at this moment.

-- Wendy


From wcheng at redhat.com  Sat Sep  1 17:42:28 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Sat, 01 Sep 2007 13:42:28 -0400
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is bettter;
	gfs_controld error
In-Reply-To: <46D9A38B.50304@redhat.com>
References: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>
	<46D9A38B.50304@redhat.com>
Message-ID: <46D9A484.5050908@redhat.com>


> 2) shell> cd /lib/modules/'your kernel version'/ extra/gfs

Hope you catch I have a (wrong) white space before the "extra" in above 
sentence.

> 3)  shell> insmod gfs.ko
> This is to manually load gfs kernel module

should be "insmod ./gfs.ko"

-- Wendy


From kadlec at sunserv.kfki.hu  Sat Sep  1 19:42:33 2007
From: kadlec at sunserv.kfki.hu (Kadlecsik Jozsi)
Date: Sat, 1 Sep 2007 21:42:33 +0200 (MEST)
Subject: [Linux-cluster] quorum lost in spite of 'leave remove'
In-Reply-To: <Pine.GSO.4.64.0708312146500.12565@sunserv.kfki.hu>
References: <Pine.GSO.4.64.0708311542080.16749@sunserv.kfki.hu>
	<46D828EE.5070103@redhat.com>
	<Pine.GSO.4.64.0708312035250.6149@sunserv.kfki.hu>
	<Pine.GSO.4.64.0708312146500.12565@sunserv.kfki.hu>
Message-ID: <Pine.GSO.4.64.0709012141060.10820@sunserv.kfki.hu>

On Fri, 31 Aug 2007, Kadlecsik Jozsi wrote:

> On Fri, 31 Aug 2007, Kadlecsik Jozsi wrote:
> 
> > > > '/etc/init.d/cman stop' was issued and executed successfully on the tree 
> > > > other nodes.
> > > 
> > > It looks like a bug to me. Congratulations - you're the first person to spot that!

I collected some debug logs on a node when at another one the command 
'cman stop' was issued and attached it. I hope it helps.

Best regards,
Jozsef
--
E-mail : kadlec at sunserv.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary
-------------- next part --------------
waiting for aisexec to start
waiting for aisexec to start
[MAIN ] AIS Executive Service RELEASE 'subrev 1358 version 0.80.3'
[MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
[MAIN ] Copyright (C) 2006 Red Hat, Inc.
[MAIN ] AIS Executive Service: started and ready to provide service.
[MAIN ] Using override node name lxserv0-gfs
[MAIN ] openais component openais_cpg loaded.
[MAIN ] Registering service handler 'openais cluster closed process group service v1.01'
[MAIN ] openais component openais_cfg loaded.
[MAIN ] Registering service handler 'openais configuration service'
[MAIN ] openais component openais_msg loaded.
[MAIN ] Registering service handler 'openais message service B.01.01'
[MAIN ] openais component openais_lck loaded.
[MAIN ] Registering service handler 'openais distributed locking service B.01.01'
[MAIN ] openais component openais_evt loaded.
[MAIN ] Registering service handler 'openais event service B.01.01'
[MAIN ] openais component openais_ckpt loaded.
[MAIN ] Registering service handler 'openais checkpoint service B.01.01'
[MAIN ] openais component openais_amf loaded.
[MAIN ] Registering service handler 'openais availability management framework B.01.01'
[MAIN ] openais component openais_clm loaded.
[MAIN ] Registering service handler 'openais cluster membership service B.01.01'
[MAIN ] openais component openais_evs loaded.
[MAIN ] Registering service handler 'openais extended virtual synchrony service'
[MAIN ] openais component openais_cman loaded.
[MAIN ] Registering service handler 'openais CMAN membership service 2.01'
[TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
[TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
[TOTEM] join (60 ms) send_join (0 ms) consensus (4800 ms) merge (200 ms)
[TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
[TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1500
[TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
[TOTEM] send threads (0 threads)
[TOTEM] RRP token expired timeout (495 ms)
[TOTEM] RRP token problem counter (2000 ms)
[TOTEM] RRP threshold (10 problem count)
[TOTEM] RRP mode set to none.
[TOTEM] heartbeat_failures_allowed (0)
[TOTEM] max_network_delay (50 ms)
[TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
[TOTEM] Receive multicast socket recv buffer size (288000 bytes).
[TOTEM] Transmit multicast socket send buffer size (262142 bytes).
[TOTEM] The network interface [192.168.192.15] is now up.
[TOTEM] Created or loaded sequence id 944.192.168.192.15 for this ring.
[TOTEM] entering GATHER state from 15.
[SERV ] Initialising service handler 'openais extended virtual synchrony service'
[SERV ] Initialising service handler 'openais cluster membership service B.01.01'
[SERV ] Initialising service handler 'openais availability management framework B.01.01'
[SERV ] Initialising service handler 'openais checkpoint service B.01.01'
[SERV ] Initialising service handler 'openais event service B.01.01'
[SERV ] Initialising service handler 'openais distributed locking service B.01.01'
[SERV ] Initialising service handler 'openais message service B.01.01'
[SERV ] Initialising service handler 'openais configuration service'
[SERV ] Initialising service handler 'openais cluster closed process group service v1.01'
[SERV ] Initialising service handler 'openais CMAN membership service 2.01'
[CMAN ] CMAN 2.01.00 (built Aug 23 2007 12:19:58) started
[SYNC ] Not using a virtual synchrony filter.
[TOTEM] Creating commit token because I am the rep.
[TOTEM] Saving state aru 0 high seq received 0
[TOTEM] entering COMMIT state.
[TOTEM] entering RECOVERY state.
[TOTEM] position [0] member 192.168.192.15:
[TOTEM] previous ring seq 944 rep 192.168.192.15
[TOTEM] aru 0 high delivered 0 received flag 0
[TOTEM] Did not need to originate any messages in recovery.
[TOTEM] Storing new sequence id for ring 3b4
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is b
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is b
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 4000000b to fd 10
[TOTEM] Sending initial ORF token
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: command return code is -2
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000007 to fd 10
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] Members Left:
[CLM  ] Members Joined:
[logging.c:0090] ais: confchg_fn called type = 1, seq=948
[SYNC ] This node is within the primary component and will provide service.
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] 	r(0) ip(192.168.192.15) 
[CLM  ] Members Left:
[CLM  ] Members Joined:
[CLM  ] 	r(0) ip(192.168.192.15) 
[logging.c:0090] ais: confchg_fn called type = 0, seq=948
[logging.c:0090] ais: last memb_count = 0, current = 1
[logging.c:0090] memb: sending TRANSITION message. cluster_name = kfki
[logging.c:0090] ais: comms send message 0xbfadf2dc len = 65
[logging.c:0090] daemon: sending reply 103 to fd 10
[SYNC ] This node is within the primary component and will provide service.
[TOTEM] entering OPERATIONAL state.
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 1, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 1
[logging.c:0090] memb: add_ais_node ID=1, incarnation = 948
[CLM  ] got nodejoin message 192.168.192.15
[TOTEM] entering GATHER state from 11.
[TOTEM] Saving state aru 9 high seq received 9
[TOTEM] entering COMMIT state.
[TOTEM] entering RECOVERY state.
[TOTEM] position [0] member 192.168.192.6:
[TOTEM] previous ring seq 952 rep 192.168.192.6
[TOTEM] aru 2d high delivered 2d received flag 0
[TOTEM] position [1] member 192.168.192.7:
[TOTEM] previous ring seq 952 rep 192.168.192.6
[TOTEM] aru 2d high delivered 2d received flag 0
[TOTEM] position [2] member 192.168.192.15:
[TOTEM] previous ring seq 948 rep 192.168.192.15
[TOTEM] aru 9 high delivered 9 received flag 0
[TOTEM] position [3] member 192.168.192.17:
[TOTEM] previous ring seq 952 rep 192.168.192.6
[TOTEM] aru 2d high delivered 2d received flag 0
[TOTEM] position [4] member 192.168.192.18:
[TOTEM] previous ring seq 952 rep 192.168.192.6
[TOTEM] aru 2d high delivered 2d received flag 0
[TOTEM] Did not need to originate any messages in recovery.
[TOTEM] Storing new sequence id for ring 3bc
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] 	r(0) ip(192.168.192.15) 
[CLM  ] Members Left:
[CLM  ] Members Joined:
[logging.c:0090] ais: confchg_fn called type = 1, seq=956
[SYNC ] This node is within the primary component and will provide service.
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] 	r(0) ip(192.168.192.6) 
[CLM  ] 	r(0) ip(192.168.192.7) 
[CLM  ] 	r(0) ip(192.168.192.15) 
[CLM  ] 	r(0) ip(192.168.192.17) 
[CLM  ] 	r(0) ip(192.168.192.18) 
[CLM  ] Members Left:
[CLM  ] Members Joined:
[CLM  ] 	r(0) ip(192.168.192.6) 
[CLM  ] 	r(0) ip(192.168.192.7) 
[CLM  ] 	r(0) ip(192.168.192.17) 
[CLM  ] 	r(0) ip(192.168.192.18) 
[logging.c:0090] ais: confchg_fn called type = 0, seq=956
[logging.c:0090] ais: last memb_count = 1, current = 5
[logging.c:0090] memb: sending TRANSITION message. cluster_name = kfki
[logging.c:0090] ais: comms send message 0xbfadf2dc len = 65
[logging.c:0090] daemon: sending reply 103 to fd 10
[SYNC ] This node is within the primary component and will provide service.
[TOTEM] entering OPERATIONAL state.
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 2, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 2
[logging.c:0090] memb: add_ais_node ID=2, incarnation = 956
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 1, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 1
[logging.c:0090] memb: add_ais_node ID=1, incarnation = 956
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 3, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 3
[logging.c:0090] memb: add_ais_node ID=3, incarnation = 956
[CMAN ] quorum regained, resuming activity
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 5, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 5
[logging.c:0090] memb: add_ais_node ID=5, incarnation = 956
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 4, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 4
[logging.c:0090] memb: add_ais_node ID=4, incarnation = 956
[CLM  ] got nodejoin message 192.168.192.6
[CLM  ] got nodejoin message 192.168.192.7
[CLM  ] got nodejoin message 192.168.192.15
[CLM  ] got nodejoin message 192.168.192.17
[CLM  ] got nodejoin message 192.168.192.18
[CPG  ] got joinlist message from node 2
[CPG  ] got joinlist message from node 3
[CPG  ] got joinlist message from node 5
[CPG  ] got joinlist message from node 4
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 1
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 1
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000001 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: get_all_members: allocated new buffer (retsize=1024)
[logging.c:0090] memb: get_all_members: retlen = 2120
[logging.c:0090] memb: command return code is 5
[logging.c:0090] daemon: Returning command data. length = 2120
[logging.c:0090] daemon: sending reply 40000007 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: get_all_members: allocated new buffer (retsize=1024)
[logging.c:0090] memb: get_all_members: retlen = 2120
[logging.c:0090] memb: command return code is 5
[logging.c:0090] daemon: Returning command data. length = 2120
[logging.c:0090] daemon: sending reply 40000007 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: get_all_members: allocated new buffer (retsize=1024)
[logging.c:0090] memb: get_all_members: retlen = 2120
[logging.c:0090] memb: command return code is 5
[logging.c:0090] daemon: Returning command data. length = 2120
[logging.c:0090] daemon: sending reply 40000007 to fd 10
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 91
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 91
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 24
[logging.c:0090] daemon: sending reply 40000091 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 9
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 9
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 16
[logging.c:0090] daemon: sending reply 40000009 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 92
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 92
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 320
[logging.c:0090] daemon: sending reply 40000092 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is d
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is d
[logging.c:0090] memb: command return code is 2
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 4000000d to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 90
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 90
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 424
[logging.c:0090] daemon: sending reply 40000090 to fd 10
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 20, source nodeid = 2, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 7 (len = 4)
[logging.c:0090] memb: got LEAVE from node 2, reason = 3
[TOTEM] The token was lost in the OPERATIONAL state.
[TOTEM] Receive multicast socket recv buffer size (288000 bytes).
[TOTEM] Transmit multicast socket send buffer size (262142 bytes).
[TOTEM] entering GATHER state from 2.
[TOTEM] entering GATHER state from 11.
[TOTEM] Saving state aru 43 high seq received 43
[TOTEM] entering COMMIT state.
[TOTEM] entering RECOVERY state.
[TOTEM] position [0] member 192.168.192.6:
[TOTEM] previous ring seq 956 rep 192.168.192.6
[TOTEM] aru 43 high delivered 43 received flag 0
[TOTEM] position [1] member 192.168.192.15:
[TOTEM] previous ring seq 956 rep 192.168.192.6
[TOTEM] aru 43 high delivered 43 received flag 0
[TOTEM] position [2] member 192.168.192.17:
[TOTEM] previous ring seq 956 rep 192.168.192.6
[TOTEM] aru 43 high delivered 43 received flag 0
[TOTEM] position [3] member 192.168.192.18:
[TOTEM] previous ring seq 956 rep 192.168.192.6
[TOTEM] aru 43 high delivered 43 received flag 0
[TOTEM] Did not need to originate any messages in recovery.
[TOTEM] Storing new sequence id for ring 3c4
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] 	r(0) ip(192.168.192.6) 
[CLM  ] 	r(0) ip(192.168.192.15) 
[CLM  ] 	r(0) ip(192.168.192.17) 
[CLM  ] 	r(0) ip(192.168.192.18) 
[CLM  ] Members Left:
[CLM  ] 	r(0) ip(192.168.192.7) 
[CLM  ] Members Joined:
[logging.c:0090] ais: confchg_fn called type = 1, seq=964
[logging.c:0090] memb: del_ais_node 2
[logging.c:0090] daemon: sending reply 102 to fd 11
[SYNC ] This node is within the primary component and will provide service.
[CLM  ] CLM CONFIGURATION CHANGE
[CLM  ] New Configuration:
[CLM  ] 	r(0) ip(192.168.192.6) 
[CLM  ] 	r(0) ip(192.168.192.15) 
[CLM  ] 	r(0) ip(192.168.192.17) 
[CLM  ] 	r(0) ip(192.168.192.18) 
[CLM  ] Members Left:
[CLM  ] Members Joined:
[logging.c:0090] ais: confchg_fn called type = 0, seq=964
[logging.c:0090] ais: last memb_count = 5, current = 4
[logging.c:0090] memb: sending TRANSITION message. cluster_name = kfki
[logging.c:0090] ais: comms send message 0xbfadf2dc len = 65
[SYNC ] This node is within the primary component and will provide service.
[TOTEM] entering OPERATIONAL state.
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: get_all_members: allocated new buffer (retsize=1024)
[logging.c:0090] memb: get_all_members: retlen = 2120
[logging.c:0090] memb: command return code is 5
[logging.c:0090] daemon: Returning command data. length = 2120
[logging.c:0090] daemon: sending reply 40000007 to fd 11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is 7
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 7
[logging.c:0090] memb: get_all_members: allocated new buffer (retsize=1024)
[logging.c:0090] memb: get_all_members: retlen = 2120
[logging.c:0090] memb: command return code is 5
[logging.c:0090] daemon: Returning command data. length = 2120
[logging.c:0090] daemon: sending reply 40000007 to fd 11
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 1, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 1
[logging.c:0090] memb: add_ais_node ID=1, incarnation = 964
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 3, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 3
[logging.c:0090] memb: add_ais_node ID=3, incarnation = 964
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 5, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 5
[logging.c:0090] memb: add_ais_node ID=5, incarnation = 964
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 81, source nodeid = 4, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 5 (len = 65)
[logging.c:0090] memb: got TRANSITION from node 4
[logging.c:0090] memb: add_ais_node ID=4, incarnation = 964
[CLM  ] got nodejoin message 192.168.192.6
[CLM  ] got nodejoin message 192.168.192.15
[CLM  ] got nodejoin message 192.168.192.17
[CLM  ] got nodejoin message 192.168.192.18
[CPG  ] got joinlist message from node 3
[CPG  ] got joinlist message from node 5
[CPG  ] got joinlist message from node 4
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 91
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 91
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 24
[logging.c:0090] daemon: sending reply 40000091 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 9
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 9
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 16
[logging.c:0090] daemon: sending reply 40000009 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 92
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 92
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 320
[logging.c:0090] daemon: sending reply 40000092 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is d
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is d
[logging.c:0090] memb: command return code is 2
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 4000000d to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 90
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 90
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 424
[logging.c:0090] daemon: sending reply 40000090 to fd 10
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 91
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 91
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 24
[logging.c:0090] daemon: sending reply 40000091 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 9
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 9
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 16
[logging.c:0090] daemon: sending reply 40000009 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 92
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 92
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 320
[logging.c:0090] daemon: sending reply 40000092 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is d
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is d
[logging.c:0090] memb: command return code is 2
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 4000000d to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 90
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 90
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 424
[logging.c:0090] daemon: sending reply 40000090 to fd 10
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 91
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 91
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 24
[logging.c:0090] daemon: sending reply 40000091 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 9
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 9
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 16
[logging.c:0090] daemon: sending reply 40000009 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 92
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 92
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 320
[logging.c:0090] daemon: sending reply 40000092 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 5
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 5
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 40000005 to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is d
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is d
[logging.c:0090] memb: command return code is 2
[logging.c:0090] daemon: Returning command data. length = 0
[logging.c:0090] daemon: sending reply 4000000d to fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 90
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 90
[logging.c:0090] memb: command return code is 0
[logging.c:0090] daemon: Returning command data. length = 424
[logging.c:0090] daemon: sending reply 40000090 to fd 10
[logging.c:0090] daemon: read 0 bytes from fd 10
[logging.c:0090] daemon: read 20 bytes from fd 10
[logging.c:0090] daemon: client command is 800000bb
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is 800000bb
[logging.c:0090] daemon: sending reply 102 to fd 11
[logging.c:0090] memb: command return code is -11
[logging.c:0090] daemon: read 20 bytes from fd 11
[logging.c:0090] daemon: client command is bc
[logging.c:0090] daemon: About to process command
[logging.c:0090] memb: command to process is bc
[logging.c:0090] memb: Shutdown reply is 1
[logging.c:0090] memb: Sending LEAVE, reason 3
[logging.c:0090] ais: comms send message 0xbfae5bdc len = 4
[logging.c:0090] memb: shutdown decision is: 0 (yes=1, no=0) flags=2
[logging.c:0090] memb: command return code is -11
[logging.c:0090] ais: deliver_fn called, iov_len = 1, iov[0].len = 20, source nodeid = 1, conversion reqd=0
[logging.c:0090] memb: Message on port 0 is 7 (len = 4)
[logging.c:0090] memb: got LEAVE from node 1, reason = 3
[logging.c:0090] daemon: send status return: 0
[logging.c:0090] daemon: sending reply c00000bb to fd 10

From carlopmart at gmail.com  Sat Sep  1 20:00:03 2007
From: carlopmart at gmail.com (carlopmart)
Date: Sat, 01 Sep 2007 22:00:03 +0200
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46D7E431.2020100@gmail.com>
References: <46D7E431.2020100@gmail.com>
Message-ID: <46D9C4C3.3070009@gmail.com>

carlopmart wrote:
> Hi all,
> 
>  I am running standalone xen host using rhel5 with three rhel5 xen guest 
> with cluster-suite. I have setup fence_xvm as a fence device on all 
> three guest. On the host side I have setup fence_xvmd on cluster.conf file.
> 
>  My problems starts when I need to restart xen server host. Every time 
> that reboots, fence_xvmd doesn't starts. If I execute "service cman 
> restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
> 
> Many thanks.
> 
Please I need an answer about this ...


-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From Nick.Couchman at seakr.com  Sat Sep  1 22:17:27 2007
From: Nick.Couchman at seakr.com (Nick Couchman)
Date: Sat, 01 Sep 2007 16:17:27 -0600
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is
	bettter; gfs_controld error
Message-ID: <46D99096.87A6.0099.1@seakr.com>

In my opinion, GFS2 is still not stable enough for production use.  GFS2 is designed to be better than GFS, but still lacks some stability.  GFS2 has better support for certain features (extended attributes, for example), and is supposed to perform better.
 
You can start with a GFS filesystem, then use the gfs2_convert utility when GFS2 becomes stable to move to GFS2.
 
--Nick

>>> On 2007/09/01 at 04:50:27, "Ian Brown" <ianbrn at gmail.com> wrote:
  - Hello,
   I had installed RHEL5 on two x86_64 machine on the same LAN; afterwards I
   had installed the RHEL5 cluster suite packege (cman-2.0.60-1.el5) and
   openais-0.80.2-1.el5.


   I had also installed kmod-gfs-0.1.16-5.2.6.18_8.el5 and gfs-utils
and gfs2-utils.

   I had crated a 2-node cluster and started the cman service OK on both nodes.

   Now I tried to create a gfs partition with gfs_mkfs (with -p lock_dlm)
   and mount it, and I got errors when trying to mount it (this errors
talk about
   gfs_controld).

   I made a second try with mkfs.gfs2 (also with -p lock_dlm) );
   this time I **could** mounted the gfs2 partition succesfully.

   My questions are:

     - should I be able with this installation to create and mount a gfs
     partition ? in case this is possible - what can be my mistale ?

     - is gfs2 considered safe to work with ? or is it still experimental and
     not recommended ? which features do I have in GFS2 which I don't have in
     GFS?

     Regards,
     Ian


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070901/2c2e1782/attachment.htm>

From ianbrn at gmail.com  Sun Sep  2 08:04:23 2007
From: ianbrn at gmail.com (Ian Brown)
Date: Sun, 2 Sep 2007 11:04:23 +0300
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is bettter;
	gfs_controld error
In-Reply-To: <46D9A38B.50304@redhat.com>
References: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>
	<46D9A38B.50304@redhat.com>
Message-ID: <d0383f90709020104h2096c8edhe5d53864f5944b27@mail.gmail.com>

Hello,
I had ran "modprobe gfs" and I see by lsmod the the gfs module is loaded.

also I had verified that under /lib/modules/MyKernelVersion/extra/gfs/ there is
gfs.ko.

Then I try:

gfs_mkfs -p lock_dlm -t myCLuster -j 32 /dev/cciss/c0d1p2
mount /dev/cciss/c0d1p2 /mnt/gfs

The errors I see in the console are:
/sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
/sbin/mount.gfs: error mounting lockproto lock_dlm

The error I see in kernel log is:
gfs_controld[32629]: mount: not in default fence domain

I want to add that the cman service is started succesfully as the
kernel log shows.

I want also to add that "service cman start" performs modprbe of gfs2 module
and not gfs module !

Namely, I ran rmmod gfs; then, after :
service cman stop
and
rmmod lock_dlm
rmmod gfs2

running  lsmod | grep gfs2 shows that
no gfs2 is loaded,
and after "service cman start" I see by
 lsmod | grep gfs2
gfs2                  522965  1 lock_dlm

which means that starting the cman service performed modprobe/insmod
of gfs2 and lock_dlm

Is this how things should be?

rgs,
Ian

On 9/1/07, Wendy Cheng <wcheng at redhat.com> wrote:
> Ian Brown wrote:
>
> >  - Hello,
> >   I had installed RHEL5 on two x86_64 machine on the same LAN; afterwards I
> >   had installed the RHEL5 cluster suite packege (cman-2.0.60-1.el5) and
> >   openais-0.80.2-1.el5.
> >
> >
> >   I had also installed kmod-gfs-0.1.16-5.2.6.18_8.el5 and gfs-utils
> >and gfs2-utils.
> >
> >   I had crated a 2-node cluster and started the cman service OK on both nodes.
> >
> >   Now I tried to create a gfs partition with gfs_mkfs (with -p lock_dlm)
> >   and mount it, and I got errors when trying to mount it (this errors
> >talk about
> >   gfs_controld).
> >
> >
> You didn't include the error message here ? This could be a known issue
> where gfs kernel module is not loaded by default (due to a RPM
> dependency problem). To check it out: before mounting the gfs partition ...
>
> 1) shell> lsmod
> This is to check whether gfs (not gfs2) kernel module is loaded. If yes,
> mount the gfs partition, then read the /var/log/messages file and
> cut-and-paste the print-out (a.k.a the gfs_controld error messages) and
> repost here.
>
> 2) shell> cd /lib/modules/'your kernel version'/ extra/gfs
> Check if gfs.ko is there. If not, you have installation problems.
>
> 3)  shell> insmod gfs.ko
> This is to manually load gfs kernel module
>
> 4) Retry the mount. If still failing, send us the /var/log/messages file.
>
> >   I made a second try with mkfs.gfs2 (also with -p lock_dlm) );
> >   this time I **could** mounted the gfs2 partition succesfully.
> >
> >
>
> GFS2 is part of the base kernel, so it doesn't need to worry about RPM
> dependency.
>
> >   My questions are:
> >
> >     - should I be able with this installation to create and mount a gfs
> >     partition ? in case this is possible - what can be my mistale ?
> >
> >
>
> See above.
>
> >     - is gfs2 considered safe to work with ? or is it still experimental and
> >     not recommended ? which features do I have in GFS2 which I don't have in
> >     GFS?
> >
> >
> >
> The advantage of GFS2 are (my personal opinion - not necessarily Red
> Hat's) :
> 1. It is mainstream and will be well maintained and updated; vs. GFS
> starts to enter maintanence mode. We're hoping to phase out GFS as soon
> as GFS2 is proved to be stable.
> 2.  It preforms better (faster), particularly for smaller file size, but
> not as stable as GFS.
>
> However, there are tools to facilitate people to migrate from GFS to
> GFS2. So if you want stability, GFS is not a bad choice at this moment.
>
> -- Wendy
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From isplist at logicore.net  Sun Sep  2 10:44:04 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Sun, 2 Sep 2007 05:44:04 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <2007925444.953113@leena>

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From kmoriwak at redhat.com  Mon Sep  3 01:42:20 2007
From: kmoriwak at redhat.com (Kazuo Moriwaka)
Date: Mon, 03 Sep 2007 10:42:20 +0900
Subject: [Linux-cluster] Discovering the world of clustering
In-Reply-To: <200708312133.14035.mm@yuhu.biz>
References: <s6d804be.026@smtp.prpb.mpf.gov.br>
	<200708312133.14035.mm@yuhu.biz>
Message-ID: <1188783740.4413.141.camel@kmoriwak>

Hi Claudio,

I'm learning gfs 3 node cluster with xen.  There are some
references:

'Virtualization for Dummies' will be great help to use xen.
http://intranet.corp.redhat.com/ic/intranet/RHEL5info

I put some configuration files in svn, you can see them at:
https://trac.nrt.redhat.com/trac/browser/tools/VMconfigs

regards,

2007-08-31 (Fri) ? 21:33 +0300 ? Marian Marinov ????????:
> You can always use Xen virtual machines which can easy migrate from machine to 
> machine.
> 
> http://www.cl.cam.ac.uk/research/srg/netos/xen/
> 
> You can have the Xen virtual machines over a GFS cluster.
> 
> Best regards
>   Marian Marinov
> On Friday 31 August 2007 18:07:49 Augusto Lima wrote:
> > Hi, i'm Augusto and i don't know much about clusters.
> > I'm from brazil, so my english it's not quite good.
> > I have an idea to test in my organization.
> >
> > We have two large DELL servers with 6GB RAM each and a Xeon 3GHz
> > processor each.
> > They have also lots of disk space.
> >
> > We want to cluster the 2 servers and run VMware Server on them, trying
> > to utilize most of the processors and the available RAM all the time.
> > We have plans to make 6 Virtual Machines running on top of them.
> > We also want to take advantage of High Availbility on our
> > configuration, meaning that if one servers goes down, the other have to
> > hold the 6 VMs for a period of time.
> > We can't afford any paid solution, since our organization does'nt
> > support that kind of implementation.
> >
> > So, i'm wondering if anyone can give a opinion about if it is possible
> > and how can i do it using only free solutions.
> >
> > Thanks in advance,
> >
> > Augusto
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From isplist at logicore.net  Sat Sep  1 00:55:07 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 31 Aug 2007 19:55:07 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <200783119557.749480@leena>

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From kmoriwak at redhat.com  Mon Sep  3 01:58:04 2007
From: kmoriwak at redhat.com (Kazuo Moriwaka)
Date: Mon, 03 Sep 2007 10:58:04 +0900
Subject: [Linux-cluster] Discovering the world of clustering
In-Reply-To: <1188783740.4413.141.camel@kmoriwak>
References: <s6d804be.026@smtp.prpb.mpf.gov.br>
	<200708312133.14035.mm@yuhu.biz> <1188783740.4413.141.camel@kmoriwak>
Message-ID: <1188784684.4413.153.camel@kmoriwak>

Hi,

I'm very sorry for I mistaked that this list is red hat internal list..
Links which I sent are unavailable from outside from redhat.com.

I'll show some tips when using xen from them.

- use 'w!' attribute for shared block device
ex. disk = [ 'tap:aio:/media/disk/VMImages/rhel5_1,xvda,w',
'tap:aio:/media/disk/VMImages/gfs_disk,xvdb,w!',]

- make a dummy network interface to build  virtual network,
 in /etc/xen/xend-config.sxp: 
  (network-script 'network-bridge bridge=xenbr0 netdev=dummy0')

- dnsmasq is very useful for constructing dns server for virtual
network.
http://www.thekelleys.org.uk/dnsmasq/doc.html

regards,

2007-09-03 (Mon) ? 10:42 +0900 ? Kazuo Moriwaka ????????:
> Hi Claudio,
> 
> I'm learning gfs 3 node cluster with xen.  There are some
> references:
> 
> 'Virtualization for Dummies' will be great help to use xen.
> http://intranet.corp.redhat.com/ic/intranet/RHEL5info
> 
> I put some configuration files in svn, you can see them at:
> https://trac.nrt.redhat.com/trac/browser/tools/VMconfigs
> 
> regards,
> 
> 2007-08-31 (Fri) ? 21:33 +0300 ? Marian Marinov ????????:
> > You can always use Xen virtual machines which can easy migrate from machine to 
> > machine.
> > 
> > http://www.cl.cam.ac.uk/research/srg/netos/xen/
> > 
> > You can have the Xen virtual machines over a GFS cluster.
> > 
> > Best regards
> >   Marian Marinov
> > On Friday 31 August 2007 18:07:49 Augusto Lima wrote:
> > > Hi, i'm Augusto and i don't know much about clusters.
> > > I'm from brazil, so my english it's not quite good.
> > > I have an idea to test in my organization.
> > >
> > > We have two large DELL servers with 6GB RAM each and a Xeon 3GHz
> > > processor each.
> > > They have also lots of disk space.
> > >
> > > We want to cluster the 2 servers and run VMware Server on them, trying
> > > to utilize most of the processors and the available RAM all the time.
> > > We have plans to make 6 Virtual Machines running on top of them.
> > > We also want to take advantage of High Availbility on our
> > > configuration, meaning that if one servers goes down, the other have to
> > > hold the 6 VMs for a period of time.
> > > We can't afford any paid solution, since our organization does'nt
> > > support that kind of implementation.
> > >
> > > So, i'm wondering if anyone can give a opinion about if it is possible
> > > and how can i do it using only free solutions.
> > >
> > > Thanks in advance,
> > >
> > > Augusto
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From carlopmart at gmail.com  Mon Sep  3 06:42:37 2007
From: carlopmart at gmail.com (carlopmart)
Date: Mon, 03 Sep 2007 08:42:37 +0200
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46D9C4C3.3070009@gmail.com>
References: <46D7E431.2020100@gmail.com> <46D9C4C3.3070009@gmail.com>
Message-ID: <46DBACDD.6060803@gmail.com>

carlopmart wrote:
> carlopmart wrote:
>> Hi all,
>>
>>  I am running standalone xen host using rhel5 with three rhel5 xen 
>> guest with cluster-suite. I have setup fence_xvm as a fence device on 
>> all three guest. On the host side I have setup fence_xvmd on 
>> cluster.conf file.
>>
>>  My problems starts when I need to restart xen server host. Every time 
>> that reboots, fence_xvmd doesn't starts. If I execute "service cman 
>> restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
>>
>> Many thanks.
>>
> Please I need an answer about this ...
> 
> 

Well I think that I found the problem: cman startup script. In this line:


     # Check for presence of Domain-0; if it's not there, we can't
     # run xvmd.
     #
     xm list --long 2> /dev/null | grep -q "Domain-0" || return 1

If it is executed from command line any result is returned:

  [root at xenhost xen]# xm list --long 2> /dev/null | grep -q "Domain-0"
  [root at xenhost xen]#

If I put -X under /etc/sysconfig/cman on FENCE_XVMD_OPTS, nothing 
happens. Is this a bug???

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From maalgi at ono.com  Mon Sep  3 11:03:58 2007
From: maalgi at ono.com (maalgi at ono.com)
Date: Mon, 3 Sep 2007 13:03:58 +0200 (CEST)
Subject: [Linux-cluster] Fence Device (Ethernet)
Message-ID: <18776480.239821188817438698.JavaMail.root@resprs03>

Hi, the first thing, sorry for my english i'm spanish.
I'm trying to mount in an eviorement of test a system cluster, with two pc's.
None PC have fence devices installed.
Every PC has 2 cards of network ethernet, one of wich this one in private network 10.0.0.x, for test to another node.
Another card has an ip 192.168.x.x, of my net and a virtual ip 192.168.x.x1 to the access to services of the cluster.

NODE1
eth0------->192.168.56.15
eth0:1----->192.168.56.24
eth1------->10.0.0.1

NODE2
eth0----->192.168.56.16
eth0:1--->192.168.56.24
eth1------>10.0.0.2

The interfaz "eth0:1" is up when cluster is up, while in the other node interfaz and cluster are down.
Is possible configure the cluster this way??
That type of device fence must i to use??

I probe to configure cluster (eth0:1) with WTI, APC.... fence devices, and i have same ressults.

/etc/init.d/ccsd start OK
/etc/init.d/fenced start FAIL
......

Thank you very much and regards.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070903/b85b723e/attachment.htm>

From gsrlinux at gmail.com  Mon Sep  3 11:24:56 2007
From: gsrlinux at gmail.com (GS R)
Date: Mon, 3 Sep 2007 16:54:56 +0530
Subject: [Linux-cluster] Fence Device (Ethernet)
In-Reply-To: <18776480.239821188817438698.JavaMail.root@resprs03>
References: <18776480.239821188817438698.JavaMail.root@resprs03>
Message-ID: <d765e01f0709030424w31ffcdfy68871a6ddf30e462@mail.gmail.com>

>
> NODE1
> eth0------->192.168.56.15
> eth0:1----->192.168.56.24
> eth1------->10.0.0.1
>
> NODE2
> eth0----->192.168.56.16
> eth0:1--->192.168.56.24
> eth1------>10.0.0.2
>
> The interfaz "eth0:1" is up when cluster is up, while in the other node
> interfaz and cluster are down.
> Is possible configure the cluster this way??


Yes. Assuming that you have a common storage.

That type of device fence must i to use??


You can your GNBD as your fencing device.

I probe to configure cluster (eth0:1) with WTI, APC.... fence devices, and i
> have same ressults.
>
> /etc/init.d/ccsd start OK
> /etc/init.d/fenced start FAIL


Which version of RedHat are you using?
/etc/init.d/cman start
&
There is no /etc/init.d/fenced start use /sbin/fenced instead.

Thanks
Gowrishankar Rajaiyan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070903/f5bd6991/attachment.htm>

From maalgi at ono.com  Mon Sep  3 11:03:58 2007
From: maalgi at ono.com (maalgi at ono.com)
Date: Mon, 3 Sep 2007 13:03:58 +0200 (CEST)
Subject: [Linux-cluster] Fence Device (Ethernet)
Message-ID: <18776480.239821188817438698.JavaMail.root@resprs03>

Hi, the first thing, sorry for my english i'm spanish.
I'm trying to mount in an eviorement of test a system cluster, with two pc's.
None PC have fence devices installed.
Every PC has 2 cards of network ethernet, one of wich this one in private network 10.0.0.x, for test to another node.
Another card has an ip 192.168.x.x, of my net and a virtual ip 192.168.x.x1 to the access to services of the cluster.

NODE1
eth0------->192.168.56.15
eth0:1----->192.168.56.24
eth1------->10.0.0.1

NODE2
eth0----->192.168.56.16
eth0:1--->192.168.56.24
eth1------>10.0.0.2

The interfaz "eth0:1" is up when cluster is up, while in the other node interfaz and cluster are down.
Is possible configure the cluster this way??
That type of device fence must i to use??

I probe to configure cluster (eth0:1) with WTI, APC.... fence devices, and i have same ressults.

/etc/init.d/ccsd start OK
/etc/init.d/fenced start FAIL
......

Thank you very much and regards.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070903/b85b723e/attachment-0001.htm>

From Augusto at prpb.mpf.gov.br  Mon Sep  3 16:26:52 2007
From: Augusto at prpb.mpf.gov.br (Augusto Lima)
Date: Mon, 03 Sep 2007 13:26:52 -0300
Subject: [Linux-cluster] Discovering the world of clustering
Message-ID: <s6dc0ba5.070@smtp.prpb.mpf.gov.br>

Thanks very much for the answers but as i was tolding you, its worth for
me a cluster which share computational powers between the machines, and
not only the HD.

Regards,

Augusto

>>> kmoriwak at redhat.com 02/09/2007 22:58 >>>
Hi,

I'm very sorry for I mistaked that this list is red hat internal
list..
Links which I sent are unavailable from outside from redhat.com.

I'll show some tips when using xen from them.

- use 'w!' attribute for shared block device
ex. disk = [ 'tap:aio:/media/disk/VMImages/rhel5_1,xvda,w',
'tap:aio:/media/disk/VMImages/gfs_disk,xvdb,w!',]

- make a dummy network interface to build  virtual network,
 in /etc/xen/xend-config.sxp: 
  (network-script 'network-bridge bridge=xenbr0 netdev=dummy0')

- dnsmasq is very useful for constructing dns server for virtual
network.
http://www.thekelleys.org.uk/dnsmasq/doc.html 

regards,

2007-09-03 (Mon) ? 10:42 +0900 ? Kazuo Moriwaka ????????:
> Hi Claudio,
> 
> I'm learning gfs 3 node cluster with xen.  There are some
> references:
> 
> 'Virtualization for Dummies' will be great help to use xen.
> http://intranet.corp.redhat.com/ic/intranet/RHEL5info 
> 
> I put some configuration files in svn, you can see them at:
> https://trac.nrt.redhat.com/trac/browser/tools/VMconfigs 
> 
> regards,
> 
> 2007-08-31 (Fri) ? 21:33 +0300 ? Marian Marinov ????????:
> > You can always use Xen virtual machines which can easy migrate from
machine to 
> > machine.
> > 
> > http://www.cl.cam.ac.uk/research/srg/netos/xen/ 
> > 
> > You can have the Xen virtual machines over a GFS cluster.
> > 
> > Best regards
> >   Marian Marinov
> > On Friday 31 August 2007 18:07:49 Augusto Lima wrote:
> > > Hi, i'm Augusto and i don't know much about clusters.
> > > I'm from brazil, so my english it's not quite good.
> > > I have an idea to test in my organization.
> > >
> > > We have two large DELL servers with 6GB RAM each and a Xeon 3GHz
> > > processor each.
> > > They have also lots of disk space.
> > >
> > > We want to cluster the 2 servers and run VMware Server on them,
trying
> > > to utilize most of the processors and the available RAM all the
time.
> > > We have plans to make 6 Virtual Machines running on top of them.
> > > We also want to take advantage of High Availbility on our
> > > configuration, meaning that if one servers goes down, the other
have to
> > > hold the 6 VMs for a period of time.
> > > We can't afford any paid solution, since our organization
does'nt
> > > support that kind of implementation.
> > >
> > > So, i'm wondering if anyone can give a opinion about if it is
possible
> > > and how can i do it using only free solutions.
> > >
> > > Thanks in advance,
> > >
> > > Augusto
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com 
> > > https://www.redhat.com/mailman/listinfo/linux-cluster 
> > 
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com 
> > https://www.redhat.com/mailman/listinfo/linux-cluster 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com 
> https://www.redhat.com/mailman/listinfo/linux-cluster 

--
Linux-cluster mailing list
Linux-cluster at redhat.com 
https://www.redhat.com/mailman/listinfo/linux-cluster 


From wcheng at redhat.com  Mon Sep  3 21:49:48 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Mon, 03 Sep 2007 17:49:48 -0400
Subject: [Linux-cluster] GFS and GFS2 : two questions: which is bettter;
	gfs_controld error
In-Reply-To: <d0383f90709020104h2096c8edhe5d53864f5944b27@mail.gmail.com>
References: <d0383f90709010350g2fc01115jd592867c11bf1546@mail.gmail.com>	
	<46D9A38B.50304@redhat.com>
	<d0383f90709020104h2096c8edhe5d53864f5944b27@mail.gmail.com>
Message-ID: <46DC817C.1050208@redhat.com>

Ian Brown wrote:

>gfs_mkfs -p lock_dlm -t myCLuster -j 32 /dev/cciss/c0d1p2
>  
>
Few things:

First, not sure why gfs_mkfs let you get away without specifying 
filesystem-name (-t option) .. ideally a gfs_mkfs should be dispatched as:
shell> gfs_mkfs -t mycluster:myfs -p lock_dlm -j 2 /dev/vg0/mygfs (see 
the ":" between the cluster name and fs name here ?).

Do a "man gfs_mkfs" to get the correct syntax of "-t" (locktable)

Second, I notice you didn't use (c)lvm partition but a cciss raw device. 
How many nodes do you have in the cluster (or how many nodes do you plan 
to access this particular filesystem) ? If it is planned for multiple 
nodes access, please use (cluster version of) LVM (clvm). If this is for 
single node access, it is probably better using "-p nolock" protocol but 
"-p lock_dlm" should work fine.

>mount /dev/cciss/c0d1p2 /mnt/gfs
>
>The errors I see in the console are:
>/sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
>/sbin/mount.gfs: error mounting lockproto lock_dlm
>
>The error I see in kernel log is:
>gfs_controld[32629]: mount: not in default fence domain
>  
>
In theory, when you do "mount", the gfs-kmod should be loaded 
automatically (assume "service cman start" has been run). Check your 
/etc/cluster/cluster.conf file please! Also make sure "fenced" is up and 
runnning ("service cman start" should bring it up) when you do the mount.

>I want to add that the cman service is started succesfully as the
>kernel log shows.
>
>I want also to add that "service cman start" performs modprbe of gfs2 module
>and not gfs module !
>
>Namely, I ran rmmod gfs; then, after :
>service cman stop
>and
>rmmod lock_dlm
>rmmod gfs2
>
>running  lsmod | grep gfs2 shows that
>no gfs2 is loaded,
>and after "service cman start" I see by
> lsmod | grep gfs2
>gfs2                  522965  1 lock_dlm
>
>which means that starting the cman service performed modprobe/insmod
>of gfs2 and lock_dlm
>
>Is this how things should be?
>
>  
>
Yes, it was the original design for RHEL5 (i.e., gfs2 is the default). 
However, you really shouldn't worry about this module loading business. 
The "mount" should be able to find the correct module and load the 
module behind the scene. If your gfs-kmod correctly exists in 
/lib/modules directory, then I don't have goold clues why things go 
wrong (it works for me). Open a service ticket if you have RHEL 
subscription (so support folks can look into the details). Or maybe GFS 
team's other team member can spot anything that I've missed ?

-- Wendy


From kadlec at sunserv.kfki.hu  Tue Sep  4 09:26:18 2007
From: kadlec at sunserv.kfki.hu (Kadlecsik Jozsi)
Date: Tue, 4 Sep 2007 11:26:18 +0200 (MEST)
Subject: [Linux-cluster] quorum lost in spite of 'leave remove'
In-Reply-To: <Pine.GSO.4.64.0708311542080.16749@sunserv.kfki.hu>
References: <Pine.GSO.4.64.0708311542080.16749@sunserv.kfki.hu>
Message-ID: <Pine.GSO.4.64.0709041116210.3787@sunserv.kfki.hu>

On Fri, 31 Aug 2007, Kadlecsik Jozsi wrote:

> In spite of having 'fence_tool leave' and 'cman_tool leave remove' in the 
> 'cman' init script, when stopping the five-member cluster, it looses 
> quorum when only two machines run the cluster components:
> 
> root at web1:~# cman_tool status
> Version: 6.0.1
> Config Version: 6
> Cluster Name: kfki
> Cluster Id: 1583
> Cluster Member: Yes
> Cluster Generation: 748
> Membership state: Cluster-Member
> Nodes: 2
> Expected votes: 5
> Total votes: 2
> Quorum: 3 Activity blocked
> Active subsystems: 7
> Flags: 
> Ports Bound: 0 11  
> Node name: web1-gfs
> Node ID: 4
> Multicast addresses: 224.0.0.3 
> Node addresses: 192.168.192.6 
> 
> root at web1:~# cman_tool nodes 
> Node  Sts   Inc   Joined               Name
>    1   X    728                        lxserv0-gfs
>    2   M    728   2007-08-31 09:19:09  lxserv1-gfs
>    3   X    728                        web0-gfs
>    4   M    724   2007-08-31 09:18:48  web1-gfs
>    5   X    728                        saturn-gfs
> 
> '/etc/init.d/cman stop' was issued and executed successfully on the tree 
> other nodes.

As I see it happens because the 'expected_votes' of the nodes are not
adjusted when nodes are removed. So even when decreasing of the quorum is 
allowed, the highest expected vote value prevents decreasing the 
value of the quorum.

I wrote the attached patch to adjust expected_votes when a node is removed 
(and when it appears again). Please review it and apply if you agree with 
it.

Best regards,
Jozsef
--
E-mail : kadlec at sunserv.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary
-------------- next part --------------
diff -urN --exclude=deb cluster-2.01.00.orig/cman/daemon/commands.c cluster-2.01.00/cman/daemon/commands.c
--- cluster-2.01.00.orig/cman/daemon/commands.c	2007-06-26 11:09:13.000000000 +0200
+++ cluster-2.01.00/cman/daemon/commands.c	2007-09-04 10:43:27.000000000 +0200
@@ -1867,7 +1867,7 @@
 	}
 }
 
-void override_expected(int newexp)
+void reset_expected(int may_increase, int newexp)
 {
 	struct list *nodelist;
 	struct cluster_node *node;
@@ -1875,13 +1875,12 @@
 	list_iterate(nodelist, &cluster_members_list) {
 		node = list_item(nodelist, struct cluster_node);
 		if (node->state == NODESTATE_MEMBER
-		    && node->expected_votes > newexp) {
+		    && (node->expected_votes > newexp || may_increase)) {
 			node->expected_votes = newexp;
 		}
 	}
 }
 
-
 /* Add a node from CCS, note that it may already exist if user has simply updated the config file */
 void add_ccs_node(char *nodename, int nodeid, int votes, int expected_votes)
 {
@@ -1942,6 +1941,8 @@
 		node->incarnation = incarnation;
 		node->state = NODESTATE_MEMBER;
 		cluster_members++;
+		if ((node->leave_reason & 0xF) == CLUSTER_LEAVEFLAG_REMOVED)
+			reset_expected(1, us->expected_votes + node->votes);
 		recalculate_quorum(0);
 	}
 }
@@ -1983,9 +1984,11 @@
 		node->state = NODESTATE_DEAD;
 		cluster_members--;
 
-		if ((node->leave_reason & 0xF) == CLUSTER_LEAVEFLAG_REMOVED)
+		if ((node->leave_reason & 0xF) == CLUSTER_LEAVEFLAG_REMOVED) {
+			override_expected(us->expected_votes > node->votes ?
+					  us->expected_votes - node->votes : 1);
 			recalculate_quorum(1);
-		else
+		} else
 			recalculate_quorum(0);
 		break;
 
diff -urN --exclude=deb cluster-2.01.00.orig/cman/daemon/commands.h cluster-2.01.00/cman/daemon/commands.h
--- cluster-2.01.00.orig/cman/daemon/commands.h	2006-08-17 15:22:39.000000000 +0200
+++ cluster-2.01.00/cman/daemon/commands.h	2007-09-04 10:28:17.000000000 +0200
@@ -29,12 +29,12 @@
 extern void add_ais_node(int nodeid, uint64_t incarnation, int total_members);
 extern void del_ais_node(int nodeid);
 extern void add_ccs_node(char *name, int nodeid, int votes, int expected_votes);
-extern void override_expected(int expected);
+extern void reset_expected(int may_increase, int expected);
 extern void cman_send_confchg(unsigned int *member_list, int member_list_entries,
 			      unsigned int *left_list, int left_list_entries,
 			      unsigned int *joined_list, int joined_list_entries);
 
-
+#define override_expected(expected)	reset_expected(0, expected)
 
 /* Startup stuff called from cmanccs: */
 extern int cman_set_nodename(char *name);

From alain.richard at equation.fr  Tue Sep  4 09:53:42 2007
From: alain.richard at equation.fr (Alain RICHARD)
Date: Tue, 4 Sep 2007 11:53:42 +0200
Subject: [Linux-cluster] Multipathed quorum disk
In-Reply-To: <39fdf1c70708311757h75a57fc3r15b740ed8ad0f58b@mail.gmail.com>
References: <39fdf1c70708311757h75a57fc3r15b740ed8ad0f58b@mail.gmail.com>
Message-ID: <46724B2A-C44E-4D0C-98C5-34B33CC5F253@equation.fr>


Le 1 sept. 07 ? 02:57, Claudio Tassini a ?crit :

> Hi,
>
> I recently upgraded a 2-nodes cluster adding two more nodes. I  
> would like a single node to remain in cluster even if the other  
> three are out of service, so I'm trying to add a quorum disk to the  
> cluster.
>
> The problem is that the quorum disk is a LUN in a shared storage  
> which has not the same device name through all the cluster nodes.  
> Moreover, we use device-mapper AND lvm. I could resolve the problem  
> using an lvm logical volume, because it would always have the same  
> name and recognize the underlying "dm" or "sd" device name even if  
> it changes across a reboot, but I've read that it's not advisable  
> to use a logical volume as quorum device.
>
> Any idea?
>
> -- 
> Claudio Tassini
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


using multipath -ll you'll see that your multipath device has got a  
unique id (wwid) :

# multipath -ll
/dev/mpath/mpath2 (200c0b60a76000032) dm-3 SNAP,FILEDISK
[size=10M][features=0][hwhandler=0]
\_ round-robin 0 [prio=0][active]
\_ 2:0:0:0 sdd 8:48  [active][ready]
\_ 1:0:0:0 sdc 8:32  [active][ready]

all you have to do then, is to modify your /etc/multipath.conf file  
to ask a fixed name for this multipath device instead of having it  
get a dynamique name (/dev/mpath/mpathx) :

/etc/multipath.conf :

...
multipaths {
         multipath {
                 wwid                    200c0b60a76000032
                 alias                   qdsk1
         }
}

and then :

# multipath -ll
[root at titan2 ~]# multipath -ll
qdsk1 (200c0b60a76000032) dm-4 SNAP,FILEDISK
[size=10M][features=0][hwhandler=0]
\_ round-robin 0 [prio=0][active]
\_ 4:0:0:0 sdf 8:80  [active][ready]
\_ 3:0:0:0 sde 8:64  [active][ready]

(please, be warn that the first time you do it, it rename the  
multipath device to the name you have ask for, but it fails to rename  
the /dev/mpath/ device, so you have to do it manually once).

do it on all your cluster members and they all get the multipath  
device with the same name.

I have also encoutered a problem with cman that refuse to register a  
node with more than 16 chars (qdiskd register the qdisk device as a  
node name). So you must ensure your device path is less than 16 chars  
for a qdisk device (this is why I use /dev/mpath/qdsk1 instead of / 
dev/mpath/qdisk1).

Regards,

-- 
Alain RICHARD <mailto:alain.richard at equation.fr>
EQUATION SA <http://www.equation.fr/>
Tel : +33 477 79 48 00     Fax : +33 477 79 48 01
E-Liance, Op?rateur des entreprises et collectivit?s,
Liaisons Fibre optique, SDSL et ADSL <http://www.e-liance.fr>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070904/592eeccb/attachment.htm>

From pawel.mastalerz at mainseek.com  Tue Sep  4 11:41:37 2007
From: pawel.mastalerz at mainseek.com (=?ISO-8859-2?Q?Pawe=B3_Mastalerz?=)
Date: Tue, 04 Sep 2007 13:41:37 +0200
Subject: [Linux-cluster] GFS and iscsi problem
Message-ID: <46DD4471.6010304@mainseek.com>

Hi,

I have some problem with gfs cluster and iscsi VTrak M500i.

Cluster structure looks like that:each of 14 nodes is connected to 
vtrack and have sdb7 disc mounted with GFS.Right now 6 machines are 
using thath disc to read&write images.Those 6 machines, on which is that 
site stored, are plugged to LB. Scheme looks like that:

			*iscsi*
			  | |
			<switch>
		  |	|     |	    |	
		node1 node2 node3 node4... etc


config:

<?xml version="1.0"?>
<cluster name="webnews" config_version="1">.
          <clusternodes>
            <clusternode name="www1" votes="1">
              <fence>
                <method name="1">
                  <device name="blade" ipaddr="192.168.3.42" blade="1"/>
                </method>
              </fence>
            </clusternode>
            <clusternode name="www2" votes="1">
              <fence>
                <method name="1">
                  <device name="blade" ipaddr="192.168.3.43" blade="2"/>
                </method>
              </fence>
            </clusternode>

(...)

 From time to time there is a problem on one of those nodes with loosing 
connection to iscsi, when that happens the whole GFS is blocked and the 
rest of the nodes has no access to that partition (sdb7) :(
Question - Why is GFS blocking access to that directory for all nodes if 
on the node (which cause problems) connection to SCSI has been recovered?
I suppose that is GFS fault, but why logs dont show that? The only thing 
i can do now is to reload cluster and GFS.

-- 
Pawel Mastalerz
pawel[dot]mastalerz[at]mainseek[dot]com
http://mainseek.net/


From Alexandre.Racine at mhicc.org  Tue Sep  4 16:57:10 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Tue, 4 Sep 2007 12:57:10 -0400
Subject: [Linux-cluster] GFS and iscsi problem
References: <46DD4471.6010304@mainseek.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C344B@cumulonimbus.RG.local>

Hi all, I would like to know that too, since I made some similar tests and GFS seems simply to hang.


My config:
# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster name="grappesgsge" config_version="1" ipaddr="192.168.1.20">

  <cman expected_votes="1">
  </cman>

  <clusternodes>
    <clusternode name="TORQUE1">
      <fence>
        <method name="human">
          <device name="human" ipaddr="TORQUE1"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>

  <clusternodes>
    <clusternode name="TORQUE2">
      <fence>
        <method name="human">
          <device name="human" ipaddr="TORQUE2"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>

  <clusternodes>
    <clusternode name="TORQUE3">
      <fence>
        <method name="human">
          <device name="human" ipaddr="TORQUE3"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>

  <fence_devices>
    <device name="human" agent="fence_manual"/>
  </fence_devices>

</cluster>


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Pawel Mastalerz
Sent: Tue 2007-09-04 07:41
To: linux-cluster at redhat.com
Subject: [Linux-cluster] GFS and iscsi problem
 
Hi,

I have some problem with gfs cluster and iscsi VTrak M500i.

Cluster structure looks like that:each of 14 nodes is connected to 
vtrack and have sdb7 disc mounted with GFS.Right now 6 machines are 
using thath disc to read&write images.Those 6 machines, on which is that 
site stored, are plugged to LB. Scheme looks like that:

			*iscsi*
			  | |
			<switch>
		  |	|     |	    |	
		node1 node2 node3 node4... etc


config:

<?xml version="1.0"?>
<cluster name="webnews" config_version="1">.
          <clusternodes>
            <clusternode name="www1" votes="1">
              <fence>
                <method name="1">
                  <device name="blade" ipaddr="192.168.3.42" blade="1"/>
                </method>
              </fence>
            </clusternode>
            <clusternode name="www2" votes="1">
              <fence>
                <method name="1">
                  <device name="blade" ipaddr="192.168.3.43" blade="2"/>
                </method>
              </fence>
            </clusternode>

(...)

 From time to time there is a problem on one of those nodes with loosing 
connection to iscsi, when that happens the whole GFS is blocked and the 
rest of the nodes has no access to that partition (sdb7) :(
Question - Why is GFS blocking access to that directory for all nodes if 
on the node (which cause problems) connection to SCSI has been recovered?
I suppose that is GFS fault, but why logs dont show that? The only thing 
i can do now is to reload cluster and GFS.

-- 
Pawel Mastalerz
pawel[dot]mastalerz[at]mainseek[dot]com
http://mainseek.net/

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3862 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070904/0c1cd20d/attachment.bin>

From orkcu at yahoo.com  Tue Sep  4 18:16:15 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Tue, 4 Sep 2007 11:16:15 -0700 (PDT)
Subject: [Linux-cluster] GFS and iscsi problem
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C344B@cumulonimbus.RG.local>
Message-ID: <353580.21090.qm@web50606.mail.re2.yahoo.com>


--- Alexandre Racine <Alexandre.Racine at mhicc.org>
wrote:

> Hi all, I would like to know that too, since I made
> some similar tests and GFS seems simply to hang.
> 
 people getting this king of problem usually have a
problem with fencing, your fencing is manual, this is
really bad for production because if there is a
problem with the GFS in one node,the cluster will wait
for that node to be feced and if it fenced by humand
hand.....
until you send the aknowledge that the cluster will be
in standby.

> 
> 
> My config:
> # cat /etc/cluster/cluster.conf

>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE1"/>
>         </method>
>       </fence>
>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE2"/>
>         </method>
>       </fence>
>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE3"/>
>         </method>
>       </fence>
>   <fence_devices>
>     <device name="human" agent="fence_manual"/>
>   </fence_devices>

use a real fence device and try gfs again :-)

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Moody friends. Drama queens. Your life? Nope! - their life, your story. Play Sims Stories at Yahoo! Games.
http://sims.yahoo.com/  


From pawel.mastalerz at mainseek.com  Tue Sep  4 18:28:10 2007
From: pawel.mastalerz at mainseek.com (=?UTF-8?B?UGF3ZcWCIE1hc3RhbGVyeg==?=)
Date: Tue, 04 Sep 2007 20:28:10 +0200
Subject: [Linux-cluster] GFS and iscsi problem
In-Reply-To: <353580.21090.qm@web50606.mail.re2.yahoo.com>
References: <353580.21090.qm@web50606.mail.re2.yahoo.com>
Message-ID: <46DDA3BA.9010403@mainseek.com>

Roger Pe?a pisze:
> --- Alexandre Racine <Alexandre.Racine at mhicc.org>
> wrote:
> 
>> Hi all, I would like to know that too, since I made
>> some similar tests and GFS seems simply to hang.
>>
>  people getting this king of problem usually have a
> problem with fencing, your fencing is manual, this is
> really bad for production because if there is a
> problem with the GFS in one node,the cluster will wait
> for that node to be feced and if it fenced by humand
> hand.....
> until you send the aknowledge that the cluster will be
> in standby.

Yes, but i use:

<fencedevice name="blade" agent="fence_bladecenter" ipaddr=....

and it's not a problem, fence work fine. Fence work only when one of 
nodes is down or have some other problem with connection to other nodes.


-- 
Pawel Mastalerz
pawel[dot]mastalerz[at]mainseek[dot]com
http://mainseek.net/


From orkcu at yahoo.com  Tue Sep  4 18:38:45 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Tue, 4 Sep 2007 11:38:45 -0700 (PDT)
Subject: [Linux-cluster] GFS and iscsi problem
In-Reply-To: <46DDA3BA.9010403@mainseek.com>
Message-ID: <7398.83425.qm@web50611.mail.re2.yahoo.com>


--- Pawe?? Mastalerz <pawel.mastalerz at mainseek.com>
wrote:

> Roger Pe?a pisze:
> > --- Alexandre Racine <Alexandre.Racine at mhicc.org>
> > wrote:
> > 
> >> Hi all, I would like to know that too, since I
> made
> >> some similar tests and GFS seems simply to hang.
> >>
> >  people getting this king of problem usually have
> a
> > problem with fencing, your fencing is manual, this
> is
> > really bad for production because if there is a
> > problem with the GFS in one node,the cluster will
> wait
> > for that node to be feced and if it fenced by
> humand
> > hand.....
> > until you send the aknowledge that the cluster
> will be
> > in standby.
> 
> Yes, but i use:
> 
> <fencedevice name="blade" agent="fence_bladecenter"
> ipaddr=....
> 
sorry, I didn't read throught your messages, just
looked at Alexandre Racine's configuration


> and it's not a problem, fence work fine. Fence work
> only when one of 
> nodes is down or have some other problem with
> connection to other nodes.
well, I would expect if one node has a problem with
its GFS filesystem ( for example, network failure in
the iscsi scenario), the cluster should-must fence
that node just to avoid filesystem corruption
but I could be wrong...


cu
roger


__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Pinpoint customers who are looking for what you sell. 
http://searchmarketing.yahoo.com/


From carlopmart at gmail.com  Tue Sep  4 19:21:02 2007
From: carlopmart at gmail.com (carlopmart)
Date: Tue, 04 Sep 2007 21:21:02 +0200
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46DBACDD.6060803@gmail.com>
References: <46D7E431.2020100@gmail.com> <46D9C4C3.3070009@gmail.com>
	<46DBACDD.6060803@gmail.com>
Message-ID: <46DDB01E.4030803@gmail.com>

carlopmart wrote:
> carlopmart wrote:
>> carlopmart wrote:
>>> Hi all,
>>>
>>>  I am running standalone xen host using rhel5 with three rhel5 xen 
>>> guest with cluster-suite. I have setup fence_xvm as a fence device on 
>>> all three guest. On the host side I have setup fence_xvmd on 
>>> cluster.conf file.
>>>
>>>  My problems starts when I need to restart xen server host. Every 
>>> time that reboots, fence_xvmd doesn't starts. If I execute "service 
>>> cman restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
>>>
>>> Many thanks.
>>>
>> Please I need an answer about this ...
>>
>>
> 
> Well I think that I found the problem: cman startup script. In this line:
> 
> 
>     # Check for presence of Domain-0; if it's not there, we can't
>     # run xvmd.
>     #
>     xm list --long 2> /dev/null | grep -q "Domain-0" || return 1
> 
> If it is executed from command line any result is returned:
> 
>  [root at xenhost xen]# xm list --long 2> /dev/null | grep -q "Domain-0"
>  [root at xenhost xen]#
> 
> If I put -X under /etc/sysconfig/cman on FENCE_XVMD_OPTS, nothing 
> happens. Is this a bug???
> 

Please any hints about this???

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From Alexandre.Racine at mhicc.org  Tue Sep  4 19:50:47 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Tue, 4 Sep 2007 15:50:47 -0400
Subject: [Linux-cluster] GFS and iscsi problem
References: <353580.21090.qm@web50606.mail.re2.yahoo.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C344E@cumulonimbus.RG.local>

Haaaaaa. I can see the light now. This was the last part of the puzzle I needed and that I had put beside at first. Thanks.

For those who want to have more infos on this:
http://sourceware.org/cluster/faq.html#fence_what


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Roger Pe?a
Sent: Tue 2007-09-04 14:16
To: linux clustering
Subject: RE: [Linux-cluster] GFS and iscsi problem
 

--- Alexandre Racine <Alexandre.Racine at mhicc.org>
wrote:

> Hi all, I would like to know that too, since I made
> some similar tests and GFS seems simply to hang.
> 
 people getting this king of problem usually have a
problem with fencing, your fencing is manual, this is
really bad for production because if there is a
problem with the GFS in one node,the cluster will wait
for that node to be feced and if it fenced by humand
hand.....
until you send the aknowledge that the cluster will be
in standby.

> 
> 
> My config:
> # cat /etc/cluster/cluster.conf

>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE1"/>
>         </method>
>       </fence>
>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE2"/>
>         </method>
>       </fence>
>       <fence>
>         <method name="human">
>           <device name="human" ipaddr="TORQUE3"/>
>         </method>
>       </fence>
>   <fence_devices>
>     <device name="human" agent="fence_manual"/>
>   </fence_devices>

use a real fence device and try gfs again :-)

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Moody friends. Drama queens. Your life? Nope! - their life, your story. Play Sims Stories at Yahoo! Games.
http://sims.yahoo.com/  

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3458 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070904/fed75522/attachment.bin>

From lhh at redhat.com  Tue Sep  4 21:03:28 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:03:28 -0400
Subject: [Linux-cluster] Fence Device (Ethernet)
In-Reply-To: <18776480.239821188817438698.JavaMail.root@resprs03>
References: <18776480.239821188817438698.JavaMail.root@resprs03>
Message-ID: <20070904210328.GF19477@redhat.com>

On Mon, Sep 03, 2007 at 01:03:58PM +0200, maalgi at ono.com wrote:
> Hi, the first thing, sorry for my english i'm spanish.

We'll try to help anyway.

> I probe to configure cluster (eth0:1) with WTI, APC.... fence devices, and i have same ressults.
> 
> /etc/init.d/ccsd start OK
> /etc/init.d/fenced start FAIL

cman must start before fenced;

/etc/init.d/ccsd start
/etc/init.d/cman start
/etc/init.d/fenced start

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Tue Sep  4 21:04:34 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:04:34 -0400
Subject: [Linux-cluster] Fence Device (Ethernet)
In-Reply-To: <d765e01f0709030424w31ffcdfy68871a6ddf30e462@mail.gmail.com>
References: <18776480.239821188817438698.JavaMail.root@resprs03>
	<d765e01f0709030424w31ffcdfy68871a6ddf30e462@mail.gmail.com>
Message-ID: <20070904210434.GG19477@redhat.com>

On Mon, Sep 03, 2007 at 04:54:56PM +0530, GS R wrote:
> Which version of RedHat are you using?
> /etc/init.d/cman start
> &
> There is no /etc/init.d/fenced start use /sbin/fenced instead.

On RHEL4 / cluster-1.0x, there is.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Tue Sep  4 21:10:05 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:10:05 -0400
Subject: [Linux-cluster] Multipathed quorum disk
In-Reply-To: <39fdf1c70708311757h75a57fc3r15b740ed8ad0f58b@mail.gmail.com>
References: <39fdf1c70708311757h75a57fc3r15b740ed8ad0f58b@mail.gmail.com>
Message-ID: <20070904211005.GH19477@redhat.com>

On Sat, Sep 01, 2007 at 02:57:48AM +0200, Claudio Tassini wrote:
> Hi,
> I recently upgraded a 2-nodes cluster adding two more nodes. I would like a
> single node to remain in cluster even if the other three are out of service,
> so I'm trying to add a quorum disk to the cluster.
> 
> The problem is that the quorum disk is a LUN in a shared storage which has
> not the same device name through all the cluster nodes. Moreover, we use
> device-mapper AND lvm. I could resolve the problem using an lvm logical
> volume, because it would always have the same name and recognize the
> underlying "dm" or "sd" device name even if it changes across a reboot, but
> I've read that it's not advisable to use a logical volume as quorum device.

You can do it, but if the LVM volume is clustered, you can introduce a
circular dependency:

  * need quorum to access CLVM volume
  * need CLVM volume to become quorate...

You can work around this by making qdisk's votes 1 less than the number
of nodes in the cluster.   Ex: 1 vote for qdisk and 1 vote per node,
expected_votes = 3 & two_node = 0 for CMAN.

Then, both nodes can come online before you start qdisk to eliminate the
"chicken and egg" problem.

This might also be related -

There's a bugzilla open against qdisk; as qdisk doesn't work with
devices which do not have a 512 byte sector size.  I should have a fix
for it this week.

https://bugzilla.redhat.com/show_bug.cgi?id=272861

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Tue Sep  4 21:13:23 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:13:23 -0400
Subject: [Linux-cluster] RE: qdisk votes not in cman
In-Reply-To: <30E8283B-B35E-4DE2-A8B6-9D59ED51C3E8@equation.fr>
References: <30E8283B-B35E-4DE2-A8B6-9D59ED51C3E8@equation.fr>
Message-ID: <20070904211323.GI19477@redhat.com>

On Fri, Aug 31, 2007 at 12:46:50PM +0200, Alain RICHARD wrote:
> Perhaps a better error reporting is needed in qdiskd to shows that we  
> have hit this problem. Also using a generic name like "qdisk device"  
> when qdiskd is registering its node to cman is a better approach.

What about using the label instead of the device name, and restricting
the label to 16 chars when advertising to cman?

-- Lon

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Tue Sep  4 21:14:04 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:14:04 -0400
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46DBACDD.6060803@gmail.com>
References: <46D7E431.2020100@gmail.com> <46D9C4C3.3070009@gmail.com>
	<46DBACDD.6060803@gmail.com>
Message-ID: <20070904211404.GJ19477@redhat.com>

On Mon, Sep 03, 2007 at 08:42:37AM +0200, carlopmart wrote:
> carlopmart wrote:
> >carlopmart wrote:
> >>Hi all,
> >>
> >> I am running standalone xen host using rhel5 with three rhel5 xen 
> >>guest with cluster-suite. I have setup fence_xvm as a fence device on 
> >>all three guest. On the host side I have setup fence_xvmd on 
> >>cluster.conf file.
> >>
> >> My problems starts when I need to restart xen server host. Every time 
> >>that reboots, fence_xvmd doesn't starts. If I execute "service cman 
> >>restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
> >>
> >>Many thanks.
> >>
> >Please I need an answer about this ...
> >
> >
> 
> Well I think that I found the problem: cman startup script. In this line:
> 
> 
>     # Check for presence of Domain-0; if it's not there, we can't
>     # run xvmd.
>     #
>     xm list --long 2> /dev/null | grep -q "Domain-0" || return 1
> 
> If it is executed from command line any result is returned:
> 
>  [root at xenhost xen]# xm list --long 2> /dev/null | grep -q "Domain-0"
>  [root at xenhost xen]#
> 
> If I put -X under /etc/sysconfig/cman on FENCE_XVMD_OPTS, nothing 
> happens. Is this a bug???

Yes.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Tue Sep  4 21:15:37 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 4 Sep 2007 17:15:37 -0400
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46DDB01E.4030803@gmail.com>
References: <46D7E431.2020100@gmail.com> <46D9C4C3.3070009@gmail.com>
	<46DBACDD.6060803@gmail.com> <46DDB01E.4030803@gmail.com>
Message-ID: <20070904211537.GK19477@redhat.com>

On Tue, Sep 04, 2007 at 09:21:02PM +0200, carlopmart wrote:
> carlopmart wrote:
> >carlopmart wrote:
> >>carlopmart wrote:
> >>>Hi all,
> >>>
> >>> I am running standalone xen host using rhel5 with three rhel5 xen 
> >>>guest with cluster-suite. I have setup fence_xvm as a fence device on 
> >>>all three guest. On the host side I have setup fence_xvmd on 
> >>>cluster.conf file.
> >>>
> >>> My problems starts when I need to restart xen server host. Every 
> >>>time that reboots, fence_xvmd doesn't starts. If I execute "service 
> >>>cman restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
> >>>
> >>>Many thanks.
> >>>
> >>Please I need an answer about this ...
> >>
> >>
> >
> >Well I think that I found the problem: cman startup script. In this line:
> >
> >
> >    # Check for presence of Domain-0; if it's not there, we can't
> >    # run xvmd.
> >    #
> >    xm list --long 2> /dev/null | grep -q "Domain-0" || return 1
> >
> >If it is executed from command line any result is returned:
> >
> > [root at xenhost xen]# xm list --long 2> /dev/null | grep -q "Domain-0"
> > [root at xenhost xen]#
> >
> >If I put -X under /etc/sysconfig/cman on FENCE_XVMD_OPTS, nothing 
> >happens. Is this a bug???
> >
> 
> Please any hints about this???

It sounds like a bug that is fixed in 5.1 beta.  fence_xvmd needs xend
to be running.

Now, in 5.0, if xend didn't start, fence_xvmd didn't correctly start.

In 5.1 beta, fence_xvmd will wait for xend to start.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From dwalgamo at gmail.com  Tue Sep  4 21:20:35 2007
From: dwalgamo at gmail.com (David Walgamotte)
Date: Tue, 4 Sep 2007 16:20:35 -0500
Subject: [Linux-cluster] howto
Message-ID: <77ad9a6b0709041420t18101f19vceef4ec5b49c98b5@mail.gmail.com>

any1 know of a good howto for web cluster with gfs. I need good step by step
guide as the redhat docs are not working.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070904/2710338d/attachment.htm>

From rpeterso at redhat.com  Tue Sep  4 21:23:43 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 04 Sep 2007 16:23:43 -0500
Subject: [Linux-cluster] howto
In-Reply-To: <77ad9a6b0709041420t18101f19vceef4ec5b49c98b5@mail.gmail.com>
References: <77ad9a6b0709041420t18101f19vceef4ec5b49c98b5@mail.gmail.com>
Message-ID: <1188941023.661.2.camel@technetium.msp.redhat.com>

On Tue, 2007-09-04 at 16:20 -0500, David Walgamotte wrote:
> any1 know of a good howto for web cluster with gfs. I need good step
> by step guide as the redhat docs are not working. 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

Hi David,

I don't know which Red Hat docs aren't working for you, but here are
some you can try:

http://sources.redhat.com/cluster/doc/nfscookbook.pdf
http://sources.redhat.com/cluster/doc/usage.txt
http://sources.redhat.com/cluster/faq.html

Regards,

Bob Peterson
Red Hat Cluster Suite


From carlopmart at gmail.com  Wed Sep  5 07:09:15 2007
From: carlopmart at gmail.com (carlopmart)
Date: Wed, 05 Sep 2007 09:09:15 +0200
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <20070904211537.GK19477@redhat.com>
References: <46D7E431.2020100@gmail.com>
	<46D9C4C3.3070009@gmail.com>	<46DBACDD.6060803@gmail.com>
	<46DDB01E.4030803@gmail.com> <20070904211537.GK19477@redhat.com>
Message-ID: <46DE561B.7040109@gmail.com>

Lon Hohberger wrote:
> On Tue, Sep 04, 2007 at 09:21:02PM +0200, carlopmart wrote:
>> carlopmart wrote:
>>> carlopmart wrote:
>>>> carlopmart wrote:
>>>>> Hi all,
>>>>>
>>>>> I am running standalone xen host using rhel5 with three rhel5 xen 
>>>>> guest with cluster-suite. I have setup fence_xvm as a fence device on 
>>>>> all three guest. On the host side I have setup fence_xvmd on 
>>>>> cluster.conf file.
>>>>>
>>>>> My problems starts when I need to restart xen server host. Every 
>>>>> time that reboots, fence_xvmd doesn't starts. If I execute "service 
>>>>> cman restart" all its ok: fence_xvmd starts. Why?? How can I fix it??
>>>>>
>>>>> Many thanks.
>>>>>
>>>> Please I need an answer about this ...
>>>>
>>>>
>>> Well I think that I found the problem: cman startup script. In this line:
>>>
>>>
>>>    # Check for presence of Domain-0; if it's not there, we can't
>>>    # run xvmd.
>>>    #
>>>    xm list --long 2> /dev/null | grep -q "Domain-0" || return 1
>>>
>>> If it is executed from command line any result is returned:
>>>
>>> [root at xenhost xen]# xm list --long 2> /dev/null | grep -q "Domain-0"
>>> [root at xenhost xen]#
>>>
>>> If I put -X under /etc/sysconfig/cman on FENCE_XVMD_OPTS, nothing 
>>> happens. Is this a bug???
>>>
>> Please any hints about this???
> 
> It sounds like a bug that is fixed in 5.1 beta.  fence_xvmd needs xend
> to be running.
> 
> Now, in 5.0, if xend didn't start, fence_xvmd didn't correctly start.
> 
> In 5.1 beta, fence_xvmd will wait for xend to start.
> 

Mant thanks Lon. I will wait until rhel 5.1 is released. Meanwhile, i 
will start fence_xvmd manually from rc.local.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From hlawatschek at atix.de  Wed Sep  5 07:48:32 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Wed, 5 Sep 2007 09:48:32 +0200
Subject: [Linux-cluster] howto
In-Reply-To: <1188941023.661.2.camel@technetium.msp.redhat.com>
References: <77ad9a6b0709041420t18101f19vceef4ec5b49c98b5@mail.gmail.com>
	<1188941023.661.2.camel@technetium.msp.redhat.com>
Message-ID: <200709050948.34577.hlawatschek@atix.de>

On Tuesday 04 September 2007 23:23:43 Bob Peterson wrote:
> On Tue, 2007-09-04 at 16:20 -0500, David Walgamotte wrote:
> > any1 know of a good howto for web cluster with gfs. I need good step
> > by step guide as the redhat docs are not working.
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> Hi David,
>
> I don't know which Red Hat docs aren't working for you, but here are
> some you can try:
>
> http://sources.redhat.com/cluster/doc/nfscookbook.pdf
> http://sources.redhat.com/cluster/doc/usage.txt
> http://sources.redhat.com/cluster/faq.html

There's also a howto for setting up a GFS cluster at 
http://open-sharedroot.org/documentation/the-opensharedroot-mini-howto/

Note, that this howto mainly covers the installation of a GFS bases diskless 
sharedroot cluster.


-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From pawel.mastalerz at mainseek.com  Wed Sep  5 08:41:10 2007
From: pawel.mastalerz at mainseek.com (=?UTF-8?B?UGF3ZcWCIE1hc3RhbGVyeg==?=)
Date: Wed, 05 Sep 2007 10:41:10 +0200
Subject: [Linux-cluster] GFS and iscsi problem
In-Reply-To: <7398.83425.qm@web50611.mail.re2.yahoo.com>
References: <7398.83425.qm@web50611.mail.re2.yahoo.com>
Message-ID: <46DE6BA6.9090308@mainseek.com>

Roger Pe?a pisze:
>> Yes, but i use:
>>
>> <fencedevice name="blade" agent="fence_bladecenter"
>> ipaddr=....
>>
> sorry, I didn't read throught your messages, just
> looked at Alexandre Racine's configuration
> 
> 
>> and it's not a problem, fence work fine. Fence work
>> only when one of 
>> nodes is down or have some other problem with
>> connection to other nodes.
> well, I would expect if one node has a problem with
> its GFS filesystem ( for example, network failure in
> the iscsi scenario), the cluster should-must fence
> that node just to avoid filesystem corruption
> but I could be wrong...
> 

Yes should... :) but when one of nodes lost connection to iscsi nothing 
happen, only i cant access to gfs, and gfs dont write to klog info about 
that...

Plz help :)

-- 
Pawel Mastalerz
pawel[dot]mastalerz[at]mainseek[dot]com
http://mainseek.net/


From mgrac at redhat.com  Wed Sep  5 14:04:08 2007
From: mgrac at redhat.com (Marek 'marx' Grac)
Date: Wed, 05 Sep 2007 16:04:08 +0200
Subject: [Linux-cluster] postgres-8 resource
In-Reply-To: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
Message-ID: <46DEB758.4070007@redhat.com>

Hi,

Hell, Robert wrote:
>
> Aug 30 19:37:06 pg-ba-001 clurgmgrd: [31089]: <err> Trying to execute 
> sudo -u postgres /usr/bin/postmaster -c 
> config_file=/etc/cluster/postgres-8/postgres-8:postgresql_vts1/postgresql.conf 
> -> *some debugging, works fine when executed manually*
>
> Aug 30 19:37:06 pg-ba-001 clurgmgrd: [31089]: <err> Starting Service 
> postgres-8:postgresql_vts1 > Failed
>
> Aug 30 19:37:06 pg-ba-001 clurgmgrd[31089]: <notice> start on 
> postgres-8 "postgresql_vts1" returned 1 (generic error)
>
> Aug 30 19:37:06 pg-ba-001 clurgmgrd[31089]: <warning> #68: Failed to 
> start service:pg-ba-vts1; return value: 1
>
>  
>
> Any ideas how to determine why it won?t start?
>
Sorry for late response (vacantions :)). You found real problems with 
resourge agent for postgres-8, please fill a bug in bugzilla. In the 
attachment is a patch which should work (extract to /usr/share/cluster; 
it fixes postres-8.sh and utils/config-utils.sh). Fixes problems with 
listen_address, directory for pid file and running postmaster on 
background. If it will work I will put in in the CVS.

       Thanks,
    marx


-- 
Marek Grac
Red Hat Czech s.r.o.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: postgres-ra.tgz
Type: application/x-compressed-tar
Size: 4175 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070905/1d1380b4/attachment.bin>

From beres.laszlo at sys-admin.hu  Wed Sep  5 19:02:27 2007
From: beres.laszlo at sys-admin.hu (BERES Laszlo)
Date: Wed, 05 Sep 2007 21:02:27 +0200
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
In-Reply-To: <2007925444.953113@leena>
References: <2007925444.953113@leena>
Message-ID: <46DEFD43.8090206@sys-admin.hu>

isplist at logicore.net wrote:

> What in the world would cause that? There aren't any external services 
> required to fire up my local cluster, never were, it's always been fine 
> before.

Just a silly question: how about name resolution? Is it independent from
the external DNS? Are the members available without nameservers?

-- 
B?RES L?szl?	 RHCE, RHCX
senior IT engineer, trainer


From Alexandre.Racine at mhicc.org  Wed Sep  5 19:55:15 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Wed, 5 Sep 2007 15:55:15 -0400
Subject: [Linux-cluster] gfs-kernel with 2.6.22
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
	<46DEB758.4070007@redhat.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>


Is there someone who is using 2.6.22 with gfs-kernel? (1.03 or 1.04)
All of this was working fine with 2.6.20 ...

I am using gentoo-2.6.22-r5 and was just wondering if I need to bug hunt or not. Thanks.

Error message below...


make[3]: Entering directory `/usr/src/linux-2.6.22-gentoo-r5'
  CC [M]  /var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.o
/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.c: In function 'nolock_plock_get':
/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.c:250: error: too many arguments to function 'posix_test_lock'
make[4]: *** [/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.o] Error 1
make[3]: *** [_module_/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock] Error 2
make[3]: Leaving directory `/usr/src/linux-2.6.22-gentoo-r5'
make[2]: *** [all] Error 2
make[2]: Leaving directory `/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src'
make: *** [all] Error 2

!!! ERROR: sys-cluster/gfs-kernel-1.03.00-r1 failed.
Call stack:
  ebuild.sh, line 1638:   Called dyn_compile
  ebuild.sh, line 985:   Called qa_call 'src_compile'
  ebuild.sh, line 44:   Called src_compile
  gfs-kernel-1.03.00-r1.ebuild, line 59:   Called die


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3011 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070905/77df933c/attachment.bin>

From Christopher.Barry at qlogic.com  Wed Sep  5 22:16:32 2007
From: Christopher.Barry at qlogic.com (Christopher Barry)
Date: Wed, 05 Sep 2007 18:16:32 -0400
Subject: [Linux-cluster] Quorum question / split brain paranoia
Message-ID: <1189030593.5447.94.camel@localhost>


Greetings all,

I'm building a 6-node hybrid virtual cluster, and would like a little
advice about quorum concepts, with the goal being functioning of at
least one virtual node, on one physical box, and the complete inability
of split-brain occurring.

I have Two Physical machines (PM) that will host many virtual machines,
however only three VMs per PM will actually be members of the cluster.

_Each_ PM is running VMware ESX 3, with: 
* 3 es4ud5 cluster node VMs
* director node VM in an Active/Passive config
* various other, non-cluster nodes, out of scope.

The diagram of one physical machine with the relevant VMs, virtual
switches and virtual wiring can be seen below.

Each PM is a mirror image of the other:
  (view this as a fixed font)
+------------------------------+
|     PHYSICAL ESX BOX         |
| +-----------------------+    |
| |   VM Cluster Node1    |--+ |
| +-----------------------+  | |
| +-----------------------+  | |
| |   VM Cluster Node2    |--+ |
| +-----------------------+  | |
| +-----------------------+  | |
| |   VM Cluster Node3    |--+ |
| +-----------------------+  | |
|             +---+---+------' |
| +-----------|---|---|---+    |
| |Cluster Virtual Switch |--+ |
| +--------------|--------+  | |
|  10.0.1.0/24   +-----+     | |
| +--------------------|--+  | |
| | Director VM Node (NAT)|  | |
| +---|-------------------+  | |
|   +-'   10.0.0.0/24        | |
| +-|---------------------+  | |
| |Director Virtual Switch|  | |
| +----------|------------+  | |
|            |               | |
|            |       +-------' |
+===[fc0]===[e0]===[e1]========+
      |      |       |
    to SAN   |       `---> x-over cable to mirror box
           To LAN

The cluster nodes will run GFS, the director will not. Only one director
will be active with a VIP, load will balance across all 6 VMs. The
crossover will actually have VLANs on it that will allow a separate
heartbeat net, but it was getting a bit tricky with ASCII art ;)

Can anyone see any issues that may arise where quorum could create a
split brain scenario? What would be the best way to approach votes, etc.
here?


Thanks all for your time.


-- 
Regards,
-C

Christopher Barry
Systems Engineer, Principal
QLogic Corporation
780 Fifth Avenue, Suite 140
King of Prussia, PA   19406
o/f: 610-233-4870 / 4777
  m: 267-242-9306


From basv at sara.nl  Thu Sep  6 06:02:39 2007
From: basv at sara.nl (Bas van der Vlies)
Date: Thu, 6 Sep 2007 08:02:39 +0200
Subject: [Linux-cluster] gfs-kernel with 2.6.22
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>	<46DEB758.4070007@redhat.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>
Message-ID: <46DF97FF.9090305@sara.nl>

Alexandre Racine wrote:
> 
> Is there someone who is using 2.6.22 with gfs-kernel? (1.03 or 1.04)
> All of this was working fine with 2.6.20 ...
> 
The release versions are tight to a kernel version. I know for 1.04 it is 
2.6.20. For 1.03 it 2.6.16 or 2.6.17. If you want to use a newer kernel you 
have to get the source form cvs STABLE branch. I do not know if this 
version compiles against your kernel version because the STABLE branch has 
not many updates lately.


> I am using gentoo-2.6.22-r5 and was just wondering if I need to bug hunt or not. Thanks.
> 
> Error message below...
> 
> 
> make[3]: Entering directory `/usr/src/linux-2.6.22-gentoo-r5'
>   CC [M]  /var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.o
> /var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.c: In function 'nolock_plock_get':
> /var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.c:250: error: too many arguments to function 'posix_test_lock'
> make[4]: *** [/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock/main.o] Error 1
> make[3]: *** [_module_/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock] Error 2
> make[3]: Leaving directory `/usr/src/linux-2.6.22-gentoo-r5'
> make[2]: *** [all] Error 2
> make[2]: Leaving directory `/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src/nolock'
> make[1]: *** [all] Error 2
> make[1]: Leaving directory `/var/tmp/portage/sys-cluster/gfs-kernel-1.03.00-r1/work/cluster-1.03.00/gfs-kernel/src'
> make: *** [all] Error 2
> 
> !!! ERROR: sys-cluster/gfs-kernel-1.03.00-r1 failed.
> Call stack:
>   ebuild.sh, line 1638:   Called dyn_compile
>   ebuild.sh, line 985:   Called qa_call 'src_compile'
>   ebuild.sh, line 44:   Called src_compile
>   gfs-kernel-1.03.00-r1.ebuild, line 59:   Called die
> 
> 
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
> 
> 
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From pcaulfie at redhat.com  Thu Sep  6 08:10:12 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 06 Sep 2007 09:10:12 +0100
Subject: [Linux-cluster] quorum lost in spite of 'leave remove'
In-Reply-To: <Pine.GSO.4.64.0709012141060.10820@sunserv.kfki.hu>
References: <Pine.GSO.4.64.0708311542080.16749@sunserv.kfki.hu>	<46D828EE.5070103@redhat.com>	<Pine.GSO.4.64.0708312035250.6149@sunserv.kfki.hu>	<Pine.GSO.4.64.0708312146500.12565@sunserv.kfki.hu>
	<Pine.GSO.4.64.0709012141060.10820@sunserv.kfki.hu>
Message-ID: <46DFB5E4.2010108@redhat.com>

Hmm, my outgoing email seems to have eaten while I was away...

I did raise a bugzilla for this last week and it has a patch attached. if you
get chance, you might like to try it.


https://bugzilla.redhat.com/show_bug.cgi?id=271701

-- 
Patrick


From hlawatschek at atix.de  Thu Sep  6 08:58:40 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Thu, 6 Sep 2007 10:58:40 +0200
Subject: [Linux-cluster] GFS profiling result
Message-ID: <200709061058.40741.hlawatschek@atix.de>

Hi,

during a performance analysis and tuning session, I did some profiling with 
oprofile on GFS and dlm. 
I got some weird results ... 

The installed software is:
RHEL4u5, kernel 2.6.9-55.0.2.ELsmp
GFS:  2.6.9-72.2.0.2
DLM: 2.6.9-46.16.0.1

The configuration includes 2 clusternodes.

I put the following load on one cluster node:

100 processes are doing in parallel: 
- create 1000 files with 100kb size each (ie altogether we have 100.000 files)
- flock 1000 files
- unlink 1000 files.

The following oprofile output shows, that the system spends about 49% 
(75%*65%*) of the time in gfs_unlinked_get.
Looking into the code whe can see, that this is related to unlinked.c:
     53 9394211 58.7081 :                       ul = list_entry(tmp, struct 
gfs_unlinked, ul_list);

It can also be observed, that dlm spends more than 50% of its time in 
searching for hashes...

Is this the expected behaviour or can this be tuned somewhere ?

Thanks,

Mark

oprofile shows the following:
# opreport --long-filenames --threshold 1
  samples|      %|
------------------
168187984 75.4905 /gfs
 37896161 17.0095 /usr/lib/debug/lib/modules/2.6.9-55.0.2.ELsmp/vmlinux
 11686302  5.2453 /dlm

# opreport image:/gfs -l --threshold 1
110838927 65.8899  gfs_unlinked_get
12918468  7.6796  gfs_unlinked_hold
10958430  6.5144  scan_glock
9448317   5.6167  examine_bucket
5504188   3.2720  gfs_unlinked_unlock
3795382   2.2562  trylock_on_glock
3368017   2.0022  unlock_on_glock
1939971   1.1532  run_queue

# opreport image:/dlm -l --threshold 1
samples  %        symbol name
5853674  50.0875  search_hashchain
3726276  31.8842  search_bucket
1506327  12.8890  __find_lock_by_id

-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From lhh at redhat.com  Thu Sep  6 12:19:26 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 6 Sep 2007 08:19:26 -0400
Subject: [Linux-cluster] Re: fence_xvmd doesn't starts
In-Reply-To: <46DE561B.7040109@gmail.com>
References: <46D7E431.2020100@gmail.com> <46D9C4C3.3070009@gmail.com>
	<46DBACDD.6060803@gmail.com> <46DDB01E.4030803@gmail.com>
	<20070904211537.GK19477@redhat.com> <46DE561B.7040109@gmail.com>
Message-ID: <20070906121926.GD30969@redhat.com>

On Wed, Sep 05, 2007 at 09:09:15AM +0200, carlopmart wrote:
> >It sounds like a bug that is fixed in 5.1 beta.  fence_xvmd needs xend
> >to be running.
> >
> >Now, in 5.0, if xend didn't start, fence_xvmd didn't correctly start.
> >
> >In 5.1 beta, fence_xvmd will wait for xend to start.
> >
> 
> Mant thanks Lon. I will wait until rhel 5.1 is released. Meanwhile, i 
> will start fence_xvmd manually from rc.local.
> 

No problem, though, beta is available and you should test it if you have
time.  More testing over wider audience = better.

(You can just pull fence_xvmd out of the 5.1 beta cman package, if you
want.)


-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Thu Sep  6 12:22:35 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 6 Sep 2007 08:22:35 -0400
Subject: [Linux-cluster] Quorum question / split brain paranoia
In-Reply-To: <1189030593.5447.94.camel@localhost>
References: <1189030593.5447.94.camel@localhost>
Message-ID: <20070906122235.GE30969@redhat.com>

On Wed, Sep 05, 2007 at 06:16:32PM -0400, Christopher Barry wrote:
> 
> The cluster nodes will run GFS, the director will not. Only one director
> will be active with a VIP, load will balance across all 6 VMs. The
> crossover will actually have VLANs on it that will allow a separate
> heartbeat net, but it was getting a bit tricky with ASCII art ;)
> 
> Can anyone see any issues that may arise where quorum could create a
> split brain scenario? What would be the best way to approach votes, etc.
> here?

So, two physical boxes hosting LVS to virtual machines as the real
servers (how ironic, actually...).  Said real server cluster is using
GFS to share the data?

(I want to make sure I understand the question here)

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From Christopher.Barry at qlogic.com  Thu Sep  6 13:30:55 2007
From: Christopher.Barry at qlogic.com (Christopher Barry)
Date: Thu, 06 Sep 2007 09:30:55 -0400
Subject: [Linux-cluster] Quorum question / split brain paranoia
In-Reply-To: <20070906122235.GE30969@redhat.com>
References: <1189030593.5447.94.camel@localhost>
	<20070906122235.GE30969@redhat.com>
Message-ID: <1189085455.5276.4.camel@localhost>

On Thu, 2007-09-06 at 08:22 -0400, Lon Hohberger wrote:
> On Wed, Sep 05, 2007 at 06:16:32PM -0400, Christopher Barry wrote:
> > 
> > The cluster nodes will run GFS, the director will not. Only one director
> > will be active with a VIP, load will balance across all 6 VMs. The
> > crossover will actually have VLANs on it that will allow a separate
> > heartbeat net, but it was getting a bit tricky with ASCII art ;)
> > 
> > Can anyone see any issues that may arise where quorum could create a
> > split brain scenario? What would be the best way to approach votes, etc.
> > here?
> 
> So, two physical boxes hosting LVS to virtual machines as the real
> servers (how ironic, actually...).  Said real server cluster is using
> GFS to share the data?
> 
> (I want to make sure I understand the question here)
> 


Hi Lon,

It is a bit ironic, isn't it ;) Yes, you are correct; the vm
real-servers are sharing a gfs volume.

-- 
Regards,
-C


From orkcu at yahoo.com  Fri Sep  7 02:41:01 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Thu, 6 Sep 2007 19:41:01 -0700 (PDT)
Subject: [Linux-cluster] GFS and SELinux work together in RHEL4?
Message-ID: <93478.57314.qm@web50605.mail.re2.yahoo.com>

Hi

yesterday I upgrade a RHEL4.4 to RHEL4.5 and the
apacher server start complaining about, the logs point
to selinux support in the GFS filesystem that hold the
DocumentRoot of several VHosts

when I try to see the context of the files and
directories I realice that the version of GFS-kernel
did not support selinux.
I am using centos csgfs over RHEL and because Centos
do not have yet the *-kernel packages for the newest
kernel of rhel4.5 I am still running the old kernel. I
am planing to recompile the srpm of the packages fo
the new kernel but first I am trying to find if
GFS-kernel-2.6.9-72.2 bring SELinux support to GFS
filesystems, I could find any hint to confirm a yes or
no. Well the lack of information subjest a "no" :-)

I found this FAQ entry:
http://sourceware.org/cluster/faq.html#gfs_selinux

but I do not know if it is updated :-)

also I found serveral places where it is mention that
SELinux xattr is supported since the end of last year.

so, the question: RH GFS 4.5 bring SELinux support for
GFS silesystems?

thanks
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


      ____________________________________________________________________________________
Park yourself in front of a world of choices in alternative vehicles. Visit the Yahoo! Auto Green Center.
http://autos.yahoo.com/green_center/ 


From kadlec at sunserv.kfki.hu  Fri Sep  7 08:13:57 2007
From: kadlec at sunserv.kfki.hu (Kadlecsik Jozsi)
Date: Fri, 7 Sep 2007 10:13:57 +0200 (MEST)
Subject: [Linux-cluster] quorum lost in spite of 'leave remove'
In-Reply-To: <46DFB5E4.2010108@redhat.com>
References: <Pine.GSO.4.64.0708311542080.16749@sunserv.kfki.hu>
	<46D828EE.5070103@redhat.com>
	<Pine.GSO.4.64.0708312035250.6149@sunserv.kfki.hu>
	<Pine.GSO.4.64.0708312146500.12565@sunserv.kfki.hu>
	<Pine.GSO.4.64.0709012141060.10820@sunserv.kfki.hu>
	<46DFB5E4.2010108@redhat.com>
Message-ID: <Pine.GSO.4.64.0709071012190.21245@sunserv.kfki.hu>

Hi,

On Thu, 6 Sep 2007, Patrick Caulfield wrote:

> I did raise a bugzilla for this last week and it has a patch attached. if you
> get chance, you might like to try it.
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=271701

I tested it and the patch fixes bug and works fine. Thank you very 
much indeed.

Best regards,
Jozsef
--
E-mail : kadlec at sunserv.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary


From frankie.montenegro at gmail.com  Fri Sep  7 10:28:01 2007
From: frankie.montenegro at gmail.com (Frankie Montenegro)
Date: Fri, 7 Sep 2007 06:28:01 -0400
Subject: [Linux-cluster] Small Cluster, Port Trunking, To use switch or not?
Message-ID: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>

Hi everyone,

I am building a small HPC cluster with two slaves and a master. I can
put together two slave nodes with under 350$ per node, so I don't want
to spend more then 70-80$ for networking.

Buying a gigabit ethernet 4 port switch would be the most
straightforward solution. However, I was hoping to get "port trunking"
set up,  and doubling the network speed. Since this is not supported
by switches that are within my budget, I wondered if I can achieve
port trunking without a switch, just
adding couple of   network cards to the master node  and having a
master node be a switch. WIll I be able to get 2Gbps or is this idea
completely idiotic?

Thanks,
F.


From bob.marcan at interstudio.homeunix.net  Fri Sep  7 10:44:04 2007
From: bob.marcan at interstudio.homeunix.net (Bob Marcan)
Date: Fri, 07 Sep 2007 12:44:04 +0200
Subject: [Linux-cluster] Small Cluster, Port Trunking, To use switch or
	not?
In-Reply-To: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>
Message-ID: <46E12B74.6030406@interstudio.homeunix.net>

Frankie Montenegro wrote:
> Hi everyone,
> 
> I am building a small HPC cluster with two slaves and a master. I can
> put together two slave nodes with under 350$ per node, so I don't want
> to spend more then 70-80$ for networking.
> 
> Buying a gigabit ethernet 4 port switch would be the most
> straightforward solution. However, I was hoping to get "port trunking"
> set up,  and doubling the network speed. Since this is not supported
> by switches that are within my budget, I wondered if I can achieve
> port trunking without a switch, just
> adding couple of   network cards to the master node  and having a
> master node be a switch. WIll I be able to get 2Gbps or is this idea
> completely idiotic?
> 
> Thanks,
> F.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

/usr/share/doc/kernel-doc-XXXXXXX/Documentation/networking/bonding.txt
Look for mode.

Regards, Bob
-- 
  Bob Marcan, Consultant                mailto:bob.marcan at snt.si
  S&T Slovenija d.d.                    tel:   +386 (1) 5895-300
  Leskoskova cesta 6                    fax:   +386 (1) 5895-202
  1000 Ljubljana, Slovenia              url:   http://www.snt.si


From Alexandre.Racine at mhicc.org  Fri Sep  7 15:17:04 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 7 Sep 2007 11:17:04 -0400
Subject: [Linux-cluster] users...
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>	<46DEB758.4070007@redhat.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>
	<46DF97FF.9090305@sara.nl>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>

Hi,

I'll install my first SGE soon! Only two little problems and there we go.

One of them is users. In the docs, it stipulate : "Ensure that all users of the grid engine system have the same user names on all submit and execution hosts."

That's good, but do we need to have passwordless login between servers or it is not needed? Or another question would be, how do you manage user accounts on all the cluster servers?


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2689 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070907/6a6ce715/attachment.bin>

From fait at anl.gov  Fri Sep  7 15:16:29 2007
From: fait at anl.gov (James Fait)
Date: Fri, 07 Sep 2007 10:16:29 -0500
Subject: [Linux-cluster] users...
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>	<46DEB758.4070007@redhat.com>	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>	<46DF97FF.9090305@sara.nl>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>
Message-ID: <46E16B4D.1060500@anl.gov>

Alexandre Racine wrote:
> Hi,
>
> I'll install my first SGE soon! Only two little problems and there we go.
>
> One of them is users. In the docs, it stipulate : "Ensure that all users of the grid engine system have the same user names on all submit and execution hosts."
>
> That's good, but do we need to have passwordless login between servers or it is not needed? Or another question would be, how do you manage user accounts on all the cluster servers?
>
>
>
>
>
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
>
>
>   
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
The easiest way is to set up Fedora Directory Services or equivalent for 
LDAP authentication.  That way you do all user administration at one point.

Sincerely

James Fait, Ph.D.
Beamline Scientist, SER-CAT
APS, ANL
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070907/390ed6a6/attachment.htm>

From frankie.montenegro at gmail.com  Fri Sep  7 15:52:31 2007
From: frankie.montenegro at gmail.com (Frankie Montenegro)
Date: Fri, 7 Sep 2007 11:52:31 -0400
Subject: [Linux-cluster] Small Cluster, Port Trunking,
	To use switch or not?
In-Reply-To: <46E12B74.6030406@interstudio.homeunix.net>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>
	<46E12B74.6030406@interstudio.homeunix.net>
Message-ID: <46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com>

Thanks. That will be very usefull when I start putting things together.

Did I understand this howto correctly: either way, switch or not, my
network devices need to be complient with this 802.3ad protocol if I
want to bond them.  RIght?

Well that's a bummer: I can't use  the network cards on board of my
"cheapo" motherboards, which means I have to buy 2 cards per node (
and the cheapest card with support of this
protocol was around 50$) . I guess I better forget about bonding then.

F.

On 9/7/07, Bob Marcan <bob.marcan at interstudio.homeunix.net> wrote:
> Frankie Montenegro wrote:
> > Hi everyone,
> >
> > I am building a small HPC cluster with two slaves and a master. I can
> > put together two slave nodes with under 350$ per node, so I don't want
> > to spend more then 70-80$ for networking.
> >
> > Buying a gigabit ethernet 4 port switch would be the most
> > straightforward solution. However, I was hoping to get "port trunking"
> > set up,  and doubling the network speed. Since this is not supported
> > by switches that are within my budget, I wondered if I can achieve
> > port trunking without a switch, just
> > adding couple of   network cards to the master node  and having a
> > master node be a switch. WIll I be able to get 2Gbps or is this idea
> > completely idiotic?
> >
> > Thanks,
> > F.
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> /usr/share/doc/kernel-doc-XXXXXXX/Documentation/networking/bonding.txt
> Look for mode.
>
> Regards, Bob
> --
>   Bob Marcan, Consultant                mailto:bob.marcan at snt.si
>   S&T Slovenija d.d.                    tel:   +386 (1) 5895-300
>   Leskoskova cesta 6                    fax:   +386 (1) 5895-202
>   1000 Ljubljana, Slovenia              url:   http://www.snt.si
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From pillai at mathstat.dal.ca  Fri Sep  7 16:17:02 2007
From: pillai at mathstat.dal.ca (Balagopal Pillai)
Date: Fri, 07 Sep 2007 13:17:02 -0300
Subject: [Linux-cluster] Small Cluster, Port Trunking,	To use switch or
	not?
In-Reply-To: <46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>	<46E12B74.6030406@interstudio.homeunix.net>
	<46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com>
Message-ID: <46E1797E.3020308@mathstat.dal.ca>

Hi,

For 802.3ad bonding mode, the switch needs to support lacp. Static 
trunking feature on the switch
is not enough. In your case with no switch support, mode 6 or adaptive 
load balancing is a good option.
Round robin is the only mode that will give you more than an interface 
worth of throughput on a single connection.
But that needs some switch support. (like cisco etherchannel for 
example) Also there is additional overhead
due to out of the order packets. The other modes will give better 
aggregate throughput.

Regards
Balagopal

           
Frankie Montenegro wrote:
> Thanks. That will be very usefull when I start putting things together.
>
> Did I understand this howto correctly: either way, switch or not, my
> network devices need to be complient with this 802.3ad protocol if I
> want to bond them.  RIght?
>
> Well that's a bummer: I can't use  the network cards on board of my
> "cheapo" motherboards, which means I have to buy 2 cards per node (
> and the cheapest card with support of this
> protocol was around 50$) . I guess I better forget about bonding then.
>
> F.
>
> On 9/7/07, Bob Marcan <bob.marcan at interstudio.homeunix.net> wrote:
>   
>> Frankie Montenegro wrote:
>>     
>>> Hi everyone,
>>>
>>> I am building a small HPC cluster with two slaves and a master. I can
>>> put together two slave nodes with under 350$ per node, so I don't want
>>> to spend more then 70-80$ for networking.
>>>
>>> Buying a gigabit ethernet 4 port switch would be the most
>>> straightforward solution. However, I was hoping to get "port trunking"
>>> set up,  and doubling the network speed. Since this is not supported
>>> by switches that are within my budget, I wondered if I can achieve
>>> port trunking without a switch, just
>>> adding couple of   network cards to the master node  and having a
>>> master node be a switch. WIll I be able to get 2Gbps or is this idea
>>> completely idiotic?
>>>
>>> Thanks,
>>> F.
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>       
>> /usr/share/doc/kernel-doc-XXXXXXX/Documentation/networking/bonding.txt
>> Look for mode.
>>
>> Regards, Bob
>> --
>>   Bob Marcan, Consultant                mailto:bob.marcan at snt.si
>>   S&T Slovenija d.d.                    tel:   +386 (1) 5895-300
>>   Leskoskova cesta 6                    fax:   +386 (1) 5895-202
>>   1000 Ljubljana, Slovenia              url:   http://www.snt.si
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>     
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   


From rstevens at internap.com  Fri Sep  7 18:01:49 2007
From: rstevens at internap.com (Rick Stevens)
Date: Fri, 07 Sep 2007 11:01:49 -0700
Subject: [Linux-cluster] users...
In-Reply-To: <46E16B4D.1060500@anl.gov>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
	<46DEB758.4070007@redhat.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>
	<46DF97FF.9090305@sara.nl>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>
	<46E16B4D.1060500@anl.gov>
Message-ID: <1189188109.29112.28.camel@prophead.corp.publichost.com>

On Fri, 2007-09-07 at 10:16 -0500, James Fait wrote:
> Alexandre Racine wrote: 
> > Hi,
> > 
> > I'll install my first SGE soon! Only two little problems and there we go.
> > 
> > One of them is users. In the docs, it stipulate : "Ensure that all users of the grid engine system have the same user names on all submit and execution hosts."
> > 
> > That's good, but do we need to have passwordless login between servers or it is not needed? Or another question would be, how do you manage user accounts on all the cluster servers?

> The easiest way is to set up Fedora Directory Services or equivalent
> for LDAP authentication.  That way you do all user administration at
> one point.

LDAP is one solution, so's NIS/NIS+ (and a bit easier to set up IMHO).

----------------------------------------------------------------------
- Rick Stevens, Principal Engineer             rstevens at internap.com -
- CDN Systems, Internap, Inc.                http://www.internap.com -
-                                                                    -
-     Better to understand a little than to misunderstand a lot.     -
----------------------------------------------------------------------


From Alexandre.Racine at mhicc.org  Fri Sep  7 18:48:02 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Fri, 7 Sep 2007 14:48:02 -0400
Subject: [Linux-cluster] users...
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>	<46DEB758.4070007@redhat.com>	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>	<46DF97FF.9090305@sara.nl>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>
	<46E16B4D.1060500@anl.gov>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3465@cumulonimbus.RG.local>

So, if I use the GFS, all UID and GID must be the same on all servers of the cluster, right?


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of James Fait
Sent: Fri 2007-09-07 11:16
To: linux clustering
Subject: Re: [Linux-cluster] users...
 
Alexandre Racine wrote:
> Hi,
>
> I'll install my first SGE soon! Only two little problems and there we go.
>
> One of them is users. In the docs, it stipulate : "Ensure that all users of the grid engine system have the same user names on all submit and execution hosts."
>
> That's good, but do we need to have passwordless login between servers or it is not needed? Or another question would be, how do you manage user accounts on all the cluster servers?
>
>
>
>
>
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
>
>
>   
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
The easiest way is to set up Fedora Directory Services or equivalent for 
LDAP authentication.  That way you do all user administration at one point.

Sincerely

James Fait, Ph.D.
Beamline Scientist, SER-CAT
APS, ANL

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3353 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070907/246bcd5c/attachment.bin>

From hlawatschek at atix.de  Fri Sep  7 18:59:06 2007
From: hlawatschek at atix.de (Mark Hlawatschek)
Date: Fri, 7 Sep 2007 20:59:06 +0200
Subject: [Linux-cluster] users...
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3465@cumulonimbus.RG.local>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
	<46E16B4D.1060500@anl.gov>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3465@cumulonimbus.RG.local>
Message-ID: <200709072059.06506.hlawatschek@atix.de>

You could create a shared root configuration. That would mean that all cluster 
nodes use the same user database per concept.

have a look at http://open-sharedroot.org/ for details.

Mark

On Friday 07 September 2007 20:48:02 Alexandre Racine wrote:
> So, if I use the GFS, all UID and GID must be the same on all servers of
> the cluster, right?
>
>
> Alexandre Racine
> Projets sp?ciaux
> 514-461-1300 poste 3304
> alexandre.racine at mhicc.org
>
>
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com on behalf of James Fait
> Sent: Fri 2007-09-07 11:16
> To: linux clustering
> Subject: Re: [Linux-cluster] users...
>
> Alexandre Racine wrote:
> > Hi,
> >
> > I'll install my first SGE soon! Only two little problems and there we go.
> >
> > One of them is users. In the docs, it stipulate : "Ensure that all users
> > of the grid engine system have the same user names on all submit and
> > execution hosts."
> >
> > That's good, but do we need to have passwordless login between servers or
> > it is not needed? Or another question would be, how do you manage user
> > accounts on all the cluster servers?
> >
> >
> >
> >
> >
> > Alexandre Racine
> > Projets sp?ciaux
> > 514-461-1300 poste 3304
> > alexandre.racine at mhicc.org
> >
> >
> >
> > ------------------------------------------------------------------------
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> The easiest way is to set up Fedora Directory Services or equivalent for
> LDAP authentication.  That way you do all user administration at one point.
>
> Sincerely
>
> James Fait, Ph.D.
> Beamline Scientist, SER-CAT
> APS, ANL


-- 
Gruss / Regards,

Dipl.-Ing. Mark Hlawatschek
http://www.atix.de/
http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany


From c_triantafillou at hotmail.com  Fri Sep  7 20:56:58 2007
From: c_triantafillou at hotmail.com (Christos Triantafillou)
Date: Fri, 7 Sep 2007 21:56:58 +0100
Subject: [Linux-cluster] DLM - Lock Value Block error
Message-ID: <BAY123-W547DDF480A3D5D464F99B93C50@phx.gbl>

Hi,
 
I am using RHEL 4.5 and DLM 1.0.3 on a 4-node cluster.
 
I noticed the following regarding the LVB:
1. there are two processes: one that sets the LVB of a resource while holding an EX lock
and another one that has a NL lock on the same resource and is blocked on a dlm_lock_wait
for getting a CR lock and reading the LVB.2. when the first process is interrupted with control-C or killed, the second process getsan invalid LVB error.
It seems that DLM falsely releases the resource after the first process is gone and then
the second process reads an uninitialized LVB.
 
Can you please confirm this error and create a bug report if necessary?
 
Kind regards,
Christos Triantafillou
 
_________________________________________________________________
Explore the seven wonders of the world
http://search.msn.com/results.aspx?q=7+wonders+world&mkt=en-US&form=QBRE
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070907/362107ae/attachment.htm>

From rstevens at internap.com  Fri Sep  7 21:28:45 2007
From: rstevens at internap.com (Rick Stevens)
Date: Fri, 07 Sep 2007 14:28:45 -0700
Subject: [Linux-cluster] users...
In-Reply-To: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3465@cumulonimbus.RG.local>
References: <B710F3299F04664DB6B37C258FDEEB94C6509D@FABAMAIL.fabagl.fabasoft.com>
	<46DEB758.4070007@redhat.com>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3457@cumulonimbus.RG.local>
	<46DF97FF.9090305@sara.nl>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3462@cumulonimbus.RG.local>
	<46E16B4D.1060500@anl.gov>
	<C43CF0825BF59D4FBC1F6A2AF45EB88D3C3465@cumulonimbus.RG.local>
Message-ID: <1189200525.31171.23.camel@prophead.corp.publichost.com>

On Fri, 2007-09-07 at 14:48 -0400, Alexandre Racine wrote:
> So, if I use the GFS, all UID and GID must be the same on all servers
> of the cluster, right?

Yes.  They'll all see the same filesystem, so if the UID/GIDs don't
match across all systems, you'll have permissions and file ownership
problems.

> -----Original Message----- 
> From: linux-cluster-bounces at redhat.com on behalf of James Fait 
> Sent: Fri 2007-09-07 11:16 
> To: linux clustering 
> Subject: Re: [Linux-cluster] users... 
>   
> Alexandre Racine wrote: 
> > Hi, 
> > 
> > I'll install my first SGE soon! Only two little problems and there
> we go. 
> > 
> > One of them is users. In the docs, it stipulate : "Ensure that all
> users of the grid engine system have the same user names on all submit
> and execution hosts."
> 
> > 
> > That's good, but do we need to have passwordless login between
> servers or it is not needed? Or another question would be, how do you
> manage user accounts on all the cluster servers?
> 
> > 
> > 
> > 
> > 
> > 
> > Alexandre Racine 
> > Projets sp?ciaux 
> > 514-461-1300 poste 3304 
> > alexandre.racine at mhicc.org 
> > 
> > 
> >    
> >
> ------------------------------------------------------------------------ 
> > 
> > -- 
> > Linux-cluster mailing list 
> > Linux-cluster at redhat.com 
> > https://www.redhat.com/mailman/listinfo/linux-cluster 
> The easiest way is to set up Fedora Directory Services or equivalent
> for  
> LDAP authentication.  That way you do all user administration at one
> point.
> 
> Sincerely
> 
> James Fait, Ph.D. 
> Beamline Scientist, SER-CAT 
> APS, ANL
> 
----------------------------------------------------------------------
- Rick Stevens, Principal Engineer             rstevens at internap.com -
- CDN Systems, Internap, Inc.                http://www.internap.com -
-                                                                    -
-   To understand recursion, you must first understand recursion.    -
----------------------------------------------------------------------


From mhanafi at csc.com  Sat Sep  8 20:34:37 2007
From: mhanafi at csc.com (Mahmoud Hanafi)
Date: Sat, 8 Sep 2007 16:34:37 -0400
Subject: [Linux-cluster] vip device selection
Message-ID: <OFF2318A81.B16E7F70-ON85257350.0070C612-85257350.00710EAA@csc.com>

Cluster nodes having more than 1 network device how do you select which 
device is used for the VIP. 

Mahmoud Hanafi
Sr. System Administrator
CSC HPC COE
Bld. 676
2435 Fifth Street
WPAFB, Ohio 45433
(937) 255-1536


--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
This is a PRIVATE message. If you are not the intended recipient, please 
delete without copying and kindly advise us by e-mail of the mistake in 
delivery. NOTE: Regardless of content, this e-mail shall not operate to 
bind CSC to any order or other contract unless pursuant to explicit 
written agreement or government initiative expressly permitting the use of 
e-mail for such purpose.
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070908/adf2a7b7/attachment.htm>

From pcaulfie at redhat.com  Mon Sep 10 07:44:28 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 10 Sep 2007 08:44:28 +0100
Subject: [Linux-cluster] DLM - Lock Value Block error
In-Reply-To: <BAY123-W547DDF480A3D5D464F99B93C50@phx.gbl>
References: <BAY123-W547DDF480A3D5D464F99B93C50@phx.gbl>
Message-ID: <46E4F5DC.3080209@redhat.com>

Christos Triantafillou wrote:
> Hi,
>  
> I am using RHEL 4.5 and DLM 1.0.3 on a 4-node cluster.
>  
> I noticed the following regarding the LVB:
> 1. there are two processes: one that sets the LVB of a resource while
> holding an EX lock
> and another one that has a NL lock on the same resource and is blocked
> on a dlm_lock_wait
> for getting a CR lock and reading the LVB.
> 2. when the first process is interrupted with control-C or killed, the
> second process gets
> an invalid LVB error.
> 
> It seems that DLM falsely releases the resource after the first process
> is gone and then
> the second process reads an uninitialized LVB.
>  
> Can you please confirm this error and create a bug report if necessary?

I don't know of this bug in particular, though it might be so. Can you raise a
bug and put as much information as possible into it please (example programs,
sample output, and contents of /proc/cluster/dlm_locks on the master node before
and after the incident).

Thanks.


-- 
Patrick


From Alain.Moulle at bull.net  Mon Sep 10 09:14:47 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 10 Sep 2007 11:14:47 +0200
Subject: [Linux-cluster] CS4 U4 / questions about quorum disk 
Message-ID: <46E50B07.7040807@bull.net>

Hi

Some questions about quorum disk :

1. is the quorum disk working correctly on CS4 Update 4 ?
   or is there any known issue which could lead to problems ?

2. when you have two or three shared disk_array between two
   HA nodes, is it needed to have a quorum disk each disk-array
   or is one quorum disk on only one disk_array sufficient ?
   (I think one is sufficient but just to have your opinion ...)

Thanks for your response.
Regards.
Alain Moull?


From claudio.tassini at gmail.com  Mon Sep 10 11:19:25 2007
From: claudio.tassini at gmail.com (Claudio Tassini)
Date: Mon, 10 Sep 2007 13:19:25 +0200
Subject: [Linux-cluster] GFS: drop_count and drop_period tuning
Message-ID: <39fdf1c70709100418j44935e4sd9bae4da92319a11@mail.gmail.com>

Hi all,
I have a four-nodes GFS cluster on RH 4.5 (last versions, updated
yesterday). There are three GFS filesystems ( 1 TB, 450 GB and 5GB), serving
some mail domains with postfix/courier imap in a "maildir" configuration.

As you can suspect, this is not exactly the best for GFS: we have a lot
(thousands) of very small files (emails) in a very lot of directories. I'm
trying to tune up things to reach the best performance. I found that tuning
the drop_count parameter in /proc/cluster/lock_dlm/drop_period , setting it
to a very large value (it was 500000 and now, after a memory upgrade, I've
set it to 1500000 ), uses a lot of memory (about 10GB out of 16 that I've
installed in every machine) and seems to "boost" performance limiting the
iowait CPU usage.

The bad thing is that when I umount a filesystem, it must clean up all that
locks (I think), and sometimes it causes problems to the whole cluster, with
the other nodes that stop writes to the filesystem while I'm umounting on
one node only.
Is this normal? How can I tune this to clean memory faster when I umount the
FS? I've read something about setting more gfs_glockd daemons per fs with
the num_glockd mount option, but it seems to be quite deprecated because it
shouldn't be necessary..


-- 
Claudio Tassini
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070910/19959a63/attachment.htm>

From Vinoda_Kumar at Satyam.com  Mon Sep 10 11:38:10 2007
From: Vinoda_Kumar at Satyam.com (Vinoda_Kumar)
Date: Mon, 10 Sep 2007 17:08:10 +0530
Subject: [Linux-cluster] Cluster Suite on mainframe (s390x)?
Message-ID: <D52BAE0514D9B048BADBCA7639120A840CE877@bhrmsg001.corp.satyam.ad>

Hi All,

 
Is Cluster Suite bundled with RHEL 5 AP for mainframe (systemZ / s390x)?


Thanks

Vinoda Kumar S

Project Lead,

System Computer Services Limited

+91 80 6658 3215

 
DISCLAIMER:
This email (including any attachments) is intended for the sole use of the intended recipient/s and may contain material that is CONFIDENTIAL AND PRIVATE COMPANY INFORMATION. Any review or reliance by others or copying or distribution or forwarding of any or all of the contents in this message is STRICTLY PROHIBITED. If you are not the intended recipient, please contact the sender by email and delete all copies; your cooperation in this regard is appreciated.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070910/9e632cb4/attachment.htm>

From jwilson at transolutions.net  Mon Sep 10 19:44:10 2007
From: jwilson at transolutions.net (James Wilson)
Date: Mon, 10 Sep 2007 14:44:10 -0500
Subject: [Linux-cluster] Cluster not starting backup after reboot
Message-ID: <46E59E8A.4000407@transolutions.net>

I had 2 host cluster up and going over the weekend. I came in today and 
shutdown the cluster and added 2 more hosts to my current cluster. The 
new hosts are xen domU's. When I rebooted everything the cluster will 
not come back up. And my /var/log/messeges file has a lot of these 
errors below. Does anyone know why I would be getting these errors now? 
Any help is appreciated.

ccsd[8297]: Cluster is not quorate.  Refusing connection.
ccsd[8297]: Error while processing connect: Connection refused


From Michael.Hagmann at hilti.com  Mon Sep 10 21:17:56 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Mon, 10 Sep 2007 23:17:56 +0200
Subject: [Linux-cluster] GFS: drop_count and drop_period tuning
In-Reply-To: <39fdf1c70709100418j44935e4sd9bae4da92319a11@mail.gmail.com>
References: <39fdf1c70709100418j44935e4sd9bae4da92319a11@mail.gmail.com>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA5301EACA71@LI-OWL.hag.hilti.com>

Hi
 
When you are on RHEL4.5 then I highly suggest you to use the new
glock_purge Parameter for every gfs Filesystem add to /etc/rc.local
-------
gfs_tool settune / glock_purge 50
gfs_tool settune /scratch glock_purge 50
-------
 
also this Parameter has to set new on every mount. That mean when you
umount it and then mount it again, run the /etc/rc.local again, otherway
the parameter are gone!
 
maybe also checkout this page -->
http://www.open-sharedroot.org/Members/marc/blog/blog-on-gfs/glock-trimm
ing-patch
 
mike

________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Claudio Tassini
Sent: Montag, 10. September 2007 13:19
To: linux clustering
Subject: [Linux-cluster] GFS: drop_count and drop_period tuning


Hi all, 

I have a four-nodes GFS cluster on RH 4.5 (last versions, updated
yesterday). There are three GFS filesystems ( 1 TB, 450 GB and 5GB),
serving some mail domains with postfix/courier imap in a "maildir"
configuration. 

As you can suspect, this is not exactly the best for GFS: we have a lot
(thousands) of very small files (emails) in a very lot of directories.
I'm trying to tune up things to reach the best performance. I found that
tuning the drop_count parameter in /proc/cluster/lock_dlm/drop_period ,
setting it to a very large value (it was 500000 and now, after a memory
upgrade, I've set it to 1500000 ), uses a lot of memory (about 10GB out
of 16 that I've installed in every machine) and seems to "boost"
performance limiting the iowait CPU usage. 

The bad thing is that when I umount a filesystem, it must clean up all
that locks (I think), and sometimes it causes problems to the whole
cluster, with the other nodes that stop writes to the filesystem while
I'm umounting on one node only.  
Is this normal? How can I tune this to clean memory faster when I umount
the FS? I've read something about setting more gfs_glockd daemons per fs
with the num_glockd mount option, but it seems to be quite deprecated
because it shouldn't be necessary.. 


-- 
Claudio Tassini 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070910/ddb8f24b/attachment.htm>

From lhh at redhat.com  Mon Sep 10 21:30:59 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 10 Sep 2007 17:30:59 -0400
Subject: [Linux-cluster] Quorum question / split brain paranoia
In-Reply-To: <1189085455.5276.4.camel@localhost>
References: <1189030593.5447.94.camel@localhost>
	<20070906122235.GE30969@redhat.com>
	<1189085455.5276.4.camel@localhost>
Message-ID: <20070910213059.GF7563@redhat.com>

On Thu, Sep 06, 2007 at 09:30:55AM -0400, Christopher Barry wrote:
> On Thu, 2007-09-06 at 08:22 -0400, Lon Hohberger wrote:
> > On Wed, Sep 05, 2007 at 06:16:32PM -0400, Christopher Barry wrote:
> > > 
> > > The cluster nodes will run GFS, the director will not. Only one director
> > > will be active with a VIP, load will balance across all 6 VMs. The
> > > crossover will actually have VLANs on it that will allow a separate
> > > heartbeat net, but it was getting a bit tricky with ASCII art ;)
> > > 
> > > Can anyone see any issues that may arise where quorum could create a
> > > split brain scenario? What would be the best way to approach votes, etc.
> > > here?
> > 
> > So, two physical boxes hosting LVS to virtual machines as the real
> > servers (how ironic, actually...).  Said real server cluster is using
> > GFS to share the data?
> > 
> > (I want to make sure I understand the question here)
> > 
> 
> 
> Hi Lon,
> 
> It is a bit ironic, isn't it ;) Yes, you are correct; the vm
> real-servers are sharing a gfs volume.

No real issues, but your qdiskd heuristics should be based on "can I
talk to a physical node in the cluster" or something like that.

Basically, you need to implement a solution which will allow all-but-one
node to fail in the "virtual machine" cluster.  This way, if you lose
half of the VMs, you can still maintain a quorum.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 10 21:42:09 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 10 Sep 2007 17:42:09 -0400
Subject: [Linux-cluster] CS4 U4 / questions about quorum disk
In-Reply-To: <46E50B07.7040807@bull.net>
References: <46E50B07.7040807@bull.net>
Message-ID: <20070910214209.GG7563@redhat.com>

On Mon, Sep 10, 2007 at 11:14:47AM +0200, Alain Moulle wrote:
> Hi
> 
> Some questions about quorum disk :
> 
> 1. is the quorum disk working correctly on CS4 Update 4 ?
>    or is there any known issue which could lead to problems ?

I'd recommend using U5.

> 2. when you have two or three shared disk_array between two
>    HA nodes, is it needed to have a quorum disk each disk-array
>    or is one quorum disk on only one disk_array sufficient ?
>    (I think one is sufficient but just to have your opinion ...)

You can only have quorum disk "device".

(Multipathed / mirrored devices should be fine...)

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 10 21:43:25 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 10 Sep 2007 17:43:25 -0400
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E59E8A.4000407@transolutions.net>
References: <46E59E8A.4000407@transolutions.net>
Message-ID: <20070910214325.GH7563@redhat.com>

On Mon, Sep 10, 2007 at 02:44:10PM -0500, James Wilson wrote:
> I had 2 host cluster up and going over the weekend. I came in today and 
> shutdown the cluster and added 2 more hosts to my current cluster. The 
> new hosts are xen domU's. When I rebooted everything the cluster will 
> not come back up. And my /var/log/messeges file has a lot of these 
> errors below. Does anyone know why I would be getting these errors now? 
> Any help is appreciated.
> 
> ccsd[8297]: Cluster is not quorate.  Refusing connection.
> ccsd[8297]: Error while processing connect: Connection refused

You need at least 3 nodes online and the configuration file version #
matching on all of them.  I'd start checking there.

-- Lon

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From smeacham at charter.net  Mon Sep 10 21:59:31 2007
From: smeacham at charter.net (smeacham at charter.net)
Date: Mon, 10 Sep 2007 21:59:31 +0000
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <20070910214325.GH7563@redhat.com>
References: <46E59E8A.4000407@transolutions.net><20070910214325.GH7563@redhat.com>
Message-ID: <1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>


Sent via BlackBerry by AT&T

-----Original Message-----
From: Lon Hohberger <lhh at redhat.com>

Date: Mon, 10 Sep 2007 17:43:25 
To:jwilson at transolutions.net,linux clustering <linux-cluster at redhat.com>
Subject: Re: [Linux-cluster] Cluster not starting backup after reboot


On Mon, Sep 10, 2007 at 02:44:10PM -0500, James Wilson wrote:
> I had 2 host cluster up and going over the weekend. I came in today and 
> shutdown the cluster and added 2 more hosts to my current cluster. The 
> new hosts are xen domU's. When I rebooted everything the cluster will 
> not come back up. And my /var/log/messeges file has a lot of these 
> errors below. Does anyone know why I would be getting these errors now? 
> Any help is appreciated.
> 
> ccsd[8297]: Cluster is not quorate.  Refusing connection.
> ccsd[8297]: Error while processing connect: Connection refused

You need at least 3 nodes online and the configuration file version #
matching on all of them.  I'd start checking there.

-- Lon

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jwilson at transolutions.net  Mon Sep 10 22:18:37 2007
From: jwilson at transolutions.net (James Wilson)
Date: Mon, 10 Sep 2007 17:18:37 -0500
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>
References: <46E59E8A.4000407@transolutions.net><20070910214325.GH7563@redhat.com>
	<1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>
Message-ID: <46E5C2BD.7090705@transolutions.net>

When I remove the xen domU's from the configuration everything comes up 
fine. Should the domU's be apart of their own cluster? But then I 
wouldn't be able to mount gfs from the dom0 right?

smeacham at charter.net wrote:
> Sent via BlackBerry by AT&T
>
> -----Original Message-----
> From: Lon Hohberger <lhh at redhat.com>
>
> Date: Mon, 10 Sep 2007 17:43:25 
> To:jwilson at transolutions.net,linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] Cluster not starting backup after reboot
>
>
> On Mon, Sep 10, 2007 at 02:44:10PM -0500, James Wilson wrote:
>   
>> I had 2 host cluster up and going over the weekend. I came in today and 
>> shutdown the cluster and added 2 more hosts to my current cluster. The 
>> new hosts are xen domU's. When I rebooted everything the cluster will 
>> not come back up. And my /var/log/messeges file has a lot of these 
>> errors below. Does anyone know why I would be getting these errors now? 
>> Any help is appreciated.
>>
>> ccsd[8297]: Cluster is not quorate.  Refusing connection.
>> ccsd[8297]: Error while processing connect: Connection refused
>>     
>
> You need at least 3 nodes online and the configuration file version #
> matching on all of them.  I'd start checking there.
>
> -- Lon
>
>   


From orkcu at yahoo.com  Tue Sep 11 01:26:33 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Mon, 10 Sep 2007 18:26:33 -0700 (PDT)
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E5C2BD.7090705@transolutions.net>
Message-ID: <449347.296.qm@web50604.mail.re2.yahoo.com>


--- James Wilson <jwilson at transolutions.net> wrote:

> When I remove the xen domU's from the configuration
> everything comes up 
> fine. Should the domU's be apart of their own
> cluster? But then I 
> wouldn't be able to mount gfs from the dom0 right?
if you remove the 2 domU, then your cluster will be
quorate with the other 2 nodes but
if you add the 2 domUs, _and_ none of then join the
cluster, the cluster will not be quorate :-(

I sujest adding one-by-one domUs, because if you add
just one domU to the cluster, it become a 3 node
cluster and will quorated with just 2 nodes (the old
ones), ultil your 1es domU join succefully the cluster
don try to add the second domU.

 
check the firewall of the domUs (comunications between
the nodes)

cu
roger


> 
> smeacham at charter.net wrote:
> > Sent via BlackBerry by AT&T
> >
> > -----Original Message-----
> > From: Lon Hohberger <lhh at redhat.com>
> >
> > Date: Mon, 10 Sep 2007 17:43:25 
> > To:jwilson at transolutions.net,linux clustering
> <linux-cluster at redhat.com>
> > Subject: Re: [Linux-cluster] Cluster not starting
> backup after reboot
> >
> >
> > On Mon, Sep 10, 2007 at 02:44:10PM -0500, James
> Wilson wrote:
> >   
> >> I had 2 host cluster up and going over the
> weekend. I came in today and 
> >> shutdown the cluster and added 2 more hosts to my
> current cluster. The 
> >> new hosts are xen domU's. When I rebooted
> everything the cluster will 
> >> not come back up. And my /var/log/messeges file
> has a lot of these 
> >> errors below. Does anyone know why I would be
> getting these errors now? 
> >> Any help is appreciated.
> >>
> >> ccsd[8297]: Cluster is not quorate.  Refusing
> connection.
> >> ccsd[8297]: Error while processing connect:
> Connection refused
> >>     
> >
> > You need at least 3 nodes online and the
> configuration file version #
> > matching on all of them.  I'd start checking
> there.
> >
> > -- Lon
> >
> >   
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Be a better Heartthrob. Get better relationship answers from someone who knows. Yahoo! Answers - Check it out. 
http://answers.yahoo.com/dir/?link=list&sid=396545433


From bernard.chew at muvee.com  Tue Sep 11 03:54:42 2007
From: bernard.chew at muvee.com (Bernard Chew)
Date: Tue, 11 Sep 2007 11:54:42 +0800
Subject: [Linux-cluster] See DLM locks that are held 
Message-ID: <229C73600EB0E54DA818AB599482BCE901AC5A42@shadowfax.sg.muvee.net>

Hi,

I have a cluster with 4 nodes each running RHEL5. I remember able to see
the DLM locks held in RHEL4 by echo the lockspace name into
/proc/cluster/dlm_locks. How do I do this in RHEL5? I cannot see any
cluster related directory in /proc.

Regards,
Bernard Chew


From pcaulfie at redhat.com  Tue Sep 11 07:02:48 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 11 Sep 2007 08:02:48 +0100
Subject: [Linux-cluster] See DLM locks that are held
In-Reply-To: <229C73600EB0E54DA818AB599482BCE901AC5A42@shadowfax.sg.muvee.net>
References: <229C73600EB0E54DA818AB599482BCE901AC5A42@shadowfax.sg.muvee.net>
Message-ID: <46E63D98.9040604@redhat.com>

Bernard Chew wrote:
> Hi,
> 
> I have a cluster with 4 nodes each running RHEL5. I remember able to see
> the DLM locks held in RHEL4 by echo the lockspace name into
> /proc/cluster/dlm_locks. How do I do this in RHEL5? I cannot see any
> cluster related directory in /proc.

Mount debugfs (eg on /debug) then look in

/debug/dlm/<lockspace>/locks

-- 
Patrick


From bernard.chew at muvee.com  Tue Sep 11 08:03:49 2007
From: bernard.chew at muvee.com (Bernard Chew)
Date: Tue, 11 Sep 2007 16:03:49 +0800
Subject: [Linux-cluster] See DLM locks that are held
In-Reply-To: <46E63D98.9040604@redhat.com>
References: <229C73600EB0E54DA818AB599482BCE901AC5A42@shadowfax.sg.muvee.net>
	<46E63D98.9040604@redhat.com>
Message-ID: <229C73600EB0E54DA818AB599482BCE901AC5AEA@shadowfax.sg.muvee.net>

> Bernard Chew wrote:
> Hi,
> 
> I have a cluster with 4 nodes each running RHEL5. I remember able to
see
> the DLM locks held in RHEL4 by echo the lockspace name into
> /proc/cluster/dlm_locks. How do I do this in RHEL5? I cannot see any
> cluster related directory in /proc.
>
> Mount debugfs (eg on /debug) then look in
>
> /debug/dlm/<lockspace>/locks
> 
> -- 
> Patrick

Thanks Patrick!

Regards,
Bernard Chew


From claudio.tassini at gmail.com  Tue Sep 11 08:35:43 2007
From: claudio.tassini at gmail.com (Claudio Tassini)
Date: Tue, 11 Sep 2007 10:35:43 +0200
Subject: [Linux-cluster] GFS: drop_count and drop_period tuning
In-Reply-To: <9C203D6FD2BF9D49BFF3450201DEDA5301EACA71@LI-OWL.hag.hilti.com>
References: <39fdf1c70709100418j44935e4sd9bae4da92319a11@mail.gmail.com>
	<9C203D6FD2BF9D49BFF3450201DEDA5301EACA71@LI-OWL.hag.hilti.com>
Message-ID: <39fdf1c70709110135n7e50bb81p83237ff901b8bc87@mail.gmail.com>

Thanks Michael, I've set this option on my filesystems. How should this
impact to the system performance/behaviour? More/less memory usage? I guess
that, by trimming the 50% of unused locks every 5 secs, it should cut off
memory usage too.. am I right?
If this works, I could also raise the drop_count value?

2007/9/10, Hagmann, Michael <Michael.Hagmann at hilti.com>:
>
>  Hi
>
> When you are on RHEL4.5 then I highly suggest you to use the new
> glock_purge Parameter for every gfs Filesystem add to /etc/rc.local
> -------
> gfs_tool settune / glock_purge 50
> gfs_tool settune /scratch glock_purge 50
> -------
>
> also this Parameter has to set new on every mount. That mean when you
> umount it and then mount it again, run the /etc/rc.local again, otherway the
> parameter are gone!
>
> maybe also checkout this page --> http://www.open-sharedroot.org
> /Members/marc/blog/blog-on-gfs/glock-trimming-patch
>
> mike
>
>  ------------------------------
> *From:* linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat <linux-cluster-bounces at redhat.com>.com<linux-cluster-bounces at redhat.com>]
> *On Behalf Of *Claudio Tassini
> *Sent:* Montag, 10. September 2007 13:19
> *To:* linux clustering
> *Subject:* [Linux-cluster] GFS: drop_count and drop_period tuning
>
> Hi all,
>
> I have a four-nodes GFS cluster on RH 4.5 (last versions, updated
> yesterday). There are three GFS filesystems ( 1 TB, 450 GB and 5GB), serving
> some mail domains with postfix/courier imap in a "maildir" configuration.
>
>
> As you can suspect, this is not exactly the best for GFS: we have a lot
> (thousands) of very small files (emails) in a very lot of directories. I'm
> trying to tune up things to reach the best performance. I found that tuning
> the drop_count parameter in /proc/cluster/lock_dlm/drop_period , setting
> it to a very large value (it was 500000 and now, after a memory upgrade,
> I've set it to 1500000 ), uses a lot of memory (about 10GB out of 16 that
> I've installed in every machine) and seems to "boost" performance limiting
> the iowait CPU usage.
>
>
> The bad thing is that when I umount a filesystem, it must clean up all
> that locks (I think), and sometimes it causes problems to the whole cluster,
> with the other nodes that stop writes to the filesystem while I'm umounting
> on one node only.
> Is this normal? How can I tune this to clean memory faster when I umount
> the FS? I've read something about setting more gfs_glockd daemons per fs
> with the num_glockd mount option, but it seems to be quite deprecated
> because it shouldn't be necessary..
>
>
>
>
> --
> Claudio Tassini
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman<https://www.redhat.com/mailman/listinfo/linux-cluster>
> /listinfo/linux-cluster<https://www.redhat.com/mailman/listinfo/linux-cluster>
>


-- 
Claudio Tassini
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070911/36e0d01d/attachment.htm>

From jwilson at transolutions.net  Tue Sep 11 13:18:08 2007
From: jwilson at transolutions.net (James Wilson)
Date: Tue, 11 Sep 2007 08:18:08 -0500
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <449347.296.qm@web50604.mail.re2.yahoo.com>
References: <449347.296.qm@web50604.mail.re2.yahoo.com>
Message-ID: <46E69590.3070707@transolutions.net>

I think I have the wrong syntax for fencing xen. Do I add <fence_xvmd/> 
to the config file or <xvmd/>?

Roger Pe?a wrote:
> --- James Wilson <jwilson at transolutions.net> wrote:
>
>   
>> When I remove the xen domU's from the configuration
>> everything comes up 
>> fine. Should the domU's be apart of their own
>> cluster? But then I 
>> wouldn't be able to mount gfs from the dom0 right?
>>     
> if you remove the 2 domU, then your cluster will be
> quorate with the other 2 nodes but
> if you add the 2 domUs, _and_ none of then join the
> cluster, the cluster will not be quorate :-(
>
> I sujest adding one-by-one domUs, because if you add
> just one domU to the cluster, it become a 3 node
> cluster and will quorated with just 2 nodes (the old
> ones), ultil your 1es domU join succefully the cluster
> don try to add the second domU.
>
>  
> check the firewall of the domUs (comunications between
> the nodes)
>
> cu
> roger
>
>
>   
>> smeacham at charter.net wrote:
>>     
>>> Sent via BlackBerry by AT&T
>>>
>>> -----Original Message-----
>>> From: Lon Hohberger <lhh at redhat.com>
>>>
>>> Date: Mon, 10 Sep 2007 17:43:25 
>>> To:jwilson at transolutions.net,linux clustering
>>>       
>> <linux-cluster at redhat.com>
>>     
>>> Subject: Re: [Linux-cluster] Cluster not starting
>>>       
>> backup after reboot
>>     
>>> On Mon, Sep 10, 2007 at 02:44:10PM -0500, James
>>>       
>> Wilson wrote:
>>     
>>>   
>>>       
>>>> I had 2 host cluster up and going over the
>>>>         
>> weekend. I came in today and 
>>     
>>>> shutdown the cluster and added 2 more hosts to my
>>>>         
>> current cluster. The 
>>     
>>>> new hosts are xen domU's. When I rebooted
>>>>         
>> everything the cluster will 
>>     
>>>> not come back up. And my /var/log/messeges file
>>>>         
>> has a lot of these 
>>     
>>>> errors below. Does anyone know why I would be
>>>>         
>> getting these errors now? 
>>     
>>>> Any help is appreciated.
>>>>
>>>> ccsd[8297]: Cluster is not quorate.  Refusing
>>>>         
>> connection.
>>     
>>>> ccsd[8297]: Error while processing connect:
>>>>         
>> Connection refused
>>     
>>>>     
>>>>         
>>> You need at least 3 nodes online and the
>>>       
>> configuration file version #
>>     
>>> matching on all of them.  I'd start checking
>>>       
>> there.
>>     
>>> -- Lon
>>>
>>>   
>>>       
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>>
>>     
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   
>
>
> __________________________________________
> RedHat Certified ( RHCE )
> Cisco Certified ( CCNA & CCDA )
>
>
>        
> ____________________________________________________________________________________
> Be a better Heartthrob. Get better relationship answers from someone who knows. Yahoo! Answers - Check it out. 
> http://answers.yahoo.com/dir/?link=list&sid=396545433
>
>   


From orkcu at yahoo.com  Tue Sep 11 14:02:19 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Tue, 11 Sep 2007 07:02:19 -0700 (PDT)
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E69590.3070707@transolutions.net>
Message-ID: <517240.11618.qm@web50606.mail.re2.yahoo.com>


--- James Wilson <jwilson at transolutions.net> wrote:

> I think I have the wrong syntax for fencing xen. Do
> I add <fence_xvmd/> 
> to the config file or <xvmd/>?
> 

I think, but I could be wrong, that for the purpose of
"joining a cluster" a fence configuration for a node
is not sooo important

I would try to garantie the comunication between
nodes, and the instalation of proper *-kernel packages
according to the kernel running at the node

just yesterday I had a funny problem, funny because
cman report that que node join the cluster (and the
cluster was quorated with its all 4 nodes in, the
others nodes reported the problematic node as
'online') but ccsd was saying the opposite and refuse
to listen to connections (complaining about "can't
comunicate with cluster infrasture...") so everything
wasn't unable to start (fence, clvmd, rgmanager, etc,
etc)

the problem was:
running kernel-smp in the node but
installed cman-kernel and not cman-kernel-smp ;-)
so cman kernel module was not loaded ...
BTW, cman start succefull without complain about not
able to load its kernel module ....

(I really don't like to top posting but how to follow
the threat if not do it? )

> Roger Pe?a wrote:
> > --- James Wilson <jwilson at transolutions.net>
> wrote:
> >
> >   
> >> When I remove the xen domU's from the
> configuration
> >> everything comes up 
> >> fine. Should the domU's be apart of their own
> >> cluster? But then I 
> >> wouldn't be able to mount gfs from the dom0
> right?
> >>     
> > if you remove the 2 domU, then your cluster will
> be
> > quorate with the other 2 nodes but
> > if you add the 2 domUs, _and_ none of then join
> the
> > cluster, the cluster will not be quorate :-(
> >
> > I sujest adding one-by-one domUs, because if you
> add
> > just one domU to the cluster, it become a 3 node
> > cluster and will quorated with just 2 nodes (the
> old
> > ones), ultil your 1es domU join succefully the
> cluster
> > don try to add the second domU.
> >
> >  
> > check the firewall of the domUs (comunications
> between
> > the nodes)
> >
> > cu
> > roger
> >
> >
> >   
> >> smeacham at charter.net wrote:
> >>     
> >>> Sent via BlackBerry by AT&T
> >>>
> >>> -----Original Message-----
> >>> From: Lon Hohberger <lhh at redhat.com>
> >>>
> >>> Date: Mon, 10 Sep 2007 17:43:25 
> >>> To:jwilson at transolutions.net,linux clustering
> >>>       
> >> <linux-cluster at redhat.com>
> >>     
> >>> Subject: Re: [Linux-cluster] Cluster not
> starting
> >>>       
> >> backup after reboot
> >>     
> >>> On Mon, Sep 10, 2007 at 02:44:10PM -0500, James
> >>>       
> >> Wilson wrote:
> >>     
> >>>   
> >>>       
> >>>> I had 2 host cluster up and going over the
> >>>>         
> >> weekend. I came in today and 
> >>     
> >>>> shutdown the cluster and added 2 more hosts to
> my
> >>>>         
> >> current cluster. The 
> >>     
> >>>> new hosts are xen domU's. When I rebooted
> >>>>         
> >> everything the cluster will 
> >>     
> >>>> not come back up. And my /var/log/messeges file
> >>>>         
> >> has a lot of these 
> >>     
> >>>> errors below. Does anyone know why I would be
> >>>>         
> >> getting these errors now? 
> >>     
> >>>> Any help is appreciated.
> >>>>
> >>>> ccsd[8297]: Cluster is not quorate.  Refusing
> >>>>         
> >> connection.
> >>     
> >>>> ccsd[8297]: Error while processing connect:
> >>>>         
> >> Connection refused
> >>     
> >>>>     
> >>>>         
> >>> You need at least 3 nodes online and the
> >>>       
> >> configuration file version #
> >>     
> >>> matching on all of them.  I'd start checking
> >>>       
> >> there.
> >>     
> >>> -- Lon
cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Need a vacation? Get great deals
to amazing places on Yahoo! Travel.
http://travel.yahoo.com/


From jparsons at redhat.com  Tue Sep 11 14:32:48 2007
From: jparsons at redhat.com (James Parsons)
Date: Tue, 11 Sep 2007 10:32:48 -0400
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E69590.3070707@transolutions.net>
References: <449347.296.qm@web50604.mail.re2.yahoo.com>
	<46E69590.3070707@transolutions.net>
Message-ID: <46E6A710.8050408@redhat.com>

James Wilson wrote:

> I think I have the wrong syntax for fencing xen. Do I add 
> <fence_xvmd/> to the config file or <xvmd/>?

The <fence_xvmd/> tag should be in the cluster.conf file as a child of 
the <cluster> tag, in the dom0 cluster. This  just tells the outside 
cluster that vm fencing is going to be employed, so the fence_xvm daemon 
is started and begins listening for distress from the virtual cluster it 
is hosting.

BTW, I am pretty sure that if you include this tag and you do NOT have a 
virtual cluster set up (yet), then nothing bad happens except that a few 
cpu cycles are stolen from the dom0 machine running the daemon for 
nothing. The inverse, however, is not true. A virtual cluster cannot be 
depended upon without the daemon running on the physical host(s).

I hope this explanation buys you some insight. The simple reason for all 
of this, is that DomU machines do not know they are virtual and they 
cannot call 'vm destroy' on another vm even if they did know. They call 
the fence_xvm fence agent when there is trouble, and this agent calls 
out to the fence_xvm daemon running in the outer physical cluster and 
asks it to please shut a particular VM down.

Perhaps xen kernels should include Kierkagaard and Sarte libraries for 
helping them deal with their isolation, alienation, and dreaded lonliness.

-J


From furor_hater at hotmail.com  Tue Sep 11 19:32:07 2007
From: furor_hater at hotmail.com (notol Perc)
Date: Tue, 11 Sep 2007 19:32:07 +0000
Subject: [Linux-cluster] GNBD Problems loading module
Message-ID: <BAY121-F37184D72844EF4310B3C3286C10@phx.gbl>

Using the latest CVS Cluster Source (09-11-2007) I have configured a cluster 
on kernel 2.6.23-rc5 (running under Debian Etch)

I can get everything running short of importing GNBD due to the fact that I 
can not find the kernal module.


I can directly make cluster/gnbd-kernel/src/ I get the following:

make -C /usr/src/linux-2.6.23-rc5 M=/usr/src/cluster/gnbd-kernel/src 
symverfile=/usr/src/linux-2.6.23-rc5/Module.symvers modules USING_KBUILD=yes
make[1]: Entering directory `/usr/src/linux-2.6.23-rc5'
  Building modules, stage 2.
  MODPOST 1 modules
make[1]: Leaving directory `/usr/src/linux-2.6.23-rc5'

then make install

make -C /usr/src/linux-2.6.23-rc5 M=/usr/src/cluster/gnbd-kernel/src 
symverfile=/usr/src/linux-2.6.23-rc5/Module.symvers modules USING_KBUILD=yes
make[1]: Entering directory `/usr/src/linux-2.6.23-rc5'
  Building modules, stage 2.
  MODPOST 1 modules
make[1]: Leaving directory `/usr/src/linux-2.6.23-rc5'
install -d /usr/include/linux
install gnbd.h /usr/include/linux
install -d /lib/modules/`uname -r`/kernel/drivers/block/gnbd
install gnbd.ko /lib/modules/`uname -r`/kernel/drivers/block/gnbd

Ca some one pleas help be get this going?

_________________________________________________________________
Get a FREE small business Web site and more from Microsoft? Office Live! 
http://clk.atdmt.com/MRT/go/aub0930003811mrt/direct/01/


From Abdel.Sadek at lsi.com  Tue Sep 11 21:27:16 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Tue, 11 Sep 2007 15:27:16 -0600
Subject: [Linux-cluster] fence_scsi agent on RHEL 4.5
Message-ID: <C776378855970A4DADE4A476447F6391DEFB64@NAMAIL3.ad.lsil.com>

I am running a 2-node cluster with RHEL 4.5 Native cluster. I am using
scsi persistent reservation as my fencing device. I have noticed when I
shutdown one of the nodes, the fence_scsi agent on the surviving node
fails to fence the dying node. I get the following message:
Sep 11 16:18:13 troy fenced[3614]: agent "fence_scsi" reports: parse
error: unknown option "nodename=porsche"
Sep 11 16:18:13 troy fenced[3614]: fence "porsche" failed
 
it looks like the fence_scsi command is executed using with the nodename
parameter instead of the -n option.
when I run fence_scsi  -h I get the following (there is no nodename
parameter)
Usage
fence_scsi [options]
Options
  -n <node>        IP address or hostname of node to fence
  -h               usage
  -V               version
  -v               verbose
 
But the man page of the fence_scsi command talks about using both the
"-n" and "nodename=" options.
So, how do I make the fence_scsi run with the -n instead of the
nodename= option?
 
Thanks.
Abdel...
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070911/068386c2/attachment.htm>

From Joel.Becker at oracle.com  Tue Sep 11 23:46:08 2007
From: Joel.Becker at oracle.com (Joel Becker)
Date: Tue, 11 Sep 2007 16:46:08 -0700
Subject: [Linux-cluster] changing configuration
Message-ID: <20070911234607.GD27482@tasint.org>

Hey everyone,
	How do I update the IP addresses of existing nodes?
	I have a simple cluster.  I had two nodes on a private network
(10.x.x.x).  I decided to add two more nodes, but they are only on the
public network.  So I wanted to add them as well as change the existing
nodes to use the public network.
	I shut down cman/ccs on all nodes.  I edited cluster.conf.  I
started cman back on one node, and I ensured that cman_tool went to the
new version of the config via "cman_tool version -r N+1".
	The problem is that it still appears to be using the private
network addresses.  I see this in the log and with "cman_tool nodes -a".
	What can I do to fix this, short of hunting down all cman and
openais droppings and removing them?  I want the "right" way :-)

Joel

-- 

"To fall in love is to create a religion that has a fallible god."
        -Jorge Luis Borges

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From orkcu at yahoo.com  Wed Sep 12 01:42:32 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Tue, 11 Sep 2007 18:42:32 -0700 (PDT)
Subject: [Linux-cluster] RHEL4.5, GFS and selinux, are they playing nice?
Message-ID: <724236.51256.qm@web50608.mail.re2.yahoo.com>

Hello everybody ;-)

I keep working in making a web cluster play nice after
the upgrade from RHEL4.4 -> RHEL4.5 
with this upgrade, the relation httpd-selinux become
more strict, my first problem came when the RHGFS4.4
do not support xattr (our web content is in a gfs
filesystem) so I must update RHGFS and RHCS to 4.5
(from centos recompilation)

so now I have support to xattr in ours GFS filesystems
but, here is the problem:
the httpd do not want to start because some config
files (witch reside in another GFS filesystem) have a
forbidden context (httpd can not read file with that
context) (those files are included from the main
apache configuration)
even if I change the context and ls -Z show me that I
change the context for every parent and final dir in
the GFS filesystem.
here are the error from selinux:
{ search } for  pid=2289 comm="httpd" name="/"
dev=dm-7 ino=25  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  
tclass=dir

as you can see, selinux is dening access to httpd
process to make a search in / (root of the filesystem
in device dm-7), with inode 25 and that inode is a
directory, it deny access because the context of that
directory is system_u:object_r:nfs_t 
 am I right?

but, that directory is /opt/soft:
ll -di /opt/soft/
25 drwxr-xr-x  8 root root 3864 Sep 11  2007
/opt/soft/
^^ <--- this is the inode

and it context is system_u:object_r:httpd_config_t:
ll -dZ /opt/soft/
drwxr-xr-x  root     root    
system_u:object_r:httpd_config_t /opt/soft/

so, who is wrong? ls -Z or "global selinux kernel
module" ?
because ls -Z show that the context of that directory
is system_u:object_r:httpd_config_t

if I set selinux to be in permissive mode, then apache
can start, of course, but with some complains like
this:

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.151:38): avc:  denied  { search } for
 pid=2333 comm="httpd" name="/" dev=dm-7 ino=25  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.155:39): avc:  denied  { getattr }
for  pid=2333 comm="httpd" name="apache" dev=dm-7
ino=31  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.155:40): avc:  denied  { read } for 
pid=2333 comm="httpd" name="apache" dev=dm-7 ino=31  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.158:41): avc:  denied  { getattr }
for  pid=2333 comm="httpd" name="httpd.conf" dev=dm-7 

ino=484983 scontext=root:system_r:httpd_t  
tcontext=system_u:object_r:nfs_t tclass=file

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.158:42): avc:  denied  { read } for 
pid=2333 comm="httpd" name="httpd.conf" dev=dm-7  
ino=484983 scontext=root:system_r:httpd_t  
tcontext=system_u:object_r:nfs_t tclass=file

this mean:
access deny to do 
1- search in /opt/soft
2- getattr and read directory /opt/soft/conf/apache
3- getattr and read file httpd.conf

but:
all this files or directory has context 
system_u:object_r:httpd_config_t 

ll -dZ /opt/soft/conf/apache/
drwxr-xr-x  root root system_u:object_r:httpd_config_t
 
/opt/soft/conf/apache/

ll -di /opt/soft/conf/apache/
31 drwxr-xr-x  2 root root 3864 Sep 11 09:44
/opt/soft/conf/apache/


is this related to the fact that selinux policy stated
this:
genfscon gfs /                 system_u:object_r:nfs_t

what do you recomment to solve this complains of
selinux?
mount the gfs filesystem with the option fscontext ?

but that filesystem has other stuff, not related with
apache, so, what context should I use?


thanks
roger


__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


      ____________________________________________________________________________________
Don't let your dream ride pass you by. Make it a reality with Yahoo! Autos.
http://autos.yahoo.com/index.html
 

From alain.richard at equation.fr  Wed Sep 12 05:05:43 2007
From: alain.richard at equation.fr (Alain Richard)
Date: Wed, 12 Sep 2007 07:05:43 +0200
Subject: [Linux-cluster] RE: qdisk votes not in cman
In-Reply-To: <20070904211323.GI19477@redhat.com>
References: <30E8283B-B35E-4DE2-A8B6-9D59ED51C3E8@equation.fr>
	<20070904211323.GI19477@redhat.com>
Message-ID: <CA0AA44E-8956-4826-8083-3FD0976D3D58@equation.fr>


Le 4 sept. 07 ? 23:13, Lon Hohberger a ?crit :

> On Fri, Aug 31, 2007 at 12:46:50PM +0200, Alain RICHARD wrote:
>> Perhaps a better error reporting is needed in qdiskd to shows that we
>> have hit this problem. Also using a generic name like "qdisk device"
>> when qdiskd is registering its node to cman is a better approach.
>
> What about using the label instead of the device name, and restricting
> the label to 16 chars when advertising to cman?
>
> -- Lon

Because when using multipath devices (for example a two paths  
device), all the paths and the multi-path device are recognized as  
having the same label, so qdisk fails to get the good device (the  
multi-path device).

Regards,

-- 
Alain RICHARD <mailto:alain.richard at equation.fr>
EQUATION SA <http://www.equation.fr/>
Tel : +33 477 79 48 00     Fax : +33 477 79 48 01
Applications client/serveur, ing?nierie r?seau et Linux

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070912/6510728f/attachment.htm>

From jprats at cesca.es  Wed Sep 12 07:14:04 2007
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 12 Sep 2007 09:14:04 +0200
Subject: [Linux-cluster] Services timeout
Message-ID: <46E791BC.2090006@cesca.es>

Hi,
I have a NFS server with RedHat Cluster. Sometimes when is on heavy load 
it sets the service status to failed. There's no fs corruption and no 
daemon is down. I suspect this is caused by some timeout while is 
checking the fs is mounted. There is any way to define the check 
interval or the check timeout?

Thank you!
Jordi

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From pcaulfie at redhat.com  Wed Sep 12 11:45:41 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Wed, 12 Sep 2007 12:45:41 +0100
Subject: [Linux-cluster] DLM - Lock Value Block error
In-Reply-To: <BAY123-W547DDF480A3D5D464F99B93C50@phx.gbl>
References: <BAY123-W547DDF480A3D5D464F99B93C50@phx.gbl>
Message-ID: <46E7D165.4040301@redhat.com>

Christos Triantafillou wrote:
> Hi,
>  
> I am using RHEL 4.5 and DLM 1.0.3 on a 4-node cluster.
>  
> I noticed the following regarding the LVB:
> 1. there are two processes: one that sets the LVB of a resource while
> holding an EX lock
> and another one that has a NL lock on the same resource and is blocked
> on a dlm_lock_wait
> for getting a CR lock and reading the LVB.
> 2. when the first process is interrupted with control-C or killed, the
> second process gets
> an invalid LVB error.
> 
> It seems that DLM falsely releases the resource after the first process
> is gone and then
> the second process reads an uninitialized LVB.
>  
> Can you please confirm this error and create a bug report if necessary?

I've just run the program on VMS and it exhibits exactly the same behaviour.

Therefore I suspect this is not a bug ;-)

-- 
Patrick


From lhh at redhat.com  Wed Sep 12 18:52:49 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 12 Sep 2007 14:52:49 -0400
Subject: [Linux-cluster] RE: qdisk votes not in cman
In-Reply-To: <CA0AA44E-8956-4826-8083-3FD0976D3D58@equation.fr>
References: <30E8283B-B35E-4DE2-A8B6-9D59ED51C3E8@equation.fr>
	<20070904211323.GI19477@redhat.com>
	<CA0AA44E-8956-4826-8083-3FD0976D3D58@equation.fr>
Message-ID: <20070912185249.GL7563@redhat.com>

On Wed, Sep 12, 2007 at 07:05:43AM +0200, Alain Richard wrote:
> 
> Le 4 sept. 07 ? 23:13, Lon Hohberger a ?crit :
> 
> >On Fri, Aug 31, 2007 at 12:46:50PM +0200, Alain RICHARD wrote:
> >>Perhaps a better error reporting is needed in qdiskd to shows that we
> >>have hit this problem. Also using a generic name like "qdisk device"
> >>when qdiskd is registering its node to cman is a better approach.
> >
> >What about using the label instead of the device name, and restricting
> >the label to 16 chars when advertising to cman?

> Because when using multipath devices (for example a two paths  
> device), all the paths and the multi-path device are recognized as  
> having the same label, so qdisk fails to get the good device (the  
> multi-path device).

I meant implementation-wise, using the label instead of the device name
to solve or work around the 16 character limit when talking to CMAN...

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Wed Sep 12 18:54:23 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 12 Sep 2007 14:54:23 -0400
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E5C2BD.7090705@transolutions.net>
References: <1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>
	<46E5C2BD.7090705@transolutions.net>
Message-ID: <20070912185421.GM7563@redhat.com>

On Mon, Sep 10, 2007 at 05:18:37PM -0500, James Wilson wrote:
> When I remove the xen domU's from the configuration everything comes up 
> fine. Should the domU's be apart of their own cluster? But then I 
> wouldn't be able to mount gfs from the dom0 right?

Yes, I wouldn't mix physical and virtual nodes in the same cluster.
*that* introduces ugly quorum problems :)

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Wed Sep 12 18:57:59 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 12 Sep 2007 14:57:59 -0400
Subject: [Linux-cluster] changing configuration
In-Reply-To: <20070911234607.GD27482@tasint.org>
References: <20070911234607.GD27482@tasint.org>
Message-ID: <20070912185759.GN7563@redhat.com>

On Tue, Sep 11, 2007 at 04:46:08PM -0700, Joel Becker wrote:
> Hey everyone,
> 	How do I update the IP addresses of existing nodes?
> 	I have a simple cluster.  I had two nodes on a private network
> (10.x.x.x).  I decided to add two more nodes, but they are only on the
> public network.  So I wanted to add them as well as change the existing
> nodes to use the public network.

The cluster node names need to resolve to the public network interface
address, and I think 'uname -n' will need to match in some cases.
Otherwise, you can issue:

  'cman_tool join -n <public_hostname>'

-- Lon

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Wed Sep 12 18:59:03 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 12 Sep 2007 14:59:03 -0400
Subject: [Linux-cluster] Services timeout
In-Reply-To: <46E791BC.2090006@cesca.es>
References: <46E791BC.2090006@cesca.es>
Message-ID: <20070912185903.GO7563@redhat.com>

On Wed, Sep 12, 2007 at 09:14:04AM +0200, Jordi Prats wrote:
> Hi,
> I have a NFS server with RedHat Cluster. Sometimes when is on heavy load 
> it sets the service status to failed. There's no fs corruption and no 
> daemon is down. I suspect this is caused by some timeout while is 
> checking the fs is mounted. There is any way to define the check 
> interval or the check timeout?

It shouldn't matter about load - a fail only occurs on fail-to-stop
cases.  Do you have any log messages from the incident?

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From Michael.Hagmann at hilti.com  Wed Sep 12 19:50:20 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Wed, 12 Sep 2007 21:50:20 +0200
Subject: [Linux-cluster] GFS: drop_count and drop_period tuning
References: <39fdf1c70709100418j44935e4sd9bae4da92319a11@mail.gmail.com><9C203D6FD2BF9D49BFF3450201DEDA5301EACA71@LI-OWL.hag.hilti.com>
	<39fdf1c70709110135n7e50bb81p83237ff901b8bc87@mail.gmail.com>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA530D101D@LI-OWL.hag.hilti.com>

Claudio

the Problem is that ( befor glock_purge Parameter ) no real mechanism to release glocks exists, the only limit is the memory size. Because we have a lot of Memory ( min. 32 GB RAM  ) and 6 Nodes, DLM cam on its limit to handle the locks ( over 6 million ) and timed out !

That means for you, maybe you use less memory but was is more important "performance" The DLM has less glocks to handle and is faster! In our Case the Cluster was, wihout this parameter not able to run!

But I don't now how this impact the drop_cout value.

mike


Michael Hagmann
UNIX Systems Engineering
Enterprise Systems Technology

Hilti Corporation
9494 Schaan  Liechtenstein

Department FIBS
Feldkircherstrasse 100   P.O.Box 333
P +423-234 2467  F +423-234 6467
E michael.hagmann at hilti.com
www.hilti.com


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Claudio Tassini
Sent: Tue 9/11/2007 10:35
To: linux clustering
Subject: Re: [Linux-cluster] GFS: drop_count and drop_period tuning
 
Thanks Michael, I've set this option on my filesystems. How should this impact to the system performance/behaviour? More/less memory usage? I guess that, by trimming the 50% of unused locks every 5 secs, it should cut off memory usage too.. am I right?  

If this works, I could also raise the drop_count value?


2007/9/10, Hagmann, Michael < Michael.Hagmann at hilti.com <mailto:Michael.Hagmann at hilti.com> >:

	Hi
	 
	When you are on RHEL4.5 then I highly suggest you to use the new glock_purge Parameter for every gfs Filesystem add to /etc/rc.local
	-------
	gfs_tool settune / glock_purge 50
	gfs_tool settune /scratch glock_purge 50
	-------
	 
	also this Parameter has to set new on every mount. That mean when you umount it and then mount it again, run the /etc/rc.local again, otherway the parameter are gone!
	 
	maybe also checkout this page --> http://www.open-sharedroot.org/Members/marc/blog/blog-on-gfs/glock-trimming-patch <http://www.open-sharedroot.org/Members/marc/blog/blog-on-gfs/glock-trimming-patch> 
	 
	mike

________________________________

	From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat <mailto:linux-cluster-bounces at redhat.com>  .com <mailto:linux-cluster-bounces at redhat.com> ] On Behalf Of Claudio Tassini
	Sent: Montag, 10. September 2007 13:19
	To: linux clustering
	Subject: [Linux-cluster] GFS: drop_count and drop_period tuning
	
	
		Hi all, 

	 
	I have a four-nodes GFS cluster on RH 4.5 (last versions, updated yesterday). There are three GFS filesystems ( 1 TB, 450 GB and 5GB), serving some mail domains with postfix/courier imap in a "maildir" configuration. 

	 
	As you can suspect, this is not exactly the best for GFS: we have a lot (thousands) of very small files (emails) in a very lot of directories. I'm trying to tune up things to reach the best performance. I found that tuning the drop_count parameter in /proc/cluster/lock_dlm/drop_period , setting it to a very large value (it was 500000 and now, after a memory upgrade, I've set it to 1500000 ), uses a lot of memory (about 10GB out of 16 that I've installed in every machine) and seems to "boost" performance limiting the iowait CPU usage. 
	
	 
	The bad thing is that when I umount a filesystem, it must clean up all that locks (I think), and sometimes it causes problems to the whole cluster, with the other nodes that stop writes to the filesystem while I'm umounting on one node only.  
	Is this normal? How can I tune this to clean memory faster when I umount the FS? I've read something about setting more gfs_glockd daemons per fs with the num_glockd mount option, but it seems to be quite deprecated because it shouldn't be necessary.. 

	 
	-- 
	Claudio Tassini 

	--
	Linux-cluster mailing list
	Linux-cluster at redhat.com
	https://www.redhat.com/mailman <https://www.redhat.com/mailman/listinfo/linux-cluster> /listinfo/linux-cluster <https://www.redhat.com/mailman/listinfo/linux-cluster> 
	

-- 
Claudio Tassini 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 4879 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070912/d03f97f7/attachment.bin>

From orkcu at yahoo.com  Wed Sep 12 19:50:27 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Wed, 12 Sep 2007 12:50:27 -0700 (PDT)
Subject: [Linux-cluster] RHEL4.5, GFS and selinux, are they playing nice?
In-Reply-To: <724236.51256.qm@web50608.mail.re2.yahoo.com>
Message-ID: <141914.88451.qm@web50602.mail.re2.yahoo.com>


--- Roger Pe?a <orkcu at yahoo.com> wrote:

> Hello everybody ;-)
> 
> I keep working in making a web cluster play nice
> after
> the upgrade from RHEL4.4 -> RHEL4.5 
> with this upgrade, the relation httpd-selinux become
> more strict

[bla bla bla]

> so now I have support to xattr in ours GFS
> filesystems
> but, here is the problem:
> the httpd do not want to start because some config
> files (witch reside in another GFS filesystem) have
> a
> forbidden context (httpd can not read file with that
> context) (those files are included from the main
> apache configuration)

> here are the error from selinux:
> { search } for  pid=2289 comm="httpd" name="/"
> dev=dm-7 ino=25  
> scontext=root:system_r:httpd_t
> tcontext=system_u:object_r:nfs_t  
> tclass=dir

[bla bla bla]

> but, that directory is /opt/soft:
> ll -di /opt/soft/
> 25 drwxr-xr-x  8 root root 3864 Sep 11  2007
> /opt/soft/
> ^^ <--- this is the inode
> 
> and it context is system_u:object_r:httpd_config_t:
> ll -dZ /opt/soft/
> drwxr-xr-x  root     root    
> system_u:object_r:httpd_config_t /opt/soft/
> 
> so, who is wrong? ls -Z or "global selinux kernel
> module" ?
> because ls -Z show that the context of that
> directory
> is system_u:object_r:httpd_config_t

[lots of bla bla]

> is this related to the fact that selinux policy
> stated
> this:
> genfscon gfs /  system_u:object_r:nfs_t

should I follow what is stated for reiserfs in this
url:
http://james-morris.livejournal.com/3580.html
?

if I should do it, because is the right thing to do,
why:
1- redhat did not do it for the release of 4.5 ?
2- others aren't getting this king of problems?

Am I the only one with GFS-selinux problems ?


cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Yahoo! oneSearch: Finally, mobile search 
that gives answers, not web links. 
http://mobile.yahoo.com/mobileweb/onesearch?refer=1ONXIC


From rohara at redhat.com  Wed Sep 12 20:26:00 2007
From: rohara at redhat.com (Ryan O'Hara)
Date: Wed, 12 Sep 2007 15:26:00 -0500
Subject: [Linux-cluster] RHEL4.5, GFS and selinux, are they playing nice?
In-Reply-To: <141914.88451.qm@web50602.mail.re2.yahoo.com>
References: <141914.88451.qm@web50602.mail.re2.yahoo.com>
Message-ID: <46E84B58.7060209@redhat.com>


Roger Pe?a wrote:

>> is this related to the fact that selinux policy
>> stated
>> this:
>> genfscon gfs /  system_u:object_r:nfs_t

Yes. This is what would be used for a filesystem that does not support 
selinux xattrs. In RHEL4.5, SELinux xattr support was added to GFS. 
However...

> should I follow what is stated for reiserfs in this
> url:
> http://james-morris.livejournal.com/3580.html

Yes. GFS needs to be defined as a filesystem that supports selinux xattrs.

> if I should do it, because is the right thing to do,
> why:
> 1- redhat did not do it for the release of 4.5 ?

The reason that the selinux policy was not updated for RHEL4.5 (in 
regards to selinux xattr support for GFS) is described in BZ 215559, 
comment #3:

"Changing this on the installed environment could have unexpected 
results.  For example, currently all files on gfs are unlabled and 
treated as nfs_t.  If I suddenly make this change, these file would then 
be treated file_t and any domain that was using them would become unable 
to .  This would require a relabel to fix.  And could cause hundreds of 
AVC messages.  I do not feel this is worth it since almost everyone will 
not use the labels on GFS to treat one file differently than another. In 
the future, where you might have /usr mounted on a gfs or gfs2 
partition, this would become more valuable."

> 2- others aren't getting this king of problems?

I'm not sure how many people are using GFS with SELinux enabled. :)

-Ryan


From jwilson at transolutions.net  Wed Sep 12 20:33:21 2007
From: jwilson at transolutions.net (James Wilson)
Date: Wed, 12 Sep 2007 15:33:21 -0500
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <20070912185421.GM7563@redhat.com>
References: <1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>
	<46E5C2BD.7090705@transolutions.net>
	<20070912185421.GM7563@redhat.com>
Message-ID: <46E84D11.80305@transolutions.net>

Thanks for the replies.

I have decided to have the dom0's in one cluster and the domU's in 
another. I import the storage into the xen instances as raw storage and 
configure gfs from within the domU and it is working fine that now. The 
only thing is when I test failover the ip does not move over. When I 
checked the service it was still assigned to the instance that got 
fenced. Any ideas?

Lon Hohberger wrote:
> On Mon, Sep 10, 2007 at 05:18:37PM -0500, James Wilson wrote:
>   
>> When I remove the xen domU's from the configuration everything comes up 
>> fine. Should the domU's be apart of their own cluster? But then I 
>> wouldn't be able to mount gfs from the dom0 right?
>>     
>
> Yes, I wouldn't mix physical and virtual nodes in the same cluster.
> *that* introduces ugly quorum problems :)
>
>   


From orkcu at yahoo.com  Wed Sep 12 20:37:03 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Wed, 12 Sep 2007 13:37:03 -0700 (PDT)
Subject: [Linux-cluster] RHEL4.5, GFS and selinux, are they playing nice?
In-Reply-To: <46E84B58.7060209@redhat.com>
Message-ID: <231325.27769.qm@web50607.mail.re2.yahoo.com>


--- Ryan O'Hara <rohara at redhat.com> wrote:

> 
> Roger Pe?a wrote:
> 
> >> is this related to the fact that selinux policy
> >> stated
> >> this:
> >> genfscon gfs /  system_u:object_r:nfs_t
> 
> Yes. This is what would be used for a filesystem
> that does not support 
> selinux xattrs. In RHEL4.5, SELinux xattr support
> was added to GFS. 
> However...
> 
> > should I follow what is stated for reiserfs in
> this
> > url:
> > http://james-morris.livejournal.com/3580.html
> 
> Yes. GFS needs to be defined as a filesystem that
> supports selinux xattrs.
> 
> > if I should do it, because is the right thing to
> do,
> > why:
> > 1- redhat did not do it for the release of 4.5 ?
> 
> The reason that the selinux policy was not updated
> for RHEL4.5 (in 
> regards to selinux xattr support for GFS) is
> described in BZ 215559, 
> comment #3:
> 
> "Changing this on the installed environment could
> have unexpected 
> results.  For example, currently all files on gfs
> are unlabled and 
> treated as nfs_t.  If I suddenly make this change,
> these file would then 
> be treated file_t and any domain that was using them
> would become unable 
> to .  This would require a relabel to fix.  And
> could cause hundreds of 
> AVC messages.  I do not feel this is worth it since
> almost everyone will 
> not use the labels on GFS to treat one file
> differently than another. In 
> the future, where you might have /usr mounted on a
> gfs or gfs2 
> partition, this would become more valuable."

thanks a lot
I had few days looking in the net but never look in
bugzilla :-( jejejeje


> 
> > 2- others aren't getting this king of problems?
> 
> I'm not sure how many people are using GFS with
> SELinux enabled. :)

I was forced to, by httpd, it complain about not able
to open configuration files and documentRoots ....

ok, I will try to follow what stated in the webpage,
and relabel the system, but this after I study a litle
bit more about selinux :-)

thanks again
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


      ____________________________________________________________________________________
Fussy? Opinionated? Impossible to please? Perfect.  Join Yahoo!'s user panel and lay it on us. http://surveylink.yahoo.com/gmrs/yahoo_panel_invite.asp?a=7 


From teigland at redhat.com  Wed Sep 12 20:45:25 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 12 Sep 2007 15:45:25 -0500
Subject: [Linux-cluster] GFS profiling result
In-Reply-To: <200709061058.40741.hlawatschek@atix.de>
References: <200709061058.40741.hlawatschek@atix.de>
Message-ID: <20070912204525.GE5634@redhat.com>

On Thu, Sep 06, 2007 at 10:58:40AM +0200, Mark Hlawatschek wrote:
> Hi,
> 
> during a performance analysis and tuning session, I did some profiling with 
> oprofile on GFS and dlm. 
> I got some weird results ... 
> 
> The installed software is:
> RHEL4u5, kernel 2.6.9-55.0.2.ELsmp
> GFS:  2.6.9-72.2.0.2
> DLM: 2.6.9-46.16.0.1
> 
> The configuration includes 2 clusternodes.
> 
> I put the following load on one cluster node:
> 
> 100 processes are doing in parallel: 
> - create 1000 files with 100kb size each (ie altogether we have 100.000 files)
> - flock 1000 files
> - unlink 1000 files.
> 
> The following oprofile output shows, that the system spends about 49% 
> (75%*65%*) of the time in gfs_unlinked_get.
> Looking into the code whe can see, that this is related to unlinked.c:
>      53 9394211 58.7081 :                       ul = list_entry(tmp, struct 
> gfs_unlinked, ul_list);
> 
> It can also be observed, that dlm spends more than 50% of its time in 
> searching for hashes...
> 
> Is this the expected behaviour or can this be tuned somewhere ?

Thanks for doing this, it's very interesting.  For the dlm
search_hashchain, could you try changing rsbtbl_size to 1024 (the default
is 256).  echo 1024 > /proc/.../rsbtbl_size after loading the dlm module,
but before the lockspace is created.

For gfs, I haven't looked very closely, but the linked list could probably
be simply turned into a hash table.  We'd want to study it more closely to
make sure that the long non-hashed list is really the right thing to fix
(i.e. we don't want to just fix a symptom of something else).

Dave


From david.costakos at gmail.com  Wed Sep 12 21:03:38 2007
From: david.costakos at gmail.com (Dave Costakos)
Date: Wed, 12 Sep 2007 14:03:38 -0700
Subject: [Linux-cluster] RE: qdisk votes not in cman
In-Reply-To: <20070912185249.GL7563@redhat.com>
References: <30E8283B-B35E-4DE2-A8B6-9D59ED51C3E8@equation.fr>
	<20070904211323.GI19477@redhat.com>
	<CA0AA44E-8956-4826-8083-3FD0976D3D58@equation.fr>
	<20070912185249.GL7563@redhat.com>
Message-ID: <6b6836c60709121403x53061da6r2a061627e0cd388c@mail.gmail.com>

For my part, I'd at least like to see an error message logged.  Would've
saved us all some time here.


On 9/12/07, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Wed, Sep 12, 2007 at 07:05:43AM +0200, Alain Richard wrote:
> >
> > Le 4 sept. 07 ? 23:13, Lon Hohberger a ?crit :
> >
> > >On Fri, Aug 31, 2007 at 12:46:50PM +0200, Alain RICHARD wrote:
> > >>Perhaps a better error reporting is needed in qdiskd to shows that we
> > >>have hit this problem. Also using a generic name like "qdisk device"
> > >>when qdiskd is registering its node to cman is a better approach.
> > >
> > >What about using the label instead of the device name, and restricting
> > >the label to 16 chars when advertising to cman?
>
> > Because when using multipath devices (for example a two paths
> > device), all the paths and the multi-path device are recognized as
> > having the same label, so qdisk fails to get the good device (the
> > multi-path device).
>
> I meant implementation-wise, using the label instead of the device name
> to solve or work around the 16 character limit when talking to CMAN...
>
> --
> Lon Hohberger - Software Engineer - Red Hat, Inc.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Dave Costakos
mailto:david.costakos at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070912/08d6607e/attachment.htm>

From Timothy.Ward at itt.com  Wed Sep 12 22:03:20 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Wed, 12 Sep 2007 18:03:20 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC24@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC25@ACDFWMAIL1.acd.de.ittind.com>

I have successfully setup apache and samba as cluster services.  I am
now trying to setup nfs, but encountering a kernel bug.  Any ideas where
I should start looking to fix this?

Thanks,
Tim


System
------
node1# uname -a
Linux node1.cluster.com 2.6.18-1.2798.fc6 #1 SMP Mon Oct 16 14:39:22 EDT
2006 x86_64 x86_64 x86_64 GNU/Linux


FC6 64bit RPMs
--------------
rpm -ivh fc6_rpm/openais-0.80.1-3.x86_64.rpm

rpm -ivh fc6_rpm/perl-Net-Telnet-3.03-5.noarch.rpm
rpm -ivh fc6_rpm_more/xen-libs-3.0.3-9.fc6.x86_64.rpm
rpm -ivh fc6_rpm_more/bridge-utils-1.1-2.x86_64.rpm
rpm -ivh --nodeps fc6_rpm_more/libvirt-0.2.3-1.fc6.x86_64.rpm
rpm -ivh fc6_rpm_more/libvirt-python-0.2.3-1.fc6.x86_64.rpm
rpm -ivh fc6_rpm_more/python-virtinst-0.95.0-1.fc6.noarch.rpm
rpm -ivh fc6_rpm_more/xen-3.0.3-9.fc6.x86_64.rpm
rpm -ivh fc6_rpm_updates/cman-2.0.60-1.fc6.x86_64.rpm
rpm -ivh fc6_rpm_updates/gfs2-utils-0.1.25-1.fc6.x86_64.rpm
rpm -ivh --force fc6_rpm_updates/device-mapper-1.02.13-1.fc6.x86_64.rpm
rpm -ivh --force fc6_rpm_updates/lvm2-2.02.17-1.fc6.x86_64.rpm
rpm -ivh fc6_rpm_updates/lvm2-cluster-2.02.17-1.fc6.x86_64.rpm
rpm -ivh fc6_rpm/rgmanager-2.0.8-1.fc6.x86_64.rpm

Luci
rpm -ivh conga/python-imaging-1.1.6-3.fc6.x86_64.rpm
rpm -ivh conga/zope-2.9.7-2.fc6.x86_64.rpm rpm -ivh
conga/plone-2.5.3-1.fc6.x86_64.rpm
rpm -ivh conga/luci-0.9.3-2.fc6.x86_64.rpm

Ricci
rpm -ivh --nodeps conga/oddjob-libs-0.27-8.x86_64.rpm
rpm -ivh conga/oddjob-0.27-8.x86_64.rpm
rpm -ivh conga/modcluster-0.9.3-2.fc6.x86_64.rpm
rpm -ivh conga/ricci-0.9.3-2.fc6.x86_64.rpm


/etc/cluster/cluster.conf
-------------------------
<cluster config_version="49" name="test1">
   <clusternodes>
      <clusternode name="node1.cluster.com" nodeid="1" votes="1">
         <fence>
            <method name="1">
               <device name="simnps1" port="1" switch="1"/>
            </method>
         </fence>
      </clusternode>
      <clusternode name="node2.cluster.com" nodeid="2" votes="1">
         <fence>
            <method name="1">
               <device name="simnps1" port="2" switch="1"/>
            </method>
         </fence>
      </clusternode>
      <clusternode name="node3.cluster.com" nodeid="3" votes="1">
         <fence/>
      </clusternode>
   </clusternodes>
   <fencedevices>
      <fencedevice agent="fence_apc" ipaddr="172.20.1.12" login="root"
name="nps1" passwd=""/>
   </fencedevices>
   <rm>
      <failoverdomains>
         <failoverdomain name="web0" ordered="1" restricted="1">
            <failoverdomainnode name="node1.cluster.com" priority="1"/>
            <failoverdomainnode name="node2.cluster.com" priority="2"/>
         </failoverdomain>
      </failoverdomains>
      <resources>
         <script file="/etc/rc.d/init.d/httpd" name="httpd_init"/>
         <ip address="172.20.1.10" monitor_link="1"/>
         <fs device="/dev/sdb1" force_fsck="0" force_unmount="0"
fsid="51920" fstype="ext3" mountpoint="/mnt/disk0" name="disk0"
self_fence="0"/>
         <smb name="ssdfwmsa" workgroup="ACDADM"/>
         <ip address="172.20.1.14" monitor_link="1"/>
         <ip address="172.20.1.16" monitor_link="1"/>
      </resources>
      <service autostart="1" domain="web0" exclusive="0" name="apache0"
recovery="relocate">
         <script ref="httpd_init"/>
         <ip ref="172.20.1.10"/>
         <fs ref="disk0"/>
      </service>
      <service autostart="1" domain="web0" exclusive="0" name="samba0"
recovery="relocate">
         <ip ref="172.20.1.14"/>
         <smb ref="ssdfwmsa"/>
      </service>
      <service autostart="1" domain="web0" exclusive="0" name="nfs3"
recovery="relocate">
         <ip ref="172.20.1.16"/>
      </service>
   </rm>
   <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="3"/>
   <cman/>
</cluster>


Commands
--------
node1# clusvcadm -e nfs3
test1# mount 172.20.1.16:/dyntest /mnt/dyntest test1# ll /mnt/dyntest

<command hangs and kernel bug logged>


/var/log/messages
-----------------
Sep 12 14:22:05 node1 clurgmgrd[2751]: <notice> Starting disabled
service service:nfs3
Sep 12 14:22:05 node1 clurgmgrd: [2751]: <info> Adding IPv4 address
172.20.1.16 to eth1
Sep 12 14:22:05 node1 avahi-daemon[2555]: Registering new address record
for 172.20.1.16 on eth1.
Sep 12 14:22:07 node1 clurgmgrd[2751]: <notice> Service service:nfs3
started
Sep 12 14:22:15 node1 mountd[29364]: authenticated mount request from
10.32.144.169:761 for /dyntest (/dyntest)
Sep 12 14:22:21 node1 kernel: original: gfs2_glock_nq_atime+0x152/0x2a2
[gfs2]
Sep 12 14:22:21 node1 kernel: pid : 29354
Sep 12 14:22:21 node1 kernel: lock type : 2 lock state : 1
Sep 12 14:22:21 node1 kernel: new: gfs2_getattr+0x2b/0x63 [gfs2]
Sep 12 14:22:21 node1 kernel: pid : 29354
Sep 12 14:22:21 node1 kernel: lock type : 2 lock state : 1
Sep 12 14:22:21 node1 kernel: ----------- [cut here ] --------- [please
bite here ] ---------
Sep 12 14:22:21 node1 kernel: Kernel BUG at fs/gfs2/glock.c:1193
Sep 12 14:22:21 node1 kernel: invalid opcode: 0000 [1] SMP
Sep 12 14:22:21 node1 kernel: last sysfs file:
/fs/gfs2/test1:gfslv/lock_module/recover_done
Sep 12 14:22:21 node1 kernel: CPU 0
Sep 12 14:22:21 node1 kernel: Modules linked in: nfsd exportfs lockd
nfs_acl ipt_MASQUERADE iptable_nat ip_nat xt_state ip_conntrack
nfnetlink ipt_REJECT xt_tcpudp iptable_filter ip_tables x_tables bridge
autofs4 hidp rfcomm l2cap bluetooth md5 sctp lock_dlm gfs2 dlm configfs
sunrpc dm_mirror dm_multipath dm_mod video sbs i2c_ec button battery
asus_acpi ac ipv6 parport_pc lp parport sg amd_rng i2c_amd8111 ide_cd
i2c_amd756 i2c_core serio_raw pcspkr cdrom e1000 shpchp tg3 floppy
k8_edac edac_mc qla2xxx scsi_transport_fc sd_mod scsi_mod ext3 jbd
ehci_hcd ohci_hcd uhci_hcd
Sep 12 14:22:21 node1 kernel: Pid: 29354, comm: nfsd Not tainted
2.6.18-1.2798.fc6 #1
Sep 12 14:22:21 node1 kernel: RIP: 0010:[<ffffffff88450641>]
[<ffffffff88450641>] :gfs2:gfs2_glock_nq+0x106/0x1f2
Sep 12 14:22:21 node1 kernel: RSP: 0018:ffff8100ea0816d0  EFLAGS:
00010282
Sep 12 14:22:21 node1 kernel: RAX: 0000000000000020 RBX:
ffff8100ea081cc0 RCX: ffffffff806aea40
Sep 12 14:22:21 node1 kernel: RDX: 0000000000000000 RSI:
0000000000000046 RDI: ffffffff80556ef0
Sep 12 14:22:21 node1 kernel: RBP: ffff8100ea081710 R08:
00000000ffffffff R09: 0000000000000400
Sep 12 14:22:21 node1 kernel: R10: 0000000000000000 R11:
0000000000000000 R12: ffff8100d8d98830
Sep 12 14:22:21 node1 kernel: R13: ffff8100d8d98830 R14:
0000000000000000 R15: ffff8100e3c3b000
Sep 12 14:22:21 node1 kernel: FS:  00002aaaab0146f0(0000)
GS:ffffffff80609000(0000) knlGS:00000000f7fd46d0
Sep 12 14:22:21 node1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Sep 12 14:22:21 node1 kernel: CR2: 0000000000669358 CR3:
0000000068f92000 CR4: 00000000000006e0
Sep 12 14:22:21 node1 kernel: Process nfsd (pid: 29354, threadinfo
ffff8100ea080000, task ffff8100f41ee7d0)
Sep 12 14:22:21 node1 kernel: Stack:  0000000100000000 ffff8100cc8ba01c
ffff8100ea0819b0 ffff8100d4539678
Sep 12 14:22:21 node1 kernel:  ffff8100ea0817b0 0000000000000001
ffff8100cc8ba000 ffffffff8845c747
Sep 12 14:22:21 node1 kernel:  ffff8100ea081710 ffff8100ea081710
ffff8100d8d98830 ffff8100f41ee7d0
Sep 12 14:22:21 node1 kernel: Call Trace:
Sep 12 14:22:21 node1 kernel:  [<ffffffff8845c747>]
:gfs2:gfs2_getattr+0x33/0x63
Sep 12 14:22:21 node1 kernel:  [<ffffffff886a4d0b>]
:nfsd:encode_post_op_attr+0x3f/0x213
Sep 12 14:22:21 node1 kernel:  [<ffffffff886a5492>]
:nfsd:encode_entry+0x21d/0x51b
Sep 12 14:22:21 node1 kernel:  [<ffffffff886a57a0>]
:nfsd:nfs3svc_encode_entry_plus+0x10/0x12
Sep 12 14:22:21 node1 kernel:  [<ffffffff8845a2c2>]
:gfs2:filldir_func+0x22/0x86
Sep 12 14:22:21 node1 kernel:  [<ffffffff8844abd1>]
:gfs2:do_filldir_main+0x126/0x16d
Sep 12 14:22:21 node1 kernel:  [<ffffffff8844b102>]
:gfs2:gfs2_dir_read+0x426/0x485
Sep 12 14:22:21 node1 kernel:  [<ffffffff8845aa6f>]
:gfs2:gfs2_readdir+0x9e/0xc4
Sep 12 14:22:21 node1 kernel:  [<ffffffff802350b2>]
vfs_readdir+0x77/0xa9
Sep 12 14:22:21 node1 kernel:  [<ffffffff8869ce5e>]
:nfsd:nfsd_readdir+0x6d/0xc5
Sep 12 14:22:21 node1 kernel:  [<ffffffff886a4621>]
:nfsd:nfsd3_proc_readdirplus+0xf8/0x211
Sep 12 14:22:21 node1 kernel:  [<ffffffff886990e9>]
:nfsd:nfsd_dispatch+0xd7/0x198
Sep 12 14:22:21 node1 kernel:  [<ffffffff883e2437>]
:sunrpc:svc_process+0x42e/0x6ec
Sep 12 14:22:21 node1 kernel:  [<ffffffff88699662>]
:nfsd:nfsd+0x1b5/0x32b
Sep 12 14:22:21 node1 kernel:  [<ffffffff8025cea5>] child_rip+0xa/0x11
Sep 12 14:22:22 node1 kernel: DWARF2 unwinder stuck at
child_rip+0xa/0x11
Sep 12 14:22:22 node1 kernel: Leftover inexact backtrace:
Sep 12 14:22:22 node1 kernel:  [<ffffffff886994ad>] :nfsd:nfsd+0x0/0x32b
Sep 12 14:22:22 node1 kernel:  [<ffffffff886994ad>] :nfsd:nfsd+0x0/0x32b
Sep 12 14:22:22 node1 kernel:  [<ffffffff8025ce9b>] child_rip+0x0/0x11
Sep 12 14:22:22 node1 kernel:
Sep 12 14:22:22 node1 kernel:
Sep 12 14:22:22 node1 kernel: Code: 0f 0b 68 0d 80 46 88 c2 a9 04 48 8b
75 18 49 8b 84 24 90 00
Sep 12 14:22:22 node1 kernel: RIP  [<ffffffff88450641>]
:gfs2:gfs2_glock_nq+0x106/0x1f2
Sep 12 14:22:22 node1 kernel:  RSP <ffff8100ea0816d0>

*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From orkcu at yahoo.com  Thu Sep 13 01:40:20 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Wed, 12 Sep 2007 18:40:20 -0700 (PDT)
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC25@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <131367.99176.qm@web50608.mail.re2.yahoo.com>


--- "Ward, Timothy - SSD" <Timothy.Ward at itt.com>
wrote:

> I have successfully setup apache and samba as
> cluster services.  I am
> now trying to setup nfs, but encountering a kernel
> bug.  Any ideas where
> I should start looking to fix this?

> 
> Sep 12 14:22:21 node1 kernel: new:
> gfs2_getattr+0x2b/0x63 [gfs2]
> Sep 12 14:22:21 node1 kernel: pid : 29354
> Sep 12 14:22:21 node1 kernel: lock type : 2 lock
> state : 1
> Sep 12 14:22:21 node1 kernel: ----------- [cut here
> ] --------- [please
> bite here ] ---------
> Sep 12 14:22:21 node1 kernel: Kernel BUG at
> fs/gfs2/glock.c:1193

but the bug is related to gfs2, how a fs gfs2 is
related to you NFS service?

BTW, remember that GFS2 is not production ready ...

I guess my email is not big help .... :-(

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Got a little couch potato? 
Check out fun summer activities for kids.
http://search.yahoo.com/search?fr=oni_on_mail&p=summer+activities+for+kids&cs=bz 


From jprats at cesca.es  Thu Sep 13 07:10:12 2007
From: jprats at cesca.es (Jordi Prats)
Date: Thu, 13 Sep 2007 09:10:12 +0200
Subject: [Linux-cluster] Services timeout
In-Reply-To: <20070912185903.GO7563@redhat.com>
References: <46E791BC.2090006@cesca.es> <20070912185903.GO7563@redhat.com>
Message-ID: <46E8E254.8030301@cesca.es>

Hi,
This is all the data I can collect. bpkar is a backup process. It have 
happened while it was indexing (it takes several weeks) and doing a 
backup at the same time.

best regards,

Sep 12 06:09:44 inf04 clurgmgrd[5964]: <notice> Stopping service 
padicat.dades
Sep 12 06:09:45 inf04 clurgmgrd: [5964]: <info> Executing 
/etc/init.d/add.nfs.padicat.dades stop
Sep 12 06:09:45 inf04 clurgmgrd: [5964]: <info> Removing IPv4 address 
192.168.12.205 from bond0
Sep 12 06:09:54 inf04 clurgmgrd: [5964]: <info> Executing 
/etc/init.d/add.nfs.recercat status
Sep 12 06:09:55 inf04 clurgmgrd: [5964]: <info> unmounting 
/projectes/padicat/dades
Sep 12 06:09:55 inf04 clurgmgrd: [5964]: <notice> Forcefully unmounting 
/projectes/padicat/dades
Sep 12 06:09:56 inf04 clurgmgrd: [5964]: <warning> killing process 13266 
(root bpbkar /projectes/padicat/dades)
Sep 12 06:09:56 inf04 clurgmgrd: [5964]: <warning> Dropping node-wide 
NFS locks
Sep 12 06:10:04 inf04 clurgmgrd: [5964]: <info> Executing 
/etc/init.d/add.nfs.padicat.web status
Sep 12 06:10:04 inf04 clurgmgrd: [5964]: <info> Executing 
/etc/init.d/add.nfs.local status
Sep 12 06:10:06 inf04 clurgmgrd: [5964]: <info> unmounting 
/projectes/padicat/dades
Sep 12 06:10:06 inf04 clurgmgrd: [5964]: <notice> Forcefully unmounting 
/projectes/padicat/dades
Sep 12 06:10:07 inf04 clurgmgrd: [5964]: <info> Sending reclaim 
notifications via inf04.cesca.es
Sep 12 06:10:07 inf04 rpc.statd[27045]: Version 1.0.6 Starting
Sep 12 06:10:07 inf04 rpc.statd[27045]: Flags: No-Daemon Notify-Only
Sep 12 06:10:10 inf04 rpc.statd[27045]: Caught signal 15, un-registering 
and exiting.
Sep 12 06:10:10 inf04 clurgmgrd: [5964]: <info> Sending reclaim 
notifications via nfstdx
Sep 12 06:10:10 inf04 rpc.statd[27067]: Version 1.0.6 Starting
Sep 12 06:10:10 inf04 rpc.statd[27067]: Flags: No-Daemon Notify-Only
Sep 12 06:10:13 inf04 rpc.statd[27067]: Caught signal 15, un-registering 
and exiting.
Sep 12 06:10:13 inf04 clurgmgrd: [5964]: <info> Sending reclaim 
notifications via nfspadicatweb
Sep 12 06:10:13 inf04 rpc.statd[27089]: Version 1.0.6 Starting
Sep 12 06:10:13 inf04 rpc.statd[27089]: Flags: No-Daemon Notify-Only
Sep 12 06:10:14 inf04 clurgmgrd: [5964]: <info> Executing 
/etc/init.d/add.nfs.tdx status
Sep 12 06:10:16 inf04 rpc.statd[27089]: Caught signal 15, un-registering 
and exiting.
Sep 12 06:10:16 inf04 clurgmgrd: [5964]: <info> Sending reclaim 
notifications via nfslocal
Sep 12 06:10:16 inf04 rpc.statd[27321]: Version 1.0.6 Starting
Sep 12 06:10:16 inf04 rpc.statd[27321]: Flags: No-Daemon Notify-Only
Sep 12 06:10:19 inf04 rpc.statd[27321]: Caught signal 15, un-registering 
and exiting.
Sep 12 06:10:19 inf04 clurgmgrd: [5964]: <info> Sending reclaim 
notifications via nfsrecercat
Sep 12 06:10:19 inf04 rpc.statd[27343]: Version 1.0.6 Starting
Sep 12 06:10:19 inf04 rpc.statd[27343]: Flags: No-Daemon Notify-Only
Sep 12 06:10:22 inf04 rpc.statd[27343]: Caught signal 15, un-registering 
and exiting.
Sep 12 06:10:22 inf04 clurgmgrd: [5964]: <err> 'umount 
/projectes/padicat/dades' failed, error=0
Sep 12 06:10:22 inf04 clurgmgrd[5964]: <notice> stop on fs 
"PADICAT.dades" returned 2 (invalid argument(s))
Sep 12 06:10:22 inf04 clurgmgrd[5964]: <crit> #12: RG padicat.dades 
failed to stop; intervention required
Sep 12 06:10:22 inf04 clurgmgrd[5964]: <notice> Service padicat.dades is 
failed


Lon Hohberger wrote:
> On Wed, Sep 12, 2007 at 09:14:04AM +0200, Jordi Prats wrote:
>   
>> Hi,
>> I have a NFS server with RedHat Cluster. Sometimes when is on heavy load 
>> it sets the service status to failed. There's no fs corruption and no 
>> daemon is down. I suspect this is caused by some timeout while is 
>> checking the fs is mounted. There is any way to define the check 
>> interval or the check timeout?
>>     
>
> It shouldn't matter about load - a fail only occurs on fail-to-stop
> cases.  Do you have any log messages from the incident?
>
>   


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From grimme at atix.de  Thu Sep 13 07:14:29 2007
From: grimme at atix.de (Marc Grimme)
Date: Thu, 13 Sep 2007 09:14:29 +0200
Subject: [Linux-cluster] GFS profiling result
In-Reply-To: <20070912204525.GE5634@redhat.com>
References: <200709061058.40741.hlawatschek@atix.de>
	<20070912204525.GE5634@redhat.com>
Message-ID: <200709130914.30302.grimme@atix.de>

Hi Dave,
you might also want to have a look at:
http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/oprofile-analysis
Perhaps you have something to add there as well.
Thanks
Marc.
On Wednesday 12 September 2007 22:45:25 David Teigland wrote:
> On Thu, Sep 06, 2007 at 10:58:40AM +0200, Mark Hlawatschek wrote:
> > Hi,
> >
> > during a performance analysis and tuning session, I did some profiling
> > with oprofile on GFS and dlm.
> > I got some weird results ...
> >
> > The installed software is:
> > RHEL4u5, kernel 2.6.9-55.0.2.ELsmp
> > GFS:  2.6.9-72.2.0.2
> > DLM: 2.6.9-46.16.0.1
> >
> > The configuration includes 2 clusternodes.
> >
> > I put the following load on one cluster node:
> >
> > 100 processes are doing in parallel:
> > - create 1000 files with 100kb size each (ie altogether we have 100.000
> > files) - flock 1000 files
> > - unlink 1000 files.
> >
> > The following oprofile output shows, that the system spends about 49%
> > (75%*65%*) of the time in gfs_unlinked_get.
> > Looking into the code whe can see, that this is related to unlinked.c:
> >      53 9394211 58.7081 :                       ul = list_entry(tmp,
> > struct gfs_unlinked, ul_list);
> >
> > It can also be observed, that dlm spends more than 50% of its time in
> > searching for hashes...
> >
> > Is this the expected behaviour or can this be tuned somewhere ?
>
> Thanks for doing this, it's very interesting.  For the dlm
> search_hashchain, could you try changing rsbtbl_size to 1024 (the default
> is 256).  echo 1024 > /proc/.../rsbtbl_size after loading the dlm module,
> but before the lockspace is created.
>
> For gfs, I haven't looked very closely, but the linked list could probably
> be simply turned into a hash table.  We'd want to study it more closely to
> make sure that the long non-hashed list is really the right thing to fix
> (i.e. we don't want to just fix a symptom of something else).
>
> Dave
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss


From Alain.Moulle at bull.net  Thu Sep 13 12:42:09 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Thu, 13 Sep 2007 14:42:09 +0200
Subject: [Linux-cluster] CS4 U4/ is interval configurable ?
Message-ID: <46E93021.9040201@bull.net>

Hi

don't know exactly what's the meaning of "status" and "monitor"
(I mean the difference ?)

but I would like to increase the interval between each call of
the target "status" of the service's script .

<action name="status" interval="30s" timeout="0"/>
<action name="monitor" interval="30s" timeout="0"/>

Is it possible to modify this parameter via the cluster.conf ?

Thanks
Alain Moull?


From teigland at redhat.com  Thu Sep 13 14:02:24 2007
From: teigland at redhat.com (David Teigland)
Date: Thu, 13 Sep 2007 09:02:24 -0500
Subject: [Linux-cluster] GFS profiling result
In-Reply-To: <200709130914.30302.grimme@atix.de>
References: <200709061058.40741.hlawatschek@atix.de>
	<20070912204525.GE5634@redhat.com>
	<200709130914.30302.grimme@atix.de>
Message-ID: <20070913140224.GA24441@redhat.com>

On Thu, Sep 13, 2007 at 09:14:29AM +0200, Marc Grimme wrote:
> Hi Dave,
> you might also want to have a look at:
> http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/oprofile-analysis
> Perhaps you have something to add there as well.

Thanks, this is great information.  In this case it's the size of the
"lkbtbl" hash table that you could try increasing.  By default it's 1024,
I might start by trying 2048 and see what changes.

This is, of course, all driven by the number of locks that gfs is using.
It would be interesting to see what that number is.  Over the last several
years, since we originally picked the size of these hash tables, the size
of gfs fs's and the amount of memory on machines has grown quite a lot
(the VA Linux machines I was using when first writing this code had 256 MB
of memory.)  So, the number of locks in a gfs cluster has grown, too.  It
may be time to increase the default sizes of these hash tables.

Another problem is the way the dlm creates and uses lock id's.  This isn't
quite as simple to solve.  Because the lock id's are only 32 bits, the
counters easily wrap around, which means that whenever a new lock id is
chosen, we have to search all existing lock id's to prevent duplicates
(these searches are per hash chain).  There may be a smarter technique we
could use to do this more efficiently.  One idea I've had is to keep a
list of deleted lkb's and recycle them -- this would mean that we don't
often search for a new lock id after the system has run for a while.  A
tree structure instead of a hash table may also be helpful.

Dave


From grimme at atix.de  Thu Sep 13 14:32:28 2007
From: grimme at atix.de (Marc Grimme)
Date: Thu, 13 Sep 2007 16:32:28 +0200
Subject: [Linux-cluster] GFS profiling result
In-Reply-To: <20070913140224.GA24441@redhat.com>
References: <200709061058.40741.hlawatschek@atix.de>
	<200709130914.30302.grimme@atix.de>
	<20070913140224.GA24441@redhat.com>
Message-ID: <200709131632.29934.grimme@atix.de>

On Thursday 13 September 2007 16:02:24 David Teigland wrote:
> On Thu, Sep 13, 2007 at 09:14:29AM +0200, Marc Grimme wrote:
> > Hi Dave,
> > you might also want to have a look at:
> > http://www.opensharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-_
> >_find_lock_by_id/oprofile-analysis Perhaps you have something to add there
> > as well.
>
> Thanks, this is great information.  In this case it's the size of the
> "lkbtbl" hash table that you could try increasing.  By default it's 1024,
> I might start by trying 2048 and see what changes.
We already increased it to 4096 ;-) on our testsystem (both lkt and rsb) and 
see way better results!!! Profiling results will follow.
>
> This is, of course, all driven by the number of locks that gfs is using.
> It would be interesting to see what that number is.  Over the last several
> years, since we originally picked the size of these hash tables, the size
> of gfs fs's and the amount of memory on machines has grown quite a lot
> (the VA Linux machines I was using when first writing this code had 256 MB
> of memory.)  So, the number of locks in a gfs cluster has grown, too.  It
> may be time to increase the default sizes of these hash tables.
Or to grow the hash tables if the lists on the tables get too long?
This is quite expensive but still the id finding would stay O(sizeof(list)).
>
> Another problem is the way the dlm creates and uses lock id's.  This isn't
> quite as simple to solve.  Because the lock id's are only 32 bits, the
> counters easily wrap around, which means that whenever a new lock id is
> chosen, we have to search all existing lock id's to prevent duplicates
> (these searches are per hash chain).  There may be a smarter technique we
> could use to do this more efficiently.  One idea I've had is to keep a
> list of deleted lkb's and recycle them -- this would mean that we don't
> often search for a new lock id after the system has run for a while.  A
> tree structure instead of a hash table may also be helpful.
I think the shorter you keep the lists in the hashtables the faster the search 
would be.
But thats up to you.
>
> Dave


-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 452 3538-14
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10 
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand: 
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss


From Timothy.Ward at itt.com  Thu Sep 13 14:32:18 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Thu, 13 Sep 2007 10:32:18 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC27@ACDFWMAIL1.acd.de.ittind.com>

I am trying to setup nfs as a clustered service.  The filesystem nfs is
serving is a GFS2 fs.  So I am trying to find out is the problem with
GFS2, NFS, the SMP processors or something else?

How do I approach this problem and has anyone else seen this yet?

Thanks,
Tim


*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From robert at exa-omicron.nl  Thu Sep 13 14:39:30 2007
From: robert at exa-omicron.nl (Robert Verspuy)
Date: Thu, 13 Sep 2007 16:39:30 +0200
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC27@ACDFWMAIL1.acd.de.ittind.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC27@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <46E94BA2.5080303@exa-omicron.nl>

Ward, Timothy - SSD schreef:
> I am trying to setup nfs as a clustered service.  The filesystem nfs is
> serving is a GFS2 fs.  So I am trying to find out is the problem with
> GFS2, NFS, the SMP processors or something else?
>
> How do I approach this problem and has anyone else seen this yet?
>
> Thanks,
> Tim
>   
It depends on the cause or your problems.
I'm also trying to setup a cluster with GFS2, and have some problems
along the way. (not even started with the setup of gfs2 or nfs)
But my problems are related (I think) to spinlock bugs in the kernel-xen
of fedora 7.
And therefor I started with posting on the fedora-xen mailing list

See my 'cry for help' at
http://www.redhat.com/archives/fedora-xen/2007-September/msg00025.html

With kind regards,
Robert Verspuy

-- 
*Exa-Omicron*
Patroonsweg 10
3892 DB Zeewolde
Tel.: 088-OMICRON (66 427 66)
Fax: 088-66 427 69
http://www.exa-omicron.nl
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070913/60053965/attachment.htm>

From kanderso at redhat.com  Thu Sep 13 15:37:19 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Thu, 13 Sep 2007 10:37:19 -0500
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC27@ACDFWMAIL1.acd.de.ittind.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC27@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <1189697839.2836.12.camel@localhost.localdomain>

On Thu, 2007-09-13 at 10:32 -0400, Ward, Timothy - SSD wrote:
> I am trying to setup nfs as a clustered service.  The filesystem nfs is
> serving is a GFS2 fs.  So I am trying to find out is the problem with
> GFS2, NFS, the SMP processors or something else?
> 
We have fixed a number of bugs with GFS2 over the last few weeks, so
would point at GFS2 as being the issue.  Pick up the latest kernel from
Steve Whitehouses GIT tree.  It has a number of fixes, with a number of
additional ones queued up to be pushed shortly.

If you need a stable environment at this time, use GFS1 instead of gfs2.

Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070913/78766bd0/attachment.htm>

From christian.brandes at forschungsgruppe.de  Thu Sep 13 16:10:35 2007
From: christian.brandes at forschungsgruppe.de (Christian Brandes)
Date: Thu, 13 Sep 2007 18:10:35 +0200
Subject: [Linux-cluster] GFS Performance Problem / coherency between GFS
 access speed and network bandwidth
Message-ID: <46E960FB.8020106@forschungsgruppe.de>

I found out, that there must be a coherency between GFS access speed and
network bandwidth allthough i/o is going through FC to the RAID in my 
4-node cluster.

It is 4 nodes each redundantly connected by 4GB-FC to the two
controllers of a FC-to-FC RAID System.
Devices are accessed through multipath-tools' /dev/mapper devices on the
primary path.
Each node has 1 100BaseT and 2 1000BaseTX network interfaces.

Before installing the cluster I did some performance tests such as
writing from /dev/zero to the RAID containing an XFS file system and now
with a GFS on it.

Here are some numbers:

FS      heartbeat-network      access-speed     latency
----------------------------------------------------------
GFS     10MBit/s               50MB/s           very high
GFS     100MBit/s              66MB/s           high
GFS     1000MBit/s             100MB/s          low
XFS                            260MB/s          none

What does GFS do over network when it is accessed by a node.
Does dlm use such a high network bandwith before the first bit is
written through FC to the RAID?

I had a VM on VMWare Server running on GFS that was interrupted every 10
seconds for one ore two seconds.
Now with 1000MBit/s it is not interrupted any more. But do not want to
waste more than 60% of our RAID's performance.

What can I do to speed up GFS?

Hope you have an idea.

Best regards

	Christian


From jbacik at redhat.com  Thu Sep 13 16:10:20 2007
From: jbacik at redhat.com (Josef Bacik)
Date: Thu, 13 Sep 2007 12:10:20 -0400
Subject: [Linux-cluster] GFS Performance Problem / coherency between
	GFS access speed and network bandwidth
In-Reply-To: <46E960FB.8020106@forschungsgruppe.de>
References: <46E960FB.8020106@forschungsgruppe.de>
Message-ID: <20070913161020.GB21270@dhcp-243-37.rdu.redhat.com>

On Thu, Sep 13, 2007 at 06:10:35PM +0200, Christian Brandes wrote:
> I found out, that there must be a coherency between GFS access speed and
> network bandwidth allthough i/o is going through FC to the RAID in my 
> 4-node cluster.
> 
> It is 4 nodes each redundantly connected by 4GB-FC to the two
> controllers of a FC-to-FC RAID System.
> Devices are accessed through multipath-tools' /dev/mapper devices on the
> primary path.
> Each node has 1 100BaseT and 2 1000BaseTX network interfaces.
> 
> Before installing the cluster I did some performance tests such as
> writing from /dev/zero to the RAID containing an XFS file system and now
> with a GFS on it.
> 
> Here are some numbers:
> 
> FS      heartbeat-network      access-speed     latency
> ----------------------------------------------------------
> GFS     10MBit/s               50MB/s           very high
> GFS     100MBit/s              66MB/s           high
> GFS     1000MBit/s             100MB/s          low
> XFS                            260MB/s          none
> 
> What does GFS do over network when it is accessed by a node.
> Does dlm use such a high network bandwith before the first bit is
> written through FC to the RAID?
> 
> I had a VM on VMWare Server running on GFS that was interrupted every 10
> seconds for one ore two seconds.
> Now with 1000MBit/s it is not interrupted any more. But do not want to
> waste more than 60% of our RAID's performance.
> 
> What can I do to speed up GFS?
> 

Are you mounting with noatime?  Everything GFS does requires a lock, so yeah
your network speed is going to directly affect how fast GFS is.  Using noatime
will cut down on the amount of dlm traffic you are doing so that should help
some.

Josef


From Alexandre.Racine at mhicc.org  Thu Sep 13 16:58:50 2007
From: Alexandre.Racine at mhicc.org (Alexandre Racine)
Date: Thu, 13 Sep 2007 12:58:50 -0400
Subject: [Linux-cluster] GFS Performance Problem / coherency betweenGFS
	access speed and network bandwidth
References: <46E960FB.8020106@forschungsgruppe.de>
	<20070913161020.GB21270@dhcp-243-37.rdu.redhat.com>
Message-ID: <C43CF0825BF59D4FBC1F6A2AF45EB88D3C3482@cumulonimbus.RG.local>

GFS Performance Tuning http://sourceware.org/cluster/faq.html#gfs_tuning


Alexandre Racine
Projets sp?ciaux
514-461-1300 poste 3304
alexandre.racine at mhicc.org


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Josef Bacik
Sent: Thu 2007-09-13 12:10
To: linux clustering
Subject: Re: [Linux-cluster] GFS Performance Problem / coherency betweenGFS access speed and network bandwidth
 
On Thu, Sep 13, 2007 at 06:10:35PM +0200, Christian Brandes wrote:
> I found out, that there must be a coherency between GFS access speed and
> network bandwidth allthough i/o is going through FC to the RAID in my 
> 4-node cluster.
> 
> It is 4 nodes each redundantly connected by 4GB-FC to the two
> controllers of a FC-to-FC RAID System.
> Devices are accessed through multipath-tools' /dev/mapper devices on the
> primary path.
> Each node has 1 100BaseT and 2 1000BaseTX network interfaces.
> 
> Before installing the cluster I did some performance tests such as
> writing from /dev/zero to the RAID containing an XFS file system and now
> with a GFS on it.
> 
> Here are some numbers:
> 
> FS      heartbeat-network      access-speed     latency
> ----------------------------------------------------------
> GFS     10MBit/s               50MB/s           very high
> GFS     100MBit/s              66MB/s           high
> GFS     1000MBit/s             100MB/s          low
> XFS                            260MB/s          none
> 
> What does GFS do over network when it is accessed by a node.
> Does dlm use such a high network bandwith before the first bit is
> written through FC to the RAID?
> 
> I had a VM on VMWare Server running on GFS that was interrupted every 10
> seconds for one ore two seconds.
> Now with 1000MBit/s it is not interrupted any more. But do not want to
> waste more than 60% of our RAID's performance.
> 
> What can I do to speed up GFS?
> 

Are you mounting with noatime?  Everything GFS does requires a lock, so yeah
your network speed is going to directly affect how fast GFS is.  Using noatime
will cut down on the amount of dlm traffic you are doing so that should help
some.

Josef

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 4126 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070913/f806ef85/attachment.bin>

From natecars at natecarlson.com  Thu Sep 13 20:15:52 2007
From: natecars at natecarlson.com (Nate Carlson)
Date: Thu, 13 Sep 2007 15:15:52 -0500 (CDT)
Subject: [Linux-cluster] Small Cluster, Port Trunking, To use switch or
	not?
In-Reply-To: <46E1797E.3020308@mathstat.dal.ca>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>
	<46E12B74.6030406@interstudio.homeunix.net>
	<46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com>
	<46E1797E.3020308@mathstat.dal.ca>
Message-ID: <Pine.LNX.4.62.0709131515270.24225@iron.msp.technicality.org>

On Fri, 7 Sep 2007, Balagopal Pillai wrote:
> For 802.3ad bonding mode, the switch needs to support lacp. Static 
> trunking feature on the switch is not enough. In your case with no 
> switch support, mode 6 or adaptive load balancing is a good option. 
> Round robin is the only mode that will give you more than an interface 
> worth of throughput on a single connection. But that needs some switch 
> support. (like cisco etherchannel for example) Also there is additional 
> overhead due to out of the order packets. The other modes will give 
> better aggregate throughput.

Well, if you use crossover cables, you can do it without switch support.. 
;)

------------------------------------------------------------------------
| nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
|       depriving some poor village of its idiot since 1981            |
------------------------------------------------------------------------


From Timothy.Ward at itt.com  Thu Sep 13 21:34:46 2007
From: Timothy.Ward at itt.com (Ward, Timothy - SSD)
Date: Thu, 13 Sep 2007 17:34:46 -0400
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <20070913160007.BFEE473326@hormel.redhat.com>
Message-ID: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>

From: Kevin Anderson <kanderso at redhat.com>

On Thu, 2007-09-13 at 10:32 -0400, Ward, Timothy - SSD wrote:
> I am trying to setup nfs as a clustered service.  The filesystem nfs 
> is serving is a GFS2 fs.  So I am trying to find out is the problem 
> with GFS2, NFS, the SMP processors or something else?
> 
We have fixed a number of bugs with GFS2 over the last few weeks, so
would point at GFS2 as being the issue.  Pick up the latest kernel from
Steve Whitehouses GIT tree.  It has a number of fixes, with a number of
additional ones queued up to be pushed shortly.

If you need a stable environment at this time, use GFS1 instead of gfs2.

Kevin


------------------------------
Kevin,

Thank you for the valuable insight!

I used the updated gfs2 RPM that was updated for FC6.  Would you suggest
using FC5 with  gfs1 or is there a gfs1 RPM someplace for FC6?  I tried
to compile the code myself with limited success.

Thanks,
Tim 
*****************************************************************
This e-mail and any files transmitted with it may be proprietary 
and are intended solely for the use of the individual or entity to 
whom they are addressed. If you have received this e-mail in 
error please notify the sender. Please note that any views or
opinions presented in this e-mail are solely those of the author 
and do not necessarily represent those of ITT Corporation. The 
recipient should check this e-mail and any attachments for the 
presence of viruses. ITT accepts no liability for any damage 
caused by any virus transmitted by this e-mail.
*******************************************************************


From Abdel.Sadek at lsi.com  Thu Sep 13 22:09:20 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Thu, 13 Sep 2007 16:09:20 -0600
Subject: [Linux-cluster] SCSI-3 reservation on RHEL 5.0 native cluster
Message-ID: <C776378855970A4DADE4A476447F6391DEFFEF@NAMAIL3.ad.lsil.com>

Are the SCSI-3 reservations used by RedHat native cluster for scsi
fencing single threaded or multithreaded?

 
Thanks.

Abdel...

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070913/b8c20624/attachment.htm>

From kanderso at redhat.com  Thu Sep 13 22:35:06 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Thu, 13 Sep 2007 17:35:06 -0500
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>
Message-ID: <1189722906.2730.37.camel@dhcp80-204.msp.redhat.com>

On Thu, 2007-09-13 at 17:34 -0400, Ward, Timothy - SSD wrote:
> From: Kevin Anderson <kanderso at redhat.com>
> 
> On Thu, 2007-09-13 at 10:32 -0400, Ward, Timothy - SSD wrote:
> > I am trying to setup nfs as a clustered service.  The filesystem nfs 
> > is serving is a GFS2 fs.  So I am trying to find out is the problem 
> > with GFS2, NFS, the SMP processors or something else?
> > 
> We have fixed a number of bugs with GFS2 over the last few weeks, so
> would point at GFS2 as being the issue.  Pick up the latest kernel from
> Steve Whitehouses GIT tree.  It has a number of fixes, with a number of
> additional ones queued up to be pushed shortly.
> 
> If you need a stable environment at this time, use GFS1 instead of gfs2.
> 
> Kevin
> 
> 
> ------------------------------
> Kevin,
> 
> Thank you for the valuable insight!
> 
> I used the updated gfs2 RPM that was updated for FC6.  Would you suggest
> using FC5 with  gfs1 or is there a gfs1 RPM someplace for FC6?  I tried
> to compile the code myself with limited success.
> 
We are working on getting gfs1 to work with FC7.  It is a bit
complicated due to gfs1 needing access to the dlm harness code that is
part of the gfs2 kernel module.  The symbols are not currently exported
in the gfs2 code, so a patch needs to be generated.  The patch isn't
something we believe would be accepted upstream since the symbols aren't
used by any other in kernel module.   Here is the exports you need:
--- linux-2.6.17.noarch/fs/gfs2/locking.c~      2006-08-10
13:33:09.000000000 -0
400
+++ linux-2.6.17.noarch/fs/gfs2/locking.c       2006-08-10
13:33:23.000000000 -0
400
@@ -188,4 +188,6 @@ void __init gfs2_init_lmh(void)
 
 EXPORT_SYMBOL_GPL(gfs2_register_lockproto);
 EXPORT_SYMBOL_GPL(gfs2_unregister_lockproto);
-
+EXPORT_SYMBOL_GPL(gfs2_withdraw_lockproto);
+EXPORT_SYMBOL_GPL(gfs2_mount_lockproto);
+EXPORT_SYMBOL_GPL(gfs2_unmount_lockproto);


This is from our RHEL5 tree, so not sure it will apply directly, but you
can see what gets added.  Am sure there are probably other issues with
building against the FC6 environment, but this is a start.  

We have been primarily focused on getting GFS2 stable, but one of the
guys is going to spend sometime over the next couple of weeks working on
getting FC7 rpms built for gfs.

Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070913/b18c9f7a/attachment.htm>

From frankie.montenegro at gmail.com  Thu Sep 13 23:46:38 2007
From: frankie.montenegro at gmail.com (Frankie Montenegro)
Date: Thu, 13 Sep 2007 11:46:38 -1200
Subject: [Linux-cluster] Small Cluster, Port Trunking,
	To use switch or not?
In-Reply-To: <Pine.LNX.4.62.0709131515270.24225@iron.msp.technicality.org>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com>
	<46E12B74.6030406@interstudio.homeunix.net>
	<46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com>
	<46E1797E.3020308@mathstat.dal.ca>
	<Pine.LNX.4.62.0709131515270.24225@iron.msp.technicality.org>
Message-ID: <46b1e5210709131646o30c13481t1881a6a5f941bc1b@mail.gmail.com>

Will check that out. Thank you guys. My stuff has arrived, I can't wait for
the  weekend to put it together.

BTW, I wondered what FS would be the best
option for the good throughput as well as sharing of the disk space. We have
NFS at school on a 12node dual opteron cluster, and, based on my experience
with it, I'll try to stay as far away from it as possible. GFS or PVFS2 or...?
Thanks again,
Frankie

On 9/13/07, Nate Carlson <natecars at natecarlson.com> wrote:
> On Fri, 7 Sep 2007, Balagopal Pillai wrote:
> > For 802.3ad bonding mode, the switch needs to support lacp. Static
> > trunking feature on the switch is not enough. In your case with no
> > switch support, mode 6 or adaptive load balancing is a good option.
> > Round robin is the only mode that will give you more than an interface
> > worth of throughput on a single connection. But that needs some switch
> > support. (like cisco etherchannel for example) Also there is additional
> > overhead due to out of the order packets. The other modes will give
> > better aggregate throughput.
>
> Well, if you use crossover cables, you can do it without switch support..
> ;)
>
> ------------------------------------------------------------------------
> | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
> |       depriving some poor village of its idiot since 1981            |
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From pillai at mathstat.dal.ca  Fri Sep 14 00:20:19 2007
From: pillai at mathstat.dal.ca (Balagopal Pillai)
Date: Thu, 13 Sep 2007 21:20:19 -0300 (ADT)
Subject: [Linux-cluster] Small Cluster, Port Trunking, To use switch or
	not?
In-Reply-To: <46b1e5210709131646o30c13481t1881a6a5f941bc1b@mail.gmail.com>
References: <46b1e5210709070328k41d02047sc0ee4e0730c78376@mail.gmail.com> 
	<46E12B74.6030406@interstudio.homeunix.net> 
	<46b1e5210709070852q2ad913cfx4e5e61c59813b497@mail.gmail.com> 
	<46E1797E.3020308@mathstat.dal.ca>
	<Pine.LNX.4.62.0709131515270.24225@iron.msp.technicality.org>
	<46b1e5210709131646o30c13481t1881a6a5f941bc1b@mail.gmail.com>
Message-ID: <Pine.LNX.4.64.0709132104280.1621@chase.mathstat.dal.ca>

On Thu, 13 Sep 2007, Frankie Montenegro wrote:
Well, i evaluated gfs over gnbd and bonded interfaces a few months ago and 
was quite happy with the performance. But it was crashing quite often 
and i didn't need the complexity of quorum and fencing issues when looking 
for just a cluster file system and not high availability features. I guess 
there is no way around complex fencing when every node accesses the block 
device through the cluster file system. Its the same with ocfs2 too, but 
fencing is much simpler with ocfs2 and less number of daemons to mess 
with compared to gfs, where quite a few daemons need to come up in a 
particular order.

          I have been using Lustre for almost 2 yrs now and quite happy 
with it. I posted a few problems about gnbd errors before crashes, didn't 
get much help. So went back to Lustre and I am still satisfied with the 
choice. I was evaluating the cluster file systems for the two clusters i 
finished in the past 2 months, the first one with one storage server and 6 
TB of storage and second one with >65TB of available storage space with 
multiple storage servers, bonding, parallel i/o and all bells and whistles. Both 
run Lustre. The new version is quite easy to maintain. Since i was already 
comfortable with Lustre, my opinions could be biased. 

        But to be fair, the throughput of gfs over gnbd was very good. But 
lack of parallel i/o and having to deal with quorum issues when all you 
need is a cluster file system, make me go back to Lustre. Hope this 
helps!!

Regards
Balagopal 


> Will check that out. Thank you guys. My stuff has arrived, I can't wait for
> the  weekend to put it together.
> 
> BTW, I wondered what FS would be the best
> option for the good throughput as well as sharing of the disk space. We have
> NFS at school on a 12node dual opteron cluster, and, based on my experience
> with it, I'll try to stay as far away from it as possible. GFS or PVFS2 or...?
> Thanks again,
> Frankie
> 
> On 9/13/07, Nate Carlson <natecars at natecarlson.com> wrote:
> > On Fri, 7 Sep 2007, Balagopal Pillai wrote:
> > > For 802.3ad bonding mode, the switch needs to support lacp. Static
> > > trunking feature on the switch is not enough. In your case with no
> > > switch support, mode 6 or adaptive load balancing is a good option.
> > > Round robin is the only mode that will give you more than an interface
> > > worth of throughput on a single connection. But that needs some switch
> > > support. (like cisco etherchannel for example) Also there is additional
> > > overhead due to out of the order packets. The other modes will give
> > > better aggregate throughput.
> >
> > Well, if you use crossover cables, you can do it without switch support..
> > ;)
> >
> > ------------------------------------------------------------------------
> > | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com |
> > |       depriving some poor village of its idiot since 1981            |
> > ------------------------------------------------------------------------
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 


From basv at sara.nl  Fri Sep 14 09:54:05 2007
From: basv at sara.nl (Bas van der Vlies)
Date: Fri, 14 Sep 2007 11:54:05 +0200
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <1189722906.2730.37.camel@dhcp80-204.msp.redhat.com>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>
	<1189722906.2730.37.camel@dhcp80-204.msp.redhat.com>
Message-ID: <46EA5A3D.9020806@sara.nl>

Kevin Anderson wrote:

> We have been primarily focused on getting GFS2 stable, but one of the 
> guys is going to spend sometime over the next couple of weeks working on 
> getting FC7 rpms built for gfs.
> 
> Kevin
> 
Kevin,

  Just a question is the STABLE GFS1 branch still supported or must i wait 
till GFS2 becomes stable? Because i do not see much activity in STABLE GFS1 
branch.

Regards


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From mad at wol.de  Fri Sep 14 10:10:42 2007
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Fri, 14 Sep 2007 12:10:42 +0200
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <46EA5A3D.9020806@sara.nl>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>
	<1189722906.2730.37.camel@dhcp80-204.msp.redhat.com>
	<46EA5A3D.9020806@sara.nl>
Message-ID: <1189764642.15265.10.camel@marc>

Am Freitag, den 14.09.2007, 11:54 +0200 schrieb Bas van der Vlies:
> Kevin Anderson wrote:
> 
> > We have been primarily focused on getting GFS2 stable, but one of the 
> > guys is going to spend sometime over the next couple of weeks working on 
> > getting FC7 rpms built for gfs.
> > 
> > Kevin
> > 
> Kevin,
> 
>   Just a question is the STABLE GFS1 branch still supported or must i wait 
> till GFS2 becomes stable? Because i do not see much activity in STABLE GFS1 
> branch.
> 
> Regards

I'm currently working on patches against STABLE which will bring STABLE
back in line with RHEL4. I'll send them next week on cluster-devel list.

You can find some conversation on this here:
https://www.redhat.com/archives/cluster-devel/2007-September/msg00045.html


Marc


From jaap at sara.nl  Fri Sep 14 10:14:57 2007
From: jaap at sara.nl (Jaap Dijkshoorn)
Date: Fri, 14 Sep 2007 12:14:57 +0200
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <1189764642.15265.10.camel@marc>
Message-ID: <ADF94D8555C7A246B86A633685E0178A3FD99726C6@planck.ka.sara.nl>

Hi Guys,

It would be nice to have the NFS fixes in STABLE branche then also.

Regards,
Jaap


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Marc -
> A. Dahlhaus [ Administration | Westermann GmbH ]
> Sent: vrijdag 14 september 2007 12:11
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster NFS causes kernel bug
>
> Am Freitag, den 14.09.2007, 11:54 +0200 schrieb Bas van der Vlies:
> > Kevin Anderson wrote:
> >
> > > We have been primarily focused on getting GFS2 stable,
> but one of the
> > > guys is going to spend sometime over the next couple of
> weeks working on
> > > getting FC7 rpms built for gfs.
> > >
> > > Kevin
> > >
> > Kevin,
> >
> >   Just a question is the STABLE GFS1 branch still supported
> or must i wait
> > till GFS2 becomes stable? Because i do not see much
> activity in STABLE GFS1
> > branch.
> >
> > Regards
>
> I'm currently working on patches against STABLE which will
> bring STABLE
> back in line with RHEL4. I'll send them next week on
> cluster-devel list.
>
> You can find some conversation on this here:
> https://www.redhat.com/archives/cluster-devel/2007-September/m
> sg00045.html
>
>
> Marc
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From maciej.bogucki at artegence.com  Fri Sep 14 10:22:05 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Fri, 14 Sep 2007 12:22:05 +0200
Subject: [Linux-cluster] SCSI-3 reservation on RHEL 5.0 native cluster
In-Reply-To: <C776378855970A4DADE4A476447F6391DEFFEF@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391DEFFEF@NAMAIL3.ad.lsil.com>
Message-ID: <46EA60CD.2020205@artegence.com>

Sadek, Abdel napisa?(a):
> Are the SCSI-3 reservations used by RedHat native cluster for scsi
> fencing single threaded or multithreaded?

Hello,

In RHCS SCSI-3 reservations fencing is done via two scripts. Bash
/etc/rc.d/init.d/scsi_reserve startup script and perl /sbin/fence_scsi.
Why You need to know if it is single or multithreaded? Here preformance
isn't the problem.

Best Regards
Maciej Bogucki


From maciej.bogucki at artegence.com  Fri Sep 14 10:39:25 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Fri, 14 Sep 2007 12:39:25 +0200
Subject: [Linux-cluster] fence_scsi agent on RHEL 4.5
In-Reply-To: <C776378855970A4DADE4A476447F6391DEFB64@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391DEFB64@NAMAIL3.ad.lsil.com>
Message-ID: <46EA64DD.6000402@artegence.com>

Sadek, Abdel napisa?(a):
> I am running a 2-node cluster with RHEL 4.5 Native cluster. I am using
> scsi persistent reservation as my fencing device. I have noticed when I
> shutdown one of the nodes, the fence_scsi agent on the surviving node
> fails to fence the dying node. I get the following message:
> 
> Sep 11 16:18:13 troy fenced[3614]: agent "fence_scsi" reports: parse
> error: unknown option "nodename=porsche"
> 
> Sep 11 16:18:13 troy fenced[3614]: fence "porsche" failed
> 
>  
> 
> it looks like the fence_scsi command is executed using with the nodename
> parameter instead of the ?n option.
> 
> when I run fence_scsi  -h I get the following (there is no nodename
> parameter)
> 
> Usage
> 
> fence_scsi [options]
> 
> Options
> 
>   -n <node>        IP address or hostname of node to fence
> 
>   -h               usage
> 
>   -V               version
> 
>   -v               verbose
> 
>  
> 
> But the man page of the fence_scsi command talks about using both the
> ??n? and ?nodename=? options.
> 
> So, how do I make the fence_scsi run with the ?n instead of the
> nodename= option?

Hello,

Please attache Your cluster.conf file

Best Regards
Maciej Bogucki


From teigland at redhat.com  Fri Sep 14 14:01:29 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 14 Sep 2007 09:01:29 -0500
Subject: [Linux-cluster] Cluster NFS causes kernel bug
In-Reply-To: <46EA5A3D.9020806@sara.nl>
References: <77E700AE7021314DB6CDF6D6E8F661320396FC2C@ACDFWMAIL1.acd.de.ittind.com>
	<1189722906.2730.37.camel@dhcp80-204.msp.redhat.com>
	<46EA5A3D.9020806@sara.nl>
Message-ID: <20070914140129.GC18955@redhat.com>

On Fri, Sep 14, 2007 at 11:54:05AM +0200, Bas van der Vlies wrote:
> Kevin Anderson wrote:
> 
> >We have been primarily focused on getting GFS2 stable, but one of the 
> >guys is going to spend sometime over the next couple of weeks working on 
> >getting FC7 rpms built for gfs.
> >
> >Kevin
> >
> Kevin,
> 
>  Just a question is the STABLE GFS1 branch still supported or must i wait 
> till GFS2 becomes stable? Because i do not see much activity in STABLE GFS1 

Can't you can simply use GFS1 from the HEAD or RHEL5 branches?

Dave


From Abdel.Sadek at lsi.com  Fri Sep 14 14:37:46 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Fri, 14 Sep 2007 08:37:46 -0600
Subject: [Linux-cluster] fence_scsi agent on RHEL 4.5
In-Reply-To: <46EA64DD.6000402@artegence.com>
Message-ID: <C776378855970A4DADE4A476447F6391DF0082@NAMAIL3.ad.lsil.com>

Hello Maciej;
Please find attached my cluster.conf file.
 
Thanks.
Abdel..
 
-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Maciej Bogucki
Sent: Friday, September 14, 2007 5:39 AM
To: linux clustering
Subject: Re: [Linux-cluster] fence_scsi agent on RHEL 4.5
 
Sadek, Abdel napisa?(a):
> I am running a 2-node cluster with RHEL 4.5 Native cluster. I am using
> scsi persistent reservation as my fencing device. I have noticed when I
> shutdown one of the nodes, the fence_scsi agent on the surviving node
> fails to fence the dying node. I get the following message:
> 
> Sep 11 16:18:13 troy fenced[3614]: agent "fence_scsi" reports: parse
> error: unknown option "nodename=porsche"
> 
> Sep 11 16:18:13 troy fenced[3614]: fence "porsche" failed
> 
>  
> 
> it looks like the fence_scsi command is executed using with the nodename
> parameter instead of the -n option.
> 
> when I run fence_scsi  -h I get the following (there is no nodename
> parameter)
> 
> Usage
> 
> fence_scsi [options]
> 
> Options
> 
>   -n <node>        IP address or hostname of node to fence
> 
>   -h               usage
> 
>   -V               version
> 
>   -v               verbose
> 
>  
> 
> But the man page of the fence_scsi command talks about using both the
> "-n" and "nodename=" options.
> 
> So, how do I make the fence_scsi run with the -n instead of the
> nodename= option?
 
Hello,
 
Please attache Your cluster.conf file
 
Best Regards
Maciej Bogucki
 
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070914/9ce927a9/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/octet-stream
Size: 2178 bytes
Desc: cluster.conf
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070914/9ce927a9/attachment.obj>

From christopher.barry at qlogic.com  Fri Sep 14 18:06:38 2007
From: christopher.barry at qlogic.com (Christopher Barry)
Date: Fri, 14 Sep 2007 13:06:38 -0500
Subject: [Linux-cluster] DLM Documentation
Message-ID: <D158540CCC0AB54C8FD4818F823CCB24190668@EPEXCH1.qlogic.org>


Hi,

Can someone point me to some good docs on the DLM?

I'm interested in learning about what it's doing and how it's doing it.


Thanks,
Chris
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070914/6704b746/attachment.htm>

From teigland at redhat.com  Fri Sep 14 18:31:23 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 14 Sep 2007 13:31:23 -0500
Subject: [Linux-cluster] DLM Documentation
In-Reply-To: <D158540CCC0AB54C8FD4818F823CCB24190668@EPEXCH1.qlogic.org>
References: <D158540CCC0AB54C8FD4818F823CCB24190668@EPEXCH1.qlogic.org>
Message-ID: <20070914183123.GF18955@redhat.com>

On Fri, Sep 14, 2007 at 01:06:38PM -0500, Christopher Barry wrote:
> 
> Hi,
> 
> Can someone point me to some good docs on the DLM?
> 
> I'm interested in learning about what it's doing and how it's doing it.

Nothing exists on the implementation details.  If you want to look through
code, use what's in the upstream kernel (linux/fs/dlm).  Otherwise, the
ibm opendlm document is the best I know of: the general dlm info is good
and consistent with ours, and the API is quite similar.

http://opendlm.sourceforge.net/cvsmirror/opendlm/docs/dlmbook_final.pdf

Dave


From Abdel.Sadek at lsi.com  Fri Sep 14 20:36:56 2007
From: Abdel.Sadek at lsi.com (Sadek, Abdel)
Date: Fri, 14 Sep 2007 14:36:56 -0600
Subject: [Linux-cluster] SCSI-3 reservation on RHEL 5.0 native cluster
In-Reply-To: <46EA60CD.2020205@artegence.com>
Message-ID: <C776378855970A4DADE4A476447F6391DF0198@NAMAIL3.ad.lsil.com>

Hello;
we need to understand how it works in order for our developers to make it work with our own multipath driver.

Thanks.
Abdel...
-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Maciej Bogucki
Sent: Friday, September 14, 2007 5:22 AM
To: linux clustering
Subject: Re: [Linux-cluster] SCSI-3 reservation on RHEL 5.0 native cluster

Sadek, Abdel napisa?(a):
> Are the SCSI-3 reservations used by RedHat native cluster for scsi
> fencing single threaded or multithreaded?

Hello,

In RHCS SCSI-3 reservations fencing is done via two scripts. Bash
/etc/rc.d/init.d/scsi_reserve startup script and perl /sbin/fence_scsi.
Why You need to know if it is single or multithreaded? Here preformance
isn't the problem.

Best Regards
Maciej Bogucki

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From kapil_maheshwari at persistent.co.in  Sat Sep 15 13:35:37 2007
From: kapil_maheshwari at persistent.co.in (Kapil Maheshwari)
Date: Sat, 15 Sep 2007 19:05:37 +0530
Subject: [Linux-cluster] required  Vol 41, Issue 14
In-Reply-To: <20070912160006.DC9AA736B0@hormel.redhat.com>
References: <20070912160006.DC9AA736B0@hormel.redhat.com>
Message-ID: <000001c7f79d$4d6bb920$1a44580a@persistent.co.in>


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of
linux-cluster-request at redhat.com
Sent: Wednesday, September 12, 2007 9:30 PM
To: linux-cluster at redhat.com
Subject: Linux-cluster Digest, Vol 41, Issue 14

Send Linux-cluster mailing list submissions to
	linux-cluster at redhat.com

To subscribe or unsubscribe via the World Wide Web, visit
	https://www.redhat.com/mailman/listinfo/linux-cluster
or, via email, send a message with subject or body 'help' to
	linux-cluster-request at redhat.com

You can reach the person managing the list at
	linux-cluster-owner at redhat.com

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Linux-cluster digest..."


Today's Topics:

   1. GNBD Problems loading module (notol Perc)
   2. fence_scsi agent on RHEL 4.5 (Sadek, Abdel)
   3. changing configuration (Joel Becker)
   4. RHEL4.5, GFS and selinux, are they playing nice? (Roger Pe?a)
   5. Re: RE: qdisk votes not in cman (Alain Richard)
   6. Services timeout (Jordi Prats)
   7. Re: DLM - Lock Value Block error (Patrick Caulfield)


----------------------------------------------------------------------

Message: 1
Date: Tue, 11 Sep 2007 19:32:07 +0000
From: "notol Perc" <furor_hater at hotmail.com>
Subject: [Linux-cluster] GNBD Problems loading module
To: linux-cluster at redhat.com
Message-ID: <BAY121-F37184D72844EF4310B3C3286C10 at phx.gbl>
Content-Type: text/plain; format=flowed

Using the latest CVS Cluster Source (09-11-2007) I have configured a cluster

on kernel 2.6.23-rc5 (running under Debian Etch)

I can get everything running short of importing GNBD due to the fact that I 
can not find the kernal module.


I can directly make cluster/gnbd-kernel/src/ I get the following:

make -C /usr/src/linux-2.6.23-rc5 M=/usr/src/cluster/gnbd-kernel/src 
symverfile=/usr/src/linux-2.6.23-rc5/Module.symvers modules USING_KBUILD=yes
make[1]: Entering directory `/usr/src/linux-2.6.23-rc5'
  Building modules, stage 2.
  MODPOST 1 modules
make[1]: Leaving directory `/usr/src/linux-2.6.23-rc5'

then make install

make -C /usr/src/linux-2.6.23-rc5 M=/usr/src/cluster/gnbd-kernel/src 
symverfile=/usr/src/linux-2.6.23-rc5/Module.symvers modules USING_KBUILD=yes
make[1]: Entering directory `/usr/src/linux-2.6.23-rc5'
  Building modules, stage 2.
  MODPOST 1 modules
make[1]: Leaving directory `/usr/src/linux-2.6.23-rc5'
install -d /usr/include/linux
install gnbd.h /usr/include/linux
install -d /lib/modules/`uname -r`/kernel/drivers/block/gnbd
install gnbd.ko /lib/modules/`uname -r`/kernel/drivers/block/gnbd

Ca some one pleas help be get this going?

_________________________________________________________________
Get a FREE small business Web site and more from Microsoft. Office Live! 
http://clk.atdmt.com/MRT/go/aub0930003811mrt/direct/01/


------------------------------

Message: 2
Date: Tue, 11 Sep 2007 15:27:16 -0600
From: "Sadek, Abdel" <Abdel.Sadek at lsi.com>
Subject: [Linux-cluster] fence_scsi agent on RHEL 4.5
To: <Linux-cluster at redhat.com>
Message-ID:
	<C776378855970A4DADE4A476447F6391DEFB64 at NAMAIL3.ad.lsil.com>
Content-Type: text/plain; charset="us-ascii"

I am running a 2-node cluster with RHEL 4.5 Native cluster. I am using
scsi persistent reservation as my fencing device. I have noticed when I
shutdown one of the nodes, the fence_scsi agent on the surviving node
fails to fence the dying node. I get the following message:
Sep 11 16:18:13 troy fenced[3614]: agent "fence_scsi" reports: parse
error: unknown option "nodename=porsche"
Sep 11 16:18:13 troy fenced[3614]: fence "porsche" failed
 
it looks like the fence_scsi command is executed using with the nodename
parameter instead of the -n option.
when I run fence_scsi  -h I get the following (there is no nodename
parameter)
Usage
fence_scsi [options]
Options
  -n <node>        IP address or hostname of node to fence
  -h               usage
  -V               version
  -v               verbose
 
But the man page of the fence_scsi command talks about using both the
"-n" and "nodename=" options.
So, how do I make the fence_scsi run with the -n instead of the
nodename= option?
 
Thanks.
Abdel...
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
https://www.redhat.com/archives/linux-cluster/attachments/20070911/068386c2/
attachment.html

------------------------------

Message: 3
Date: Tue, 11 Sep 2007 16:46:08 -0700
From: Joel Becker <Joel.Becker at oracle.com>
Subject: [Linux-cluster] changing configuration
To: linux-cluster at redhat.com
Message-ID: <20070911234607.GD27482 at tasint.org>
Content-Type: text/plain; charset=us-ascii

Hey everyone,
	How do I update the IP addresses of existing nodes?
	I have a simple cluster.  I had two nodes on a private network
(10.x.x.x).  I decided to add two more nodes, but they are only on the
public network.  So I wanted to add them as well as change the existing
nodes to use the public network.
	I shut down cman/ccs on all nodes.  I edited cluster.conf.  I
started cman back on one node, and I ensured that cman_tool went to the
new version of the config via "cman_tool version -r N+1".
	The problem is that it still appears to be using the private
network addresses.  I see this in the log and with "cman_tool nodes -a".
	What can I do to fix this, short of hunting down all cman and
openais droppings and removing them?  I want the "right" way :-)

Joel

-- 

"To fall in love is to create a religion that has a fallible god."
        -Jorge Luis Borges

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


------------------------------

Message: 4
Date: Tue, 11 Sep 2007 18:42:32 -0700 (PDT)
From: Roger Pe?a <orkcu at yahoo.com>
Subject: [Linux-cluster] RHEL4.5, GFS and selinux, are they playing
	nice?
To: RedHat Cluster Suit <Linux-cluster at redhat.com>
Message-ID: <724236.51256.qm at web50608.mail.re2.yahoo.com>
Content-Type: text/plain; charset=iso-8859-1

Hello everybody ;-)

I keep working in making a web cluster play nice after
the upgrade from RHEL4.4 -> RHEL4.5 
with this upgrade, the relation httpd-selinux become
more strict, my first problem came when the RHGFS4.4
do not support xattr (our web content is in a gfs
filesystem) so I must update RHGFS and RHCS to 4.5
(from centos recompilation)

so now I have support to xattr in ours GFS filesystems
but, here is the problem:
the httpd do not want to start because some config
files (witch reside in another GFS filesystem) have a
forbidden context (httpd can not read file with that
context) (those files are included from the main
apache configuration)
even if I change the context and ls -Z show me that I
change the context for every parent and final dir in
the GFS filesystem.
here are the error from selinux:
{ search } for  pid=2289 comm="httpd" name="/"
dev=dm-7 ino=25  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  
tclass=dir

as you can see, selinux is dening access to httpd
process to make a search in / (root of the filesystem
in device dm-7), with inode 25 and that inode is a
directory, it deny access because the context of that
directory is system_u:object_r:nfs_t 
 am I right?

but, that directory is /opt/soft:
ll -di /opt/soft/
25 drwxr-xr-x  8 root root 3864 Sep 11  2007
/opt/soft/
^^ <--- this is the inode

and it context is system_u:object_r:httpd_config_t:
ll -dZ /opt/soft/
drwxr-xr-x  root     root    
system_u:object_r:httpd_config_t /opt/soft/

so, who is wrong? ls -Z or "global selinux kernel
module" ?
because ls -Z show that the context of that directory
is system_u:object_r:httpd_config_t

if I set selinux to be in permissive mode, then apache
can start, of course, but with some complains like
this:

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.151:38): avc:  denied  { search } for
 pid=2333 comm="httpd" name="/" dev=dm-7 ino=25  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.155:39): avc:  denied  { getattr }
for  pid=2333 comm="httpd" name="apache" dev=dm-7
ino=31  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.155:40): avc:  denied  { read } for 
pid=2333 comm="httpd" name="apache" dev=dm-7 ino=31  
scontext=root:system_r:httpd_t
tcontext=system_u:object_r:nfs_t  tclass=dir

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.158:41): avc:  denied  { getattr }
for  pid=2333 comm="httpd" name="httpd.conf" dev=dm-7 

ino=484983 scontext=root:system_r:httpd_t  
tcontext=system_u:object_r:nfs_t tclass=file

Sep 11 14:18:08 blade26 kernel:
audit(1189534688.158:42): avc:  denied  { read } for 
pid=2333 comm="httpd" name="httpd.conf" dev=dm-7  
ino=484983 scontext=root:system_r:httpd_t  
tcontext=system_u:object_r:nfs_t tclass=file

this mean:
access deny to do 
1- search in /opt/soft
2- getattr and read directory /opt/soft/conf/apache
3- getattr and read file httpd.conf

but:
all this files or directory has context 
system_u:object_r:httpd_config_t 

ll -dZ /opt/soft/conf/apache/
drwxr-xr-x  root root system_u:object_r:httpd_config_t
 
/opt/soft/conf/apache/

ll -di /opt/soft/conf/apache/
31 drwxr-xr-x  2 root root 3864 Sep 11 09:44
/opt/soft/conf/apache/


is this related to the fact that selinux policy stated
this:
genfscon gfs /                 system_u:object_r:nfs_t

what do you recomment to solve this complains of
selinux?
mount the gfs filesystem with the option fscontext ?

but that filesystem has other stuff, not related with
apache, so, what context should I use?


thanks
roger


__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________
________
Don't let your dream ride pass you by. Make it a reality with Yahoo! Autos.
http://autos.yahoo.com/index.html
 

------------------------------

Message: 5
Date: Wed, 12 Sep 2007 07:05:43 +0200
From: Alain Richard <alain.richard at equation.fr>
Subject: Re: [Linux-cluster] RE: qdisk votes not in cman
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <CA0AA44E-8956-4826-8083-3FD0976D3D58 at equation.fr>
Content-Type: text/plain; charset="iso-8859-1"


Le 4 sept. 07 ` 23:13, Lon Hohberger a icrit :

> On Fri, Aug 31, 2007 at 12:46:50PM +0200, Alain RICHARD wrote:
>> Perhaps a better error reporting is needed in qdiskd to shows that we
>> have hit this problem. Also using a generic name like "qdisk device"
>> when qdiskd is registering its node to cman is a better approach.
>
> What about using the label instead of the device name, and restricting
> the label to 16 chars when advertising to cman?
>
> -- Lon

Because when using multipath devices (for example a two paths  
device), all the paths and the multi-path device are recognized as  
having the same label, so qdisk fails to get the good device (the  
multi-path device).

Regards,

-- 
Alain RICHARD <mailto:alain.richard at equation.fr>
EQUATION SA <http://www.equation.fr/>
Tel : +33 477 79 48 00     Fax : +33 477 79 48 01
Applications client/serveur, inginierie riseau et Linux

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
https://www.redhat.com/archives/linux-cluster/attachments/20070912/6510728f/
attachment.html

------------------------------

Message: 6
Date: Wed, 12 Sep 2007 09:14:04 +0200
From: Jordi Prats <jprats at cesca.es>
Subject: [Linux-cluster] Services timeout
To: linux-cluster at redhat.com
Message-ID: <46E791BC.2090006 at cesca.es>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi,
I have a NFS server with RedHat Cluster. Sometimes when is on heavy load 
it sets the service status to failed. There's no fs corruption and no 
daemon is down. I suspect this is caused by some timeout while is 
checking the fs is mounted. There is any way to define the check 
interval or the check timeout?

Thank you!
Jordi

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputacis de Catalunya

  Gran Capit`, 2-4 (Edifici Nexus) 7 08034 Barcelona
  T. 93 205 6464 7 F.  93 205 6979 7 jprats at cesca.es
...................................................................... 


------------------------------

Message: 7
Date: Wed, 12 Sep 2007 12:45:41 +0100
From: Patrick Caulfield <pcaulfie at redhat.com>
Subject: Re: [Linux-cluster] DLM - Lock Value Block error
To: linux clustering <linux-cluster at redhat.com>
Message-ID: <46E7D165.4040301 at redhat.com>
Content-Type: text/plain; charset=ISO-8859-1

Christos Triantafillou wrote:
> Hi,
>  
> I am using RHEL 4.5 and DLM 1.0.3 on a 4-node cluster.
>  
> I noticed the following regarding the LVB:
> 1. there are two processes: one that sets the LVB of a resource while
> holding an EX lock
> and another one that has a NL lock on the same resource and is blocked
> on a dlm_lock_wait
> for getting a CR lock and reading the LVB.
> 2. when the first process is interrupted with control-C or killed, the
> second process gets
> an invalid LVB error.
> 
> It seems that DLM falsely releases the resource after the first process
> is gone and then
> the second process reads an uninitialized LVB.
>  
> Can you please confirm this error and create a bug report if necessary?

I've just run the program on VMS and it exhibits exactly the same behaviour.

Therefore I suspect this is not a bug ;-)

-- 
Patrick


------------------------------

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

End of Linux-cluster Digest, Vol 41, Issue 14
*********************************************


DISCLAIMER
==========
This e-mail may contain privileged and confidential information which is the property of Persistent Systems Pvt. Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Pvt. Ltd. does not accept any liability for virus infected mails.


From alain.richard at equation.fr  Mon Sep 17 08:06:03 2007
From: alain.richard at equation.fr (Alain Richard)
Date: Mon, 17 Sep 2007 10:06:03 +0200
Subject: [Linux-cluster] Multiple TCP/IP adresses for cman communication
Message-ID: <23A272DA-6127-46B0-8A59-3498D3245BC6@equation.fr>

Hi,

I am currently using a pretty common setup, using a two or more nodes  
cluster,  one or more iSCSI SANs, at least 3 gigabyte interfaces on  
each nodes (one for cluter member cman communication, and 2 for iSCSI  
communication) :


     switch 1
     |       |
node 1      node 2
  |   |      |   |
  |   Switch 2   |
  |     |        |
  |    SAN       |
  |     |        |
  ???Switch 3----|

On each switches, I use a separate network :

10.10.10.0/24 for switch 1, cman communication and for communicating  
with the rest of the world,
10.10.20.0/24 for switch 2 and first port of the SAN and nodes,
10.10.30.0/24 for switch 3 and second port of the SAN and nodes.

My SAN luns are multipathed on the nodes.

This setup works very well for me and I achieve both redundancy and  
load balancing for access from nodes to SAN.

I am using Centos 5 (RHEL 5) on the nodes and cman use the default  
configuration of using multicast on the switch 1 network.

I wonder if it is possible to make cman use the two other interfaces  
as well for its communications between cluster nodes ?

Doing so, I will be able to support the fail of any of the 3 switches  
without having the cluster members fenced in any way.

Regards,

-- 
Alain RICHARD <mailto:alain.richard at equation.fr>
EQUATION SA <http://www.equation.fr/>
Tel : +33 477 79 48 00     Fax : +33 477 79 48 01
Applications client/serveur, ing?nierie r?seau et Linux


From sebastian.walter at fu-berlin.de  Mon Sep 17 10:11:13 2007
From: sebastian.walter at fu-berlin.de (Sebastian Walter)
Date: Mon, 17 Sep 2007 12:11:13 +0200
Subject: [Linux-cluster] Buffer I/O error on device diapered_dm-2
Message-ID: <46EE52C1.7070207@fu-berlin.de>

Dear list,

I get the following errors on my CentOS 4.5/Dell/Xeon Cluster with
Infortrend/QLogic fiber channel SAN:

Sep 17 11:24:21 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340454
Sep 17 11:24:38 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340455
Sep 17 11:24:56 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340456
Sep 17 11:25:14 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340457
Sep 17 11:25:32 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340458
Sep 17 11:25:49 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340459
Sep 17 11:26:07 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340460
Sep 17 11:26:25 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340461
Sep 17 11:26:42 dtm kernel: Buffer I/O error on device diapered_dm-2,
logical block 128340462
Sep 17 11:27:39 dtm clurgmgrd: [4713]: <err> clusterfs:gfs-dtm-home:
Mount point is not accessible!

Then the filesystem is not mountable on this node anymore (it's set up
using rgmanager to mount the gfs). On the other nodes, there are no
problems. How can I debug this if it's dependant on the hardware or
something is wrong in the filesystem?

Thanks, regards
Sebastian


From christian.brandes at forschungsgruppe.de  Mon Sep 17 16:17:29 2007
From: christian.brandes at forschungsgruppe.de (Christian Brandes)
Date: Mon, 17 Sep 2007 18:17:29 +0200
Subject: [Linux-cluster] GFS Performance Problem / coherency between GFS
 access speed, and network bandwidth
Message-ID: <46EEA899.5060808@forschungsgruppe.de>

Thanks for the hints.

No, I do not mount with -o noatime because some of our applications need 
atime. Further this would only speed up reading, not writing.

I also tried the http://sourceware.org/cluster/faq.html#gfs_tuning 
hints. -o noquota gives some aditional performance but only a few percent.

Supprisingly the 1000BaseTX network's bandwidth is not totally used at 
all, just a small percentage.
Can the effect depend on network latency?


From chris at cmiware.com  Mon Sep 17 18:54:06 2007
From: chris at cmiware.com (Chris Harms)
Date: Mon, 17 Sep 2007 13:54:06 -0500
Subject: [Linux-cluster] Cluster Suite 5.1 Beta 2?
Message-ID: <46EECD4E.8010008@cmiware.com>

Is there an  ETA on any new RPMs for 5.1 or 5.1 Beta versions?

Thanks,
Chris


From chris at cmiware.com  Mon Sep 17 22:50:38 2007
From: chris at cmiware.com (Chris Harms)
Date: Mon, 17 Sep 2007 17:50:38 -0500
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
Message-ID: <46EF04BE.5040104@cmiware.com>

Is there an easy way to disable GFS and related kernel modules if one 
does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
and had a mysterious crash of the cluster suite.  There were issues with 
the GFS and dlm modules.  The kernel panicked on shutdown.

Thanks,
Chris


From Alain.Moulle at bull.net  Tue Sep 18 09:25:30 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 18 Sep 2007 11:25:30 +0200
Subject: [Linux-cluster] CS4 U5 / advised quorumd values ?
Message-ID: <46EF998A.1040808@bull.net>

Hi

First time I will try the quorum disk functionnality ... which values
are recommended for quorumd parameters for a two nodes cluster ?

Is this correct ?
   <quorumd interval="2" tko="10" votes="3" log_level="9" log_facility="local4"
status_file="/tmp/qdisk_status" min_score="3" label="CS4QUORUMDISK">
   </quorumd>

Thanks
Regards
Alain Moull?


From maciej.bogucki at artegence.com  Tue Sep 18 11:58:18 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 18 Sep 2007 13:58:18 +0200
Subject: [Linux-cluster] SCSI-3 reservation on RHEL 5.0 native cluster
In-Reply-To: <C776378855970A4DADE4A476447F6391DF0198@NAMAIL3.ad.lsil.com>
References: <C776378855970A4DADE4A476447F6391DF0198@NAMAIL3.ad.lsil.com>
Message-ID: <46EFBD5A.5030702@artegence.com>

Sadek, Abdel napisa?(a):
> Hello;
> we need to understand how it works in order for our developers to make it work with our own multipath driver.

Hello,

scsi_reserve and fence_scsi do not work with multipath environments.
scsi_reserve do registration and reservation at startup, and fence_scsi
do unregister.

Here is some information form conversation with Ryan O'Hara - person who
wrote fence_scsi and scsi_reserve scripts.

--cut---
There are a few reasons for this. First, it is possible the creating
registrations/reservations at startup and unregistering at fence time
could ping-pong the I/O paths. That is assuming that I/O down either
path will automatically switch the active I/O path. So at best it would
work with only active/active controllers. Second, discovering the I/O
paths in a multipath environment does not work. All devices (I/O paths
in this case) must be registered, so each node must create a
registration for each device (I/O path) using its key. This also does
not work. Finally, fencing is unreliable with scsi reservations in a
multipath environment. For example, suppose an I/O path fails for one
node. Assume that node is selected to fence another node. It will need
to unregister the key for the faulty node for all I/O paths. If the node
responsible for performing fencing does not have all I/O paths
available, it will not be able to unregister the key for the faulty
node. This is very problematic.

These are a few reasons why you cannot use scsi reservations as a fence
method in a multipath environment.

---cut---

For my opinion You have two choices:
1. Change multipath driver to work correctly with scsi fencing
2. Change firmware in You storage where there it only check of host ID.
In this situation WWN of HBA isn't checked and You can connect all of
You pathes.

Please give me also information which multipath driver do You use.

Hope this makes sense.

Best Regards
Maciej Bogucki


From maciej.bogucki at artegence.com  Tue Sep 18 12:54:10 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 18 Sep 2007 14:54:10 +0200
Subject: [Linux-cluster] Multiple TCP/IP adresses for cman communication
In-Reply-To: <23A272DA-6127-46B0-8A59-3498D3245BC6@equation.fr>
References: <23A272DA-6127-46B0-8A59-3498D3245BC6@equation.fr>
Message-ID: <46EFCA72.5060708@artegence.com>

Alain Richard napisa?(a):
> Hi,
> 
> I am currently using a pretty common setup, using a two or more nodes
> cluster,  one or more iSCSI SANs, at least 3 gigabyte interfaces on each
> nodes (one for cluter member cman communication, and 2 for iSCSI
> communication) :
> 
> 
>     switch 1
>     |       |
> node 1      node 2
>  |   |      |   |
>  |   Switch 2   |
>  |     |        |
>  |    SAN       |
>  |     |        |
>  ???Switch 3----|
> 
> On each switches, I use a separate network :
> 
> 10.10.10.0/24 for switch 1, cman communication and for communicating
> with the rest of the world,
> 10.10.20.0/24 for switch 2 and first port of the SAN and nodes,
> 10.10.30.0/24 for switch 3 and second port of the SAN and nodes.
> 
> My SAN luns are multipathed on the nodes.
> 
> This setup works very well for me and I achieve both redundancy and load
> balancing for access from nodes to SAN.
> 
> I am using Centos 5 (RHEL 5) on the nodes and cman use the default
> configuration of using multicast on the switch 1 network.
> 
> I wonder if it is possible to make cman use the two other interfaces as
> well for its communications between cluster nodes ?


> 
> Doing so, I will be able to support the fail of any of the 3 switches
> without having the cluster members fenced in any way.

Yes, it is possible to do . Here is the answer
https://open.datacore.ch/DCwiki.open/Wiki.jsp?page=GFS.usage.txt

Best Regards
Maciej Bogucki


From maciej.bogucki at artegence.com  Tue Sep 18 13:11:01 2007
From: maciej.bogucki at artegence.com (Maciej Bogucki)
Date: Tue, 18 Sep 2007 15:11:01 +0200
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <46EF04BE.5040104@cmiware.com>
References: <46EF04BE.5040104@cmiware.com>
Message-ID: <46EFCE65.9070003@artegence.com>

Chris Harms napisa?(a):
> Is there an easy way to disable GFS and related kernel modules if one
> does not need GFS?  We are running the 5.1 Beta 1 version of the cluster
> and had a mysterious crash of the cluster suite.  There were issues with
> the GFS and dlm modules.  The kernel panicked on shutdown.
> 

rpm -e GFS-kernel-smp GFS cman  dlm cman-devel dlm-devel

Best Regards
Maciej Bogucki


From kanderso at redhat.com  Tue Sep 18 13:53:28 2007
From: kanderso at redhat.com (Kevin Anderson)
Date: Tue, 18 Sep 2007 08:53:28 -0500
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <46EF04BE.5040104@cmiware.com>
References: <46EF04BE.5040104@cmiware.com>
Message-ID: <1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>

On Mon, 2007-09-17 at 17:50 -0500, Chris Harms wrote:
> Is there an easy way to disable GFS and related kernel modules if one 
> does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
> and had a mysterious crash of the cluster suite.  There were issues with 
> the GFS and dlm modules.  The kernel panicked on shutdown.
> 
Do you have any details on the panic?

Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070918/25b3ac45/attachment.htm>

From chris at cmiware.com  Tue Sep 18 14:34:45 2007
From: chris at cmiware.com (Chris Harms)
Date: Tue, 18 Sep 2007 09:34:45 -0500
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>
References: <46EF04BE.5040104@cmiware.com>
	<1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>
Message-ID: <46EFE205.9010409@cmiware.com>

It said something about an out of memory condition.   This was logged 
just prior to where it would have panicked:

groupd[9639]: found uncontrolled kernel object rgmanager in /sys/kernel/dlm
groupd[9639]: local node must be reset to clear 1 uncontrolled instances 
of gfs and/or dlm
openais[9625]: [CMAN ] cman killed by node 1 because we were killed by 
cman_tool or other application
fenced[9647]: cman_init error 0 111
dlm_controld[9653]: cman_init error 0 111
gfs_controld[9659]: cman_init error 111

There were 2 runaway processes related to GFS / DLM before I tried to 
shut it down.  We had not encountered any issues like this until now.  
The only changes to our setup were a superficial change to some cluster 
services, and an upgrade of the DRBD kernel module.

Kevin Anderson wrote:
> On Mon, 2007-09-17 at 17:50 -0500, Chris Harms wrote:
>> Is there an easy way to disable GFS and related kernel modules if one 
>> does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
>> and had a mysterious crash of the cluster suite.  There were issues with 
>> the GFS and dlm modules.  The kernel panicked on shutdown.
>>
>>     
> Do you have any details on the panic?
>
> Kevin
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From teigland at redhat.com  Tue Sep 18 14:38:40 2007
From: teigland at redhat.com (David Teigland)
Date: Tue, 18 Sep 2007 09:38:40 -0500
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <46EFE205.9010409@cmiware.com>
References: <46EF04BE.5040104@cmiware.com>
	<1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>
	<46EFE205.9010409@cmiware.com>
Message-ID: <20070918143840.GB15032@redhat.com>

On Tue, Sep 18, 2007 at 09:34:45AM -0500, Chris Harms wrote:
> It said something about an out of memory condition.   This was logged 
> just prior to where it would have panicked:
> 
> groupd[9639]: found uncontrolled kernel object rgmanager in /sys/kernel/dlm
> groupd[9639]: local node must be reset to clear 1 uncontrolled instances 
> of gfs and/or dlm
> openais[9625]: [CMAN ] cman killed by node 1 because we were killed by 
> cman_tool or other application
> fenced[9647]: cman_init error 0 111
> dlm_controld[9653]: cman_init error 0 111
> gfs_controld[9659]: cman_init error 111

These messages mean that the userspace cluster software all exited for
some unknown reason, leaving behind a dlm lockspace (in the kernel) from
rgmanager.  At this point, you needed to reboot the machine, but instead
you restarted the userspace cluster software, which rightly complained
that you hadn't rebooted the machine, and refused to do operate.

This probably doesn't help, though, because it doesn't tell us anything
about the original problem(s) you had.  The original problem(s) probably
caused the cluster software to exit the first time, and was probably
related to the runaway processes.


> There were 2 runaway processes related to GFS / DLM before I tried to 
> shut it down.  We had not encountered any issues like this until now.  
> The only changes to our setup were a superficial change to some cluster 
> services, and an upgrade of the DRBD kernel module.
> 
> Kevin Anderson wrote:
> >On Mon, 2007-09-17 at 17:50 -0500, Chris Harms wrote:
> >>Is there an easy way to disable GFS and related kernel modules if one 
> >>does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
> >>and had a mysterious crash of the cluster suite.  There were issues with 
> >>the GFS and dlm modules.  The kernel panicked on shutdown.
> >>
> >>    
> >Do you have any details on the panic?
> >
> >Kevin
> >------------------------------------------------------------------------
> >
> >--
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From chris at cmiware.com  Tue Sep 18 15:00:06 2007
From: chris at cmiware.com (Chris Harms)
Date: Tue, 18 Sep 2007 10:00:06 -0500
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <20070918143840.GB15032@redhat.com>
References: <46EF04BE.5040104@cmiware.com>
	<1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>
	<46EFE205.9010409@cmiware.com> <20070918143840.GB15032@redhat.com>
Message-ID: <46EFE7F6.9010801@cmiware.com>

The only other thing I can think of is that I started NTPd and there was 
likely a big time adjustment as it had not been running.

Sep 17 10:27:32 ntpd[1118]: synchronized to 206.222.28.90, stratum 2
Sep 17 15:53:38 ntpd[1118]: time reset +18217.299628 s
Sep 17 15:53:38 ntpd[1118]: kernel time sync enabled 0001
Sep 17 15:53:38 openais[4457]: [TOTEM] The token was lost in the 
OPERATIONAL state.
Sep 17 15:53:38 dlm_controld[4480]: cluster is down, exiting
Sep 17 15:53:38 gfs_controld[4486]: cluster is down, exiting
Sep 17 15:53:38 fenced[4474]: cluster is down, exiting
Sep 17 15:53:38 kernel: dlm: closing connection to node 1
Sep 17 15:53:48 named[8732]: *** POKED TIMER ***
Sep 17 15:53:48 named[8733]: *** POKED TIMER ***
Sep 17 15:54:04 ccsd[4437]: Unable to connect to cluster infrastructure 
after 30 seconds.


David Teigland wrote:
> On Tue, Sep 18, 2007 at 09:34:45AM -0500, Chris Harms wrote:
>   
>> It said something about an out of memory condition.   This was logged 
>> just prior to where it would have panicked:
>>
>> groupd[9639]: found uncontrolled kernel object rgmanager in /sys/kernel/dlm
>> groupd[9639]: local node must be reset to clear 1 uncontrolled instances 
>> of gfs and/or dlm
>> openais[9625]: [CMAN ] cman killed by node 1 because we were killed by 
>> cman_tool or other application
>> fenced[9647]: cman_init error 0 111
>> dlm_controld[9653]: cman_init error 0 111
>> gfs_controld[9659]: cman_init error 111
>>     
>
> These messages mean that the userspace cluster software all exited for
> some unknown reason, leaving behind a dlm lockspace (in the kernel) from
> rgmanager.  At this point, you needed to reboot the machine, but instead
> you restarted the userspace cluster software, which rightly complained
> that you hadn't rebooted the machine, and refused to do operate.
>
> This probably doesn't help, though, because it doesn't tell us anything
> about the original problem(s) you had.  The original problem(s) probably
> caused the cluster software to exit the first time, and was probably
> related to the runaway processes.
>
>
>   
>> There were 2 runaway processes related to GFS / DLM before I tried to 
>> shut it down.  We had not encountered any issues like this until now.  
>> The only changes to our setup were a superficial change to some cluster 
>> services, and an upgrade of the DRBD kernel module.
>>
>> Kevin Anderson wrote:
>>     
>>> On Mon, 2007-09-17 at 17:50 -0500, Chris Harms wrote:
>>>       
>>>> Is there an easy way to disable GFS and related kernel modules if one 
>>>> does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
>>>> and had a mysterious crash of the cluster suite.  There were issues with 
>>>> the GFS and dlm modules.  The kernel panicked on shutdown.
>>>>
>>>>    
>>>>         
>>> Do you have any details on the panic?
>>>
>>> Kevin
>>> ------------------------------------------------------------------------
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>       
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>     


From sdake at redhat.com  Tue Sep 18 15:35:02 2007
From: sdake at redhat.com (Steven Dake)
Date: Tue, 18 Sep 2007 08:35:02 -0700
Subject: [Linux-cluster] disabling DLM and GFS kernel modules
In-Reply-To: <46EFE7F6.9010801@cmiware.com>
References: <46EF04BE.5040104@cmiware.com>
	<1190123608.2719.0.camel@dhcp80-204.msp.redhat.com>
	<46EFE205.9010409@cmiware.com> <20070918143840.GB15032@redhat.com>
	<46EFE7F6.9010801@cmiware.com>
Message-ID: <1190129704.19471.23.camel@balance>

The nodes were killed as a result of a condition called "AISONLY"
resulting from the ntp timeofday change which caused a change of the
time of greater then 10 seconds.

Totem uses gettimeofday() to determine the time of day difference
between two time periods.  If the time period is greater then 10
seconds, Totem will indicate a network connectivity lost, reform a
cluster, and the higher level software will detect the aisonly flag on
that node.

Then that node must be rebooted to be operational again.

Ideally gettimeofday() shouldn't be used to detect the difference of two
time periods, but instead itimers or something else should be used.  But
for the moment gettimeofday() is used which doesn't notify the user of
ntp adjustments.

Regards
-steve


On Tue, 2007-09-18 at 10:00 -0500, Chris Harms wrote:
> The only other thing I can think of is that I started NTPd and there was 
> likely a big time adjustment as it had not been running.
> 
> Sep 17 10:27:32 ntpd[1118]: synchronized to 206.222.28.90, stratum 2
> Sep 17 15:53:38 ntpd[1118]: time reset +18217.299628 s
> Sep 17 15:53:38 ntpd[1118]: kernel time sync enabled 0001
> Sep 17 15:53:38 openais[4457]: [TOTEM] The token was lost in the 
> OPERATIONAL state.
> Sep 17 15:53:38 dlm_controld[4480]: cluster is down, exiting
> Sep 17 15:53:38 gfs_controld[4486]: cluster is down, exiting
> Sep 17 15:53:38 fenced[4474]: cluster is down, exiting
> Sep 17 15:53:38 kernel: dlm: closing connection to node 1
> Sep 17 15:53:48 named[8732]: *** POKED TIMER ***
> Sep 17 15:53:48 named[8733]: *** POKED TIMER ***
> Sep 17 15:54:04 ccsd[4437]: Unable to connect to cluster infrastructure 
> after 30 seconds.
> 
> 
> 
> David Teigland wrote:
> > On Tue, Sep 18, 2007 at 09:34:45AM -0500, Chris Harms wrote:
> >   
> >> It said something about an out of memory condition.   This was logged 
> >> just prior to where it would have panicked:
> >>
> >> groupd[9639]: found uncontrolled kernel object rgmanager in /sys/kernel/dlm
> >> groupd[9639]: local node must be reset to clear 1 uncontrolled instances 
> >> of gfs and/or dlm
> >> openais[9625]: [CMAN ] cman killed by node 1 because we were killed by 
> >> cman_tool or other application
> >> fenced[9647]: cman_init error 0 111
> >> dlm_controld[9653]: cman_init error 0 111
> >> gfs_controld[9659]: cman_init error 111
> >>     
> >
> > These messages mean that the userspace cluster software all exited for
> > some unknown reason, leaving behind a dlm lockspace (in the kernel) from
> > rgmanager.  At this point, you needed to reboot the machine, but instead
> > you restarted the userspace cluster software, which rightly complained
> > that you hadn't rebooted the machine, and refused to do operate.
> >
> > This probably doesn't help, though, because it doesn't tell us anything
> > about the original problem(s) you had.  The original problem(s) probably
> > caused the cluster software to exit the first time, and was probably
> > related to the runaway processes.
> >
> >
> >   
> >> There were 2 runaway processes related to GFS / DLM before I tried to 
> >> shut it down.  We had not encountered any issues like this until now.  
> >> The only changes to our setup were a superficial change to some cluster 
> >> services, and an upgrade of the DRBD kernel module.
> >>
> >> Kevin Anderson wrote:
> >>     
> >>> On Mon, 2007-09-17 at 17:50 -0500, Chris Harms wrote:
> >>>       
> >>>> Is there an easy way to disable GFS and related kernel modules if one 
> >>>> does not need GFS?  We are running the 5.1 Beta 1 version of the cluster 
> >>>> and had a mysterious crash of the cluster suite.  There were issues with 
> >>>> the GFS and dlm modules.  The kernel panicked on shutdown.
> >>>>
> >>>>    
> >>>>         
> >>> Do you have any details on the panic?
> >>>
> >>> Kevin
> >>> ------------------------------------------------------------------------
> >>>
> >>> --
> >>> Linux-cluster mailing list
> >>> Linux-cluster at redhat.com
> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>       
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>     
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From nlam87346 at library.usyd.edu.au  Wed Sep 19 08:15:08 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Wed, 19 Sep 2007 18:15:08 +1000
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
Message-ID: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>

Some weeks ago I posted to this list reporting problems with my 2-node
RHEL5 cluster where I had each node on a different VLAN, connected by a
layer 3 router.

The problem was that the OpenAIS (cluster communications) packets were
not reaching the other node, so all sorts of annoying things were
happening.

Well, I've just had someone with extensive multicast experience help me
with some troubleshooting. We've discovered that the problem appears to
be that the packets are going out with a TTL of 1, which means the first
router to receive it will drop it without forwarding.

Here's a piece of tcpdump -v

04:47:23.167506 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 74
04:47:23.336194 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 74
04:47:23.538871 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 118
04:47:23.658161 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 74
04:47:23.826268 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 74
04:47:24.026863 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
length 118


So, I guess the question is, what is the correct method to set the TTL
to be a bit more reasonable, say 128?

Regards,

Nik Lam


From sdake at redhat.com  Wed Sep 19 08:25:42 2007
From: sdake at redhat.com (Steven Dake)
Date: Wed, 19 Sep 2007 01:25:42 -0700
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
In-Reply-To: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
References: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
Message-ID: <1190190342.30194.15.camel@balance>

My apologies I must have missed that email.

The code in openais is untested in a routed environment because I dont
have any routed environments to test with.  This is why TTL is not
configurable in the current openais or tested with any values greater
then 1.  I can build you an RPM with ttl set to some greater value to
test with if you like.

Regards
-steve
 
On Wed, 2007-09-19 at 18:15 +1000, Nikolas Lam wrote:
> Some weeks ago I posted to this list reporting problems with my 2-node
> RHEL5 cluster where I had each node on a different VLAN, connected by a
> layer 3 router.
> 
> The problem was that the OpenAIS (cluster communications) packets were
> not reaching the other node, so all sorts of annoying things were
> happening.
> 
> Well, I've just had someone with extensive multicast experience help me
> with some troubleshooting. We've discovered that the problem appears to
> be that the packets are going out with a TTL of 1, which means the first
> router to receive it will drop it without forwarding.
> 
> Here's a piece of tcpdump -v
> 
> 04:47:23.167506 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 74
> 04:47:23.336194 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 74
> 04:47:23.538871 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 118
> 04:47:23.658161 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 74
> 04:47:23.826268 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 74
> 04:47:24.026863 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> length 118
> 
> 
> So, I guess the question is, what is the correct method to set the TTL
> to be a bit more reasonable, say 128?
> 
> Regards,
> 
> Nik Lam
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From swhiteho at redhat.com  Wed Sep 19 10:26:26 2007
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 19 Sep 2007 11:26:26 +0100
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
In-Reply-To: <1190190342.30194.15.camel@balance>
References: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
	<1190190342.30194.15.camel@balance>
Message-ID: <1190197587.3413.7.camel@menhir.chygwyn.com>

Hi,

On Wed, 2007-09-19 at 01:25 -0700, Steven Dake wrote:
> My apologies I must have missed that email.
> 
> The code in openais is untested in a routed environment because I dont
> have any routed environments to test with.  This is why TTL is not
> configurable in the current openais or tested with any values greater
> then 1.  I can build you an RPM with ttl set to some greater value to
> test with if you like.
> 
> Regards
> -steve

The normal settings for multicast ttl are:
0 - node local
1 - link local
<32 - site local (so I guess a setting of 31 is reasonable here)
>128 - global

I also found a ref with the following extra values which I didn't
remember:
33-64 region local
65-128 continent local

I'm not sure if those last couple are official or not, 

Steve.


>  
> On Wed, 2007-09-19 at 18:15 +1000, Nikolas Lam wrote:
> > Some weeks ago I posted to this list reporting problems with my 2-node
> > RHEL5 cluster where I had each node on a different VLAN, connected by a
> > layer 3 router.
> > 
> > The problem was that the OpenAIS (cluster communications) packets were
> > not reaching the other node, so all sorts of annoying things were
> > happening.
> > 
> > Well, I've just had someone with extensive multicast experience help me
> > with some troubleshooting. We've discovered that the problem appears to
> > be that the packets are going out with a TTL of 1, which means the first
> > router to receive it will drop it without forwarding.
> > 
> > Here's a piece of tcpdump -v
> > 
> > 04:47:23.167506 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 74
> > 04:47:23.336194 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 74
> > 04:47:23.538871 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 118
> > 04:47:23.658161 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 74
> > 04:47:23.826268 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 102) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 74
> > 04:47:24.026863 IP (tos 0x0, ttl 1, id 0, offset 0, flags [DF], proto
> > UDP (17), length 146) 172.16.99.50.5149 > 239.224.72.11.5405: UDP,
> > length 118
> > 
> > 
> > So, I guess the question is, what is the correct method to set the TTL
> > to be a bit more reasonable, say 128?
> > 
> > Regards,
> > 
> > Nik Lam
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From Alain.Moulle at bull.net  Wed Sep 19 09:50:29 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 19 Sep 2007 11:50:29 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for two nodes
	cluster ? (contd.)
Message-ID: <46F0F0E5.7050200@bull.net>

Hi

As said before, I'm trying for the first time to
add a quorum disk on my two nodes cluster.
Finally, I've set parameters as below :

        <quorumd interval="1" tko="10" votes="1" log_level="9"
log_facility="local4" status_file="/tmp/qdisk_status" label="CS4QUORUMDISK">
        </quorumd>
and
	<clusternode name="node1" votes="1">
and
	<clusternode name="node2" votes="1">
and
        <cman expected_votes="3" two_node="0"/>

With these parameters values for my two nodes cluster, I have
to launch the Cluster Suite on both nodes, it can't be
launched on one node only because cman detects "not quorate".

Is there a way to avoid this ? to be able to launch CS4 only
on one side ?
(I guess expected_votes="2" but would this value lead to other problems ... ?)

By the way, how can I be shure that quorum disk is really used , where can
I see traces ? (I see nothing in syslog)

Thanks a lot for your help
Alain Moull?


From nlam87346 at library.usyd.edu.au  Wed Sep 19 09:59:33 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Wed, 19 Sep 2007 19:59:33 +1000
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
In-Reply-To: <1190197587.3413.7.camel@menhir.chygwyn.com>
References: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
	<1190190342.30194.15.camel@balance>
	<1190197587.3413.7.camel@menhir.chygwyn.com>
Message-ID: <1190195973.7337.106.camel@lits17.library.usyd.edu.au>

On Wed, 2007-09-19 at 11:26 +0100, Steven Whitehouse wrote:
> Hi,
> 
> On Wed, 2007-09-19 at 01:25 -0700, Steven Dake wrote:
> > My apologies I must have missed that email.
> > 
> > The code in openais is untested in a routed environment because I dont
> > have any routed environments to test with.  This is why TTL is not
> > configurable in the current openais or tested with any values greater
> > then 1.  I can build you an RPM with ttl set to some greater value to
> > test with if you like.
> > 
> > Regards
> > -steve
> 
> The normal settings for multicast ttl are:
> 0 - node local
> 1 - link local
> <32 - site local (so I guess a setting of 31 is reasonable here)
> >128 - global
> 
> I also found a ref with the following extra values which I didn't
> remember:
> 33-64 region local
> 65-128 continent local
> 
> I'm not sure if those last couple are official or not, 
> 
> Steve.

Thanks Steve(s),

We're a pretty big site (a university), so it's possible (but unlikely)
that a legitimate round-about route could consume up to around approx 30
hops. We're using a multicast range which is not being routed outside
the university, so it's probably fine to set it to 128.


Nik


From carlopmart at gmail.com  Wed Sep 19 12:51:46 2007
From: carlopmart at gmail.com (carlopmart)
Date: Wed, 19 Sep 2007 14:51:46 +0200
Subject: [Linux-cluster] Starting up two of three nodes that compose a
	cluster
Message-ID: <46F11B62.2090007@gmail.com>

Hi all,

  I have setup a rhel5 based cluster with three nodes. Sometimes i need 
to start only two of this three nodes, but cluster services that i 
configured doesn't starts (fenced fail). Is it not possible to start up 
only two nodes on a three node cluster?? Maybe I need to adjust votes 
param to two instead of three??

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From Alain.Moulle at bull.net  Wed Sep 19 13:17:50 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 19 Sep 2007 15:17:50 +0200
Subject: [Linux-cluster] quorum disk with CS4 U4
Message-ID: <46F1217E.9040401@bull.net>

Hi

I don't remember who recommended a few weeks ago to have the
U5 to use quorum disk but ... which main problems will I have
to use quorum disk with CS4 U4 ?

Thanks
Alain Moull?


From nlam87346 at library.usyd.edu.au  Thu Sep 20 08:49:34 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Thu, 20 Sep 2007 18:49:34 +1000
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
In-Reply-To: <1190190342.30194.15.camel@balance>
References: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
	<1190190342.30194.15.camel@balance>
Message-ID: <1190278174.3624.46.camel@lits17.library.usyd.edu.au>

On Wed, 2007-09-19 at 01:25 -0700, Steven Dake wrote:
> My apologies I must have missed that email.
> 
> The code in openais is untested in a routed environment because I dont
> have any routed environments to test with.  This is why TTL is not
> configurable in the current openais or tested with any values greater
> then 1.  I can build you an RPM with ttl set to some greater value to
> test with if you like.
> 
> Regards
> -steve
>  

In case anyone else is interested, we're trialling using iptables to
increase the TTL on those multicast packets so that they'll get routed.

Nik Lam


From nlam87346 at library.usyd.edu.au  Thu Sep 20 08:59:23 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Thu, 20 Sep 2007 18:59:23 +1000
Subject: [Linux-cluster] TTL on multicast packets set to 1 by default?
In-Reply-To: <1190190342.30194.15.camel@balance>
References: <1190189708.7337.26.camel@lits17.library.usyd.edu.au>
	<1190190342.30194.15.camel@balance>
Message-ID: <1190278763.3624.50.camel@lits17.library.usyd.edu.au>

On Wed, 2007-09-19 at 01:25 -0700, Steven Dake wrote:
> My apologies I must have missed that email.
> 
> The code in openais is untested in a routed environment because I dont
> have any routed environments to test with.  This is why TTL is not
> configurable in the current openais or tested with any values greater
> then 1.  I can build you an RPM with ttl set to some greater value to
> test with if you like.
> 
> Regards
> -steve
>  

In case anyone else is interested, we're trialling using iptables to
increase the TTL on those multicast packets so that they'll get routed.

(Sorry I meant to add this in my previous email)

Here's the rule:

iptables -t mangle -A OUTPUT -d <multicast address> -j TTL --ttl-set 128

Where <multicast address> is replaced by whatever your cman is using (you can
get an idea of this using netstat -gn).

Cheers,

Nik Lam


From jdozarchuk at babcock.com  Thu Sep 20 19:45:29 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Thu, 20 Sep 2007 15:45:29 -0400
Subject: [Linux-cluster] Two-node setup Help (RHES5)
Message-ID: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>

Hi all,

 
I am trying to configure a two-node cluster that will simple be NFS,
FTP, and SMB fileshares.  I cannot see to get each node to join the
cluster.  I am fencing via the ilo2 ports, and that portion is working.
However, when I try to bring up the services on both nodes they say
"fence failed <hostname>".  

 
I am not sure why the nodes are trying to fence each other, probably
because neither of them can join.  

 
I can resolve hostnames back and forth to each node, they can ping one
another, and they are both on the same VLAN.  

 
Am I missing something?  This was a breeze in RHEL4.  

 
Here is my cluster.conf...

 
<?xml version="1.0"?>

<cluster alias="plm_test" config_version="9" name="plm_test">

        <fence_daemon post_fail_delay="0" post_join_delay="60"/>

        <clusternodes>

                <clusternode name="bplmft12" nodeid="1" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="ilo-bplmft12"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="bplmft11" nodeid="2" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="ilo-bplmft11"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_ilo" hostname="ilo-bplmft12"
login="redhat_cluster_user" name="ilo-bplmft12" passwd="redhatcluster"/>

                <fencedevice agent="fence_ilo" hostname="ilo-bplmft11"
login="redhat_cluster_user" name="ilo-bplmft11" passwd="redhatcluster"/>

        </fencedevices>

        <rm>

                <failoverdomains/>

                <resources/>

        </rm>

</cluster>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070920/72d176de/attachment.htm>

From chris at cmiware.com  Thu Sep 20 22:45:14 2007
From: chris at cmiware.com (Chris Harms)
Date: Thu, 20 Sep 2007 17:45:14 -0500
Subject: [Linux-cluster] Virtual IPs on multiple subnets
Message-ID: <46F2F7FA.9060409@cmiware.com>

Hi,

Based on the IP addresses allocated by our ISP, we will need the cluster 
to manage virtual IP addresses on multiple (likely 3) different subnets, 
one is a private block and 2 public subnets. I would like the cluster 
nodes to have 192.168.xx.yy addresses for cluster traffic, and manage 
the public virtual IPs separately.  I would prefer to not assign public 
static addresses, but can do so if necessary.

To test this setup, I have added an IP resource and created a simple 
service containing only the IP resource.  The service "starts" and the 
cluster thinks it is available, but the IP does not appear in the output 
of the ip command nor is it accessible from other nodes.  If I manually 
add the address via the ip command, it works fine.  It appears the ip.sh 
script does some voodoo to extract / set various information for the 
address, adapter, etc so maybe there is something amiss there.

Any advice is appreciated.

Thanks,
Chris


From nlam87346 at library.usyd.edu.au  Thu Sep 20 23:58:01 2007
From: nlam87346 at library.usyd.edu.au (Nikolas Lam)
Date: Fri, 21 Sep 2007 09:58:01 +1000
Subject: [Linux-cluster] Two-node setup Help (RHES5)
In-Reply-To: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>
References: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>
Message-ID: <1190332681.9054.9.camel@lits17.library.usyd.edu.au>

On Thu, 2007-09-20 at 15:45 -0400, Ozarchuk, John D wrote:
> Hi all,
> 
>  
> 
> I am trying to configure a two-node cluster that will simple be NFS,
> FTP, and SMB fileshares.  I cannot see to get each node to join the
> cluster.  I am fencing via the ilo2 ports, and that portion is
> working.  However, when I try to bring up the services on both nodes
> they say ?fence failed <hostname>?.  
> 
>  
> 
> I am not sure why the nodes are trying to fence each other, probably
> because neither of them can join.  
> 
>  
> 
> I can resolve hostnames back and forth to each node, they can ping one
> another, and they are both on the same VLAN.  


Maybe it's overkill, but it's pretty straight-forward to check whether
each of your nodes is receiving multicast cluster packets from the other
node using tcpdump. Assuming you've got everything on eth0 and your
other node's IP address is 192.168.1.100 you would use (with root privs)

tcpdump -v -nn -i eth0 ip multicast and src 192.168.1.100

You should see some packets from your other node sent to the default
cluster multicast group IP address, which is probably 225.0.0.11.

Nik


From sdake at redhat.com  Fri Sep 21 00:11:49 2007
From: sdake at redhat.com (Steven Dake)
Date: Thu, 20 Sep 2007 17:11:49 -0700
Subject: [Linux-cluster] Two-node setup Help (RHES5)
In-Reply-To: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>
References: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>
Message-ID: <1190333509.6603.0.camel@balance>

Would you send the output of /var/log/messages please?

Also which brand of switch are you using.  If Cisco what type of
firewall device.

Thanks
-steve

On Thu, 2007-09-20 at 15:45 -0400, Ozarchuk, John D wrote:
> Hi all,
> 
>  
> 
> I am trying to configure a two-node cluster that will simple be NFS,
> FTP, and SMB fileshares.  I cannot see to get each node to join the
> cluster.  I am fencing via the ilo2 ports, and that portion is
> working.  However, when I try to bring up the services on both nodes
> they say ?fence failed <hostname>?.  
> 
>  
> 
> I am not sure why the nodes are trying to fence each other, probably
> because neither of them can join.  
> 
>  
> 
> I can resolve hostnames back and forth to each node, they can ping one
> another, and they are both on the same VLAN.  
> 
>  
> 
> Am I missing something?  This was a breeze in RHEL4.  
> 
>  
> 
> Here is my cluster.conf?
> 
>  
> 
> <?xml version="1.0"?>
> 
> <cluster alias="plm_test" config_version="9" name="plm_test">
> 
>         <fence_daemon post_fail_delay="0" post_join_delay="60"/>
> 
>         <clusternodes>
> 
>                 <clusternode name="bplmft12" nodeid="1" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1">
> 
>                                         <device name="ilo-bplmft12"/>
> 
>                                 </method>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>                 <clusternode name="bplmft11" nodeid="2" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1">
> 
>                                         <device name="ilo-bplmft11"/>
> 
>                                 </method>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>         </clusternodes>
> 
>         <cman expected_votes="1" two_node="1"/>
> 
>         <fencedevices>
> 
>                 <fencedevice agent="fence_ilo" hostname="ilo-bplmft12"
> login="redhat_cluster_user" name="ilo-bplmft12"
> passwd="redhatcluster"/>
> 
>                 <fencedevice agent="fence_ilo" hostname="ilo-bplmft11"
> login="redhat_cluster_user" name="ilo-bplmft11"
> passwd="redhatcluster"/>
> 
>         </fencedevices>
> 
>         <rm>
> 
>                 <failoverdomains/>
> 
>                 <resources/>
> 
>         </rm>
> 
> </cluster>
> 
> 
> 
> ______________________________________________________________________
> 
> This message is intended only for the individual or entity to which it
> is addressed and contains information that is proprietary to The
> Babcock & Wilcox Company and/or its affiliates, or may be otherwise
> confidential. If the reader of this message is not the intended
> recipient, or the employee agent responsible for delivering the
> message to the intended recipient, you are hereby notified that any
> dissemination, distribution or copying of this communication is
> strictly prohibited. If you have received this communication in error,
> please notify the sender immediately by return e-mail and delete this
> message from your computer. Thank you.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From jdozarchuk at babcock.com  Fri Sep 21 10:57:29 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Fri, 21 Sep 2007 06:57:29 -0400
Subject: [Linux-cluster] Two-node setup Help (RHES5)
In-Reply-To: <1190332681.9054.9.camel@lits17.library.usyd.edu.au>
References: <EE9F27C20409C44AA031C769B3DBCE4102748342@barbpo3.bwes.net>
	<1190332681.9054.9.camel@lits17.library.usyd.edu.au>
Message-ID: <EE9F27C20409C44AA031C769B3DBCE4102748365@barbpo3.bwes.net>

Nope, no packets between the two nodes.  This tells me that they cannot
communicate.


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nikolas Lam
Sent: Thursday, September 20, 2007 7:58 PM
To: linux clustering
Subject: Re: [Linux-cluster] Two-node setup Help (RHES5)

On Thu, 2007-09-20 at 15:45 -0400, Ozarchuk, John D wrote:
> Hi all,
> 
>  
> 
> I am trying to configure a two-node cluster that will simple be NFS,
> FTP, and SMB fileshares.  I cannot see to get each node to join the
> cluster.  I am fencing via the ilo2 ports, and that portion is
> working.  However, when I try to bring up the services on both nodes
> they say "fence failed <hostname>".  
> 
>  
> 
> I am not sure why the nodes are trying to fence each other, probably
> because neither of them can join.  
> 
>  
> 
> I can resolve hostnames back and forth to each node, they can ping one
> another, and they are both on the same VLAN.  


Maybe it's overkill, but it's pretty straight-forward to check whether
each of your nodes is receiving multicast cluster packets from the other
node using tcpdump. Assuming you've got everything on eth0 and your
other node's IP address is 192.168.1.100 you would use (with root privs)

tcpdump -v -nn -i eth0 ip multicast and src 192.168.1.100

You should see some packets from your other node sent to the default
cluster multicast group IP address, which is probably 225.0.0.11.

Nik


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-----------------------------------------
This message is intended only for the individual or entity to which
it is addressed and contains information that is proprietary to The
Babcock & Wilcox Company and/or its affiliates, or may be otherwise
confidential.  If the reader of this message is not the intended
recipient, or the employee agent responsible for delivering the
message to the intended recipient, you are hereby notified that any
dissemination, distribution or copying of this communication is
strictly prohibited.  If you have received this communication in
error, please notify the sender immediately by return e-mail and
delete this message from your computer.  Thank you.


From carlopmart at gmail.com  Thu Sep 20 09:40:55 2007
From: carlopmart at gmail.com (carlopmart)
Date: Thu, 20 Sep 2007 11:40:55 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
Message-ID: <46F24027.7040201@gmail.com>

Please, any hints??

-------- Original Message --------
Subject: Starting up two of three nodes that compose a cluster
Date: Wed, 19 Sep 2007 14:51:46 +0200
From: carlopmart <carlopmart at gmail.com>
To: linux clustering <linux-cluster at redhat.com>

Hi all,

  I have setup a rhel5 based cluster with three nodes. Sometimes i need
to start only two of this three nodes, but cluster services that i
configured doesn't starts (fenced fail). Is it not possible to start up
only two nodes on a three node cluster?? Maybe I need to adjust votes
param to two instead of three??

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From teigland at redhat.com  Fri Sep 21 14:28:39 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 09:28:39 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F24027.7040201@gmail.com>
References: <46F24027.7040201@gmail.com>
Message-ID: <20070921142839.GA20746@redhat.com>

On Thu, Sep 20, 2007 at 11:40:55AM +0200, carlopmart wrote:
> Please, any hints??
> 
> -------- Original Message --------
> Subject: Starting up two of three nodes that compose a cluster
> Date: Wed, 19 Sep 2007 14:51:46 +0200
> From: carlopmart <carlopmart at gmail.com>
> To: linux clustering <linux-cluster at redhat.com>
> 
> Hi all,
> 
>  I have setup a rhel5 based cluster with three nodes. Sometimes i need
> to start only two of this three nodes, but cluster services that i
> configured doesn't starts (fenced fail). Is it not possible to start up
> only two nodes on a three node cluster?? Maybe I need to adjust votes
> param to two instead of three??

Could you be more specific about what you run, where, what happens,
what messages you see, etc.

Dave


From carlopmart at gmail.com  Fri Sep 21 15:02:18 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 17:02:18 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921142839.GA20746@redhat.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
Message-ID: <46F3DCFA.5010803@gmail.com>

David Teigland wrote:
> On Thu, Sep 20, 2007 at 11:40:55AM +0200, carlopmart wrote:
>> Please, any hints??
>>
>> -------- Original Message --------
>> Subject: Starting up two of three nodes that compose a cluster
>> Date: Wed, 19 Sep 2007 14:51:46 +0200
>> From: carlopmart <carlopmart at gmail.com>
>> To: linux clustering <linux-cluster at redhat.com>
>>
>> Hi all,
>>
>>  I have setup a rhel5 based cluster with three nodes. Sometimes i need
>> to start only two of this three nodes, but cluster services that i
>> configured doesn't starts (fenced fail). Is it not possible to start up
>> only two nodes on a three node cluster?? Maybe I need to adjust votes
>> param to two instead of three??
> 
> Could you be more specific about what you run, where, what happens,
> what messages you see, etc.
> 
> Dave
> 
> 
Yes,

  First, I attached my cluster.conf. When /etc/init.d/cman starts, 
returns an ok, but when I try to mount my gfs partition returns this error:

[root at haldir cluster]# service mountgfs start
Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld 
join error: -22
/sbin/mount.gfs: error mounting lockproto lock_dlm
                                                            [FAILED]
[root at haldir cluster]#

And of course any service couldn't start .... And clustat output is:

[root at haldir cluster]# clustat
Member Status: Quorate

   Member Name                        ID   Status
   ------ ----                        ---- ------
   thranduil.hpulabs.org                 1 Online
   haldir.hpulabs.org                    2 Online, Local, rgmanager
   elrond.hpulabs.org                    3 Offline

   Service Name         Owner (Last)                   State
   ------- ----         ----- ------                   -----
   service:rsync-svc    (none)                         stopped
   service:wwwsoft-svc  (none)                         stopped
   service:proxy-svc    (none)                         stopped
   service:mail-svc     (none)                         stopped
[root at haldir cluster]#


P.D: mountgfs it is a simple script that mounts gfs partitions because 
gfs script provided by redhat doesn't works with _netdev param under fstab.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: text/xml
Size: 3491 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070921/648d80fe/attachment.xml>

From teigland at redhat.com  Fri Sep 21 15:14:51 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 10:14:51 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3DCFA.5010803@gmail.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com>
Message-ID: <20070921151451.GB20746@redhat.com>

On Fri, Sep 21, 2007 at 05:02:18PM +0200, carlopmart wrote:
> David Teigland wrote:
> >On Thu, Sep 20, 2007 at 11:40:55AM +0200, carlopmart wrote:
> >>Please, any hints??
> >>
> >>-------- Original Message --------
> >>Subject: Starting up two of three nodes that compose a cluster
> >>Date: Wed, 19 Sep 2007 14:51:46 +0200
> >>From: carlopmart <carlopmart at gmail.com>
> >>To: linux clustering <linux-cluster at redhat.com>
> >>
> >>Hi all,
> >>
> >> I have setup a rhel5 based cluster with three nodes. Sometimes i need
> >>to start only two of this three nodes, but cluster services that i
> >>configured doesn't starts (fenced fail). Is it not possible to start up
> >>only two nodes on a three node cluster?? Maybe I need to adjust votes
> >>param to two instead of three??
> >
> >Could you be more specific about what you run, where, what happens,
> >what messages you see, etc.
> >
> >Dave
> >
> >
> Yes,
> 
>  First, I attached my cluster.conf. When /etc/init.d/cman starts, 
> returns an ok, but when I try to mount my gfs partition returns this error:
> 
> [root at haldir cluster]# service mountgfs start
> Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld 
> join error: -22
> /sbin/mount.gfs: error mounting lockproto lock_dlm

So an error is coming back from gfs_controld on mount.  Please do the
steps manually, without init scripts or other scripts, so we know exactly
what steps fail.  And look in /var/log/messages for anything from
gfs_controld.  If there are none, send the output of 'group_tool -v;
group_tool dump gfs' after the failed mount.

Dave


From carlopmart at gmail.com  Fri Sep 21 15:29:22 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 17:29:22 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921151451.GB20746@redhat.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
Message-ID: <46F3E352.2030405@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 05:02:18PM +0200, carlopmart wrote:
>> David Teigland wrote:
>>> On Thu, Sep 20, 2007 at 11:40:55AM +0200, carlopmart wrote:
>>>> Please, any hints??
>>>>
>>>> -------- Original Message --------
>>>> Subject: Starting up two of three nodes that compose a cluster
>>>> Date: Wed, 19 Sep 2007 14:51:46 +0200
>>>> From: carlopmart <carlopmart at gmail.com>
>>>> To: linux clustering <linux-cluster at redhat.com>
>>>>
>>>> Hi all,
>>>>
>>>> I have setup a rhel5 based cluster with three nodes. Sometimes i need
>>>> to start only two of this three nodes, but cluster services that i
>>>> configured doesn't starts (fenced fail). Is it not possible to start up
>>>> only two nodes on a three node cluster?? Maybe I need to adjust votes
>>>> param to two instead of three??
>>> Could you be more specific about what you run, where, what happens,
>>> what messages you see, etc.
>>>
>>> Dave
>>>
>>>
>> Yes,
>>
>>  First, I attached my cluster.conf. When /etc/init.d/cman starts, 
>> returns an ok, but when I try to mount my gfs partition returns this error:
>>
>> [root at haldir cluster]# service mountgfs start
>> Mounting GFS filesystems:  /sbin/mount.gfs: lock_dlm_join: gfs_controld 
>> join error: -22
>> /sbin/mount.gfs: error mounting lockproto lock_dlm
> 
> So an error is coming back from gfs_controld on mount.  Please do the
> steps manually, without init scripts or other scripts, so we know exactly
> what steps fail.  And look in /var/log/messages for anything from
> gfs_controld.  If there are none, send the output of 'group_tool -v;
> group_tool dump gfs' after the failed mount.
> 
> Dave
> 
> 
Hi Dave,

  When I try mount gfs patition fails:

  [root at thranduil log]# mount -t gfs /dev/xvdc1 /data
/sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
/sbin/mount.gfs: error mounting lockproto lock_dlm
[root at thranduil log]#

Output of group_tool command:

[root at thranduil log]# group_tool -v; group_tool dump gfs
type             level name     id       state node id local_done
fence            0     default  00010001 JOIN_START_WAIT 1 100010001 0
[1]
1190386130 listen 1
1190386130 cpg 4
1190386130 groupd 6
1190386130 uevent 7
1190386130 plocks 10
1190386130 setup done
1190386167 client 6: join /data gfs lock_dlm XenDomUcluster:datavol01 rw 
/dev/xvdc1
1190386167 mount: /data gfs lock_dlm XenDomUcluster:datavol01 rw /dev/xvdc1
1190386167 datavol01 cluster name matches: XenDomUcluster
1190386167 mount: not in default fence domain
1190386167 datavol01 do_mount: rv -22
1190386167 client 6 fd 11 dead
1190386167 client 6 fd -1 dead
1190386228 client 6: join /data gfs lock_dlm XenDomUcluster:datavol01 rw 
/dev/xvdc1
1190386228 mount: /data gfs lock_dlm XenDomUcluster:datavol01 rw /dev/xvdc1
1190386228 datavol01 cluster name matches: XenDomUcluster
1190386228 mount: not in default fence domain
1190386228 datavol01 do_mount: rv -22
1190386228 client 6 fd 11 dead
1190386228 client 6 fd -1 dead
1190388485 client 6: join /data gfs lock_dlm XenDomUcluster:datavol01 rw 
/dev/xvdc1
1190388485 mount: /data gfs lock_dlm XenDomUcluster:datavol01 rw /dev/xvdc1
1190388485 datavol01 cluster name matches: XenDomUcluster
1190388485 mount: not in default fence domain
1190388485 datavol01 do_mount: rv -22
1190388485 client 6 fd 11 dead
1190388485 client 6 fd -1 dead
1190388530 client 6: dump
[root at thranduil log]#

  Thanks David.


-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From isplist at logicore.net  Fri Sep 21 14:40:34 2007
From: isplist at logicore.net (isplist at logicore.net)
Date: Fri, 21 Sep 2007 09:40:34 -0500
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
Message-ID: <200792194034.295600@leena>

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


From teigland at redhat.com  Fri Sep 21 15:37:22 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 10:37:22 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3E352.2030405@gmail.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com>
Message-ID: <20070921153722.GC20746@redhat.com>

On Fri, Sep 21, 2007 at 05:29:22PM +0200, carlopmart wrote:
>  [root at thranduil log]# mount -t gfs /dev/xvdc1 /data
> /sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
> /sbin/mount.gfs: error mounting lockproto lock_dlm

This has already been changed to report a descriptive error message,
  "node not a member of the default fence domain"

as is shown in the debug log from gfs_controld below, and I suspect
appears in your /var/log/messages.

> 1190388485 mount: not in default fence domain
> 1190388485 datavol01 do_mount: rv -22

> [root at thranduil log]# group_tool -v; group_tool dump gfs
> type             level name     id       state node id local_done
> fence            0     default  00010001 JOIN_START_WAIT 1 100010001 0
> [1]

This shows it's not in the fence domain yet.  The reason appears to be
that it's trying to fence someone.  Again, look in /var/log/messages to
find out more information about what needs to be fenced, or why fencing
isn't working.

Dave


From carlopmart at gmail.com  Fri Sep 21 15:50:09 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 17:50:09 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921153722.GC20746@redhat.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com> <20070921153722.GC20746@redhat.com>
Message-ID: <46F3E831.80700@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 05:29:22PM +0200, carlopmart wrote:
>>  [root at thranduil log]# mount -t gfs /dev/xvdc1 /data
>> /sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
>> /sbin/mount.gfs: error mounting lockproto lock_dlm
> 
> This has already been changed to report a descriptive error message,
>   "node not a member of the default fence domain"
> 
> as is shown in the debug log from gfs_controld below, and I suspect
> appears in your /var/log/messages.
> 
>> 1190388485 mount: not in default fence domain
>> 1190388485 datavol01 do_mount: rv -22
> 
>> [root at thranduil log]# group_tool -v; group_tool dump gfs
>> type             level name     id       state node id local_done
>> fence            0     default  00010001 JOIN_START_WAIT 1 100010001 0
>> [1]
> 
> This shows it's not in the fence domain yet.  The reason appears to be
> that it's trying to fence someone.  Again, look in /var/log/messages to
> find out more information about what needs to be fenced, or why fencing
> isn't working.
> 
> Dave
> 
> 
Correct Dave. Error is:

Sep 21 16:50:30 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:30 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:50:35 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:35 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:50:40 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:40 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:50:45 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:45 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:50:50 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:50 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:50:55 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:50:55 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:00 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:00 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:05 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:05 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:10 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:10 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:15 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:15 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:20 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:20 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
Sep 21 16:51:25 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
Sep 21 16:51:25 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed

  And it is ok. "elrond.hpulabs.org" is the node that I can't startup 
(it is on maintenance hardware until monday). I need to start all other 
cluster services under thranduil and haldir .... Is it possible???

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From teigland at redhat.com  Fri Sep 21 15:51:25 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 10:51:25 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3E831.80700@gmail.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
Message-ID: <20070921155125.GD20746@redhat.com>

On Fri, Sep 21, 2007 at 05:50:09PM +0200, carlopmart wrote:
> David Teigland wrote:
> >On Fri, Sep 21, 2007 at 05:29:22PM +0200, carlopmart wrote:
> >> [root at thranduil log]# mount -t gfs /dev/xvdc1 /data
> >>/sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
> >>/sbin/mount.gfs: error mounting lockproto lock_dlm
> >
> >This has already been changed to report a descriptive error message,
> >  "node not a member of the default fence domain"
> >
> >as is shown in the debug log from gfs_controld below, and I suspect
> >appears in your /var/log/messages.
> >
> >>1190388485 mount: not in default fence domain
> >>1190388485 datavol01 do_mount: rv -22
> >
> >>[root at thranduil log]# group_tool -v; group_tool dump gfs
> >>type             level name     id       state node id local_done
> >>fence            0     default  00010001 JOIN_START_WAIT 1 100010001 0
> >>[1]
> >
> >This shows it's not in the fence domain yet.  The reason appears to be
> >that it's trying to fence someone.  Again, look in /var/log/messages to
> >find out more information about what needs to be fenced, or why fencing
> >isn't working.
> >
> >Dave
> >
> >
> Correct Dave. Error is:
> 
> Sep 21 16:51:25 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
> Sep 21 16:51:25 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
> 
>  And it is ok. "elrond.hpulabs.org" is the node that I can't startup 
> (it is on maintenance hardware until monday). I need to start all other 
> cluster services under thranduil and haldir .... Is it possible???

Two options:

1. Remove that node from cluster.conf so it's not fenced every time the
cluster starts up.

2. Manually override/ack the fencing operation every time it happens with:
fence_ack_manual -n elrond.hpulabs.org.  This will allow things to
continue.

Dave


From carlopmart at gmail.com  Fri Sep 21 16:08:29 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 18:08:29 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921155125.GD20746@redhat.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com>
Message-ID: <46F3EC7D.5090108@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 05:50:09PM +0200, carlopmart wrote:
>> David Teigland wrote:
>>> On Fri, Sep 21, 2007 at 05:29:22PM +0200, carlopmart wrote:
>>>> [root at thranduil log]# mount -t gfs /dev/xvdc1 /data
>>>> /sbin/mount.gfs: lock_dlm_join: gfs_controld join error: -22
>>>> /sbin/mount.gfs: error mounting lockproto lock_dlm
>>> This has already been changed to report a descriptive error message,
>>>  "node not a member of the default fence domain"
>>>
>>> as is shown in the debug log from gfs_controld below, and I suspect
>>> appears in your /var/log/messages.
>>>
>>>> 1190388485 mount: not in default fence domain
>>>> 1190388485 datavol01 do_mount: rv -22
>>>> [root at thranduil log]# group_tool -v; group_tool dump gfs
>>>> type             level name     id       state node id local_done
>>>> fence            0     default  00010001 JOIN_START_WAIT 1 100010001 0
>>>> [1]
>>> This shows it's not in the fence domain yet.  The reason appears to be
>>> that it's trying to fence someone.  Again, look in /var/log/messages to
>>> find out more information about what needs to be fenced, or why fencing
>>> isn't working.
>>>
>>> Dave
>>>
>>>
>> Correct Dave. Error is:
>>
>> Sep 21 16:51:25 thranduil fenced[1081]: fencing node "elrond.hpulabs.org"
>> Sep 21 16:51:25 thranduil fenced[1081]: fence "elrond.hpulabs.org" failed
>>
>>  And it is ok. "elrond.hpulabs.org" is the node that I can't startup 
>> (it is on maintenance hardware until monday). I need to start all other 
>> cluster services under thranduil and haldir .... Is it possible???
> 
> Two options:
> 
> 1. Remove that node from cluster.conf so it's not fenced every time the
> cluster starts up.
> 
> 2. Manually override/ack the fencing operation every time it happens with:
> fence_ack_manual -n elrond.hpulabs.org.  This will allow things to
> continue.
> 
> Dave
> 
> 
  First option it isn't possible because I can't restore cluster.conf 
when elrond comes up on the other two nodes.

  Second option returns me this error:

  [root at thranduil ~]# clustat
Member Status: Quorate

   Member Name                        ID   Status
   ------ ----                        ---- ------
   thranduil.hpulabs.org                 1 Online, Local, rgmanager
   haldir.hpulabs.org                    2 Online, rgmanager
   elrond.hpulabs.org                    3 Offline

   Service Name         Owner (Last)                   State
   ------- ----         ----- ------                   -----
   service:rsync-svc    (none)                         stopped
   service:wwwsoft-svc  (none)                         stopped
   service:proxy-svc    (thranduil.hpulabs.org)        stopped
   service:mail-svc     (none)                         stopped
[root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org

Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable!  Please verify that the node shown above has
been reset or disconnected from storage.

Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such file or directory

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From teigland at redhat.com  Fri Sep 21 16:05:18 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 11:05:18 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3EC7D.5090108@gmail.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
Message-ID: <20070921160518.GA23560@redhat.com>

On Fri, Sep 21, 2007 at 06:08:29PM +0200, carlopmart wrote:
> >1. Remove that node from cluster.conf so it's not fenced every time the
> >cluster starts up.
> >
> >2. Manually override/ack the fencing operation every time it happens with:
> >fence_ack_manual -n elrond.hpulabs.org.  This will allow things to
> >continue.

>  First option it isn't possible because I can't restore cluster.conf 
> when elrond comes up on the other two nodes.
> 
>  Second option returns me this error:
> 
>  [root at thranduil ~]# clustat

What does clustat have to do with any of this?

> [root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
> 
> Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
> (i.e. power cycled or disconnected from shared storage devices)
> the GFS file system may become corrupted and all its data
> unrecoverable!  Please verify that the node shown above has
> been reset or disconnected from storage.
> 
> Are you certain you want to continue? [yN] y
> can't open /tmp/fence_manual.fifo: No such file or directory

That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...

Dave


From carlopmart at gmail.com  Fri Sep 21 16:15:37 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 18:15:37 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921160518.GA23560@redhat.com>
References: <46F24027.7040201@gmail.com> <20070921142839.GA20746@redhat.com>
	<46F3DCFA.5010803@gmail.com> <20070921151451.GB20746@redhat.com>
	<46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
	<20070921160518.GA23560@redhat.com>
Message-ID: <46F3EE29.7090200@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 06:08:29PM +0200, carlopmart wrote:
>>> 1. Remove that node from cluster.conf so it's not fenced every time the
>>> cluster starts up.
>>>
>>> 2. Manually override/ack the fencing operation every time it happens with:
>>> fence_ack_manual -n elrond.hpulabs.org.  This will allow things to
>>> continue.
> 
>>  First option it isn't possible because I can't restore cluster.conf 
>> when elrond comes up on the other two nodes.
>>
>>  Second option returns me this error:
>>
>>  [root at thranduil ~]# clustat
> 
> What does clustat have to do with any of this?
> 
>> [root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
>>
>> Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
>> (i.e. power cycled or disconnected from shared storage devices)
>> the GFS file system may become corrupted and all its data
>> unrecoverable!  Please verify that the node shown above has
>> been reset or disconnected from storage.
>>
>> Are you certain you want to continue? [yN] y
>> can't open /tmp/fence_manual.fifo: No such file or directory
> 
> That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...
> 
> Dave
> 
> 
And has some solution???

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From teigland at redhat.com  Fri Sep 21 16:18:07 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 11:18:07 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3EE29.7090200@gmail.com>
References: <20070921142839.GA20746@redhat.com> <46F3DCFA.5010803@gmail.com>
	<20070921151451.GB20746@redhat.com> <46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
	<20070921160518.GA23560@redhat.com> <46F3EE29.7090200@gmail.com>
Message-ID: <20070921161807.GB23560@redhat.com>

On Fri, Sep 21, 2007 at 06:15:37PM +0200, carlopmart wrote:
> >>[root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
> >>
> >>Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
> >>(i.e. power cycled or disconnected from shared storage devices)
> >>the GFS file system may become corrupted and all its data
> >>unrecoverable!  Please verify that the node shown above has
> >>been reset or disconnected from storage.
> >>
> >>Are you certain you want to continue? [yN] y
> >>can't open /tmp/fence_manual.fifo: No such file or directory
> >
> >That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...

> And has some solution???

You need to make sure the RHEL4/cluster-1.0 binaries are removed from the
nodes and the new RHEL5/cluster-2.0/openais binaries are installed.  If
you're getting this far, it may only be some fencing binaries that are
incorrect, so first just remove fence_manual and fence_ack_manual and make
sure you have the new fence_ack_manual installed (it's now a bash script).
fence_manual no longer exists in RHEL5/cluster-2.0 code since
fence_ack_manual talks directly with fenced.

Dave


From carlopmart at gmail.com  Fri Sep 21 16:36:04 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 18:36:04 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921161807.GB23560@redhat.com>
References: <20070921142839.GA20746@redhat.com> <46F3DCFA.5010803@gmail.com>
	<20070921151451.GB20746@redhat.com> <46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
	<20070921160518.GA23560@redhat.com> <46F3EE29.7090200@gmail.com>
	<20070921161807.GB23560@redhat.com>
Message-ID: <46F3F2F4.9080007@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 06:15:37PM +0200, carlopmart wrote:
>>>> [root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
>>>>
>>>> Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
>>>> (i.e. power cycled or disconnected from shared storage devices)
>>>> the GFS file system may become corrupted and all its data
>>>> unrecoverable!  Please verify that the node shown above has
>>>> been reset or disconnected from storage.
>>>>
>>>> Are you certain you want to continue? [yN] y
>>>> can't open /tmp/fence_manual.fifo: No such file or directory
>>> That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...
> 
>> And has some solution???
> 
> You need to make sure the RHEL4/cluster-1.0 binaries are removed from the
> nodes and the new RHEL5/cluster-2.0/openais binaries are installed.  If
> you're getting this far, it may only be some fencing binaries that are
> incorrect, so first just remove fence_manual and fence_ack_manual and make
> sure you have the new fence_ack_manual installed (it's now a bash script).
> fence_manual no longer exists in RHEL5/cluster-2.0 code since
> fence_ack_manual talks directly with fenced.
> 
> Dave
> 
> 
Sorry??? this three nodes are RHEL5 with lastest patches applied except 
kernel version 2.6.18-8.1.10.

Version of cman is: cman-2.0.64-1.0.1.el5
Version of gfs-utils:
Version of rgmanager: rgmanager-2.0.24-1.el5

  And fence-manual exists on this cluster suite:

[root at haldir xen]# whereis fence_manual
fence_manual: /sbin/fence_manual /usr/share/man/man8/fence_manual.8.gz
[root at haldir xen]# rpm -qf /sbin/fence_manual
cman-2.0.64-1.0.1.el5
[root at smeagol xen]#

And fence_ack_manual it is not a bash script, it is a binary:

[root at haldir xen]# whereis fence_ack_manual
fence_ack_manual: /sbin/fence_ack_manual 
/usr/share/man/man8/fence_ack_manual.8.gz
[root at haldir xen]# cd /sbin
[root at haldir sbin]# file fence_ack_manual
fence_ack_manual: ELF 32-bit LSB executable, Intel 80386, version 1 
(SYSV), for GNU/Linux 2.6.9, dynamically linked (uses shared libs), for 
GNU/Linux 2.6.9, stripped
[root at haldir sbin]#

  Do I need to install rhel5.1 beta to do this?? If it yes i have a very 
very great problem ....


-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From teigland at redhat.com  Fri Sep 21 16:55:11 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 21 Sep 2007 11:55:11 -0500
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose a
	cluster
In-Reply-To: <46F3F2F4.9080007@gmail.com>
References: <20070921151451.GB20746@redhat.com> <46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
	<20070921160518.GA23560@redhat.com> <46F3EE29.7090200@gmail.com>
	<20070921161807.GB23560@redhat.com> <46F3F2F4.9080007@gmail.com>
Message-ID: <20070921165511.GC23560@redhat.com>

On Fri, Sep 21, 2007 at 06:36:04PM +0200, carlopmart wrote:
> David Teigland wrote:
> >On Fri, Sep 21, 2007 at 06:15:37PM +0200, carlopmart wrote:
> >>>>[root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
> >>>>
> >>>>Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
> >>>>(i.e. power cycled or disconnected from shared storage devices)
> >>>>the GFS file system may become corrupted and all its data
> >>>>unrecoverable!  Please verify that the node shown above has
> >>>>been reset or disconnected from storage.
> >>>>
> >>>>Are you certain you want to continue? [yN] y
> >>>>can't open /tmp/fence_manual.fifo: No such file or directory
> >>>That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...
> >
> >>And has some solution???
> >
> >You need to make sure the RHEL4/cluster-1.0 binaries are removed from the
> >nodes and the new RHEL5/cluster-2.0/openais binaries are installed.  If
> >you're getting this far, it may only be some fencing binaries that are
> >incorrect, so first just remove fence_manual and fence_ack_manual and make
> >sure you have the new fence_ack_manual installed (it's now a bash script).
> >fence_manual no longer exists in RHEL5/cluster-2.0 code since
> >fence_ack_manual talks directly with fenced.
> >
> >Dave
> >
> >
> Sorry??? this three nodes are RHEL5 with lastest patches applied except 
> kernel version 2.6.18-8.1.10.
> 
> Version of cman is: cman-2.0.64-1.0.1.el5
> Version of gfs-utils:
> Version of rgmanager: rgmanager-2.0.24-1.el5
> 
>  And fence-manual exists on this cluster suite:
> 
> [root at haldir xen]# whereis fence_manual
> fence_manual: /sbin/fence_manual /usr/share/man/man8/fence_manual.8.gz
> [root at haldir xen]# rpm -qf /sbin/fence_manual
> cman-2.0.64-1.0.1.el5
> [root at smeagol xen]#
> 
> And fence_ack_manual it is not a bash script, it is a binary:
> 
> [root at haldir xen]# whereis fence_ack_manual
> fence_ack_manual: /sbin/fence_ack_manual 
> /usr/share/man/man8/fence_ack_manual.8.gz
> [root at haldir xen]# cd /sbin
> [root at haldir sbin]# file fence_ack_manual
> fence_ack_manual: ELF 32-bit LSB executable, Intel 80386, version 1 
> (SYSV), for GNU/Linux 2.6.9, dynamically linked (uses shared libs), for 
> GNU/Linux 2.6.9, stripped
> [root at haldir sbin]#
> 
>  Do I need to install rhel5.1 beta to do this?? If it yes i have a very 
> very great problem ....

Looks like I was wrong about what got into RHEL5, it's a real pity the new
stuff didn't make it.  Looking back at your cluster.conf file it seems
that you're using fence_gnbd for that node, so my next guess is that
fence_gnbd isn't found or isn't working.

I can't find a way to override a failing fence operation in the RHEL5
code, so that probably means you'll have to get fence_gnbd working.

Or, another somewhat dangerous option is to disable startup fencing
altogether by adding this to cluster.conf:
  <fence_daemon clean_start="1"/>

Dave


From Danny.Wall at health-first.org  Fri Sep 21 17:11:04 2007
From: Danny.Wall at health-first.org (Danny Wall)
Date: Fri, 21 Sep 2007 13:11:04 -0400
Subject: [Linux-cluster] Sounds like your cluster communication is happening
	over the T1, and not a local network.
Message-ID: <46F3C2E9.EB03.00C8.0@health-first.org>

Sounds like your cluster communication is happening over the T1, and not a local network.

Message: 11
Date: Fri, 21 Sep 2007 09:40:34 -0500
From: "isplist at logicore.net" <isplist at logicore.net>
Subject: [Linux-cluster] Cluster won't come up when T1 is down???
To: linux-cluster <linux-cluster at redhat.com>
Message-ID: <200792194034.295600 at leena>
Content-Type: text/plain; charset="iso-8859-1"

Here's a very weird one. I have a cluster of web servers outgoing over a T1. 
When the T1 went down this morning, the cluster, which is all internal, non 
routable IP's, would not come back. All of the machines locked up around the 
loading DLM section on bootup.

Once the T1 came back, they all booted just fine and went into cluster mode.

What in the world would cause that? There aren't any external services 
required to fire up my local cluster, never were, it's always been fine 
before.

Mike


#####################################
This message is for the named person's use only.  It may
contain confidential, proprietary, or legally privileged
information.  No confidentiality or privilege is waived or
lost by any mistransmission.  If you receive this message
in error, please immediately delete it and all copies of it
from your system, destroy any hard copies of it, and notify
the sender.  You must not, directly or indirectly, use,
disclose, distribute, print, or copy any part of this message
if you are not the intended recipient.  Health First reserves
the right to monitor all e-mail communications through its
networks.  Any views or opinions expressed in this message
are solely those of the individual sender, except (1) where
the message states such views or opinions are on behalf of
a particular entity;  and (2) the sender is authorized by
the entity to give such views or opinions.
#####################################


From Danny.Wall at health-first.org  Fri Sep 21 17:17:29 2007
From: Danny.Wall at health-first.org (Danny Wall)
Date: Fri, 21 Sep 2007 13:17:29 -0400
Subject: [Linux-cluster] Node will not join cluster.
Message-ID: <46F3C469.EB03.00C8.0@health-first.org>

I have a two node cluster on RHEL4. It was all Update 4, but Red Hat suggested I update the problem node to U5, so now it is mixed.

The problem:
We had a SAN issue last week that took 3 of the 15 LUNs offline. The SAN problem was been fixed, so I tried to rescan the LUNs, and still could not see the LUNs. I restarted one node, and ever since then, it would fail when trying to join the cluster. I disabled the cluster services at boot, and try them manually. Every time I get to CLVMD, it hangs forever. Sometimes I can CTRL-C to break out, sometimes no. There are no relevant messages in /var/log/messages. I have worked with Red Hat and the last thing they suggested was to upgrade the node to U5. I did Tuesday, and it still hangs. I am afraid to take down the other node, because if it does the same thing, I am going to have major problems.

I have also noticed that running lvdisplay, vgdisplay and pvdisplay hang. I am sure it is related. I know dlm and module is loaded.

lock_dlm               47220  0
lock_harness            7216  2 gfs,lock_dlm
dlm                   134404  3 lock_dlm
cman                  139552  11 gfs,lock_dlm,dlm


Any thoughts?

Danny


#####################################
This message is for the named person's use only.  It may
contain confidential, proprietary, or legally privileged
information.  No confidentiality or privilege is waived or
lost by any mistransmission.  If you receive this message
in error, please immediately delete it and all copies of it
from your system, destroy any hard copies of it, and notify
the sender.  You must not, directly or indirectly, use,
disclose, distribute, print, or copy any part of this message
if you are not the intended recipient.  Health First reserves
the right to monitor all e-mail communications through its
networks.  Any views or opinions expressed in this message
are solely those of the individual sender, except (1) where
the message states such views or opinions are on behalf of
a particular entity;  and (2) the sender is authorized by
the entity to give such views or opinions.
#####################################


From carlopmart at gmail.com  Fri Sep 21 17:21:05 2007
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 21 Sep 2007 19:21:05 +0200
Subject: [Linux-cluster] Re: Starting up two of three nodes that compose
	a cluster
In-Reply-To: <20070921165511.GC23560@redhat.com>
References: <20070921151451.GB20746@redhat.com> <46F3E352.2030405@gmail.com>
	<20070921153722.GC20746@redhat.com> <46F3E831.80700@gmail.com>
	<20070921155125.GD20746@redhat.com> <46F3EC7D.5090108@gmail.com>
	<20070921160518.GA23560@redhat.com> <46F3EE29.7090200@gmail.com>
	<20070921161807.GB23560@redhat.com> <46F3F2F4.9080007@gmail.com>
	<20070921165511.GC23560@redhat.com>
Message-ID: <46F3FD81.9030206@gmail.com>

David Teigland wrote:
> On Fri, Sep 21, 2007 at 06:36:04PM +0200, carlopmart wrote:
>> David Teigland wrote:
>>> On Fri, Sep 21, 2007 at 06:15:37PM +0200, carlopmart wrote:
>>>>>> [root at thranduil ~]# fence_ack_manual -n elrond.hpulabs.org
>>>>>>
>>>>>> Warning:  If the node "elrond.hpulabs.org" has not been manually fenced
>>>>>> (i.e. power cycled or disconnected from shared storage devices)
>>>>>> the GFS file system may become corrupted and all its data
>>>>>> unrecoverable!  Please verify that the node shown above has
>>>>>> been reset or disconnected from storage.
>>>>>>
>>>>>> Are you certain you want to continue? [yN] y
>>>>>> can't open /tmp/fence_manual.fifo: No such file or directory
>>>>> That looks like the old RHEL4/cluster-1.0 version of fence_ack_manual...
>>>> And has some solution???
>>> You need to make sure the RHEL4/cluster-1.0 binaries are removed from the
>>> nodes and the new RHEL5/cluster-2.0/openais binaries are installed.  If
>>> you're getting this far, it may only be some fencing binaries that are
>>> incorrect, so first just remove fence_manual and fence_ack_manual and make
>>> sure you have the new fence_ack_manual installed (it's now a bash script).
>>> fence_manual no longer exists in RHEL5/cluster-2.0 code since
>>> fence_ack_manual talks directly with fenced.
>>>
>>> Dave
>>>
>>>
>> Sorry??? this three nodes are RHEL5 with lastest patches applied except 
>> kernel version 2.6.18-8.1.10.
>>
>> Version of cman is: cman-2.0.64-1.0.1.el5
>> Version of gfs-utils:
>> Version of rgmanager: rgmanager-2.0.24-1.el5
>>
>>  And fence-manual exists on this cluster suite:
>>
>> [root at haldir xen]# whereis fence_manual
>> fence_manual: /sbin/fence_manual /usr/share/man/man8/fence_manual.8.gz
>> [root at haldir xen]# rpm -qf /sbin/fence_manual
>> cman-2.0.64-1.0.1.el5
>> [root at smeagol xen]#
>>
>> And fence_ack_manual it is not a bash script, it is a binary:
>>
>> [root at haldir xen]# whereis fence_ack_manual
>> fence_ack_manual: /sbin/fence_ack_manual 
>> /usr/share/man/man8/fence_ack_manual.8.gz
>> [root at haldir xen]# cd /sbin
>> [root at haldir sbin]# file fence_ack_manual
>> fence_ack_manual: ELF 32-bit LSB executable, Intel 80386, version 1 
>> (SYSV), for GNU/Linux 2.6.9, dynamically linked (uses shared libs), for 
>> GNU/Linux 2.6.9, stripped
>> [root at haldir sbin]#
>>
>>  Do I need to install rhel5.1 beta to do this?? If it yes i have a very 
>> very great problem ....
> 
> Looks like I was wrong about what got into RHEL5, it's a real pity the new
> stuff didn't make it.  Looking back at your cluster.conf file it seems
> that you're using fence_gnbd for that node, so my next guess is that
> fence_gnbd isn't found or isn't working.
> 
> I can't find a way to override a failing fence operation in the RHEL5
> code, so that probably means you'll have to get fence_gnbd working.
> 
> Or, another somewhat dangerous option is to disable startup fencing
> altogether by adding this to cluster.conf:
>   <fence_daemon clean_start="1"/>
> 
> Dave
> 
> 
Thanks Dave, but I have tried clean_start without luck ... Error is the 
same. Fence_gnd works ok, almost when three nodes are up. 
(deagol.hpulabs.org is a VMWare virtual machine allocated on a ESX cluster).

Well I will try to do a cron job to change cluster.conf at 00:00 AM on 
Monday ... I think that this is the only option ....

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From jdozarchuk at babcock.com  Fri Sep 21 17:34:19 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Fri, 21 Sep 2007 13:34:19 -0400
Subject: [Linux-cluster] Node will not join cluster.
In-Reply-To: <46F3C469.EB03.00C8.0@health-first.org>
References: <46F3C469.EB03.00C8.0@health-first.org>
Message-ID: <EE9F27C20409C44AA031C769B3DBCE4102748432@barbpo3.bwes.net>

By U5, do you mean Redhat Enterprise 5.0?

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Danny Wall
Sent: Friday, September 21, 2007 1:17 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Node will not join cluster.

I have a two node cluster on RHEL4. It was all Update 4, but Red Hat
suggested I update the problem node to U5, so now it is mixed.

The problem:
We had a SAN issue last week that took 3 of the 15 LUNs offline. The SAN
problem was been fixed, so I tried to rescan the LUNs, and still could
not see the LUNs. I restarted one node, and ever since then, it would
fail when trying to join the cluster. I disabled the cluster services at
boot, and try them manually. Every time I get to CLVMD, it hangs
forever. Sometimes I can CTRL-C to break out, sometimes no. There are no
relevant messages in /var/log/messages. I have worked with Red Hat and
the last thing they suggested was to upgrade the node to U5. I did
Tuesday, and it still hangs. I am afraid to take down the other node,
because if it does the same thing, I am going to have major problems.

I have also noticed that running lvdisplay, vgdisplay and pvdisplay
hang. I am sure it is related. I know dlm and module is loaded.

lock_dlm               47220  0
lock_harness            7216  2 gfs,lock_dlm
dlm                   134404  3 lock_dlm
cman                  139552  11 gfs,lock_dlm,dlm


Any thoughts?

Danny


#####################################
This message is for the named person's use only.  It may
contain confidential, proprietary, or legally privileged
information.  No confidentiality or privilege is waived or
lost by any mistransmission.  If you receive this message
in error, please immediately delete it and all copies of it
from your system, destroy any hard copies of it, and notify
the sender.  You must not, directly or indirectly, use,
disclose, distribute, print, or copy any part of this message
if you are not the intended recipient.  Health First reserves
the right to monitor all e-mail communications through its
networks.  Any views or opinions expressed in this message
are solely those of the individual sender, except (1) where
the message states such views or opinions are on behalf of
a particular entity;  and (2) the sender is authorized by
the entity to give such views or opinions.
#####################################

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-----------------------------------------
This message is intended only for the individual or entity to which
it is addressed and contains information that is proprietary to The
Babcock & Wilcox Company and/or its affiliates, or may be otherwise
confidential.  If the reader of this message is not the intended
recipient, or the employee agent responsible for delivering the
message to the intended recipient, you are hereby notified that any
dissemination, distribution or copying of this communication is
strictly prohibited.  If you have received this communication in
error, please notify the sender immediately by return e-mail and
delete this message from your computer.  Thank you.


From simone.gotti at email.it  Fri Sep 21 17:51:09 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Fri, 21 Sep 2007 19:51:09 +0200
Subject: [Linux-cluster] altname broken?
In-Reply-To: <466FE1C5.8090806@redhat.com>
References: <466FB0B3.1010806@physik.lmu.de> <466FB462.9010705@redhat.com>
	<466FD6A4.7030605@physik.lmu.de>  <466FE1C5.8090806@redhat.com>
Message-ID: <1190397069.3146.4.camel@localhost>

Hi,

Just found the same problem testing multiple heartbeat NICs on RHEL5
update 0. I didn't had time to try cvs HEAD, RHEL51 branch or the U1
beta1 and I cannot find a related bugzilla (just this mail) on it so
dunno if it was fixed.
Was this already reported? or I'll open a new one in the next days.

Thanks!
Bye!


On Wed, 2007-06-13 at 13:23 +0100, Patrick Caulfield wrote:
> Frederik Wagner wrote:
> > Hi Patrick,
> > 
> > On 06/13/2007 11:09 AM, Patrick Caulfield wrote:
> >> Frederik Wagner wrote:
> >>> Are there any known problems regading this option?
> >>>
> >> There's nothing changed in that area that I can recall. Can you track the
> >> startup with "cman_tool join -d" and also CMAN messages that appear in syslog on
> >> the nodes?
> >>
> 
> Ahh, I misread you original message slightly. I though this was an upgrade from
> 4.4 to 4.5 not to 5.0 which is a totally new cluster system!
> 
> I just tried altname on my systems here and it seems to work fine - though it's
> not a very representative test network.
> 
> Firstly, make sure that all your nodes are running the same version of the
> software. I suspect you are but it is important to realise that you can't mix V4
> and v5 clusters.
> 
> The TOTEM messages I'd rather leave to Steve Dake to have a look at if he has time
> 
-- 
Simone Gotti
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070921/cd95255b/attachment.sig>

From orkcu at yahoo.com  Fri Sep 21 20:30:54 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Fri, 21 Sep 2007 13:30:54 -0700 (PDT)
Subject: [Linux-cluster] Node will not join cluster.
In-Reply-To: <46F3C469.EB03.00C8.0@health-first.org>
Message-ID: <1018.53985.qm@web50609.mail.re2.yahoo.com>


--- Danny Wall <Danny.Wall at health-first.org> wrote:

> I have a two node cluster on RHEL4. It was all
> Update 4, but Red Hat suggested I update the problem
> node to U5, so now it is mixed.
> 
> The problem:
> We had a SAN issue last week that took 3 of the 15
> LUNs offline. The SAN problem was been fixed, so I
> tried to rescan the LUNs, and still could not see
> the LUNs. 
after the restart, if booted in runlevel 1, did you
see the LUNs ?
I mean as normal sdX devices
for example:
fdisk -l /dev/sdf
give you the correct information?

>I restarted one node, and ever since then,
> it would fail when trying to join the cluster. I
> disabled the cluster services at boot, and try them
> manually. Every time I get to CLVMD, it hangs
> forever. Sometimes I can CTRL-C to break out,
> sometimes no. There are no relevant messages in
> /var/log/messages. I have worked with Red Hat and
> the last thing they suggested was to upgrade the
> node to U5. I did Tuesday, and it still hangs. I am
> afraid to take down the other node, because if it
> does the same thing, I am going to have major
> problems.

well,  you have a _two_ node cluster, so, you will
have always one node up, right?
if the LUNs are see it by the OS (kernel)

> 
> I have also noticed that running lvdisplay,
> vgdisplay and pvdisplay hang. I am sure it is
> related. I know dlm and module is loaded.

I guess it is a matter of lvm unable to read the LUNs,
"initializated" the VG and LV

but, what about fencing and quorum of the cluster, are
they ok? 
or the node never go over the clvmd start and stay in
that stage forever?

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Pinpoint customers who are looking for what you sell. 
http://searchmarketing.yahoo.com/


From chris at cmiware.com  Sat Sep 22 00:31:09 2007
From: chris at cmiware.com (Chris Harms)
Date: Fri, 21 Sep 2007 19:31:09 -0500
Subject: [Linux-cluster] rgmanager segfault
Message-ID: <46F4624D.8020206@cmiware.com>

This occurred after stopping the cluster via  Luci and then start via Luci:

kernel: clurgmgrd[22367]: segfault at 0000000000000048 rip 
0000000000419804 rsp 0000000041401030 error 4

5.1 Beta RPMs:
    rgmanager-2.0.28-1.el5
    cman-2.0.70-1.el5
    kernel-2.6.18-36.el5


From jos at xos.nl  Sat Sep 22 16:03:56 2007
From: jos at xos.nl (Jos Vos)
Date: Sat, 22 Sep 2007 18:03:56 +0200
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
Message-ID: <200709221603.l8MG3uAv031551@jasmine.xos.nl>

Hi,

After adding a quorum disk to a two-node configuration (and restarting,
rebooting the cluster), everything *seems* to be ok, except that
clurgmgrd doesn't work properly anymore and that no services are started.
It times out when clustat asks the status, it can't get any service
status anymore, and so it doesn't start any service too...

This is on RHEL 5.0.

Some info (I masked most names/IP address by ****'s):

Old config:

     <cman expected_votes="1" two_node="1"/>

New config (this is the only change in the config!):

     <cman expected_votes="3" two_node="0"/>
     <quorumd interval="1" tko="10" votes="1" label="qdisk1">
             <heuristic program="ping ****** -c1 -t1" score="1" interval="2"/>
             <heuristic program="ping ****** -c1 -t1" score="1" interval="2"/>
             <heuristic program="ping ****** -c1 -t1" score="1" interval="2"/>
     </quorumd>

cman_tool status (old config):

Version: 6.0.1
Config Version: 36
Cluster Name: ***********
Cluster Id: 21428
Cluster Member: Yes
Cluster Generation: 8
Membership state: Cluster-Member
Nodes: 2
Expected votes: 1
Total votes: 2
Quorum: 1  
Active subsystems: 8
Flags: 2node 
Ports Bound: 0 11 177  
Node name: *******
Node ID: 1
Multicast addresses: 239.192.83.8 
Node addresses: ************** 

cman_tool status (new config):

Version: 6.0.1
Config Version: 35
Cluster Name: ***********
Cluster Id: 21428
Cluster Member: Yes
Cluster Generation: 8
Membership state: Cluster-Member
Nodes: 2
Expected votes: 3
Total votes: 3
Quorum: 2  
Active subsystems: 9
Flags: 
Ports Bound: 0 11 177  
Node name: *******
Node ID: 1
Multicast addresses: 239.192.83.8 
Node addresses: ************** 

clustat (old config):

Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  *******                               1 Online, rgmanager
  *******                               2 Online, Local, rgmanager

  Service Name         Owner (Last)                   State         
  ------- ----         ----- ------                   -----         
  service:***********  *******                        started         
  service:***********  *******                        started         
  service:***********  *******                        started         
  service:***********  *******                        started      


clustat (new config):

Timeout waiting for a response from Resource Group Manager
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  *******                               1 Online
  *******                               2 Online, Local
  /dev/sdc                              0 Online, Quorum Disk

With the new condig, I see in the messages file a.o.:

Sep 22 17:15:15 ******* clurgmgrd[4465]: <err> #34: Cannot get status for service service:***********

Also, clurgmgrd can't be stopped by its service script in that case,
it just doesn't react.

The quorum disk is seen correctly on both systems with "mkqdisk -L"
and it looks like cman etc. do take it into account.

Any clues on what's going on here?
Thanks,

--
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From celso at webbertek.com.br  Mon Sep 24 13:52:49 2007
From: celso at webbertek.com.br (Celso K. Webber)
Date: Mon, 24 Sep 2007 10:52:49 -0300
Subject: [Linux-cluster] Qdiskd issue over EMC CX3-20 Storage + EMC
 PowerPath multipathing software
Message-ID: <46F7C131.7080806@webbertek.com.br>

Hello all,

I'm having an issue with a RHCS4 Cluster. Here are some versioning information:
* Storage: EMC CX3-20, latest FLARE code applied;
* HBAs: 2 x QLogic 2462, latest/certified BIOS by EMC (v1.24);
* Servers: 2 Dell PowerEdge 2950, 2 quad-core processors, 8 GB of RAM, all 
available firmware updates applied;
* OS: RHEL v4 Update 4 with kernel 2.6.9-42.0.10.ELsmp (latest kernel 
certified by EMC for RHEL4). RHEL4u5 is not certified by EMC yet, so we 
installed RHEL4u4 and upgraded the kernel only to the latest certified release;
* Processor Architecture: everything x86_64;
* RH Cluster Suite: latest non-kernel specific packages, the other packages 
(cman-kernel, dlm-kernel) are specific for the 2.6.9-42.0.10.ELsmp kernel;
* Multipath/storage software: EMC PowerPath v5.0.0.157, Navisphere Agent 
v6.24.0.6.13.


We are experiencing a problem during our tests with the multipathing 
software. If we take out the fiber cable from one of the HBAs from one 
server, it removes itself from the Cluster because of losing access to the 
shared partition (this is an expected behaviour). But since we are pointing 
the Qdisk daemon to an EMC Power device (/dev/emcpowerXX), we expected that 
the multipathing should take care of the fibre channel outage.

So, I ask: is there any specific timers I should configure in cman or qdiskd 
so that I can give enough time for PowerPath to reconfigure the available 
paths? The Storage Administrator verified that all storage paths are active 
and functional.

By the way: I'm configuring qdiskd with no heuristics at all, since we 
didn't have any reliable "router" available to work as an IP tiebraker for 
the cluster. Since the Cluster FAQ 
(http://sources.redhat.com/cluster/faq.html#quorumdiskonly) states in 
question #23 (last paragraph) that in RHCS4U5 it is possible to have no 
heuristics at all, we are trying it in this installation for the first time.

Below I post the relevant part of my cluster.conf file:

<?xml version="1.0"?>
<cluster config_version="9" name="clu_xxxxxx">
	<quorumd log_facility="local6" device="/dev/emcpowere1" interval="1" 
min_score="0" tko="10" votes="1"/>
	<fence_daemon post_fail_delay="10" post_join_delay="3"/>
	<clusternodes>
		<clusternode name="node1" votes="1">
			<fence>
				<method name="1">
					<device lanplus="" name="node1-ipmi"/>
				</method>
			</fence>
		</clusternode>
		<clusternode name="node2" votes="1">
			<fence>
				<method name="1">
					<device lanplus="" name="node2-ipmi"/>
				</method>
			</fence>
		</clusternode>
	</clusternodes>
	<cman/>
	<fencedevices>
		<fencedevice agent="fence_ipmilan" auth="none" ipaddr="hercules01-ipmi" 
login="root" name="node1-ipmi" passwd="clusterprosper"/>
		<fencedevice agent="fence_ipmilan" auth="none" ipaddr="hercules02-ipmi" 
login="root" name="node2-ipmi" passwd="clusterprosper"/>
	</fencedevices>
...


Thank you very much for any ideas on this issue.

Regards,

Celso.

-- 
*Celso Kopp Webber*

celso at webbertek.com.br <mailto:celso at webbertek.com.br>

*Webbertek - Opensource Knowledge*
(41) 8813-1919 - celular
(41) 4063-8448, ramal 102 - fixo


-- 
Esta mensagem foi verificada pelo sistema de antiv?rus e
 acredita-se estar livre de perigo.


From jobot at wmdata.com  Mon Sep 24 15:33:30 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Mon, 24 Sep 2007 17:33:30 +0200
Subject: [Linux-cluster] Possible cman init script race condition
Message-ID: <CABF801D13AA444988E62B7AF62C371D0217F346@WMRI000167.corp.wmdata.net>

Hi,

I think there might be some race condition in the cman init script causing fenced to stop working correctly.
I'm able to reliably reproduce the problem using problem using a minimal cluster.conf with two nodes and fence_manual fencing.

Steps to reproduce:
1. Install cluster.conf on two nodes, enable the "cman" service and reboot both nodes.
2. The cluster boots successfully and clustat lists both nodes as online.
3. Power-cycle node prod-db1.
4. On prod-db2 openais detects the missing node but fenced decides to do nothing about it and logs nothing to /var/log/messages (But the fenced process is still running)

Output from "group_tool dump fence" After the test:

[root at prod-db2 ~]# group_tool dump fence
1190645583 our_nodeid 2 our_name prod-db2
1190645583 listen 4 member 5 groupd 7
1190645584 client 3: join default
1190645584 delay post_join 120s post_fail 0s
1190645584 added 2 nodes from ccs
1190645584 setid default 65538
1190645584 start default 1 members 2 
1190645584 do_recovery stop 0 start 1 finish 0
1190645584 node "prod-db1" not a cman member, cn 1
1190645584 add first victim prod-db1
1190645585 node "prod-db1" not a cman member, cn 1
1190645586 node "prod-db1" not a cman member, cn 1
1190645587 node "prod-db1" not a cman member, cn 1
1190645588 node "prod-db1" not a cman member, cn 1
1190645589 node "prod-db1" not a cman member, cn 1
1190645590 node "prod-db1" not a cman member, cn 1
1190645591 node "prod-db1" not a cman member, cn 1
1190645592 node "prod-db1" not a cman member, cn 1
1190645593 node "prod-db1" not a cman member, cn 1
1190645594 node "prod-db1" not a cman member, cn 1
1190645595 node "prod-db1" not a cman member, cn 1
1190645596 node "prod-db1" not a cman member, cn 1
1190645597 node "prod-db1" not a cman member, cn 1
1190645598 node "prod-db1" not a cman member, cn 1
1190645599 node "prod-db1" not a cman member, cn 1
1190645600 reduce victim prod-db1
1190645600 delay of 16s leaves 0 victims
1190645600 finish default 1
1190645600 stop default
1190645600 start default 2 members 1 2 
1190645600 do_recovery stop 1 start 2 finish 1
1190645954 client 3: dump    <--- Before killing prod-db1
1190645985 stop default
1190645985 start default 3 members 2 
1190645985 do_recovery stop 2 start 3 finish 1
1190645985 finish default 3
1190646008 client 3: dump    <--- After killing prod-db1

The reason why I suspect some kind of race condition is because I'm only able to reproduce this when cman is started on boot. If I run "service cman start" manually or add a "sleep 30" line to the init script fenced works as expected.

I'm running this test on two Dell 1955 blades on which it takes quite some time for the Linux kernel to boot and initialize all drivers. So I'm guessing this might cause the cman to be started before some crucial part of the kernel has been loaded/initialized or something.

I tried to reproduce this problem using two xen virtual machines without success, but that kernel probably boots and initializes fast enough to avoid this race condition.

The scary part is that as far as I can tell fenced is the only cman daemon being affected by this. So your cluster appears to work fine. But when a node needs to be fenced the operation it isn't carried out and that can cause gfs filesystem corruption.

Any thoughts?   

Hacking the init script doesn't feel like a very solid solution since that will be overwritten the next time the cman rpm is updated...

OS: RHEL5 Advanced platform
cman: 2.0.64-1.0.1.el5
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cluster.conf.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070924/e95e5c14/attachment.txt>

From teigland at redhat.com  Mon Sep 24 16:10:12 2007
From: teigland at redhat.com (David Teigland)
Date: Mon, 24 Sep 2007 11:10:12 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D0217F346@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F346@WMRI000167.corp.wmdata.net>
Message-ID: <20070924161012.GA14880@redhat.com>

On Mon, Sep 24, 2007 at 05:33:30PM +0200, Borgstr?m Jonas wrote:
> Hi,
> 
> I think there might be some race condition in the cman init script
> causing fenced to stop working correctly.
> I'm able to reliably reproduce the problem using problem using a minimal
> cluster.conf with two nodes and fence_manual fencing.
> 
> Steps to reproduce:
> 1. Install cluster.conf on two nodes, enable the "cman" service and
> reboot both nodes.
> 2. The cluster boots successfully and clustat lists both nodes as online.
> 3. Power-cycle node prod-db1.
> 4. On prod-db2 openais detects the missing node but fenced decides to do
> nothing about it and logs nothing to /var/log/messages (But the fenced
> process is still running)
> 
> Output from "group_tool dump fence" After the test:
> 
> [root at prod-db2 ~]# group_tool dump fence
> 1190645583 our_nodeid 2 our_name prod-db2
> 1190645583 listen 4 member 5 groupd 7
> 1190645584 client 3: join default
> 1190645584 delay post_join 120s post_fail 0s
> 1190645584 added 2 nodes from ccs
> 1190645584 setid default 65538
> 1190645584 start default 1 members 2 
> 1190645584 do_recovery stop 0 start 1 finish 0
> 1190645584 node "prod-db1" not a cman member, cn 1
> 1190645584 add first victim prod-db1
> 1190645585 node "prod-db1" not a cman member, cn 1
> 1190645586 node "prod-db1" not a cman member, cn 1
> 1190645587 node "prod-db1" not a cman member, cn 1
> 1190645588 node "prod-db1" not a cman member, cn 1
> 1190645589 node "prod-db1" not a cman member, cn 1
> 1190645590 node "prod-db1" not a cman member, cn 1
> 1190645591 node "prod-db1" not a cman member, cn 1
> 1190645592 node "prod-db1" not a cman member, cn 1
> 1190645593 node "prod-db1" not a cman member, cn 1
> 1190645594 node "prod-db1" not a cman member, cn 1
> 1190645595 node "prod-db1" not a cman member, cn 1
> 1190645596 node "prod-db1" not a cman member, cn 1
> 1190645597 node "prod-db1" not a cman member, cn 1
> 1190645598 node "prod-db1" not a cman member, cn 1
> 1190645599 node "prod-db1" not a cman member, cn 1
> 1190645600 reduce victim prod-db1
> 1190645600 delay of 16s leaves 0 victims
> 1190645600 finish default 1
> 1190645600 stop default
> 1190645600 start default 2 members 1 2 
> 1190645600 do_recovery stop 1 start 2 finish 1

I think something has gone wrong here, either in groupd or fenced, that's
preventing this start from finishing (we don't get a 'finish default 2'
which we expect).  A 'group_tool -v' here should show the state of the
fence group still in transition.  Could you run that, plus a 'group_tool
dump' at this point, in addition to the 'dump fence' you have.  And please
run those commands on both nodes.

> 1190645954 client 3: dump    <--- Before killing prod-db1
> 1190645985 stop default
> 1190645985 start default 3 members 2 
> 1190645985 do_recovery stop 2 start 3 finish 1
> 1190645985 finish default 3
> 1190646008 client 3: dump    <--- After killing prod-db1

Node 1 isn't fenced here because it never completed joining the fence
group above.

> The scary part is that as far as I can tell fenced is the only cman
> daemon being affected by this. So your cluster appears to work fine. But
> when a node needs to be fenced the operation it isn't carried out and
> that can cause gfs filesystem corruption.

You shouldn't be able to mount gfs on the node where joining the fence
group is stuck.

Thanks for the informative report.

Dave


From lhh at redhat.com  Mon Sep 24 17:15:28 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:15:28 -0400
Subject: [Linux-cluster] Cluster not starting backup after reboot
In-Reply-To: <46E84D11.80305@transolutions.net>
References: <1381753941-1189461572-cardhu_decombobulator_blackberry.rim.net-1440139959-@bxe019.bisx.prod.on.blackberry>
	<46E5C2BD.7090705@transolutions.net>
	<20070912185421.GM7563@redhat.com>
	<46E84D11.80305@transolutions.net>
Message-ID: <20070924171528.GQ10872@redhat.com>

On Wed, Sep 12, 2007 at 03:33:21PM -0500, James Wilson wrote:
> Thanks for the replies.
> 
> I have decided to have the dom0's in one cluster and the domU's in 
> another. I import the storage into the xen instances as raw storage and 
> configure gfs from within the domU and it is working fine that now. The 
> only thing is when I test failover the ip does not move over. When I 
> checked the service it was still assigned to the instance that got 
> fenced. Any ideas?

That's strange - rgmanager could be stuck waiting for a fencing
operation to complete.  If you reset the other VM (the one which was
evicted from the cluster), does the IP come back?

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 24 17:17:25 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:17:25 -0400
Subject: [Linux-cluster] Services timeout
In-Reply-To: <46E8E254.8030301@cesca.es>
References: <46E791BC.2090006@cesca.es> <20070912185903.GO7563@redhat.com>
	<46E8E254.8030301@cesca.es>
Message-ID: <20070924171721.GR10872@redhat.com>

On Thu, Sep 13, 2007 at 09:10:12AM +0200, Jordi Prats wrote:
> Hi,
> This is all the data I can collect. bpkar is a backup process. It have 
> happened while it was indexing (it takes several weeks) and doing a 
> backup at the same time.
> 
> best regards,

That's helpful actually.  Thanks.  I think this is related to the status
queue length bug which will be fixed in 4.6 and 5.1:

https://bugzilla.redhat.com/show_bug.cgi?id=246669

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 24 17:20:32 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:20:32 -0400
Subject: [Linux-cluster] rgmanager segfault
In-Reply-To: <46F4624D.8020206@cmiware.com>
References: <46F4624D.8020206@cmiware.com>
Message-ID: <20070924172032.GS10872@redhat.com>

On Fri, Sep 21, 2007 at 07:31:09PM -0500, Chris Harms wrote:
> This occurred after stopping the cluster via  Luci and then start via Luci:
> 
> kernel: clurgmgrd[22367]: segfault at 0000000000000048 rip 
> 0000000000419804 rsp 0000000041401030 error 4
> 
> 5.1 Beta RPMs:
>    rgmanager-2.0.28-1.el5
>    cman-2.0.70-1.el5
>    kernel-2.6.18-36.el5

Could you file a bugzilla? (segfaults = always bad)

I wonder if rgmanager was running and maybe CMAN got stopped out from
under it.


-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 24 17:20:55 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:20:55 -0400
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
In-Reply-To: <200709221603.l8MG3uAv031551@jasmine.xos.nl>
References: <200709221603.l8MG3uAv031551@jasmine.xos.nl>
Message-ID: <20070924172055.GT10872@redhat.com>

On Sat, Sep 22, 2007 at 06:03:56PM +0200, Jos Vos wrote:
> Hi,
> 
> After adding a quorum disk to a two-node configuration (and restarting,
> rebooting the cluster), everything *seems* to be ok, except that
> clurgmgrd doesn't work properly anymore and that no services are started.
> It times out when clustat asks the status, it can't get any service
> status anymore, and so it doesn't start any service too...
> 
> This is on RHEL 5.0.

Fixed in 5.1 beta.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 24 17:25:06 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:25:06 -0400
Subject: [Linux-cluster] Qdiskd issue over EMC CX3-20 Storage + EMC
	PowerPath multipathing software
In-Reply-To: <46F7C131.7080806@webbertek.com.br>
References: <46F7C131.7080806@webbertek.com.br>
Message-ID: <20070924172506.GU10872@redhat.com>

On Mon, Sep 24, 2007 at 10:52:49AM -0300, Celso K. Webber wrote:
> Hello all,
> 
> I'm having an issue with a RHCS4 Cluster. Here are some versioning 
> information:
> * Storage: EMC CX3-20, latest FLARE code applied;
> * HBAs: 2 x QLogic 2462, latest/certified BIOS by EMC (v1.24);
> * Servers: 2 Dell PowerEdge 2950, 2 quad-core processors, 8 GB of RAM, all 
> available firmware updates applied;
> * OS: RHEL v4 Update 4 with kernel 2.6.9-42.0.10.ELsmp (latest kernel 
> certified by EMC for RHEL4). RHEL4u5 is not certified by EMC yet, so we 
> installed RHEL4u4 and upgraded the kernel only to the latest certified 
> release;
> * Processor Architecture: everything x86_64;
> * RH Cluster Suite: latest non-kernel specific packages, the other packages 
> (cman-kernel, dlm-kernel) are specific for the 2.6.9-42.0.10.ELsmp kernel;
> * Multipath/storage software: EMC PowerPath v5.0.0.157, Navisphere Agent 
> v6.24.0.6.13.
> 
> 
> We are experiencing a problem during our tests with the multipathing 
> software. If we take out the fiber cable from one of the HBAs from one 
> server, it removes itself from the Cluster because of losing access to the 
> shared partition (this is an expected behaviour). But since we are pointing 
> the Qdisk daemon to an EMC Power device (/dev/emcpowerXX), we expected that 
> the multipathing should take care of the fibre channel outage.

Yes, it should.
> 
> So, I ask: is there any specific timers I should configure in cman or 
> qdiskd so that I can give enough time for PowerPath to reconfigure the 
> available paths? The Storage Administrator verified that all storage paths 
> are active and functional.

Yes, you can adjust interval + TKO count.  See the qdisk(5) man page.
Note that qdisk timings should be < (0.5 * cluster_timeout), so you will
need to adjust your cluster timeout accordingly:

   <cman deadnode_timeout="..." .../>

> By the way: I'm configuring qdiskd with no heuristics at all, since we 
> didn't have any reliable "router" available to work as an IP tiebraker for 
> the cluster. Since the Cluster FAQ 
> (http://sources.redhat.com/cluster/faq.html#quorumdiskonly) states in 
> question #23 (last paragraph) that in RHCS4U5 it is possible to have no 
> heuristics at all, we are trying it in this installation for the first time.

Correct, but it's nice to have them :)
> 
> <?xml version="1.0"?>
> <cluster config_version="9" name="clu_xxxxxx">
> 	<quorumd log_facility="local6" device="/dev/emcpowere1" interval="1" 
> min_score="0" tko="10" votes="1"/>

  interval*tko = qdisk timeout (in seconds)

> 	<cman/>

   <cman deadnode_timeout="X"/>

   ... where X = 2 * interval * tko + 1

The qdisk timeout should be set to something which exceeds the Power
Path failure detection timeout; I don't know what that is...

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From lhh at redhat.com  Mon Sep 24 17:26:48 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:26:48 -0400
Subject: [Linux-cluster] CS4 U5 / advised quorumd values ?
In-Reply-To: <46EF998A.1040808@bull.net>
References: <46EF998A.1040808@bull.net>
Message-ID: <20070924172648.GV10872@redhat.com>

On Tue, Sep 18, 2007 at 11:25:30AM +0200, Alain Moulle wrote:
> Hi
> 
> First time I will try the quorum disk functionnality ... which values
> are recommended for quorumd parameters for a two nodes cluster ?
> 
> Is this correct ?
>    <quorumd interval="2" tko="10" votes="3" log_level="9" log_facility="local4"
> status_file="/tmp/qdisk_status" min_score="3" label="CS4QUORUMDISK">
>    </quorumd>

For two node cluster:

  votes = 1
  tko = 10
  interval = 1

Unless you have your cman deadnode_timeout set to something other than
the default (which is 21 seconds).

I recommend against using a status_file unless you're debugging; it's a
potential place for qdiskd to block.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From jos at xos.nl  Mon Sep 24 17:27:41 2007
From: jos at xos.nl (Jos Vos)
Date: Mon, 24 Sep 2007 19:27:41 +0200
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
In-Reply-To: <20070924172055.GT10872@redhat.com>
References: <200709221603.l8MG3uAv031551@jasmine.xos.nl>
	<20070924172055.GT10872@redhat.com>
Message-ID: <20070924172741.GA21329@jasmine.xos.nl>

On Mon, Sep 24, 2007 at 01:20:55PM -0400, Lon Hohberger wrote:

> > After adding a quorum disk to a two-node configuration (and restarting,
> > rebooting the cluster), everything *seems* to be ok, except that
> > clurgmgrd doesn't work properly anymore and that no services are started.
> > It times out when clustat asks the status, it can't get any service
> > status anymore, and so it doesn't start any service too...
> > 
> > This is on RHEL 5.0.
> 
> Fixed in 5.1 beta.

OK, thanks.  Do I need to run the full 5.1 software set (incl. kernel),
or can I try it by replacing only some (user-level) packages, or can I
even only replace "rgmanager"?

And what is the status of this behavior in RHEL4U5?
That is, is the bug RHEL5-specific or not?

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From lhh at redhat.com  Mon Sep 24 17:28:18 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:28:18 -0400
Subject: [Linux-cluster] CS4 U5 / advised quorumd values ?
In-Reply-To: <20070924172648.GV10872@redhat.com>
References: <46EF998A.1040808@bull.net> <20070924172648.GV10872@redhat.com>
Message-ID: <20070924172818.GW10872@redhat.com>

On Mon, Sep 24, 2007 at 01:26:48PM -0400, Lon Hohberger wrote:
> On Tue, Sep 18, 2007 at 11:25:30AM +0200, Alain Moulle wrote:
> > Hi
> > 
> > First time I will try the quorum disk functionnality ... which values
> > are recommended for quorumd parameters for a two nodes cluster ?
> > 
> > Is this correct ?
> >    <quorumd interval="2" tko="10" votes="3" log_level="9" log_facility="local4"
> > status_file="/tmp/qdisk_status" min_score="3" label="CS4QUORUMDISK">
> >    </quorumd>
> 
> For two node cluster:
> 
>   votes = 1
>   tko = 10
>   interval = 1
> 

Note that in this configuration (no heuristics), qdiskd doesn't really
gain you much.  It's probable that you can run without it...

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From jobot at wmdata.com  Mon Sep 24 17:29:01 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Mon, 24 Sep 2007 19:29:01 +0200
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20070924161012.GA14880@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F346@WMRI000167.corp.wmdata.net>
	<20070924161012.GA14880@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>

From: David Teigland [mailto:teigland at redhat.com] 
Sent: den 24 september 2007 18:10
To: Borgstr?m Jonas
Cc: linux clustering
Subject: Re: [Linux-cluster] Possible cman init script race condition
*snip*
> > 1190645596 node "prod-db1" not a cman member, cn 1
> > 1190645597 node "prod-db1" not a cman member, cn 1
> > 1190645598 node "prod-db1" not a cman member, cn 1
> > 1190645599 node "prod-db1" not a cman member, cn 1
> > 1190645600 reduce victim prod-db1
> > 1190645600 delay of 16s leaves 0 victims
> > 1190645600 finish default 1
> > 1190645600 stop default
> > 1190645600 start default 2 members 1 2 
> > 1190645600 do_recovery stop 1 start 2 finish 1
> 
> I think something has gone wrong here, either in groupd or fenced, that's
> preventing this start from finishing (we don't get a 'finish default 2'
> which we expect).  A 'group_tool -v' here should show the state of the
> fence group still in transition.  Could you run that, plus a 'group_tool
> dump' at this point, in addition to the 'dump fence' you have.  And please
> run those commands on both nodes.
> 
Hi david, thanks for your fast response. Here's the output you requested:

[root at prod-db1 ~]# group_tool -v
type             level name     id       state node id local_done
fence            0     default  00010001 JOIN_START_WAIT 2 200020001 1
[1 2]

[root at prod-db2 ~]# group_tool -v
type             level name     id       state node id local_done
fence            0     default  00010002 JOIN_START_WAIT 1 100020001 1
[1 2]

I attached "group_tool dump" output as files, since they are quite long.

> > 1190645954 client 3: dump    <--- Before killing prod-db1
> > 1190645985 stop default
> > 1190645985 start default 3 members 2 
> > 1190645985 do_recovery stop 2 start 3 finish 1
> > 1190645985 finish default 3
> > 1190646008 client 3: dump    <--- After killing prod-db1
> 
> Node 1 isn't fenced here because it never completed joining the fence
> group above.
>
> > The scary part is that as far as I can tell fenced is the only cman
> > daemon being affected by this. So your cluster appears to work fine. But
> > when a node needs to be fenced the operation it isn't carried out and
> > that can cause gfs filesystem corruption.
>
> You shouldn't be able to mount gfs on the node where joining the fence
> group is stuck.

My current setup is very stripped down so I haven't configured gfs. But on my original setup where I initially noticed this issue I had no problem mounting gfs filesystems and after a simulated network failure I could still continue to write to the filesystem from both nodes since no node was fenced, and that quickly corrupted the filesystem.

Regards,
Jonas

-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: group_dump_db2.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070924/d646b122/attachment.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: group_dump_db1.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070924/d646b122/attachment-0001.txt>

From lhh at redhat.com  Mon Sep 24 17:29:51 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 24 Sep 2007 13:29:51 -0400
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
In-Reply-To: <20070924172741.GA21329@jasmine.xos.nl>
References: <200709221603.l8MG3uAv031551@jasmine.xos.nl>
	<20070924172055.GT10872@redhat.com>
	<20070924172741.GA21329@jasmine.xos.nl>
Message-ID: <20070924172951.GX10872@redhat.com>

On Mon, Sep 24, 2007 at 07:27:41PM +0200, Jos Vos wrote:
> On Mon, Sep 24, 2007 at 01:20:55PM -0400, Lon Hohberger wrote:
> 
> > > After adding a quorum disk to a two-node configuration (and restarting,
> > > rebooting the cluster), everything *seems* to be ok, except that
> > > clurgmgrd doesn't work properly anymore and that no services are started.
> > > It times out when clustat asks the status, it can't get any service
> > > status anymore, and so it doesn't start any service too...
> > > 
> > > This is on RHEL 5.0.
> > 
> > Fixed in 5.1 beta.
> 
> OK, thanks.  Do I need to run the full 5.1 software set (incl. kernel),
> or can I try it by replacing only some (user-level) packages, or can I
> even only replace "rgmanager"?

Certainly for testing, you can just replace rgmanager + cman + deps.
5.1 rgmanager requires 5.1 cman due to changes in the ccs daemon.

-- 
Lon Hohberger - Software Engineer - Red Hat, Inc.


From Alain.Moulle at bull.net  Tue Sep 25 06:59:17 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 25 Sep 2007 08:59:17 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two nodes
	cluster
Message-ID: <46F8B1C5.7010007@bull.net>

Hi

As said before, I'm trying for the first time to
add a quorum disk on my two nodes cluster.
Finally, I've set parameters as below :

        <quorumd interval="1" tko="10" votes="1" log_level="9"
log_facility="local4" status_file="/tmp/qdisk_status" label="CS4QUORUMDISK">
        </quorumd>
and
	<clusternode name="node1" votes="1">
and
	<clusternode name="node2" votes="1">
and
        <cman expected_votes="3" two_node="0"/>

With these parameters values for my two nodes cluster, I have
to launch the Cluster Suite on both nodes, it can't be
launched on one node only because cman detects "not quorate".

Is there a way to avoid this ? to be able to launch CS4 only
on one side ?
Could somebody give me the advised values for such a two-nodes cluster ?

By the way, how can I be shure that quorum disk is really used , where can
I see traces ? (I see nothing in syslog)

Thanks a lot for your help
Alain Moull?


From mad at wol.de  Tue Sep 25 08:37:15 2007
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Tue, 25 Sep 2007 10:37:15 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two
	nodes cluster
In-Reply-To: <46F8B1C5.7010007@bull.net>
References: <46F8B1C5.7010007@bull.net>
Message-ID: <1190709435.23919.11.camel@marc>

Hello,

Am Dienstag, den 25.09.2007, 08:59 +0200 schrieb Alain Moulle:
> Hi
> 
> As said before, I'm trying for the first time to
> add a quorum disk on my two nodes cluster.
> Finally, I've set parameters as below :
> 
>         <quorumd interval="1" tko="10" votes="1" log_level="9"
> log_facility="local4" status_file="/tmp/qdisk_status" label="CS4QUORUMDISK">
>         </quorumd>
> and
> 	<clusternode name="node1" votes="1">
> and
> 	<clusternode name="node2" votes="1">
> and
>         <cman expected_votes="3" two_node="0"/>

Your problem lies here: expected_votes="3"

Your qdisk is not a regular Node, it is a helper in your setup to
provide the own vote to make the system quorate so it can start.

You should calculate your votes like this:
( votes % 2 ) + 1

So you should use this: expected_votes="2"

In addition you should really read the FAQ:

http://sourceware.org/cluster/faq.html


> With these parameters values for my two nodes cluster, I have
> to launch the Cluster Suite on both nodes, it can't be
> launched on one node only because cman detects "not quorate".
> 
> Is there a way to avoid this ? to be able to launch CS4 only
> on one side ?
> Could somebody give me the advised values for such a two-nodes cluster ?
> 
> By the way, how can I be shure that quorum disk is really used , where can
> I see traces ? (I see nothing in syslog)
> 
> Thanks a lot for your help
> Alain Moull?


Marc


From beres.laszlo at sys-admin.hu  Tue Sep 25 09:39:11 2007
From: beres.laszlo at sys-admin.hu (BERES Laszlo)
Date: Tue, 25 Sep 2007 11:39:11 +0200
Subject: [Linux-cluster] RHEL5 and Oracle 10g RAC
Message-ID: <46F8D73F.3090408@sys-admin.hu>

Hello there,

well, I'm not an Oracle expert, but I'd like to know if is it possible
to get working the above under RHEL5? As far as I know RAC is certified
with GULM, but RHEL5's cluster doesn't have it.

Have you ever used them successfully?

Thanks,

-- 
B?RES L?szl?	 RHCE, RHCX
senior IT engineer, trainer


From jprats at cesca.es  Tue Sep 25 11:33:37 2007
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 25 Sep 2007 13:33:37 +0200
Subject: [Linux-cluster] problems starting cman
Message-ID: <46F8F211.3020309@cesca.es>

Hi,
I'm trying to start a two node cluster, but I can't start cman. If I run 
manually the join command it returns 141. (I don't know what dos this 
means) How can I know why it do not  start? /var/log/messages does not 
give me any clue about it.

Here I copy all the messages I get. (I'm using a Fedora 7 with the 
latest packeges)

best regards,
Jordi

[root at inf17 log]# /etc/init.d/cman restart
Stopping cluster:
   Stopping fencing... done
   Stopping cman... done
   Stopping ccsd... done
   Unmounting configfs... done
                                                           [  OK  ]
Starting cluster:
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed

                                                           [FAILED]


[root at inf17 log]# /usr/sbin/cman_tool -t 120 -w join
[root at inf17 log]# echo $?
141

Sep 25 13:30:05 inf17 ccsd[4226]: Starting ccsd 2.0.60:
Sep 25 13:30:05 inf17 ccsd[4226]:  Built: Feb 27 2007 17:13:55
Sep 25 13:30:05 inf17 ccsd[4226]:  Copyright (C) Red Hat, Inc.  2004  
All rights reserved.
Sep 25 13:30:05 inf17 ccsd[4226]: cluster.conf (cluster name = boumort, 
version = 3) found.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] AIS Executive Service 
RELEASE 'subrev 1204 version 0.80.1'
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2002-2006 
MontaVista Software, Inc and contributors.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2006 Red Hat, 
Inc.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Using default multicast 
address of 239.192.52.96
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
openais_cpg loaded.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service handler 
'openais cluster closed process group service v1.01'
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
openais_cfg loaded.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service handler 
'openais configuration service'
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
openais_msg loaded.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service handler 
'openais message service B.01.01'
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
openais_lck loaded.
Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service handler 
'openais distributed locking service B.01.01'
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] AIS Executive Service 
RELEASE 'subrev 1204 version 0.80.1'
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2002-2006 
MontaVista Software, Inc and contributors.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2006 Red Hat, 
Inc.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Using default multicast 
address of 239.192.52.96
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
openais_cpg loaded.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service handler 
'openais cluster closed process group service v1.01'
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
openais_cfg loaded.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service handler 
'openais configuration service'
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
openais_msg loaded.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service handler 
'openais message service B.01.01'
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
openais_lck loaded.
Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service handler 
'openais distributed locking service B.01.01'
Sep 25 13:30:34 inf17 ccsd[4226]: Unable to connect to cluster 
infrastructure after 30 seconds.
Sep 25 13:31:04 inf17 ccsd[4226]: Unable to connect to cluster 
infrastructure after 60 seconds.

<?xml version="1.0"?>
<cluster alias="boumort" config_version="3" name="boumort">
        <fence_daemon clean_start="0" post_fail_delay="0" 
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="inf17" votes="1" nodeid="1">
                        <fence>
                                <method name="1">
                                        <device name="fdinf17"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="inf18" votes="1" nodeid="2">
                        <fence>
                                <method name="1">
                                        <device name="fdinf18"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="172.16.11.17" 
login="root" name="fdinf17" passwd="R9t9t0u1ll3"/>
                <fencedevice agent="fence_ilo" hostname="172.16.11.18" 
login="root" name="fdinf18" passwd="R9t9t0u1ll3"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources/>
        </rm>
        <cman expected_votes="1" two_node="1" />
</cluster>


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From randy.brown at noaa.gov  Tue Sep 25 11:51:40 2007
From: randy.brown at noaa.gov (Randy Brown)
Date: Tue, 25 Sep 2007 07:51:40 -0400
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <46F8F211.3020309@cesca.es>
References: <46F8F211.3020309@cesca.es>
Message-ID: <46F8F64C.9000907@noaa.gov>

Do you have a host-based firewall running? 

Randy

Jordi Prats wrote:
> Hi,
> I'm trying to start a two node cluster, but I can't start cman. If I 
> run manually the join command it returns 141. (I don't know what dos 
> this means) How can I know why it do not  start? /var/log/messages 
> does not give me any clue about it.
>
> Here I copy all the messages I get. (I'm using a Fedora 7 with the 
> latest packeges)
>
> best regards,
> Jordi
>
> [root at inf17 log]# /etc/init.d/cman restart
> Stopping cluster:
>   Stopping fencing... done
>   Stopping cman... done
>   Stopping ccsd... done
>   Unmounting configfs... done
>                                                           [  OK  ]
> Starting cluster:
>   Loading modules... done
>   Mounting configfs... done
>   Starting ccsd... done
>   Starting cman... failed
>
>                                                           [FAILED]
>
>
> [root at inf17 log]# /usr/sbin/cman_tool -t 120 -w join
> [root at inf17 log]# echo $?
> 141
>
> Sep 25 13:30:05 inf17 ccsd[4226]: Starting ccsd 2.0.60:
> Sep 25 13:30:05 inf17 ccsd[4226]:  Built: Feb 27 2007 17:13:55
> Sep 25 13:30:05 inf17 ccsd[4226]:  Copyright (C) Red Hat, Inc.  2004  
> All rights reserved.
> Sep 25 13:30:05 inf17 ccsd[4226]: cluster.conf (cluster name = 
> boumort, version = 3) found.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] AIS Executive Service 
> RELEASE 'subrev 1204 version 0.80.1'
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2002-2006 
> MontaVista Software, Inc and contributors.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2006 Red 
> Hat, Inc.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Using default multicast 
> address of 239.192.52.96
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
> openais_cpg loaded.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
> handler 'openais cluster closed process group service v1.01'
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
> openais_cfg loaded.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
> handler 'openais configuration service'
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
> openais_msg loaded.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
> handler 'openais message service B.01.01'
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
> openais_lck loaded.
> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
> handler 'openais distributed locking service B.01.01'
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] AIS Executive Service 
> RELEASE 'subrev 1204 version 0.80.1'
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2002-2006 
> MontaVista Software, Inc and contributors.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2006 Red 
> Hat, Inc.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Using default multicast 
> address of 239.192.52.96
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
> openais_cpg loaded.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
> handler 'openais cluster closed process group service v1.01'
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
> openais_cfg loaded.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
> handler 'openais configuration service'
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
> openais_msg loaded.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
> handler 'openais message service B.01.01'
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
> openais_lck loaded.
> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
> handler 'openais distributed locking service B.01.01'
> Sep 25 13:30:34 inf17 ccsd[4226]: Unable to connect to cluster 
> infrastructure after 30 seconds.
> Sep 25 13:31:04 inf17 ccsd[4226]: Unable to connect to cluster 
> infrastructure after 60 seconds.
>
> <?xml version="1.0"?>
> <cluster alias="boumort" config_version="3" name="boumort">
>        <fence_daemon clean_start="0" post_fail_delay="0" 
> post_join_delay="3"/>
>        <clusternodes>
>                <clusternode name="inf17" votes="1" nodeid="1">
>                        <fence>
>                                <method name="1">
>                                        <device name="fdinf17"/>
>                                </method>
>                        </fence>
>                </clusternode>
>                <clusternode name="inf18" votes="1" nodeid="2">
>                        <fence>
>                                <method name="1">
>                                        <device name="fdinf18"/>
>                                </method>
>                        </fence>
>                </clusternode>
>        </clusternodes>
>        <cman/>
>        <fencedevices>
>                <fencedevice agent="fence_ilo" hostname="172.16.11.17" 
> login="root" name="fdinf17" passwd="R9t9t0u1ll3"/>
>                <fencedevice agent="fence_ilo" hostname="172.16.11.18" 
> login="root" name="fdinf18" passwd="R9t9t0u1ll3"/>
>        </fencedevices>
>        <rm>
>                <failoverdomains/>
>                <resources/>
>        </rm>
>        <cman expected_votes="1" two_node="1" />
> </cluster>
>
>


From Nick.Couchman at seakr.com  Tue Sep 25 12:33:10 2007
From: Nick.Couchman at seakr.com (Nick Couchman)
Date: Tue, 25 Sep 2007 06:33:10 -0600
Subject: [Linux-cluster] NFS Exports in rgmanager
Message-ID: <46F8ABA6.87A6.0099.1@seakr.com>

I'm attempting to set up fail-over NFS services on my RHCS cluster and have a question about the NFS export and NFS client resources.  In our environment, we have a few occasions where we don't want to export the entire volume to a client, we just want to export a certain directory on the volume.  So, for example, by default, when you set up a GFS Filesystem (let's say it's mounted at /mnt/Vol1), an NFS export resource (we'll call it Share1), and an NFS Client Resource (client1 with options rw,root_squash), the entire contents of /mnt/Vol1 is exported to the client.  In our case, we have a directory - /mnt/Vol1/Dir1/ThisDirectory - that we need to export to a client, and we only want the client to have access to that directory.  Is this possible in rgmanager?  If so, how do I go about it?  If not, can someone suggest some alternatives, aside from completely manually managing the NFS stuff with RHCS?  If the answer to that is that I must do it manually, maybe someone can consider adding in a "Directory" resource to rgmanager?
 
Thanks,
Nick
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070925/33c5b996/attachment.htm>

From jprats at cesca.es  Tue Sep 25 12:48:14 2007
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 25 Sep 2007 14:48:14 +0200
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <46F8F64C.9000907@noaa.gov>
References: <46F8F211.3020309@cesca.es> <46F8F64C.9000907@noaa.gov>
Message-ID: <46F9038E.6060406@cesca.es>

No, I have iptables disabled. The other node it's on the same subnet, so 
there's no firewall between them.

Jordi

Randy Brown wrote:
> Do you have a host-based firewall running?
> Randy
>
> Jordi Prats wrote:
>> Hi,
>> I'm trying to start a two node cluster, but I can't start cman. If I 
>> run manually the join command it returns 141. (I don't know what dos 
>> this means) How can I know why it do not  start? /var/log/messages 
>> does not give me any clue about it.
>>
>> Here I copy all the messages I get. (I'm using a Fedora 7 with the 
>> latest packeges)
>>
>> best regards,
>> Jordi
>>
>> [root at inf17 log]# /etc/init.d/cman restart
>> Stopping cluster:
>>   Stopping fencing... done
>>   Stopping cman... done
>>   Stopping ccsd... done
>>   Unmounting configfs... done
>>                                                           [  OK  ]
>> Starting cluster:
>>   Loading modules... done
>>   Mounting configfs... done
>>   Starting ccsd... done
>>   Starting cman... failed
>>
>>                                                           [FAILED]
>>
>>
>> [root at inf17 log]# /usr/sbin/cman_tool -t 120 -w join
>> [root at inf17 log]# echo $?
>> 141
>>
>> Sep 25 13:30:05 inf17 ccsd[4226]: Starting ccsd 2.0.60:
>> Sep 25 13:30:05 inf17 ccsd[4226]:  Built: Feb 27 2007 17:13:55
>> Sep 25 13:30:05 inf17 ccsd[4226]:  Copyright (C) Red Hat, Inc.  2004  
>> All rights reserved.
>> Sep 25 13:30:05 inf17 ccsd[4226]: cluster.conf (cluster name = 
>> boumort, version = 3) found.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] AIS Executive Service 
>> RELEASE 'subrev 1204 version 0.80.1'
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2002-2006 
>> MontaVista Software, Inc and contributors.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Copyright (C) 2006 Red 
>> Hat, Inc.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Using default multicast 
>> address of 239.192.52.96
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
>> openais_cpg loaded.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
>> handler 'openais cluster closed process group service v1.01'
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
>> openais_cfg loaded.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
>> handler 'openais configuration service'
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
>> openais_msg loaded.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
>> handler 'openais message service B.01.01'
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] openais component 
>> openais_lck loaded.
>> Sep 25 13:30:08 inf17 openais[4232]: [MAIN ] Registering service 
>> handler 'openais distributed locking service B.01.01'
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] AIS Executive Service 
>> RELEASE 'subrev 1204 version 0.80.1'
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2002-2006 
>> MontaVista Software, Inc and contributors.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Copyright (C) 2006 Red 
>> Hat, Inc.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Using default multicast 
>> address of 239.192.52.96
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
>> openais_cpg loaded.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
>> handler 'openais cluster closed process group service v1.01'
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
>> openais_cfg loaded.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
>> handler 'openais configuration service'
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
>> openais_msg loaded.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
>> handler 'openais message service B.01.01'
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] openais component 
>> openais_lck loaded.
>> Sep 25 13:30:33 inf17 openais[4239]: [MAIN ] Registering service 
>> handler 'openais distributed locking service B.01.01'
>> Sep 25 13:30:34 inf17 ccsd[4226]: Unable to connect to cluster 
>> infrastructure after 30 seconds.
>> Sep 25 13:31:04 inf17 ccsd[4226]: Unable to connect to cluster 
>> infrastructure after 60 seconds.
>>
>> <?xml version="1.0"?>
>> <cluster alias="boumort" config_version="3" name="boumort">
>>        <fence_daemon clean_start="0" post_fail_delay="0" 
>> post_join_delay="3"/>
>>        <clusternodes>
>>                <clusternode name="inf17" votes="1" nodeid="1">
>>                        <fence>
>>                                <method name="1">
>>                                        <device name="fdinf17"/>
>>                                </method>
>>                        </fence>
>>                </clusternode>
>>                <clusternode name="inf18" votes="1" nodeid="2">
>>                        <fence>
>>                                <method name="1">
>>                                        <device name="fdinf18"/>
>>                                </method>
>>                        </fence>
>>                </clusternode>
>>        </clusternodes>
>>        <cman/>
>>        <fencedevices>
>>                <fencedevice agent="fence_ilo" hostname="172.16.11.17" 
>> login="root" name="fdinf17" passwd="R9t9t0u1ll3"/>
>>                <fencedevice agent="fence_ilo" hostname="172.16.11.18" 
>> login="root" name="fdinf18" passwd="R9t9t0u1ll3"/>
>>        </fencedevices>
>>        <rm>
>>                <failoverdomains/>
>>                <resources/>
>>        </rm>
>>        <cman expected_votes="1" two_node="1" />
>> </cluster>
>>
>>
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From jos at xos.nl  Tue Sep 25 12:47:50 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 25 Sep 2007 14:47:50 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two
	nodes cluster
In-Reply-To: <1190709435.23919.11.camel@marc>
References: <46F8B1C5.7010007@bull.net> <1190709435.23919.11.camel@marc>
Message-ID: <20070925124750.GA28897@jasmine.xos.nl>

On Tue, Sep 25, 2007 at 10:37:15AM +0200, Marc - A. Dahlhaus [ Administration | Westermann GmbH ] wrote:

> Your problem lies here: expected_votes="3"

[...]

> You should calculate your votes like this:
> ( votes % 2 ) + 1
> 
> So you should use this: expected_votes="2"

Oh... but the FAQ (# 18) explicitly says "nodes + 1" and gives "3"
as the example for a two-node cluster.  And I tried it (accidently)
with expected_votes="2" and the result was that the nodes started
fencing each other in an endless loop.  This was solve by setting
expected_votes="3".  This is on RHEL5. b.t.w., not RHEL4.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From mad at wol.de  Tue Sep 25 13:32:52 2007
From: mad at wol.de (Marc - A. Dahlhaus [ Administration | Westermann GmbH ])
Date: Tue, 25 Sep 2007 15:32:52 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two
	nodes cluster
In-Reply-To: <20070925124750.GA28897@jasmine.xos.nl>
References: <46F8B1C5.7010007@bull.net> <1190709435.23919.11.camel@marc>
	<20070925124750.GA28897@jasmine.xos.nl>
Message-ID: <1190727173.23919.43.camel@marc>

Am Dienstag, den 25.09.2007, 14:47 +0200 schrieb Jos Vos:
> On Tue, Sep 25, 2007 at 10:37:15AM +0200, Marc - A. Dahlhaus [ Administration | Westermann GmbH ] wrote:
> 
> > Your problem lies here: expected_votes="3"
> 
> [...]
> 
> > You should calculate your votes like this:
> > ( votes % 2 ) + 1
> > 
> > So you should use this: expected_votes="2"
> 
> Oh... but the FAQ (# 18) explicitly says "nodes + 1" and gives "3"
> as the example for a two-node cluster.  And I tried it (accidently)
> with expected_votes="2" and the result was that the nodes started
> fencing each other in an endless loop.  This was solve by setting
> expected_votes="3".  This is on RHEL5. b.t.w., not RHEL4.

I was talking about FAQ cman#24 (this is what you want, right? ).
I stated the quorate math, sorry for that. It should be
"qdisc votes = nodes - 1" and "cman expected votes = votes - 1".
You could give <qdisc votes="2"> in combination with <cman
expected_votes="3"> a try then.

If that vote increment in your qdisc doesn't work, FAQ cman#23 could fix
your circular fencing problems for real in a two node cluster. But we
don't use this here at our site as we have more than two nodes.

Marc


From rpeterso at redhat.com  Tue Sep 25 13:46:40 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 25 Sep 2007 08:46:40 -0500
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <46F8F211.3020309@cesca.es>
References: <46F8F211.3020309@cesca.es>
Message-ID: <1190728000.3038.45.camel@technetium.msp.redhat.com>

On Tue, 2007-09-25 at 13:33 +0200, Jordi Prats wrote:
> Hi,
> I'm trying to start a two node cluster, but I can't start cman. If I run 
> manually the join command it returns 141. (I don't know what dos this 
> means) How can I know why it do not  start? /var/log/messages does not 
> give me any clue about it.
> 
> Here I copy all the messages I get. (I'm using a Fedora 7 with the 
> latest packeges)
> 
> best regards,
> Jordi

Hi Jordi,

Check to make sure you have multicast enabled and if it really works.
We've seen some problems with some network switches not being set
for this properly.  Unfortunately, it varies from switch to switch.
Also, check to see if aisexec is failing and leaving a core dump.
This would be in the form of a "core" file in /var/lib/openais/

Regards,

Bob Peterson
Red Hat Cluster Suite


From jprats at cesca.es  Tue Sep 25 14:11:35 2007
From: jprats at cesca.es (Jordi Prats)
Date: Tue, 25 Sep 2007 16:11:35 +0200
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <1190728000.3038.45.camel@technetium.msp.redhat.com>
References: <46F8F211.3020309@cesca.es>
	<1190728000.3038.45.camel@technetium.msp.redhat.com>
Message-ID: <46F91717.3070905@cesca.es>

Hi,
Using tcpdump on one node I've started cman on the other one (failing 
again). It have captured one UDP packet:

[root at inf18 openais]# tcpdump -nni eth1 'host 239.192.52.96'
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on eth1, link-type EN10MB (Ethernet), capture size 96 bytes
06:17:48.937509 IP 192.168.22.17.5149 > 239.192.52.96.5405: UDP, length 148

So should not be a multicast problem. Is not a routing problem because 
I'm just using eth1 and eth0 is disabled, so all traffic go across eth1.

On /var/lib/openais/ I found no core dump. What else could it be?

Regards,
Jordi


Bob Peterson wrote:
> On Tue, 2007-09-25 at 13:33 +0200, Jordi Prats wrote:
>   
>> Hi,
>> I'm trying to start a two node cluster, but I can't start cman. If I run 
>> manually the join command it returns 141. (I don't know what dos this 
>> means) How can I know why it do not  start? /var/log/messages does not 
>> give me any clue about it.
>>
>> Here I copy all the messages I get. (I'm using a Fedora 7 with the 
>> latest packeges)
>>
>> best regards,
>> Jordi
>>     
>
> Hi Jordi,
>
> Check to make sure you have multicast enabled and if it really works.
> We've seen some problems with some network switches not being set
> for this properly.  Unfortunately, it varies from switch to switch.
> Also, check to see if aisexec is failing and leaving a core dump.
> This would be in the form of a "core" file in /var/lib/openais/
>
> Regards,
>
> Bob Peterson
> Red Hat Cluster Suite
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>   


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From jos at xos.nl  Tue Sep 25 14:23:52 2007
From: jos at xos.nl (Jos Vos)
Date: Tue, 25 Sep 2007 16:23:52 +0200
Subject: [Linux-cluster] CS4 U5 / recommended quorumd values for a two
	nodes cluster
In-Reply-To: <1190727173.23919.43.camel@marc>
References: <46F8B1C5.7010007@bull.net> <1190709435.23919.11.camel@marc>
	<20070925124750.GA28897@jasmine.xos.nl>
	<1190727173.23919.43.camel@marc>
Message-ID: <20070925142352.GA29746@jasmine.xos.nl>

On Tue, Sep 25, 2007 at 03:32:52PM +0200, Marc - A. Dahlhaus [ Administration | Westermann GmbH ] wrote:

> I was talking about FAQ cman#24 (this is what you want, right? ).

I don't know, I was not the one starting this topic ;-).

> I stated the quorate math, sorry for that. It should be
> "qdisc votes = nodes - 1" and "cman expected votes = votes - 1".
> You could give <qdisc votes="2"> in combination with <cman
> expected_votes="3"> a try then.

That this (incrementing quorumd votes) is needed to have only one node
in a two-node cluster running, seems to be in contradiction with cman#18.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From steven.bakker at ams-ix.net  Tue Sep 25 16:55:06 2007
From: steven.bakker at ams-ix.net (Steven Bakker)
Date: Tue, 25 Sep 2007 18:55:06 +0200
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <46F91717.3070905@cesca.es>
References: <46F8F211.3020309@cesca.es>
	<1190728000.3038.45.camel@technetium.msp.redhat.com>
	<46F91717.3070905@cesca.es>
Message-ID: <20070925185506.786e4cb4@cluestix.noc.ams-ix.net>

On Tue, 25 Sep 2007 16:11:35 +0200 Jordi Prats wrote:

> On /var/lib/openais/ I found no core dump.

If I'm not mistaken, processes on Fedora 7 run with "ulmit -c 0" by
default, so unless you explicitly specify otherwise, you'll never find
a core dump anywhere.

> What else could it be?

Ah, for that I have no answer. :-/

Cheers,
Steven


From fajar at telkom.net.id  Wed Sep 26 02:01:46 2007
From: fajar at telkom.net.id (Fajar A. Nugraha)
Date: Wed, 26 Sep 2007 09:01:46 +0700
Subject: [Linux-cluster] RHEL5 and Oracle 10g RAC
In-Reply-To: <46F8D73F.3090408@sys-admin.hu>
References: <46F8D73F.3090408@sys-admin.hu>
Message-ID: <46F9BD8A.2060205@telkom.net.id>

BERES Laszlo wrote:
> Hello there,
>
> well, I'm not an Oracle expert, but I'd like to know if is it possible
> to get working the above under RHEL5? 

You should use RHEL5 (or Oracle's Enterprise Linux) with Oracle's
built-in clusterware for RAC, not RedHat Cluster.

Regards,

Fajar


From Michael.Hagmann at hilti.com  Wed Sep 26 04:45:08 2007
From: Michael.Hagmann at hilti.com (Hagmann, Michael)
Date: Wed, 26 Sep 2007 06:45:08 +0200
Subject: [Linux-cluster] RHEL5 and Oracle 10g RAC
References: <46F8D73F.3090408@sys-admin.hu> <46F9BD8A.2060205@telkom.net.id>
Message-ID: <9C203D6FD2BF9D49BFF3450201DEDA530D1028@LI-OWL.hag.hilti.com>

Fajar

do you have any experience with RAC on RHEL5 ? We are heavily testing RAC on RHEL4, but we are thinking about RAC on RHEL5.

thx mike

Michael Hagmann
UNIX Systems Engineering
Enterprise Systems Technology

Hilti Corporation
9494 Schaan  Liechtenstein

Department FIBS
Feldkircherstrasse 100   P.O.Box 333
P +423-234 2467  F +423-234 6467
E michael.hagmann at hilti.com
www.hilti.com


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Fajar A. Nugraha
Sent: Wed 9/26/2007 04:01
To: linux clustering
Subject: Re: [Linux-cluster] RHEL5 and Oracle 10g RAC
 
BERES Laszlo wrote:
> Hello there,
>
> well, I'm not an Oracle expert, but I'd like to know if is it possible
> to get working the above under RHEL5? 

You should use RHEL5 (or Oracle's Enterprise Linux) with Oracle's
built-in clusterware for RAC, not RedHat Cluster.

Regards,

Fajar


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3085 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/a7e25e1a/attachment.bin>

From jprats at cesca.es  Wed Sep 26 06:49:48 2007
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 26 Sep 2007 08:49:48 +0200
Subject: [Linux-cluster] problems starting cman
In-Reply-To: <20070925185506.786e4cb4@cluestix.noc.ams-ix.net>
References: <46F8F211.3020309@cesca.es>	<1190728000.3038.45.camel@technetium.msp.redhat.com>	<46F91717.3070905@cesca.es>
	<20070925185506.786e4cb4@cluestix.noc.ams-ix.net>
Message-ID: <46FA010C.5090503@cesca.es>

Hi,
I've set  the maximum size of core files to unlimited:

[root at inf17 lib]# ulimit -c
unlimited

But it does not appears anything on /var/lib/openais... Any idea?

Thanks!
Jordi


Steven Bakker wrote:
> On Tue, 25 Sep 2007 16:11:35 +0200 Jordi Prats wrote:
>
>   
>> On /var/lib/openais/ I found no core dump.
>>     
>
> If I'm not mistaken, processes on Fedora 7 run with "ulmit -c 0" by
> default, so unless you explicitly specify otherwise, you'll never find
> a core dump anywhere.
>
>   
>> What else could it be?
>>     
>
> Ah, for that I have no answer. :-/
>
> Cheers,
> Steven
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>   


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From lining at mail.ustc.edu.cn  Wed Sep 26 08:06:34 2007
From: lining at mail.ustc.edu.cn (lining at mail.ustc.edu.cn)
Date: Wed, 26 Sep 2007 16:06:34 +0800
Subject: [Linux-cluster] help:can't execute Add a failover Domain
Message-ID: <390793994.12818@ustc.edu.cn>

I have a cluster with two nodes ,it has started.
On the conga platform ,when I choose add a failover domain ,
it returned as followed:

Network station Error??
this network station occured an error when handle your request.
the error is :
error type:
    AttributeError
error value:
    getFdomNodes

Enclosed my cluster.conf:
<?xml version="1.0"?>
<cluster alias="tvod" config_version="28" name="tvod">
        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="25"/>
        <clusternodes>
                <clusternode name="10.0.3.232" nodeid="1" votes="1"/>
                <clusternode name="10.0.0.8" nodeid="2" votes="1"/>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_vixel" ipaddr="10.0.0.8" name="fence"
passwd="123456"/>
        </fencedevices>
        <rm>
                <failoverdomains/>
                <resources>
                        <clusterfs device="/dev/mapper/vod-lv_vod"
force_unmount="1" fsid="Wf9iet-YGHm-dHM7-PSjW-CvVq-uOxJ-Y63hGd" fstype="gfs"
mountpoint="/1/" name="gfs" options="-t gfs"/>
                </resources>
                <service autostart="1" name="httpd">
                        <clusterfs ref="gfs"/>
                </service>
        </rm>
</cluster>

Thanks for helping !


From jobot at wmdata.com  Wed Sep 26 08:33:41 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Wed, 26 Sep 2007 10:33:41 +0200
Subject: [Linux-cluster] Found unlinked inode
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>

Hi again,

After stress testing a gfs filesystem for 24 hours fsck.gfs complains about "Found unlinked inode".
This scared me so I reran the test again but got the same result. 

My test consists of two nodes running bonnie++, postgresql and pgbench against a single file system. Every five
minutes one of the nodes is shot.

The weird part is that on both occasions the thing fsck.gfs complained about was an "unlinked inode" corresponding to a postgresql pid file. This is a file created (and deleted) every time postgresql is failed over to another node. It is also the last file on the filesystem being deleted when postgresql was shutdown before the filesystem was umounted and fsck.gfs was run.

Can anybody explain why this pid file triggered this fsck error twice and not any of the thousands of files created and deleted by bonnie++?

Does this mean the filesystem is corrupt, or is this an expected behavior for files deleted directly before a filesystem is umounted?

BTW: I'm not able to reproduce this by simply mounting the filesystem, starting/stopping pgsql and umounting. I need to leave the test running over night. I've also performed some tests directly on the SAN device and as far as I can tell it's working as expected.

OS: RHEL5 Advanced platform

[root at test-db1 ~]# fsck.gfs /dev/testdb/pg_fs 
Initializing fsck
Clearing journals (this may take a while).
Journals cleared.
Starting pass1
Pass1 complete      
Starting pass1b
Pass1b complete      
Starting pass1c
Pass1c complete      
Starting pass2
Pass2 complete      
Starting pass3
Pass3 complete      
Starting pass4
Found unlinked inode at 2375183   <-- This is a postgresql pid file
Add unlinked inode to l+f? (y/n)y
l+f directory at 11411
Added inode #2375183 to l+f dir
Pass4 complete      
Starting pass5
Converting 1 unused metadata blocks to free data blocks...
Converting 277 unused metadata blocks to free data blocks...
...
...

Regards,
Jonas


From jos at xos.nl  Wed Sep 26 12:45:24 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 26 Sep 2007 14:45:24 +0200
Subject: [Linux-cluster] clurgmgrd doesn't work with quorum disk
In-Reply-To: <20070924172951.GX10872@redhat.com>
References: <200709221603.l8MG3uAv031551@jasmine.xos.nl>
	<20070924172055.GT10872@redhat.com>
	<20070924172741.GA21329@jasmine.xos.nl>
	<20070924172951.GX10872@redhat.com>
Message-ID: <20070926124524.GA8653@jasmine.xos.nl>

On Mon, Sep 24, 2007 at 01:29:51PM -0400, Lon Hohberger wrote:

> Certainly for testing, you can just replace rgmanager + cman + deps.
> 5.1 rgmanager requires 5.1 cman due to changes in the ccs daemon.

Note that these (runtime) dependencies are not enforced by the rpms,
not are there proper build-time requirements in the src.rpms.

Anyway, I updated cman, rgmanager, modcluster, openais and the kernel,
but now it fails on "Starting fencing" (after 5 minutes...) :-(.

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From Alain.Moulle at bull.net  Wed Sep 26 13:12:08 2007
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 26 Sep 2007 15:12:08 +0200
Subject: [Linux-cluster] Re: CS4 U5 / recommended quorumd values for a two
	nodes (contd.)
Message-ID: <46FA5AA8.4010205@bull.net>

Thanks Marc and Jos for your pieces of advice, but
it does not seems to work:

I tried your first suggestion with qdisk votes=2 and expected_votes=3:

        <quorumd interval="1" tko="10" votes="2" log_level="9"
log_facility="local4" status_file="/tmp/qdisk_status" label="CS4QUORUMDISK">
        </quorumd>
...
     <clusternode name="node0" votes="1">
...
     <clusternode name="node1" votes="1">
...
     <cman expected_votes="3" two_node="0"/>

and I can't start cman on only one node, it needs cman on second node to be
started, and I don't understand why ...

I tried (just to give it a try because I would have not understood if it had
worked!) also qdisk votes=2 and expected_votes=2, but same result...

Each times, in file "messages" , I see Cluster Inquorate :
Sep 26 15:04:40 s_sys at bali0 ccsd[12224]: Connected to cluster infrastruture via:
CMAN/SM Plugin v1.1.7.4
Sep 26 15:04:40 s_sys at bali0 ccsd[12224]: Initial status:: Inquorate

until the cman of second node is started and then :
Sep 26 15:07:01 s_sys at bali0 ccsd[12224]: Cluster is quorate.  Allowing connections.

I have read and read again the FAQ page, especially the # you mention, but
don't understand why it does not work for me ...
Except if my quorum disk is not working ?
But command mkqisk returns :
#mkqdisk -L
mkqdisk v0.5.1
/dev/sdk:
        Magic:   eb7a62c2
        Label:   CS4QUORUMDISK
        Created: Tue Sep 18 16:33:40 2007
        Host:    node
but is it sufficient to know if Quorum disk is working correctly ?

Any new clue or suggestion is welcome.
Thanks
Regards
Alain


From teigland at redhat.com  Wed Sep 26 13:41:27 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 26 Sep 2007 08:41:27 -0500
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>
Message-ID: <20070926134127.GD15033@redhat.com>

On Wed, Sep 26, 2007 at 10:33:41AM +0200, Borgstr?m Jonas wrote:
> Hi again,
> 
> After stress testing a gfs filesystem for 24 hours fsck.gfs complains
> about "Found unlinked inode".  This scared me so I reran the test again
> but got the same result. 
> 
> My test consists of two nodes running bonnie++, postgresql and pgbench
> against a single file system. Every five minutes one of the nodes is
> shot.
> 
> The weird part is that on both occasions the thing fsck.gfs complained
> about was an "unlinked inode" corresponding to a postgresql pid file.
> This is a file created (and deleted) every time postgresql is failed
> over to another node. It is also the last file on the filesystem being
> deleted when postgresql was shutdown before the filesystem was umounted
> and fsck.gfs was run.
> 
> Can anybody explain why this pid file triggered this fsck error twice
> and not any of the thousands of files created and deleted by bonnie++?
> 
> Does this mean the filesystem is corrupt, or is this an expected
> behavior for files deleted directly before a filesystem is umounted?

I don't think fsck recovers the journal, and journal recovery would most
likely fix this up.  Do you see this if you do a mount/unmount before
running fsck?

Dave


From orkcu at yahoo.com  Wed Sep 26 13:51:38 2007
From: orkcu at yahoo.com (=?iso-8859-1?Q?Roger_Pe=F1a?=)
Date: Wed, 26 Sep 2007 06:51:38 -0700 (PDT)
Subject: [Linux-cluster] Re: CS4 U5 / recommended quorumd values for a two
	nodes (contd.)
In-Reply-To: <46FA5AA8.4010205@bull.net>
Message-ID: <619444.97073.qm@web50607.mail.re2.yahoo.com>


--- Alain Moulle <Alain.Moulle at bull.net> wrote:

> Thanks Marc and Jos for your pieces of advice, but
> it does not seems to work:
> 
> I tried your first suggestion with qdisk votes=2 and
> expected_votes=3:
> 
>         <quorumd interval="1" tko="10" votes="2"
> log_level="9"
> log_facility="local4"
> status_file="/tmp/qdisk_status"
> label="CS4QUORUMDISK">
>         </quorumd>
> ...
>      <clusternode name="node0" votes="1">
> ...
>      <clusternode name="node1" votes="1">
> ...
>      <cman expected_votes="3" two_node="0"/>
> 
> and I can't start cman on only one node, it needs
> cman on second node to be
> started, and I don't understand why ...
> 
> I tried (just to give it a try because I would have
> not understood if it had
> worked!) also qdisk votes=2 and expected_votes=2,
> but same result...
> 
> Each times, in file "messages" , I see Cluster
> Inquorate :
> Sep 26 15:04:40 s_sys at bali0 ccsd[12224]: Connected
> to cluster infrastruture via:
> CMAN/SM Plugin v1.1.7.4
> Sep 26 15:04:40 s_sys at bali0 ccsd[12224]: Initial
> status:: Inquorate
> 
> until the cman of second node is started and then :
> Sep 26 15:07:01 s_sys at bali0 ccsd[12224]: Cluster is
> quorate.  Allowing connections.
> 
> I have read and read again the FAQ page, especially
> the # you mention, but
> don't understand why it does not work for me ...
> Except if my quorum disk is not working ?
> But command mkqisk returns :
> #mkqdisk -L
> mkqdisk v0.5.1
> /dev/sdk:
>         Magic:   eb7a62c2
>         Label:   CS4QUORUMDISK
>         Created: Tue Sep 18 16:33:40 2007
>         Host:    node
> but is it sufficient to know if Quorum disk is
> working correctly ?
> 
> Any new clue or suggestion is welcome.

when the cluster is properly quorate, find out how
many vote do you have:
clustat

if you have only 2 votes then qdisk is not working

cu
roger

__________________________________________
RedHat Certified ( RHCE )
Cisco Certified ( CCNA & CCDA )


____________________________________________________________________________________
Yahoo! oneSearch: Finally, mobile search 
that gives answers, not web links. 
http://mobile.yahoo.com/mobileweb/onesearch?refer=1ONXIC


From rpeterso at redhat.com  Wed Sep 26 14:01:08 2007
From: rpeterso at redhat.com (Bob Peterson)
Date: Wed, 26 Sep 2007 09:01:08 -0500
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>
Message-ID: <1190815268.16010.29.camel@technetium.msp.redhat.com>

On Wed, 2007-09-26 at 10:33 +0200, Borgstr?m Jonas wrote:
> Hi again,
> 
> After stress testing a gfs filesystem for 24 hours fsck.gfs complains about "Found unlinked inode".
> This scared me so I reran the test again but got the same result. 
> 
> My test consists of two nodes running bonnie++, postgresql and pgbench against a single file system. Every five
> minutes one of the nodes is shot.
> 
> The weird part is that on both occasions the thing fsck.gfs complained about was an "unlinked inode" corresponding to a postgresql pid file. This is a file created (and deleted) every time postgresql is failed over to another node. It is also the last file on the filesystem being deleted when postgresql was shutdown before the filesystem was umounted and fsck.gfs was run.
> 
> Can anybody explain why this pid file triggered this fsck error twice and not any of the thousands of files created and deleted by bonnie++?
> 
> Does this mean the filesystem is corrupt, or is this an expected behavior for files deleted directly before a filesystem is umounted?
> 
> BTW: I'm not able to reproduce this by simply mounting the filesystem, starting/stopping pgsql and umounting. I need to leave the test running over night. I've also performed some tests directly on the SAN device and as far as I can tell it's working as expected.
> 
> OS: RHEL5 Advanced platform

Hi Jonas,

Well, I can think of one possible explanation.  I can't be sure because
I don't know your test scenario, but this is my theory.  First, a bit
of background:

When a node gets "shot" as you say, the metadata for some of its
recent operations are likely to still be in the journal it was using.
Depending on the circumstances of when it gets shot, that data may
exist only in the journal if it got "shot" before the data was
written to its final destination on disk.

Ordinarily, that's not a big deal because the next time the file
system is mounted, the journal is replayed and that causes the metadata
to be written correctly to its proper place on disk and all is well.
That's the same for most journaling file systems afaik.

A couple years ago, one of my predecessors (before I started) made an
executive decision to make gfs_fsck *clear* the system journals rather
than replay the journals.  I don't know offhand if the code was once
there and got taken out or if it was never written.  At any rate, it
seemed like a good idea at the time and there were several good
reasons to justify that decision:

First, if the user is running gfs_fsck, they must already suspect
file system corruption.  If (and this is a big if) that corruption was
caused by recent operations to the file system, then replaying the
journal can only serve to compound the corruption and cause more
corruption.  That's because what is in the journal may also be based on
the corruption.  This was more of a concern if, for some reason,
GFS bailed out and "withdrew" from the file system because it detected
corruption, suspecting that it must have somehow caused that corruption.

Second, if the user is running gfs_fsck because of corruption, we may
not be able to assume that the journal is good metadata, worthy of
being replayed.

Third, the user always has the option of replaying the journal before
doing gfs_fsck:

1. mount the file system after the crash (to replay the journal)
2. unmount the file system
3. run gfs_fsck

The decision to have gfs_fsck clear the journals was probably made
many years ago, before gfs was stable, and these "withdraw" situations
were more common.

Some people believe that this was a bad decision.  I believe
that it makes more sense to trust the journal and replay it before
doing the rest of the fsck operations because in "normal" cases where
a node dies (often for some reason unrelated to gfs, like getting
shot, fenced, losing power, blowing up a power supply, etc.) you have
the potential to lose metadata unless the journal is replayed.

Other journaling file systems replay their journals during fsck
or else they inform the user, ask them to take steps to replay
the journal (as above), or give them the option to clear them, etc.
So far, gfs_fsck does not do that.  It just clears the journals.

To remedy the situation, I've got an open bugzilla 291551 (which
may be marked "private" because it was opened internally--sorry)
at least in the gfs2_fsck case.  (gfs_fsck will likely be done too).
With that bugzilla, I intend to somehow remedy the situation.
Either I'll ask the user if they want the journal replayed or else
I'll replay them automatically, or try to detect problems with them.

I'm not certain that this is the cause of your corruption, but it's
the only one I can think of at the moment.

Regards,

Bob Peterson
Red Hat Cluster Suite


From jobot at wmdata.com  Wed Sep 26 14:49:41 2007
From: jobot at wmdata.com (=?utf-8?B?Qm9yZ3N0csO2bSBKb25hcw==?=)
Date: Wed, 26 Sep 2007 16:49:41 +0200
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <1190815268.16010.29.camel@technetium.msp.redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net>
	<1190815268.16010.29.camel@technetium.msp.redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob Peterson
Sent: den 26 september 2007 16:01
To: linux clustering
Subject: Re: [Linux-cluster] Found unlinked inode

> Hi Jonas,
>
> Well, I can think of one possible explanation.  I can't be sure because
> I don't know your test scenario, but this is my theory.  First, a bit
> of background:
>
> When a node gets "shot" as you say, the metadata for some of its
> recent operations are likely to still be in the journal it was using.
> Depending on the circumstances of when it gets shot, that data may
> exist only in the journal if it got "shot" before the data was
> written to its final destination on disk.
>
> Ordinarily, that's not a big deal because the next time the file
> system is mounted, the journal is replayed and that causes the metadata
> to be written correctly to its proper place on disk and all is well.
> That's the same for most journaling file systems afaik.
>
> A couple years ago, one of my predecessors (before I started) made an
> executive decision to make gfs_fsck *clear* the system journals rather
> than replay the journals.  I don't know offhand if the code was once
> there and got taken out or if it was never written.  At any rate, it
> seemed like a good idea at the time and there were several good
> reasons to justify that decision:
>
> First, if the user is running gfs_fsck, they must already suspect
> file system corruption.  If (and this is a big if) that corruption was
> caused by recent operations to the file system, then replaying the
> journal can only serve to compound the corruption and cause more
> corruption.  That's because what is in the journal may also be based on
> the corruption.  This was more of a concern if, for some reason,
> GFS bailed out and "withdrew" from the file system because it detected
> corruption, suspecting that it must have somehow caused that corruption.
>
> Second, if the user is running gfs_fsck because of corruption, we may
> not be able to assume that the journal is good metadata, worthy of
> being replayed.
>
> Third, the user always has the option of replaying the journal before
> doing gfs_fsck:
>
> 1. mount the file system after the crash (to replay the journal)
> 2. unmount the file system
> 3. run gfs_fsck
>
> The decision to have gfs_fsck clear the journals was probably made
> many years ago, before gfs was stable, and these "withdraw" situations
> were more common.
>
> Some people believe that this was a bad decision.  I believe
> that it makes more sense to trust the journal and replay it before
> doing the rest of the fsck operations because in "normal" cases where
> a node dies (often for some reason unrelated to gfs, like getting
> shot, fenced, losing power, blowing up a power supply, etc.) you have
> the potential to lose metadata unless the journal is replayed.
>
> Other journaling file systems replay their journals during fsck
> or else they inform the user, ask them to take steps to replay
> the journal (as above), or give them the option to clear them, etc.
> So far, gfs_fsck does not do that.  It just clears the journals.
>
> To remedy the situation, I've got an open bugzilla 291551 (which
> may be marked "private" because it was opened internally--sorry)
> at least in the gfs2_fsck case.  (gfs_fsck will likely be done too).
> With that bugzilla, I intend to somehow remedy the situation.
> Either I'll ask the user if they want the journal replayed or else
> I'll replay them automatically, or try to detect problems with them.
>
> I'm not certain that this is the cause of your corruption, but it's
> the only one I can think of at the moment.
>
Hi Bob,

This sounds like a reasonable explanation except for one thing, the filesystem was cleanly umounted on both nodes before I ran gfs_fsck. So there shouldn't be any journal to replay, right?

Anyway, I've restarted the test and if I'm able to recreate this error I'll first take a copy of the filesystem and then check if running "mount + umount" makes this gfs_fsck error go away.

Regards,
Jonas
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jobot at wmdata.com  Wed Sep 26 15:40:59 2007
From: jobot at wmdata.com (=?utf-8?B?Qm9yZ3N0csO2bSBKb25hcw==?=)
Date: Wed, 26 Sep 2007 17:40:59 +0200
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D021D699F@WMRI000167.corp.wmdata.net><1190815268.16010.29.camel@technetium.msp.redhat.com>
	<CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>

Hi again,

I was just able to reproduce the filesystem corruption again. This time four lost zero-sized inodes were found :(
And unfortunately mounting+umounting the filesystem didn't make the lost inodes go away. 
I still have a copy of the corrupted filesystem if there is any more things you want me to test.

Here's the gfs_fsck output:

[root at test-db2 ~]# gfs_fsck -n /dev/testdb/pg_fs 
Initializing fsck
Starting pass1
Pass1 complete      
Starting pass1b
Pass1b complete      
Starting pass1c
Pass1c complete      
Starting pass2
Pass2 complete      
Starting pass3
Pass3 complete      
Starting pass4
Found unlinked inode at 1706623
Unlinked inode has zero size
Unlinked inode left unlinked
Found unlinked inode at 1706620
Unlinked inode left unlinked
Found unlinked inode at 1706622
Unlinked inode has zero size
Unlinked inode left unlinked
Found unlinked inode at 1706621
Unlinked inode has zero size
Unlinked inode left unlinked
Pass4 complete      
Starting pass5
Converting 8457 unused metadata blocks to free data blocks...
Converting 61490 unused metadata blocks to free data blocks...
...
...

Regards,
Jonas

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Borgstr?m Jonas
Sent: den 26 september 2007 16:50
To: rpeterso at redhat.com; linux clustering
Subject: RE: [Linux-cluster] Found unlinked inode

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob Peterson
Sent: den 26 september 2007 16:01
To: linux clustering
Subject: Re: [Linux-cluster] Found unlinked inode

> Hi Jonas,
>
> Well, I can think of one possible explanation.  I can't be sure because
> I don't know your test scenario, but this is my theory.  First, a bit
> of background:
>
> When a node gets "shot" as you say, the metadata for some of its
> recent operations are likely to still be in the journal it was using.
> Depending on the circumstances of when it gets shot, that data may
> exist only in the journal if it got "shot" before the data was
> written to its final destination on disk.
>
> Ordinarily, that's not a big deal because the next time the file
> system is mounted, the journal is replayed and that causes the metadata
> to be written correctly to its proper place on disk and all is well.
> That's the same for most journaling file systems afaik.
>
> A couple years ago, one of my predecessors (before I started) made an
> executive decision to make gfs_fsck *clear* the system journals rather
> than replay the journals.  I don't know offhand if the code was once
> there and got taken out or if it was never written.  At any rate, it
> seemed like a good idea at the time and there were several good
> reasons to justify that decision:
>
> First, if the user is running gfs_fsck, they must already suspect
> file system corruption.  If (and this is a big if) that corruption was
> caused by recent operations to the file system, then replaying the
> journal can only serve to compound the corruption and cause more
> corruption.  That's because what is in the journal may also be based on
> the corruption.  This was more of a concern if, for some reason,
> GFS bailed out and "withdrew" from the file system because it detected
> corruption, suspecting that it must have somehow caused that corruption.
>
> Second, if the user is running gfs_fsck because of corruption, we may
> not be able to assume that the journal is good metadata, worthy of
> being replayed.
>
> Third, the user always has the option of replaying the journal before
> doing gfs_fsck:
>
> 1. mount the file system after the crash (to replay the journal)
> 2. unmount the file system
> 3. run gfs_fsck
>
> The decision to have gfs_fsck clear the journals was probably made
> many years ago, before gfs was stable, and these "withdraw" situations
> were more common.
>
> Some people believe that this was a bad decision.  I believe
> that it makes more sense to trust the journal and replay it before
> doing the rest of the fsck operations because in "normal" cases where
> a node dies (often for some reason unrelated to gfs, like getting
> shot, fenced, losing power, blowing up a power supply, etc.) you have
> the potential to lose metadata unless the journal is replayed.
>
> Other journaling file systems replay their journals during fsck
> or else they inform the user, ask them to take steps to replay
> the journal (as above), or give them the option to clear them, etc.
> So far, gfs_fsck does not do that.  It just clears the journals.
>
> To remedy the situation, I've got an open bugzilla 291551 (which
> may be marked "private" because it was opened internally--sorry)
> at least in the gfs2_fsck case.  (gfs_fsck will likely be done too).
> With that bugzilla, I intend to somehow remedy the situation.
> Either I'll ask the user if they want the journal replayed or else
> I'll replay them automatically, or try to detect problems with them.
>
> I'm not certain that this is the cause of your corruption, but it's
> the only one I can think of at the moment.
>
Hi Bob,

This sounds like a reasonable explanation except for one thing, the filesystem was cleanly umounted on both nodes before I ran gfs_fsck. So there shouldn't be any journal to replay, right?

Anyway, I've restarted the test and if I'm able to recreate this error I'll first take a copy of the filesystem and then check if running "mount + umount" makes this gfs_fsck error go away.

Regards,
Jonas
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From teigland at redhat.com  Wed Sep 26 15:52:15 2007
From: teigland at redhat.com (David Teigland)
Date: Wed, 26 Sep 2007 10:52:15 -0500
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>
Message-ID: <20070926155215.GE15033@redhat.com>

On Wed, Sep 26, 2007 at 05:40:59PM +0200, Borgstr??m Jonas wrote:
> Hi again,
> 
> I was just able to reproduce the filesystem corruption again. This time
> four lost zero-sized inodes were found :( And unfortunately
> mounting+umounting the filesystem didn't make the lost inodes go away.
> I still have a copy of the corrupted filesystem if there is any more
> things you want me to test.

I still think this is probably expected and cleaned up properly by gfs.
When you mount you should see something like this:

GFS: fsid=bull:x.2: jid=2: Trying to acquire journal lock...
GFS: fsid=bull:x.2: jid=2: Looking at journal...
GFS: fsid=bull:x.2: jid=2: Done
GFS: fsid=bull:x.2: Scanning for log elements...
GFS: fsid=bull:x.2: Found 48 unlinked inodes
GFS: fsid=bull:x.2: Found quota changes for 0 IDs
GFS: fsid=bull:x.2: Done

It suspect the unlinked inodes found by fsck are the same as those gfs
finds in the journal when mounting.  Note that a cleanly shut down journal
may still have records of unlinked inodes that need to be deallocated.

You may need to mount/unmount all of the journals (so all journals are
replayed).  something like

mount -t gfs /dev/foo /gfs -o lockproto=lock_nolock hostdata=jid=0
umount /gfs
mount -t gfs /dev/foo /gfs -o lockproto=lock_nolock hostdata=jid=1
umount /gfs
mount -t gfs /dev/foo /gfs -o lockproto=lock_nolock hostdata=jid=2
umount /gfs
etc

and then gfs may clean them up (deallocate) asynchronously after the mount
has completed, I'm not sure.

Dave


From wcheng at redhat.com  Wed Sep 26 16:37:02 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 26 Sep 2007 12:37:02 -0400
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <20070926155215.GE15033@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>
	<20070926155215.GE15033@redhat.com>
Message-ID: <46FA8AAE.9000606@redhat.com>

David Teigland wrote:

>On Wed, Sep 26, 2007 at 05:40:59PM +0200, Borgstr??m Jonas wrote:
>  
>
>>Hi again,
>>
>>I was just able to reproduce the filesystem corruption again. This time
>>four lost zero-sized inodes were found :( And unfortunately
>>mounting+umounting the filesystem didn't make the lost inodes go away.
>>I still have a copy of the corrupted filesystem if there is any more
>>things you want me to test.
>>    
>>
>
>I still think this is probably expected and cleaned up properly by gfs.
>When you mount you should see something like this:
>
>GFS: fsid=bull:x.2: jid=2: Trying to acquire journal lock...
>GFS: fsid=bull:x.2: jid=2: Looking at journal...
>GFS: fsid=bull:x.2: jid=2: Done
>GFS: fsid=bull:x.2: Scanning for log elements...
>GFS: fsid=bull:x.2: Found 48 unlinked inodes
>GFS: fsid=bull:x.2: Found quota changes for 0 IDs
>GFS: fsid=bull:x.2: Done
>
>  
>

Just read this mail - not sure the running kernel version where this 
problem occurs . However, GFS1's unlinked inodes are linked into a list 
that are cleaned up by gfs_inoded (a daemon). So there is a time gap 
between the file is deleted and its on-disk inode is actually removed. 
There are two possibilities for this problem to occur:

1. An unclean shutdown where this linked list is not completely walked 
thru and cleaned up.
2. Possible bugs in RHEL5 based kernels (where gfs umount logic may 
accidentally overlook this clean-up logic).

In any case, I don't view this as a filesystem corruption - but the 
unlinked inodes would take some extra disk space that can only be 
cleaned up by fsck (and/or journal replay if the journal stil has the 
file remove transaction).

-- Wendy


From thorsten.henrici at gfd.de  Wed Sep 26 16:50:29 2007
From: thorsten.henrici at gfd.de (thorsten.henrici at gfd.de)
Date: Wed, 26 Sep 2007 18:50:29 +0200
Subject: [Linux-cluster] rgmanager and qdisk in RHEL5 "behavior problems"
Message-ID: <OFCE45FADA.AB5EC862-ONC1257362.005B11EA-C1257362.005C847B@obi.de>


Hello List,
has this fix

http://www.redhat.com/archives/cluster-devel/2007-April/msg00064.html

Rgmanager thinks qdisk is a node (with node ID 0), so it tries to send
VF information to node 0 - which doesn't exist, causing rgmanger to not
work when qdisk is running :(

already been included in one of the RHEL5 errata RPMs, or will it be in
the future?

We seem to have exactly this problem here. cman and qdiskd as such run
fine, but rgmanager won't start.

Syslog provides the following message: <err> #34: Cannot get status for
service service: ...


Mit freundlichen Gr??en / With best regards

Mag. Thorsten Henrici
____________________________________________________________________________________________________

Kontakt:


GfD Gesellschaft f?r Datenverarbeitung mbH
IT-Kommunikation / Rechenzentrum/Appl.-Betrieb

Industriestr.10
D-42929 Wermelskirchen
Deutschland / Germany
phone:
+49-2196-76-1857
fax:
+49-2196-76-1932
e-mail:
thorsten.henrici at gfd.de


Besuchen Sie uns auch im Internet unter www.gfd.de

Amtsgericht K?ln HR B 36326

Gesch?ftsf?hrer:
Sergio Giroldi, Jens Siebenhaar,
Joachim Kr?ger, Matthias B?cker

Bankverbindung:
Deutsche Bank, Remscheid
BLZ 340 700 93, Konto-Nr. 5835921

UST-Idnr.: DE 811 254 705


--
IMPORTANT NOTICE:
This email is confidential, may be legally privileged, and is for the
intended recipient only. Access, disclosure, copying, distribution, or
reliance on any of it by anyone else is prohibited and may be a criminal
offence. Please delete if obtained in error and email confirmation to the sender.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/a2d16fa7/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 2586 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/a2d16fa7/attachment.gif>

From jos at xos.nl  Wed Sep 26 17:00:15 2007
From: jos at xos.nl (Jos Vos)
Date: Wed, 26 Sep 2007 19:00:15 +0200
Subject: [Linux-cluster] rgmanager and qdisk in RHEL5 "behavior problems"
In-Reply-To: <OFCE45FADA.AB5EC862-ONC1257362.005B11EA-C1257362.005C847B@obi.de>
References: <OFCE45FADA.AB5EC862-ONC1257362.005B11EA-C1257362.005C847B@obi.de>
Message-ID: <20070926170015.GA11096@jasmine.xos.nl>

On Wed, Sep 26, 2007 at 06:50:29PM +0200, thorsten.henrici at gfd.de wrote:

> http://www.redhat.com/archives/cluster-devel/2007-April/msg00064.html
> 
> Rgmanager thinks qdisk is a node (with node ID 0), so it tries to send
> VF information to node 0 - which doesn't exist, causing rgmanger to not
> work when qdisk is running :(
> 
> already been included in one of the RHEL5 errata RPMs, or will it be in
> the future?

According to RH (see answer to my mail about clurgmgrd a from a few days
ago) this should be in RHEL 5.1 beta, but after upgrading the cluster
packages and the kernel on a test system, I got other problems (fencing
won't start anymore) and as I'm not sure what's causing this (partly
incompatible software or my config), I think I postpone further tests
using quorum disks on RHEL5 till 5.1 final is out...

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From wcheng at redhat.com  Wed Sep 26 17:36:49 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 26 Sep 2007 13:36:49 -0400
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <46FA8AAE.9000606@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>	<20070926155215.GE15033@redhat.com>
	<46FA8AAE.9000606@redhat.com>
Message-ID: <46FA98B1.9070501@redhat.com>

Wendy Cheng wrote:

>
> Just read this mail - not sure the running kernel version where this 
> problem occurs . However, GFS1's unlinked inodes are linked into a 
> list that are cleaned up by gfs_inoded (a daemon). So there is a time 
> gap between the file is deleted and its on-disk inode is actually 
> removed. There are two possibilities for this problem to occur:
>
> 1. An unclean shutdown where this linked list is not completely walked 
> thru and cleaned up.
> 2. Possible bugs in RHEL5 based kernels (where gfs umount logic may 
> accidentally overlook this clean-up logic).
>
> In any case, I don't view this as a filesystem corruption - but the 
> unlinked inodes would take some extra disk space that can only be 
> cleaned up by fsck (and/or journal replay if the journal stil has the 
> file remove transaction).
>

Got some private emails in my folder... look like I didn't explain 
things well... An unlinked inode is an inode that has been removed from 
directory (so you would not be able to access it from operating system's 
lookup routine). If you (as an application) need to access a file with 
the same name, you would have to (re)create it (then gfs1 will assign a 
new inode for that new file).

For this issue, the unlinked inode doesn't affect the correctness of the 
filesystem operations. You can still safely use the filesystem, except 
these orphan inodes will take extra disk space.

-- Wendy


From jobot at wmdata.com  Wed Sep 26 17:31:52 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Wed, 26 Sep 2007 19:31:52 +0200
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <20070926155215.GE15033@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>
	<20070926155215.GE15033@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D6CE1@WMRI000167.corp.wmdata.net>

-----Original Message-----
From: David Teigland [mailto:teigland at redhat.com] 
Sent: den 26 september 2007 17:52
To: Borgstr?m Jonas
Cc: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Found unlinked inode

> On Wed, Sep 26, 2007 at 05:40:59PM +0200, Borgstr??m Jonas wrote:
> > Hi again,
> > 
> > I was just able to reproduce the filesystem corruption again. This time
> > four lost zero-sized inodes were found :( And unfortunately
> > mounting+umounting the filesystem didn't make the lost inodes go away.
> > I still have a copy of the corrupted filesystem if there is any more
> > things you want me to test.
>
> I still think this is probably expected and cleaned up properly by gfs.
> When you mount you should see something like this:
>
> GFS: fsid=bull:x.2: jid=2: Trying to acquire journal lock...
> GFS: fsid=bull:x.2: jid=2: Looking at journal...
> GFS: fsid=bull:x.2: jid=2: Done
> GFS: fsid=bull:x.2: Scanning for log elements...
> GFS: fsid=bull:x.2: Found 48 unlinked inodes
> GFS: fsid=bull:x.2: Found quota changes for 0 IDs
> GFS: fsid=bull:x.2: Done
>
> It suspect the unlinked inodes found by fsck are the same as those gfs
> finds in the journal when mounting.  Note that a cleanly shut down journal
> may still have records of unlinked inodes that need to be deallocated.
>

This is what I get when mounting the filesystem:

Trying to join cluster "lock_dlm", "test-db:pg_fs"
Joined cluster. Now mounting FS...
GFS: fsid=test-db:pg_fs.0: jid=0: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=0: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=0: Done
GFS: fsid=test-db:pg_fs.0: jid=1: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=1: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=1: Done

> You may need to mount/unmount all of the journals (so all journals are
> replayed).  something like
>
It doesn't appear to make any difference:

mount /dev/testdb/pg_fs /var/lib/pgsql -o lockproto=lock_nolock,hostdata=jid=0

Trying to join cluster "lock_nolock", "test-db:pg_fs"
Lock_Nolock (built Jul 10 2007 06:40:22) installed
Joined cluster. Now mounting FS...
GFS: fsid=test-db:pg_fs.0: jid=0: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=0: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=0: Done
GFS: fsid=test-db:pg_fs.0: jid=1: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=1: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=1: Done

mount /dev/testdb/pg_fs /var/lib/pgsql -o lockproto=lock_nolock,hostdata=jid=1

Trying to join cluster "lock_nolock", "test-db:pg_fs"
Joined cluster. Now mounting FS...
GFS: fsid=test-db:pg_fs.0: jid=0: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=0: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=0: Done
GFS: fsid=test-db:pg_fs.0: jid=1: Trying to acquire journal lock...
GFS: fsid=test-db:pg_fs.0: jid=1: Looking at journal...
GFS: fsid=test-db:pg_fs.0: jid=1: Done

And running gfs_fsck at this points yields the same errors as before.

Regards,
Jonas


From jdozarchuk at babcock.com  Wed Sep 26 17:40:43 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Wed, 26 Sep 2007 13:40:43 -0400
Subject: [Linux-cluster] Cannot join two nodes together
Message-ID: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>

I have two nodes on the same subnet, can ping each other, are both
alive, both are members of a two-node cluster.  When I start cman on
both nodes at the same time it says "X not a cluster member after 60 sec
post_join_delay".  The output of clustat shows that the other node is
"Offline" and the first node is "Online, local".  The nodes are fencing
each other and powering each other off.  

 
Please help determine why I cannot get these nodes to join. Below is
some information from my systems. RedHat Support is not getting
anywhere.  

 
Thanks

 
----------------------------------

 
Node1: bplmft11

Node2: bplmft12

 
uname -a -> Linux bplmft11 2.6.18-8.1.10.el5 #1 SMP Thu Aug 30 20:43:28
EDT 2007 x86_64 x86_64 x86_64 GNU/Linux

 
[root at bplmft11 ~]# clustat

msg_open: No such file or directory

Member Status: Quorate

 
  Member Name                        ID   Status

  ------ ----                        ---- ------

  bplmft12                              1 Offline

  bplmft11                              2 Online, Local

 
/etc/cluster/cluster.conf file (with the fencing levels removed):

 
<?xml version="1.0" ?>

<cluster alias="plm_test" config_version="16" name="plm_test">

        <fence_daemon post_fail_delay="0" post_join_delay="60"/>

        <clusternodes>

                <clusternode name="bplmft12" nodeid="1" votes="1">

                        <fence>

                                <method name="1"/>

                        </fence>

                </clusternode>

                <clusternode name="bplmft11" nodeid="2" votes="1">

                        <fence>

                                <method name="1"/>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_ilo" hostname="ilo-bplmft12"
login="redhat_cluster_user" name="ilo-bplmft12" passwd="PASSWORD"/>

                <fencedevice agent="fence_ilo" hostname="ilo-bplmft11"
login="redhat_cluster_user" name="ilo-bplmft11" passwd="PASSWORD"/>

        </fencedevices>

        <rm>

                <failoverdomains/>

                <resources/>

        </rm>

</cluster>

 
/var/log/messages after doing a "service cman start" on both nodes:

 
ep 26 13:33:04 bplmft11 ccsd[31407]: Cluster is not quorate.  Refusing
connection.

Sep 26 13:33:04 bplmft11 ccsd[31407]: Error while processing connect:
Connection refused

Sep 26 13:33:04 bplmft11 ccsd[31407]: Initial status:: Quorate

Sep 26 13:33:10 bplmft11 snmpd[2616]: Connection from UDP:
[127.0.0.1]:32771

Sep 26 13:33:10 bplmft11 snmpd[2616]: Received SNMP packet(s) from UDP:
[127.0.0.1]:32771

Sep 26 13:33:25 bplmft11 snmpd[2616]: Connection from UDP:
[127.0.0.1]:32771

Sep 26 13:33:55 bplmft11 last message repeated 2 times

Sep 26 13:34:06 bplmft11 fenced[31436]: bplmft12 not a cluster member
after 60 sec post_join_delay

Sep 26 13:34:06 bplmft11 fenced[31436]: fencing node "bplmft12"

Sep 26 13:34:06 bplmft11 fenced[31436]: fence "bplmft12" failed


-----------------------------------------
This message is intended only for the individual or entity to which
it is addressed and contains information that is proprietary to The
Babcock & Wilcox Company and/or its affiliates, or may be otherwise
confidential.  If the reader of this message is not the intended
recipient, or the employee agent responsible for delivering the
message to the intended recipient, you are hereby notified that any
dissemination, distribution or copying of this communication is
strictly prohibited.  If you have received this communication in
error, please notify the sender immediately by return e-mail and
delete this message from your computer.  Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/10d752b7/attachment.htm>

From jobot at wmdata.com  Wed Sep 26 17:45:52 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Wed, 26 Sep 2007 19:45:52 +0200
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <46FA98B1.9070501@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>	<20070926155215.GE15033@redhat.com><46FA8AAE.9000606@redhat.com>
	<46FA98B1.9070501@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D6CE3@WMRI000167.corp.wmdata.net>

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Wendy Cheng
Sent: den 26 september 2007 19:37
To: linux clustering
Subject: Re: [Linux-cluster] Found unlinked inode
> 
> Wendy Cheng wrote:
>
> >
> > Just read this mail - not sure the running kernel version where this 
> > problem occurs . However, GFS1's unlinked inodes are linked into a 
> > list that are cleaned up by gfs_inoded (a daemon). So there is a time 
> > gap between the file is deleted and its on-disk inode is actually 
> > removed. There are two possibilities for this problem to occur:
> >
> > 1. An unclean shutdown where this linked list is not completely walked 
> > thru and cleaned up.
> > 2. Possible bugs in RHEL5 based kernels (where gfs umount logic may 
> > accidentally overlook this clean-up logic).
> >
> > In any case, I don't view this as a filesystem corruption - but the 
> > unlinked inodes would take some extra disk space that can only be 
> > cleaned up by fsck (and/or journal replay if the journal stil has the 
> > file remove transaction).
> >
>
> Got some private emails in my folder... look like I didn't explain 
> things well... An unlinked inode is an inode that has been removed from 
> directory (so you would not be able to access it from operating system's 
> lookup routine). If you (as an application) need to access a file with 
> the same name, you would have to (re)create it (then gfs1 will assign a 
> new inode for that new file).
>
> For this issue, the unlinked inode doesn't affect the correctness of the 
> filesystem operations. You can still safely use the filesystem, except 
> these orphan inodes will take extra disk space.

Hi Wendy, thanks for your answer.

To answer your earlier question, the kernel version used is 2.6.18-8.1.8.el5. I just noticed that a never kernel version is available, but as far as I can tell this is a security release and the changelog doesn't mention any changes to the GFS filesystem...

So if I understood you correctly this is a known bug/limitation that might orphan already deleted inodes if a node crashes before gfs_inoded had a chance to free them. And the only downside is some lost disk space which can be reclaimed by running gfs_fsck. This is good news.

Regards,
Jonas
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From raycharles_man at yahoo.com  Wed Sep 26 18:24:02 2007
From: raycharles_man at yahoo.com (Ray Charles)
Date: Wed, 26 Sep 2007 11:24:02 -0700 (PDT)
Subject: [Linux-cluster] Cannot join two nodes together
In-Reply-To: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>
Message-ID: <831131.40504.qm@web32103.mail.mud.yahoo.com>


Hi,

If this is a lab or dev can you turn off iptables if
its running. Then later fine tune your iptables rules.

Sometimes its just a firewall issue.


-R
--- "Ozarchuk, John D" <jdozarchuk at babcock.com> wrote:

> I have two nodes on the same subnet, can ping each
> other, are both
> alive, both are members of a two-node cluster.  When
> I start cman on
> both nodes at the same time it says "X not a cluster
> member after 60 sec
> post_join_delay".  The output of clustat shows that
> the other node is
> "Offline" and the first node is "Online, local". 
> The nodes are fencing
> each other and powering each other off.  
> 
>  
> 
> Please help determine why I cannot get these nodes
> to join. Below is
> some information from my systems. RedHat Support is
> not getting
> anywhere.  
> 
>  
> 
> Thanks
> 
>  
> 
> ----------------------------------
> 
>  
> 
> Node1: bplmft11
> 
> Node2: bplmft12
> 
>  
> 
> uname -a -> Linux bplmft11 2.6.18-8.1.10.el5 #1 SMP
> Thu Aug 30 20:43:28
> EDT 2007 x86_64 x86_64 x86_64 GNU/Linux
> 
>  
> 
> [root at bplmft11 ~]# clustat
> 
> msg_open: No such file or directory
> 
> Member Status: Quorate
> 
>  
> 
>   Member Name                        ID   Status
> 
>   ------ ----                        ---- ------
> 
>   bplmft12                              1 Offline
> 
>   bplmft11                              2 Online,
> Local
> 
>  
> 
>  
> 
> /etc/cluster/cluster.conf file (with the fencing
> levels removed):
> 
>  
> 
> <?xml version="1.0" ?>
> 
> <cluster alias="plm_test" config_version="16"
> name="plm_test">
> 
>         <fence_daemon post_fail_delay="0"
> post_join_delay="60"/>
> 
>         <clusternodes>
> 
>                 <clusternode name="bplmft12"
> nodeid="1" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1"/>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>                 <clusternode name="bplmft11"
> nodeid="2" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1"/>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>         </clusternodes>
> 
>         <cman expected_votes="1" two_node="1"/>
> 
>         <fencedevices>
> 
>                 <fencedevice agent="fence_ilo"
> hostname="ilo-bplmft12"
> login="redhat_cluster_user" name="ilo-bplmft12"
> passwd="PASSWORD"/>
> 
>                 <fencedevice agent="fence_ilo"
> hostname="ilo-bplmft11"
> login="redhat_cluster_user" name="ilo-bplmft11"
> passwd="PASSWORD"/>
> 
>         </fencedevices>
> 
>         <rm>
> 
>                 <failoverdomains/>
> 
>                 <resources/>
> 
>         </rm>
> 
> </cluster>
> 
>  
> 
>  
> 
> /var/log/messages after doing a "service cman start"
> on both nodes:
> 
>  
> 
> ep 26 13:33:04 bplmft11 ccsd[31407]: Cluster is not
> quorate.  Refusing
> connection.
> 
> Sep 26 13:33:04 bplmft11 ccsd[31407]: Error while
> processing connect:
> Connection refused
> 
> Sep 26 13:33:04 bplmft11 ccsd[31407]: Initial
> status:: Quorate
> 
> Sep 26 13:33:10 bplmft11 snmpd[2616]: Connection
> from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:10 bplmft11 snmpd[2616]: Received SNMP
> packet(s) from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:25 bplmft11 snmpd[2616]: Connection
> from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:55 bplmft11 last message repeated 2
> times
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: bplmft12 not
> a cluster member
> after 60 sec post_join_delay
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: fencing node
> "bplmft12"
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: fence
> "bplmft12" failed
> 
> 
> 
> -----------------------------------------
> This message is intended only for the individual or
> entity to which
> it is addressed and contains information that is
> proprietary to The
> Babcock & Wilcox Company and/or its affiliates, or
> may be otherwise
> confidential.  If the reader of this message is not
> the intended
> recipient, or the employee agent responsible for
> delivering the
> message to the intended recipient, you are hereby
> notified that any
> dissemination, distribution or copying of this
> communication is
> strictly prohibited.  If you have received this
> communication in
> error, please notify the sender immediately by
> return e-mail and
> delete this message from your computer.  Thank you.>
--
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


____________________________________________________________________________________
Got a little couch potato? 
Check out fun summer activities for kids.
http://search.yahoo.com/search?fr=oni_on_mail&p=summer+activities+for+kids&cs=bz 


From rohara at redhat.com  Wed Sep 26 18:30:05 2007
From: rohara at redhat.com (Ryan O'Hara)
Date: Wed, 26 Sep 2007 13:30:05 -0500
Subject: [Linux-cluster] Cannot join two nodes together
In-Reply-To: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>
References: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>
Message-ID: <46FAA52D.50509@redhat.com>

Ozarchuk, John D wrote:
> I have two nodes on the same subnet, can ping each other, are both 
> alive, both are members of a two-node cluster.  When I start cman on 
> both nodes at the same time it says ?X not a cluster member after 60 sec 
> post_join_delay?.  The output of clustat shows that the other node is 
> ?Offline? and the first node is ?Online, local?.  The nodes are fencing 
> each other and powering each other off. 
> 
>  
> 
> Please help determine why I cannot get these nodes to join. Below is 
> some information from my systems. RedHat Support is not getting anywhere. 

You probably want to set the "two_node=1" option in the cluster.conf 
file. A good explanation of the caveats of a two-node cluster can be 
found in the FAQ, found here:

http://sourceware.org/cluster/faq.html

-Ryan


From jdozarchuk at babcock.com  Wed Sep 26 18:32:06 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Wed, 26 Sep 2007 14:32:06 -0400
Subject: [Linux-cluster] Cannot join two nodes together
In-Reply-To: <831131.40504.qm@web32103.mail.mud.yahoo.com>
References: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>
	<831131.40504.qm@web32103.mail.mud.yahoo.com>
Message-ID: <EE9F27C20409C44AA031C769B3DBCE410298BD45@barbpo3.bwes.net>

iptables is not running on either node. 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ray Charles
Sent: Wednesday, September 26, 2007 2:24 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cannot join two nodes together


Hi,

If this is a lab or dev can you turn off iptables if
its running. Then later fine tune your iptables rules.

Sometimes its just a firewall issue.


-R
--- "Ozarchuk, John D" <jdozarchuk at babcock.com> wrote:

> I have two nodes on the same subnet, can ping each
> other, are both
> alive, both are members of a two-node cluster.  When
> I start cman on
> both nodes at the same time it says "X not a cluster
> member after 60 sec
> post_join_delay".  The output of clustat shows that
> the other node is
> "Offline" and the first node is "Online, local". 
> The nodes are fencing
> each other and powering each other off.  
> 
>  
> 
> Please help determine why I cannot get these nodes
> to join. Below is
> some information from my systems. RedHat Support is
> not getting
> anywhere.  
> 
>  
> 
> Thanks
> 
>  
> 
> ----------------------------------
> 
>  
> 
> Node1: bplmft11
> 
> Node2: bplmft12
> 
>  
> 
> uname -a -> Linux bplmft11 2.6.18-8.1.10.el5 #1 SMP
> Thu Aug 30 20:43:28
> EDT 2007 x86_64 x86_64 x86_64 GNU/Linux
> 
>  
> 
> [root at bplmft11 ~]# clustat
> 
> msg_open: No such file or directory
> 
> Member Status: Quorate
> 
>  
> 
>   Member Name                        ID   Status
> 
>   ------ ----                        ---- ------
> 
>   bplmft12                              1 Offline
> 
>   bplmft11                              2 Online,
> Local
> 
>  
> 
>  
> 
> /etc/cluster/cluster.conf file (with the fencing
> levels removed):
> 
>  
> 
> <?xml version="1.0" ?>
> 
> <cluster alias="plm_test" config_version="16"
> name="plm_test">
> 
>         <fence_daemon post_fail_delay="0"
> post_join_delay="60"/>
> 
>         <clusternodes>
> 
>                 <clusternode name="bplmft12"
> nodeid="1" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1"/>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>                 <clusternode name="bplmft11"
> nodeid="2" votes="1">
> 
>                         <fence>
> 
>                                 <method name="1"/>
> 
>                         </fence>
> 
>                 </clusternode>
> 
>         </clusternodes>
> 
>         <cman expected_votes="1" two_node="1"/>
> 
>         <fencedevices>
> 
>                 <fencedevice agent="fence_ilo"
> hostname="ilo-bplmft12"
> login="redhat_cluster_user" name="ilo-bplmft12"
> passwd="PASSWORD"/>
> 
>                 <fencedevice agent="fence_ilo"
> hostname="ilo-bplmft11"
> login="redhat_cluster_user" name="ilo-bplmft11"
> passwd="PASSWORD"/>
> 
>         </fencedevices>
> 
>         <rm>
> 
>                 <failoverdomains/>
> 
>                 <resources/>
> 
>         </rm>
> 
> </cluster>
> 
>  
> 
>  
> 
> /var/log/messages after doing a "service cman start"
> on both nodes:
> 
>  
> 
> ep 26 13:33:04 bplmft11 ccsd[31407]: Cluster is not
> quorate.  Refusing
> connection.
> 
> Sep 26 13:33:04 bplmft11 ccsd[31407]: Error while
> processing connect:
> Connection refused
> 
> Sep 26 13:33:04 bplmft11 ccsd[31407]: Initial
> status:: Quorate
> 
> Sep 26 13:33:10 bplmft11 snmpd[2616]: Connection
> from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:10 bplmft11 snmpd[2616]: Received SNMP
> packet(s) from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:25 bplmft11 snmpd[2616]: Connection
> from UDP:
> [127.0.0.1]:32771
> 
> Sep 26 13:33:55 bplmft11 last message repeated 2
> times
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: bplmft12 not
> a cluster member
> after 60 sec post_join_delay
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: fencing node
> "bplmft12"
> 
> Sep 26 13:34:06 bplmft11 fenced[31436]: fence
> "bplmft12" failed
> 
> 
> 
> -----------------------------------------
> This message is intended only for the individual or
> entity to which
> it is addressed and contains information that is
> proprietary to The
> Babcock & Wilcox Company and/or its affiliates, or
> may be otherwise
> confidential.  If the reader of this message is not
> the intended
> recipient, or the employee agent responsible for
> delivering the
> message to the intended recipient, you are hereby
> notified that any
> dissemination, distribution or copying of this
> communication is
> strictly prohibited.  If you have received this
> communication in
> error, please notify the sender immediately by
> return e-mail and
> delete this message from your computer.  Thank you.>
--
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


________________________________________________________________________
____________
Got a little couch potato? 
Check out fun summer activities for kids.
http://search.yahoo.com/search?fr=oni_on_mail&p=summer+activities+for+ki
ds&cs=bz 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From jdozarchuk at babcock.com  Wed Sep 26 18:46:00 2007
From: jdozarchuk at babcock.com (Ozarchuk, John D)
Date: Wed, 26 Sep 2007 14:46:00 -0400
Subject: [Linux-cluster] Cannot join two nodes together
In-Reply-To: <46FAA52D.50509@redhat.com>
References: <EE9F27C20409C44AA031C769B3DBCE410298BD2E@barbpo3.bwes.net>
	<46FAA52D.50509@redhat.com>
Message-ID: <EE9F27C20409C44AA031C769B3DBCE410298BD4E@barbpo3.bwes.net>

This is a cman_tool nodes from each node...

[root at bplmft11 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   X     24                        bplmft12
   2   M      4   2007-09-26 13:33:04  bplmft11

Last login: Wed Sep 26 14:35:26 2007 from bplmft11
[root at bplmft12 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M      4   2007-09-26 14:35:16  bplmft12
   2   X     24                        bplmft11

It's as if each node is locally joined but the other node is not.  

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Ryan O'Hara
Sent: Wednesday, September 26, 2007 2:30 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cannot join two nodes together

Ozarchuk, John D wrote:
> I have two nodes on the same subnet, can ping each other, are both 
> alive, both are members of a two-node cluster.  When I start cman on 
> both nodes at the same time it says "X not a cluster member after 60
sec 
> post_join_delay".  The output of clustat shows that the other node is 
> "Offline" and the first node is "Online, local".  The nodes are
fencing 
> each other and powering each other off. 
> 
>  
> 
> Please help determine why I cannot get these nodes to join. Below is 
> some information from my systems. RedHat Support is not getting
anywhere. 

You probably want to set the "two_node=1" option in the cluster.conf 
file. A good explanation of the caveats of a two-node cluster can be 
found in the FAQ, found here:

http://sourceware.org/cluster/faq.html

-Ryan

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
-----------------------------------------
This message is intended only for the individual or entity to which
it is addressed and contains information that is proprietary to The
Babcock & Wilcox Company and/or its affiliates, or may be otherwise
confidential.  If the reader of this message is not the intended
recipient, or the employee agent responsible for delivering the
message to the intended recipient, you are hereby notified that any
dissemination, distribution or copying of this communication is
strictly prohibited.  If you have received this communication in
error, please notify the sender immediately by return e-mail and
delete this message from your computer.  Thank you.


From wcheng at redhat.com  Wed Sep 26 19:55:04 2007
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 26 Sep 2007 15:55:04 -0400
Subject: [Linux-cluster] Found unlinked inode
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D6CE3@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D021D6CA9@WMRI000167.corp.wmdata.net>	<CABF801D13AA444988E62B7AF62C371D021D6CC5@WMRI000167.corp.wmdata.net>	<20070926155215.GE15033@redhat.com><46FA8AAE.9000606@redhat.com>
	<46FA98B1.9070501@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D021D6CE3@WMRI000167.corp.wmdata.net>
Message-ID: <46FAB918.2050002@redhat.com>

Borgstr?m Jonas wrote:

>
>Hi Wendy, thanks for your answer.
>
>To answer your earlier question, the kernel version used is 2.6.18-8.1.8.el5. I just noticed that a never kernel version is available, but as far as I can tell this is a security release and the changelog doesn't mention any changes to the GFS filesystem...
>
>So if I understood you correctly this is a known bug/limitation that might orphan already deleted inodes if a node crashes before gfs_inoded had a chance to free them. And the only downside is some lost disk space which can be reclaimed by running gfs_fsck. This is good news.
>
>  
>
Yes..

We'll look into this.

-- Wendy


From johnvge at yahoo.com  Wed Sep 26 19:57:19 2007
From: johnvge at yahoo.com (John Vijoe George)
Date: Wed, 26 Sep 2007 12:57:19 -0700 (PDT)
Subject: [Linux-cluster] adding a node to a cluster
Message-ID: <615221.60186.qm@web54410.mail.yahoo.com>

I am a newbie and am trying to add a node to an existing cluster.
I am running into issues when trying to add a 4th node to a cluster of three RHEL-5 machines. 
Since the /etc/cluster/cluster.conf file does not have the information (clusternode name, fence info etc) of the fourth node, I decided to run the system-config-cluster tool on a machine that is already on the cluster. When I run the system-config-cluster tool on any of the machines already on the cluster, I get the following errors:
"Because this machine is not already part of the cluster, the management tab for this application is not available". To confirm that this machine was on the cluster, I ran the cman_tool join command as follows:

[root at DL365-2 ~]# cman_tool join
cman_tool: Node is already active

1. Why is it that I am not able to obtain the management tab even when I run the system-config-cluster from a machine that is already on the cluster?

Since the above failed, I decided to add the corresponding node and fence detials manually in the cluster.conf file (and copy it to all the machines on cluster). Once I did that and ran the 'service cman start', I got an error saying that the local node name is not present in the cluster.conf. When I checked the conf file again, the node that I added disappeared after I ran cman start?

# service cman start
Starting cluster:
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed
cman not started: Can't find local node name in cluster.conf /usr/sbin/cman_tool: aisexec daemon didn't start

2. Why is it that cluster.conf gets reset after I run cman? Is there anyway I could get it to work?

Any help in this regards is appreciated.

John G


      ____________________________________________________________________________________
Tonight's top picks. What will you watch tonight? Preview the hottest shows on Yahoo! TV.
http://tv.yahoo.com/ 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/d6adca04/attachment.htm>

From emerhsilva at gmail.com  Wed Sep 26 20:17:49 2007
From: emerhsilva at gmail.com (Emerson Henrique da silva)
Date: Wed, 26 Sep 2007 17:17:49 -0300
Subject: [Linux-cluster] Cluster for DB2 database
Message-ID: <616a369f0709261317w50e49f77je6e4a027aeff9aa9@mail.gmail.com>

Hello all!


Does anybody ever used Red Hat cluster for DB2 Databases ? (performance
clusters)
Is it similar to Oracle's RAC, that have a particular cluster manager ?
In my readings, I fell that RH Cluster is not the right way to achieve the
targets that I've finding. In the IBM's site, the documents are very
confused to locate it.


Thanks for comments.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070926/c7214c9c/attachment.htm>

From jprats at cesca.es  Wed Sep 26 19:38:18 2007
From: jprats at cesca.es (Jordi Prats)
Date: Wed, 26 Sep 2007 21:38:18 +0200
Subject: [Linux-cluster] Cluster for DB2 database
In-Reply-To: <616a369f0709261317w50e49f77je6e4a027aeff9aa9@mail.gmail.com>
References: <616a369f0709261317w50e49f77je6e4a027aeff9aa9@mail.gmail.com>
Message-ID: <46FAB52A.6050005@cesca.es>

Hi,
I'm not a expert on DB2 but I don't think RHcluster will give you a 
performance-oriented cluster of DB2 databases. Instead, I could provide 
you a HA database or a HA-iSCSI target or GFS for your datafiles. 
Remember that if you share the same datafiles DB2 should be cluster-aware.

By the way, Oracle RAC it's a cluster by itself, it does not use RH 
cluster in any way.

Hope this helps.

regards,
Jordi

Emerson Henrique da silva wrote:
> Hello all!
> 
> 
> Does anybody ever used Red Hat cluster for DB2 Databases ? (performance 
> clusters)
> Is it similar to Oracle's RAC, that have a particular cluster manager ?
> In my readings, I fell that RH Cluster is not the right way to achieve 
> the targets that I've finding. In the IBM's site, the documents are very 
> confused to locate it.
> 
> 
> Thanks for comments.
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From Joel.Becker at oracle.com  Wed Sep 26 23:02:08 2007
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 26 Sep 2007 16:02:08 -0700
Subject: [Linux-cluster] propagating expected=N
Message-ID: <20070926230208.GC17474@tasint.org>

Folks,
	Quick question on clusters where nodes are off.  I have a
cluster with four nodes.  They are all off.  I turn one on and start
cman.  The cluster services hang on this node because there is no
quorum.  I then run "cman_tool expected -e 1" to tell cman there is only
one node.  cman becomes quorate and fences the other three nodes.  Now
this node is a happy one-node cluster.
	Then I turn on a second node and start cman.  It is my
understanding that when the second node comes up, the first node will
automatically bump expected to 2.  It does just that, and the first node
is happy.  *However*, the second node does not see expected=2.  It sees
expected=4.  I had to run "cman_tool expected -e 2" on the second node
before it became quorate.
	Shouldn't the second node get the expected value from the first
node?  Shouldn't it just go quorate?

Joel

-- 

"There are some experiences in life which should not be demanded
 twice from any man, and one of them is listening to the Brahms Requiem."
        - George Bernard Shaw

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From JFillman at cucbc.com  Wed Sep 26 23:20:05 2007
From: JFillman at cucbc.com (James Fillman)
Date: Wed, 26 Sep 2007 16:20:05 -0700
Subject: [Linux-cluster] How do you set up a Virtual IP Service??
Message-ID: <66F461DD7EDEEF4AA928FCC80B425B52AFCBF4@c2kp01mail.cucbc.com>

First off, I have to say that the documentation for cluster suite is
horrible. Trying to find explanations and examples of cluster.conf
syntax is impossible. Where do you got learn you got configure services,
resources, failover domains? 

Anyway, I want to set up a vip service that belongs to a failover
domain. How do I do it? This is the config I've come up with so far. It
looks fine based on the information that I've scrapped up, but it
doesn't work. I've got verbose logging turned on and I see no errors.

For a vip that's managed by rgmanager, I assume that it shouldn't be
started and stopped by init upon server start/stops. That would mean NOT
having an entry in /etc/sysconfig/network-scripts/.  Correct??

Are there any command line tools to test this with?

Here's a snippet of my cluster.conf file:

<cluster alias="test cluster" config_version="7" name="test cluster">
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="clxp02cluster.cucbc.com" nodeid="1"
votes="1"/>
                <clusternode name="clxp01cluster.cucbc.com" nodeid="2"
votes="1"/>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices/>
        <rm>
                <failoverdomains/>
                <resources/>
                <service autostart="1" name="Network Sysloging"
domain="syslog">
                        <ip address="172.17.54.210" monitor_link="1"/>
                        <script file="/etc/rc.d/init.d/syslog-ng"
name="syslog-ng"/>
                </service>
                <failoverdomains>
                        <failoverdomain name="syslog" ordered="1"
restricted="1">
                                <failoverdomainnode
name="clxp02cluster.cucbc.com" priority="2"/>
                                <failoverdomainnode
name="clxp01cluster.cucbc.com" priority="1"/>
                        </failoverdomain>
                        <failoverdomain name="splunk" ordered="1"
restricted="1">
                                <failoverdomainnode
name="clxp02cluster.cucbc.com" priority="1"/>
                                <failoverdomainnode
name="clxp01cluster.cucbc.com" priority="2"/>
                        </failoverdomain>
                </failoverdomains>
        </rm>
</cluster>

thanks,
--james


From Joel.Becker at oracle.com  Thu Sep 27 00:13:53 2007
From: Joel.Becker at oracle.com (Joel Becker)
Date: Wed, 26 Sep 2007 17:13:53 -0700
Subject: [Linux-cluster] propagating expected=N
In-Reply-To: <20070926230208.GC17474@tasint.org>
References: <20070926230208.GC17474@tasint.org>
Message-ID: <20070927001353.GF17474@tasint.org>

On Wed, Sep 26, 2007 at 04:02:08PM -0700, Joel Becker wrote:
> 	Then I turn on a second node and start cman.  It is my
> understanding that when the second node comes up, the first node will
> automatically bump expected to 2.  It does just that, and the first node

	I was wrong.  The first node leaves "expected" as "1".  Isn't
this incorrect as well?

Joel

-- 

"The opposite of a correct statement is a false statement. The
 opposite of a profound truth may well be another profound truth."
         - Niels Bohr 

Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127


From jos at xos.nl  Thu Sep 27 06:19:29 2007
From: jos at xos.nl (Jos Vos)
Date: Thu, 27 Sep 2007 08:19:29 +0200
Subject: [Linux-cluster] How do you set up a Virtual IP Service??
In-Reply-To: <66F461DD7EDEEF4AA928FCC80B425B52AFCBF4@c2kp01mail.cucbc.com>
References: <66F461DD7EDEEF4AA928FCC80B425B52AFCBF4@c2kp01mail.cucbc.com>
Message-ID: <20070927061929.GA17399@jasmine.xos.nl>

On Wed, Sep 26, 2007 at 04:20:05PM -0700, James Fillman wrote:

> First off, I have to say that the documentation for cluster suite is
> horrible. Trying to find explanations and examples of cluster.conf
> syntax is impossible. Where do you got learn you got configure services,
> resources, failover domains? 

Start reading:

http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/Cluster_Suite_Overview/index.html
http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/en-US/Cluster_Suite_Overview.pdf
(2 alternatives for the same document)

http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/Cluster_Administration/index.html
http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/en-US/Cluster_Administration.pdf
(2 alternatives for the same document)

http://sourceware.org/cluster/faq.html

Then create, given this documentation, a complete cluster.conf using
system-config-cluster.

> For a vip that's managed by rgmanager, I assume that it shouldn't be
> started and stopped by init upon server start/stops. That would mean NOT
> having an entry in /etc/sysconfig/network-scripts/.  Correct??

Correct.  It is handled by the cluster software internally and you can
not see it with "ifconfig", only with "ip addr list".

-- 
--    Jos Vos <jos at xos.nl>
--    X/OS Experts in Open Systems BV   |   Phone: +31 20 6938364
--    Amsterdam, The Netherlands        |     Fax: +31 20 6948204


From changerv at gmail.com  Thu Sep 27 07:31:40 2007
From: changerv at gmail.com (Changer Van)
Date: Thu, 27 Sep 2007 15:31:40 +0800
Subject: [Linux-cluster] service can not be relocated
Message-ID: <9fa3c2e50709270031n187b3403nf0666fdfa9bed4e9@mail.gmail.com>

Hi all,

Httpd service can not be relocated when I performed the command as follows:

# clusvcadm -r httpd
Trying to relocate service:httpd...Failure
service:httpd is now running on node02

Here are cluster status:
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  node01                                1 Online, Local, rgmanager
  node02                                2 Online, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  service:httpd        node02                         started


Here are the context of cluster.conf:
<?xml version="1.0" ?>
<cluster alias="web_cluster" config_version="5" name="web_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="node01" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="manual"
nodename="node01"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node02" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="manual"
nodename="node02"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="manual"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="web" ordered="0"
restricted="0">
                                <failoverdomainnode name="node01"
priority="1"/>
                                <failoverdomainnode name="node02"
priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="192.168.0.101" monitor_link="1"/>
                        <script file="/etc/init.d/httpd" name="httpd"/>
                        <clusterfs device="/dev/gnbd/gfs" force_unmount="1"
fsid="57107" fstype="gfs" mountpoint="/media/gfs" name="gnbd_storage"
options=""/>
                </resources>
                <service autostart="0" domain="web" name="httpd"
recovery="relocate">
                        <ip ref="192.168.0.101">
                                <script ref="httpd"/>
                                <clusterfs ref="gnbd_storage"/>
                        </ip>
                </service>
        </rm>
</cluster>

Any help would be greatly appreciated.


-- 
Regards,
Changer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070927/ddaaeb5c/attachment.htm>

From pcaulfie at redhat.com  Thu Sep 27 09:00:15 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 27 Sep 2007 10:00:15 +0100
Subject: [Linux-cluster] propagating expected=N
In-Reply-To: <20070927001353.GF17474@tasint.org>
References: <20070926230208.GC17474@tasint.org>
	<20070927001353.GF17474@tasint.org>
Message-ID: <46FB711F.1030003@redhat.com>

Joel Becker wrote:
> On Wed, Sep 26, 2007 at 04:02:08PM -0700, Joel Becker wrote:
>> 	Then I turn on a second node and start cman.  It is my
>> understanding that when the second node comes up, the first node will
>> automatically bump expected to 2.  It does just that, and the first node
> 
> 	I was wrong.  The first node leaves "expected" as "1".  Isn't
> this incorrect as well?


Yikes!

Fixed in HEAD.

Patrick


From emerhsilva at gmail.com  Thu Sep 27 11:55:12 2007
From: emerhsilva at gmail.com (Emerson Henrique da silva)
Date: Thu, 27 Sep 2007 08:55:12 -0300
Subject: [Linux-cluster] Cluster for DB2 database
In-Reply-To: <46FAB52A.6050005@cesca.es>
References: <616a369f0709261317w50e49f77je6e4a027aeff9aa9@mail.gmail.com>
	<46FAB52A.6050005@cesca.es>
Message-ID: <616a369f0709270455q324e2b96he6443ef4bf645816@mail.gmail.com>

Hello Jordi,

Thanks for your comments.

Really, I was thinking about this hypothesis too but, I need to improve the
performance overall. Of course, HA is a implied concept on that kind of
Cluster.
GFS or HA-iSCSI, only will  help me to consolidate the Database files but
wont give the growth in scale, that is what I'm  looking for.

Our database partner is insisting to make us to use the IBM's HACMP but, I'm
not sure about this yet.

What do you thing ?


Emerson


2007/9/26, Jordi Prats <jprats at cesca.es>:
>
> Hi,
> I'm not a expert on DB2 but I don't think RHcluster will give you a
> performance-oriented cluster of DB2 databases. Instead, I could provide
> you a HA database or a HA-iSCSI target or GFS for your datafiles.
> Remember that if you share the same datafiles DB2 should be cluster-aware.
>
> By the way, Oracle RAC it's a cluster by itself, it does not use RH
> cluster in any way.
>
> Hope this helps.
>
> regards,
> Jordi
>
> Emerson Henrique da silva wrote:
> > Hello all!
> >
> >
> > Does anybody ever used Red Hat cluster for DB2 Databases ? (performance
> > clusters)
> > Is it similar to Oracle's RAC, that have a particular cluster manager ?
> > In my readings, I fell that RH Cluster is not the right way to achieve
> > the targets that I've finding. In the IBM's site, the documents are very
> > confused to locate it.
> >
> >
> > Thanks for comments.
> >
> >
> > ------------------------------------------------------------------------
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070927/bf7323ba/attachment.htm>

From jparsons at redhat.com  Thu Sep 27 15:45:46 2007
From: jparsons at redhat.com (jim parsons)
Date: Thu, 27 Sep 2007 11:45:46 -0400
Subject: [Linux-cluster] How do you set up a Virtual IP Service??
In-Reply-To: <20070927061929.GA17399@jasmine.xos.nl>
References: <66F461DD7EDEEF4AA928FCC80B425B52AFCBF4@c2kp01mail.cucbc.com>
	<20070927061929.GA17399@jasmine.xos.nl>
Message-ID: <1190907946.3305.3.camel@localhost.localdomain>

On Thu, 2007-09-27 at 08:19 +0200, Jos Vos wrote:
> On Wed, Sep 26, 2007 at 04:20:05PM -0700, James Fillman wrote:
> 
> > First off, I have to say that the documentation for cluster suite is
> > horrible. Trying to find explanations and examples of cluster.conf
> > syntax is impossible. Where do you got learn you got configure services,
> > resources, failover domains? 
> 
> Start reading:
> 
> http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/Cluster_Suite_Overview/index.html
> http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/en-US/Cluster_Suite_Overview.pdf
> (2 alternatives for the same document)
> 
> http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/Cluster_Administration/index.html
> http://www.redhat.com/docs/manuals/enterprise/RHEL-5-manual/en-US/Cluster_Administration.pdf
> (2 alternatives for the same document)
> 
> http://sourceware.org/cluster/faq.html
> 
> Then create, given this documentation, a complete cluster.conf using
> system-config-cluster.
May I also suggest the cluster.conf schema doc here:

> http://sources.redhat.com/cluster/doc/cluster_schema.html
This page is linked off of sourceware.org/cluster

Also, conga is a really easy way of getting a cluster up and running.
You kinda need to know more to use s-c-cluster...with conga, you enter
the proposed nodenames, and the passwords for them and it just creates a
cluster for you. 

-j


From rmccabe at redhat.com  Thu Sep 27 16:18:02 2007
From: rmccabe at redhat.com (Ryan McCabe)
Date: Thu, 27 Sep 2007 12:18:02 -0400
Subject: [Linux-cluster] adding a node to a cluster
In-Reply-To: <615221.60186.qm@web54410.mail.yahoo.com>
References: <615221.60186.qm@web54410.mail.yahoo.com>
Message-ID: <20070927161801.GA847648@redhat.com>

On Wed, Sep 26, 2007 at 12:57:19PM -0700, John Vijoe George wrote:
> 1. Why is it that I am not able to obtain the management tab even when I run the system-config-cluster from a machine that is already on the cluster?

You may be hitting this bug:
https://bugzilla.redhat.com/show_bug.cgi?id=249705. There's an updated
rpm attached to https://bugzilla.redhat.com/show_bug.cgi?id=253843 that
you may want to give a shot. Alternately, you could use conga.

> Since the above failed, I decided to add the corresponding node and fence detials manually in the cluster.conf file (and copy it to all the machines on cluster). Once I did that and ran the 'service cman start', I got an error saying that the local node name is not present in the cluster.conf. When I checked the conf file again, the node that I added disappeared after I ran cman start?

Make sure you bumped the cluster config version number, then run
'ccs_tool update /etc/cluster/cluster.conf' on one node with the
updated cluster.conf file. That will propagate the updated
configuration to the rest of the running nodes and update the
configuration version number.


Ryan


From johnvge at yahoo.com  Thu Sep 27 18:09:40 2007
From: johnvge at yahoo.com (John Vijoe George)
Date: Thu, 27 Sep 2007 11:09:40 -0700 (PDT)
Subject: [Linux-cluster] adding a node to a cluster
Message-ID: <984786.71739.qm@web54408.mail.yahoo.com>


On Wed, Sep 26, 2007 at 12:57:19PM -0700, John Vijoe George wrote:
>> 1. Why is it that I am not able to obtain the management tab even when I run the system-config-cluster from a machine that is already on the cluster?
>
> You may be hitting this bug:
> https://bugzilla.redhat.com/show_bug.cgi?id=249705. There's an updated
> rpm attached to https://bugzilla.redhat.com/show_bug.cgi?id=253843 that
> you may want to give a shot. Alternately, you could use conga.
Yes. After trying the above rpm, system-config-cluster works much much better. Thank you very much Ryan.

>> Since the above failed, I decided to add the corresponding node and fence detials manually in the cluster.conf file (and copy it to all the machines on cluster). Once I did that and ran the 'service cman start', I got an error saying that the local node name is not present in the cluster.conf. When I checked the conf file again, the node that I added disappeared after I ran cman start?

> Make sure you bumped the cluster config version number, then run
> 'ccs_tool update /etc/cluster/cluster.conf' on one node with the
> updated cluster.conf file. That will propagate the updated
> configuration to the rest of the running nodes and update the
> configuration version number.

I figured this late last afternoon, but appreciate you confirming the same for me.


____________________________________________________________________________________
Boardwalk for $500? In 2007? Ha! Play Monopoly Here and Now (it's updated for today's economy) at Yahoo! Games.
http://get.games.yahoo.com/proddesc?gamekey=monopolyherenow  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070927/6182093d/attachment.htm>

From jparsons at redhat.com  Thu Sep 27 18:42:42 2007
From: jparsons at redhat.com (jim parsons)
Date: Thu, 27 Sep 2007 14:42:42 -0400
Subject: [Linux-cluster] adding a node to a cluster
In-Reply-To: <984786.71739.qm@web54408.mail.yahoo.com>
References: <984786.71739.qm@web54408.mail.yahoo.com>
Message-ID: <1190918562.3310.9.camel@localhost.localdomain>

On Thu, 2007-09-27 at 11:09 -0700, John Vijoe George wrote:
> 
> On Wed, Sep 26, 2007 at 12:57:19PM -0700, John Vijoe George wrote:
> >> 1. Why is it that I am not able to obtain the management tab even
> when I run the system-config-cluster from a machine that is already on
> the cluster?
> >
> > You may be hitting this bug:
> > https://bugzilla.redhat.com/show_bug.cgi?id=249705. There's an
> updated
> > rpm attached to https://bugzilla.redhat.com/show_bug.cgi?id=253843
> that
> > you may want to give a shot. Alternately, you could use conga.
> Yes. After trying the above rpm, system-config-cluster works much much
> better. Thank you very much Ryan.
> 
> >> Since the above failed, I decided to add the corresponding node and
> fence detials manually in the cluster.conf file (and copy it to all
> the machines on cluster). Once I did that and ran the 'service cman
> start', I got an error saying that the local node name is not present
> in the cluster.conf. When I checked the conf file again, the node that
> I added disappeared after I ran cman start?
> 
> > Make sure you bumped the cluster config version number, then run
> > 'ccs_tool update /etc/cluster/cluster.conf' on one node with the
> > updated cluster.conf file. That will propagate the updated
> > configuration to the rest of the running nodes and update the
> > configuration version number.
> 
> I figured this late last afternoon, but appreciate you confirming the
> same for me.
BTW, my 2 cents, if you don't mind; system-config-cluster and conga both
do the incrementing for you. It is kind of a hassle to do by hand, imho,
but if gui's don't fit your environment, it is scriptable.
> 
> 
-j


From lhh at redhat.com  Thu Sep 27 19:35:30 2007
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 27 Sep 2007 15:35:30 -0400
Subject: [Linux-cluster] NFS Exports in rgmanager
In-Reply-To: <46F8ABA6.87A6.0099.1@seakr.com>
References: <46F8ABA6.87A6.0099.1@seakr.com>
Message-ID: <1190921730.20722.5.camel@ayanami.boston.devel.redhat.com>

On Tue, 2007-09-25 at 06:33 -0600, Nick Couchman wrote:
> I'm attempting to set up fail-over NFS services on my RHCS cluster and
> have a question about the NFS export and NFS client resources.  In our
> environment, we have a few occasions where we don't want to export the
> entire volume to a client, we just want to export a certain directory
> on the volume.  So, for example, by default, when you set up a GFS
> Filesystem (let's say it's mounted at /mnt/Vol1), an NFS export
> resource (we'll call it Share1), and an NFS Client Resource (client1
> with options rw,root_squash), the entire contents of /mnt/Vol1 is
> exported to the client.  In our case, we have a directory
> - /mnt/Vol1/Dir1/ThisDirectory - that we need to export to a client,
> and we only want the client to have access to that directory.  Is this
> possible in rgmanager?  If so, how do I go about it?  If not, can
> someone suggest some alternatives, aside from completely manually
> managing the NFS stuff with RHCS?  If the answer to that is that I
> must do it manually, maybe someone can consider adding in a
> "Directory" resource to rgmanager?

Yes, just add a private nfsclient resource to the nfsexport and override
the path:

  <nfsclient path="whole-path-to-directory" .../>

Note that you can't reuse the nfsclient resource if you do this (it
breaks inheritance).

-- Lon


From johnvge at yahoo.com  Thu Sep 27 22:23:26 2007
From: johnvge at yahoo.com (John Vijoe George)
Date: Thu, 27 Sep 2007 15:23:26 -0700 (PDT)
Subject: [Linux-cluster] adding a node to a cluster
Message-ID: <465828.33850.qm@web54410.mail.yahoo.com>


> BTW, my 2 cents, if you don't mind; system-config-cluster and conga both
do the incrementing for you. It is kind of a hassle to do by hand, imho,
but if gui's don't fit your environment, it is scriptable.

Sure Jim. Makes sense. 
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


____________________________________________________________________________________
Looking for a deal? Find great prices on flights and hotels with Yahoo! FareChase.
http://farechase.yahoo.com/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070927/e19d1112/attachment.htm>

From jprats at cesca.es  Fri Sep 28 06:46:49 2007
From: jprats at cesca.es (Jordi Prats)
Date: Fri, 28 Sep 2007 08:46:49 +0200
Subject: [Linux-cluster] Cluster for DB2 database
In-Reply-To: <616a369f0709270455q324e2b96he6443ef4bf645816@mail.gmail.com>
References: <616a369f0709261317w50e49f77je6e4a027aeff9aa9@mail.gmail.com>	<46FAB52A.6050005@cesca.es>
	<616a369f0709270455q324e2b96he6443ef4bf645816@mail.gmail.com>
Message-ID: <46FCA359.5040103@cesca.es>

Hi,
HACMP seems to be, like RH-cluster, a active-pasive cluster:

http://www-03.ibm.com/systems/p/software/hacmp.html

So, you will not increase performance, just avaliability. If you want to 
increase performance you should look for a cluster database like 
MySQL-cluster or Oracle RAC. I don't know much about DB2, so I don't 
know if there's a DB2-cluster.

regards,
Jordi


Emerson Henrique da silva wrote:
> Hello Jordi,
>
> Thanks for your comments.
>
> Really, I was thinking about this hypothesis too but, I need to 
> improve the performance overall. Of course, HA is a implied concept on 
> that kind of Cluster.
> GFS or HA-iSCSI, only will  help me to consolidate the Database files 
> but wont give the growth in scale, that is what I'm  looking for.
>
> Our database partner is insisting to make us to use the IBM's HACMP 
> but, I'm not sure about this yet.
>
> What do you thing ?
>
>
>
> Emerson
>
>
>
>
> 2007/9/26, Jordi Prats < jprats at cesca.es <mailto:jprats at cesca.es>>:
>
>     Hi,
>     I'm not a expert on DB2 but I don't think RHcluster will give you a
>     performance-oriented cluster of DB2 databases. Instead, I could
>     provide
>     you a HA database or a HA-iSCSI target or GFS for your datafiles.
>     Remember that if you share the same datafiles DB2 should be
>     cluster-aware.
>
>     By the way, Oracle RAC it's a cluster by itself, it does not use RH
>     cluster in any way.
>
>     Hope this helps.
>
>     regards,
>     Jordi
>
>     Emerson Henrique da silva wrote:
>     > Hello all!
>     >
>     >
>     > Does anybody ever used Red Hat cluster for DB2 Databases ?
>     (performance
>     > clusters)
>     > Is it similar to Oracle's RAC, that have a particular cluster
>     manager ?
>     > In my readings, I fell that RH Cluster is not the right way to
>     achieve
>     > the targets that I've finding. In the IBM's site, the documents
>     are very
>     > confused to locate it.
>     >
>     >
>     > Thanks for comments.
>     >
>     >
>     >
>     ------------------------------------------------------------------------
>
>     >
>     > --
>     > Linux-cluster mailing list
>     > Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     > https://www.redhat.com/mailman/listinfo/linux-cluster
>     <https://www.redhat.com/mailman/listinfo/linux-cluster>
>
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
>     <https://www.redhat.com/mailman/listinfo/linux-cluster>
>
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From jprats at cesca.es  Fri Sep 28 07:46:26 2007
From: jprats at cesca.es (Jordi Prats)
Date: Fri, 28 Sep 2007 09:46:26 +0200
Subject: [Linux-cluster] Bug on dlm
Message-ID: <46FCB152.2040001@cesca.es>

Hi,
I've found this while starting my server. It's a F7 with the latest 
version avaliable.

Hope this helps :)

Jordi

Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: recover 1
Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: add member 2
Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: total members 1 error 0
Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory
Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory 0 
entries
Jul 26 23:52:51 inf18 kernel:
Jul 26 23:52:51 inf18 kernel: =====================================
Jul 26 23:52:51 inf18 kernel: [ BUG: bad unlock balance detected! ]
Jul 26 23:52:51 inf18 kernel: -------------------------------------
Jul 26 23:52:51 inf18 kernel: dlm_recoverd/2963 is trying to release 
lock (&ls->ls_in_recovery) at:
Jul 26 23:52:51 inf18 kernel: [<ee67b874>] dlm_recoverd+0x265/0x433 [dlm]
Jul 26 23:52:51 inf18 kernel: but there are no more locks to release!
Jul 26 23:52:51 inf18 kernel:
Jul 26 23:52:51 inf18 kernel: other info that might help us debug this:
Jul 26 23:52:51 inf18 kernel: 2 locks held by dlm_recoverd/2963:
Jul 26 23:52:51 inf18 kernel:  #0:  (&ls->ls_recoverd_active){....}, at: 
[<c11f9e7d>] mutex_lock+0x21/0x24
Jul 26 23:52:51 inf18 kernel:  #1:  (&ls->ls_recover_lock){....}, at: 
[<ee67b84d>] dlm_recoverd+0x23e/0x433 [dlm]
Jul 26 23:52:51 inf18 kernel:
Jul 26 23:52:51 inf18 kernel: stack backtrace:
Jul 26 23:52:51 inf18 kernel:  [<c1005e4a>] show_trace_log_lvl+0x1a/0x2f
Jul 26 23:52:51 inf18 kernel:  [<c10063fc>] show_trace+0x12/0x14
Jul 26 23:52:51 inf18 kernel:  [<c1006480>] dump_stack+0x16/0x18
Jul 26 23:52:51 inf18 kernel:  [<c1037321>] 
print_unlock_inbalance_bug+0xec/0xf9
Jul 26 23:52:51 inf18 kernel:  [<c10381e6>] 
lock_release_non_nested+0x95/0x150
Jul 26 23:52:51 inf18 kernel:  [<c10383ed>] lock_release+0x14c/0x189
Jul 26 23:52:51 inf18 kernel:  [<c1033bfc>] up_write+0x16/0x2b
Jul 26 23:52:51 inf18 kernel:  [<ee67b874>] dlm_recoverd+0x265/0x433 [dlm]
Jul 26 23:52:52 inf18 kernel:  [<c1030e47>] kthread+0xb3/0xdc
Jul 26 23:52:52 inf18 kernel:  [<c1005967>] kernel_thread_helper+0x7/0x10
Jul 26 23:52:52 inf18 kernel:  =======================
Jul 26 23:52:52 inf18 kernel: dlm: rgmanager: recover 1 done: 4 ms
Jul 26 23:53:00 inf18 clurgmgrd[2949]: <notice> Resource Group Manager 
Starting

-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From pcaulfie at redhat.com  Fri Sep 28 07:59:15 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 28 Sep 2007 08:59:15 +0100
Subject: [Linux-cluster] Re: [Cluster-devel] Bug on dlm
In-Reply-To: <46FCB152.2040001@cesca.es>
References: <46FCB152.2040001@cesca.es>
Message-ID: <46FCB453.407@redhat.com>

Jordi Prats wrote:
> Hi,
> I've found this while starting my server. It's a F7 with the latest
> version avaliable.
>
> Hope this helps :)
>
> Jordi
>
> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: recover 1
> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: add member 2
> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: total members 1 error 0
> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory
> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory 0
> entries
> Jul 26 23:52:51 inf18 kernel:
> Jul 26 23:52:51 inf18 kernel: =====================================
> Jul 26 23:52:51 inf18 kernel: [ BUG: bad unlock balance detected! ]
> Jul 26 23:52:51 inf18 kernel: -------------------------------------
> Jul 26 23:52:51 inf18 kernel: dlm_recoverd/2963 is trying to release
> lock (&ls->ls_in_recovery) at:
> Jul 26 23:52:51 inf18 kernel: [<ee67b874>] dlm_recoverd+0x265/0x433 [dlm]
> Jul 26 23:52:51 inf18 kernel: but there are no more locks to release!
> Jul 26 23:52:51 inf18 kernel:

Yeah, we know about it. It's not actually a bug, just the lockdep checking code
being a little over-enthusiastic. Unfortunately there aren't any annotations
available to make it quiet either.

The trick is to live with it, or to use kernels that have a little less
debugging compiled in, which you would want to do for production anyway :)


Patrick


From jobot at wmdata.com  Fri Sep 28 09:12:40 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Fri, 28 Sep 2007 11:12:40 +0200
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F346@WMRI000167.corp.wmdata.net><20070924161012.GA14880@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
Message-ID: <CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>

Anyone with an idea why a "sleep 30" is needed for fenced to be able to join the fence group properly?

Even though this workaround appears to work it would be nice to have a more solid solution. Since now I will need to remember to patch the init script every time it's updated.

Regards,
Jonas

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Borgstr?m Jonas
Sent: den 24 september 2007 19:29
To: David Teigland
Cc: linux clustering
Subject: RE: [Linux-cluster] Possible cman init script race condition

From: David Teigland [mailto:teigland at redhat.com]
Sent: den 24 september 2007 18:10
To: Borgstr?m Jonas
Cc: linux clustering
Subject: Re: [Linux-cluster] Possible cman init script race condition
*snip*
> > 1190645596 node "prod-db1" not a cman member, cn 1
> > 1190645597 node "prod-db1" not a cman member, cn 1
> > 1190645598 node "prod-db1" not a cman member, cn 1
> > 1190645599 node "prod-db1" not a cman member, cn 1 1190645600 reduce 
> > victim prod-db1 1190645600 delay of 16s leaves 0 victims 1190645600 
> > finish default 1 1190645600 stop default 1190645600 start default 2 
> > members 1 2 1190645600 do_recovery stop 1 start 2 finish 1
> 
> I think something has gone wrong here, either in groupd or fenced, 
> that's preventing this start from finishing (we don't get a 'finish default 2'
> which we expect).  A 'group_tool -v' here should show the state of the 
> fence group still in transition.  Could you run that, plus a 
> 'group_tool dump' at this point, in addition to the 'dump fence' you 
> have.  And please run those commands on both nodes.
> 
Hi david, thanks for your fast response. Here's the output you requested:

[root at prod-db1 ~]# group_tool -v
type             level name     id       state node id local_done
fence            0     default  00010001 JOIN_START_WAIT 2 200020001 1
[1 2]

[root at prod-db2 ~]# group_tool -v
type             level name     id       state node id local_done
fence            0     default  00010002 JOIN_START_WAIT 1 100020001 1
[1 2]

I attached "group_tool dump" output as files, since they are quite long.

> > 1190645954 client 3: dump    <--- Before killing prod-db1
> > 1190645985 stop default
> > 1190645985 start default 3 members 2
> > 1190645985 do_recovery stop 2 start 3 finish 1
> > 1190645985 finish default 3
> > 1190646008 client 3: dump    <--- After killing prod-db1
> 
> Node 1 isn't fenced here because it never completed joining the fence 
> group above.
>
> > The scary part is that as far as I can tell fenced is the only cman 
> > daemon being affected by this. So your cluster appears to work fine. 
> > But when a node needs to be fenced the operation it isn't carried 
> > out and that can cause gfs filesystem corruption.
>
> You shouldn't be able to mount gfs on the node where joining the fence 
> group is stuck.

My current setup is very stripped down so I haven't configured gfs. But on my original setup where I initially noticed this issue I had no problem mounting gfs filesystems and after a simulated network failure I could still continue to write to the filesystem from both nodes since no node was fenced, and that quickly corrupted the filesystem.

Regards,
Jonas


From jprats at cesca.es  Fri Sep 28 09:50:49 2007
From: jprats at cesca.es (Jordi Prats)
Date: Fri, 28 Sep 2007 11:50:49 +0200
Subject: [Linux-cluster] Different views
Message-ID: <46FCCE79.3070804@cesca.es>

Hi,
I'm getting a strange error: On one node I cannot see the other one, but 
on the other I can see both online. Any one can help me with this?

I'm getting a lot of problems setting up this version of RH cluster (the 
one with openais).

Thanks,

Here I paste some status messages:

[root at inf17 ~]# clustat
Member Status: Inquorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  inf17                                 1 Online, Local
  inf18                                 2 Offline
  inf19                                 3 Offline


[root at inf18 ~]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  inf17                                 1 Online
  inf18                                 2 Online, Local
  inf19                                 3 Offline


[root at inf17 ~]# group_tool
type             level name       id       state
fence            0     default    00010001 JOIN_START_WAIT
[1]
dlm              1     rgmanager  00020001 JOIN_ALL_STOPPED
[1]

[root at inf18 ~]# group_tool
type             level name       id       state
fence            0     default    00000000 JOIN_STOP_WAIT
[1 2]
dlm              1     rgmanager  00010002 JOIN_START_WAIT
[2]

[root at inf17 ~]# cman_tool status
Version: 6.0.1
Config Version: 4
Cluster Name: boumort
Cluster Id: 13356
Cluster Member: Yes
Cluster Generation: 3824
Membership state: Cluster-Member
Nodes: 1
Expected votes: 2
Total votes: 1
Quorum: 2 Activity blocked
Active subsystems: 7
Flags:
Ports Bound: 0
Node name: inf17
Node ID: 1
Multicast addresses: 239.192.52.96
Node addresses: 192.168.22.17


[root at inf18 ~]# cman_tool status
Version: 6.0.1
Config Version: 4
Cluster Name: boumort
Cluster Id: 13356
Cluster Member: Yes
Cluster Generation: 3820
Membership state: Cluster-Member
Nodes: 2
Expected votes: 2
Total votes: 2
Quorum: 2
Active subsystems: 7
Flags:
Ports Bound: 0 177
Node name: inf18
Node ID: 2
Multicast addresses: 239.192.52.96
Node addresses: 192.168.22.18


-- 
......................................................................
         __
        / /          Jordi Prats
  C E / S / C A      Dept. de Sistemes
      /_/            Centre de Supercomputaci? de Catalunya

  Gran Capit?, 2-4 (Edifici Nexus) ? 08034 Barcelona
  T. 93 205 6464 ? F.  93 205 6979 ? jprats at cesca.es
...................................................................... 


From jprats at cesca.es  Fri Sep 28 13:37:31 2007
From: jprats at cesca.es (Jordi Prats)
Date: Fri, 28 Sep 2007 15:37:31 +0200
Subject: [Linux-cluster] Re: [Cluster-devel] Bug on dlm
In-Reply-To: <46FCB453.407@redhat.com>
References: <46FCB152.2040001@cesca.es> <46FCB453.407@redhat.com>
Message-ID: <46FD039B.2060302@cesca.es>

Hi,
This bug could be causing this?


[root at inf17 ~]# clustat
Member Status: Inquorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  inf17                                 1 Online, Local
  inf18                                 2 Offline
  inf19                                 3 Offline


[root at inf18 ~]# clustat
Member Status: Quorate

  Member Name                        ID   Status
  ------ ----                        ---- ------
  inf17                                 1 Online
  inf18                                 2 Online, Local
  inf19                                 3 Offline


[root at inf17 ~]# group_tool
type             level name       id       state
fence            0     default    00010001 JOIN_START_WAIT
[1]
dlm              1     rgmanager  00020001 JOIN_ALL_STOPPED
[1]

[root at inf18 ~]# group_tool
type             level name       id       state
fence            0     default    00000000 JOIN_STOP_WAIT
[1 2]
dlm              1     rgmanager  00010002 JOIN_START_WAIT
[2]

[root at inf17 ~]# cman_tool status
Version: 6.0.1
Config Version: 4
Cluster Name: boumort
Cluster Id: 13356
Cluster Member: Yes
Cluster Generation: 3824
Membership state: Cluster-Member
Nodes: 1
Expected votes: 2
Total votes: 1
Quorum: 2 Activity blocked
Active subsystems: 7
Flags:
Ports Bound: 0
Node name: inf17
Node ID: 1
Multicast addresses: 239.192.52.96
Node addresses: 192.168.22.17


[root at inf18 ~]# cman_tool status
Version: 6.0.1
Config Version: 4
Cluster Name: boumort
Cluster Id: 13356
Cluster Member: Yes
Cluster Generation: 3820
Membership state: Cluster-Member
Nodes: 2
Expected votes: 2
Total votes: 2
Quorum: 2
Active subsystems: 7
Flags:
Ports Bound: 0 177
Node name: inf18
Node ID: 2
Multicast addresses: 239.192.52.96
Node addresses: 192.168.22.18


Patrick Caulfield wrote:
> Jordi Prats wrote:
>> Hi,
>> I've found this while starting my server. It's a F7 with the latest
>> version avaliable.
>>
>> Hope this helps :)
>>
>> Jordi
>>
>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: recover 1
>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: add member 2
>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: total members 1 error 0
>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory
>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory 0
>> entries
>> Jul 26 23:52:51 inf18 kernel:
>> Jul 26 23:52:51 inf18 kernel: =====================================
>> Jul 26 23:52:51 inf18 kernel: [ BUG: bad unlock balance detected! ]
>> Jul 26 23:52:51 inf18 kernel: -------------------------------------
>> Jul 26 23:52:51 inf18 kernel: dlm_recoverd/2963 is trying to release
>> lock (&ls->ls_in_recovery) at:
>> Jul 26 23:52:51 inf18 kernel: [<ee67b874>] dlm_recoverd+0x265/0x433 [dlm]
>> Jul 26 23:52:51 inf18 kernel: but there are no more locks to release!
>> Jul 26 23:52:51 inf18 kernel:
> 
> Yeah, we know about it. It's not actually a bug, just the lockdep checking code
> being a little over-enthusiastic. Unfortunately there aren't any annotations
> available to make it quiet either.
> 
> The trick is to live with it, or to use kernels that have a little less
> debugging compiled in, which you would want to do for production anyway :)
> 
> 
> Patrick
> 
> 


From pcaulfie at redhat.com  Fri Sep 28 13:51:19 2007
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 28 Sep 2007 14:51:19 +0100
Subject: [Linux-cluster] Re: [Cluster-devel] Bug on dlm
In-Reply-To: <46FD039B.2060302@cesca.es>
References: <46FCB152.2040001@cesca.es> <46FCB453.407@redhat.com>
	<46FD039B.2060302@cesca.es>
Message-ID: <46FD06D7.6020809@redhat.com>

Jordi Prats wrote:
> Hi,
> This bug could be causing this?
> 
> 
> [root at inf17 ~]# clustat
> Member Status: Inquorate
> 
>  Member Name                        ID   Status
>  ------ ----                        ---- ------
>  inf17                                 1 Online, Local
>  inf18                                 2 Offline
>  inf19                                 3 Offline
> 
> 
> [root at inf18 ~]# clustat
> Member Status: Quorate
> 
>  Member Name                        ID   Status
>  ------ ----                        ---- ------
>  inf17                                 1 Online
>  inf18                                 2 Online, Local
>  inf19                                 3 Offline
> 
> 
> [root at inf17 ~]# group_tool
> type             level name       id       state
> fence            0     default    00010001 JOIN_START_WAIT
> [1]
> dlm              1     rgmanager  00020001 JOIN_ALL_STOPPED
> [1]
> 
> [root at inf18 ~]# group_tool
> type             level name       id       state
> fence            0     default    00000000 JOIN_STOP_WAIT
> [1 2]
> dlm              1     rgmanager  00010002 JOIN_START_WAIT
> [2]


No, that's misconfgured fencing.

> [root at inf17 ~]# cman_tool status
> Version: 6.0.1
> Config Version: 4
> Cluster Name: boumort
> Cluster Id: 13356
> Cluster Member: Yes
> Cluster Generation: 3824
> Membership state: Cluster-Member
> Nodes: 1
> Expected votes: 2
> Total votes: 1
> Quorum: 2 Activity blocked
> Active subsystems: 7
> Flags:
> Ports Bound: 0
> Node name: inf17
> Node ID: 1
> Multicast addresses: 239.192.52.96
> Node addresses: 192.168.22.17
> 
> 
> [root at inf18 ~]# cman_tool status
> Version: 6.0.1
> Config Version: 4
> Cluster Name: boumort
> Cluster Id: 13356
> Cluster Member: Yes
> Cluster Generation: 3820
> Membership state: Cluster-Member
> Nodes: 2
> Expected votes: 2
> Total votes: 2
> Quorum: 2
> Active subsystems: 7
> Flags:
> Ports Bound: 0 177
> Node name: inf18
> Node ID: 2
> Multicast addresses: 239.192.52.96
> Node addresses: 192.168.22.18
> 
> 
> Patrick Caulfield wrote:
>> Jordi Prats wrote:
>>> Hi,
>>> I've found this while starting my server. It's a F7 with the latest
>>> version avaliable.
>>>
>>> Hope this helps :)
>>>
>>> Jordi
>>>
>>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: recover 1
>>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: add member 2
>>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: total members 1 error 0
>>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory
>>> Jul 26 23:52:51 inf18 kernel: dlm: rgmanager: dlm_recover_directory 0
>>> entries
>>> Jul 26 23:52:51 inf18 kernel:
>>> Jul 26 23:52:51 inf18 kernel: =====================================
>>> Jul 26 23:52:51 inf18 kernel: [ BUG: bad unlock balance detected! ]
>>> Jul 26 23:52:51 inf18 kernel: -------------------------------------
>>> Jul 26 23:52:51 inf18 kernel: dlm_recoverd/2963 is trying to release
>>> lock (&ls->ls_in_recovery) at:
>>> Jul 26 23:52:51 inf18 kernel: [<ee67b874>] dlm_recoverd+0x265/0x433
>>> [dlm]
>>> Jul 26 23:52:51 inf18 kernel: but there are no more locks to release!
>>> Jul 26 23:52:51 inf18 kernel:
>>
>> Yeah, we know about it. It's not actually a bug, just the lockdep
>> checking code
>> being a little over-enthusiastic. Unfortunately there aren't any
>> annotations
>> available to make it quiet either.
>>
>> The trick is to live with it, or to use kernels that have a little less
>> debugging compiled in, which you would want to do for production
>> anyway :)
>>
>>
>> Patrick
>>
>>
> 


-- 
Patrick

Registered Address: Red Hat UK Ltd, Amberley Place, 107-111 Peascod Street,
Windsor, Berkshire, SL4 ITE, UK.
Registered in England and Wales under Company Registration No. 3798903


From teigland at redhat.com  Fri Sep 28 14:27:30 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 28 Sep 2007 09:27:30 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
Message-ID: <20070928142730.GA7239@redhat.com>

On Fri, Sep 28, 2007 at 11:12:40AM +0200, Borgstr?m Jonas wrote:
> Anyone with an idea why a "sleep 30" is needed for fenced to be able to
> join the fence group properly?
> 
> Even though this workaround appears to work it would be nice to have a
> more solid solution. Since now I will need to remember to patch the init
> script every time it's updated.

We never got to the bottom of what the problem is AFAIK.


> > > 1190645954 client 3: dump    <--- Before killing prod-db1
> > > 1190645985 stop default
> > > 1190645985 start default 3 members 2
> > > 1190645985 do_recovery stop 2 start 3 finish 1
> > > 1190645985 finish default 3
> > > 1190646008 client 3: dump    <--- After killing prod-db1
> > 
> > Node 1 isn't fenced here because it never completed joining the fence 
> > group above.

This is the problem we need to debug.  Here's what I suggested before to
do that:

"A 'group_tool -v' here should show the state of the fence group still in
transition.  Could you run that, plus a 'group_tool dump' at this point,
in addition to the 'dump fence' you have.  And please run those commands
on both nodes."

Dave


From jobot at wmdata.com  Fri Sep 28 14:48:18 2007
From: jobot at wmdata.com (=?iso-8859-1?Q?Borgstr=F6m_Jonas?=)
Date: Fri, 28 Sep 2007 16:48:18 +0200
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20070928142730.GA7239@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
Message-ID: <CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>

From: David Teigland [mailto:teigland at redhat.com] 
Sent: den 28 september 2007 16:28
To: Borgstr?m Jonas
Cc: linux clustering
Subject: Re: [Linux-cluster] Possible cman init script race condition
> This is the problem we need to debug.  Here's what I suggested before to
> do that:
>
> "A 'group_tool -v' here should show the state of the fence group still in
> transition.  Could you run that, plus a 'group_tool dump' at this point,
> in addition to the 'dump fence' you have.  And please run those commands
> on both nodes."
>
I must have misunderstood you or something, but didn't I already include that info in the message I sent a few days ago?

http://permalink.gmane.org/gmane.linux.redhat.cluster/9999

(The archive inlines the "group_tool dump" output making it a bit hard to read, but hopefully your email client shows them as attachments).

Regards,
Jonas


From teigland at redhat.com  Fri Sep 28 14:58:18 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 28 Sep 2007 09:58:18 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
Message-ID: <20070928145818.GB7239@redhat.com>

On Fri, Sep 28, 2007 at 04:48:18PM +0200, Borgstr?m Jonas wrote:
> I must have misunderstood you or something, but didn't I already include
> that info in the message I sent a few days ago?
> 
> http://permalink.gmane.org/gmane.linux.redhat.cluster/9999
> 
> (The archive inlines the "group_tool dump" output making it a bit hard
> to read, but hopefully your email client shows them as attachments).

I missed that, I'll take a look, thanks.
Dave


From simone.gotti at email.it  Fri Sep 28 16:14:14 2007
From: simone.gotti at email.it (Simone Gotti)
Date: Fri, 28 Sep 2007 18:14:14 +0200
Subject: [Linux-cluster] Some ideas on changes to the lvm.sh agent (or new
	agents).
Message-ID: <1190996054.5802.30.camel@localhost>

Hi,

Trying to use a non cluster vg in redhat cluster I noticed that lvm.sh,
to avoid metadata corruption, is forcing the need of only one lv per vg.

I was thinking that other clusters don't have this limitation as they
let you just use a vg only on one node at a time (and also on one
service group at a time).

To test if this was possible with lvm2 I made little changes to lvm.sh
(just variables renames, use of vgchange instead of lvchange for tag
adding) and using the same changes needed to /etc/lvm/lvm.conf
(volume_list = [ "rootvgname", "@my_hostname" ]) looks like this idea
was working.

I can activate the vg and all of its volume only on the node with the vg
tagged with its hostname and the start on the other nodes is refused.

Now, will this idea be accepted? If so these are a list of possible
needed changes and other ideas:

*) Make <parameter name="vg_name" required="1"> also unique="1" or
better primary="1" and remove the parameter "name" as only one service
can use a vg.

*) What vg_status should do?
  
a) Monitor all the LVs
	or
b) Check only the VG and use ANOTHER resource agent for every lv used by
the cluster? So I can create/remove/modify lvs on that vg that aren't
under rgmanager control without any error reported by the status
functions of the lvm.sh agent.
Also other clusters distinguish between vg and lv and they have 2
different agents for them. 

Creating two new agents will also leave the actual lvm.sh without
changes and keep backward compatibility for who is already using it.

Something like this (lets call lvm_vg and lvm_lv respectively the agents
for the vg and the lv):

<service name="foo">
  <lvm_vg vgname="vg01">
     <lvm_lv lvname="lv01/>
     <lvm_lv lvname="lv01/>
     <script .... />
  </lvm_vg>
</service>


*) Another problem that is present just now is that lvm should be
changed to avoid any operation on a non activable vg or lv. In these
days you cannot be able to start a vg/lv as its not tagged with the
hostname but you can remove/resize it without any problem. :D

I'll be happy to provide patches if these idea are accepted!

Thanks!
Bye!

-- 
Simone Gotti
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070928/3abd513a/attachment.sig>

From teigland at redhat.com  Fri Sep 28 16:45:47 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 28 Sep 2007 11:45:47 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20070928145818.GB7239@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
Message-ID: <20070928164547.GC7239@redhat.com>

On Fri, Sep 28, 2007 at 09:58:18AM -0500, David Teigland wrote:
> On Fri, Sep 28, 2007 at 04:48:18PM +0200, Borgstr?m Jonas wrote:
> > I must have misunderstood you or something, but didn't I already include
> > that info in the message I sent a few days ago?
> > 
> > http://permalink.gmane.org/gmane.linux.redhat.cluster/9999
> > 
> > (The archive inlines the "group_tool dump" output making it a bit hard
> > to read, but hopefully your email client shows them as attachments).
> 
> I missed that, I'll take a look, thanks.

You've hit a known bug that's been fixed:
  https://bugzilla.redhat.com/show_bug.cgi?id=251966

We may have to move up the release of that fix since people are seeing the
problem.  Be careful when reading that bz because there's a lot of
incorrect diagnosis that was recorded before we figured out what the real
bug was.  Here's the problem, it's very complex:

1. when the nodes start up, they each form a 1-node openais cluster
   independent of the other

   [This shouldn't really happen, but in reality we can't prevent it
    100% of the time.  We try to make it rare, and then deal with it
    sensibly on the rare occasion when it does happen.  You've hit
    the "rare" occasion -- if you're actually seeing this regularly
    then we probably need to fix or adjust something at the openais
    level to make it less common.]

2. fence_tool join is run on each node which creates group state in both
   clusters

3. The two clusters then merge together.  We could handle this merging
   *if* there had been no group activity yet (in this case from fenced).
   But, in this case, divergent group state exists in the two clusters
   that we can't combine.  Cman (above openais) should recognize this [*]
   and continue to treat the nodes separately, even though openais has
   merged them together.

   [*] In RHEL5.0, cman/groupd are *not* smart enough to recognize this.
   The fix in bz 251966 makes cman/groupd recognize this condition by
   introducing a "dirty flag".  What you observe, is groupd trying to
   merge the divergent state, getting confused and stuck.

   After the bug is fixed, what you should observe is the two nodes will
   stay separate (in cman) and will try to fence each other.  One will
   win the fencing race and reboot the other.  When the rebooted node
   returns, it should properly join the existing cluster.

Dave


From teigland at redhat.com  Fri Sep 28 17:03:09 2007
From: teigland at redhat.com (David Teigland)
Date: Fri, 28 Sep 2007 12:03:09 -0500
Subject: [Linux-cluster] Possible cman init script race condition
In-Reply-To: <20070928164547.GC7239@redhat.com>
References: <CABF801D13AA444988E62B7AF62C371D0217F37A@WMRI000167.corp.wmdata.net>
	<CABF801D13AA444988E62B7AF62C371D021D735F@WMRI000167.corp.wmdata.net>
	<20070928142730.GA7239@redhat.com>
	<CABF801D13AA444988E62B7AF62C371D02230545@WMRI000167.corp.wmdata.net>
	<20070928145818.GB7239@redhat.com>
	<20070928164547.GC7239@redhat.com>
Message-ID: <20070928170309.GD7239@redhat.com>

On Fri, Sep 28, 2007 at 11:45:47AM -0500, David Teigland wrote:
> On Fri, Sep 28, 2007 at 09:58:18AM -0500, David Teigland wrote:
> > On Fri, Sep 28, 2007 at 04:48:18PM +0200, Borgstr?m Jonas wrote:
> > > I must have misunderstood you or something, but didn't I already include
> > > that info in the message I sent a few days ago?
> > > 
> > > http://permalink.gmane.org/gmane.linux.redhat.cluster/9999
> > > 
> > > (The archive inlines the "group_tool dump" output making it a bit hard
> > > to read, but hopefully your email client shows them as attachments).
> > 
> > I missed that, I'll take a look, thanks.
> 
> You've hit a known bug that's been fixed:
>   https://bugzilla.redhat.com/show_bug.cgi?id=251966
> 
> We may have to move up the release of that fix since people are seeing the
> problem.  Be careful when reading that bz because there's a lot of
> incorrect diagnosis that was recorded before we figured out what the real
> bug was.  Here's the problem, it's very complex:
> 
> 1. when the nodes start up, they each form a 1-node openais cluster
>    independent of the other
> 
>    [This shouldn't really happen, but in reality we can't prevent it
>     100% of the time.  We try to make it rare, and then deal with it
>     sensibly on the rare occasion when it does happen.  You've hit
>     the "rare" occasion -- if you're actually seeing this regularly
>     then we probably need to fix or adjust something at the openais
>     level to make it less common.]

I'd try to use some sleeps here, before running fence_tool join on either
node, as a work-around.  We're trying to get both nodes merged together
before they do anything else.

Also, how often are you seeing the nodes not merge together right away?
If it's frequent, then we need to fix that.

> 2. fence_tool join is run on each node which creates group state in both
>    clusters
> 
> 3. The two clusters then merge together.  We could handle this merging
>    *if* there had been no group activity yet (in this case from fenced).
>    But, in this case, divergent group state exists in the two clusters
>    that we can't combine.  Cman (above openais) should recognize this [*]
>    and continue to treat the nodes separately, even though openais has
>    merged them together.
> 
>    [*] In RHEL5.0, cman/groupd are *not* smart enough to recognize this.
>    The fix in bz 251966 makes cman/groupd recognize this condition by
>    introducing a "dirty flag".  What you observe, is groupd trying to
>    merge the divergent state, getting confused and stuck.

There's no work-around once you've gotten to this point.

>    After the bug is fixed, what you should observe is the two nodes will
>    stay separate (in cman) and will try to fence each other.  One will
>    win the fencing race and reboot the other.  When the rebooted node
>    returns, it should properly join the existing cluster.