[Linux-cluster] Cluster of XEN guests unstable when rebooting a node under CS5.1

Paolo Marini paolom at prisma-eng.it
Wed Dec 12 18:23:54 UTC 2007


I reiterate the request for help hoping someone has undergone (and 
hopefully solved) the same issues.

I am building up a cluster of XEN Guests with root file system residing 
on a file on an GFS filesystem (iscsi actually).

Each cluster node mounts an GFS file system residing on an iscsi device.

For performance reasons, both the iscsi device and the physical nodes 
(part also of a cluster) use two gigabit ethernet with bonding and LACP. 
For the physical machines, I had to insert a sleep 30 on the 
/etc/init.d/iscsi script before the iscsi login, in order to wait for 
the bond interface to come up, otherwise the iscsi devices are not seen 
and no gfs mount is possible.

Then, going to the cluster of XEN Guests, they work fine, I am able to 
migrate each one to a different physical node without problems on the guest.

When I reboot or fence one of the guests, the guest cluster breaks, e.g. 
the quorum is dissolved and I have to fence ALL the nodes and reboot 
them in order for the cluster to restart.

Does it have to do with the xen bridge going up and down for a time 
longer than the heartbeat timeout ?

One other problem. My physical machines cluster is build with 3 nodes. 
If I start 2 of them and build up the cluster  (with one missing node), 
everything works fine. Then I switch on the third node. As soon as the 
node comes up and cman starts, the physical nodes cluster quorum is 
dissolved.

Hope someone has undergone the same issues and provides a hand ...

Paolo
-------------- next part --------------
A non-text attachment was scrubbed...
Name: paolom.vcf
Type: text/x-vcard
Size: 298 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071212/58eaa303/attachment.vcf>


More information about the Linux-cluster mailing list