[Linux-cluster] RH Cluster doesn't pass basic acceptance tests - bug in fenced?

Miroslav Zubcic mvz+rhcluster at nimium.hr
Fri Jan 12 12:41:14 UTC 2007


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Josef Whiter wrote:

> This isn't a bug, its working as expected.

IT People from the central bank doesn't think like that. I cannot blame
them, because it is strange to me, and to anybody who has seen this RH
cluster behaviour.

> What you need in qdisk, set it up
> with the proper hueristics and it will force the shutdown of the bad node before
> the bad node has a chance to fence off the working node.

This is just a workaround for lack of communication between clurgmgrd and
fenced daemons, where first is aware of ethernet/network failure and is
trying to disable active service, and fenced which is fencing other node
without any good reason, because it doesn't know that it's node is faulty one.

I have even better workaround (one bonding with native data ethernet and
tagged vlan for fence subnet) for this silly behaviour, but I will really
like to see this thing fixed, because people are laughing on us when
testing our cluster configurations (we are configuring Red Hat machines
and clusters).


- --
Miroslav Zubcic, Nimium d.o.o., email: <mvz at nimium.hr>
Tel: +385 01 4852 639, Fax: +385 01 4852 640, Mobile: +385 098 942 8672
Mrazoviceva 12, 10000 Zagreb, Hrvatska

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iQEVAwUBRaeB6sqzT+8/3SzgAQLkLQf9EMUnXY86JAttXzmIp9DwyHoWl6mB908k
kYVgSTjIUWMMVVnAEgKxnKlVqUnhzmtMtHhkoMs+08D0QDxAl4/T/i0iAF9RwC/9
0jKPCB+rOYvdoKyg9s1yg6ic2DFi4cs0kVd+WbhLvmNd3Q70ATCzSRc1k7aySQ/N
0x8Wn0Mg+4aPAJFBEM1XafYbvOQAvABgX9aRJXH1aS9LVn4sQRMiwcosCR/fZTLH
EWTUlJiKgWQDEKyE4QsoxeOXu290VlISv8Rqx3IHCeAfMiEa1tdVs9/9wUndbqO9
ui3e9l8KrCoI8mJW1YIjHEUY1p7H2X9rT3pm88TDHkf0XA4lnirYlw==
=BxUi
-----END PGP SIGNATURE-----




More information about the Linux-cluster mailing list