[Linux-cluster] qdisk timeouts

Michael Pye michael at ulimit.org
Wed Nov 9 20:23:41 UTC 2011


2 node cluster with quorum disk running rhcs on rh5.7.

Attempting to debug why we get the following qdisk issues:
Nov  7 16:24:09 host532 qdiskd[5750]:  qdiskd: write (system call) has 
hung for 13 seconds
Nov  7 16:24:09 host532 qdiskd[5750]:  In 14 more seconds, we will be 
evicted
Nov  7 16:24:11 host532 openais[5711]: [CMAN ] lost contact with quorum 
device
Nov  7 16:24:29 host532 openais[5711]: [CMAN ] cman killed by node 1 
because we were killed by cman_tool or other application

And the node where the time out happens is usually fenced. Is this 
something to do with my qdisk interval/ko timings of:
quorumd interval="3" label="qdisk" min_score="1" tko="9" votes="1

cluster.conf attached.

Thanks
Michael
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: host532_cluster.conf.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20111109/40102290/attachment.txt>


More information about the Linux-cluster mailing list