[Linux-cluster] Manual fencing doest work

Thai Duong thaidn at gmail.com
Mon Apr 3 10:30:16 UTC 2006


Hi all,

I have a 2 node GFS 6.1 cluster with the following configuration:

<?xml version="1.0"?>
<cluster name="fccrac" config_version="5">

    <cman two_node="1" expected_votes="1">
    </cman>

    <clusternodes>
      <clusternode name="fcc1" votes="1">
       <fence>
        <method name="single">
         <device name="human" nodename="fcc1"/>
        </method>
       </fence>
      </clusternode>

      <clusternode name="fcc4" votes="1">
       <fence>
        <method name="single">
         <device name="human" nodename="fcc4"/>
        </method>
       </fence>
      </clusternode>
   </clusternodes>

  <fence_devices>
   <fence_device name="human" agent="fence_manual"/>
  </fence_devices>

 </cluster>

It turns out that manual fencing doest work as expected. When I force power
down a node, the other could not fence it and worse, the whole GFS file
system is freeze waiting for the downed node to be up again. I got something
like below in kernel log

Apr  2 16:46:28 fcc1 fenced[3444]: fencing node "fcc4"
Apr  2 16:46:28 fcc1 fenced[3444]: fence "fcc4" failed

Some information about GFS and kernel:

[root at fcc1 ~]# rpm -qa | grep GFS
GFS-6.1.3-0
GFS-kernel-2.6.9-45.0.2

[root at fcc1 ~]# uname -a
Linux fcc1 2.6.9-22.0.2.EL #1 SMP Thu Jan 5 17:04:58 EST 2006 ia64 ia64 ia64
GNU/Linux

Please help.

TIA,

Thai Duong.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060403/f16991f8/attachment.htm>


More information about the Linux-cluster mailing list