[Linux-cluster] Physical shutdown of one node causes both node to crash in active/passive configuration of 2 node RHEL cluster

Amjad Syed amjadcsu at gmail.com
Wed Sep 10 08:28:00 UTC 2014


On Tue, Sep 9, 2014 at 11:53 AM, Digimer <lists at alteeve.ca> wrote:

> On 09/09/14 03:14 AM, Amjad Syed wrote:
>
>> <device lanplus = "" name="inspuripmi"  action ="reboot"/>
>>
>
> Something is breaking the network during the shutdown, a fence is being
> called and both nodes are killing the other, causing a dual fence. So you
> have a set of problems, I think.
>
> First, disable acpid on both nodes.
>
> Second, change the quoted line (only) to:
>
> <device lanplus = "" name="inspuripmi" delay="15" action ="reboot"/>
>
> If I am right, this will mean that 192.168.10.10 will stay up (fence) .11
>
> Third, what bonding mode are you using? I would only use mode=1.
>
> Forth, please set the node names to match 'uname -n' on both nodes. Be
> sure the names translate to the IPs you want (via /etc/hosts, ideally).
>
> Fifth, as Sivaji suggested, please put switch(es) between the nodes.
>
> If it still tries to fence when a node shuts down (watch /var/log/messages
> and look for 'fencing node ...'), please paste your logs from both nodes.
>
> --
> Digimer
> Papers and Projects: https://alteeve.ca/w/
> What if the cure for cancer is trapped in the mind of a person without
> access to education?
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20140910/d3c2faef/attachment.htm>


More information about the Linux-cluster mailing list