[Linux-cluster] all nodes halt when one lose connection

Jonathan Brassow jbrassow at redhat.com
Thu May 21 15:01:39 UTC 2009


On May 21, 2009, at 9:57 AM, ESGLinux wrote:

> Hello,
>
> these are the logs I get:
>
> In node1:
>
> May 21 11:33:44 NODE1 fenced[3840]: NODE2 not a cluster member after  
> 5 sec post_fail_delay
> May 21 11:33:44 NODE1 fenced[3840]: fencing node "NODE2"
> May 21 11:33:44 NODE1 shutdown[5448]: shutting down for system halt
>
> in node2:
>
> May 21 11:33:45 NODE2 fenced[3843]: NODE1 not a cluster member after  
> 5 sec post_fail_delay
> May 21 11:33:45 NODE2 fenced[3843]: fencing node "NODE1"
> May 21 11:33:45 NODE2 shutdown[5923]: shutting down for system halt
>
>
> what I don´t know is way they lose the connection with the cluster,  
> they are still connected (I only unplug a cable from the service  
> network)

That may be something worth chasing down, as it appears that your  
cluster communication is on a network you don't expect?

Also, are the nodes simply "shutting down", or are they being forcibly  
rebooted.  If it is a casual shutdown, then it would appear that both  
nodes are trying to shutdown simultaneously.

  brassow




More information about the Linux-cluster mailing list