[Linux-cluster] all nodes halt when one lose connection
Jonathan Brassow
jbrassow at redhat.com
Thu May 21 15:01:39 UTC 2009
On May 21, 2009, at 9:57 AM, ESGLinux wrote:
> Hello,
>
> these are the logs I get:
>
> In node1:
>
> May 21 11:33:44 NODE1 fenced[3840]: NODE2 not a cluster member after
> 5 sec post_fail_delay
> May 21 11:33:44 NODE1 fenced[3840]: fencing node "NODE2"
> May 21 11:33:44 NODE1 shutdown[5448]: shutting down for system halt
>
> in node2:
>
> May 21 11:33:45 NODE2 fenced[3843]: NODE1 not a cluster member after
> 5 sec post_fail_delay
> May 21 11:33:45 NODE2 fenced[3843]: fencing node "NODE1"
> May 21 11:33:45 NODE2 shutdown[5923]: shutting down for system halt
>
>
> what I don´t know is way they lose the connection with the cluster,
> they are still connected (I only unplug a cable from the service
> network)
That may be something worth chasing down, as it appears that your
cluster communication is on a network you don't expect?
Also, are the nodes simply "shutting down", or are they being forcibly
rebooted. If it is a casual shutdown, then it would appear that both
nodes are trying to shutdown simultaneously.
brassow
More information about the Linux-cluster
mailing list