[Linux-cluster] problem with rejoining a node
David Teigland
teigland at redhat.com
Thu Aug 11 03:45:14 UTC 2005
On Wed, Aug 10, 2005 at 01:45:26PM +0200, Javi Polo wrote:
> I've made a script that, prior to starting any of the cluster
> infrastructure, enables his SAN port.
I'm not sure if this is related to the rest.
> I can then join the cluster, but when I try to join the fence, it locks
> up there ... :
>
> gfstest1:~# cman_tool services
> Service Name GID LID State Code
> Fence Domain: "default" 0 2 join S-1,80,3
> []
it's waiting to join the fence domain, the others won't let him yet...
> gfstest1:~# cman_tool nodes
> Node Votes Exp Sts Name
> 1 1 3 M gfstest1
> 2 1 3 M gfstest2
> 3 1 3 M gfstest3
>
> from other nodes, I see it as recovering:
> gfstest2:/etc/init.d# cman_tool services
> Service Name GID LID State Code
> Fence Domain: "default" 1 2 recover 2 -
> [2 3]
These two appear to be trying to fence gfstest1, but the fencing operation
hasn't completed. They won't let anyone join the domain until they
finish. You could check /var/log/messages on 2&3 for any fencing messages
or errors.
Dave
More information about the Linux-cluster
mailing list