[Linux-cluster] problem with rejoining a node

David Teigland teigland at redhat.com
Thu Aug 11 03:45:14 UTC 2005


On Wed, Aug 10, 2005 at 01:45:26PM +0200, Javi Polo wrote:
> I've made a script that, prior to starting any of the cluster
> infrastructure, enables his SAN port.

I'm not sure if this is related to the rest.

> I can then join the cluster, but when I try to join the fence, it locks
> up there ... :
> 
> gfstest1:~# cman_tool services
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           0   2 join      S-1,80,3
> []

it's waiting to join the fence domain, the others won't let him yet...

> gfstest1:~# cman_tool nodes
> Node  Votes Exp Sts  Name
>    1    1    3   M   gfstest1
>    2    1    3   M   gfstest2
>    3    1    3   M   gfstest3
> 
> from other nodes, I see it as recovering:
> gfstest2:/etc/init.d# cman_tool services
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 recover 2 -
> [2 3]

These two appear to be trying to fence gfstest1, but the fencing operation
hasn't completed.  They won't let anyone join the domain until they
finish.  You could check /var/log/messages on 2&3 for any fencing messages
or errors.

Dave




More information about the Linux-cluster mailing list