[Linux-cluster] two node cluster, 2nd node hangs in join

Patrick Caulfield pcaulfie at redhat.com
Wed May 4 10:03:58 UTC 2005


Dan B. Phung wrote:
> Hello, hopefully someone has ran into this and it's a quick fix. I'm using
> a vanilla 2.6.9 kernel and the newest (as of tonite)  cvs branch from
> -rRHEL4.  My sequence is to startup ccsd on both nodes, and then I try to
> have both of them join (with a brief wait before I have the 2nd one try).
> Here's what I get from the cman_tool's view of the nodes.
> 
> phung # cman_tool nodes
> Node  Votes Exp Sts  Name
>    3    1    1   J   blade03
>    4    1    1   M   blade04
> 
> and in /var/log/messages, I see this:
>   CMAN: sending membership request
> 
> followed by many:
>   last message repeated 7 times
> 
> In addition I ran a tcpdump, and there seem to be UDP packets flying
> around from node to node, using port 6809, so the network seems fine.
> How would I debug this further?  What kinds of tools are people using
> to debug their config/setup?
>  

It looks like the return join-ack messages are not arriving at blade03. Check you don't have a firewall active that would block
these. cman uses unicast as well as multicast packets so you need to enable both.


-- 

patrick




More information about the Linux-cluster mailing list