[Linux-cluster] two node cluster, 2nd node hangs in join
pcaulfie at redhat.com
Wed May 4 10:03:58 UTC 2005
Dan B. Phung wrote:
> Hello, hopefully someone has ran into this and it's a quick fix. I'm using
> a vanilla 2.6.9 kernel and the newest (as of tonite) cvs branch from
> -rRHEL4. My sequence is to startup ccsd on both nodes, and then I try to
> have both of them join (with a brief wait before I have the 2nd one try).
> Here's what I get from the cman_tool's view of the nodes.
> phung # cman_tool nodes
> Node Votes Exp Sts Name
> 3 1 1 J blade03
> 4 1 1 M blade04
> and in /var/log/messages, I see this:
> CMAN: sending membership request
> followed by many:
> last message repeated 7 times
> In addition I ran a tcpdump, and there seem to be UDP packets flying
> around from node to node, using port 6809, so the network seems fine.
> How would I debug this further? What kinds of tools are people using
> to debug their config/setup?
It looks like the return join-ack messages are not arriving at blade03. Check you don't have a firewall active that would block
these. cman uses unicast as well as multicast packets so you need to enable both.
More information about the Linux-cluster