[Linux-cluster] ccsd problems after update to RHEL 5.2/5.3
gordan at bobich.net
Thu Mar 12 00:49:13 UTC 2009
I have a two-node cluster and ever since I updated the kernel and
cluster components I cannot get more than one node running with GFS.
Here are the package versions I have:
Node 2 starts up OK, but I see this in the syslog:
node2 ccsd: Unable to perform sendto: Cannot assign requested address
When I power up node2, it just gets strange and the whole thing locks up:
node2 openais: [CMAN ] cman killed by node 1 because we rejoined
the cluster without a full restart
node2 groupd: cman_get_nodes error -1 104
node2 gfs_controld: groupd_dispatch error -1 errno 11
node2 gfs_controld: groupd connection died
node2 gfs_controld: cluster is down, exiting
So for some reason node 1's joining makes node 2 get kicked out of the
cluster - but worse, it doesn't seem to initiate fencing. Instead, the
whole cluster just locks up on GFS access.
What am I missing? What should I be looking for in the logs? This
cluster worked fine before the update.
I found this:
but updating cman to 2.0.98 as per the RHBA didn't fix the problem.
More information about the Linux-cluster