[Linux-cluster] Cluster rejoin problem - 4U3, two node cluster

mehmet celik bsd_daemon at msn.com
Wed Jul 4 09:45:44 UTC 2007


hi tomas,

when you do restart. which services run on the node1 ???

>node2 logs following message:
>
>kernel: CMAN: removing node node1 from the cluster : Missed too many 
>heartbeats

when network problem, you get this error.

>kernel: CMAN: too many transition restarts - will die
>kernel: CMAN: we are leaving the cluster. Inconsistent cluster view
>kernel: WARNING: dlm_emergency_shutdown
>clurgmgrd[2848]: <warning> #67: Shutting down uncleanly
>kernel: WARNING: dlm_emergency_shutdown
>kernel: SM: 00000001 sm_stop: SG still joined
>kernel: SM: 01000003 sm_stop: SG still joined
>kernel: SM: 03000002 sm_stop: SG still joined
>ccsd[2242]: Cluster is not quorate.  Refusing connection.
>ccsd[2242]: Error while processing connect: Connection refused
>ccsd[2242]: Invalid descriptor specified (-111).
>ccsd[2242]: Someone may be attempting something evil.
>ccsd[2242]: Error while processing get: Invalid request descriptor
>ccsd[2242]: Invalid descriptor specified (-111).
>ccsd[2242]: Someone may be attempting something evil.
>ccsd[2242]: Error while processing get: Invalid request descriptor
>ccsd[2242]: Invalid descriptor specified (-21).
>
>and again ~1 minute later on node1:
>
>kernel: CMAN: removing node node2 from the cluster : No response to 
>messages
>kernel: ------------[ cut here ]------------
>kernel: kernel BUG at
>/usr/src/build/714635-i686/BUILD/cman-kernel-2.6.9-43/smp/src/membership.c:3150!
>kernel: invalid operand: 0000 [#1]
>kernel: SMP

i thing this error a bug. did you check this error from bugzilla ?

_________________________________________________________________
Local listings, incredible imagery, and driving directions - all in one 
place! http://maps.live.com/?wip=69&FORM=MGAC01




More information about the Linux-cluster mailing list