[Linux-cluster] OpenAIS hangs at shutdown

Steven Dake sdake at redhat.com
Mon Aug 23 23:25:04 UTC 2010


On 08/23/2010 11:15 AM, Andrés Mauricio Mujica Zalamea wrote:
>
> Hi, i'm having an issue with OpenAIS after an update from RHEL 5.3 to
> 5.5.
>
> Since the update process the openais service hangs when i try to stop
> it, openais gets stuck on the no connection error and the only way to
> shutdown the server is by force.
>
> These are some relevant logs...
>
>
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] The token was lost in
> the OPERATIONAL state.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Receive multicast
> socket recv buffer size (320000 bytes).
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Transmit multicast
> socket send buffer size (262142 bytes).
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering GATHER state
> from 2.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Creating commit token
> because I am the rep.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Saving state aru 4d
> high seq received 4d
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Storing new sequence id
> for ring 240
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering COMMIT state.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering RECOVERY
> state.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [0] member
> 10.117.157.135:
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
> rep 10.117.157.135
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
> 4d received flag 1
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [1] member
> 10.117.157.136:
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
> rep 10.117.157.135
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
> 4d received flag 1
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Did not need to
> originate any messages in recovery.
> Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Sending initial ORF
> token
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
> CHANGE
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration:
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135)
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136)
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Left:
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Joined:
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
> CHANGE
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration:
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135)
> Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136)
> Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Left:
> Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Joined:
> Aug 18 23:45:40 cluster01 openais[3370]: [SYNC ] This node is within the
> primary component and will provide service.
> Aug 18 23:45:40 cluster01 openais[3370]: [TOTEM] entering OPERATIONAL
> state.
> Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
> 10.117.157.135
> Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
> 10.117.157.136
> Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
> from node 2
> Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
> from node 1
>
>

which rpm version are you using?

https://bugzilla.redhat.com/show_bug.cgi?id=566467 may be relevant here 
but was fixed in openais-0.80.6-16.el5.

Thanks
-steve
>




More information about the Linux-cluster mailing list