[Linux-cluster] OpenAIS hangs at shutdown

Andrés Mauricio Mujica Zalamea andres.mujica at seaq.com.co
Mon Aug 23 18:15:04 UTC 2010


Hi, i'm having an issue with OpenAIS after an update from RHEL 5.3 to
5.5.

Since the update process the openais service hangs when i try to stop
it, openais gets stuck on the no connection error and the only way to
shutdown the server is by force.

These are some relevant logs...


Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] The token was lost in
the OPERATIONAL state. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Receive multicast
socket recv buffer size (320000 bytes). 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Transmit multicast
socket send buffer size (262142 bytes). 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering GATHER state
from 2. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Creating commit token
because I am the rep. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Saving state aru 4d
high seq received 4d 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Storing new sequence id
for ring 240 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering COMMIT state. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering RECOVERY
state. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [0] member
10.117.157.135: 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
rep 10.117.157.135 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
4d received flag 1 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [1] member
10.117.157.136: 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
rep 10.117.157.135 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
4d received flag 1 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Did not need to
originate any messages in recovery. 
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Sending initial ORF
token 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
CHANGE 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration: 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135) 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136) 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Left: 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Joined: 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
CHANGE 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration: 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135) 
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136) 
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Left: 
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Joined: 
Aug 18 23:45:40 cluster01 openais[3370]: [SYNC ] This node is within the
primary component and will provide service. 
Aug 18 23:45:40 cluster01 openais[3370]: [TOTEM] entering OPERATIONAL
state. 
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
10.117.157.135 
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
10.117.157.136 
Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
from node 2 
Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
from node 1



-- 
Andrés Mauricio Mujica Zalamea <andres.mujica at seaq.com.co>
SEAQ SERVICIOS CIA LTDA




More information about the Linux-cluster mailing list