[Linux-cluster] Initiating transition message

Thu Feb 21 19:13:05 UTC 2008

While this explains the situation somewhat, I was trying to bring a
bit more clarity to the problem (without examining th esource).  What
exactly is happening when a 'initiates transition'?

Shawn

On Thu, Feb 21, 2008 at 3:30 AM, Christine Caulfield
<ccaulfie at redhat.com> wrote:
>
> Shawn Hood wrote:
>  > Though one instance of 'Initating transition' message seems to be
>  > normal , what could the behavior shown in the following log indicate?
>  > What exactly is happening during an 'Initating transition' message?
>  >
>  > Shawn
>  >
>  > Feb 14 15:25:55 odin kernel: CMAN: Initiating transition, generation 7
>  > Feb 14 15:26:01 odin kernel: CMAN: removing node munin from the
>  > cluster : No response to messages
>  > Feb 14 15:26:01 odin kernel: CMAN: Initiating transition, generation 8
>  > Feb 14 15:26:16 odin kernel: CMAN: Initiating transition, generation 9
>  > Feb 14 15:26:31 odin kernel: CMAN: Initiating transition, generation 10
>  > Feb 14 15:26:40 odin su(pam_unix)[20082]: session opened for user root
>  > by shood(uid=0)
>  > Feb 14 15:26:46 odin kernel: CMAN: Initiating transition, generation 11
>  > Feb 14 15:27:01 odin kernel: CMAN: Initiating transition, generation 12
>  > Feb 14 15:27:16 odin kernel: CMAN: Initiating transition, generation 13
>  > Feb 14 15:27:31 odin kernel: CMAN: Initiating transition, generation 14
>  > Feb 14 15:27:46 odin kernel: CMAN: Initiating transition, generation 15
>  > Feb 14 15:28:01 odin kernel: CMAN: Initiating transition, generation 16
>  > Feb 14 15:28:16 odin kernel: CMAN: Initiating transition, generation 17
>  > Feb 14 15:28:31 odin kernel: CMAN: Initiating transition, generation 18
>  > Feb 14 15:28:46 odin kernel: CMAN: too many transition restarts - will die
>  > Feb 14 15:28:46 odin kernel: CMAN: we are leaving the cluster.
>  > Inconsistent cluster view
>  > Feb 14 15:28:46 odin kernel: WARNING: dlm_emergency_shutdown
>  > Feb 14 15:28:46 odin kernel: WARNING: dlm_emergency_shutdown
>  > Feb 14 15:28:46 odin kernel: SM: 00000002 sm_stop: SG still joined
>  > Feb 14 15:28:46 odin kernel: SM: 01000004 sm_stop: SG still joined
>  > Feb 14 15:28:46 odin kernel: SM: 02000014 sm_stop: SG still joined
>  > Feb 14 15:28:46 odin ccsd[17392]: Cluster manager shutdown.  Attemping
>
>
>  The usual cause of all those messages (not that it's usual!) is network
>  problems. Often a one-way connection can cause it, eg the node can send
>  messages but not receive them. There are pathological iptables rules
>  that can make that happen too.
>
>  It's hard to be specific without knowing more, but I would investigate
>  the network connections, routers/switches and routing/iptables rules
>
>  Chrissie
>
>  --
>  Linux-cluster mailing list
>  Linux-cluster at redhat.com
>  https://www.redhat.com/mailman/listinfo/linux-cluster
>

-- 
Shawn Hood
(910) 670-1819 Mobile