[Linux-cluster] Service Recovery Failure

Lon Hohberger lhh at redhat.com
Tue Nov 27 14:28:58 UTC 2007


On Mon, 2007-11-26 at 14:36 -0800, Scott Becker wrote:

> openais[9498]: [CLM  ] CLM CONFIGURATION CHANGE
> openais[9498]: [CLM  ] New Configuration:
> kernel: dlm: closing connection to node 3
> fenced[9568]: 205.234.65.133 not a cluster member after 0 sec 
> post_fail_delay
> openais[9498]: [CLM  ]     r(0) ip(205.234.65.132)
> openais[9498]: [CLM  ] Members Left:
> openais[9498]: [CLM  ]     r(0) ip(205.234.65.133)
> openais[9498]: [CLM  ] Members Joined:
> openais[9498]: [CLM  ] CLM CONFIGURATION CHANGE
> openais[9498]: [CLM  ] New Configuration:
> openais[9498]: [CLM  ]     r(0) ip(205.234.65.132)
> openais[9498]: [CLM  ] Members Left:
> openais[9498]: [CLM  ] Members Joined:
> openais[9498]: [SYNC ] This node is within the primary component and 
> will provide service.
> openais[9498]: [TOTEM] entering OPERATIONAL state.
> openais[9498]: [CLM  ] got nodejoin message 205.234.65.132
> openais[9498]: [CPG  ] got joinlist message from node 2

Did it even try to run the fence_apc agent?  It should have done
*something* - it didn't even look like it tried to fence.

-- Lon




More information about the Linux-cluster mailing list