[Linux-cluster] Service Recovery Failure
Scott Becker
scottb at bxwa.com
Tue Nov 27 17:43:03 UTC 2007
Lack of fencing in my case (without GFS) is only a problem if the
failing NIC fades in and out. The larger problem during real operation
is the lack of service recovery. I plugged the public nic back in and it
was rejected as a node and then the service was relocated (late).
scottb
Scott Becker wrote:
>
>
> Lon Hohberger wrote:
>> On Mon, 2007-11-26 at 14:36 -0800, Scott Becker wrote:
>>
>>
>>> openais[9498]: [CLM ] CLM CONFIGURATION CHANGE
>>> openais[9498]: [CLM ] New Configuration:
>>> kernel: dlm: closing connection to node 3
>>> fenced[9568]: 205.234.65.133 not a cluster member after 0 sec
>>> post_fail_delay
>>> openais[9498]: [CLM ] r(0) ip(205.234.65.132)
>>> openais[9498]: [CLM ] Members Left:
>>> openais[9498]: [CLM ] r(0) ip(205.234.65.133)
>>> openais[9498]: [CLM ] Members Joined:
>>> openais[9498]: [CLM ] CLM CONFIGURATION CHANGE
>>> openais[9498]: [CLM ] New Configuration:
>>> openais[9498]: [CLM ] r(0) ip(205.234.65.132)
>>> openais[9498]: [CLM ] Members Left:
>>> openais[9498]: [CLM ] Members Joined:
>>> openais[9498]: [SYNC ] This node is within the primary component and
>>> will provide service.
>>> openais[9498]: [TOTEM] entering OPERATIONAL state.
>>> openais[9498]: [CLM ] got nodejoin message 205.234.65.132
>>> openais[9498]: [CPG ] got joinlist message from node 2
>>>
>>
>> Did it even try to run the fence_apc agent? It should have done
>> *something* - it didn't even look like it tried to fence.
>>
>> -- Lon
>>
>>
> No sign of an attempt. How do I turn up the verbosity of fenced? I'll
> repeat the test. The only mention I can find is -D but I don't know
> how I can use that. I'll browse the source and see if I can learn
> anything. I'm using 2.0.73.
>
> thanks
> scottb
>
>
>
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071127/a170f3f5/attachment.htm>
More information about the Linux-cluster
mailing list