[Linux-cluster] Service Recovery Failure

Scott Becker scottb at bxwa.com
Tue Nov 27 17:43:03 UTC 2007


Lack of fencing in my case (without GFS) is only a problem if the 
failing NIC fades in and out. The larger problem during real operation 
is the lack of service recovery. I plugged the public nic back in and it 
was rejected as a node and then the service was relocated (late).

    scottb


Scott Becker wrote:
>
>
> Lon Hohberger wrote:
>> On Mon, 2007-11-26 at 14:36 -0800, Scott Becker wrote:
>>
>>   
>>> openais[9498]: [CLM  ] CLM CONFIGURATION CHANGE
>>> openais[9498]: [CLM  ] New Configuration:
>>> kernel: dlm: closing connection to node 3
>>> fenced[9568]: 205.234.65.133 not a cluster member after 0 sec 
>>> post_fail_delay
>>> openais[9498]: [CLM  ]     r(0) ip(205.234.65.132)
>>> openais[9498]: [CLM  ] Members Left:
>>> openais[9498]: [CLM  ]     r(0) ip(205.234.65.133)
>>> openais[9498]: [CLM  ] Members Joined:
>>> openais[9498]: [CLM  ] CLM CONFIGURATION CHANGE
>>> openais[9498]: [CLM  ] New Configuration:
>>> openais[9498]: [CLM  ]     r(0) ip(205.234.65.132)
>>> openais[9498]: [CLM  ] Members Left:
>>> openais[9498]: [CLM  ] Members Joined:
>>> openais[9498]: [SYNC ] This node is within the primary component and 
>>> will provide service.
>>> openais[9498]: [TOTEM] entering OPERATIONAL state.
>>> openais[9498]: [CLM  ] got nodejoin message 205.234.65.132
>>> openais[9498]: [CPG  ] got joinlist message from node 2
>>>     
>>
>> Did it even try to run the fence_apc agent?  It should have done
>> *something* - it didn't even look like it tried to fence.
>>
>> -- Lon
>>
>>   
> No sign of an attempt. How do I turn up the verbosity of fenced? I'll 
> repeat the test. The only mention I can find is -D but I don't know 
> how I can use that. I'll browse the source and see if I can learn 
> anything. I'm using 2.0.73.
>
>     thanks
>     scottb
>
>
>
>
> ------------------------------------------------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071127/a170f3f5/attachment.htm>


More information about the Linux-cluster mailing list