[Linux-cluster] node fenced by dlm_controld on a clean shutdown

Wed Nov 21 10:19:02 UTC 2012

Jacek Konieczny napsal(a):
> On Mon, Nov 19, 2012 at 10:16:48AM +0100, Jacek Konieczny wrote:
>> It goes like that:
>> - resources using the shared storage are properly stopped by Pacemaker.
>> - DRBD is cleanly demoted and unconfigured by Pacemaker
>> - Pacemaker cleanly exits
>> - CLVMD is stopped.
>> – dlm_controld is stopped
>> – corosync is being stopped
>>
>> and at this point the node is fenced (rebooted) by the dlm_controld on
>> the other node. I would expect it continue with a clean shutdown.
>>
>> Any idea how to debug/fix it?
>> Is this '541 cpg_dispatch error 9' the problem?
> 
> I found a workaround: I have added a 10 seconds pause between
> dlm_controld and corosync shutdown. The node shuts down cleanly now (is
> not fenced). '541 cpg_dispatch error 9' is still there in the logs,
> though.
> 
> Greets,
>         Jacek
> 

Hi,
we've discussed this problem with dave, but I would like to get some
information:
- What distro are you using?
- Packages are compiled or disro?
- what you mean by "clean shutdown"? This is something like service
dlm_control stop, or your own script?

Thanks,
  Honza