[Linux-cluster] Problem in clvmd/dlm_recoverd
Nuno Fernandes
npf-mlists at eurotux.com
Fri Nov 14 21:53:13 UTC 2008
On Friday 14 November 2008 16:26:49 David Teigland wrote:
> On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote:
> > 22236 [dlm_recoverd] dlm_wait_function
> > 25097 [dlm_recoverd] dlm_wait_function
>
> dlm recovery appears to be stuck; this is usually due to a problem at the
> network level. The recovery seems to be caused by a node starting clvmd.
Hi,
I don't know if it helps, but groupd is using all available CPU, but only in 2
of the nodes.
I don't know if it's required to be up.. but we've disabled IPV6..
snip of modprobe.conf:
alias net-pf-10 off
Best regards,
./npf
>
> sysrq-t backtraces from all the nodes could confirm some of this, and
> adding <dlm log_debug="1"/> to cluster.conf would give us more information
> the next time it happens.
>
> Dave
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20081114/871ed8e3/attachment.htm>
More information about the Linux-cluster
mailing list