[Linux-cluster] Recovering from "telling LM to withdraw"

Jeff Sturm jeff.sturm at eprize.com
Thu Jul 2 03:45:00 UTC 2009


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Abhijith Das
> Sent: Wednesday, July 01, 2009 12:43 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Recovering from "telling LM to withdraw"
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=471258
> 
> The assert+withdraw you're seeing seems to be this bug above. I've
tried
> to recreate this on my cluster and failed. If you have a recipe to
> create this, could you please post it to the bugzilla?

Thank you for the link.  I'm not confident I can easily reproduce this
yet, as we've had months of continuous uptime without such an incident.

However if I do learn more about the circumstances leading up to our
crash, I'll certainly post information to the bugzilla page.

In the meantime I'll see if I can install a nagios agent to scan logs
for any GFS problems.  The sooner we know about it, the faster we can
recover if this happens again.

-Jeff






More information about the Linux-cluster mailing list