[Linux-cluster] proper cluster crash procedures?

Mark Chaney macscr at macscr.com
Mon Sep 29 19:44:25 UTC 2008


Thanks for the tips, but Ive found that one of my biggest problems is
getting the services to actually stop. The Cluster Service Manager usually
hangs when I try to stop it. 

 

Also, did you take a look at my editions to this thread? I listed my
cluster.conf and some of the error messages I have been getting.

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Alan A
Sent: Monday, September 29, 2008 1:26 PM
To: linux clustering
Subject: Re: [Linux-cluster] proper cluster crash procedures?

 

I am not sure that this will help but try disabling following services
before reboot and then have them join the cluster afterwards:
chkconfig rgmanager off
chkconfig gfs off
chkconfig clvmd off
chkconfig cman off

Then when node is up - start the services in this order
service cman start
service clvmd start
service gfs start
service rgmanager start

Stopping services should go in the reverse order. Don't forget to chkconfig
<servicename> on after you have the cluster working again.

On Mon, Sep 29, 2008 at 3:06 AM, Mark Chaney <macscr at macscr.com> wrote:

I have a 3 node cluster that has shared storage using iscsi san, hence I am
using GFS. Anyway, I had it crash for whatever reason, not sure if something
was rebooted incorrectly or what, but now I have been spending the past 2
hours trying to get the cluster back up. I would think that sampling
rebooting all the nodes would work, but heck, that hasn't. What should I be
doing? Should I just start up one at a time? BTW, I am using ipmi for
fencing if that makes a difference. I can post my cluster.conf if that's
helpful, but I would think there would be general techniques available.

Thanks,
Mark




--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster




-- 
Alan A.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080929/a77b3ae5/attachment.htm>


More information about the Linux-cluster mailing list