[Linux-cluster] <crit> #12: RG xxcluster failed to stop; intervention required

Omer Faruk Sen omer at faruk.net
Sun Feb 5 17:41:27 UTC 2006


Hi,

I have done a simple test but redhat-cluster can't survive it. I have a 2
node cluster I have edited httpd.conf so it can't start (adding a few
characters to httpd.conf) and on node2 I have made a

/usr/sbin/clusvcadm -r cluster -m clu2

it move resources to clu2 perfectly but when I corrected httpd.conf on
clu1 and issue

/usr/sbin/clusvcadm -r ggcluster -m clu1

on node2 I get following errors:

Feb  5 16:25:19 clu1 clurgmgrd[2256]: <notice> stop on script "apache"
returned 1 (generic error)
Feb  5 16:25:19 clu1 clurgmgrd[2256]: <crit> #12: RG xxcluster failed to
stop; intervention required
Feb  5 16:25:19 clu1 clurgmgrd[2256]: <notice> Service xxcluster is failed

It shows that httpd service can't get started but when I manually start
apache it starts without problem ..

Can someone tell me what I am missing. It is a simple test and redhat
cluster didn't pass it.

I use  2 dl140 using fence_ipmilan. I can manually stop start and reboot
server using fence_ipmilan command. And here is my cluster.conf:

<?xml version="1.0"?>
<cluster config_version="5" name="xx">
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="clu1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="clu1-ilo"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="clu2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="clu2-ilo"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_ipmilan" ipaddr="192.168.1.98"
login="admin" name="clu1-ilo" passwd="admin"/>
                <fencedevice agent="fence_ipmilan" ipaddr="192.168.1.99"
login="admin" name="clu2-ilo" passwd="admin"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="xx-cluster" ordered="0"
restricted="1">
                                <failoverdomainnode name="clu1"
priority="1"/>
                                <failoverdomainnode name="clu2"
priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="192.168.1.54" monitor_link="1"/>
                        <script file="/etc/rc.d/init.d/httpd" name="apache"/>
                        <script file="/etc/rc.d/init.d/vsftpd" name="ftp"/>
                        <script file="/etc/rc.d/init.d/mysqld" name="mysql"/>
                </resources>
                <service autostart="1" domain="xx-cluster" name="xxcluster">
                        <ip ref="192.168.1.54"/>
                        <script ref="apache"/>
                        <script ref="ftp"/>
                        <script ref="mysql"/>
                </service>
        </rm>
</cluster>




-- 
Omer Faruk Sen
http://www.faruk.net




More information about the Linux-cluster mailing list