[Linux-cluster] RHEL 5/CentOS 5 cluster?

Eric Schneider eschneid at uccs.edu
Mon May 7 21:58:59 UTC 2007


Firewall is off.  I can do a ccsd, cman_tool join, cman_tool nodes and get
info.  However, looking in the logs shows that the systems don't even see
each other.

[root at sleepy ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M      4   2007-05-07 15:47:35  sleepy
   2   X      0                        sleepy2

The systems try to fence each other right away.

May  7 14:28:12 sleepy fenced[2655]: fencing node "sleepy2"
May  7 14:28:12 sleepy fence_manual: Node sleepy2 needs to be reset before
recovery can procede.  Waiting for sleepy2 to rejoin the cluster or for
manual acknowledgement that it has been reset (i.e. fence_ack_manual -n
sleepy2)
May  7 15:38:06 sleepy fenced[2745]: sleepy2 not a cluster member after 3
sec post_join_delay


<?xml version="1.0"?>
<cluster config_version="2" name="alpha_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="sleepy" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="human"
nodename="sleepy"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="sleepy2" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device name="human"
nodename="sleepy2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_manual" name="human"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="www" ordered="0"
restricted="1">
                                <failoverdomainnode name="sleepy"
priority="1"/>
                                <failoverdomainnode name="sleepy2"
priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="128.198.1.218" monitor_link="1"/>
                </resources>
                <service autostart="1" domain="www" name="www"
recovery="relocate">
                        <script file="/etc/init.d/httpd" name="httpd">
                                <ip ref="192.168.1.100"/>
                        </script>
                </service>
        </rm>
</cluster>

Eric

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Steven Dake
Sent: Monday, May 07, 2007 3:22 PM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL 5/CentOS 5 cluster?

On Mon, 2007-05-07 at 15:08 -0600, Eric Schneider wrote:
> I have tried CentOS 5 (i386 and x64) and SL 5 (i386) and I cannot get a 2
> node cluster to startup.  I ask questions in the CentOS IRC channel and on
> their forums, but no one has a solution.  I can get a RHEL 4 cluster
working
> without issues.  Is there something broken in RHEL 5 and clones?
> 
> Eric
> 
> 

Output from /var/log/messages would be helpful.  If you configure a
firewall by default, you must add a firewall rule for port 5405 UDP to
allow connections from other cluster nodes.  This is most likely the
problem you are having.

Regards
-steve

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster





More information about the Linux-cluster mailing list