Hi!<br><br>Thx for the advice. Now it's working properly. I don't know why rgmanager stop working with a malformed cluster.conf, and not the other components.<br>Anyway, about the cluster votes, i take the info from the cluster faq (<a href="http://sources.redhat.com/cluster/wiki/FAQ/CMAN#quorumdiskhow">http://sources.redhat.com/cluster/wiki/FAQ/CMAN#quorumdiskhow</a>) with says:<br>
<br><p class="line874">Note that if you configure a quorum disk/partition,
you want two_node="1" or expected_votes="2" since the quorum disk
solves the voting imbalance. You want two_node="0" and
expected_votes="3" (or nodes + 1 if it's not a two-node cluster).
However, since 0 is the default value for two_node, you don't need to
specify it at all. If this is an existing two-node cluster and you're
changing the two_node value from "1" to "0", you'll have to stop the
entire cluster and restart it after the configuration is changed
(normally, the cluster doesn't have to be stopped and restarted for
configuration changes, but two_node is a special case.) Basically, you
want something like this in your /etc/cluster/cluster.conf: <span class="anchor" id="line-255"></span><span class="anchor" id="line-256"></span></p><p class="line867"><span class="anchor" id="line-257"></span><span class="anchor" id="line-258"></span><span class="anchor" id="line-259"></span><span class="anchor" id="line-260"></span><span class="anchor" id="line-261"></span><span class="anchor" id="line-262"></span><span class="anchor" id="line-263"></span></p>
<pre>  <cman two_node="0" expected_votes="3" .../><br>    <clusternodes><br>       <clusternode name="node1" votes="1" .../><br>       <clusternode name="node2" votes="1" .../><br>
    </clusternodes><br>  <quorumd device="/dev/mapper/lun01" votes="1"/></pre>Thx!<br><br><div class="gmail_quote">On Tue, Jan 20, 2009 at 12:42 PM, Stewart Walters <span dir="ltr"><<a href="mailto:stewart@epits.com.au">stewart@epits.com.au</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Ghe Rivero wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div></div><div class="Wj3C7c">
Hi everyone,<br>
    i've been fighting the last days with a 2-node cluster, but finally i quit.<br>
I'm having problems with the clurgmgrd daemon. It stop responding  when i restart the cluster (just the cluster, not the services or the nodes) and become unkillable. The only way to revert this situation it's restarting the nodes but as you can imagine that's not a solution.<br>

<br>
I'm using conga to config it. Any ideas?<br>
<br>
Ghe Rivero<br>
<?xml version="1.0"?><br>
<cluster alias="AAA" config_version="14" name="AAA"><br>
        <quorumd interval="3" label="quorumlnx" status_file="/tmp/qdisk-status" tko="23" votes="1"/><br>
        <cman deadnode_timeout="135" expected_nodes="3"/><br>
        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/><br>
        <clusternodes><br>
                <clusternode name="node1.fqdn" nodeid="1" votes="1"><br>
                        <fence><br>
                                <method name="1"><br>
                                        <device name="iLO-node1"/><br>
                                </method><br>
                        </fence><br>
                </clusternode><br>
                <clusternode name="node2.fqdn" nodeid="2" votes="1"><br>
                        <fence><br>
                                <method name="1"><br>
                                        <device name="iLO-node2"/><br>
                                </method><br>
                        </fence><br>
                </clusternode><br>
        </clusternodes><br>
        <cman expected_votes="3" two_node="0"/><br>
        <fencedevices><br>
                <fencedevice agent="fence_ilo" hostname="10.110.65.6" login="login" name="iLO-node1" passwd="Y"/><br>
                <fencedevice agent="fence_ilo" hostname="10.110.65.7" login="login" name="iLO-node2" passwd="Y"/><br>
        </fencedevices><br>
        <rm><br>
                <failoverdomains><br>
                        <failoverdomain name="Web" ordered="1" restricted="1"><br>
                                <failoverdomainnode name="node1.fqdn" priority="1"/><br>
                                <failoverdomainnode name="node2.fqdn" priority="2"/><br>
                        </failoverdomain><br>
                </failoverdomains><br>
                <resources><br>
                        <script file="/etc/init.d/httpd" name="Apache"/><br>
                        <ip address="10.110.65.30" monitor_link="1"/><br>
                </resources><br>
                <service autostart="1" domain="Web" exclusive="1" name="Web"><br>
                        <script ref="Apache"/><br>
                </service><br>
        </rm><br>
</cluster><br>
<br>
<br>
<br>
<br>
<br>
-- <br></div></div>
.''`.  Pienso, Luego Incordio  : :' :  `. `'   `-    <a href="http://www.debian.org" target="_blank">www.debian.org</a> <<a href="http://www.debian.org" target="_blank">http://www.debian.org</a>>    <a href="http://www.hispalinux.es" target="_blank">www.hispalinux.es</a> <<a href="http://www.hispalinux.es" target="_blank">http://www.hispalinux.es</a>><div class="Ih2E3d">
<br>
<br>
GPG Key: 26F020F7<br>
GPG fingerprint: 4986 39DA D152 050B 4699  9A71 66DB 5A36 26F0 20F7<br></div>
------------------------------------------------------------------------<br>
<br>
--<br>
Linux-cluster mailing list<br>
<a href="mailto:Linux-cluster@redhat.com" target="_blank">Linux-cluster@redhat.com</a><br>
<a href="https://www.redhat.com/mailman/listinfo/linux-cluster" target="_blank">https://www.redhat.com/mailman/listinfo/linux-cluster</a><br>
</blockquote>
<br>
Your cluster.conf looks a little out of whack for a 2 node cluster.  It looks as if it's designed for a 3 node cluster, but you've only defined two nodes.  This will get you in to trouble (I know from experience) :-)<br>

<br>
You've got duplicate cman entries which do not look right (although I'm pretty new to RHCS myself so I wouldn't consider me an authority on the matter).  See <cman deadnode_timeout="135" expected_nodes="3"/> and <cman expected_votes="3" two_node="0"/>.<br>

<br>
I would have thought that should be in a combined cman directive such as <cman deadnode_timeout=135 expected_votes="2" two_node="1"/>.  The expected votes would be 2, because in the event of split brain you'll want 1 node + quorum disk to remain a Quorate Cluster.<br>

<br>
In my cluster.conf <cman> is defined after the </clusternodes>.  I'm not sure if it makes a difference, but I would suggest removing the top most cman directive and merge it's parameters in to the bottom directive.<br>

<br>
Also, do you need a quorum disk?  A two node cluster can have but does not need one to operate.<br>
<br>
If you don't, expected_votes=1.<br>
<br>
See how you go.<br>
<br>
Regards,<br>
<br>
Stewart<br><font color="#888888">
<br>
--<br>
Linux-cluster mailing list<br>
<a href="mailto:Linux-cluster@redhat.com" target="_blank">Linux-cluster@redhat.com</a><br>
<a href="https://www.redhat.com/mailman/listinfo/linux-cluster" target="_blank">https://www.redhat.com/mailman/listinfo/linux-cluster</a><br>
</font></blockquote></div><br><br clear="all"><br>-- <br> .''`.  Pienso, Luego Incordio   <br>: :' :  <br>`. `'   <br>  `-    <a href="http://www.debian.org">www.debian.org</a>    <a href="http://www.hispalinux.es">www.hispalinux.es</a><br>
<br>GPG Key: 26F020F7<br>GPG fingerprint: 4986 39DA D152 050B 4699  9A71 66DB 5A36 26F0 20F7<br>