[Linux-cluster] Killing node XXX because it has rejoined thecluster with existing state

BONNETOT Jean-Daniel (EXT THALES) ext.thales.jean-daniel.bonnetot at sncf.fr
Thu Sep 29 08:13:47 UTC 2011


Sorry for double message :)

 

 

 

Jean-Daniel BONNETOT

 

De : linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] De la part de BONNETOT Jean-Daniel (EXT THALES)
Envoyé : jeudi 29 septembre 2011 09:29
À : linux-cluster at redhat.com
Objet : [Linux-cluster] Killing node XXX because it has rejoined thecluster with existing state

 

Hi,

 

I have problem with two node cluster. When I force a node to faile, second node fences first one. When first one rejoin my cluster, cman shutdown on both nodes saying : 

 

Sep 28 17:29:36 s64lmwbig3c openais[7273]: [MAIN ] Killing node s64lmwbig3b because it has rejoined the cluster with existing state

Sep 28 17:29:36 s64lmwbig3c openais[7273]: [CMAN ] cman killed by node 1 because we rejoined the cluster without a full restart

 

 

Logs :

See attached

 

Conf :

<?xml version="1.0"?>

<cluster config_version="12" name="u64lmwbig8r">

        <cman expected_votes="1" two_node="1">

                <multicast addr="239.192.0.11"/>

        </cman>

        <clusternodes>

                <clusternode name="s64lmwbig3b" nodeid="1" votes="1">

                        <fence>

                                <method name="single">

                                        <device name="fenceHP_g3b"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="s64lmwbig3c" nodeid="2" votes="1">

                        <fence>

                                <method name="single">

                                        <device name="fenceHP_g3c"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <fencedevices>

                <fencedevice agent="fence_ipmilan" ipaddr="XXXXX" lanplus="1" login="user" name="fenceHP_g3b" passwd="password" verbose="yes"/>

                <fencedevice agent="fence_ipmilan" ipaddr="XXXXX" lanplus="1" login="user" name="fenceHP_g3c" passwd="password" verbose="yes"/>

        </fencedevices>

        <rm>

                <failoverdomains/>

                <resources/>

        </rm>

        <fence_daemon clean_start="0" post_fail_delay="20" post_join_delay="60"/>

</cluster>

 

Do you know what I missed ?

 

Thanks

Regards,



 

Jean-Daniel BONNETOT

 

-------
Ce message et toutes les pièces jointes sont établis à l'intention exclusive de ses destinataires et sont confidentiels. L'intégrité de ce message n'étant pas assurée sur Internet, la SNCF ne peut être tenue responsable des altérations qui pourraient se produire sur son contenu. Toute publication, utilisation, reproduction, ou diffusion, même partielle, non autorisée préalablement par la SNCF, est strictement interdite. Si vous n'êtes pas le destinataire de ce message, merci d'en avertir immédiatement l'expéditeur et de le détruire.
-------
This message and any attachments are intended solely for the addressees and are confidential. SNCF may not be held responsible for their contents whose accuracy and completeness cannot be guaranteed over the Internet. Unauthorized use, disclosure, distribution, copying, or any part thereof is strictly prohibited. If you are not the intended recipient of this message, please notify the sender immediately and delete it. 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110929/c5fe2315/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 667 bytes
Desc: image001.jpg
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110929/c5fe2315/attachment.jpg>


More information about the Linux-cluster mailing list