[Linux-cluster] Adding a Node back to cluster failing

Zama Ques queszama at yahoo.in
Wed Mar 6 09:37:56 UTC 2013


Thanks emmanuel. 

Please find the cluster config file.

=====
<?xml version="1.0"?>
<cluster alias="exammysqldb" config_version="31" name="exammysqldb">
    <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
    <clusternodes>
        <clusternode name="node1.example.com" nodeid="1" votes="1">
            <fence/>
        </clusternode>
    </clusternodes>
    <cman/>
    <fencedevices/>
    <rm>
        <failoverdomains>
            <failoverdomain name="EXAMDBFA-DM" nofailback="0" ordered="0" restricted="1">
                <failoverdomainnode name="node1.example.com" priority="1"/>
            </failoverdomain>
        </failoverdomains>
        <resources>
            <postgres-8 config_file="/var/lib/pgsql/data/postgresql.conf" name="psql" postmaster_user="postgres" shutdown_wait="0"/>
            <ip address="192.168.2.173" monitor_link="1"/>
            <clusterfs device="/dev/examdb/examdbgfs" force_unmount="1" fsid="48393" fstype="gfs" mountpoint="/var/lib/pgsql/" name="exampsqlDB" self_fence="1"/>
            <script file="/etc/init.d/postgresql" name="psql-ssdg"/>
        </resources>
        <service autostart="1" domain="EXAMDBFA-DM" exclusive="1" name="EXAMDB" recovery="relocate">
            <ip ref="192.168.2.173"/>
            <clusterfs fstype="gfs" ref="exampsqlDB"/>
            <script ref="psql-ssdg"/>
        </service>
    </rm>
    <totem consensus="4800" join="60" token="10000" token_retransmits_before_loss_const="20"/>
</cluster>





________________________________
 From: emmanuel segura <emi2fast at gmail.com>
To: Zama Ques <queszama at yahoo.in>; linux clustering <linux-cluster at redhat.com> 
Sent: Wednesday, 6 March 2013 1:39 PM
Subject: Re: [Linux-cluster] Adding a Node back to cluster failing
 

Hello Zama

Where is your cluster config?

Thanks


2013/3/6 Zama Ques <queszama at yahoo.in>

Hi All,
>
>
>We are using a two node cluster using centOS 5.5 on HP Proliant Servers . One of the our servers in the cluster failed and is not booting up . There was a hardware issue on the server , so we removed the node from the cluster and also removed it from fencing domain. 
>
>
>
>Now our cluster service is now running on a single node cluster . After resolving the hardware issue , we are now trying to add back  the node to the cluster using the luci interface but it is failing with the following  error
>
>
>"Host already member of cluster  testcluster"
>
>
>whereas the host information is not available in cluster config file. 
>
>
># grep host2 /etc/cluster/cluster.conf 
>
>
>
>Restart of cman on the node is showing the following error
>
>
>====
># service cman restart
>Stopping cluster: 
>   Stopping fencing... done
>   Stopping cman... done
>   Stopping ccsd... done
>   Unmounting configfs... done
>                                                           [  OK  ]
>Starting cluster: 
>   Loading modules... done
>   Mounting configfs... done
>   Starting ccsd... done
>   Starting cman... failed
>cman not started: Can't find
 local node name in cluster.conf /usr/sbin/cman_tool: aisexec daemon didn't start
>                                                           [FAILED]
>
>--------------
>
>
>
># tail -f /var/log/messages
>Jul 29 00:04:16 host2
 ccsd[2793]:  Local version # : 31 
>Jul 29 00:04:16 host2 ccsd[2793]:  Remote version #: 31 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] AIS Executive Service RELEASE 'subrev 1887 version 0.80.6' 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] AIS Executive Service: started and ready to provide service. 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] local node name "host2.example.com" not found in cluster.conf 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] Error reading CCS info, cannot start 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] Error reading config from CCS 
>Jul 29 00:04:16 host2 openais[2799]: [MAIN ] AIS Executive exiting (reason: could not read the main configuration file). 
>=======
>
>
>Please suggest how to add the failed node back to the cluster ? 
>
>
>
>Thanks in Advance
>Zaman
>
>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
esta es mi vida e me la vivo hasta que dios quiera 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130306/f31fd20f/attachment.htm>


More information about the Linux-cluster mailing list