[Linux-cluster] can't communicate with fenced -1
GS R
gsrlinux at gmail.com
Wed Jun 25 08:33:51 UTC 2008
>
>
>
>
> 2008/6/25 GS R <gsrlinux at gmail.com>:
>
>>
>>
>> On 6/24/08, Gian Paolo Buono <gpbuono at gmail.com> wrote:
>>
>>> Hi,
>>>
>>> We have two RHEL5.1 boxes installed sharing a
>>> single iscsi emc2 SAN, whitout fence devices. System is configured
>>>
>>>
>>> as a high-availability system of xen guest.
>>>
>>> One of the most repeating problems are fence_tool related.
>>>
>>> # service cman start
>>> Starting cluster:
>>> Loading modules... done
>>> Mounting configfs... done
>>> Starting ccsd... done
>>> Starting cman... done
>>> Starting daemons... done
>>> Starting fencing... fence_tool: can't communicate with fenced -1
>>>
>>>
>>>
>>> # fenced -D
>>> 1204556546 cman_init error 0 111
>>>
>>> # clustat
>>> CMAN is not running.
>>>
>>> # cman_tool join
>>>
>>> # clustat
>>> msg_open: Connection refused
>>>
>>> Member Status: Quorate
>>> Member Name ID Status
>>>
>>> ------ ---- ---- ------
>>> yoda1 1 Online, Local
>>> yoda2 2 Offline
>>>
>>> Sometimes this problem gets solved if the two machines are rebooted at
>>>
>>>
>>>
>>> the same time. But in the current HA configuration, I cannot guarantee
>>> two systems will be rebooted at the same time for every problem we
>>> face. This is my config file:
>>>
>>> ###################################cluster.conf####################################
>>>
>>>
>>>
>>> <?xml version="1.0"?>
>>> <cluster alias="yoda-cl" config_version="2" name="yoda-cl">
>>> <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>>>
>>>
>>>
>>> <clusternodes>
>>> <clusternode name="yoda2" nodeid="1" votes="1">
>>> <fence/>
>>> </clusternode>
>>> <clusternode name="yoda1" nodeid="2" votes="1">
>>>
>>>
>>>
>>> <fence/>
>>> </clusternode>
>>> </clusternodes>
>>> <cman expected_votes="1" two_node="1"/>
>>> <rm>
>>> <failoverdomains/>
>>>
>>>
>>>
>>> <resources/>
>>> </rm>
>>> <fencedevices/>
>>> </cluster>
>>> ###################################cluster.conf####################################
>>> Regards.
>>>
>>> Hi
>>
>> I configured a two node cluster with no fence device on RHEL5.1.
>> The cluster started and stopped with no issues. The only difference that I
>> see is that I have used FQDN in my cluster.conf
>>
>> i.e., <clusternode name="yoda2*.gsr.com*" nodeid="1" votes="1">
>>
>> Check your /etc/hosts if it has the FQDN in it.
>>
>> Thanks
>> Gowrishankar Rajaiyan
>>
>>
>>
>
On 6/25/08, Gian Paolo Buono <gpbuono at gmail.com> wrote:
>
> Hi,
> the problem of my cluster is that it start-up weel but after two days the
> problem that I have described is running, and this problem gets solved if
> the two machines are rebooted at the same time.
>
> Thanks
> Gian Paolo
Hi Gian
Could you please attach the logs.
Thanks
Gowrishankar Rajaiyan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080625/805cdbbe/attachment.htm>
More information about the Linux-cluster
mailing list