[Linux-cluster] can't communicate with fenced -1

GS R gsrlinux at gmail.com
Wed Jun 25 08:33:51 UTC 2008


>
>
>
>
> 2008/6/25 GS R <gsrlinux at gmail.com>:
>
>>
>>
>> On 6/24/08, Gian Paolo Buono <gpbuono at gmail.com> wrote:
>>
>>> Hi,
>>>
>>> We have two RHEL5.1 boxes installed sharing a
>>> single iscsi emc2 SAN, whitout fence devices. System is configured
>>>
>>>
>>> as a high-availability system of xen guest.
>>>
>>> One of the most repeating problems are fence_tool related.
>>>
>>>   # service cman start
>>>   Starting cluster:
>>>      Loading modules... done
>>>      Mounting configfs... done
>>>      Starting ccsd... done
>>>      Starting cman... done
>>>      Starting daemons... done
>>>  Starting fencing... fence_tool: can't communicate with fenced -1
>>>
>>>
>>>
>>>  # fenced -D
>>>   1204556546 cman_init error 0 111
>>>
>>>   # clustat
>>>   CMAN is not running.
>>>
>>>   # cman_tool join
>>>
>>>   # clustat
>>>   msg_open: Connection refused
>>>
>>>   Member Status: Quorate
>>>     Member Name                        ID   Status
>>>
>>>     ------ ----                        ---- ------
>>>     yoda1                             1 Online, Local
>>>     yoda2                             2 Offline
>>>
>>> Sometimes this problem gets solved if the two machines are rebooted at
>>>
>>>
>>>
>>> the same time. But in the current HA configuration, I cannot guarantee
>>> two systems will be rebooted at the same time for every problem we
>>> face. This is my config file:
>>>
>>> ###################################cluster.conf####################################
>>>
>>>
>>>
>>> <?xml version="1.0"?>
>>> <cluster alias="yoda-cl" config_version="2" name="yoda-cl">
>>>         <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>>>
>>>
>>>
>>>         <clusternodes>
>>>                 <clusternode name="yoda2" nodeid="1" votes="1">
>>>                         <fence/>
>>>                 </clusternode>
>>>                 <clusternode name="yoda1" nodeid="2" votes="1">
>>>
>>>
>>>
>>>                         <fence/>
>>>                 </clusternode>
>>>         </clusternodes>
>>>         <cman expected_votes="1" two_node="1"/>
>>>         <rm>
>>>                 <failoverdomains/>
>>>
>>>
>>>
>>>                 <resources/>
>>>         </rm>
>>>         <fencedevices/>
>>> </cluster>
>>> ###################################cluster.conf####################################
>>> Regards.
>>>
>>> Hi
>>
>> I configured a two node cluster with no fence device on RHEL5.1.
>> The cluster started and stopped with no issues. The only difference that I
>> see is that I have used FQDN in my cluster.conf
>>
>> i.e., <clusternode name="yoda2*.gsr.com*" nodeid="1" votes="1">
>>
>> Check your /etc/hosts if it has the FQDN in it.
>>
>> Thanks
>> Gowrishankar Rajaiyan
>>
>>
>>
>

On 6/25/08, Gian Paolo Buono <gpbuono at gmail.com> wrote:
>
> Hi,
> the problem of my cluster is that it start-up weel but after two days the
> problem that I have described is running, and this problem gets solved if
> the two machines are rebooted at the same time.
>
> Thanks
> Gian Paolo



Hi Gian

Could you please attach the logs.

Thanks
Gowrishankar Rajaiyan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20080625/805cdbbe/attachment.htm>


More information about the Linux-cluster mailing list