[Linux-cluster] nodes don't see each other

Skulkin Dmitry skulkin at mosinfo.ru
Fri Sep 9 07:21:27 UTC 2005


Hi,

I'm tring to make a simple two-node cluster with RHCS4, but nodes don't  
see each other. clustat on node alpha1 shows that alpha1 is online,  
clustat on node alpha2 shows that alpha2 is online, but no information  
about other node. ping alpha1 and ping alpha2 is ok, hostnames are alpha1  
and alpha2. For testing purposes I'm using manual fencing, but with  
unchecked "clean start" on "cluster properties" starting fenced is hang  
and in /var/log/messages:
Sep  9 10:30:57 alpha1 fenced[12327]: fencing node "alpha2"
Sep  9 10:30:57 alpha1 fenced[12327]: fence "alpha2" failed
Sep  9 10:31:02 alpha1 fenced[12327]: fencing node "alpha2"
Sep  9 10:31:02 alpha1 fenced[12327]: fence "alpha2" failed
Sep  9 10:31:07 alpha1 fenced[12327]: fencing node "alpha2"
Sep  9 10:31:07 alpha1 fenced[12327]: fence "alpha2" failed
Sep  9 10:31:12 alpha1 fenced[12327]: fencing node "alpha2"
Sep  9 10:31:12 alpha1 fenced[12327]: fence "alpha2" failed

and so on. I tried fence_ack_manual -n alpha1 and fence_ack_manual -n  
alpha2 on both nodes, but no result:

Warning:  If the node "alpha1" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable!  Please verify that the node shown above has
been reset or disconnected from storage.

Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such file or directory

With checked "clean start" starting fenced is ok, but nodes don't see each  
other at all.
cluster.conf:
<?xml version="1.0"?>
<cluster config_version="5" name="alpha_cluster">
         <fence_daemon clean_start="1" post_fail_delay="0"  
post_join_delay="3"/>
         <clusternodes>
                 <clusternode name="alpha1" votes="1"/>
                 <clusternode name="alpha2" votes="1"/>
         </clusternodes>
         <cman expected_votes="1" two_node="1"/>
         <fencedevices>
                 <fencedevice agent="fence_manual" name="mf"/>
         </fencedevices>
         <rm>
                 <failoverdomains/>
                 <resources/>
         </rm>
</cluster>

Thanks for any help,
-- 
Best regards,
Dmitry Skulkin




More information about the Linux-cluster mailing list