[Linux-cluster] GFS/cman related problems?

Dan B. Phung phung at cs.columbia.edu
Mon May 2 05:59:56 UTC 2005


I hope I'm sending to the correct list.  I'm having problems starting up
gfs, and hopefully it's just something incorrect with my configuration.

For the sources, I checked out the latest of:
  device-mapper LVM2 cluster
  cvs -d :pserver:cvs at sources.redhat.com:/cvs/cluster 

for the kernel sources, I'm using a vanilla kernel, so I had to
update the cluster/*kernel from old versions.  (this may be my problem)
The kernel built and installed fine.  The kernel modules load fine.

root # modprobe dm-mod
root # device-mapper/scripts/devmap_mknod.sh
root # modprobe gfs
root # modprobe lock_dlm

  Lock_Harness <CVS> (built May  1 2005 15:31:12) installed
  GFS <CVS> (built May  1 2005 15:30:54) 
  CMAN <CVS> (built May  1 2005 14:54:13) 
  NET: Registered protocol family 30
  DLM <CVS> (built May  1 2005 14:54:32) 
  udev[5023]: creating device node '/dev/dlm-control'
  Lock_DLM (built May  1 2005 15:31:03) 

Then I try to startup the daemons

root # route add -net netmask dev eth0

root # ccsd -V
ccsd DEVEL.1114967270 (built May  1 2005 13:07:54)
Copyright (C) Red Hat, Inc.  2004  All rights reserved.

root # ccsd 
Starting ccsd DEVEL.1114967270: 
May  1 16:02:38 localhost ccsd[5035]:  Built: May  1 2005 13:07:54 
May  1 16:02:38 localhost ccsd[5035]:  Copyright (C) Red Hat, Inc.  2004 All rights reserved.

root # cman_tool -V
cman_tool DEVEL.1114967270 (built May  1 2005 13:07:59)
Copyright (C) Red Hat, Inc.  2004  All rights reserved.

root # cman_tool -d join
selected nodename blade1
multicast address
if eth0 for mcast address
setup up interface for address: blade1
cman: CMAN DEVEL.1114967270 (built May  1 2005 20:17:15) started
root # cman: Waiting to join or form a Linux-cluster
cman: forming a new cluster
cman: quorum regained, resuming activity

root # clvmd -V
Cluster LVM daemon version: 2.01.10-cvs (2005-04-04)
Protocol version:           0.2.1

root # clvmd   
clvmd could not connect to cluster manager
Consult syslog for more information

root # syslog | tail -1
Unable to connect to cluster infrastructure after 60 seconds
(there are many of these)

so from here I tried to startup ccsd and cman from my other blade.
'cman_tool join' on the other blade never joins and gives these messages:

  cman: sending membership request

I'm following the instructions from:

Here is my configuration:

<?xml version="1.0"?>
        <cluster name="blade_cluster" config_version="2">
        <cman two_node="1" expected_votes="1">
        <multicast addr=""/>

          <clusternode name="blade03" nodeid="1" votes="1">
          <multicast addr="" interface="eth0"/>
                <method name="human">
                <device name="eth0" ipaddr="blade1"/>

          <clusternode name="blade04" nodeid="2" votes="1">
          <multicast addr="" interface="eth0"/>
                <method name="human">
                <device name="eth0" ipaddr="blade2"/>

          <fencedevice name="human" agent="fence_manual"/>

        <fence_daemon clean_start="0">

any help is much appreciated.


More information about the Linux-cluster mailing list