[Linux-cluster] GFS on CentOS - cman unable to start
Chris Kwall
christiankwall-qsa at yahoo.com
Sat Jan 7 08:22:12 UTC 2012
Hi Wes
Please excuse my poor english - it's not my mother language I'm writing in.
----- Ursprüngliche Message -----
> Howdy, y'all. I'm trying to set up GFS in a cluster on CentOS systems
> running on vmWare. The GFS FS is on a Dell Equilogic SAN.
>
> I keep running into the same problem despite many differently-flavored
> attempts to set up GFS. The problem comes when I try to start cman, the
> cluster management software.
>
> [root at test01]# service cman start
> Starting cluster:
> Loading modules... done
> Mounting configfs... done
> Starting ccsd... done
> Starting cman... failed
> cman not started: Can't find local node name in cluster.conf
> /usr/sbin/cman_tool: aisexec daemon didn't start
> [FAILED]
I don't think that the cluster is your main-problem.
The nodename must "not" present in DNS, but it must be resolvable by files, ldap whatever.
Please verify that "files" is present at /etc/nsswitch.conf.
e.g: hosts: files dns
Did you've check with "ip addr list" that the ip-address matches the same as in /etc/hosts?
>
> [root at test01]# tail /var/log/messages
> Jan 5 13:39:40 testbench06 ccsd[13194]: Unable to connect to
> cluster infrastructure after 1193640 seconds.
> Jan 5 13:40:10 testbench06 ccsd[13194]: Unable to connect to
> cluster infrastructure after 1193670 seconds.
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] AIS Executive
> Service RELEASE 'subrev 1887 version 0.80.6'
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] Copyright (C)
> 2002-2006 MontaVista Software, Inc and contributors.
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] Copyright (C)
> 2006 Red Hat, Inc.
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] AIS Executive
> Service: started and ready to provide service.
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] local node name
> "test01.gdao.ucsc.edu" not found in cluster.conf
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] Error reading CCS
> info, cannot start
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] Error reading
> config from CCS
> Jan 5 13:40:24 testbench06 openais[3939]: [MAIN ] AIS Executive
> exiting (reason: could not read the main configuration file).
>
> Here are details of my configuration:
>
> [root at test01]# rpm -qa | grep cman
> cman-2.0.115-85.el5_7.2
>
> [root at test01]# echo $HOSTNAME
> test01.gdao.ucsc.edu
>
> [root at test01]# hostname
> test01.gdao.ucsc.edu
>
> [root at test01]# cat /etc/hosts
> # Do not remove the following line, or various programs
> # that require network functionality will fail.
> 128.114.31.112 test01 test01.gdao test01.gdao.ucsc.edu
> 128.114.31.113 test02 test02.gdao test02.gdao.ucsc.edu
> 127.0.0.1 localhost.localdomain localhost
> ::1 localhost6.localdomain6 localhost6
>
> [root at test01]# sestatus
> SELinux status: enabled
> SELinuxfs mount: /selinux
> Current mode: permissive
> Mode from config file: permissive
> Policy version: 21
> Policy from config file: targeted
>
> [root at test01]# cat /etc/cluster/cluster.conf
> <?xml version="1.0"?>
> <cluster config_version="25" name="gdao_cluster">
> <fence_daemon post_fail_delay="0"
> post_join_delay="120"/>
> <clusternodes>
> <clusternode name="test01" nodeid="1"
> votes="1">
> <fence>
> <method name="single">
> <device name="gfs_vmware"/>
> </method>
> </fence>
> </clusternode>
> <clusternode name="test02" nodeid="2"
> votes="1">
> <fence>
> <method name="single">
> <device name="gfs_vmware"/>
> </method>
> </fence>
> </clusternode>
> </clusternodes>
> <cman/>
> <fencedevices>
> <fencedevice agent="fence_manual"
> name="gfs1_ipmi"/>
> <fencedevice agent="fence_vmware"
> name="gfs_vmware"
> ipaddr="gdvcenter.ucsc.edu" login="root"
> passwd="1hateAmazon.com"
> vmlogin="root" vmpasswd="esxpass"
> port="/vmfs/volumes/49086551-c64fd83c-0401-001e0bcd6848/eagle1/gfs1.vmx"/>
> </fencedevices>
> <rm>
> <failoverdomains/>
> </rm>
> </cluster>
- Chris
More information about the Linux-cluster
mailing list