[Linux-cluster] startup problem: "cman not started: Can't bind to local cman socket /usr/sbin/cman_tool: aisexec daemon didn't start "

Brieseneck, Arne, VF-Group Arne.Brieseneck at vodafone.com
Wed Apr 25 16:26:29 UTC 2007


it is high likely that anybody already knows this error and knows how to
fix it: 

---snip--- 
[root at box1 ~]# /etc/init.d/cman start 
Starting cluster: 
Loading modules... done 
Mounting configfs... done 
Starting ccsd... done 
Starting cman... failed 
cman not started: Can't bind to local cman socket /usr/sbin/cman_tool:
aisexec daemon didn't start 
[FAILED] 
[root at box1 ~]# 

---snap--- 

the cluster does not start even what is written in the logfileif looks
OK for me:

---snip--- 

Apr 24 19:59:06 box1 ccsd[5129]: Starting ccsd 2.0.60: 
Apr 24 19:59:06 box1 ccsd[5129]: Built: Jan 24 2007 15:31:03 
Apr 24 19:59:06 box1 ccsd[5129]: Copyright (C) Red Hat, Inc. 2004 All
rights reserved. 
Apr 24 19:59:06 box1 ccsd[5129]: cluster.conf (cluster name =
alpha_cluster, version = 6) found. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] AIS Executive Service
RELEASE 'subrev 1204 version 0.80.1' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Copyright (C) 2002-2006
MontaVista Software, Inc and contributors. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Copyright (C) 2006 Red Hat,
Inc. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Using default multicast
address of 239.192.196.121 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_cpg loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais cluster closed process group service v1.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_cfg loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais configuration service' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_msg loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais message service B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_lck loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais distributed locking service B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_evt loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais event service B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_ckpt loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais checkpoint service B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_amf loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais availability management framework B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_clm loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais cluster membership service B.01.01' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_evs loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais extended virtual synchrony service' 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_cman loaded. 
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais CMAN membership service 2.01' 
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] entering GATHER state from
12. 
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Creating commit token
because I am the rep. 
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Saving state aru 1e high seq
received 1e 
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Storing new sequence id for
ring 304 
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] entering COMMIT state. 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] entering RECOVERY state. 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [0] member
192.168.50.194: 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep
192.168.50.194 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e
received flag 0 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [1] member
192.168.50.195: 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep
192.168.50.194 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e
received flag 0 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [2] member
192.168.50.196: 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep
192.168.50.194 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e
received flag 0 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] Did not need to originate
any messages in recovery. 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM CONFIGURATION CHANGE 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New Configuration: 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.194) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.195) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.196) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left: 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined: 
Apr 24 19:59:09 box1 openais[5070]: [SYNC ] This node is within the
primary component and will provide service. 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM CONFIGURATION CHANGE 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New Configuration: 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.194) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.195) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.196) 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left: 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined: 
Apr 24 19:59:09 box1 openais[5070]: [SYNC ] This node is within the
primary component and will provide service. 
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] entering OPERATIONAL state. 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message
192.168.50.195 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message
192.168.50.196 
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message
192.168.50.194 
Apr 24 19:59:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 30 seconds. 
Apr 24 20:00:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 60 seconds. 
Apr 24 20:00:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 90 seconds. 
Apr 24 20:01:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 120 seconds. 
Apr 24 20:01:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 150 seconds. 
Apr 24 20:02:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 180 seconds. 
Apr 24 20:02:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 210 seconds. 
Apr 24 20:03:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 240 seconds. 
Apr 24 20:03:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 270 seconds. 
Apr 24 20:04:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 300 seconds. 
Apr 24 20:04:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 330 seconds. 
Apr 24 20:05:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 360 seconds. 
Apr 24 20:05:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 390 seconds. 
Apr 24 20:06:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 420 seconds. 
Apr 24 20:06:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 450 seconds. 
Apr 24 20:07:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 480 seconds. 
Apr 24 20:07:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 510 seconds. 
Apr 24 20:08:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 540 seconds. 
Apr 24 20:08:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 570 seconds. 
Apr 24 20:09:01 box1 ccsd[5129]: Stopping ccsd, SIGTERM received. 

---snap--- 

thats the /etc/cluster/cluster.conf:

---snip--- 
[root at box1 ~]# cat /etc/cluster/cluster.conf 
<?xml version="1.0"?> 
<cluster alias="alpha" config_version="6" name="alpha_cluster"> 
<cman> 
<multicast addr="224.0.0.1"/> 
</cman> 
<fence_daemon post_fail_delay="0" post_join_delay="3"/> 
<clusternodes> 
<clusternode name="box1" nodeid="1"> 
<multicast addr="224.0.0.1" interface="eth0"/> 
<fence> 
<method name="1"> 
<device name="human" nodename="box1.sbe" 
</method> 
</fence> 
</clusternode> 
<clusternode name="box2" votes="1" nodeid="2"> 
<multicast addr="224.0.0.1" interface="eth0"/> 
<fence> 
<method name="1"> 
<device name="human" nodename="box2.sbe" 
</method> 
</fence> 
</clusternode> 
<clusternode name="box3" votes="1" nodeid="4"> 
<multicast addr="224.0.0.1" interface="eth0"/> 
<fence> 
<method name="1"> 
<device name="human" nodename="box3.sbe" 
</method> 
</fence> 
</clusternode> 
</clusternodes> 
<fencedevices> 
<fencedevice agent="fence_manual" name="human"/> 
</fencedevices> 
<rm log_level="7" log_facility="syslog"> 
<failoverdomains/> 
<resources/> 
</rm> 
</cluster> 


---snap---
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070425/687c1b1a/attachment.htm>


More information about the Linux-cluster mailing list