[Linux-cluster] OpenAIS issue on CentOS 5.3

Madison Kelly linux at alteeve.com
Fri Oct 9 17:13:10 UTC 2009


Hi all,

   I've been fussing with a test cluster (2-node) for a bit now. I had 
it working, but I had very little luck with test failure and recovery. 
So I decided to start over and follow the "Redhat" way. Specifically, I 
was following along with their "Configuring and Managing a Red Hat 
Cluster; Red Hat Cluster for Red Hat Enterprise 5" PDF.

   I've gotten to the point where, using luci, the cluster was built. 
However, the nodes haven't joined and trying to use 'have node join 
cluster' fails and generates the following in '/var/log/messages':

---------------------------------------------
Oct  9 13:15:45 vsh02 luci[22301]: Unable to retrieve batch 531050721 
status from vsh02.canadaequity.com:11111: module scheduled for execution
Oct  9 13:15:46 vsh02 ccsd[24724]: Unable to connect to cluster 
infrastructure after 154350 seconds.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] AIS Executive Service 
RELEASE 'subrev 1358 version 0.80.3'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Copyright (C) 2002-2006 
MontaVista Software, Inc and contributors.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Copyright (C) 2006 Red 
Hat, Inc.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] AIS Executive Service: 
started and ready to provide service.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Using default multicast 
address of 239.192.119.37
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_cpg loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais cluster closed process group service v1.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_cfg loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais configuration service'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_msg loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais message service B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_lck loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais distributed locking service B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_evt loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais event service B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_ckpt loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais checkpoint service B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_amf loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais availability management framework B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_clm loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais cluster membership service B.01.01'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_evs loaded.
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service 
handler 'openais extended virtual synchrony service'
Oct  9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component 
openais_cman loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] AIS Executive Service 
RELEASE 'subrev 1358 version 0.80.3'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Copyright (C) 2002-2006 
MontaVista Software, Inc and contributors.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Copyright (C) 2006 Red 
Hat, Inc.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] AIS Executive Service: 
started and ready to provide service.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Using default multicast 
address of 239.192.119.37
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_cpg loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais cluster closed process group service v1.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_cfg loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais configuration service'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_msg loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais message service B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_lck loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais distributed locking service B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_evt loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais event service B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_ckpt loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais checkpoint service B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_amf loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais availability management framework B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_clm loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais cluster membership service B.01.01'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_evs loaded.
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service 
handler 'openais extended virtual synchrony service'
Oct  9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component 
openais_cman loaded.
Oct  9 13:15:51 vsh02 luci[22301]: Unable to retrieve batch 531050721 
status from vsh02.canadaequity.com:11111: service cman start failed:
---------------------------------------------

   When I try to start 'cman' from the command line, I get this error:

---------------------------------------------
# service cman start
Starting cluster:
    Enabling workaround for Xend bridged networking... done
    Loading modules... done
    Mounting configfs... done
    Starting ccsd... done
    Starting cman... failed
/usr/sbin/cman_tool: aisexec daemon didn't start
---------------------------------------------

   This generates the same MAIN: openais errors.

   My cluster is pretty simple;
- Two ASUS servers with three NICs each. One dedicated to a DRBD link.
- IPMI for fencing
- LVM running on DRBD (no SAN, I'm afraid)

   Any insight into what I might be doing wrong?

Thanks!

Madi




More information about the Linux-cluster mailing list