[Linux-cluster] only one service fails-over out of two depended services.
Georgi Stanojevski
glisha at gmail.com
Thu Jun 3 20:46:39 UTC 2010
Hi,
I have configured two services in my two-node cluster (RHEL 5.4).
service1 - with ip, ha-lvm and fs resources.
service2 - with a script resource which depends on service1.
When i manually relocate the services everything works as expected.
But, when i fail one node (halt -f) only service1 gets relocated to
the other node. service2 "stays" on the failed node in started state.
The logs say that only service1 will be taken over from the failed
node. No mention that service2 should be failed to the working node.
Jun 3 22:17:35 node1 clurgmgrd[22963]: <info> Waiting for node #2 to be fenced
Jun 3 22:17:43 node1 clurgmgrd[22963]: <info> Node #2 fenced; continuing
Jun 3 22:17:43 node1 clurgmgrd[22963]: <debug> Evaluating RG
service:service1, state started, owner node2
Jun 3 22:17:43 node1 clurgmgrd[22963]: <debug> Evaluating RG
service:service2, state started, owner node2
Jun 3 22:17:43 node1 clurgmgrd[22963]: <notice> Taking over service
service:service1 from down member node2
...
Jun 3 22:17:45 yeti clurgmgrd[22963]: <notice> Service service:service1 started
Does anyone have an idea if I am mis-configuring something?
Here is clustat when one node is failed:
===
Cluster Status for cluster1 @ Thu Jun 3 22:34:52 2010
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
node1 1 Online, Local, rgmanager
node2 2 Offline
Service Name Owner (Last)
State
------- ---- ----- ------
-----
service:service1 node1
started
service:service2 node2
started
===
Here is the snippet of cluster.conf regarding the services:
===
<service autostart="1" exclusive="0" name="service1"
recovery="relocate" priority="1">
<ip ref="10.1.1.1"/>
<lvm ref="lvm1"/>
<fs ref="fs1"/>
</service>
<service autostart="1" exclusive="0" name="service2"
recovery="relocate" depend="service:service1" depend_mode="hard"
priority="2">
<script ref="scriptsresource"/>
</service>
===
Whole cluster.conf is attached.
Thank you very much for any input.
--
Glisha
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/octet-stream
Size: 2018 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20100603/9d7cc2d6/attachment.obj>
More information about the Linux-cluster
mailing list