[Linux-cluster] only one service fails-over out of two depended services.

Georgi Stanojevski glisha at gmail.com
Thu Jun 3 20:46:39 UTC 2010


Hi,

I have configured two services in my two-node cluster (RHEL 5.4).

service1 - with ip, ha-lvm and fs resources.
service2 - with a script  resource which depends on service1.

When i manually relocate the services everything works as expected.

But, when i fail one node (halt -f) only service1 gets relocated to
the other node. service2 "stays" on the failed node in started state.

The logs say that only service1 will be taken over from the failed
node. No mention that service2 should be failed to the working node.

Jun  3 22:17:35 node1 clurgmgrd[22963]: <info> Waiting for node #2 to be fenced
Jun  3 22:17:43 node1 clurgmgrd[22963]: <info> Node #2 fenced; continuing
Jun  3 22:17:43 node1 clurgmgrd[22963]: <debug> Evaluating RG
service:service1, state started, owner node2
Jun  3 22:17:43 node1 clurgmgrd[22963]: <debug> Evaluating RG
service:service2, state started, owner node2
Jun  3 22:17:43 node1 clurgmgrd[22963]: <notice> Taking over service
service:service1 from down member node2
...
Jun  3 22:17:45 yeti clurgmgrd[22963]: <notice> Service service:service1 started

Does anyone have an idea if I am mis-configuring something?

Here is clustat when one node is failed:
===
Cluster Status for cluster1 @ Thu Jun  3 22:34:52 2010
Member Status: Quorate

Member Name                                   ID   Status
------ ----                                               ---- ------
node1                                              1 Online, Local, rgmanager
node2                                              2 Offline

Service Name                            Owner (Last)
          State
------- ----                                     ----- ------
                           -----
service:service1                         node1
              started
service:service2                         node2
              started
===

Here is the snippet of cluster.conf regarding the services:
===
        <service autostart="1" exclusive="0" name="service1"
recovery="relocate" priority="1">
            <ip ref="10.1.1.1"/>
            <lvm ref="lvm1"/>
            <fs ref="fs1"/>
        </service>
        <service autostart="1" exclusive="0" name="service2"
recovery="relocate" depend="service:service1" depend_mode="hard"
priority="2">
            <script ref="scriptsresource"/>
        </service>
===

Whole cluster.conf is attached.

Thank you very much for any input.

--
Glisha
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/octet-stream
Size: 2018 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20100603/9d7cc2d6/attachment.obj>


More information about the Linux-cluster mailing list