[Linux-cluster] Service won't relocate after yum updates

Randy Brown randy.brown at noaa.gov
Thu Dec 6 15:41:17 UTC 2007


I just ran `yum update` on one of the nodes in my two node cluster and 
now the nfs service won't relocate to the updated node.  Here are the 
versions of relevant packages on each node:

Node 1 (updated node)
[root at nfs1-cluster ~]# rpm -qa |grep -e cman -e lvm -e gfs -e rgmanager 
-e kernel
kmod-gfs-0.1.16-6.2.6.18_8.1.15.el5
lvm2-2.02.26-3.el5
kmod-gfs-0.1.19-7.el5_1.1
gfs-utils-0.1.12-1.el5
system-config-lvm-1.0.22-1.0.el5
cman-2.0.73-1.el5_1.1
lvm2-cluster-2.02.26-1.el5
rgmanager-2.0.31-1.el5.centos
gfs2-utils-0.1.38-1.el5
kernel-2.6.18-53.1.4.el5
kernel-2.6.18-8.1.15.el5
kernel-headers-2.6.18-53.1.4.el5

Node 2
[root at nfs2-cluster ~]# rpm -qa |grep -e cman -e lvm -e gfs -e rgmanager 
-e kernel
gfs2-utils-0.1.25-1.el5
kmod-gfs-0.1.16-5.2.6.18_8.1.14.el5
kmod-gfs-0.1.16-6.2.6.18_8.1.15.el5
system-config-lvm-1.0.22-1.0.el5
cman-2.0.64-1.0.1.el5
rgmanager-2.0.24-1.el5.centos
gfs-utils-0.1.11-3.el5
lvm2-2.02.16-3.el5
lvm2-cluster-2.02.16-3.el5
kernel-2.6.18-8.1.14.el5
kernel-2.6.18-8.1.15.el5
kernel-headers-2.6.18-8.1.15.el5

The cluster will start on the new machine but the nfs service will 
failover to it as it did prior to the upgrade.  The messages I see in 
/var/log/messages are:
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Member 2 shutting 
down
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Starting stopped 
service service:nfs
Dec  6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> start on 
nfsclient "fs-shared-client" returned 2 (invalid argument(s))
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <warning> #68: Failed to 
start service:nfs; return value: 1
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service 
service:nfs
Dec  6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"rfcdata-client" returned 2 (invalid argument(s))
Dec  6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"fs-shared-client" returned 2 (invalid argument(s))
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Service 
service:nfs is recovering
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <warning> #71: Relocating 
failed service service:nfs
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service 
service:nfs
Dec  6 10:14:09 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"rfcdata-client" returned 2 (invalid argument(s))
Dec  6 10:14:09 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"fs-shared-client" returned 2 (invalid argument(s))
Dec  6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Service 
service:nfs is stopped
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Starting stopped 
service service:nfs
Dec  6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> start on 
nfsclient "fs-shared-client" returned 2 (invalid argument(s))
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <warning> #68: Failed to 
start service:nfs; return value: 1
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service 
service:nfs
Dec  6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"rfcdata-client" returned 2 (invalid argument(s))
Dec  6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path 
specified.
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient 
"fs-shared-client" returned 2 (invalid argument(s))
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Service 
service:nfs is recovering
Dec  6 10:14:47 nfs1-cluster clurgmgrd[4455]: <warning> #71: Relocating 
failed service service:nfs
Dec  6 10:14:49 nfs1-cluster clurgmgrd[4455]: <notice> Service 
service:nfs is now running on member 2

The export path in the nfsclient resource box, when using 
system-config-cluster, is marked optional and it has not been a problem 
leaving that blank in the past.  Has something regarding this changed?

Thanks in advance for any assistance,

Randy

Cluster.conf:
[root at nfs1-cluster cluster]# cat cluster.conf
<?xml version="1.0"?>
<cluster alias="ohd_cluster" config_version="120" name="ohd_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="60"/>
        <clusternodes>
                <clusternode name="nfs1-cluster.nws.noaa.gov" nodeid="1" 
votes="1">
                        <fence>
                                <method name="1">
                                        <device name="nfspower" port="8" 
switch="1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="nfs2-cluster.nws.noaa.gov" nodeid="2" 
votes="1">
                        <fence>
                                <method name="1">
                                        <device name="nfspower" port="7" 
switch="1"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <rm>
                <failoverdomains>
                        <failoverdomain name="nfs-failover" ordered="0" 
restricted="1">
                                <failoverdomainnode 
name="nfs1-cluster.nws.noaa.gov" priority="1"/>
                                <failoverdomainnode 
name="nfs2-cluster.nws.noaa.gov" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="140.90.91.244" monitor_link="1"/>
                        <clusterfs 
device="/dev/VolGroupFS/LogVol-shared" force_unmount="0" fsid="30647" 
fstype="gfs" mountpoint="/fs/shared" name="fs-shared" options="acl"/>
                        <nfsexport name="fs-shared-exp"/>
                        <nfsclient name="fs-shared-client" 
options="no_root_squash,rw" path="" target="140.90.91.0/24"/>
                        <clusterfs 
device="/dev/VolGroupTemp/LogVol-rfcdata" force_unmount="0" fsid="54233" 
fstype="gfs" mountpoint="/rfcdata" name="rfcdata" options="acl"/>
                        <nfsexport name="rfcdata-exp"/>
                        <nfsclient name="rfcdata-client" 
options="no_root_squash,rw" path="" target="140.90.91.0/24"/>
                </resources>
                <service autostart="1" domain="nfs-failover" name="nfs">
                        <clusterfs ref="fs-shared">
                                <nfsexport ref="fs-shared-exp">
                                        <nfsclient ref="fs-shared-client"/>
                                </nfsexport>
                        </clusterfs>
                        <ip ref="140.90.91.244"/>
                        <clusterfs ref="rfcdata">
                                <nfsexport ref="rfcdata-exp">
                                        <nfsclient ref="rfcdata-client"/>
                                </nfsexport>
                                <ip ref="140.90.91.244"/>
                        </clusterfs>
                </service>
        </rm>
        <fencedevices>
                <fencedevice agent="fence_apc" ipaddr="192.168.42.30" 
login="rbrown" name="nfspower" passwd="Tele4m32"/>
        </fencedevices>
</cluster>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: randy_brown.vcf
Type: text/x-vcard
Size: 313 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071206/b8098cf3/attachment.vcf>


More information about the Linux-cluster mailing list