[Linux-cluster] NFS timeout once service has failover

Antoine Samson antoine.samson at etiam.com
Mon Apr 19 16:06:35 UTC 2010


I have a NFS clustered two nodes service using a VIP.
NFS clients are setup with following options: 
rw,timeo=10,retrans=3,retry=1,soft,intr

When NFS service triggers from one node to another, NFS clients can no 
longer acces NFS mount (sometimes it just comes back after a long period 
of time, much more than 90s NFS gracefull period, sometimes not).

NFS clients reports: xxxxx kernel: nfs: server xxxxxxxxx not responding, 
timed out

tcpdump shows that NFS server is responding (so there should not be any 
arp problem, ping comes back up as soon as NFS service has been started 
on new node)

Clients and servers are 2.6.18-164.el5 #1 SMP Tue Aug 18 15:51:48 EDT 
2009 x86_64 x86_64 x86_64 GNU/Linux

Thanks for your help,


-- 
Antoine




More information about the Linux-cluster mailing list