[rhn-users] strange problem with NFS and/or automount and/or autofs

Brian C. Hill bchill at bch.net
Wed Jan 26 20:36:39 UTC 2005


	Hi Red Hatters,

	Sorry, this is a little long, but maybe this problem will jump
out at you.  This NFS problem is hard to describe, since it seems to
bounce around, but I will try. First, note that all systems involved
are completely up2date.

Basic architecture:

* 1 'server' system with 2 IP addresses running NFS with 2 exported
filesystems (at the same level - /data/a and /data/b)

* 4 'client' systems, each with 1 IP address using autofs via /net to
mount filesystems from the 'server' system running NFS

* all run Red Hat Enterprise ES 3.0

Problem:

We see a variety of problems after rebooting the NFS server that are
resolved in in one or more ways including:

* restarting autofs on the clients
* running exportfs -av on the NFS server
* restarting NFS on the server (even though it has just been rebooted)
* rebooting one or more (though seldom all) client systems

Since the steps necessary to restore access after a reboot vary from
episode to episode, the problem seems to be a moving target and is
really hard to debug. The only error message that seems to appear is
this one on the NFS server:

getfh failed: Operation not permitted

The only other odd symptoms that often presents themselves are:

* sometimes, when the system cannot be accessed via /net, it is still
possible to do a mount command by hand (though not always)

* sometimes, access to only one of the servers IP addresses fails via
/net while both of them are usable with a mount command

* sometimes, both /net and manual mount fail with a 'permission
denied', which is, again, sometimes fixed by an exportfs -a or -r on
the NFS server and sometimes not.

* sometimes /data/a can be accessed, but not /data/b and vice versa.

It is also strange that this usually only seems to affect some of the
clients (usually 2 out of 4). It does generally seem to be the same 2
that fail, but again, it is not always those 2 or only those 2.

I have exhausted many hours trying to figure out this problem (via Red
Hat faqs, google, etc.).

Clues?

Thanks. :)

Brian




More information about the rhn-users mailing list