[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

lockd goes into D state on RHEL5 NFS (xen) server



Hi,

I have a fully patched (firmware plus RHEL distro) Dell PowerEdge 2950 with a moderate I/O load which is currently running a:

Linux name 2.6.18-92.1.10.el5xen #1 SMP Wed Jul 23 04:11:52 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux

kernel. My problem is that my lockd daemon goes into the D state:
ps auxwww | grep lockd
root 27 0.0 0.0 0 0 ? S< Aug07 0:00 [kblockd/0] root 28 0.0 0.0 0 0 ? S< Aug07 0:00 [kblockd/1] root 29 0.0 0.0 0 0 ? S< Aug07 0:00 [kblockd/2] root 30 0.0 0.0 0 0 ? S< Aug07 0:00 [kblockd/3]
root      5183  0.0  0.0      0     0 ?        D    Aug07   0:00 [lockd]

and as a result all NFS clients report that (dmesg says on the client side: lockd: server 192.168.8.2 not responding, still trying) and as all home and group areas are NFS mounted, everything appears to be frozen. I rule out Ethernet problems as all boxes have other services (samba, Apache) running properly and my Cisco switch reports no issues with Layer 2. :-(


Some details:
Amongst other things the server box NFS exports a list of large FSes (from 200 Gig to 3 Tbytes) to a group of Linux RHEL 4 workstations that also fully patched (Linux prometheus.uio.no 2.6.9-78.ELsmp #1 SMP Wed Jul 9 15:39:47 EDT 2008 i686 i686 i386 GNU/Linux) So, my NFS server export options:

/biotek/dias/storage/groupareas biotroll(rw,sync,no_root_squash) panoptis(rw,no_root_squash) intasclepius(rw,sync) biotin(rw,sync) intpoetas(rw,sync,no_root_squash) intfrigg(rw,sync) /biotek/dias/storage/groupareas/donaldson3 hfaistos(rw,sync) intasclepius(rw,sync) intprometheus(rw,sync) biotin(rw,sync) athena(rw,sync) cn1(rw,sync,no_root_squash) cn2(rw,sync)

and my RHEL mount options from the clients fstab:

intdias:/biotek/dias/storage/databases /biotek/dias/storage/databases nfs ro,bg,hard,intr 0 0 intdias:/biotek/dias/storage/groupareas/donaldson3 /biotek/dias/donaldson3 nfs rw,bg,hard,intr,rsize=32768,wsize=32768 0 0

Has anyone seen this issue and be able to advise on whether is a config issue?

GM

--
--
George Magklaras

Senior Computer Systems Engineer/UNIX Systems Administrator
EMBnet Technical Management Board The Biotechnology Centre of Oslo,
University of Oslo
http://folk.uio.no/georgios






[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]