[Linux-cluster] umount hung single node

Daniel McNeil daniel at osdl.org
Wed Mar 16 01:33:49 UTC 2005


On Fri, 2005-03-11 at 00:27, David Teigland wrote:
> On Wed, Mar 09, 2005 at 05:26:34PM -0800, Daniel McNeil wrote:
> > I upgraded to 2.6.11 and the latest cvs a few days ago.
> > I started my tests on Mar  7 16:01 and they hung on Mar  9 12:34.
> > This is a 3 node cluster, but the test that hung only has 1
> > node with gfs mounted and it is trying to unmount:
> > 
> > root     12500 12494  0 12:34 ?        00:00:01 umount /gfs_stripe5
> > 
> > $ cat /proc/12500/wchan
> > .text.lock.ast
> > 
> > dlm_astd is spinning as top shows:
> > 
> > 12302 root      20  -5     0    0    0 R 99.9  0.0 280:28.23 dlm_astd
> 
> I've found and fixed a missing lock around ast_queue.  If that list
> was corrupted it could explain this problem which looks like infinite
> looping through that list.


I upgrade to the latest cvs and I hit the same problem again.

umount is hung:
root     24099 24093  0 Mar14 ?        00:00:02 umount /gfs_stripe5

and dlm_astd is spinning:
23895 root      20  -5     0    0    0 R 99.9  0.0   1479:34 dlm_astd

Any ideas?  Is there any debug info that would be useful?

Daniel




More information about the Linux-cluster mailing list