[Linux-cluster] umount hung single node

David Teigland teigland at redhat.com
Fri Mar 11 08:27:29 UTC 2005


On Wed, Mar 09, 2005 at 05:26:34PM -0800, Daniel McNeil wrote:
> I upgraded to 2.6.11 and the latest cvs a few days ago.
> I started my tests on Mar  7 16:01 and they hung on Mar  9 12:34.
> This is a 3 node cluster, but the test that hung only has 1
> node with gfs mounted and it is trying to unmount:
> 
> root     12500 12494  0 12:34 ?        00:00:01 umount /gfs_stripe5
> 
> $ cat /proc/12500/wchan
> .text.lock.ast
> 
> dlm_astd is spinning as top shows:
> 
> 12302 root      20  -5     0    0    0 R 99.9  0.0 280:28.23 dlm_astd

I've found and fixed a missing lock around ast_queue.  If that list
was corrupted it could explain this problem which looks like infinite
looping through that list.

-- 
Dave Teigland  <teigland at redhat.com>




More information about the Linux-cluster mailing list