[Linux-cluster] waiting for glock: pid does not exists

Steven Whitehouse swhiteho at redhat.com
Mon Jan 10 14:07:46 UTC 2011


Hi,

On Mon, 2011-01-10 at 13:48 +0100, InterNetworX | Hostmaster wrote:
> Hello,
> 
> we are trying to run OpenVZ on a GFS2. We copied a virtual machine to
> the GFS2 storage (on node1) and added the service to cluster.conf. After
> reloading the configuration on all nodes, rgmanager was trying to start
> the virtual machine on node3. That is not working and now the machine is
> hanging with a lock.
> 
> This is the result of the gfs2 hang analyzer:
> 
> There are 4 glocks with waiters.
> node1, pid 2674 is waiting for glock 3/8389396, which is held by pid 6821
> node3, pid 7024 is waiting for glock 3/8389396, which is held by pid 6821
> 
> 
> node1, pid 10188 is waiting for glock 2/1857345, which is held by pid 6821
> node3, pid 6772 is waiting for glock 2/1857345, which is held by pid 6821
> node3, pid 7251 is waiting for glock 2/1857345, which is held by pid 6821
> node3, pid 7289 is waiting for glock 2/1857345, which is held by pid 6821
> 
> 
> carl, pid 23817 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 4243 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7055 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7090 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7129 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7176 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7230 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7270 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7306 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7345 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7369 is waiting for glock 2/394135, which is held by pid 7024
> node3, pid 7402 is waiting for glock 2/394135, which is held by pid 7024
> 
> 
> node3, pid 6821 is waiting for glock 5/8425127, which is held by pid 7258
> 
> 
> 
> The pid 6821 is still running on node3:
> 
> root      6821  0.0  0.0  12216   696 ?        D<   08:29   0:00 /bin/cp
> -fp /etc/hosts /etc/hosts.12
> 
> The problem pid is 7258 - but I can not find this process running on any
> node. Any idea what is the problem here?
> 
> Mario
> 
If pid 7528 has exited, then it is almost certainly not a problem. What
makes you think that this is the issue? Since it is a type 5 glock, it
should not be blocking access to anything,

Steve.

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster





More information about the Linux-cluster mailing list