[Linux-cluster] waiting for glock: pid does not exists
InterNetworX | Hostmaster
hostmaster at inwx.de
Mon Jan 10 12:48:05 UTC 2011
Hello,
we are trying to run OpenVZ on a GFS2. We copied a virtual machine to
the GFS2 storage (on node1) and added the service to cluster.conf. After
reloading the configuration on all nodes, rgmanager was trying to start
the virtual machine on node3. That is not working and now the machine is
hanging with a lock.
This is the result of the gfs2 hang analyzer:
There are 4 glocks with waiters.
node1, pid 2674 is waiting for glock 3/8389396, which is held by pid 6821
node3, pid 7024 is waiting for glock 3/8389396, which is held by pid 6821
node1, pid 10188 is waiting for glock 2/1857345, which is held by pid 6821
node3, pid 6772 is waiting for glock 2/1857345, which is held by pid 6821
node3, pid 7251 is waiting for glock 2/1857345, which is held by pid 6821
node3, pid 7289 is waiting for glock 2/1857345, which is held by pid 6821
carl, pid 23817 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 4243 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7055 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7090 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7129 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7176 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7230 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7270 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7306 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7345 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7369 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 7402 is waiting for glock 2/394135, which is held by pid 7024
node3, pid 6821 is waiting for glock 5/8425127, which is held by pid 7258
The pid 6821 is still running on node3:
root 6821 0.0 0.0 12216 696 ? D< 08:29 0:00 /bin/cp
-fp /etc/hosts /etc/hosts.12
The problem pid is 7258 - but I can not find this process running on any
node. Any idea what is the problem here?
Mario
More information about the Linux-cluster
mailing list