[Linux-cluster] gfs locking up
kpreslan at redhat.com
Tue Sep 7 19:43:15 UTC 2004
On Tue, Sep 07, 2004 at 07:45:47PM +0100, Brian Marsden wrote:
> I have two machines, hestia and hroth1 which are running Red Hat
> Enterprise Linux 3.0 AS. The two machines are connected via fibrechannel
> to the same storage group on a EMC CX300 array. I have compiled gfs
> using the latest src.rpm file that is available and the 2.4.21-15 kernel
> patches. All works fine on both nodes for a while (locking is fine, no
> corruption, manual fencing works if a machine dies) but then I
> experience lockups for processes that access any of the mounted gfs
> filesystems. It is hard to reproduce reliably and may occur at any time.
> Classic examples are ls /scratch (where /scratch is a gfs filesystem) or
> even mount or unmount. Once one process has locked up, no other gfs
> filesystems or any commands associated with them work. Only a reboot
> will solve the problem - restarting lock_gulm does not help (and has
> actually given me a kernel panic on one occasion).
When a lockup happens, a few things that might be useful in
figuring it out are: A "ps aux" on both nodes and the output of
"gfs_tool lockdump /mountpoint" on both nodes.
Ken Preslan <kpreslan at redhat.com>
More information about the Linux-cluster