[Linux-cluster] GFS problem in plock.c?

tadashi.iwashita at uniadex.co.jp tadashi.iwashita at uniadex.co.jp
Tue Oct 24 11:22:50 UTC 2006


Hello All,

I have experienced system hang-up after running the latest
LTP(http://ltp.sourceforge.net/) tool as a part of durability testings
on our GFS environement. I was using 2 DELL PE1950 servers which
installed CentOS4.3(IA32) and DELL EMC AX150 was setup as a GFS shared
storage connected to each servers. We did "./runltp -d /gfs3" to run the
LTP tool on one server and another server remained just idle. Here are
the extracted /var/log/messages taken when the system was stopped:

Oct  4 13:15:47 centos1 kernel: lock_dlm:  Assertion failed on line 500
of file
/home/buildcentos/rpmbuild/BUILD/gfs-kernel-2.6.9-49/smp/src/dlm/plock.c
Oct  4 13:15:47 centos1 kernel: lock_dlm:  assertion:  "!error"
Oct  4 13:15:47 centos1 kernel: lock_dlm:  time = 71704458
Oct  4 13:15:47 centos1 kernel: error=-11
Oct  4 13:15:47 centos1 kernel:
Oct  4 13:15:47 centos1 kernel: ------------[ cut here ]------------
Oct  4 13:15:47 centos1 kernel: kernel BUG at
/home/buildcentos/rpmbuild/BUILD/fs-kernel-2.6.9-49/smp/src/dlm/plock.c:
500!
Oct  4 13:15:47 centos1 kernel: invalid operand: 0000 [#1]
Oct  4 13:15:47 centos1 kernel: SMP
Oct  4 13:15:47 centos1 kernel: Modules linked in: parport_pc lp parport
autofs i2c_dev i2c_core lock_dlm(U) gfs(U) lock_harness(U) dlm(U)
cman(U) sunrpc dm_m      rror dm_multipath dm_mod button battery ac md5
ipv6 joydev uhci_hcd ehci_hcd hw      random shpchp bnx2 ext3 jbd
qla6312 qla2xxx scsi_transport_fc megaraid_sas sd_m      d scsi_mod
Oct  4 13:15:47 centos1 kernel: CPU:    0

Does anyone know whether this is a known problem or not? Or any
suggestion?
It is most likely I will be able to reproduce this problem.

Thanks,
Tadashi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061024/538fde3e/attachment.htm>


More information about the Linux-cluster mailing list