[Linux-cluster] Segfault in GFS2

Thiago Henrique thiagoh at digirati.com.br
Mon Feb 21 15:19:41 UTC 2011


Hello,

I'm making a simple test with GFS2: I run simultaneously on both nodes,
a script that make write operations in the filesystem. It causes GFS2 to
dump a stack trace and fault.

I have a cluster configured with two nodes like this:
  Ubuntu 10.04.1 LTS 
  Kernel 2.6.35-23-generic 
  drbd8-source-2:8.3.7-1ubuntu2.1
  drbd8-utils-2:8.3.8.1-0ubuntu1
  cman-3.0.2-2ubuntu3.1 
  libcman3-3.0.2-2ubuntu3.1 
  gfs2-tools-3.0.2-2ubuntu3.1


Is this known? What other kind of information could be useful to help
find this issue?

Thanks,
--
Thiago Henrique

STACK TRACE:
################################################################################
/var/log/kern.log:
Feb 20 06:29:39 wcluster1 kernel: [142560.304056] INFO: task
gfs2_quotad:1813 blocked for more than 120 seconds.
Feb 20 06:29:39 wcluster1 kernel: [142560.304075] "echo 0
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Feb 20 06:29:39 wcluster1 kernel: [142560.304089] gfs2_quotad   D
f4887e0c     0  1813      2 0x00000000
Feb 20 06:29:39 wcluster1 kernel: [142560.304098]  f4887e1c 00000046
00000002 f4887e0c f5778744 c05d99e0 c08c3700 c08c3700
Feb 20 06:29:39 wcluster1 kernel: [142560.304114]  e70ea676 00008184
c08c3700 c08c3700 e70c4587 00008184 00000000 c08c3700
Feb 20 06:29:39 wcluster1 kernel: [142560.304123]  c08c3700 f545bf70
00000001 f4887e50 00000000 f4887e58 f4887e24 f85ab73d
Feb 20 06:29:39 wcluster1 kernel: [142560.304133] Call Trace:
Feb 20 06:29:39 wcluster1 kernel: [142560.304174]  [<f85ab73d>]
gfs2_glock_holder_wait+0xd/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304192]  [<c05c81fd>]
__wait_on_bit+0x4d/0x70
Feb 20 06:29:39 wcluster1 kernel: [142560.304203]  [<f85ab730>] ?
gfs2_glock_holder_wait+0x0/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304214]  [<f85ab730>] ?
gfs2_glock_holder_wait+0x0/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304220]  [<c05c82cb>]
out_of_line_wait_on_bit+0xab/0xc0
Feb 20 06:29:39 wcluster1 kernel: [142560.304231]  [<c0165f20>] ?
wake_bit_function+0x0/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304242]  [<f85ac7f2>]
gfs2_glock_wait+0x32/0x40 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304254]  [<f85adcfe>]
gfs2_glock_nq+0x29e/0x350 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304266]  [<c012cec8>] ?
default_spin_lock_flags+0x8/0x10
Feb 20 06:29:39 wcluster1 kernel: [142560.304272]  [<c05c977f>] ?
_raw_spin_lock_irqsave+0x2f/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304296]  [<f85c5efc>]
gfs2_statfs_sync+0x4c/0x1b0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304304]  [<c0159319>] ?
del_timer_sync+0x19/0x20
Feb 20 06:29:39 wcluster1 kernel: [142560.304319]  [<f85c5ef4>] ?
gfs2_statfs_sync+0x44/0x1b0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304323]  [<c0158a90>] ?
process_timeout+0x0/0x10
Feb 20 06:29:39 wcluster1 kernel: [142560.304337]  [<f85bdfce>]
quotad_check_timeo+0x3e/0xa0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304343]  [<c016603f>] ?
finish_wait+0x4f/0x70
Feb 20 06:29:39 wcluster1 kernel: [142560.304356]  [<f85be23a>]
gfs2_quotad+0x20a/0x250 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304362]  [<c05c75aa>] ?
schedule+0x37a/0x7a0
Feb 20 06:29:39 wcluster1 kernel: [142560.304367]  [<c0165ed0>] ?
autoremove_wake_function+0x0/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304380]  [<f85be030>] ?
gfs2_quotad+0x0/0x250 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304386]  [<c0165aa4>] kthread
+0x74/0x80
Feb 20 06:29:39 wcluster1 kernel: [142560.304390]  [<c0165a30>] ?
kthread+0x0/0x80
Feb 20 06:29:39 wcluster1 kernel: [142560.304397]  [<c010363e>]
kernel_thread_helper+0x6/0x10
################################################################################




More information about the Linux-cluster mailing list