[rhelv6-list] Host hung, hung_task_timeout_secs mentioned

Brian Long brilong at cisco.com
Wed Jun 15 15:13:17 UTC 2011


I ran into a server hang last night running 2.6.32-131.2.1.el6.x86_64.
I just installed the latest updates (RHEL 6.1) yesterday morning and I
experienced the hang during my Amanda backups.  I found a RHEL 5 bug
which mention similar problems but no fix:
https://bugzilla.redhat.com/show_bug.cgi?id=605444

I had Opsware monitoring the host and it went offline completely for
about 1 minute.  Has anyone else experienced this?  I'm running a LSI
8708EM2 RAID controller with battery-backed cache.

Jun 15 02:00:01 delenn xinetd[2082]: START: amanda pid=19385 from=x.x.x.x
Jun 15 02:00:31 delenn xinetd[2082]: EXIT: amanda status=0 pid=19385
duration=30(sec)
Jun 15 02:09:45 delenn kernel: INFO: task jbd2/dm-1-8:609 blocked for
more than 120 seconds.
Jun 15 02:09:45 delenn kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jun 15 02:09:45 delenn kernel: jbd2/dm-1-8   D 0000000000000000     0
609      2 0x00000000
Jun 15 02:09:45 delenn kernel: ffff8802629e1c10 0000000000000046
ffff8802629e1bd8 ffff8802629e1bd4
Jun 15 02:09:45 delenn kernel: ffff880263b1d340 ffff88026fc24300
ffff8800282f5f80 0000000103e3d1cb
Jun 15 02:09:45 delenn kernel: ffff880263b1d0b8 ffff8802629e1fd8
000000000000f598 ffff880263b1d0b8
Jun 15 02:09:45 delenn kernel: Call Trace:
Jun 15 02:09:45 delenn kernel: [<ffffffff811a3a90>] ? sync_buffer+0x0/0x50
Jun 15 02:09:45 delenn kernel: [<ffffffff814db013>] io_schedule+0x73/0xc0
Jun 15 02:09:45 delenn kernel: [<ffffffff811a3ad0>] sync_buffer+0x40/0x50
Jun 15 02:09:45 delenn kernel: [<ffffffff814db87f>] __wait_on_bit+0x5f/0x90
Jun 15 02:09:45 delenn kernel: [<ffffffff811a3a90>] ? sync_buffer+0x0/0x50
Jun 15 02:09:45 delenn kernel: [<ffffffff814db928>]
out_of_line_wait_on_bit+0x78/0x90
Jun 15 02:09:45 delenn kernel: [<ffffffff8108e140>] ?
wake_bit_function+0x0/0x50
Jun 15 02:09:45 delenn kernel: [<ffffffff811a3a86>]
__wait_on_buffer+0x26/0x30
Jun 15 02:09:45 delenn kernel: [<ffffffffa00847d1>]
jbd2_journal_commit_transaction+0x1121/0x1490 [jbd2]
Jun 15 02:09:45 delenn kernel: [<ffffffff810096d0>] ? __switch_to+0xd0/0x320
Jun 15 02:09:45 delenn kernel: [<ffffffff8107a11b>] ?
try_to_del_timer_sync+0x7b/0xe0
Jun 15 02:09:45 delenn kernel: [<ffffffffa0089948>]
kjournald2+0xb8/0x220 [jbd2]
Jun 15 02:09:45 delenn kernel: [<ffffffff8108e100>] ?
autoremove_wake_function+0x0/0x40
Jun 15 02:09:45 delenn kernel: [<ffffffffa0089890>] ?
kjournald2+0x0/0x220 [jbd2]
Jun 15 02:09:45 delenn kernel: [<ffffffff8108dd96>] kthread+0x96/0xa0
Jun 15 02:09:45 delenn kernel: [<ffffffff8100c1ca>] child_rip+0xa/0x20
Jun 15 02:09:45 delenn kernel: [<ffffffff8108dd00>] ? kthread+0x0/0xa0
Jun 15 02:09:45 delenn kernel: [<ffffffff8100c1c0>] ? child_rip+0x0/0x20
Jun 15 02:20:25 delenn auditd[1702]: Audit daemon rotating log files
Jun 15 02:43:41 delenn xinetd[2082]: START: amanda pid=20084 from=x.x.x.x
Jun 15 02:44:11 delenn xinetd[2082]: EXIT: amanda status=0 pid=20084
duration=30(sec)

/Brian/
-- 
       Brian Long                             |       |
       Corporate Security Programs Org    . | | | . | | | .
                                              '       '
                                              C I S C O




More information about the rhelv6-list mailing list