log: BUG: soft lockup

ESGLinux esggrupos at gmail.com
Mon Oct 17 08:40:07 UTC 2011


Hi All,

I have a problem with a RHEL server. Sometimes the system stops and I get
this log in /var/log/messages


Oct 17 08:47:16 localhost kernel: BUG: soft lockup - CPU#0 stuck for 600s!
[kjournald:364]
Oct 17 08:47:16 localhost kernel: CPU 0:
Oct 17 08:47:16 localhost kernel: Modules linked in: nfsd exportfs nfs_acl
auth_rpcgss ipv6 xfrm_nalgo crypto_api autofs4 hidp rfcomm l2cap bluetooth
lockd sunrpc dm_multipath scsi_dh video backlight sbs power_meter hwmon
i2c_ec dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc
lp parport floppy joydev snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy
snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss
snd_mixer_oss snd_pcm snd_timer snd soundcore snd_page_alloc ide_cd
i2c_piix4 pcspkr serio_raw cdrom i2c_core virtio_pci virtio_net virtio_ring
dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero
dm_mirror dm_log dm_mod virtio_blk virtio ata_piix libata sd_mod scsi_mod
ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Oct 17 08:47:16 localhost kernel: Pid: 364, comm: kjournald Not tainted
2.6.18-194.26.1.el5 #1
Oct 17 08:47:16 localhost kernel: RIP: 0010:[<ffffffff8001240b>]
 [<ffffffff8001240b>] __do_softirq+0x51/0x133
Oct 17 08:47:16 localhost kernel: RSP: 0018:ffffffff80446f60  EFLAGS:
00000206
Oct 17 08:47:16 localhost kernel: RAX: 0000000000000042 RBX:
0000000000000042 RCX: 0000000000000000
Oct 17 08:47:16 localhost kernel: RDX: ffff81007c6b3fd8 RSI:
0000000000000080 RDI: ffff810037fe10c0
Oct 17 08:47:16 localhost kernel: RBP: ffffffff80446ee0 R08:
0000000000000001 R09: ffffffff8005e2fc
Oct 17 08:47:16 localhost kernel: R10: 0000000000000001 R11:
ffffffff881b253d R12: ffffffff8005dc8e
Oct 17 08:47:16 localhost kernel: R13: 0000000000000046 R14:
ffffffff80078225 R15: ffffffff80446ee0
Oct 17 08:47:16 localhost kernel: FS:  0000000000000000(0000)
GS:ffffffff803ca000(0000) knlGS:0000000000000000
Oct 17 08:47:34 localhost kernel: CS:  0010 DS: 0018 ES: 0018 CR0:
000000008005003b
Oct 17 08:48:47 localhost kernel: CR2: 00002aaaaae099c0 CR3:
00000000769b4000 CR4: 00000000000006e0
Oct 17 08:49:43 localhost kernel:
Oct 17 08:50:33 localhost kernel: Call Trace:
Oct 17 08:51:53 localhost kernel:  <IRQ>  [<ffffffff8005e2fc>]
call_softirq+0x1c/0x28
Oct 17 08:52:55 localhost kernel:  [<ffffffff8006cb8a>] do_softirq+0x2c/0x85
Oct 17 08:53:08 localhost kernel:  [<ffffffff8005dc8e>]
apic_timer_interrupt+0x66/0x6c
Oct 17 08:53:18 localhost kernel:  <EOI>  [<ffffffff881b253d>]
:virtio_pci:vp_notify+0x0/0x1c
Oct 17 08:53:35 localhost kernel:  [<ffffffff8005a86a>]
generic_unplug_device+0x31/0x32
Oct 17 08:53:54 localhost kernel:  [<ffffffff88113c00>]
:dm_mod:dm_table_unplug_all+0x3f/0x83
Oct 17 08:54:45 localhost kernel:  [<ffffffff88112929>]
:dm_mod:dm_request+0x11d/0x124
Oct 17 08:55:27 localhost kernel:  [<ffffffff88111d80>]
:dm_mod:dm_unplug_all+0x1d/0x28
Oct 17 08:55:39 localhost kernel:  [<ffffffff80015561>]
sync_buffer+0x36/0x3f
Oct 17 08:55:39 localhost kernel:  [<ffffffff80063a16>]
__wait_on_bit+0x40/0x6e
Oct 17 08:55:41 localhost kernel:  [<ffffffff8001552b>] sync_buffer+0x0/0x3f
Oct 17 08:55:41 localhost kernel:  [<ffffffff80063ab0>]
out_of_line_wait_on_bit+0x6c/0x78
Oct 17 08:56:23 localhost kernel:  [<ffffffff800a0b44>]
wake_bit_function+0x0/0x23
Oct 17 08:57:06 localhost kernel:  [<ffffffff8003a97f>]
sync_dirty_buffer+0x96/0xcb
Oct 17 08:57:34 localhost kernel:  [<ffffffff8803401e>]
:jbd:journal_commit_transaction+0xbbc/0x1066
Oct 17 08:57:59 localhost kernel:  [<ffffffff8003da83>]
lock_timer_base+0x1b/0x3c
Oct 17 08:58:23 localhost kernel:  [<ffffffff880375d3>]
:jbd:kjournald+0xc1/0x213
Oct 17 08:58:38 localhost kernel:  [<ffffffff800a0b16>]
autoremove_wake_function+0x0/0x2e
Oct 17 08:59:24 localhost kernel:  [<ffffffff800a08fe>]
keventd_create_kthread+0x0/0xc4
Oct 17 08:59:55 localhost kernel:  [<ffffffff88037512>]
:jbd:kjournald+0x0/0x213
Oct 17 09:00:37 localhost kernel:  [<ffffffff800a08fe>]
keventd_create_kthread+0x0/0xc4
Oct 17 09:01:10 localhost kernel:  [<ffffffff8003290a>] kthread+0xfe/0x132
Oct 17 09:01:10 localhost kernel:  [<ffffffff8005dfb1>] child_rip+0xa/0x11
Oct 17 09:02:56 localhost kernel:  [<ffffffff800a08fe>]
keventd_create_kthread+0x0/0xc4
Oct 17 09:08:03 localhost kernel:  [<ffffffff8003280c>] kthread+0x0/0x132
Oct 17 09:08:03 localhost kernel:  [<ffffffff8005dfa7>] child_rip+0x0/0x11

anyone knows about this error?

Thanks in advance,

ESG



More information about the redhat-list mailing list