Tyan 2885 w/ 3w-9xxx oopses [was: CONFIG_HIGHMEM64G=y for x86_64 SMP kernels]

Ed Hill ed at eh3.com
Thu Sep 2 15:58:30 UTC 2004


On Thu, 2004-09-02 at 05:34, Alan Cox wrote:
> 
> Likewise my 2885 is rock solid out of the box with FC2


I wish we could say the same.  Our new Tyan 2885 MB seems to be fine as
long as we don't create a lot of writes to the 3ware 9xxx controller. 
When we do, the machine oopses within ~15--30min.  The 3rd oops I was
not able to find in /var/log/messages but it looked similar to the
second.

Can someone please suggest whats causing these oopses and how we might
fix it?  We're willing to try just about anything including new kernels,
fresh installs, etc.

And many thanks for all the help so far!

Ed

===== 2 oopses writing to 3ware 9xxx RAID array =====


Sep  1 16:05:20 adams kernel: general protection fault: 0000 [1] SMP 
Sep  1 16:05:20 adams kernel: CPU 1 
Sep  1 16:05:20 adams kernel: Modules linked in: sata_sil(U) libata(U)
floppy(U) sg(U) nfsd(U) exportfs(U) md5(U) ipv6(U) parport_pc(U) lp(U)
parport(U) autofs4(U) nfs(U) lockd(U) sunrpc(U) iptable_filter(U)
ip_tables(U) tg3(U) ohci1394(U) ieee1394(U) dm_mod(U) ohci_hcd(U)
button(U) battery(U) asus_acpi(U) ac(U) ext3(U) jbd(U) 3w_9xxx(U)
sd_mod(U) scsi_mod(U)
Sep  1 16:05:20 adams kernel: Pid: 20158, comm: rsync Not tainted
2.6.8-1.533.edhillsmp
Sep  1 16:05:20 adams kernel: RIP: 0010:[<ffffffff80124861>]
<ffffffff80124861>{do_page_fault+1340}
Sep  1 16:05:20 adams kernel: RSP: 0018:0000010031e619d8  EFLAGS:
00010002
Sep  1 16:05:20 adams kernel: RAX: 0000e987f000f0e0 RBX:
0000000000000001 RCX: 000ffffffffff000
Sep  1 16:05:20 adams kernel: RDX: 0000e987f000f000 RSI:
0000010000000000 RDI: 0000010031e61a98
Sep  1 16:05:20 adams kernel: RBP: 000003010383a4b0 R08:
000003010383a4b0 R09: 0000000300000000
Sep  1 16:05:20 adams kernel: R10: 0000000000000000 R11:
0000000000000000 R12: 00000100b7720e90
Sep  1 16:05:20 adams kernel: R13: 0000000000000000 R14:
0000010031e61a98 R15: 00000100d2990940
Sep  1 16:05:20 adams kernel: FS:  0000002a9557bd40(0000)
GS:ffffffff804ee280(0000) knlGS:000000000082e560
Sep  1 16:05:20 adams kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Sep  1 16:05:20 adams kernel: CR2: 000003010383a4b0 CR3:
0000000004862000 CR4: 00000000000006e0
Sep  1 16:05:20 adams kernel: Process rsync (pid: 20158, threadinfo
0000010031e60000, task 00000100b7720e90)
Sep  1 16:05:20 adams kernel: Stack: 00000101050d4e00 0000000000030001
0000010031e61a78 000001002831d130 
Sep  1 16:05:20 adams kernel:        000001002831d240 0000000000000246
00000100d79b7ea0 ffffffff801801b0 
Sep  1 16:05:20 adams kernel:        0000000000000246 0000000000000246 
Sep  1 16:05:20 adams kernel: Call
Trace:<ffffffff801801b0>{__find_get_block+181}
<ffffffff801801e2>{__getblk+43} 
Sep  1 16:05:20 adams kernel:        <ffffffff80111ffd>{error_exit+0}
<ffffffff801348d5>{__wake_up_common+40} 
Sep  1 16:05:20 adams kernel:       
<ffffffff801a2174>{__mark_inode_dirty+40}
<ffffffff80134980>{__wake_up+102} 
Sep  1 16:05:20 adams kernel:       
<ffffffff8015b358>{generic_file_buffered_write+1085} 
Sep  1 16:05:20 adams kernel:       
<ffffffff8015b755>{generic_file_aio_write_nolock+741} 
Sep  1 16:05:20 adams kernel:       
<ffffffff8015b8c0>{generic_file_aio_write+126}
<ffffffffa00471ea>{:ext3:ext3_file_write+22} 
Sep  1 16:05:20 adams kernel:       
<ffffffff8017d200>{do_sync_write+173} <ffffffff80191ed4>{__pollwait+0} 
Sep  1 16:05:20 adams kernel:       
<ffffffff801369e2>{autoremove_wake_function+0}
<ffffffff8019288a>{sys_select+1177} 
Sep  1 16:05:20 adams kernel:        <ffffffff8017d2fc>{vfs_write+208}
<ffffffff8017d3e4>{sys_write+69} 
Sep  1 16:05:20 adams kernel:        <ffffffff80111562>{system_call+126}
Sep  1 16:05:20 adams kernel: 
Sep  1 16:05:20 adams kernel: Code: 48 8b 14 06 f6 c2 01 0f 84 9c fc ff
ff 48 89 e8 48 21 ca 48 
Sep  1 16:05:20 adams kernel: RIP <ffffffff80124861>{do_page_fault+1340}
RSP <0000010031e619d8>


Sep  1 18:04:34 adams kernel: Unable to handle kernel paging request at
000003010383aff0 RIP: 
Sep  1 18:04:34 adams kernel: <ffffffff80130a9e>{__wake_up_common+37}
Sep  1 18:04:34 adams kernel: PML4 0 
Sep  1 18:04:34 adams kernel: Oops: 0000 [1] SMP 
Sep  1 18:04:34 adams kernel: CPU 1 
Sep  1 18:04:34 adams kernel: Modules linked in: 3w_9xxx nfsd exportfs
md5 ipv6 parport_pc lp parport autofs4 nfs lockd sunrpc iptable_filter
ip_tables tg3 ohci1394 ieee1394 dm_mod ohci_hcd button battery asus_acpi
ac ext3 jbd sd_mod scsi_mod
Sep  1 18:04:34 adams kernel: Pid: 62, comm: kswapd1 Not tainted
2.6.7-1.515smp
Sep  1 18:04:34 adams kernel: RIP: 0010:[<ffffffff80130a9e>]
<ffffffff80130a9e>{__wake_up_common+37}
Sep  1 18:04:34 adams kernel: RSP: 0018:00000101ffef7ae8  EFLAGS:
00010046
Sep  1 18:04:34 adams kernel: RAX: 000001010383aff0 RBX:
0000000000000001 RCX: 0000000000000000
Sep  1 18:04:34 adams kernel: RDX: 0000000000000001 RSI:
0000000000000003 RDI: 000001010383afe8
Sep  1 18:04:34 adams kernel: RBP: 00000101ffef7b18 R08:
000003010383aff0 R09: 00000101ffef7a30
Sep  1 18:04:34 adams kernel: R10: 0000000000000206 R11:
0000010102bedb80 R12: 000001010383afe8
Sep  1 18:04:34 adams kernel: R13: 00000100d4ea2e50 R14:
00000100d4ea2e50 R15: 0000010102bedb80
Sep  1 18:04:34 adams kernel: FS:  0000002a9557e800(0000)
GS:ffffffff8047b600(0000) knlGS:0000000000000000
Sep  1 18:04:34 adams kernel: CS:  0010 DS: 0018 ES: 0018 CR0:
000000008005003b
Sep  1 18:04:34 adams kernel: CR2: 000003010383aff0 CR3:
0000000004862000 CR4: 00000000000006e0
Sep  1 18:04:34 adams kernel: Process kswapd1 (pid: 62, threadinfo
00000101ffef6000, task 00000100d9eb9210)
Sep  1 18:04:34 adams kernel: Stack: 0000000300000000 000001010383afe8
0000010100000780 00000100d4ea2e50 
Sep  1 18:04:34 adams kernel:        00000100d4ea2e50 00000101ffef7e58
00000101ffef7b38 ffffffff80130afb 
Sep  1 18:04:34 adams kernel:        0000000000000206 0000000000000001 
Sep  1 18:04:34 adams kernel: Call
Trace:<ffffffff80130afb>{__wake_up+33}
<ffffffff8015a7d5>{shrink_zone+3713} 
Sep  1 18:04:34 adams kernel:       
<ffffffff802e972c>{__down_failed_trylock+53}
<ffffffff80159896>{shrink_slab+119} 
Sep  1 18:04:34 adams kernel:       
<ffffffff8015aeaa>{balance_pgdat+468}
<ffffffff8015acd8>{balance_pgdat+2} 
Sep  1 18:04:34 adams kernel:        <ffffffff8015b06f>{kswapd+257}
<ffffffff80132652>{autoremove_wake_function+0} 
Sep  1 18:04:34 adams kernel:       
<ffffffff80132652>{autoremove_wake_function+0}
<ffffffff8012f2ed>{schedule_tail+11} 
Sep  1 18:04:34 adams kernel:        <ffffffff80110d03>{child_rip+8}
<ffffffff8015af6e>{kswapd+0} 
Sep  1 18:04:34 adams kernel:        <ffffffff80110cfb>{child_rip+0} 
Sep  1 18:04:34 adams kernel: 
Sep  1 18:04:34 adams kernel: Code: 4d 8b 30 49 39 c0 74 28 49 8d 78 e8
45 8b 68 e8 4c 89 f9 8b 
Sep  1 18:04:34 adams kernel: RIP
<ffffffff80130a9e>{__wake_up_common+37} RSP <00000101ffef7ae8>
Sep  1 18:04:34 adams kernel: CR2: 000003010383aff0


===== 2 oopses writing to 3ware 9xxx RAID array =====


-- 
Edward H. Hill III, PhD
office:  MIT Dept. of EAPS;  Rm 54-1424;  77 Massachusetts Ave.
             Cambridge, MA 02139-4307
emails:  eh3 at mit.edu                ed at eh3.com
URLs:    http://web.mit.edu/eh3/    http://eh3.com/
phone:   617-253-0098
fax:     617-253-4464
-------------- next part --------------


Sep  1 16:05:20 adams kernel: general protection fault: 0000 [1] SMP 
Sep  1 16:05:20 adams kernel: CPU 1 
Sep  1 16:05:20 adams kernel: Modules linked in: sata_sil(U) libata(U) floppy(U) sg(U) nfsd(U) exportfs(U) md5(U) ipv6(U) parport_pc(U) lp(U) parport(U) autofs4(U) nfs(U) lockd(U) sunrpc(U) iptable_filter(U) ip_tables(U) tg3(U) ohci1394(U) ieee1394(U) dm_mod(U) ohci_hcd(U) button(U) battery(U) asus_acpi(U) ac(U) ext3(U) jbd(U) 3w_9xxx(U) sd_mod(U) scsi_mod(U)
Sep  1 16:05:20 adams kernel: Pid: 20158, comm: rsync Not tainted 2.6.8-1.533.edhillsmp
Sep  1 16:05:20 adams kernel: RIP: 0010:[<ffffffff80124861>] <ffffffff80124861>{do_page_fault+1340}
Sep  1 16:05:20 adams kernel: RSP: 0018:0000010031e619d8  EFLAGS: 00010002
Sep  1 16:05:20 adams kernel: RAX: 0000e987f000f0e0 RBX: 0000000000000001 RCX: 000ffffffffff000
Sep  1 16:05:20 adams kernel: RDX: 0000e987f000f000 RSI: 0000010000000000 RDI: 0000010031e61a98
Sep  1 16:05:20 adams kernel: RBP: 000003010383a4b0 R08: 000003010383a4b0 R09: 0000000300000000
Sep  1 16:05:20 adams kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00000100b7720e90
Sep  1 16:05:20 adams kernel: R13: 0000000000000000 R14: 0000010031e61a98 R15: 00000100d2990940
Sep  1 16:05:20 adams kernel: FS:  0000002a9557bd40(0000) GS:ffffffff804ee280(0000) knlGS:000000000082e560
Sep  1 16:05:20 adams kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Sep  1 16:05:20 adams kernel: CR2: 000003010383a4b0 CR3: 0000000004862000 CR4: 00000000000006e0
Sep  1 16:05:20 adams kernel: Process rsync (pid: 20158, threadinfo 0000010031e60000, task 00000100b7720e90)
Sep  1 16:05:20 adams kernel: Stack: 00000101050d4e00 0000000000030001 0000010031e61a78 000001002831d130 
Sep  1 16:05:20 adams kernel:        000001002831d240 0000000000000246 00000100d79b7ea0 ffffffff801801b0 
Sep  1 16:05:20 adams kernel:        0000000000000246 0000000000000246 
Sep  1 16:05:20 adams kernel: Call Trace:<ffffffff801801b0>{__find_get_block+181} <ffffffff801801e2>{__getblk+43} 
Sep  1 16:05:20 adams kernel:        <ffffffff80111ffd>{error_exit+0} <ffffffff801348d5>{__wake_up_common+40} 
Sep  1 16:05:20 adams kernel:        <ffffffff801a2174>{__mark_inode_dirty+40} <ffffffff80134980>{__wake_up+102} 
Sep  1 16:05:20 adams kernel:        <ffffffff8015b358>{generic_file_buffered_write+1085} 
Sep  1 16:05:20 adams kernel:        <ffffffff8015b755>{generic_file_aio_write_nolock+741} 
Sep  1 16:05:20 adams kernel:        <ffffffff8015b8c0>{generic_file_aio_write+126} <ffffffffa00471ea>{:ext3:ext3_file_write+22} 
Sep  1 16:05:20 adams kernel:        <ffffffff8017d200>{do_sync_write+173} <ffffffff80191ed4>{__pollwait+0} 
Sep  1 16:05:20 adams kernel:        <ffffffff801369e2>{autoremove_wake_function+0} <ffffffff8019288a>{sys_select+1177} 
Sep  1 16:05:20 adams kernel:        <ffffffff8017d2fc>{vfs_write+208} <ffffffff8017d3e4>{sys_write+69} 
Sep  1 16:05:20 adams kernel:        <ffffffff80111562>{system_call+126} 
Sep  1 16:05:20 adams kernel: 
Sep  1 16:05:20 adams kernel: Code: 48 8b 14 06 f6 c2 01 0f 84 9c fc ff ff 48 89 e8 48 21 ca 48 
Sep  1 16:05:20 adams kernel: RIP <ffffffff80124861>{do_page_fault+1340} RSP <0000010031e619d8>


Sep  1 18:04:34 adams kernel: Unable to handle kernel paging request at 000003010383aff0 RIP: 
Sep  1 18:04:34 adams kernel: <ffffffff80130a9e>{__wake_up_common+37}
Sep  1 18:04:34 adams kernel: PML4 0 
Sep  1 18:04:34 adams kernel: Oops: 0000 [1] SMP 
Sep  1 18:04:34 adams kernel: CPU 1 
Sep  1 18:04:34 adams kernel: Modules linked in: 3w_9xxx nfsd exportfs md5 ipv6 parport_pc lp parport autofs4 nfs lockd sunrpc iptable_filter ip_tables tg3 ohci1394 ieee1394 dm_mod ohci_hcd button battery asus_acpi ac ext3 jbd sd_mod scsi_mod
Sep  1 18:04:34 adams kernel: Pid: 62, comm: kswapd1 Not tainted 2.6.7-1.515smp
Sep  1 18:04:34 adams kernel: RIP: 0010:[<ffffffff80130a9e>] <ffffffff80130a9e>{__wake_up_common+37}
Sep  1 18:04:34 adams kernel: RSP: 0018:00000101ffef7ae8  EFLAGS: 00010046
Sep  1 18:04:34 adams kernel: RAX: 000001010383aff0 RBX: 0000000000000001 RCX: 0000000000000000
Sep  1 18:04:34 adams kernel: RDX: 0000000000000001 RSI: 0000000000000003 RDI: 000001010383afe8
Sep  1 18:04:34 adams kernel: RBP: 00000101ffef7b18 R08: 000003010383aff0 R09: 00000101ffef7a30
Sep  1 18:04:34 adams kernel: R10: 0000000000000206 R11: 0000010102bedb80 R12: 000001010383afe8
Sep  1 18:04:34 adams kernel: R13: 00000100d4ea2e50 R14: 00000100d4ea2e50 R15: 0000010102bedb80
Sep  1 18:04:34 adams kernel: FS:  0000002a9557e800(0000) GS:ffffffff8047b600(0000) knlGS:0000000000000000
Sep  1 18:04:34 adams kernel: CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Sep  1 18:04:34 adams kernel: CR2: 000003010383aff0 CR3: 0000000004862000 CR4: 00000000000006e0
Sep  1 18:04:34 adams kernel: Process kswapd1 (pid: 62, threadinfo 00000101ffef6000, task 00000100d9eb9210)
Sep  1 18:04:34 adams kernel: Stack: 0000000300000000 000001010383afe8 0000010100000780 00000100d4ea2e50 
Sep  1 18:04:34 adams kernel:        00000100d4ea2e50 00000101ffef7e58 00000101ffef7b38 ffffffff80130afb 
Sep  1 18:04:34 adams kernel:        0000000000000206 0000000000000001 
Sep  1 18:04:34 adams kernel: Call Trace:<ffffffff80130afb>{__wake_up+33} <ffffffff8015a7d5>{shrink_zone+3713} 
Sep  1 18:04:34 adams kernel:        <ffffffff802e972c>{__down_failed_trylock+53} <ffffffff80159896>{shrink_slab+119} 
Sep  1 18:04:34 adams kernel:        <ffffffff8015aeaa>{balance_pgdat+468} <ffffffff8015acd8>{balance_pgdat+2} 
Sep  1 18:04:34 adams kernel:        <ffffffff8015b06f>{kswapd+257} <ffffffff80132652>{autoremove_wake_function+0} 
Sep  1 18:04:34 adams kernel:        <ffffffff80132652>{autoremove_wake_function+0} <ffffffff8012f2ed>{schedule_tail+11} 
Sep  1 18:04:34 adams kernel:        <ffffffff80110d03>{child_rip+8} <ffffffff8015af6e>{kswapd+0} 
Sep  1 18:04:34 adams kernel:        <ffffffff80110cfb>{child_rip+0} 
Sep  1 18:04:34 adams kernel: 
Sep  1 18:04:34 adams kernel: Code: 4d 8b 30 49 39 c0 74 28 49 8d 78 e8 45 8b 68 e8 4c 89 f9 8b 
Sep  1 18:04:34 adams kernel: RIP <ffffffff80130a9e>{__wake_up_common+37} RSP <00000101ffef7ae8>
Sep  1 18:04:34 adams kernel: CR2: 000003010383aff0



More information about the fedora-devel-list mailing list