Broadcom BCM5751 gigabit kernel failure, tg3 driver

Anthony Messina amessina at messinet.com
Wed Apr 19 04:37:21 UTC 2006


i have an up-to-date fc5 install with an onboard gigabit ethernet chip:
02:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5751
Gigabit Ethernet PCI Express (rev 01)

i have, in the last few days been having the problem where the ethernet
connection dies when i use it at gigabit (default) speeds using the tg3
(default) driver.

Apr 14 17:00:01 linux-ws1 kernel: Uhhuh. NMI received. Dazed and
confused, but trying to continue
Apr 14 17:00:01 linux-ws1 kernel: You probably have a hardware problem
with your RAM chips
Apr 14 17:00:11 linux-ws1 kernel: BUG: soft lockup detected on CPU#0!
Apr 14 17:00:11 linux-ws1 kernel:
Apr 14 17:00:11 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:00:11 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:00:11 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:00:11 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:00:11 linux-ws1 kernel: EAX: 442f04f6 EBX: 442f04da ECX:
01398d60 EDX: 0000008e
Apr 14 17:00:11 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:00:11 linux-ws1 kernel: CR0: 8005003b CR2: b2f5333c CR3:
371ec3a0 CR4: 000006f0
Apr 14 17:00:11 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
 [<c01d5685>] __delay+0x9/0xa
Apr 14 17:00:11 linux-ws1 kernel:  [<f89c07a9>] tg3_readphy+0x63/0xc5
[tg3]     [<f89c197b>] tg3_setup_copper_phy+0x2e5/0x97a [tg3]
Apr 14 17:00:11 linux-ws1 kernel:  [<c01367f8>]
hrtimer_forward+0x1b/0x156     [<f89c2c06>] tg3_setup_phy+0xbf6/0xc7b [tg3]
Apr 14 17:00:11 linux-ws1 kernel:  [<c0127715>] getnstimeofday+0xd/0x21
    [<c0136dc8>] ktime_get_ts+0x16/0x44
Apr 14 17:00:14 linux-ws1 kernel:  [<c0136f33>] ktime_get+0xe/0x2f
[<c02f2228>] _spin_unlock_irq+0x5/0x7
Apr 14 17:00:14 linux-ws1 kernel:  [<f89c31ef>] tg3_timer+0x478/0x711
[tg3]     [<c012bbd1>] run_timer_softirq+0x128/0x187
Apr 14 17:00:14 linux-ws1 kernel:  [<f89c2d77>] tg3_timer+0x0/0x711
[tg3]     [<c01280f0>] __do_softirq+0x58/0xc2
Apr 14 17:00:14 linux-ws1 kernel:  [<c0105eef>] do_softirq+0x46/0x4e
 =======================
Apr 14 17:00:14 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24
Apr 14 17:00:14 linux-ws1 kernel:  [<c0102f2e>] mwait_idle+0x1f/0x33
 [<c0102ef6>] cpu_idle+0x8f/0xa8
Apr 14 17:00:14 linux-ws1 kernel:  [<c03c8715>] start_kernel+0x2fe/0x304
   <6>tg3: eth0: Link is down.
Apr 14 17:00:24 linux-ws1 kernel: BUG: soft lockup detected on CPU#0!
Apr 14 17:00:24 linux-ws1 kernel:
Apr 14 17:00:24 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:00:24 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:00:24 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:00:24 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:00:24 linux-ws1 kernel: EAX: 4f705293 EBX: 4f705262 ECX:
01398d60 EDX: 0000008e
Apr 14 17:00:24 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:00:24 linux-ws1 kernel: CR0: 8005003b CR2: 09479008 CR3:
37dd9b20 CR4: 000006f0
Apr 14 17:00:24 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
Apr 14 17:00:24 linux-ws1 kernel:  [<c01d5685>] __delay+0x9/0xa
[<f89c07a9>] tg3_readphy+0x63/0xc5 [tg3]
Apr 14 17:00:24 linux-ws1 kernel:  [<f89c1989>]
tg3_setup_copper_phy+0x2f3/0x97a [tg3]     [<c01367f8>]
hrtimer_forward+0x1b/0x156
Apr 14 17:00:24 linux-ws1 kernel:  [<f89c2c06>]
tg3_setup_phy+0xbf6/0xc7b [tg3]     [<c0127715>] getnstimeofday+0xd/0x21
Apr 14 17:00:27 linux-ws1 kernel:  [<c0136dc8>] ktime_get_ts+0x16/0x44
   [<c0136f33>] ktime_get+0xe/0x2f
Apr 14 17:00:27 linux-ws1 kernel:  [<c02f2228>] _spin_unlock_irq+0x5/0x7
    [<f89c31ef>] tg3_timer+0x478/0x711 [tg3]
Apr 14 17:00:27 linux-ws1 kernel:  [<c012bbd1>]
run_timer_softirq+0x128/0x187     [<f89c2d77>] tg3_timer+0x0/0x711 [tg3]
Apr 14 17:00:27 linux-ws1 kernel:  [<c01280f0>] __do_softirq+0x58/0xc2
   [<c0105eef>] do_softirq+0x46/0x4e
Apr 14 17:00:27 linux-ws1 kernel:  =======================
Apr 14 17:00:27 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24     [<c0102f2e>] mwait_idle+0x1f/0x33
Apr 14 17:00:27 linux-ws1 kernel:  [<c0102ef6>] cpu_idle+0x8f/0xa8
[<c03c8715>] start_kernel+0x2fe/0x304
Apr 14 17:00:37 linux-ws1 kernel: BUG: soft lockup detected on CPU#0!
Apr 14 17:00:37 linux-ws1 kernel:
Apr 14 17:00:37 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:00:37 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:00:37 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:00:37 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:00:37 linux-ws1 kernel: EAX: 5ac858ef EBX: 5ac85868 ECX:
01398d60 EDX: 0000008e
Apr 14 17:00:37 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:00:37 linux-ws1 kernel: CR0: 8005003b CR2: 08e43cc4 CR3:
371ec3a0 CR4: 000006f0
Apr 14 17:00:37 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
 [<c01d5685>] __delay+0x9/0xa
Apr 14 17:00:37 linux-ws1 kernel:  [<f89c07a9>] tg3_readphy+0x63/0xc5
[tg3]     [<f89c197b>] tg3_setup_copper_phy+0x2e5/0x97a [tg3]
Apr 14 17:00:37 linux-ws1 kernel:  [<c01367f8>]
hrtimer_forward+0x1b/0x156     [<f89c2c06>] tg3_setup_phy+0xbf6/0xc7b [tg3]
Apr 14 17:00:37 linux-ws1 kernel:  [<c0127715>] getnstimeofday+0xd/0x21
    [<c0136dc8>] ktime_get_ts+0x16/0x44
Apr 14 17:00:41 linux-ws1 kernel:  [<c0136f33>] ktime_get+0xe/0x2f
[<c02f2228>] _spin_unlock_irq+0x5/0x7
Apr 14 17:00:41 linux-ws1 kernel:  [<f89c31ef>] tg3_timer+0x478/0x711
[tg3]     [<c012bbd1>] run_timer_softirq+0x128/0x187
Apr 14 17:00:41 linux-ws1 kernel:  [<f89c2d77>] tg3_timer+0x0/0x711
[tg3]     [<c01280f0>] __do_softirq+0x58/0xc2
Apr 14 17:00:41 linux-ws1 kernel:  [<c0105eef>] do_softirq+0x46/0x4e
 =======================
Apr 14 17:00:41 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24
Apr 14 17:00:41 linux-ws1 kernel:  [<c0102f2e>] mwait_idle+0x1f/0x33
 [<c0102ef6>] cpu_idle+0x8f/0xa8
Apr 14 17:00:51 linux-ws1 kernel:  [<c03c8715>] start_kernel+0x2fe/0x304
   <3>BUG: soft lockup detected on CPU#0!
Apr 14 17:00:51 linux-ws1 kernel:
Apr 14 17:00:51 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:00:51 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:00:51 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:00:51 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:00:51 linux-ws1 kernel: EAX: 6629fc50 EBX: 6629fc42 ECX:
01398d60 EDX: 0000008e
Apr 14 17:00:51 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:00:51 linux-ws1 kernel: CR0: 8005003b CR2: 0955e954 CR3:
371ec3a0 CR4: 000006f0
Apr 14 17:00:51 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
Apr 14 17:00:51 linux-ws1 kernel:  [<c01d5685>] __delay+0x9/0xa
[<f89c07a9>] tg3_readphy+0x63/0xc5 [tg3]
Apr 14 17:00:51 linux-ws1 kernel:  [<f89c197b>]
tg3_setup_copper_phy+0x2e5/0x97a [tg3]     [<c01367f8>]
hrtimer_forward+0x1b/0x156
Apr 14 17:00:51 linux-ws1 kernel:  [<f89c2c06>]
tg3_setup_phy+0xbf6/0xc7b [tg3]     [<c0127715>] getnstimeofday+0xd/0x21
Apr 14 17:00:54 linux-ws1 kernel:  [<c0136dc8>] ktime_get_ts+0x16/0x44
   [<c0136f33>] ktime_get+0xe/0x2f
Apr 14 17:00:54 linux-ws1 kernel:  [<c02f2228>] _spin_unlock_irq+0x5/0x7
    [<f89c31ef>] tg3_timer+0x478/0x711 [tg3]
Apr 14 17:00:54 linux-ws1 kernel:  [<c012bbd1>]
run_timer_softirq+0x128/0x187     [<f89c2d77>] tg3_timer+0x0/0x711 [tg3]
Apr 14 17:00:54 linux-ws1 kernel:  [<c01280f0>] __do_softirq+0x58/0xc2
   [<c0105eef>] do_softirq+0x46/0x4e
Apr 14 17:00:54 linux-ws1 kernel:  =======================
Apr 14 17:00:54 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24     [<c0102f2e>] mwait_idle+0x1f/0x33
Apr 14 17:00:54 linux-ws1 kernel:  [<c0102ef6>] cpu_idle+0x8f/0xa8
[<c03c8715>] start_kernel+0x2fe/0x304
Apr 14 17:01:04 linux-ws1 kernel: BUG: soft lockup detected on CPU#0!
Apr 14 17:01:04 linux-ws1 kernel:
Apr 14 17:01:04 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:01:04 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:01:04 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:01:04 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:01:04 linux-ws1 kernel: EAX: 718b9fa7 EBX: 718b9f6e ECX:
01398d60 EDX: 0000008e
Apr 14 17:01:04 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:01:04 linux-ws1 kernel: CR0: 8005003b CR2: 094fa270 CR3:
003fc000 CR4: 000006f0
Apr 14 17:01:04 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
 [<c01d5685>] __delay+0x9/0xa
Apr 14 17:01:04 linux-ws1 kernel:  [<f89c07a9>] tg3_readphy+0x63/0xc5
[tg3]     [<f89c197b>] tg3_setup_copper_phy+0x2e5/0x97a [tg3]
Apr 14 17:01:04 linux-ws1 kernel:  [<f884ebeb>]
scsi_io_completion+0x215/0x3c3 [scsi_mod]     [<f89c2c06>]
tg3_setup_phy+0xbf6/0xc7b [tg3]
Apr 14 17:01:04 linux-ws1 kernel:  [<c011da09>]
try_to_wake_up+0x353/0x35d     [<f89c31ef>] tg3_timer+0x478/0x711 [tg3]
Apr 14 17:01:07 linux-ws1 kernel:  [<c012bbd1>]
run_timer_softirq+0x128/0x187     [<f89c2d77>] tg3_timer+0x0/0x711 [tg3]
Apr 14 17:01:07 linux-ws1 kernel:  [<c01280f0>] __do_softirq+0x58/0xc2
   [<c0105eef>] do_softirq+0x46/0x4e
Apr 14 17:01:07 linux-ws1 kernel:  =======================
Apr 14 17:01:07 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24     [<c0102f2e>] mwait_idle+0x1f/0x33
Apr 14 17:01:07 linux-ws1 kernel:  [<c0102ef6>] cpu_idle+0x8f/0xa8
[<c03c8715>] start_kernel+0x2fe/0x304
Apr 14 17:01:17 linux-ws1 kernel: BUG: soft lockup detected on CPU#0!
Apr 14 17:01:17 linux-ws1 kernel:
Apr 14 17:01:17 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:01:17 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:01:17 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:01:17 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:01:17 linux-ws1 kernel: EAX: 7cf52079 EBX: 7cf51fed ECX:
01398d60 EDX: 0000008e
Apr 14 17:01:17 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:01:17 linux-ws1 kernel: CR0: 8005003b CR2: 080a0440 CR3:
371ec3a0 CR4: 000006f0
Apr 14 17:01:17 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
 [<c01d5685>] __delay+0x9/0xa
Apr 14 17:01:17 linux-ws1 kernel:  [<f89c07a9>] tg3_readphy+0x63/0xc5
[tg3]     [<f89c197b>] tg3_setup_copper_phy+0x2e5/0x97a [tg3]
Apr 14 17:01:17 linux-ws1 kernel:  [<c01367f8>]
hrtimer_forward+0x1b/0x156     [<f89c2c06>] tg3_setup_phy+0xbf6/0xc7b [tg3]
Apr 14 17:01:18 linux-ws1 kernel:  [<c0127715>] getnstimeofday+0xd/0x21
    [<c0136dc8>] ktime_get_ts+0x16/0x44
Apr 14 17:01:21 linux-ws1 kernel:  [<c0136f33>] ktime_get+0xe/0x2f
[<c02f2228>] _spin_unlock_irq+0x5/0x7
Apr 14 17:01:21 linux-ws1 kernel:  [<f89c31ef>] tg3_timer+0x478/0x711
[tg3]     [<c012bbd1>] run_timer_softirq+0x128/0x187
Apr 14 17:01:21 linux-ws1 kernel:  [<f89c2d77>] tg3_timer+0x0/0x711
[tg3]     [<c01280f0>] __do_softirq+0x58/0xc2
Apr 14 17:01:21 linux-ws1 kernel:  [<c0105eef>] do_softirq+0x46/0x4e
 =======================
Apr 14 17:01:21 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24
Apr 14 17:01:21 linux-ws1 kernel:  [<c0102f2e>] mwait_idle+0x1f/0x33
 [<c0102ef6>] cpu_idle+0x8f/0xa8
Apr 14 17:01:31 linux-ws1 kernel:  [<c03c8715>] start_kernel+0x2fe/0x304
   <3>BUG: soft lockup detected on CPU#0!
Apr 14 17:01:31 linux-ws1 kernel:
Apr 14 17:01:31 linux-ws1 kernel: Pid: 0, comm:              swapper
Apr 14 17:01:31 linux-ws1 kernel: EIP: 0060:[<c0119558>] CPU: 0
Apr 14 17:01:31 linux-ws1 kernel: EIP is at hpet_readl+0x8/0x9
Apr 14 17:01:31 linux-ws1 kernel:  EFLAGS: 00200286    Not tainted
(2.6.15-1.2054_FC5smp #1)
Apr 14 17:01:31 linux-ws1 kernel: EAX: 8857a387 EBX: 8857a336 ECX:
01398d60 EDX: 0000008e
Apr 14 17:01:31 linux-ws1 kernel: ESI: 0000008e EDI: c0401f4c EBP:
c19f3f60 DS: 007b ES: 007b
Apr 14 17:01:31 linux-ws1 kernel: CR0: 8005003b CR2: 0950ae90 CR3:
003fc000 CR4: 000006f0
Apr 14 17:01:31 linux-ws1 kernel:  [<c0110ba4>] delay_hpet+0x24/0x2d
Apr 14 17:01:31 linux-ws1 kernel:  [<c01d5685>] __delay+0x9/0xa
[<f89c07a9>] tg3_readphy+0x63/0xc5 [tg3]
Apr 14 17:01:31 linux-ws1 kernel:  [<f89c197b>]
tg3_setup_copper_phy+0x2e5/0x97a [tg3]     [<c01367f8>]
hrtimer_forward+0x1b/0x156
Apr 14 17:01:31 linux-ws1 kernel:  [<f89c2c06>]
tg3_setup_phy+0xbf6/0xc7b [tg3]     [<c0127715>] getnstimeofday+0xd/0x21
Apr 14 17:01:34 linux-ws1 kernel:  [<c0136dc8>] ktime_get_ts+0x16/0x44
   [<c0136f33>] ktime_get+0xe/0x2f
Apr 14 17:01:34 linux-ws1 kernel:  [<c02f2228>] _spin_unlock_irq+0x5/0x7
    [<f89c31ef>] tg3_timer+0x478/0x711 [tg3]
Apr 14 17:01:34 linux-ws1 kernel:  [<c012bbd1>]
run_timer_softirq+0x128/0x187     [<f89c2d77>] tg3_timer+0x0/0x711 [tg3]
Apr 14 17:01:34 linux-ws1 kernel:  [<c01280f0>] __do_softirq+0x58/0xc2
   [<c0105eef>] do_softirq+0x46/0x4e
Apr 14 17:01:34 linux-ws1 kernel:  =======================
Apr 14 17:01:34 linux-ws1 kernel:  [<c01047dc>]
apic_timer_interrupt+0x1c/0x24     [<c0102f2e>] mwait_idle+0x1f/0x33
Apr 14 17:01:34 linux-ws1 kernel:  [<c0102ef6>] cpu_idle+0x8f/0xa8
[<c03c8715>] start_kernel+0x2fe/0x304
Apr 14 17:01:35 linux-ws1 kernel: tg3: tg3_abort_hw timed out for eth0,
TX_MODE_ENABLE will not clear MAC_TX_MODE=ffffffff
Apr 14 17:01:36 linux-ws1 kernel: ADDRCONF(NETDEV_UP): eth0: link is not
ready
Apr 14 17:01:40 linux-ws1 kernel: tg3: eth0: Link is up at 1000 Mbps,
full duplex.
Apr 14 17:01:40 linux-ws1 kernel: tg3: eth0: Flow control is on for TX
and on for RX.
Apr 14 17:01:40 linux-ws1 kernel: ADDRCONF(NETDEV_CHANGE): eth0: link
becomes ready
Apr 14 17:02:54 linux-ws1 kernel: tg3: eth0: Link is down.
Apr 14 17:03:50 linux-ws1 kernel: tg3: tg3_abort_hw timed out for eth0,
TX_MODE_ENABLE will not clear MAC_TX_MODE=ffffffff
Apr 14 17:03:50 linux-ws1 kernel: ADDRCONF(NETDEV_UP): eth0: link is not
ready
Apr 14 17:03:54 linux-ws1 kernel: tg3: eth0: Link is up at 1000 Mbps,
full duplex.
Apr 14 17:03:54 linux-ws1 kernel: tg3: eth0: Flow control is on for TX
and on for RX.
Apr 14 17:03:54 linux-ws1 kernel: ADDRCONF(NETDEV_CHANGE): eth0: link
becomes ready

i have used this system with fc4 with gigabit speeds and the tg3 driver
with no problems and also with fc5 until april 14 with no problems.  it
is also noteworthy that this lockup seems to happen more frequently when
i use thunderbird 1.5 and seems to happen at random.  sometimes it
occurs right at login; sometimes i can use the system for an hour before
it happens.

i have had to use ethtool to reduce the speed to 100mbit and the problem
resolves.  i would like to use it at 1000mbit again, however.

i do see the line where it says "You probably have a hardware problem
with your RAM chips" but i'm not sure to what this refers.  i do use the
"extended" boot method of the computer that tests ram prior to loading
the os and have not had a problem with those tests.

my kernel is: 2.6.16-1.2080_FC5smp on i686

where do i begin with this one?  thank you all for any help you can provide.

-- 
Anthony

Website: http://messinet.com
Gallery: http://gallery.messinet.com/main.php?g2_itemId=34

GnuPG Key / Fingerprint:
0xB0014A4E / 8F89 5E72 8DF0 BCF0 10BE 9967 92DC 35DC B001 4A4E

Registered Linux User #389089 - Get counted!: http://counter.li.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 191 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/fedora-list/attachments/20060418/b7cff085/attachment-0001.sig>


More information about the fedora-list mailing list