[Linux-cluster] OOPS

Edward Muller edwardam at interlix.com
Thu Nov 10 19:38:27 UTC 2005


On Wednesday 09 November 2005 10:58 pm, Wendy Cheng wrote:
> On Wed, 2005-11-09 at 20:55 -0600, Edward Muller wrote:
> > Anyone have any idea? Need more info?
>
> Red Hat distribution doesn't include AOE driver (yet). I would say
> trying out linux-kernel mailing list - folks there could have more ideas
> about this new driver.

Well it's one of two things AFAICT:

1) A bug in AOE. But, since aoe going straight the block device doesn't cause 
the OOPS at all. I don't think it is though, unless the clvm/cman/dlm/etc 
stack are triggering the bug in aoe.

2) A bug in the clvm/cman/dlm/etc stack, triggered by aoe.

>
>
> -- Wendy
>
> > On Wednesday 09 November 2005 12:46 am, Edward Muller wrote:
> > > Using version 1.01.00 of the cluster software with version 2.01.09 of
> > > lvm/clvm on an aoe device I get the following error, repeatable under
> > > high IO (rsync triggers it). When I use the same aoe device with a
> > > normal filesystem (i.e. not through lvm) the oops does not occur.
> > >
> > > Also the OOPS goes away when I disable highmem for my kernel.
> > >
> > > I've been able to create this oops on 2.6.13-gentoo-r5 (yes, I'm using
> > > gentoo) and vanilla 2.6.14.
> > >
> > > If anyone needs any more information please let me know.
> > >
> > > Oops: 0002 [#1]
> > > Modules linked in: dlm cman ipv6 usbcore e1000 aoe
> > > CPU:    0
> > > EIP:    0060:[<f8c1e47d>]    Not tainted VLI
> > > EFLAGS: 00010006   (2.6.14)
> > > EIP is at aoecmd_ata_rsp+0x20d/0x370 [aoe]
> > > eax: 00000400   ebx: f631083c   ecx: 00000100   edx: 00000400
> > > esi: f595d036   edi: 00000400   ebp: f78d3000   esp: c050fdc0
> > > ds: 007b   es: 007b   ss: 0068
> > > Process swapper (pid: 0, threadinfo=c050e000 task=c04a3ba0)
> > > Stack: f78d3000 00000002 00000000 f7edd740 00000003 00000000 00000024
> > > f7c70f80 00000286 c9eb0634 f7c503c0 00000086 f4500034 41e01640 f7c13900
> > > d6aad39c c1bec680 c013f4c3 c1bb1a00 f7c06f48 f7c06f48 d6aad39c 00000092
> > > f7c06f48 Call Trace:
> > >  [<c013f4c3>] mempool_free+0x33/0x80
> > >  [<c02d6e9c>] __freed_request+0x9c/0xb0
> > >  [<c02d4406>] elv_queue_empty+0x26/0x30
> > >  [<c037b6c5>] __alloc_skb+0x55/0x130
> > >  [<f8c1f1f4>] aoenet_rcv+0xd4/0x150 [aoe]
> > >  [<f8b2ee3b>] e1000_alloc_rx_buffers+0x7b/0x3b0 [e1000]
> > >  [<c038171c>] netif_receive_skb+0x22c/0x300
> > >  [<f8b2e5cd>] e1000_clean_rx_irq+0x17d/0x4b0 [e1000]
> > >  [<c02ac47d>] add_timer_randomness+0x15d/0x160
> > >  [<f8b2de8b>] e1000_clean+0x4b/0xf0 [e1000]
> > >  [<c0381954>] net_rx_action+0x74/0x100
> > >  [<c011fa1b>] __do_softirq+0x7b/0x90
> > >  [<c011fa56>] do_softirq+0x26/0x30
> > >  [<c0104dce>] do_IRQ+0x1e/0x30
> > >  [<c010367e>] common_interrupt+0x1a/0x20
> > >  [<c0100e13>] mwait_idle+0x33/0x50
> > >  [<c02a2466>] acpi_processor_idle+0x100/0x299
> > >  [<c0100d50>] cpu_idle+0x50/0x60
> > >  [<c051085d>] start_kernel+0x17d/0x1c0
> > >  [<c05103a0>] unknown_bootoption+0x0/0x1e0
> > > Code: 00 00 00 e9 53 ff ff ff 0f b6 51 02 8b 84 24 bc 00 00 00 8b 48 58
> > > c1 e2 09 8d 41 dc 39 d0 72 1b 89 d1 8b 7b 0c 83 c6 0c c1 e9 02 <f3> a5
> > > 89 d1 83 e1 03 74 02 f3 a4 e9 c3 fe ff ff 89 4c 24 04 c7
> > >  <0>Kernel panic - not syncing: Fatal exception in interrupt
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Edward Muller - Interlix
edwardam at interlix.com
417-862-0573
PGP Key: http://interlix.com/Members/edwardam/pgpkeys
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20051110/59739131/attachment.sig>


More information about the Linux-cluster mailing list