[Linux-cluster] kernel oops

Adam Cassar adam.cassar at netregistry.com.au
Thu Aug 26 07:19:21 UTC 2004


I also got quite a few of these:

dlm: dude: restbl_rsb_update_recv rsb not found 2447
dlm: dude: restbl_rsb_update_recv rsb not found 2448
dlm: dude: restbl_rsb_update_recv rsb not found 2449
dlm: dude: restbl_rsb_update_recv rsb not found 2450
dlm: dude: restbl_rsb_update_recv rsb not found 2451
dlm: dude: restbl_rsb_update_recv rsb not found 2452
dlm: dude: restbl_rsb_update_recv rsb not found 2453
dlm: dude: restbl_rsb_update_recv rsb not found 2454
dlm: dude: restbl_rsb_update_recv rsb not found 2455
dlm: dude: restbl_rsb_update_recv rsb not found 2456
dlm: dude: restbl_rsb_update_recv rsb not found 2457
dlm: dude: restbl_rsb_update_recv rsb not found 2458
dlm: dude: restbl_rsb_update_recv rsb not found 2459
dlm: dude: restbl_rsb_update_recv rsb not found 2460
dlm: dude: restbl_rsb_update_recv rsb not found 2461
dlm: dude: restbl_rsb_update_recv rsb not found 2462
dlm: dude: restbl_rsb_update_recv rsb not found 2463
dlm: dude: restbl_rsb_update_recv rsb not found 2464
dlm: dude: restbl_rsb_update_recv rsb not found 2465
dlm: dude: restbl_rsb_update_recv rsb not found 2466

On Thu, 2004-08-26 at 17:09, Adam Cassar wrote:
> I received the following trying to unmount a GFS partition.
> 
> I tried to unmount a GFS partition shared between three nodes and it
> hung.
> 
> I discovered that one of the nodes had become unresponsive so I manually
> ACKED the fence request and attempted to unmount. The following
> occurred:
> 
> Unable to handle kernel paging request at virtual address 001dae44
>  printing eip:
> f88cda59
> *pde = 00000000
> Oops: 0000 [#1]
> SMP 
> Modules linked in: lock_dlm dlm cman gfs lock_harness 8250 serial_core
> dm_mod
> CPU:    0
> EIP:    0060:[<f88cda59>]    Not tainted
> EFLAGS: 00010286   (2.6.8.1) 
> EIP is at name_to_directory_nodeid+0x15/0xf9 [dlm]
> eax: 001dae00   ebx: e8dc304c   ecx: c1b5ae3c   edx: e8dc304c
> esi: 00000000   edi: 001dae00   ebp: e8dc304c   esp: f7297ec0
> ds: 007b   es: 007b   ss: 0068
> Process dlm_recoverd (pid: 859, threadinfo=f7296000 task=f7125930)
> Stack: f706c000 f706c000 c1b5ae00 00000000 f88db235 00000000 e8dc304c
> c1b5ae00 
>        c1b5aef0 e8dc304c f88cdb5e 001dae00 e8dc30c5 00000018 f88dc727
> e8dc304c 
>        00000003 00000003 f706c000 00000000 001dae00 e8dc304c e8dc304c
> c1b5ae00 
> Call Trace:
>  [<f88db235>] rcom_send_message+0xe1/0x217 [dlm]
>  [<f88cdb5e>] get_directory_nodeid+0x21/0x25 [dlm]
>  [<f88dc727>] rsb_master_lookup+0x1a/0x126 [dlm]
>  [<f88dc9a6>] restbl_rsb_update+0x142/0x165 [dlm]
>  [<f88dcf04>] ls_reconfig+0xd5/0x220 [dlm]
>  [<f88ddf71>] dlm_recoverd+0x0/0x66 [dlm]
>  [<f88ddc99>] do_ls_recovery+0x16c/0x444 [dlm]
>  [<f88ddfbd>] dlm_recoverd+0x4c/0x66 [dlm]
>  [<c012aeb9>] kthread+0xb7/0xbd
>  [<c012ae02>] kthread+0x0/0xbd
>  [<c0102161>] kernel_thread_helper+0x5/0xb
> Code: 83 7f 44 01 74 65 8b 44 24 34 89 44 24 04 8b 44 24 30 89 04 
> Unable to handle kernel paging request at virtual address 001dae44
> f88cda59
> *pde = 00000000
> Oops: 0000 [#1]
> CPU:    0
> EIP:    0060:[<f88cda59>]    Not tainted
> Using defaults from ksymoops -t elf32-i386 -a i386
> EFLAGS: 00010286   (2.6.8.1) 
> eax: 001dae00   ebx: e8dc304c   ecx: c1b5ae3c   edx: e8dc304c
> esi: 00000000   edi: 001dae00   ebp: e8dc304c   esp: f7297ec0
> ds: 007b   es: 007b   ss: 0068
> Stack: f706c000 f706c000 c1b5ae00 00000000 f88db235 00000000 e8dc304c
> c1b5ae00 
>        c1b5aef0 e8dc304c f88cdb5e 001dae00 e8dc30c5 00000018 f88dc727
> e8dc304c 
>        00000003 00000003 f706c000 00000000 001dae00 e8dc304c e8dc304c
> c1b5ae00 
>  [<f88db235>] rcom_send_message+0xe1/0x217 [dlm]
>  [<f88cdb5e>] get_directory_nodeid+0x21/0x25 [dlm]
>  [<f88dc727>] rsb_master_lookup+0x1a/0x126 [dlm]
>  [<f88dc9a6>] restbl_rsb_update+0x142/0x165 [dlm]
>  [<f88dcf04>] ls_reconfig+0xd5/0x220 [dlm]
>  [<f88ddf71>] dlm_recoverd+0x0/0x66 [dlm]
>  [<f88ddc99>] do_ls_recovery+0x16c/0x444 [dlm]
>  [<f88ddfbd>] dlm_recoverd+0x4c/0x66 [dlm]
>  [<c012aeb9>] kthread+0xb7/0xbd
>  [<c012ae02>] kthread+0x0/0xbd
>  [<c0102161>] kernel_thread_helper+0x5/0xb
> Code: 83 7f 44 01 74 65 8b 44 24 34 89 44 24 04 8b 44 24 30 89 04 
> 
> 
> >>EIP; f88cda59 <pg0+38477a59/3fba8000>   <=====
> 
> >>eax; 001dae00 Before first symbol
> >>ebx; e8dc304c <pg0+2896d04c/3fba8000>
> >>ecx; c1b5ae3c <pg0+1704e3c/3fba8000>
> >>edx; e8dc304c <pg0+2896d04c/3fba8000>
> >>edi; 001dae00 Before first symbol
> >>ebp; e8dc304c <pg0+2896d04c/3fba8000>
> >>esp; f7297ec0 <pg0+36e41ec0/3fba8000>
> 
> Code;  f88cda59 <pg0+38477a59/3fba8000>
> 00000000 <_EIP>:
> Code;  f88cda59 <pg0+38477a59/3fba8000>   <=====
>    0:   83 7f 44 01               cmpl   $0x1,0x44(%edi)   <=====
> Code;  f88cda5d <pg0+38477a5d/3fba8000>
>    4:   74 65                     je     6b <_EIP+0x6b> f88cdac4
> <pg0+38477ac4/3fba8000>
> Code;  f88cda5f <pg0+38477a5f/3fba8000>
>    6:   8b 44 24 34               mov    0x34(%esp,1),%eax
> Code;  f88cda63 <pg0+38477a63/3fba8000>
>    a:   89 44 24 04               mov    %eax,0x4(%esp,1)
> Code;  f88cda67 <pg0+38477a67/3fba8000>
>    e:   8b 44 24 30               mov    0x30(%esp,1),%eax
> Code;  f88cda6b <pg0+38477a6b/3fba8000>
>   12:   89 04 00                  mov    %eax,(%eax,%eax,1)
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Adam Cassar
IT Manager
NetRegistry Pty Ltd
______________________________________________
http://www.netregistry.com.au
Tel:  02 9699 6099          Fax:  02 9699 6088
PO Box 270    Broadway      NSW   2007

Domains |Business Email|Web Hosting|E-Commerce
Trusted  by  10,000s of  businesses since 1997
______________________________________________





More information about the Linux-cluster mailing list