[Linux-cluster] kernel oops

Adam Cassar adam.cassar at netregistry.com.au
Thu Aug 26 07:09:37 UTC 2004


I received the following trying to unmount a GFS partition.

I tried to unmount a GFS partition shared between three nodes and it
hung.

I discovered that one of the nodes had become unresponsive so I manually
ACKED the fence request and attempted to unmount. The following
occurred:

Unable to handle kernel paging request at virtual address 001dae44
 printing eip:
f88cda59
*pde = 00000000
Oops: 0000 [#1]
SMP 
Modules linked in: lock_dlm dlm cman gfs lock_harness 8250 serial_core
dm_mod
CPU:    0
EIP:    0060:[<f88cda59>]    Not tainted
EFLAGS: 00010286   (2.6.8.1) 
EIP is at name_to_directory_nodeid+0x15/0xf9 [dlm]
eax: 001dae00   ebx: e8dc304c   ecx: c1b5ae3c   edx: e8dc304c
esi: 00000000   edi: 001dae00   ebp: e8dc304c   esp: f7297ec0
ds: 007b   es: 007b   ss: 0068
Process dlm_recoverd (pid: 859, threadinfo=f7296000 task=f7125930)
Stack: f706c000 f706c000 c1b5ae00 00000000 f88db235 00000000 e8dc304c
c1b5ae00 
       c1b5aef0 e8dc304c f88cdb5e 001dae00 e8dc30c5 00000018 f88dc727
e8dc304c 
       00000003 00000003 f706c000 00000000 001dae00 e8dc304c e8dc304c
c1b5ae00 
Call Trace:
 [<f88db235>] rcom_send_message+0xe1/0x217 [dlm]
 [<f88cdb5e>] get_directory_nodeid+0x21/0x25 [dlm]
 [<f88dc727>] rsb_master_lookup+0x1a/0x126 [dlm]
 [<f88dc9a6>] restbl_rsb_update+0x142/0x165 [dlm]
 [<f88dcf04>] ls_reconfig+0xd5/0x220 [dlm]
 [<f88ddf71>] dlm_recoverd+0x0/0x66 [dlm]
 [<f88ddc99>] do_ls_recovery+0x16c/0x444 [dlm]
 [<f88ddfbd>] dlm_recoverd+0x4c/0x66 [dlm]
 [<c012aeb9>] kthread+0xb7/0xbd
 [<c012ae02>] kthread+0x0/0xbd
 [<c0102161>] kernel_thread_helper+0x5/0xb
Code: 83 7f 44 01 74 65 8b 44 24 34 89 44 24 04 8b 44 24 30 89 04 
Unable to handle kernel paging request at virtual address 001dae44
f88cda59
*pde = 00000000
Oops: 0000 [#1]
CPU:    0
EIP:    0060:[<f88cda59>]    Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286   (2.6.8.1) 
eax: 001dae00   ebx: e8dc304c   ecx: c1b5ae3c   edx: e8dc304c
esi: 00000000   edi: 001dae00   ebp: e8dc304c   esp: f7297ec0
ds: 007b   es: 007b   ss: 0068
Stack: f706c000 f706c000 c1b5ae00 00000000 f88db235 00000000 e8dc304c
c1b5ae00 
       c1b5aef0 e8dc304c f88cdb5e 001dae00 e8dc30c5 00000018 f88dc727
e8dc304c 
       00000003 00000003 f706c000 00000000 001dae00 e8dc304c e8dc304c
c1b5ae00 
 [<f88db235>] rcom_send_message+0xe1/0x217 [dlm]
 [<f88cdb5e>] get_directory_nodeid+0x21/0x25 [dlm]
 [<f88dc727>] rsb_master_lookup+0x1a/0x126 [dlm]
 [<f88dc9a6>] restbl_rsb_update+0x142/0x165 [dlm]
 [<f88dcf04>] ls_reconfig+0xd5/0x220 [dlm]
 [<f88ddf71>] dlm_recoverd+0x0/0x66 [dlm]
 [<f88ddc99>] do_ls_recovery+0x16c/0x444 [dlm]
 [<f88ddfbd>] dlm_recoverd+0x4c/0x66 [dlm]
 [<c012aeb9>] kthread+0xb7/0xbd
 [<c012ae02>] kthread+0x0/0xbd
 [<c0102161>] kernel_thread_helper+0x5/0xb
Code: 83 7f 44 01 74 65 8b 44 24 34 89 44 24 04 8b 44 24 30 89 04 


>>EIP; f88cda59 <pg0+38477a59/3fba8000>   <=====

>>eax; 001dae00 Before first symbol
>>ebx; e8dc304c <pg0+2896d04c/3fba8000>
>>ecx; c1b5ae3c <pg0+1704e3c/3fba8000>
>>edx; e8dc304c <pg0+2896d04c/3fba8000>
>>edi; 001dae00 Before first symbol
>>ebp; e8dc304c <pg0+2896d04c/3fba8000>
>>esp; f7297ec0 <pg0+36e41ec0/3fba8000>

Code;  f88cda59 <pg0+38477a59/3fba8000>
00000000 <_EIP>:
Code;  f88cda59 <pg0+38477a59/3fba8000>   <=====
   0:   83 7f 44 01               cmpl   $0x1,0x44(%edi)   <=====
Code;  f88cda5d <pg0+38477a5d/3fba8000>
   4:   74 65                     je     6b <_EIP+0x6b> f88cdac4
<pg0+38477ac4/3fba8000>
Code;  f88cda5f <pg0+38477a5f/3fba8000>
   6:   8b 44 24 34               mov    0x34(%esp,1),%eax
Code;  f88cda63 <pg0+38477a63/3fba8000>
   a:   89 44 24 04               mov    %eax,0x4(%esp,1)
Code;  f88cda67 <pg0+38477a67/3fba8000>
   e:   8b 44 24 30               mov    0x30(%esp,1),%eax
Code;  f88cda6b <pg0+38477a6b/3fba8000>
  12:   89 04 00                  mov    %eax,(%eax,%eax,1)





More information about the Linux-cluster mailing list