[dm-devel] 2.6 Kernel Oops

christophe varoqui christophe.varoqui at free.fr
Mon Dec 27 10:14:08 UTC 2004


I reproduced this only with "queue_if_no_path" feature.
Ranaldo already showed some places are dusty.
A new patchset is due soon.

Alasdair, I guess there still are some unpined references to sd_mod on
"map-load error-code-path". I did experience refcount going crazy when
trying to create a map with path with wrong size (dummy 1GB).

regards,
cvaroqui

Le vendredi 24 d?embre 2004 à 16:52 +0100, Hergen Lange a écrit :
> Hello,
> 
> i try to setup a multipath to a EMC Clariion Device with the following 
> infrstructure.
> Kernel 2.6.10-rc3 and udm2 Patch.
> multipath-tools 0.4.1
> 
> I have one qla2310 Adapter "Firmware version 3.03.02 IPX, Driver version 
> 8.00.00b21-k"
> wich is zoned to SPA Port0  and SPB Port0 of the Clariion. So we have 
> two path to the LUN.
> 
> I setup multipath.conf with the following.
> 
> multipath {
>                wwid                 3600601601b121200540b8c1caf4ad911                     
>                alias                   yellow
>                path_grouping_policy    failover
>                path_checker             emc_clariion
>                path_selector           "round-robin 0"
>                getuid_callout  "/sbin/scsi_id -g -u -s %n"
>        }
>        device {
>                vendor                  "DGC     "
>                product                 "RAID 5          "
>                path_grouping_policy    failover
>                getuid_callout          "/sbin/scsi_id -g -u -s /block/%n"
>                path_checker            emc_clariion
>        }
> 
> When i start the multipathd i get the following table
> 
> yellow: 0 20971520 multipath 0 0 2 1 round-robin 0 1 1 8:64 1000 
> round-robin 0 1 1 8:80 1000
> 
> multipath -v2 -l reports
> 
> yellow (3600601601b121200540b8c1caf4ad911)
> [size=10 GB][features="0"][hwhandler="0"]
> \_ round-robin 0 [enabled][first]
>  \_ 2:0:1:0 sde  8:64    [ready ][active]
> \_ round-robin 0 [enabled]
>  \_ 2:0:2:0 sdf  8:80    [ready ][active]
> 
> I can work with the device     dd if=/dev/yellow of=/dev/null count=1000 
> 
> When i disable one port on the switch to the Clarion, or unplug the cable from 
> the SP (Clariion) the dd command goes to D+ state (ps ax)
> and I get messages like this.
> 
> Dec 23 14:35:31 lxssantm1 kernel: SCSI error : <2 0 1 0> return code = 
> 0x20000
> Dec 23 14:35:31 lxssantm1 kernel: end_request: I/O error, dev sde, sector 0
> Dec 23 14:35:31 lxssantm1 kernel: end_request: I/O error, dev sde, sector 8
> Dec 23 14:35:31 lxssantm1 multipathd: devmap event on yellow
> Dec 23 14:35:45 lxssantm1 multipathd: 8:64 : emc_clariion_checker: query 
> command indicates error
> Dec 23 14:35:45 lxssantm1 kernel: SCSI error : <2 0 1 0> return code = 
> 0x10000
> Dec 23 14:35:45 lxssantm1 last message repeated 6 times
> Dec 23 14:35:45 lxssantm1 kernel: device-mapper: multipath_message 
> received. argc: 2
> Dec 23 14:35:45 lxssantm1 kernel: Unable to handle kernel paging request 
> at virtual address f8d44080
> Dec 23 14:35:45 lxssantm1 kernel:  printing eip:
> Dec 23 14:35:45 lxssantm1 kernel: f8f47e43
> Dec 23 14:35:45 lxssantm1 kernel: *pde = 1fc86067
> Dec 23 14:35:45 lxssantm1 kernel: Oops: 0000 [#1]
> Dec 23 14:35:45 lxssantm1 kernel: SMP
> Dec 23 14:35:45 lxssantm1 kernel: Modules linked in: dm_round_robin 
> dm_multipath joydev st sr_mod ide_cd cdrom nvram edd speedstep_lib fre
> q_table processor ipv6 sg intel_agp agpgart i2c_piix4 i2c_core evdev 
> tulip dm_mod reiserfs qla2300 qla2xxx scsi_transport_fc gdth aic7xxx
> sd_mod scsi_mod
> Dec 23 14:35:45 lxssantm1 kernel: CPU:    0
> Dec 23 14:35:45 lxssantm1 kernel: EIP:    0060:[<f8f47e43>]    Not 
> tainted VLI
> Dec 23 14:35:45 lxssantm1 kernel: EFLAGS: 00010247   (2.6.10-rc3)
> Dec 23 14:35:45 lxssantm1 kernel: EIP is at trigger_event+0x3/0x10 
> [dm_multipath]
> Dec 23 14:35:45 lxssantm1 kernel: eax: f8d44080   ebx: f31787ec   ecx: 
> dfe30000   edx: 00000216
> Dec 23 14:35:45 lxssantm1 kernel: esi: dfe30000   edi: f31787f0   ebp: 
> f3178760   esp: dfe33f60
> Dec 23 14:35:45 lxssantm1 multipathd: event checker startup : yellow
> Dec 23 14:35:45 lxssantm1 kernel: ds: 007b   es: 007b   ss: 0068
> Dec 23 14:35:45 lxssantm1 kernel: Process events/0 (pid: 6, 
> threadinfo=dfe32000 task=dffefa60)
> Dec 23 14:35:45 lxssantm1 kernel: Stack: c0128a38 dfe30020 f8f47e40 
> 00000216 dfe32000 dfe3000c ffffffff ffffffff
> Dec 23 14:35:45 lxssantm1 kernel:        00000001 00000000 c0116330 
> 00010000 00000000 c18109c0 c1810060 00000000
> Dec 23 14:35:45 lxssantm1 kernel:        00000000 dffefa60 c0116330 
> 00100100 00200200 dffefbc4 00000000 00000000
> Dec 23 14:35:45 lxssantm1 kernel: Call Trace:
> Dec 23 14:35:45 lxssantm1 kernel:  [<c0128a38>] worker_thread+0x1a8/0x230
> Dec 23 14:35:45 lxssantm1 kernel:  [<f8f47e40>] trigger_event+0x0/0x10 
> [dm_multipath]
> Dec 23 14:35:45 lxssantm1 kernel:  [<c0116330>] 
> default_wake_function+0x0/0x10
> Dec 23 14:35:45 lxssantm1 kernel:  [<c0116330>] 
> default_wake_function+0x0/0x10
> Dec 23 14:35:45 lxssantm1 kernel:  [<c0128890>] worker_thread+0x0/0x230
> Dec 23 14:35:45 lxssantm1 kernel:  [<c012c666>] kthread+0x86/0xb0
> Dec 23 14:35:45 lxssantm1 kernel:  [<c012c5e0>] kthread+0x0/0xb0
> Dec 23 14:35:45 lxssantm1 kernel:  [<c01019b5>] 
> kernel_thread_helper+0x5/0x10
> Dec 23 14:35:45 lxssantm1 kernel: Code: 89 f0 e8 61 fc ff ff 8b 7e 38 85 
> ff 75 96 8b 46 28 85 c0 74 96 8b 5e 2c 85 db 75 8f c7 44 24 04 01
> 00 00 00 eb 85 89 f6 8b 40 08 <8b> 00 e9 76 e8 d4 ff 8d b6 00 00 00 00 
> 56 85 d2 53 89 ce 89 c3
> Dec 23 14:35:56 lxssantm1 multipathd: 8:64 : emc_clariion_checker: Path 
> healthy
> 
> 
> Enable the Switch Port or plugging the cable has no effect. The 
> device /dev/yellow remains dead.
> 
> No failover !!!!
> 
> --
> dm-devel mailing list
> dm-devel at redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
-- 
christophe varoqui <christophe.varoqui at free.fr>





More information about the dm-devel mailing list