[dm-devel] system freeze/console flood when unplugging one path of multipath disk array
Benjamin Marzinski
bmarzins at redhat.com
Tue Nov 28 21:11:21 UTC 2006
On Tue, Nov 28, 2006 at 12:18:51PM +0100, Rainer Toebbicke wrote:
> Hello,
>
> I've got a Sun 3510 disk array and a dual-channel Qlogic FC HBA, in a
> Sun X4100 under RHEL4 U4, each path connected to the array over a
> qlogic 5600 switch.
>
> The devices are seen correctly with multipath -l as far as I can judge.
>
>
> When I disable one of the paths during a stress test (in this case, an
> fsck -y), the system "freezes". Actually, it just sends enough
> (thousands) error messages of the style
>
> end_request: I/O error, dev sdb, sector 279446024
> Buffer I/O error on device sdb1, logical block 279445961
You wouldn't happen to be doing IO directly to /dev/sdb1, would you? If
so, you need to be doing IO to a kpartx created device on top of your
multipathed device.
> lost page write due to I/O error on sdb1
> SCSI error : <1 0 0 1> return code = 0x10000
> end_request: I/O error, dev sdb, sector 279446025
> Buffer I/O error on device sdb1, logical block 279445962
> lost page write due to I/O error on sdb1
> SCSI error : <1 0 0 1> return code = 0x10000
> end_request: I/O error, dev sdb, sector 279446026
> Buffer I/O error on device sdb1, logical block 279445963
> lost page write due to I/O error on sdb1
> SCSI error : <1 0 0 1> return code = 0x10000
> end_request: I/O error, dev sdb, sector 279446027
> Buffer I/O error on device sdb1, logical block 279445964
> lost page write due to I/O error on sdb1
> SCSI error : <1 0 0 1> return code = 0x10000
>
>
> to the serial console to make it appear completely stuck for hours.
>
> The fsck does not finish, BTW, it restarts from the beginning as soon
> as I re-enable the port in the switch.
>
> root at afs26 ~]# multipath -l
> mpath2 (3600c0ff0000000000a792a1887b34100)
> [size=136 GB][features="1 queue_if_no_path"][hwhandler="0"]
> \_ round-robin 0 [active]
> \_ 1:0:0:3 sdc 8:32 [active][ready]
> \_ 2:0:0:3 sdi 8:128 [active][ready]
>
> mpath1 (3600c0ff0000000000a792a0aac9b4200)
> [size=136 GB][features="1 queue_if_no_path"][hwhandler="0"]
> \_ round-robin 0 [active]
> \_ 1:0:0:1 sdb 8:16 [active][ready]
> \_ 2:0:0:1 sdh 8:112 [active][ready]
> ....
>
> /etc/multipath.conf just contains
> defaults {
> default_features "1 queue_if_no_path"
> user_friendly_names yes
>
> }
>
> besides the usual blacklist.
>
> Any ideas?
>
> --
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Rainer Toebbicke
> European Laboratory for Particle Physics(CERN) - Geneva, Switzerland
> Phone: +41 22 767 8985 Fax: +41 22 767 7155
>
> --
> dm-devel mailing list
> dm-devel at redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
More information about the dm-devel
mailing list