Here is my setup: (this is ISCSI) going for multipath failover.<br><br>Equallogic storage<br clear="all">Qlogic 4052c HBAs (2 ports)<br>RHEL4.5<br>device-mapper-multipath-0.4.5-21.RHEL4<br>HP-GigE switches<br>/etc/multipath.conf using failover mode
<br>defaults {<br> multipath_tool "/sbin/multipath -v0"<br> udev_dir /dev<br> #polling_interval 2<br> polling_interval 4<br> selector "round-robin 0"
<br> path_grouping_policy failover <br> getuid_callout "/sbin/scsi_id -g -u -s /block/%n"<br> path_checker readsector0<br> prio_callout "/bin/true"
<br> features "0"<br> #rr_min_io 2<br> rr_min_io 100<br> rr_weight priorities <br> failback immediate
<br> no_path_retry fail<br> user_friendly_name yes<br>}<br><br>Here is the problem:<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> kernel: SCSI error : <2 0 5 0> return code = 0x10000
<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 65:16: readsector0 checker reports path is up<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31
</a> kernel: device-mapper: dm-multipath: Failing path 8:96.<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 65:16: reinstated<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">
192.168.46.31/192.168.46.31</a> multipathd: ban1.banprod: remaining active paths: 2<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 8:96: readsector0 checker reports path is down
<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: checker failed path 8:96 in map ban1.datafiles.famp<br>Dec 20 05:28:32 <a href="http://192.168.46.31/192.168.46.31">
192.168.46.31/192.168.46.31</a> multipathd: ban1.datafiles.famp: remaining active paths: 1<br>Dec 20 05:28:35 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> kernel: SCSI error : <2 0 8 0> return code = 0x10000
<br>Dec 20 05:28:35 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 8:144: readsector0 checker reports path is down<br>Dec 20 05:28:35 <a href="http://192.168.46.31/192.168.46.31">
192.168.46.31/192.168.46.31</a> multipathd: checker failed path 8:144 in map ban1.datafiles.wflw<br>Dec 20 05:28:35 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> kernel: device-mapper: dm-multipath: Failing path 8:144.
<br>Dec 20 05:28:37 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: ban1.datafiles.wflw: remaining active paths: 1<br>Dec 20 05:28:37 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31
</a> multipathd: 8:96: readsector0 checker reports path is up<br>Dec 20 05:28:37 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 8:96: reinstated<br>Dec 20 05:28:37 <a href="http://192.168.46.31/192.168.46.31">
192.168.46.31/192.168.46.31</a> multipathd: ban1.datafiles.famp: remaining active paths: 2<br>Dec 20 05:28:39 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 8:144: readsector0 checker reports path is up
<br>Dec 20 05:28:39 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a> multipathd: 8:144: reinstated<br>Dec 20 05:28:39 <a href="http://192.168.46.31/192.168.46.31">192.168.46.31/192.168.46.31</a>
multipathd: ban1.datafiles.wflw: remaining active paths: 2<br><br>There is no real pattern to these disconnect/reconnect events. The only setting I'm thinking about changing <br>currently is the no_path_retry. I'm thinking of making this 2 or some low number. The equallogic shows load balancing
<br>events but they don't always align with the drop/reconnect that I'm seeing on the linux hosts.<br><br>Is anyone using these HBAs with equallogic on RHEL4.5 ??? <br><br>-- <br>:wq!<br>kevin.foote