<br><font size=3 color=#000080 face="sans-serif">Christophe,</font>
<br><font size=3 color=#000080 face="sans-serif">Unfortunately it does
not appear that the TP9700 is working using the multipath device settings
you provided. </font>
<br>
<br><font size=3 color=#000080 face="sans-serif">Our configuration is such
where the host (a Sun X4600 running RHEL 5.2) is connected to the TP9700
using two Fibrechannel connections:</font>
<br>
<br><img src=cid:_2_0A07001C02B0A068001A7D6E85257474>
<br>
<br><font size=3 color=#000080 face="sans-serif">No FC switches are used,
just simple direct HBA to SP connectivity with two HBAs and two Storage
Processors. LUNs on the RAID are distributed to be owned by either
SPA or SPB to distribute the workload between the SPs and the fibrechannel
connections.</font>
<br>
<br><font size=3 color=#000080 face="sans-serif">The TP9700 can be configured
to present the storage to a host by setting the "Storage Array Host
Type" (Linux, SGIRDAC, SGIAVT, Windows, etc). For my tests,
I've been experimenting with Linux and SGIRDAC. I have been unsuccessful
in determining what the storage array host type "Linux"s failover
method is, but I thought I had come across an article that said the Linux
type is basic AVT. I could be mistaken.</font>
<br>
<br><font size=3 color=#000080 face="sans-serif">Setting the TP9700 Host
Type to "Linux" , I then setup /etc/multipath.conf to mimic the
defaults for the TP9500:</font>
<br>
<br><tt><font size=2 color=#000080> device {</font></tt>
<br><tt><font size=2 color=#000080>
vendor
"SGI"</font></tt>
<br><tt><font size=2 color=#000080>
product
"TP9[457]00"</font></tt>
<br><tt><font size=2 color=#000080>
getuid_callout "/sbin/scsi_id
-g -u -s /block/%n"</font></tt>
<br><tt><font size=2 color=#000080>
prio_callout "/sbin/mpath_prio_tpc
/dev/%n"</font></tt>
<br><tt><font size=2 color=#000080>
features
"0"</font></tt>
<br><tt><font size=2 color=#000080>
hardware_handler "0"</font></tt>
<br><tt><font size=2 color=#000080>
path_grouping_policy group_by_prio</font></tt>
<br><tt><font size=2 color=#000080>
failback
immediate</font></tt>
<br><tt><font size=2 color=#000080>
rr_weight
uniform</font></tt>
<br><tt><font size=2 color=#000080>
rr_min_io
1000</font></tt>
<br><tt><font size=2 color=#000080>
path_checker tur</font></tt>
<br><tt><font size=2 color=#000080> }</font></tt><font size=3 color=#000080 face="sans-serif"><br>
</font><font size=2 face="sans-serif"><br>
</font><font size=3 color=#000080 face="Tahoma">This configuration ran
OK for a while, then began to log multipath failures, and eventually I/O
buffer errors. All LUNs on one SP trespassed to the other SP, and
I had to manually place each trespassed LUN back to its primary path.</font>
<br>
<br><font size=3 color=#000080 face="Tahoma">Changing the TP9700 host type
to SGIRDAC, then trying the configuration you provided me caused the host
to not see the ghost path. Effectively I ended up with a single path.
Disconnecting a FC connection resulted in the inability to see any
of the LUNs assigned to the associated SP.</font>
<br>
<br><font size=3 color=#000080 face="Tahoma">I modified the multipath.conf
a little:</font>
<br><font size=3 color=#000080 face="Tahoma"> </font><tt><font size=2 color=#000080>
device {</font></tt>
<br><tt><font size=2 color=#000080>
vendor "SGI"</font></tt>
<br><tt><font size=2 color=#000080>
product "TP9700"</font></tt>
<br><tt><font size=2 color=#000080>
path_grouping_policy failover</font></tt>
<br><tt><font size=2 color=#000080>
getuid_callout "/sbin/scsi_id
-g -u -s /block/%n"</font></tt>
<br><tt><font size=2 color=#000080>
features "1 queue_if_no_path"</font></tt>
<br><tt><font size=2 color=#000080>
path_checker rdac</font></tt>
<br><tt><font size=2 color=#000080>
prio_callout "/sbin/mpath_prio_tpc
/dev/%n"</font></tt>
<br><tt><font size=2 color=#000080>
hardware_handler "1 rdac"</font></tt>
<br><tt><font size=2 color=#000080>
prio rdac</font></tt>
<br><tt><font size=2 color=#000080>
failback immediate</font></tt>
<br><tt><font size=2 color=#000080> }</font></tt>
<br><tt><font size=2 color=#000080>This worked ok, but I see lots of scsi
sense key errors:</font></tt>
<br>
<br><tt><font size=2 color=#000080>Jun 23 12:16:42 p4dbl03 kernel: sdbk:
Current: sense key: Recovered Error</font></tt>
<br><tt><font size=2 color=#000080>Jun 23 12:16:42 p4dbl03 kernel:
<<vendor>> ASC=0x95 ASCQ=0x1ASC=0x95 ASCQ=0x1</font></tt>
<br><tt><font size=2 color=#000080>Jun 23 12:16:42 p4dbl03 kernel:</font></tt>
<br>
<br><tt><font size=2 color=#000080>I see those error regardless of how
I configured the RAID and multipath.conf, which is worrisome.</font></tt>
<br><tt><font size=2 color=#000080>I especially see those errors if I run
'fdisk -l'. </font></tt>
<br>
<br><tt><font size=2 color=#000080>Disconnecting a FC cable on one HBA
caused the associated volumes to trespass to the other SP, however, during
this process, I noticed buffer I/O errors. Also, I noticed that the
trespassed LUNs did not failback to their original SP when the FC cable
was reconnected. Am I to assume that RDAC or other multipath software
will not tell the storage to failback trepassed LUNs?</font></tt>
<br>
<br><tt><font size=2 color=#000080>Your assistance is appreciated,</font></tt>
<br>
<br><tt><font size=2 color=#000080>- Kevin</font></tt>
<br>
<br>
<br>
<br>