[dm-devel] [PATCH 1/1] multipath-tools: Change path checker for IBM IPR devices

Brian King brking at linux.vnet.ibm.com
Thu Sep 25 16:47:42 UTC 2014


On 09/25/2014 01:29 AM, Hannes Reinecke wrote:
> On 09/24/2014 09:57 PM, wenxiong at linux.vnet.ibm.com wrote:
>> We would like to change path checker to directio for IBM IPR devices.
>>
>> Signed-off-by: Wen Xiong <wenxiong at linux.vnet.ibm.com>
>> ---
>>  libmultipath/hwtable.c |    2 +-
>>  1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> Index: b/libmultipath/hwtable.c
>> ===================================================================
>> --- a/libmultipath/hwtable.c	2014-09-23 17:39:27.000000000 -0500
>> +++ b/libmultipath/hwtable.c	2014-09-23 17:40:31.419667000 -0500
>> @@ -630,7 +630,7 @@ static struct hwentry default_hw[] = {
>>  		.pgfailback    = -FAILBACK_IMMEDIATE,
>>  		.rr_weight     = RR_WEIGHT_NONE,
>>  		.no_path_retry = NO_PATH_RETRY_UNDEF,
>> -		.checker_name  = TUR,
>> +		.checker_name  = DIRECTIO,
>>  		.prio_name     = PRIO_ALUA,
>>  		.prio_args     = NULL,
>>  	},
>>
>> -- -- dm-devel mailing list dm-devel at redhat.com
>> https://www.redhat.com/mailman/listinfo/dm-devel
> Any particular reason for doing so?
> 
> Using 'directio' has the drawback that it might stall under higher
> load, as it's being added to the _tail_ of the request queue.
> TUR is using the 'SG_IO' ioctl, which is added to the head of the
> request_queue and hence won't be stalled ...

The issue we've run into started when this patch started making its
way into distros:

http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/drivers/scsi/scsi_error.c?id=14216561e164671ce147458653b1fea06a4ada1e

That changed the behaviour for user initiated TUR commands. After an ipr
adapter gets reset, all disk array devices require a start unit command
to be issued to them before they will accept commands. So, with the SCSI
EH change, we now end up in a scenario with dual ipr adapters where the
TUR getting issued from the health checker returns with a Not Ready response
and since SCSI EH no longer triggers the Start Unit in this scenario,
the path never recovers.

The alternative solution would be to change the TUR path checker in multipath-tools
to issue a Start Unit if it sees a 02/04/02.

Thanks,

Brian

-- 
Brian King
Power Linux I/O
IBM Linux Technology Center





More information about the dm-devel mailing list