[dm-devel] [PATCH] multipathd: "san_path_err" failure optimization

Benjamin Marzinski bmarzins at redhat.com
Tue Sep 3 23:41:33 UTC 2019


On Tue, Aug 27, 2019 at 12:28:14PM +0000, Chongyun Wu wrote:
> Hi Martin and Ben,
> 
> Cloud you help to view below patch, thanks.
> 
> From a7126e33e7eff8a985600b41b1723ee66b183586 Mon Sep 17 00:00:00 2001
> From: Chongyun Wu <wu.chongyun at h3c.com>
> Date: Tue, 27 Aug 2019 10:23:50 +0800
> Subject: [PATCH] multipathd: "san_path_err" failure optimization
> 
> Let san_path_err_recovery_time path unstable can be
> detected and not reinstate it until this path keep up in
> san_path_err_recovery_time. It will fix heavy IO delay
> caused by parts of paths state shaky in multipath device.
> 
> Test and result:
> Run up eth1 30s and down eth1 30s with 100 loops script to
> make some paths shaky in each multipath devices.
> Using below multipath.conf configure in defaults section:
>     san_path_err_recovery_time 30
>     san_path_err_threshold 2
>     san_path_err_forget_rate 6
> After test, not found any IO delay logs except several logs in the very
> beginning which before san_path_err filter shaky path works .
> If without above config and this patch there will be lots of IO delay
> in syslog and some paths state change from up to down again and again.
> 
> Signed-off-by: Chongyun Wu <wu.chongyun at h3c.com>

Reviewed-by: Benjamin Marzinski <bmarzins at redhat.com>

> ---
>  multipathd/main.c | 17 +++++++++++++++++
>  1 file changed, 17 insertions(+)
> 
> diff --git a/multipathd/main.c b/multipathd/main.c
> index 7a5cd11..8acd080 100644
> --- a/multipathd/main.c
> +++ b/multipathd/main.c
> @@ -1897,6 +1897,18 @@ static int check_path_reinstate_state(struct path * pp) {
>  			condlog(2, "%s : reinstating path early", pp->dev);
>  			goto reinstate_path;
>  		}
> +
> +		/* If path became failed again or continue failed, should reset
> +		 * path san_path_err_forget_rate and path dis_reinstate_time to
> +		 * start a new stable check. 
> +		 */
> +		if ((pp->state != PATH_UP) && (pp->state != PATH_GHOST) &&
> +			(pp->state != PATH_DELAYED)) {
> +			pp->san_path_err_forget_rate =
> +				pp->mpp->san_path_err_forget_rate;
> +			pp->dis_reinstate_time = curr_time.tv_sec;
> +		}
> +
>  		if ((curr_time.tv_sec - pp->dis_reinstate_time ) > pp->mpp->san_path_err_recovery_time) {
>  			condlog(2,"%s : reinstate the path after err recovery time", pp->dev);
>  			goto reinstate_path;
> @@ -2106,6 +2118,11 @@ check_path (struct vectors * vecs, struct path * pp, int ticks)
>  			check_path_reinstate_state(pp)) {
>  		pp->state = PATH_DELAYED;
>  		return 1;
> +	} else if ((newstate != PATH_UP && newstate != PATH_GHOST) &&
> +			(pp->state == PATH_DELAYED)) {
> +		/* If path state become failed again cancel path delay state */
> +		pp->state = newstate;
> +		return 1;
>  	}
>  
>  	if ((newstate == PATH_UP || newstate == PATH_GHOST) &&
> -- 
> 
> Best Regards,
> Chongyun Wu
> 




More information about the dm-devel mailing list