[dm-devel] dm: dm-mpath: Provide high-resolution timer to HST with bio-mpath
Mike Snitzer
snitzer at redhat.com
Mon May 9 19:47:54 UTC 2022
On Wed, Apr 27 2022 at 12:57P -0400,
Gabriel Krisman Bertazi <krisman at collabora.com> wrote:
> The precision loss of reading IO start_time with jiffies_to_nsecs
> instead of using a high resolution timer degrades HST path prediction
> for BIO-based mpath on high load workloads.
>
> Below, I show the utilization percentage of a 10 disk multipath with
> asymmetrical disk access cost, while being exercised by a randwrite FIO
> benchmark with high submission queue depth (depth=64). It is possible
> to see that the HST path selection degrades heavily for high-iops in
> BIO-mpath, underutilizing the slower paths way beyond expected. This
> seems to be caused by the start_time truncation, which makes some IO to
> seem much slower than they actually is. In this scenario ST outperforms
> HST for bio-mpath, but not for mq-mpath, which already uses ktime_get_ns().
>
> The third column shows utilization with this patch applied. It is easy
> to see that now HST prediction is much closer to the ideal distribution
> (calculated considering the real cost of each path).
>
> | | ST | HST (orig) | HST(ktime) | Best |
> | sdd | 0.17 | 0.20 | 0.17 | 0.18 |
> | sde | 0.17 | 0.20 | 0.17 | 0.18 |
> | sdf | 0.17 | 0.20 | 0.17 | 0.18 |
> | sdg | 0.06 | 0.00 | 0.06 | 0.04 |
> | sdh | 0.03 | 0.00 | 0.03 | 0.02 |
> | sdi | 0.03 | 0.00 | 0.03 | 0.02 |
> | sdj | 0.02 | 0.00 | 0.01 | 0.01 |
> | sdk | 0.02 | 0.00 | 0.01 | 0.01 |
> | sdl | 0.17 | 0.20 | 0.17 | 0.18 |
> | sdm | 0.17 | 0.20 | 0.17 | 0.18 |
>
> This issue was originally discussed [1] when we first merged HST, and
> this patch was left as a low hanging fruit to be solved later. I don't
> think anyone is using HST with BIO mpath, but it'd be neat to get it
> sorted out.
>
> Regarding the implementation, as suggested by Mike in that mail thread,
> in order to avoid the overhead of ktime_get_ns for other selectors, this
> patch adds a flag for the selector code to request the high-resolution
> timer.
>
> I tested this using the same benchmark used in the original HST submission.
>
> Full test and benchmark scripts are available here:
>
> https://people.collabora.com/~krisman/HST-BIO-MPATH/
>
> [1] https://lore.kernel.org/lkml/85tv0am9de.fsf@collabora.com/T/
>
> Signed-off-by: Gabriel Krisman Bertazi <krisman at collabora.com>
> Acked-by: Gabriel Krisman Bertazi <krisman at collabora.com>
Overall your code was OK, but I nudged it a bit further to be
inkeeping with how 'features' flags have been implemented elsewhere
(e.g. dm_target_type's features) -- by using a healer to test the
flag, etc.
I also tweaked some other small implementation details. Please see:
https://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm.git/commit/?h=dm-5.19&id=c06dfd124d46df9c482fbd1319b5fe19bcb1a110
More information about the dm-devel
mailing list