[dm-devel] [for-4.16 PATCH v5 0/4] block/dm: allow DM to defer blk_register_queue() until ready
Mike Snitzer
snitzer at redhat.com
Mon Jan 15 22:15:11 UTC 2018
On Mon, Jan 15 2018 at 12:16pm -0500,
Bart Van Assche <Bart.VanAssche at wdc.com> wrote:
> On Fri, 2018-01-12 at 10:06 -0500, Mike Snitzer wrote:
> > I'm submitting this v5 with more feeling now ;)
>
> Hello Mike,
>
> Have these patches been tested with lockdep enabled? The following appeared in
> the kernel log when after I started testing Jens' for-next tree of this morning:
>
> ======================================================
> WARNING: possible circular locking dependency detected
> 4.15.0-rc7-dbg+ #1 Not tainted
> ------------------------------------------------------
> 02-mq/1211 is trying to acquire lock:
> (&q->sysfs_lock){+.+.}, at: [<000000008b65bdad>] queue_attr_store+0x35/0x80
>
> but task is already holding lock:
> (kn->count#213){++++}, at: [<000000007a18ad18>] kernfs_fop_write+0xe5/0x190
>
> which lock already depends on the new lock.
>
>
> the existing dependency chain (in reverse order) is:
>
> -> #1 (kn->count#213){++++}:
> kernfs_remove+0x1a/0x30
> kobject_del.part.3+0xe/0x40
> blk_unregister_queue+0xa7/0xe0
> del_gendisk+0x12f/0x260
> sd_remove+0x58/0xc0 [sd_mod]
> device_release_driver_internal+0x15a/0x220
> bus_remove_device+0xf4/0x170
> device_del+0x12f/0x330
> __scsi_remove_device+0xef/0x120 [scsi_mod]
> scsi_forget_host+0x1b/0x60 [scsi_mod]
> scsi_remove_host+0x6f/0x110 [scsi_mod]
> 0xffffffffc09ed6e4
> process_one_work+0x21c/0x6d0
> worker_thread+0x35/0x380
> kthread+0x117/0x130
> ret_from_fork+0x24/0x30
>
> -> #0 (&q->sysfs_lock){+.+.}:
> __mutex_lock+0x6c/0x9e0
> queue_attr_store+0x35/0x80
> kernfs_fop_write+0x109/0x190
> __vfs_write+0x1e/0x130
> vfs_write+0xb9/0x1b0
> SyS_write+0x40/0xa0
> entry_SYSCALL_64_fastpath+0x23/0x9a
>
> other info that might help us debug this:
>
> Possible unsafe locking scenario:
>
> CPU0 CPU1
> ---- ----
> lock(kn->count#213);
> lock(&q->sysfs_lock);
> lock(kn->count#213);
> lock(&q->sysfs_lock);
>
> *** DEADLOCK ***
>
> 3 locks held by 02-mq/1211:
> #0: (sb_writers#6){.+.+}, at: [<00000000afdb61d3>] vfs_write+0x17f/0x1b0
> #1: (&of->mutex){+.+.}, at: [<00000000b291cabb>] kernfs_fop_write+0xdd/0x190
> #2: (kn->count#213){++++}, at: [<000000007a18ad18>] kernfs_fop_write+0xe5/0x190
sysfs write op calls kernfs_fop_write which takes:
of->mutex then kn->count#213 (no idea what that is)
then q->sysfs_lock (via queue_attr_store)
vs
blk_unregister_queue takes:
q->sysfs_lock then
kernfs_mutex (via kernfs_remove)
seems lockdep thinks "kernfs_mutex" is "kn->count#213"?
Feels like lockdep code in fs/kernfs/file.c and fs/kernfs/dir.c is
triggering false positives.. because these seem like different kernfs
locks yet they are reported as "kn->count#213".
Certainly feeling out of my depth with kernfs's locking though.
Mike
More information about the dm-devel
mailing list