[dm-devel] kernel BUG at lib/percpu-refcount.c:192

Mike Snitzer snitzer at redhat.com
Thu Jun 8 13:38:29 UTC 2017


On Thu, Jun 08 2017 at  4:29am -0400,
Marian Csontos <mcsontos at redhat.com> wrote:

> Mike, Heinz, help please!
> 
> RC4 and still present:
> 
> [  248.815709] CPU: 0 PID: 21 Comm: rcuos/1 Not tainted
> 4.12.0-0.rc4.git2.1.fc27.x86_64 #1
> [  248.816841] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
> 
> ...and it affects RAID1 as well.
> 
> Is this md wrong or is there a more serious issue? And if so is it a
> known one?
> I have not found any reference anywhere except this dm-devel thread.
> Is this a problem in LVM RAID only?

Unclear.
 
> Where to forward this? LKML? Fedora Bugzilla?

Certainly linux-raid (now cc'd).

> Is there anything else I should provide?

The linewrapping makes it hard to read these logs, but I'll defer to
others to weigh-in on any additional info needed.

Mike

> -- Martian
> 
> On 05/18/2017 10:33 AM, Marian Csontos wrote:
> >When running `lvm lvcreate --type raid5 --size 4194304B --name
> >meta_r5 LVMTEST12279nqymhamb_vg --yes` there's a warning followed
> >by the bug.
> >
> >Also at [  192.140142] it looks like there is an uninitialized
> >variable somewhere.
> >
> >env: rawhide, kvm, 2 (V)CPUs
> >lvm2: upstream
> >kernel: 4.12.0-0.rc1.git1.1.fc27.x86_64
> >
> >----
> >
> >[  192.131076] mdX: bitmap file is out of date, doing full recovery
> >
> >[  192.140142] percpu ref (          (null)) <= 0
> >(-9223372036854775807) after switching to atomic
> >
> >[  192.140180] ------------[ cut here ]------------
> >[  192.143734] WARNING: CPU: 0 PID: 10 at
> >lib/percpu-refcount.c:155
> >percpu_ref_switch_to_atomic_rcu+0x1cd/0x1f0
> >[  192.146353] Modules linked in: dm_raid raid456
> >async_raid6_recov async_memcpy async_pq async_xor async_tx xor
> >raid6_pq ...|dm_thin_pool dm_cache_smq dm_cache dm_persistent_data
> >dm_bio_prison libcrc32c loop crct10dif_pclmul crc32_pclmul
> >crc32c_intel     ...|ppdev ghash_clmulni_intel acpi_cpufreq
> >virtio_net virtio_balloon tpm_tis joydev tpm_tis_core parport_pc
> >parport tpm i2c_piix4     ...|virtio_blk cirrus drm_kms_helper ttm
> >drm serio_raw virtio_pci virtio_ring virtio ata_generic pata_acpi
> >[  192.154276] CPU: 0 PID: 10 Comm: rcuos/0 Not tainted
> >4.12.0-0.rc1.git1.1.fc27.x86_64 #1
> >[  192.155684] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
> >[  192.156741] task: ffff8d5bb6e73100 task.stack: ffffa9a3c0360000
> >[  192.157830] RIP: 0010:percpu_ref_switch_to_atomic_rcu+0x1cd/0x1f0
> >[  192.158956] RSP: 0018:ffffa9a3c0363e50 EFLAGS: 00010282
> >[  192.159899] RAX: 0000000000000053 RBX: 8000000000000000 RCX:
> >0000000000000000
> >[  192.161192] RDX: 0000000000000000 RSI: ffff8d5bb6e73d90 RDI:
> >0000000000000247
> >[  192.162483] RBP: ffffa9a3c0363e68 R08: 0000000000000000 R09:
> >0000000000000000
> >[  192.163762] R10: ffffa9a3c0363e00 R11: 0000000000000000 R12:
> >0000000000000000
> >[  192.165144] R13: ffff8d5bb5252580 R14: ffff8d5bb5252580 R15:
> >0000000000000000
> >[  192.166437] FS:  0000000000000000(0000)
> >GS:ffff8d5bb7800000(0000) knlGS:0000000000000000
> >[  192.169392] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> >[  192.171893] CR2: 00007ffc83d87d98 CR3: 0000000013e11000 CR4:
> >00000000000006f0
> >[  192.174692] Call Trace:
> >[  192.176627]  ? percpu_ref_reinit+0x140/0x140
> >[  192.178875]  rcu_nocb_kthread+0x1ae/0x580
> >[  192.181174]  kthread+0x133/0x150
> >[  192.183305]  ? rcu_eqs_enter_common.constprop.68+0x1c0/0x1c0
> >[  192.185813]  ? kthread_create_on_node+0x70/0x70
> >[  192.188110]  ret_from_fork+0x31/0x40
> >[  192.190338] Code: ff ff ff 80 3d a6 71 b9 00 00 0f 85 c1 fe ff
> >ff 49 8b 55 d8 49 8b 75 e8 48 c7 c7 f0 2b ce 8b c6 05 8a 71 b9 00
> >...|01 e8 89 95 d4 ff <0f> ff e9 9f fe ff ff f0 49 83 6d d8 01 0f
> >85 05 ff ff ff 48 89
> >[  192.196842] ---[ end trace fbf926294c454ffa ]---
> >
> >[  192.199425] ------------[ cut here ]------------
> >[  192.200346] kernel BUG at lib/percpu-refcount.c:192!
> >[  192.200346] invalid opcode: 0000 [#1] SMP
> >[  192.200346] Modules linked in: dm_raid raid456
> >async_raid6_recov async_memcpy async_pq async_xor async_tx xor
> >raid6_pq ...|dm_thin_pool dm_cache_smq dm_cache dm_persistent_data
> >dm_bio_prison libcrc32c loop crct10dif_pclmul crc32_pclmul
> >crc32c_intel     ...|ppdev ghash_clmulni_intel acpi_cpufreq
> >virtio_net virtio_balloon tpm_tis joydev tpm_tis_core parport_pc
> >parport tpm i2c_piix4     ...|virtio_blk cirrus drm_kms_helper ttm
> >drm serio_raw virtio_pci virtio_ring virtio ata_generic pata_acpi
> >[  192.211647] CPU: 0 PID: 13767 Comm: mdX_raid5 Tainted: G
> >W   4.12.0-0.rc1.git1.1.fc27.x86_64 #1
> >[  192.211647] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
> >[  192.211647] task: ffff8d5bb15f3100 task.stack: ffffa9a3c0d10000
> >[  192.211647] RIP: 0010:__percpu_ref_switch_mode+0x278/0x280
> >[  192.211647] RSP: 0018:ffffa9a3c0d13ce8 EFLAGS: 00010046
> >[  192.211647] RAX: 0000000000000001 RBX: ffff8d5bb5252558 RCX:
> >0000000000000000
> >[  192.211647] RDX: 0000000000000001 RSI: ffffffff8c05a098 RDI:
> >0000000000000046
> >[  192.211647] RBP: ffffa9a3c0d13d30 R08: 0000000000000001 R09:
> >0000000000000000
> >[  192.211647] R10: ffffa9a3c0d13ca0 R11: ffffffff8c05a098 R12:
> >0000000000000000
> >[  192.211647] R13: 0000000000000000 R14: ffff8d5bb15f3100 R15:
> >ffff8d5bb508f000
> >[  192.211647] FS:  0000000000000000(0000)
> >GS:ffff8d5bb7800000(0000) knlGS:0000000000000000
> >[  192.211647] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> >[  192.211647] CR2: 00007ffc83d87d98 CR3: 0000000013e11000 CR4:
> >00000000000006f0
> >[  192.211647] Call Trace:
> >[  192.211647]  percpu_ref_switch_to_percpu+0x27/0x40
> >[  192.211647]  set_in_sync+0xd4/0xe0
> >[  192.211647]  md_check_recovery+0x1f5/0x4e0
> >[  192.211647]  raid5d+0x56/0x6a0 [raid456]
> >[  192.211647]  ? finish_wait+0x72/0x90
> >[  192.211647]  ? trace_hardirqs_on_caller+0xf4/0x190
> >[  192.211647]  ? trace_hardirqs_on+0xd/0x10
> >[  192.211647]  md_thread+0x138/0x180
> >[  192.211647]  ? md_thread+0x138/0x180
> >[  192.211647]  ? finish_wait+0x90/0x90
> >[  192.211647]  kthread+0x133/0x150
> >[  192.211647]  ? find_pers+0x70/0x70
> >[  192.211647]  ? kthread_create_on_node+0x70/0x70
> >[  192.211647]  ret_from_fork+0x31/0x40
> >[  192.211647] Code: f0 4a c8 8b be d5 03 00 00 48 c7 c7 b1 75 c7
> >8b c6 05 ef 75 b9 00 01 e8 97 d2 c4 ff e9 e8 fe ff ff f0 48 83 03
> >...|01 e9 bf fe ff ff <0f> 0b e8 21 d9 be ff 90 55 48 89 e5 41 55
> >41 54 53 48 89 fb 48
> >[  192.211647] RIP: __percpu_ref_switch_mode+0x278/0x280 RSP:
> >ffffa9a3c0d13ce8
> >[  192.211647] ---[ end trace fbf926294c454ffb ]---
> >
> >[  192.211647] BUG: sleeping function called from invalid context
> >at ./include/linux/percpu-rwsem.h:33
> >[  192.211647] in_atomic(): 1, irqs_disabled(): 1, pid: 13767,
> >name: mdX_raid5
> >[  192.211647] INFO: lockdep is turned off.
> >[  192.211647] irq event stamp: 68
> >[  192.211647] hardirqs last  enabled at (67):
> >[<ffffffff8b983586>] _raw_spin_unlock_irqrestore+0x36/0x60
> >[  192.211647] hardirqs last disabled at (68):
> >[<ffffffff8b983c47>] _raw_spin_lock_irqsave+0x27/0x90
> >[  192.211647] softirqs last  enabled at (0): [<ffffffff8b0ae308>]
> >copy_process.part.26+0x618/0x1ee0
> >[  192.211647] softirqs last disabled at (0): [<          (null)>]
> >(null)
> >[  192.211647] CPU: 0 PID: 13767 Comm: mdX_raid5 Tainted: G      D
> >W   4.12.0-0.rc1.git1.1.fc27.x86_64 #1
> >[  192.211647] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
> >[  192.211647] Call Trace:
> >[  192.211647]  dump_stack+0x8e/0xcd
> >[  192.211647]  ___might_sleep+0x144/0x260
> >[  192.211647]  __might_sleep+0x4a/0x80
> >[  192.211647]  exit_signals+0x33/0x240
> >[  192.211647]  do_exit+0xb4/0xd30
> >[  192.211647]  ? kthread+0x133/0x150
> >[  192.211647]  ? find_pers+0x70/0x70
> >[  192.211647]  rewind_stack_do_exit+0x17/0x20
> >
> >[  192.211647] note: mdX_raid5[13767] exited with preempt_count 2
> >
> >[  220.071005] NMI watchdog: BUG: soft lockup - CPU#1 stuck for
> >22s! [lvm:13678]
> >[  220.071079] Modules linked in: dm_raid raid456
> >async_raid6_recov async_memcpy async_pq async_xor async_tx xor
> >raid6_pq ...|dm_thin_pool dm_cache_smq dm_cache dm_persistent_data
> >dm_bio_prison libcrc32c loop crct10dif_pclmul crc32_pclmul
> >crc32c_intel     ...|ppdev ghash_clmulni_intel acpi_cpufreq
> >virtio_net virtio_balloon tpm_tis joydev tpm_tis_core parport_pc
> >parport tpm i2c_piix4     ...|virtio_blk cirrus drm_kms_helper ttm
> >drm serio_raw virtio_pci virtio_ring virtio ata_generic pata_acpi
> >[  220.083131] irq event stamp: 313716
> >[  220.087117] hardirqs last  enabled at (313715):
> >[<ffffffff8b983da7>] entry_SYSCALL_64_fastpath+0x5/0xbe
> >[  220.091109] hardirqs last disabled at (313716):
> >[<ffffffff8b97afe4>] __schedule+0xc4/0xb40
> >[  220.091109] softirqs last  enabled at (301518):
> >[<ffffffff8b988d52>] __do_softirq+0x382/0x4ed
> >[  220.095114] softirqs last disabled at (301511):
> >[<ffffffff8b0b8e7f>] irq_exit+0x10f/0x120
> >[  220.099150] CPU: 1 PID: 13678 Comm: lvm Tainted: G      D W
> >4.12.0-0.rc1.git1.1.fc27.x86_64 #1
> >[  220.103049] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
> >[  220.103049] task: ffff8d5bb3c03100 task.stack: ffffa9a3c155c000
> >[  220.107103] RIP: 0010:queued_spin_lock_slowpath+0x25/0x1b0
> >[  220.107103] RSP: 0018:ffffa9a3c155f720 EFLAGS: 00000202
> >ORIG_RAX: ffffffffffffff10
> >[  220.111144] RAX: 0000000000000001 RBX: ffff8d5bb5252448 RCX:
> >0000000000000000
> >[  220.115126] RDX: 0000000000000001 RSI: 0000000000000001 RDI:
> >ffff8d5bb5252448
> >[  220.115126] RBP: ffffa9a3c155f720 R08: 0000000000000000 R09:
> >0000000000000000
> >[  220.119130] R10: ffffa9a3c155f6d0 R11: ffff8d5bb3c03100 R12:
> >ffff8d5bb5252448
> >[  220.123139] R13: ffff8d5bb5252448 R14: ffff8d5bb379ca20 R15:
> >ffff8d5bb5252058
> >[  220.127118] FS:  00007f1f8253b880(0000)
> >GS:ffff8d5bb7a00000(0000) knlGS:0000000000000000
> >[  220.127118] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> >[  220.131158] CR2: 00007fe2715ae000 CR3: 0000000075108000 CR4:
> >00000000000006e0
> >[  220.135114] Call Trace:
> >[  220.135114]  do_raw_spin_lock+0xb2/0xc0
> >[  220.139145]  _raw_spin_lock+0x55/0x70
> >[  220.139145]  md_write_start+0x143/0x360
> >[  220.143126]  ? probe_sched_wakeup+0x1c/0x20
> >[  220.143126]  ? check_preempt_curr+0x27/0xb0
> >[  220.147147]  raid5_make_request+0xd0/0x1060 [raid456]
> >[  220.147147]  ? _raw_spin_unlock_irqrestore+0x5b/0x60
> >[  220.151162]  ? try_to_wake_up+0x58/0x520
> >[  220.151162]  ? finish_wait+0x90/0x90
> >[  220.155099]  ? mempool_alloc_slab+0x1d/0x30
> >[  220.155099]  ? finish_wait+0x90/0x90
> >[  220.159136]  raid_map+0x2b/0x40 [dm_raid]
> >[  220.159136]  __map_bio+0xc9/0x2b0
> >[  220.163115]  ? dm_put_table_device+0xd0/0xd0
> >[  220.163115]  __split_and_process_bio+0x293/0x590
> >[  220.167133]  dm_make_request+0x7f/0x120
> >[  220.167133]  ? generic_make_request+0xf9/0x3b0
> >[  220.171131]  generic_make_request+0x126/0x3b0
> >[  220.171131]  ? iov_iter_get_pages+0xc9/0x300
> >[  220.175138]  submit_bio+0x73/0x150
> >[  220.175138]  ? submit_bio+0x73/0x150
> >[  220.179132]  ? bio_iov_iter_get_pages+0xe0/0x120
> >[  220.179132]  ? __blkdev_direct_IO_simple+0x168/0x320
> >[  220.183114]  __blkdev_direct_IO_simple+0x184/0x320
> >[  220.183114]  ? lock_acquire+0xa3/0x1f0
> >[  220.187129]  ? bdput+0x20/0x20
> >[  220.187129]  blkdev_direct_IO+0x3b1/0x3e0
> >[  220.191112]  ? blkdev_direct_IO+0x3b1/0x3e0
> >[  220.191112]  ? current_time+0x1a/0x70
> >[  220.195108]  ? trace_hardirqs_on+0xd/0x10
> >[  220.195108]  generic_file_direct_write+0x9c/0x110
> >[  220.199114]  ? generic_file_direct_write+0x9c/0x110
> >[  220.199114]  __generic_file_write_iter+0xbe/0x1d0
> >[  220.199114]  blkdev_write_iter+0x96/0x110
> >[  220.203115]  __vfs_write+0xe8/0x160
> >[  220.203115]  ? __vfs_write+0xe8/0x160
> >[  220.207211]  vfs_write+0xc6/0x1c0
> >[  220.207211]  SyS_write+0x58/0xc0
> >[  220.207211]  entry_SYSCALL_64_fastpath+0x1f/0xbe
> >[  220.211130] RIP: 0033:0x7f1f81309290
> >[  220.211130] RSP: 002b:00007fffba453af8 EFLAGS: 00000246
> >ORIG_RAX: 0000000000000001
> >[  220.215116] RAX: ffffffffffffffda RBX: 0000000000000001 RCX:
> >00007f1f81309290
> >[  220.215116] RDX: 0000000000001000 RSI: 000055f0c4ab4000 RDI:
> >000000000000000c
> >[  220.219114] RBP: 00007f1f815d5583 R08: fffffffffffffff0 R09:
> >0000000000000000
> >[  220.219114] R10: 00007f1f815d4b38 R11: 0000000000000246 R12:
> >0000000000000001
> >[  220.223148] R13: 0000000000000001 R14: 00007f1f815d5500 R15:
> >00007f1f815d08c0
> >[  220.223148] Code: 1f 80 00 00 00 00 66 66 66 66 90 55 48 89 e5
> >66 66 90 66 90 ba 01 00 00 00 8b 07 85 c0 75 0a f0 0f b1 17 85 c0
> >...|75 f2 5d c3 f3 90 <eb> ec 81 fe 00 01 00 00 0f 84 1a 01 00 00
> >41 b8 01 01 00 00 b9
> >
> >-- 
> >dm-devel mailing list
> >dm-devel at redhat.com
> >https://www.redhat.com/mailman/listinfo/dm-devel
> 




More information about the dm-devel mailing list