On Fri, Oct 16, 2020 at 9:48 PM Shuah Khan <skhan@xxxxxxxxxxxxxxxxxxx> wrote: > > Andrey, > > I am unable to reproduce the problem. I even tweaked the reproducer > to launch 10 procs and 100 threads. > > Can you test the following patch with your setup > > Here it is - also attached. Hi Shuah, I get the crash below with this change. It complains about kthread_stop_put() being called in atomic context. BUG: sleeping function called from invalid context at kernel/sched/completion.c:101 in_atomic(): 1, irqs_disabled(): 1, non_block: 0, pid: 112, name: kworker/u4:2 4 locks held by kworker/u4:2/112: #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: arch_atomic64_set arch/x86/include/asm/atomic64_64.h:34 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: atomic64_set include/asm-generic/atomic-instrumented.h:856 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: atomic_long_set include/asm-generic/atomic-long.h:41 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: set_work_data kernel/workqueue.c:616 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: set_work_pool_and_clear_pending kernel/workqueue.c:643 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: process_one_work+0x841/0x15c0 kernel/workqueue.c:2243 #1: ffffc900001dfdc0 (usbip_work){+.+.}-{0:0}, at: process_one_work+0x874/0x15c0 kernel/workqueue.c:2247 #2: ffff88801371d280 (&vhci_hcd->vhci->lock){....}-{2:2}, at: vhci_shutdown_connection+0xd2/0x360 drivers/usb/usbip/vhci_hcd.c:1019 #3: ffff88801a904890 (&vdev->priv_lock){....}-{2:2}, at: spin_lock include/linux/spinlock.h:354 [inline] #3: ffff88801a904890 (&vdev->priv_lock){....}-{2:2}, at: vhci_shutdown_connection+0xeb/0x360 drivers/usb/usbip/vhci_hcd.c:1020 irq event stamp: 1298372 hardirqs last enabled at (1298371): [<ffffffff81b6ebd6>] kfree+0x236/0x290 mm/slab.c:3759 hardirqs last disabled at (1298372): [<ffffffff8844b5f9>] __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:108 [inline] hardirqs last disabled at (1298372): [<ffffffff8844b5f9>] _raw_spin_lock_irqsave+0xa9/0xd0 kernel/locking/spinlock.c:159 softirqs last enabled at (1298362): [<ffffffff87fdcae1>] spin_unlock_bh include/linux/spinlock.h:399 [inline] softirqs last enabled at (1298362): [<ffffffff87fdcae1>] batadv_nc_purge_paths+0x2d1/0x400 net/batman-adv/network-coding.c:470 softirqs last disabled at (1298360): [<ffffffff87fdc8ef>] spin_lock_bh include/linux/spinlock.h:359 [inline] softirqs last disabled at (1298360): [<ffffffff87fdc8ef>] batadv_nc_purge_paths+0xdf/0x400 net/batman-adv/network-coding.c:441 Preemption disabled at: [<0000000000000000>] 0x0 CPU: 0 PID: 112 Comm: kworker/u4:2 Not tainted 5.9.0+ #29 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1 04/01/2014 Workqueue: usbip_event event_handler Call Trace: __dump_stack lib/dump_stack.c:77 [inline] dump_stack+0x198/0x1fb lib/dump_stack.c:118 ___might_sleep.cold+0x1ef/0x235 kernel/sched/core.c:7298 __wait_for_common kernel/sched/completion.c:101 [inline] wait_for_common kernel/sched/completion.c:117 [inline] wait_for_completion+0x85/0x270 kernel/sched/completion.c:138 kthread_stop+0x17d/0x720 kernel/kthread.c:596 vhci_shutdown_connection+0x121/0x360 drivers/usb/usbip/vhci_hcd.c:1024 event_handler+0x1a4/0x3b0 drivers/usb/usbip/usbip_event.c:78 process_one_work+0x953/0x15c0 kernel/workqueue.c:2272 worker_thread+0x96/0xe20 kernel/workqueue.c:2418 kthread+0x3b5/0x4a0 kernel/kthread.c:292 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:296 BUG: scheduling while atomic: kworker/u4:2/112/0x00000003 4 locks held by kworker/u4:2/112: #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: arch_atomic64_set arch/x86/include/asm/atomic64_64.h:34 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: atomic64_set include/asm-generic/atomic-instrumented.h:856 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: atomic_long_set include/asm-generic/atomic-long.h:41 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: set_work_data kernel/workqueue.c:616 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: set_work_pool_and_clear_pending kernel/workqueue.c:643 [inline] #0: ffff888012a7b138 ((wq_completion)usbip_event){+.+.}-{0:0}, at: process_one_work+0x841/0x15c0 kernel/workqueue.c:2243 #1: ffffc900001dfdc0 (usbip_work){+.+.}-{0:0}, at: process_one_work+0x874/0x15c0 kernel/workqueue.c:2247 #2: ffff88801371d280 (&vhci_hcd->vhci->lock){+.+.}-{2:2}, at: vhci_shutdown_connection+0xd2/0x360 drivers/usb/usbip/vhci_hcd.c:1019 #3: ffff88801a904890 (&vdev->priv_lock){+.+.}-{2:2}, at: spin_lock include/linux/spinlock.h:354 [inline] #3: ffff88801a904890 (&vdev->priv_lock){+.+.}-{2:2}, at: vhci_shutdown_connection+0xeb/0x360 drivers/usb/usbip/vhci_hcd.c:1020 Modules linked in: Preemption disabled at: [<0000000000000000>] 0x0