On Wed, May 03, 2017 at 05:08:30PM +0000, Bart Van Assche wrote: > On Thu, 2017-05-04 at 00:52 +0800, Ming Lei wrote: > > Looks v4.11 plus your for-linus often triggers the following hang during > > boot, and it seems caused by the change in (blk-mq: unify hctx delayed_run_work > > and run_work) > > > > > > BUG: scheduling while atomic: kworker/0:1H/704/0x00000002 > > Modules linked in: > > Preemption disabled at: > > [<ffffffffaf5607bb>] virtio_queue_rq+0xdb/0x350 > > CPU: 0 PID: 704 Comm: kworker/0:1H Not tainted 4.11.0-04508-ga1f35f46164b #132 > > Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.9.3-1.fc25 04/01/2014 > > Workqueue: kblockd blk_mq_run_work_fn > > Call Trace: > > dump_stack+0x65/0x8f > > ? virtio_queue_rq+0xdb/0x350 > > __schedule_bug+0x76/0xc0 > > __schedule+0x610/0x820 > > ? new_slab+0x2c9/0x590 > > schedule+0x40/0x90 > > schedule_timeout+0x273/0x320 > > ? ___slab_alloc+0x3cb/0x4f0 > > wait_for_completion+0x97/0x100 > > ? wait_for_completion+0x97/0x100 > > ? wake_up_q+0x80/0x80 > > flush_work+0x104/0x1a0 > > ? flush_workqueue_prep_pwqs+0x130/0x130 > > __cancel_work_timer+0xeb/0x160 > > ? vp_notify+0x16/0x20 > > ? virtqueue_add_sgs+0x23c/0x4a0 > > cancel_delayed_work_sync+0x13/0x20 > > blk_mq_stop_hw_queue+0x16/0x20 > > virtio_queue_rq+0x316/0x350 > > blk_mq_dispatch_rq_list+0x194/0x350 > > blk_mq_sched_dispatch_requests+0x118/0x170 > > ? finish_task_switch+0x80/0x1e0 > > __blk_mq_run_hw_queue+0xa3/0xc0 > > blk_mq_run_work_fn+0x2c/0x30 > > process_one_work+0x1e0/0x400 > > worker_thread+0x48/0x3f0 > > kthread+0x109/0x140 > > ? process_one_work+0x400/0x400 > > ? kthread_create_on_node+0x40/0x40 > > ret_from_fork+0x2c/0x40 > > Callers of blk_mq_quiesce_queue() really need blk_mq_stop_hw_queue() to > cancel delayed work synchronously. The above call stack shows that we have > to do something about the blk_mq_stop_hw_queue() calls from inside .queue_rq() > functions for queues for which BLK_MQ_F_BLOCKING has not been set. I'm not > sure what the best approach would be: setting BLK_MQ_F_BLOCKING for queues > that call blk_mq_stop_hw_queue() from inside .queue_rq() or creating two > versions of blk_mq_stop_hw_queue(). I think either synchronize_srcu() or synchronize_rcu() can handle this, since they are waiting for completion of .queue_rq(). So looks cancel_delayed_work_sync() shouldn't be needed? Thanks, Ming