Patch "blk-mq: Fix stall due to recursive flush plug" has been added to the 6.4-stable tree

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



This is a note to let you know that I've just added the patch titled

    blk-mq: Fix stall due to recursive flush plug

to the 6.4-stable tree which can be found at:
    http://www.kernel.org/git/?p=linux/kernel/git/stable/stable-queue.git;a=summary

The filename of the patch is:
     blk-mq-fix-stall-due-to-recursive-flush-plug.patch
and it can be found in the queue-6.4 subdirectory.

If you, or anyone else, feels it should not be added to the stable tree,
please let <stable@xxxxxxxxxxxxxxx> know about it.



commit 71777f122cab2b0f8829ed04895b872bd32ace09
Author: Ross Lagerwall <ross.lagerwall@xxxxxxxxxx>
Date:   Fri Jul 14 11:11:06 2023 +0100

    blk-mq: Fix stall due to recursive flush plug
    
    [ Upstream commit 70904263512a74a3b8941dd9e6e515ca6fc57821 ]
    
    We have seen rare IO stalls as follows:
    
    * blk_mq_plug_issue_direct() is entered with an mq_list containing two
    requests.
    * For the first request, it sets last == false and enters the driver's
    queue_rq callback.
    * The driver queue_rq callback indirectly calls schedule() which calls
    blk_flush_plug(). This may happen if the driver has the
    BLK_MQ_F_BLOCKING flag set and is allowed to sleep in ->queue_rq.
    * blk_flush_plug() handles the remaining request in the mq_list. mq_list
    is now empty.
    * The original call to queue_rq resumes (with last == false).
    * The loop in blk_mq_plug_issue_direct() terminates because there are no
    remaining requests in mq_list.
    
    The IO is now stalled because the last request submitted to the driver
    had last == false and there was no subsequent call to commit_rqs().
    
    Fix this by returning early in blk_mq_flush_plug_list() if rq_count is 0
    which it will be in the recursive case, rather than checking if the
    mq_list is empty. At the same time, adjust one of the callers to skip
    the mq_list empty check as it is not necessary.
    
    Fixes: dc5fc361d891 ("block: attempt direct issue of plug list")
    Signed-off-by: Ross Lagerwall <ross.lagerwall@xxxxxxxxxx>
    Reviewed-by: Bart Van Assche <bvanassche@xxxxxxx>
    Link: https://lore.kernel.org/r/20230714101106.3635611-1-ross.lagerwall@xxxxxxxxxx
    Signed-off-by: Jens Axboe <axboe@xxxxxxxxx>
    Signed-off-by: Sasha Levin <sashal@xxxxxxxxxx>

diff --git a/block/blk-core.c b/block/blk-core.c
index 3fc68b9444791..0434f5a8151fe 100644
--- a/block/blk-core.c
+++ b/block/blk-core.c
@@ -1141,8 +1141,7 @@ void __blk_flush_plug(struct blk_plug *plug, bool from_schedule)
 {
 	if (!list_empty(&plug->cb_list))
 		flush_plug_callbacks(plug, from_schedule);
-	if (!rq_list_empty(plug->mq_list))
-		blk_mq_flush_plug_list(plug, from_schedule);
+	blk_mq_flush_plug_list(plug, from_schedule);
 	/*
 	 * Unconditionally flush out cached requests, even if the unplug
 	 * event came from schedule. Since we know hold references to the
diff --git a/block/blk-mq.c b/block/blk-mq.c
index 73ed8ccb09ce8..58bf41e8e66c7 100644
--- a/block/blk-mq.c
+++ b/block/blk-mq.c
@@ -2754,7 +2754,14 @@ void blk_mq_flush_plug_list(struct blk_plug *plug, bool from_schedule)
 {
 	struct request *rq;
 
-	if (rq_list_empty(plug->mq_list))
+	/*
+	 * We may have been called recursively midway through handling
+	 * plug->mq_list via a schedule() in the driver's queue_rq() callback.
+	 * To avoid mq_list changing under our feet, clear rq_count early and
+	 * bail out specifically if rq_count is 0 rather than checking
+	 * whether the mq_list is empty.
+	 */
+	if (plug->rq_count == 0)
 		return;
 	plug->rq_count = 0;
 



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux