On 6/22/21 11:42 PM, Olivier Langlois wrote: > On Tue, 2021-06-22 at 18:37 -0400, Olivier Langlois wrote: >> On Tue, 2021-06-22 at 21:45 +0100, Pavel Begunkov wrote: >>> On 6/22/21 7:55 PM, Olivier Langlois wrote: >>>> If an asynchronous completion happens before the task is >>>> preparing >>>> itself to wait and set its state to TASK_INTERRUPTIBLE, the >>>> completion >>>> will not wake up the sqp thread. >>>> >>>> Signed-off-by: Olivier Langlois <olivier@xxxxxxxxxxxxxx> >>>> --- >>>> fs/io_uring.c | 2 +- >>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>> >>>> diff --git a/fs/io_uring.c b/fs/io_uring.c >>>> index fc8637f591a6..02f789e07d4c 100644 >>>> --- a/fs/io_uring.c >>>> +++ b/fs/io_uring.c >>>> @@ -6902,7 +6902,7 @@ static int io_sq_thread(void *data) >>>> } >>>> >>>> prepare_to_wait(&sqd->wait, &wait, >>>> TASK_INTERRUPTIBLE); >>>> - if (!io_sqd_events_pending(sqd)) { >>>> + if (!io_sqd_events_pending(sqd) && !current- >>>>> task_works) { >>> >>> Agree that it should be here, but we also lack a good enough >>> task_work_run() around, and that may send the task burn CPU >>> for a while in some cases. Let's do >>> >>> if (!io_sqd_events_pending(sqd) && !io_run_task_work()) >>> ... >> >> I can do that if you want but considering that the function is inline >> and the race condition is a relatively rare occurence, is the cost >> coming with inline expansion really worth it in this case? >>> > On hand, there is the inline expansion concern. > > OTOH, the benefit of going with your suggestion is that completions > generally precedes new submissions so yes, it might be better that way. > > I'm really unsure about this. I'm just raising the concern and I'll let > you make the final decision... It seems it may actually loop infinitely until it gets a signal, so yes. And even if not, rare stalls are nasty, they will ruin some 9s of latency and hard to catch. That part is quite cold anyway, would generate some extra cold instructions, meh -- Pavel Begunkov