Different batching improvements, that's it. Unfortunately, I don't have a decent SSD/setup at hand to benchmark it properly. p.s. if extra 32 pointers on stack would be a problem, I wanted for long to put submit_state into ctx itself. Pavel Begunkov (5): io_uring: move io_req_complete() definition io_uring: replace list with array for compl batch io_uring: batch free in batched completion tasks: add put_task_struct_many() io_uring: batch put_task_struct() fs/io_uring.c | 129 ++++++++++++++++++++++--------------- include/linux/sched/task.h | 6 ++ 2 files changed, 82 insertions(+), 53 deletions(-) -- 2.24.0