On Thu, Jan 30, 2025 at 12:41 AM Pavel Begunkov <asml.silence@xxxxxxxxx> wrote: > Ok, then it's an architectural problem and needs more serious > reengineering, e.g. of how work items are stored and grabbed Rough unpolished idea: I was thinking about having multiple work lists, each with its own spinlock (separate cache line), and each io-wq thread only uses one of them, while the submitter round-robins through the lists.