On 12/16/24 11:08 PM, Jingbo Xu wrote: >> That's why I asked Jens to weigh in on whether there is a generic >> block layer solution here. If uring_cmd is faster then maybe a generic >> uring_cmd I/O interface can be defined without tying applications to >> device-specific commands. Or maybe the traditional io_uring code path >> can be optimized so that bypass is no longer attractive. It's not that the traditional io_uring code path is slower, it's in fact basically the same thing. It's that all the other jazz that happens below io_uring slows things down, which is why passthrough ends up being faster. -- Jens Axboe