On 7/13/20 5:41 PM, Pavel Begunkov wrote: > Different batching improvements, that's it. > > Unfortunately, I don't have a decent SSD/setup at hand to > benchmark it properly. I do though, but I'm not seeing any improvement with this, whereas some of the previous series made nice improvements... If anything maybe it's a bit slower. > p.s. if extra 32 pointers on stack would be a problem, I wanted for > long to put submit_state into ctx itself. It's getting up there... But really depends on how early in the stack, so 32 could _probably_ work, though usually batched on-stack counts are a bit lower than that. -- Jens Axboe