On 1/9/18 9:19 AM, tj@xxxxxxxxxx wrote: > Hello, Bart. > > On Tue, Jan 09, 2018 at 04:12:40PM +0000, Bart Van Assche wrote: >> I'm concerned about the additional CPU cycles needed for the new blk_mq_map_queue() >> call, although I know this call is cheap. Would the timeout code really get that > > So, if that is really a concern, let's cache that mapping instead of > changing synchronization rules for that. > >> much more complicated if the hctx_lock() and hctx_unlock() calls would be changed >> into rcu_read_lock() and rcu_read_unlock() calls? Would it be sufficient if >> "if (has_rcu) synchronize_rcu();" would be changed into "synchronize_rcu();" in >> blk_mq_timeout_work()? > > Code-wise, it won't be too much extra code but I think diverging the > sync methods between issue and completion paths is more fragile and > likely to invite confusions and mistakes in the future. We have the > normal path (issue&completion) synchronizing against the exception > path (timeout). I think it's best to keep the sync constructs aligned > with that conceptual picture. Guys, the map call is FINE. We do it at least once per request anyway, usually multiple times. If it's too expensive, then THAT is what needs fixing, not adding an arbitrary caching of that mapping. Look at the actual code: return q->queue_hw_ctx[q->mq_map[cpu]]; that's it. It's just a double index. So let's put this thing to rest right now - the map call is perfectly fine, especially since the alternatives are either bloating struct request or making the code less maintainable. Foot -> down. -- Jens Axboe