On Thu, Apr 6, 2017 at 7:13 AM, Will Deacon <will.deacon@xxxxxxx> wrote: > > We've wrapped this up in the arm64 code as __cmpwait, and we use that > to build smp_cond_load_acquire. It would be nice to use the same machinery > for the conditional spinning here, unless you anticipate that we're only > going to be spinning for a handful of iterations anyway? I suspect most of these loops aren't set up for the WFE kind of spinning, because they look for more than one variable. .. and the ones that _are_ set up for this probably should just be rewritten to use smp_cond_load_acquire() anyway, because the "wait for value" special case is fairly special. In theory x86 could use monitor/mwait for it too, in practice I think it tends to still be too high latency (because it was originally just designed for the idle loop). mwait got extended to actually be useful, but I'm not sure what the latency is for the modern one. Linus