On Tue, Jan 26, 2016 at 01:37:38PM -0800, H. Peter Anvin wrote: > On 01/26/16 00:20, Michael S. Tsirkin wrote: > > On Tue, Jan 12, 2016 at 02:25:24PM -0800, H. Peter Anvin wrote: > > > > Hi hpa, > > Any luck getting some feedback on this one? > > > > Yes. What we know so far is that *most* cases it will work, but there > are apparently a few corner cases where MFENCE or a full-blown > serializing instruction is necessary. We are trying to characterize > those corner cases and see if any of them affect the kernel. It would be very interesting to know your findings. Going over the manual I found one such case, and then going over the kernel code I found some questionable uses of barriers - it would be interesting to find out what some other cases are. So I think it's probably useful to find out the full answer, anyway. Awaiting the answers with interest. > Even if they are, we can probably make those barriers explicitly > different, but we don't want to go ahead with the change until we know > where we need to care. > > -hpa Thanks! Now that you definitely said there are corner cases, I poked some more at the manual and found one: CLFLUSH is only ordered by the MFENCE instruction. It is not guaranteed to be ordered by any other fencing or serializing instructions or by another CLFLUSH instruction. For example, software can use an MFENCE instruction to ensure that previous stores are included in the write-back. There are instances of this in mwait_play_dead, clflush_cache_range, mwait_idle_with_hints, mwait_idle .. A comment near pcommit_sfence includes an example flush_and_commit_buffer code which is interesting - it assumes sfence flushes clflush. So it appears that pcommit_sfence in that file is wrong then? At least on processors where it falls back on clflush. mwait_idle is the only one that calls smp_mb and not mb() I couldn't figure out why - original patches did mb() there. Outside core kernel - drm_cache_flush_clflush, drm_clflush_sg, drm_clflush_virt_range. Then there's gru_start_instruction in drivers/misc/sgi-gru/. But otherwise drivers/misc/sgi-gru/ calls clflush in gru_flush_cache without calling mb() - this could be a bug. Looking at all users, it seems that only mwait_idle calls smp_mb, around clflush, others call mb(). So at least as a first step, maybe it makes sense to scope this down somewhat by changing mwait_idle to call mb() and then optimizing __smp_mb instead of mb? I'll post v3 that does this. -- MST _______________________________________________ Virtualization mailing list Virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linuxfoundation.org/mailman/listinfo/virtualization