On Tue, Jun 09, 2015 at 02:54:01PM -0700, Linus Torvalds wrote: > On Tue, Jun 9, 2015 at 2:14 PM, Dave Hansen <dave.hansen@xxxxxxxxx> wrote: > > > > The 0 cycle TLB miss was also interesting. It goes back up to something > > reasonable if I put the mb()/mfence's back. > > So I've said it before, and I'll say it again: Intel does really well > on TLB fills. Assuming the page tables are cache-hot... And hot here does not mean L3 cache, but higher. But a memory intensive workload can easily violate that. That's why I'm dubious of all these micro benchmarks. They won't be clearing caches. They generate unrealistic conditions in the CPU pipeline and overestimate the cost of the flushes. The only good way to measure TLB costs is macro benchmarks. -Andi -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>