On Sat, Sep 29, 2012 at 02:37:18AM +0300, Kirill A. Shutemov wrote: > Cons: > - increases TLB pressure; I generally don't like using 4k tlb entries ever. This only has the advantage of saving 2MB-4KB RAM (globally), and a chpxchg at the first system-wide zero page fault. I like apps to only use 2M TLB entries whenever possible (that is going to payoff big as the number of 2M TLB entries is going to increase over time). I did some research with tricks using 4k ptes up to half the pmd was filled before converting it to a THP (to save some memory and cache), and it didn't look good, so my rule of thumb was "THP sometime costs, even the switch from half pte filled to transhuge pmd still costs, so to diminish the risk of slowdowns we should use 2M TLB entries immediately, whenever possible". Now the rule of thumb doesn't fully apply here, 1) there's no compaction costs to offset, 2) chances are the zero page isn't very performance critical anyway... only some weird apps uses it (but sometime they have a legitimate reason for using it, this is why we support it). There would be a small cache benefit here... but even then some first level caches are virtually indexed IIRC (always physically tagged to avoid the software to notice) and virtually indexed ones won't get any benefit. It wouldn't provide even the memory saving tradeoff by dropping the zero pmd at the first fault (not at the last). And it's better to replace it at the first fault then the last (that matches the current design). Another point is that the previous patch is easier to port to other archs by not requiring arch features to track the zero pmd. I guess it won't make a whole lot of difference but my preference is for the previous implementation that always guaranteed huge TLB entries whenever possible. Said that I'm fine either ways so if somebody has strong reasons for wanting this one, I'd like to hear about it. Thanks! Andrea -- To unsubscribe from this list: send the line "unsubscribe linux-arch" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html