On Tue, Mar 18, 2014 at 01:10:44PM +0000, Zuckerman, Boris wrote: > X86 cache lines are much smaller than a page. Cache lined are flushed "naturally", but we do not know about that. > How many Dirty pages do we anticipate? What is the performance cost of msync()? Is that higher, if we do page-based accounting? The number of dirty pages is going to depend on the workload. The problem with looking at today's workloads as an approximation of what workloads will look like is that people will optimise their software for persistent memory once persistent memory becomes more widely available. So all we can do is point out "hey, you have a lot of dirty pages, maybe you'd like to change your algorithms". > Reasons and frequency of msync(): > Atomicity: needs barriers, happens frequently, leaves relatively small number of Dirty pages. Here the cost is probably smaller. > Durability of application updates: issued infrequently, leaves many Dirty pages. The cost could be high, right? We have two ways on x86 to implement msync. One is to flush every cacheline and the other is to flush the entire cache. If the user asks to flush a terabyte of memory, it's clearly cheaper to flush the cache. If the user asks to flush 64 bytes, we should clearly just flush a single line. Somewhere in-between there's a cross-over point, and that's going to depend on the size of the CPU's cache, the nature of the workload, and a few other factors. I'm not worrying about where that is right now, because we can't make that decision without first tracking which pages are dirty and which are clean. > Let's assume that at some point we get CPU/Persistent Memory Controller > combinations that support atomicity of multiple updates in hardware. Would > you need to mark pages Dirty in such cases? If not, what is the right > layer build that support for x86? Regardless of future hardware innovations, we need to support the semantics of msync(). That is, if the user calls msync(), sees that it has successfully synced a range of data to media, and then after a reboot discovers that all or part of that msync hadn't actually happened, we're going to have a very unhappy user on our hands. If we have a write-through cache, then we don't need to implement dirty bit tracking. But my impression is that write-through caches are significantly worse performing than write-back, so I don't intend to optimise for them. If there's some fancy new hardware that lets you do an update of multiple cachelines atomically and persistently, then I guess the software will be calling that instead of msync(), so the question about whether msync() would need to flush the cachelines for that page won't actually arise. -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html