On Mon, Jun 02, 2014 at 03:46:10PM +0100, Jan Beulich wrote: > For cold page allocations using the normal clear_highpage() mechanism > may be inefficient on certain architectures, namely due to needlessly > replacing a good part of the data cache contents. Introduce an arch- > overridable clear_cold_highpage() (using streaming non-temporal stores > on x86, where an override gets implemented right away) to make use of > in this specific case. > > Leverage the impovement in the Xen balloon driver, eliminating the > explicit scrub_page() function. Any benchmark data? I've tried non-temporal stores to clear huge pages, but it didn't helped much. I believe it can vary between micro-architectures, but we need numbers. I've played with Westmere that time. -- Kirill A. Shutemov -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>