Re: [PATCH] improve __GFP_COLD/__GFP_ZERO interaction

"Jan Beulich" <JBeulich@xxxxxxxx> · Mon, 02 Jun 2014 16:27:52 +0100

>>> On 02.06.14 at 17:16, <kirill@xxxxxxxxxxxxx> wrote:
> On Mon, Jun 02, 2014 at 03:46:10PM +0100, Jan Beulich wrote:
>> For cold page allocations using the normal clear_highpage() mechanism
>> may be inefficient on certain architectures, namely due to needlessly
>> replacing a good part of the data cache contents. Introduce an arch-
>> overridable clear_cold_highpage() (using streaming non-temporal stores
>> on x86, where an override gets implemented right away) to make use of
>> in this specific case.
>> 
>> Leverage the impovement in the Xen balloon driver, eliminating the
>> explicit scrub_page() function.
> 
> Any benchmark data?
> 
> I've tried non-temporal stores to clear huge pages, but it didn't helped
> much. I believe it can vary between micro-architectures, but we need
> numbers. I've played with Westmere that time.

It's not at all clear to me what to measure here - after all this isn't
about improving the page clearing latency or throughput, but about
avoiding to disturb other operations.

Jan

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href