Re: [PATCH v1] mm: Fix khugepaged activation policy

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 02/07/2024 15:57, David Hildenbrand wrote:
> On 02.07.24 16:46, Ryan Roberts wrote:
>> Since the introduction of mTHP, the docuementation has stated that
>> khugepaged would be enabled when any mTHP size is enabled, and disabled
>> when all mTHP sizes are disabled. There are 2 problems with this; 1.
>> this is not what was implemented by the code and 2. this is not the
>> desirable behavior.
>>
>> Desirable behavior is for khugepaged to be enabled when any PMD-sized
>> THP is enabled, anon or file. (Note that file THP is still controlled by
>> the top-level control so we must always consider that, as well as the
>> PMD-size mTHP control for anon). khugepaged only supports collapsing to
>> PMD-sized THP so there is no value in enabling it when PMD-sized THP is
>> disabled. So let's change the code and documentation to reflect this
>> policy.
>>
>> Further, per-size enabled control modification events were not
>> previously forwarded to khugepaged to give it an opportunity to start or
>> stop. Consequently the following was resulting in khugepaged eroneously
>> not being activated:
>>
>>    echo never > /sys/kernel/mm/transparent_hugepage/enabled
>>    echo always > /sys/kernel/mm/transparent_hugepage/hugepages-2048kB/enabled
>>
>> Signed-off-by: Ryan Roberts <ryan.roberts@xxxxxxx>
>> Fixes: 3485b88390b0 ("mm: thp: introduce multi-size THP sysfs interface")
>> Closes:
>> https://lore.kernel.org/linux-mm/7a0bbe69-1e3d-4263-b206-da007791a5c4@xxxxxxxxxx/
>> Cc: stable@xxxxxxxxxxxxxxx
>> ---
>>
>> Hi All,
>>
>> Applies on top of today's mm-unstable (9bb8753acdd8). No regressions observed in
>> mm selftests.
>>
>> When fixing this I also noticed that khugepaged doesn't get (and never has been)
>> activated/deactivated by `shmem_enabled=`. I'm not sure if khugepaged knows how
>> to collapse shmem - perhaps it should be activated in this case?
>>
> 
> Call me confused.
> 
> khugepaged_scan_mm_slot() and madvise_collapse() only all
> hpage_collapse_scan_file() with ... IS_ENABLED(CONFIG_SHMEM) ?

Looks like khugepaged_scan_mm_slot() was converted from:

  if (shmem_file(vma->vm_file)) {

to:

  if (IS_ENABLED(CONFIG_SHMEM) && vma->vm_file) {

By 99cb0dbd47a15d395bf3faa78dc122bc5efe3fc0 which adds THP collapse support for
non-shmem files. Clearly that looks wrong, but I guess never spotted in practice
because noone disables shemem?

I guess madvise_collapse() was a copy/paste?

> 
> collapse_file() is only called by hpage_collapse_scan_file() ... and there we
> check "shmem_file(file)".
> 
> So why is the IS_ENABLED(CONFIG_SHMEM) check in there if collapse_file() seems
> to "collapse filemap/tmpfs/shmem pages into huge one".
> 
> Anyhow, we certainly can collapse shmem (that's how it all started IIUC).

Yes, thanks for pointing me at it. Should have just searched "shmem" in
khugepaged.c :-/

> 
> Besides that, khugepaged only seems to collapse !shmem with
>   VM_BUG_ON(!IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS) && !is_shmem);

That makes sense. I guess I could use IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS) to
tighen the (non-shmem) file THP check in hugepage_pmd_enabled() (currently I'm
unconditionally using the top-level enabled setting as a "is THP enabled for
files" check).

But back to my original question, I think hugepage_pmd_enabled() should also be
explicitly checking the appropriate shmem_enabled controls and ORing in the
result? Otherwise in a situation where only shmem is THP enabled (and file/anon
THP is disabled) khugepaged won't run.

> 
> The thp_vma_allowable_order() check tests if we are allowed to collapse a
> PMD_ORDER in that VMA.

I don't follow the relevance of this statement.





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux