On Mon, Jun 26, 2023 at 8:34 PM Yu Zhao <yuzhao@xxxxxxxxxx> wrote:
On Mon, Jun 26, 2023 at 11:14 AM Ryan Roberts <ryan.roberts@xxxxxxx> wrote:
Opportunistically attempt to allocate high-order folios in highmem,
optionally zeroed. Retry with lower orders all the way to order-0, until
success. Although, of note, order-1 allocations are skipped since a
large folio must be at least order-2 to work with the THP machinery. The
user must check what they got with folio_order().
This will be used to oportunistically allocate large folios for
anonymous memory with a sensible fallback under memory pressure.
For attempts to allocate non-0 orders, we set __GFP_NORETRY to prevent
high latency due to reclaim, instead preferring to just try for a lower
order. The same approach is used by the readahead code when allocating
large folios.
Signed-off-by: Ryan Roberts <ryan.roberts@xxxxxxx>
---
mm/memory.c | 33 +++++++++++++++++++++++++++++++++
1 file changed, 33 insertions(+)
diff --git a/mm/memory.c b/mm/memory.c
index 367bbbb29d91..53896d46e686 100644
--- a/mm/memory.c
+++ b/mm/memory.c
@@ -3001,6 +3001,39 @@ static vm_fault_t fault_dirty_shared_page(struct vm_fault *vmf)
return 0;
}
+static inline struct folio *vma_alloc_movable_folio(struct vm_area_struct *vma,
+ unsigned long vaddr, int order, bool zeroed)
+{
+ gfp_t gfp = order > 0 ? __GFP_NORETRY | __GFP_NOWARN : 0;
+
+ if (zeroed)
+ return vma_alloc_zeroed_movable_folio(vma, vaddr, gfp, order);
+ else
+ return vma_alloc_folio(GFP_HIGHUSER_MOVABLE | gfp, order, vma,
+ vaddr, false);
+}
+
+/*
+ * Opportunistically attempt to allocate high-order folios, retrying with lower
+ * orders all the way to order-0, until success. order-1 allocations are skipped
+ * since a folio must be at least order-2 to work with the THP machinery. The
+ * user must check what they got with folio_order(). vaddr can be any virtual
+ * address that will be mapped by the allocated folio.
+ */
+static struct folio *try_vma_alloc_movable_folio(struct vm_area_struct *vma,
+ unsigned long vaddr, int order, bool zeroed)
+{
+ struct folio *folio;
+
+ for (; order > 1; order--) {
+ folio = vma_alloc_movable_folio(vma, vaddr, order, zeroed);
+ if (folio)
+ return folio;
+ }
+
+ return vma_alloc_movable_folio(vma, vaddr, 0, zeroed);
+}
I'd drop this patch. Instead, in do_anonymous_page():
if (IS_ENABLED(CONFIG_ARCH_WANTS_PTE_ORDER))
folio = vma_alloc_zeroed_movable_folio(vma, addr,
CONFIG_ARCH_WANTS_PTE_ORDER))
if (!folio)
folio = vma_alloc_zeroed_movable_folio(vma, addr, 0);
I meant a runtime function arch_wants_pte_order() (Its default
implementation would return 0.)