On 07/20/2016 09:24 AM, Michal Hocko wrote:
On Tue 19-07-16 22:22:16, zhongjiang wrote:
From: zhong jiang <zhongjiang@xxxxxxxxxx>
when pfn_valid(pfn) return false, pfn should be align with
pageblock_nr_pages other than MAX_ORDER_NR_PAGES in
init_pages_in_zone, because the skipped 2M may be valid pfn,
as a result, early allocated count will not be accurate.
I really do not understand this changelog. I thought that
MAX_ORDER_NR_PAGES and pageblock_nr_pages are the same thing but they
might not be for HUGETLB.
The common situation on x86 is that pageblock is 512 pages (2MB) and
MAX_ORDER_NR_PAGES is 1024 (4MB).
Should init_pages_in_zone depend on something
like HUGETLB? Is this even correct I would have expected that we should
initialize in the page block steps so MAX_ORDER_NR_PAGES. Could you
clarify Joonsoo, please?
On !CONFIG_HOLES_IN_ZONE systems, pfn_valid() should give the same
outcome within MAX_ORDER_NR_PAGES blocks (modulo zone boundaries). So
the ALIGN using MAX_ORDER_NR_PAGES is correct for these systems. What's
somewhat weird is that the rest of the for loop uses pageblock_nr_pages,
but it doesn't affect the outcome.
On CONFIG_HOLES_IN_ZONE the situation is less clear. The hole can be
theoretically anywhere within MAX_ORDER_NR_PAGES, including the first
pfn. If it's the first pfn, init_pages_in_zone() will skip
MAX_ORDER_NR_PAGES. The patch helps if the hole is e.g. the first 2MB of
a 4MB pageblock... then the second 2MB will be picked up after this
patch. But it's still not thorough in all situations. Strictly speaking,
one these systems one would have to avoid the MAX_ORDER_NR_PAGES skip
completely, and just check each pfn one by one to be sure nothing is missed.
But that's potentially costly, so for example, __pageblock_pfn_to_page()
(that originated in compaction) assumes that the hole is in the middle,
and checks first and last pfn of pageblock. So it has a pageblock
granularity like this patch, but still is more restrictive.
I wish there was a better solution that would get used everywhere...
possibly making the CONFIG_HOLES_IN_ZONE configs also declare the
granularity of holes, so we don't need to check each pfn...
Signed-off-by: zhong jiang <zhongjiang@xxxxxxxxxx>
---
mm/page_owner.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/mm/page_owner.c b/mm/page_owner.c
index c6cda3e..aa2c486 100644
--- a/mm/page_owner.c
+++ b/mm/page_owner.c
@@ -310,7 +310,7 @@ static void init_pages_in_zone(pg_data_t *pgdat, struct zone *zone)
*/
for (; pfn < end_pfn; ) {
if (!pfn_valid(pfn)) {
- pfn = ALIGN(pfn + 1, MAX_ORDER_NR_PAGES);
+ pfn = ALIGN(pfn + 1, pageblock_nr_pages);
continue;
}
--
1.8.3.1
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>