On 11/04/2014 01:28 AM, Joonsoo Kim wrote: > On Fri, Oct 31, 2014 at 04:53:44PM +0100, Vlastimil Babka wrote: >> On 10/28/2014 08:08 AM, Joonsoo Kim wrote: >> >> OK, so you don't find a problem with how this patch changes >> migration scanner caching, just the free scanner, right? >> So how about making release_freepages() return the highest freepage >> pfn it encountered (could perhaps do without comparing individual >> pfn's, the list should be ordered so it could be just the pfn of >> first or last page in the list, but need to check that) and updating >> cached free pfn with that? That should ensure rescanning only when >> needed. > > Hello, > > Updating cached free pfn in release_freepages() looks good to me. > > In fact, I guess that migration scanner also has similar problems, but, > it's just my guess. I admit your following arguments in patch description. > > However, the downside is that potentially many pages are rescanned without > successful isolation. At worst, there might be a page where isolation from LRU > succeeds but migration fails (potentially always). > > So, I'm okay if you update cached free pfn in release_freepages(). Hi, here's the patch-fix to update cached free pfn based on release_freepages(). No significant difference in my testing. ------8<------ From: Vlastimil Babka <vbabka@xxxxxxx> Date: Wed, 12 Nov 2014 16:37:21 +0100 Subject: [PATCH] mm-compaction-always-update-cached-scanner-positions-fix This patch-fix addresses Joonsoo Kim's concerns about free pages potentially being skipped when they are isolated and then returned due to migration failure. It does so by setting the cached scanner pfn to the pageblock where where the free page with the highest pfn of all returned free pages resides. A small downside is that release_freepages() no longer returns the number of freed pages, which has been used in a VM_BUG_ON check. I don't think the check was important enough to warrant a more complex solution. Signed-off-by: Vlastimil Babka <vbabka@xxxxxxx> Cc: Minchan Kim <minchan@xxxxxxxxxx> Cc: Mel Gorman <mgorman@xxxxxxx> Cc: Joonsoo Kim <iamjoonsoo.kim@xxxxxxx> Cc: Michal Nazarewicz <mina86@xxxxxxxxxx> Cc: Naoya Horiguchi <n-horiguchi@xxxxxxxxxxxxx> Cc: Christoph Lameter <cl@xxxxxxxxx> Cc: Rik van Riel <riel@xxxxxxxxxx> Cc: David Rientjes <rientjes@xxxxxxxxxx> --- When squashed, the following paragraph should be appended to the fixed patch's changelog: To prevent free scanner from leaving free pages behind after they are returned due to page migration failure, the cached scanner pfn is changed to point to the pageblock of the returned free page with the highest pfn, before leaving compact_zone(). mm/compaction.c | 29 +++++++++++++++++++++++------ 1 file changed, 23 insertions(+), 6 deletions(-) diff --git a/mm/compaction.c b/mm/compaction.c index e0befc3..3669f92 100644 --- a/mm/compaction.c +++ b/mm/compaction.c @@ -41,15 +41,17 @@ static inline void count_compact_events(enum vm_event_item item, long delta) static unsigned long release_freepages(struct list_head *freelist) { struct page *page, *next; - unsigned long count = 0; + unsigned long high_pfn = 0; list_for_each_entry_safe(page, next, freelist, lru) { + unsigned long pfn = page_to_pfn(page); list_del(&page->lru); __free_page(page); - count++; + if (pfn > high_pfn) + high_pfn = pfn; } - return count; + return high_pfn; } static void map_pages(struct list_head *list) @@ -1223,9 +1225,24 @@ static int compact_zone(struct zone *zone, struct compact_control *cc) } out: - /* Release free pages and check accounting */ - cc->nr_freepages -= release_freepages(&cc->freepages); - VM_BUG_ON(cc->nr_freepages != 0); + /* + * Release free pages and update where the free scanner should restart, + * so we don't leave any returned pages behind in the next attempt. + */ + if (cc->nr_freepages > 0) { + unsigned long free_pfn = release_freepages(&cc->freepages); + cc->nr_freepages = 0; + + VM_BUG_ON(free_pfn == 0); + /* The cached pfn is always the first in a pageblock */ + free_pfn &= ~(pageblock_nr_pages-1); + /* + * Only go back, not forward. The cached pfn might have been + * already reset to zone end in compact_finished() + */ + if (free_pfn > zone->compact_cached_free_pfn) + zone->compact_cached_free_pfn = free_pfn; + } trace_mm_compaction_end(ret); -- 2.1.2 -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>