On Thu, Feb 15, 2018 at 04:46:44AM -0800, Matthew Wilcox wrote: > On Thu, Jan 25, 2018 at 03:21:44PM +0800, Aaron Lu wrote: > > When freeing a batch of pages from Per-CPU-Pages(PCP) back to buddy, > > the zone->lock is held and then pages are chosen from PCP's migratetype > > list. While there is actually no need to do this 'choose part' under > > lock since it's PCP pages, the only CPU that can touch them is us and > > irq is also disabled. > > I have no objection to this patch. If you're looking for ideas for > future improvement though, I wonder whether using a LIST_HEAD is the > best way to store these pages temporarily. If you batch them into a > pagevec and then free the entire pagevec, the CPU should be a little > faster scanning a short array than walking a linked list. > > It would also puts a hard boundary on how long zone->lock is held, as > you'd drop it and go back for another batch after 15 pages. That might > be bad, of course. > It's not a guaranteed win. You're trading a list traversal for increased stack usage of 128 bytes (unless you stick a static one in the pgdat and incur a cache miss penalty instead or a per-cpu pagevec and increase memory consumption) and 2 spin lock acquire/releases in the common case which may or may not be contended. It might make more sense if the PCP's themselves where statically sized but that would limit tuning options and increase the per-cpu footprint of the pcp structures. Maybe I'm missing something obvious and it really would be a universal win but right now I find it hard to imagine that it's a win. > Another minor change I'd like to see is free_pcpages_bulk updating > pcp->count itself; all of the callers do it currently. > That should be reasonable, it's not even particularly difficult. -- Mel Gorman SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>