On Thu, Jun 02, 2016 at 01:29:16PM +0900, Minchan Kim wrote: > On Thu, Jun 02, 2016 at 11:22:43AM +0900, Joonsoo Kim wrote: > > On Thu, Jun 02, 2016 at 09:19:19AM +0800, Wang Sheng-Hui wrote: > > > > > > > > > On 6/1/2016 3:40 PM, Minchan Kim wrote: > > > > On Wed, Jun 01, 2016 at 11:42:29AM +0800, Wang Sheng-Hui wrote: > > > >> Dear, > > > >> > > > >> Sorry to trouble you. > > > >> > > > >> I noticed cma_alloc would turn to __alloc_contig_migrate_range for allocating pages. > > > >> But __alloc_contig_migrate_range calls migrate_prep() at first, even if the requested page > > > >> is single and free, lru_add_drain_all still run (called by migrate_prep())? > > > >> > > > >> Image a large chunk of free contig pages for CMA, various drivers may request a single page from > > > >> the CMA area, we'll get lru_add_drain_all run for each page. > > > >> > > > >> Should we detect if the required pages are free before migrate_prep(), or detect at least for single > > > >> page allocation? > > > > That makes sense to me. > > > > > > > > How about calling migrate_prep once migrate_pages fails in the first trial? > > > > > > Minchan, > > > > > > I tried your patch in my env, and the number of calling migrate_prep() dropped a lot. > > > > > > In my case, CMA reserved 512MB, and the linux will call migrate_prep() 40~ times during bootup, > > > most are single page allocation request to CMA. > > > With your patch, migrate_prep() is not called for the single pages allocation requests as the free > > > pages in CMA area is enough. > > > > > > Will you please push the patch to upstream? > > > > It is not correct. > > > > migrate_prep() is called to move lru pages in lruvec to LRU. In > > isolate_migratepages_range(), non LRU pages are just skipped so if > > page is on the lruvec it will not be isolated and error isn't returned. > > So, "if (ret) migrate_prep()" will not be called and we can't catch > > the page in lruvec. > > Ah,, true. Thanks for correcting. > > Simple fix is to remove migrate_prep in there and retry if test_pages_isolated > found migration is failed at least once. Hmm...much better than before. But, it makes me wonder what his painpoint is. He want to remove migrate_prep() which calls lru_add_drain_all() needlessly. But, we already have one in alloc_contig_range(). So, he will not be happy entirely with following change. Moreover, lru_add_drain_all() is there without any validation. It is there since we need to gather migrated pages in lruvec but I think that it is sufficient to call lru_add_drain_cpu() and drain_local_pages(), respectively. > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > index 7da8310b86e9..e0aa4a9b573d 100644 > --- a/mm/page_alloc.c > +++ b/mm/page_alloc.c > @@ -7294,8 +7294,6 @@ static int __alloc_contig_migrate_range(struct compact_control *cc, > unsigned int tries = 0; > int ret = 0; > > - migrate_prep(); > - > while (pfn < end || !list_empty(&cc->migratepages)) { > if (fatal_signal_pending(current)) { > ret = -EINTR; > @@ -7355,6 +7353,7 @@ int alloc_contig_range(unsigned long start, unsigned long end, > unsigned long outer_start, outer_end; > unsigned int order; > int ret = 0; > + bool lru_flushed = false; > > struct compact_control cc = { > .nr_migratepages = 0, > @@ -7395,6 +7394,7 @@ int alloc_contig_range(unsigned long start, unsigned long end, > if (ret) > return ret; > > +again: > /* > * In case of -EBUSY, we'd like to know which page causes problem. > * So, just fall through. We will check it in test_pages_isolated(). > @@ -7448,6 +7448,11 @@ int alloc_contig_range(unsigned long start, unsigned long end, > > /* Make sure the range is really isolated. */ > if (test_pages_isolated(outer_start, end, false)) { > + if (!lru_flushed) { > + lru_flushed = true; > + goto again; > + } > + > pr_info("%s: [%lx, %lx) PFNs busy\n", > __func__, outer_start, end); > ret = -EBUSY; > > > > > > Anyway, better optimization for your case should be done in higher > > level. See following patch. It removes useless pageblock isolation and migration > > if possible. In fact, even we can do better than below by inroducing > > alloc_contig_range() light mode that skip migrate_prep() and other high cost things > > but it needs more surgery. I will revisit it soon. > > Yes, there are many rooms to be improved in cma_alloc and I remember > a few years ago, some guys(maybe, graphic) complained cma_alloc for small order page > is really slow so it would be really worth to do. > > I'm looking forward to seeing that. Okay. Will be back soon. Thanks. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>