On Mon, Dec 12, 2011 at 05:46:13PM +0100, Michal Nazarewicz wrote: > On Mon, 12 Dec 2011 17:30:52 +0100, Mel Gorman <mel@xxxxxxxxx> wrote: > > >On Mon, Dec 12, 2011 at 04:22:39PM +0100, Michal Nazarewicz wrote: > >>> <SNIP> > >>> > >>>>+ if (!pfn_valid_within(pfn)) > >>>>+ goto skip; > >>> > >>>The flow of this function in general with gotos of skipped and next > >>>is confusing in comparison to the existing function. For example, > >>>if this PFN is not valid, and no freelist is provided, then we call > >>>__free_page() on a PFN that is known to be invalid. > >>> > >>>>+ ++nr_scanned; > >>>>+ > >>>>+ if (!PageBuddy(page)) { > >>>>+skip: > >>>>+ if (freelist) > >>>>+ goto next; > >>>>+ for (; start < pfn; ++start) > >>>>+ __free_page(pfn_to_page(pfn)); > >>>>+ return 0; > >>>>+ } > >>> > >>>So if a PFN is valid and !PageBuddy and no freelist is provided, we > >>>call __free_page() on it regardless of reference count. That does not > >>>sound safe. > >> > >>Sorry about that. It's a bug in the code which was caught later on. The > >>code should read ???__free_page(pfn_to_page(start))???. > > On Mon, 12 Dec 2011 17:30:52 +0100, Mel Gorman <mel@xxxxxxxxx> wrote: > >That will call free on valid PFNs but why is it safe to call > >__free_page() at all? You say later that CMA requires that all > >pages in the range be valid but if the pages are in use, that does > >not mean that calling __free_page() is safe. I suspect you have not > >seen a problem because the pages in the range were free as expected > >and not in use because of MIGRATE_ISOLATE. > > All pages from [start, pfn) have passed through the loop body which > means that they are valid and they have been removed from buddy (for > caller's use). Also, because of split_free_page(), all of those pages > have been split into 0-order pages. Ah, I see. Even though you are not putting the pages on a freelist, the function still returns with an elevated reference count and it's up to the caller to find them again. > Therefore, in error recovery, to > undo what the loop has done so far, we put give back to buddy by > calling __free_page() on each 0-order page. > Ok. > >>>> /* Found a free page, break it into order-0 pages */ > >>>> isolated = split_free_page(page); > >>>> total_isolated += isolated; > >>>>- for (i = 0; i < isolated; i++) { > >>>>- list_add(&page->lru, freelist); > >>>>- page++; > >>>>+ if (freelist) { > >>>>+ struct page *p = page; > >>>>+ for (i = isolated; i; --i, ++p) > >>>>+ list_add(&p->lru, freelist); > >>>> } > >>>> > >>>>- /* If a page was split, advance to the end of it */ > >>>>- if (isolated) { > >>>>- blockpfn += isolated - 1; > >>>>- cursor += isolated - 1; > >>>>- } > >>>>+next: > >>>>+ pfn += isolated; > >>>>+ page += isolated; > >>> > >>>The name isolated is now confusing because it can mean either > >>>pages isolated or pages scanned depending on context. Your patch > >>>appears to be doing a lot more than is necessary to convert > >>>isolate_freepages_block into isolate_freepages_range and at this point, > >>>it's unclear why you did that. > >> > >>When CMA uses this function, it requires all pages in the range to be valid > >>and free. (Both conditions should be met but you never know.) > > To be clear, I meant that the CMA expects pages to be in buddy when the function > is called but after the function finishes, all the pages in the range are removed > from buddy. This, among other things, is why the call to split_free_page() is > necessary. > Understood. -- Mel Gorman SUSE Labs -- To unsubscribe from this list: send the line "unsubscribe linux-media" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html