On Wed 29-01-20 11:05:11, Yang Shi wrote: > > > On 1/29/20 10:58 AM, Michal Hocko wrote: > > On Thu 30-01-20 02:30:32, Yang Shi wrote: > > > Since commit a49bd4d71637 ("mm, numa: rework do_pages_move"), > > > the semantic of move_pages() has changed to return the number of > > > non-migrated pages if they were result of a non-fatal reasons (usually a > > > busy page). This was an unintentional change that hasn't been noticed > > > except for LTP tests which checked for the documented behavior. > > > > > > There are two ways to go around this change. We can even get back to the > > > original behavior and return -EAGAIN whenever migrate_pages is not able > > > to migrate pages due to non-fatal reasons. Another option would be to > > > simply continue with the changed semantic and extend move_pages > > > documentation to clarify that -errno is returned on an invalid input or > > > when migration simply cannot succeed (e.g. -ENOMEM, -EBUSY) or the > > > number of pages that couldn't have been migrated due to ephemeral > > > reasons (e.g. page is pinned or locked for other reasons). > > > > > > We decided to keep the second option in kernel because this behavior is in > > > place for some time without anybody complaining and possibly new users > > > depending on it. Also it allows to have a slightly easier error handling > > > as the caller knows that it is worth to retry when err > 0. > > > > > > Update man pages to reflect the new semantic. > > Thanks a lot! Looks good to me. I would just add a note that it is > > generally recommended to pre-initialize status array by -1 to have a > > reliable way to check which pages have been migrated. The man page says > > "The array contains valid values only if move_pages() did not return an > > error." > > > > I would just add. "Pre initialization of the array to -1 or similar > > value which cannot represent a real numa node could help to identify > > pages that have been migrated". > > Sure. Adding this note right after the explanation of positive value error > seems fine? I would just follow up on the existing note about valid values. -- Michal Hocko SUSE Labs