On Thu 14-03-19 12:56:43, Takashi Iwai wrote: > On Thu, 14 Mar 2019 12:36:26 +0100, > Michal Hocko wrote: > > > > On Thu 14-03-19 11:30:03, Vlastimil Babka wrote: > > > On 3/14/19 11:15 AM, Michal Hocko wrote: > > > > On Thu 14-03-19 10:42:49, Vlastimil Babka wrote: > > > >> alloc_pages_exact*() allocates a page of sufficient order and then splits it > > > >> to return only the number of pages requested. That makes it incompatible with > > > >> __GFP_COMP, because compound pages cannot be split. > > > >> > > > >> As shown by [1] things may silently work until the requested size (possibly > > > >> depending on user) stops being power of two. Then for CONFIG_DEBUG_VM, BUG_ON() > > > >> triggers in split_page(). Without CONFIG_DEBUG_VM, consequences are unclear. > > > >> > > > >> There are several options here, none of them great: > > > >> > > > >> 1) Don't do the spliting when __GFP_COMP is passed, and return the whole > > > >> compound page. However if caller then returns it via free_pages_exact(), > > > >> that will be unexpected and the freeing actions there will be wrong. > > > >> > > > >> 2) Warn and remove __GFP_COMP from the flags. But the caller wanted it, so > > > >> things may break later somewhere. > > > >> > > > >> 3) Warn and return NULL. However NULL may be unexpected, especially for > > > >> small sizes. > > > >> > > > >> This patch picks option 3, as it's best defined. > > > > > > > > The question is whether callers of alloc_pages_exact do have any > > > > fallback because if they don't then this is forcing an always fail path > > > > and I strongly suspect this is not really what users want. I would > > > > rather go with 2) because "callers wanted it" is much less probable than > > > > "caller is simply confused and more gfp flags is surely better than > > > > fewer". > > > > > > I initially went with 2 as well, as you can see from v1 :) but then I looked at > > > the commit [2] mentioned in [1] and I think ALSA legitimaly uses __GFP_COMP so > > > that the pages are then mapped to userspace. Breaking that didn't seem good. > > > > It used the flag legitimately before because they were allocating > > compound pages but now they don't so this is just a conversion bug. > > We still use __GFP_COMP for allocation of the sound buffers that are > also mmapped to user-space. The mentioned commit above [2] was > reverted later. Yes, I understand that part. __GFP_COMP makes sense on a comound page. But if you are using alloc_pages_exact then the flag doesn't make sense because split out should already do what you want. Unless I am missing something. > But honestly speaking, I'm not sure whether we still need the compound > pages. The change was introduced long time ago (commit f3d48f0373c1 > in 2005). Is it superfluous nowadays...? AFAIU alloc_pages_exact should do do what you need. > > Why should we screw up the helper for that reason? Or put in other words > > why a silent fix up adds any risk? > > IMO, it's good to catch the incompatible usage as early as possible, > so that others won't hit the same failure again like I did. There > aren't so many users of __GFP_COMP in the whole tree, after all. Yes, completely agreed and warning with a fixup sounds like the safest option to me. Returning NULL is risky because it essentially introduces a permanent failure mode as already pointed out. -- Michal Hocko SUSE Labs