On Tue, 28 Sep 2010, Andrew Morton wrote: > > Order-0 allocations, including quicklist_alloc(), are always under > > PAGE_ALLOC_COSTLY_ORDER, so they loop endlessly in the page allocator > > already without the need for __GFP_REPEAT. > > That's only true for the current implementation of the page allocator. > Yes, but in this case it's irrelevant since we're talking about order-0 allocations. The page allocator will never be changed so that order-0 allocations immediately fail if there's no available memory, otherwise we'd only use direct reclaim and the oom killer for high-order allocs or add __GFP_NOFAIL everywhere and that's quite pointless. > If we were to change the page allocator behaviour to not do that (and > we change it daily!) then all those callsites which wanted __GFP_REPEAT > behaviour will get broken. So someone would need to go back and work > out how to unbreak them, if we remembered. > > Plus there's presumably some documentary benefit in leaving the > __GFP_REPEATs in there. > The documentation is one of the problems here, __GFP_REPEAT isn't sanely defined by any of it and it leaves the user guessing as to its behavior unless you peruse the implementation. Intiution would suggest that __GFP_REPEAT would repeat the allocation attempt once it failed. After all, we have __GFP_NOFAIL to try indefinitely. That's not what it does, however. include/linux/gfp.h: * __GFP_REPEAT: Try hard to allocate the memory, but the allocation attempt * _might_ fail. This depends upon the particular VM implementation. Try hard in what way? Sure, it depends on the implementation but does that mean we only reclaim if we have __GFP_REPEAT? This definition also does allow us to change its meaning, so saying it has a specific importance for order-0 allocations in arch code isn't really that compelling. include/linux/slab.h: * %__GFP_REPEAT - If allocation fails initially, try once more before failing. Nope, that's not what it does either. (And, if it did, why would that possibly be helpful unless we know there's something being freed?) The reality is that __GFP_REPEAT continues the allocation until we've reclaimed at least the number of pages we're looking for. For order-0 allocations, it would only repeat if we failed to reclaim any pages. But, if that's the case, the oom killer would have killed something and we implicitly loop anyway in that situation (otherwise, we would have needlessly killed a task!) without even looking at the retry logic. So we can definitely remove __GFP_REPEAT for any order-0 allocation and it's based on its implementation -- poorly defined as it may be -- and the inherit design of any sane page allocator that retries such an allocation if it's going to use reclaim in the first place. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxxx For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>