On Mon 26-09-16 18:20:25, Vlastimil Babka wrote: > Fragmentation index and the vm.extfrag_threshold sysctl is meant as a heuristic > to prevent excessive compaction for costly orders (i.e. THP). It's unlikely to > make any difference for non-costly orders, especially with the default > threshold. But we cannot afford any uncertainty for the non-costly orders where > the only alternative to successful reclaim/compaction is OOM. After the recent > patches we are guaranteed maximum effort without heuristics from compaction > before deciding OOM, and fragindex is the last remaining heuristic. Therefore > skip fragindex altogether for non-costly orders. It would be nicer to reduce this just to the highest compaction priority but as your previous attempt shows this adds a lot of code churn. Not skipping the compaction for these !costly orders might lead to a higher latency for the allocation due to pointless zone scanning but considering that an alternative would be the order-0 reclaim which doesn't guarantee any larger blocks then doing a more targeted approach sounds quite reasonable to me. This patch is not really needed to prevent pre-mature OOMs because compaction_zonelist_suitable doesn't rely on the fragmentation index after the previous patch but it makes sense to me regardless. The fagindex was quite an obscure measure and having !costly order easier to understand is valuable imho. > Suggested-by: Michal Hocko <mhocko@xxxxxxxx> > Signed-off-by: Vlastimil Babka <vbabka@xxxxxxx> > Cc: Michal Hocko <mhocko@xxxxxxxxxx> > Cc: Mel Gorman <mgorman@xxxxxxxxxxxxxxxxxxx> > Cc: Joonsoo Kim <iamjoonsoo.kim@xxxxxxx> > Cc: David Rientjes <rientjes@xxxxxxxxxx> > Cc: Rik van Riel <riel@xxxxxxxxxx> Acked-by: Michal Hocko <mhocko@xxxxxxxx> > --- > mm/compaction.c | 9 +++++++-- > 1 file changed, 7 insertions(+), 2 deletions(-) > > diff --git a/mm/compaction.c b/mm/compaction.c > index 5ff7f801c345..badb92bf14b4 100644 > --- a/mm/compaction.c > +++ b/mm/compaction.c > @@ -1435,9 +1435,14 @@ enum compact_result compaction_suitable(struct zone *zone, int order, > * index towards 0 implies failure is due to lack of memory > * index towards 1000 implies failure is due to fragmentation > * > - * Only compact if a failure would be due to fragmentation. > + * Only compact if a failure would be due to fragmentation. Also > + * ignore fragindex for non-costly orders where the alternative to > + * a successful reclaim/compaction is OOM. Fragindex and the > + * vm.extfrag_threshold sysctl is meant as a heuristic to prevent > + * excessive compaction for costly orders, but it should not be at the > + * expense of system stability. > */ > - if (ret == COMPACT_CONTINUE) { > + if (ret == COMPACT_CONTINUE && (order > PAGE_ALLOC_COSTLY_ORDER)) { > fragindex = fragmentation_index(zone, order); > if (fragindex >= 0 && fragindex <= sysctl_extfrag_threshold) > return COMPACT_NOT_SUITABLE_ZONE; > -- > 2.10.0 -- Michal Hocko SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>