On Tue, Dec 14, 2010 at 02:33:06PM -0800, Andrew Morton wrote: > On Fri, 10 Dec 2010 15:46:20 +0000 > Mel Gorman <mel@xxxxxxxxx> wrote: > > > When the allocator enters its slow path, kswapd is woken up to balance the > > node. It continues working until all zones within the node are balanced. For > > order-0 allocations, this makes perfect sense but for higher orders it can > > have unintended side-effects. If the zone sizes are imbalanced, kswapd may > > reclaim heavily within a smaller zone discarding an excessive number of > > pages. > > Why was it doing this? > Partially because of lumpy reclaim but mostly because it simply stays awake. If the zone is unbalanced, kswapd will reclaim in there, shrinking slabs, rotating lists etc. even if ultimately it cannot balance that zone. > > The user-visible behaviour is that kswapd is awake and reclaiming > > even though plenty of pages are free from a suitable zone. > > Suitable for what? I assume you refer to a future allocation which can > be satisfied from more than one of the zones? > Yes. > But what if that allocation wanted to allocate a high-order page from > a zone which we just abandoned? > classzone_idx is taken into account by the series overall and it doesn't count zones above the classzone_idx. > > This patch alters the "balance" logic for high-order reclaim allowing kswapd > > to stop if any suitable zone becomes balanced to reduce the number of pages > > again, suitable for what? > Suitable for a future allocation of the same type that woke kswapd. > > it reclaims from other zones. kswapd still tries to ensure that order-0 > > watermarks for all zones are met before sleeping. > > Handling order-0 pages differently from higher-order pages sounds weird > and wrong. > > I don't think I understand this patch. > The objective is that kswapd will go to sleep again. It has been found when there is a constant source of high-order allocations that kswapd stays awake constantly trying to reclaim even though a suitable zone had free pages. -- Mel Gorman Part-time Phd Student Linux Technology Center University of Limerick IBM Dublin Software Lab -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxxx For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>