On 11/22/2012 10:53 AM, Fengguang Wu wrote:
Ah it's more likely caused by this logic: if (is_active_lru(lru)) { if (inactive_list_is_low(mz, file)) shrink_active_list(nr_to_scan, mz, sc, priority, file); The active file list won't be scanned at all if it's smaller than the active list. In this case, it's inactive=33586MB > active=25719MB. So the data-1 pages in the active list will never be scanned and reclaimed.
That's it, indeed. The reason we have that code is that otherwise one large streaming IO could easily end up evicting the entire page cache working set. Usually it works well, because the new page cache working set tends to get touched twice while on the inactive list, and the old working set gets demoted from the active list. Only in a few very specific cases, where the inter-reference distance of the new working set is larger than the size of the inactive list, does it fail. Something like Johannes's patches should solve the problem. -- All rights reversed -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>