On Wed, Mar 22, 2017 at 01:41:17PM +0900, Minchan Kim wrote: > Hi Tim, > > On Tue, Mar 21, 2017 at 10:18:26AM -0700, Tim Murray wrote: > > On Sun, Mar 19, 2017 at 10:59 PM, Minchan Kim <minchan@xxxxxxxxxx> wrote: > > > However, I'm not sure your approach is good. It seems your approach just > > > reclaims pages from groups (DEF_PRIORITY - memcg->priority) >= sc->priority. > > > IOW, it is based on *temporal* memory pressure fluctuation sc->priority. > > > > > > Rather than it, I guess pages to be reclaimed should be distributed by > > > memcg->priority. Namely, if global memory pressure happens and VM want to > > > reclaim 100 pages, VM should reclaim 90 pages from memcg-A(priority-10) > > > and 10 pages from memcg-B(prioirty-90). > > > > This is what I debated most while writing this patch. If I'm > > understanding your concern correctly, I think I'm doing more than > > skipping high-priority cgroups: > > Yes, that is my concern. It could give too much pressure lower-priority > group. You already reduced scanning window for high-priority group so > I guess it would be enough for working. > > The rationale from my thining is high-priority group can have cold pages( > for instance, used-once pages, madvise_free pages and so on) so, VM should > age every groups to reclaim cold pages but we can reduce scanning window > for high-priority group to keep more workingset as you did. By that, we > already give more pressure to lower priority group than high-prioirty group. > > > > > - If the scan isn't high priority yet, then skip high-priority cgroups. > > This part is the one I think it's too much ;-) > I think no need to skip but just reduce scanning window by the group's > prioirty. > > > - When the scan is high priority, scan fewer pages from > > higher-priority cgroups (using the priority to modify the shift in > > get_scan_count). > > That sounds lkie a good idea but need to tune more. > > How about this? > > get_scan_count for memcg-A: > .. > size = lruvec_lru_size(lruvec, lru, sc->reclaim_idx) * > (memcg-A / sum(memcg all priorities)) > > get_scan_count for memcg-B: > .. > size = lruvec_lru_size(lruvec, lru, sc->reclaim_idx) * > (memcg-B / sum(memcg all priorities)) > Huh, correction. size = lruvec_lru_size(lruvec, lru, sc->reclaim_idx); scan = size >> sc->priority; scan = scan * (sum(memcg) - memcg A) / sum(memcg); -- To unsubscribe from this list: send the line "unsubscribe cgroups" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html