On Thu 26-07-18 08:41:35, Bruce Merry wrote: > On 26 July 2018 at 02:55, Singh, Balbir <bsingharora@xxxxxxxxx> wrote: > > Do you by any chance have use_hierarch=1? memcg_stat_show should just rely on counters inside the memory cgroup and the the LRU sizes for each node. > > Yes, /sys/fs/cgroup/memory/memory.use_hierarchy is 1. I assume systemd > is doing that. And this is actually good. Non hierarchical behavior is discouraged. The real problem is that we are keeping way too many zombie memcgs around and waiting for memory pressure to reclaim them and so they go away on their own. As I've tried to explain in other email force_empty before removing the memcg should help. Fixing this properly would require quite some heavy lifting AFAICS. We would basically have to move zombies out of the way which is not hard but we do not want to hide their current memory consumption so we would have to somehow move their stats to the parent. And then we are back to reparenting which has been removed by b2052564e66d ("mm: memcontrol: continue cache reclaim from offlined groups"). -- Michal Hocko SUSE Labs