There are three significant concerns about the cgroup aware oom killer as it is implemented in -mm: (1) allows users to evade the oom killer by creating subcontainers or using other controllers since scoring is done per cgroup and not hierarchically, (2) unfairly compares the root mem cgroup using completely different criteria than leaf mem cgroups and allows wildly inaccurate results if oom_score_adj is used, and (3) does not allow the user to influence the decisionmaking, such that important subtrees cannot be preferred or biased. This patchset fixes (1) and (2) completely and, by doing so, introduces a completely extensible user interface that can be expanded in the future. Concern (3) could subsequently be addressed either before or after the cgroup-aware oom killer feature is merged. It preserves all functionality that currently exists in -mm and extends it to be generally useful outside of very specialized usecases. It eliminates the mount option for the cgroup aware oom killer entirely since it is now enabled through the root mem cgroup's oom policy. --- v3: - Rebased to next-20180713 v2: - Rebased to next-20180322 - Fixed get_nr_swap_pages() build bug found by kbuild test robot Documentation/admin-guide/cgroup-v2.rst | 100 ++++++----- include/linux/cgroup-defs.h | 5 - include/linux/memcontrol.h | 21 +++ kernel/cgroup/cgroup.c | 13 +- mm/memcontrol.c | 221 ++++++++++++++---------- 5 files changed, 204 insertions(+), 156 deletions(-)