Recently, many userland daemon prefer to use libcap-ng and drop all privilege just after startup. Because of (1) Almost privilege are necessary only when special file open, and aren't necessary read and write. (2) In general, privilege dropping brings better protection from exploit when bugs are found in the daemon. But, it makes suboptimal oom-killer behavior. CAI Qian reported oom killer killed some important daemon at first on his fedora like distro. Because they've lost CAP_SYS_ADMIN. Of course, we recommend to drop privileges as far as possible instead of keeping them. Thus, oom killer don't have to check any capability. It implicitly suggest wrong programming style. This patch change root process check way from CAP_SYS_ADMIN to just euid==0. Reported-by: CAI Qian <caiqian@xxxxxxxxxx> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx> --- mm/oom_kill.c | 9 +++++---- 1 files changed, 5 insertions(+), 4 deletions(-) diff --git a/mm/oom_kill.c b/mm/oom_kill.c index e4b0991..f552e39 100644 --- a/mm/oom_kill.c +++ b/mm/oom_kill.c @@ -203,7 +203,7 @@ unsigned int oom_badness(struct task_struct *p, struct mem_cgroup *mem, * Root processes get 3% bonus, just like the __vm_enough_memory() * implementation used by LSMs. */ - if (has_capability_noaudit(p, CAP_SYS_ADMIN)) + if (task_euid(p) == 0) points -= 30; /* @@ -373,7 +373,7 @@ static void dump_tasks(const struct mem_cgroup *mem, const nodemask_t *nodemask) struct task_struct *p; struct task_struct *task; - pr_info("[ pid ] uid tgid total_vm rss cpu oom_adj oom_score_adj name\n"); + pr_info("[ pid ] uid euid tgid total_vm rss cpu oom_adj oom_score_adj name\n"); for_each_process(p) { if (oom_unkillable_task(p, mem, nodemask)) continue; @@ -388,8 +388,9 @@ static void dump_tasks(const struct mem_cgroup *mem, const nodemask_t *nodemask) continue; } - pr_info("[%5d] %5d %5d %8lu %8lu %3u %3d %5d %s\n", - task->pid, task_uid(task), task->tgid, + pr_info("[%5d] %5d %5d %5d %8lu %8lu %3u %3d %5d %s\n", + task->pid, task_uid(task), task_euid(task), + task->tgid, task->mm->total_vm, get_mm_rss(task->mm), task_cpu(task), task->signal->oom_adj, task->signal->oom_score_adj, task->comm); -- 1.7.3.1 -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>