On 06/08, Andrew Morton wrote: > > On Sun, 6 Jun 2010 15:34:03 -0700 (PDT) > David Rientjes <rientjes@xxxxxxxxxx> wrote: > > > [kosaki.motohiro@xxxxxxxxxxxxxx: use in badness(), __oom_kill_task()] > > Signed-off-by: Oleg Nesterov <oleg@xxxxxxxxxx> > > Signed-off-by: David Rientjes <rientjes@xxxxxxxxxx> > > I assume from the above that we should have a Signed-off-by:kosaki > here. I didn't make that change yet - please advise. Yes. The patch mixes 2 changes: find_lock_task_mm patch + "do not forget about the sub-thread's children". The changelog doesn't match the actual changes. > > @@ -115,12 +126,17 @@ unsigned long badness(struct task_struct *p, unsigned long uptime) > > * child is eating the vast majority of memory, adding only half > > * to the parents will make the child our kill candidate of choice. > > */ > > - list_for_each_entry(child, &p->children, sibling) { > > - task_lock(child); > > - if (child->mm != mm && child->mm) > > - points += child->mm->total_vm/2 + 1; > > - task_unlock(child); > > - } > > + t = p; > > + do { > > + list_for_each_entry(c, &t->children, sibling) { > > + child = find_lock_task_mm(c); > > + if (child) { > > + if (child->mm != p->mm) > > + points += child->mm->total_vm/2 + 1; > > What if 1000 children share the same mm? Doesn't this give a grossly > wrong result? Can't answer. Obviusly it is hard to explain what is the "right" result here. But otoh, without this change we can't account children. Kosaki sent this as a separate change. > > @@ -256,9 +272,6 @@ static struct task_struct *select_bad_process(unsigned long *ppoints, > > for_each_process(p) { > > unsigned long points; > > > > - /* skip tasks that have already released their mm */ > > - if (!p->mm) > > - continue; We shouldn't remove this without removing OR updating the PF_EXITING check below. That is why we had another patch. This change alone allows to trivially disable oom-kill. If we have a process with the dead leader, select_bad_process() will always return -1. We either need another patch from Kosaki's series - if (p->flags & PF_EXITING) + if (p->flags & PF_EXITING && p->mm) or remove this check (David objects). Oleg. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxxx For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>