Re: [PATCH 6/6] mm, oom: fortify task_will_free_mem

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed 01-06-16 00:29:45, Tetsuo Handa wrote:
> Michal Hocko wrote:
> > On Wed 01-06-16 00:03:53, Tetsuo Handa wrote:
[...]
> > > How is it guaranteed that task_will_free_mem() == false && oom_victims > 0
> > > shall not lock up the OOM killer?
> > 
> > But this patch is talking about task_will_free_mem == true. Is the
> > description confusing? Should I reword the changelog?
> 
> The situation I'm talking about is
> 
>   (1) out_of_memory() is called.
>   (2) select_bad_process() is called because task_will_free_mem(current) == false.
>   (3) oom_kill_process() is called because select_bad_process() chose a victim.
>   (4) oom_kill_process() sets TIF_MEMDIE on that victim.
>   (5) oom_kill_process() fails to call wake_oom_reaper() because that victim's
>       memory was shared by use_mm() or global init.
>   (6) other !TIF_MEMDIE threads sharing that victim's memory call out_of_memory().
>   (7) select_bad_process() is called because task_will_free_mem(current) == false.
>   (8) oom_scan_process_thread() returns OOM_SCAN_ABORT because it finds TIF_MEMDIE
>       set at (4).
>   (9) other !TIF_MEMDIE threads sharing that victim's memory fail to get TIF_MEMDIE.
>   (10) How other !TIF_MEMDIE threads sharing that victim's memory will release
>        that memory?
> 
> I'm fine with task_will_free_mem(current) == true case. My question is that
> "doesn't this patch break task_will_free_mem(current) == false case when there is
> already TIF_MEMDIE thread" ?

OK, I see your point now. This is certainly possible, albeit unlikely. I
think calling this a regression would be a bit an overstatement. We are
basically replacing one unreliable heuristic by another one which is
more likely to lead to a deterministic behavior.

If you are worried about locking up the oom killer I have another 2
patches on top of this series which should deal with that (one of them
was already posted [1] and another one was drafted in [2]. Both of them
on top of this series should remove the concern of the lockup. I just
wait to post them until this thread settles down.

[1] http://lkml.kernel.org/r/1464276476-25136-1-git-send-email-mhocko@xxxxxxxxxx
[2] http://lkml.kernel.org/r/20160527133502.GN27686@xxxxxxxxxxxxxx
-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>



[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [ECOS]     [Asterisk Internet PBX]     [Linux API]