On Tue, 16 Dec 2014, Michal Hocko wrote: > > We may want to use memcg to limit the total memory > > footprint of all the processes within the one group. > > This may lead to a situation where any arbitrary > > process cannot get migrated to that one memcg > > because its limits will be breached. Or, process can > > get migrated but even being most recently used > > process, it can get killed by in-cgroup OOM. To > > avoid such scenarios, provide a convenient knob > > by which we can forcefully trigger OOM and make > > a room for upcoming process. > > > > To trigger force OOM, > > $ echo 1 > /<memcg_path>/memory.force_oom > > What would prevent another task deplete that memory shortly after you > triggered OOM and end up in the same situation? E.g. while the moving > task is migrating its charges to the new group... > > Why cannot you simply disable OOM killer in that memcg and handle it > from userspace properly? > The patch is introducing a mechanism to induce a kernel oom kill for a memcg hierarchy to make room for it in the new memcg, not disable the oom killer so the migration fails due to the lower limits. It doesn't have any basis since a SIGKILL coming from userspace should be considered the same as a kernel oom kill from the memcg perspective, i.e. the fatal_signal_pending() checks that allow charge bypass instead of a strict reliance on TIF_MEMDIE being set. It seems to be proposed as a shortcut so that the kernel will determine the best process to kill. That information is available to userspace so it should be able to just SIGKILL the desired process (either in the destination memcg or in the source memcg to allow deletion), so this functionality isn't needed in the kernel. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>