Hello, On Wed, Apr 28, 2021 at 04:37:46PM +0200, Christian Brauner wrote: > > I'd align it with cgroup.procs. Killing is a process-level operation (unlike > > arbitrary signal delivery which I think is another reason to confine this to > > killing) and threaded cgroups should be invisible to process-level > > operations. > > Ok, so we make write to cgroup.kill in threaded cgroups EOPNOTSUPP which > is equivalent what a read on cgroup.procs would yield. Sounds good to me. > Tejun, Roman, Michal, I've been thinking a bit about the escaping > children during fork() when killing a cgroup and I would like to propose > we simply take the write-side of threadgroup_rwsem during cgroup.kill. > > This would give us robust protection against escaping children during > fork() since fork()ing takes the read-side already in cgroup_can_fork(). > And cgroup.kill should be sufficiently rare that this isn't an > additional burden. > > Other ways seems more fragile where the child can potentially escape > being killed. The most obvious case is when CLONE_INTO_CGROUP is not > used. If a cgroup.kill is initiated after cgroup_can_fork() and after > the parent's fatal_signal_pending() check we will wait for the parent to > release the siglock in cgroup_kill(). Once it does we deliver the fatal > signal to the parent. But if we haven't passed cgroup_post_fork() fast > enough the child can be placed into that cgroup right after the kill. > That's not super bad per se since the child isn't technically visible in > the target cgroup anyway but it feels a bit cleaner if it would die > right away. We could minimize the window by raising a flag say CGRP_KILL > say: So, yeah, I wouldn't worry about the case where migration is competing against killing. The order of operations simply isn't defined and any outcome is fine. As for the specific synchronization method to use, my gut feeling is whatever which aligns better with the freezer implementation but I don't have strong feelings otherwise. Roman, what do you think? Thanks. -- tejun