process, so I would argue that they aren't "stealing" anything. While a
higher level process might not know where precisely in the hierarchy the
process is, they'll know it that it must be a sub-cgroup of the one they
were put in (meaning the parent can still impose restrictions without any
issue).
Hmmm... it's not just about the ownership of the process itself. If
it had been, we wouldn't have bothered with permission model on cgroup
hierarchy itself. It's also about who is allowed to modify a given
cgroup and what you're proposing violates that.
I feel like the permission model makes sense in certain cases (the
common ancestor restriction, as well as the ability for a parent to
apply limits to children by setting its own limits). Neither of those
are violated (if you read the commit that introduced the common ancestor
restriction).
Maybe if you give me a usecase of when it might be important that a
process must not be able to move to a sub-cgroup of its current one, I
might be able to understand your concerns? From my perspective, I think
that's actually quite useful.
If you want, we can make it so that an unprivileged user migrating processes
to a child cgroup only works if you're in the same cgroup namespace (and
have CAP_SYS_ADMIN in the pinned user namespace, etc). The current setup
would obviously still work, but you'd add a permission for users that just
want to be able to limit their own processes. IIRC we need to update
cgroup_procs_write_permission() anyway. By having the cgroup namespace
requirement, you'd definitely have to "own" the process in every sense of
the word I can imagine.
Maybe I'm misunderstanding but I can't see how that would change the
situation in a significant way.
Well, it would avoid the issue of a process being moved against *its*
will. The process would have to be complicit in joining (or unsharing) a
cgroup namespace. I'm not sure I really agree with the argument that a
higher level process should be able to stop a process from imposing more
*stringent* limits on itself if the process is complicit in setting
those limits (see above).
The reason I'm doing this is so that we might be able to _practically_
use cgroups as an unprivileged user (something that will almost
certainly be useful to not just the container crowd, but people also
planning on using cgroups as advanced forms of rlimits).
--
Aleksa Sarai
Software Engineer (Containers)
SUSE Linux GmbH
https://www.cyphar.com/
--
To unsubscribe from this list: send the line "unsubscribe cgroups" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html