On Tue, Oct 24, 2023 at 9:46 AM Serge E. Hallyn <serge@xxxxxxxxxx> wrote: > On Sun, Dec 18, 2022 at 08:29:10PM +0100, Stefan Bavendiek wrote: > > When building userspace application sandboxes, one issue that does not seem trivial to solve is the isolation of abstract sockets. > > Veeery late reply. Have you had any productive discussions about this in > other threads or venues? > > > While most IPC mechanism can be isolated by mechanisms like mount namespaces, abstract sockets are part of the network namespace. > > It is possible to isolate abstract sockets by using a new network namespace, however, unprivileged processes can only create a new empty network namespace, which removes network access as well and makes this useless for network clients. > > > > Same linux sandbox projects try to solve this by bridging the existing network interfaces into the new namespace or use something like slirp4netns to archive this, but this does not look like an ideal solution to this problem, especially since sandboxing should reduce the kernel attack surface without introducing more complexity. > > > > Aside from containers using namespaces, sandbox implementations based on seccomp and landlock would also run into the same problem, since landlock only provides file system isolation and seccomp cannot filter the path argument and therefore it can only be used to block new unix domain socket connections completely. > > > > Currently there does not seem to be any way to disable network namespaces in the kernel without also disabling unix domain sockets. > > > > The question is how to solve the issue of abstract socket isolation in a clean and efficient way, possibly even without namespaces. > > What would be the ideal way to implement a mechanism to disable abstract sockets either globally or even better, in the context of a process. > > And would such a patch have a realistic chance to make it into the kernel? > > Disabling them altogether would break lots of things depending on them, > like X :) (@/tmp/.X11-unix/X0). The other path is to reconsider network > namespaces. There are several directions this could lead. For one, as > Dinesh Subhraveti often points out, the current "network" namespace is > really a network device namespace. If we instead namespace at the > bind/connect/etc calls, we end up with much different abilities. The LSM layer supports access controls on abstract sockets, with at least two (AppArmor, SELinux) providing abstract socket access controls, other LSMs may provide controls as well. -- paul-moore.com