On Fri, Jun 30, 2023 at 11:23 AM Mickaël Salaün <mic@xxxxxxxxxxx> wrote: > > > On 30/06/2023 06:18, Jeff Xu wrote: > > On Thu, Jun 29, 2023 at 4:07 AM Mickaël Salaün <mic@xxxxxxxxxxx> wrote: > >> > >> > >> On 29/06/2023 05:18, Jeff Xu wrote: > >>> resend. > >>> > >>> On Wed, Jun 28, 2023 at 12:29 PM Mickaël Salaün <mic@xxxxxxxxxxx> wrote: > >>>> > >>>> > >>>> On 28/06/2023 19:03, Jeff Xu wrote: > >>>>> Hello, > >>>>> > >>>>> Thanks for writing up the example for an incoming TCP connection ! It > >>>>> helps with the context. > >>>>> > >>>>> Since I'm late to this thread, one thing I want to ask: all the APIs > >>>>> proposed so far are at the process level, we don't have any API that > >>>>> applies restriction to socket fd itself, right ? this is what I > >>>>> thought, but I would like to get confirmation. > >>>> > >>>> Restriction are applied to actions, not to already existing/opened FDs. > >>>> We could add a way to restrict opened FDs, but I don't think this is the > >>>> right approach because sandboxing is a deliberate action from a process, > >>>> and it should already take care of its FDs. > >>>> > >>>> > >>>>> > >>>>> On Wed, Jun 28, 2023 at 2:09 AM Günther Noack <gnoack@xxxxxxxxxx> wrote: > >>>>>> > >>>>>> Hello! > >>>>>> > >>>>>> On Mon, Jun 26, 2023 at 05:29:34PM +0200, Mickaël Salaün wrote: > >>>>>>> Here is a design to be able to only allow a set of network protocols and > >>>>>>> deny everything else. This would be complementary to Konstantin's patch > >>>>>>> series which addresses fine-grained access control. > >>>>>>> > >>>>>>> First, I want to remind that Landlock follows an allowed list approach with > >>>>>>> a set of (growing) supported actions (for compatibility reasons), which is > >>>>>>> kind of an allow-list-on-a-deny-list. But with this proposal, we want to be > >>>>>>> able to deny everything, which means: supported, not supported, known and > >>>>>>> unknown protocols. > >>>>>>> > >>>>>>> We could add a new "handled_access_socket" field to the landlock_ruleset > >>>>>>> struct, which could contain a LANDLOCK_ACCESS_SOCKET_CREATE flag. > >>>>>>> > >>>>>>> If this field is set, users could add a new type of rules: > >>>>>>> struct landlock_socket_attr { > >>>>>>> __u64 allowed_access; > >>>>>>> int domain; // see socket(2) > >>>>>>> int type; // see socket(2) > >>>>>>> } > >>>>>>> > >>>>>>> The allowed_access field would only contain LANDLOCK_ACCESS_SOCKET_CREATE at > >>>>>>> first, but it could grow with other actions (which cannot be handled with > >>>>>>> seccomp): > >>>>>>> - use: walk through all opened FDs and mark them as allowed or denied > >>>>>>> - receive: hook on received FDs > >>>>>>> - send: hook on sent FDs > >>>>>>> > >>>>>>> We might also use the same approach for non-socket objects that can be > >>>>>>> identified with some meaningful properties. > >>>>>>> > >>>>>>> What do you think? > >>>>>> > >>>>>> This sounds like a good plan to me - it would make it possible to restrict new > >>>>>> socket creation using protocols that were not intended to be used, and I also > >>>>>> think it would fit the Landlock model nicely. > >>>>>> > >>>>>> Small remark on the side: The security_socket_create() hook does not only get > >>>>>> invoked as a result of socket(2), but also as a part of accept(2) - so this > >>>>>> approach might already prevent new connections very effectively. > >>>>>> > >>>>> That is an interesting aspect that might be worth discussing more. > >>>>> seccomp is per syscall, landlock doesn't necessarily follow the same, > >>>>> another design is to add more logic in Landlock, e.g. > >>>>> LANDLOCK_ACCESS_SOCKET_PROTOCOL which will apply to all of the socket > >>>>> calls (socket/bind/listen/accept/connect). App dev might feel it is > >>>>> easier to use. > >>>> > >>>> seccomp restricts the use of the syscall interface, whereas Landlock > >>>> restricts the use of kernel objects (i.e. the semantic). > >>>> > >>>> We need to find a good tradeoff between a lot of access rights and a few > >>>> grouping different actions. This should make sense from a developer > >>>> point of view according to its knowledge and use of the kernel > >>>> interfaces (potential wrapped with high level libraries), but also to > >>>> the semantic of the sandbox and the security guarantees we want to provide. > >>>> > >>>> We should also keep in mind that high level Landlock libraries can take > >>>> care of potential coarse-grained use of restrictions. > >>>> > >>>> > >>>>> > >>>>>> Spelling out some scenarios, so that we are sure that we are on the same page: > >>>>>> > >>>>>> A) > >>>>>> > >>>>>> A program that does not need networking could specify a ruleset where > >>>>>> LANDLOCK_ACCESS_SOCKET_CREATE is handled, and simply not permit anything. > >>>>>> > >>>>>> B) > >>>>>> > >>>>>> A program that runs a TCP server could specify a ruleset where > >>>>>> LANDLOCK_NET_BIND_TCP, LANDLOCK_NET_CONNECT_TCP and > >>>>>> LANDLOCK_ACCESS_SOCKET_CREATE are handled, and where the following rules are added: > >>>>>> > >>>>>> /* From Konstantin's patch set */ > >>>>>> struct landlock_net_service_attr bind_attr = { > >>>>>> .allowed_access = LANDLOCK_NET_BIND_TCP, > >>>>>> .port = 8080, > >>>>>> }; > >>>>>> > >>>>>> /* From Mickaël's proposal */ > >>>>>> struct landlock_socket_attr sock_inet_attr = { > >>>>>> .allowed_access = LANDLOCK_ACCESS_SOCKET_CREATE, > >>>>>> .domain = AF_INET, > >>>>>> .type = SOCK_STREAM, > >>>>>> } > >>>>>> > >>>>>> struct landlock_socket_attr sock_inet6_attr = { > >>>>>> .allowed_access = LANDLOCK_ACCESS_SOCKET_CREATE, > >>>>>> .domain = AF_INET6, > >>>>>> .type = SOCK_STREAM, > >>>>>> } > >>>>>> > >>>>>> That should then be enough to bind and listen on ports, whereas outgoing > >>>>>> connections with TCP and anything using other network protocols would not be > >>>>>> permitted. > >>>>>> > >>>>> TCP server is an interesting case. From a security perspective, a > >>>>> process cares if it is acting as a server or client in TCP, a server > >>>>> might only want to accept an incoming TCP connection, never initiate > >>>>> an outgoing TCP connection, and a client is the opposite. > >>>>> > >>>>> Processes can restrict outgoing/incoming TCP connection by seccomp for > >>>>> accept(2) or connect(2), though I feel Landlock can do this more > >>>>> naturally for app dev, and at per-protocol level. seccomp doesn't > >>>>> provide per-protocol granularity. > >>>> > >>>> Right, seccomp cannot filter TCP ports. > >>>> > >>>>> > >>>>> For bind(2), iirc, it can be used for a server to assign dst port of > >>>>> incoming TCP connection, also by a client to assign a src port of an > >>>>> outgoing TCP connection. LANDLOCK_NET_BIND_TCP will apply to both > >>>>> cases, right ? this might not be a problem, just something to keep > >>>>> note. > >>>> > >>>> Good point. I think it is in line with the rule definition: to allow to > >>>> bind on a specific port. However, if clients want to set the source port > >>>> to a (legitimate) value, then that would be an issue because we cannot > >>>> allow a whole range of ports (e.g., >= 1024). I'm not sure if this > >>>> practice would be deemed "legitimate" though. Do you know client > >>>> applications using bind? My understanding is that the higher protocol might negotiate and assign both ports for a new connection (I think SIP does this for RTP, but that is UDP. I don't know any case for TCP). > >>>> > >>>> Konstantin, we should have a test for this case anyway. > >> > >> Thinking more about TCP clients binding sockets, a > >> LANDLOCK_ACCESS_NET_LISTEN_TCP would be more useful than > >> LANDLOCK_ACCESS_NET_BIND_TCP, but being able to limit the scope of > >> "bindable" ports is also valuable to forbid a malicious sandboxed > >> process to impersonate a legitimate server process. This also means that > >> it might be interesting to be able to handle port ranges. > >> > >> We already have a LANDLOCK_ACCESS_NET_BIND_TCP implementation and > >> related tests, so I think we should proceed with that. The next > >> network-related patch series should implement this > >> LANDLOCK_ACCESS_NET_LISTEN_TCP access right though, which should not be > >> difficult thanks to the framework implemented with current patch series. > >> > >> Konstantin, would you like to develop the TCP listening access control > >> once this patch series land? > >> > >> > >>>>>> (Alternatively, it could bind() the socket early, *then enable Landlock* and > >>>>>> leave out the rule for BIND_TCP, only permitting SOCKET_CREATE for IPv4 and > >>>>>> IPv6, so that listen() and accept() work on the already-bound socket.) > >>>>>> > >>>>> For this approach, LANDLOCK_ACCESS_SOCKET_PROTOCOL is a better name, > >>>>> so dev is fully aware it is not just applied to socket create. > >>>> > >>>> I don't get the semantic of LANDLOCK_ACCESS_SOCKET_PROTOCOL. What does > >>>> PROTOCOL mean? > >>>> > >>> I meant checking family + type of socket, and apply to all of > >>> socket(2),bind(2),accept(2),connect(2),listen(2), maybe > >>> send(2)/recv(2) too. > >> > >> OK, that would be kind of similar to the LANDLOCK_ACCESS_SOCKET_USE > >> description. However, I think this kind of global approach has several > >> issues: > >> - This covers a lot of different aspects and would increase the cost of > >> development/testing/review. > > True. > > > >> - Whereas it wraps different actions, it will not let user space have a > >> fine-grained access control on these, which could be useful for some use > >> cases. > > Make sense. > > > >> - I don't see the point of restricting accept(2) if we can already > >> restrict bind(2) and listen(2). accept(2) could be useful to identify > >> the remote peer but I'm not convinced this would make sense, and if it > >> would, then this can be postponed until we have a way to identify peers. > > > > I was thinking about a case that the socket was created/bind/listen in > > another process, then passed into the current process, > > > > For example: > > Process A has : > > LANDLOCK_ACCESS_SOCKET_CREATE (family = f1, type = t1) > > socket s1 is created in process A with family = f1, type = t1, and > > bind/listen to port p1. > > > > socket s1 is passed to process B > > Process B has: > > LANDLOCK_ACCESS_SOCKET_CREATE (family =f1, type = t2) (note the type > > is different than A) > > LANDLOCK_ACCESS_NET_{CONNECT,BIND}_TCP (port = p2) > > > > However, those rules in B don't restrict process B from using > > accept(s1), s1 is another type. > > Indeed, but why process A would pass this FD to B? Do you have real use > cases in mind? > > In case of confuse deputy attack, I'm convinced there is much more > chance for B to just ask A to do nasty thing, no need to receive an FD, > just to write data to the socket/IPC. > Yes. It is the confuse deputy attack I was referring to. Though, it is more of a design options type of questions than a real world attack case. > > > > In accept(2), struct sockaddr contains sa_family_t (AF_xx) but no > > type, which is strange to me, the API should either include both, or > > none (accept whatever it is already in socket fd, which is set during > > creation time). > > I think sockaddr defines the minimal requirement to deal with > accept/bind/connect. The sin_family is require to define the type of > address and port according, but the type is not. > > > > > looking into accept(2) implementation: it calls: sock->ops->accept > > iiuc, sock->ops is set during socket(2), allowing each protocol to > > have its own implementation. > > > > When we consider a> our intention to restrict family + type of socket, > > with b> socket can be passed between processes, > > there can be a need to harden the check (family + type) for all of > > bind/listen/accept/connect. Otherwise, there is still a possibility > > that the process to accept a socket of different type unintentionally. > > > > This means: > > LANDLOCK_ACCESS_SOCKET_ATTR_CREATE (family =f1, type = t2) > > LANDLOCK_ACCESS_SOCKET_ATTR_BIND (family =f1, type = t2) > > LANDLOCK_ACCESS_SOCKET_ATTR_ACCEPT (family =f1, type = t2) > > LANDLOCK_ACCESS_SOCKET_ATTR_ LISTEN (family =f1, type = t2) > > LANDLOCK_ACCESS_SOCKET_ATTR_CONNECT (family =f1, type = t2) > > Note: this checks family+type only, not port. > > The check is applied to all protocols, so not specific to TCP/UDP > > The sandboxing/Landlock threat model is to restrict a process when it is > sandboxed, but this sandboxing is a request from the same process (or > one of its parent) that happen when it is more trustworthy (or at least > has more privileges) than after it sandbox itself. > > The process sandboxing itself can use several kernel features, and one > of it is Landlock. In any case, it should take care of closing file > descriptors that should not be passed to the sandboxed process. > Agree. > The limits of sandboxing are the communication channels from and to > outside the sandbox. The peers talking with sandboxed processes should > then not be subject to confused deputy attacks, which means they must > not enable to bypass the user-defined security policy (from which the > Landlock policy is only a part). Receiving file descriptors should then > not be more important than controlling the communication channels. If a > not-sandboxed process is willing to give more right to a sandboxed > process, by passing FDs or just receiving commands, then this > not-sandboxed process need to be fixed. > > This is the rationale to not care about received nor sent file > descriptors. The communication channels and the remote peers must be > trusted to not give more privileges to the sandboxed processes. > > If a peer is malicious, it doesn't need to pass a file descriptor to the > sandboxed process, it can just read (data) commands and apply them to > its file descriptors. I see the reasoning. i.e. sandboxing the process is not more important than securing communication channels, or securing the peer. So in a system that let a peer process to pass a socket into a higher privileged process, when the communication channel or the peer process is compromised, e.g. swapping the fd/socket into a different one that the attacker controls, confuse deputy attack can happen. The recommendation here is to secure peer and communication. I agree with this approach in general. I need to think about how it applies to specific cases. > I think the ability to pass file descriptors > should be seen as a way to improve performance by avoiding a user space > process to act as a proxy receiving read/write commands and managing > file descriptors itself. On the other hand, file descriptors could be > used as real capabilities/tokens to manage access, but senders still > need to be careful to only pass the required ones. > > All this to say that being able to restrict actions on file descriptors > would be useful for senders/services to send a subset of the file > descriptor capabilities (cf. Capsicum), but not the other way around. > In the Landlock kernel doc: Similarly to file access modes (e.g. O_RDWR), Landlock access rights attached to file descriptors are retained even if they are passed between processes (e.g. through a Unix domain socket). Such access rights will then be enforced even if the receiving process is not sandboxed by Landlock. Indeed, this is required to keep a consistent access control over the whole system, and this avoids unattended bypasses through file descriptor passing (i.e. confused deputy attack). iiuc, the design for file and socket in landlock is different. For socket, the access rules are applied only to the current process (more like seccomp), while for file restriction, the rules can be passed into another un-landlocked process. > > > > >> - For performance reasons, we should avoid restricting > >> send/recv/read/write but instead only restrict the control plane: object > >> creation and configuration. > >> > > Performance is a valid concern. > > > > As example of server, usually the main process listens/accepts incoming > > connections, and forked processes do send/recv, the main process can > > be viewed as a control plane, and send/recv can be viewed as a data > > plane. It makes sense that we start with the control plane. > > > > We might like to keep a note that by not restricting send/recv, a > > socket can be created OOP, then passed into current process and call > > send/recv, so the network is not fully disabled by landlock alone > > (still need seccomp) > > Right, the kernel (and then Landlock) is not enough to sandbox a > complete environment, user space needs to be aware and be configured for > that too. > > I understand the desire to restrict as much as possible, but this > require to add more code and then it increase the risk of bugs, whereas > it might not be a big deal for attackers. I don't think the cost is > worth it and I don't want to give a false sense of security that could > let users think their application cannot communicate with the network if > it can communicate with local processes connected to the network. > Agree with the cost/benefit concern. I think the current design is already good enough for the decompression program case in https://cr.yp.to/unix/disablenetwork.html, as mentioned in Günther's response. > > > > Things might get more complicated, say: a forked process is intended > > to send/recv UDP, but was confused and got a TCP socket from > > OOP, etc. This is not different than accept(2) case above. There might > > be an opportunity for Landlock to harden this when we design for > > data-plane. > > > >> I'm not convinced that being able to control all kind of socket bind, > >> listen and connect actions might be worth implementing instead of a > >> fine-grained access control for the main protocols (TCP, UDP, unix and > >> vsock maybe), with the related tests and guarantees. > >> > >> However, this landlock_socket_attr struct could have an allowed_access > >> field that could contain LANDLOCK_ACCESS_NET_{CONNECT,LISTEN,BIND}_TCP > >> rights (which would just not be constrained by any port, except if a > >> landlock_net_port_attr rule matches). It would then make sense to rename > >> LANDLOCK_ACCESS_SOCKET_CREATE to LANDLOCK_ACCESS_NET_CREATE_SOCKET. This > >> right would not be accepted in a landlock_net_port_attr.allowed_access > >> though. > >> > > I'm not sure if my view is fully explained. I don't mean to control > > all kinds of socket bind/listen/connect actions. > > My view is: > > 1> have a rule to check family + type, to make sure the process is > > using the socket type they intend to use, such as > > LANDLOCK_ACCESS_SOCKET_ATTR_{CREATE|CONNECT|BIND|ACCEPT|LISTEN}, as > > discussed in accept(2) case. > > 2> have protocol specific rules, such as LANDLOCK_ACCESS_NET_{CONNECT,BIND}_TCP. > > So bind(2) will be checked by both 1 and 2. > > Right, I understand your point. > Great ! Thanks a lot for the discussion ! > > > > As example of TCP server, the process will use: > > LANDLOCK_ACCESS_SOCKET_ATTR_{CREATE|BIND|ACCEPT|LISTEN} > > LANDLOCK_ACCESS_NET_{BIND}_TCP > > > >>> > >>> s/LANDLOCK_ACCESS_SOCKET_CREATE/LANDLOCK_ACCESS_SOCKET_TYPE. > >>> > >>> This implies the kernel will check on socket fd's property (family + > >>> type) at those calls, this applies to > >>> a - the socket fd is created within the process, after landlock is applied. > >>> b - created in process prior to landlock is applied. > >>> c - created out of process then passed into this process, > >> > >> OK, these are the same rules as for LANDLOCK_ACCESS_NET_{CONNECT,BIND}_TCP. > > > > I don't mean this to be _TCP specific, this is still the family + type > > discussion above. > > Yes, I meant that your a/b/c rules would apply for the current > LANDLOCK_ACCESS_NET_{CONNECT,BIND}_TCP types as well.