On Thu, Nov 7, 2019 at 10:23 AM Andrea Arcangeli <aarcange@xxxxxxxxxx> wrote: > > On Thu, Nov 07, 2019 at 08:15:53AM -0800, Daniel Colascione wrote: > > You're already paying for bounds checking. Receiving a message via a > > datagram socket is basically the same thing as what UFFD's read is > > doing anyway. > > Except it's synchronous and there are no dynamic allocations required > in uffd, while af_netlink and af_unix both all deal with queue of > events in skbs dynamically allocated. Do you have any evidence that skb allocation is a significant cost compared to a page fault and schedule? Regardless: if you don't want to use skbs, don't. My point is that recvmsg is the ideal interface for UFFD and i'm agnostic on the implementation of this interface. > And should then eventfd also become a netlink then? I mean uffd was > supposed to work like eventfd except it requires specialized events. You've raised eventfd as a model for UFFD on several occasions. I don't think eventfd is a good reference point. An eventfd is a single object with 64 bits of state. It can notify interested parties in changes to that state. Eventfd does not provide a queue. UFFD, however, *is* a queue. It provides an arbitrary number of state change notifications to a reader. In this way, UFFD is much more like a socket than it's like eventfd. That is, eventfd is about level-change notifications, but UFFD is about sending messages. > > Programs generally don't go calling recvmsg() on random FDs they get > > from the outside world. They do call read() on those FDs, which is why > > That programs generally don't do something only means the attack is > less probable. > > Programs generally aren't suid. Programs generally don't use > SCM_RIGHTS. Programs generally don't ignore the retval of > open/socket/uffd syscalls. Programs generally don't make assumptions > on the fd ID after one of those syscalls that install fds. > > If all programs generally do the right thing (where the most important > is to not make assumptions on the fd IDs and to check all syscall > retvals), there was never an issue to begin with even in uffd API. "The right thing" is a matter of contracts. If a program calls read() and behaves as if read() has only the effects read() is documented to have, that means that from the kernel's point of view, the program is doing the right thing. That you think certain practices are more prudent than others is irrelevant here. UFFD is a violation of read()'s *contract* and so if programs break after calling read(), it's the *kernel*'s fault. > > read() having unexpected side effects is terrible. > > If having unexpected side effects in read is "terrible" (i.e. I > personally prefer to use terms like terrible when there's at least > something that can be exploited in practice, not for theoretical > issues) for an SCM_RIGHTS receiving daemon, I just don't see how the > exact same unexpected (still theoretical) side effects in recvmsg with > an unexpected nested cmsg->cmsg_type == SCM_RIGHTS message being > returned, isn't terrible too. If a program calls recvmsg on an FD of unknown provenance, it *must* be prepared to receive file descriptors via SCM_RIGHTS. If it doesn't, it's a bug. The contract the kernel makes with userspace for recvmsg() includes the possibility of creating file descriptors. The contract the kernel makes with userspace for read() does not ordinarily involve creating file descriptors, so if the kernel does in fact do that, it's the kernel's problem. > > If you call it with a non-empty ancillary data buffer, you know to > > react to what you get. You're *opting into* the possibility of getting > > file descriptors. Sure, it's theoretically possible that a program > > calls recvmsg on random FDs it gets from unknown sources, sees > > SCM_RIGHTS unexpectedly, and just the SCM_RIGHTS message and its FD > > payload, but that's an outright bug, while calling read() on stdin is > > no bug. > > I'm not talking about stdin and suid. recvmsg might mitigate the > concern for suid (not certain, depends on the suid, if it's generally > doing what you expect most suid to be doing or not), I was talking > about the SCM_RIGHTS receiving daemon instead, the "worse" more > concerning case than the suid. > > I quote below Andy's relevant email: > > ====== > It's worse if SCM_RIGHTS is involved. > ====== > > Not all software will do this after calling recvmsg: > > if (cmsg->cmsg_type == SCM_RIGHTS) { > /* oops we got attacked and an fd was involountarily installed > because we received another AF_UNIX from a malicious attacker > in control of the other end of the SCM_RIGHTS-receiving > AF_UNIX connection instead of our expected socket family > which doesn't even support SCM_RIGHTS so we would never have > noticed an fd was installed after recvmsg */ > } If a program omits this code after calling recvmsg on a file descriptor of unknown provenance and the program breaks, it's the program's fault. It's reasonable to epect that recvmsg might create file descriptors if you call it on an unknown FD. It's unreasonable to expect a program to consider the possibility of read() creating file descriptors because read isn't documented to do that. >