On Fri, Feb 10, 2023 at 7:15 AM Andy Lutomirski <luto@xxxxxxxxxx> wrote: > > Frankly, I really don't like having non-immutable data in a pipe. That statement is completely nonsensical. A pipe is a kernel buffer. If you want the buffer to be immutable, you do "read()" and "write()" on it. End of story. In contrast, "splice()" is literally the "map" operation. It's "mmap()", without the "m", because it turns out that memory mapping has a few problems: (a) mmap fundamentally only works on a page granularity (b) mmap has huge setup and teardown costs with page tables and TLB's and so splice() is just basically "strange mmap with that kernel buffer that is pipe" Really. That is what spice *is*. There's absolutely no question about it. It has some advantages over mmap, in that because it's a software mapping, we can "map" partial pages, which means that you can also map data that might not be in full pages (it might be an incoming network buffer, for example). It has a lot of disadvantages over mmap too, of course. Not using hardware means that it doesn't really show up as a linear mapping, and you don't get the nice hardware lookup accelerations - but it also means that you don't have the downsides (ie TLB costs etc). So I'm not saying that it is the same thing as "mmap", but I very much _am_ saying that there's a very real and direct similarity. There are three fundamental IO models in Unix: read, write, and mmap. And mmap() is very much a "you get a direct window into something that can change under you". And splice() is exactly that same thing. It was literally designed to be "look, we want zero-copy networking, and we could do 'sendfile()' by mmap'ing the file, but mmap - and particularly munmap - is too expensive, so we map things into kernel buffers instead". So saying "I really don't like having non-immutable data in a pipe" is complete nonsense. It's syntactically correct English, but it makes no conceptual sense. You can say "I don't like 'splice()'". That's fine. I used to think splice was a really cool concept, but I kind of hate it these days. Not liking splice() makes a ton of sense. But given splice, saying "I don't like non-immutable data" really is complete nonsense. If you want a stable buffer, use read() and write(). It's that simple. If you want to send data from a file to the network, and want a stable buffer in between the two, then "read()" and "write()" is *exactly* what you should do. With read and write, there's no mmap()/munmap() overhead of the file, and you already have the buffer (iwe call it "user address space"). The only downside is the extra copy. So if you want to send stable, unchanging file contents to the network, there is absolutely *no* reason to ever involve a pipe at all, and you should entirely ignore splice(). The *only* reason to ever use splice() is because you don't want to copy data, and just want a reference to it, and want to keep it all in kernel space, because the kernel<->user boundary ends up either requiring copies, or that page alignment, and is generally fairly expensive. But once you decide to go that way, you need to understand that you don't have "immutable data". You asked for a reference, you got a reference, and it *will* change. That's not something specific to "splice()". It's fundamental to the whole *concept* of zero-copy. If you don't want copies, and the source file changes, then you see those changes. So saying "I really don't like having non-immutable data in a pipe" really is nonsense. Don't use splice. Or do, and realize that it's a *mapping*. Because that is literally the whole - and ONLY - reason for splice in the first place. Linus