On Sun, Feb 02, 2014 at 07:21:04PM +0000, Al Viro wrote: > On Sat, Feb 01, 2014 at 10:43:01PM +0000, Al Viro wrote: > > > * pipe_buffer_operations ->map()/->unmap() should die; let the caller do > > k{,un}map{,_atomic}(). All instances have the same method there and > > there's no point to make it different. PIPE_BUF_FLAG_ATOMIC should also > > go. > > BTW, another pile of code interesting in that respect (i.e. getting that > interface right) is fs/fuse/dev.c; I don't like the way it's playing > with get_user_pages_fast() there, It's trying to work around a deadlock by lock_page() recursion. I.e. fuse daemon is reading a request into a page for which the readpage is being serviced by said request (no, that's not something that happens accidentally). Do all archs set FAULT_FLAG_KILLABLE? If so, that flag along with code depending on the lack of it can go away and we can simply depend on page faults being interruptible by fatal signals. And that would simplify the fuse device I/O substantially. > and I doubt that sharing the code for > read and write side as it's done there makes much sense, but it's > definitely going to be a test for any API of that kind. It *does* > try to unify write-from-iovec with write-from-array-of-pages and > similar for reads; the interesting issue is that unlike the usual > write-to-pagecache we can have many chunks picked from one page and > we'd rather avoid doing kmap_atomic/kunmap_atomic for each of those. > > I suspect that the right answer is, in addition to a primitive that > does copying from iov_iter to have "copy from iov_iter and be ready > to copy more from soon after" + "done copying"; for the "array of > pages" the former would be allowed to leave the current page mapped, > skipping kmap_atomic() on the next call. And the latter would unmap. > of course. The caller is responsible for not blocking or doing > unbalanced map/unmap until it's said "done copying". > > BTW, is there any reason why fuse/dev.c doesn't use atomic kmaps for > everything? After all, as soon as we'd done kmap() in there, we > grab a spinlock and don't drop it until just before kunmap(). With > nothing by memcpy() done in between... Miklos? AFAICS, we only win > from switching to kmap_atomic there - we can't block anyway, we don't > need it to be visible on other CPUs and nesting isn't a problem. > Looks like it'll be cheaper in highmem cases and do exactly the same > thing as now for non-highmem... Comments? We don't hold the spinlock. But regardless, I don't see any reason why it couldn't be atomic kmap. Thanks, Miklos -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html