Hi Miklos, Here is the updated RFC patch for implementing FUSE_HANDLE_KILLPRIV_V2 and enabling SB_NOSEC. Previous discussion was here. https://lore.kernel.org/linux-fsdevel/20200716144032.GC422759@xxxxxxxxxx/ Based on previous discussion, you preferred that enabling SB_NOSEC should dependent on FUSE_HANDLE_KILLPRIV. We also agreed that implementing FUSE_HANDLE_KILLPRIV_V2 probably is a good idea because current version clears suid/sgid even if caller has CAP_FSETID in certain cases. So I decided to give it a try and here are the RFC patches. This is only compile and boot tested. Before I spend more time, I want to make sure I am heading in right direction. I have done some virtiofsd changes as well to enable handle_killpriv_v2 and proof of concept patch is here. https://github.com/rhvgoyal/qemu/commit/5c8b40dd94e094942df3fd2796b1ee468f9d3df3 TODO: I want to set set SB_NOSEC flag after I get a response from file server. But we send init request asynchronously. That means by the time fill_super finishes, we might not get resoponse from server and that means we might not enable SB_NOSEC. Before I improve these patches, I have my doubts that enabling SB_NOSEC should be dependent on FUSE_HANDLE_KILLPRIV_V2. And here is my rationale. If server clears suid/sgid/caps on chown/trunc/write always, then it probably just provides little better coherency in certain cases where client B might have modified file metadata and client A does not know about it. So those who have even stricter coherency requirement can enable FUSE_HANDLE_KILLPRIV_V2. But I am not sure why it should be a pre-requisite for SB_NOSEC. Even now, clearing of suid/sgid happens based on cached state of inode->mode. So even with SB_NOSEC disabled, it is very much possible that client B sets suid/sgid and client A write will not clear it. Only exception seems to be security.capability. Because we don't cache this xattr, currently we always check with file server if this xattr is set. If it is, we clear it. So while suid/sgid clearing is dedendent on cached attributes in client, clearing of caps is not. I feel this is probably more by accident and not design. To me, I can think of following two models. - weak coherency Clearing of suid/sgid/caps is driven by cached attributes in client. If attributes are stale, then suid/sgid/caps might not be cleared. This is the current default (except the case of caps). - strong coherency File server takes care of clearing suid/sgid/caps so that even if client cache is old/stale, server will still be able to clear it. This is what FUSE_HANDLE_KILLPRIV_V2 can achieve based on the use case. IOW, FUSE_HANDLE_KILLPRIV_V2 will help choose between weak/strong coherency model when it comes to clearing suid/sgid/caps. But notion of SB_NOSEC seems to be orthogonal to FUSE_HANDLE_KILLPRIV_V2 and SB_NOSEC should work both with and without FUSE_HANDLE_KILLPRIV_V2. FUSE_HANDLE_KILLPRIV_V2 has its own issues. Right now enabling it does not disable client driven clearing of suid/sgid/caps (file_remove_privs() and other chown/trucnation paths). If we actively work on supressing file_remove_privs/setattr when FUSE_HANDLE_KILLPRIV_V2 is set, then we have the issue of client attributes going stale (suid/sgid), as server might clear suid/sgid upon write but client will have no idea. May be we can keep track of completion of writes and clear suid/sgid in local cache or invalidate attrs etc. Something to think about. In short, I feel more inclined that SB_NOSEC should not be dependent on FUSE_HANDLE_KILLPRIV_V2. It should be a separate feature which works both with and without and user chooses FUSE_HANDLE_KILLPRIV_V2 if they want stronger coherency. If you are concerned about regression w.r.t clear of caps, then we can think of enabling SB_NOSEC conditionally. Say user chooses it as mount option. But given caps is just an outlier and currently we clear suid/sgid based on cache (and not based on state on server), I feel it might not be a huge issue. What do you think? Thanks Vivek Vivek Goyal (5): fuse: Introduce the notion of FUSE_HANDLE_KILLPRIV_V2 fuse: Set FUSE_WRITE_KILL_PRIV in cached write path fuse: Add a flag FUSE_SETATTR_KILL_PRIV fuse: For sending setattr in case of open(O_TRUNC) virtiofs: Support SB_NOSEC flag to improve direct write performance fs/fuse/dir.c | 13 +++++++++---- fs/fuse/file.c | 2 ++ fs/fuse/fuse_i.h | 6 ++++++ fs/fuse/inode.c | 17 ++++++++++++++++- fs/fuse/virtio_fs.c | 3 +++ include/uapi/linux/fuse.h | 18 +++++++++++++++++- 6 files changed, 53 insertions(+), 6 deletions(-) -- 2.25.4