On Sun, Dec 12, 2021 at 09:13:12AM -0500, James Bottomley wrote: > On Fri, 2021-12-10 at 12:49 +0100, Christian Brauner wrote: > > On Thu, Dec 09, 2021 at 02:38:13PM -0500, James Bottomley wrote: > [...] > > > @@ -317,21 +315,15 @@ EXPORT_SYMBOL_GPL(securityfs_create_symlink); > > > void securityfs_remove(struct dentry *dentry) > > > { > > > struct user_namespace *ns = dentry->d_sb->s_user_ns; > > > - struct inode *dir; > > > > > > if (!dentry || IS_ERR(dentry)) > > > return; > > > > > > - dir = d_inode(dentry->d_parent); > > > - inode_lock(dir); > > > if (simple_positive(dentry)) { > > > - if (d_is_dir(dentry)) > > > - simple_rmdir(dir, dentry); > > > - else > > > - simple_unlink(dir, dentry); > > > + d_delete(dentry); > > > > Not, that doesn't work. You can't just call d_delete() and dput() and > > even if I wouldn't advise it. And you also can't do this without > > taking the inode lock on the directory. > > Agreed on that > > > simple_rmdir()/simple_unlink() take care to update various inode > > fields in the parent dir and handle link counts. This really wants to > > be sm like > > > > struct inode *parent_inode; > > > > parent_inode = d_inode(dentry->d_parent); > > inode_lock(parent_inode); > > if (simple_positive(dentry)) { > > dget(dentry); > > if (d_is_dir(dentry) > > simple_unlink(parent_inode, dentry); > > else > > simple_unlink(parent_inode, dentry); > > d_delete(dentry); > > dput(dentry); > > } > > inode_unlock(parent_inode); > > It just slightly annoys me how the simple_ functions change fields in > an inode that is about to be evicted ... it seems redundant; plus we > shouldn't care if the object we're deleting is a directory or file. I > also don't think we need the additional dget because the only consumer > is policy file removal and the opener of that file will have done a > dget. The inode lock now prevents us racing with another remove in the > case of two simultaneous writes. > > How about > > struct inode *parent_inode; > > parent_inode = d_inode(dentry->d_parent); > inode_lock(parent_inode); > if (simple_positive(dentry)) { > drop_nlink(parent_inode); > d_delete(dentry); > dput(dentry); > } > inode_unlock(parent_inode); It doesn't just change fields in an inode that is about to be evicted. It changes fields in the parent inode. If you're deleting any file or directory your function currently fails to update mtime and ctime for the parent directory. What you're doing below also isn't all that future proof or safe for callers other than ima. Consider a future caller that might want to call securityfs_remove() with .open = first_file() .relase = first_release( securityfs_remove(second_file) ) .open = second_file() If your securityfs_remove() is called from the first file's release function while the second_file is still open and thus holds a reference and won't go way during first_release()'s securityfs_remove() call you have just failed to update relevant inode fields of a file that can still be queried via stat* functions and can be used to create other files below it. In addition, if someone accidently calls your securityfs_remove() on a directory that is not-empty you're effectively deleting the directory without deleting the files in that directory first whereas simple_rmdir() would tell you to go away. If a user later needs an .unlink/.rmdir method for securityfs or allows calls of securityfs_remove() on the same dentry from concurrent locations you need the dget() in securityfs_remove() even if the inode_lock() is exclusive otherwise you can end up doing a dput() too many, iirc. I would recommened to not turn this into a nih exercise for simple vfs functionality. Your version isn't even significantly simpler. The securityfs_remove() function doesn't need to be reinvented. void securityfs_remove(struct dentry *dentry) { struct inode *dir; if (WARN_ON(!dentry || IS_ERR(dentry))) return; dir = d_inode(dentry->d_parent); inode_lock(dir); if (simple_positive(dentry)) { dget(dentry); if (d_is_dir(dentry)) simple_rmdir(dir, dentry); else simple_unlink(dir, dentry); d_delete(dentry); dput(dentry); } inode_unlock(dir); } I'm not claiming or trying to make this the most minimal version of securityfs_remove() that we could possibly get but I'm making it one where we don't have to worry that there's a subtle corner case that'll bite us in the future just because we tried to hand-massage a function that isn't in any hotpath.