On Wed, May 31, 2017 at 3:54 PM, Amir Goldstein <amir73il@xxxxxxxxx> wrote: > On Wed, May 31, 2017 at 1:09 PM, Miklos Szeredi <miklos@xxxxxxxxxx> wrote: >> On Tue, May 23, 2017 at 11:50 AM, Amir Goldstein <amir73il@xxxxxxxxx> wrote: >>> Added an i_state flag I_INUSE and helpers to set/clear/test the bit. >>> >>> The 'inuse' lock is an 'advisory' inode lock, which also provides >>> may_delete() protection, so can be used to extend exclusive create >>> protection beyond parent->i_mutex lock among cooperating users. >>> >>> This is going to be used by overlayfs to get exclusive ownership >>> on upper and work dirs among overlayfs mounts. >> >> Not sure I like the delete protection. Any modification of workdir or >> layers while mounted might cause inconsistencies or errors in the >> overlay. So why single out deletion of base directories? >> > > There are a few reasons why 'inuse' inode should not be deleted, > regardless of whether delete protection is needed by overlayfs or not > (I think we don't need it). > > 1. setting INUSE on a FREEING|WILL_FREE inode is not allowed > so preventing delete on INUSE makes the possible states fewer and > easier to manage. Overlayfs keeps a ref on upperdir, so the inode cannot be deleted only unhashed. No ref is kept on workdir, because we don't currently use it for anything other than creating the empty work directory inside, but if we mark it inuse, we should keep a ref on it as well. Maybe the interface should be: struct dentry *d_try_to_use(struct dentry *dentry) { struct inode *inode = d_inode(dentry); spin_lock(&inode->i_lock); if (inode->i_state & I_INUSE) { spin_unlock(&inode->i_lock); return NULL; } inode->i_state |= I_INUSE; spin_unlock(&inode->i_lock); return dget(dentry) } void d_unuse(struct dentry *dentry) { struct inode *inode = d_inode(dentry); WARN_ON(!(inode->i_state & I_INUSE)); spin_lock(&inode->i_lock); inode->i_state &= ~I_INUSE; spin_unlock(&inode->i_lock); dput(dentry); } > > 2. With latest patchset I also implemented wait_on_inode_inuse() > https://github.com/amir73il/linux/blob/ovl-dir-lock/fs/inode.c#L2175 > which is later used by to copy up code for index hardlink. Need to see these patches to see what's going on here. > By preventing delete, I can isolate I_INUSE waiters from I_NEW waiters > and don't need to deal with INUSE waiters and inode delete. > > 3. Backwards justification: the man page for unlink(2) and rmdir(2) > already explain EBUSY in a generic way: > "pathname cannot be unlinked because it is being used by the system ..." > "pathname is currently in use by the system or ..." That's fine. I'm not objecting to the error value. I'm objecting to special casing the root upperdentry wrt. delete/modification protection. Make I_INUSE recursive? I think it would be an overkill. Just let it do the minimal thing that needs to be done to prevent unobvious configuration errors. Removing upperdir or workdir is pretty obviously going to break the overlay, so I don't think we need to worry about that. Thanks, Miklos