On Wed, Dec 1, 2021 at 6:24 PM Chengguang Xu <cgxu519@xxxxxxxxxxxx> wrote: > > ---- 在 星期三, 2021-12-01 21:46:10 Jan Kara <jack@xxxxxxx> 撰写 ---- > > On Wed 01-12-21 09:19:17, Amir Goldstein wrote: > > > On Wed, Dec 1, 2021 at 8:31 AM Chengguang Xu <cgxu519@xxxxxxxxxxxx> wrote: > > > > So the final solution to handle all the concerns looks like accurately > > > > mark overlay inode diry on modification and re-mark dirty only for > > > > mmaped file in ->write_inode(). > > > > > > > > Hi Miklos, Jan > > > > > > > > Will you agree with new proposal above? > > > > > > > > > > Maybe you can still pull off a simpler version by remarking dirty only > > > writably mmapped upper AND inode_is_open_for_write(upper)? > > > > Well, if inode is writeably mapped, it must be also open for write, doesn't > > it? The VMA of the mapping will hold file open. So remarking overlay inode > > dirty during writeback while inode_is_open_for_write(upper) looks like > > reasonably easy and presumably there won't be that many inodes open for > > writing for this to become big overhead? I think it should be ok and a good tradeoff of complexity vs. performance. > > > > > If I am not mistaken, if you always mark overlay inode dirty on ovl_flush() > > > of FMODE_WRITE file, there is nothing that can make upper inode dirty > > > after last close (if upper is not mmaped), so one more inode sync should > > > be enough. No? > > > > But we still need to catch other dirtying events like timestamp updates, > > truncate(2) etc. to mark overlay inode dirty. Not sure how reliably that > > can be done... > > Oh yeh, we have those as well :) All those cases should be covered by ovl_copyattr() that updates the ovl inode ctime/mtime, so always dirty in ovl_copyattr() should be good. I *think* the only case of ovl_copyattr() that should not dirty is in ovl_inode_init(), so need some special helper there. > > To be honest I even don't fully understand what's the ->flush() logic in overlayfs. > Why should we open new underlying file when calling ->flush()? > Is it still correct in the case of opening lower layer first then copy-uped case? > The semantics of flush() are far from being uniform across filesystems. most local filesystems do nothing on close. most network fs only flush dirty data when a writer closes a file but not when a reader closes a file. It is hard to imagine that applications rely on flush-on-close of rdonly fd behavior and I agree that flushing only if original fd was upper makes more sense, so I am not sure if it is really essential for overlayfs to open an upper rdonly fd just to do whatever the upper fs would have done on close of rdonly fd, but maybe there is no good reason to change this behavior either. Thanks, Amir.