On Wed, May 20, 2020 at 4:02 AM cgxu <cgxu519@xxxxxxxxxxxx> wrote: > > On 5/6/20 5:53 PM, Chengguang Xu wrote: > > Current syncfs(2) syscall on overlayfs just calls sync_filesystem() > > on upper_sb to synchronize whole dirty inodes in upper filesystem > > regardless of the overlay ownership of the inode. In the use case of > > container, when multiple containers using the same underlying upper > > filesystem, it has some shortcomings as below. > > > > (1) Performance > > Synchronization is probably heavy because it actually syncs unnecessary > > inodes for target overlayfs. > > > > (2) Interference > > Unplanned synchronization will probably impact IO performance of > > unrelated container processes on the other overlayfs. > > > > This patch tries to only sync target dirty upper inodes which are belong > > to specific overlayfs instance and wait for completion. By doing this, > > it is able to reduce cost of synchronization and will not seriously impact > > IO performance of unrelated processes. > > > > Signed-off-by: Chengguang Xu <cgxu519@xxxxxxxxxxxx> > > Except explicit sycnfs is triggered by user process, there is also implicit > syncfs during umount process of overlayfs instance. Every syncfs will > deliver to upper fs and whole dirty data of upper fs syncs to persistent > device at same time. > > In high density container environment, especially for temporary jobs, > this is quite unwilling behavior. Should we provide an option to > mitigate this effect for containers which don't care about dirty data? > This is not the first time this sort of suggestion has been made: https://lore.kernel.org/linux-unionfs/4bc73729-5d85-36b7-0768-ae5952ae05e9@xxxxxxxxxxxx/T/#md5fc5d51852016da7e042f5d9e5ef7a6d21ea822 At the time, I proposed to use the SHUTDOWN ioctl as a means for containers runtime to communicate careless teardown. I've pushed an uptodate version of ovl-shutdown RFC [1]. It is only lightly tested. It does not take care of OVL_SHUTDOWN_FLAGS_NOSYNC, but this is trivial. I also think it misses some smp_mb__after_atomic() for accessing ofs->goingdown and ofs->creator_cred. I did not address my own comments on the API [2]. And there are no tests at all. If this works for your use case, let me know how you want to proceed. I could re-post the ioctl and access hook patches, leaving out the actual shutdown patch for you to work on. You may add some of your own patched, write tests and post v2. Thanks, Amir. [1] https://github.com/amir73il/linux/commits/ovl-shutdown [2] https://lore.kernel.org/linux-unionfs/CAOQ4uxiau7N6NtMLzjwPzHa0nMKZWi4nu6AwnQkR0GFnKA4nPg@xxxxxxxxxxxxxx/