Re: [PATCH v3] ovl: improving copy-up efficiency for big sparse file

Chengguang Xu <cgxu519@xxxxxxxxxxxx> · Fri, 01 Nov 2019 10:19:13 +0800

 ---- 在 星期四, 2019-10-31 22:14:54 Amir Goldstein <amir73il@xxxxxxxxx> 撰写 ----
 > On Thu, Oct 31, 2019 at 12:47 PM Chengguang Xu <cgxu519@xxxxxxxxxxxx> wrote:
 > >
 > > Current copy-up is not efficient for big sparse file,
 > > It's not only slow but also wasting more disk space
 > > when the target lower file has huge hole inside.
 > > This patch tries to recognize file hole and skip it
 > > during copy-up.
 > >
 > > Detail logic of hole detection as below:
 > > When we detect next data position is larger than current
 > > position we will skip that hole, otherwise we copy
 > > data in the size of OVL_COPY_UP_CHUNK_SIZE. Actually,
 > > it may not recognize all kind of holes and sometimes
 > > only skips partial of hole area. However, it will be
 > > enough for most of the use cases.
 > >
 > > Additionally, this optimization relies on lseek(2)
 > > SEEK_DATA implementation, so for some specific
 > > filesystems which do not support this feature
 > > will behave as before on copy-up.
 > >
 > > Signed-off-by: Chengguang Xu <cgxu519@xxxxxxxxxxxx>
 > 
 > Sorry for so many rounds.
 > With some nits fixed below you may add:
 > Reviewed-by: Amir Goldstein <amir73il@xxxxxxxxx>
 > 
 > > ---
 > > v1->v2:
 > > - Set file size when the hole is in the end of the file.
 > > - Add a code comment for hole copy-up improvement.
 > > - Check SEEK_DATA support before doing hole skip.
 > > - Back to original copy-up when seek data fails(in error case).
 > >
 > > v2->v3:
 > > - Detect big continuous holes in an effective way.
 > > - Modify changelog and code comment.
 > > - Set file size in the end of ovl_copy_up_inode().
 > >
 > >  fs/overlayfs/copy_up.c | 43 ++++++++++++++++++++++++++++++++++++++++--
 > >  1 file changed, 41 insertions(+), 2 deletions(-)
 > >
 > > diff --git a/fs/overlayfs/copy_up.c b/fs/overlayfs/copy_up.c
 > > index b801c6353100..10a2ae452393 100644
 > > --- a/fs/overlayfs/copy_up.c
 > > +++ b/fs/overlayfs/copy_up.c
 > > @@ -123,6 +123,9 @@ static int ovl_copy_up_data(struct path *old, struct path *new, loff_t len)
 > >         loff_t old_pos = 0;
 > >         loff_t new_pos = 0;
 > >         loff_t cloned;
 > > +       loff_t old_next_data_pos;
 > 
 > If you initialize data_pos = -1 ....
 > 
 > > +       loff_t hole_len;
 > > +       bool skip_hole = false;
 > >         int error = 0;
 > >
 > >         if (len == 0)
 > > @@ -144,7 +147,11 @@ static int ovl_copy_up_data(struct path *old, struct path *new, loff_t len)
 > >                 goto out;
 > >         /* Couldn't clone, so now we try to copy the data */
 > >
 > > -       /* FIXME: copy up sparse files efficiently */
 > > +       /* Check if lower fs supports seek operation */
 > > +       if (old_file->f_mode & FMODE_LSEEK &&
 > > +           old_file->f_op->llseek)
 > > +               skip_hole = true;
 > > +
 > >         while (len) {
 > >                 size_t this_len = OVL_COPY_UP_CHUNK_SIZE;
 > >                 long bytes;
 > > @@ -157,6 +164,38 @@ static int ovl_copy_up_data(struct path *old, struct path *new, loff_t len)
 > >                         break;
 > >                 }
 > >
 > > +               /*
 > > +                * Fill zero for hole will cost unnecessary disk space
 > > +                * and meanwhile slow down the copy-up speed, so we do
 > > +                * an optimization for hole during copy-up, it relies
 > > +                * on SEEK_DATA implementation in lower fs so if lower
 > > +                * fs does not support it, copy-up will behave as before.
 > > +                *
 > > +                * Detail logic of hole detection as below:
 > > +                * When we detect next data position is larger than current
 > > +                * position we will skip that hole, otherwise we copy
 > > +                * data in the size of OVL_COPY_UP_CHUNK_SIZE. Actually,
 > > +                * it may not recognize all kind of holes and sometimes
 > > +                * only skips partial of hole area. However, it will be
 > > +                * enough for most of the use cases.
 > > +                */
 > > +
 > > +               if (skip_hole) {
 > 
 > ... you could test (skip_hole && old_pos != data_pos) {
 > 
 > because if (old_pos == data_pos) then we just got here from
 > continue after skipping hole and there is no need to call llseek again.
 > Am I right?

Good point!
I'll check more precise condition like below.

if (skip_hole && data_pos < old_pos) {

Do llseek check

}

 > 
 > > +                       old_next_data_pos = vfs_llseek(old_file,
 > > +                                               old_pos, SEEK_DATA);
 > > +                       if (old_next_data_pos > old_pos) {
 > > +                               hole_len = old_next_data_pos - old_pos;
 > 
 > IMO, if you shorten var name to data_pos, it will not be any less
 > clear what it means and indentation will not be as messy.
 > 

Okay, let's truncate the var name.

Thanks,
Chengguang