Re: [PATCH] delalloc: add quota handling

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, 2008-06-21 at 16:18 +0530, Aneesh Kumar K.V wrote:
> On Fri, Jun 20, 2008 at 06:14:38PM -0700, Mingming wrote:
> > Correct quota handling in delayed allocation. With this patch,
> > the quota for blocks are counted at block reservation time when
> > the fs free blocks counter are updated, instead of at later 
> > block allocation time.
> > 
> > Signed-off-by: Mingming Cao <cmm@xxxxxxxxxx>
> > ---
> >  fs/ext4/balloc.c  |    8 +++++---
> >  fs/ext4/inode.c   |    5 +++++
> >  fs/ext4/mballoc.c |   22 ++++++++++++----------
> >  3 files changed, 22 insertions(+), 13 deletions(-)
> > 
> > Index: linux-2.6.26-rc6/fs/ext4/balloc.c
> > ===================================================================
> > --- linux-2.6.26-rc6.orig/fs/ext4/balloc.c	2008-06-20 18:06:02.000000000 -0700
> > +++ linux-2.6.26-rc6/fs/ext4/balloc.c	2008-06-20 18:06:05.000000000 -0700
> > @@ -1716,7 +1716,8 @@ ext4_fsblk_t ext4_old_new_blocks(handle_
> >  	/*
> >  	 * Check quota for allocation of this block.
> >  	 */
> > -	if (DQUOT_ALLOC_BLOCK(inode, num)) {
> > +	if (!EXT4_I(inode)->i_delalloc_reserved_flag &&
> > +	     DQUOT_ALLOC_BLOCK(inode, num)) {
> >  		*errp = -EDQUOT;
> >  		return 0;
> >  	}
> > @@ -1928,7 +1929,8 @@ allocated:
> > 
> >  	*errp = 0;
> >  	brelse(bitmap_bh);
> > -	DQUOT_FREE_BLOCK(inode, *count-num);
> > +	if (!EXT4_I(inode)->i_delalloc_reserved_flag)
> > +		DQUOT_FREE_BLOCK(inode, *count-num);
> >  	*count = num;
> >  	return ret_block;
> > 
> > @@ -1942,7 +1944,7 @@ out:
> >  	/*
> >  	 * Undo the block allocation
> >  	 */
> > -	if (!performed_allocation)
> > +	if (!EXT4_I(inode)->i_delalloc_reserved_flag && !performed_allocation)
> >  		DQUOT_FREE_BLOCK(inode, *count);
> >  	brelse(bitmap_bh);
> >  	return 0;
> > Index: linux-2.6.26-rc6/fs/ext4/inode.c
> > ===================================================================
> > --- linux-2.6.26-rc6.orig/fs/ext4/inode.c	2008-06-20 18:06:02.000000000 -0700
> > +++ linux-2.6.26-rc6/fs/ext4/inode.c	2008-06-20 18:06:05.000000000 -0700
> > @@ -1450,6 +1450,9 @@ static int ext4_da_reserve_space(struct 
> >  		return -ENOSPC;
> >  	}
> > 
> > +	if (DQUOT_ALLOC_BLOCK(inode, total))
> > +		return -EDQUOT;
> > +
> 
> We should not be counting meta-data blocks for quota. I guess this
> should be 
> 
> 	if (DQUOT_ALLOC_BLOCK(inode, nrblocks))
> 		return -EDQUOT;
> 

Metadata blocks are also accounted for quota today with mballoc and non
mballoc allocation.

> Also i think we should be doing quota check first. 

Okay, I will move the quota check/update before checking fs free blocks

> In mballoc we request
> for less number of blocks if quota is limited. In case of
> ext4_da_reserve_space even though we get called only with one block
> reservation request to make the code correct i guess we should do the
> mballoc equivalent.
> 
> while (ar->len && DQUOT_ALLOC_BLOCK(ar->inode, ar->len)) {
> 	ar->flags |= EXT4_MB_HINT_NOPREALLOC;
> 	ar->len--;
> }
> 
Hmm.. not sure if the requested allocation length can be reduced since
this is all needed for allocating one block. Reduce the length may
result in later ENOSPC.  BTW ar is a temprory variable that only alive
in the path doing block allocation, we need some addition field in inode
to pass the hint flag from write_begin() time down to the writepages().

> 
> Also in mballoc we should do some test for free quota blocks and
> should set the EXT4_MB_HINT_NOPREALLOC even for delalloc.
> 

With this patch, since the quota has already accounted at write_begin()
time  with delayed allocation., Should we check the free quota again
here (at the block allocation time)?

I think doing the quota reservation at write_begin() time for delalloc
is probably the right way, as you suggested on IRC.

Mingming
> >  	/* reduce fs free blocks counter */
> >  	percpu_counter_sub(&sbi->s_freeblocks_counter, total);
> > 
> > @@ -1479,6 +1482,8 @@ void ext4_da_release_space(struct inode 
> > 
> >  	release = to_free + mdb_free;
> > 
> > +	DQUOT_FREE_BLOCK(inode, release);
> 
> 
> This should be
> 	DQUOT_FREE_BLOCK(inode, to_free);
> 
> > +
> >  	/* update fs free blocks counter for truncate case */
> >  	percpu_counter_add(&sbi->s_freeblocks_counter, release);
> > 
> > Index: linux-2.6.26-rc6/fs/ext4/mballoc.c
> > ===================================================================
> > --- linux-2.6.26-rc6.orig/fs/ext4/mballoc.c	2008-06-20 18:06:01.000000000 -0700
> > +++ linux-2.6.26-rc6/fs/ext4/mballoc.c	2008-06-20 18:06:05.000000000 -0700
> > @@ -4037,7 +4037,7 @@ ext4_fsblk_t ext4_mb_new_blocks(handle_t
> >  	struct super_block *sb;
> >  	ext4_fsblk_t block = 0;
> >  	int freed;
> > -	int inquota;
> > +	int inquota = 0;
> > 
> >  	sb = ar->inode->i_sb;
> >  	sbi = EXT4_SB(sb);
> > @@ -4059,15 +4059,17 @@ ext4_fsblk_t ext4_mb_new_blocks(handle_t
> >  		return 0;
> >  	}
> > 
> > -	while (ar->len && DQUOT_ALLOC_BLOCK(ar->inode, ar->len)) {
> > -		ar->flags |= EXT4_MB_HINT_NOPREALLOC;
> > -		ar->len--;
> > -	}
> > -	if (ar->len == 0) {
> > -		*errp = -EDQUOT;
> > -		return 0;
> > +	if (!EXT4_I(ar->inode)->i_delalloc_reserved_flag) {
> > +		while (ar->len && DQUOT_ALLOC_BLOCK(ar->inode, ar->len)) {
> > +			ar->flags |= EXT4_MB_HINT_NOPREALLOC;
> > +			ar->len--;
> > +		}
> 
> 
> We need to set ar->flags even for delalloc. Otherwise we will try to
> normalize the request in mballoc.
> 
> 
> > +		if (ar->len == 0) {
> > +			*errp = -EDQUOT;
> > +			return 0;
> > +		}
> > +		inquota = ar->len;
> >  	}
> > -	inquota = ar->len;
> > 
> >  	if (EXT4_I(ar->inode)->i_delalloc_reserved_flag)
> >  		ar->flags |= EXT4_MB_DELALLOC_RESERVED;
> > @@ -4134,7 +4136,7 @@ repeat:
> >  out2:
> >  	kmem_cache_free(ext4_ac_cachep, ac);
> >  out1:
> > -	if (ar->len < inquota)
> > +	if (!EXT4_I(ar->inode)->i_delalloc_reserved_flag && ar->len < inquota)
> >  		DQUOT_FREE_BLOCK(ar->inode, inquota - ar->len);
> > 
> >  	return block;
> > 
> > 
> 
> -aneesh

--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Reiser Filesystem Development]     [Ceph FS]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Linux FS]     [Yosemite National Park]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Device Mapper]     [Linux Media]

  Powered by Linux