Re: [PATCH v2 2/3] shmem: implement user/group quota support for tmpfs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Nov 23, 2022 at 10:01:37AM +0100, Lukas Czerner wrote:
> On Tue, Nov 22, 2022 at 03:57:57PM -0500, Brian Foster wrote:
> > On Mon, Nov 21, 2022 at 03:28:53PM +0100, Lukas Czerner wrote:
> > > Implement user and group quota support for tmpfs using system quota file
> > > in vfsv0 quota format. Because everything in tmpfs is temporary and as a
> > > result is lost on umount, the quota files are initialized on every
> > > mount. This also goes for quota limits, that needs to be set up after
> > > every mount.
> > > 
> > > The quota support in tmpfs is well separated from the rest of the
> > > filesystem and is only enabled using mount option -o quota (and
> > > usrquota and grpquota for compatibility reasons). Only quota accounting
> > > is enabled this way, enforcement needs to be enable by regular quota
> > > tools (using Q_QUOTAON ioctl).
> > > 
> 
> Hi Brian,
> 
> thanks for the review.
> 
> > 
> > FWIW, just from a first look through, it seems like this could be made a
> > little easier to review by splitting it up into a few smaller patches.
> > For example, could the accounting and enforcement support split into
> > separate patches?
> 
> Maybe a little, it seems a bit pointless to me.
> 

It seems like this is often the case for the patch author. ;)

For the reviewer (or for me at least), it's usually quite helpful to see
things broken down into the smallest possible changes. Not only does it
help a single review pass, but if you have multiple reviewers and expect
multiple review cycles, then it saves overall reviewer time not having
to revisit logic that has been acked and might not have changed since
last posted.

That being said, note that I don't know this code terribly well and so
this might be less relevant to others. In general, feel free to dismiss
any feedback from me that just doesn't make sense. :)

> > 
> > A few more random notes/questions...
> > 
> > > Signed-off-by: Lukas Czerner <lczerner@xxxxxxxxxx>
> > > ---
> > > v2: Use the newly introduced in-memory only quota foramt QFMT_MEM_ONLY
> > > 
> > >  Documentation/filesystems/tmpfs.rst |  12 ++
> > >  fs/quota/dquot.c                    |  10 +-
> > >  include/linux/shmem_fs.h            |   3 +
> > >  mm/shmem.c                          | 200 ++++++++++++++++++++++++----
> > >  4 files changed, 197 insertions(+), 28 deletions(-)
> > > 
> > ...
> > > diff --git a/fs/quota/dquot.c b/fs/quota/dquot.c
> > > index f1a7a03632a2..007604e7eb09 100644
> > > --- a/fs/quota/dquot.c
> > > +++ b/fs/quota/dquot.c
> > > @@ -716,11 +716,11 @@ int dquot_quota_sync(struct super_block *sb, int type)
> > >  	for (cnt = 0; cnt < MAXQUOTAS; cnt++) {
> > >  		if (type != -1 && cnt != type)
> > >  			continue;
> > > -		if (!sb_has_quota_active(sb, cnt))
> > > -			continue;
> > > -		inode_lock(dqopt->files[cnt]);
> > > -		truncate_inode_pages(&dqopt->files[cnt]->i_data, 0);
> > > -		inode_unlock(dqopt->files[cnt]);
> > > +		if (sb_has_quota_active(sb, cnt) && dqopt->files[cnt]) {
> > > +			inode_lock(dqopt->files[cnt]);
> > > +			truncate_inode_pages(&dqopt->files[cnt]->i_data, 0);
> > > +			inode_unlock(dqopt->files[cnt]);
> > > +		}
> > 
> > Perhaps a separate patch with some context for the change in the commit
> > log? (Maybe it's obvious to others, I'm just not familiar with the core
> > quota code.)
> 
> Oops, this hunk needs to be in the first patch. It is making sure that
> we won't touch dquot->files[] if we don't have any quota files which is
> the case for QFMT_MEM_ONLY format. I'll add some comment here.
> 

Ok.

> > 
> > >  	}
> > >  
> > >  	return 0;
> > ...
> > > diff --git a/mm/shmem.c b/mm/shmem.c
> > > index c1d8b8a1aa3b..26f2effd8f7c 100644
> > > --- a/mm/shmem.c
> > > +++ b/mm/shmem.c
...
> > > @@ -353,7 +429,6 @@ static void shmem_recalc_inode(struct inode *inode)
> > >  	freed = info->alloced - info->swapped - inode->i_mapping->nrpages;
> > >  	if (freed > 0) {
> > >  		info->alloced -= freed;
> > > -		inode->i_blocks -= freed * BLOCKS_PER_PAGE;
> > 
> > Did these various ->i_blocks updates get moved somewhere?
> 
> Yes, it is being taken care of by dquot_alloc_block_nodirty() and
> dquot_free_block_nodirty() in dquot_alloc_block_nodirty() and
> dquot_free_block_nodirty() respectively.
> 

Ah, Ok... I assume you mean __inode_[add|sub]_bytes(), called via
__dquot_alloc_space() and friends.

> You're not the first to ask about this, I'll put that into commit
> description.
> 

Ack, thanks.

> > 
> > >  		shmem_inode_unacct_blocks(inode, freed);
> > >  	}
> > >  }
> > ...
> > > @@ -2384,6 +2467,35 @@ static struct inode *shmem_get_inode(struct super_block *sb, struct inode *dir,
> > >  	return inode;
> > >  }
> > >  
> > > +static struct inode *shmem_get_inode(struct super_block *sb, struct inode *dir,
> > > +				     umode_t mode, dev_t dev, unsigned long flags)
> > > +{
> > > +	int err;
> > > +	struct inode *inode;
> > > +
> > > +	inode = shmem_get_inode_noquota(sb, dir, mode, dev, flags);
> > > +	if (inode) {
> > > +		err = dquot_initialize(inode);
> > > +		if (err)
> > > +			goto errout;
> > > +
> > > +		err = dquot_alloc_inode(inode);
> > > +		if (err) {
> > > +			dquot_drop(inode);
> > > +			goto errout;
> > > +		}
> > > +	}
> > > +	return inode;
> > > +
> > > +errout:
> > > +	inode->i_flags |= S_NOQUOTA;
> > 
> > I assume this is here so the free path won't unaccount an inode from the
> > quota that wasn't able to allocate, but is it needed with the
> > dquot_drop() above? If so, a comment might be helpful. :)
> 
> Yes it is needed. Quota code generally expects dquot to be initialized
> for operations such as dquot_free_inode(). It won't be in this case and
> hece we have to avoid quota accounting.
> 

Ok. FWIW, it looks to me that the dquot_free_inode() path checks for and
handles the case of NULL dquots. That said, I see this pattern is used
elsewhere in such error scenarios and on a second look, it seems like
explicitly defensive logic. Particularly to prevent something else from
perhaps trying to initialize the inode again (assuming failures would be
persistent). Makes more sense now, thanks.

Brian

> 
> > 
> > Brian
> 
> Thanks Brian!
> 
> -Lukas
> > 
> > > +	iput(inode);
> > > +	shmem_free_inode(sb);
> > > +	if (err)
> > > +		return ERR_PTR(err);
> > > +	return NULL;
> > > +}
> > > +
> > >  #ifdef CONFIG_USERFAULTFD
> > >  int shmem_mfill_atomic_pte(struct mm_struct *dst_mm,
> > >  			   pmd_t *dst_pmd,
> > > @@ -2403,7 +2515,7 @@ int shmem_mfill_atomic_pte(struct mm_struct *dst_mm,
> > >  	int ret;
> > >  	pgoff_t max_off;
> > >  
> > > -	if (!shmem_inode_acct_block(inode, 1)) {
> > > +	if (shmem_inode_acct_block(inode, 1)) {
> > >  		/*
> > >  		 * We may have got a page, returned -ENOENT triggering a retry,
> > >  		 * and now we find ourselves with -ENOMEM. Release the page, to
> > > @@ -2487,7 +2599,6 @@ int shmem_mfill_atomic_pte(struct mm_struct *dst_mm,
> > >  
> > >  	spin_lock_irq(&info->lock);
> > >  	info->alloced++;
> > > -	inode->i_blocks += BLOCKS_PER_PAGE;
> > >  	shmem_recalc_inode(inode);
> > >  	spin_unlock_irq(&info->lock);
> > >  
> > > @@ -2908,7 +3019,7 @@ shmem_mknod(struct user_namespace *mnt_userns, struct inode *dir,
> > >  	int error = -ENOSPC;
> > >  
> > >  	inode = shmem_get_inode(dir->i_sb, dir, mode, dev, VM_NORESERVE);
> > > -	if (inode) {
> > > +	if (!IS_ERR_OR_NULL(inode)) {
> > >  		error = simple_acl_create(dir, inode);
> > >  		if (error)
> > >  			goto out_iput;
> > > @@ -2924,7 +3035,8 @@ shmem_mknod(struct user_namespace *mnt_userns, struct inode *dir,
> > >  		inode_inc_iversion(dir);
> > >  		d_instantiate(dentry, inode);
> > >  		dget(dentry); /* Extra count - pin the dentry in core */
> > > -	}
> > > +	} else if (IS_ERR(inode))
> > > +		error = PTR_ERR(inode);
> > >  	return error;
> > >  out_iput:
> > >  	iput(inode);
> > > @@ -2939,7 +3051,7 @@ shmem_tmpfile(struct user_namespace *mnt_userns, struct inode *dir,
> > >  	int error = -ENOSPC;
> > >  
> > >  	inode = shmem_get_inode(dir->i_sb, dir, mode, 0, VM_NORESERVE);
> > > -	if (inode) {
> > > +	if (!IS_ERR_OR_NULL(inode)) {
> > >  		error = security_inode_init_security(inode, dir,
> > >  						     NULL,
> > >  						     shmem_initxattrs, NULL);
> > > @@ -2949,7 +3061,8 @@ shmem_tmpfile(struct user_namespace *mnt_userns, struct inode *dir,
> > >  		if (error)
> > >  			goto out_iput;
> > >  		d_tmpfile(file, inode);
> > > -	}
> > > +	} else if (IS_ERR(inode))
> > > +		error = PTR_ERR(inode);
> > >  	return finish_open_simple(file, error);
> > >  out_iput:
> > >  	iput(inode);
> > > @@ -3126,6 +3239,8 @@ static int shmem_symlink(struct user_namespace *mnt_userns, struct inode *dir,
> > >  				VM_NORESERVE);
> > >  	if (!inode)
> > >  		return -ENOSPC;
> > > +	else if (IS_ERR(inode))
> > > +		return PTR_ERR(inode);
> > >  
> > >  	error = security_inode_init_security(inode, dir, &dentry->d_name,
> > >  					     shmem_initxattrs, NULL);
> > > @@ -3443,6 +3558,7 @@ enum shmem_param {
> > >  	Opt_uid,
> > >  	Opt_inode32,
> > >  	Opt_inode64,
> > > +	Opt_quota,
> > >  };
> > >  
> > >  static const struct constant_table shmem_param_enums_huge[] = {
> > > @@ -3464,6 +3580,9 @@ const struct fs_parameter_spec shmem_fs_parameters[] = {
> > >  	fsparam_u32   ("uid",		Opt_uid),
> > >  	fsparam_flag  ("inode32",	Opt_inode32),
> > >  	fsparam_flag  ("inode64",	Opt_inode64),
> > > +	fsparam_flag  ("quota",		Opt_quota),
> > > +	fsparam_flag  ("usrquota",	Opt_quota),
> > > +	fsparam_flag  ("grpquota",	Opt_quota),
> > >  	{}
> > >  };
> > >  
> > > @@ -3547,6 +3666,13 @@ static int shmem_parse_one(struct fs_context *fc, struct fs_parameter *param)
> > >  		ctx->full_inums = true;
> > >  		ctx->seen |= SHMEM_SEEN_INUMS;
> > >  		break;
> > > +	case Opt_quota:
> > > +#ifdef CONFIG_QUOTA
> > > +		ctx->seen |= SHMEM_SEEN_QUOTA;
> > > +#else
> > > +		goto unsupported_parameter;
> > > +#endif
> > > +		break;
> > >  	}
> > >  	return 0;
> > >  
> > > @@ -3646,6 +3772,12 @@ static int shmem_reconfigure(struct fs_context *fc)
> > >  		goto out;
> > >  	}
> > >  
> > > +	if (ctx->seen & SHMEM_SEEN_QUOTA &&
> > > +	    !sb_any_quota_loaded(fc->root->d_sb)) {
> > > +		err = "Cannot enable quota on remount";
> > > +		goto out;
> > > +	}
> > > +
> > >  	if (ctx->seen & SHMEM_SEEN_HUGE)
> > >  		sbinfo->huge = ctx->huge;
> > >  	if (ctx->seen & SHMEM_SEEN_INUMS)
> > > @@ -3728,6 +3860,9 @@ static void shmem_put_super(struct super_block *sb)
> > >  {
> > >  	struct shmem_sb_info *sbinfo = SHMEM_SB(sb);
> > >  
> > > +#ifdef SHMEM_QUOTA_TMPFS
> > > +	shmem_disable_quotas(sb);
> > > +#endif
> > >  	free_percpu(sbinfo->ino_batch);
> > >  	percpu_counter_destroy(&sbinfo->used_blocks);
> > >  	mpol_put(sbinfo->mpol);
> > > @@ -3805,14 +3940,26 @@ static int shmem_fill_super(struct super_block *sb, struct fs_context *fc)
> > >  #endif
> > >  	uuid_gen(&sb->s_uuid);
> > >  
> > > +#ifdef SHMEM_QUOTA_TMPFS
> > > +	if (ctx->seen & SHMEM_SEEN_QUOTA) {
> > > +		sb->dq_op = &dquot_operations;
> > > +		sb->s_qcop = &dquot_quotactl_sysfile_ops;
> > > +		sb->s_quota_types = QTYPE_MASK_USR | QTYPE_MASK_GRP;
> > > +
> > > +		if (shmem_enable_quotas(sb))
> > > +			goto failed;
> > > +	}
> > > +#endif  /* SHMEM_QUOTA_TMPFS */
> > > +
> > >  	inode = shmem_get_inode(sb, NULL, S_IFDIR | sbinfo->mode, 0, VM_NORESERVE);
> > > -	if (!inode)
> > > +	if (IS_ERR_OR_NULL(inode))
> > >  		goto failed;
> > >  	inode->i_uid = sbinfo->uid;
> > >  	inode->i_gid = sbinfo->gid;
> > >  	sb->s_root = d_make_root(inode);
> > >  	if (!sb->s_root)
> > >  		goto failed;
> > > +
> > >  	return 0;
> > >  
> > >  failed:
> > > @@ -3976,7 +4123,12 @@ static const struct super_operations shmem_ops = {
> > >  #ifdef CONFIG_TMPFS
> > >  	.statfs		= shmem_statfs,
> > >  	.show_options	= shmem_show_options,
> > > -#endif
> > > +#ifdef CONFIG_QUOTA
> > > +	.quota_read	= shmem_quota_read,
> > > +	.quota_write	= shmem_quota_write,
> > > +	.get_dquots	= shmem_get_dquots,
> > > +#endif /* CONFIG_QUOTA */
> > > +#endif /* CONFIG_TMPFS */
> > >  	.evict_inode	= shmem_evict_inode,
> > >  	.drop_inode	= generic_delete_inode,
> > >  	.put_super	= shmem_put_super,
> > > @@ -4196,8 +4348,10 @@ static struct file *__shmem_file_setup(struct vfsmount *mnt, const char *name, l
> > >  
> > >  	inode = shmem_get_inode(mnt->mnt_sb, NULL, S_IFREG | S_IRWXUGO, 0,
> > >  				flags);
> > > -	if (unlikely(!inode)) {
> > > +	if (IS_ERR_OR_NULL(inode)) {
> > >  		shmem_unacct_size(flags, size);
> > > +		if (IS_ERR(inode))
> > > +			return (struct file *)inode;
> > >  		return ERR_PTR(-ENOSPC);
> > >  	}
> > >  	inode->i_flags |= i_flags;
> > > -- 
> > > 2.38.1
> > > 
> > > 
> > 
> 




[Index of Archives]     [Linux Ext4 Filesystem]     [Union Filesystem]     [Filesystem Testing]     [Ceph Users]     [Ecryptfs]     [NTFS 3]     [AutoFS]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux Cachefs]     [Reiser Filesystem]     [Linux RAID]     [NTFS 3]     [Samba]     [Device Mapper]     [CEPH Development]

  Powered by Linux