Re: i_version, NFSv4 change attribute

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Nov 23, 2009 at 01:51:05PM -0500, tytso@xxxxxxx wrote:
> Now, all of this having been said, Feodra 11 and 12 have been using
> ext4 as the default filesystem, and for generic desktop usage, people
> haven't been screaming about the increased CPU overhead implied by
> engaging the jbd2 machinery on every sys_write().
> 
> However, we have had a report that some enterprise database developers
> have noticed the increased overhead in ext4, and this is on our list
> of things that require some performance tuning.  Hence my comments
> about a mount option to adjust s_time_gran for the benefit of database
> workloads, and once we have that moun option, since enabling i_version
> would mean once again needing to update the inode at every single
> write(2) call, we would be back with the same problem.
> 
> Maybe we can find a way to be more clever about doing some (but not
> all) of the jbd2 work on each sys_write(), and deferring as much as
> possible to the commit handling.  We need to do some investigating to
> see if that's possible.  Even if it isn't, though, my gut tells me
> that we will probably be able to enable i_version by default for
> desktop workloads, and tell database server folks that they should
> mount with the mount options "noi_version,time_gran=1s", or some such.
> 
> I'd like to do some testing to confirm my intuition first, of course,
> but that's how I'm currently leaning.   Does that make sense?

I think so, thanks.

So do I have this todo list approximately right?:

	1. Use an atomic type instead of a spinlock for i_version, and
	do some before-and-after benchmarking of writes (following your
	suggestions in
	http://marc.info/?l=linux-ext4&m=125900130605891&w=2)

	2. Turn on i_version by default.  (At this point it shouldn't be
	making things any worse than the high-resolution timestamps
	are.)

	3. Find someone to run database benchmarks, and work on
	noi_version,time_gran=1s (or whatever) options for their case.

I wish I could volunteer at least for #1, but embarassingly don't have
much more than dual-core machines lying around right now to test with.

--b.
--
To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Ext4 Filesystem]     [Union Filesystem]     [Filesystem Testing]     [Ceph Users]     [Ecryptfs]     [AutoFS]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux Cachefs]     [Reiser Filesystem]     [Linux RAID]     [Samba]     [Device Mapper]     [CEPH Development]
  Powered by Linux