Re: [PATCH] NFS: Add acreg{min,max} and acdir{min,max} in milliseconds

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 2009-06-11 at 13:24 -0700, Amit Gud wrote:
> Trond Myklebust wrote:
> > On Jun 10, 2009, at 15:43, Amit Gud <agud@xxxxxxxxxx> wrote:
> > 
> >> Trond Myklebust wrote:
> >>> On Tue, 2009-06-09 at 18:32 -0700, Amit Gud wrote:
> >>>> This patch adds 4 new NFS mount options(acdirminms, acdirmaxms,
> >>>> acregminms,
> >>>> acregmaxms) converting already existing one into a millisecond
> >>>> resolution
> >>>> instead of seconds.
> >>>>
> >>>> Also, modifies the mountstats output to milliseconds instead of
> >>>> seconds.
> >>>
> >>> Why, exactly, do you need to control cache timeouts down to the
> >>> millisecond level?
> >>>
> >>
> >> The problem is to make the updates visible from one client to the other
> >> in less than a second and turning off caching entirely has an
> >> unacceptably high penalty.
> > 
> > Specifics, please: updates of what, exactly? Are we talking attributes,
> > data or directory contents?
> > 
> > What is your application, and why does 1ms constitute an acceptable
> > caching timeout, while 1s does not?
> > 
> 
> We have a system in which machine A needs a change made to an NFS
> directory.  But, the actual change must be executed by machine B, so A
> makes an request to B to make the update.  But, A is allowed to read
> the directory directly, so A can't in general continue until it is
> able to see the effect of the update.  With 1s caching, that means a
> 1s "sleep" after each update.
> 
> Setting caching to "0s" means that consecutive stats performed by
> machine A must always revalidate every node of each path walked.  In
> many cases, we end up wanting to stat every path component in some
> path, which would result in O(N2) getattr or lookup calls for N
> levels of directory depth.
> 
> Using a very small acdirmax (e.g. 10ms) gives us a significant
> performance boost on these consecutive stats without increasing the
> post-update "sleep" to 1s.  (Of course, if the file server delays more
> than 10ms at any step, then we will have to start over for the next
> operation.  But, the file server often has everything necessary in
> cache such that each response is well below a millisecond.)

I'm still not sure I fully understand, but here are a couple of
comments.

Firstly, opendir() will _always_ force a revalidation of the directory
attributes, so if your machine A is relying on doing an 'ls' in order to
figure out the directory contents, then that shouldn't need extra
attribute timeouts.

Secondly, we have already introduced finer control over the lookup
caching as of Linux 2.6.28 and newer:
        - if you want to ensure that machine A always see new files and
        links immediately once they have been created, then
        '-olookupcache=positive' should turn off negative dentry
        caching.
        - if you also want to ensure that it immediately sees renames,
        unlinks and such, then '-olookupcache=none' will turn off lookup
        caching altogether.
Both of these lookup cache options are more reliable than relying on the
directory attribute cache timeouts, and they have the added benefit that
they also work around the problem that Linux servers tend to have poor
mtime resolution. A 1ms cache timeout on the client won't help you much
if the server only registers changes in 1s increments.

Trond

--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux