On Fri, Apr 20, 2012 at 11:13 PM, Jeff Layton <jlayton@xxxxxxxxxx> wrote: > On Fri, 20 Apr 2012 15:37:26 -0500 > Malahal Naineni <malahal@xxxxxxxxxx> wrote: > >> Steve Dickson [SteveD@xxxxxxxxxx] wrote: >> > > 2) if we assume that it is fairly representative of one, how can we >> > > achieve retrying indefinitely with NFS, or at least some large finite >> > > amount? >> > The amount of looping would be peer speculation. If the problem can >> > not be handled by one simple retry I would say we simply pass the >> > error up to the app... Its an application issue... >> >> As someone said, ESTALE is an incorrect errno for a path based call. >> How about turning ESTALE into ENOENT after a retry or few retries? >> > > It's not really the same thing. One could envision an application > that's repeatedly renaming a new file on top of another one. The file > is never missing from the namespace of the server, but you could still > end up getting an ESTALE. > > That would break other atomicity guarantees in an even worse way, IMO... For directory operations ESTALE *is* equivalent to ENOENT if already retrying with LOOKUP_REVAL. Think about it. Atomic replacement by another directory with rename(2) is not an excuse here actually. Local filesystems too can end up with IS_DEAD directory after lookup in that case. For non directories we basically have getattr and setattr. NFSv4 can handle both without retries if we supply the name instead of the handle (i.e. i_op->getattr_by_name, i_op->setattr_by_name). Other protocols can do whatever they want, exponential backoff with limited number of retries, whatever. No looping required in the VFS. Thanks, Miklos -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html