Re: [PATCH RFC v2] nfsd: don't revoke delegations that a client has stated it doesn't have

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Oct 20, 2015 at 02:34:15PM -0400, Andrew W Elble wrote:
> 
> > Are clients really allowed to just lose a delegation?  (Have you seen
> > such a case, other than the duplicate-delegation case which you already
> > fixed?)
> 
> In short, yes, we're still seeing it. We have also been seeing increasing
> stability from the work that has been done (which also increases the
> time between replication).
> 
> The reason for v2 was I got coverage on the destruction path in
> testing and discovered my mistake in v1. This one is extremely
> frustrating to chase down (I've been 14+ hours deep in
> packet captures to try and find the allocation to the recall - I
> keep running out of disk space).

Ugh.  Maybe patching in some well-chosen printk's could help.

Or doing some filtering as you capture.   You could capture just
operations that might have a delegation stateid--OPEN, DELEGRETURN,
CB_RECALL.  And probably SEQUENCE replies with any flags set.

> > Another thing we could do here is finally implement the server-side
> > support for referring triples (I think the client's done?):
> >
> > 	http://wiki.linux-nfs.org/wiki/index.php/Server_4.0_and_4.1_issues#Referring_triples
> > 	https://tools.ietf.org/html/rfc5661#section-2.10.6.3
> >
> > That would eliminate the need for the recall retries.
> >
> > Though that would still leave open the question of how to handle those
> > errors on a recall.  We still not be able to conclude that it's safe for
> > the server to destroy the delegation.
> 
> Would this be more appropriately "fixed" by supporting DELEGPURGE in a
> limited fashion to clear out cl_revoked? (I'm not quite sure
> that's a valid interpretation of RFC5561) to clear out any "lost"
> delegations?

I haven't thought it through, but that's clearly not what's intended
for, so I'm pessimistic.  Note also we'd need to implement it first.

There may be other ways the client could better recover, but we'd still
rather avoid such situations in the first place.  The recovery logic is
already complicated enough, it would be worse if it also needed to
handle a lot of cases that could only occur due to outright bugs.

--b.
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux