Re: [PATCH 3/6] sunrpc: close connection when a request is irretrievably lost.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Sep 22, 2010 at 09:37:25AM +1000, Neil Brown wrote:
> On Tue, 21 Sep 2010 16:53:43 -0400
> "J. Bruce Fields" <bfields@xxxxxxxxxxxx> wrote:
> 
> > Apologies for the delay getting to the rest of these.
> > 
> > On Thu, Aug 12, 2010 at 05:04:07PM +1000, NeilBrown wrote:
> > > If we drop a request in the sunrpc layer, either due kmalloc failure,
> > > or due to a cache miss when we could not queue the request for later
> > > replay, then close the connection to encourage the client to retry sooner.
> > > 
> > > Note that if the drop happens in the NFS layer, NFSERR_JUKEBOX
> > > (aka NFS4ERR_DELAY) is returned to guide the client concerning
> > > replay.
> > 
> > Looks fine, but:
> > 
> > > diff --git a/net/sunrpc/svc.c b/net/sunrpc/svc.c
> > > index d9017d6..6359c42 100644
> > > --- a/net/sunrpc/svc.c
> > > +++ b/net/sunrpc/svc.c
> > > @@ -1055,6 +1055,9 @@ svc_process_common(struct svc_rqst *rqstp, struct kvec *argv, struct kvec *resv)
> > >  		goto err_bad;
> > >  	case SVC_DENIED:
> > >  		goto err_bad_auth;
> > > +	case SVC_CLOSE:
> > > +		if (test_bit(XPT_TEMP, &rqstp->rq_xprt->xpt_flags))
> > > +			svc_close_xprt(rqstp->rq_xprt);
> > 
> > There are also dropit's later in svc_process_common when xdr encoding
> > fails.  I wonder if we should close there?
> > 
> > Well, it's an odd case.  Seems like it should almost be declared a
> > programming error and made a WARN().
> > 
> > Applying as is.--b.
> 
> Thanks, I was wondering what had happened to these...
> 
> I cannot find them in your git though - not pushed yet, or am I looking in
> the wrong branch (for-2.6.37 and nfsd-next seem the obvious choices).

Yeah, that's the right place, apologies, I just tend to say "applied"
when I add them to my local tree, but then I wait till some tests have
run at least before actually pushing them out.

> For the other 'dropit' cases I think the svc_authorise failure is most likely
> to be interesting.  That can only fail if svcauth_gss_wrap_resp_* fails.  I
> don't really know what that means that ... maybe we should close the
> connection in that case.

If dropping the connection is our way of communicating "I've lost one of
your requests, you're never going to see the reply until you retry",
then perhaps the only time we should drop is when we've deferred.

(In which case, maybe s/SVC_DROP/SVC_DEFERRED/.)

Anyway, they're all pushed out now.  The last one required more fixing
up (thanks to my gratuitous cleanup patch), but I think it's right....

--b.
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux