Re: [PATCH] sunrpc: fix loss of task->tk_status after rpc_delay call in xprt_alloc_slot

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, 2012-05-18 at 16:33 -0400, Jeff Layton wrote:
> On Fri, 18 May 2012 16:16:28 -0400
> Jeff Layton <jlayton@xxxxxxxxxx> wrote:
> 
> > xprt_alloc_slot will call rpc_delay() to make the task wait a bit before
> > retrying when it gets back an -ENOMEM error from xprt_dynamic_alloc_slot.
> > 
> > The problem there is that rpc_delay does this:
> > 
> >     rpc_sleep_on(&delay_queue, task, __rpc_atrun);
> > 
> > ...and __rpc_atrun will reset the task->tk_status back to 0 when the
> > task wakes back up. This makes call_reserveresult abort the task with
> > -EIO.
> > 
> > The RH QA group discovered this problem when testing with low-memory
> > clients and this patch fixed it:
> > 
> >     https://bugzilla.redhat.com/show_bug.cgi?id=822189
> > 
> > Fix this by adding a new rpc_delay_action() variant that makes allows
> > us to pass in a rpc_action to reset the status back to -EAGAIN instead.
> > 
> > Cc: Andy Adamson <andros@xxxxxxxxxx>
> > Signed-off-by: Jeff Layton <jlayton@xxxxxxxxxx>
> > ---
> >  include/linux/sunrpc/sched.h |    1 +
> >  net/sunrpc/sched.c           |    9 +++++++--
> >  net/sunrpc/xprt.c            |    7 ++++++-
> >  3 files changed, 14 insertions(+), 3 deletions(-)
> > 
> > diff --git a/include/linux/sunrpc/sched.h b/include/linux/sunrpc/sched.h
> > index dc0c3cc..6cafdbb 100644
> > --- a/include/linux/sunrpc/sched.h
> > +++ b/include/linux/sunrpc/sched.h
> > @@ -241,6 +241,7 @@ struct rpc_task *rpc_wake_up_first(struct rpc_wait_queue *,
> >  					void *);
> >  void		rpc_wake_up_status(struct rpc_wait_queue *, int);
> >  int		rpc_queue_empty(struct rpc_wait_queue *);
> > +void		rpc_delay_action(struct rpc_task *, unsigned long, rpc_action);
> >  void		rpc_delay(struct rpc_task *, unsigned long);
> >  void *		rpc_malloc(struct rpc_task *, size_t);
> >  void		rpc_free(void *);
> > diff --git a/net/sunrpc/sched.c b/net/sunrpc/sched.c
> > index 994cfea..f2ff98d 100644
> > --- a/net/sunrpc/sched.c
> > +++ b/net/sunrpc/sched.c
> > @@ -613,13 +613,18 @@ static void __rpc_atrun(struct rpc_task *task)
> >  	task->tk_status = 0;
> >  }
> >  
> > +void rpc_delay_action(struct rpc_task *task, unsigned long delay, rpc_action action)
> > +{
> > +	task->tk_timeout = delay;
> > +	rpc_sleep_on(&delay_queue, task, action);
> > +}
> > +
> >  /*
> >   * Run a task at a later time
> >   */
> >  void rpc_delay(struct rpc_task *task, unsigned long delay)
> >  {
> > -	task->tk_timeout = delay;
> > -	rpc_sleep_on(&delay_queue, task, __rpc_atrun);
> > +	rpc_delay_action(task, delay, __rpc_atrun);
> >  }
> >  EXPORT_SYMBOL_GPL(rpc_delay);
> >  
> > diff --git a/net/sunrpc/xprt.c b/net/sunrpc/xprt.c
> > index b239e75..a5dc816 100644
> > --- a/net/sunrpc/xprt.c
> > +++ b/net/sunrpc/xprt.c
> > @@ -969,6 +969,11 @@ static bool xprt_dynamic_free_slot(struct rpc_xprt *xprt, struct rpc_rqst *req)
> >  	return false;
> >  }
> >  
> > +static void xprt_alloc_delay_reset_status(struct rpc_task *task)
> > +{
> > +	task->tk_status = -EAGAIN;
> > +}
> > +
> >  static void xprt_alloc_slot(struct rpc_task *task)
> >  {
> >  	struct rpc_xprt	*xprt = task->tk_xprt;
> > @@ -984,7 +989,7 @@ static void xprt_alloc_slot(struct rpc_task *task)
> >  		goto out_init_req;
> >  	switch (PTR_ERR(req)) {
> >  	case -ENOMEM:
> > -		rpc_delay(task, HZ >> 2);
> > +		rpc_delay_action(task, HZ >> 2, xprt_alloc_delay_reset_status);
> >  		dprintk("RPC:       dynamic allocation of request slot "
> >  				"failed! Retrying\n");
> >  		break;
> 
> 
> Now that I look at this more closely though...I wonder if we're going
> to end up with similar problems in the EAGAIN case. Because we have
> tk_timeout==0 at this point, I think we'll always end up having the
> tk_status set to -ETIMEDOUT in the event that xprt_dynamic_alloc_slot
> returns EAGAIN (due to __rpc_queue_timer_fn).

Am I missing something?

xprt_alloc_slot calls rpc_sleep_on with task->tk_timeout == 0, which
means __rpc_add_timer quits without adding the task to the
queue->timer_list. How would __rpc_queue_timer_fn find it?

> Perhaps we need a similar fix there? Or maybe we should just push the
> error handling for xprt_alloc_slot into call_reserveresult and bypass
> the whole problem that way?
> 
> Thoughts?

See the patch I sent out before I saw this email. We'll solve this in
call_reserveresult.

-- 
Trond Myklebust
Linux NFS client maintainer

NetApp
Trond.Myklebust@xxxxxxxxxx
www.netapp.com

��.n��������+%������w��{.n�����{��w���jg��������ݢj����G�������j:+v���w�m������w�������h�����٥



[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux