Re: [PATCH 4/4] NFS: Always send an unlock for FL_CLOSE

Trond Myklebust <trondmy@xxxxxxxxxxxxxxx> · Wed, 22 Feb 2017 16:27:23 +0000

On Wed, 2017-02-22 at 10:42 -0500, Jeff Layton wrote:
> On Wed, 2017-02-22 at 09:10 -0500, Benjamin Coddington wrote:
> > On 22 Feb 2017, at 8:20, Jeff Layton wrote:
> > 
> > > On Tue, 2017-02-21 at 10:39 -0500, Benjamin Coddington wrote:
> > > > NFS attempts to wait for read and write completion before
> > > > unlocking 
> > > > in
> > > > order to ensure that the data returned was protected by the
> > > > lock.  
> > > > When
> > > > this waiting is interrupted by a signal, the unlock may be
> > > > skipped, 
> > > > and
> > > > messages similar to the following are seen in the kernel ring
> > > > buffer:
> > > > 
> > > > [20.167876] Leaked locks on dev=0x0:0x2b ino=0x8dd4c3:
> > > > [20.168286] POSIX: fl_owner=ffff880078b06940 fl_flags=0x1
> > > > fl_type=0x0 
> > > > fl_pid=20183
> > > > [20.168727] POSIX: fl_owner=ffff880078b06680 fl_flags=0x1
> > > > fl_type=0x0 
> > > > fl_pid=20185
> > > > 
> > > > For NFSv3, the missing unlock will cause the server to refuse 
> > > > conflicting
> > > > locks indefinitely.  For NFSv4, the leftover lock will be
> > > > removed by 
> > > > the
> > > > server after the lease timeout.
> > > > 
> > > > This patch fixes this issue by skipping the wait in order to 
> > > > immediately send
> > > > the unlock if the FL_CLOSE flag is set when signaled.
> > > > 
> > > > Signed-off-by: Benjamin Coddington <bcodding@xxxxxxxxxx>
> > > > ---
> > > >  fs/nfs/file.c | 10 +++++-----
> > > >  1 file changed, 5 insertions(+), 5 deletions(-)
> > > > 
> > > > diff --git a/fs/nfs/file.c b/fs/nfs/file.c
> > > > index a490f45df4db..df695f52bb9d 100644
> > > > --- a/fs/nfs/file.c
> > > > +++ b/fs/nfs/file.c
> > > > @@ -697,14 +697,14 @@ do_unlk(struct file *filp, int cmd,
> > > > struct 
> > > > file_lock *fl, int is_local)
> > > >  	if (!IS_ERR(l_ctx)) {
> > > >  		status = nfs_iocounter_wait(l_ctx);
> > > >  		nfs_put_lock_context(l_ctx);
> > > > -		if (status < 0)
> > > > +		/*  NOTE: special case
> > > > +		 * 	If we're signalled while cleaning
> > > > up locks on process exit, we
> > > > +		 * 	still need to complete the unlock.
> > > > +		 */
> > > > +		if (status < 0 && !(fl->fl_flags & FL_CLOSE))
> > > >  			return status;
> > > 
> > > 
> > > Hmm, I don't know if this is safe...
> > > 
> > > Suppose we have a bunch of buffered writeback going on, and we're
> > > sitting here waiting for it so we can do the unlock. The task
> > > catches 
> > > a
> > > signal, and then issues the unlock while writeback is still going
> > > on.
> > > Another client then grabs the lock, and starts doing reads and
> > > writes
> > > while this one is still writing back.
> > > 
> > > I think the unlock really needs to wait until writeback is done,
> > > regardless of whether you catch a signal or not.
> > 
> > The only other way to do this is as you've sugggested many times --
> > by
> > putting the LOCKU on a rpc_wait_queue.  But that won't work for
> > NFSv3.
> > 
> 
> IDGI -- why doesn't that (or something similar) not work for v3?
> 
> > For NFSv4, the other thing to do might be to look up the lock state
> > and 
> > set
> > NFS_LOCK_LOST, then unlock.  That will cause subsequent writes to
> > not
> > attempt recovery.
> > 
> 
> No! LOCK_LOST is really for the case where we've lost contact with
> the
> server. We don't want to get kicked into recovery just because of a
> SIGKILL.

I suggest we add a new lock state: NFS_LOCK_UNLOCKED to allow the
client to recognise that the failed operation should not be retried.

In fact, we might want to have nfs4_set_rw_stateid() scan for that, and
return an error that could be made to fail the operation.
Furthermore, maybe __nfs4_find_lock_state() should also ignore lock
structures in this state so that once the lock is in this state, we
always ignore new attempts.
This could also be made to work with pNFS; although for most layout
types, the locking will not be enforced by the server, we can at least
prevent new I/O requests from being generated after the
NFS_LOCK_UNLOCKED state has been declared.

You can't do any of the above with NFSv3, since the locking is
decoupled from the actual I/O. Is anyone actually using NLM for mission
critical applications that need this kind of protection? (followup
question: is there a good reason why they are doing so instead of using
NFSv4?)

-- 
Trond Myklebust
Linux NFS client maintainer, PrimaryData
trond.myklebust@xxxxxxxxxxxxxxx
��.n��������+%������w��{.n�����{��w���jg��������ݢj����G�������j:+v���w�m������w�������h�����٥