Re: [PATCH] NLM: add network test when host expire but hold lock at nlm_gc_hosts

Chuck Lever <chuck.lever@xxxxxxxxxx> · Wed, 2 Dec 2009 12:20:20 -0500

On Dec 2, 2009, at 12:09 PM, J. Bruce Fields wrote:
On Wed, Dec 02, 2009 at 09:29:03AM -0500, Trond Myklebust wrote:
On Wed, 2009-12-02 at 07:26 -0500, Jeff Layton wrote:
On Wed, 02 Dec 2009 17:47:04 +0800
Mi Jinlong <mijinlong@xxxxxxxxxxxxxx> wrote:

After a client get lock, it's network partition for some reasons.
other client cannot get lock success forever.

This patch can avoid this problem using rpc_ping to test client's
network when host expired but hold lock.

If the client's network is partition, server will release client's
lock, other client will get lock success.

Signed-off-by: mijinlong@xxxxxxxxxxxxxx

Yikes! That sounds like it'll make locking subject to the  
reliability
of the network. I don't think that's a good idea.

What might be more reasonable is to consider implementing something
like the clear_locks command in Solaris. That is, a way for an  
admin to
remove server-side locks held by a client that he knows is never  
going
to come back. With that, this sort of thing at least becomes a  
willful
act...

Agreed on both counts.

We should not be changing the semantics of either NFSv3 or NLM at  
this
time. That will break existing setups that are treating NFSv3 as  
being a
stable platform.
As I've said in previous correspondence: NFSv4 already offers lease
based locking. If people are worried about network partitions and/or
locks being held by clients that are dead, then they can switch to  
that.

On the other hand, a clear_locks command could be useful in order to
tell a server that a given client is dead. It should be fairly easy  
to
leverage the existing NSM/statd protocol to implement this.

Oh, so all clear_locks does is send an nsm notification?  Yeah, that
sounds like a completely reasonable project for someone.

If you send an SM_NOTIFY to statd, it will ignore it if it doesn't  
recognize the mon_name.  statd also checks the sender's IP address,  
which would be different in this case than that actual peer's IP  
address.

The SM_NOTIFY RPC does not have a return value, so there's no way to  
know whether your command was effective (other than seeing that the  
locks are still held).

clear_locks would have to read /var/lib/nfs/statd/sm/foo to get the  
RPC proc/vers/proc and priv arguments if it were to send an NLM  
downcall.

So, using NSM might be a simple approach, but not a robust one, IMO.

I've always wanted to have the kernel's NSM hosts cache exported via / 
sys (or similar).  That would make it somewhat easier to see what's  
going on, and provide a convenient sysctl-like interface for local  
commands to make adjustments such as this (or for statd to gather more  
information than is available from an SM_MON request).

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com

--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html