Re: [RFC][PATCH] client cannot get lock after other client got lock occur network partition.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 11/11/2009 04:34 AM, Mi Jinlong wrote:
> Hi Trond
> 
> Trond Myklebust 写道:
>> On Tue, 2009-11-10 at 17:38 +0800, Mi Jinlong wrote:
>>> Hi Trond
>>>
>>> Trond Myklebust 写道:
>>>> On Mon, 2009-11-09 at 17:19 +0800, Mi Jinlong wrote:
>>>>> Hi Trond et all
>>>>>
>>>>> There is a bug, when i test NFSv3 file's lock as followed:
>>>>>
>>>>> Step1: ClientA and ClientB open a same nfs file;
>>>>> Step2: ClientA locks file with write lock, it's ok;
>>>>> Step3: Cut off the network between ClientA and Server;
>>>>> Step4: ClientB can not acquire for write lock successful forever, even though
>>>>>        the network partition larger than NLM_HOST_EXPIRE.
>>>>>
>>>>> As i know, If use NFSv4, step4 can success after LEASE_TIME.
>>>>>
>>>>> Is it necessary to fix NFSv3 ? 
>>>>>
>>>>> The attached patch can make this case OK, but i am not sure it's good.
>>>> Unfortunately, NLM (the NFSv2 and v3 locking protocol) is not lease
>>>> based, so the above scenario is truly an unfixable one.
>>>>
>>>> The problem with applying your patch is, in essence, that we risk
>>>> breaking another scenario where a client grabs a lock, and then holds it
>>>> for a while.
>>>> The reason this breaks is that there is no equivalent in the NLM
>>>> protocol of the NFSv4 RENEW operation to tell the server that "This
>>>> client is still alive and wants you to keep its state".
>>> Thanks for your answer!
>>>
>>> This bug seems serious, shouldn't we fix it?
>>
>> Unless you can think of a fix which works with the current NLM protocol,
>> I'd suggest simply encouraging people to move to a protocol with lease
>> based locks: i.e. NFSv4...
> 
> Can we add a process(like NFSv4's nfsd4) to call the nlm_gc_hosts() periodically?
> At nlm_gc_hosts, then call rpc_ping() to check whether network is OK, if not,
> its resource will be release.
> 

This would also violate the semantics that the current NLM has.
If, while holding the lock, the client does not need to contact
the server, it may not even notice the network partition and
will continue to expect that it holds the lock.

It might have been interesting to fix this problem about 20
years ago.  However, nowadays, we just live with it.  If it is
a real problem, then using NFSv4 can be a good solution.

		ps
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux