Re: "Using NFS over UDP on high-speed links such as Gigabit can cause silent data corruption."

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Feb 28, 2012, at 9:35 AM, Chuck Lever wrote:

> 
> On Feb 28, 2012, at 7:46 AM, Jim Rees wrote:
> 
>> Jeff Layton wrote:
>> 
>> On Tue, 28 Feb 2012 16:22:01 +1100
>> Harshula <harshula@xxxxxxxxxx> wrote:
>> 
>>> Hi Steve,
>>> 
>>> The following openSUSE nfs-utils patch, warn-nfs-udp.patch, is not
>>> included upstream:
>>> 
>>> https://build.opensuse.org/package/view_file?file=warn-nfs-udp.patch&package=nfs-utils&project=openSUSE%3AFactory&rev=8e3e60c70e8270cd4afa036e13f6b2bb
>>> 
>>> Please consider including it.
>>> 
>>> Thanks,
>>> #
>>> 
>> 
>> I think that patch looks reasonable and clearly documenting the
>> problems with UDP is a wonderful thing.
>> 
>> It may be best to send it formally to steved and the list as a real
>> [PATCH] with a real description and SoB line.
>> 
>> This feels like the wrong place to document this, since it affects anything
>> that uses udp, not just nfs.
> 
> NFS has a particular sensitivity to unreliable datagram transports, and that is a well-known problem.  NetApp's retired TR-3183 and many Oracle meta documents mention the problems with NFS over UDP.  Most other uses of UDP do not involve such large datagrams.
> 
> My comment is that if the text in the TRANSPORT METHODS section in nfs(5) about UDP reassembly is not adequate it should be updated.  I would rather see the meat of the proposed text merged into that section; otherwise we have two disparate sections discussing the same topic.  That section is where this kind of discussion belongs.

A few more comments.

Any file, including a /proc file, called out in new text should be added to the FILES section, IMO.

If we can't resolve the provenance issue, someone could rewrite the patch from scratch so that it addresses the review comments.

I don't agree with adding in-code warnings.  Mount works silently unless it fails, and this is not a mount failure.  Would such warnings ever be seen for NFS mounts added to /etc/fstab, or performed by automounter?  I think by and large most people type "mount -t nfs" without options and will get our current default transport setting, which is TCP, or UDP if the server does not support TCP.  Isn't that adequate?

We also know that the risk of using UDP is mitigated by using jumbo frames, specifying a small r/wsize, or by reducing the fragment reassembly timeout.  If an admin does those things, she still gets the warning.

It seems needlessly alarmist, and useless for our most common use cases.

-- 
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux