Hi Neil- > On Mar 13, 2022, at 9:04 PM, NeilBrown <neilb@xxxxxxx> wrote: > > > When mounting NFS filesystem in a network namespace using v4, some care > must be taken to ensure a unique and stable client identity. Similar > case is needed for NFS-root and other situations. > > Add documentation explaining the requirements for the NFS identity in > these situations. > > Signed-off-by: NeilBrown <neilb@xxxxxxx> > --- > > I think I've address most of the feedback, but please forgive and remind > if I missed something. > NeilBrown > > utils/mount/nfs.man | 109 +++++++++++++++++++++++++++++++++++++++++++- > 1 file changed, 108 insertions(+), 1 deletion(-) > > diff --git a/utils/mount/nfs.man b/utils/mount/nfs.man > index d9f34df36b42..5f15abe8cf72 100644 > --- a/utils/mount/nfs.man > +++ b/utils/mount/nfs.man > @@ -1,7 +1,7 @@ > .\"@(#)nfs.5" > .TH NFS 5 "9 October 2012" > .SH NAME > -nfs \- fstab format and options for the > +nfs \- fstab format and configuration for the > .B nfs > file systems Suggest "configuration for nfs file systems" (remove "the") > .SH SYNOPSIS > @@ -1844,6 +1844,113 @@ export pathname, but not both, during a remount. For example, > merges the mount option > .B ro > with the mount options already saved on disk for the NFS server mounted at /mnt. > +.SH "NFS CLIENT IDENTIFIER" > +NFSv4 requires that the client present a unique identifier to the server > +to be used to track state such as file locks. By default Linux NFS uses > +the host name, as configured at the time of the first NFS mount, > +together with some fixed content such as the name "Linux NFS" and the > +particular protocol version. When the hostname is guaranteed to be > +unique among all client which access the same server this is sufficient. > +If hostname uniqueness cannot be assumed, extra identity information > +must be provided. The last sentence is made ambiguous by the use of passive voice. Suggest: "When hostname uniqueness cannot be guaranteed, the client administrator must provide extra identity information." I have a problem with basing our default uniqueness guarantee on hostnames "most of the time" hoping it will all work out. There are simply too many common cases where hostname stability can't be relied upon. Our sustaining teams will happily tell us this hope hasn't so far been born out. I also don't feel that nfs(5) is an appropriate place for this level of detail. Documentation/filesystems/nfs/ is more appropriate IMO. In general, man pages are good for quick summaries, not for explainers. Here, it reads like "you, a user, are going to have to do this thing that is like filling out a tax form" -- in reality it should be information that should be: - Ignorable by most folks - Used by distributors to add value by automating set up - Used for debugging large client installations Maybe I'm just stating this to understand the purpose of this patch, but it could also be used as an "Intended audience" disclaimer in this new section. > +.PP > +Some situations which are known to be problematic with respect to unique > +host names include: A little wordy. Suggest: "Situations known to be problematic with respect to unique hostnames include:" If this will eventually become part of nfs(5), I would first run this patch by documentation experts, because they might have a preference for "hostnames" over "host names" and "namespaces" over "name-spaces". Usage of these terms throughout this patch is not consistent. > +.IP \- 2 > +NFS-root (diskless) clients, where the DCHP server (or equivalent) does > +not provide a unique host name. Suggest this addition: .IP \- 2 Dynamically-assigned hostnames, where the hostname can be changed after a client reboot, while the client is booted, or if a client often repeatedly connects to multiple networks (for example if it is moved from home to an office every day). > +.IP \- 2 > +"containers" within a single Linux host. If each container has a separate > +network namespace, but does not use the UTS namespace to provide a unique > +host name, then there can be multiple effective NFS clients with the > +same host name. > +.IP \= 2 .IP \- 2 > +Clients across multiple administrative domains that access a common NFS > +server. If assignment of host name is devolved to separate domains, I don't recognize the phrase "assignment is devolved to separate domains". Can you choose a friendlier way of saying this? > +uniqueness cannot be guaranteed, unless a domain name is included in the > +host name. > +.SS "Increasing Client Uniqueness" > +Apart from the host name, which is the preferred way to differentiate > +NFS clients, there are two mechanisms to add uniqueness to the > +client identifier. > +.TP > +.B nfs.nfs4_unique_id > +This module parameter can be set to an arbitrary string at boot time, or > +when the > +.B nfs > +module is loaded. This might be suitable for configuring diskless clients. Suggest: "This is suitable for" > +.TP > +.B /sys/fs/nfs/client/net/identifier > +This virtual file (available since Linux 5.3) is local to the network > +name-space in which it is accessed and so can provided uniqueness between > +network namespaces (containers) when the hostname remains uniform. ^provided^provide ^between^amongst and the clause at the end confused me. Suggest: "in which it is accessed and thus can provide uniqueness amongst network namespaces (containers)." > +.RS > +.PP > +This value is empty on name-space creation. > +If the value is to be set, that should be done before the first > +mount. If the container system has access to some sort of per-container > +identity then that identity, possibly obfuscated as a UUID is privacy is > +needed, can be used. Combining the identity with the name of the > +container systems would also help. I object to recommending obfuscation via a UUID. 1. This is confusing because there has been no mention of any persistence requirement so far. At this point, a reader might think that the client can simply convert the hostname and netns identifier every time it boots. However this is only OK to do if these things are guaranteed not to change during the lifetime of a client. In a world where a majority of systems get their hostnames dynamically, I think this is a shaky foundation. 2. There's no requirement that this uniquifier be in the form of a UUID anywhere in specifications, and the Linux client itself does not add such a requirement. (You suggested before that we should start by writing down requirements. Using a UUID ain't a requirement). Linux chooses to implement its uniquifer with a UUID because it is assumed we are using a random UUID (rather than a name-based or time-based UUID). A random UUID has strong global uniqueness guarantees, which guarantees the client identifier will always be unique amongst clients in nearly all situations for nearly no cost. If we want to create a good uniquifier here, then combine the hostname, netns identity, and/or the host's machine-id and then hash that blob with a known strong digest algorithm like SHA-256. A man page must not recommend the use of deprecated or insecure obfuscation mechanisms. The man page can suggest a random-based UUID as long as it states plainly that such UUIDs have global uniqueness guarantees that make them suitable for this purpose. We're using a UUID for its global uniqueness properties, not because of its appearance. > For example: > +.RS 4 > +echo "ip-netns:`ip netns identify`" \\ > +.br > + > /sys/fs/nfs/client/net/identifier > +.br > +uuidgen --sha1 --namespace @url \\ > +.br > + -N "nfs:`cat /etc/machine-id`" \\ > +.br > + > /sys/fs/nfs/client/net/identifier > +.RE > +If the container system provides no stable name, > +but does have stable storage, Here's the first mention of "stable". It needs some introduction far above. > then something like > +.RS 4 > +[ -s /etc/nfsv4-uuid ] || uuidgen > /etc/nfsv4-uuid && > +.br > +cat /etc/nfsv4-uuid > /sys/fs/nfs/client/net/identifier > +.RE > +would suffice. > +.PP > +If a container has neither a stable name nor stable (local) storage, > +then it is not possible to provide a stable identifier, so providing > +a random identifier to ensure uniqueness would be best > +.RS 4 > +uuidgen > /sys/fs/nfs/client/net/identifier > +.RE > +.RE > +.SS Consequences of poor identity setting This section provides context to understand the above technical recommendations. I suggest this whole section should be moved to near the opening paragraph. > +Any two concurrent clients that might access the same server must have > +different identifiers for correct operation, and any two consecutive > +instances of the same client should have the same identifier for optimal > +crash recovery. Also recovery from network partitions. > +.PP > +If two different clients present the same identity to a server there are > +two possible scenarios. If the clients use the same credential then the > +server will treat them as the same client which appears to be restarting > +frequently. One client may manage to open some files etc, but as soon > +as the other client does anything the first client will lose access and > +need to re-open everything. This seems fuzzy. 1. If locks are lost, then there is a substantial risk of data corruption. 2. Is the client itself supposed to re-open files, or are applications somehow notified that they need to re-open? Either of these scenarios is fraught -- I don't believe any application is coded to expect to have to re-open a file due to exigent circumstances. > +.PP > +If the clients use different credentials, then the second client to > +establish a connection to the server will be refused access. For > +.B auth=sys > +the credential is based on hostname, so will be the same if the > +identities are the same. With > +.B auth=krb > +the credential is stored in > +.I /etc/krb5.keytab > +and will be the same only if this is copied among hosts. This language implies that copying the keytab is a recommended thing to do. It's not. I mentioned it before because some customers think it's OK to use the same keytab across their client fleet. But obviously that will result in lost open and lock state. I suggest rephrasing this last sentence to describe the negative lease recovery consequence of two clients happening to share the same host principal -- as in "This is why you shouldn't share keytabs..." > +.PP > +If the identity is unique but not stable, for example if it is generated > +randomly on each start up of the NFS client, then crash recovery is > +affected. When a client shuts down uncleanly and restarts, the server > +will normally detect this because the same identity is presented with > +different boot time (or "incarnation verifier"), and will discard old > +state. If the client presents a different identifier, then the server > +cannot discard old state until the lease time has expired, and the new > +client may be delayed in opening or locking files that it was > +previously accessing. > .SH FILES > .TP 1.5i > .I /etc/fstab > -- > 2.35.1 > -- Chuck Lever