On Sun, Oct 18, 2020 at 12:14 AM J. Bruce Fields <bfields@xxxxxxxxxxxx> wrote: > > On Sat, Oct 17, 2020 at 11:40:09PM +0300, Guy Keren wrote: > > according to what you wrote here, an NFS4ERR_DELAY response is > > something that needs to be sent at the level of the entire compound > > request - i.e. the server is not allowed to send a compound response > > where the first few requests have a status of NFS4_OK, while the last > > have a status of NFS4ERR_DELAY. > > Oh, no, it's absolutely fine for a server to do that. > > Sorry, you mentioned persistent sessions, so I assumed somehow this was > about retries after crashes or reboots, where the client may not have > received the reply and doesn't know whether it executed. > > > according to what you say, if the OPEN request is in the middle of the > > compound request, and is preceded by state-modifying requests (e.g. > > creation of other files, writes into other open handles, renames, > > etc.), then the server must avoid processing them until it recalled > > the delegation to the file (i.e. it must process the entire command to > > make sure it doesn't need to send an NFS4ERR_DELAY response due to any > > of the requests inside it, before it starts processing, and it must > > also lock the state of all files involved in the request, to avoid > > another client acquiring a delegation on any of the files in the > > request that have an OPEN request in the same compound. alternatively, > > it must not send an NFS4ERR_DELAY request, and instead just keep the > > request pending until the delegation recall was completed. > > No, sorry for the confusion, you're correct, if the client had a bunch > of non-idempotent ops all in one compound, and got a DELAY partway > through, then, yes, it would have to deal with retrying only the part > that didn't execute. actually, it is my understanding that, with persistent sessions, the client has no way to distinguish between a temporary network connection loss, and a server restart, if the server stores the client state (client_id and all stateids) in persistent store. so suppose that the client sent two 'Open' requests in one compound. the server finished processing the first, but then had a delegation on the 2nd one, so it is supposed to return an NFS4_OK to the first Open and a NFSERR_DELAY for the 2nd open (and this is also the compound response that the server will store in its Duplicate Request Cache). if the server had a temporary network disconnection, or had a server restart, then when the client re-connects and re-sends this compound request, it receives the response from the server's Duplicate Request Cache (with OK for the first open and DELA?Y For the 2nd). than, i presume that the client needs to accept that the first Open already succeeded, and when creating a new session, re-send only the 2nd Open request. does this make sense? > > I don't know of any client that actually does that, for what it's worth. > The Linux client, for example, doesn't send any compounds that I can > think of that have more than one nonidempotent op. does it mean that the linux NFS 4.1 client will also never send two Write requests in the same compound? and never send an Open request which might create a file, with a Write request in the same compound? because, although these are not non-idempotent requests, it could be that one of them was executed while the next one was not (at least according to the spec, the server might return NFS4ERR_DELAY for all of the NFS4.1 Request types)? > > > i would assume that the same mechanism used to create the compound > > request in the first place (adding the PUTFH in front, etc.) could be > > used during a re-building of a smaller compound request - provided > > that the client knows which requests from the compound were already > > completed - and which were not. > > > > but i understand that there's no such mechanism today on the linux NFS > > client kernel - which is what i initially asked - so that clarifies > > things. > > Right, in theory you could imagine clients doing very general things > with compounds. In practice I don't know of any that do. > > (Not that that allows a spec-compliant server to assume they won't.) > > > what about a situation in which instead of a server restart event, the > > client just disconnected before receiving a rename response, and > > re-connected with the same session to the same session? in that case, > > i presume that the Linux NFS client will re-send the compound request, > > and get the results from the server's Duplicate-Request cache, without > > returning errors to the application. correct? > > Right, assuming the client managed to hang on to its lease. right. which will be the case if the server doesn't revoke state immediately upon lease expiration, and no other client performed conflicting requests. > > > and this doesn't answer the original question: how was the "persistent > > sessions" support in the linux NFS 4.1 client tested? > > I don't know, sorry. ok, thanks. > > > on an aside - i see that you are also the maintainer of the pynfs test > > suite. would you be interested in patches fixing its install > > operation, and if yes - should we send them to this mailing list, or > > directly to you? i failed to find a mailing list dedicated to pynfs > > development. > > Just send them to me, cc'd to this list. Thanks! ok. we'll clean-up what we have and send it within a few days. thanks. > > --b. --guy keren Vast Data