On Thu, Nov 12, 2020 at 06:33:45PM +0000, Daire Byrne wrote: > Well, yes NFSv4.2 all the way through works well for us but it's re-exporting a NFSv4.0 server (Linux OR Netapp) that seems to still show the input/output errors when dropping caches. Every other possible combination now seems to be working without ESTALE or input/errors with the lookupp emulation patches. > > So this is still not working when dropping caches on the re-export server: > > NFSv3/4.x NFSv4.0 > client --------> re-export server -------> original server > > The bit specific to the Netapp is simply that our 7-mode only supports NFSv4.0 so I can't actually test NFSv4.1/4.2 on a more modern Netapp firmware release. So I have to use NFSv3 to mount the Netapp and can then happily re-export that using NFSv4.x or NFSv3 (if the filehandles fit in 63 bytes). Oh, got it, thanks, so it's just the minor-version difference (probably the open-by-filehandle stuff that went into 4.1). > There was some discussion about NFS4_CHANGE_TYPE_IS_MONOTONIC_INCR > allowing for the hack/optimisation but I guess that is only for the > case when re-exporting NFSv4 to the eventual clients. It would not > help if you were re-exporting an NFSv3 server with NFSv3 to the > clients? I lack the deeper understanding to say anything more than > that. Oh, right, thanks for the reminder. The CHANGE_TYPE_IS_MONOTONIC_INCR optimization still looks doable to me. How does that help, anyway? I guess it avoids false positives of some kind when rpc's are processed out of order? Looking back at https://lore.kernel.org/linux-nfs/1155061727.42788071.1600777874179.JavaMail.zimbra@xxxxxxxx/ this bothers me: "I'm not exactly sure why, but the iversion of the inode gets changed locally (due to atime modification?) most likely via invocation of method inode_inc_iversion_raw. Each time it gets incremented the following call to validate attributes detects changes causing it to be reloaded from the originating server." The only call to that function outside afs or ceph code is in fs/nfs/write.c, in the write delegation case. The Linux server doesn't support write delegations, Netapp does but this shouldn't be causing cache invalidations. --b.