On Thu, Feb 15, 2024 at 08:40:06PM +0000, Perry, Daniel wrote: > Before we spend more time investigating, I first thought I'd ask if the maintainers would be open to reviewing a set of patches that backport the NFSD filecache improvements to LTS kernel 5.10. From my perspective, these patches are core to nfsd being performant and stable with nfsv4. The changes included in the original patch series are large, but from what I can tell have been relatively bug free since being introduced to the mainline. > > I believe we would not be the only ones who would benefit if these changes were backported to a 5.x LTS kernel. It appears others have attempted to backport some of these changes to their own 5.x kernels (see https://marc.info/?l=linux-kernel&m=167286008910652&w=2, https://marc.info/?l=linux-nfs&m=169269659416487&w=2). Both of these submissions indicate that they encountered some issues after they backported, the latter of which mentioned a later patch resolved (https://marc.info/?l=linux-nfs&m=167293078213110&w=2). However, I'm unsure if this later patch is needed since LTS kernel 6.1 is still without this commit. The above two examples provide some hesitation on our side for backporting these changes without some assistance/guidance. We (Oracle) have been discussing this internally as well. I'm not a big fan of backporting large patch series. Generally, if a stable kernel is not working for you, the best course of action is for you to upgrade. But I know this is not always feasible. In this case Jeff and I never found an adequate reproducer, so we can't nail down exactly where in the series the problem was finally resolved. And I think the community would be better off if we had an upstream-tested backport rather than every distribution rolling their own. Further, the upstream community now has more standardized CI that works for not just the upstream kernel but also the 5.x stable kernels as well. And, I now have some branches in my kernel.org repo where we can collect patches specific to each stable kernel, to organize the testing and review process before we send pull requests to Greg and Sasha. (Perhaps) the bad news is I would like to see the performance and stability issues addressed for all stable kernels between 5.4, where the filecache was introduced, and 6.1, the kernel release just before things stabilized again. Maybe 5.4 is not practical? But I think fixing only 5.10.y is not good enough. As long as the community, and especially the author of these patches, is involved I think we can make this happen. Can we start with v6.1.y, which should be simpler? Do you have testing or CI in place to tell when nfsd is working satisfactorily? -- Chuck Lever