Re: NFS mount lockups since about a month ago

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



That network looks fine to me

I would try v3.  I have had bad luck many times with v4 on a variety
of different kernels.  If the code is recovering from something
related to a bug 45 seconds might be right to decide something that
was working is no longer working.

I am not sure any amount of debugging would help (without having
really verbose kernel debugging).

What is the current kernel you are running and trying a new one might
be worth it.  Though I don't see nfs changes/fixes listed in the
5.14.* or 5.13.* kernels changelog in the rpm file (rpm -q
--changelog) and there are only a few listed at kernel.org for  those
kernels.

On Tue, Oct 5, 2021 at 11:04 AM Terry Barnaby <terry1@xxxxxxxxxxx> wrote:
>
> sar -n EDEV reports all 0's all around then. There are some rxdrop/s of 0.02 occasionally on eno1 through the day (about 20 of these with minute based sampling).
> Today ifconfig lists 39 dropped RX packets out of 2357593. Not sure why there are some dropped packets. "ethtool -S eno1" doesn't seem to list any particular issues.
>
> sar -n DEV does not appear to show anything at 10:51:30:
>
>                  IFACE   rxpck/s   txpck/s    rxkB/s    txkB/s   rxcmp/s   txcmp/s  rxmcst/s   %ifutil
> 10:44:04         eno1     18.29     19.54      5.81      5.25      0.00      0.00      0.00      0.00
> 10:45:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:45:04         eno1     20.45     22.52      5.96      5.79      0.00      0.00      0.00      0.00
> 10:46:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:46:04         eno1     22.50     24.26      7.52      7.88      0.00      0.00      0.00      0.01
> 10:47:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:47:04         eno1     21.53     22.75      7.27      5.71      0.00      0.00      0.00      0.01
> 10:48:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:48:04         eno1    222.03    284.24    173.49    367.55      0.00      0.00      0.00      0.30
> 10:49:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:49:04         eno1     11.83     12.28      2.74      3.98      0.00      0.00      0.00      0.00
> 10:50:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:50:04         eno1     15.72     14.13      4.33      3.80      0.00      0.00      0.00      0.00
> 10:51:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:51:04         eno1     11.00     10.53      3.48      2.63      0.00      0.00      0.00      0.00
> 10:52:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:52:04         eno1     13.48     13.45      4.21      4.56      0.00      0.00      0.00      0.00
> 10:53:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
> 10:53:04         eno1     21.76     23.98      6.99     10.26      0.00      0.00      0.00      0.01
> 10:54:04           lo      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
>
> Also NFV4 uses TCP/IP I think by default and TCP/IP retries would be much quicker than 45 seconds.
> I do feel there is an issue in the NFS code somewhere, but I am biased about the speed of NFS directory access these days !
>
> On 04/10/2021 17:06, Roger Heflin wrote:
>
> Since it is recovering from it, maybe it is losing packets inside the
> network, what does "sar -n DEV" and "sar -n EDEV" look like during
> that time on both client seeing the pause and the server.
>
> EDEV is typically all zeros unless something is lost.  if something is
> being lost and it matches the times the time of hangs that could be
> it.
>
>
> _______________________________________________
> users mailing list -- users@xxxxxxxxxxxxxxxxxxxxxxx
> To unsubscribe send an email to users-leave@xxxxxxxxxxxxxxxxxxxxxxx
> Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
> List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
> List Archives: https://lists.fedoraproject.org/archives/list/users@xxxxxxxxxxxxxxxxxxxxxxx
> Do not reply to spam on the list, report it: https://pagure.io/fedora-infrastructure
_______________________________________________
users mailing list -- users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to users-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/users@xxxxxxxxxxxxxxxxxxxxxxx
Do not reply to spam on the list, report it: https://pagure.io/fedora-infrastructure



[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [EPEL Devel]     [Fedora Magazine]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Desktop]     [Fedora Fonts]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Fedora Sparc]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux