Hello,
I have just deployed a new system with Mellanox ConnectX-4 VPI EDR IB
cards and wanted to setup NFS over RDMA on it.
However, while mounting the FS over RDMA works fine, actually using it
results in the following messages absolutely hammering dmesg on both
client and server:
https://gist.github.com/BtbN/9582e597b6581f552fa15982b0285b80#file-server-log
The spam only stops once I forcibly reboot the client. The filesystem
gets nowhere during all this. The retrans counter in nfsstat just keeps
going up, nothing actually gets done.
This is on Linux 5.4.54, using nfs-utils 2.4.3.
The mlx5 driver had enhanced-mode disabled in order to enable IPoIB
connected mode with an MTU of 65520.
Normal NFS 4.2 over tcp works perfectly fine on this setup, it's only
when I mount via rdma that things go wrong.
Is this an issue on my end, or did I run into a bug somewhere here?
Any pointers, patches and solutions to test are welcome.
Thanks,
Timo Rothenpieler