Folks, I have to apologize here. Flow control was at one point configured correctly, but we had a power outage in our lab two weeks ago. We thought the switch had come back up OK, but late week we were double-checking all the ports in use and we found one port had lost some settings and no longer had flow control enabled (it had also reverted from 100Gb to 50Gb for some reason). After fixing the port settings we ran IO through the weekend and the early part of this week on a variety of workloads. We don't seem to be able to reproduce the failure after fixing the port settings. It looks like this one may have been caused by the lost flow control setting on the switch. Sorry for the confusion!
Yep, don't expect RoCE to work without flow-control. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html