Re: rdma_cm segfaults on RoCE with ConnectX-4 [WAS: Re: rping segfault with 4.9.28 on CentOS 7.3]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, May 17, 2017 at 11:07 PM, Leon Romanovsky <leon@xxxxxxxxxx> wrote:
> On Wed, May 17, 2017 at 12:14:18PM -0600, Robert LeBlanc wrote:
>> Since I have a connectX-3 card in this same box, I set it up as
>> Infiniband. I can run all the tests (udaddy, rping, ib_send_bw with -R
>> or -z) using the Infiniband link, but the RoCE ConnectX-4 LX segfault
>> on any rdma_cm communications.
>>
>> I put the ConnectX-3 into Ethernet mode and ran the tests again and it
>> passed all of them while the ConnectX-4 LX cards still failed. We have
>> some ConnectX-4 EN 100 Gb cards in other boxes that have the same
>> problem.
>>
>> It really looks like this problem is specific to ConnectX-4 (mlx5
>> driver) when running in RoCE. I _don't_ have ConnectX-4 IB cards to
>> test. We are also seeing the problem with the Mellanox drivers. I
>> can't find http://www.mellanox.com/page/custom_firmware_table to build
>> a new OEM firmware for my SuperMicro branded cards to test the latest
>> firmware.
>
> Robert,
>
> Please avoid top-posting, It is unreadable.
>
> In regards to your issue, the best way to move forward is to open
> customer issue request and leverage established procedures to get
> proper and prompt customer channel support.
>
> Thanks

Are you saying to open a case with Mellanox? I performed all my tests
with the in-box drivers, which I thought the community would be
interested in. I _also_ ran my tests on the Mellanox OFED driver to
see if it was something specific to the in-box drivers or consistent
across both as an additional point of information to help with
resolving the problem. The problem shows up in the in-box driver and
the Mellanox OFED although a little different.

Thanks

----------------
Robert LeBlanc
PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux