Re: rdma_cm segfaults on RoCE with ConnectX-4 [WAS: Re: rping segfault with 4.9.28 on CentOS 7.3]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, May 18, 2017 at 09:59:02AM -0600, Robert LeBlanc wrote:
> On Wed, May 17, 2017 at 11:07 PM, Leon Romanovsky <leon@xxxxxxxxxx> wrote:
> > On Wed, May 17, 2017 at 12:14:18PM -0600, Robert LeBlanc wrote:
> >> Since I have a connectX-3 card in this same box, I set it up as
> >> Infiniband. I can run all the tests (udaddy, rping, ib_send_bw with -R
> >> or -z) using the Infiniband link, but the RoCE ConnectX-4 LX segfault
> >> on any rdma_cm communications.
> >>
> >> I put the ConnectX-3 into Ethernet mode and ran the tests again and it
> >> passed all of them while the ConnectX-4 LX cards still failed. We have
> >> some ConnectX-4 EN 100 Gb cards in other boxes that have the same
> >> problem.
> >>
> >> It really looks like this problem is specific to ConnectX-4 (mlx5
> >> driver) when running in RoCE. I _don't_ have ConnectX-4 IB cards to
> >> test. We are also seeing the problem with the Mellanox drivers. I
> >> can't find http://www.mellanox.com/page/custom_firmware_table to build
> >> a new OEM firmware for my SuperMicro branded cards to test the latest
> >> firmware.
> >
> > Robert,
> >
> > Please avoid top-posting, It is unreadable.
> >
> > In regards to your issue, the best way to move forward is to open
> > customer issue request and leverage established procedures to get
> > proper and prompt customer channel support.
> >
> > Thanks
>
> Are you saying to open a case with Mellanox? I performed all my tests
> with the in-box drivers, which I thought the community would be
> interested in. I _also_ ran my tests on the Mellanox OFED driver to
> see if it was something specific to the in-box drivers or consistent
> across both as an additional point of information to help with
> resolving the problem. The problem shows up in the in-box driver and
> the Mellanox OFED although a little different.

Nothing stops you from opening ticket in parallel. You will get prompt
resolution (the customer service measures it), dedicated engineer,
reproduction in-house, custom FW if needed.

Thanks

>
> Thanks
>
> ----------------
> Robert LeBlanc
> PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1

Attachment: signature.asc
Description: PGP signature


[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux