Regression: Connect-X5 doesn't connect with NVME-of

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

We've experienced a regression with using nvme-of and two Connect-X5s. With v4.15 and v4.14.16 we see the following dmesgs when trying to connect to the target:

[   43.732539] nvme nvme2: creating 16 I/O queues.
[   44.072427] nvmet: adding queue 1 to ctrl 1.
[   44.072553] nvmet: adding queue 2 to ctrl 1.
[   44.072597] nvme nvme2: Connect command failed, error wo/DNR bit: -16402
[   44.072609] nvme nvme2: failed to connect queue: 3 ret=-18
[   44.075421] nvmet_rdma: freeing queue 2
[   44.075792] nvmet_rdma: freeing queue 1
[   44.264293] nvmet_rdma: freeing queue 3
*snip*

(on v4.15 there is additional error panics likely do to some other nvme-of error handling bugs)

And nvme connect returns:

Failed to write to /dev/nvme-fabrics: Invalid cross-device link

The two adapters are the same with the latest available firmware:

	transport:			InfiniBand (0)
	fw_ver:				16.21.2010
	vendor_id:			0x02c9
	vendor_part_id:			4119
	hw_ver:				0x0
	board_id:			MT_0000000010

We bisected to find the commit that broke our setup is:

05e0cc84e00c net/mlx5: Fix get vector affinity helper function

Thanks,

Logan
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux