Re: failed to allocate device WQ

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



在 2024/12/20 18:10, Holger Kiehl 写道:
Hello,

since upgrading from kernel 6.10 to 6.11 (also 6.12) one Infiniband
card sometimes hits this error:

    kernel: workqueue: Failed to create a rescuer kthread for wq "ipoib_wq": -EINTR
    kernel: ib0: failed to allocate device WQ
    kernel: mlx5_1: failed to initialize device: ib0 port 1 (ret = -12)
    kernel: mlx5_1: couldn't register ipoib port 1; error -12

The system has two cards:

    41:00.0 Infiniband controller: Mellanox Technologies MT28908 Family [ConnectX-6]
    c4:00.0 Infiniband controller: Mellanox Technologies MT28908 Family [ConnectX-6]

If that happens one cannot use that card for TCP/IP communication. It does
not always happen, but when it does it always happens with the second
card mlx5_1. Never with mlx5_0. This happens on four different systems.

Any idea what I can do to stop this from happening?

Regards,
Holger

PS: Firmware for both cards is 20.41.1000

It is very possible that FW is not compatible with the driver. IMO, you can make tests with Mellanox OFED.

If the driver is compatible with FW, this problem should disappear.

Zhu Yanjun




[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux