Re: [PATCH rdma-next] RDMA/mlx5: Fix a race for an ODP MR which leads to CQE with error

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Jan 19, 2025 at 02:38:25PM +0200, Leon Romanovsky wrote:
> 
> infiniband rocep8s0f0: dump_cqe:277:(pid 0): WC error: 6,
> Message: memory bind operation error
> cqe_dump: 00000000: 00 00 00 00 00 00 00 00 00 00 00 00 00
>                     00 00 00
> cqe_dump: 00000010: 00 00 00 00 00 00 00 00 00 00 00 00 00
>                     00 00 00
> cqe_dump: 00000020: 00 00 00 00 00 00 00 00 00 00 00 00 00
>                     00 00 00
> cqe_dump: 00000030: 00 00 00 00 08 00 78 06 25 00 11 b9 00
>                     0e dd d2
> ------------[ cut here ]------------
> WARNING: CPU: 15 PID: 1506 at
> drivers/infiniband/hw/mlx5/umr.c:394
> mlx5r_umr_post_send_wait+0x15a/0x2b0 [mlx5_ib]
>           Modules linked in: ip6table_mangle ip6table_nat
> ip6table_filter ip6_tables iptable_mangle xt_conntrack xt_MASQUERADE
> nf_conntrack_netlink nfnetlink xt_addrtype iptable_nat nf_nat
> br_netfilter rpcsec_gss_krb5 auth_rpcgss oid_registry overlay rpcrdma
> rdma_ucm ib_iser libiscsi scsi_transport_iscsi rdma_cm iw_cm ib_umad
> ib_ipoib ib_cm mlx5_ib ib_uverbs ib_core fuse mlx5_core

Don't word wrap dmesg output in commit messages, ignore any checkpatch
warnings.

> Fixes: e6fb246ccafb ("RDMA/mlx5: Consolidate MR destruction to mlx5_ib_dereg_mr()")
> Signed-off-by: Yishai Hadas <yishaih@xxxxxxxxxx>
> Reviewed-by: Artemy Kovalyov <artemyko@xxxxxxxxxx>
> Signed-off-by: Leon Romanovsky <leonro@xxxxxxxxxx>
> ---
>  drivers/infiniband/hw/mlx5/mr.c  | 17 +++++++++++++++--
>  drivers/infiniband/hw/mlx5/odp.c |  2 ++
>  2 files changed, 17 insertions(+), 2 deletions(-)

Applied to for-next, thanks

Jason




[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux