Re: Known SMBDirect issue?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



2023-09-19 7:38 GMT+09:00, Tom Talpey <tom@xxxxxxxxxx>:
> Namjae, after building a 6.6.0-rc2 kernel to test here at the IOLab,
> I was surprised to see the smbdirect connection break during the
> Connectathon "special" tests. The basic tests all work fine, but shortly
> after the special tests begin, I start seeing this on the server (this
> is with softRoCE, though I see similar failures over softiWarp):
I'll try to reproduce it tonight. I found no problems in testing with
the Windows client last week. I will have to check with cifs.ko &
softROCE.

Thanks for your report!
>
> [ 1266.623187] rxe0: qp#17 do_complete: non-flush error status = 2
> [ 1266.623233] ksmbd: smb_direct: Recv error. status='local QP operation
> error (2)' opcode=0
> [ 1266.623605] ksmbd: smb_direct: disconnected
> [ 1266.623610] ksmbd: sock_read failed: -107
> [ 1266.628656] rxe0: qp#18 do_complete: non-flush error status = 2
> [ 1266.628684] ksmbd: smb_direct: Recv error. status='local QP operation
> error (2)' opcode=0
> [ 1266.628820] ksmbd: smb_direct: disconnected
> [ 1266.628824] ksmbd: sock_read failed: -107
> [ 1266.633354] rxe0: qp#19 do_complete: non-flush error status = 2
> [ 1266.633380] ksmbd: smb_direct: Recv error. status='local QP operation
> error (2)' opcode=0
> [ 1266.633583] ksmbd: smb_direct: disconnected
>
> The local QP error 2 is IB_WC_LOC_QP_OP_ERR, which is a buffer error
> of some sort, could be a receive buffer unavailable or maybe a length
> overrun. Both of these seem highly improbable, because the "basic" tests
> run fine. The client sees only a disconnection with IB_WC_REM_OP_ERR,
> which is expected in this case.
>
> OTOH it could be a client send issue, maybe a too-large datagram or an
> smbdirect credit overrun. But it's the server detecting the error, so
> I'm starting there for now.
>
> This worked as recently as 6.5, definitely it was all fine in 6.4. I am
> not yet able to drill down to the level of figuring out what SMB3
> payload was being received by ksmbd.
>
> Steve tells me you test over RDMA semi-often. Have you seen this?
> Any ideas are welcome.
>
> Tom.
>



[Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux