From: Leon Romanovsky <leonro@xxxxxxxxxxxx> Hi, This is user space part of previously sent kernel part. It is marked as RFC because libmlx5 patch is not final and will be rewritten, however the concept is pretty solid. I'm posting the series to the ML in attempt to gather feedback, and I already aware of need to extend man pages to include definitions of struct ibv_ece. Thanks ------------------------------------------------------------------------------------- Enhanced Connection Established or ECE is new negotiation scheme introduced in IBTA v1.4 to exchange extra information about nodes capabilities and later negotiate them at the connection establishment phase. The RDMA-CM messages (REQ, REP, SIDR_REQ and SIDR_REP) were extended to carry two fields, one new and another gained new functionality: * VendorID is a new field that indicates that common subset of vendor option bits are supported as indicated by that VendorID. * AttributeModifier already exists, but overloaded to indicate which vendor options are supported by this VendorID. The general (success) communication flow can be described by the following table: ------------------------------------------------------------------------------ Requester (client) | Responder (server) ----------------------------------------------------------------------------- 1. Create data QP. | 2. Get ECE information about this QP. | 3. Update REQ message with local ECE data.| 4. Send REQ message with rdma_connect(). | | 5. Get REQ message with rdma_get_events. | 6. Read remote ECE data from the REQ message. | 7. Create data QP. | 8. Set in QP the desired ECE options by giving remote ECE data. | 9. Read accepted local ECE options. |10. Modify QP based on those options. |11. Fill local ECE options in REP message. |12. Send REP message with rdma_accept(). 13. Receive REP message. | 14. Read remote ECE data from REP message.| 15. Set in QP remote ECE data | 16. Modify QP based on remote ECE data | ------------------------------------------------------------------------------ In case the server decides to reject connection, the items #9-10 will be replaced with rdma_reject_ece() call that will send REJ message together with "ECE options not supported" reason as described in the IBTA. Thanks Leon Romanovsky (11): Update kernel headers libibverbs: Add interfaces to configure and use ECE mlx5: Implement ECE callbacks libibverbs: Document ECE API debian: Install all available librdmacm man pages librdmacm: Provide interface to use ECE for external QPs librdmacm: Connect rdma_connect to the ECE librdmacm: Return ECE results through rdma_accept librdmacm: Add an option to reject ECE request librdmacm: Implement ECE handshake logic librdmacm: Document ECE API CMakeLists.txt | 2 +- debian/libibverbs1.symbols | 5 +- debian/librdmacm-dev.install | 53 +--------- debian/librdmacm1.symbols | 4 + kernel-headers/rdma/rdma_user_cm.h | 15 ++- libibverbs/CMakeLists.txt | 2 +- libibverbs/driver.h | 2 + libibverbs/dummy_ops.c | 14 +++ libibverbs/libibverbs.map.in | 6 ++ libibverbs/man/CMakeLists.txt | 2 + libibverbs/man/ibv_query_ece.3.md | 56 +++++++++++ libibverbs/man/ibv_set_ece.3.md | 61 ++++++++++++ libibverbs/verbs.c | 15 +++ libibverbs/verbs.h | 18 ++++ librdmacm/CMakeLists.txt | 2 +- librdmacm/cma.c | 130 +++++++++++++++++++++++-- librdmacm/librdmacm.map | 7 ++ librdmacm/man/CMakeLists.txt | 2 + librdmacm/man/rdma_cm.7 | 14 ++- librdmacm/man/rdma_get_remote_ece.3.md | 61 ++++++++++++ librdmacm/man/rdma_set_local_ece.3.md | 62 ++++++++++++ librdmacm/rdma_cma.h | 24 +++++ librdmacm/rdma_cma_abi.h | 15 ++- providers/mlx5/mlx5.c | 6 +- providers/mlx5/mlx5.h | 10 ++ providers/mlx5/qp.c | 33 +++++++ providers/mlx5/verbs.c | 9 ++ 27 files changed, 560 insertions(+), 70 deletions(-) create mode 100644 libibverbs/man/ibv_query_ece.3.md create mode 100644 libibverbs/man/ibv_set_ece.3.md create mode 100644 librdmacm/man/rdma_get_remote_ece.3.md create mode 100644 librdmacm/man/rdma_set_local_ece.3.md -- 2.24.1