Re: [PATCH RFC] svcrdma: Ignore source port when computing DRC hash

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 6/7/2019 11:43 AM, Olga Kornievskaia wrote:
On Thu, Jun 6, 2019 at 2:33 PM Chuck Lever <chuck.lever@xxxxxxxxxx> wrote:
...
@@ -211,9 +211,14 @@ static void handle_connect_req(struct rdma_cm_id *new_cma_id,
         /* Save client advertised inbound read limit for use later in accept. */
         newxprt->sc_ord = param->initiator_depth;

         /* Set the local and remote addresses in the transport */
         sa = (struct sockaddr *)&newxprt->sc_cm_id->route.addr.dst_addr;

Can you add a printk to your server to show the port value
in cm_id->route.addr.dst_addr?

What I see printed here isn't not what I see in the network trace in
wireshark for the UDP port. It's also confusing that
Connection manager communication (in my case) say between source port
55410 (which stays the same on remounts) and dst port 4791. Then NFS
traffic is between source port 63494 (which changes on remounts) and
destination port 4791. Yet, NFS reports that source port was 40403 and
destination port was 20049(!).

This is expected. 4791 is the RoCEv2 protocol destination port, and
it's a constant for all RoCEv2 connections. think of it like a tunnel,
similar to VxLAN.

RoCEv2 then applies its own mapping to demultiplex incoming packets to
the proper queue pair. This is based on the RDMA CM connection service
id's. These are based on the upper layer's requested port.

In other words, you need to apply tracing at the proper layer. Wire
level traces aren't going to be very useful.

Btw, iWARP connections won't remap ports like this. But IB and RoCE
will, since they don't natively have an actual 5-tuple defined.

Tom.

The code that I looked at (as you pointed) was for the connection
manager and that port stays constant but for the NFSoRDMA after that
it's from a new port that changes all the time (even for the Mellanox
card).

Looks like there is no way to get the "real" port thru the rdma layers
on either side.


I also see that there is nothing in the verbs API thru which we
interact with the RDMA drivers will allow us to set the port.

I suspect this would be part of the RDMA Connection Manager
interface, not part of the RDMA driver code.


Unless
we can ask the linux implementation to augment some structures to
allow us to set and query that port or is that unreasonable because
it's not in the standard API.



Signed-off-by: Chuck Lever <chuck.lever@xxxxxxxxxx>
Cc: stable@xxxxxxxxxxxxxxx
---
net/sunrpc/xprtrdma/svc_rdma_transport.c |    7 ++++++-
1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/net/sunrpc/xprtrdma/svc_rdma_transport.c b/net/sunrpc/xprtrdma/svc_rdma_transport.c
index 027a3b0..1b3700b 100644
--- a/net/sunrpc/xprtrdma/svc_rdma_transport.c
+++ b/net/sunrpc/xprtrdma/svc_rdma_transport.c
@@ -211,9 +211,14 @@ static void handle_connect_req(struct rdma_cm_id *new_cma_id,
       /* Save client advertised inbound read limit for use later in accept. */
       newxprt->sc_ord = param->initiator_depth;

-       /* Set the local and remote addresses in the transport */
       sa = (struct sockaddr *)&newxprt->sc_cm_id->route.addr.dst_addr;
       svc_xprt_set_remote(&newxprt->sc_xprt, sa, svc_addr_len(sa));
+       /* The remote port is arbitrary and not under the control of the
+        * ULP. Set it to a fixed value so that the DRC continues to work
+        * after a reconnect.
+        */
+       rpc_set_port((struct sockaddr *)&newxprt->sc_xprt.xpt_remote, 0);
+
       sa = (struct sockaddr *)&newxprt->sc_cm_id->route.addr.src_addr;
       svc_xprt_set_local(&newxprt->sc_xprt, sa, svc_addr_len(sa));



--
Chuck Lever

--
Chuck Lever








[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux