On Tue, 17 Mar 2020 at 03:06, Joe Stringer <joe@xxxxxxxxxxx> wrote: > > On Mon, Mar 16, 2020 at 3:58 PM Martin KaFai Lau <kafai@xxxxxx> wrote: > > > > On Thu, Mar 12, 2020 at 04:36:44PM -0700, Joe Stringer wrote: > > > Add support for TPROXY via a new bpf helper, bpf_sk_assign(). > > > > > > This helper requires the BPF program to discover the socket via a call > > > to bpf_sk*_lookup_*(), then pass this socket to the new helper. The > > > helper takes its own reference to the socket in addition to any existing > > > reference that may or may not currently be obtained for the duration of > > > BPF processing. For the destination socket to receive the traffic, the > > > traffic must be routed towards that socket via local route, the socket > > I also missed where is the local route check in the patch. > > Is it implied by a sk can be found in bpf_sk*_lookup_*()? > > This is a requirement for traffic redirection, it's not enforced by > the patch. If the operator does not configure routing for the relevant > traffic to ensure that the traffic is delivered locally, then after > the eBPF program terminates, it will pass up through ip_rcv() and > friends and be subject to the whims of the routing table. (or > alternatively if the BPF program redirects somewhere else then this > reference will be dropped). Can you elaborate what "an appropriate routing configuration" would be? I'm not well versed with how routing works, sorry. Do you think being subject to the routing table is desirable, or is it an implementation trade-off? > > I think this is a general bpf_sk*_lookup_*() question, previous > discussion[0] settled on avoiding that complexity before a use case > arises, for both TC and XDP versions of these helpers; I still don't > have a specific use case in mind for such functionality. If we were to > do it, I would presume that the socket lookup caller would need to > pass a dedicated flag (supported at TC and likely not at XDP) to > communicate that SO_ATTACH_REUSEPORT_EBPF progs should be respected > and used to select the reuseport socket. I was surprised that both TC and XDP don't run the reuseport program! So far I assumed that TC did pass the skb. I understand that you don't want to tackle this issue, but is it possible to reject reuseport sockets from sk_assign in that case? -- Lorenz Bauer | Systems Engineer 6th Floor, County Hall/The Riverside Building, SE1 7PB, UK www.cloudflare.com