Re: [PATCH net-next v4 1/8] net: gro: decouple GRO from the NAPI layer

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2/7/25 12:56, Alexander Lobakin wrote:
From: Eric Dumazet <edumazet@xxxxxxxxxx>
Date: Thu, 6 Feb 2025 19:35:50 +0100

On Thu, Feb 6, 2025 at 1:15 PM Alexander Lobakin
<aleksander.lobakin@xxxxxxxxx> wrote:

From: Eric Dumazet <edumazet@xxxxxxxxxx>
Date: Wed, 5 Feb 2025 18:48:50 +0100

On Wed, Feb 5, 2025 at 5:46 PM Alexander Lobakin
<aleksander.lobakin@xxxxxxxxx> wrote:

In fact, these two are not tied closely to each other. The only
requirements to GRO are to use it in the BH context and have some
sane limits on the packet batches, e.g. NAPI has a limit of its
budget (64/8/etc.).
Move purely GRO fields into a new tagged group, &gro_node. Embed it
into &napi_struct and adjust all the references. napi_id doesn't
really belong to GRO, but:

1. struct gro_node has a 4-byte padding at the end anyway. If you
    leave napi_id outside, struct napi_struct takes additional 8 bytes
    (u32 napi_id + another 4-byte padding).
2. gro_receive_skb() uses it to mark skbs. We don't want to split it
    into two functions or add an `if`, as this would be less efficient,
    but we need it to be NAPI-independent. The current approach doesn't
    change anything for NAPI-backed GROs; for standalone ones (which
    are less important currently), the embedded napi_id will be just
    zero => no-op.

Three Ethernet drivers use napi_gro_flush() not really meant to be
exported, so move it to <net/gro.h> and add that include there.
napi_gro_receive() is used in more than 100 drivers, keep it
in <linux/netdevice.h>.
This does not make GRO ready to use outside of the NAPI context
yet.

Tested-by: Daniel Xu <dxu@xxxxxxxxx>
Acked-by: Jakub Kicinski <kuba@xxxxxxxxxx>
Reviewed-by: Toke Høiland-Jørgensen <toke@xxxxxxxxxx>
Signed-off-by: Alexander Lobakin <aleksander.lobakin@xxxxxxxxx>
---
  include/linux/netdevice.h                  | 26 +++++---
  include/net/busy_poll.h                    | 11 +++-
  include/net/gro.h                          | 35 +++++++----
  drivers/net/ethernet/brocade/bna/bnad.c    |  1 +
  drivers/net/ethernet/cortina/gemini.c      |  1 +
  drivers/net/wwan/t7xx/t7xx_hif_dpmaif_rx.c |  1 +
  net/core/dev.c                             | 60 ++++++++-----------
  net/core/gro.c                             | 69 +++++++++++-----------
  8 files changed, 112 insertions(+), 92 deletions(-)

diff --git a/include/linux/netdevice.h b/include/linux/netdevice.h
index 2a59034a5fa2..d29b6ebde73f 100644
--- a/include/linux/netdevice.h
+++ b/include/linux/netdevice.h
@@ -340,8 +340,8 @@ struct gro_list {
  };

  /*
- * size of gro hash buckets, must less than bit number of
- * napi_struct::gro_bitmask
+ * size of gro hash buckets, must be <= the number of bits in
+ * gro_node::bitmask
   */
  #define GRO_HASH_BUCKETS       8

@@ -370,7 +370,6 @@ struct napi_struct {
         unsigned long           state;
         int                     weight;
         u32                     defer_hard_irqs_count;
-       unsigned long           gro_bitmask;
         int                     (*poll)(struct napi_struct *, int);
  #ifdef CONFIG_NETPOLL
         /* CPU actively polling if netpoll is configured */
@@ -379,11 +378,14 @@ struct napi_struct {
         /* CPU on which NAPI has been scheduled for processing */
         int                     list_owner;
         struct net_device       *dev;
-       struct gro_list         gro_hash[GRO_HASH_BUCKETS];
         struct sk_buff          *skb;
-       struct list_head        rx_list; /* Pending GRO_NORMAL skbs */
-       int                     rx_count; /* length of rx_list */
-       unsigned int            napi_id; /* protected by netdev_lock */
+       struct_group_tagged(gro_node, gro,
+               unsigned long           bitmask;
+               struct gro_list         hash[GRO_HASH_BUCKETS];
+               struct list_head        rx_list; /* Pending GRO_NORMAL skbs */
+               int                     rx_count; /* length of rx_list */
+               u32                     napi_id; /* protected by netdev_lock */
+

I am old school, I would prefer a proper/standalone old C construct.

struct gro_node  {
                 unsigned long           bitmask;
                struct gro_list         hash[GRO_HASH_BUCKETS];
                struct list_head        rx_list; /* Pending GRO_NORMAL skbs */
                int                     rx_count; /* length of rx_list */
                u32                     napi_id; /* protected by netdev_lock */
};

Really, what struct_group_tagged() can possibly bring here, other than
obfuscation ?

You'd need to adjust every ->napi_id access, which is a lot.
Plus, as I wrote previously, napi_id doesn't really belong here, but
embedding it here eases life.

I'm often an old school, too, but sometimes this helps a lot.
Unless you have very strong preference on this.


Is struct_group_tagged even supported by ctags ?

In terms of maintenance, I am sorry to say this looks bad to me.

I get that it's not in good style to impose "new style" on the old folks
(esp. Founders, Maintainers). I agree that our drivers are just a random
sample in the pool of drivers, and it is not clearly a win-win to have us experiment there with "modern practices". But this particular
contribution from Olek is for benefit of all. It will be also a great
example how to use struct_group :)
I hope that we will get more struct_group usage, it's really great
mechanism to keep/pass only a part of the bigger struct (and zero/don't
care about the rest).


Even without ctags, I find git grep -n "struct xxxx {" quite good.

I also use mostly grep to find stuff, including the ` {` "trick".
And I do add wrappers when it gets tedious, here is a one for tagged
structs:
function sgrep() {     # here is the pattern: vv
	grep -EInr "struct(_group_tagged[(]| )$*( [{]|,)" |
	awk '
		/{$/ { print; normal = 1; next }
		{ tagged[tag++] = $0 }
		END {
			if (!normal)
				for (t = 0; t < tag; t++)
					print tagged[t]
		}'; }

awk resolves the complexity of "struct something," being sometimes
a parameter of a macro


compile_commands.json (already supported natively by Kbuild) + clangd is
not enough?

Elixir correctly tags struct_group()s.

napi->napi_id is used in a lot of core files and drivers, adjusting all
the references is not what I wanted to do in the series which does
completely different things.

Page Pool uses tagged struct groups, as well a ton of other different
files. Do you want to revert all this and adjust a couple thousand
references only due to ctags and grep?

(instead of just clicking on the references generated by clangd)

Thanks,
Olek






[Index of Archives]     [Linux Samsung SoC]     [Linux Rockchip SoC]     [Linux Actions SoC]     [Linux for Synopsys ARC Processors]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]


  Powered by Linux