> -----Original Message----- > From: Shakeel Butt <shakeelb@xxxxxxxxxx> > Sent: Friday, May 12, 2023 1:07 PM > To: Zhang, Cathy <cathy.zhang@xxxxxxxxx> > Cc: Eric Dumazet <edumazet@xxxxxxxxxx>; Linux MM <linux- > mm@xxxxxxxxx>; Cgroups <cgroups@xxxxxxxxxxxxxxx>; Paolo Abeni > <pabeni@xxxxxxxxxx>; davem@xxxxxxxxxxxxx; kuba@xxxxxxxxxx; > Brandeburg@xxxxxxxxxx; Brandeburg, Jesse <jesse.brandeburg@xxxxxxxxx>; > Srinivas, Suresh <suresh.srinivas@xxxxxxxxx>; Chen, Tim C > <tim.c.chen@xxxxxxxxx>; You, Lizhen <lizhen.you@xxxxxxxxx>; > eric.dumazet@xxxxxxxxx; netdev@xxxxxxxxxxxxxxx > Subject: Re: [PATCH net-next 1/2] net: Keep sk->sk_forward_alloc as a proper > size > > On Fri, May 12, 2023 at 03:23:45AM +0000, Zhang, Cathy wrote: > > Remove the invalid mail addr added unintentionally. > > > > Sorry that was my buggy script. > > [...] > > > > > > Hi Shakeel, > > > > > > Run with the temp change you provided, the output shows it comes to > > > drain_stock_1(), Here is the call trace: > > > > > > 8.96% mc-worker [kernel.vmlinux] [k] page_counter_cancel > > > | > > > --8.95%--page_counter_cancel > > > | > > > --8.95%--page_counter_uncharge > > > drain_stock_1 > > > __refill_stock > > > refill_stock > > > | > > > --8.88%--try_charge_memcg > > > mem_cgroup_charge_skmem > > > | > > > --8.87%--__sk_mem_raise_allocated > > > __sk_mem_schedule > > > | > > > |--5.37%--tcp_try_rmem_schedule > > > | tcp_data_queue > > > | tcp_rcv_established > > > | tcp_v4_do_rcv > > > > Thanks a lot. This tells us that one or both of following scenarios are > happening: > > 1. In the softirq recv path, the kernel is processing packets from multiple > memcgs. > > 2. The process running on the CPU belongs to memcg which is different from > the memcgs whose packets are being received on that CPU. Thanks for sharing the points, Shakeel! Is there any trace records you want to collect? > > BTW have you seen this performance issue when you run the client and > server on different machines? I am wondering if RFS would be good enough > for such scenario and we only need to worry about the same machine case. The same test scenario is tried between two machines which are on the same network but with network topology, both of the two machines are with 1Gbit NIC. There is no hot paths observed, but it might because that the systems are not so busy, for memtier shows the average transfer speed is down to a few tenth of original on the same machine.