Hi everyone! We encountered slowdowns in routing, and implemented a simple cache to speed it up: https://lore.kernel.org/netdev/20240307171202.232684-1-leone4fernando@xxxxxxxxx/ Our patch shows considerable improvement in our use-case (hundreds of daddrs), and non-negligible improvement in other scenarios as well. We received some pushback from the kernel community claiming this improvement is modest in real-life scenarios. We measured our changes using udp floods with different numbers of daddrs. The benchmarking setup is comprised of 3 machines: a sender, a forwarder and a receiver. We measured the PPS received by the receiver as the forwarder was running either the mainline kernel or the patched kernel, comparing the results. Does anyone have a good idea for more accurate benchmarking methods?