On Tue, Mar 23, 2021 at 01:12:33PM +0200, Nikolay Aleksandrov wrote: > On 23/03/2021 01:51, Vladimir Oltean wrote: > > From: Vladimir Oltean <vladimir.oltean@xxxxxxx> > > > > When a switchdev port starts offloading a LAG that is already in a > > bridge and has an FDB entry pointing to it: > > > > ip link set bond0 master br0 > > bridge fdb add dev bond0 00:01:02:03:04:05 master static > > ip link set swp0 master bond0 > > > > the switchdev driver will have no idea that this FDB entry is there, > > because it missed the switchdev event emitted at its creation. > > > > Ido Schimmel pointed this out during a discussion about challenges with > > switchdev offloading of stacked interfaces between the physical port and > > the bridge, and recommended to just catch that condition and deny the > > CHANGEUPPER event: > > https://lore.kernel.org/netdev/20210210105949.GB287766@xxxxxxxxxxxx/ > > > > But in fact, we might need to deal with the hard thing anyway, which is > > to replay all FDB addresses relevant to this port, because it isn't just > > static FDB entries, but also local addresses (ones that are not > > forwarded but terminated by the bridge). There, we can't just say 'oh > > yeah, there was an upper already so I'm not joining that'. > > > > So, similar to the logic for replaying MDB entries, add a function that > > must be called by individual switchdev drivers and replays local FDB > > entries as well as ones pointing towards a bridge port. This time, we > > use the atomic switchdev notifier block, since that's what FDB entries > > expect for some reason. > > > > I get the reason to have both bridge and bridge port devices (although the bridge > is really unnecessary as it can be inferred from the port), but it looks kind of > weird at first glance, I mean we get all of the port's fdbs and all of the bridge > fdbs every time (dst == NULL). The code itself is correct and the alternative > to take only 1 net_device and act based on its type would add another > step to the process per-port which also doesn't sound good... > There are a few minor const nits below too, again if there is another version > please take care of them, for the patch: > > Acked-by: Nikolay Aleksandrov <nikolay@xxxxxxxxxx> Thanks for the review. For host MDB entries, those are already offloaded to every bridge port (which yes, is still giving me headaches), so replaying them for every port that calls br_mdb_replay is at least consistent with that. For br_fdb_replay, honestly I am not yet sure because mainline DSA does not yet handle local FDBs, I might end up touching things up a little when I come back to the "RX filtering in DSA" series (I need to address Ido's feedback by then too). I would just like to get something started. It's even possible that by the end of the kernel development cycle, the end result might not even look anything remotely similar to what we have here - this is just what I deemed as "good enough as a small first step". If nobody has objections or sees problems with the current series, I think I'd prefer to send a follow-up with the const conversions, so I can spam less people with another 11 emails.