On 2/9/2012 6:14 PM, John Fastabend wrote: > On 2/9/2012 1:11 PM, jamal wrote: >> On Thu, 2012-02-09 at 09:52 -0800, John Fastabend wrote: >> >>>>> By netlink_notifier do you mean adding a notifier_block and using atomic_notifier_call_chain() >>>>> probably in rtnl_notify()? Then drivers could register with the notifier chain with >>>>> atomic_notifier_chain_register() and receive the events correctly. Or did I miss >>>>> some notifier chain that already exists? >>>> >>>> Yes. that is what I mean. The callbacks you need may or may not already be present. >> >> I'll go one step further. >> This stuff shouldnt be in the kernel at all. >> The disadvantage is you need a user space app to update the hardware. >> i.e, the same mechanism should be usable for either a switch embedded >> in a NIC or a standalone hardware switch (with/out the s/ware bridge >> presence) >> >> cheers, >> jamal >> > > Hi Jamal, > > The user space app in this case would listen for FDB updates to the SW > bridge and then mirror them at the embedded NIC. In this case it seems > easier to just add a notifier chain and let the kernel keep these in > sync. Otherwise we need a daemon in user space to replicate these. > > On the other hand if you could make the same RTM_NEWNEIGH, RTM_DELNEIGH, > and RTM_GETNEIGH work for the bridge, embedded bridge, and macvlan you > would have one common interface to drive these. But the bridge already > has this protocol/msgtype so that would require either some demux or > new protocol/msgtype pairs to be created. > > Let me think on it. I'm tempted by the simplicity of adding notifier > hooks though. > > .John > Actually because the bridge is adding/removing fdb entries dynamically maybe its best this gets done in kernel. Here's the example case, ---------- --------- | ethx.y | <---- E | veth0 | <--- A ---------- --------- | | | | | | | -------------- | | SW Bridge | <--- B | -------------- | | | | | --------- | | eth0 | <--- C | --------- | | ----------------------------------- | embedded switch | <--- D ----------------------------------- | | G With the flow by letters above hope this is not too difficult to follow. (A) veth0 a virtual device transmits packet destined for ethx.y (B) SW bridge receives frames and updates FDB flooding to C (C) eth0 the PF in this case sends the frame to the HW backed by the embedded bridge (D) The HW embedded switch has a static entry for ethx.y and forwards the frame to the VF or if its a broadcast frame also floods it to the wire and ethx.y (E) ethx.y receives the frame and generates a response to the dest mac of veth0 Now here is the potential issue, (G) The frame transmitted from ethx.y with the destination address of veth0 but the embedded switch is not a learning switch. If the FDB update is done in user space its possible (likely?) that the FDB entry for veth0 has not been added to the embedded switch yet. Now we either have to flood the frame which is not horrible but not ideal or worse if the embedded switch does not support flooding send it to the wire and veth0 never receives it. If the SW bridge pushes the FDB update down into the embedded switch the address is for sure in the embedded switches forwarding tables and the switching works as expected. So to handle this case correctly its probably best IMHO to use a notifier hook. Having a RTM_GETNEIGH for the embedded switch implemented though would be nice for dumping the FDB of the embedded switch and SET/DEL could be used to configure the FDB when its not being driven by the SW switch. Of course we should try to be minimalists here. .John -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html