On 7/5/2019 2:08 AM, Vladimir Oltean wrote: > Hi Andrew, > > On Fri, 5 Jul 2019 at 07:49, Andrew Lunn <andrew@xxxxxxx> wrote: >> >> Hi Vladimir >> >>> - DSA is typically used for discrete switches, switchdev is typically >>> used for embedded ones. >> >> Typically DSA is for discrete switches, but not exclusively. The >> b53/SF2 is embedded in a number of Broadcom SoCs. So this is no >> different to Ocelot, except ARM vs MIPS. Also, i would disagree that >> switchdev is used for embedded ones. Mellonex devices are discrete, on >> a PCIe bus. I believe Netronome devices are also discrete PCIe >> devices. In fact, i think ocelot is the only embedded switchdev >> switch. >> >> So embedded vs discrete plays no role here at all. >> > > drivers/staging/fsl-dpaa2/ethsw/ is another example of switchdev > driver for an embedded switch. > I would give it to you that the sample size is probably too small to > say 'typically', but my point was that in order to support cascaded > switches it makes more sense for those to be discrete. > >>> - The D in DSA is for cascaded switches. Apart from the absence of >>> such a "Ocelot SoC" driver (which maybe can be written, I don't know), >>> I think the switching core itself has some fundamental limitations >>> that make a DSA implementation questionable: >> >> There is no requirement to implement D in DSA. In fact, only Marvell >> does. None of the other switches do. And you will also find that most >> boards with a Marvell switch use a single device. D in DSA is totally >> optional. In fact, DSA is built from the ground up that nearly >> everything is optional. Take a look at mv88e6060, as an example. It >> implements nearly nothing. It cannot even offload a bridge to the >> switch. >> > > Let me see if I get your point. > The D is optional, and the S is optional. So what's left? :) > Also, there's a big difference between "the hardware can't do it" and > "the driver doesn't implement it". If I follow your argument, would > you write a DSA driver for a device that doesn't do L2 switching? > Along that same line, what benefit does the DSA model bring to a > switch that can't do cascading, compared to switchdev? I'm asking this > as a user, not as a developer. As an user, I don't think there are compelling arguments to either switchdev or DSA because the end result is the same: network devices that can offload "stuff". As a developer though, there is much less code to write with DSA than with switchdev to get your HW live. > >>> So my conclusion is that DSA for Felix/Ocelot doesn't make a lot of >>> sense if the whole purpose is to hide the CPU-facing netdev. >> >> You actually convinced me the exact opposite. You described the >> headers which are needed to implement DSA. The switch sounds like it >> can do what DSA requires. So DSA is the correct model. >> >> Andrew > > Somebody actually asked, with the intention of building a board, if > it's possible to cascade the LS1028A embedded switch (Felix) with > discrete SJA1105 devices - Felix being at the top of the switch tree. > Does the DSA model support heterogeneous setups (parsing stacked > headers)? I can't tell if that's how EDSA tags work. With switchdev > for Felix there wouldn't be any problem - it just wouldn't be part of > the DSA tree and its own driver would remove its tags before DSA would > look at the rest. DSA not does not make any particular assumptions about how the stacking is done actually because each slave network device is expected to provided standard Ethernet frames to the network stack. How you get to that point is entirely specific to what the hardware can do. You do what Andrew described about one of my setup (bcm_sf2 w/ tagging enabled and b53 w/o tagging, see more below why [1]]) and both being discrete switch trees, with the master netdev of the b53 being a slave netdev provided by bcm_sf2. If your tagging protocol supports it you can make them part of the same DSA switch tree and just have them have different switch identifiers, that is what Marvell switches do and it works just great. In your case, I suppose you could even use double VLAN tagging to get such cascading to work, that would limit you to a two level of cascading, unless you invent something custom. [1]: The original Broadcom tag format introduced with BCM5325/5365 did support cascading in the same way that Marvell did where a switch identifier can be added in addition to a port number within the tag. The newest Broadcom tag that was introduced with 5395 and newer dropped support for the switch identifier and the switch will "terminate" the first (from start of Ethernet frame) tag that it receives. This is the reason why we need to disable tagging on the outermost B53 device that we are connected to. This means those network devices are mainly configuration endpoints and not passing data (DSA_TAG_PROTO_NONE), though we could use DSA_TAG_PROTO_8021Q and resolve that now. -- Florian