On Tue, 15 Feb 2022 16:10:14 +0000 Jonathan Cameron <Jonathan.Cameron@xxxxxxxxxx> wrote: > On Thu, 27 Jan 2022 16:27:04 -0800 > Ben Widawsky <ben.widawsky@xxxxxxxxx> wrote: > > > CXL switches have HDM decoders just like host bridges and endpoints. > > Their programming works in a similar fashion. > > > > The spec does not prohibit multiple levels of switches, however, those > > are not implemented at this time. > > > > Signed-off-by: Ben Widawsky <ben.widawsky@xxxxxxxxx> > Hi Ben, > > I'm still hammering away at trying to bring up qemu switch emulation. > Even though I know you are reworking this, seems only sensible to point > out issues when I hit them. If no longer relevant you can ignore them! > > With these bits and a few other minor tweaks the decoders now look > to be right - I just need to wire up the QEMU side so that > I don't get hardware exceptions on actually reading and writing once > the region is bound :) QEMU emulation of a very basic CXL switch at: https://gitlab.com/jic23/qemu/-/commits/cxl-v7-draft-for-test Hope that is helpful for testing next version of this. I've been testing with a 4 port switch and 4 type 3 devices and with a few hacks/fixes as mentioned in review feedback it all works (well devmem2 gives me what I expect. :) Jonathan > > Thanks, > > J > > --- > > drivers/cxl/cxl.h | 5 ++++ > > drivers/cxl/region.c | 61 ++++++++++++++++++++++++++++++++++++++++++-- > > 2 files changed, 64 insertions(+), 2 deletions(-) > > > > diff --git a/drivers/cxl/cxl.h b/drivers/cxl/cxl.h > > index 8ace6cca0776..d70d8c85d05f 100644 > > --- a/drivers/cxl/cxl.h > > +++ b/drivers/cxl/cxl.h > > @@ -96,6 +96,11 @@ static inline u8 cxl_to_ig(u16 g) > > return ilog2(g) - 8; > > } > > > > +static inline int cxl_to_ways(u8 ways) > > +{ > > + return 1 << ways; > > +} > > + > > static inline bool cxl_is_interleave_ways_valid(int iw) > > { > > switch (iw) { > > diff --git a/drivers/cxl/region.c b/drivers/cxl/region.c > > index b8982be13bfe..f748060733dd 100644 > > --- a/drivers/cxl/region.c > > +++ b/drivers/cxl/region.c > > @@ -359,6 +359,23 @@ static bool has_switch(const struct cxl_region *cxlr) > > return false; > > } > > > > +static bool has_multi_switch(const struct cxl_region *cxlr) > > +{ > > + struct cxl_memdev *ep; > > + int i; > > + > > + for_each_cxl_endpoint(ep, cxlr, i) > > + if (ep->port->depth > 3) > > + return true; > > + > > + return false; > > +} > > + > > +static struct cxl_port *get_switch(struct cxl_memdev *ep) > > +{ > > + return to_cxl_port(ep->port->dev.parent); > > +} > > + > > static struct cxl_decoder *get_decoder(struct cxl_region *cxlr, > > struct cxl_port *p) > > { > > @@ -409,6 +426,8 @@ static bool region_hb_rp_config_valid(struct cxl_region *cxlr, > > const struct cxl_decoder *rootd, > > bool state_update) > > { > > + const int region_ig = cxl_to_ig(cxlr->config.interleave_granularity); > > + const int region_eniw = cxl_to_eniw(cxlr->config.interleave_ways); > > const int num_root_ports = get_num_root_ports(cxlr); > > struct cxl_port *hbs[CXL_DECODER_MAX_INTERLEAVE]; > > struct cxl_decoder *cxld, *c; > > @@ -416,8 +435,12 @@ static bool region_hb_rp_config_valid(struct cxl_region *cxlr, > > > > hb_count = get_unique_hostbridges(cxlr, hbs); > > > > - /* TODO: Switch support */ > > - if (has_switch(cxlr)) > > + /* TODO: support multiple levels of switches */ > > + if (has_multi_switch(cxlr)) > > + return false; > > + > > + /* TODO: x3 interleave for switches is hard. */ > > + if (has_switch(cxlr) && !is_power_of_2(region_ways(cxlr))) > > return false; > > > > /* > > @@ -470,8 +493,14 @@ static bool region_hb_rp_config_valid(struct cxl_region *cxlr, > > list_for_each_entry(rp, &hb->dports, list) { > > struct cxl_memdev *ep; > > int port_grouping = -1; > > + int target_ndx; > As things currently stand, with a switch connected below a single port > of a host bridge (4 type 3 off the switch) this will program the HB > decoder to have 4 targets, all routed to the switch USP. > > There is an argument that this is correct but its not what I'd expect. > I'd expect to see 1 target only. It's not a problem for small cases, but > with enough rp and switches we can run out of targets. > > > > > for_each_cxl_endpoint_hb(ep, cxlr, hb, idx) { > > + struct cxl_decoder *switch_cxld; > > + struct cxl_dport *target; > > + struct cxl_port *switch_port; > > + bool found = false; > > + > > if (get_rp(ep) != rp) > > continue; > > > > @@ -499,6 +528,34 @@ static bool region_hb_rp_config_valid(struct cxl_region *cxlr, > > > > cxld->interleave_ways++; > > cxld->target[port_grouping] = get_rp(ep); > > + > > + /* > > + * At least one switch is connected here if the endpoint > > + * has a depth > 2 > > + */ > > + if (ep->port->depth == 2) > > + continue; > > + > > + /* Check the staged list to see if this > > + * port has already been added > > + */ > > + switch_port = get_switch(ep); > > + list_for_each_entry(switch_cxld, &cxlr->staged_list, region_link) { > > + if (to_cxl_port(switch_cxld->dev.parent) == switch_port) > > + found = true; > > break; > > > + } > > + > > + if (found) { > > + target = cxl_find_dport_by_dev(switch_port, ep->dev.parent->parent); > > + switch_cxld->target[target_ndx++] = target; > > + continue; > > + } > > + > > + target_ndx = 0; > > + > > + switch_cxld = get_decoder(cxlr, switch_port); > > + switch_cxld->interleave_ways++; > > + switch_cxld->interleave_granularity = cxl_to_ways(region_ig + region_eniw); > > I'm not following this. Perhaps comment on why this particular maths? I was assuming the switch > interleave granularity would that of the region as the switch is last level of decode. > > Need to do the equivalent here of what you do in the if (found) or the first target is missed. > Also need to be updating interleave_ways only in the found path, not here (as the default is 1) > > > } > > } > > } >