On 22-02-14 16:20:37, Jonathan Cameron wrote: > On Thu, 27 Jan 2022 16:27:01 -0800 > Ben Widawsky <ben.widawsky@xxxxxxxxx> wrote: > > > Host bridge root port verification determines if the device ordering in > > an interleave set can be programmed through the host bridges and > > switches. > > > > The algorithm implemented here is based on the CXL Type 3 Memory Device > > Software Guide, chapter 2.13.15. The current version of the guide does > > not yet support x3 interleave configurations, and so that's not > > supported here either. > > > > Signed-off-by: Ben Widawsky <ben.widawsky@xxxxxxxxx> > > > > +static struct cxl_dport *get_rp(struct cxl_memdev *ep) > > +{ > > + struct cxl_port *port, *parent_port = port = ep->port; > > + struct cxl_dport *dport; > > + > > + while (!is_cxl_root(port)) { > > + parent_port = to_cxl_port(port->dev.parent); > > + if (parent_port->depth == 1) > > + list_for_each_entry(dport, &parent_port->dports, list) > > + if (dport->dport == port->uport->parent->parent) > > + return dport; > > + port = parent_port; > > + } > > + > > + BUG(); > > I know you mentioned you were reworking this patch set anyway, but > I thought I'd give some quick debugging related feedback. > > When running against a single switch in qemu (patches out once > things are actually working), I hit this BUG() > printing dev_name for the port->uport->parent->parent gives > pci0000:0c but the matches are sort against > 0000:0c:00.0 etc > > So looks like one too many levels of parent in this case at least. Hmm. This definitely looks dubious now that I see it again. Let me try to figure out how to rework it. I think it would be good to ask Dan as well. Much of the topology relationship works from bottom up, but top down is less easy. Previously I had used pci-isms to do this but Dan has been working on keeping the two domains isolated, which I agree is a good idea. > > The other bug I haven't chased down yet is that if we happen > to have downstream ports of the switch with duplicate ids > (far too easy to do in QEMU as port_num is an optional > parameter for switch DS ports) it's detected and the probe fails > - but then it tries again and we get an infinite loop of new > ports being created and failing to probe... Is this allowed by spec? We shouldn't infinite loop, but I can't imagine the driver could do anything saner than fail to probe for such a case. > I'll get back this one once I have it working with > a valid switch config. Thanks. > > Jonathan > > > + return NULL; > > +}