On Tue, Mar 04, 2025 at 05:11:54PM -0500, Frank Li wrote: > On Tue, Mar 04, 2025 at 11:50:10AM -0600, Bjorn Helgaas wrote: > > On Mon, Mar 03, 2025 at 04:57:29PM -0500, Frank Li wrote: > > > On Wed, Feb 26, 2025 at 06:23:26PM -0600, Bjorn Helgaas wrote: > > > > On Tue, Jan 28, 2025 at 05:07:36PM -0500, Frank Li wrote: > > > > > Introduce `parent_bus_offset` in `resource_entry` and a new API, > > > > > `pci_add_resource_parent_bus_offset()`, to provide necessary information > > > > > for PCI controllers with address translation units. > > > > > > > > > > Typical PCI data flow involves: > > > > > CPU (CPU address) -> Bus Fabric (Intermediate address) -> > > > > > PCI Controller (PCI bus address) -> PCI Bus. > > > > > > > > > > While most bus fabrics preserve address consistency, some modify addresses > > > > > to intermediate values. The `parent_bus_offset` enables PCI controllers to > > > > > translate these intermediate addresses correctly to PCI bus addresses. > > > > > > > > > > Pave the road to remove hardcoded cpu_addr_fixup() and similar patterns in > > > > > PCI controller drivers. > > > > > ... > > > > > > > > > +++ b/drivers/pci/of.c > > > > > @@ -402,7 +402,17 @@ static int devm_of_pci_get_host_bridge_resources(struct device *dev, > > > > > res->flags &= ~IORESOURCE_MEM_64; > > > > > } > > > > > > > > > > - pci_add_resource_offset(resources, res, res->start - range.pci_addr); > > > > > + /* > > > > > + * IORESOURCE_IO res->start is io space start address. > > > > > + * IORESOURCE_MEM res->start is cpu start address, which is the > > > > > + * same as range.cpu_addr. > > > > > + * > > > > > + * Use (range.cpu_addr - range.parent_bus_addr) to align both > > > > > + * IO and MEM's parent_bus_offset always offset to cpu address. > > > > > + */ > > > > > + > > > > > + pci_add_resource_parent_bus_offset(resources, res, res->start - range.pci_addr, > > > > > + range.cpu_addr - range.parent_bus_addr); > > > > > > > > I don't know exactly where it needs to go, but I think we can call > > > > .cpu_addr_fixup() once at startup on the base of the region. This > > > > will tell us the offset that applies to the entire region, i.e., > > > > parent_bus_offset. > > > > > > > > Then we can remove all the .cpu_addr_fixup() calls in > > > > cdns_pcie_host_init_address_translation(), > > > > cdns_pcie_set_outbound_region(), and dw_pcie_prog_outbound_atu(). > > > > > > > > Until we can get rid of all the .cpu_addr_fixup() implementations, > > > > We'll still have that single call at startup (I guess once for cadence > > > > and another for designware), but it should simplify the current > > > > callers quite a bit. > > > > > > I don't think it can simple code. cdns_pcie_set_outbound_region() and > > > dw_pcie_prog_outbound_atu() are called by EP functions, which have not use > > > "resource" to manage outbound windows. > > > > Let's ignore cadence for now. I don't think we need to solve that > > until later. > > > > dw_pcie_prog_outbound_atu() is called by: > > > > - dw_pcie_other_conf_map_bus(): atu.parent_bus_addr = pp->cfg0_base > > > > I think dw_pcie_host_init() can set pp->cfg0_base with the correct > > intermediate address, either via the the of_property_read_reg() or > > .cpu_addr_fixup(). And chicken and egg problem here for artpec6_pcie_cpu_addr_fixup(), which need cfg0_base. But try to use .cpu_addr_fixup() to get cfg0_base's intermediate address. Frank > > > > If dw_pcie_host_init() does this, then we don't need > > .cpu_addr_fixup() in dw_pcie_prog_outbound_atu(). > > > > - dw_pcie_rd_other_conf(): atu.parent_bus_addr = pp->io_base > > > > Similarly, dw_pcie_host_init() should be able to set pp->io_base > > to the intermediate address, so we don't need .cpu_addr_fixup() in > > dw_pcie_prog_outbound_atu(). > > I found some driver's cpu_addr_fixup()'s implement depend on the > initilize sequence. > > for example: > pcie-artpec6.c > > static u64 artpec6_pcie_cpu_addr_fixup(struct dw_pcie *pci, u64 cpu_addr) > { > struct artpec6_pcie *artpec6_pcie = to_artpec6_pcie(pci); > struct dw_pcie_rp *pp = &pci->pp; > struct dw_pcie_ep *ep = &pci->ep; > > switch (artpec6_pcie->mode) { > case DW_PCIE_RC_TYPE: > return cpu_addr - pp->cfg0_base; > case DW_PCIE_EP_TYPE: > return cpu_addr - ep->phys_base; > default: > dev_err(pci->dev, "UNKNOWN device type\n"); > } > return cpu_addr; > } > > This implement require *cfg0_base* and *phys_base*, pp/ep, need set before > call artpec6_pcie_cpu_addr_fixup(). > > static u64 visconti_pcie_cpu_addr_fixup(struct dw_pcie *pci, u64 cpu_addr) > { > struct dw_pcie_rp *pp = &pci->pp; > > return cpu_addr & ~pp->io_base; > } > > this one require *io_base* and *pp* need be set before call > visconti_pcie_cpu_addr_fixup() > > Because I have not such hardware platform, it is not trivial change and > it is hard to involve bugs. > > If move .cpu_addr_fixup() too early, it will cause kernel dump. > > I suggest keep current overall sequent and try to clean up these driver's > cpu_addr_fixup() firstly. > > Frank > > > > > - dw_pcie_iatu_setup(): atu.parent_bus_addr = entry->res->start > > > > Here "entry" iterates through bridge->windows, and we should be > > able to set entry->parent_bus_offset at init-time, using > > .cpu_addr_fixup() if necessary, so we can apply that offset > > unconditionally, regardless of use_parent_dt_ranges, and we won't > > need .cpu_addr_fixup() in dw_pcie_prog_outbound_atu(). > > > > - dw_pcie_pme_turn_off: > > atu.parent_bus_addr = pci->pp.msg_res->start - pci->pp.msg_parent_bus_offset > > > > This should be the same as dw_pcie_iatu_setup() since > > msg_parent_bus_offset comes from the window iteration in > > dw_pcie_host_request_msg_tlp_res(). As long as the windows have > > the correct parent_bus_offset at init-time, we should be all set.