Hi Rob, On Mon, 4 Dec 2023 07:59:09 -0600 Rob Herring <robh@xxxxxxxxxx> wrote: > On Mon, Dec 4, 2023 at 6:43 AM Herve Codina <herve.codina@xxxxxxxxxxx> wrote: > > > > Hi Rob, > > > > On Fri, 1 Dec 2023 16:26:45 -0600 > > Rob Herring <robh@xxxxxxxxxx> wrote: > > > > > On Thu, Nov 30, 2023 at 10:57 AM Herve Codina <herve.codina@xxxxxxxxxxx> wrote: > > > > > > > > Hi, > > > > > > > > The commit 407d1a51921e ("PCI: Create device tree node for bridge") > > > > creates of_node for PCI devices. > > > > During the insertion handling of these new DT nodes done by of_platform, > > > > new devices (struct device) are created. > > > > For each PCI devices a struct device is already present (created and > > > > handled by the PCI core). > > > > Creating a new device from a DT node leads to some kind of wrong struct > > > > device duplication to represent the exact same PCI device. > > > > > > > > This patch series first introduces device_{add,remove}_of_node() in > > > > order to add or remove a newly created of_node to an already existing > > > > device. > > > > Then it fixes the DT node creation for PCI devices to add or remove the > > > > created node to the existing PCI device without any new device creation. > > > > > > I think the simpler solution is to get the DT node created earlier. We > > > are just asking for pain if the DT node is set for the device at > > > different times compared to static DT nodes. > > > > > > The following fixes the lack of of_node link. The DT unittest fails > > > with the change though and I don't see why. > > > > > > Also, no idea if the bridge part works because my qemu setup doesn't > > > create bridges (anyone got a magic cmdline to create them?). > > > > > > diff --git a/drivers/pci/bus.c b/drivers/pci/bus.c > > > index 9c2137dae429..46b252bbe500 100644 > > > --- a/drivers/pci/bus.c > > > +++ b/drivers/pci/bus.c > > > @@ -342,8 +342,6 @@ void pci_bus_add_device(struct pci_dev *dev) > > > */ > > > pcibios_bus_add_device(dev); > > > pci_fixup_device(pci_fixup_final, dev); > > > - if (pci_is_bridge(dev)) > > > - of_pci_make_dev_node(dev); > > > pci_create_sysfs_dev_files(dev); > > > pci_proc_attach_device(dev); > > > pci_bridge_d3_update(dev); > > > diff --git a/drivers/pci/of.c b/drivers/pci/of.c > > > index 51e3dd0ea5ab..e15eaf0127fc 100644 > > > --- a/drivers/pci/of.c > > > +++ b/drivers/pci/of.c > > > @@ -31,6 +31,8 @@ int pci_set_of_node(struct pci_dev *dev) > > > return 0; > > > > > > node = of_pci_find_child_device(dev->bus->dev.of_node, dev->devfn); > > > + if (!node && pci_is_bridge(dev)) > > > + of_pci_make_dev_node(dev); > > > if (!node) > > > return 0; > > > > Maybe it is too early. > > of_pci_make_dev_node() creates a node and fills some properties based on > > some already set values available in the PCI device such as its struct resource > > values. > > We need to have some values set by the PCI infra in order to create our DT node > > with correct values. > > Indeed, that's probably the issue I'm having. In that case, > DECLARE_PCI_FIXUP_HEADER should work. That's later, but still before > device_add(). > > I think modifying sysfs after device_add() is going to race with > userspace. Userspace is notified of a new device, and then the of_node > link may or may not be there when it reads sysfs. Also, not sure if > we'll need DT modaliases with PCI devices, but they won't work if the > DT node is not set before device_add(). > > Rob DECLARE_PCI_FIXUP_HEADER is too early as well as doing the DT node creation just before the device_add() call. Indeed, in order to fill the DT properties, resources need to be assigned (needed for the 'ranges' property used for addresses translation). The resources assignment is done after the call to device_add(). Some PCI sysfs files are already created after adding the device by the pci_create_sysfs_dev_files() call: https://elixir.bootlin.com/linux/v6.6/source/drivers/pci/bus.c#L347 Is it really an issue to add the of_node link to sysfs on an already present device ? Maybe we can add the of_node link before the device_add() call and add the 'ranges' property in the DT node later, once resources are assigned. In that case, the race condition is not fixed but moved from the PCI device to the DT node the device is pointing to. With DT overlays and of_changeset_*(), modifying nodes is a normal behavior. Is that acceptable for the 'ranges' property in this use-case ? Best regards, Hervé