Re: [PATCH v2] PCI: Make sure the bus bridge powered on when scanning bus

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2020/12/12 6:27, Bjorn Helgaas wrote:
> On Fri, Sep 25, 2020 at 06:23:06PM +0800, Yicong Yang wrote:
>> When the bus bridge is runtime suspended, we'll fail to rescan
>> the devices through sysfs as we cannot access the configuration
>> space correctly when the bridge is in D3hot.
>> It can be reproduced like:
>>
>> $ echo 1 > /sys/bus/pci/devices/0000:80:00.0/0000:81:00.1/remove
>> $ echo 1 > /sys/bus/pci/devices/0000:80:00.0/pci_bus/0000:81/rescan
>>
>> 0000:80:00.0 is root port and is runtime suspended and we cannot
>> get 0000:81:00.1 after rescan.
>>
>> Make bridge powered on when scanning the child bus, by adding
>> pm_runtime_get_sync()/pm_runtime_put() in pci_scan_child_bus_extend().
>>
>> A similar issue is met and solved by
>> commit d963f6512e15 ("PCI: Power on bridges before scanning new devices")
>> which rescan the devices through /sys/bus/pci/devices/0000:80:00.0/rescan.
>> The callstack is like:
>>
>> dev_rescan_restore()
>>   pci_rescan_bus()
>>     pci_scan_bridge_extend()
>>       pci_scan_child_bus_extend() /* will wake up the bridge with this patch */
>>
>> With this patch the issue is also resolved, so let's remove the calls of
>> pm_runtime_*() in pci_scan_bridge_extend().
> 
> I'm sorry, I feel like an idiot, but I totally lost whatever
> understanding I had of this patch.  Here's what I *think* I
> understand:
> 
> PCI devices always respond to config transactions unless they're in D3cold,
> but a bridge only forwards config transactions when it is in D0.  When a
> bridge is runtime suspended, we can access config space of the bridge
> itself, but not of anything on its secondary side.  If a bridge is in a
> low-power state, we must bring it back to D0 before enumerating devices
> below it.
> 
> Prior to d963f6512e15 ("PCI: Power on bridges before scanning new
> devices"), this rescan could fail if 00:01.0 were suspended:
> 
>   # echo 1 > /sys/bus/pci/devices/0000:00:01.0/0000:01:00.0/remove
>   # echo 1 > /sys/bus/pci/devices/0000:00:01.0/rescan
> 
> d963f6512e15 fixed this with the following addition (call tree at the time):
> 
>   dev_rescan_store(dev 00:01.0)
>     pci_rescan_bus(bus 00)
>       pci_scan_child_bus(bus 00)
>         for (devfn = 0; devfn < 0x100; devfn += 8)
>           pci_scan_slot(bus 00, dev 00.0, 01.0, etc)
>         list_for_each_entry(dev, &bus->devices)
>           pci_scan_bridge(bus 00, dev 01.0)
>  +          pm_runtime_get_sync(dev 00:01.0)  # enables config below 00:01.0
>             pci_scan_child_bus(bus 01)
>               for (devfn = 0; devfn < 0x100; devfn += 8)
>                 pci_scan_slot(bus 01, dev 00.0, 01.0, etc)
>                   # config read of 01:00.0 fails unless 00:01.0 is in D0
> 
> 
> Now, for *this* patch, I think you're saying that this rescan can
> still fail:
> 
>   # echo 1 > /sys/bus/pci/devices/0000:80:00.0/0000:81:00.1/remove
>   # echo 1 > /sys/bus/pci/devices/0000:80:00.0/pci_bus/0000:81/rescan
> 
> IIUC, it uses this path:
> 
>   bus_rescan_store(bus 81)                  # 81 is not a root bus
>     pci_rescan_bus_bridge_resize(80:00.0)   # (bus 81)->self
>       bus = 80:00.0->subordinate
>       pci_scan_child_bus(bus 81)
>         pci_scan_child_bus_extend
>           for (devfn = 0; devfn < 256; devfn += 8)
>             pci_scan_slot(bus 81, dev 00.0)
>               # config read of 81:00.0 fails unless 80:00.0 is in D0
> 
> 
> Am I making any sense?

yes, it's right. and the bus rescan will also fail when going another path.

when the subordinates of 0000:80:00.0 is partly removed :

  0000:80:00.0 #root port is runtime suspended and 0000:81:00.0 is removed
    0000:81:00.1 #function is runtime suspended

then the bus rescan will go:

  bus_rescan_store (bus 81)
    if (!pci_is_root_bus(bus) && list_empty(&bus->devices))
      pci_rescan_bus_bridge_resize(bus->self);
    else
      pci_rescan_bus(bus 81) # will go this path as !list_empty(&bus->devices)
        pci_scan_child_bus(bus 81)
        ......

the config read of 81:00.0 will fail as bridge 80:00.0 is in D3hot.

> 
>> Signed-off-by: Yicong Yang <yangyicong@xxxxxxxxxxxxx>
>> ---
>> Change since v1:
>> - use an intermediate variable *bridge as suggested
>> - remove the pm_runtime_*() calls in pci_scan_bridge_extend()
>>
>>  drivers/pci/probe.c | 21 ++++++++++++---------
>>  1 file changed, 12 insertions(+), 9 deletions(-)
>>
>> diff --git a/drivers/pci/probe.c b/drivers/pci/probe.c
>> index 03d3712..747a8bc 100644
>> --- a/drivers/pci/probe.c
>> +++ b/drivers/pci/probe.c
>> @@ -1211,12 +1211,6 @@ static int pci_scan_bridge_extend(struct pci_bus *bus, struct pci_dev *dev,
>>  	u8 fixed_sec, fixed_sub;
>>  	int next_busnr;
>>
>> -	/*
>> -	 * Make sure the bridge is powered on to be able to access config
>> -	 * space of devices below it.
>> -	 */
>> -	pm_runtime_get_sync(&dev->dev);
>> -
>>  	pci_read_config_dword(dev, PCI_PRIMARY_BUS, &buses);
>>  	primary = buses & 0xFF;
>>  	secondary = (buses >> 8) & 0xFF;
>> @@ -1418,8 +1412,6 @@ static int pci_scan_bridge_extend(struct pci_bus *bus, struct pci_dev *dev,
>>  out:
>>  	pci_write_config_word(dev, PCI_BRIDGE_CONTROL, bctl);
>>
>> -	pm_runtime_put(&dev->dev);
>> -
>>  	return max;
>>  }
>>
>> @@ -2796,11 +2788,19 @@ static unsigned int pci_scan_child_bus_extend(struct pci_bus *bus,
>>  	unsigned int used_buses, normal_bridges = 0, hotplug_bridges = 0;
>>  	unsigned int start = bus->busn_res.start;
>>  	unsigned int devfn, fn, cmax, max = start;
>> -	struct pci_dev *dev;
>> +	struct pci_dev *dev, *bridge = bus->self;
>>  	int nr_devs;
>>
>>  	dev_dbg(&bus->dev, "scanning bus\n");
>>
>> +	/*
>> +	 * Make sure the bus bridge is powered on, otherwise we may not be
>> +	 * able to scan the devices as we may fail to access the configuration
>> +	 * space of subordinates.
>> +	 */
>> +	if (bridge)
>> +		pm_runtime_get_sync(&bridge->dev);
>> +
>>  	/* Go find them, Rover! */
>>  	for (devfn = 0; devfn < 256; devfn += 8) {
>>  		nr_devs = pci_scan_slot(bus, devfn);
>> @@ -2913,6 +2913,9 @@ static unsigned int pci_scan_child_bus_extend(struct pci_bus *bus,
>>  		}
>>  	}
>>
>> +	if (bridge)
>> +		pm_runtime_put(&bridge->dev);
>> +
>>  	/*
>>  	 * We've scanned the bus and so we know all about what's on
>>  	 * the other side of any bridges that may be on this bus plus
>> --
>> 2.8.1
>>
> 
> .
> 




[Index of Archives]     [DMA Engine]     [Linux Coverity]     [Linux USB]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Greybus]

  Powered by Linux