Re: [PATCH v9 00/26] PCI: Allow BAR movement during boot and hotplug

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Bjorn,

On Thu, 2021-01-28 at 08:53 -0600, Bjorn Helgaas wrote:
> I can certainly see scenarios where this functionality will be
> useful,
> but the series currently doesn't mention bug reports that it
> fixes.  I
> suspect there *are* some related bug reports, e.g., for Thunderbolt
> hotplug.  We should dig them up, include pointers to them, and get
> the
> reporters to test the series and provide feedback.

It never occurred to me to actually search for reports, thanks.

> We had some review traffic on earlier versions, but as far as I can
> see, nobody has stepped up with any actual signs of approval, e.g.,
> Tested-by, Reviewed-by, or Acked-by tags.  That's a problem because
> this touches a lot of sensitive code and I don't want to be stuck
> fixing issues all by myself.  When issues crop up, I look to the
> author and reviewers to help out.

Indeed, I had only a few informal personal responses [from outside of
our company]:
 - some machines have no regressions with the kernel built with Clang-
12+LTO+IAS;
 - NVMe hotplug started working for AMD Epyc with a PEX switch;
 - Another system with several PEX switches is under ongoing
experiments.

Providing an ability to quickly return to the old BARs allocating is
the reason why I keep wrapping the most intrusive code with "if
(pci_can_move_bars)". Originally I wanted it to be disabled by default,
available for those truly desperate. But there was an objection that
the code will not be ever tested this way.

> Along that line, I'm a little concerned about how we will be able to
> reproduce and debug problems.  Issues will likely depend on the
> topology, the resources of the specific devices involved, and a
> specific sequence of hotplug operations.  Those may be hard to
> reproduce even for the reporter.  Maybe this is nothing more than a
> grep pattern to extract relevant events from dmesg.  Ideal, but maybe
> impractical, would be a way to reproduce things in a VM using qemu
> and
> a script to simulate hotplugs.

I haven't yet tried the qemu for PCI debugging, definitely need to take
a look. To help with that, the patchset is supplied with additional
prints, but CONFIG_PCI_DEBUG=y is still preferred. A simulation will
reveal if additional debug messages are needed.

> > The feature is usable not only for hotplug, but also for booting
> > with a
> > complete set of BARs assigned, and for Resizable BARs.
> 
> I'm interested in more details about both of these.  What does "a
> complete set of BARs assigned" mean?  On x86, the BIOS usually
> assigns
> all the BARs ahead of time, but I know that's not the case for all
> arches.

Usually yes, but we have at least one x86 PC that skips some BARs (need
to check out again its particular model and version, IIRC that was
Z370-F) -- not the crucial ones like BAR0, though. That really got me
by surprise, so now it is one more case covered.

> For Resizable BARs, is the point that this allows more flexibility in
> resizing BARs because we can now move things out of the way?

Not only things out of the way, but most importantly the bridge windows
are now floating as well. So it's more freedom for a new BAR to
"choose" a place.

An awfully synthetic example:

|                       RC Address Space                       |
| orig GPU BAR | fixed BAR |                                   |
|              | fixed BAR |     Resized GPU BAR        |      |

> > Tested on a number of x86_64 machines without any special kernel
> > command
> > line arguments (some of them -- with older v8 revision of this
> > patchset):
> >  - PC: i7-5930K + ASUS X99-A;
> >  - PC: i5-8500 + ASUS Z370-F;
> >  - PC: AMD Ryzen 7 3700X + ASRock X570 + AMD 5700 XT (Resizable
> > BARs);
> >  - Supermicro Super Server/H11SSL-i: AMD EPYC 7251;
> >  - HP ProLiant DL380 G5: Xeon X5460;
> >  - Dell Inspiron N5010: i5 M 480;
> >  - Dell Precision M6600: i7-2920XM.
> 
> Does this testing show no change in behavior and no regressions, or
> does it show that this series fixes cases that previously did not
> work?  If the latter, some bugzilla reports with before/after dmesg
> logs would give some insight into how this works and also some good
> justification.

Both, actually, but the dmesg logs were never published, so they will
be the next step -- combined with qemu scripts, if it works.

> > This patchset is a part of our work on adding support for hot-
> > adding
> > chains of chassis full of other bridges, NVME drives, SAS HBAs,
> > GPUs, etc.
> > without special requirements such as Hot-Plug Controller,
> > reservation of
> > bus numbers or memory regions by firmware, etc.
> 
> This is a little bit of a chicken and egg situation.  I suspect many
> people would like this functionality, but currently we either avoid
> it
> because it's "known not to work" or we hack around it with the
> firmware reservations you mention.

The current (v9) patchset became more symbiotic with other hotplug
facilities: it started to respect pci=hpmemsize etc, and only drops it
if such allocation is impossible.

Attention Button for Assisted Hotplug (pciehp driver) was always
supported. It is also fully compatible with DPC.

The goal is to squeeze as much as possible even without special
hw+fw+sw support, when the only available tool is a manual rescan via
sysfs.

Serge




[Index of Archives]     [DMA Engine]     [Linux Coverity]     [Linux USB]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Greybus]

  Powered by Linux