Hey Bjorn, so I have spent a few hours seeing how we could simplify this series. Here are my thoughts. On Fri, 2024-04-26 at 17:01 -0500, Bjorn Helgaas wrote: > On Fri, Apr 26, 2024 at 10:07:02AM +0200, Philipp Stanner wrote: > > On Wed, 2024-04-24 at 15:12 -0500, Bjorn Helgaas wrote: > > > On Mon, Apr 08, 2024 at 10:44:12AM +0200, Philipp Stanner wrote: > > > > ... > > > > PCI's devres API suffers several weaknesses: > > > > > > > > 1. There are functions prefixed with pcim_. Those are always > > > > managed > > > > counterparts to never-managed functions prefixed with pci_ – > > > > or > > > > so one > > > > would like to think. There are some apparently unmanaged > > > > functions > > > > (all region-request / release functions, and pci_intx()) > > > > which > > > > suddenly become managed once the user has initialized the > > > > device > > > > with > > > > pcim_enable_device() instead of pci_enable_device(). This > > > > "sometimes > > > > yes, sometimes no" nature of those functions is confusing > > > > and > > > > therefore bug-provoking. In fact, it has already caused a > > > > bug in > > > > DRM. > > > > The last patch in this series fixes that bug. > > > > 2. iomappings: Instead of giving each mapping its own callback, > > > > the > > > > existing API uses a statically allocated struct tracking one > > > > mapping > > > > per bar. This is not extensible. Especially, you can't > > > > create > > > > _ranged_ managed mappings that way, which many drivers want. > > > > 3. Managed request functions only exist as "plural versions" > > > > with a > > > > bit-mask as a parameter. That's quite over-engineered > > > > considering > > > > that each user only ever mapps one, maybe two bars. > > > > > > > > This series: > > > > - add a set of new "singular" devres functions that use devres > > > > the > > > > way > > > > its intended, with one callback per resource. > > > > - deprecates the existing iomap-table mechanism. > > > > - deprecates the hybrid nature of pci_ functions. > > > > - preserves backwards compatibility so that drivers using the > > > > existing > > > > API won't notice any changes. > > > > - adds documentation, especially some warning users about the > > > > complicated nature of PCI's devres. > > > > > > There's a lot of good work here; thanks for working on it. > > > > Thanks! > > Good to get some more feedback from you > > > > > > > > > Philipp Stanner (10): > > > > PCI: Add new set of devres functions > > > > > > This first patch adds some infrastructure and several new > > > exported > > > interfaces: > > > > > > void __iomem *pcim_iomap_region(struct pci_dev *pdev, int bar, > > > const char *name) > > > void pcim_iounmap_region(struct pci_dev *pdev, int bar) > > > int pcim_request_region(struct pci_dev *pdev, int bar, const > > > char > > > *name) > > > void pcim_release_region(struct pci_dev *pdev, int bar) > > > void __iomem *pcim_iomap_range(struct pci_dev *pdev, int bar, > > > void __iomem *pcim_iomap_region_range(struct pci_dev *pdev, int > > > bar, > > > void pcim_iounmap_region_range(struct pci_dev *pdev, int bar, > > > > > > > PCI: Deprecate iomap-table functions > > > > > > This adds a little bit of infrastructure (add/remove to > > > legacy_table), > > > reimplements these existing interfaces: > > > > > > void __iomem *pcim_iomap(struct pci_dev *pdev, int bar, > > > unsigned > > > long maxlen) > > > void pcim_iounmap(struct pci_dev *pdev, void __iomem *addr) > > > int pcim_iomap_regions(struct pci_dev *pdev, int mask, const > > > char > > > *name) > > > int pcim_iomap_regions_request_all(struct pci_dev *pdev, int > > > mask, > > > void pcim_iounmap_regions(struct pci_dev *pdev, int mask) > > > > > > and adds a couple new exported interfaces: > > > > > > void pcim_release_all_regions(struct pci_dev *pdev) > > > int pcim_request_all_regions(struct pci_dev *pdev, const char > > > *name) > > > > > > There's a lot going on in these two patches, so they're hard to > > > review. I think it would be easier if you could do the fixes to > > > existing interfaces first, > > > > I agree that the patches can be further split into smaller chunks > > to > > make them more atomic and easier to review. I can do that. > > > > BUT I'd need some more details about what you mean by "do the fixes > > first" – which fixes? > > The later patches at least in part rely on the new better functions > > being available. > > > > > followed by adding new things, maybe > > > something like separate patches that: > > > > > > - Add pcim_addr_devres_alloc(), pcim_addr_devres_free(), > > > pcim_addr_devres_clear(). > > > > > > - Add pcim_add_mapping_to_legacy_table(), > > > pcim_remove_mapping_from_legacy_table(), > > > pcim_remove_bar_from_legacy_table(). > > > > > > - Reimplement pcim_iomap(), pcim_iomap_regions(), > > > pcim_iounmap(). > > This is the part I meant by "fixes", but maybe it's not so much a fix > as it is reimplementing based on different infrastructure. The diffs > in "PCI: Deprecate iomap-table functions" for pcim_iomap() and > pcim_iounmap() are fairly straightforward and only depend on the > above. > > pcim_iomap_regions() is a bit more complicated and probably needs > pcim_iomap_region() but not necessarily __pcim_request_region() and > __pcim_request_region_range(). Well, pcim_iomap_region() relies on __pcim_request_region() and __pcim_request_region_range(). The entire architecture circles around those: At the bottom you have generic functions that can reserve you any range, and at the top you have functions that use those to get a whole BAR if needed. That was done very specifically to have an extensible API that doesn't suffer the limitations of the current devres API. It's super easy this way to add all sorts of request() functions should the need ever arise, all you need to do is add a PCIM_ADDR_* counterpart, basically. Right now, I wouldn't even know how I could implement pcim_iomap_region() without my __pcim_* helpers, since the later are (see the comment above them) written as counter parts to the ones in pci.c – which are hybrid again. So I couldn't write it that way, because then we'd have circle where the pcim_ function calls the pci_ function which might call the pcim_ function again... So the one thing I really want is to keep __pcim_request_region_range() and its wrappers. The algorithms for the region ranges in __pcim_request_region_range() can IMO be trusted to work because they are just copies of the existing ones in pci_iomap_region(). Only difference is that they request instead of ioremap(). And the request part in turn is copied from __pci_request_region(), only difference being that I tried to make it more readable by storing the pci_resource_*() return values at the top :) There's really a reason why I did it this way: 1. Add new, better interfaces 2. Reimplement pcim_iomap() & partners 3. Separate that hybrid API as much as possible I really tried to put some thought into the existing approach. Changing that now in v7 would be very work intensive, because I'd have to think everything through again, refactor it, drop APIs again, split commits, readjust commit messages and all the clean up comments inline (which wouldn't fit anymore), test the build commit-by-commit carefully once again with KASAN & Co... Yesterday I also found that abandoning that "inner nature" of the series would actually decrease readability in many cases, because pcim_addr_resources_match() would be added later, whereas the logic added in pcim_addr_resource_release() would be now empty at the beginning and scattered over several patches etc. > > This would be a pretty small patch and defer making them deprecated > until replacements are added. I understand your desire to make those two patches smaller. We can work on making things more readable and reduce the exported APIs Let me suggest this: * I drop luxury functions no one needs (neither internally nor externally) yet (e.g., pcim_request_region_range()). That will already make patch #1 quite a bit smaller. * We don't export anything else which is not needed by users yet (pcim_request_all_regions(), pcim_request_region_exclusive() etc.) * I split patches #1 and #2 into smaller chunks where possible according to your suggestions above. * We keep __pcim_request_region(), and use it as the working horse cleanly separated from the hybrid counterparts in pci.c. Would that work for you? P. > > > > - Add new interfaces like pcim_iomap_region(), > > > pcim_request_region(), etc. > > > > > > AFAICS, except for pcim_iomap_range() (used by vbox), these > > > new > > > interfaces have no users outside drivers/pci, so ... we might > > > defer adding them, or at least defer exposing them via > > > include/linux/pci.h, until we have users for them. > > > > Dropping (the export of) functions like pcim_request_region_range() > > or > > pcim_request_all_regions() is not a problem. > > > > What I quite fundamentally have to disagree with, however, is not > > to > > export the functions > > > > * pcim_request_region() > > * pcim_iomap_region() > > > > the main point of this series is to deprecate that hybrid nature of > > those existing pci_* functions. You can only deprecate something > > when > > you provide users with new, better alternatives. > > Right. But the new alternatives are only better when there are > actual > examples in the tree for people to look at. If there are no users, > more interfaces are at best confusing and at worst dead code. > > Bjorn >