On Wed, Jun 19, 2019 at 10:21:21AM -0600, Logan Gunthorpe wrote: > *(cc'd back Bjorn and the list) > > On 2019-06-19 8:00 a.m., Nicholas Johnson wrote: > > Hi Ben and Logan, > > > > It looks like my git send-email has been not working correctly since I > > started trying to get these patches accepted. I may have remedied this > > now, but I have seen that Logan tried to find these patches and failed. > > So as a courtesy until I post PATCH v7 (hopefully correctly, this time), > > I am forwarding you the patches. I hope you like them. I would love to > > know of any concerns or questions you may have, and / or what happens if > > you test them. Thanks and all the best! > > > > ----- Forwarded message from Nicholas Johnson <nicholas.johnson-opensource@xxxxxxxxxxxxxx> ----- > > > > Date: Thu, 23 May 2019 06:29:27 +0800 > > From: Nicholas Johnson <nicholas.johnson-opensource@xxxxxxxxxxxxxx> > > To: linux-kernel@xxxxxxxxxxxxxxx > > Cc: linux-pci@xxxxxxxxxxxxxxx, bhelgaas@xxxxxxxxxx, mika.westerberg@xxxxxxxxxxxxxxx, corbet@xxxxxxx, Nicholas Johnson <nicholas.johnson-opensource@xxxxxxxxxxxxxx> > > Subject: [PATCH v6 3/4] PCI: Fix bug resulting in double hpmemsize being assigned to MMIO window > > X-Mailer: git-send-email 2.19.1 > > > > Background > > ========================================================================== > > > > Solve bug report: > > https://bugzilla.kernel.org/show_bug.cgi?id=203243 > > This is all kinds of confusing... the bug report just seems to be a copy > of the patch set. The description of the actual symptoms of the problem > appears to be missing from all of it. I believe everything to be there, but I can take another look and add more details. It is possible I lost track of what I had written where. There are common elements which I borrowed from the patchset or vice-versa, like the pin diagram for using the Thunderbolt add-in card for testing. > > > Currently, the kernel can sometimes assign the MMIO_PREF window > > additional size into the MMIO window, resulting in double the MMIO > > additional size, even if the MMIO_PREF window was successful. > > > > This happens if in the first pass, the MMIO_PREF succeeds but the MMIO > > fails. In the next pass, because MMIO_PREF is already assigned, the > > attempt to assign MMIO_PREF returns an error code instead of success > > (nothing more to do, already allocated). > > > > Example of problem (more context can be found in the bug report URL): > > > > Mainline kernel: > > pci 0000:06:01.0: BAR 14: assigned [mem 0x90100000-0xa00fffff] = 256M > > pci 0000:06:04.0: BAR 14: assigned [mem 0xa0200000-0xb01fffff] = 256M > > > > Patched kernel: > > pci 0000:06:01.0: BAR 14: assigned [mem 0x90100000-0x980fffff] = 128M > > pci 0000:06:04.0: BAR 14: assigned [mem 0x98200000-0xa01fffff] = 128M > > > > This was using pci=realloc,hpmemsize=128M,nocrs - on the same machine > > with the same configuration, with a Ubuntu mainline kernel and a kernel > > patched with this patch series. > > > > This patch is vital for the next patch in the series. The next patch > > allows the user to specify MMIO and MMIO_PREF independently. If the > > MMIO_PREF is set to be very large, this bug will end up more than > > doubling the MMIO size. The bug results in the MMIO_PREF being added to > > the MMIO window, which means doubling if MMIO_PREF size == MMIO size. > > With a large MMIO_PREF, without this patch, the MMIO window will likely > > fail to be assigned altogether due to lack of 32-bit address space. > > > > Patch notes > > ========================================================================== > > > > Change find_free_bus_resource() to not skip assigned resources with > > non-null parent. > > > > Add checks in pbus_size_io() and pbus_size_mem() to return success if > > resource returned from find_free_bus_resource() is already allocated. > > > > This avoids pbus_size_io() and pbus_size_mem() returning error code to > > __pci_bus_size_bridges() when a resource has been successfully assigned > > in a previous pass. This fixes the existing behaviour where space for a > > resource could be reserved multiple times in different parent bridge > > windows. This also greatly reduces the number of failed BAR messages in > > dmesg when Linux assigns resources. > > This patch looks like the same bug that I tracked down earlier but I > solved in a slightly different way. See this patch[1] which is still > under review. Can you maybe test it and see if it solves the same problem? I read [1] and it is definitely the same bug, without a doubt. This is fantastic because it means I have somebody to back me up on this. I will test the patch as soon as I can - perhaps after work today. My initial thoughts of [1] patch are that restricting 64-bit BARs to 64-bit windows might break assigning 64-bit BARs on bridges without the optional prefetchable window. My patch should not have that issue - but after I have tested [1], it might turn out to be fine. Correct me if I am wrong about assumptions about windows. My understanding cannot be perfect. As far as I know, 64-bit BARs should always be prefetchable, but I own the Aquantia AQC-107S NIC and it has three 64-bit non-pref BARs. It happens that they are assigned into the 32-bit window. I will see if [1] patch prevents that from happening or not. Cheers > > Thanks, > > Logan > > [1] > https://lore.kernel.org/lkml/20190531171216.20532-2-logang@xxxxxxxxxxxx/T/#u