Hi Bjorn,
On 02.12.2016 22:57, Bjorn Helgaas wrote:
On Fri, Dec 02, 2016 at 03:20:28PM +0100, Tomasz Nowicki wrote:
dmesg from ThunderX pass2.0 (1 socket board) + small fix:
diff --git a/drivers/acpi/pci_mcfg.c b/drivers/acpi/pci_mcfg.c
- THUNDER_PEM_QUIRK(2, 0), /* off-chip devices */
- THUNDER_PEM_QUIRK(2, 1), /* off-chip devices */
+ THUNDER_PEM_QUIRK(2, 0UL), /* off-chip devices */
+ THUNDER_PEM_QUIRK(2, 1UL), /* off-chip devices */
Thanks! I folded this change into my branch (possibly to be updated
if Robert sends something better).
I believe Robert meant:
diff --git a/drivers/acpi/pci_mcfg.c b/drivers/acpi/pci_mcfg.c
index d34d196..8ca8ca8 100644
--- a/drivers/acpi/pci_mcfg.c
+++ b/drivers/acpi/pci_mcfg.c
@@ -76,7 +76,7 @@ static struct mcfg_fixup mcfg_quirks[] = {
HISI_QUAD_DOM("HIP07 ", 12, &hisi_pcie_ops),
#define THUNDER_PEM_RES(addr, node) \
- DEFINE_RES_MEM(addr + (node << 44), 0x39 * SZ_16M)
+ DEFINE_RES_MEM(addr + ((u64)node << 44), 0x39 * SZ_16M)
Which means we can forget UL suffix for THUNDER_PEM_QUIRK macro:
@@ -91,8 +91,8 @@ static struct mcfg_fixup mcfg_quirks[] = {
{ "CAVIUM", "THUNDERX", rev, 9 + (10 * node), MCFG_BUS_ANY,
\
&thunder_pem_ecam_ops, THUNDER_PEM_RES(0x89808f000000UL, node) }
/* SoC pass2.x */
- THUNDER_PEM_QUIRK(1, 0UL),
- THUNDER_PEM_QUIRK(1, 1UL),
+ THUNDER_PEM_QUIRK(1, 0),
+ THUNDER_PEM_QUIRK(1, 1),
Also, my apologies, I should have fixed parentheses in this macro from
the very beginning. For you convenience below incremental patch where I
fixed both issues:
diff --git a/drivers/acpi/pci_mcfg.c b/drivers/acpi/pci_mcfg.c
index d34d196..cf6c321 100644
--- a/drivers/acpi/pci_mcfg.c
+++ b/drivers/acpi/pci_mcfg.c
@@ -76,23 +76,23 @@ static struct mcfg_fixup mcfg_quirks[] = {
HISI_QUAD_DOM("HIP07 ", 12, &hisi_pcie_ops),
#define THUNDER_PEM_RES(addr, node) \
- DEFINE_RES_MEM(addr + (node << 44), 0x39 * SZ_16M)
+ DEFINE_RES_MEM((addr) + ((u64)(node) << 44), 0x39 * SZ_16M)
#define THUNDER_PEM_QUIRK(rev, node) \
- { "CAVIUM", "THUNDERX", rev, 4 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x88001f000000UL, node)
}, \
- { "CAVIUM", "THUNDERX", rev, 5 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x884057000000UL, node)
}, \
- { "CAVIUM", "THUNDERX", rev, 6 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x88808f000000UL, node)
}, \
- { "CAVIUM", "THUNDERX", rev, 7 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x89001f000000UL, node)
}, \
- { "CAVIUM", "THUNDERX", rev, 8 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x894057000000UL, node)
}, \
- { "CAVIUM", "THUNDERX", rev, 9 + (10 * node), MCFG_BUS_ANY,
\
- &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x89808f000000UL, node) }
+ { "CAVIUM", "THUNDERX", (rev), 4 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x88001f000000UL,
(node)) }, \
+ { "CAVIUM", "THUNDERX", (rev), 5 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x884057000000UL,
(node)) }, \
+ { "CAVIUM", "THUNDERX", (rev), 6 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x88808f000000UL,
(node)) }, \
+ { "CAVIUM", "THUNDERX", (rev), 7 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x89001f000000UL,
(node)) }, \
+ { "CAVIUM", "THUNDERX", (rev), 8 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x894057000000UL,
(node)) }, \
+ { "CAVIUM", "THUNDERX", (rev), 9 + (10 * (node)), MCFG_BUS_ANY,
\
+ &thunder_pem_ecam_ops, THUNDER_PEM_RES(0x89808f000000UL, (node)) }
/* SoC pass2.x */
- THUNDER_PEM_QUIRK(1, 0UL),
- THUNDER_PEM_QUIRK(1, 1UL),
+ THUNDER_PEM_QUIRK(1, 0),
+ THUNDER_PEM_QUIRK(1, 1),
#define THUNDER_ECAM_QUIRK(rev, seg) \
{ "CAVIUM", "THUNDERX", rev, seg, MCFG_BUS_ANY, \
diff --git a/include/linux/pci-ecam.h b/include/linux/pci-ecam.h
index 00eb8eb..643c9e7 100644
--- a/include/linux/pci-ecam.h
+++ b/include/linux/pci-ecam.h
@@ -62,8 +62,9 @@ extern struct pci_ecam_ops pci_generic_ecam_ops;
#if defined(CONFIG_ACPI) && defined(CONFIG_PCI_QUIRKS)
extern struct pci_ecam_ops pci_32b_ops; /* 32-bit
accesses only */
extern struct pci_ecam_ops hisi_pcie_ops; /* HiSilicon */
-extern struct pci_ecam_ops thunder_pem_ecam_ops; /* Cavium ThunderX 1.x
& 2.x */
-extern struct pci_ecam_ops pci_thunder_ecam_ops; /* Cavium ThunderX 1.x */
+extern struct pci_ecam_ops thunder_pem_ecam_ops; /* Cavium ThunderX
pass1.x &
+ pass2.x */
+extern struct pci_ecam_ops pci_thunder_ecam_ops; /* Cavium ThunderX
pass1.x */
#endif
#ifdef CONFIG_PCI_HOST_GENERIC
ACPI: MCFG table detected, 4 entries
ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-1f])
acpi PNP0A08:00: _OSC: OS supports [ExtendedConfig ASPM ClockPM
Segments MSI]
acpi PNP0A08:00: _OSC: platform does not support [PCIeHotplug PME AER]
acpi PNP0A08:00: _OSC: OS now controls [PCIeCapability]
acpi PNP0A08:00: ECAM area [mem 0x848000000000-0x848001ffffff]
reserved by THRX0001:00
acpi PNP0A08:00: ECAM at [mem 0x848000000000-0x848001ffffff] for [bus 00-1f]
I guess we don't need MCFG quirks for these bridges, since I don't see
the "MCFG quirk" message?
Yes, this is ThunderX 1-socket pass2.x so quirks are needed for 4-9 host
bridges only. Below summary of where/how quirks should be applied:
ThunderX pass2.x
NUMA node -> segments -> ECAM compliant note
0 -> 0-3 -> ECAM compliant
0 -> 4-9 -> ECAM quirks (thunder_pem_ecam_ops)
1 (optionally) -> 10-13 -> ECAM compliant
1 (optionally) -> 14-19 -> ECAM quirks (thunder_pem_ecam_ops)
ThunderX pass1.x
NUMA node -> segments -> ECAM compliant note
0 -> 0-3 -> ECAM quirks (pci_thunder_ecam_ops)
0 -> 4-9 -> ECAM quirks (thunder_pem_ecam_ops)
1 (optionally) -> 10-13 -> ECAM quirks (pci_thunder_ecam_ops)
1 (optionally) -> 14-19 -> ECAM quirks (thunder_pem_ecam_ops)
system 00:00: [mem 0x848000000000-0x848001ffffff] could not be reserved
system 00:01: [mem 0x849000000000-0x849001ffffff] could not be reserved
system 00:02: [mem 0x84a000000000-0x84a001ffffff] could not be reserved
system 00:03: [mem 0x84b000000000-0x84b001ffffff] could not be reserved
system 00:04: [mem 0x87e0c0000000-0x87e0c0ffffff] could not be reserved
system 00:04: [mem 0x88001f000000-0x880057ffffff] could not be reserved
system 00:05: [mem 0x87e0c2000000-0x87e0c2ffffff] could not be reserved
system 00:05: [mem 0x88808f000000-0x8880c7ffffff] could not be reserved
Most of these match ECAM spaces, which is good. 00:04 and 00:05 each
have one ECAM space and one "other" space, which might be PEMx host
bridge registers? That all seems good.
Yes, "other" space is PEMx host bridge registers.
But I assume the other bridges (PCIx) also have register space in
addition to ECAM, and that should be reported also.
00:00, 00:01, 00:02, 00:03 are ECAM compliant so for those we need ECAM
region only.
root@ubuntu:~# cat /proc/iomem
...
838000000000-841fffffffff : PCI Bus 0000:00
838000000000-8380003fffff : 0000:03:00.0
Something's missing here: we should have a clue about how we got from
bus 00 to bus 03. From your dmesg, 0000:00:15.0 is a bridge from bus
00 to bus 03, and its windows should appear here. I'd expect
something like:
838000000000-841fffffffff : PCI Bus 0000:00
838000000000-838fffffffff : PCI Bus 0000:03 <- 00:15.0 window
838000000000-8380003fffff : 0000:03:00.0
This window should be inserted by generic code, so I don't know how
this could be broken. Your dmesg should also have something like
this:
pci 0000:00:15.0: PCI bridge to [bus 03]
pci 0000:00:15.0: bridge window [mem 0x838000000000-0x838fffffffff]
I don't see that, maybe because this is actually a console log
collected without "ignore_loglevel"? But that obviously doesn't
affect /proc/iomem, so I'm still puzzled about that.
Enhanced Allocation is used for devices but not for bridges which have
invalid BARs in standard configuration header. Thus devices request
resources from first valid parent bridge (host bridge in this case).
87e024000000-87e024000fff : ARMH0011:00
87e024000000-87e024000fff : ARMH0011:00
This is interesting. This must be a driver reserving these areas?
Normally a driver would use the driver name, not the device name.
Ideally, I think the core should reserve the region with the device
name, and the driver would request it using the driver name, like
this:
843000000000-84303fffffff : 0002:01:00.0 <-- PCI core reservation
843000000000-84303fffffff : thunder-nic <-- driver request
The ACPI core doesn't actually do the reservation, so I assume the
ARMH0011:00 stuff is from the driver, and it's curious that it has the
same region twice.
87e026000000-87e0bfffffff : PCI Bus 0000:00
87e027000000-87e0277fffff : CAVA02A:00
This is interesting, too. CAVA02A:00 looks like an ACPI device, but
apparently it consumes some of the space that we think is routed to
PCI bus 0000:00. I think this happens on some x86 boxes, too, but it
is somewhat confusing.
I will take a look.
Based on this, I don't see any problems with the ThunderX quirks.
I'd like to understand what's going on with the PCI-to-PCI bridge
windows in /proc/iomem, but I doubt that's related to your quirks.
Yes it is not related to quirks.
However, CAVA02A:00 and ARMH0011 things should be investigated.
Thanks,
Tomasz
--
To unsubscribe from this list: send the line "unsubscribe linux-pci" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html