On Fri, 2022-05-06 at 13:33 +0200, Arnd Bergmann wrote:
On Fri, May 6, 2022 at 12:20 PM Maciej W. Rozycki <macro@xxxxxxxxxxx> wrote:
On Thu, 5 May 2022, Arnd Bergmann wrote:
I think I'm missing something here. IIUC we're talking about a PCI/PCIe
bus used with s390 hardware, right?
(It has to be PCI/PCIe, because other than x86/IA-64 host buses there are
only PCI/PCIe and EISA/ISA buses out there that define I/O access cycles
and EISA/ISA have long been obsoleted except perhaps from some niche use.)
If this is PCI/PCIe indeed, then an I/O access is just a different bit
pattern put on the bus/in the TLP in the address phase. So what is there
inherent to the s390 architecture that prevents that different bit pattern
from being used?
The hardware design for PCI on s390 is very different from any other
architecture, and more abstract. Rather than implementing MMIO register
access as pointer dereference, this is a separate CPU instruction that
takes a device/bar plus offset as arguments rather than a pointer, and
Linux encodes this back into a fake __iomem token.
Correct, we have since gained new PCI load/store instructions that
actually do take a virtual address that gets address translated to a
"physical address" for the PCI BARs. The "physical address" we get from
the platform (again via special instructions). I put "physical address"
in quotes because while they are conceptually physical addresses and
they are translated to by MMU translation tables, accessing their
virtual mapping with any instruction other then the special PCI
load/store instructions will cause addressing exceptions. So we still
don't have real MMIO but something that looks a lot more like MMIO than
we used to.
If anything, I could imagine the same limitation as with current POWER9
implementations, that is whatever glue is used to wire PCI/PCIe to the
rest of the system does not implement a way to use said bit pattern (which
has nothing to do with the POWER9 processor instruction set).
But that has nothing to do with the presence or absence of any specific
processor instructions. It's just a limitation of bus glue. So I guess
it's just that all PCI/PCIe glue logic implementations for s390 have such
a limitation, right?
There are separate instructions for PCI memory and config space, but
no instructions for I/O space, or for non-PCI MMIO that it could be mapped
into.
Arnd
The config space is still accessed with the old style PCI load/store
instructions via a special extra BAR.
But yes overall on s390x we can only access PCI(e) devices via special
instructions not via real MMIO and also the OS has no direct access to
the registers of the PHB which are only accessible to firmware.
Maybe as a bit of further background it's also important to note that
on s390x all Operating Systems run inside a hypervisor. On the lowest
level any OS can run this is a non-paging machine level hypervisor. For
PCI that means that we always have a kind of pass-through access though
without paging and hardware support for the memory partitioning this is
of course relatively simple.