Salut Andre, On 06/23/2015 11:03 AM, Andre Przywara wrote: > Hi Eric, > > I went back reading the code and looked at how the x86 APIC works more > closely to understand the GSI routing better. > See below for more ... > > On 22/06/15 10:21, Eric Auger wrote: >> On 06/22/2015 10:40 AM, Andre Przywara wrote: >>> Hi Eric, >>> >>> I briefly looked over the series, the patches itself look good overall. >>> I have one or two comments on the actual code, but want to discuss the >>> general approach first (more a dump of some first thoughts): >>> >>> On 18/06/15 18:40, Eric Auger wrote: >>>> With the advent of GICv3 ITS in-kernel emulation, KVM GSI routing >>>> appears to be requested. More specifically MSI routing is needed. >>>> irqchip routing does not sound to be really useful on arm but usage of >>>> MSI routing also mandates to integrate irqchip routing. The initial >>>> implementation of irqfd on arm must be upgraded with the integration >>>> of kvm irqchip.c code and the implementation of its standard hooks >>>> in the architecture specific part. >>>> >>>> The series therefore allows and mandates the usage of KVM_SET_GSI_ROUTING >>>> ioctl along with KVM_IRQFD. If the userspace does not define any routing >>>> table, no irqfd injection can happen. The user-space can use >>>> KVM_CAP_IRQ_ROUTING to detect whether a routing table is needed. >>>> >>>> for irqchip routing, the convention is, only SPI can be injected and the >>>> SPI ID corresponds to irqchip.pin + 32. For MSI routing the interrupt ID >>>> matches the MSI msg data. API evolve to support associating a device ID >>>> to a routine entry. >>> >>> So if I get this right, in a guest ITS case we have now three different >>> IRQ name spaces: >>> a) the LPI number, which is guest internal. The ITS driver in the guest >>> maintains it. We can track assignments and changes when handling the >>> MAPVI command in the host kernel, but this would stay in the kernel, as >>> I don't see an efficient way of propagating this to userland. >>> b) the GSI number, which is used in communication between userland and >>> the host kernel. The guest kernel does not know about this at all. Also >>> the ioctl requires us to set the routing for _all_ GSIs, and I read it >>> that it assumes starting at GSI 0. >> all injected GSI must effectively have a routing entry in KVM. Starting >> at 0 that's not requested. At qemu level there's just the constaint gsi >> fits between [0, max route number]. > > Yeah, you are right, I somehow missed that each routing entry has a gsi > field in it. So we have to allocate all of them at once with one ioctl, > but they can be sparse. > >> So we cannot even pretend to have >>> LPIs here, because we would need at least 8192 empty entries then, not >>> to speak of the possibly sparse allocation above. So we have a >>> completely distinct name space here. >> What is done currently at qemu level for other archs - if I understand >> it correctly - is there is static GSI routing for standard IRQ. For MSI >> irqfd setup they use spare gsi number not yet used for GSI routing < max >> route number. So this is sparse for MSI but not for standard IRQs. >> Effectively we do not plan to have GSI routing for LPIs but only MSI >> routing. > > That seems to make sense to me. Since we already limit the number of > SPIs to something sensible with our KVM_DEV_ARM_VGIC_GRP_NR_IRQS, we > could infer an implicit direct routing for those SPIs. KVM could check > the IRQ number against vgic.nr_irqs to see whether an IRQ is routed or not. > Any GSI beyond that number would be an MSI with your enhanced DevID:EvID > pair in it, which gets injected via the ITS emulation code (or the > respective GICv2m code). > > That would be the idea, but if it turns out that not routing SPIs but > only MSIs requires too many changes to the (core) KVM code (haven't > looked yet), I am currently prototyping that. Maybe this is not so much change in the core. I am attempting to build a default irqchip routing table and just allow userspace to add MSI entries on top of those. This should be the topic of RFC v2, I think this week. we could require routing entries for SPIs as well. > After all that's what for instance kvmtool sets up for x86, creating > default 1:1 mappings for ISA and low APIC IRQs and allocating MSIs on > demand after that. > >>> c) The DevID:EvID pair, which actually identifies an IRQ in all the >>> three regimes and is the only authoritative ID. >>> >>> So that means we need to maintain the connection between all the three, >>> somehow duplicating the whole ITS mapping again to map GSIs to DevID:EvID. >> Currently the KVM routing table indeed stores GSI/DevID:EvID mapping. >>> >>> So I wonder if we could use DevID:EvID directly. >>> The KVM_IRQFD ioctl struct has some space, so we could put the DevID >>> into the pad area. >>> Also (more forward-looking) KVM_CAP_ASSIGN_DEV_IRQ identifies guest IRQs >>> by an u32, but again there is quite some padding area available. > >> ASSIGN_DEV_IRQ is a deprecated feature. We should not use that API I think. > > OK, so do we have other users of the GSI routing beside IRQFD then? Well KVM_IRQ_LINE as well. That's why I would like to unplug userspace irqchip routing. > > I will go ahead and try to implement some code matching Eric's patches > in kvmtool to test the GSI routing. > > Eric, how did you test the irqchip routing on the Midway? I used xgmac passthrough on Midway. > > Cheers, > Andre. > >> Eric >>> >>> In general I am a bit reluctant to introduce just another level of >>> complexity to the already quite convoluted way of doing IRQs and MSIs on >>> ARM(64), that's why I will investigate if we can use DevID:EvID to refer >>> to an interrupt. >>> >>> So far, >>> Andre. >>> >>>> >>>> Known Issues of this RFC: >>>> >>>> - One of the biggest is the API inconsistencies on ARM. Blame me. >>>> Routing should apply to KVM_IRQ_LINE ioctl which is not the case yet >>>> in this series. It only applies to irqfd. >>>> on x86 typically this KVM_IRQ_LINE is plugged onto irqchip.c kvm_set_irq >>>> whereas on ARM we inject directly through kvm_vgic_inject_irq >>>> x on arm/arm64 gsi has a specific structure: >>>> bits: | 31 ... 24 | 23 ... 16 | 15 ... 0 | >>>> field: | irq_type | vcpu_index | irq_id | >>>> where irq_id matches the Interrupt ID >>>> - for KVM_IRQFD without routing (current implementation) the gsi field >>>> corresponds to an SPI index = irq_id (above) -32. >>>> - as far as understand qemu integration, gsi is supposed to be within >>>> [0, KVM_MAX_IRQ_ROUTES]. Difficult to use KVM_IRQ_LINE gsi. >>>> - to be defined what we choose as a convention with irqchip routing is >>>> applied: gsi -> irqchip input pin. >>>> - Or shouldn't we simply rule out any userspace irqchip routing and stick >>>> to MSI routing? we could define a fixed identity in-kernel irqchip mapping >>>> and only offer MSI routing. >>>> - static allocation of chip[KVM_NR_IRQCHIPS][KVM_IRQCHIP_NUM_PINS]; >>>> arbitrary put KVM_IRQCHIP_NUM_PINS = 1020 - 32 (SPI count). On s390 >>>> this is even bigger. >>>> >>>> Currently tested on irqchip routing only (Calxeda midway only), >>>> ie NOT TESTED on MSI routing yet. >>>> >>>> This is a very preliminary RFC to ease the discussion. >>>> >>>> Code can be found at https://git.linaro.org/people/eric.auger/linux.git/shortlog/refs/heads/v4.1-rc8-gsi-routing-rfc >>>> >>>> It applies on Andre's [PATCH 00/13] arm64: KVM: GICv3 ITS emulation >>>> (http://www.spinics.net/lists/kvm/msg117402.html) >>>> >>>> Eric Auger (6): >>>> KVM: api: add kvm_irq_routing_extended_msi >>>> KVM: kvm_host: add kvm_extended_msi >>>> KVM: irqchip: convey devid to kvm_set_msi >>>> KVM: arm/arm64: enable irqchip routing >>>> KVM: arm/arm64: enable MSI routing >>>> KVM: arm: implement kvm_set_msi by gsi direct mapping >>>> >>>> Documentation/virtual/kvm/api.txt | 20 ++++++-- >>>> arch/arm/include/asm/kvm_host.h | 2 + >>>> arch/arm/kvm/Kconfig | 3 ++ >>>> arch/arm/kvm/Makefile | 2 +- >>>> arch/arm64/include/asm/kvm_host.h | 1 + >>>> arch/arm64/kvm/Kconfig | 2 + >>>> arch/arm64/kvm/Makefile | 2 +- >>>> include/kvm/arm_vgic.h | 9 ---- >>>> include/linux/kvm_host.h | 10 ++++ >>>> include/uapi/linux/kvm.h | 9 ++++ >>>> virt/kvm/arm/vgic.c | 96 +++++++++++++++++++++++++++------------ >>>> virt/kvm/irqchip.c | 20 ++++++-- >>>> 12 files changed, 128 insertions(+), 48 deletions(-) >>>> >> -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html