On Tue, Nov 19, 2024 at 11:46 PM Oliver Upton <oliver.upton@xxxxxxxxx> wrote: > > On Tue, Nov 19, 2024 at 03:57:46PM -0800, Jiaqi Yan wrote: > > > > While continuing the discussion here, I think it may make sense I sent > > out a V2 with 2 major updates: > > - add documentation for new SIGBUS feature > > - remove KVM_CAP_ARM_SIGBUS_ON_SEA > > Just wanted to add that QEMU already has a functioning "MCE" injection > implemenation based on signals that deals with the sloppy mess of > coordinating w/ vCPU threads [*]. > > I completely agree with Marc that the UAPI around using signals for this > sort of thing is a giant pile of crap, but it seems to be a semi-understood > pile of crap. And from that perspective, wiring unclaimed SEAs into the > existing infrastructure at least makes the UAPI consistent. > > [*]: https://elixir.bootlin.com/qemu/v9.1.1/C/ident/kvm_on_sigbus_vcpu > > > On Tue, Nov 12, 2024 at 12:51 AM Marc Zyngier <maz@xxxxxxxxxx> wrote: > > > > Do you mean a CAP that VMM can tell KVM the VM guest has RAS ability? > > > > I don't know if there is one for arm64. On x86 there is > > > > KVM_X86_SETUP_MCE. KVM_CAP_ARM_INJECT_EXT_DABT maybe a revelant one > > > > but I don't think it is exactly the one for "RAS ability". > > > > > > Having though about this a bit more, I now think this is independent > > > of the guest supporting RAS. This really is about the VMM asking to be > > > made aware of RAS errors affecting the guest, and it is the signalling > > > back to the guest that needs to be gated by ID_AA64PFR0_EL1.RAS. > > > > Just to make sure I fully catch you. I think ID_AA64PFR0_EL1.RAS > > translates to ARM64_HAS_RAS_EXTN in the kernel. If VMM signals RAS > > error back to the guest with SEA, are you suggesting > > __kvm_arm_vcpu_set_events should check > > cpus_have_final_cap(ARM64_HAS_RAS_EXTN) before it > > kvm_inject_dabt(vcpu)? > > > > If so, how could __kvm_arm_vcpu_set_events know if the error is about > > RAS (e.g. memory error) vs about accessing memory not in a memslot > > (i.e. KVM_EXIT_ARM_NISV)? I guess KVM needs to look at ESR_EL2 again > > (e.g. kvm_vcpu_abt_issea vs kvm_vcpu_dabt_isvalid)? > > Good point. I don't think we can lock down this UAPI after the fact > given the existing use cases. It is ultimately up to the VMM what to do. > > I don't see anything that would stop an implementation without FEAT_RAS > from generating an SEA in this situation. The lack of FEAT_RAS (to me at > least) implies: > > - No ESR injection for vSErrors (already enforced in UAPI) > - No deferral of SErrors / ESBs > - No error record registers in the PE Thanks Oliver, if I read you correctly, __kvm_arm_vcpu_set_events doesn't need to assert ARM64_HAS_RAS_EXTN, and it also doesn't need to differentiate RAS and KVM_EXIT_ARM_NISV. > > Of course, it is very likely you've thought about this more than I have, > Marc. I will send out RFC V2 in a second. Please take another look and comment there, Marc and Oliver, thanks! > > -- > Thanks, > Oliver