Hi Ricardo, On Tue, Nov 2, 2021 at 4:45 PM Ricardo Koller <ricarkol@xxxxxxxxxx> wrote: > > On Tue, Nov 02, 2021 at 09:46:49AM +0000, Oliver Upton wrote: > > The OS lock blocks all debug exceptions at every EL. To date, KVM has > > not implemented the OS lock for its guests, despite the fact that it is > > mandatory per the architecture. Simple context switching between the > > guest and host is not appropriate, as its effects are not constrained to > > the guest context. > > > > Emulate the OS Lock by clearing MDE and SS in MDSCR_EL1, thereby > > blocking all but software breakpoint instructions. To handle breakpoint > > instructions, trap debug exceptions to EL2 and skip the instruction. > > > > Signed-off-by: Oliver Upton <oupton@xxxxxxxxxx> > > --- > > arch/arm64/include/asm/kvm_host.h | 4 ++++ > > arch/arm64/kvm/debug.c | 20 +++++++++++++++----- > > arch/arm64/kvm/handle_exit.c | 8 ++++++++ > > arch/arm64/kvm/sys_regs.c | 6 +++--- > > 4 files changed, 30 insertions(+), 8 deletions(-) > > > > diff --git a/arch/arm64/include/asm/kvm_host.h b/arch/arm64/include/asm/kvm_host.h > > index c98f65c4a1f7..f13b8b79b06d 100644 > > --- a/arch/arm64/include/asm/kvm_host.h > > +++ b/arch/arm64/include/asm/kvm_host.h > > @@ -724,6 +724,10 @@ void kvm_arm_vcpu_init_debug(struct kvm_vcpu *vcpu); > > void kvm_arm_setup_debug(struct kvm_vcpu *vcpu); > > void kvm_arm_clear_debug(struct kvm_vcpu *vcpu); > > void kvm_arm_reset_debug_ptr(struct kvm_vcpu *vcpu); > > + > > +#define kvm_vcpu_os_lock_enabled(vcpu) \ > > + (__vcpu_sys_reg(vcpu, OSLSR_EL1) & SYS_OSLSR_OSLK) > > + > > int kvm_arm_vcpu_arch_set_attr(struct kvm_vcpu *vcpu, > > struct kvm_device_attr *attr); > > int kvm_arm_vcpu_arch_get_attr(struct kvm_vcpu *vcpu, > > diff --git a/arch/arm64/kvm/debug.c b/arch/arm64/kvm/debug.c > > index db9361338b2a..5690a9c99c89 100644 > > --- a/arch/arm64/kvm/debug.c > > +++ b/arch/arm64/kvm/debug.c > > @@ -95,8 +95,11 @@ static void kvm_arm_setup_mdcr_el2(struct kvm_vcpu *vcpu) > > MDCR_EL2_TDRA | > > MDCR_EL2_TDOSA); > > > > - /* Is the VM being debugged by userspace? */ > > - if (vcpu->guest_debug) > > + /* > > + * Check if the VM is being debugged by userspace or the guest has > > + * enabled the OS lock. > > + */ > > + if (vcpu->guest_debug || kvm_vcpu_os_lock_enabled(vcpu)) > > /* Route all software debug exceptions to EL2 */ > > vcpu->arch.mdcr_el2 |= MDCR_EL2_TDE; > > > > @@ -160,8 +163,11 @@ void kvm_arm_setup_debug(struct kvm_vcpu *vcpu) > > > > kvm_arm_setup_mdcr_el2(vcpu); > > > > - /* Is Guest debugging in effect? */ > > - if (vcpu->guest_debug) { > > + /* > > + * Check if the guest is being debugged or if the guest has enabled the > > + * OS lock. > > + */ > > + if (vcpu->guest_debug || kvm_vcpu_os_lock_enabled(vcpu)) { > > /* Save guest debug state */ > > save_guest_debug_regs(vcpu); > > > > @@ -223,6 +229,10 @@ void kvm_arm_setup_debug(struct kvm_vcpu *vcpu) > > trace_kvm_arm_set_regset("WAPTS", get_num_wrps(), > > &vcpu->arch.debug_ptr->dbg_wcr[0], > > &vcpu->arch.debug_ptr->dbg_wvr[0]); > > + } else if (kvm_vcpu_os_lock_enabled(vcpu)) { > > + mdscr = vcpu_read_sys_reg(vcpu, MDSCR_EL1); > > + mdscr &= ~DBG_MDSCR_MDE; > > + vcpu_write_sys_reg(vcpu, mdscr, MDSCR_EL1); > > I think this is missing the case where the guest is being debugged by > userspace _and_ from inside (the guest) at the same time. In this > situation, the vmm gets a KVM_EXIT_DEBUG and if it doesn't know what to > do with it, it injects the exception into the guest (1). With this "else > if", the guest would still get the debug exception when the os lock is > enabled. > > (1) kvm_arm_handle_debug() is the one doing this in QEMU source code. I wonder if this is a problem that KVM should even handle. KVM doesn't do anything to help userspace inject the debug exception into the guest, and from reading kvm_arm_handle_debug() it would seem that QEMU is manually injecting the exception to EL1 and setting the PC to the appropriate vector. There is an issue, though, with migration: older KVM will not show OSLSR_EL1 on KVM_GET_REG_LIST. However, in order to provide an architectural OS Lock, its reset value must be 1 (enabled). This would all have the effect of discarding the guest's OS lock value and blocking all debug exceptions intended for the guest until the next reboot. So it would seem that userspace needs to know about the OSLK bit to correctly inject debug exceptions and migrate guests. If opt-in is heavyweight, we could cure the migration issue by explicitly documenting the OS lock being disabled out of reset as an erratum of KVM. Doing so would be consistent with all prior versions of KVM. Of course, adopting nonarchitected behavior in perpetuity seems a bit unsavory :-) -- Oliver