From: Lai Jiangshan <laijs@xxxxxxxxxxxxxxxxx> In VMX, the NMI handler needs to be invoked after NMI VM-Exit. Before the commit 1a5488ef0dcf6 ("KVM: VMX: Invoke NMI handler via indirect call instead of INTn"), the work is done by INTn ("int $2"). But INTn microcode is relatively expensive, so the commit reworked NMI VM-Exit handling to invoke the kernel handler by function call. And INTn doesn't set the NMI blocked flag required by the linux kernel NMI entry. So moving away from INTn are very reasonable. Yet some details were missed. After the said commit applied, the NMI entry pointer is fetched from the IDT table and called from the kernel stack. But the NMI entry pointer installed on the IDT table is asm_exc_nmi() which expects to be invoked on the IST stack by the ISA. And it relies on the "NMI executing" variable on the IST stack to work correctly. When it is unexpectedly called from the kernel stack, the RSP-located "NMI executing" variable is also on the kernel stack and is "uninitialized" and can cause the NMI entry to run in the wrong way. During fixing the problem for KVM, I found that there might be the same problem for early booting stage where the IST is not set up. asm_exc_nmi() is not allowed to be used in this stage for the same reason about the RSP-located "NMI executing" variable. For both cases, we should use asm_noist_exc_nmi() which is introduced in the patch 1 via renaming from an existing asm_xenpv_exc_nmi() and which is safe on the kernel stack. https://lore.kernel.org/lkml/20200915191505.10355-3-sean.j.christopherson@xxxxxxxxx/ Cc: Thomas Gleixner <tglx@xxxxxxxxxxxxx> Cc: Paolo Bonzini <pbonzini@xxxxxxxxxx> Cc: Sean Christopherson <seanjc@xxxxxxxxxx> Cc: Steven Rostedt <rostedt@xxxxxxxxxxx> Cc: Andi Kleen <ak@xxxxxxxxxxxxxxx> Cc: Andy Lutomirski <luto@xxxxxxxxxx> Cc: Vitaly Kuznetsov <vkuznets@xxxxxxxxxx> Cc: Wanpeng Li <wanpengli@xxxxxxxxxxx> Cc: Jim Mattson <jmattson@xxxxxxxxxx> Cc: Joerg Roedel <joro@xxxxxxxxxx> Cc: kvm@xxxxxxxxxxxxxxx Cc: Josh Poimboeuf <jpoimboe@xxxxxxxxxx> Cc: Uros Bizjak <ubizjak@xxxxxxxxx> Cc: Maxim Levitsky <mlevitsk@xxxxxxxxxx> Signed-off-by: Lai Jiangshan <laijs@xxxxxxxxxxxxxxxxx> Lai Jiangshan (4): x86/xen/entry: Rename xenpv_exc_nmi to noist_exc_nmi x86/entry: Use asm_noist_exc_nmi() for NMI in early booting stage KVM/VMX: Invoke NMI non-IST entry instead of IST entry KVM/VMX: fold handle_interrupt_nmi_irqoff() into its solo caller arch/x86/include/asm/idtentry.h | 4 +--- arch/x86/kernel/idt.c | 8 +++++++- arch/x86/kernel/nmi.c | 12 ++++++++++++ arch/x86/kvm/vmx/vmx.c | 27 ++++++++++++++------------- arch/x86/xen/enlighten_pv.c | 9 +++------ arch/x86/xen/xen-asm.S | 2 +- 6 files changed, 38 insertions(+), 24 deletions(-) -- 2.19.1.6.gb485710b