At init time, KVM does compatibility checks to ensure that all online CPUs support hardware virtualization and a common set of features. But KVM uses hotplugged CPUs without such compatibility checks. On Intel CPUs, this leads to #GP if the hotplugged CPU doesn't support VMX or vmentry failure if the hotplugged CPU doesn't meet minimal feature requirements. Do compatibility checks when onlining a CPU. If any VM is running, KVM hotplug callback returns an error to abort onlining incompatible CPUs. But if no VM is running, onlining incompatible CPUs is allowed. Instead, KVM is prohibited from creating VMs similar to the policy for init-time compatibility checks. CPU hotplug is disabled during hardware_enable_all() to prevent the corner case as shown below. A hotplugged CPU marks itself online in cpu_online_mask (1) and enables interrupt (2) before invoking callbacks registered in ONLINE section (3). So, if hardware_enable_all() is invoked on another CPU right after (2), then on_each_cpu() in hardware_enable_all() invokes hardware_enable_nolock() on the hotplugged CPU before kvm_online_cpu() is called. This makes the CPU escape from compatibility checks, which is risky. start_secondary { ... set_cpu_online(smp_processor_id(), true); <- 1 ... local_irq_enable(); <- 2 ... cpu_startup_entry(CPUHP_AP_ONLINE_IDLE); <- 3 } Keep compatibility checks at KVM init time. It can help to find incompatibility issues earlier and refuse to load arch KVM module (e.g., kvm-intel). Signed-off-by: Chao Gao <chao.gao@xxxxxxxxx> --- virt/kvm/kvm_main.c | 36 ++++++++++++++++++++++++++++++++++-- 1 file changed, 34 insertions(+), 2 deletions(-) diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c index c1054604d1e8..0ff80076d48d 100644 --- a/virt/kvm/kvm_main.c +++ b/virt/kvm/kvm_main.c @@ -106,6 +106,8 @@ LIST_HEAD(vm_list); static cpumask_var_t cpus_hardware_enabled; static int kvm_usage_count; static atomic_t hardware_enable_failed; +/* Set if hardware becomes incompatible after CPU hotplug */ +static bool hardware_incompatible; static struct kmem_cache *kvm_vcpu_cache; @@ -4855,20 +4857,32 @@ static void hardware_enable_nolock(void *junk) static int kvm_online_cpu(unsigned int cpu) { - int ret = 0; + int ret; + ret = kvm_arch_check_processor_compat(); raw_spin_lock(&kvm_count_lock); /* * Abort the CPU online process if hardware virtualization cannot * be enabled. Otherwise running VMs would encounter unrecoverable * errors when scheduled to this CPU. */ - if (kvm_usage_count) { + if (!ret && kvm_usage_count) { hardware_enable_nolock(NULL); if (atomic_read(&hardware_enable_failed)) { ret = -EIO; pr_info("kvm: abort onlining CPU%d", cpu); } + } else if (ret && !kvm_usage_count) { + /* + * Continue onlining an incompatible CPU if no VM is + * running. KVM should reject creating any VM after this + * point. Then this CPU can be still used to run non-VM + * workload. + */ + ret = 0; + hardware_incompatible = true; + pr_info("kvm: prohibit VM creation due to incompatible CPU%d", + cpu); } raw_spin_unlock(&kvm_count_lock); return ret; @@ -4913,8 +4927,24 @@ static int hardware_enable_all(void) { int r = 0; + /* + * During onlining a CPU, cpu_online_mask is set before kvm_online_cpu() + * is called. on_each_cpu() between them includes the CPU. As a result, + * hardware_enable_nolock() may get invoked before kvm_online_cpu(). + * This would enable hardware virtualization on that cpu without + * compatibility checks, which can potentially crash system or break + * running VMs. + * + * Disable CPU hotplug to prevent this case from happening. + */ + cpus_read_lock(); raw_spin_lock(&kvm_count_lock); + if (hardware_incompatible) { + r = -EIO; + goto unlock; + } + kvm_usage_count++; if (kvm_usage_count == 1) { atomic_set(&hardware_enable_failed, 0); @@ -4926,7 +4956,9 @@ static int hardware_enable_all(void) } } +unlock: raw_spin_unlock(&kvm_count_lock); + cpus_read_unlock(); return r; } -- 2.25.1