On Mon, Nov 07, 2022 at 03:06:51PM +0000, Sean Christopherson wrote: > On Mon, Nov 07, 2022, Yu Zhang wrote: > > On Thu, Nov 03, 2022 at 04:53:11PM +0000, Sean Christopherson wrote: > > > Ideally, KVM should NEVER manipulate VMX MSRs in response to guest CPUID changes. > > > That's what I was referring to earlier by commits: > > ... > > > Thanks Sean. Let me try to rephrase my understandings of your statement( > > and pls feel free to correct me): > > > > 1> For now, what vmx_adjust_secondary_exec_control() does, is to enable/ > > disable a feature in VMX MSR(and nVMX MSR) based on cpuid changes. > > 2> What makes sense is, if a feature is > > a. disabled by guest CPUID, it shall not be exposed in guest VMX MSR; > > b. enabled by guest CPUID, it could be either exposed or hidden in > > guest VMX MSR. > > 3> So your previous change is to guarantee 2.a, and userspace VMM can choose > > to follow follow either choices in 2.b(depending on whether it believes this > > feature is correctly supported by KVM in nested). > > > > Is above understanding correct? > > Not quite. Again, in an ideal world, KVM would not modify the VMX MSRs based on > guest CPUID. But it's possible userspace is relying on KVM to hide a feature from > L2 if it's hidden from L1, so to avoid breaking an otherwise valide userspace config, > it's worth enforcing that in KVM. > Sorry, maybe I should understand this way: In theroy, KVM shall not modify guest VMX MSRs in response to the guest CPUID updates. Therefore we shall not enforce the exposure of a feature in guest VMX MSR, just because it is enabled in guest CPUID (e.g., userspace VMM can choose to hide such feature so long as it believes KVM can not provide correct nested support for this feature). But in reverse, it is not reasonable for userspace VMM to expose a feature in guest VMX MSR settings, if such feature is disabled in this guest's CPUID. So KVM shall help to make sure such feature is hidden when guest CPUID changes. BTW, I found my previous understanding of what vmx_adjust_secondary_exec_control() currently does was also wrong. It could also be used for EXITING controls. And for such flags(e.g., SECONDARY_EXEC_RDRAND_EXITING), values for the nested settings (vmx->nested.msrs.secondary_ctls_high) and for the L1 execution controls(*exec_control) could be opposite. So the statement: "1> For now, what vmx_adjust_secondary_exec_control() does, is to enable/ disable a feature in VMX MSR(and nVMX MSR) based on cpuid changes." is wrong. Hopefully we are gonna change vmx_adjust_secondary_exec_control() soon... B.R. Yu