On Mon, Dec 18, 2023 at 07:53:45PM -0800, Jim Mattson <jmattson@xxxxxxxxxx> wrote: > On Mon, Dec 18, 2023 at 5:40 PM Isaku Yamahata > <isaku.yamahata@xxxxxxxxxxxxxxx> wrote: > > > > On Thu, Dec 14, 2023 at 08:41:43AM -0800, > > Sean Christopherson <seanjc@xxxxxxxxxx> wrote: > > > > > On Thu, Dec 14, 2023, Maxim Levitsky wrote: > > > > On Wed, 2023-12-13 at 15:10 -0800, Sean Christopherson wrote: > > > > > Upstream KVM's non-TDX behavior is fine, because KVM doesn't advertise support > > > > > for CPUID 0x15, i.e. doesn't announce to host userspace that it's safe to expose > > > > > CPUID 0x15 to the guest. Because TDX makes exposing CPUID 0x15 mandatory, KVM > > > > > needs to be taught to correctly emulate the guest's APIC bus frequency, a.k.a. > > > > > the TDX guest core crystal frequency of 25Mhz. > > > > > > > > I assume that TDX doesn't allow to change the CPUID 0x15 leaf. > > > > > > Correct. I meant to call that out below, but left my sentence half-finished. It > > > was supposed to say: > > > > > > I halfheartedly floated the idea of "fixing" the TDX module/architecture to either > > > use 1Ghz as the base frequency or to allow configuring the base frequency > > > advertised to the guest. > > > > > > > > I halfheartedly floated the idea of "fixing" the TDX module/architecture to either > > > > > use 1Ghz as the base frequency (off list), but it definitely isn't a hill worth > > > > > dying on since the KVM changes are relatively simple. > > > > > > > > > > https://lore.kernel.org/all/ZSnIKQ4bUavAtBz6@xxxxxxxxxx > > > > > > > > > > > > > Best regards, > > > > Maxim Levitsky > > > > The followings are the updated version of the commit message. > > > > > > KVM: x86: Make the hardcoded APIC bus frequency VM variable > > > > The TDX architecture hard-codes the APIC bus frequency to 25MHz in the > > CPUID leaf 0x15. The > > TDX mandates it to be exposed and doesn't allow the VMM to override > > its value. The KVM APIC timer emulation hard-codes the frequency to > > 1GHz. It doesn't unconditionally enumerate it to the guest unless the > > user space VMM sets the CPUID leaf 0x15 by KVM_SET_CPUID. > > > > If the CPUID leaf 0x15 is enumerated, the guest kernel uses it as the > > APIC bus frequency. If not, the guest kernel measures the frequency > > based on other known timers like the ACPI timer or the legacy PIT. > > The TDX guest kernel gets timer interrupt more times by 1GHz / 25MHz. > > > > To ensure that the guest doesn't have a conflicting view of the APIC > > bus frequency, allow the userspace to tell KVM to use the same > > frequency that TDX mandates instead of the default 1Ghz. > > > > There are several options to address this. > > 1. Make the KVM able to configure APIC bus frequency (This patch). > > Pros: It resembles the existing hardware. The recent Intel CPUs > > adapts 25MHz. > > Cons: Require the VMM to emulate the APIC timer at 25MHz. > > 2. Make the TDX architecture enumerate CPUID 0x15 to configurable > > frequency or not enumerate it. > > Pros: Any APIC bus frequency is allowed. > > Cons: Deviation from the real hardware. > > 3. Make the TDX guest kernel use 1GHz when it's running on KVM. > > Cons: The kernel ignores CPUID leaf 0x15. > > 4. Change CPUID.15H under TDX to report the crystal clock frequency as 1 GHz. > Pro: This has been the virtual APIC frequency for KVM guests for 13 years. > Pro: This requires changing only one hard-coded constant in TDX. > > I see no compelling reason to complicate KVM with support for > configurable APIC frequencies, and I see no advantages to doing so. Because TDX isn't specific to KVM, it should work with other VMM technologies. If we'd like to go for this route, the frequency would be configurable. What frequency should be acceptable securely is obscure. 25MHz has long history with the real hardware. -- Isaku Yamahata <isaku.yamahata@xxxxxxxxxxxxxxx>