On 23/11/2023 14:58, Kirill A. Shutemov wrote: > On Wed, Nov 22, 2023 at 06:01:04PM +0100, Jeremi Piotrowski wrote: >> Check for additional CPUID bits to identify TDX guests running with Trust >> Domain (TD) partitioning enabled. TD partitioning is like nested virtualization >> inside the Trust Domain so there is a L1 TD VM(M) and there can be L2 TD VM(s). >> >> In this arrangement we are not guaranteed that the TDX_CPUID_LEAF_ID is visible >> to Linux running as an L2 TD VM. This is because a majority of TDX facilities >> are controlled by the L1 VMM and the L2 TDX guest needs to use TD partitioning >> aware mechanisms for what's left. So currently such guests do not have >> X86_FEATURE_TDX_GUEST set. >> >> We want the kernel to have X86_FEATURE_TDX_GUEST set for all TDX guests so we >> need to check these additional CPUID bits, but we skip further initialization >> in the function as we aren't guaranteed access to TDX module calls. > > I don't follow. The idea of partitioning is that L2 OS can be > unenlightened and have no idea if it runs indide of TD. But this patch > tries to enumerate TDX anyway. > > Why? > That's not the only idea of partitioning. Partitioning provides different privilege levels within the TD, and unenlightened L2 OS can be made to work but are inefficient. In our case Linux always runs enlightened (both with and without TD partitioning), and uses TDX functionality where applicable (TDX vmcalls, PTE encryption bit). There have been long discussions on LKML about how CoCo features should be supported, I've followed most of them and I believe we've converged on: the kernel is fully aware what kind of guest it is (SNP/TDX) and uses CC_ATTR_XXX to check for specific SNP/TDX features. Right now the guest with TD partitioning is missing out on X86_FEATURE_TDX_GUEST. That's why this patch tries to enumerate TDX. I have posted an alternate version of this patch for discussion here: https://lore.kernel.org/lkml/0799b692-4b26-4e00-9cec-fdc4c929ea58@xxxxxxxxxxxxxxxxxxx/