On Thu, Dec 07, 2023 at 06:06:38PM +0100, Jeremi Piotrowski wrote: > > > >> This doesn't work in partitioning when TDVMCALLs go to L0: TDVMCALL_MAP_GPA bypasses > >> L1 and TDX_ACCEPT_PAGE is L1 responsibility. > >> > >> If you want to see how this is currently supported take a look at arch/x86/hyperv/ivm.c. > >> All memory starts as private and there is a hypercall to notify the paravisor for both > >> TDX (when partitioning) and SNP (when VMPL). This guarantees that all page conversions > >> go through L1. > > > > But L1 guest control anyway during page conversion and it has to manage > > aliases with TDG.MEM.PAGE.ATTR.RD/WR. Why do you need MAP_GPA for that? > > > > When the L2 wants to perform a page conversion it needs to notify L1 of this so that it > can do its part managing the aliases. Without L1 involvement the conversion doesn't > happen. MAP_GPA is not suitable for this purpose as I've described and you've confirmed > above. Memory conversion causes exit to L1 as there will be no aliases in L2 otherwise. There's no need to intercept MAP_GPA for that. See section 21.8 of TD partitioning spec. > > > One possible change I mentioned above: make TDVMCALL exit to L1 for some > > TDVMCALL leafs (or something along the line). > > > > You can explore changes to TDVMCALL handling in the TDX module but I don't see any reason > this would be adopted, because a shared hypercall to control page visibility for SNP & TDX is > already part of Hyper-V ABI and works great for this purpose. > > > I would like to keep it transparent for enlightened TDX Linux guest. It > > should not care if it runs as L1 or as L2 in your environment. > > I understand that is how you would prefer it but, as we've established in these emails, > that doesn't work when the L1 paravisor provides services to the L2 with an L1 specific > protocol and TDVMCALLs are routed to L0 for performance reasons. It can't be done > transparently with TDX 1.5 calls alone and we already have TDX 1.5 deployed to users with > an upstream kernel. TDX 1.5 is not set in stone (yet). The spec is still draft. We can add capabilities if we make case for them. Let's try to shift the discussion to how to make TDX better rather than adding workaround to kernel. -- Kiryl Shutsemau / Kirill A. Shutemov