From: Lai Jiangshan <jiangshan.ljs@xxxxxxxxxxxx> (Request For Help for testing on AMD machine with 32 bit L1 hypervisor, see information below) KVM handles root pages specially for these cases: direct mmu (nonpaping for 32 bit guest): gCR0_PG=0 shadow mmu (shadow paping for 32 bit guest): gCR0_PG=1,gEFER_LMA=0,gCR4_PSE=0 gCR0_PG=1,gEFER_LMA=0,gCR4_PSE=1 direct mmu (NPT for 32bit host): hEFER_LMA=0 shadow nested NPT (for 32bit L1 hypervisor): gCR0_PG=1,gEFER_LMA=0,gCR4_PSE=0,hEFER_LMA=0 gCR0_PG=1,gEFER_LMA=0,gCR4_PSE=1,hEFER_LMA=0 gCR0_PG=1,gEFER_LMA=0,gCR4_PSE={0|1},hEFER_LMA=1,hCR4_LA57={0|1} Shadow nested NPT for 64bit L1 hypervisor: gEFER_LMA=1,gCR4_LA57=0,hEFER_LMA=1,hCR4_LA57=1 They are either using special roots or matched the condition ((mmu->shadow_root_level > mmu->root_level) && !mm->direct_map) (refered as level expansion) or both. All the cases are using special roots except the last one. Many cases are doing level expansion including the last one. When special roots are used, the root page will not be backed by kvm_mmu_page. So they must be treated specially, but not all places is considering this problem, and Sean is adding some code to check this special roots. When level expansion, the kvm treats them silently always. These treaments incur problems or complication, see the changelog of every patch. These patches were made when I reviewed all the usage of shadow_root_level and root_level. Many root level patches are sent and accepted. These patches has not been tested with shadow NPT cases listed above. Because I don't have guest images can act as 32 bit L1 hypervisor, nor I can access to AMD machine with 5 level paging. I'm a bit reluctant to ask for the resource, so I send the patches and wish someone test them and modify them. At least, it provides some thinking and reveals problems of the existing code and of the AMD cases. ( *Request For Help* here.) These patches have been tested with the all cases except the shadow-NPT cases, the code coverage is believed to be more than 95% (hundreds of code related to shadow-NPT are shoved, and be replaced with common role.pae_root and role.glevel code with only 8 line of code is added for shadow-NPT, only 2 line of code is not covered in my tests). Cleanup patches (such as use role.glevel instead of !role.direct) will be sent after this patchset is queued. [V2]: https://lore.kernel.org/lkml/20220329153604.507475-1-jiangshanlai@xxxxxxxxx/ [V1]: https://lore.kernel.org/lkml/20211210092508.7185-1-jiangshanlai@xxxxxxxxx/ Changed from V2: Instroduce role.glevel instead of role.passthrough Changed from V1: Apply Sean's comments and suggestion. (Too much to list. Thanks!) Add some comments. Change changelog for role.pae_root patch. Lai Jiangshan (4): KVM: X86: Add arguement gfn and role to kvm_mmu_alloc_page() KVM: X86: Introduce role.glevel for level expanded pagetable KVM: X86: Alloc role.pae_root shadow page KVM: X86: Use passthrough and pae_root shadow page for 32bit guests Documentation/virt/kvm/mmu.rst | 9 + arch/x86/include/asm/kvm_host.h | 16 +- arch/x86/kvm/mmu/mmu.c | 399 +++++++++----------------------- arch/x86/kvm/mmu/paging_tmpl.h | 15 +- 4 files changed, 138 insertions(+), 301 deletions(-) -- 2.19.1.6.gb485710b