* Sai Praneeth Prakhya <sai.praneeth.prakhya@xxxxxxxxx> wrote: > Ideally, after kernel assumes control of the platform firmware shouldn't > access EFI Boot Services Code/Data regions. But, it's noticed that this > is not so true in many x86 platforms. Hence, during boot, kernel > reserves efi boot services code/data regions [1] and maps [2] them to > efi_pgd so that call to set_virtual_address_map() doesn't fail. After > returning from set_virtual_address_map(), kernel frees the reserved > regions [3] but they still remain mapped. > > This means that any code that's running in efi_pgd address space (e.g: > any efi runtime service) would still be able to access efi boot services > code/data regions but the contents of these regions would have long been > over written by someone else as they are freed by efi_free_boot_services(). > So, it's important to unmap these regions. After unmapping boot services > code/data regions, any illegal access by buggy firmware to these regions > would result in page fault which will be handled by efi specific fault > handler. > > [1] Please see efi_reserve_boot_services() > [2] Please see efi_map_region() -> __map_region() > [3] Please see efi_free_boot_services() > > Signed-off-by: Sai Praneeth Prakhya <sai.praneeth.prakhya@xxxxxxxxx> > Cc: Borislav Petkov <bp@xxxxxxxxx> > Cc: Ingo Molnar <mingo@xxxxxxxxxx> > Cc: Andy Lutomirski <luto@xxxxxxxxxx> > Cc: Dave Hansen <dave.hansen@xxxxxxxxx> > Cc: Bhupesh Sharma <bhsharma@xxxxxxxxxx> > Cc: Thomas Gleixner <tglx@xxxxxxxxxxxxx> > Cc: Peter Zijlstra <peterz@xxxxxxxxxxxxx> > Cc: Ard Biesheuvel <ard.biesheuvel@xxxxxxxxxx> > --- > arch/x86/include/asm/pgtable_types.h | 2 ++ > arch/x86/mm/pageattr.c | 21 +++++++++++++++++++++ > arch/x86/platform/efi/quirks.c | 26 ++++++++++++++++++++++++++ > 3 files changed, 49 insertions(+) > > diff --git a/arch/x86/include/asm/pgtable_types.h b/arch/x86/include/asm/pgtable_types.h > index b64acb08a62b..796476f11151 100644 > --- a/arch/x86/include/asm/pgtable_types.h > +++ b/arch/x86/include/asm/pgtable_types.h > @@ -566,6 +566,8 @@ extern pmd_t *lookup_pmd_address(unsigned long address); > extern phys_addr_t slow_virt_to_phys(void *__address); > extern int kernel_map_pages_in_pgd(pgd_t *pgd, u64 pfn, unsigned long address, > unsigned numpages, unsigned long page_flags); > +extern int kernel_unmap_pages_in_pgd(pgd_t *pgd, u64 pfn, unsigned long address, > + unsigned long numpages); > #endif /* !__ASSEMBLY__ */ > > #endif /* _ASM_X86_PGTABLE_DEFS_H */ > diff --git a/arch/x86/mm/pageattr.c b/arch/x86/mm/pageattr.c > index 51a5a69ecac9..b88ed8e91790 100644 > --- a/arch/x86/mm/pageattr.c > +++ b/arch/x86/mm/pageattr.c > @@ -2147,6 +2147,27 @@ int kernel_map_pages_in_pgd(pgd_t *pgd, u64 pfn, unsigned long address, > return retval; > } > > +int kernel_unmap_pages_in_pgd(pgd_t *pgd, u64 pfn, unsigned long address, > + unsigned long numpages) > +{ > + int retval; > + > + struct cpa_data cpa = { > + .vaddr = &address, > + .pfn = pfn, > + .pgd = pgd, > + .numpages = numpages, > + .mask_set = __pgprot(0), > + .mask_clr = __pgprot(_PAGE_PRESENT | _PAGE_RW), > + .flags = 0, > + }; > + > + retval = __change_page_attr_set_clr(&cpa, 0); > + __flush_tlb_all(); > + > + return retval; > +} That's certainly a creative use of __change_page_attr_set_clr() by EFI used for mapping in pages so far (kernel_map_pages_in_pgd()), and now used for unmapping as well. Doesn't look wrong, just a bit weird as part of CPA. Could you please write the initializer in an easier to read fashion: struct cpa_data cpa = { .vaddr = &address, .pfn = pfn, .pgd = pgd, .numpages = numpages, .mask_set = __pgprot(0), .mask_clr = __pgprot(_PAGE_PRESENT | _PAGE_RW), .flags = 0, }; ? The one bit that is odd is the cpa->pfn field - for unmapped pages that's totally uninteresting and I'm wondering whether setting it to 0 wouldn't be better. Does the CPU _ever_ look look at the PFN if the page is !_PAGE_PRESENT, for example speculatively? If yes then what is the recommended value for the pfn - zero perhaps? Also note that if for whatever reason the PFN range of the EFI boot area gets hot-unplugged, we'd have outright invalid PFNs - although this is probably very unlikely from a platform perspective. > +/* > + * Apart from having VA mappings for efi boot services code/data regions, > + * (duplicate) 1:1 mappings were also created as a catch for buggy firmware. So, > + * unmap both 1:1 and VA mappings. > + */ Speling nits: - please capitalize 'EFI' consistently. - s/catch/quirk ? BTW., are the 1:1 'boot mappings' a buggy firmware quirk, or something required by the EFI spec? (or both? ;-) > +static void __init efi_unmap_pages(efi_memory_desc_t *md) > +{ > + pgd_t *pgd = efi_mm.pgd; > + u64 pfn = md->phys_addr >> PAGE_SHIFT; Note that this md->phys_addr isn't really meaningful once it gets unmapped. > + > + if (kernel_unmap_pages_in_pgd(pgd, pfn, md->phys_addr, md->num_pages)) > + pr_err("Failed to unmap 1:1 mapping: PA 0x%llx -> VA 0x%llx!\n", > + md->phys_addr, md->virt_addr); > + > + if (kernel_unmap_pages_in_pgd(pgd, pfn, md->virt_addr, md->num_pages)) > + pr_err("Failed to unmap VA mapping: PA 0x%llx -> VA 0x%llx!\n", > + md->phys_addr, md->virt_addr); Please keep pr_err()'s in a single line. (and ignore checkpatch.) > +} > + > void __init efi_free_boot_services(void) > { > phys_addr_t new_phys, new_size; > @@ -415,6 +434,13 @@ void __init efi_free_boot_services(void) > } > > free_bootmem_late(start, size); > + > + /* > + * Before calling set_virtual_address_map(), boot services > + * code/data regions were mapped as a catch for buggy firmware. > + * Unmap them from efi_pgd as they have already been freed. > + */ > + efi_unmap_pages(md); Ditto. BTW., the ordering here is wrong: we should unmap any virtual aliases from pagetables _before_ we free the underlying memory. The ordering is probably harmless in this case but overall a good practice. Thanks, Ingo