On 10/02/2013 10:05 AM, Borislav Petkov wrote: > > Btw, Matt just found another issue with the bottom-up approach - due to > different alignment of VA and PA addresses, this messes up the pagetable > in terms of the order in which we're using 4K, 2M, etc pages. > > What can happen is that, you can get a non-2M aligned PA mapped with > 2M-aligned VA which results in a #PF with PF_RSVD set, which most likely > happens because one or more of the bits in the [12:20] slice of the PMD > are reserved but they get set due to the PA having address bits set in > the aforementioned slice and thus a #PF is raised. > So this is a bug in the sense that 2M pages were used when they were not safe to use (matching alignment is part of the requirement for 2M pages being allowable.) However, we of course want to use 2M pages, so see below. > So we changed the mapping method to a more straight-forward one: we map > all EFI regions in the following range: > > [ efi_va - -4G ] > > and we compute efi_va by subtracting the highest EFI region address from > -4G, i.e. 0xffff_ffff_0000_0000. > > Then, each VA is computed by doing efi_va + PA. > > Oh, and the alignment remains compatible this way. > > So this mapping scheme - courtesy of Matt - is very straight-forward > and simple and I like simple. This way we won't need the setup_data > games with kexec tools as we'll be simply doing the same mappings in the > kexec'ed kernel. > > Anyway, I'll clean up the patch and send it out later. > We could achieve the same thing by doing alignment after subtracting the pointer. HOWEVER, it also goes to show that any mapping scheme is inherently fragile (consider if the mapping scheme above ends up consuming too much virtual space in the future), and as a result I really think that explicitly passing the map to the kexec kernel really is the only sane thing to do, as otherwise we have to maintain the same algorithm forever. -hpa -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html