On 05.10.2022 20:11, Demi Marie Obenour wrote: > On Wed, Oct 05, 2022 at 08:15:07AM +0200, Jan Beulich wrote: >> On 04.10.2022 17:46, Demi Marie Obenour wrote: >>> Linux has a function called efi_mem_reserve() that is used to reserve >>> EfiBootServicesData memory that contains e.g. EFI configuration tables. >>> This function does not work under Xen because Xen could have already >>> clobbered the memory. efi_mem_reserve() not working is the whole reason >>> for this thread, as it prevents EFI tables that are in >>> EfiBootServicesData from being used under Xen. >>> >>> A much nicer approach would be for Xen to reserve boot services memory >>> unconditionally, but provide a hypercall that dom0 could used to free >>> the parts of EfiBootServicesData memory that are no longer needed. This >>> would allow efi_mem_reserve() to work normally. >> >> efi_mem_reserve() actually working would be a layering violation; >> controlling the EFI memory map is entirely Xen's job. > > Doing this properly would require Xen to understand all of the EFI > tables that could validly be in EfiBootServices* and which could be of > interest to dom0. We don't need to understand the tables as long as none crosses memory map descriptor boundaries, and as long as they don't contain further pointers. > It might (at least on some very buggy firmware) > require a partial ACPI and/or SMBIOS implementation too, if the firmware > decided to put an ACPI or SMBIOS table in EfiBootServices*. I hope we won't need to go that far; on such systems -mapbs will continue to be needed. >> As to the hypercall you suggest - I wouldn't mind its addition, but only >> for the case when -mapbs is used. As I've indicated before, I'm of the >> opinion that default behavior should be matching the intentions of the >> spec, and the intention of EfiBootServices* is for the space to be >> reclaimed. Plus I'm sure you realize there's a caveat with Dom0 using >> that hypercall: It might use it for regions where data lives which it >> wouldn't care about itself, but which an eventual kexec-ed (or alike) >> entity would later want to consume. Code/data potentially usable by >> _anyone_ between two resets of the system cannot legitimately be freed >> (and hence imo is wrong to live in EfiBootServices* regions). > > I agree, but currently some such data *is* in EfiBootServices* regions, > sadly. When -mapbs is *not* used, I recommend uninstalling all of the > configuration tables that point to EfiBootServicesData memory before > freeing that memory. Hmm, uninstalling isn't nice, as it may limit functionality. Instead we might go through all tables and fiddle with memap descriptors in case a pointer references an EfiBootServices* region (regardless of size, as per the first restriction mentioned above). (A more brute force approach might be to simply behave as if -mapbs was specified in such a case, provided we can reliably determine this early enough, i.e. before first checking the "map_bs" variable.) Tables actually known to us could also be relocated (like you've done for ESRT). Such checking could be extended to the runtime services function pointers. While that wouldn't cover cases where a function entry point is in runtime services space but the function then wrongly calls into or references boot services space, it would cover a few more (broken) systems. This, unlike behaving by default as if -mapbs was given, would be a workaround I'd accept to be enabled unconditionally, as it wouldn't affect well behaved systems (beyond the time it takes to carry out the checks, and provided the checking logic isn't buggy). There's one further caveat towards uninstalling (in a way also for your ESRT relocation code): The final memory map is known to us only when we can't call boot services functions anymore (i.e. in particular InstallConfigurationTable()). Jan