On Wed, Jan 20, 2016 at 02:25:07PM +0900, AKASHI Takahiro wrote: > On 01/19/2016 11:01 PM, Mark Rutland wrote: > >For NUMA topology in !ACPI kernels, we might need to also retain and > >parse memory nodes, but only for toplogy information. The kernel would > >still only use memory as described by the EFI memory map. > > > >There's a horrible edge case I've spotted if performing a chain of > >cross-endian kexecs: LE -> BE -> LE, as the BE kernel would have to > >respect the EFI memory map so as to avoid corrupting it for the > >subsequent LE kernel. Other than this I believe everything should just > >work. > > BE kernel doesn't support UEFI yet and cannot access UEFI memmap table. So, > for LE -> BE, we don't use a dtb generated from /sys/firmware/fdt (or /proc/device-tree) > (as in the case of LE -> LE) and require users to provide a dtb file explicitly. As I mentioned above, the problem exists when memory nodes also exist (for describing NUMA topology). In that case the BE kernel would try to use the information from the memory nodes. > For BE -> LE, BE kernel doesn't know wther UEFI memmap table is available or not > and so use the same (explicitly-provided) dtb (as LE -> LE in !UEFI) See above. The problem I imagine is: LE kernel - uses EFI mmap, takes NUMA information from DT memory nodes v kexec BE kernel - uses DT memory nodes - clobbers EFI runtime regions as it sees them as available v kexec LE kernel - uses EFI mmap, takes NUMA information from DT memory nodes - tries to call EFI runtime services, and explodes. > >>>A kexec'd kernel should simply inherit that. So long as the DTB and/or > >>>UEFI tables in memory are the same, it would be the same as a cold boot. > >> > >>For kexec all memory ranges are same, for kdump we need use original reserved > >>range with crashkernel= as usable memory and all other orignal usable ranges > >>are not usable anymore. > > > >Sure. This is what I believe we should expose with an additional > >property under /chosen, while keeping everything else pristine. > > > >The crash kernel can then limit itself to that region, while it would > >have the information of the full memory map (which it could log and/or > >use to drive other dumping). > > FYI, > all the original usable memory regions used by the 1st kernel are also > described in an ELF core header specified by "elfcorehdr=" parameter to > the crash dump kernel. That only describes what the first kernel parsed and thus believed, not exactly what the firmware described. Thanks, Mark.