On 14.04.20 08:40, Baoquan He wrote: > On 04/13/20 at 08:15am, Eric W. Biederman wrote: >> Baoquan He <bhe@xxxxxxxxxx> writes: >> >>> On 04/12/20 at 02:52pm, Eric W. Biederman wrote: >>>> >>>> The only benefit of kexec_file_load is that it is simple enough from a >>>> kernel perspective that signatures can be checked. >>> >>> We don't have this restriction any more with below commit: >>> >>> commit 99d5cadfde2b ("kexec_file: split KEXEC_VERIFY_SIG into KEXEC_SIG >>> and KEXEC_SIG_FORCE") >>> >>> With KEXEC_SIG_FORCE not set, we can use kexec_load_file to cover both >>> secure boot or legacy system for kexec/kdump. Being simple enough is >>> enough to astract and convince us to use it instead. And kexec_file_load >>> has been in use for several years on systems with secure boot, since >>> added in 2014, on x86_64. >> >> No. Actaully kexec_file_load is the less capable interface, and less >> flexible interface. Which is why it is appropriate for signature >> verification. > > Well, everyone has a stance and the corresponding view. You could have > wider view from long time maintenance and in upstrem position, and think > kexec_file_load is horrible. But I can only see from our work as a front > line engineer to maintain/develop kexec/kdump in RHEL, and think > kexec_file_load is easier to maintain. > > Surely except of multiple kernel image format support. No matter it is > kexec_load and kexec_file_load, e.g in x86_64, we only support bzImage. > This is produced from kerel building by default. We have no way to > support it in our distros and add it into kexec_file_load. > > [RFC PATCH] x86/boot: make ELF kernel multiboot-able > https://lkml.org/lkml/2017/2/15/654 > >> >>>> kexec_load in every other respect is the more capable and functional >>>> interface. It makes no sense to get rid of it. >>>> >>>> It does make sense to reload with a loaded kernel on memory hotplug. >>>> That is simple and easy. If we are going to handle something in the >>>> kernel it should simple an automated unloading of the kernel on memory >>>> hotplug. >>>> >>>> >>>> I think it would be irresponsible to deprecate kexec_load on any >>>> platform. >>>> >>>> I also suspect that kexec_file_load could be taught to copy the dtb >>>> on arm32 if someone wants to deal with signatures. >>>> >>>> We definitely can not even think of deprecating kexec_load until >>>> architecture that supports it also supports kexec_file_load and everyone >>>> is happy with that interface. That is Linus's no regression rule. >>> >>> I should pick a milder word to express our tendency and tell our plan >>> then 'obsolete'. Even though I added 'gradually', seems it doesn't help >>> much. I didn't mean to say 'deprecate' at all when replied. >>> >>> The situation and trend I understand about kexec_load and kexec_file_load >>> are: >>> >>> 1) Supporting kexec_file_load is suggested to add in ARCHes which don't >>> have yet, just as x86_64, arm64 and s390 have done; >>> >>> 2) kexec_file_load is suggested to use, and take precedence over >>> kexec_load in the future, if both are supported in one ARCH. >> >> The deep problem is that kexec_file_load is distinctly less expressive >> than kexec_load. >> >>> 3) Kexec_load is kept being used by ARCHes w/o kexc_file_load support, >>> and by ARCHes for back compatibility w/ kexec_file_load support. >>> >>> For 1) and 2), I think the reason is obvious as Eric said, >>> kexec_file_load is simple enough. And currently, whenever we got a bug >>> report, we may need fix them twice, for kexec_load and kexec_file_load. >>> If kexec_file_load is made by default, e.g on x86_64, we will change it >>> in kernel space only, for kexec_file_load. This is what I meant about >>> 'obsolete gradually'. I think for arm64, s390, they will do these too. >>> Unless there's some critical/blocker bug in kexec_load, to corrupt the >>> old kexec_load interface in old product. >> >> Maybe. The code that kexec_file_load sucked into the kernel is quite >> stable and rarely needs changes except during a port of kexec to >> another architecture. >> >> Last I looked the real maintenance effor of kexec and kexec on panic was >> in the drivers. So I don't think we can use maintenance to do anything. > > Not sure if I got it. But if check Lianbo's patches, a lot of effort has > been taken to make SEV work well on kexec_file_load. And we have > switched to use kexec_file_load in the newly published Fedora release > on x86_64 by default. Before this, Lianbo has investigated and done many > experiments to make sure the switching is safe. We finally made this > decision. Next we will do the switch in Enterprise distros. Once these > are proved safe, we will suggest customers to use kexec_file_load for > kexec rebooting too. In the future, we will only care about > kexec_file_load if everying is going well. But as I have explained > repeatedly, only caring about kexec_file_load means we will leave > kexec_load as is, we will not add new feature or improvement patches > for it. > > commit 6a20bd54473e11011bf2b47efb52d0759d412854 > Author: Lianbo Jiang <lijiang@xxxxxxxxxx> > Date: Thu Jan 16 13:47:35 2020 +0800 > > kdump-lib: switch to the kexec_file_load() syscall on x86_64 by default > >> >>> For 3), people can still use kexec_load and develop/fix for it, if no >>> kexec_file_load supported. But 32-bit arm should be a different one, >>> more like i386, we will leave it as is, and fix anything which could >>> break it. But people really expects to improve or add feature to it? E.g >>> in this patchset, the mem hotplug issue James raised, I assume James is >>> focusing on arm64, x86_64, but not 32-bit arm. As DavidH commented in >>> another reply, people even don't agree to continue supporting memory >>> hotplug on 32-bit system. We ever took effort to fix a memory hotplug >>> bug on i386 with a patch, but people would rather set it as BROKEN. >> >> For memory hotplug just reload. Userspace already gets good events. > > Kexec_file_load is easy to maintain. This is an example. > > Lock the hotplug area where kexed-ed kernel is targeted in this patchset, > it's obviously not right. We can't disable memory hotplug just because > kexec-ed kernel is loaded ahead of time. > > Reloading is also not a good fix. Kexec-ed kernel is targeted at a > movable area, reloading can avoid kexec rebooting corruption if that > area is hot removed. But if that area is not removed, locating kernel > into the hotpluggable area will change the area into ummovable zone. > Unless we decide to not support memory hotplug in kexec-ed kernel, I > guess it's very hard. Now in our distros kexec rebooting has been > supported, the big cloud providers are deploying linux in guest, bugs on > kexec reboot failure has been reported. They need the memory hotplug to > increase/decrease memory. > > The root cause is kexec-ed kernel is targeted at hotpluggable memory > region. Just avoiding the movable area can fix it. In kexec_file_load(), > just checking or picking those unmovable region to put kernel/initrd in > function locate_mem_hole_callback() can fix it. The page or pageblock's > zone is movable or not, it's easy to know. This fix doesn't need to > bother other component. I don't fully agree. E.g., just because memory is onlined to ZONE_NORMAL does not imply that it cannot get offlined and removed e.g., this is heavily used on ppc64, with 16MB sections. -- Thanks, David / dhildenb