"H. Peter Anvin" <hpa@xxxxxxxxx> writes: > Jeremy Fitzhardinge wrote: >> H. Peter Anvin wrote: >>> The more I'm thinking about this whole issue, the more I'm thinking this >>> is really misdirected. >>> >>> I'm starting to think that the right thing to do might very well be as >>> follows: >>> >>> - Change the payload of the bzImage file from a flat binary to an ELF >>> file -- a stripped vmlinux.gz. >>> - Change the decompressor to incorporate a (simple) ELF parser. >>> - Incorporate a couple of pointer fields in the bzImage header that >>> points directly to the payload, the format of which can be identified >>> via its magic number (currently gzip). >> >> I don't see a lot of benefit in doing this. Having some form of >> executable I can just run is probably simpler for me to implement than >> trying to unwrap a vmlinux.gz from an envelope, even if that's what >> we're currently booting. The Xen domain builder simply wants an ELF >> file it can load; if its a self-decompressing kernel image, then that's >> OK. The only real problem is setting up the initial mappings so that >> the booting code can run as if paging is enabled. The simplest answer >> to that is to simply 1:1 map all lowmem and set it running, but having >> specific Phdrs laying out the mappings would be a bit more refined. >> >> Now if the bzImage were simply some 16-bit startup code prepended onto >> a self-decompressing 32-bit ELF kernel file, then that would be fairly >> easy to deal with too, since that would just be a matter of looking at >> the boot_params to find the start of the ELF file and continue from >> there. > > You're focusing entirely on Xen here, but if Xen really is the only user > of this, then I say there isn't much benefit into doing this at all. > Hopefully we're trying to solve something bigger than the Xen domain > builder being way too rigidly written, here. We are. But it is honest to focus on the problems that you see. Jeremy I believe has already committed to changing the Xen domain builder. I haven't seen much that Jeremy has been saying with respect to Xen that I haven't noted when working on /sbin/kexec. However there is one more thing that is probably worth addressing. Having a bImage target where we do not compress the raw kernel binary. For embedded targets like OLPC they can actually store the kernel in less room by using a separate decompressor. I don't think it would be very interesting to our traditional > So it really comes down to: what are the bigger problems we're solving? There are two basic core problems. 1) Cleaning starting a kernel in it's native mode. (32bit x86/64bit x86_64). 2) For bootloaders that boot a diverse set of standalone programs a minimal amount of code needed to support something different. For kexec this is the cross architecture problem. For things like Xen or Grub this is the multiple operating systems. Keeping to a standard format helps here. Reporting information about our selves that a standard query mechanism can use helps here. I think we have seen this problems often enough it is worth solving things relatively cleanly. >> I'm not sure why you say that. Rusty managed to get unmodified >> bzImage's booting in lguest within a few hours of hacking. I don't >> expect Xen would be much different. > > Anything that's a true virtualizer should just be able to load and run a > bzImage file from the 16-bit entrypoint, obviously. That's not what > Rusty is doing, but all he'd need is the bit (already proposed) to > inhibit cli and segment reloads. The refined bit of more accurate headers reporting which memory the kernel can use before it runs may be nice, and in there. Right now I suspect that changing the decompressor in the bzImage so that we are decompressing a vmlinux and then loading that could be a pain. At the moment we don't have code on the table for that. Further if we manually roll our own headers and are not slaves to binutils we get a level of control that we might not have otherwise, and that we may need. In particular we can set ET_DYN and set physical == virtual with no bad effects. binutils won't let us set ET_DYN on the program header when we are ``relocatable'' because it doesn't understand our what we are doing to make the kernel load time relocatable. We can't set physical == virtual on vmlinux or we would not be able to debug the loaded kernel. Setting physical == virtual in the bootloader context removes one area of ambiguity. So we have 3 pieces to this puzzle. - The linux boot parameters enhancement (I proposed some specific things that seemed essentially agreed upon). In particular: - A flag to say skip the cli and segment reload. (We can just kill the cli (redundant) but things are just touchy enough I'm not comfortable with unconditionally throwing out the segment reload code, and a flag that we set by default is cheap) - A word to store the environment we are in. - A pointer to environment specific data. - ELF headers and ELF notes for general flexibility. I think this will take a little more discussion, but we are in essential agreement here. - Where to put the ELF headers and ELF notes. We have one patch on the table. We have an alternative suggestion on the table. The 16bit startup code at this point is modestly interesting, and I think we should have a standard way to find it so that bootloaders that want to use advanced features that can use it have the option. However there is a very common intersection between pc's without an x86 BIOS (so we can't run 16bit code) and bootloaders doing things differently. Eric _______________________________________________ Virtualization mailing list Virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linux-foundation.org/mailman/listinfo/virtualization