Andrew Morton <akpm at linux-foundation.org> writes: > On Wed, 19 Dec 2012 16:57:03 -0800 > ebiederm at xmission.com (Eric W. Biederman) wrote: > >> Andrew Morton <akpm at linux-foundation.org> writes: >> >> > Is there any way in which we can move some of this logic into the >> > kernel? In this case, add some kernel code which uses PageBuddy() on >> > behalf of makedumpfile, rather than replicating the PageBuddy() logic >> > in userspace? >> >> All that exists when makedumpfile runs is a core file. So it would have >> to be something like a share library that builds with the kernel and >> then makedumpfile loads. > > Can we omit free pages from that core file? > > And/or add a section to that core file which flags free pages? Ommitting pages is what makedumpfile does. Very loosely shortly after boot when things are running fine /sbin/kexec runs. /sbin/kexec constructs a set of elf headers that describe where the memory is and load the crashdump kernel an initrd and those elf headers into memory. Years later when the running kernel calls panic. panic calls machine_kexec machine_kexec jmps to the preloaded crashdump kernel. I think it is /proc/vmcore that reads the elf headers out of memory and presents them to userspace. Then we have options. vmcore-to-dmesg will just read the dmesg ring buffer so we have that. makedumpfile reads the kernel data structures and filters out the free pages for people who don't want to write everything to disk. So the basic interface is strongly kernel version agnostic. The challenge is how to filter out undesirable pages from the core dump quickly and reliably. Right now what we have are a set of ELF notes that describe struct page. For my uses I have either had enough disk space that saving everything didn't matter or so little disk space that all I could afford was getting out the dmesg ring buffer. So I don't know how robust the solution adopted by makedumpfile is. Eric