Hi tejun, On 08/21/2013 09:06 PM, Tejun Heo wrote: > Hello, > > On Wed, Aug 21, 2013 at 06:15:35PM +0800, Tang Chen wrote: >> [What are we doing] >> >> We are trying to initialize acip tables as early as possible. But Linux kernel >> allows users to override acpi tables by specifying their own tables in initrd. >> So we have to do acpi_initrd_override() earlier first. > > So, are we now back to making SRAT info as early as possible? What > happened to just co-locating early allocations close to kernel image? > What'd be the benefit of doing this over that? We know you are trying to give the direction to make the change more natural and robust and very thankful for your comments. We have taken your comments and suggestions about co-locating early allocations close to kernel image into consideration, but still we found that not that easy. In current boot order, before we get the SRAT, we have a big consumer of early allocations: we are setting up the page table in top-down (The idea was proposed by HPA, Link: https://lkml.org/lkml/2012/10/4/701). That said, this kind of page table setup will make the page tables as high as possible in memory, since memory at low addresses is precious (for stupid DMA devices, for things like kexec/kdump, and so on.) So if we are trying to make early allocations close to kernel image, we should rewrite the way we are setting up page table totally. That is not a easy thing to do. As for the benefits of the patchset, just as Tang said in this patch, * For memory hotplug, we need ACPI SRAT at early time to be aware of which memory ranges are hotpluggable, and tell the kernel to try to stay away from hotpluggable nodes. This one is the current requirement of us but may be very helpful for future change: * As suggested by Yinghai, we should allocate page tables in local node. This also needs SRAT before direct mapping page tables are setup. * As mentioned by Toshi Kani <toshi.kani@xxxxxx>, ACPI SCPR/DBGP/DBG2 tables allow the OS to initialize serial console/debug ports at early boot time. The earlier it can be initialized, the better this feature will be. These tables are not currently used by Linux due to a licensing issue, but it could be addressed some time soon. So we decided to firstly make ACPI override earlier and use BRK (this is obviously near the kernel image range) to store the found ACPI tables. -- Thanks. Zhang Yanfei -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>