Hello Tejun, On Thu, 2013-10-10 at 11:35 -0400, Tejun Heo wrote: : > > > Are firmware writers gonna be > > > required to split SRAT entries into multiple sub-nodes to support it? > > > > Yes, and that's part of the ACPI spec. That's not something the OS > > requests to do. If a memory range has different attribute, firmware has > > to put it in a separate entry. > > I was referring to having to segment a contiguous hotplug memory area > further to support finer granularity. This is represented by separate > mem devices rather than segmented SRAT entries, right? Hmmm... so we > should parse device nodes before setting up page tables? Yes, a memory device object is the finest granularity of performing memory hotplug on ACPI based platforms. SRAT must be consistent with the memory device object info, but its entry does not have to be segmented by the device granularity. It only needs to be segmented when memory attribute is different. For instance, SRAT may have a single entry for Case A), but Case B) must have two separate entries. In both cases, MEMA & MEMB represent a contiguous memory range. Case A) Both MEMA and MEMB devices are hotpluggable MEMA: _CRS: 0x0000-0x0fff _EJ0: hotpluggable MEMB: _CRS: 0x1000-0x1fff _EJ0: hotpluggable SRAT: 0x0000-0x1ffff hotpluggable Case B) Only MEMB is hotpluggable MEMA: _CRS: 0x0000-0x0fff MEMB: _CRS: 0x1000-0x1fff _EJ0: hotpluggable SRAT: 0x0000-0x0fff 0x1000-0x1fff hotpluggable > > SRAT and _EJ0 method are the only interfaces that define ejectability in > > the standard spec. Are you suggesting us to change the e820 spec or not > > to comply with the spec? I do not think such approaches work. > > It's slower but standards get revised and updated over time. Have no > idea whether there'd be a sane way to do that for e820 tho. I am familiar with the process. Yes, it is slow, but most importantly, it needs some standard group or company to actively maintain the spec in order to update it. I do not think e820 is in such state. > > I think memory hotplug was originally implemented on ia64 with the node > > granularity. I share your concerns, but that's been done a long time > > ago. It's too late to complain the past. This SRAT work is not > > introducing such restriction. > > We're going round and round. You're saying that using SRAT isn't > worse than what came before while failing to illustrate how committing > to invasive changes would eventually lead to something better. "it > isn't worse" isn't much of an argument. We did avoid moving up the ACPI table init function per your suggestion. I guess I do not understand why you still concerned about using SRAT... Thanks, -Toshi -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>