On 02/20/20 at 11:38am, Michal Hocko wrote: > On Thu 20-02-20 12:33:09, Baoquan He wrote: > > Memory sub-section hotplug was added to fix the issue that nvdimm could > > be mapped at non-section aligned starting address. A subsection map is > > added into struct mem_section_usage to implement it. However, sub-section > > is only supported in VMEMMAP case. > > Why? Is there any fundamental reason or just a lack of implementation? > VMEMMAP should be really only an implementation detail unless I am > missing something subtle. Thanks for checking. VMEMMAP is one of two ways to convert a PFN to the corresponding 'struct page' in SPARSE model. I mentioned them as VMEMMAP case, or !VMEMMAP case because we called them like this previously when reviewed patches, hope it won't cause confusion. Currently, config ZONE_DEVICE depends on SPARSEMEM_VMEMMAP. The subsection_map is added to struct mem_section_usage to track which sub section is present, VMEMMAP fills those bits which corresponding sub-sections are present, while !VMEMMAP, namely classic SPARSE, fills the whole map always. As we know, VMEMMAP builds page table to map a cluster of 'struct page' into the corresponding area of 'vmemmap'. Subsection hotplug can be supported naturally, w/o any change, just map needed region related to sub-sections on demand. For !VMEMMAP, it allocates memmap with alloc_pages() or vmalloc, thing is a little complicated, e.g the mixed section, boot memory occupies the starting area, later pmem hot added to the rear part. About !VMEMMAP which doesn't support sub-section hotplog, Dan said it's more because the effort and maintenance burden outweighs the benefit. And the current 64 bit ARCHes all enable SPARSEMEM_VMEMMAP_ENABLE by default. So no need to keep subsection_map and its handling in SPARSE|!VMEMMAP. > > > Hence there's no need to operate > > subsection map in SPARSEMEM|!VMEMMAP case. In this patchset, change > > codes to make sub-section map and the relevant operation only available > > in VMEMMAP case. > > > > And since sub-section hotplug added, the hot add/remove functionality > > have been broken in SPARSEMEM|!VMEMMAP case. Wei Yang and I, each of us > > make one patch to fix one of the failures. In this patchset, the patch > > 1/7 from me is used to fix the hot remove failure. Wei Yang's patch has > > been merged by Andrew. > > Not sure I understand. Are there more issues to be fixed? Only these two. Wei Yang firstly posted the patch to fix the hot add failure in SPARSE|!VMEMMAP. When I reviewed his patch and tested, found hot remove failed too. So the patch 1/7 is to fix the hot remove failure in !VMEMMAP. With these two patches, hot add/remove works well in !VMEMMAP. Not sure if it's clear. > > include/linux/mmzone.h | 2 + > > mm/sparse.c | 178 +++++++++++++++++++++++++++++------------ > > 2 files changed, 127 insertions(+), 53 deletions(-) > > Why do we need to add so much code to remove a functionality from one > memory model? Hmm, Dan also asked this before. The adding mainly happens in patch 2, 3, 4, including the two newly added function defitions, the code comments above them, and those added dummy functions for !VMEMMAP. Thanks Baoquan