On Mon 10-09-18 14:32:16, Pavel Tatashin wrote: > On Mon, Sep 10, 2018 at 10:19 AM Michal Hocko <mhocko@xxxxxxxxxx> wrote: > > > > On Mon 10-09-18 14:11:45, Pavel Tatashin wrote: > > > Hi Michal, > > > > > > It is tricky, but probably can be done. Either change > > > memmap_init_zone() or its caller to also cover the ends and starts of > > > unaligned sections to initialize and reserve pages. > > > > > > The same thing would also need to be done in deferred_init_memmap() to > > > cover the deferred init case. > > > > Well, I am not sure TBH. I have to think about that much more. Maybe it > > would be much more simple to make sure that we will never add incomplete > > memblocks and simply refuse them during the discovery. At least for now. > > On x86 memblocks can be upto 2G on machines with over 64G of RAM. sorry I meant pageblock_nr_pages rather than memblocks. > Also, memory size is way to easy too change via qemu arguments when VM > starts. If we simply disable unaligned trailing memblocks, I am sure > we would get tons of noise of missing memory. > > I think, adding check_hotplug_memory_range() would work to fix the > immediate problem. But, we do need to figure out a better solution. > > memblock design is based on archaic assumption that hotplug units are > physical dimms. VMs and hypervisors changed all of that, and we can > have much finer hotplug requests on machines with huge DIMMs. Yet, we > do not want to pollute sysfs with millions of tiny memory devices. I > am not sure what a long term proper solution for this problem should > be, but I see that linux hotplug/hotremove subsystems must be > redesigned based on the new requirements. Not an easy task though. Anyway, sparse memory modely is highly based on memory sections so it makes some sense to have hotplug section based as well. Memblocks as a higher logical unit on top of that is kinda hack. The userspace API has never been properly thought through I am afraid. -- Michal Hocko SUSE Labs