On Thu 07-06-18 19:55:53, Hanjun Guo wrote: > On 2018/6/7 18:55, Michal Hocko wrote: [...] > > I am not sure I have the full context but pci_acpi_scan_root calls > > kzalloc_node(sizeof(*info), GFP_KERNEL, node) > > and that should fall back to whatever node that is online. Offline node > > shouldn't keep any pages behind. So there must be something else going > > on here and the patch is not the right way to handle it. What does > > faddr2line __alloc_pages_nodemask+0xf0 tells on this kernel? > > The whole context is: > > The system is booted with a NUMA node has no memory attaching to it > (memory-less NUMA node), also with NR_CPUS less than CPUs presented > in MADT, so CPUs on this memory-less node are not brought up, and > this NUMA node will not be online (but SRAT presents this NUMA node); > > Devices attaching to this NUMA node such as PCI host bridge still > return the valid NUMA node via _PXM, but actually that valid NUMA node > is not online which lead to this issue. But we should have other numa nodes on the zonelists so the allocator should fall back to other node. If the zonelist is not intiailized properly, though, then this can indeed show up as a problem. Knowing which exact place has blown up would help get a better picture... -- Michal Hocko SUSE Labs