Hi Tejun, Tejun Heo <tj@xxxxxxxxxx> writes: > Hello, Michael. > > On Tue, Aug 22, 2017 at 11:41:41AM +1000, Michael Ellerman wrote: >> > This is something powerpc needs to fix. >> >> There is no way for us to fix it. > > I don't think that's true. The CPU id used in kernel doesn't have to > match the physical one and arch code should be able to pre-map CPU IDs > to nodes and use the matching one when hotplugging CPUs. I'm not > saying that's the best way to solve the problem tho. We already virtualise the CPU numbers, but not the node IDs. And it's the node IDs that are really the problem. So yeah I guess we might be able to make that work, but I'd have to think about it a bit more. > It could be that the best way forward is making cpu <-> node mapping > dynamic and properly synchronized. We don't need it to be dynamic (at least for this bug). Laurent is booting Qemu with a fixed CPU <-> Node mapping, it's just that because some CPUs aren't present at boot we don't know what the node mapping is. (Correct me if I'm wrong Laurent). So all we need is: - the workqueue code to cope with CPUs that are possible but not online having NUMA_NO_NODE to begin with. - a way to update the workqueue cpumask when the CPU comes online. Which seems reasonable to me? cheers