On Sat 07-01-17 10:37:41, Eric Dumazet wrote: > On Sat, Jan 7, 2017 at 1:27 AM, Michal Hocko <mhocko@xxxxxxxxxx> wrote: > > On Fri 06-01-17 14:14:49, Eric Dumazet wrote: > > >> I believe the intent was to get NUMA spreading, a bit like what we have > >> in alloc_large_system_hash() when hashdist == HASHDIST_DEFAULT > > > > Hmm, I am not sure this works as expected then. Because it is more > > likely that all pages backing the vmallocked area will come from the > > local node than spread around more nodes. Or did I miss your point? > > Well, you missed that vmalloc() is aware of NUMA policies. You are right. I have missed that alloc_page ends up using alloc_pages_current for CONFIG_NUMA rather than alloc_pages_node. > If current process has requested interleave on 2 nodes (as it is done > at boot time on a dual node system), > then vmalloc() of 8 pages will allocate 4 pages on each node. On the other hand alloc_ila_locks does go over a single page when lockdep is enabled and I am wondering whether doing this NUMA subtle magic is any win... Also this seems to be an init code so I assume a modprobe would have to set a non-default policy to make use of it. Does anybody do that out there? alloc_bucket_locks is a bit more complicated because it is not only called from the init context. But considering that rhashtable_shrink is called from the worker context - so no mem policy can be assumed then I am wondering whether the code really works as expected. To me it sounds like it is trying to be clever while the outcome doesn't really do what it is intended. Would you mind if I just convert it to kvmalloc and make it easier to understand? -- Michal Hocko SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>