From: Tejun Heo <tj@xxxxxxxxxx> Date: Thu, 09 Apr 2009 04:48:12 -0700 > > David Miller wrote: >> void __init setup_per_cpu_areas(void) >> { >> - unsigned long size, i, nr_possible_cpus = num_possible_cpus(); >> - char *ptr; >> + size_t dyn_size, static_size = __per_cpu_end - __per_cpu_start; >> + static struct vm_struct vm; >> + unsigned long delta, cpu; >> + size_t pcpu_unit_size; >> + size_t ptrs_size; >> + >> + pcpur_size = PFN_ALIGN(static_size + PERCPU_MODULE_RESERVE + >> + PERCPU_DYNAMIC_RESERVE); >> + dyn_size = pcpur_size - static_size - PERCPU_MODULE_RESERVE; > > Isn't it better to use embedding allocator for !NUMA cases (one less > TLB entry usage for each CPU)? Heck, the embedding case would probably be optimal for Niagara NUMA systems too. On Niagara systems all of the "possible" cpu numbers are linear and in order. No holes, gaps, or other stuff like this. So just allocating big TLB mapping chunks and chopping them up to the individual cpus is the best scheme possible. Indeed, these are the kinds of things I plan to experiment with. -- To unsubscribe from this list: send the line "unsubscribe sparclinux" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html