On Fri, 13 Nov 2009, Lee Schermerhorn wrote: > Ad hoc measurements on x86_64 using: hackbench 400 process 200 > > 2.6.32-rc5+mmotm-091101 no patch this series > x86_64 avg of 40: 4.605 4.628 ~0.5% Instructions become more efficient here. > Ia64 showed ~1.2% longer time with the series applied. IA64 can use the per cpu TLB entry to get to the numa node id with the platform specific per cpu handling. The per cpu implementation currently requires fallback. IA64 percpu ops could be reworked to avoid consulting the per cpu offset arrray which would make it equivalent to the current implementation. -- To unsubscribe from this list: send the line "unsubscribe linux-arch" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html