On Thu 07-09-23 17:52:12, Wei Xu wrote: [...] > I tested this patch on a machine with 384 CPUs using a microbenchmark > that spawns 10K threads, each reading its memory.stat every 100 > milliseconds. This is rather extreme case but I wouldn't call it utterly insane though. > Most of memory.stat reads take 5ms-10ms in kernel, with > ~5% reads even exceeding 1 second. Just curious, what would numbers look like if the mutex is removed and those threads would be condending on the existing spinlock with lock dropping in place and removed. Would you be willing to give it a shot? -- Michal Hocko SUSE Labs