On Mon, Feb 10, 2025 at 04:38:56PM +0100, Michal Koutný wrote: ... > The challenge is with nr (assuming they're all runnable during Δt), that > would need to be sampled from /sys/kernel/debug/sched/debug. But then > you can get whatever load for individual cfs_rqs from there. Hm, does it > even make sense to add up run_delays from different CPUs? The difficulty in aggregating across CPUs is why some and full pressures are defined the way they are. Ideally, we'd want full distribution of stall states across CPUs but both aggregation and presentation become challenging, so some/full provide the two extremes. Sum of all cpu_delay adds more incomplete signal on top. I don't know how useful it'd be. At meta, we depend on PSI a lot when investigating resource problems and we've never felt the need for the sum time, so that's one data point with the caveat that usually our focus is on mem and io pressures where some and full pressure metrics usually seem to provide sufficient information. As the picture provided by some and full metrics is incomplete, I can imagine adding the sum being useful. That said, it'd help if Able can provide more concrete examples on it being useful. Another thing to consider is whether we should add this across resources monitored by PSI - cpu, mem and io. Thanks. -- tejun