On Mon, Jun 14, 2021 at 02:01:55PM +0200, Vincent Guittot wrote: > On Sat, 12 Jun 2021 at 13:31, Odin Ugedal <odin@xxxxxxx> wrote: > > > > This fixes an issue where fairness is decreased since cfs_rq's can > > end up not being decayed properly. For two sibling control groups with > > the same priority, this can often lead to a load ratio of 99/1 (!!). > > > > This happen because when a cfs_rq is throttled, all the descendant cfs_rq's > > s/happen/happens/ > > > will be removed from the leaf list. When they initial cfs_rq is > > unthrottled, it will currently only re add descendant cfs_rq's if they > > have one or more entities enqueued. This is not a perfect heuristic. > > > > Instead, we insert all cfs_rq's that contain one or more enqueued > > entities, or it its load is not completely decayed. > > > > Can often lead to situations like this for equally weighted control > > groups: > > > > $ ps u -C stress > > USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND > > root 10009 88.8 0.0 3676 100 pts/1 R+ 11:04 0:13 stress --cpu 1 > > root 10023 3.0 0.0 3676 104 pts/1 R+ 11:04 0:00 stress --cpu 1 > > > > Fixes: 31bc6aeaab1d ("sched/fair: Optimize update_blocked_averages()") > > Signed-off-by: Odin Ugedal <odin@xxxxxxx> > > minor typo in the commit message otherwise > > Reviewed-by: Vincent Guittot <vincent.guittot@xxxxxxxxxx> Thanks!