[PATH 6.1.y 0/5] Backport "sched cpuset: Bring back cpuset_mutex"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

When using KVM on systems that require iTLB multihit mitigation enabled[1],
we're observing very high latency (70ms+) in KVM_CREATE_VM ioctl() in 6.1
kernel in comparison to older stable kernels such as 5.10. This is true even
when using favordynmods mount option.

We debugged this down to the cpuset controller trying to acquire cpuset_rwsem
in cpuset_can_attach(). This happens because KVM creates a worker thread which
calls cgroup_attach_task_all() during KVM_CREATE_VM. I don't know if
favordynmods is supposed to cover this case or not, but removing cpuset_rwsem
certainly solves the issue.

For the backport I tried to pick as many dependent commits as required to avoid
conflicts. I would highly appreciate review from cgroup people.

Tests performed:
 * Measured latency in KVM_CREATE_VM ioctl(), it goes down to less than 1ms
 * Ran the cgroup kselftest tests, got same results with or without this series
    * However, some tests such as test_memcontrol and test_kmem are failing
      in 6.1. This probably needs to be looked at
    * To make test_cpuset_prs.sh work, I had to increase the timeout on line
      592 to 1 second. With this change, the test runs and passes
 * I run our downstream test suite against our downstream 6.1 kernel with this
   series applied, it passed

 [1] For the case where the CPU is not vulnerable to iTLB multihit we can
     simply disable the iTLB multihit mitigation in KVM which avoids this
     whole situation. Disabling the mitigation is possible since upstream
     commit 0b210faf337 which I plan to backport soon

Daniel Vacek (1):
  cgroup/cpuset: no need to explicitly init a global static variable

Juri Lelli (1):
  sched/cpuset: Bring back cpuset_mutex

Waiman Long (3):
  cgroup/cpuset: Optimize cpuset_attach() on v2
  cgroup/cpuset: Skip task update if hotplug doesn't affect current
    cpuset
  cgroup/cpuset: Include offline CPUs when tasks' cpumasks in top_cpuset
    are updated

 include/linux/cpuset.h |   8 +-
 kernel/cgroup/cpuset.c | 211 +++++++++++++++++++++++------------------
 kernel/sched/core.c    |  22 +++--
 3 files changed, 139 insertions(+), 102 deletions(-)

-- 
2.40.1




[Index of Archives]     [Linux Kernel]     [Kernel Development Newbies]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux