Hi again everyone, I'm not a scheduler hacker, I'm a scheduler hacker's mate. I'm only hacking the scheduler, 'cos trying to run 32-bit applications on systems where not all of the CPUs support it is GREAT. It's Friday 13th, and I'm back with version three of the increasingly popular patches I previously posted here: v1: https://lore.kernel.org/r/20201027215118.27003-1-will@xxxxxxxxxx v2: https://lore.kernel.org/r/20201109213023.15092-1-will@xxxxxxxxxx and which started life as a reimplementation of some patches from Qais: https://lore.kernel.org/r/20201021104611.2744565-1-qais.yousef@xxxxxxx The aim of this series is to allow 32-bit ARM applications to run on arm64 SoCs where not all of the CPUs support the 32-bit instruction set. There are some major changes in v3: * Add some scheduler hooks for restricting a task's affinity mask * Implement these hooks for arm64 so that we can avoid 32-bit tasks running on 64-bit-only cores * Restrict affinity mask of 32-bit tasks on execve() * Prevent hot-unplug of all 32-bit CPUs if we have a mismatched system * Ensure 32-bit EL0 cpumask is zero-initialised (oops) It's worth mentioning that this approach goes directly against my initial proposal for punting the affinity management to userspace, because it turns out that doesn't really work. There are cases where the kernel has to muck with the affinity mask explicitly, such as execve(), CPU hotplug and cpuset balancing. Ensuring that these don't lead to random SIGKILLs as far as userspace is concerned means avoiding any 64-bit-only CPUs appearing in the affinity mask for a 32-bit task, at which point it's easier just to handle everything in the kernel anyway. - Patches 1-6 hack the arm64 CPU feature code to allow 32-bit tasks to run on a mismatched system, but forcing SIGKILL if a task ends up on the wrong CPU. This is gated on a command-line option; without it, a mismatched system will be treated as 64-bit-only. - Patches 7-11 add scheduler functionality necessary to constrain the CPU affinity mask on a per-task basis and hook this up for execve() on arm64. - Patches 12-14 finish off the arm64 plumbing and remove the logic for killing misplaced tasks, as it adds overhead to the context-switch and ret-to-user paths. This seems to do the right thing in my contrived QEMU environment, but as I say, I'm not a scheduler hacker so I'm open to alternative ideas. Cheers, Will Cc: Catalin Marinas <catalin.marinas@xxxxxxx> Cc: Marc Zyngier <maz@xxxxxxxxxx> Cc: Greg Kroah-Hartman <gregkh@xxxxxxxxxxxxxxxxxxx> Cc: Peter Zijlstra <peterz@xxxxxxxxxxxxx> Cc: Morten Rasmussen <morten.rasmussen@xxxxxxx> Cc: Qais Yousef <qais.yousef@xxxxxxx> Cc: Suren Baghdasaryan <surenb@xxxxxxxxxx> Cc: Quentin Perret <qperret@xxxxxxxxxx> Cc: Tejun Heo <tj@xxxxxxxxxx> Cc: Li Zefan <lizefan@xxxxxxxxxx> Cc: Johannes Weiner <hannes@xxxxxxxxxxx> Cc: Ingo Molnar <mingo@xxxxxxxxxx> Cc: Juri Lelli <juri.lelli@xxxxxxxxxx> Cc: Vincent Guittot <vincent.guittot@xxxxxxxxxx> Cc: kernel-team@xxxxxxxxxxx --->8 Will Deacon (14): arm64: cpuinfo: Split AArch32 registers out into a separate struct arm64: Allow mismatched 32-bit EL0 support KVM: arm64: Kill 32-bit vCPUs on systems with mismatched EL0 support arm64: Kill 32-bit applications scheduled on 64-bit-only CPUs arm64: Advertise CPUs capable of running 32-bit applications in sysfs arm64: Hook up cmdline parameter to allow mismatched 32-bit EL0 sched: Introduce restrict_cpus_allowed_ptr() to limit task CPU affinity arm64: exec: Adjust affinity for compat tasks with mismatched 32-bit EL0 cpuset: Don't use the cpu_possible_mask as a last resort for cgroup v1 sched: Introduce arch_cpu_allowed_mask() to limit fallback rq selection sched: Reject CPU affinity changes based on arch_cpu_allowed_mask() arm64: Prevent offlining first CPU with 32-bit EL0 on mismatched system arm64: Implement arch_cpu_allowed_mask() arm64: Remove logic to kill 32-bit tasks on 64-bit-only cores .../ABI/testing/sysfs-devices-system-cpu | 9 + .../admin-guide/kernel-parameters.txt | 7 + arch/arm64/include/asm/cpu.h | 44 ++-- arch/arm64/include/asm/cpucaps.h | 2 +- arch/arm64/include/asm/cpufeature.h | 8 +- arch/arm64/include/asm/mmu_context.h | 12 + arch/arm64/kernel/cpufeature.c | 219 ++++++++++++++---- arch/arm64/kernel/cpuinfo.c | 53 +++-- arch/arm64/kernel/process.c | 17 +- arch/arm64/kvm/arm.c | 11 +- include/linux/sched.h | 1 + kernel/cgroup/cpuset.c | 6 +- kernel/sched/core.c | 90 +++++-- 13 files changed, 370 insertions(+), 109 deletions(-) -- 2.29.2.299.gdc1121823c-goog