Hello, kernel test robot noticed a 23.4% improvement of stress-ng.sigxfsz.ops_per_sec on: commit: ee2a5c3e36093d0ff5709bc8f21d3793cf55f746 ("[PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"") url: https://github.com/intel-lab-lkp/linux/commits/Michal-Koutn/Revert-pid-allow-pid_max-to-be-set-per-pid-namespace/20250222-010942 patch link: https://lore.kernel.org/all/20250221170249.890014-2-mkoutny@xxxxxxxx/ patch subject: [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace" testcase: stress-ng config: x86_64-rhel-9.4 compiler: gcc-12 test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory parameters: nr_threads: 100% testtime: 60s test: sigxfsz cpufreq_governor: performance In addition to that, the commit also has significant impact on the following tests: +------------------+-------------------------------------------------------------------------------------------+ | testcase: change | stress-ng: stress-ng.mprotect.ops_per_sec 4.5% improvement | | test machine | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory | | test parameters | cpufreq_governor=performance | | | nr_threads=100% | | | test=mprotect | | | testtime=60s | +------------------+-------------------------------------------------------------------------------------------+ | testcase: change | stress-ng: stress-ng.sigrt.ops_per_sec 15.7% improvement | | test machine | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory | | test parameters | cpufreq_governor=performance | | | nr_threads=100% | | | test=sigrt | | | testtime=60s | +------------------+-------------------------------------------------------------------------------------------+ | testcase: change | stress-ng: stress-ng.sigbus.ops_per_sec 20.6% improvement | | test machine | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory | | test parameters | cpufreq_governor=performance | | | nr_threads=100% | | | test=sigbus | | | testtime=60s | +------------------+-------------------------------------------------------------------------------------------+ Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20250310/202503101532.348576bb-lkp@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigxfsz/stress-ng/60s commit: 3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply") ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"") 334426094588f817 ee2a5c3e36093d0ff5709bc8f21 ---------------- --------------------------- %stddev %change %stddev \ | \ 5.11 +1.3 6.43 mpstat.cpu.all.usr% 3737 ± 6% -38.8% 2286 ± 42% proc-vmstat.numa_hint_faults_local 1212920 ± 4% -10.4% 1086901 ± 5% sched_debug.cpu.avg_idle.max 35.50 ± 16% -30.0% 24.83 ± 20% perf-c2c.DRAM.local 1517 ± 4% -46.5% 812.17 ± 3% perf-c2c.DRAM.remote 1808 ± 2% +57.0% 2840 perf-c2c.HITM.local 1360 ± 5% -49.9% 680.83 ± 2% perf-c2c.HITM.remote 5.22 ± 3% +19.8% 6.26 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 53.33 ± 15% +25.0% 66.67 ± 15% perf-sched.wait_and_delay.count.__cond_resched.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe 953.83 ± 3% -16.5% 796.33 ± 7% perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 5.21 ± 3% +20.0% 6.25 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 163515 +27.8% 208915 stress-ng.sigxfsz.SIGXFSZ_signals_per_sec 6.668e+08 +23.4% 8.23e+08 stress-ng.sigxfsz.ops 11113966 +23.4% 13716156 stress-ng.sigxfsz.ops_per_sec 3623 -1.4% 3573 stress-ng.time.system_time 163.26 +31.7% 214.98 stress-ng.time.user_time 0.25 -54.7% 0.12 ± 2% perf-stat.i.MPKI 1.125e+10 +22.1% 1.373e+10 perf-stat.i.branch-instructions 0.54 -0.0 0.50 perf-stat.i.branch-miss-rate% 59748239 +10.9% 66264440 perf-stat.i.branch-misses 33.30 -17.9 15.38 ± 2% perf-stat.i.cache-miss-rate% 13040640 -45.8% 7066419 ± 2% perf-stat.i.cache-misses 39047103 +15.5% 45098530 perf-stat.i.cache-references 4.39 -18.2% 3.59 perf-stat.i.cpi 17823 +97.0% 35113 perf-stat.i.cycles-between-cache-misses 5.144e+10 +22.0% 6.275e+10 perf-stat.i.instructions 0.23 +21.3% 0.28 perf-stat.i.ipc 0.25 -55.6% 0.11 ± 2% perf-stat.overall.MPKI 0.53 -0.0 0.48 perf-stat.overall.branch-miss-rate% 33.40 -17.7 15.67 ± 2% perf-stat.overall.cache-miss-rate% 4.40 -18.0% 3.60 perf-stat.overall.cpi 17350 +84.6% 32027 ± 2% perf-stat.overall.cycles-between-cache-misses 0.23 +22.0% 0.28 perf-stat.overall.ipc 1.106e+10 +22.1% 1.35e+10 perf-stat.ps.branch-instructions 58763534 +10.9% 65180843 perf-stat.ps.branch-misses 12827760 -45.8% 6951883 ± 2% perf-stat.ps.cache-misses 38411225 +15.5% 44365626 perf-stat.ps.cache-references 5.06e+10 +22.0% 6.172e+10 perf-stat.ps.instructions 3.106e+12 +21.9% 3.787e+12 perf-stat.total.instructions *************************************************************************************************** lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory ========================================================================================= compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/mprotect/stress-ng/60s commit: 3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply") ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"") 334426094588f817 ee2a5c3e36093d0ff5709bc8f21 ---------------- --------------------------- %stddev %change %stddev \ | \ 10205 ± 25% +33.5% 13621 ± 16% numa-meminfo.node0.KernelStack 0.02 ± 37% -37.8% 0.01 ± 13% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 0.82 ± 32% -37.7% 0.51 ± 7% perf-sched.sch_delay.max.ms.schedule_timeout.kcompactd.kthread.ret_from_fork 807.17 ± 5% -8.5% 738.67 ± 5% perf-sched.wait_and_delay.count.__cond_resched.down_write.vma_prepare.__split_vma.vma_modify 433709 +4.9% 454923 ± 5% proc-vmstat.nr_active_anon 61940 ± 3% +31.3% 81315 ± 35% proc-vmstat.nr_shmem 433709 +4.9% 454923 ± 5% proc-vmstat.nr_zone_active_anon 4.903e+08 +4.5% 5.124e+08 stress-ng.mprotect.ops 8163833 +4.5% 8533021 stress-ng.mprotect.ops_per_sec 239.55 +4.7% 250.91 stress-ng.time.user_time 3960356 ± 7% -16.0% 3325457 numa-numastat.node0.local_node 3990670 ± 7% -16.1% 3348370 numa-numastat.node0.numa_hit 2608139 ± 6% +34.5% 3507199 ± 4% numa-numastat.node1.local_node 2644058 ± 6% +34.3% 3550893 ± 4% numa-numastat.node1.numa_hit 3986137 ± 7% -16.0% 3349506 numa-vmstat.node0.numa_hit 3955823 ± 7% -15.9% 3326594 numa-vmstat.node0.numa_local 2639425 ± 6% +34.6% 3552253 ± 4% numa-vmstat.node1.numa_hit 2603506 ± 6% +34.8% 3508559 ± 4% numa-vmstat.node1.numa_local 1.11 ± 20% -38.9% 0.68 ± 31% sched_debug.cfs_rq:/.h_nr_queued.stddev 1.11 ± 19% -38.6% 0.68 ± 31% sched_debug.cfs_rq:/.h_nr_runnable.stddev 5890 ± 6% -10.7% 5262 sched_debug.cfs_rq:/.runnable_avg.max 1064 ± 20% -41.1% 626.67 ± 33% sched_debug.cfs_rq:/.runnable_avg.stddev 1151 -12.2% 1010 sched_debug.cpu.clock_task.stddev 1.11 ± 20% -39.1% 0.68 ± 32% sched_debug.cpu.nr_running.stddev 1.861e+10 +4.5% 1.945e+10 perf-stat.i.branch-instructions 1.264e+08 +4.1% 1.316e+08 perf-stat.i.branch-misses 1.45e+08 +5.3% 1.526e+08 perf-stat.i.cache-references 2.28 -4.3% 2.18 perf-stat.i.cpi 8.533e+10 +4.5% 8.92e+10 perf-stat.i.instructions 0.44 +4.5% 0.46 perf-stat.i.ipc 63.03 +4.5% 65.90 perf-stat.i.metric.K/sec 4035009 +4.5% 4218051 perf-stat.i.page-faults 2.29 -4.4% 2.19 perf-stat.overall.cpi 0.44 +4.6% 0.46 perf-stat.overall.ipc 1.829e+10 +4.5% 1.912e+10 perf-stat.ps.branch-instructions 1.242e+08 +4.1% 1.293e+08 perf-stat.ps.branch-misses 1.424e+08 +5.3% 1.499e+08 perf-stat.ps.cache-references 8.385e+10 +4.6% 8.767e+10 perf-stat.ps.instructions 3966080 +4.6% 4146673 perf-stat.ps.page-faults 5.154e+12 +4.6% 5.389e+12 perf-stat.total.instructions 36.24 -1.9 34.36 ± 2% perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_mprotect_mem 38.30 -1.7 36.58 ± 2% perf-profile.calltrace.cycles-pp.stress_mprotect_mem 14.45 ± 2% -1.7 12.80 ± 2% perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem 17.12 -1.5 15.58 ± 2% perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem 17.06 -1.5 15.54 ± 2% perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem 12.44 ± 2% -1.5 10.92 ± 2% perf-profile.calltrace.cycles-pp.do_dec_rlimit_put_ucounts.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode 12.46 ± 2% -1.5 10.94 ± 2% perf-profile.calltrace.cycles-pp.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault 0.54 ± 2% -0.1 0.43 ± 44% perf-profile.calltrace.cycles-pp.up_read.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault 0.84 -0.1 0.75 ± 4% perf-profile.calltrace.cycles-pp.down_write.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup 1.60 -0.1 1.51 ± 2% perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_sig_handler 1.59 -0.1 1.51 ± 2% perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler 0.82 ± 3% -0.1 0.74 ± 2% perf-profile.calltrace.cycles-pp.sigprocmask.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask 1.44 -0.1 1.37 ± 2% perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler 1.03 ± 2% -0.1 0.98 perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask 1.29 ± 2% -0.1 1.23 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask 0.68 ± 3% -0.0 0.64 ± 2% perf-profile.calltrace.cycles-pp.up_write.vma_complete.__split_vma.vma_modify.vma_modify_flags 0.58 ± 2% -0.0 0.54 ± 3% perf-profile.calltrace.cycles-pp.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem 0.58 ± 2% -0.0 0.56 perf-profile.calltrace.cycles-pp.fpu__clear_user_states.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault 0.62 ± 3% +0.1 0.67 ± 2% perf-profile.calltrace.cycles-pp.mas_prev_slot.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.01 +0.1 1.07 perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart 1.23 +0.1 1.30 ± 2% perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem 0.84 ± 3% +0.1 0.91 ± 2% perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.commit_merge.vma_merge_existing_range.vma_modify 0.84 ± 2% +0.1 0.91 perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup 1.75 ± 2% +0.1 1.83 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mprotect 0.59 ± 2% +0.1 0.67 ± 2% perf-profile.calltrace.cycles-pp.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup.do_mprotect_pkey 2.41 ± 2% +0.1 2.50 perf-profile.calltrace.cycles-pp.clear_bhb_loop.__mprotect 1.77 +0.1 1.88 perf-profile.calltrace.cycles-pp.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode 2.02 +0.1 2.14 perf-profile.calltrace.cycles-pp.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault 0.98 ± 18% +0.1 1.10 perf-profile.calltrace.cycles-pp.change_protection_range.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64 2.57 +0.1 2.70 perf-profile.calltrace.cycles-pp.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem 3.13 ± 3% +0.2 3.34 ± 2% perf-profile.calltrace.cycles-pp.asm_exc_page_fault.__mprotect 0.00 +0.6 0.55 ± 2% perf-profile.calltrace.cycles-pp.prepend_copy.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup 34.00 +1.1 35.12 ± 2% perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe 46.05 +1.1 47.19 perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect 46.28 +1.2 47.43 perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect 48.43 +1.2 49.61 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect 48.86 +1.2 50.06 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mprotect 55.84 +1.6 57.41 perf-profile.calltrace.cycles-pp.__mprotect 39.48 -1.9 37.62 ± 2% perf-profile.children.cycles-pp.asm_exc_page_fault 14.48 ± 2% -1.6 12.83 ± 2% perf-profile.children.cycles-pp.get_signal 18.72 -1.6 17.11 perf-profile.children.cycles-pp.irqentry_exit_to_user_mode 39.92 -1.6 38.32 ± 2% perf-profile.children.cycles-pp.stress_mprotect_mem 18.52 -1.6 16.92 perf-profile.children.cycles-pp.arch_do_signal_or_restart 12.47 ± 2% -1.5 10.94 ± 2% perf-profile.children.cycles-pp.__sigqueue_free 12.44 ± 2% -1.5 10.92 ± 2% perf-profile.children.cycles-pp.do_dec_rlimit_put_ucounts 5.00 -0.2 4.83 ± 2% perf-profile.children.cycles-pp.up_write 0.47 ± 10% -0.1 0.34 ± 7% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt 0.47 ± 10% -0.1 0.34 ± 7% perf-profile.children.cycles-pp.hrtimer_interrupt 1.16 ± 3% -0.1 1.05 perf-profile.children.cycles-pp.recalc_sigpending 0.35 ± 7% -0.1 0.24 ± 6% perf-profile.children.cycles-pp.__hrtimer_run_queues 0.89 ± 6% -0.1 0.79 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irq 0.34 ± 8% -0.1 0.24 ± 6% perf-profile.children.cycles-pp.tick_nohz_handler 0.86 ± 2% -0.1 0.78 perf-profile.children.cycles-pp.sigprocmask 0.28 ± 10% -0.1 0.21 ± 6% perf-profile.children.cycles-pp.update_process_times 1.05 ± 2% -0.1 0.98 perf-profile.children.cycles-pp.__x64_sys_rt_sigprocmask 0.30 ± 3% -0.0 0.26 ± 3% perf-profile.children.cycles-pp.fpregs_mark_activate 0.17 ± 10% -0.0 0.13 ± 6% perf-profile.children.cycles-pp.sched_tick 0.47 ± 3% -0.0 0.43 ± 3% perf-profile.children.cycles-pp.complete_signal 0.54 ± 2% -0.0 0.51 ± 2% perf-profile.children.cycles-pp.up_read 0.58 ± 2% -0.0 0.55 ± 2% perf-profile.children.cycles-pp.__bad_area 0.61 -0.0 0.58 perf-profile.children.cycles-pp.fpu__clear_user_states 0.12 ± 5% +0.0 0.14 ± 4% perf-profile.children.cycles-pp.__get_user_nocheck_4 0.13 ± 3% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.ima_file_mprotect 0.22 ± 5% +0.0 0.24 ± 2% perf-profile.children.cycles-pp.security_file_mprotect 0.25 ± 3% +0.0 0.28 ± 4% perf-profile.children.cycles-pp.stress_mwc16 0.18 ± 5% +0.0 0.20 ± 6% perf-profile.children.cycles-pp.stress_mwc16modn 0.34 ± 3% +0.0 0.37 ± 3% perf-profile.children.cycles-pp.mas_ascend 0.12 ± 4% +0.0 0.15 ± 5% perf-profile.children.cycles-pp.copy_from_kernel_nofault_allowed 0.30 ± 8% +0.0 0.33 ± 2% perf-profile.children.cycles-pp.rcu_all_qs 0.26 ± 4% +0.0 0.29 ± 6% perf-profile.children.cycles-pp.mas_pop_node 0.44 ± 2% +0.0 0.47 perf-profile.children.cycles-pp.vma_set_page_prot 0.49 ± 3% +0.0 0.53 ± 3% perf-profile.children.cycles-pp.save_xstate_epilog 0.66 ± 2% +0.0 0.71 ± 2% perf-profile.children.cycles-pp.native_irq_return_iret 0.02 ± 99% +0.1 0.08 ± 11% perf-profile.children.cycles-pp.anon_vma_clone 1.27 +0.1 1.33 perf-profile.children.cycles-pp.do_user_addr_fault 0.84 +0.1 0.90 perf-profile.children.cycles-pp.mas_prev_slot 1.04 +0.1 1.11 perf-profile.children.cycles-pp.copy_fpstate_to_sigframe 0.73 ± 7% +0.1 0.79 ± 2% perf-profile.children.cycles-pp.__cond_resched 0.46 ± 3% +0.1 0.53 ± 2% perf-profile.children.cycles-pp.copy_from_kernel_nofault 1.30 ± 2% +0.1 1.37 perf-profile.children.cycles-pp.entry_SYSCALL_64 0.50 ± 2% +0.1 0.58 ± 2% perf-profile.children.cycles-pp.prepend_copy 1.68 +0.1 1.75 perf-profile.children.cycles-pp.mas_preallocate 0.61 ± 3% +0.1 0.70 ± 3% perf-profile.children.cycles-pp.simple_dname 2.77 ± 2% +0.1 2.87 perf-profile.children.cycles-pp.clear_bhb_loop 3.27 +0.1 3.37 perf-profile.children.cycles-pp.handle_signal 1.78 +0.1 1.89 perf-profile.children.cycles-pp.get_sigframe 2.05 +0.1 2.16 perf-profile.children.cycles-pp.x64_setup_rt_frame 0.99 ± 18% +0.1 1.11 perf-profile.children.cycles-pp.change_protection_range 7.00 +0.2 7.24 ± 2% perf-profile.children.cycles-pp.vma_prepare 34.09 +1.1 35.22 ± 2% perf-profile.children.cycles-pp.mprotect_fixup 50.17 +1.1 51.31 perf-profile.children.cycles-pp.do_syscall_64 46.24 +1.2 47.39 perf-profile.children.cycles-pp.do_mprotect_pkey 46.33 +1.2 47.49 perf-profile.children.cycles-pp.__x64_sys_mprotect 50.61 +1.2 51.78 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 55.94 +1.6 57.52 perf-profile.children.cycles-pp.__mprotect 12.44 ± 2% -1.5 10.91 ± 2% perf-profile.self.cycles-pp.do_dec_rlimit_put_ucounts 4.36 -0.1 4.22 ± 2% perf-profile.self.cycles-pp.up_write 1.14 ± 3% -0.1 1.03 perf-profile.self.cycles-pp.recalc_sigpending 0.87 ± 6% -0.1 0.78 ± 5% perf-profile.self.cycles-pp._raw_spin_lock_irq 2.83 -0.1 2.75 perf-profile.self.cycles-pp.down_write 0.28 ± 5% -0.0 0.23 ± 5% perf-profile.self.cycles-pp.fpregs_mark_activate 0.19 ± 10% -0.0 0.14 ± 12% perf-profile.self.cycles-pp.__perf_event_header__init_id 0.40 ± 3% -0.0 0.36 ± 5% perf-profile.self.cycles-pp.complete_signal 0.52 ± 2% -0.0 0.48 ± 2% perf-profile.self.cycles-pp.up_read 0.15 ± 2% -0.0 0.14 ± 3% perf-profile.self.cycles-pp.__send_signal_locked 0.10 ± 4% -0.0 0.09 ± 4% perf-profile.self.cycles-pp.__bad_area_nosemaphore 0.30 ± 3% +0.0 0.33 ± 4% perf-profile.self.cycles-pp.mas_ascend 0.10 ± 5% +0.0 0.12 ± 5% perf-profile.self.cycles-pp.do_user_addr_fault 0.10 ± 4% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.copy_from_kernel_nofault_allowed 0.21 ± 6% +0.0 0.24 ± 4% perf-profile.self.cycles-pp.rwsem_down_write_slowpath 0.40 +0.0 0.43 ± 2% perf-profile.self.cycles-pp.change_protection_range 0.44 +0.0 0.47 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 0.24 ± 3% +0.0 0.27 ± 6% perf-profile.self.cycles-pp.mas_pop_node 0.34 ± 2% +0.0 0.38 ± 3% perf-profile.self.cycles-pp.mas_preallocate 0.37 ± 8% +0.0 0.41 ± 3% perf-profile.self.cycles-pp.__cond_resched 0.72 +0.0 0.76 ± 2% perf-profile.self.cycles-pp.copy_fpstate_to_sigframe 0.41 +0.0 0.45 ± 3% perf-profile.self.cycles-pp.mas_prev_slot 0.66 ± 2% +0.0 0.71 ± 2% perf-profile.self.cycles-pp.native_irq_return_iret 0.30 ± 4% +0.0 0.35 ± 2% perf-profile.self.cycles-pp.copy_from_kernel_nofault 0.02 ±141% +0.1 0.08 ± 11% perf-profile.self.cycles-pp.anon_vma_clone 1.21 ± 2% +0.1 1.30 ± 2% perf-profile.self.cycles-pp.__mprotect 2.73 ± 2% +0.1 2.83 perf-profile.self.cycles-pp.clear_bhb_loop 2.76 +0.1 2.88 perf-profile.self.cycles-pp.do_mprotect_pkey 3.48 ± 3% +0.3 3.74 ± 2% perf-profile.self.cycles-pp.stress_mprotect_mem *************************************************************************************************** lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory ========================================================================================= compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigrt/stress-ng/60s commit: 3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply") ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"") 334426094588f817 ee2a5c3e36093d0ff5709bc8f21 ---------------- --------------------------- %stddev %change %stddev \ | \ 1345 ± 9% -15.8% 1132 ± 5% perf-c2c.HITM.remote 5328778 +18.0% 6289475 vmstat.system.cs 197362 +2.0% 201296 vmstat.system.in 45.97 ±118% -85.4% 6.71 ± 55% perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio 582.79 ± 39% -39.2% 354.28 ± 31% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait 1260 ± 46% -43.7% 709.74 ± 31% perf-sched.wait_and_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait 45.97 ±118% -85.4% 6.71 ± 55% perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio 705.59 ± 50% -48.9% 360.90 ± 32% perf-sched.wait_time.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait 83250 -16.0% 69935 stress-ng.sigrt.nanosecs_between_sigqueue_and_sigwaitinfo_completion 3.362e+08 +15.7% 3.89e+08 stress-ng.sigrt.ops 5601334 +15.7% 6480915 stress-ng.sigrt.ops_per_sec 65582158 +17.7% 77176472 stress-ng.time.involuntary_context_switches 3423 -1.4% 3375 stress-ng.time.system_time 335.13 ± 2% +14.5% 383.80 ± 2% stress-ng.time.user_time 2.714e+08 +17.4% 3.185e+08 stress-ng.time.voluntary_context_switches 4202907 ± 15% -24.2% 3184715 ± 12% sched_debug.cfs_rq:/.avg_vruntime.max 82.07 ± 12% +391.9% 403.68 ± 94% sched_debug.cfs_rq:/.load_avg.avg 169.48 ± 8% +1182.4% 2173 ±115% sched_debug.cfs_rq:/.load_avg.stddev 4202907 ± 15% -24.2% 3184715 ± 12% sched_debug.cfs_rq:/.min_vruntime.max 1239 ± 8% +14.2% 1415 ± 12% sched_debug.cfs_rq:/.util_avg.max 2593172 +17.4% 3044316 sched_debug.cpu.nr_switches.avg 1526897 ± 3% +66.4% 2540867 ± 2% sched_debug.cpu.nr_switches.min 606805 -67.2% 198918 ± 9% sched_debug.cpu.nr_switches.stddev 1.902e+10 +14.8% 2.184e+10 perf-stat.i.branch-instructions 1.42e+08 ± 3% +16.2% 1.65e+08 perf-stat.i.branch-misses 6.65 ± 4% -0.9 5.77 ± 7% perf-stat.i.cache-miss-rate% 3.931e+08 ± 9% +17.1% 4.605e+08 ± 6% perf-stat.i.cache-references 5534190 +17.4% 6498045 perf-stat.i.context-switches 2.71 -14.3% 2.33 perf-stat.i.cpi 8.694e+10 +14.8% 9.976e+10 perf-stat.i.instructions 0.39 +14.2% 0.45 perf-stat.i.ipc 86.53 +17.4% 101.60 perf-stat.i.metric.K/sec 6.82 ± 5% -0.9 5.91 ± 9% perf-stat.overall.cache-miss-rate% 2.59 -12.9% 2.26 perf-stat.overall.cpi 0.39 +14.7% 0.44 perf-stat.overall.ipc 1.871e+10 +14.8% 2.149e+10 perf-stat.ps.branch-instructions 1.396e+08 ± 3% +16.2% 1.622e+08 perf-stat.ps.branch-misses 3.868e+08 ± 9% +17.1% 4.53e+08 ± 6% perf-stat.ps.cache-references 5443676 +17.4% 6391319 perf-stat.ps.context-switches 8.552e+10 +14.8% 9.813e+10 perf-stat.ps.instructions 5.251e+12 +14.3% 6e+12 perf-stat.total.instructions *************************************************************************************************** lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory ========================================================================================= compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigbus/stress-ng/60s commit: 3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply") ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"") 334426094588f817 ee2a5c3e36093d0ff5709bc8f21 ---------------- --------------------------- %stddev %change %stddev \ | \ 7.64 +1.7 9.30 mpstat.cpu.all.usr% 36.50 ± 16% -42.9% 20.83 ± 31% perf-c2c.DRAM.local 2312 ± 6% -68.7% 723.17 ± 4% perf-c2c.DRAM.remote 3690 ± 3% +44.9% 5347 ± 6% perf-c2c.HITM.local 2155 ± 6% -71.8% 608.17 ± 4% perf-c2c.HITM.remote 4477 ± 69% -70.3% 1328 ± 35% proc-vmstat.numa_hint_faults 2459 ± 11% -64.8% 866.33 ± 47% proc-vmstat.numa_hint_faults_local 140611 ± 21% -33.6% 93302 ± 45% proc-vmstat.numa_pte_updates 7.197e+08 +20.7% 8.685e+08 proc-vmstat.pgfault 7.201e+08 +20.6% 8.682e+08 stress-ng.sigbus.ops 12001759 +20.6% 14469786 stress-ng.sigbus.ops_per_sec 3526 -1.8% 3461 stress-ng.time.system_time 261.31 +25.4% 327.64 stress-ng.time.user_time 0.03 ± 55% -64.6% 0.01 ± 17% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.86 ±150% -90.1% 0.09 ±201% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait 0.02 ± 50% -58.7% 0.01 ± 14% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 1.08 ± 18% -34.1% 0.71 ± 14% perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown] 0.31 ± 72% -65.9% 0.11 ± 71% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 0.02 ± 10% -23.4% 0.01 ± 15% perf-sched.sch_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm 1.91 ±218% -99.2% 0.02 ± 11% perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 4.00 ± 49% -71.6% 1.14 ± 56% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 261.25 ± 37% +199.1% 781.43 ± 15% perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 81.02 ± 59% +274.1% 303.13 ± 50% perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait 6.60 ± 2% +16.9% 7.71 ± 3% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 108.83 ± 63% -81.2% 20.50 ±113% perf-sched.wait_and_delay.count.devkmsg_read.vfs_read.ksys_read.do_syscall_64 3107 ± 3% -12.6% 2714 ± 5% perf-sched.wait_and_delay.count.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown] 124.17 ± 63% -70.1% 37.17 ± 60% perf-sched.wait_and_delay.count.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait 751.00 ± 2% -17.0% 623.50 ± 2% perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 1550 ± 31% +119.7% 3406 ± 19% perf-sched.wait_and_delay.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 261.24 ± 37% +199.1% 781.42 ± 15% perf-sched.wait_time.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 80.16 ± 60% +278.0% 303.05 ± 50% perf-sched.wait_time.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait 6.59 ± 2% +17.0% 7.71 ± 3% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 1550 ± 31% +119.7% 3406 ± 19% perf-sched.wait_time.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 0.18 -49.0% 0.09 ± 3% perf-stat.i.MPKI 1.59e+10 +19.7% 1.903e+10 perf-stat.i.branch-instructions 0.28 -0.0 0.25 perf-stat.i.branch-miss-rate% 40989724 +5.3% 43173098 ± 2% perf-stat.i.branch-misses 32.63 -15.8 16.81 ± 2% perf-stat.i.cache-miss-rate% 12733301 ± 2% -40.3% 7597041 ± 3% perf-stat.i.cache-misses 38933806 +14.5% 44591128 perf-stat.i.cache-references 3.17 -16.4% 2.65 perf-stat.i.cpi 18224 +75.2% 31921 perf-stat.i.cycles-between-cache-misses 7.098e+10 +19.6% 8.489e+10 perf-stat.i.instructions 0.32 +19.0% 0.38 perf-stat.i.ipc 184.67 +20.6% 222.65 perf-stat.i.metric.K/sec 11819123 +20.6% 14249011 perf-stat.i.page-faults 0.18 -50.1% 0.09 ± 3% perf-stat.overall.MPKI 0.26 -0.0 0.23 perf-stat.overall.branch-miss-rate% 32.70 -15.7 17.04 ± 3% perf-stat.overall.cache-miss-rate% 3.19 -16.4% 2.66 perf-stat.overall.cpi 17772 ± 2% +67.6% 29795 ± 2% perf-stat.overall.cycles-between-cache-misses 0.31 +19.6% 0.38 perf-stat.overall.ipc 1.564e+10 +19.7% 1.871e+10 perf-stat.ps.branch-instructions 40314687 +5.4% 42478375 ± 2% perf-stat.ps.branch-misses 12525837 ± 2% -40.3% 7473864 ± 3% perf-stat.ps.cache-misses 38300912 +14.5% 43866104 perf-stat.ps.cache-references 6.982e+10 +19.6% 8.35e+10 perf-stat.ps.instructions 11626044 +20.6% 14016280 perf-stat.ps.page-faults 4.284e+12 +19.5% 5.117e+12 perf-stat.total.instructions Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki