Re: [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Hello,

kernel test robot noticed a 23.4% improvement of stress-ng.sigxfsz.ops_per_sec on:


commit: ee2a5c3e36093d0ff5709bc8f21d3793cf55f746 ("[PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"")
url: https://github.com/intel-lab-lkp/linux/commits/Michal-Koutn/Revert-pid-allow-pid_max-to-be-set-per-pid-namespace/20250222-010942
patch link: https://lore.kernel.org/all/20250221170249.890014-2-mkoutny@xxxxxxxx/
patch subject: [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"

testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: sigxfsz
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.mprotect.ops_per_sec 4.5% improvement                                |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=mprotect                                                                             |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.sigrt.ops_per_sec 15.7% improvement                                  |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=sigrt                                                                                |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.sigbus.ops_per_sec 20.6% improvement                                 |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=sigbus                                                                               |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250310/202503101532.348576bb-lkp@xxxxxxxxx

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigxfsz/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      5.11            +1.3        6.43        mpstat.cpu.all.usr%
      3737 ±  6%     -38.8%       2286 ± 42%  proc-vmstat.numa_hint_faults_local
   1212920 ±  4%     -10.4%    1086901 ±  5%  sched_debug.cpu.avg_idle.max
     35.50 ± 16%     -30.0%      24.83 ± 20%  perf-c2c.DRAM.local
      1517 ±  4%     -46.5%     812.17 ±  3%  perf-c2c.DRAM.remote
      1808 ±  2%     +57.0%       2840        perf-c2c.HITM.local
      1360 ±  5%     -49.9%     680.83 ±  2%  perf-c2c.HITM.remote
      5.22 ±  3%     +19.8%       6.26 ±  7%  perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
     53.33 ± 15%     +25.0%      66.67 ± 15%  perf-sched.wait_and_delay.count.__cond_resched.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
    953.83 ±  3%     -16.5%     796.33 ±  7%  perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      5.21 ±  3%     +20.0%       6.25 ±  7%  perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
    163515           +27.8%     208915        stress-ng.sigxfsz.SIGXFSZ_signals_per_sec
 6.668e+08           +23.4%   8.23e+08        stress-ng.sigxfsz.ops
  11113966           +23.4%   13716156        stress-ng.sigxfsz.ops_per_sec
      3623            -1.4%       3573        stress-ng.time.system_time
    163.26           +31.7%     214.98        stress-ng.time.user_time
      0.25           -54.7%       0.12 ±  2%  perf-stat.i.MPKI
 1.125e+10           +22.1%  1.373e+10        perf-stat.i.branch-instructions
      0.54            -0.0        0.50        perf-stat.i.branch-miss-rate%
  59748239           +10.9%   66264440        perf-stat.i.branch-misses
     33.30           -17.9       15.38 ±  2%  perf-stat.i.cache-miss-rate%
  13040640           -45.8%    7066419 ±  2%  perf-stat.i.cache-misses
  39047103           +15.5%   45098530        perf-stat.i.cache-references
      4.39           -18.2%       3.59        perf-stat.i.cpi
     17823           +97.0%      35113        perf-stat.i.cycles-between-cache-misses
 5.144e+10           +22.0%  6.275e+10        perf-stat.i.instructions
      0.23           +21.3%       0.28        perf-stat.i.ipc
      0.25           -55.6%       0.11 ±  2%  perf-stat.overall.MPKI
      0.53            -0.0        0.48        perf-stat.overall.branch-miss-rate%
     33.40           -17.7       15.67 ±  2%  perf-stat.overall.cache-miss-rate%
      4.40           -18.0%       3.60        perf-stat.overall.cpi
     17350           +84.6%      32027 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.23           +22.0%       0.28        perf-stat.overall.ipc
 1.106e+10           +22.1%   1.35e+10        perf-stat.ps.branch-instructions
  58763534           +10.9%   65180843        perf-stat.ps.branch-misses
  12827760           -45.8%    6951883 ±  2%  perf-stat.ps.cache-misses
  38411225           +15.5%   44365626        perf-stat.ps.cache-references
  5.06e+10           +22.0%  6.172e+10        perf-stat.ps.instructions
 3.106e+12           +21.9%  3.787e+12        perf-stat.total.instructions


***************************************************************************************************
lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/mprotect/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     10205 ± 25%     +33.5%      13621 ± 16%  numa-meminfo.node0.KernelStack
      0.02 ± 37%     -37.8%       0.01 ± 13%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.82 ± 32%     -37.7%       0.51 ±  7%  perf-sched.sch_delay.max.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
    807.17 ±  5%      -8.5%     738.67 ±  5%  perf-sched.wait_and_delay.count.__cond_resched.down_write.vma_prepare.__split_vma.vma_modify
    433709            +4.9%     454923 ±  5%  proc-vmstat.nr_active_anon
     61940 ±  3%     +31.3%      81315 ± 35%  proc-vmstat.nr_shmem
    433709            +4.9%     454923 ±  5%  proc-vmstat.nr_zone_active_anon
 4.903e+08            +4.5%  5.124e+08        stress-ng.mprotect.ops
   8163833            +4.5%    8533021        stress-ng.mprotect.ops_per_sec
    239.55            +4.7%     250.91        stress-ng.time.user_time
   3960356 ±  7%     -16.0%    3325457        numa-numastat.node0.local_node
   3990670 ±  7%     -16.1%    3348370        numa-numastat.node0.numa_hit
   2608139 ±  6%     +34.5%    3507199 ±  4%  numa-numastat.node1.local_node
   2644058 ±  6%     +34.3%    3550893 ±  4%  numa-numastat.node1.numa_hit
   3986137 ±  7%     -16.0%    3349506        numa-vmstat.node0.numa_hit
   3955823 ±  7%     -15.9%    3326594        numa-vmstat.node0.numa_local
   2639425 ±  6%     +34.6%    3552253 ±  4%  numa-vmstat.node1.numa_hit
   2603506 ±  6%     +34.8%    3508559 ±  4%  numa-vmstat.node1.numa_local
      1.11 ± 20%     -38.9%       0.68 ± 31%  sched_debug.cfs_rq:/.h_nr_queued.stddev
      1.11 ± 19%     -38.6%       0.68 ± 31%  sched_debug.cfs_rq:/.h_nr_runnable.stddev
      5890 ±  6%     -10.7%       5262        sched_debug.cfs_rq:/.runnable_avg.max
      1064 ± 20%     -41.1%     626.67 ± 33%  sched_debug.cfs_rq:/.runnable_avg.stddev
      1151           -12.2%       1010        sched_debug.cpu.clock_task.stddev
      1.11 ± 20%     -39.1%       0.68 ± 32%  sched_debug.cpu.nr_running.stddev
 1.861e+10            +4.5%  1.945e+10        perf-stat.i.branch-instructions
 1.264e+08            +4.1%  1.316e+08        perf-stat.i.branch-misses
  1.45e+08            +5.3%  1.526e+08        perf-stat.i.cache-references
      2.28            -4.3%       2.18        perf-stat.i.cpi
 8.533e+10            +4.5%   8.92e+10        perf-stat.i.instructions
      0.44            +4.5%       0.46        perf-stat.i.ipc
     63.03            +4.5%      65.90        perf-stat.i.metric.K/sec
   4035009            +4.5%    4218051        perf-stat.i.page-faults
      2.29            -4.4%       2.19        perf-stat.overall.cpi
      0.44            +4.6%       0.46        perf-stat.overall.ipc
 1.829e+10            +4.5%  1.912e+10        perf-stat.ps.branch-instructions
 1.242e+08            +4.1%  1.293e+08        perf-stat.ps.branch-misses
 1.424e+08            +5.3%  1.499e+08        perf-stat.ps.cache-references
 8.385e+10            +4.6%  8.767e+10        perf-stat.ps.instructions
   3966080            +4.6%    4146673        perf-stat.ps.page-faults
 5.154e+12            +4.6%  5.389e+12        perf-stat.total.instructions
     36.24            -1.9       34.36 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_mprotect_mem
     38.30            -1.7       36.58 ±  2%  perf-profile.calltrace.cycles-pp.stress_mprotect_mem
     14.45 ±  2%      -1.7       12.80 ±  2%  perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     17.12            -1.5       15.58 ±  2%  perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     17.06            -1.5       15.54 ±  2%  perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     12.44 ±  2%      -1.5       10.92 ±  2%  perf-profile.calltrace.cycles-pp.do_dec_rlimit_put_ucounts.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode
     12.46 ±  2%      -1.5       10.94 ±  2%  perf-profile.calltrace.cycles-pp.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.54 ±  2%      -0.1        0.43 ± 44%  perf-profile.calltrace.cycles-pp.up_read.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault
      0.84            -0.1        0.75 ±  4%  perf-profile.calltrace.cycles-pp.down_write.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup
      1.60            -0.1        1.51 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_sig_handler
      1.59            -0.1        1.51 ±  2%  perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler
      0.82 ±  3%      -0.1        0.74 ±  2%  perf-profile.calltrace.cycles-pp.sigprocmask.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      1.44            -0.1        1.37 ±  2%  perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler
      1.03 ±  2%      -0.1        0.98        perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      1.29 ±  2%      -0.1        1.23        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      0.68 ±  3%      -0.0        0.64 ±  2%  perf-profile.calltrace.cycles-pp.up_write.vma_complete.__split_vma.vma_modify.vma_modify_flags
      0.58 ±  2%      -0.0        0.54 ±  3%  perf-profile.calltrace.cycles-pp.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem
      0.58 ±  2%      -0.0        0.56        perf-profile.calltrace.cycles-pp.fpu__clear_user_states.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.62 ±  3%      +0.1        0.67 ±  2%  perf-profile.calltrace.cycles-pp.mas_prev_slot.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.01            +0.1        1.07        perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart
      1.23            +0.1        1.30 ±  2%  perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem
      0.84 ±  3%      +0.1        0.91 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.commit_merge.vma_merge_existing_range.vma_modify
      0.84 ±  2%      +0.1        0.91        perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup
      1.75 ±  2%      +0.1        1.83        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mprotect
      0.59 ±  2%      +0.1        0.67 ±  2%  perf-profile.calltrace.cycles-pp.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup.do_mprotect_pkey
      2.41 ±  2%      +0.1        2.50        perf-profile.calltrace.cycles-pp.clear_bhb_loop.__mprotect
      1.77            +0.1        1.88        perf-profile.calltrace.cycles-pp.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode
      2.02            +0.1        2.14        perf-profile.calltrace.cycles-pp.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.98 ± 18%      +0.1        1.10        perf-profile.calltrace.cycles-pp.change_protection_range.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
      2.57            +0.1        2.70        perf-profile.calltrace.cycles-pp.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
      3.13 ±  3%      +0.2        3.34 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.__mprotect
      0.00            +0.6        0.55 ±  2%  perf-profile.calltrace.cycles-pp.prepend_copy.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup
     34.00            +1.1       35.12 ±  2%  perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
     46.05            +1.1       47.19        perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     46.28            +1.2       47.43        perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     48.43            +1.2       49.61        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     48.86            +1.2       50.06        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mprotect
     55.84            +1.6       57.41        perf-profile.calltrace.cycles-pp.__mprotect
     39.48            -1.9       37.62 ±  2%  perf-profile.children.cycles-pp.asm_exc_page_fault
     14.48 ±  2%      -1.6       12.83 ±  2%  perf-profile.children.cycles-pp.get_signal
     18.72            -1.6       17.11        perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
     39.92            -1.6       38.32 ±  2%  perf-profile.children.cycles-pp.stress_mprotect_mem
     18.52            -1.6       16.92        perf-profile.children.cycles-pp.arch_do_signal_or_restart
     12.47 ±  2%      -1.5       10.94 ±  2%  perf-profile.children.cycles-pp.__sigqueue_free
     12.44 ±  2%      -1.5       10.92 ±  2%  perf-profile.children.cycles-pp.do_dec_rlimit_put_ucounts
      5.00            -0.2        4.83 ±  2%  perf-profile.children.cycles-pp.up_write
      0.47 ± 10%      -0.1        0.34 ±  7%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.47 ± 10%      -0.1        0.34 ±  7%  perf-profile.children.cycles-pp.hrtimer_interrupt
      1.16 ±  3%      -0.1        1.05        perf-profile.children.cycles-pp.recalc_sigpending
      0.35 ±  7%      -0.1        0.24 ±  6%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.89 ±  6%      -0.1        0.79 ±  5%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.34 ±  8%      -0.1        0.24 ±  6%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.86 ±  2%      -0.1        0.78        perf-profile.children.cycles-pp.sigprocmask
      0.28 ± 10%      -0.1        0.21 ±  6%  perf-profile.children.cycles-pp.update_process_times
      1.05 ±  2%      -0.1        0.98        perf-profile.children.cycles-pp.__x64_sys_rt_sigprocmask
      0.30 ±  3%      -0.0        0.26 ±  3%  perf-profile.children.cycles-pp.fpregs_mark_activate
      0.17 ± 10%      -0.0        0.13 ±  6%  perf-profile.children.cycles-pp.sched_tick
      0.47 ±  3%      -0.0        0.43 ±  3%  perf-profile.children.cycles-pp.complete_signal
      0.54 ±  2%      -0.0        0.51 ±  2%  perf-profile.children.cycles-pp.up_read
      0.58 ±  2%      -0.0        0.55 ±  2%  perf-profile.children.cycles-pp.__bad_area
      0.61            -0.0        0.58        perf-profile.children.cycles-pp.fpu__clear_user_states
      0.12 ±  5%      +0.0        0.14 ±  4%  perf-profile.children.cycles-pp.__get_user_nocheck_4
      0.13 ±  3%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.ima_file_mprotect
      0.22 ±  5%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.security_file_mprotect
      0.25 ±  3%      +0.0        0.28 ±  4%  perf-profile.children.cycles-pp.stress_mwc16
      0.18 ±  5%      +0.0        0.20 ±  6%  perf-profile.children.cycles-pp.stress_mwc16modn
      0.34 ±  3%      +0.0        0.37 ±  3%  perf-profile.children.cycles-pp.mas_ascend
      0.12 ±  4%      +0.0        0.15 ±  5%  perf-profile.children.cycles-pp.copy_from_kernel_nofault_allowed
      0.30 ±  8%      +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.rcu_all_qs
      0.26 ±  4%      +0.0        0.29 ±  6%  perf-profile.children.cycles-pp.mas_pop_node
      0.44 ±  2%      +0.0        0.47        perf-profile.children.cycles-pp.vma_set_page_prot
      0.49 ±  3%      +0.0        0.53 ±  3%  perf-profile.children.cycles-pp.save_xstate_epilog
      0.66 ±  2%      +0.0        0.71 ±  2%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.02 ± 99%      +0.1        0.08 ± 11%  perf-profile.children.cycles-pp.anon_vma_clone
      1.27            +0.1        1.33        perf-profile.children.cycles-pp.do_user_addr_fault
      0.84            +0.1        0.90        perf-profile.children.cycles-pp.mas_prev_slot
      1.04            +0.1        1.11        perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
      0.73 ±  7%      +0.1        0.79 ±  2%  perf-profile.children.cycles-pp.__cond_resched
      0.46 ±  3%      +0.1        0.53 ±  2%  perf-profile.children.cycles-pp.copy_from_kernel_nofault
      1.30 ±  2%      +0.1        1.37        perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.50 ±  2%      +0.1        0.58 ±  2%  perf-profile.children.cycles-pp.prepend_copy
      1.68            +0.1        1.75        perf-profile.children.cycles-pp.mas_preallocate
      0.61 ±  3%      +0.1        0.70 ±  3%  perf-profile.children.cycles-pp.simple_dname
      2.77 ±  2%      +0.1        2.87        perf-profile.children.cycles-pp.clear_bhb_loop
      3.27            +0.1        3.37        perf-profile.children.cycles-pp.handle_signal
      1.78            +0.1        1.89        perf-profile.children.cycles-pp.get_sigframe
      2.05            +0.1        2.16        perf-profile.children.cycles-pp.x64_setup_rt_frame
      0.99 ± 18%      +0.1        1.11        perf-profile.children.cycles-pp.change_protection_range
      7.00            +0.2        7.24 ±  2%  perf-profile.children.cycles-pp.vma_prepare
     34.09            +1.1       35.22 ±  2%  perf-profile.children.cycles-pp.mprotect_fixup
     50.17            +1.1       51.31        perf-profile.children.cycles-pp.do_syscall_64
     46.24            +1.2       47.39        perf-profile.children.cycles-pp.do_mprotect_pkey
     46.33            +1.2       47.49        perf-profile.children.cycles-pp.__x64_sys_mprotect
     50.61            +1.2       51.78        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     55.94            +1.6       57.52        perf-profile.children.cycles-pp.__mprotect
     12.44 ±  2%      -1.5       10.91 ±  2%  perf-profile.self.cycles-pp.do_dec_rlimit_put_ucounts
      4.36            -0.1        4.22 ±  2%  perf-profile.self.cycles-pp.up_write
      1.14 ±  3%      -0.1        1.03        perf-profile.self.cycles-pp.recalc_sigpending
      0.87 ±  6%      -0.1        0.78 ±  5%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      2.83            -0.1        2.75        perf-profile.self.cycles-pp.down_write
      0.28 ±  5%      -0.0        0.23 ±  5%  perf-profile.self.cycles-pp.fpregs_mark_activate
      0.19 ± 10%      -0.0        0.14 ± 12%  perf-profile.self.cycles-pp.__perf_event_header__init_id
      0.40 ±  3%      -0.0        0.36 ±  5%  perf-profile.self.cycles-pp.complete_signal
      0.52 ±  2%      -0.0        0.48 ±  2%  perf-profile.self.cycles-pp.up_read
      0.15 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.__send_signal_locked
      0.10 ±  4%      -0.0        0.09 ±  4%  perf-profile.self.cycles-pp.__bad_area_nosemaphore
      0.30 ±  3%      +0.0        0.33 ±  4%  perf-profile.self.cycles-pp.mas_ascend
      0.10 ±  5%      +0.0        0.12 ±  5%  perf-profile.self.cycles-pp.do_user_addr_fault
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.copy_from_kernel_nofault_allowed
      0.21 ±  6%      +0.0        0.24 ±  4%  perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.40            +0.0        0.43 ±  2%  perf-profile.self.cycles-pp.change_protection_range
      0.44            +0.0        0.47        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.24 ±  3%      +0.0        0.27 ±  6%  perf-profile.self.cycles-pp.mas_pop_node
      0.34 ±  2%      +0.0        0.38 ±  3%  perf-profile.self.cycles-pp.mas_preallocate
      0.37 ±  8%      +0.0        0.41 ±  3%  perf-profile.self.cycles-pp.__cond_resched
      0.72            +0.0        0.76 ±  2%  perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
      0.41            +0.0        0.45 ±  3%  perf-profile.self.cycles-pp.mas_prev_slot
      0.66 ±  2%      +0.0        0.71 ±  2%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.30 ±  4%      +0.0        0.35 ±  2%  perf-profile.self.cycles-pp.copy_from_kernel_nofault
      0.02 ±141%      +0.1        0.08 ± 11%  perf-profile.self.cycles-pp.anon_vma_clone
      1.21 ±  2%      +0.1        1.30 ±  2%  perf-profile.self.cycles-pp.__mprotect
      2.73 ±  2%      +0.1        2.83        perf-profile.self.cycles-pp.clear_bhb_loop
      2.76            +0.1        2.88        perf-profile.self.cycles-pp.do_mprotect_pkey
      3.48 ±  3%      +0.3        3.74 ±  2%  perf-profile.self.cycles-pp.stress_mprotect_mem



***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigrt/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      1345 ±  9%     -15.8%       1132 ±  5%  perf-c2c.HITM.remote
   5328778           +18.0%    6289475        vmstat.system.cs
    197362            +2.0%     201296        vmstat.system.in
     45.97 ±118%     -85.4%       6.71 ± 55%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
    582.79 ± 39%     -39.2%     354.28 ± 31%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
      1260 ± 46%     -43.7%     709.74 ± 31%  perf-sched.wait_and_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
     45.97 ±118%     -85.4%       6.71 ± 55%  perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
    705.59 ± 50%     -48.9%     360.90 ± 32%  perf-sched.wait_time.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
     83250           -16.0%      69935        stress-ng.sigrt.nanosecs_between_sigqueue_and_sigwaitinfo_completion
 3.362e+08           +15.7%   3.89e+08        stress-ng.sigrt.ops
   5601334           +15.7%    6480915        stress-ng.sigrt.ops_per_sec
  65582158           +17.7%   77176472        stress-ng.time.involuntary_context_switches
      3423            -1.4%       3375        stress-ng.time.system_time
    335.13 ±  2%     +14.5%     383.80 ±  2%  stress-ng.time.user_time
 2.714e+08           +17.4%  3.185e+08        stress-ng.time.voluntary_context_switches
   4202907 ± 15%     -24.2%    3184715 ± 12%  sched_debug.cfs_rq:/.avg_vruntime.max
     82.07 ± 12%    +391.9%     403.68 ± 94%  sched_debug.cfs_rq:/.load_avg.avg
    169.48 ±  8%   +1182.4%       2173 ±115%  sched_debug.cfs_rq:/.load_avg.stddev
   4202907 ± 15%     -24.2%    3184715 ± 12%  sched_debug.cfs_rq:/.min_vruntime.max
      1239 ±  8%     +14.2%       1415 ± 12%  sched_debug.cfs_rq:/.util_avg.max
   2593172           +17.4%    3044316        sched_debug.cpu.nr_switches.avg
   1526897 ±  3%     +66.4%    2540867 ±  2%  sched_debug.cpu.nr_switches.min
    606805           -67.2%     198918 ±  9%  sched_debug.cpu.nr_switches.stddev
 1.902e+10           +14.8%  2.184e+10        perf-stat.i.branch-instructions
  1.42e+08 ±  3%     +16.2%   1.65e+08        perf-stat.i.branch-misses
      6.65 ±  4%      -0.9        5.77 ±  7%  perf-stat.i.cache-miss-rate%
 3.931e+08 ±  9%     +17.1%  4.605e+08 ±  6%  perf-stat.i.cache-references
   5534190           +17.4%    6498045        perf-stat.i.context-switches
      2.71           -14.3%       2.33        perf-stat.i.cpi
 8.694e+10           +14.8%  9.976e+10        perf-stat.i.instructions
      0.39           +14.2%       0.45        perf-stat.i.ipc
     86.53           +17.4%     101.60        perf-stat.i.metric.K/sec
      6.82 ±  5%      -0.9        5.91 ±  9%  perf-stat.overall.cache-miss-rate%
      2.59           -12.9%       2.26        perf-stat.overall.cpi
      0.39           +14.7%       0.44        perf-stat.overall.ipc
 1.871e+10           +14.8%  2.149e+10        perf-stat.ps.branch-instructions
 1.396e+08 ±  3%     +16.2%  1.622e+08        perf-stat.ps.branch-misses
 3.868e+08 ±  9%     +17.1%   4.53e+08 ±  6%  perf-stat.ps.cache-references
   5443676           +17.4%    6391319        perf-stat.ps.context-switches
 8.552e+10           +14.8%  9.813e+10        perf-stat.ps.instructions
 5.251e+12           +14.3%      6e+12        perf-stat.total.instructions



***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigbus/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      7.64            +1.7        9.30        mpstat.cpu.all.usr%
     36.50 ± 16%     -42.9%      20.83 ± 31%  perf-c2c.DRAM.local
      2312 ±  6%     -68.7%     723.17 ±  4%  perf-c2c.DRAM.remote
      3690 ±  3%     +44.9%       5347 ±  6%  perf-c2c.HITM.local
      2155 ±  6%     -71.8%     608.17 ±  4%  perf-c2c.HITM.remote
      4477 ± 69%     -70.3%       1328 ± 35%  proc-vmstat.numa_hint_faults
      2459 ± 11%     -64.8%     866.33 ± 47%  proc-vmstat.numa_hint_faults_local
    140611 ± 21%     -33.6%      93302 ± 45%  proc-vmstat.numa_pte_updates
 7.197e+08           +20.7%  8.685e+08        proc-vmstat.pgfault
 7.201e+08           +20.6%  8.682e+08        stress-ng.sigbus.ops
  12001759           +20.6%   14469786        stress-ng.sigbus.ops_per_sec
      3526            -1.8%       3461        stress-ng.time.system_time
    261.31           +25.4%     327.64        stress-ng.time.user_time
      0.03 ± 55%     -64.6%       0.01 ± 17%  perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.86 ±150%     -90.1%       0.09 ±201%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.02 ± 50%     -58.7%       0.01 ± 14%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1.08 ± 18%     -34.1%       0.71 ± 14%  perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.31 ± 72%     -65.9%       0.11 ± 71%  perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      0.02 ± 10%     -23.4%       0.01 ± 15%  perf-sched.sch_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
      1.91 ±218%     -99.2%       0.02 ± 11%  perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      4.00 ± 49%     -71.6%       1.14 ± 56%  perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    261.25 ± 37%    +199.1%     781.43 ± 15%  perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     81.02 ± 59%    +274.1%     303.13 ± 50%  perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.60 ±  2%     +16.9%       7.71 ±  3%  perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
    108.83 ± 63%     -81.2%      20.50 ±113%  perf-sched.wait_and_delay.count.devkmsg_read.vfs_read.ksys_read.do_syscall_64
      3107 ±  3%     -12.6%       2714 ±  5%  perf-sched.wait_and_delay.count.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown]
    124.17 ± 63%     -70.1%      37.17 ± 60%  perf-sched.wait_and_delay.count.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
    751.00 ±  2%     -17.0%     623.50 ±  2%  perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      1550 ± 31%    +119.7%       3406 ± 19%  perf-sched.wait_and_delay.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    261.24 ± 37%    +199.1%     781.42 ± 15%  perf-sched.wait_time.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     80.16 ± 60%    +278.0%     303.05 ± 50%  perf-sched.wait_time.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.59 ±  2%     +17.0%       7.71 ±  3%  perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      1550 ± 31%    +119.7%       3406 ± 19%  perf-sched.wait_time.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.18           -49.0%       0.09 ±  3%  perf-stat.i.MPKI
  1.59e+10           +19.7%  1.903e+10        perf-stat.i.branch-instructions
      0.28            -0.0        0.25        perf-stat.i.branch-miss-rate%
  40989724            +5.3%   43173098 ±  2%  perf-stat.i.branch-misses
     32.63           -15.8       16.81 ±  2%  perf-stat.i.cache-miss-rate%
  12733301 ±  2%     -40.3%    7597041 ±  3%  perf-stat.i.cache-misses
  38933806           +14.5%   44591128        perf-stat.i.cache-references
      3.17           -16.4%       2.65        perf-stat.i.cpi
     18224           +75.2%      31921        perf-stat.i.cycles-between-cache-misses
 7.098e+10           +19.6%  8.489e+10        perf-stat.i.instructions
      0.32           +19.0%       0.38        perf-stat.i.ipc
    184.67           +20.6%     222.65        perf-stat.i.metric.K/sec
  11819123           +20.6%   14249011        perf-stat.i.page-faults
      0.18           -50.1%       0.09 ±  3%  perf-stat.overall.MPKI
      0.26            -0.0        0.23        perf-stat.overall.branch-miss-rate%
     32.70           -15.7       17.04 ±  3%  perf-stat.overall.cache-miss-rate%
      3.19           -16.4%       2.66        perf-stat.overall.cpi
     17772 ±  2%     +67.6%      29795 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.31           +19.6%       0.38        perf-stat.overall.ipc
 1.564e+10           +19.7%  1.871e+10        perf-stat.ps.branch-instructions
  40314687            +5.4%   42478375 ±  2%  perf-stat.ps.branch-misses
  12525837 ±  2%     -40.3%    7473864 ±  3%  perf-stat.ps.cache-misses
  38300912           +14.5%   43866104        perf-stat.ps.cache-references
 6.982e+10           +19.6%   8.35e+10        perf-stat.ps.instructions
  11626044           +20.6%   14016280        perf-stat.ps.page-faults
 4.284e+12           +19.5%  5.117e+12        perf-stat.total.instructions





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki





[Index of Archives]     [Linux Ext4 Filesystem]     [Union Filesystem]     [Filesystem Testing]     [Ceph Users]     [Ecryptfs]     [NTFS 3]     [AutoFS]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux Cachefs]     [Reiser Filesystem]     [Linux RAID]     [NTFS 3]     [Samba]     [Device Mapper]     [CEPH Development]

  Powered by Linux