[linus:master] [mm] 249608ee47: stress-ng.mmap.ops_per_sec 13.1% regression

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Hello,

we reported
"[linus:master] [mm]  249608ee47:  will-it-scale.per_thread_ops 50.1% improvement"
in
https://lore.kernel.org/all/202412122346.ea54d461-lkp@xxxxxxxxx/

now we noticed a regression from stress-ng.mmap tests. just FYI.


kernel test robot noticed a 13.1% regression of stress-ng.mmap.ops_per_sec on:


commit: 249608ee47132cab3b1adacd9e463548f57bd316 ("mm: respect mmap hint address when aligning for THP")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

[test failed on linus/master      f932fb9b40749d1c9a539d89bb3e288c077aafe5]
[test failed on linux-next/master 4176cf5c5651c33769de83bb61b0287f4ec7719f]

testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: mmap
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+---------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops 51.6% improvement |
| test machine     | 104 threads 2 sockets (Skylake) with 192G memory              |
| test parameters  | cpufreq_governor=performance                                  |
|                  | mode=thread                                                   |
|                  | nr_task=100%                                                  |
|                  | test=brk2                                                     |
+------------------+---------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops 50.1% improvement |
| test machine     | 104 threads 2 sockets (Skylake) with 192G memory              |
| test parameters  | cpufreq_governor=performance                                  |
|                  | mode=thread                                                   |
|                  | nr_task=100%                                                  |
|                  | test=brk1                                                     |
+------------------+---------------------------------------------------------------+


If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@xxxxxxxxx>
| Closes: https://lore.kernel.org/oe-lkp/202412241643.57d4b342-lkp@xxxxxxxxx


Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241224/202412241643.57d4b342-lkp@xxxxxxxxx

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/mmap/stress-ng/60s

commit: 
  89dd878282 ("mm: memcg: declare do_memsw_account inline")
  249608ee47 ("mm: respect mmap hint address when aligning for THP")

89dd878282881306 249608ee47132cab3b1adacd9e4 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      2875 ± 10%    +306.7%      11691 ± 30%  meminfo.Mlocked
      4187            -4.6%       3996        vmstat.system.cs
      0.36 ±  4%      -0.0        0.32 ±  4%  mpstat.cpu.all.irq%
     11.47            -1.5        9.98        mpstat.cpu.all.soft%
      1477 ± 12%    +303.6%       5961 ± 29%  numa-meminfo.node0.Mlocked
      1532 ± 16%    +273.3%       5722 ± 28%  numa-meminfo.node1.Mlocked
    356.07 ± 14%    +291.1%       1392 ± 29%  numa-vmstat.node0.nr_mlock
    375.78 ± 15%    +304.8%       1521 ± 24%  numa-vmstat.node1.nr_mlock
   8538445 ±  5%      -8.4%    7819829 ±  5%  sched_debug.cfs_rq:/.avg_vruntime.avg
   8538448 ±  5%      -8.4%    7819830 ±  5%  sched_debug.cfs_rq:/.min_vruntime.avg
    125824           -13.1%     109373        stress-ng.mmap.ops
      2096           -13.1%       1822        stress-ng.mmap.ops_per_sec
    100307            -3.7%      96634        stress-ng.time.involuntary_context_switches
 1.412e+08            +4.7%  1.478e+08        stress-ng.time.minor_page_faults
      5404            +1.9%       5508        stress-ng.time.percent_of_cpu_this_job_got
      3178            +2.2%       3248        stress-ng.time.system_time
     71.10           -10.4%      63.67        stress-ng.time.user_time
      3130 ±  4%     +14.8%       3594 ±  2%  perf-sched.wait_and_delay.count.__cond_resched.__tlb_batch_free_encoded_pages.tlb_finish_mmu.vms_clear_ptes.part
    177.30 ±  9%     -21.3%     139.60 ±  9%  perf-sched.wait_and_delay.count.__cond_resched.down_write.anon_vma_clone.__split_vma.vms_gather_munmap_vmas
    120.50 ± 10%     -34.8%      78.60 ± 50%  perf-sched.wait_and_delay.count.__cond_resched.down_write.vma_expand.vma_merge_new_range.__mmap_region
    603.00 ±  6%     -35.9%     386.50 ±  7%  perf-sched.wait_and_delay.count.__cond_resched.folio_zero_user.vma_alloc_anon_folio_pmd.__do_huge_pmd_anonymous_page.__handle_mm_fault
    190.30 ±  6%     -20.7%     150.90 ± 10%  perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.__split_vma
    869.80 ±  7%     -18.1%     712.20 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.unmap_vmas.vms_clear_ptes.part.0
    347.50 ±  6%     -20.0%     277.90 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
    217.40 ± 78%     -33.1%     145.50 ±  5%  perf-sched.wait_and_delay.count.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
    704.39 ± 15%    +305.6%       2856 ± 27%  proc-vmstat.nr_mlock
 3.029e+08           -12.8%  2.643e+08        proc-vmstat.pgalloc_normal
 1.417e+08            +4.7%  1.483e+08        proc-vmstat.pgfault
 3.025e+08           -12.8%  2.638e+08        proc-vmstat.pgfree
    228212           -34.6%     149297        proc-vmstat.thp_deferred_split_page
    228277           -34.6%     149362        proc-vmstat.thp_fault_alloc
    228276           -34.6%     149361        proc-vmstat.thp_split_pmd
  12441114           -14.2%   10672430        proc-vmstat.unevictable_pgs_culled
  12441103           -14.2%   10672446        proc-vmstat.unevictable_pgs_mlocked
  12441101           -14.2%   10672045        proc-vmstat.unevictable_pgs_munlocked
  12441100           -14.2%   10672016        proc-vmstat.unevictable_pgs_rescued
      5.22            -6.2%       4.90        perf-stat.i.MPKI
 1.477e+10            -7.4%  1.367e+10        perf-stat.i.branch-instructions
 1.137e+08 ±  3%     -10.0%  1.023e+08        perf-stat.i.branch-misses
     79.12            -1.7       77.44        perf-stat.i.cache-miss-rate%
 3.868e+08           -13.5%  3.346e+08        perf-stat.i.cache-misses
 4.885e+08           -11.6%  4.316e+08        perf-stat.i.cache-references
      4083            -5.1%       3876        perf-stat.i.context-switches
      2.63            +8.5%       2.85        perf-stat.i.cpi
    503.35           +15.6%     582.12        perf-stat.i.cycles-between-cache-misses
 7.402e+10            -7.8%  6.823e+10        perf-stat.i.instructions
      0.38            -7.8%       0.35        perf-stat.i.ipc
     72.64            +2.7%      74.59        perf-stat.i.metric.K/sec
   2325817            +2.6%    2387093        perf-stat.i.minor-faults
   2325817            +2.6%    2387093        perf-stat.i.page-faults
      5.22            -6.2%       4.90        perf-stat.overall.MPKI
     79.16            -1.7       77.50        perf-stat.overall.cache-miss-rate%
      2.63            +8.4%       2.86        perf-stat.overall.cpi
    504.25           +15.6%     582.68        perf-stat.overall.cycles-between-cache-misses
      0.38            -7.8%       0.35        perf-stat.overall.ipc
 1.452e+10            -7.4%  1.344e+10        perf-stat.ps.branch-instructions
  1.12e+08 ±  3%      -9.9%   1.01e+08        perf-stat.ps.branch-misses
 3.802e+08           -13.5%   3.29e+08        perf-stat.ps.cache-misses
 4.803e+08           -11.6%  4.245e+08        perf-stat.ps.cache-references
      4017            -5.0%       3817        perf-stat.ps.context-switches
 7.278e+10            -7.8%  6.711e+10        perf-stat.ps.instructions
   2286433            +2.7%    2347115        perf-stat.ps.minor-faults
   2286433            +2.7%    2347115        perf-stat.ps.page-faults
 4.455e+12            -7.7%  4.111e+12        perf-stat.total.instructions
     17.85            -2.1       15.72        perf-profile.calltrace.cycles-pp.stress_mmap_set_light
     16.59            -2.1       14.48        perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.stress_mmap_set_light
     16.56            -2.1       14.45        perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.stress_mmap_set_light
     15.84            -2.1       13.76        perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.stress_mmap_set_light
     15.59            -2.1       13.52        perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      5.18            -2.0        3.19        perf-profile.calltrace.cycles-pp.__do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
     17.19            -2.0       15.21        perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_mmap_set_light
      5.08            -2.0        3.11        perf-profile.calltrace.cycles-pp.vma_alloc_anon_folio_pmd.__do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      4.88            -1.9        2.98        perf-profile.calltrace.cycles-pp.folio_zero_user.vma_alloc_anon_folio_pmd.__do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault
     10.48            -1.8        8.70        perf-profile.calltrace.cycles-pp.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      4.60            -1.8        2.82        perf-profile.calltrace.cycles-pp.clear_page_erms.folio_zero_user.vma_alloc_anon_folio_pmd.__do_huge_pmd_anonymous_page.__handle_mm_fault
      9.96            -1.6        8.40        perf-profile.calltrace.cycles-pp.__mmap
      8.90            -1.5        7.36        perf-profile.calltrace.cycles-pp.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      8.78            -1.4        7.39        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
      8.76            -1.4        7.37        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      7.90            -1.4        6.53        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      8.58            -1.4        7.22        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      7.42            -1.3        6.12        perf-profile.calltrace.cycles-pp.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.90            -1.0        5.91 ±  4%  perf-profile.calltrace.cycles-pp.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      6.88            -1.0        5.89 ±  4%  perf-profile.calltrace.cycles-pp.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      6.88            -1.0        5.89 ±  4%  perf-profile.calltrace.cycles-pp.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork
      6.92            -1.0        5.94 ±  4%  perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm
      6.88            -1.0        5.89 ±  4%  perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread
      6.92            -1.0        5.94 ±  4%  perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm
      6.92            -1.0        5.94 ±  4%  perf-profile.calltrace.cycles-pp.ret_from_fork_asm
      6.86            -1.0        5.87 ±  4%  perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn
      5.09            -0.7        4.35 ±  4%  perf-profile.calltrace.cycles-pp.kmem_cache_free.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd
      4.27            -0.7        3.54        perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      3.10            -0.6        2.54        perf-profile.calltrace.cycles-pp.vma_merge_new_range.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64
      2.85            -0.5        2.35        perf-profile.calltrace.cycles-pp.vm_area_dup.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      2.70            -0.5        2.22        perf-profile.calltrace.cycles-pp.vma_expand.vma_merge_new_range.__mmap_region.do_mmap.vm_mmap_pgoff
      3.27            -0.4        2.84 ±  3%  perf-profile.calltrace.cycles-pp.__slab_free.kmem_cache_free.rcu_do_batch.rcu_core.handle_softirqs
      2.47            -0.4        2.04        perf-profile.calltrace.cycles-pp.commit_merge.vma_expand.vma_merge_new_range.__mmap_region.do_mmap
      2.42            -0.4        1.99        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap
      2.41            -0.4        2.00 ±  2%  perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      2.35            -0.4        1.95        perf-profile.calltrace.cycles-pp.__mmap_new_vma.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64
      2.28            -0.4        1.88        perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      2.22            -0.4        1.83 ±  2%  perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap
      0.62 ±  2%      -0.4        0.26 ±100%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.__split_vma.vms_gather_munmap_vmas
      1.72 ±  2%      -0.3        1.42 ±  2%  perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_prealloc.vma_complete.__split_vma.vms_gather_munmap_vmas
      1.72            -0.3        1.44        perf-profile.calltrace.cycles-pp.free_pgtables.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      0.58 ±  2%      -0.3        0.31 ± 81%  perf-profile.calltrace.cycles-pp.__slab_free.kmem_cache_free.vm_area_free_rcu_cb.rcu_do_batch.rcu_core
      1.68            -0.2        1.43        perf-profile.calltrace.cycles-pp.asm_exc_page_fault.__mmap
      1.38 ±  2%      -0.2        1.15 ±  2%  perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      1.45            -0.2        1.22        perf-profile.calltrace.cycles-pp.unlink_anon_vmas.free_pgtables.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap
      1.40            -0.2        1.17        perf-profile.calltrace.cycles-pp.mas_store_prealloc.commit_merge.vma_expand.vma_merge_new_range.__mmap_region
      1.43 ±  2%      -0.2        1.22 ±  4%  perf-profile.calltrace.cycles-pp.anon_vma_clone.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      1.07            -0.2        0.86        perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      3.80            -0.2        3.59        perf-profile.calltrace.cycles-pp.vma_alloc_folio_noprof.alloc_anon_folio.do_anonymous_page.__handle_mm_fault.handle_mm_fault
      3.73            -0.2        3.53        perf-profile.calltrace.cycles-pp.folio_alloc_mpol_noprof.vma_alloc_folio_noprof.alloc_anon_folio.do_anonymous_page.__handle_mm_fault
      3.72            -0.2        3.52        perf-profile.calltrace.cycles-pp.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof.alloc_anon_folio.do_anonymous_page
      1.23            -0.2        1.03        perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.tlb_finish_mmu.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap
      3.65            -0.2        3.45        perf-profile.calltrace.cycles-pp.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof.alloc_anon_folio
      1.22            -0.2        1.03        perf-profile.calltrace.cycles-pp.mas_store_prealloc.__mmap_new_vma.__mmap_region.do_mmap.vm_mmap_pgoff
      3.53            -0.2        3.34        perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.94 ±  2%      -0.2        0.77        perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      0.90            -0.2        0.74        perf-profile.calltrace.cycles-pp.__mmap_prepare.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64
      0.87            -0.2        0.71        perf-profile.calltrace.cycles-pp.__vmf_anon_prepare.do_anonymous_page.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      0.82            -0.2        0.67        perf-profile.calltrace.cycles-pp.__anon_vma_prepare.__vmf_anon_prepare.do_anonymous_page.__handle_mm_fault.handle_mm_fault
      1.08            -0.1        0.93 ±  2%  perf-profile.calltrace.cycles-pp.vm_area_free_rcu_cb.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd
      0.70            -0.1        0.56 ±  2%  perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_preallocate.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap
      0.62 ±  2%      -0.1        0.49 ± 33%  perf-profile.calltrace.cycles-pp.rcu_cblist_dequeue.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd
      0.71 ±  2%      -0.1        0.58 ±  2%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap
      0.84            -0.1        0.71        perf-profile.calltrace.cycles-pp.mas_wr_bnode.mas_store_prealloc.__mmap_new_vma.__mmap_region.do_mmap
      0.90 ±  4%      -0.1        0.77 ±  5%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.anon_vma_clone.__split_vma.vms_gather_munmap_vmas.do_vmi_align_munmap
      0.78            -0.1        0.65        perf-profile.calltrace.cycles-pp.mas_split.mas_wr_bnode.mas_store_prealloc.__mmap_new_vma.__mmap_region
      0.88            -0.1        0.76 ±  2%  perf-profile.calltrace.cycles-pp.kmem_cache_free.vm_area_free_rcu_cb.rcu_do_batch.rcu_core.handle_softirqs
      0.75            -0.1        0.63        perf-profile.calltrace.cycles-pp.perf_event_mmap.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64
      0.73            -0.1        0.62        perf-profile.calltrace.cycles-pp.mas_wr_spanning_store.mas_store_prealloc.commit_merge.vma_expand.vma_merge_new_range
      0.77            -0.1        0.66        perf-profile.calltrace.cycles-pp.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.vms_clear_ptes.vms_complete_munmap_vmas
      0.67            -0.1        0.56        perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.__mmap_region.do_mmap.vm_mmap_pgoff
      0.80 ±  2%      -0.1        0.70 ±  7%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__slab_free.kmem_cache_free.rcu_do_batch.rcu_core
      0.65 ±  2%      -0.1        0.54 ±  2%  perf-profile.calltrace.cycles-pp.kmem_cache_free.unlink_anon_vmas.free_pgtables.vms_clear_ptes.vms_complete_munmap_vmas
      0.68            -0.1        0.58        perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.vms_clear_ptes
      2.55            -0.1        2.45        perf-profile.calltrace.cycles-pp.clear_page_erms.get_page_from_freelist.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof
      0.64            -0.1        0.54        perf-profile.calltrace.cycles-pp.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc.commit_merge.vma_expand
      0.69 ±  3%      -0.1        0.60 ±  8%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__slab_free.kmem_cache_free.rcu_do_batch
      0.73 ±  2%      -0.1        0.65 ±  3%  perf-profile.calltrace.cycles-pp.rmqueue.get_page_from_freelist.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof
      0.64 ±  2%      -0.1        0.56 ±  3%  perf-profile.calltrace.cycles-pp.__rmqueue_pcplist.rmqueue.get_page_from_freelist.__alloc_pages_noprof.alloc_pages_mpol_noprof
      1.32 ±  4%      +0.1        1.44 ±  4%  perf-profile.calltrace.cycles-pp.kmem_cache_free.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu
      1.63 ±  4%      +0.1        1.76 ±  4%  perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
      1.65 ±  4%      +0.1        1.77 ±  4%  perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.folios_put_refs.free_pages_and_swap_cache
      1.64 ±  4%      +0.1        1.77 ±  4%  perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
      1.64 ±  4%      +0.1        1.77 ±  4%  perf-profile.calltrace.cycles-pp.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.folios_put_refs
      1.72 ±  4%      +0.1        1.86 ±  3%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.folios_put_refs.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages
      1.73 ±  4%      +0.1        1.87 ±  3%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.folios_put_refs.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages.tlb_finish_mmu
      0.96 ±  3%      +0.3        1.25        perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.__folio_batch_add_and_move.do_anonymous_page.__handle_mm_fault
      0.93 ±  3%      +0.3        1.22        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.__folio_batch_add_and_move
      0.95 ±  3%      +0.3        1.25        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.__folio_batch_add_and_move.do_anonymous_page
      1.50            +0.3        1.80        perf-profile.calltrace.cycles-pp.folio_batch_move_lru.__folio_batch_add_and_move.do_anonymous_page.__handle_mm_fault.handle_mm_fault
      1.55            +0.3        1.86        perf-profile.calltrace.cycles-pp.__folio_batch_add_and_move.do_anonymous_page.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      1.82 ±  5%      +0.4        2.19 ±  3%  perf-profile.calltrace.cycles-pp.__munlock_folio.mlock_folio_batch.munlock_folio.zap_present_ptes.zap_pte_range
      1.78 ±  5%      +0.4        2.15 ±  3%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.folio_lruvec_lock_irq.__munlock_folio.mlock_folio_batch.munlock_folio
      1.79 ±  5%      +0.4        2.16 ±  4%  perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irq.__munlock_folio.mlock_folio_batch.munlock_folio.zap_present_ptes
      1.93 ±  5%      +0.4        2.30 ±  4%  perf-profile.calltrace.cycles-pp.munlock_folio.zap_present_ptes.zap_pte_range.zap_pmd_range.unmap_page_range
      1.91 ±  5%      +0.4        2.28 ±  4%  perf-profile.calltrace.cycles-pp.mlock_folio_batch.munlock_folio.zap_present_ptes.zap_pte_range.zap_pmd_range
      1.73 ±  5%      +0.4        2.11 ±  4%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.folio_lruvec_lock_irq.__munlock_folio.mlock_folio_batch
      0.00            +0.5        0.51        perf-profile.calltrace.cycles-pp.__mm_populate.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      0.00            +0.5        0.51        perf-profile.calltrace.cycles-pp.populate_vma_page_range.__mm_populate.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     64.14            +4.7       68.84        perf-profile.calltrace.cycles-pp.__munmap
     63.37            +4.8       68.18        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
     63.33            +4.8       68.14        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     62.96            +4.9       67.82        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     62.93            +4.9       67.81        perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     62.63            +4.9       67.55        perf-profile.calltrace.cycles-pp.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     62.24            +5.0       67.21        perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
     31.45 ±  2%      +7.4       38.84        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs
     32.36 ±  2%      +7.4       39.77        perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages
     32.32 ±  2%      +7.4       39.73        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs.free_pages_and_swap_cache
     33.49 ±  2%      +7.4       40.92        perf-profile.calltrace.cycles-pp.__page_cache_release.folios_put_refs.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages.tlb_finish_mmu
     47.19            +7.5       54.71        perf-profile.calltrace.cycles-pp.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
     46.32            +7.7       53.97        perf-profile.calltrace.cycles-pp.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
     38.42            +7.7       46.15        perf-profile.calltrace.cycles-pp.folios_put_refs.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages.tlb_finish_mmu.vms_clear_ptes
     40.48            +7.9       48.40        perf-profile.calltrace.cycles-pp.tlb_finish_mmu.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
     38.90            +8.2       47.07        perf-profile.calltrace.cycles-pp.__tlb_batch_free_encoded_pages.tlb_finish_mmu.vms_clear_ptes.vms_complete_munmap_vmas.do_vmi_align_munmap
     38.74            +8.2       46.94        perf-profile.calltrace.cycles-pp.free_pages_and_swap_cache.__tlb_batch_free_encoded_pages.tlb_finish_mmu.vms_clear_ptes.vms_complete_munmap_vmas
     18.73            -2.1       16.61        perf-profile.children.cycles-pp.stress_mmap_set_light
     16.65            -2.1       14.52        perf-profile.children.cycles-pp.exc_page_fault
     16.62            -2.1       14.50        perf-profile.children.cycles-pp.do_user_addr_fault
     18.10            -2.1       15.99        perf-profile.children.cycles-pp.asm_exc_page_fault
      5.64            -2.1        3.56        perf-profile.children.cycles-pp.__do_huge_pmd_anonymous_page
      5.53            -2.1        3.48        perf-profile.children.cycles-pp.vma_alloc_anon_folio_pmd
     16.35            -2.0       14.30        perf-profile.children.cycles-pp.handle_mm_fault
     16.10            -2.0       14.06        perf-profile.children.cycles-pp.__handle_mm_fault
      5.34            -2.0        3.35        perf-profile.children.cycles-pp.folio_zero_user
      7.67            -1.9        5.72        perf-profile.children.cycles-pp.clear_page_erms
     10.52            -1.8        8.74        perf-profile.children.cycles-pp.vms_gather_munmap_vmas
     12.77            -1.7       11.08        perf-profile.children.cycles-pp.rcu_core
     12.78            -1.7       11.10        perf-profile.children.cycles-pp.handle_softirqs
     12.76            -1.7       11.08        perf-profile.children.cycles-pp.rcu_do_batch
     12.35            -1.6       10.71        perf-profile.children.cycles-pp.kmem_cache_free
     10.03            -1.6        8.45        perf-profile.children.cycles-pp.__mmap
      8.94            -1.5        7.39        perf-profile.children.cycles-pp.__split_vma
      7.92            -1.4        6.55        perf-profile.children.cycles-pp.do_mmap
      8.60            -1.4        7.24        perf-profile.children.cycles-pp.vm_mmap_pgoff
      7.44            -1.3        6.15        perf-profile.children.cycles-pp.__mmap_region
      5.71            -1.0        4.69        perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
      6.90            -1.0        5.91 ±  4%  perf-profile.children.cycles-pp.smpboot_thread_fn
      6.88            -1.0        5.89 ±  4%  perf-profile.children.cycles-pp.run_ksoftirqd
      6.92            -1.0        5.94 ±  4%  perf-profile.children.cycles-pp.kthread
      6.92            -1.0        5.94 ±  4%  perf-profile.children.cycles-pp.ret_from_fork
      6.92            -1.0        5.94 ±  4%  perf-profile.children.cycles-pp.ret_from_fork_asm
      6.07            -0.8        5.23        perf-profile.children.cycles-pp.__slab_free
      4.87            -0.8        4.03        perf-profile.children.cycles-pp.mas_wr_node_store
      4.86            -0.8        4.05        perf-profile.children.cycles-pp.mas_store_prealloc
      4.58            -0.8        3.81        perf-profile.children.cycles-pp.mas_store_gfp
      6.28            -0.7        5.54 ±  3%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      6.23            -0.7        5.50 ±  3%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      5.91            -0.7        5.21 ±  3%  perf-profile.children.cycles-pp.__irq_exit_rcu
      3.12            -0.6        2.55        perf-profile.children.cycles-pp.vma_merge_new_range
      2.86            -0.5        2.36        perf-profile.children.cycles-pp.vm_area_dup
      2.36            -0.5        1.87 ±  2%  perf-profile.children.cycles-pp.___slab_alloc
      2.71            -0.5        2.23        perf-profile.children.cycles-pp.vma_expand
      2.36            -0.5        1.91        perf-profile.children.cycles-pp.mas_alloc_nodes
      2.48            -0.4        2.04        perf-profile.children.cycles-pp.commit_merge
      2.49            -0.4        2.06 ±  2%  perf-profile.children.cycles-pp.vma_complete
      2.63 ±  2%      -0.4        2.21 ±  2%  perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      2.36            -0.4        1.96        perf-profile.children.cycles-pp.__mmap_new_vma
      2.00 ±  2%      -0.4        1.62        perf-profile.children.cycles-pp.mas_preallocate
      3.62 ±  2%      -0.4        3.24 ±  2%  perf-profile.children.cycles-pp.free_unref_page_commit
      3.54 ±  2%      -0.4        3.17 ±  2%  perf-profile.children.cycles-pp.free_unref_page
      3.50 ±  2%      -0.4        3.13 ±  2%  perf-profile.children.cycles-pp.free_pcppages_bulk
      2.81 ±  2%      -0.3        2.47 ±  3%  perf-profile.children.cycles-pp.__put_partials
      4.36            -0.3        4.07        perf-profile.children.cycles-pp.alloc_pages_mpol_noprof
      1.75            -0.3        1.46        perf-profile.children.cycles-pp.free_pgtables
      4.27            -0.3        3.98        perf-profile.children.cycles-pp.__alloc_pages_noprof
      4.12            -0.3        3.84        perf-profile.children.cycles-pp.get_page_from_freelist
      1.84            -0.3        1.58        perf-profile.children.cycles-pp.vm_area_free_rcu_cb
      1.37            -0.3        1.11 ±  2%  perf-profile.children.cycles-pp.allocate_slab
      1.47            -0.2        1.23        perf-profile.children.cycles-pp.unlink_anon_vmas
      1.45            -0.2        1.22        perf-profile.children.cycles-pp.mas_find
      3.93            -0.2        3.71        perf-profile.children.cycles-pp.folio_alloc_mpol_noprof
      3.99            -0.2        3.77        perf-profile.children.cycles-pp.vma_alloc_folio_noprof
      1.45 ±  2%      -0.2        1.23 ±  4%  perf-profile.children.cycles-pp.anon_vma_clone
      1.35            -0.2        1.14        perf-profile.children.cycles-pp.__call_rcu_common
      1.32 ±  2%      -0.2        1.10        perf-profile.children.cycles-pp.__memcg_slab_free_hook
      1.25            -0.2        1.05        perf-profile.children.cycles-pp.flush_tlb_mm_range
      1.25            -0.2        1.06        perf-profile.children.cycles-pp.mas_wr_bnode
      0.82            -0.2        0.64 ±  2%  perf-profile.children.cycles-pp.__cond_resched
      1.08 ±  2%      -0.2        0.89 ±  2%  perf-profile.children.cycles-pp.mod_objcg_state
      1.14            -0.2        0.96 ±  2%  perf-profile.children.cycles-pp.down_write
      0.94            -0.2        0.76 ±  2%  perf-profile.children.cycles-pp.shuffle_freelist
      1.13            -0.2        0.96        perf-profile.children.cycles-pp.mas_spanning_rebalance
      0.91            -0.2        0.75        perf-profile.children.cycles-pp.__mmap_prepare
      0.88            -0.2        0.72 ±  2%  perf-profile.children.cycles-pp.__vmf_anon_prepare
      0.83            -0.2        0.67 ±  2%  perf-profile.children.cycles-pp.__anon_vma_prepare
      1.10 ±  3%      -0.2        0.95 ±  3%  perf-profile.children.cycles-pp.rmqueue
      0.99 ±  3%      -0.2        0.84 ±  3%  perf-profile.children.cycles-pp.__rmqueue_pcplist
      1.03            -0.2        0.88        perf-profile.children.cycles-pp.rcu_cblist_dequeue
      0.96            -0.1        0.81        perf-profile.children.cycles-pp.mas_wr_spanning_store
      1.10            -0.1        0.96        perf-profile.children.cycles-pp.mas_walk
      0.72 ±  2%      -0.1        0.58 ±  2%  perf-profile.children.cycles-pp.kmem_cache_alloc_bulk_noprof
      0.69 ±  2%      -0.1        0.56        perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
      0.84            -0.1        0.70        perf-profile.children.cycles-pp.mas_split
      0.71 ±  4%      -0.1        0.58 ±  4%  perf-profile.children.cycles-pp.rmqueue_bulk
      0.76            -0.1        0.63        perf-profile.children.cycles-pp.perf_event_mmap
      0.87 ±  4%      -0.1        0.75 ±  5%  perf-profile.children.cycles-pp.obj_cgroup_charge
      0.79            -0.1        0.67        perf-profile.children.cycles-pp.flush_tlb_func
      0.61 ±  2%      -0.1        0.49        perf-profile.children.cycles-pp.vma_prepare
      0.69            -0.1        0.58        perf-profile.children.cycles-pp.perf_event_mmap_event
      0.68            -0.1        0.58        perf-profile.children.cycles-pp.native_flush_tlb_one_user
      0.56            -0.1        0.46        perf-profile.children.cycles-pp.mas_wr_store_type
      0.56            -0.1        0.46        perf-profile.children.cycles-pp.vm_area_alloc
      0.43 ±  3%      -0.1        0.33 ±  2%  perf-profile.children.cycles-pp.folio_remove_rmap_ptes
      0.58 ±  2%      -0.1        0.49 ±  2%  perf-profile.children.cycles-pp.mas_pop_node
      0.58            -0.1        0.49 ±  2%  perf-profile.children.cycles-pp.mas_prev_slot
      0.25 ±  4%      -0.1        0.16 ±  4%  perf-profile.children.cycles-pp.get_partial_node
      0.47            -0.1        0.38        perf-profile.children.cycles-pp.mas_update_gap
      0.57 ±  2%      -0.1        0.48 ±  3%  perf-profile.children.cycles-pp.up_write
      0.47 ±  2%      -0.1        0.39        perf-profile.children.cycles-pp.perf_iterate_sb
      0.46            -0.1        0.38 ±  2%  perf-profile.children.cycles-pp.mas_push_data
      0.49            -0.1        0.42 ±  2%  perf-profile.children.cycles-pp.mas_next_slot
      0.48 ±  5%      -0.1        0.40 ±  3%  perf-profile.children.cycles-pp.__lruvec_stat_mod_folio
      0.46 ±  4%      -0.1        0.38 ±  3%  perf-profile.children.cycles-pp.__memcpy
      0.50            -0.1        0.43        perf-profile.children.cycles-pp.clear_bhb_loop
      0.47            -0.1        0.39 ±  2%  perf-profile.children.cycles-pp.mab_mas_cp
      0.45 ±  2%      -0.1        0.38 ±  2%  perf-profile.children.cycles-pp.mas_topiary_replace
      0.36 ±  2%      -0.1        0.29        perf-profile.children.cycles-pp.mas_leaf_max_gap
      0.34 ±  2%      -0.1        0.28        perf-profile.children.cycles-pp.__put_anon_vma
      0.29 ±  2%      -0.1        0.23 ±  3%  perf-profile.children.cycles-pp.setup_object
      0.30 ±  2%      -0.1        0.24 ±  2%  perf-profile.children.cycles-pp.rcu_all_qs
      0.19            -0.1        0.13 ±  3%  perf-profile.children.cycles-pp.vma_adjust_trans_huge
      0.33 ±  2%      -0.0        0.28 ±  2%  perf-profile.children.cycles-pp.mas_rebalance
      0.33 ±  2%      -0.0        0.28 ±  2%  perf-profile.children.cycles-pp.perf_event_mmap_output
      0.28            -0.0        0.23 ±  3%  perf-profile.children.cycles-pp.mas_destroy
      0.14 ±  3%      -0.0        0.10 ±  5%  perf-profile.children.cycles-pp.__split_huge_pmd
      0.14 ±  2%      -0.0        0.10 ±  5%  perf-profile.children.cycles-pp.__split_huge_pmd_locked
      0.12 ±  3%      -0.0        0.08 ±  3%  perf-profile.children.cycles-pp.folio_add_anon_rmap_ptes
      0.33 ±  5%      -0.0        0.28 ±  6%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.23 ±  3%      -0.0        0.19 ±  3%  perf-profile.children.cycles-pp.__anon_vma_interval_tree_remove
      0.32 ±  4%      -0.0        0.28 ±  5%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.28 ±  2%      -0.0        0.24 ±  2%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.26 ±  4%      -0.0        0.22 ±  2%  perf-profile.children.cycles-pp.call_rcu
      0.26 ±  2%      -0.0        0.22 ±  3%  perf-profile.children.cycles-pp.rcu_segcblist_enqueue
      0.30 ±  2%      -0.0        0.26        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      0.38 ±  2%      -0.0        0.34 ±  2%  perf-profile.children.cycles-pp.__pte_offset_map_lock
      0.21 ±  3%      -0.0        0.17 ±  3%  perf-profile.children.cycles-pp.mas_put_in_tree
      0.25 ±  4%      -0.0        0.21 ±  5%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.24 ±  2%      -0.0        0.21 ±  2%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.31 ±  2%      -0.0        0.27 ±  2%  perf-profile.children.cycles-pp.__free_one_page
      0.20 ±  2%      -0.0        0.17 ±  2%  perf-profile.children.cycles-pp.mt_find
      0.22 ±  2%      -0.0        0.18 ±  4%  perf-profile.children.cycles-pp.find_mergeable_anon_vma
      0.14 ±  3%      -0.0        0.10 ±  2%  perf-profile.children.cycles-pp.prep_compound_page
      0.24            -0.0        0.20 ±  3%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.18 ±  3%      -0.0        0.15 ±  3%  perf-profile.children.cycles-pp.stress_mmap_child
      0.24 ±  4%      -0.0        0.20 ±  5%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.15 ±  3%      -0.0        0.12 ±  4%  perf-profile.children.cycles-pp.can_vma_merge_right
      0.22 ±  3%      -0.0        0.19 ±  3%  perf-profile.children.cycles-pp.mas_mab_cp
      0.20 ±  2%      -0.0        0.17 ±  2%  perf-profile.children.cycles-pp.stress_mmap_slow_munmap
      0.10 ±  4%      -0.0        0.07 ±  5%  perf-profile.children.cycles-pp.get_any_partial
      0.22            -0.0        0.19 ±  3%  perf-profile.children.cycles-pp.mas_prev_node
      0.23 ±  2%      -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.mas_ascend
      0.16 ±  3%      -0.0        0.13 ±  3%  perf-profile.children.cycles-pp.mas_prev
      0.18 ±  2%      -0.0        0.15 ±  2%  perf-profile.children.cycles-pp.refill_obj_stock
      0.18 ±  2%      -0.0        0.15 ±  2%  perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.18 ±  3%      -0.0        0.15 ±  3%  perf-profile.children.cycles-pp._find_next_bit
      0.17 ±  4%      -0.0        0.14 ±  3%  perf-profile.children.cycles-pp.down_write_killable
      0.11 ±  4%      -0.0        0.09 ±  5%  perf-profile.children.cycles-pp.can_vma_merge_after
      0.16 ±  2%      -0.0        0.13 ±  3%  perf-profile.children.cycles-pp.__get_unmapped_area
      0.11 ±  2%      -0.0        0.09 ±  6%  perf-profile.children.cycles-pp.kmem_cache_free_bulk
      0.16 ±  2%      -0.0        0.14 ±  3%  perf-profile.children.cycles-pp.mas_next_node
      0.13 ±  3%      -0.0        0.11 ±  2%  perf-profile.children.cycles-pp.mas_split_final_node
      0.14 ±  4%      -0.0        0.12 ±  4%  perf-profile.children.cycles-pp.__perf_event_header__init_id
      0.11 ±  2%      -0.0        0.09 ±  3%  perf-profile.children.cycles-pp.anon_vma_interval_tree_insert
      0.13 ±  2%      -0.0        0.11 ±  2%  perf-profile.children.cycles-pp.mas_wr_store_entry
      0.12 ±  4%      -0.0        0.11 ±  4%  perf-profile.children.cycles-pp.mast_fill_bnode
      0.12 ±  4%      -0.0        0.11 ±  4%  perf-profile.children.cycles-pp.security_vm_enough_memory_mm
      0.40            -0.0        0.38        perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.10 ±  3%      -0.0        0.08 ±  5%  perf-profile.children.cycles-pp.perf_output_begin
      0.11 ±  4%      -0.0        0.09 ±  4%  perf-profile.children.cycles-pp.tlb_gather_mmu
      0.08 ±  3%      -0.0        0.06 ±  6%  perf-profile.children.cycles-pp.mas_prev_setup
      0.13 ±  2%      -0.0        0.11 ±  3%  perf-profile.children.cycles-pp.sched_tick
      0.11 ±  2%      -0.0        0.09 ±  4%  perf-profile.children.cycles-pp.downgrade_write
      0.16 ±  3%      -0.0        0.14 ±  3%  perf-profile.children.cycles-pp.__mod_node_page_state
      0.12 ±  4%      -0.0        0.10 ±  4%  perf-profile.children.cycles-pp.init_multi_vma_prep
      0.19            -0.0        0.18 ±  2%  perf-profile.children.cycles-pp.__perf_sw_event
      0.32 ±  2%      -0.0        0.30        perf-profile.children.cycles-pp.lru_add_drain
      0.09 ±  4%      -0.0        0.08 ±  3%  perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
      0.07            -0.0        0.06 ±  5%  perf-profile.children.cycles-pp.kfree
      0.07            -0.0        0.06        perf-profile.children.cycles-pp.discard_slab
      0.46 ±  3%      +0.0        0.51        perf-profile.children.cycles-pp.__get_user_pages
      0.46 ±  3%      +0.0        0.51        perf-profile.children.cycles-pp.populate_vma_page_range
      0.46 ±  3%      +0.0        0.51        perf-profile.children.cycles-pp.__mm_populate
      1.61            +0.3        1.91        perf-profile.children.cycles-pp.__folio_batch_add_and_move
      1.64 ±  2%      +0.3        1.96        perf-profile.children.cycles-pp.folio_batch_move_lru
      1.93 ±  5%      +0.4        2.31 ±  3%  perf-profile.children.cycles-pp.munlock_folio
      0.00            +0.5        0.54 ±  4%  perf-profile.children.cycles-pp.mlock_drain_local
      1.81 ±  5%      +0.8        2.66 ±  3%  perf-profile.children.cycles-pp.folio_lruvec_lock_irq
      1.80 ±  5%      +0.9        2.66 ±  3%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      1.82 ±  5%      +0.9        2.68 ±  3%  perf-profile.children.cycles-pp.__munlock_folio
      1.92 ±  5%      +0.9        2.82 ±  3%  perf-profile.children.cycles-pp.mlock_folio_batch
     72.44            +3.4       75.83        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     72.28            +3.4       75.69        perf-profile.children.cycles-pp.do_syscall_64
     64.28            +4.7       68.96        perf-profile.children.cycles-pp.__munmap
     62.96            +4.9       67.83        perf-profile.children.cycles-pp.__x64_sys_munmap
     62.95            +4.9       67.82        perf-profile.children.cycles-pp.__vm_munmap
     62.65            +4.9       67.56        perf-profile.children.cycles-pp.do_vmi_munmap
     62.25            +5.0       67.22        perf-profile.children.cycles-pp.do_vmi_align_munmap
     38.34 ±  2%      +7.1       45.41        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     33.52 ±  2%      +7.4       40.96        perf-profile.children.cycles-pp.__page_cache_release
     47.24            +7.5       54.75        perf-profile.children.cycles-pp.vms_complete_munmap_vmas
     46.34            +7.7       54.00        perf-profile.children.cycles-pp.vms_clear_ptes
     33.47 ±  2%      +7.7       41.19        perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
     38.56            +7.7       46.30        perf-profile.children.cycles-pp.folios_put_refs
     40.50            +7.9       48.41        perf-profile.children.cycles-pp.tlb_finish_mmu
     38.93 ±  2%      +7.9       46.87        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     38.92            +8.2       47.09        perf-profile.children.cycles-pp.__tlb_batch_free_encoded_pages
     38.76            +8.2       46.96        perf-profile.children.cycles-pp.free_pages_and_swap_cache
      7.11            -1.8        5.32        perf-profile.self.cycles-pp.clear_page_erms
      5.03            -0.7        4.34        perf-profile.self.cycles-pp.__slab_free
      2.38 ±  2%      -0.4        1.98 ±  2%  perf-profile.self.cycles-pp.mas_wr_node_store
      1.33 ±  4%      -0.2        1.13 ±  3%  perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook
      0.98            -0.2        0.82 ±  2%  perf-profile.self.cycles-pp.kmem_cache_alloc_noprof
      0.99            -0.2        0.83        perf-profile.self.cycles-pp.__call_rcu_common
      1.02            -0.2        0.87        perf-profile.self.cycles-pp.rcu_cblist_dequeue
      0.91 ±  2%      -0.1        0.76 ±  2%  perf-profile.self.cycles-pp.mod_objcg_state
      0.81            -0.1        0.66 ±  2%  perf-profile.self.cycles-pp.shuffle_freelist
      0.88            -0.1        0.74 ±  2%  perf-profile.self.cycles-pp.down_write
      1.01            -0.1        0.88        perf-profile.self.cycles-pp.mas_walk
      0.84            -0.1        0.72        perf-profile.self.cycles-pp.kmem_cache_free
      0.73 ±  2%      -0.1        0.61        perf-profile.self.cycles-pp.__memcg_slab_free_hook
      0.48            -0.1        0.36        perf-profile.self.cycles-pp.__cond_resched
      0.62 ±  2%      -0.1        0.51        perf-profile.self.cycles-pp.___slab_alloc
      0.67            -0.1        0.58        perf-profile.self.cycles-pp.native_flush_tlb_one_user
      0.49            -0.1        0.40        perf-profile.self.cycles-pp.mas_wr_store_type
      0.52 ±  2%      -0.1        0.45 ±  3%  perf-profile.self.cycles-pp.mas_pop_node
      0.49 ±  2%      -0.1        0.42 ±  2%  perf-profile.self.cycles-pp.up_write
      0.49            -0.1        0.42        perf-profile.self.cycles-pp.clear_bhb_loop
      0.37 ±  2%      -0.1        0.30        perf-profile.self.cycles-pp.zap_present_ptes
      0.41 ±  4%      -0.1        0.34 ±  3%  perf-profile.self.cycles-pp.__memcpy
      0.35            -0.1        0.30        perf-profile.self.cycles-pp.mab_mas_cp
      0.36            -0.1        0.30        perf-profile.self.cycles-pp.mas_store_gfp
      0.37 ±  3%      -0.1        0.32 ±  4%  perf-profile.self.cycles-pp.vm_area_dup
      0.43 ±  5%      -0.1        0.37 ±  5%  perf-profile.self.cycles-pp.obj_cgroup_charge
      0.32 ±  2%      -0.1        0.26        perf-profile.self.cycles-pp.mas_leaf_max_gap
      0.33            -0.1        0.28        perf-profile.self.cycles-pp.mas_prev_slot
      0.30            -0.0        0.25        perf-profile.self.cycles-pp.mas_next_slot
      0.23 ±  3%      -0.0        0.18 ±  2%  perf-profile.self.cycles-pp.vma_prepare
      0.30            -0.0        0.25        perf-profile.self.cycles-pp.vm_area_free_rcu_cb
      0.32            -0.0        0.28        perf-profile.self.cycles-pp.vms_gather_munmap_vmas
      0.27 ±  2%      -0.0        0.23 ±  2%  perf-profile.self.cycles-pp.mas_store_prealloc
      0.30 ±  2%      -0.0        0.26 ±  2%  perf-profile.self.cycles-pp.tlb_finish_mmu
      0.21 ±  2%      -0.0        0.17        perf-profile.self.cycles-pp.flush_tlb_mm_range
      0.30 ±  2%      -0.0        0.26        perf-profile.self.cycles-pp.mas_topiary_replace
      0.27            -0.0        0.23 ±  2%  perf-profile.self.cycles-pp.anon_vma_clone
      0.28 ±  2%      -0.0        0.24        perf-profile.self.cycles-pp.mas_find
      0.11 ±  4%      -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.folio_add_anon_rmap_ptes
      0.25 ±  2%      -0.0        0.21 ±  2%  perf-profile.self.cycles-pp.mas_preallocate
      0.20 ±  3%      -0.0        0.16        perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
      0.17 ±  3%      -0.0        0.13        perf-profile.self.cycles-pp.folio_remove_rmap_ptes
      0.26            -0.0        0.22 ±  2%  perf-profile.self.cycles-pp.mas_spanning_rebalance
      0.24 ±  2%      -0.0        0.20 ±  3%  perf-profile.self.cycles-pp.rmqueue_bulk
      0.21 ±  3%      -0.0        0.17 ±  3%  perf-profile.self.cycles-pp.unmap_page_range
      0.19 ±  4%      -0.0        0.15 ±  4%  perf-profile.self.cycles-pp.mas_put_in_tree
      0.13 ±  3%      -0.0        0.09 ±  5%  perf-profile.self.cycles-pp.prep_compound_page
      0.24 ±  2%      -0.0        0.20 ±  2%  perf-profile.self.cycles-pp.rcu_segcblist_enqueue
      0.30 ±  2%      -0.0        0.26 ±  2%  perf-profile.self.cycles-pp.__free_one_page
      0.19 ±  2%      -0.0        0.16        perf-profile.self.cycles-pp.mt_find
      0.24            -0.0        0.21        perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.12 ±  3%      -0.0        0.09 ±  5%  perf-profile.self.cycles-pp.get_partial_node
      0.17 ±  2%      -0.0        0.14 ±  4%  perf-profile.self.cycles-pp.__anon_vma_interval_tree_remove
      0.19 ±  3%      -0.0        0.16 ±  2%  perf-profile.self.cycles-pp.__split_vma
      0.16 ±  3%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.mas_update_gap
      0.20 ±  3%      -0.0        0.17 ±  2%  perf-profile.self.cycles-pp.rcu_all_qs
      0.14 ±  2%      -0.0        0.11 ±  2%  perf-profile.self.cycles-pp.perf_iterate_sb
      0.18 ±  2%      -0.0        0.15 ±  2%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.21 ±  2%      -0.0        0.18 ±  2%  perf-profile.self.cycles-pp.__mmap_region
      0.19 ±  2%      -0.0        0.16 ±  2%  perf-profile.self.cycles-pp.do_vmi_align_munmap
      0.13 ±  4%      -0.0        0.10 ±  3%  perf-profile.self.cycles-pp.can_vma_merge_right
      0.21            -0.0        0.18 ±  2%  perf-profile.self.cycles-pp.mas_ascend
      0.14 ±  5%      -0.0        0.12 ±  5%  perf-profile.self.cycles-pp.__lruvec_stat_mod_folio
      0.15 ±  3%      -0.0        0.13 ±  2%  perf-profile.self.cycles-pp._find_next_bit
      0.15 ±  4%      -0.0        0.13 ±  3%  perf-profile.self.cycles-pp.mas_alloc_nodes
      0.14 ±  3%      -0.0        0.11 ±  2%  perf-profile.self.cycles-pp.stress_mmap_child
      0.16 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.refill_obj_stock
      0.17            -0.0        0.15 ±  3%  perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.14 ±  2%      -0.0        0.12        perf-profile.self.cycles-pp.mas_mab_cp
      0.17            -0.0        0.15 ±  3%  perf-profile.self.cycles-pp.stress_mmap_slow_munmap
      0.11            -0.0        0.09 ±  4%  perf-profile.self.cycles-pp.vma_merge_new_range
      0.16 ±  2%      -0.0        0.14 ±  4%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.15            -0.0        0.13 ±  3%  perf-profile.self.cycles-pp.mas_push_data
      0.14 ±  3%      -0.0        0.12 ±  4%  perf-profile.self.cycles-pp.unlink_anon_vmas
      0.09 ±  3%      -0.0        0.07        perf-profile.self.cycles-pp.free_pages_and_swap_cache
      0.14 ±  5%      -0.0        0.12 ±  2%  perf-profile.self.cycles-pp.stress_munmap_retry_enomem
      0.16 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.zap_pte_range
      0.10            -0.0        0.08        perf-profile.self.cycles-pp.can_vma_merge_after
      0.15 ±  3%      -0.0        0.13 ±  3%  perf-profile.self.cycles-pp.__vm_munmap
      0.13 ±  3%      -0.0        0.11 ±  4%  perf-profile.self.cycles-pp.lru_add_drain
      0.12 ±  3%      -0.0        0.10 ±  3%  perf-profile.self.cycles-pp.vms_complete_munmap_vmas
      0.14 ±  3%      -0.0        0.12 ±  3%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.09 ±  4%      -0.0        0.07 ±  4%  perf-profile.self.cycles-pp.perf_event_mmap_event
      0.09 ±  5%      -0.0        0.08 ±  5%  perf-profile.self.cycles-pp.zap_pmd_range
      0.12 ±  3%      -0.0        0.10        perf-profile.self.cycles-pp.do_syscall_64
      0.09            -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.vma_complete
      0.10 ±  4%      -0.0        0.09 ±  4%  perf-profile.self.cycles-pp.down_write_killable
      0.11 ±  3%      -0.0        0.09 ±  4%  perf-profile.self.cycles-pp.init_multi_vma_prep
      0.11 ±  4%      -0.0        0.09 ±  3%  perf-profile.self.cycles-pp.mas_wr_store_entry
      0.09 ±  4%      -0.0        0.08 ±  5%  perf-profile.self.cycles-pp.free_pgtables
      0.09 ±  5%      -0.0        0.08 ±  3%  perf-profile.self.cycles-pp.mas_prev
      0.06            -0.0        0.04 ± 33%  perf-profile.self.cycles-pp.mas_prev_setup
      0.09 ±  4%      -0.0        0.08 ±  5%  perf-profile.self.cycles-pp.tlb_gather_mmu
      0.08 ±  3%      -0.0        0.06 ±  7%  perf-profile.self.cycles-pp.unmap_vmas
      0.08 ±  5%      -0.0        0.06 ±  7%  perf-profile.self.cycles-pp.vms_clear_ptes
      0.10 ±  4%      -0.0        0.09        perf-profile.self.cycles-pp.flush_tlb_func
      0.07            -0.0        0.06 ±  8%  perf-profile.self.cycles-pp.__put_partials
      0.09 ±  5%      -0.0        0.08        perf-profile.self.cycles-pp.anon_vma_interval_tree_insert
      0.09 ±  4%      -0.0        0.08 ±  3%  perf-profile.self.cycles-pp.do_mmap
      0.08 ±  5%      -0.0        0.07 ±  4%  perf-profile.self.cycles-pp.perf_output_begin
      0.09 ±  5%      -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.rcu_do_batch
      0.07 ±  5%      -0.0        0.06        perf-profile.self.cycles-pp.__mmap
      0.09 ±  4%      -0.0        0.08        perf-profile.self.cycles-pp.downgrade_write
      0.07            -0.0        0.06 ±  5%  perf-profile.self.cycles-pp.mas_destroy
      0.12 ±  4%      -0.0        0.10 ±  4%  perf-profile.self.cycles-pp.up_read
      0.07            -0.0        0.06        perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
      0.07            -0.0        0.06        perf-profile.self.cycles-pp.discard_slab
      0.17 ±  2%      +0.0        0.18        perf-profile.self.cycles-pp.lru_gen_add_folio
     38.93 ±  2%      +7.9       46.86        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath


***************************************************************************************************
lkp-skl-fpga01: 104 threads 2 sockets (Skylake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-9.4/thread/100%/debian-12-x86_64-20240206.cgz/lkp-skl-fpga01/brk2/will-it-scale

commit: 
  89dd878282 ("mm: memcg: declare do_memsw_account inline")
  249608ee47 ("mm: respect mmap hint address when aligning for THP")

89dd878282881306 249608ee47132cab3b1adacd9e4 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 3.415e+09 ±  5%     -18.3%  2.791e+09 ±  8%  cpuidle..time
    117810            +2.1%     120255        vmstat.system.in
     10.66 ±  4%      -2.0        8.69 ±  8%  mpstat.cpu.all.idle%
      0.10            -0.0        0.08 ±  2%  mpstat.cpu.all.soft%
      0.31            +0.1        0.37 ±  2%  mpstat.cpu.all.usr%
   1679216 ±  5%     -30.5%    1166751 ±  9%  numa-numastat.node0.local_node
   1728543 ±  4%     -29.7%    1214908 ±  8%  numa-numastat.node0.numa_hit
   2318360 ±  3%     -30.9%    1600917 ±  6%  numa-numastat.node1.local_node
   2376686 ±  2%     -30.1%    1660471 ±  5%  numa-numastat.node1.numa_hit
   1726631 ±  4%     -29.7%    1214257 ±  8%  numa-vmstat.node0.numa_hit
   1677304 ±  5%     -30.5%    1166100 ±  9%  numa-vmstat.node0.numa_local
   2374815 ±  2%     -30.1%    1659314 ±  5%  numa-vmstat.node1.numa_hit
   2316489 ±  3%     -30.9%    1599760 ±  6%  numa-vmstat.node1.numa_local
    198860           +51.6%     301493 ±  2%  will-it-scale.104.threads
     10.10           -22.5%       7.82 ±  2%  will-it-scale.104.threads_idle
      1911           +51.6%       2898 ±  2%  will-it-scale.per_thread_ops
    198860           +51.6%     301493 ±  2%  will-it-scale.workload
    506.67 ±  6%     +50.9%     764.67 ±  3%  perf-c2c.DRAM.local
      5447           +27.1%       6925 ±  3%  perf-c2c.DRAM.remote
      5367 ±  2%     +18.6%       6364        perf-c2c.HITM.local
      3830           +17.8%       4513 ±  3%  perf-c2c.HITM.remote
      9197           +18.3%      10877 ±  2%  perf-c2c.HITM.total
     23736            -1.8%      23303        proc-vmstat.nr_mapped
    108712            -2.0%     106548        proc-vmstat.nr_slab_unreclaimable
   4105528           -30.0%    2875907        proc-vmstat.numa_hit
   3997875           -30.8%    2768196        proc-vmstat.numa_local
    236448 ± 14%     -25.0%     177254 ± 12%  proc-vmstat.numa_pte_updates
   7242851           -34.3%    4757136        proc-vmstat.pgalloc_normal
   7071106           -35.1%    4589946        proc-vmstat.pgfree
  19917807 ±  2%     +24.3%   24752419 ±  3%  sched_debug.cfs_rq:/.avg_vruntime.avg
  38832674 ±  6%     +31.8%   51167079 ±  8%  sched_debug.cfs_rq:/.avg_vruntime.max
   5538759 ±  3%     +56.3%    8659607 ± 16%  sched_debug.cfs_rq:/.avg_vruntime.stddev
  19917807 ±  2%     +24.3%   24752418 ±  3%  sched_debug.cfs_rq:/.min_vruntime.avg
  38832674 ±  6%     +31.8%   51167093 ±  8%  sched_debug.cfs_rq:/.min_vruntime.max
   5538759 ±  3%     +56.3%    8659606 ± 16%  sched_debug.cfs_rq:/.min_vruntime.stddev
    894.81 ±  7%     +11.9%       1001 ±  8%  sched_debug.cfs_rq:/.util_est.max
      5560 ±  6%     -40.7%       3294 ±  3%  sched_debug.cpu.avg_idle.min
      0.52 ±  3%     +21.7%       0.63 ±  3%  perf-stat.i.MPKI
  17623556            -6.6%   16458641 ±  3%  perf-stat.i.branch-misses
     37.96            +3.6       41.59        perf-stat.i.cache-miss-rate%
  14340737 ±  3%     +22.2%   17528616 ±  2%  perf-stat.i.cache-misses
  38069590 ±  2%     +11.5%   42445235 ±  2%  perf-stat.i.cache-references
      9.24            +2.6%       9.48        perf-stat.i.cpi
 2.602e+11            +2.4%  2.665e+11        perf-stat.i.cpu-cycles
     18443 ±  3%     -17.1%      15286 ±  2%  perf-stat.i.cycles-between-cache-misses
      0.51 ±  2%     +22.2%       0.63 ±  2%  perf-stat.overall.MPKI
      0.32            -0.0        0.29 ±  2%  perf-stat.overall.branch-miss-rate%
     37.63            +3.6       41.25        perf-stat.overall.cache-miss-rate%
      9.28            +2.4%       9.50        perf-stat.overall.cpi
     18154 ±  2%     -16.2%      15205 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.11            -2.3%       0.11        perf-stat.overall.ipc
  42574383           -33.8%   28187632 ±  2%  perf-stat.overall.path-length
  17580646            -6.7%   16398374 ±  3%  perf-stat.ps.branch-misses
  14294844 ±  3%     +22.2%   17469729 ±  2%  perf-stat.ps.cache-misses
  37981661 ±  2%     +11.5%   42347645 ±  2%  perf-stat.ps.cache-references
 2.593e+11            +2.4%  2.655e+11        perf-stat.ps.cpu-cycles
      0.00 ±147%    +500.0%       0.01 ± 14%  perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.11 ±  8%     -32.5%       0.08 ± 23%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      0.00 ±223%  +10641.7%       0.21 ± 55%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      0.00 ±179%   +2890.9%       0.05 ± 53%  perf-sched.sch_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
      0.01 ±135%    +390.2%       0.07 ±100%  perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      0.00 ±223%   +1475.0%       0.01 ± 71%  perf-sched.sch_delay.avg.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
      0.00 ±223%   +9837.5%       0.13 ±121%  perf-sched.sch_delay.avg.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_write_begin
      0.00 ± 14%   +1830.0%       0.06 ± 97%  perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      0.01 ±  8%   +2452.0%       0.21 ± 64%  perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.01 ± 16%    +870.6%       0.08 ± 84%  perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
      0.01 ±  6%    +823.9%       0.07 ± 31%  perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00 ±100%    +411.1%       0.01 ±  9%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
      0.02 ± 34%   +3178.5%       0.71 ± 32%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.01 ± 75%   +1602.7%       0.10 ±143%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown].[unknown]
      0.12 ±150%     -87.6%       0.02 ± 45%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
      0.00 ±150%   +1047.1%       0.03 ±105%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
      0.00 ± 30%    +346.7%       0.01 ± 20%  perf-sched.sch_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
      0.02 ± 68%   +1050.0%       0.19 ± 27%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.do_poll.constprop.0.do_sys_poll
      0.01 ± 14%    +376.8%       0.04 ±105%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.01 ±  9%    +138.9%       0.01 ± 12%  perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      0.01         +2033.3%       0.13 ± 33%  perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
      0.01 ± 11%    +216.7%       0.03 ± 83%  perf-sched.sch_delay.avg.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
      0.01 ±  5%    +172.1%       0.02 ± 11%  perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      0.01 ± 61%    +173.4%       0.03 ± 46%  perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.00 ±147%    +787.5%       0.01 ± 37%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.03 ±223%   +4840.4%       1.24 ± 64%  perf-sched.sch_delay.max.ms.__cond_resched.__do_fault.do_read_fault.do_pte_missing.__handle_mm_fault
      0.00 ±223%  +41625.0%       0.83 ± 60%  perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      0.16 ±213%    +813.2%       1.48 ± 78%  perf-sched.sch_delay.max.ms.__cond_resched.down_write.vma_expand.vma_merge_new_range.do_brk_flags
      0.00 ±167%  +43144.0%       1.80 ± 59%  perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
      0.00 ±223%  +22188.9%       0.33 ±216%  perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.fdget_pos.ksys_write.do_syscall_64
      0.00 ±223%   +2458.3%       0.05 ±154%  perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
      0.00 ±223%  +68268.8%       1.82 ± 71%  perf-sched.sch_delay.max.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_write_begin
      0.00 ± 11%  +15918.5%       0.72 ±101%  perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      0.01 ± 12%   +5779.5%       0.72 ± 50%  perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.02 ± 53%   +2545.4%       0.48 ± 73%  perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
      0.02 ± 18%  +15675.3%       2.45 ± 11%  perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00 ±100%   +1100.0%       0.02 ± 76%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
      0.22 ± 70%   +1725.7%       3.94 ±  4%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.01 ± 72%   +3737.3%       0.33 ±114%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown].[unknown]
      0.00 ±141%  +25095.7%       0.97 ±144%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
      0.58 ± 79%    +423.4%       3.03 ± 43%  perf-sched.sch_delay.max.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
      0.91 ± 75%    +324.0%       3.84 ±  3%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range.do_poll.constprop.0.do_sys_poll
      0.02 ± 49%  +18885.6%       3.51 ± 21%  perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
      0.06 ±  5%   +3199.2%       2.01        perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      0.93 ±115%    +238.9%       3.16 ± 52%  perf-sched.sch_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      5.53 ±  3%     +35.2%       7.48 ±  3%  perf-sched.total_wait_and_delay.average.ms
    330090           -37.0%     207837 ±  4%  perf-sched.total_wait_and_delay.count.ms
      5.52 ±  3%     +35.2%       7.46 ±  3%  perf-sched.total_wait_time.average.ms
      6.70 ±  4%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
    167.82 ± 96%     -92.4%      12.75 ± 78%  perf-sched.wait_and_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
      1.20 ±  4%     -58.9%       0.49 ±  4%  perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
    280.09 ±  3%     +36.1%     381.15 ±  3%  perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    606.50 ±  6%    -100.0%       0.00        perf-sched.wait_and_delay.count.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
    320972           -38.3%     197924 ±  4%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      3118 ±  2%     -24.6%       2352 ±  2%  perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    693.67            -9.8%     626.00        perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      1000          -100.0%       0.00        perf-sched.wait_and_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
    167.82 ± 96%     -91.5%      14.30 ± 56%  perf-sched.wait_time.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
      0.55 ±223%    +762.9%       4.74 ±117%  perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.fdget_pos.ksys_write.do_syscall_64
      0.61 ±  3%     +24.0%       0.76 ±  8%  perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.26 ±221%   +3041.2%       8.22 ±129%  perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
      1.20 ±  4%     -59.9%       0.48 ±  4%  perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      0.91           +45.7%       1.32 ±  6%  perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
    280.07 ±  3%     +36.1%     381.13 ±  3%  perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.43 ±223%    +525.8%       2.69 ± 57%  perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      3.29 ±223%   +1258.4%      44.70 ± 98%  perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.fdget_pos.ksys_write.do_syscall_64
     29.75 ±  9%     +42.0%      42.24 ± 16%  perf-sched.wait_time.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.52 ±222%  +67466.8%     350.90 ±131%  perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
      3.60 ±  5%    +106.8%       7.43 ± 11%  perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
      5.04           +36.0%       6.86 ±  4%  perf-sched.wait_time.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      1.72 ±  3%      -0.2        1.47 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      1.73 ±  3%      -0.2        1.48 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      1.72 ±  3%      -0.2        1.47 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
      1.82 ±  3%      -0.2        1.57 ±  3%  perf-profile.calltrace.cycles-pp.common_startup_64
      1.80 ±  3%      -0.2        1.56 ±  3%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
      1.80 ±  3%      -0.2        1.56 ±  3%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      1.80 ±  3%      -0.2        1.56 ±  3%  perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
      0.63 ±  3%      -0.2        0.43 ± 44%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable
      0.73            -0.1        0.59        perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
      0.82            -0.1        0.71        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      0.63 ±  3%      -0.1        0.54 ±  4%  perf-profile.calltrace.cycles-pp.intel_idle_ibrs.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
     97.85            +0.2       98.02        perf-profile.calltrace.cycles-pp.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     97.87            +0.2       98.04        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     96.68            +0.2       96.85        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk.do_syscall_64
     97.90            +0.2       98.09        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.brk
     96.79            +0.2       96.99        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
     96.82            +0.2       97.04        perf-profile.calltrace.cycles-pp.down_write_killable.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     95.68            +0.2       95.91        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
     98.06            +0.3       98.32        perf-profile.calltrace.cycles-pp.brk
      0.00            +0.6        0.60 ±  3%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      0.56 ±  4%      -0.4        0.16 ±  4%  perf-profile.children.cycles-pp.intel_idle_irq
      1.06 ±  3%      -0.4        0.70 ±  4%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      1.73 ±  3%      -0.2        1.49 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter
      1.73 ±  3%      -0.2        1.49 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter_state
      1.74 ±  3%      -0.2        1.50 ±  3%  perf-profile.children.cycles-pp.cpuidle_idle_call
      1.82 ±  3%      -0.2        1.57 ±  3%  perf-profile.children.cycles-pp.common_startup_64
      1.82 ±  3%      -0.2        1.57 ±  3%  perf-profile.children.cycles-pp.cpu_startup_entry
      1.82 ±  3%      -0.2        1.57 ±  3%  perf-profile.children.cycles-pp.do_idle
      1.80 ±  3%      -0.2        1.56 ±  3%  perf-profile.children.cycles-pp.start_secondary
      0.21            -0.2        0.05 ±  7%  perf-profile.children.cycles-pp.mas_store_gfp
      0.73            -0.1        0.59        perf-profile.children.cycles-pp.do_vmi_align_munmap
      0.69 ±  2%      -0.1        0.56 ±  4%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      0.58 ±  3%      -0.1        0.47 ±  4%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.83            -0.1        0.72        perf-profile.children.cycles-pp.rwsem_spin_on_owner
      0.17 ±  2%      -0.1        0.07 ±  7%  perf-profile.children.cycles-pp.mas_store_prealloc
      0.58 ±  3%      -0.1        0.47 ±  4%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.17 ±  2%      -0.1        0.07 ±  6%  perf-profile.children.cycles-pp.vma_complete
      0.49 ±  3%      -0.1        0.39 ±  4%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.63 ±  4%      -0.1        0.55 ±  4%  perf-profile.children.cycles-pp.intel_idle_ibrs
      0.44 ±  4%      -0.1        0.36 ±  4%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.39 ±  3%      -0.1        0.32 ±  4%  perf-profile.children.cycles-pp.update_process_times
      0.32            -0.0        0.28        perf-profile.children.cycles-pp.__split_vma
      0.36            -0.0        0.31        perf-profile.children.cycles-pp.vms_gather_munmap_vmas
      0.24 ±  4%      -0.0        0.20 ±  3%  perf-profile.children.cycles-pp.sched_tick
      0.19 ±  7%      -0.0        0.16 ±  2%  perf-profile.children.cycles-pp.task_tick_fair
      0.06 ±  6%      -0.0        0.03 ± 70%  perf-profile.children.cycles-pp.smpboot_thread_fn
      0.12 ±  4%      -0.0        0.10 ±  6%  perf-profile.children.cycles-pp.rcu_do_batch
      0.13 ±  3%      -0.0        0.10 ±  3%  perf-profile.children.cycles-pp.rcu_core
      0.14 ±  2%      -0.0        0.12 ±  4%  perf-profile.children.cycles-pp.handle_softirqs
      0.08 ±  4%      -0.0        0.06 ± 11%  perf-profile.children.cycles-pp.get_jiffies_update
      0.08 ±  5%      -0.0        0.06 ± 11%  perf-profile.children.cycles-pp.tmigr_requires_handle_remote
      0.14 ±  2%      -0.0        0.12 ±  3%  perf-profile.children.cycles-pp.kmem_cache_free
      0.07 ±  7%      -0.0        0.05        perf-profile.children.cycles-pp.kthread
      0.07 ±  7%      -0.0        0.05        perf-profile.children.cycles-pp.ret_from_fork
      0.07 ±  7%      -0.0        0.05        perf-profile.children.cycles-pp.ret_from_fork_asm
      0.10 ±  7%      -0.0        0.08 ±  4%  perf-profile.children.cycles-pp.update_cfs_group
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.__slab_free
      0.05            +0.0        0.07 ±  5%  perf-profile.children.cycles-pp.commit_merge
      0.06 ±  9%      +0.0        0.08 ±  4%  perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.06 ±  6%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.vma_expand
      0.08 ±  4%      +0.0        0.11 ±  5%  perf-profile.children.cycles-pp.up_write
      0.06 ±  6%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.05 ±  7%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.anon_vma_clone
      0.07 ±  5%      +0.0        0.11 ±  4%  perf-profile.children.cycles-pp.vma_merge_new_range
      0.06 ±  9%      +0.0        0.09        perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
      0.08 ±  5%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.vms_clear_ptes
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.unlink_anon_vmas
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.11 ±  4%      +0.1        0.17 ±  2%  perf-profile.children.cycles-pp.do_brk_flags
      0.00            +0.1        0.06 ±  6%  perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      0.00            +0.1        0.06 ±  6%  perf-profile.children.cycles-pp.free_pgtables
      0.00            +0.1        0.06        perf-profile.children.cycles-pp.vm_area_dup
      0.17 ±  2%      +0.1        0.23 ±  2%  perf-profile.children.cycles-pp.vms_complete_munmap_vmas
      0.00            +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.mas_wr_node_store
      0.00            +0.1        0.12 ±  3%  perf-profile.children.cycles-pp.poll_idle
      0.46 ±  4%      +0.1        0.60 ±  3%  perf-profile.children.cycles-pp.intel_idle
     97.85            +0.2       98.02        perf-profile.children.cycles-pp.__do_sys_brk
     97.90            +0.2       98.08        perf-profile.children.cycles-pp.do_syscall_64
     96.68            +0.2       96.86        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     97.94            +0.2       98.12        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     96.79            +0.2       96.99        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     96.82            +0.2       97.04        perf-profile.children.cycles-pp.down_write_killable
     95.71            +0.2       95.94        perf-profile.children.cycles-pp.osq_lock
     98.06            +0.3       98.32        perf-profile.children.cycles-pp.brk
      0.54 ±  4%      -0.4        0.15 ±  3%  perf-profile.self.cycles-pp.intel_idle_irq
      0.82            -0.1        0.71        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.63 ±  4%      -0.1        0.55 ±  4%  perf-profile.self.cycles-pp.intel_idle_ibrs
      0.08 ±  4%      -0.0        0.06 ± 11%  perf-profile.self.cycles-pp.get_jiffies_update
      0.10 ±  7%      -0.0        0.08 ±  4%  perf-profile.self.cycles-pp.update_cfs_group
      0.06            -0.0        0.05        perf-profile.self.cycles-pp.ktime_get_update_offsets_now
      0.06 ±  9%      +0.0        0.08 ±  4%  perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.06            +0.0        0.09 ±  6%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.13 ±  3%      +0.1        0.18 ±  2%  perf-profile.self.cycles-pp.rwsem_optimistic_spin
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.up_write
      0.00            +0.1        0.12 ±  4%  perf-profile.self.cycles-pp.poll_idle
      0.46 ±  4%      +0.1        0.60 ±  3%  perf-profile.self.cycles-pp.intel_idle
     95.11            +0.3       95.44        perf-profile.self.cycles-pp.osq_lock



***************************************************************************************************
lkp-skl-fpga01: 104 threads 2 sockets (Skylake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-9.4/thread/100%/debian-12-x86_64-20240206.cgz/lkp-skl-fpga01/brk1/will-it-scale

commit: 
  89dd878282 ("mm: memcg: declare do_memsw_account inline")
  249608ee47 ("mm: respect mmap hint address when aligning for THP")

89dd878282881306 249608ee47132cab3b1adacd9e4 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 3.271e+09 ± 11%     -23.6%  2.499e+09 ±  4%  cpuidle..time
    534782 ±  3%      -9.8%     482625        meminfo.Shmem
      7292 ± 10%     -16.8%       6068        uptime.idle
    117230            +3.0%     120705        vmstat.system.in
     10.21 ± 10%      -2.5        7.74 ±  4%  mpstat.cpu.all.idle%
      0.10            -0.0        0.08        mpstat.cpu.all.soft%
      0.30 ±  8%      +0.1        0.38 ±  2%  mpstat.cpu.all.usr%
   1562083 ±  5%     -28.9%    1111214 ±  6%  numa-numastat.node0.local_node
   1600171 ±  5%     -27.1%    1165935 ±  5%  numa-numastat.node0.numa_hit
   2469533 ±  5%     -36.7%    1562269 ±  7%  numa-numastat.node1.local_node
   2538689 ±  5%     -36.4%    1615104 ±  7%  numa-numastat.node1.numa_hit
   1599764 ±  5%     -27.2%    1165290 ±  5%  numa-vmstat.node0.numa_hit
   1561676 ±  5%     -28.9%    1110570 ±  6%  numa-vmstat.node0.numa_local
   2537854 ±  5%     -36.4%    1613883 ±  7%  numa-vmstat.node1.numa_hit
   2468697 ±  5%     -36.8%    1561112 ±  7%  numa-vmstat.node1.numa_local
    517.00 ±  6%     +44.8%     748.67 ±  5%  perf-c2c.DRAM.local
      5599 ±  3%     +22.8%       6877 ±  3%  perf-c2c.DRAM.remote
      5356 ±  2%     +17.2%       6277 ±  4%  perf-c2c.HITM.local
      3995 ±  3%     +12.9%       4512 ±  2%  perf-c2c.HITM.remote
    207757 ±  3%     +50.1%     311758 ±  4%  will-it-scale.104.threads
      9.27 ±  4%     -19.6%       7.45 ±  4%  will-it-scale.104.threads_idle
      1997 ±  3%     +50.1%       2997 ±  4%  will-it-scale.per_thread_ops
    207757 ±  3%     +50.1%     311758 ±  4%  will-it-scale.workload
  20771245 ±  7%     +19.8%   24875862 ±  5%  sched_debug.cfs_rq:/.avg_vruntime.avg
   6013540 ±  9%     +29.6%    7795227 ± 15%  sched_debug.cfs_rq:/.avg_vruntime.stddev
  20771245 ±  7%     +19.8%   24875862 ±  5%  sched_debug.cfs_rq:/.min_vruntime.avg
   6013540 ±  9%     +29.6%    7795227 ± 15%  sched_debug.cfs_rq:/.min_vruntime.stddev
      5286 ±  5%     -32.3%       3580 ±  9%  sched_debug.cpu.avg_idle.min
    304791            -4.4%     291399        proc-vmstat.nr_active_anon
   1009858            -1.3%     996889        proc-vmstat.nr_file_pages
     23935            -4.3%      22912        proc-vmstat.nr_mapped
    133626 ±  3%      -9.7%     120653        proc-vmstat.nr_shmem
    108257            -1.7%     106463        proc-vmstat.nr_slab_unreclaimable
    304791            -4.4%     291399        proc-vmstat.nr_zone_active_anon
   4140560           -32.8%    2781620 ±  2%  proc-vmstat.numa_hit
   4033316           -33.7%    2674065 ±  2%  proc-vmstat.numa_local
   7314624 ±  2%     -37.7%    4554492 ±  3%  proc-vmstat.pgalloc_normal
   1102175            -2.4%    1075842        proc-vmstat.pgfault
   7136742 ±  2%     -38.5%    4391328 ±  3%  proc-vmstat.pgfree
      0.49 ±  6%     +23.1%       0.60 ±  6%  perf-stat.i.MPKI
     37.67            +4.2       41.92        perf-stat.i.cache-miss-rate%
  13495545 ±  3%     +26.4%   17064915 ±  6%  perf-stat.i.cache-misses
  36075782 ±  2%     +14.0%   41135363 ±  5%  perf-stat.i.cache-references
      9.29            +2.5%       9.52        perf-stat.i.cpi
 2.621e+11            +2.5%  2.685e+11        perf-stat.i.cpu-cycles
    212.81            -1.4%     209.80        perf-stat.i.cpu-migrations
     19736 ±  4%     -19.1%      15958 ±  7%  perf-stat.i.cycles-between-cache-misses
      0.11 ±  2%      -3.3%       0.11        perf-stat.i.ipc
      0.48 ±  4%     +25.9%       0.60 ±  6%  perf-stat.overall.MPKI
     37.35            +4.0       41.40        perf-stat.overall.cache-miss-rate%
      9.33            +2.0%       9.52        perf-stat.overall.cpi
     19440 ±  3%     -18.7%      15809 ±  7%  perf-stat.overall.cycles-between-cache-misses
      0.11            -2.0%       0.11        perf-stat.overall.ipc
  40994713 ±  3%     -33.4%   27301203 ±  4%  perf-stat.overall.path-length
  13453027 ±  3%     +26.4%   17009626 ±  6%  perf-stat.ps.cache-misses
  36008186 ±  2%     +14.0%   41056969 ±  5%  perf-stat.ps.cache-references
 2.612e+11            +2.5%  2.676e+11        perf-stat.ps.cpu-cycles
    212.16            -1.4%     209.13        perf-stat.ps.cpu-migrations
      0.00 ±143%    +614.3%       0.01 ± 38%  perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.00 ±223%  +12311.1%       0.19 ±115%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      0.00         +2575.0%       0.05 ± 92%  perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      0.04 ±175%    +275.8%       0.15 ± 89%  perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.02 ±120%    +669.0%       0.15 ± 89%  perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
      0.01 ± 32%    +657.1%       0.07 ± 51%  perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.15 ±114%    +559.8%       1.00 ± 19%  perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.00 ± 55%    +229.2%       0.01 ± 22%  perf-sched.sch_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
      0.04 ± 61%    +378.2%       0.19 ± 15%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.do_poll.constprop.0.do_sys_poll
      0.01 ± 15%    +160.3%       0.03 ±109%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.01 ± 30%    +216.1%       0.02 ± 12%  perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      0.03 ±163%    +448.7%       0.18 ± 24%  perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
      0.01 ± 30%     +96.7%       0.02 ± 11%  perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      0.01 ± 86%    +234.6%       0.05 ± 60%  perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.00 ±143%    +700.0%       0.01 ± 33%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.00 ±223%  +50788.9%       0.76 ±137%  perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      1.05 ±141%    +326.0%       4.46 ± 67%  perf-sched.sch_delay.max.ms.__cond_resched.down_write.vma_expand.vma_merge_new_range.do_brk_flags
      0.60 ±186%    +271.1%       2.25 ± 74%  perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
      0.02 ± 97%  +14710.9%       2.72 ± 47%  perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      0.17 ±208%    +228.7%       0.54 ± 80%  perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.10 ±150%   +2829.8%       2.93 ± 34%  perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.73 ± 99%    +137.5%       4.10 ±  5%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.05 ±162%   +3038.5%       1.62 ± 72%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
      0.18 ±174%   +1759.9%       3.30 ± 41%  perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
      2.19 ± 69%     +74.8%       3.82 ±  6%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range.do_poll.constprop.0.do_sys_poll
      1.16 ± 95%    +211.8%       3.61 ±  8%  perf-sched.sch_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.01 ± 25%    +200.0%       0.02 ± 11%  perf-sched.total_sch_delay.average.ms
      5.20 ±  7%     +55.1%       8.06 ±  7%  perf-sched.total_wait_and_delay.average.ms
    338197 ±  7%     -43.5%     190977 ±  7%  perf-sched.total_wait_and_delay.count.ms
      5.19 ±  7%     +54.9%       8.04 ±  7%  perf-sched.total_wait_time.average.ms
      6.72 ±  6%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
     70.88 ±162%    +311.9%     292.00 ± 22%  perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.91 ± 15%     -43.6%       0.51 ±  3%  perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
    279.25 ± 11%     +24.7%     348.09 ±  5%  perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    607.00 ±  6%    -100.0%       0.00        perf-sched.wait_and_delay.count.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
    328796 ±  8%     -45.0%     180683 ±  7%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      3211 ±  6%     -20.9%       2541 ±  7%  perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1001          -100.0%       0.00        perf-sched.wait_and_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      0.00 ±223%  +52555.6%       0.79 ± 31%  perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      0.00 ±142%  +1.2e+05%       1.79 ± 90%  perf-sched.wait_time.avg.ms.__cond_resched.down_write.vma_prepare.commit_merge.vma_expand
     70.88 ±162%    +312.0%     291.99 ± 22%  perf-sched.wait_time.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.91 ± 16%     -45.1%       0.50 ±  3%  perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
      0.98 ± 11%     +43.4%       1.40 ± 25%  perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
    279.22 ± 11%     +24.7%     348.08 ±  5%  perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.00 ±223%  +1.5e+05%       2.21 ± 63%  perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.wait_for_completion_state.kernel_clone.__x64_sys_vfork
      0.00 ±145%  +2.2e+05%       3.74 ± 71%  perf-sched.wait_time.max.ms.__cond_resched.down_write.vma_prepare.commit_merge.vma_expand
      0.05 ±161%   +3018.3%       1.62 ± 72%  perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
      0.59 ±  3%      -0.3        0.27 ±100%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable
      0.57 ±  6%      -0.3        0.26 ±100%  perf-profile.calltrace.cycles-pp.intel_idle_ibrs.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      1.70 ±  4%      -0.2        1.49 ±  3%  perf-profile.calltrace.cycles-pp.common_startup_64
      1.61 ±  4%      -0.2        1.40 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
      1.61 ±  4%      -0.2        1.40 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      1.62 ±  4%      -0.2        1.42 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      1.68 ±  4%      -0.2        1.47 ±  3%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      1.68 ±  4%      -0.2        1.48 ±  3%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
      1.68 ±  4%      -0.2        1.48 ±  3%  perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
      0.72            -0.1        0.58 ±  2%  perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
      0.81            -0.1        0.70        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
     97.96            +0.1       98.08        perf-profile.calltrace.cycles-pp.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     97.98            +0.1       98.11        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     96.80            +0.1       96.94        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk.do_syscall_64
     98.01            +0.1       98.16        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.brk
     96.91            +0.2       97.07        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
     96.94            +0.2       97.12        perf-profile.calltrace.cycles-pp.down_write_killable.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     95.81            +0.2       96.00        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write_killable.__do_sys_brk
     98.17            +0.2       98.40        perf-profile.calltrace.cycles-pp.brk
      0.00            +0.6        0.59 ±  2%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      0.53 ±  6%      -0.4        0.17 ±  8%  perf-profile.children.cycles-pp.intel_idle_irq
      1.00 ±  4%      -0.3        0.70 ±  3%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      1.70 ±  4%      -0.2        1.49 ±  3%  perf-profile.children.cycles-pp.common_startup_64
      1.70 ±  4%      -0.2        1.49 ±  3%  perf-profile.children.cycles-pp.cpu_startup_entry
      1.63 ±  4%      -0.2        1.42 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter
      1.63 ±  4%      -0.2        1.42 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter_state
      1.64 ±  4%      -0.2        1.43 ±  3%  perf-profile.children.cycles-pp.cpuidle_idle_call
      1.70 ±  4%      -0.2        1.49 ±  3%  perf-profile.children.cycles-pp.do_idle
      1.68 ±  4%      -0.2        1.48 ±  3%  perf-profile.children.cycles-pp.start_secondary
      0.21 ±  2%      -0.2        0.05        perf-profile.children.cycles-pp.mas_store_gfp
      0.72            -0.1        0.58 ±  2%  perf-profile.children.cycles-pp.do_vmi_align_munmap
      0.82            -0.1        0.70        perf-profile.children.cycles-pp.rwsem_spin_on_owner
      0.17 ±  2%      -0.1        0.06 ±  7%  perf-profile.children.cycles-pp.mas_store_prealloc
      0.17 ±  2%      -0.1        0.07 ±  5%  perf-profile.children.cycles-pp.vma_complete
      0.58 ±  6%      -0.1        0.49 ±  9%  perf-profile.children.cycles-pp.intel_idle_ibrs
      0.64 ±  3%      -0.1        0.56 ±  3%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      0.54 ±  3%      -0.1        0.47 ±  4%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.54 ±  4%      -0.1        0.47 ±  4%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.45 ±  3%      -0.1        0.39 ±  4%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.41 ±  4%      -0.1        0.36 ±  5%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.35            -0.0        0.31 ±  3%  perf-profile.children.cycles-pp.vms_gather_munmap_vmas
      0.32            -0.0        0.27 ±  3%  perf-profile.children.cycles-pp.__split_vma
      0.36 ±  2%      -0.0        0.31 ±  5%  perf-profile.children.cycles-pp.update_process_times
      0.14 ±  6%      -0.0        0.12 ±  4%  perf-profile.children.cycles-pp.handle_softirqs
      0.23 ±  2%      -0.0        0.20 ±  4%  perf-profile.children.cycles-pp.sched_tick
      0.13 ±  6%      -0.0        0.10 ±  4%  perf-profile.children.cycles-pp.rcu_core
      0.13 ±  5%      -0.0        0.10 ±  4%  perf-profile.children.cycles-pp.rcu_do_batch
      0.15 ±  3%      -0.0        0.12 ±  3%  perf-profile.children.cycles-pp.kmem_cache_free
      0.06 ±  6%      -0.0        0.04 ± 44%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.06 ± 11%      -0.0        0.05        perf-profile.children.cycles-pp.kthread
      0.06 ± 11%      -0.0        0.05        perf-profile.children.cycles-pp.ret_from_fork
      0.06 ± 11%      -0.0        0.05        perf-profile.children.cycles-pp.ret_from_fork_asm
      0.06 ±  7%      -0.0        0.05        perf-profile.children.cycles-pp.smpboot_thread_fn
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.__slab_free
      0.06 ±  7%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.vma_expand
      0.07 ±  7%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.08 ±  6%      +0.0        0.10        perf-profile.children.cycles-pp.vma_merge_new_range
      0.06 ±  9%      +0.0        0.08 ±  4%  perf-profile.children.cycles-pp.anon_vma_clone
      0.08 ±  5%      +0.0        0.11 ±  6%  perf-profile.children.cycles-pp.up_write
      0.06 ±  8%      +0.0        0.09 ±  8%  perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.05 ±  7%      +0.0        0.09 ±  7%  perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
      0.08 ±  5%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.vms_clear_ptes
      0.12 ±  4%      +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.do_brk_flags
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.unlink_anon_vmas
      0.00            +0.1        0.06 ±  9%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.00            +0.1        0.06 ±  8%  perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      0.00            +0.1        0.06 ±  8%  perf-profile.children.cycles-pp.vm_area_dup
      0.00            +0.1        0.06        perf-profile.children.cycles-pp.free_pgtables
      0.16 ±  4%      +0.1        0.22 ±  3%  perf-profile.children.cycles-pp.vms_complete_munmap_vmas
      0.00            +0.1        0.07 ±  5%  perf-profile.children.cycles-pp.mas_wr_node_store
      0.00            +0.1        0.11 ±  4%  perf-profile.children.cycles-pp.poll_idle
     97.96            +0.1       98.08        perf-profile.children.cycles-pp.__do_sys_brk
     98.02            +0.1       98.14        perf-profile.children.cycles-pp.do_syscall_64
     96.80            +0.1       96.94        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     98.05            +0.1       98.19        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.45 ±  4%      +0.2        0.60 ±  2%  perf-profile.children.cycles-pp.intel_idle
     96.91            +0.2       97.07        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     96.94            +0.2       97.12        perf-profile.children.cycles-pp.down_write_killable
     95.84            +0.2       96.02        perf-profile.children.cycles-pp.osq_lock
     98.18            +0.2       98.40        perf-profile.children.cycles-pp.brk
      0.50 ±  6%      -0.3        0.16 ±  9%  perf-profile.self.cycles-pp.intel_idle_irq
      0.81            -0.1        0.70        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.58 ±  6%      -0.1        0.49 ±  9%  perf-profile.self.cycles-pp.intel_idle_ibrs
      0.06 ±  8%      +0.0        0.08 ±  5%  perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.06 ±  7%      +0.0        0.09 ±  4%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.00            +0.1        0.05 ±  7%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.13 ±  2%      +0.1        0.18 ±  2%  perf-profile.self.cycles-pp.rwsem_optimistic_spin
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.up_write
      0.00            +0.1        0.11 ±  4%  perf-profile.self.cycles-pp.poll_idle
      0.45 ±  4%      +0.2        0.60 ±  2%  perf-profile.self.cycles-pp.intel_idle
     95.28            +0.3       95.53        perf-profile.self.cycles-pp.osq_lock





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux