[linus:master] [mm/mmap] 6852c46c78: stress-ng.spawn.ops_per_sec 2.9% improvement

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Hello,

kernel test robot noticed a 2.9% improvement of stress-ng.spawn.ops_per_sec on:


commit: 6852c46c783d20a4c0153d14d2990040e5e6e47e ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 100%
	disk: 1HDD
	testtime: 60s
	class: exec_spawn
	test: spawn
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.score 6.4% improvement                                                        |
| test machine     | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory           |
| test parameters  | cpufreq_governor=performance                                                                       |
|                  | nr_task=100%                                                                                       |
|                  | runtime=300s                                                                                       |
|                  | test=shell8                                                                                        |
+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | vm-scalability: vm-scalability.throughput 23.5% improvement                                        |
| test machine     | 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory |
| test parameters  | cpufreq_governor=performance                                                                       |
|                  | runtime=300s                                                                                       |
|                  | test=small-allocs                                                                                  |
+------------------+----------------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20230921/202309211530.ca9c00a1-oliver.sang@xxxxxxxxx

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  exec_spawn/gcc-12/performance/1HDD/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp8/spawn/stress-ng/60s

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     10609 ± 11%     -19.9%       8493 ±  9%  numa-meminfo.node0.PageTables
      2666 ± 11%     -22.4%       2069 ±  9%  numa-vmstat.node0.nr_page_table_pages
     73.76            -1.5%      72.67        iostat.cpu.system
     11.69            +3.3%      12.07        iostat.cpu.user
      9706 ±  5%     +35.6%      13159 ± 22%  turbostat.POLL
     86.46            +0.9%      87.20        turbostat.RAMWatt
    301.24 ± 14%     -16.3%     252.23 ±  5%  sched_debug.cfs_rq:/.runnable_avg.stddev
    -60827           +81.7%    -110508        sched_debug.cfs_rq:/.spread0.avg
      1216 ± 13%     +33.8%       1627 ± 12%  sched_debug.cpu.nr_uninterruptible.max
   1282527            +2.9%    1319688        stress-ng.spawn.ops
     21375            +2.9%      21994        stress-ng.spawn.ops_per_sec
    593917           +46.2%     868090        stress-ng.time.major_page_faults
 2.721e+08            +2.9%  2.799e+08        stress-ng.time.minor_page_faults
      2943            -1.5%       2899        stress-ng.time.system_time
    434.40            +4.3%     453.01        stress-ng.time.user_time
    602081 ±  2%      +4.2%     627620        proc-vmstat.nr_anon_pages
    823789 ±  2%      +4.2%     858619        proc-vmstat.nr_inactive_anon
    219961            +3.6%     227832        proc-vmstat.nr_mapped
    346359            +2.1%     353585        proc-vmstat.nr_shmem
    823789 ±  2%      +4.2%     858619        proc-vmstat.nr_zone_inactive_anon
 1.826e+08            +3.0%  1.881e+08        proc-vmstat.numa_hit
 1.825e+08            +3.0%   1.88e+08        proc-vmstat.numa_local
    425017            +2.2%     434312        proc-vmstat.pgactivate
 1.874e+08            +3.0%  1.931e+08        proc-vmstat.pgalloc_normal
 2.767e+08            +3.0%   2.85e+08        proc-vmstat.pgfault
 1.857e+08            +3.0%  1.913e+08        proc-vmstat.pgfree
      5030            +2.6%       5159        proc-vmstat.thp_split_pmd
      0.01 ±  9%     +30.2%       0.01 ± 17%  perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_alloc.alloc_bprm.do_execveat_common
      0.54 ± 22%     -46.9%       0.29 ± 52%  perf-sched.sch_delay.max.ms.__cond_resched.filemap_read.__kernel_read.search_binary_handler.exec_binprm
      0.15 ± 80%     -67.6%       0.05 ± 41%  perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc.__anon_vma_prepare.do_cow_fault.do_fault
      0.63 ± 12%     -32.5%       0.42 ± 12%  perf-sched.sch_delay.max.ms.__cond_resched.remove_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      0.03 ±  2%     +15.9%       0.03 ±  4%  perf-sched.wait_and_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      4292 ±  5%     -14.2%       3683 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
      3397 ±  4%     +16.8%       3966 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
      5346 ±  4%     +14.1%       6100 ±  2%  perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_exc_page_fault
     27421 ±  2%     +49.2%      40919        perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
     18122 ±  5%     -13.9%      15598 ±  3%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     78347 ±  5%     -14.6%      66894 ±  3%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.unlink_file_vma
      2.02 ± 27%     -46.2%       1.09 ± 14%  perf-sched.wait_and_delay.max.ms.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
      0.01 ± 23%    +109.1%       0.02 ± 63%  perf-sched.wait_time.avg.ms.__cond_resched.__alloc_pages.pte_alloc_one.__do_fault.do_cow_fault
      0.02 ±  6%     +68.4%       0.03 ± 29%  perf-sched.wait_time.avg.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
      0.02 ± 21%     +49.0%       0.03 ± 26%  perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.getname_flags.part.0
      0.02 ±  6%     +19.5%       0.03 ± 10%  perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.mprotect_fixup
      0.01 ±  3%     +20.3%       0.01 ±  7%  perf-sched.wait_time.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      0.04 ± 54%    +138.4%       0.10 ± 68%  perf-sched.wait_time.max.ms.__cond_resched.down_read_killable.iterate_dir.__x64_sys_getdents64.do_syscall_64
      0.08 ± 52%    +290.1%       0.33 ± 61%  perf-sched.wait_time.max.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
      0.03 ± 70%    +101.1%       0.06 ± 16%  perf-sched.wait_time.max.ms.__cond_resched.down_write.exit_mmap.__mmput.exit_mm
      0.54 ±  7%     -48.8%       0.27 ± 69%  perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.vm_mmap_pgoff.elf_map.load_elf_interp
      0.54 ±  3%     +18.9%       0.64 ± 16%  perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open
      2.30 ± 18%     -22.0%       1.79 ±  8%  perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     13.82            +1.5%      14.03        perf-stat.i.MPKI
      1.93            +0.0        1.96        perf-stat.i.branch-miss-rate%
 4.182e+08            +2.3%   4.28e+08        perf-stat.i.branch-misses
 2.549e+08            +1.9%  2.598e+08        perf-stat.i.cache-misses
 1.545e+09            +2.5%  1.584e+09        perf-stat.i.cache-references
      1.86            -2.0%       1.82        perf-stat.i.cpi
    945.76 ±  3%      -5.3%     895.17 ±  2%  perf-stat.i.cycles-between-cache-misses
 1.249e+10            +2.7%  1.283e+10        perf-stat.i.dTLB-stores
      0.55            +3.0%       0.56        perf-stat.i.ipc
      9557           +46.8%      14033        perf-stat.i.major-faults
      2025            +2.4%       2074        perf-stat.i.metric.K/sec
    996.43            +1.3%       1009        perf-stat.i.metric.M/sec
   4413113            +2.7%    4533786        perf-stat.i.minor-faults
  12764008 ±  2%      +3.7%   13236279        perf-stat.i.node-loads
  47197681            +2.6%   48424472        perf-stat.i.node-stores
   4422671            +2.8%    4547820        perf-stat.i.page-faults
     14.16            +1.6%      14.39        perf-stat.overall.MPKI
      1.92            +0.0        1.95        perf-stat.overall.branch-miss-rate%
      1.87            -1.8%       1.84        perf-stat.overall.cpi
    810.88            -2.8%     788.33        perf-stat.overall.cycles-between-cache-misses
      0.53            +1.8%       0.54        perf-stat.overall.ipc
 4.094e+08            +2.3%   4.19e+08        perf-stat.ps.branch-misses
 1.511e+09            +2.5%  1.549e+09        perf-stat.ps.cache-references
     17556 ±  2%      +3.9%      18239        perf-stat.ps.cpu-migrations
 1.226e+10            +2.7%   1.26e+10        perf-stat.ps.dTLB-stores
      9359           +46.0%      13666        perf-stat.ps.major-faults
   4325409            +2.8%    4445394        perf-stat.ps.minor-faults
  12743809 ±  2%      +4.1%   13262041        perf-stat.ps.node-loads
  46090814            +2.7%   47343915        perf-stat.ps.node-stores
   4334769            +2.9%    4459060        perf-stat.ps.page-faults
     17.36            -1.2       16.11        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables
     14.04            -1.1       12.96        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
     69.96            -0.9       69.02        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
     70.01            -0.9       69.08        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     12.28            -0.9       11.37        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma
      9.72            -0.8        8.90        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.exit_mmap
     10.90 ±  2%      -0.8       10.10        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
      9.96            -0.8        9.16        perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.exit_mmap.__mmput
      7.46 ±  2%      -0.8        6.69        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
      7.33 ±  2%      -0.8        6.56        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
     10.95            -0.8       10.20        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.exit_mmap.__mmput.exit_mm
     12.30            -0.7       11.60        perf-profile.calltrace.cycles-pp.free_pgtables.exit_mmap.__mmput.exit_mm.do_exit
      8.52 ±  2%      -0.5        8.03        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.do_vmi_align_munmap
      8.72 ±  2%      -0.5        8.24        perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      7.36            -0.4        6.92        perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      7.95            -0.4        7.52        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.unmap_region
      5.72 ±  2%      -0.4        5.30        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
      3.96 ±  2%      -0.4        3.55        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.mprotect_fixup
      8.40            -0.4        7.98        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap
      8.12            -0.4        7.71        perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap
      4.10 ±  2%      -0.4        3.70        perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
      8.68            -0.4        8.28        perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      5.16            -0.3        4.84        perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
      9.61            -0.3        9.30        perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
      7.33            -0.2        7.14        perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
      2.84            -0.2        2.66        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
      2.18            -0.2        2.02        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
      1.18            -0.1        1.11        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
      0.84 ±  3%      -0.1        0.77 ±  3%  perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.elf_map.load_elf_interp.load_elf_binary.search_binary_handler
      0.79 ±  3%      -0.1        0.73 ±  3%  perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.elf_map.load_elf_interp
      0.82            -0.0        0.78        perf-profile.calltrace.cycles-pp.mas_store_prealloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.56            +0.0        0.58        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap
      0.60            +0.0        0.63        perf-profile.calltrace.cycles-pp.__sysconf
      0.57 ±  2%      +0.0        0.60        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.wait4
      0.55 ±  2%      +0.0        0.58        perf-profile.calltrace.cycles-pp.kernel_wait4.__do_sys_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
      0.57 ±  2%      +0.0        0.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
      0.67            +0.0        0.70        perf-profile.calltrace.cycles-pp.sync_regs.asm_exc_page_fault
      0.55            +0.0        0.58 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.61            +0.0        0.64        perf-profile.calltrace.cycles-pp.copy_process.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.04            +0.0        1.08        perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      1.08            +0.0        1.12        perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.56            +0.0        0.60 ±  3%  perf-profile.calltrace.cycles-pp.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.64            +0.0        0.68        perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.54            +0.0        0.58        perf-profile.calltrace.cycles-pp.do_set_pte.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
      0.54 ±  2%      +0.0        0.58        perf-profile.calltrace.cycles-pp.finish_fault.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
      0.93            +0.0        0.98        perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
      0.94            +0.0        0.99        perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.63 ±  2%      +0.1        0.68        perf-profile.calltrace.cycles-pp._compound_head.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
      1.06            +0.1        1.12        perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.00            +0.1        1.06        perf-profile.calltrace.cycles-pp._dl_addr
      1.07            +0.1        1.12        perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.58 ±  2%      +0.1        0.63 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.93            +0.1        0.99        perf-profile.calltrace.cycles-pp.kernfs_fop_readdir.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.03            +0.1        1.09        perf-profile.calltrace.cycles-pp.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      1.03            +0.1        1.09        perf-profile.calltrace.cycles-pp.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      0.97            +0.1        1.04        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64
      0.97            +0.1        1.04        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      0.96            +0.1        1.03        perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      0.98            +0.1        1.04        perf-profile.calltrace.cycles-pp.getdents64
      0.97            +0.1        1.03        perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      1.37            +0.1        1.44        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__clone
      1.33 ±  2%      +0.1        1.40        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      0.96            +0.1        1.04        perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      1.50            +0.1        1.58        perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
      1.30            +0.1        1.38        perf-profile.calltrace.cycles-pp.wp_page_copy.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      1.51            +0.1        1.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.51            +0.1        1.60        perf-profile.calltrace.cycles-pp.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.52            +0.1        1.61        perf-profile.calltrace.cycles-pp.execve.__clone
      1.52            +0.1        1.60        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.60            +0.1        1.70        perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      1.26 ±  2%      +0.1        1.36        perf-profile.calltrace.cycles-pp.page_remove_rmap.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
      1.20            +0.1        1.30        perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      1.52            +0.1        1.62        perf-profile.calltrace.cycles-pp.next_uptodate_page.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
      0.43 ± 44%      +0.1        0.55 ±  3%  perf-profile.calltrace.cycles-pp.mm_init.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64
      1.86 ±  2%      +0.2        2.03        perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput
      3.24            +0.2        3.42        perf-profile.calltrace.cycles-pp.__clone
      2.30            +0.2        2.50        perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput.exit_mm
      2.33            +0.2        2.52        perf-profile.calltrace.cycles-pp.tlb_finish_mmu.exit_mmap.__mmput.exit_mm.do_exit
      2.64            +0.2        2.84        perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
      2.83            +0.2        3.03        perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      3.13            +0.2        3.34        perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap
      3.34            +0.2        3.55        perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap.__mmput
      3.41            +0.2        3.63        perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.__mmput.exit_mm
      2.02            +0.2        2.24        perf-profile.calltrace.cycles-pp.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      3.62            +0.2        3.86        perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.__mmput.exit_mm.do_exit
      4.88            +0.4        5.31        perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      0.00            +0.5        0.51 ±  2%  perf-profile.calltrace.cycles-pp.copy_page.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
      7.28            +0.6        7.86        perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      7.68            +0.6        8.29        perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      8.90            +0.7        9.59        perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      8.97            +0.7        9.67        perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
      9.85            +0.7       10.58        perf-profile.calltrace.cycles-pp.asm_exc_page_fault
     39.11            -3.0       36.10        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     39.79            -3.0       36.79        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     40.99            -2.9       38.11        perf-profile.children.cycles-pp.down_write
     31.42            -2.5       28.90        perf-profile.children.cycles-pp.osq_lock
     19.60            -1.2       18.41        perf-profile.children.cycles-pp.unlink_file_vma
     21.27            -1.1       20.14        perf-profile.children.cycles-pp.free_pgtables
     40.67            -1.0       39.70        perf-profile.children.cycles-pp.mmap_region
     41.06            -1.0       40.10        perf-profile.children.cycles-pp.do_mmap
     41.28            -0.9       40.34        perf-profile.children.cycles-pp.vm_mmap_pgoff
     79.55            -0.8       78.73        perf-profile.children.cycles-pp.do_syscall_64
     79.62            -0.8       78.82        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     17.54            -0.7       16.80        perf-profile.children.cycles-pp.vma_prepare
      6.88            -0.5        6.38        perf-profile.children.cycles-pp.rwsem_spin_on_owner
     24.95            -0.5       24.48        perf-profile.children.cycles-pp.do_vmi_align_munmap
     25.10            -0.5       24.64        perf-profile.children.cycles-pp.do_vmi_munmap
     21.02            -0.4       20.64        perf-profile.children.cycles-pp.__split_vma
     10.25            -0.3        9.94        perf-profile.children.cycles-pp.unmap_region
      2.79 ±  4%      -0.2        2.54 ±  3%  perf-profile.children.cycles-pp.elf_map
      0.39            -0.1        0.30        perf-profile.children.cycles-pp.mas_wr_walk
      1.50            -0.0        1.46        perf-profile.children.cycles-pp.mas_wr_store_entry
      0.11            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.security_file_alloc
      0.23 ±  2%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.__percpu_counter_sum
      0.16 ±  2%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.init_file
      0.16 ±  3%      +0.0        0.17        perf-profile.children.cycles-pp.unmap_single_vma
      0.25            +0.0        0.27        perf-profile.children.cycles-pp.__memcpy
      0.24 ±  3%      +0.0        0.26        perf-profile.children.cycles-pp.native_flush_tlb_one_user
      0.21 ±  2%      +0.0        0.23        perf-profile.children.cycles-pp.mas_push_data
      0.16            +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.free_unref_page_list
      0.14 ±  3%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.padzero
      0.38            +0.0        0.40        perf-profile.children.cycles-pp.d_path
      0.30            +0.0        0.31        perf-profile.children.cycles-pp.__vm_area_free
      0.29            +0.0        0.31        perf-profile.children.cycles-pp.mas_find
      0.45            +0.0        0.47        perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.34 ±  2%      +0.0        0.35 ±  2%  perf-profile.children.cycles-pp.find_idlest_group
      0.30            +0.0        0.32        perf-profile.children.cycles-pp.update_sg_wakeup_stats
      0.36 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.find_idlest_cpu
      0.29 ±  2%      +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.__filemap_get_folio
      0.34            +0.0        0.36        perf-profile.children.cycles-pp.link_path_walk
      0.44 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp.__mem_cgroup_charge
      0.41            +0.0        0.43        perf-profile.children.cycles-pp.unlink_anon_vmas
      0.31 ±  2%      +0.0        0.33        perf-profile.children.cycles-pp.alloc_empty_file
      0.25 ±  2%      +0.0        0.27        perf-profile.children.cycles-pp.wake_up_new_task
      0.41 ±  2%      +0.0        0.43 ±  2%  perf-profile.children.cycles-pp.__mmdrop
      0.35 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.mas_next_slot
      0.35 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.flush_tlb_mm_range
      0.33            +0.0        0.35        perf-profile.children.cycles-pp.find_vma
      0.45 ±  2%      +0.0        0.48 ±  2%  perf-profile.children.cycles-pp.__rb_erase_color
      0.28 ±  2%      +0.0        0.30 ±  2%  perf-profile.children.cycles-pp.filemap_get_entry
      0.27            +0.0        0.29 ±  3%  perf-profile.children.cycles-pp.up_read
      0.51            +0.0        0.53        perf-profile.children.cycles-pp.mas_store_gfp
      0.41 ±  2%      +0.0        0.43        perf-profile.children.cycles-pp.free_swap_cache
      0.44            +0.0        0.47        perf-profile.children.cycles-pp.___perf_sw_event
      0.05 ± 13%      +0.0        0.08 ± 14%  perf-profile.children.cycles-pp.get_any_partial
      0.35            +0.0        0.38 ±  2%  perf-profile.children.cycles-pp.mt_find
      0.37 ±  2%      +0.0        0.39 ±  2%  perf-profile.children.cycles-pp.vfs_read
      0.33 ±  2%      +0.0        0.35 ±  2%  perf-profile.children.cycles-pp.vma_interval_tree_augment_rotate
      0.50 ±  2%      +0.0        0.52 ±  2%  perf-profile.children.cycles-pp.copy_page
      0.33 ±  2%      +0.0        0.36 ±  2%  perf-profile.children.cycles-pp.__percpu_counter_init
      0.24 ±  5%      +0.0        0.26 ±  3%  perf-profile.children.cycles-pp.down_read
      0.31            +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.lru_add_fn
      0.37            +0.0        0.40        perf-profile.children.cycles-pp.__open64_nocancel
      0.39 ±  2%      +0.0        0.42 ±  2%  perf-profile.children.cycles-pp.ksys_read
      0.37 ±  2%      +0.0        0.40        perf-profile.children.cycles-pp.__mod_lruvec_page_state
      0.53            +0.0        0.56        perf-profile.children.cycles-pp.__perf_sw_event
      0.43 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp.free_pages_and_swap_cache
      0.66            +0.0        0.68        perf-profile.children.cycles-pp.get_page_from_freelist
      0.62            +0.0        0.65        perf-profile.children.cycles-pp.__sysconf
      0.55 ±  2%      +0.0        0.58        perf-profile.children.cycles-pp.kernel_wait4
      0.59 ±  2%      +0.0        0.62        perf-profile.children.cycles-pp.wait4
      0.47 ±  2%      +0.0        0.50 ±  3%  perf-profile.children.cycles-pp.try_to_wake_up
      0.53            +0.0        0.56        perf-profile.children.cycles-pp.mas_split
      0.30            +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.folio_add_lru_vma
      0.06 ± 11%      +0.0        0.09 ±  6%  perf-profile.children.cycles-pp.io_schedule
      0.51            +0.0        0.54        perf-profile.children.cycles-pp.lock_mm_and_find_vma
      0.47            +0.0        0.50 ±  2%  perf-profile.children.cycles-pp.__x64_sys_munmap
      0.41            +0.0        0.44 ±  3%  perf-profile.children.cycles-pp.pcpu_alloc
      0.60            +0.0        0.64        perf-profile.children.cycles-pp.mas_wr_bnode
      0.06 ± 11%      +0.0        0.10 ±  5%  perf-profile.children.cycles-pp.folio_wait_bit_common
      0.61 ±  2%      +0.0        0.65        perf-profile.children.cycles-pp.copy_process
      0.48            +0.0        0.51        perf-profile.children.cycles-pp.page_add_file_rmap
      0.75            +0.0        0.79        perf-profile.children.cycles-pp.native_irq_return_iret
      0.69            +0.0        0.73        perf-profile.children.cycles-pp.vm_area_alloc
      0.56            +0.0        0.60 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock
      0.54            +0.0        0.58        perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.46 ±  2%      +0.0        0.50 ±  2%  perf-profile.children.cycles-pp.__pte_offset_map_lock
      0.51            +0.0        0.55 ±  2%  perf-profile.children.cycles-pp.mm_init
      0.15 ±  2%      +0.0        0.19 ±  3%  perf-profile.children.cycles-pp.__wake_up_common
      0.61            +0.0        0.65        perf-profile.children.cycles-pp.mas_walk
      0.67            +0.0        0.71        perf-profile.children.cycles-pp.vma_alloc_folio
      0.88            +0.0        0.92        perf-profile.children.cycles-pp.sync_regs
      0.83            +0.0        0.87        perf-profile.children.cycles-pp.perf_event_mmap_output
      0.56            +0.0        0.60 ±  2%  perf-profile.children.cycles-pp.alloc_bprm
      0.83            +0.0        0.87        perf-profile.children.cycles-pp.mas_wr_node_store
      0.58            +0.0        0.62        perf-profile.children.cycles-pp.__rb_insert_augmented
      0.79            +0.0        0.83 ±  2%  perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
      0.06 ±  8%      +0.0        0.10        perf-profile.children.cycles-pp.folio_wake_bit
      0.92            +0.0        0.97        perf-profile.children.cycles-pp.__alloc_pages
      0.75            +0.0        0.80        perf-profile.children.cycles-pp.mtree_range_walk
      1.04            +0.0        1.09        perf-profile.children.cycles-pp.__slab_free
      0.77            +0.0        0.82        perf-profile.children.cycles-pp.vm_area_dup
      0.55 ±  2%      +0.0        0.60        perf-profile.children.cycles-pp.finish_fault
      1.01            +0.1        1.06        perf-profile.children.cycles-pp._dl_addr
      1.06            +0.1        1.12        perf-profile.children.cycles-pp.perf_iterate_sb
      0.75 ±  2%      +0.1        0.81        perf-profile.children.cycles-pp._compound_head
      1.03            +0.1        1.09        perf-profile.children.cycles-pp.__do_sys_clone
      1.03            +0.1        1.09        perf-profile.children.cycles-pp.kernel_clone
      0.96            +0.1        1.03        perf-profile.children.cycles-pp.iterate_dir
      0.94            +0.1        1.00        perf-profile.children.cycles-pp.kernfs_fop_readdir
      0.98            +0.1        1.04        perf-profile.children.cycles-pp.getdents64
      0.97            +0.1        1.03        perf-profile.children.cycles-pp.__x64_sys_getdents64
      0.36 ±  4%      +0.1        0.42 ±  2%  perf-profile.children.cycles-pp.__unfreeze_partials
      0.01 ±223%      +0.1        0.08 ±  6%  perf-profile.children.cycles-pp.wake_page_function
      0.36            +0.1        0.43 ±  2%  perf-profile.children.cycles-pp.filemap_fault
      1.81            +0.1        1.89        perf-profile.children.cycles-pp.perf_event_mmap_event
      1.42            +0.1        1.49        perf-profile.children.cycles-pp.do_sys_openat2
      0.56            +0.1        0.63        perf-profile.children.cycles-pp.__do_fault
      1.87            +0.1        1.95        perf-profile.children.cycles-pp.perf_event_mmap
      0.76 ±  2%      +0.1        0.84        perf-profile.children.cycles-pp.kmem_cache_free_bulk
      1.42            +0.1        1.50        perf-profile.children.cycles-pp.__x64_sys_openat
      0.32 ±  3%      +0.1        0.40 ±  5%  perf-profile.children.cycles-pp.get_partial_node
      1.40            +0.1        1.48        perf-profile.children.cycles-pp.path_openat
      1.42            +0.1        1.50        perf-profile.children.cycles-pp.do_filp_open
      0.70            +0.1        0.79 ±  2%  perf-profile.children.cycles-pp.lru_add_drain_cpu
      0.71            +0.1        0.79        perf-profile.children.cycles-pp.lru_add_drain
      1.39            +0.1        1.48        perf-profile.children.cycles-pp.wp_page_copy
      1.43            +0.1        1.53        perf-profile.children.cycles-pp.do_set_pte
      3.10            +0.1        3.20        perf-profile.children.cycles-pp.mas_store_prealloc
      2.00            +0.1        2.11        perf-profile.children.cycles-pp.kmem_cache_alloc
      1.35 ±  2%      +0.1        1.46        perf-profile.children.cycles-pp.page_remove_rmap
      0.94            +0.1        1.04        perf-profile.children.cycles-pp.folio_batch_move_lru
      0.76 ±  2%      +0.1        0.87 ±  3%  perf-profile.children.cycles-pp.___slab_alloc
      1.22            +0.1        1.33        perf-profile.children.cycles-pp.mas_destroy
      0.98            +0.1        1.11 ±  2%  perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
      1.11            +0.1        1.24 ±  2%  perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
      0.96 ±  3%      +0.1        1.09        perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
      1.38            +0.1        1.52        perf-profile.children.cycles-pp.mas_preallocate
      1.43            +0.1        1.58        perf-profile.children.cycles-pp.mas_alloc_nodes
      2.02            +0.2        2.19        perf-profile.children.cycles-pp.release_pages
      3.25            +0.2        3.43        perf-profile.children.cycles-pp.__clone
      3.39            +0.2        3.58        perf-profile.children.cycles-pp.vma_complete
      2.40            +0.2        2.60        perf-profile.children.cycles-pp.tlb_batch_pages_flush
      4.42            +0.2        4.62        perf-profile.children.cycles-pp.vma_interval_tree_insert
      2.82            +0.2        3.02        perf-profile.children.cycles-pp.next_uptodate_page
      2.65            +0.2        2.86        perf-profile.children.cycles-pp.tlb_finish_mmu
      3.50            +0.2        3.73        perf-profile.children.cycles-pp.zap_pte_range
      2.10            +0.2        2.33        perf-profile.children.cycles-pp.do_cow_fault
      3.59            +0.2        3.82        perf-profile.children.cycles-pp.zap_pmd_range
      3.73            +0.2        3.97        perf-profile.children.cycles-pp.unmap_page_range
      1.61 ±  2%      +0.3        1.87 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      3.99            +0.3        4.26        perf-profile.children.cycles-pp.unmap_vmas
      1.33 ±  2%      +0.3        1.60 ±  2%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      5.00            +0.3        5.33        perf-profile.children.cycles-pp.filemap_map_pages
      5.37            +0.4        5.72        perf-profile.children.cycles-pp.do_read_fault
      7.51            +0.6        8.09        perf-profile.children.cycles-pp.do_fault
     10.64            +0.8       11.41        perf-profile.children.cycles-pp.__handle_mm_fault
     11.19            +0.8       12.00        perf-profile.children.cycles-pp.handle_mm_fault
     12.75            +0.9       13.66        perf-profile.children.cycles-pp.do_user_addr_fault
     12.82            +0.9       13.73        perf-profile.children.cycles-pp.exc_page_fault
     14.29            +1.0       15.27        perf-profile.children.cycles-pp.asm_exc_page_fault
     30.81            -2.5       28.33        perf-profile.self.cycles-pp.osq_lock
      6.72            -0.5        6.23        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.34 ±  2%      -0.1        0.26        perf-profile.self.cycles-pp.mas_wr_walk
      0.09            +0.0        0.10        perf-profile.self.cycles-pp.access_error
      0.20            +0.0        0.21        perf-profile.self.cycles-pp.__memcpy
      0.11 ±  3%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.__unfreeze_partials
      0.38            +0.0        0.40        perf-profile.self.cycles-pp.___perf_sw_event
      0.37            +0.0        0.39        perf-profile.self.cycles-pp.free_swap_cache
      0.26            +0.0        0.28 ±  4%  perf-profile.self.cycles-pp.up_read
      0.47            +0.0        0.49 ±  2%  perf-profile.self.cycles-pp.kmem_cache_free_bulk
      0.32 ±  2%      +0.0        0.34 ±  2%  perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
      0.22 ±  4%      +0.0        0.24 ±  3%  perf-profile.self.cycles-pp.down_read
      0.49            +0.0        0.51 ±  2%  perf-profile.self.cycles-pp.copy_page
      0.36            +0.0        0.38        perf-profile.self.cycles-pp.do_set_pte
      0.37            +0.0        0.40        perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
      0.41            +0.0        0.44 ±  3%  perf-profile.self.cycles-pp.___slab_alloc
      0.06 ±  7%      +0.0        0.09        perf-profile.self.cycles-pp.try_to_wake_up
      0.49            +0.0        0.52        perf-profile.self.cycles-pp.memcg_slab_post_alloc_hook
      0.50            +0.0        0.52 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock
      0.41            +0.0        0.44        perf-profile.self.cycles-pp.page_add_file_rmap
      0.58 ±  2%      +0.0        0.62        perf-profile.self.cycles-pp.kmem_cache_alloc
      0.75            +0.0        0.79        perf-profile.self.cycles-pp.native_irq_return_iret
      0.50            +0.0        0.53        perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.55            +0.0        0.59        perf-profile.self.cycles-pp.__rb_insert_augmented
      0.77            +0.0        0.81        perf-profile.self.cycles-pp.filemap_map_pages
      0.87            +0.0        0.91        perf-profile.self.cycles-pp.sync_regs
      0.72            +0.0        0.76        perf-profile.self.cycles-pp.mtree_range_walk
      1.03            +0.0        1.07 ±  2%  perf-profile.self.cycles-pp.__slab_free
      0.89            +0.0        0.94        perf-profile.self.cycles-pp._dl_addr
      1.10            +0.1        1.15        perf-profile.self.cycles-pp.zap_pte_range
      0.69 ±  2%      +0.1        0.74        perf-profile.self.cycles-pp._compound_head
      1.08            +0.1        1.16        perf-profile.self.cycles-pp.release_pages
      1.14 ±  2%      +0.1        1.23        perf-profile.self.cycles-pp.page_remove_rmap
      1.10 ±  2%      +0.1        1.20        perf-profile.self.cycles-pp.down_write
      4.32            +0.2        4.52        perf-profile.self.cycles-pp.vma_interval_tree_insert
      2.72            +0.2        2.92        perf-profile.self.cycles-pp.next_uptodate_page
      1.33 ±  2%      +0.3        1.60 ±  2%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath


***************************************************************************************************
lkp-spr-2sp3: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/300s/lkp-spr-2sp3/shell8/unixbench

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    128621 ± 44%     -35.5%      83021 ± 18%  sched_debug.cfs_rq:/.spread0.stddev
     68.67            -2.7%      66.83        turbostat.PkgTmp
     30.36            +1.2%      30.74        turbostat.RAMWatt
     23470 ±  2%      +6.4%      24970 ±  2%  unixbench.score
    799587 ±  3%     +11.6%     892414 ±  3%  unixbench.time.involuntary_context_switches
     83795 ±  8%     +39.5%     116914 ±  5%  unixbench.time.major_page_faults
 1.003e+08 ±  2%      +6.5%  1.068e+08 ±  2%  unixbench.time.minor_page_faults
    328.24 ±  2%      +6.1%     348.20        unixbench.time.user_time
    894180 ±  2%      +6.7%     953859 ±  2%  unixbench.workload
  70334849 ±  2%      +6.3%   74748722 ±  2%  proc-vmstat.numa_hit
  70084753 ±  2%      +6.3%   74504737 ±  2%  proc-vmstat.numa_local
     54187 ±  2%      +6.3%      57597 ±  2%  proc-vmstat.pgactivate
  74859673 ±  2%      +6.3%   79582401 ±  2%  proc-vmstat.pgalloc_normal
 1.024e+08 ±  2%      +6.4%   1.09e+08 ±  2%  proc-vmstat.pgfault
  74344826 ±  2%      +6.4%   79067399 ±  2%  proc-vmstat.pgfree
   4212400 ±  2%      +6.2%    4474364 ±  2%  proc-vmstat.pgreuse
      3869 ±  2%      +7.1%       4145 ±  3%  proc-vmstat.thp_fault_alloc
   1612101 ±  2%      +6.3%    1713519 ±  2%  proc-vmstat.unevictable_pgs_culled
      5.69 ±  2%      +5.1%       5.98 ±  2%  perf-stat.i.MPKI
      0.94 ±  2%      +0.0        0.98        perf-stat.i.branch-miss-rate%
 2.934e+08 ±  2%      +5.8%  3.105e+08 ±  2%  perf-stat.i.branch-misses
 2.844e+08 ±  2%      +6.1%  3.016e+08 ±  2%  perf-stat.i.cache-misses
  9.76e+08 ±  2%      +5.8%  1.033e+09 ±  2%  perf-stat.i.cache-references
      2.99            -1.3%       2.95        perf-stat.i.cpi
 5.226e+11            -0.9%  5.179e+11        perf-stat.i.cpu-cycles
  33882826 ±  2%      +7.2%   36317869 ±  2%  perf-stat.i.dTLB-store-misses
 9.478e+09 ±  2%      +5.9%  1.004e+10 ±  2%  perf-stat.i.dTLB-stores
      0.36            +2.1%       0.36        perf-stat.i.ipc
      1312 ±  8%     +39.6%       1831 ±  5%  perf-stat.i.major-faults
      2.33            -0.9%       2.31        perf-stat.i.metric.GHz
    513.19 ±  2%      +6.0%     544.03 ±  2%  perf-stat.i.metric.K/sec
   1545804 ±  2%      +6.5%    1646269 ±  2%  perf-stat.i.minor-faults
  60502067 ±  2%      +5.4%   63798162 ±  2%  perf-stat.i.node-load-misses
  14871403 ±  2%      +6.2%   15796551 ±  2%  perf-stat.i.node-loads
   1547117 ±  2%      +6.5%    1648101 ±  2%  perf-stat.i.page-faults
      5.78 ±  2%      +5.4%       6.10 ±  2%  perf-stat.overall.MPKI
      0.87 ±  2%      +0.0        0.92 ±  2%  perf-stat.overall.branch-miss-rate%
      3.10            -1.3%       3.06        perf-stat.overall.cpi
      1839 ±  3%      -6.6%       1718 ±  2%  perf-stat.overall.cycles-between-cache-misses
  12132105 ±  2%      -5.5%   11462241 ±  2%  perf-stat.overall.path-length
 2.901e+08 ±  2%      +5.8%   3.07e+08 ±  2%  perf-stat.ps.branch-misses
 2.815e+08 ±  2%      +6.1%  2.986e+08 ±  2%  perf-stat.ps.cache-misses
 9.661e+08 ±  2%      +5.8%  1.022e+09 ±  2%  perf-stat.ps.cache-references
 5.174e+11            -0.9%  5.128e+11        perf-stat.ps.cpu-cycles
  33544567 ±  2%      +7.2%   35957170 ±  2%  perf-stat.ps.dTLB-store-misses
 9.376e+09 ±  2%      +5.9%  9.933e+09 ±  2%  perf-stat.ps.dTLB-stores
      1299 ±  8%     +39.6%       1813 ±  5%  perf-stat.ps.major-faults
   1529961 ±  2%      +6.5%    1629550 ±  2%  perf-stat.ps.minor-faults
  59908353 ±  2%      +5.4%   63173333 ±  2%  perf-stat.ps.node-load-misses
  14730891 ±  2%      +6.2%   15646714 ±  2%  perf-stat.ps.node-loads
   1531260 ±  2%      +6.5%    1631363 ±  2%  perf-stat.ps.page-faults



***************************************************************************************************
lkp-cpl-4sp2: 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/300s/lkp-cpl-4sp2/small-allocs/vm-scalability

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     42167           +23.5%      52085        vm-scalability.median
     55.63            +4.8       60.48 ±  5%  vm-scalability.stddev%
   9445724           +23.5%   11667563        vm-scalability.throughput
    380.73            +5.2%     400.43        vm-scalability.time.elapsed_time
    380.73            +5.2%     400.43        vm-scalability.time.elapsed_time.max
    106067           +51.4%     160541        vm-scalability.time.involuntary_context_switches
 6.307e+08           +23.6%  7.795e+08        vm-scalability.time.minor_page_faults
     78306            +4.6%      81916        vm-scalability.time.system_time
    777.26           +22.8%     954.12        vm-scalability.time.user_time
   2562168 ±  2%     -19.3%    2068789 ±  3%  vm-scalability.time.voluntary_context_switches
 2.838e+09           +23.6%  3.507e+09        vm-scalability.workload
 5.815e+09           +10.4%  6.421e+09 ±  3%  cpuidle..time
      6.66            +0.3        6.99 ±  3%  turbostat.C1%
     17067 ±  2%     -16.7%      14220 ±  2%  vmstat.system.cs
      0.05            -0.0        0.04 ±  3%  mpstat.cpu.all.soft%
      0.96            +0.1        1.11        mpstat.cpu.all.usr%
      2261 ±  5%     +22.8%       2776 ±  6%  perf-c2c.HITM.local
      1329 ±  8%     +20.8%       1605 ±  4%  perf-c2c.HITM.remote
  18499960           +19.0%   22006663        meminfo.Memused
   2990186           +26.8%    3790943        meminfo.PageTables
  10350121           +26.1%   13050399        meminfo.SUnreclaim
  10515083           +25.7%   13215478        meminfo.Slab
  18692470           +19.1%   22260889        meminfo.max_used_kB
    407798 ± 17%     +67.2%     682010 ± 20%  sched_debug.cfs_rq:/.load.max
    869895 ± 16%     -34.9%     566399 ± 18%  sched_debug.cfs_rq:/.spread0.avg
   -743993           +53.2%   -1139615        sched_debug.cfs_rq:/.spread0.min
     17559 ±  2%     -12.8%      15312 ±  2%  sched_debug.cpu.nr_switches.avg
     13015 ±  2%     -10.6%      11629 ±  2%  sched_debug.cpu.nr_switches.min
      4612           -18.2%       3772 ± 10%  sched_debug.cpu.nr_switches.stddev
    223.38 ±  8%     -15.7%     188.21 ± 12%  sched_debug.cpu.nr_uninterruptible.max
   1169323           +13.4%    1326004 ±  4%  numa-numastat.node0.local_node
   1239338           +13.9%    1411769 ±  2%  numa-numastat.node0.numa_hit
   1115777 ±  2%     +16.7%    1302136 ±  3%  numa-numastat.node1.local_node
   1181029 ±  2%     +19.6%    1412041 ±  3%  numa-numastat.node1.numa_hit
     65251 ± 31%     +68.4%     109907 ±  7%  numa-numastat.node1.other_node
   1111909 ±  5%     +15.5%    1284183 ±  2%  numa-numastat.node2.local_node
   1218212 ±  5%     +11.5%    1358029        numa-numastat.node2.numa_hit
   1285007 ±  2%     +25.7%    1615828 ±  2%  numa-numastat.node3.local_node
   1391326 ±  2%     +21.8%    1694543 ±  2%  numa-numastat.node3.numa_hit
    137025 ±  2%      +6.1%     145442        proc-vmstat.nr_active_anon
     95233            -1.5%      93787        proc-vmstat.nr_anon_pages
   4427600            -2.0%    4339914        proc-vmstat.nr_dirty_background_threshold
   8866026            -2.0%    8690440        proc-vmstat.nr_dirty_threshold
  44521360            -2.0%   43643224        proc-vmstat.nr_free_pages
    747466           +26.8%     948001        proc-vmstat.nr_page_table_pages
    168941 ±  2%      +4.9%     177164        proc-vmstat.nr_shmem
   2587999           +26.1%    3264111        proc-vmstat.nr_slab_unreclaimable
    137025 ±  2%      +6.1%     145442        proc-vmstat.nr_zone_active_anon
   5032565           +16.8%    5879446        proc-vmstat.numa_hit
   4684671           +18.1%    5530835        proc-vmstat.numa_local
   7168214           +18.3%    8479220        proc-vmstat.pgalloc_normal
 6.322e+08           +23.5%   7.81e+08        proc-vmstat.pgfault
   6881231           +18.9%    8181469        proc-vmstat.pgfree
   3627776            +4.5%    3791232        proc-vmstat.unevictable_pgs_scanned
   3935473 ±  5%     +34.6%    5297695 ± 16%  numa-meminfo.node0.MemUsed
    748076           +26.7%     947801        numa-meminfo.node0.PageTables
   2597779           +26.1%    3275316        numa-meminfo.node0.SUnreclaim
   2625903           +26.1%    3310684        numa-meminfo.node0.Slab
    747754           +26.9%     948714        numa-meminfo.node1.PageTables
   2580884           +26.5%    3264576        numa-meminfo.node1.SUnreclaim
   2652930           +25.8%    3336480        numa-meminfo.node1.Slab
    141326 ± 10%     -21.1%     111532 ± 19%  numa-meminfo.node2.Inactive(anon)
   3886953 ±  2%     +20.1%    4666698        numa-meminfo.node2.MemUsed
    751265           +26.3%     949188        numa-meminfo.node2.PageTables
   2591514           +25.6%    3255894        numa-meminfo.node2.SUnreclaim
   2622178           +24.9%    3274703        numa-meminfo.node2.Slab
    327366 ± 10%     +39.0%     454988 ± 20%  numa-meminfo.node3.Active
    327339 ± 10%     +39.0%     454931 ± 20%  numa-meminfo.node3.Active(anon)
     17238 ±  7%     +16.9%      20158 ± 11%  numa-meminfo.node3.Mapped
   4217474           +22.9%    5184418 ±  3%  numa-meminfo.node3.MemUsed
    745042           +27.0%     946381        numa-meminfo.node3.PageTables
   2586773           +26.0%    3259792        numa-meminfo.node3.SUnreclaim
    379407 ± 18%     +30.5%     495065 ± 18%  numa-meminfo.node3.Shmem
   2620922           +25.9%    3298794        numa-meminfo.node3.Slab
      0.88 ± 71%     -80.8%       0.17 ± 52%  perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      0.13 ± 53%    +311.7%       0.53 ± 55%  perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
      0.20 ± 13%     +32.6%       0.26 ± 15%  perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
      0.10 ±  9%    +110.7%       0.21 ±  8%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.06 ± 28%    +138.4%       0.13 ± 29%  perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      0.01        +14036.1%       1.70 ±221%  perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      9.50 ± 34%    +476.1%      54.73 ± 84%  perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
      2.31 ± 39%     -78.8%       0.49 ±216%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
    138.30 ± 14%     +48.7%     205.63 ± 23%  perf-sched.sch_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     13.72 ± 12%    +100.6%      27.53 ± 35%  perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      9.63 ± 18%    +114.0%      20.60 ± 10%  perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      2.26 ±  6%     +17.9%       2.66 ±  3%  perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    213.13 ±  2%     -11.2%     189.35 ±  2%  perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1074 ±  4%     +65.8%       1781 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
     39318 ±  7%     -17.4%      32472 ±  5%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    171.51 ± 29%     +87.8%     322.16 ± 43%  perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
      2.06 ±  6%     +16.5%       2.40 ±  3%  perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    213.03 ±  2%     -11.2%     189.14 ±  2%  perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     10.68 ± 19%     +53.1%      16.34 ± 21%  perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
      0.02 ± 29%     -42.3%       0.01 ±  6%  perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
     27.23 ± 10%     +56.4%      42.58 ± 17%  perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
     72.93 ± 46%    +126.7%     165.37 ± 43%  perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    186850           +26.8%     236835        numa-vmstat.node0.nr_page_table_pages
    648927           +26.1%     818461        numa-vmstat.node0.nr_slab_unreclaimable
   1239327           +13.9%    1411687 ±  2%  numa-vmstat.node0.numa_hit
   1169311           +13.4%    1325922 ±  4%  numa-vmstat.node0.numa_local
    186769           +26.9%     237070        numa-vmstat.node1.nr_page_table_pages
    644727           +26.5%     815732        numa-vmstat.node1.nr_slab_unreclaimable
   1180822 ±  2%     +19.6%    1411883 ±  3%  numa-vmstat.node1.numa_hit
   1115570 ±  2%     +16.7%    1301978 ±  3%  numa-vmstat.node1.numa_local
     65251 ± 31%     +68.4%     109907 ±  7%  numa-vmstat.node1.numa_other
     35329 ± 10%     -21.1%      27874 ± 19%  numa-vmstat.node2.nr_inactive_anon
    187645           +26.4%     237184        numa-vmstat.node2.nr_page_table_pages
    647359           +25.7%     813590        numa-vmstat.node2.nr_slab_unreclaimable
     35330 ± 10%     -21.1%      27874 ± 19%  numa-vmstat.node2.nr_zone_inactive_anon
   1218056 ±  5%     +11.5%    1357863        numa-vmstat.node2.numa_hit
   1111753 ±  5%     +15.5%    1284018 ±  2%  numa-vmstat.node2.numa_local
     81831 ± 10%     +38.9%     113699 ± 20%  numa-vmstat.node3.nr_active_anon
      4376 ±  8%     +17.1%       5125 ± 10%  numa-vmstat.node3.nr_mapped
    186088           +27.1%     236505        numa-vmstat.node3.nr_page_table_pages
     94854 ± 18%     +30.5%     123744 ± 18%  numa-vmstat.node3.nr_shmem
    646172           +26.1%     814570        numa-vmstat.node3.nr_slab_unreclaimable
     81831 ± 10%     +38.9%     113699 ± 20%  numa-vmstat.node3.nr_zone_active_anon
   1391229 ±  2%     +21.8%    1694455 ±  2%  numa-vmstat.node3.numa_hit
   1284909 ±  2%     +25.7%    1615740 ±  2%  numa-vmstat.node3.numa_local
 2.182e+10            +5.7%  2.307e+10        perf-stat.i.branch-instructions
     67.44            -1.4       66.01        perf-stat.i.cache-miss-rate%
 1.084e+08            +5.0%  1.138e+08 ±  3%  perf-stat.i.cache-references
     17046 ±  2%     -17.1%      14132 ±  2%  perf-stat.i.context-switches
      8.89            -2.0%       8.71        perf-stat.i.cpi
    488.32            +5.3%     514.19        perf-stat.i.cpu-migrations
      0.03            +0.0        0.04 ±  2%  perf-stat.i.dTLB-load-miss-rate%
   9099499           +14.7%   10433460 ±  2%  perf-stat.i.dTLB-load-misses
 2.527e+10            +4.3%  2.636e+10        perf-stat.i.dTLB-loads
   1732945           +15.5%    2001701        perf-stat.i.dTLB-store-misses
 2.005e+09           +12.5%  2.256e+09        perf-stat.i.dTLB-stores
     93.66            -3.6       90.02        perf-stat.i.iTLB-load-miss-rate%
    431003 ±  5%     +42.8%     615338        perf-stat.i.iTLB-loads
 9.349e+10            +4.1%  9.729e+10        perf-stat.i.instructions
     17597            +4.2%      18333 ±  2%  perf-stat.i.instructions-per-iTLB-miss
      0.12            +4.5%       0.12        perf-stat.i.ipc
    218.73            +5.1%     229.81        perf-stat.i.metric.M/sec
   1645267           +16.2%    1911350        perf-stat.i.minor-faults
     80.15            +1.7       81.87        perf-stat.i.node-load-miss-rate%
  22474764 ±  2%      +3.5%   23258402        perf-stat.i.node-load-misses
   6215674 ±  3%      -6.3%    5824416        perf-stat.i.node-loads
   5207478           +17.4%    6113199        perf-stat.i.node-store-misses
    831077           +17.9%     979621        perf-stat.i.node-stores
   1645267           +16.2%    1911350        perf-stat.i.page-faults
     68.37            -2.1       66.31        perf-stat.overall.cache-miss-rate%
      8.49            -4.5%       8.11        perf-stat.overall.cpi
      0.04            +0.0        0.04 ±  2%  perf-stat.overall.dTLB-load-miss-rate%
      0.09            +0.0        0.09        perf-stat.overall.dTLB-store-miss-rate%
     93.41            -2.5       90.92        perf-stat.overall.iTLB-load-miss-rate%
     15175            +3.4%      15687        perf-stat.overall.instructions-per-iTLB-miss
      0.12            +4.7%       0.12        perf-stat.overall.ipc
     78.20            +1.6       79.79        perf-stat.overall.node-load-miss-rate%
     12543           -11.2%      11140        perf-stat.overall.path-length
 2.183e+10            +6.1%  2.316e+10        perf-stat.ps.branch-instructions
 1.102e+08            +5.6%  1.164e+08 ±  2%  perf-stat.ps.cache-references
     17070 ±  2%     -16.9%      14191 ±  2%  perf-stat.ps.context-switches
    486.54            +5.7%     514.33        perf-stat.ps.cpu-migrations
   9294872           +15.8%   10767066 ±  2%  perf-stat.ps.dTLB-load-misses
 2.525e+10            +4.5%  2.639e+10        perf-stat.ps.dTLB-loads
   1747511           +16.6%    2038370        perf-stat.ps.dTLB-store-misses
 2.024e+09           +13.5%  2.298e+09        perf-stat.ps.dTLB-stores
    434045 ±  5%     +42.8%     619721        perf-stat.ps.iTLB-loads
 9.339e+10            +4.3%  9.739e+10        perf-stat.ps.instructions
   1658111           +17.4%    1946167        perf-stat.ps.minor-faults
  22728635 ±  2%      +4.0%   23636858        perf-stat.ps.node-load-misses
   6338224 ±  3%      -5.5%    5988664        perf-stat.ps.node-loads
   5139882           +17.1%    6017185        perf-stat.ps.node-store-misses
    840111           +18.9%     998845        perf-stat.ps.node-stores
   1658111           +17.4%    1946167        perf-stat.ps.page-faults
  3.56e+13            +9.8%  3.907e+13        perf-stat.total.instructions
     94.07            -1.6       92.46        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
     93.61            -1.6       92.01        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
     94.14            -1.6       92.57        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
     94.18            -1.6       92.62        perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
     94.87            -1.3       93.61        perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
     94.94            -1.2       93.69        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     94.95            -1.2       93.70        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     94.99            -1.2       93.76        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     96.35            -0.9       95.49        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
     96.35            -0.9       95.49        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     96.39            -0.9       95.53        perf-profile.calltrace.cycles-pp.__mmap
      0.77            +0.2        0.98        perf-profile.calltrace.cycles-pp.do_rw_once
      0.55            +0.3        0.81 ±  2%  perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.do_access
      0.56            +0.3        0.82 ±  2%  perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.do_access
      0.80            +0.3        1.12        perf-profile.calltrace.cycles-pp.asm_exc_page_fault.do_access
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      1.33            +0.4        1.70 ±  5%  perf-profile.calltrace.cycles-pp.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      1.16            +0.4        1.53 ±  6%  perf-profile.calltrace.cycles-pp.vma_migratable.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare
      1.07            +0.4        1.48        perf-profile.calltrace.cycles-pp.do_access
     94.07            -1.6       92.46        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     93.63            -1.6       92.03        perf-profile.children.cycles-pp.osq_lock
     94.14            -1.6       92.57        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     94.18            -1.6       92.62        perf-profile.children.cycles-pp.down_write
     94.88            -1.3       93.61        perf-profile.children.cycles-pp.mmap_region
     94.94            -1.2       93.69        perf-profile.children.cycles-pp.do_mmap
     94.95            -1.2       93.70        perf-profile.children.cycles-pp.vm_mmap_pgoff
     94.99            -1.2       93.76        perf-profile.children.cycles-pp.ksys_mmap_pgoff
     96.38            -0.9       95.52        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     96.38            -0.9       95.52        perf-profile.children.cycles-pp.do_syscall_64
     96.39            -0.9       95.54        perf-profile.children.cycles-pp.__mmap
      0.38            -0.0        0.35        perf-profile.children.cycles-pp.rwsem_spin_on_owner
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.mas_walk
      0.05            +0.0        0.06 ±  6%  perf-profile.children.cycles-pp.vm_unmapped_area
      0.05 ±  8%      +0.0        0.07 ±  7%  perf-profile.children.cycles-pp.__perf_sw_event
      0.06 ±  8%      +0.0        0.07        perf-profile.children.cycles-pp.lock_mm_and_find_vma
      0.06 ±  7%      +0.0        0.08        perf-profile.children.cycles-pp.sync_regs
      0.08            +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.mtree_range_walk
      0.10            +0.0        0.13 ±  3%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.06 ±  7%      +0.0        0.09 ±  7%  perf-profile.children.cycles-pp.__unfreeze_partials
      0.17 ±  2%      +0.0        0.21 ±  8%  perf-profile.children.cycles-pp.update_cfs_group
      0.12 ±  4%      +0.0        0.16 ±  3%  perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.06            +0.0        0.10 ± 10%  perf-profile.children.cycles-pp.kmem_cache_free_bulk
      0.28 ±  2%      +0.0        0.33 ±  8%  perf-profile.children.cycles-pp.task_tick_fair
      0.07 ±  7%      +0.0        0.11 ±  8%  perf-profile.children.cycles-pp.___slab_alloc
      0.13 ±  3%      +0.0        0.18 ±  6%  perf-profile.children.cycles-pp.mas_store_prealloc
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.try_to_wake_up
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.wake_up_q
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.perf_event_mmap
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.perf_event_mmap_event
      0.07 ±  7%      +0.1        0.12 ±  7%  perf-profile.children.cycles-pp.mas_destroy
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.___perf_sw_event
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.mt_find
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.mas_empty_area_rev
      0.00            +0.1        0.05 ±  8%  perf-profile.children.cycles-pp.find_vma
      0.10            +0.1        0.16 ±  6%  perf-profile.children.cycles-pp.mas_preallocate
      0.10            +0.1        0.16 ±  6%  perf-profile.children.cycles-pp.mas_alloc_nodes
      0.09            +0.1        0.14 ±  6%  perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
      0.00            +0.1        0.06 ± 13%  perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
      0.09            +0.1        0.15 ±  7%  perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
      0.10 ±  4%      +0.1        0.16 ±  4%  perf-profile.children.cycles-pp.rwsem_wake
      0.00            +0.1        0.06 ± 11%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.00            +0.1        0.07 ± 11%  perf-profile.children.cycles-pp.get_partial_node
      0.15 ±  5%      +0.1        0.22 ±  3%  perf-profile.children.cycles-pp.up_write
      0.17 ±  2%      +0.1        0.24        perf-profile.children.cycles-pp.vma_interval_tree_insert
      0.14 ±  6%      +0.1        0.25 ±  5%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.08            +0.1        0.20 ±  6%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      0.28 ±  2%      +0.2        0.45 ±  3%  perf-profile.children.cycles-pp.__handle_mm_fault
      0.34 ±  2%      +0.2        0.51 ±  2%  perf-profile.children.cycles-pp.handle_mm_fault
      2.24 ±  3%      +0.2        2.47 ±  5%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.90            +0.2        1.15        perf-profile.children.cycles-pp.do_rw_once
      0.56            +0.3        0.81        perf-profile.children.cycles-pp.do_user_addr_fault
      0.57            +0.3        0.82        perf-profile.children.cycles-pp.exc_page_fault
      0.83            +0.3        1.16        perf-profile.children.cycles-pp.asm_exc_page_fault
      1.36            +0.4        1.73 ±  5%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      1.39            +0.4        1.78 ±  5%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      1.38            +0.4        1.77 ±  5%  perf-profile.children.cycles-pp.exit_to_user_mode_loop
      1.38            +0.4        1.77 ±  5%  perf-profile.children.cycles-pp.task_work_run
      1.36            +0.4        1.75 ±  5%  perf-profile.children.cycles-pp.task_numa_work
      1.18            +0.4        1.58 ±  5%  perf-profile.children.cycles-pp.vma_migratable
      1.46            +0.5        1.96        perf-profile.children.cycles-pp.do_access
     92.91            -1.6       91.28        perf-profile.self.cycles-pp.osq_lock
      0.38            -0.0        0.35        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.06 ±  7%      +0.0        0.08        perf-profile.self.cycles-pp.sync_regs
      0.10            +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.07 ±  6%      +0.0        0.10 ±  3%  perf-profile.self.cycles-pp.mtree_range_walk
      0.06            +0.0        0.09 ±  5%  perf-profile.self.cycles-pp.lock_vma_under_rcu
      0.08 ±  5%      +0.0        0.11 ±  6%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.17 ±  2%      +0.0        0.21 ±  8%  perf-profile.self.cycles-pp.update_cfs_group
      0.02 ±141%      +0.0        0.06 ±  7%  perf-profile.self.cycles-pp.rwsem_optimistic_spin
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.___perf_sw_event
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.down_write
      0.17 ±  2%      +0.1        0.24        perf-profile.self.cycles-pp.vma_interval_tree_insert
      0.08            +0.1        0.20 ±  6%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.55            +0.1        0.70        perf-profile.self.cycles-pp.do_access
      0.24 ±  3%      +0.2        0.39 ±  3%  perf-profile.self.cycles-pp.__handle_mm_fault
      0.68            +0.2        0.86        perf-profile.self.cycles-pp.do_rw_once
      1.16            +0.4        1.56 ±  6%  perf-profile.self.cycles-pp.vma_migratable





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux