Hello, kernel test robot noticed a 3.9% improvement of will-it-scale.per_process_ops on: commit: fd892593d44d8b649caf30a67f0c7696d976d901 ("mm: change do_vmi_align_munmap() tracking of VMAs to remove") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: will-it-scale test machine: 104 threads 2 sockets (Skylake) with 192G memory parameters: nr_task: 100% mode: process test: brk2 cpufreq_governor: performance Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20231026/202310261659.18c0c7e-oliver.sang@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase: gcc-12/performance/x86_64-rhel-8.3/process/100%/debian-11.1-x86_64-20220510.cgz/lkp-skl-fpga01/brk2/will-it-scale commit: 8c314f3b55 ("maple_tree: add benchmarking for mas_prev()") fd892593d4 ("mm: change do_vmi_align_munmap() tracking of VMAs to remove") 8c314f3b55fbc422 fd892593d44d8b649caf30a67f0 ---------------- --------------------------- %stddev %change %stddev \ | \ 44.35 +1.3% 44.94 turbostat.RAMWatt 7472 +2.2% 7638 vmstat.system.cs 13163009 +3.9% 13672076 will-it-scale.104.processes 126566 +3.9% 131461 will-it-scale.per_process_ops 13163009 +3.9% 13672076 will-it-scale.workload 3.703e+08 +2.3% 3.789e+08 proc-vmstat.numa_hit 3.702e+08 +2.3% 3.788e+08 proc-vmstat.numa_local 7.24e+08 +2.3% 7.407e+08 proc-vmstat.pgalloc_normal 7.23e+08 +2.3% 7.397e+08 proc-vmstat.pgfree 5.72 ± 6% -100.0% 0.00 perf-sched.wait_and_delay.avg.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk 442.80 ± 5% +9.9% 486.50 ± 5% perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc.mas_alloc_nodes.mas_preallocate.do_brk_flags 407.80 ± 6% +12.5% 458.90 ± 4% perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap 311.30 ± 11% -20.7% 246.80 ± 5% perf-sched.wait_and_delay.count.__cond_resched.remove_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64 1114 ± 3% -100.0% 0.00 perf-sched.wait_and_delay.count.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk 28.01 ± 36% -100.0% 0.00 perf-sched.wait_and_delay.max.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk 6.22 ± 19% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap 5.72 ± 6% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk 14.25 ± 10% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap 28.01 ± 36% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk 2.49 +3.5% 2.58 perf-stat.i.MPKI 3.407e+08 +3.5% 3.527e+08 perf-stat.i.cache-misses 5.063e+08 +3.7% 5.25e+08 perf-stat.i.cache-references 7446 +2.3% 7616 perf-stat.i.context-switches 248.21 +2.2% 253.65 perf-stat.i.cpu-migrations 817.56 -3.4% 789.61 perf-stat.i.cycles-between-cache-misses 0.04 +0.0 0.04 perf-stat.i.dTLB-load-miss-rate% 14854767 +3.6% 15392456 perf-stat.i.dTLB-load-misses 13577896 +4.9% 14238965 perf-stat.i.iTLB-load-misses 24153428 +3.4% 24977848 perf-stat.i.iTLB-loads 10082 -4.6% 9618 perf-stat.i.instructions-per-iTLB-miss 1532 +3.0% 1578 perf-stat.i.metric.K/sec 43693423 +2.7% 44860627 perf-stat.i.node-loads 86115914 +2.6% 88355886 perf-stat.i.node-stores 2.49 +3.5% 2.58 perf-stat.overall.MPKI 817.24 -3.4% 789.30 perf-stat.overall.cycles-between-cache-misses 0.04 +0.0 0.04 perf-stat.overall.dTLB-load-miss-rate% 10079 -4.6% 9614 perf-stat.overall.instructions-per-iTLB-miss 3134760 -3.7% 3019810 perf-stat.overall.path-length 3.396e+08 +3.5% 3.515e+08 perf-stat.ps.cache-misses 5.046e+08 +3.7% 5.232e+08 perf-stat.ps.cache-references 7420 +2.3% 7589 perf-stat.ps.context-switches 247.36 +2.2% 252.76 perf-stat.ps.cpu-migrations 14810795 +3.6% 15350854 perf-stat.ps.dTLB-load-misses 13531716 +4.9% 14190673 perf-stat.ps.iTLB-load-misses 24072354 +3.4% 24894366 perf-stat.ps.iTLB-loads 43550846 +2.7% 44720688 perf-stat.ps.node-loads 85827526 +2.6% 88063334 perf-stat.ps.node-stores 36.62 -1.0 35.60 perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk 1.94 -0.8 1.18 perf-profile.calltrace.cycles-pp.mas_wr_store_entry.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64 5.74 -0.7 5.01 perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 2.89 ± 3% -0.5 2.41 ± 5% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__slab_free.rcu_do_batch.rcu_core.__do_softirq 2.84 ± 3% -0.5 2.36 ± 5% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__slab_free.rcu_do_batch.rcu_core 5.17 ± 2% -0.5 4.70 ± 3% perf-profile.calltrace.cycles-pp.__slab_free.rcu_do_batch.rcu_core.__do_softirq.run_ksoftirqd 4.47 -0.4 4.04 perf-profile.calltrace.cycles-pp.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64 3.42 -0.4 3.06 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.49 -0.4 1.14 ± 3% perf-profile.calltrace.cycles-pp.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.do_brk_flags 2.54 -0.3 2.21 perf-profile.calltrace.cycles-pp.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk 1.10 -0.3 0.82 ± 5% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc 1.09 -0.3 0.82 ± 4% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy 4.53 -0.3 4.28 ± 2% perf-profile.calltrace.cycles-pp.__kmem_cache_alloc_bulk.kmem_cache_alloc_bulk.mas_alloc_nodes.mas_preallocate.__split_vma 4.88 -0.3 4.63 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_bulk.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap 5.53 -0.2 5.31 perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap.__do_sys_brk 5.64 -0.2 5.42 perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64 0.60 -0.1 0.47 ± 33% perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_wr_bnode.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk 0.85 -0.1 0.72 perf-profile.calltrace.cycles-pp.mas_next_slot.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.10 -0.1 1.00 ± 3% perf-profile.calltrace.cycles-pp.__unfreeze_partials.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk 0.81 ± 2% -0.1 0.72 ± 5% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__unfreeze_partials.mas_destroy.mas_store_prealloc.do_brk_flags 1.09 -0.1 1.01 perf-profile.calltrace.cycles-pp.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.vma_complete.__split_vma 0.53 +0.0 0.56 perf-profile.calltrace.cycles-pp.mas_store_b_node.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk 0.58 +0.0 0.61 perf-profile.calltrace.cycles-pp.mab_mas_cp.mas_split_final_node.mas_split.mas_wr_bnode.mas_store_prealloc 0.75 +0.0 0.78 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.61 +0.0 0.65 ± 2% perf-profile.calltrace.cycles-pp.get_unmapped_area.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.71 +0.0 0.75 perf-profile.calltrace.cycles-pp.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk 0.71 +0.0 0.75 perf-profile.calltrace.cycles-pp.memcg_slab_post_alloc_hook.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap 1.10 +0.0 1.14 perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags 1.08 +0.0 1.12 perf-profile.calltrace.cycles-pp.mas_descend_adopt.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc 0.71 ± 2% +0.1 0.76 ± 2% perf-profile.calltrace.cycles-pp.mas_pop_node.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete 1.00 +0.1 1.05 perf-profile.calltrace.cycles-pp.__call_rcu_common.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc 0.76 +0.1 0.81 perf-profile.calltrace.cycles-pp.mas_mab_cp.mas_store_b_node.mas_wr_bnode.mas_store_prealloc.vma_complete 0.74 ± 2% +0.1 0.80 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap 1.48 +0.1 1.55 perf-profile.calltrace.cycles-pp.__entry_text_start.brk 1.02 +0.1 1.08 perf-profile.calltrace.cycles-pp.mas_split_final_node.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete 0.91 +0.1 0.98 ± 2% perf-profile.calltrace.cycles-pp.mas_store_b_node.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma 0.62 +0.1 0.69 perf-profile.calltrace.cycles-pp.mas_wr_walk_index.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk 1.55 +0.1 1.62 perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags.__do_sys_brk 1.18 +0.1 1.25 perf-profile.calltrace.cycles-pp.mas_descend_adopt.mas_wmb_replace.mas_split.mas_wr_bnode.mas_store_prealloc 1.65 +0.1 1.74 perf-profile.calltrace.cycles-pp.mast_fill_bnode.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete 0.72 +0.1 0.82 perf-profile.calltrace.cycles-pp.mas_push_data.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete 2.28 +0.1 2.38 perf-profile.calltrace.cycles-pp.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags 2.09 +0.1 2.20 perf-profile.calltrace.cycles-pp.mas_wmb_replace.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete 2.49 +0.1 2.63 perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap.__do_sys_brk 3.36 +0.1 3.49 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk 14.95 +0.1 15.08 perf-profile.calltrace.cycles-pp.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 2.89 +0.1 3.03 perf-profile.calltrace.cycles-pp.vm_area_dup.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64 3.85 +0.2 4.01 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.brk 6.96 +0.3 7.29 perf-profile.calltrace.cycles-pp.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk 6.29 +0.5 6.78 ± 2% perf-profile.calltrace.cycles-pp.__unfreeze_partials.rcu_do_batch.rcu_core.__do_softirq.run_ksoftirqd 9.84 +0.5 10.35 perf-profile.calltrace.cycles-pp.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64 8.18 +0.5 8.69 perf-profile.calltrace.cycles-pp.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma 4.09 ± 2% +0.6 4.67 ± 4% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.free_pcppages_bulk.free_unref_page.__unfreeze_partials 4.11 ± 2% +0.6 4.69 ± 4% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.free_pcppages_bulk.free_unref_page.__unfreeze_partials.rcu_do_batch 4.22 ± 2% +0.6 4.81 ± 4% perf-profile.calltrace.cycles-pp.free_pcppages_bulk.free_unref_page.__unfreeze_partials.rcu_do_batch.rcu_core 4.25 ± 2% +0.6 4.84 ± 4% perf-profile.calltrace.cycles-pp.free_unref_page.__unfreeze_partials.rcu_do_batch.rcu_core.__do_softirq 9.74 +0.6 10.34 perf-profile.calltrace.cycles-pp.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap 13.31 +0.6 13.92 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.__do_sys_brk 0.00 +0.6 0.63 perf-profile.calltrace.cycles-pp.kmem_cache_alloc.mas_alloc_nodes.mas_preallocate.do_brk_flags.__do_sys_brk 13.46 +0.6 14.10 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64 22.96 +0.7 23.64 perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe 36.73 -1.0 35.71 perf-profile.children.cycles-pp.do_vmi_align_munmap 27.38 -1.0 26.40 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 27.64 -1.0 26.67 perf-profile.children.cycles-pp._raw_spin_lock_irqsave 2.68 -0.7 1.94 perf-profile.children.cycles-pp.mas_wr_store_entry 5.80 -0.7 5.06 perf-profile.children.cycles-pp.mas_store_gfp 14.46 -0.7 13.77 perf-profile.children.cycles-pp.mas_alloc_nodes 7.52 -0.5 7.06 perf-profile.children.cycles-pp.mas_destroy 3.66 -0.4 3.25 perf-profile.children.cycles-pp.kmem_cache_free_bulk 0.80 -0.4 0.43 ± 2% perf-profile.children.cycles-pp.free_pgtables 3.45 -0.4 3.09 perf-profile.children.cycles-pp.unmap_region 1.85 -0.3 1.57 perf-profile.children.cycles-pp.mas_find 1.48 -0.3 1.22 perf-profile.children.cycles-pp.mas_walk 1.49 -0.2 1.25 perf-profile.children.cycles-pp.mtree_range_walk 1.09 -0.2 0.85 perf-profile.children.cycles-pp.mas_next_slot 2.68 -0.1 2.55 perf-profile.children.cycles-pp.kmem_cache_free 4.33 -0.1 4.20 perf-profile.children.cycles-pp.kmem_cache_alloc 1.90 -0.1 1.79 ± 2% perf-profile.children.cycles-pp.mas_pop_node 0.22 ± 2% -0.1 0.17 ± 3% perf-profile.children.cycles-pp.remove_vma 0.22 ± 2% -0.0 0.17 ± 3% perf-profile.children.cycles-pp.tlb_gather_mmu 0.30 ± 2% -0.0 0.26 ± 3% perf-profile.children.cycles-pp.__pte_offset_map_lock 0.13 ± 3% -0.0 0.11 ± 4% perf-profile.children.cycles-pp.downgrade_write 0.20 ± 3% -0.0 0.18 ± 3% perf-profile.children.cycles-pp.mas_nomem 0.08 ± 3% -0.0 0.06 ± 5% perf-profile.children.cycles-pp.mas_node_count_gfp 0.14 ± 2% +0.0 0.16 ± 4% perf-profile.children.cycles-pp.discard_slab 0.48 +0.0 0.51 perf-profile.children.cycles-pp.rcu_nocb_try_bypass 0.58 +0.0 0.61 perf-profile.children.cycles-pp.rcu_segcblist_enqueue 0.10 ± 6% +0.0 0.12 ± 3% perf-profile.children.cycles-pp.unmap_single_vma 0.74 +0.0 0.78 perf-profile.children.cycles-pp.mas_wr_walk 0.66 +0.0 0.70 perf-profile.children.cycles-pp.vma_prepare 0.86 +0.0 0.89 ± 2% perf-profile.children.cycles-pp.mas_prev_slot 1.11 +0.0 1.15 perf-profile.children.cycles-pp.perf_event_mmap_output 0.11 ± 3% +0.0 0.15 ± 4% perf-profile.children.cycles-pp.anon_vma_clone 0.62 +0.0 0.67 perf-profile.children.cycles-pp.get_unmapped_area 1.22 +0.0 1.26 perf-profile.children.cycles-pp.vm_area_free_rcu_cb 0.71 +0.0 0.76 perf-profile.children.cycles-pp.check_brk_limits 0.94 +0.1 1.00 perf-profile.children.cycles-pp.__cond_resched 0.76 +0.1 0.82 perf-profile.children.cycles-pp.zap_pmd_range 1.02 +0.1 1.09 perf-profile.children.cycles-pp.mas_split_final_node 1.02 +0.1 1.08 perf-profile.children.cycles-pp.memcpy_orig 0.63 +0.1 0.69 perf-profile.children.cycles-pp.mas_wr_walk_index 0.29 ± 2% +0.1 0.36 ± 3% perf-profile.children.cycles-pp.mas_next_sibling 1.56 +0.1 1.63 perf-profile.children.cycles-pp.perf_iterate_sb 1.87 +0.1 1.94 perf-profile.children.cycles-pp.__entry_text_start 1.66 +0.1 1.75 perf-profile.children.cycles-pp.mast_fill_bnode 0.73 +0.1 0.83 perf-profile.children.cycles-pp.mas_push_data 1.63 +0.1 1.74 perf-profile.children.cycles-pp.rcu_cblist_dequeue 2.27 +0.1 2.38 perf-profile.children.cycles-pp.mas_descend_adopt 3.42 +0.1 3.53 perf-profile.children.cycles-pp.mab_mas_cp 1.85 +0.1 1.97 perf-profile.children.cycles-pp.mas_store_b_node 2.51 +0.1 2.63 perf-profile.children.cycles-pp.__call_rcu_common 3.38 +0.1 3.52 perf-profile.children.cycles-pp.syscall_exit_to_user_mode 2.90 +0.1 3.04 perf-profile.children.cycles-pp.vm_area_dup 2.43 +0.2 2.58 perf-profile.children.cycles-pp.mas_mab_cp 3.88 +0.2 4.05 perf-profile.children.cycles-pp.syscall_return_via_sysret 4.40 +0.2 4.62 perf-profile.children.cycles-pp.mas_wmb_replace 13.50 +0.3 13.77 perf-profile.children.cycles-pp.__unfreeze_partials 7.00 +0.3 7.32 perf-profile.children.cycles-pp.mas_spanning_rebalance 28.03 +0.5 28.49 perf-profile.children.cycles-pp.rcu_core 9.86 +0.5 10.37 perf-profile.children.cycles-pp.mas_wr_spanning_store 8.26 +0.5 8.77 perf-profile.children.cycles-pp.mas_split 12.98 +0.6 13.62 perf-profile.children.cycles-pp.mas_wr_bnode 13.58 +0.7 14.23 perf-profile.children.cycles-pp.vma_complete 23.02 +0.7 23.70 perf-profile.children.cycles-pp.__split_vma 28.32 +0.7 29.06 perf-profile.children.cycles-pp.mas_store_prealloc 27.38 -1.0 26.40 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 0.76 -0.2 0.53 perf-profile.self.cycles-pp.mas_next_slot 1.31 -0.2 1.09 perf-profile.self.cycles-pp.mtree_range_walk 1.54 -0.1 1.40 perf-profile.self.cycles-pp.kmem_cache_alloc 2.27 -0.1 2.13 perf-profile.self.cycles-pp.kmem_cache_free 1.70 -0.1 1.58 perf-profile.self.cycles-pp.mas_pop_node 0.48 -0.1 0.40 ± 2% perf-profile.self.cycles-pp.unmap_page_range 0.52 -0.1 0.45 perf-profile.self.cycles-pp.mas_alloc_nodes 0.16 ± 3% -0.1 0.11 ± 3% perf-profile.self.cycles-pp.free_pgtables 0.31 ± 2% -0.0 0.26 ± 2% perf-profile.self.cycles-pp.mas_destroy 0.11 ± 4% -0.0 0.07 ± 7% perf-profile.self.cycles-pp.remove_vma 0.19 ± 2% -0.0 0.15 ± 2% perf-profile.self.cycles-pp.tlb_gather_mmu 0.75 -0.0 0.71 perf-profile.self.cycles-pp.mas_wr_store_entry 0.82 -0.0 0.78 perf-profile.self.cycles-pp.___slab_alloc 0.12 ± 4% -0.0 0.09 ± 3% perf-profile.self.cycles-pp.__pte_offset_map_lock 0.12 ± 3% -0.0 0.10 ± 5% perf-profile.self.cycles-pp.downgrade_write 0.17 ± 2% -0.0 0.15 ± 2% perf-profile.self.cycles-pp.unmap_vmas 0.24 -0.0 0.22 ± 2% perf-profile.self.cycles-pp.unmap_region 0.08 ± 6% -0.0 0.06 ± 7% perf-profile.self.cycles-pp.can_vma_merge_after 0.18 ± 2% +0.0 0.20 ± 3% perf-profile.self.cycles-pp.percpu_counter_add_batch 0.13 ± 3% +0.0 0.15 ± 3% perf-profile.self.cycles-pp.get_unmapped_area 0.08 ± 5% +0.0 0.11 ± 4% perf-profile.self.cycles-pp.unmap_single_vma 0.17 ± 2% +0.0 0.19 ± 3% perf-profile.self.cycles-pp.zap_pmd_range 0.36 ± 2% +0.0 0.39 ± 2% perf-profile.self.cycles-pp.mas_push_data 0.79 +0.0 0.82 perf-profile.self.cycles-pp.do_vmi_align_munmap 0.99 ± 2% +0.0 1.03 perf-profile.self.cycles-pp.perf_event_mmap_output 0.08 ± 5% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.anon_vma_clone 0.61 +0.0 0.65 perf-profile.self.cycles-pp.__cond_resched 0.83 +0.0 0.87 perf-profile.self.cycles-pp.mast_fill_bnode 0.20 ± 4% +0.0 0.24 ± 2% perf-profile.self.cycles-pp.vma_complete 1.22 +0.0 1.26 perf-profile.self.cycles-pp.mas_wr_spanning_store 0.88 +0.0 0.93 perf-profile.self.cycles-pp.memcpy_orig 1.26 +0.0 1.30 perf-profile.self.cycles-pp.__call_rcu_common 0.21 ± 2% +0.0 0.26 ± 2% perf-profile.self.cycles-pp.mas_next_sibling 1.12 +0.1 1.18 perf-profile.self.cycles-pp.kmem_cache_free_bulk 0.78 +0.1 0.83 perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk 0.79 +0.1 0.85 perf-profile.self.cycles-pp.__do_sys_brk 0.56 +0.1 0.61 perf-profile.self.cycles-pp.mas_wr_walk_index 0.51 ± 2% +0.1 0.57 perf-profile.self.cycles-pp.__split_vma 1.65 +0.1 1.72 perf-profile.self.cycles-pp.__entry_text_start 1.32 +0.1 1.39 perf-profile.self.cycles-pp.mas_spanning_rebalance 2.30 +0.1 2.37 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 2.04 +0.1 2.12 perf-profile.self.cycles-pp.mas_descend_adopt 1.77 +0.1 1.86 perf-profile.self.cycles-pp.mas_mab_cp 1.61 +0.1 1.71 perf-profile.self.cycles-pp.rcu_cblist_dequeue 3.13 +0.1 3.27 perf-profile.self.cycles-pp.syscall_exit_to_user_mode 3.88 +0.2 4.04 perf-profile.self.cycles-pp.syscall_return_via_sysret 4.75 +0.2 4.94 perf-profile.self.cycles-pp.__slab_free Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki