Hello, kernel test robot noticed a 7.2% improvement of stress-ng.dynlib.ops_per_sec on: commit: 93faf426e3cc000c95f1a5d3510b77ce99adac52 ("vfs: shave work on failed file open") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: stress-ng test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory parameters: nr_threads: 10% disk: 1HDD testtime: 60s fs: ext4 class: os test: dynlib cpufreq_governor: performance Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20240129/202401291500.8546fbc3-oliver.sang@xxxxxxxxx ========================================================================================= class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: os/gcc-12/performance/1HDD/ext4/x86_64-rhel-8.3/10%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp7/dynlib/stress-ng/60s commit: 6036c5f131 ("fs: simplify misleading code to remove ambiguity regarding ihold()/iput()") 93faf426e3 ("vfs: shave work on failed file open") 6036c5f131752689 93faf426e3cc000c95f1a5d3510 ---------------- --------------------------- %stddev %change %stddev \ | \ 8.49 -0.9% 8.41 iostat.cpu.system 0.61 -0.2 0.39 mpstat.cpu.all.soft% 55.50 ± 11% -44.7% 30.67 ± 21% perf-c2c.DRAM.local 165542 ± 2% -9.7% 149471 numa-meminfo.node1.Active 161615 -9.0% 147100 ± 2% numa-meminfo.node1.Active(anon) 40405 -9.0% 36775 ± 2% numa-vmstat.node1.nr_active_anon 40405 -9.0% 36775 ± 2% numa-vmstat.node1.nr_zone_active_anon 51245 ± 35% -42.9% 29277 ± 7% sched_debug.cfs_rq:/.avg_vruntime.max 9910 ± 54% -48.0% 5156 ± 8% sched_debug.cfs_rq:/.avg_vruntime.stddev 51247 ± 35% -42.9% 29277 ± 7% sched_debug.cfs_rq:/.min_vruntime.max 9911 ± 54% -48.0% 5156 ± 8% sched_debug.cfs_rq:/.min_vruntime.stddev 249.51 -6.7% 232.83 stress-ng.dynlib.nanosecs_per_dlsym_lookup 287979 +7.2% 308763 stress-ng.dynlib.ops 4799 +7.2% 5146 stress-ng.dynlib.ops_per_sec 2343 ± 5% -39.9% 1409 ± 5% stress-ng.time.involuntary_context_switches 20176141 +7.4% 21671738 stress-ng.time.minor_page_faults 535.17 +2.6% 548.83 stress-ng.time.percent_of_cpu_this_job_got 291.91 +2.2% 298.23 stress-ng.time.system_time 40770 -7.3% 37774 ± 2% proc-vmstat.nr_active_anon 3098 ± 2% +3.4% 3204 proc-vmstat.nr_inactive_file 34934 -5.1% 33139 proc-vmstat.nr_mapped 70157 -6.1% 65866 proc-vmstat.nr_shmem 42286 -3.3% 40888 proc-vmstat.nr_slab_unreclaimable 40770 -7.3% 37774 ± 2% proc-vmstat.nr_zone_active_anon 3098 ± 2% +3.4% 3204 proc-vmstat.nr_zone_inactive_file 15514950 -3.9% 14905053 proc-vmstat.numa_hit 15448751 -3.9% 14838822 proc-vmstat.numa_local 124008 -5.2% 117617 proc-vmstat.pgactivate 18884464 -11.2% 16772814 proc-vmstat.pgalloc_normal 20593501 +7.2% 22078068 proc-vmstat.pgfault 18735781 -11.2% 16638429 proc-vmstat.pgfree 0.36 ±179% -94.5% 0.02 ± 44% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 165.10 ±215% -97.9% 3.40 ± 37% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 0.11 ± 40% -100.0% 0.00 perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 0.13 ± 25% -39.4% 0.08 ± 21% perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault 23.83 ± 16% -100.0% 0.00 perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 545.33 ± 4% -14.0% 468.83 ± 7% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 1.19 ± 41% -100.0% 0.00 perf-sched.wait_and_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 0.46 ±124% -99.3% 0.00 ±146% perf-sched.wait_time.avg.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open 0.09 ± 63% -92.8% 0.01 ± 77% perf-sched.wait_time.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode 0.11 ± 23% -37.4% 0.07 ± 20% perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault 2.02 ± 6% -29.8% 1.42 ± 44% perf-sched.wait_time.avg.ms.syslog_print.do_syslog.kmsg_read.vfs_read 0.56 ±104% -99.3% 0.00 ±156% perf-sched.wait_time.max.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open 0.44 ± 81% -98.5% 0.01 ± 76% perf-sched.wait_time.max.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode 4.02 ± 6% -12.3% 3.53 ± 8% perf-sched.wait_time.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64 1.18 ± 42% -63.7% 0.43 ± 83% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 3.24 ± 22% -44.2% 1.81 ± 19% perf-sched.wait_time.max.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault 4.05 ± 6% -29.8% 2.84 ± 44% perf-sched.wait_time.max.ms.syslog_print.do_syslog.kmsg_read.vfs_read 0.58 -12.8% 0.51 ± 3% perf-stat.i.MPKI 9185336 ± 2% -11.2% 8152378 ± 3% perf-stat.i.cache-misses 51941695 -13.4% 44958941 perf-stat.i.cache-references 1.29 -2.8% 1.26 perf-stat.i.cpi 2.039e+10 -0.9% 2.022e+10 perf-stat.i.cpu-cycles 107.66 ± 3% -8.6% 98.35 ± 5% perf-stat.i.cpu-migrations 2258 +11.1% 2508 ± 3% perf-stat.i.cycles-between-cache-misses 0.05 ± 2% +0.0 0.06 ± 2% perf-stat.i.dTLB-load-miss-rate% 2224314 ± 2% +5.6% 2349507 ± 2% perf-stat.i.dTLB-load-misses 0.08 +0.0 0.08 perf-stat.i.dTLB-store-miss-rate% 1755339 +7.6% 1889357 perf-stat.i.dTLB-store-misses 0.78 +2.8% 0.80 perf-stat.i.ipc 0.32 -0.8% 0.32 perf-stat.i.metric.GHz 965.70 -11.8% 851.51 perf-stat.i.metric.K/sec 326896 +7.3% 350610 perf-stat.i.minor-faults 89.04 +3.5 92.53 perf-stat.i.node-load-miss-rate% 617886 ± 7% -34.0% 407524 ± 12% perf-stat.i.node-loads 49.79 ± 3% +6.2 56.00 perf-stat.i.node-store-miss-rate% 1915596 ± 3% -9.6% 1731427 ± 4% perf-stat.i.node-store-misses 2010814 ± 4% -29.8% 1410941 ± 2% perf-stat.i.node-stores 327425 +7.3% 351195 perf-stat.i.page-faults 0.58 -13.1% 0.51 ± 3% perf-stat.overall.MPKI 1.30 -2.9% 1.26 perf-stat.overall.cpi 2219 +11.8% 2482 ± 3% perf-stat.overall.cycles-between-cache-misses 0.05 ± 2% +0.0 0.06 ± 2% perf-stat.overall.dTLB-load-miss-rate% 0.08 +0.0 0.08 perf-stat.overall.dTLB-store-miss-rate% 0.77 +3.0% 0.79 perf-stat.overall.ipc 87.69 +5.5 93.21 ± 3% perf-stat.overall.node-load-miss-rate% 48.82 ± 3% +6.3 55.14 ± 2% perf-stat.overall.node-store-miss-rate% 9052155 ± 2% -11.3% 8027092 ± 3% perf-stat.ps.cache-misses 51114488 -13.4% 44240106 perf-stat.ps.cache-references 2.008e+10 -0.9% 1.991e+10 perf-stat.ps.cpu-cycles 106.26 ± 3% -8.7% 97.05 ± 5% perf-stat.ps.cpu-migrations 2190693 ± 2% +5.6% 2313598 ± 2% perf-stat.ps.dTLB-load-misses 1728462 +7.6% 1860472 perf-stat.ps.dTLB-store-misses 321900 +7.3% 345241 perf-stat.ps.minor-faults 621192 ± 7% -34.3% 407816 ± 12% perf-stat.ps.node-loads 1885823 ± 3% -9.6% 1703871 ± 4% perf-stat.ps.node-store-misses 1978205 ± 4% -30.0% 1385178 ± 2% perf-stat.ps.node-stores 322420 +7.3% 345817 perf-stat.ps.page-faults 6.48 -4.9 1.63 ± 4% perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.66 -4.8 1.81 ± 4% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.41 -4.8 1.57 ± 4% perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.06 -4.5 1.53 ± 4% perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 5.03 -3.6 1.41 ± 5% perf-profile.calltrace.cycles-pp.__fput.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode 9.04 ± 2% -2.3 6.78 perf-profile.calltrace.cycles-pp.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat 6.84 ± 2% -1.6 5.24 perf-profile.calltrace.cycles-pp.init_file.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2 72.38 -0.9 71.44 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe 72.16 -0.9 71.21 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.99 ± 3% -0.7 1.31 ± 3% perf-profile.calltrace.cycles-pp.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2 0.65 ± 6% -0.4 0.27 ±100% perf-profile.calltrace.cycles-pp.__do_softirq.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state 0.69 ± 5% -0.3 0.38 ± 70% perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter 1.37 ± 4% -0.1 1.22 ± 6% perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap 1.45 ± 4% -0.1 1.32 ± 6% perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff 1.69 ± 3% -0.1 1.57 ± 6% perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff 0.74 ± 5% -0.1 0.63 ± 9% perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region 0.67 ± 3% +0.1 0.72 perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region 0.94 ± 3% +0.1 0.99 ± 2% perf-profile.calltrace.cycles-pp.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap 0.94 ± 3% +0.1 1.01 perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap 0.75 ± 4% +0.1 0.83 ± 2% perf-profile.calltrace.cycles-pp.rcu_segcblist_enqueue.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap 0.89 ± 3% +0.1 0.97 ± 2% perf-profile.calltrace.cycles-pp.up_write.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap 1.06 ± 4% +0.1 1.14 ± 3% perf-profile.calltrace.cycles-pp.vma_expand.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff 1.10 ± 2% +0.1 1.20 ± 2% perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap 1.06 ± 4% +0.1 1.16 ± 2% perf-profile.calltrace.cycles-pp.link_path_walk.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat 1.74 +0.1 1.88 perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap 2.44 +0.2 2.58 ± 2% perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region 2.43 ± 2% +0.2 2.60 perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap 2.79 +0.2 2.96 ± 3% perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception 2.84 +0.2 3.01 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe._dl_catch_exception 2.78 +0.2 2.95 ± 3% perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception 2.17 ± 2% +0.2 2.34 perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region 2.82 +0.2 3.00 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception 2.91 ± 2% +0.2 3.09 ± 3% perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.58 ± 6% +0.2 1.76 ± 6% perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap 2.22 ± 2% +0.2 2.40 perf-profile.calltrace.cycles-pp.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap 0.26 ±100% +0.3 0.54 ± 3% perf-profile.calltrace.cycles-pp.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault.asm_exc_page_fault 1.86 ± 4% +0.3 2.14 ± 4% perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region 3.32 ± 2% +0.3 3.60 ± 3% perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region 2.44 ± 3% +0.3 2.73 ± 4% perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap 4.31 +0.3 4.62 perf-profile.calltrace.cycles-pp.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap 5.40 +0.3 5.73 ± 3% perf-profile.calltrace.cycles-pp.next_uptodate_folio.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault 8.33 ± 3% +0.4 8.68 ± 4% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 8.40 ± 3% +0.4 8.75 ± 4% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 7.06 +0.4 7.47 ± 3% perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault 7.11 +0.4 7.53 ± 3% perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault 8.33 +0.4 8.78 ± 2% perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault 1.94 ± 12% +0.5 2.40 ± 3% perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64 9.37 +0.5 9.84 ± 2% perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault 15.46 +0.5 15.94 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap 9.70 +0.5 10.20 perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault 10.75 +0.5 11.30 perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault 10.79 +0.5 11.34 perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault 7.83 +0.6 8.43 perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap 11.55 +0.6 12.16 perf-profile.calltrace.cycles-pp.asm_exc_page_fault 20.74 +0.7 21.44 perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64 20.88 +0.7 21.58 perf-profile.calltrace.cycles-pp.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe 20.95 +0.7 21.67 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe 20.95 +0.7 21.67 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.00 +0.7 0.74 ± 5% perf-profile.calltrace.cycles-pp.kmem_cache_free.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat 3.49 ± 2% +0.8 4.31 perf-profile.calltrace.cycles-pp.apparmor_file_alloc_security.security_file_alloc.init_file.alloc_empty_file.path_openat 3.96 +0.8 4.78 perf-profile.calltrace.cycles-pp.security_file_alloc.init_file.alloc_empty_file.path_openat.do_filp_open 11.55 +0.9 12.50 ± 2% perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff 11.78 +1.0 12.74 ± 2% perf-profile.calltrace.cycles-pp.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff 23.77 +1.3 25.09 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe 21.46 +1.3 22.79 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64 22.25 +1.3 23.58 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe 23.91 +1.3 25.25 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe 14.68 +1.5 16.19 perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64 14.99 +1.6 16.54 perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe 17.95 +1.7 19.64 perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe 18.05 +1.7 19.74 perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.00 +2.5 2.54 ± 5% perf-profile.calltrace.cycles-pp.apparmor_file_free_security.security_file_free.release_empty_file.path_openat.do_filp_open 0.00 +2.6 2.56 ± 5% perf-profile.calltrace.cycles-pp.security_file_free.release_empty_file.path_openat.do_filp_open.do_sys_openat2 0.00 +3.2 3.21 ± 4% perf-profile.calltrace.cycles-pp.release_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat 6.53 -4.9 1.67 ± 4% perf-profile.children.cycles-pp.exit_to_user_mode_prepare 6.44 -4.9 1.58 ± 4% perf-profile.children.cycles-pp.exit_to_user_mode_loop 6.72 -4.8 1.88 ± 3% perf-profile.children.cycles-pp.syscall_exit_to_user_mode 6.10 -4.6 1.54 ± 4% perf-profile.children.cycles-pp.task_work_run 5.05 -3.6 1.42 ± 5% perf-profile.children.cycles-pp.__fput 7.18 ± 2% -2.6 4.60 ± 4% perf-profile.children.cycles-pp.__do_softirq 6.77 ± 2% -2.6 4.20 ± 4% perf-profile.children.cycles-pp.rcu_core 6.72 ± 2% -2.6 4.17 ± 4% perf-profile.children.cycles-pp.rcu_do_batch 7.08 ± 2% -2.5 4.59 ± 3% perf-profile.children.cycles-pp.__irq_exit_rcu 9.07 ± 2% -2.3 6.81 perf-profile.children.cycles-pp.alloc_empty_file 10.42 -2.1 8.34 ± 4% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt 10.17 -2.1 8.09 ± 4% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt 6.86 ± 2% -1.6 5.26 perf-profile.children.cycles-pp.init_file 3.66 ± 4% -0.9 2.80 ± 4% perf-profile.children.cycles-pp.apparmor_file_free_security 3.70 ± 4% -0.9 2.84 ± 5% perf-profile.children.cycles-pp.security_file_free 1.21 ± 3% -0.7 0.50 ± 7% perf-profile.children.cycles-pp.rcu_cblist_dequeue 1.55 ± 2% -0.6 0.94 ± 4% perf-profile.children.cycles-pp.___slab_alloc 4.97 ± 2% -0.6 4.38 ± 2% perf-profile.children.cycles-pp.kmem_cache_alloc 2.62 ± 2% -0.6 2.05 ± 4% perf-profile.children.cycles-pp.__slab_free 2.26 -0.5 1.71 perf-profile.children.cycles-pp.__call_rcu_common 0.58 ± 6% -0.5 0.10 ± 19% perf-profile.children.cycles-pp.file_free_rcu 0.63 ± 2% -0.4 0.18 ± 4% perf-profile.children.cycles-pp.fput 0.38 ± 4% -0.3 0.08 ± 12% perf-profile.children.cycles-pp.task_work_add 0.66 ± 2% -0.3 0.37 ± 7% perf-profile.children.cycles-pp.allocate_slab 0.46 ± 3% -0.2 0.27 ± 8% perf-profile.children.cycles-pp.shuffle_freelist 0.29 ± 5% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.inc_slabs_node 0.18 ± 5% -0.1 0.05 ± 47% perf-profile.children.cycles-pp._raw_spin_lock_irq 0.17 ± 10% -0.1 0.07 ± 48% perf-profile.children.cycles-pp.smpboot_thread_fn 0.15 ± 10% -0.1 0.06 ± 51% perf-profile.children.cycles-pp.run_ksoftirqd 0.20 ± 3% -0.1 0.12 ± 9% perf-profile.children.cycles-pp.rcu_nocb_try_bypass 0.20 ± 7% -0.1 0.13 ± 5% perf-profile.children.cycles-pp.__unfreeze_partials 1.16 ± 2% -0.1 1.10 perf-profile.children.cycles-pp.rcu_segcblist_enqueue 0.29 ± 5% -0.1 0.24 ± 5% perf-profile.children.cycles-pp.refill_obj_stock 0.22 ± 4% -0.0 0.17 ± 11% perf-profile.children.cycles-pp.get_page_from_freelist 0.14 ± 10% -0.0 0.09 ± 9% perf-profile.children.cycles-pp.setup_object 0.18 ± 7% -0.0 0.15 ± 8% perf-profile.children.cycles-pp.__kmem_cache_alloc_node 0.09 ± 14% -0.0 0.06 ± 11% perf-profile.children.cycles-pp.free_unref_page 0.09 ± 7% +0.0 0.11 ± 6% perf-profile.children.cycles-pp.cp_new_stat 0.20 ± 9% +0.0 0.24 ± 9% perf-profile.children.cycles-pp.generic_file_mmap 0.53 ± 4% +0.0 0.57 ± 2% perf-profile.children.cycles-pp.lock_vma_under_rcu 0.27 ± 5% +0.0 0.31 ± 6% perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave 0.16 ± 12% +0.0 0.21 ± 7% perf-profile.children.cycles-pp.put_unused_fd 0.24 ± 6% +0.1 0.29 ± 7% perf-profile.children.cycles-pp.path_init 0.26 ± 7% +0.1 0.31 ± 10% perf-profile.children.cycles-pp.inode_permission 0.01 ±223% +0.1 0.07 ± 11% perf-profile.children.cycles-pp.__irqentry_text_end 0.62 ± 3% +0.1 0.67 ± 3% perf-profile.children.cycles-pp.__rb_insert_augmented 0.11 ± 31% +0.1 0.18 ± 17% perf-profile.children.cycles-pp.tick_sched_do_timer 1.06 ± 4% +0.1 1.15 ± 3% perf-profile.children.cycles-pp.vma_expand 0.12 ± 22% +0.1 0.21 ± 28% perf-profile.children.cycles-pp.ktime_get_update_offsets_now 1.14 ± 2% +0.1 1.24 ± 2% perf-profile.children.cycles-pp.release_pages 1.09 ± 4% +0.1 1.19 ± 2% perf-profile.children.cycles-pp.link_path_walk 2.12 ± 2% +0.1 2.24 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_remove 0.51 ± 5% +0.1 0.64 ± 4% perf-profile.children.cycles-pp.percpu_counter_add_batch 1.75 +0.1 1.90 perf-profile.children.cycles-pp.tlb_batch_pages_flush 3.61 ± 2% +0.2 3.77 ± 2% perf-profile.children.cycles-pp._dl_catch_exception 2.66 ± 2% +0.2 2.83 perf-profile.children.cycles-pp.flush_tlb_mm_range 2.91 ± 2% +0.2 3.09 ± 3% perf-profile.children.cycles-pp.mprotect_fixup 3.36 +0.2 3.54 perf-profile.children.cycles-pp.vma_complete 2.28 +0.2 2.46 perf-profile.children.cycles-pp.kmem_cache_free 2.32 ± 2% +0.2 2.51 perf-profile.children.cycles-pp.native_flush_tlb_one_user 3.38 +0.2 3.58 ± 2% perf-profile.children.cycles-pp.__x64_sys_mprotect 3.38 +0.2 3.58 ± 2% perf-profile.children.cycles-pp.do_mprotect_pkey 2.41 ± 2% +0.2 2.62 perf-profile.children.cycles-pp.flush_tlb_func 3.09 ± 2% +0.2 3.33 ± 2% perf-profile.children.cycles-pp.up_write 3.12 ± 2% +0.3 3.37 ± 3% perf-profile.children.cycles-pp.vma_interval_tree_insert 4.52 +0.3 4.83 perf-profile.children.cycles-pp.tlb_finish_mmu 0.21 ± 62% +0.4 0.57 ± 60% perf-profile.children.cycles-pp.tick_irq_enter 5.76 +0.4 6.13 ± 3% perf-profile.children.cycles-pp.next_uptodate_folio 4.70 ± 2% +0.5 5.15 ± 3% perf-profile.children.cycles-pp.vma_prepare 7.52 +0.5 7.98 ± 2% perf-profile.children.cycles-pp.filemap_map_pages 7.56 +0.5 8.03 ± 2% perf-profile.children.cycles-pp.do_read_fault 8.78 +0.5 9.27 ± 2% perf-profile.children.cycles-pp.do_fault 9.84 +0.5 10.37 perf-profile.children.cycles-pp.__handle_mm_fault 10.18 +0.6 10.74 perf-profile.children.cycles-pp.handle_mm_fault 11.30 +0.6 11.90 perf-profile.children.cycles-pp.exc_page_fault 11.27 +0.6 11.87 perf-profile.children.cycles-pp.do_user_addr_fault 12.10 +0.7 12.76 perf-profile.children.cycles-pp.asm_exc_page_fault 20.95 +0.7 21.67 perf-profile.children.cycles-pp.__vm_munmap 20.95 +0.7 21.67 perf-profile.children.cycles-pp.__x64_sys_munmap 10.11 +0.7 10.85 perf-profile.children.cycles-pp.__split_vma 18.12 +0.8 18.88 perf-profile.children.cycles-pp.unmap_region 3.51 ± 2% +0.8 4.34 perf-profile.children.cycles-pp.apparmor_file_alloc_security 3.98 +0.8 4.81 perf-profile.children.cycles-pp.security_file_alloc 24.22 +1.3 25.56 perf-profile.children.cycles-pp.vm_mmap_pgoff 22.69 +1.3 24.04 perf-profile.children.cycles-pp.do_mmap 21.92 +1.4 23.27 perf-profile.children.cycles-pp.mmap_region 23.92 +1.4 25.27 perf-profile.children.cycles-pp.ksys_mmap_pgoff 14.74 +1.5 16.27 perf-profile.children.cycles-pp.path_openat 15.02 +1.6 16.58 perf-profile.children.cycles-pp.do_filp_open 32.60 +1.7 34.26 perf-profile.children.cycles-pp.do_vmi_align_munmap 32.93 +1.7 34.60 perf-profile.children.cycles-pp.do_vmi_munmap 18.02 +1.7 19.71 perf-profile.children.cycles-pp.do_sys_openat2 18.07 +1.7 19.77 perf-profile.children.cycles-pp.__x64_sys_openat 0.00 +3.2 3.22 ± 4% perf-profile.children.cycles-pp.release_empty_file 2.63 ± 4% -2.2 0.41 ± 6% perf-profile.self.cycles-pp.init_file 3.44 ± 4% -0.8 2.68 ± 4% perf-profile.self.cycles-pp.apparmor_file_free_security 1.20 ± 3% -0.7 0.49 ± 7% perf-profile.self.cycles-pp.rcu_cblist_dequeue 2.54 ± 2% -0.5 2.01 ± 4% perf-profile.self.cycles-pp.__slab_free 0.57 ± 6% -0.5 0.09 ± 13% perf-profile.self.cycles-pp.file_free_rcu 0.78 ± 3% -0.3 0.44 ± 4% perf-profile.self.cycles-pp.__call_rcu_common 0.30 ± 6% -0.2 0.07 ± 14% perf-profile.self.cycles-pp.task_work_add 0.42 ± 6% -0.1 0.28 ± 3% perf-profile.self.cycles-pp.___slab_alloc 0.28 ± 5% -0.1 0.14 ± 4% perf-profile.self.cycles-pp.inc_slabs_node 0.34 ± 6% -0.1 0.20 ± 7% perf-profile.self.cycles-pp.shuffle_freelist 0.23 ± 3% -0.1 0.10 ± 12% perf-profile.self.cycles-pp.fput 0.16 ± 5% -0.1 0.04 ± 71% perf-profile.self.cycles-pp._raw_spin_lock_irq 0.14 ± 4% -0.1 0.08 ± 14% perf-profile.self.cycles-pp.rcu_nocb_try_bypass 1.13 ± 2% -0.1 1.08 perf-profile.self.cycles-pp.rcu_segcblist_enqueue 0.27 ± 4% -0.0 0.22 ± 6% perf-profile.self.cycles-pp.refill_obj_stock 0.10 ± 8% -0.0 0.06 ± 6% perf-profile.self.cycles-pp.__unfreeze_partials 0.11 ± 8% -0.0 0.08 ± 10% perf-profile.self.cycles-pp.rcu_do_batch 0.11 ± 4% +0.0 0.14 ± 10% perf-profile.self.cycles-pp.mas_walk 0.07 ± 16% +0.0 0.10 ± 6% perf-profile.self.cycles-pp.atime_needs_update 0.21 ± 5% +0.0 0.25 ± 6% perf-profile.self.cycles-pp.vm_area_dup 0.89 +0.0 0.93 perf-profile.self.cycles-pp.zap_pte_range 0.40 ± 5% +0.0 0.44 ± 2% perf-profile.self.cycles-pp.__split_vma 0.44 ± 3% +0.0 0.48 ± 4% perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate 0.15 ± 8% +0.0 0.20 ± 7% perf-profile.self.cycles-pp.syscall_enter_from_user_mode 0.17 ± 11% +0.0 0.22 ± 7% perf-profile.self.cycles-pp.path_init 0.52 ± 3% +0.0 0.57 ± 3% perf-profile.self.cycles-pp.__rb_insert_augmented 0.20 ± 8% +0.1 0.26 ± 4% perf-profile.self.cycles-pp.get_obj_cgroup_from_current 0.01 ±223% +0.1 0.06 ± 14% perf-profile.self.cycles-pp.__irqentry_text_end 0.49 ± 4% +0.1 0.55 ± 4% perf-profile.self.cycles-pp.free_swap_cache 0.60 ± 5% +0.1 0.66 ± 3% perf-profile.self.cycles-pp.mmap_region 0.74 ± 3% +0.1 0.81 perf-profile.self.cycles-pp._raw_spin_lock 0.08 ± 53% +0.1 0.15 ± 20% perf-profile.self.cycles-pp.tick_sched_do_timer 0.11 ± 24% +0.1 0.19 ± 29% perf-profile.self.cycles-pp.ktime_get_update_offsets_now 0.35 ± 11% +0.1 0.44 ± 5% perf-profile.self.cycles-pp.rwsem_down_write_slowpath 0.44 ± 7% +0.1 0.58 ± 3% perf-profile.self.cycles-pp.percpu_counter_add_batch 1.33 ± 2% +0.2 1.48 ± 3% perf-profile.self.cycles-pp.down_write 1.92 ± 2% +0.2 2.09 ± 2% perf-profile.self.cycles-pp.vma_interval_tree_remove 2.29 ± 2% +0.2 2.48 perf-profile.self.cycles-pp.native_flush_tlb_one_user 1.34 +0.2 1.58 ± 2% perf-profile.self.cycles-pp.kmem_cache_free 2.90 ± 2% +0.2 3.14 ± 2% perf-profile.self.cycles-pp.up_write 0.00 +0.3 0.26 ± 3% perf-profile.self.cycles-pp.release_empty_file 2.89 +0.3 3.20 ± 3% perf-profile.self.cycles-pp.vma_interval_tree_insert 2.54 ± 3% +0.3 2.85 ± 4% perf-profile.self.cycles-pp.rwsem_spin_on_owner 5.28 +0.4 5.73 ± 3% perf-profile.self.cycles-pp.next_uptodate_folio 3.22 +0.9 4.07 perf-profile.self.cycles-pp.apparmor_file_alloc_security Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki