Hello, kernel test robot noticed a 4.4% improvement of aim7.jobs-per-min on: commit: 7bd5bc3ce9632aefd0eed33a19212a2e55c0f873 ("mm: memcg: normalize the value passed into memcg_rstat_updated()") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: aim7 test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory parameters: disk: 1BRD_48G fs: ext4 test: disk_rr load: 3000 cpufreq_governor: performance In addition to that, the commit also has significant impact on the following tests: +------------------+--------------------------------------------------------------------------------------------+ | testcase: change | aim7: aim7.jobs-per-min 2.2% improvement | | test machine | 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory | | test parameters | cpufreq_governor=performance | | | disk=1BRD_48G | | | fs=ext4 | | | load=3000 | | | test=disk_cp | +------------------+--------------------------------------------------------------------------------------------+ Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20231121/202311211612.33647109-oliver.sang@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/disk/fs/kconfig/load/rootfs/tbox_group/test/testcase: gcc-12/performance/1BRD_48G/ext4/x86_64-rhel-8.3/3000/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp2/disk_rr/aim7 commit: ff841a06c8 ("mm: memcg: refactor page state unit helpers") 7bd5bc3ce9 ("mm: memcg: normalize the value passed into memcg_rstat_updated()") ff841a06c844b055 7bd5bc3ce9632aefd0eed33a192 ---------------- --------------------------- %stddev %change %stddev \ | \ 838.50 ± 27% -47.7% 438.67 ± 50% perf-sched.wait_and_delay.count.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity 21.24 ± 22% +36.9% 29.06 ± 20% sched_debug.cfs_rq:/.util_est_enqueued.avg 620370 +4.4% 647378 aim7.jobs-per-min 153681 ± 6% -8.1% 141236 ± 4% aim7.time.involuntary_context_switches 2286 -6.8% 2132 aim7.time.system_time 2.79 -4.6% 2.66 perf-stat.overall.cpi 0.01 ± 19% -0.0 0.01 ± 13% perf-stat.overall.dTLB-load-miss-rate% 0.36 +4.9% 0.38 perf-stat.overall.ipc 21.96 -0.8 21.14 perf-profile.calltrace.cycles-pp.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin 22.01 -0.8 21.19 perf-profile.calltrace.cycles-pp.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write 21.40 -0.8 20.58 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru 21.42 -0.8 20.61 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio 21.41 -0.8 20.60 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio 23.15 -0.8 22.37 perf-profile.calltrace.cycles-pp.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 24.45 -0.7 23.76 perf-profile.calltrace.cycles-pp.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write 24.40 -0.7 23.75 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode 24.38 -0.7 23.72 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release 24.40 -0.7 23.74 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range 27.97 -0.6 27.32 perf-profile.calltrace.cycles-pp.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill 25.20 -0.6 24.56 perf-profile.calltrace.cycles-pp.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict 30.21 -0.5 29.69 perf-profile.calltrace.cycles-pp.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill.dentry_kill 30.32 -0.5 29.81 perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dentry_kill.dput.__fput 30.31 -0.5 29.80 perf-profile.calltrace.cycles-pp.ext4_evict_inode.evict.__dentry_kill.dentry_kill.dput 30.36 -0.5 29.85 perf-profile.calltrace.cycles-pp.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 30.34 -0.5 29.83 perf-profile.calltrace.cycles-pp.dentry_kill.dput.__fput.__x64_sys_close.do_syscall_64 30.38 -0.5 29.87 perf-profile.calltrace.cycles-pp.__close 30.35 -0.5 29.84 perf-profile.calltrace.cycles-pp.dput.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe 30.37 -0.5 29.87 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__close 30.37 -0.5 29.87 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 30.36 -0.5 29.86 perf-profile.calltrace.cycles-pp.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 30.33 -0.5 29.83 perf-profile.calltrace.cycles-pp.__dentry_kill.dentry_kill.dput.__fput.__x64_sys_close 27.52 -0.5 27.03 perf-profile.calltrace.cycles-pp.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 32.56 -0.5 32.09 perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64 33.62 -0.4 33.22 perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe 34.80 -0.3 34.52 perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write 0.76 -0.2 0.54 perf-profile.calltrace.cycles-pp.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write 0.76 -0.2 0.54 perf-profile.calltrace.cycles-pp.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags 0.78 -0.2 0.56 perf-profile.calltrace.cycles-pp.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter 0.80 -0.2 0.59 perf-profile.calltrace.cycles-pp.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.75 -0.2 0.54 perf-profile.calltrace.cycles-pp.cgroup_rstat_flush_locked.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages 1.16 -0.2 0.98 perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.56 ± 2% +0.0 0.58 perf-profile.calltrace.cycles-pp.workingset_age_nonresident.workingset_activation.folio_mark_accessed.filemap_read.vfs_read 0.74 ± 2% +0.0 0.77 perf-profile.calltrace.cycles-pp.workingset_activation.folio_mark_accessed.filemap_read.vfs_read.ksys_read 1.10 +0.0 1.14 perf-profile.calltrace.cycles-pp.__filemap_add_folio.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write 0.64 +0.0 0.68 perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.54 +0.0 0.58 perf-profile.calltrace.cycles-pp.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 0.98 +0.0 1.03 perf-profile.calltrace.cycles-pp.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 0.58 +0.0 0.63 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek 0.95 +0.1 1.00 perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write 0.84 +0.1 0.89 perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.75 +0.1 0.81 perf-profile.calltrace.cycles-pp.filemap_get_read_batch.filemap_get_pages.filemap_read.vfs_read.ksys_read 0.71 +0.1 0.77 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.llseek 0.90 +0.1 0.97 perf-profile.calltrace.cycles-pp.filemap_get_pages.filemap_read.vfs_read.ksys_read.do_syscall_64 0.91 +0.1 0.98 ± 2% perf-profile.calltrace.cycles-pp.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write 0.93 +0.1 1.00 ± 2% perf-profile.calltrace.cycles-pp.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 2.17 +0.1 2.25 perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 1.63 +0.1 1.72 perf-profile.calltrace.cycles-pp.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill 1.58 +0.1 1.69 perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read 1.42 +0.1 1.53 perf-profile.calltrace.cycles-pp.llseek 1.71 +0.1 1.83 perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read.ksys_read 1.81 +0.1 1.93 perf-profile.calltrace.cycles-pp.copy_page_to_iter.filemap_read.vfs_read.ksys_read.do_syscall_64 2.74 +0.2 2.90 perf-profile.calltrace.cycles-pp.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write 29.25 +0.3 29.56 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read 29.38 +0.3 29.71 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read 30.13 +0.4 30.51 perf-profile.calltrace.cycles-pp.read 70.96 -1.7 69.29 perf-profile.children.cycles-pp._raw_spin_lock_irqsave 70.80 -1.7 69.14 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 70.75 -1.6 69.12 perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave 47.66 -1.0 46.66 perf-profile.children.cycles-pp.folio_batch_move_lru 22.05 -0.8 21.23 perf-profile.children.cycles-pp.folio_add_lru 23.15 -0.8 22.38 perf-profile.children.cycles-pp.filemap_add_folio 24.49 -0.7 23.80 perf-profile.children.cycles-pp.__filemap_get_folio 27.97 -0.6 27.32 perf-profile.children.cycles-pp.__folio_batch_release 25.44 -0.6 24.83 perf-profile.children.cycles-pp.release_pages 30.21 -0.5 29.70 perf-profile.children.cycles-pp.truncate_inode_pages_range 30.32 -0.5 29.81 perf-profile.children.cycles-pp.evict 30.34 -0.5 29.83 perf-profile.children.cycles-pp.__dentry_kill 30.31 -0.5 29.80 perf-profile.children.cycles-pp.ext4_evict_inode 30.35 -0.5 29.84 perf-profile.children.cycles-pp.dput 30.36 -0.5 29.85 perf-profile.children.cycles-pp.__fput 30.34 -0.5 29.83 perf-profile.children.cycles-pp.dentry_kill 30.36 -0.5 29.86 perf-profile.children.cycles-pp.__x64_sys_close 30.38 -0.5 29.87 perf-profile.children.cycles-pp.__close 27.56 -0.5 27.07 perf-profile.children.cycles-pp.ext4_da_write_begin 32.68 -0.5 32.20 perf-profile.children.cycles-pp.generic_perform_write 33.67 -0.4 33.28 perf-profile.children.cycles-pp.ext4_buffered_write_iter 34.90 -0.3 34.61 perf-profile.children.cycles-pp.vfs_write 0.95 ± 4% -0.2 0.72 ± 4% perf-profile.children.cycles-pp.cgroup_rstat_updated 0.76 -0.2 0.54 perf-profile.children.cycles-pp.do_flush_stats 97.02 -0.2 96.80 perf-profile.children.cycles-pp.do_syscall_64 0.76 -0.2 0.54 perf-profile.children.cycles-pp.cgroup_rstat_flush 0.78 -0.2 0.56 perf-profile.children.cycles-pp.mem_cgroup_wb_stats 0.75 -0.2 0.54 perf-profile.children.cycles-pp.cgroup_rstat_flush_locked 0.80 -0.2 0.59 perf-profile.children.cycles-pp.balance_dirty_pages 97.31 -0.2 97.12 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 1.19 -0.2 1.02 perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags 0.56 -0.2 0.40 perf-profile.children.cycles-pp.mem_cgroup_css_rstat_flush 1.00 ± 2% -0.1 0.87 ± 2% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state 1.20 -0.1 1.14 ± 2% perf-profile.children.cycles-pp.__mod_lruvec_page_state 0.29 ± 3% -0.0 0.24 ± 3% perf-profile.children.cycles-pp.__count_memcg_events 0.33 ± 3% -0.0 0.30 perf-profile.children.cycles-pp.charge_memcg 0.40 -0.0 0.37 perf-profile.children.cycles-pp.__mem_cgroup_charge 0.08 ± 9% -0.0 0.05 ± 8% perf-profile.children.cycles-pp.cgroup_base_stat_flush 0.14 +0.0 0.15 perf-profile.children.cycles-pp.filemap_unaccount_folio 0.05 +0.0 0.06 perf-profile.children.cycles-pp.generic_file_read_iter 0.12 +0.0 0.13 perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare 0.08 +0.0 0.09 perf-profile.children.cycles-pp.timestamp_truncate 0.06 +0.0 0.07 perf-profile.children.cycles-pp.try_charge_memcg 0.14 ± 2% +0.0 0.15 perf-profile.children.cycles-pp.up_write 0.11 +0.0 0.12 ± 3% perf-profile.children.cycles-pp.node_dirty_ok 0.10 ± 5% +0.0 0.11 perf-profile.children.cycles-pp.ext4_dirty_inode 0.21 ± 2% +0.0 0.23 ± 3% perf-profile.children.cycles-pp.ext4_generic_write_checks 0.19 +0.0 0.21 ± 2% perf-profile.children.cycles-pp.syscall_enter_from_user_mode 0.16 ± 2% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.xas_start 0.19 ± 2% +0.0 0.21 ± 2% perf-profile.children.cycles-pp.__mod_node_page_state 0.12 ± 4% +0.0 0.14 ± 2% perf-profile.children.cycles-pp.tlb_finish_mmu 0.12 ± 4% +0.0 0.14 ± 2% perf-profile.children.cycles-pp.tlb_batch_pages_flush 0.26 +0.0 0.27 ± 2% perf-profile.children.cycles-pp.__mod_lruvec_state 0.25 +0.0 0.27 perf-profile.children.cycles-pp.inode_needs_update_time 0.23 ± 2% +0.0 0.25 ± 2% perf-profile.children.cycles-pp.__list_del_entry_valid_or_report 0.16 ± 4% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.__mark_inode_dirty 0.24 +0.0 0.26 ± 2% perf-profile.children.cycles-pp.ext4_da_reserve_space 0.56 ± 2% +0.0 0.58 perf-profile.children.cycles-pp.workingset_age_nonresident 0.12 ± 6% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.generic_update_time 0.44 +0.0 0.46 perf-profile.children.cycles-pp.filemap_get_entry 0.15 ± 3% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.__mmput 0.15 ± 3% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.exit_mmap 0.30 +0.0 0.32 ± 2% perf-profile.children.cycles-pp.disk_rr 0.15 ± 3% +0.0 0.17 ± 2% perf-profile.children.cycles-pp.exit_mm 0.38 ± 2% +0.0 0.40 ± 2% perf-profile.children.cycles-pp.get_page_from_freelist 0.38 +0.0 0.41 ± 2% perf-profile.children.cycles-pp.fault_in_readable 0.16 ± 3% +0.0 0.18 ± 3% perf-profile.children.cycles-pp.__x64_sys_exit_group 0.16 ± 3% +0.0 0.18 ± 3% perf-profile.children.cycles-pp.do_group_exit 0.16 ± 3% +0.0 0.18 ± 3% perf-profile.children.cycles-pp.do_exit 0.35 +0.0 0.38 ± 2% perf-profile.children.cycles-pp.atime_needs_update 0.35 +0.0 0.38 ± 2% perf-profile.children.cycles-pp.__cond_resched 0.44 +0.0 0.46 perf-profile.children.cycles-pp.fault_in_iov_iter_readable 0.40 +0.0 0.42 perf-profile.children.cycles-pp.delete_from_page_cache_batch 0.41 ± 2% +0.0 0.44 perf-profile.children.cycles-pp.syscall_exit_to_user_mode 0.28 +0.0 0.32 perf-profile.children.cycles-pp.xas_store 0.37 ± 2% +0.0 0.40 ± 2% perf-profile.children.cycles-pp.ksys_lseek 0.46 +0.0 0.50 perf-profile.children.cycles-pp.__alloc_pages 0.42 +0.0 0.45 ± 2% perf-profile.children.cycles-pp.touch_atime 0.20 ± 3% +0.0 0.24 ± 5% perf-profile.children.cycles-pp.ext4_file_read_iter 0.54 +0.0 0.58 perf-profile.children.cycles-pp.folio_alloc 0.20 ± 2% +0.0 0.23 ± 5% perf-profile.children.cycles-pp.ext4_file_write_iter 1.12 +0.0 1.16 perf-profile.children.cycles-pp.__filemap_add_folio 0.69 +0.0 0.73 perf-profile.children.cycles-pp.copyin 0.42 ± 3% +0.0 0.46 ± 2% perf-profile.children.cycles-pp.__fget_light 0.40 ± 2% +0.0 0.44 perf-profile.children.cycles-pp.apparmor_file_permission 0.46 +0.0 0.50 perf-profile.children.cycles-pp.file_modified 0.62 +0.0 0.66 perf-profile.children.cycles-pp.xas_load 0.32 ± 2% +0.0 0.36 ± 4% perf-profile.children.cycles-pp.percpu_counter_add_batch 0.52 +0.0 0.56 perf-profile.children.cycles-pp.security_file_permission 0.98 +0.0 1.03 perf-profile.children.cycles-pp.memset_orig 0.98 +0.0 1.03 perf-profile.children.cycles-pp.zero_user_segments 0.90 +0.1 0.95 perf-profile.children.cycles-pp.copy_page_from_iter_atomic 0.59 ± 2% +0.1 0.64 ± 2% perf-profile.children.cycles-pp.__fdget_pos 0.80 +0.1 0.85 perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack 0.77 +0.1 0.83 perf-profile.children.cycles-pp.filemap_get_read_batch 0.92 +0.1 0.99 ± 2% perf-profile.children.cycles-pp.ext4_da_map_blocks 0.91 +0.1 0.98 perf-profile.children.cycles-pp.filemap_get_pages 0.94 +0.1 1.01 ± 2% perf-profile.children.cycles-pp.ext4_da_get_block_prep 1.01 +0.1 1.08 perf-profile.children.cycles-pp.__entry_text_start 1.64 +0.1 1.72 perf-profile.children.cycles-pp.truncate_cleanup_folio 1.63 +0.1 1.74 perf-profile.children.cycles-pp.copyout 1.73 +0.1 1.84 perf-profile.children.cycles-pp._copy_to_iter 1.82 +0.1 1.95 perf-profile.children.cycles-pp.copy_page_to_iter 1.68 +0.1 1.81 perf-profile.children.cycles-pp.llseek 2.75 +0.2 2.92 perf-profile.children.cycles-pp.ext4_block_write_begin 30.38 +0.4 30.79 perf-profile.children.cycles-pp.read 70.80 -1.7 69.14 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 0.81 ± 5% -0.2 0.61 ± 5% perf-profile.self.cycles-pp.cgroup_rstat_updated 0.54 -0.2 0.39 perf-profile.self.cycles-pp.mem_cgroup_css_rstat_flush 0.08 ± 4% -0.0 0.05 ± 8% perf-profile.self.cycles-pp.cgroup_rstat_flush_locked 0.08 ± 9% -0.0 0.05 ± 8% perf-profile.self.cycles-pp.cgroup_base_stat_flush 0.19 ± 2% -0.0 0.17 ± 2% perf-profile.self.cycles-pp._raw_spin_lock_irqsave 0.19 ± 2% +0.0 0.20 ± 2% perf-profile.self.cycles-pp.ext4_block_write_begin 0.18 +0.0 0.19 perf-profile.self.cycles-pp.__mod_node_page_state 0.20 +0.0 0.21 perf-profile.self.cycles-pp.block_invalidate_folio 0.23 +0.0 0.24 perf-profile.self.cycles-pp.ext4_da_do_write_end 0.17 +0.0 0.18 ± 2% perf-profile.self.cycles-pp.__fdget_pos 0.26 +0.0 0.27 perf-profile.self.cycles-pp.generic_perform_write 0.21 ± 2% +0.0 0.23 perf-profile.self.cycles-pp.xas_load 0.21 +0.0 0.22 ± 2% perf-profile.self.cycles-pp.__cond_resched 0.15 ± 3% +0.0 0.17 ± 2% perf-profile.self.cycles-pp.xas_store 0.28 +0.0 0.30 perf-profile.self.cycles-pp.__filemap_get_folio 0.17 ± 2% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.folio_mark_accessed 0.37 +0.0 0.39 perf-profile.self.cycles-pp.fault_in_readable 0.24 +0.0 0.26 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret 0.29 +0.0 0.31 ± 2% perf-profile.self.cycles-pp.read 0.21 +0.0 0.23 ± 2% perf-profile.self.cycles-pp.do_syscall_64 0.56 ± 2% +0.0 0.58 perf-profile.self.cycles-pp.workingset_age_nonresident 0.27 +0.0 0.29 perf-profile.self.cycles-pp.llseek 0.27 +0.0 0.29 perf-profile.self.cycles-pp.__entry_text_start 0.22 ± 2% +0.0 0.25 ± 3% perf-profile.self.cycles-pp.ext4_da_write_begin 0.31 +0.0 0.33 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 0.28 ± 2% +0.0 0.31 perf-profile.self.cycles-pp.apparmor_file_permission 0.46 +0.0 0.49 perf-profile.self.cycles-pp.vfs_read 0.27 ± 3% +0.0 0.30 ± 3% perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags 0.40 ± 3% +0.0 0.43 ± 3% perf-profile.self.cycles-pp.__fget_light 0.19 ± 3% +0.0 0.23 ± 4% perf-profile.self.cycles-pp.ext4_file_read_iter 0.19 ± 2% +0.0 0.22 ± 5% perf-profile.self.cycles-pp.ext4_file_write_iter 0.67 +0.0 0.72 perf-profile.self.cycles-pp.__block_commit_write 0.55 +0.0 0.59 perf-profile.self.cycles-pp.vfs_write 0.28 ± 2% +0.0 0.32 ± 4% perf-profile.self.cycles-pp.percpu_counter_add_batch 0.68 +0.0 0.72 perf-profile.self.cycles-pp.copyin 0.50 +0.0 0.54 perf-profile.self.cycles-pp.filemap_get_read_batch 0.98 +0.0 1.02 perf-profile.self.cycles-pp.memset_orig 0.78 +0.0 0.82 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack 0.64 +0.0 0.68 perf-profile.self.cycles-pp.filemap_read 0.29 ± 5% +0.0 0.34 ± 4% perf-profile.self.cycles-pp.__mod_memcg_lruvec_state 0.00 +0.1 0.05 perf-profile.self.cycles-pp.try_charge_memcg 1.61 +0.1 1.72 perf-profile.self.cycles-pp.copyout *************************************************************************************************** lkp-icl-2sp2: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory ========================================================================================= compiler/cpufreq_governor/disk/fs/kconfig/load/rootfs/tbox_group/test/testcase: gcc-12/performance/1BRD_48G/ext4/x86_64-rhel-8.3/3000/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp2/disk_cp/aim7 commit: ff841a06c8 ("mm: memcg: refactor page state unit helpers") 7bd5bc3ce9 ("mm: memcg: normalize the value passed into memcg_rstat_updated()") ff841a06c844b055 7bd5bc3ce9632aefd0eed33a192 ---------------- --------------------------- %stddev %change %stddev \ | \ 1.19 ± 5% +9.1% 1.30 ± 3% iostat.cpu.user 1.25 ± 5% +0.1 1.38 ± 4% mpstat.cpu.all.usr% 11.57 ± 62% -65.1% 4.04 ± 65% perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm 39.78 ± 4% +12.9 52.64 ± 9% turbostat.PKG_% 895233 +2.2% 915336 aim7.jobs-per-min 48750 ± 7% -21.3% 38348 ± 9% aim7.time.involuntary_context_switches 1176 -5.7% 1109 aim7.time.system_time 8891 ± 32% -78.2% 1936 ± 16% sched_debug.cfs_rq:/.load.avg 886484 ± 42% -96.6% 30376 ± 20% sched_debug.cfs_rq:/.load.max 78636 ± 40% -92.5% 5891 ± 12% sched_debug.cfs_rq:/.load.stddev 2.18 ± 2% -4.5% 2.09 perf-stat.i.cpi 0.54 ± 4% +8.8% 0.59 perf-stat.i.ipc 1.90 -4.3% 1.82 ± 2% perf-stat.overall.MPKI 1.75 -5.1% 1.66 perf-stat.overall.cpi 0.57 +5.4% 0.60 perf-stat.overall.ipc 62989 ± 18% -38.7% 38627 ± 33% numa-vmstat.node0.nr_anon_pages 63952 ± 18% -38.0% 39673 ± 32% numa-vmstat.node0.nr_inactive_anon 63952 ± 18% -38.0% 39673 ± 32% numa-vmstat.node0.nr_zone_inactive_anon 64470 ± 18% +39.2% 89730 ± 14% numa-vmstat.node1.nr_anon_pages 66787 ± 17% +36.8% 91333 ± 14% numa-vmstat.node1.nr_inactive_anon 11264 ± 5% +216.1% 35608 ± 48% numa-vmstat.node1.nr_kernel_stack 834.42 ± 17% +2010.2% 17608 ± 67% numa-vmstat.node1.nr_page_table_pages 66787 ± 17% +36.8% 91333 ± 14% numa-vmstat.node1.nr_zone_inactive_anon 251945 ± 18% -38.7% 154531 ± 33% numa-meminfo.node0.AnonPages 305057 ± 15% -28.9% 216862 ± 22% numa-meminfo.node0.AnonPages.max 14698 ± 9% -13.9% 12660 ± 7% numa-meminfo.node0.Dirty 271685 ± 17% -36.2% 173249 ± 29% numa-meminfo.node0.Inactive 255806 ± 18% -38.0% 158709 ± 32% numa-meminfo.node0.Inactive(anon) 291694 ± 10% -25.8% 216523 ± 19% numa-meminfo.node0.Slab 257857 ± 18% +39.2% 358935 ± 14% numa-meminfo.node1.AnonPages 282059 ± 16% +34.8% 380109 ± 13% numa-meminfo.node1.Inactive 267122 ± 17% +36.8% 365352 ± 14% numa-meminfo.node1.Inactive(anon) 11264 ± 5% +216.2% 35619 ± 48% numa-meminfo.node1.KernelStack 3337 ± 17% +2011.4% 70463 ± 67% numa-meminfo.node1.PageTables 24.42 -2.3 22.14 ± 2% perf-profile.calltrace.cycles-pp.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill 21.77 -2.2 19.59 ± 2% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release 21.80 -2.2 19.62 ± 2% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range 21.81 -2.2 19.64 ± 2% perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode 23.12 -2.2 20.96 perf-profile.calltrace.cycles-pp.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict 28.82 -2.0 26.78 perf-profile.calltrace.cycles-pp.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill.dentry_kill 29.06 -2.0 27.04 perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dentry_kill.dput.__fput 29.03 -2.0 27.01 perf-profile.calltrace.cycles-pp.ext4_evict_inode.evict.__dentry_kill.dentry_kill.dput 29.08 -2.0 27.06 perf-profile.calltrace.cycles-pp.__dentry_kill.dentry_kill.dput.__fput.__x64_sys_close 29.08 -2.0 27.06 perf-profile.calltrace.cycles-pp.dentry_kill.dput.__fput.__x64_sys_close.do_syscall_64 29.09 -2.0 27.07 perf-profile.calltrace.cycles-pp.dput.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe 29.14 -2.0 27.12 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__close 29.14 -2.0 27.12 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 29.14 -2.0 27.12 perf-profile.calltrace.cycles-pp.__close 29.12 -2.0 27.11 perf-profile.calltrace.cycles-pp.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 29.11 -2.0 27.09 perf-profile.calltrace.cycles-pp.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close 20.28 -1.3 18.93 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio 23.94 -1.3 22.60 perf-profile.calltrace.cycles-pp.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 20.23 -1.3 18.89 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru 20.26 -1.3 18.92 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio 21.38 -1.3 20.05 perf-profile.calltrace.cycles-pp.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write 21.29 -1.3 19.96 perf-profile.calltrace.cycles-pp.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin 26.64 -1.2 25.46 perf-profile.calltrace.cycles-pp.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write 33.56 -0.7 32.84 perf-profile.calltrace.cycles-pp.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.80 -0.2 0.61 perf-profile.calltrace.cycles-pp.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write 0.79 -0.2 0.61 perf-profile.calltrace.cycles-pp.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags 0.83 -0.2 0.66 ± 2% perf-profile.calltrace.cycles-pp.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter 0.78 -0.2 0.61 ± 2% perf-profile.calltrace.cycles-pp.cgroup_rstat_flush_locked.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages 0.88 -0.2 0.71 ± 2% perf-profile.calltrace.cycles-pp.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write 1.24 ± 2% -0.1 1.12 ± 2% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu 1.24 ± 2% -0.1 1.12 ± 2% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release 1.24 ± 2% -0.1 1.12 ± 2% perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range 1.26 -0.1 1.15 ± 2% perf-profile.calltrace.cycles-pp.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict 1.26 ± 2% -0.1 1.14 ± 2% perf-profile.calltrace.cycles-pp.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode 1.88 -0.1 1.80 perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.74 ± 3% -0.1 0.68 ± 2% perf-profile.calltrace.cycles-pp.charge_memcg.__mem_cgroup_charge.__filemap_add_folio.filemap_add_folio.__filemap_get_folio 0.92 -0.1 0.87 perf-profile.calltrace.cycles-pp.__mem_cgroup_charge.__filemap_add_folio.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin 0.84 ± 2% -0.1 0.79 perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write 0.72 ± 2% -0.0 0.68 perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.folio_account_cleaned.__folio_cancel_dirty.try_to_free_buffers.truncate_cleanup_folio 1.40 +0.0 1.44 perf-profile.calltrace.cycles-pp.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write.block_write_end 0.52 ± 2% +0.0 0.55 perf-profile.calltrace.cycles-pp.security_file_permission.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.15 +0.0 1.19 perf-profile.calltrace.cycles-pp.folio_create_empty_buffers.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 0.80 +0.0 0.85 perf-profile.calltrace.cycles-pp.delete_from_page_cache_batch.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill 0.86 +0.0 0.91 ± 2% perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages.folio_alloc.__filemap_get_folio.ext4_da_write_begin 0.83 +0.1 0.88 perf-profile.calltrace.cycles-pp.fault_in_iov_iter_readable.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.80 +0.1 0.86 perf-profile.calltrace.cycles-pp.filemap_get_entry.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 1.73 +0.1 1.79 perf-profile.calltrace.cycles-pp.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end 0.73 ± 3% +0.1 0.79 ± 2% perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.filemap_read.vfs_read.ksys_read 1.04 +0.1 1.10 perf-profile.calltrace.cycles-pp.__alloc_pages.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write 0.82 ± 3% +0.1 0.88 ± 2% perf-profile.calltrace.cycles-pp.touch_atime.filemap_read.vfs_read.ksys_read.do_syscall_64 0.78 ± 3% +0.1 0.84 ± 2% perf-profile.calltrace.cycles-pp.ext4_file_read_iter.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.21 +0.1 1.28 perf-profile.calltrace.cycles-pp.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 1.40 +0.1 1.48 perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write 2.25 +0.1 2.34 perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write 2.21 +0.1 2.30 perf-profile.calltrace.cycles-pp.try_to_free_buffers.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict 1.74 ± 2% +0.1 1.84 perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read 1.13 ± 3% +0.1 1.24 ± 2% perf-profile.calltrace.cycles-pp.file_modified.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64 1.82 +0.1 1.93 perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 2.01 ± 2% +0.1 2.13 perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read.ksys_read 2.38 +0.1 2.52 perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write 2.44 +0.1 2.58 perf-profile.calltrace.cycles-pp.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 2.22 ± 2% +0.1 2.36 perf-profile.calltrace.cycles-pp.copy_page_to_iter.filemap_read.vfs_read.ksys_read.do_syscall_64 1.98 ± 3% +0.1 2.13 ± 2% perf-profile.calltrace.cycles-pp.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write 2.03 ± 3% +0.2 2.18 ± 2% perf-profile.calltrace.cycles-pp.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter 3.12 +0.2 3.28 perf-profile.calltrace.cycles-pp.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill 3.92 +0.2 4.10 perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 4.12 +0.2 4.32 perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.34 ± 70% +0.2 0.55 ± 2% perf-profile.calltrace.cycles-pp.inode_needs_update_time.file_modified.ext4_buffered_write_iter.vfs_write.ksys_write 4.84 +0.2 5.08 perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 6.05 +0.4 6.40 perf-profile.calltrace.cycles-pp.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.68 +0.4 1.09 ± 22% perf-profile.calltrace.cycles-pp.__entry_text_start.write 0.08 ±223% +0.4 0.53 ± 2% perf-profile.calltrace.cycles-pp.ext4_da_reserve_space.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin 0.68 +0.6 1.23 ± 2% perf-profile.calltrace.cycles-pp.__entry_text_start.read 8.05 ± 4% +0.8 8.84 ± 4% perf-profile.calltrace.cycles-pp.filemap_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe 10.85 ± 3% +1.0 11.87 ± 3% perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read 11.49 ± 3% +1.1 12.55 ± 3% perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read 12.03 ± 2% +1.1 13.12 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read 12.29 ± 2% +1.1 13.40 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read 13.83 ± 2% +1.2 15.02 ± 2% perf-profile.calltrace.cycles-pp.read 43.88 -3.7 40.20 perf-profile.children.cycles-pp._raw_spin_lock_irqsave 43.63 -3.7 39.96 ± 2% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 43.54 -3.6 39.91 ± 2% perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave 24.42 -2.3 22.14 ± 2% perf-profile.children.cycles-pp.__folio_batch_release 23.44 -2.2 21.28 perf-profile.children.cycles-pp.release_pages 28.83 -2.0 26.79 perf-profile.children.cycles-pp.truncate_inode_pages_range 29.06 -2.0 27.04 perf-profile.children.cycles-pp.evict 29.03 -2.0 27.01 perf-profile.children.cycles-pp.ext4_evict_inode 29.08 -2.0 27.06 perf-profile.children.cycles-pp.__dentry_kill 29.08 -2.0 27.06 perf-profile.children.cycles-pp.dentry_kill 29.10 -2.0 27.08 perf-profile.children.cycles-pp.dput 29.14 -2.0 27.12 perf-profile.children.cycles-pp.__close 29.12 -2.0 27.11 perf-profile.children.cycles-pp.__x64_sys_close 29.11 -2.0 27.09 perf-profile.children.cycles-pp.__fput 22.60 -1.5 21.15 perf-profile.children.cycles-pp.folio_batch_move_lru 23.95 -1.3 22.61 perf-profile.children.cycles-pp.filemap_add_folio 21.41 -1.3 20.07 perf-profile.children.cycles-pp.folio_add_lru 26.72 -1.2 25.56 perf-profile.children.cycles-pp.__filemap_get_folio 33.65 -0.7 32.94 perf-profile.children.cycles-pp.ext4_da_write_begin 1.79 ± 3% -0.4 1.42 ± 2% perf-profile.children.cycles-pp.cgroup_rstat_updated 1.78 ± 2% -0.2 1.58 perf-profile.children.cycles-pp.__mod_memcg_lruvec_state 0.80 -0.2 0.61 ± 2% perf-profile.children.cycles-pp.do_flush_stats 0.79 -0.2 0.61 perf-profile.children.cycles-pp.cgroup_rstat_flush 0.78 -0.2 0.61 perf-profile.children.cycles-pp.cgroup_rstat_flush_locked 0.83 -0.2 0.66 ± 2% perf-profile.children.cycles-pp.mem_cgroup_wb_stats 0.88 -0.2 0.71 ± 2% perf-profile.children.cycles-pp.balance_dirty_pages 0.59 -0.1 0.45 ± 2% perf-profile.children.cycles-pp.mem_cgroup_css_rstat_flush 1.29 ± 2% -0.1 1.17 ± 2% perf-profile.children.cycles-pp.lru_add_drain_cpu 2.65 -0.1 2.56 perf-profile.children.cycles-pp.__mod_lruvec_page_state 0.58 ± 3% -0.1 0.49 ± 2% perf-profile.children.cycles-pp.__count_memcg_events 1.96 -0.1 1.88 perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags 0.76 ± 3% -0.1 0.70 ± 2% perf-profile.children.cycles-pp.charge_memcg 0.94 ± 2% -0.1 0.88 perf-profile.children.cycles-pp.__mem_cgroup_charge 0.07 +0.0 0.08 perf-profile.children.cycles-pp.bdev_getblk 0.09 +0.0 0.10 ± 3% perf-profile.children.cycles-pp.ext4_fill_raw_inode 0.13 ± 2% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.ext4_do_update_inode 0.29 ± 2% +0.0 0.31 perf-profile.children.cycles-pp.up_write 0.09 ± 6% +0.0 0.10 ± 4% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack 0.15 ± 3% +0.0 0.16 ± 3% perf-profile.children.cycles-pp.jbd2_journal_grab_journal_head 0.15 ± 3% +0.0 0.17 ± 2% perf-profile.children.cycles-pp.ext4_mark_iloc_dirty 0.14 ± 4% +0.0 0.16 ± 3% perf-profile.children.cycles-pp.rcu_core 0.36 +0.0 0.38 perf-profile.children.cycles-pp.generic_write_checks 0.19 ± 2% +0.0 0.20 ± 2% perf-profile.children.cycles-pp.jbd2_journal_try_to_free_buffers 0.11 ± 4% +0.0 0.12 ± 4% perf-profile.children.cycles-pp.folio_memcg_lock 0.24 ± 2% +0.0 0.26 perf-profile.children.cycles-pp.aa_file_perm 0.29 +0.0 0.31 ± 2% perf-profile.children.cycles-pp.folio_unlock 0.47 ± 2% +0.0 0.49 perf-profile.children.cycles-pp.ext4_es_insert_delayed_block 0.28 ± 3% +0.0 0.30 perf-profile.children.cycles-pp.__slab_free 0.21 ± 2% +0.0 0.23 ± 2% perf-profile.children.cycles-pp.__ext4_mark_inode_dirty 0.38 ± 2% +0.0 0.40 ± 3% perf-profile.children.cycles-pp.syscall_return_via_sysret 0.13 ± 3% +0.0 0.15 ± 2% perf-profile.children.cycles-pp.try_charge_memcg 0.30 ± 2% +0.0 0.33 ± 2% perf-profile.children.cycles-pp.disk_cp 0.58 +0.0 0.61 ± 2% perf-profile.children.cycles-pp.syscall_exit_to_user_mode 0.37 ± 3% +0.0 0.40 ± 3% perf-profile.children.cycles-pp.ext4_es_lookup_extent 0.42 +0.0 0.46 ± 2% perf-profile.children.cycles-pp.__mod_lruvec_state 0.02 ±141% +0.0 0.06 ± 9% perf-profile.children.cycles-pp.update_load_avg 0.55 ± 2% +0.0 0.59 ± 3% perf-profile.children.cycles-pp.xas_descend 0.26 ± 7% +0.0 0.30 ± 3% perf-profile.children.cycles-pp.ext4_dirty_inode 1.42 +0.0 1.46 perf-profile.children.cycles-pp.folio_account_dirtied 1.16 +0.0 1.20 perf-profile.children.cycles-pp.folio_create_empty_buffers 0.58 +0.0 0.62 ± 2% perf-profile.children.cycles-pp.xas_store 0.75 +0.0 0.79 ± 2% perf-profile.children.cycles-pp.fault_in_readable 0.51 ± 3% +0.0 0.55 ± 2% perf-profile.children.cycles-pp.ext4_da_reserve_space 0.82 +0.0 0.86 perf-profile.children.cycles-pp.delete_from_page_cache_batch 0.88 +0.0 0.92 ± 2% perf-profile.children.cycles-pp.get_page_from_freelist 0.56 ± 2% +0.0 0.61 ± 2% perf-profile.children.cycles-pp.inode_needs_update_time 0.29 ± 3% +0.0 0.34 ± 7% perf-profile.children.cycles-pp.ext4_nonda_switch 0.86 +0.1 0.92 ± 2% perf-profile.children.cycles-pp.fault_in_iov_iter_readable 1.04 +0.1 1.10 perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack 0.72 +0.1 0.77 ± 2% perf-profile.children.cycles-pp.__cond_resched 1.75 +0.1 1.80 perf-profile.children.cycles-pp.__folio_mark_dirty 0.32 ± 7% +0.1 0.38 ± 4% perf-profile.children.cycles-pp.generic_update_time 0.84 +0.1 0.90 perf-profile.children.cycles-pp.filemap_get_entry 0.78 ± 3% +0.1 0.85 ± 2% perf-profile.children.cycles-pp.atime_needs_update 0.43 ± 6% +0.1 0.50 ± 3% perf-profile.children.cycles-pp.__mark_inode_dirty 1.06 +0.1 1.12 perf-profile.children.cycles-pp.__alloc_pages 0.79 ± 3% +0.1 0.86 perf-profile.children.cycles-pp.ext4_file_read_iter 1.22 +0.1 1.29 perf-profile.children.cycles-pp.folio_alloc 0.84 ± 3% +0.1 0.91 ± 2% perf-profile.children.cycles-pp.touch_atime 0.89 ± 2% +0.1 0.96 ± 2% perf-profile.children.cycles-pp.apparmor_file_permission 1.24 +0.1 1.32 perf-profile.children.cycles-pp.xas_load 2.28 +0.1 2.37 perf-profile.children.cycles-pp.mark_buffer_dirty 1.49 +0.1 1.58 perf-profile.children.cycles-pp.copyin 1.13 ± 2% +0.1 1.22 ± 2% perf-profile.children.cycles-pp.security_file_permission 2.23 +0.1 2.32 perf-profile.children.cycles-pp.try_to_free_buffers 1.84 ± 2% +0.1 1.94 perf-profile.children.cycles-pp.copyout 1.37 +0.1 1.47 perf-profile.children.cycles-pp.__entry_text_start 1.17 ± 3% +0.1 1.28 ± 2% perf-profile.children.cycles-pp.file_modified 1.92 +0.1 2.04 perf-profile.children.cycles-pp.copy_page_from_iter_atomic 2.04 ± 2% +0.1 2.16 perf-profile.children.cycles-pp._copy_to_iter 2.45 +0.1 2.58 perf-profile.children.cycles-pp.memset_orig 2.45 +0.1 2.58 perf-profile.children.cycles-pp.zero_user_segments 2.25 ± 2% +0.1 2.39 perf-profile.children.cycles-pp.copy_page_to_iter 2.04 ± 3% +0.2 2.19 ± 2% perf-profile.children.cycles-pp.ext4_da_get_block_prep 0.86 ± 3% +0.2 1.01 ± 2% perf-profile.children.cycles-pp.percpu_counter_add_batch 2.00 ± 3% +0.2 2.16 ± 2% perf-profile.children.cycles-pp.ext4_da_map_blocks 3.13 +0.2 3.28 perf-profile.children.cycles-pp.truncate_cleanup_folio 3.95 +0.2 4.14 perf-profile.children.cycles-pp.__block_commit_write 4.17 +0.2 4.37 perf-profile.children.cycles-pp.block_write_end 4.89 +0.2 5.14 perf-profile.children.cycles-pp.ext4_da_do_write_end 6.08 +0.3 6.43 perf-profile.children.cycles-pp.ext4_block_write_begin 8.12 ± 4% +0.8 8.92 ± 4% perf-profile.children.cycles-pp.filemap_read 10.92 ± 3% +1.0 11.96 ± 3% perf-profile.children.cycles-pp.vfs_read 11.55 ± 3% +1.1 12.61 ± 3% perf-profile.children.cycles-pp.ksys_read 14.32 ± 2% +1.2 15.55 ± 2% perf-profile.children.cycles-pp.read 43.63 -3.7 39.96 ± 2% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 1.56 ± 4% -0.3 1.24 ± 2% perf-profile.self.cycles-pp.cgroup_rstat_updated 0.57 -0.1 0.44 ± 3% perf-profile.self.cycles-pp.mem_cgroup_css_rstat_flush 0.10 -0.0 0.07 ± 6% perf-profile.self.cycles-pp.cgroup_rstat_flush_locked 0.12 ± 3% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.folio_account_cleaned 0.22 ± 2% +0.0 0.24 ± 3% perf-profile.self.cycles-pp.mod_objcg_state 0.16 ± 2% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.drop_buffers 0.20 +0.0 0.22 ± 2% perf-profile.self.cycles-pp.aa_file_perm 0.17 ± 2% +0.0 0.19 ± 3% perf-profile.self.cycles-pp.folio_create_empty_buffers 0.34 ± 2% +0.0 0.36 ± 2% perf-profile.self.cycles-pp.block_invalidate_folio 0.23 ± 2% +0.0 0.25 ± 3% perf-profile.self.cycles-pp.__fdget_pos 0.28 +0.0 0.29 perf-profile.self.cycles-pp.folio_unlock 0.09 ± 5% +0.0 0.10 ± 7% perf-profile.self.cycles-pp.try_charge_memcg 0.20 ± 5% +0.0 0.22 ± 3% perf-profile.self.cycles-pp.kmem_cache_free 0.28 ± 3% +0.0 0.30 perf-profile.self.cycles-pp.__slab_free 0.31 ± 2% +0.0 0.33 ± 3% perf-profile.self.cycles-pp.xas_store 0.37 ± 2% +0.0 0.40 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret 0.44 +0.0 0.47 ± 2% perf-profile.self.cycles-pp.__cond_resched 0.29 ± 2% +0.0 0.32 ± 3% perf-profile.self.cycles-pp.do_syscall_64 0.36 +0.0 0.39 ± 2% perf-profile.self.cycles-pp.filemap_get_entry 0.42 +0.0 0.46 ± 2% perf-profile.self.cycles-pp.copy_page_from_iter_atomic 0.42 +0.0 0.45 ± 2% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 0.54 +0.0 0.57 ± 2% perf-profile.self.cycles-pp.generic_perform_write 0.42 ± 2% +0.0 0.46 ± 2% perf-profile.self.cycles-pp.xas_load 0.47 ± 2% +0.0 0.51 perf-profile.self.cycles-pp.ext4_da_do_write_end 0.47 ± 2% +0.0 0.51 ± 3% perf-profile.self.cycles-pp.xas_descend 0.72 +0.0 0.77 ± 2% perf-profile.self.cycles-pp.fault_in_readable 0.56 +0.0 0.60 ± 2% perf-profile.self.cycles-pp.__filemap_get_folio 0.28 ± 2% +0.0 0.32 ± 7% perf-profile.self.cycles-pp.ext4_nonda_switch 1.01 +0.0 1.06 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack 0.40 +0.0 0.44 ± 2% perf-profile.self.cycles-pp.__entry_text_start 0.40 ± 4% +0.1 0.45 ± 4% perf-profile.self.cycles-pp.atime_needs_update 0.63 ± 3% +0.1 0.68 ± 4% perf-profile.self.cycles-pp.apparmor_file_permission 0.56 ± 2% +0.1 0.62 ± 2% perf-profile.self.cycles-pp.ext4_da_write_begin 0.50 ± 6% +0.1 0.56 ± 5% perf-profile.self.cycles-pp.__mod_memcg_lruvec_state 0.77 ± 3% +0.1 0.83 ± 2% perf-profile.self.cycles-pp.ext4_file_read_iter 0.92 +0.1 0.99 ± 2% perf-profile.self.cycles-pp.vfs_read 1.46 +0.1 1.54 perf-profile.self.cycles-pp.copyin 1.53 +0.1 1.62 perf-profile.self.cycles-pp.__block_commit_write 1.81 ± 2% +0.1 1.90 perf-profile.self.cycles-pp.copyout 2.42 +0.1 2.56 perf-profile.self.cycles-pp.memset_orig 0.78 ± 2% +0.1 0.91 ± 2% perf-profile.self.cycles-pp.percpu_counter_add_batch 1.31 ± 2% +0.1 1.45 perf-profile.self.cycles-pp.vfs_write 1.73 ± 4% +0.2 1.90 ± 5% perf-profile.self.cycles-pp.filemap_read Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki