Hello, kernel test robot noticed a 22.9% improvement of stress-ng.metamix.ops_per_sec on: commit: c7fc0366c65628fd69bfc310affec4918199aae2 ("ext4: partial zero eof block on unaligned inode size extension") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: stress-ng config: x86_64-rhel-9.4 compiler: gcc-12 test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz (Ivy Bridge) with 16G memory parameters: nr_threads: 100% disk: 1HDD testtime: 60s fs: ext4 test: metamix cpufreq_governor: performance Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20241206/202412061635.d962a313-lkp@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/1HDD/ext4/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-ivb-d01/metamix/stress-ng/60s commit: 25f51ea8ac ("ext4: disambiguate the return value of ext4_dio_write_end_io()") c7fc0366c6 ("ext4: partial zero eof block on unaligned inode size extension") 25f51ea8ac8144af c7fc0366c65628fd69bfc310aff ---------------- --------------------------- %stddev %change %stddev \ | \ 553.38 ± 4% +12.1% 620.15 ± 2% uptime.idle 26348 ± 3% -40.4% 15711 ± 7% meminfo.Dirty 16924 ± 2% -32.5% 11430 ± 7% meminfo.Writeback 6.91 ± 9% +163.1% 18.17 ± 7% iostat.cpu.idle 90.85 -13.0% 79.04 iostat.cpu.iowait 1.45 +33.6% 1.93 iostat.cpu.system 1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.avg_vruntime.avg 8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.avg_vruntime.min 1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.min_vruntime.avg 8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.min_vruntime.min 4.41 ± 14% +11.6 16.01 ± 9% mpstat.cpu.all.idle% 93.54 -12.1 81.39 mpstat.cpu.all.iowait% 0.23 ± 2% +0.1 0.31 ± 2% mpstat.cpu.all.irq% 0.07 ± 2% +0.0 0.07 mpstat.cpu.all.soft% 1.13 +0.4 1.54 mpstat.cpu.all.sys% 6.94 ± 9% +162.5% 18.21 ± 7% vmstat.cpu.id 90.85 -13.0% 79.05 vmstat.cpu.wa 31576 +20.8% 38129 vmstat.io.bo 64.71 ± 3% -33.6% 42.98 ± 8% vmstat.procs.b 2893 +2.9% 2976 vmstat.system.in 1911 ± 2% +21.8% 2328 stress-ng.metamix.ops 30.37 +22.9% 37.32 ± 2% stress-ng.metamix.ops_per_sec 4111968 ± 2% +20.4% 4952304 stress-ng.time.file_system_outputs 24323 +8.5% 26386 stress-ng.time.minor_page_faults 6.17 ± 6% +54.1% 9.50 ± 5% stress-ng.time.percent_of_cpu_this_job_got 23964 ± 2% +18.6% 28423 stress-ng.time.voluntary_context_switches 514496 ± 2% +20.4% 619656 proc-vmstat.nr_dirtied 6606 ± 3% -42.4% 3807 ± 5% proc-vmstat.nr_dirty 4338 +3.4% 4486 ± 2% proc-vmstat.nr_page_table_pages 4266 ± 2% -33.9% 2820 ± 7% proc-vmstat.nr_writeback 514089 ± 2% +20.5% 619358 proc-vmstat.nr_written 10850 ± 3% -39.1% 6607 ± 6% proc-vmstat.nr_zone_write_pending 666231 ± 2% +15.4% 769072 proc-vmstat.numa_hit 666231 ± 2% +15.5% 769184 proc-vmstat.numa_local 698850 ± 2% +14.8% 802298 proc-vmstat.pgalloc_normal 678127 ± 3% +15.7% 784461 ± 2% proc-vmstat.pgfree 2088116 ± 2% +19.9% 2503923 proc-vmstat.pgpgout 94023007 +15.2% 1.083e+08 perf-stat.i.branch-instructions 2405282 +15.0% 2765956 perf-stat.i.branch-misses 14.62 ± 2% -1.2 13.38 perf-stat.i.cache-miss-rate% 583290 ± 2% +17.5% 685524 ± 2% perf-stat.i.cache-misses 3648640 ± 2% +29.2% 4714050 perf-stat.i.cache-references 5.461e+08 +24.7% 6.808e+08 ± 2% perf-stat.i.cpu-cycles 164.88 ± 2% +24.0% 204.37 perf-stat.i.cpu-migrations 833.08 +7.0% 891.37 ± 2% perf-stat.i.cycles-between-cache-misses 4.545e+08 +16.6% 5.301e+08 perf-stat.i.instructions 0.61 -1.1% 0.60 perf-stat.i.ipc 15.99 -1.4 14.54 perf-stat.overall.cache-miss-rate% 1.20 +6.9% 1.28 perf-stat.overall.cpi 936.52 ± 2% +6.1% 993.24 perf-stat.overall.cycles-between-cache-misses 0.83 -6.4% 0.78 perf-stat.overall.ipc 92548187 +15.2% 1.066e+08 perf-stat.ps.branch-instructions 2367842 +15.0% 2723489 perf-stat.ps.branch-misses 574185 ± 2% +17.5% 674749 ± 2% perf-stat.ps.cache-misses 3591758 ± 2% +29.2% 4640021 perf-stat.ps.cache-references 5.375e+08 +24.7% 6.701e+08 ± 2% perf-stat.ps.cpu-cycles 162.31 ± 2% +23.9% 201.17 perf-stat.ps.cpu-migrations 4.474e+08 +16.6% 5.218e+08 perf-stat.ps.instructions 2.874e+10 +15.8% 3.328e+10 ± 2% perf-stat.total.instructions 0.02 ± 9% +25.2% 0.02 ± 6% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity 0.48 ± 44% +216.9% 1.52 ±106% perf-sched.sch_delay.avg.ms.io_schedule.bit_wait_io.__wait_on_bit.out_of_line_wait_on_bit 0.00 ± 16% +148.1% 0.01 ± 76% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone 0.02 ± 10% +26.8% 0.03 ± 8% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 7.94 ± 52% -76.1% 1.90 ±131% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 0.02 ± 55% +1263.0% 0.21 ±185% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone 252.86 ± 13% -46.6% 134.96 ± 19% perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range 291.92 ± 4% -38.6% 179.21 ± 17% perf-sched.wait_and_delay.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle 45.27 ± 9% -11.1% 40.23 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 26.86 ± 2% +22.0% 32.76 ± 6% perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 292.00 ± 12% +71.9% 501.83 ± 16% perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range 323.83 ± 7% +29.0% 417.83 ± 12% perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync 2817 ± 2% -17.3% 2329 ± 8% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 999.12 ± 4% +71.7% 1715 ± 20% perf-sched.wait_and_delay.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle 909.77 ± 11% +88.7% 1716 ± 20% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file 970.08 ± 4% +51.7% 1471 ± 28% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync 252.77 ± 13% -46.6% 134.90 ± 19% perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range 291.02 ± 4% -38.8% 178.11 ± 17% perf-sched.wait_time.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle 21.25 ±192% +2117.6% 471.24 ± 90% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat 45.22 ± 9% -11.1% 40.19 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread 26.84 ± 2% +22.0% 32.73 ± 6% perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 998.61 ± 4% +71.3% 1710 ± 20% perf-sched.wait_time.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle 908.10 ± 12% +88.5% 1711 ± 20% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file 969.71 ± 4% +51.4% 1468 ± 28% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync 28.45 ±194% +3281.7% 962.20 ± 56% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat 40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64 40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64 44.88 ± 3% -4.3 40.58 perf-profile.calltrace.cycles-pp.common_startup_64 40.10 ± 3% -4.3 35.80 ± 2% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64 36.18 ± 4% -4.2 31.96 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary 39.12 ± 3% -4.2 34.91 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64 39.55 ± 4% -4.1 35.44 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry 22.45 ± 4% -1.7 20.71 ± 4% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle 3.22 ± 14% -1.6 1.58 ± 15% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle 6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix 6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync 6.96 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe 7.03 ± 7% -1.6 5.46 ± 6% perf-profile.calltrace.cycles-pp.fdatasync.stress_metamix 6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix 6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix 6.17 ± 7% -1.5 4.71 ± 7% perf-profile.calltrace.cycles-pp.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64 5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync 5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync 5.83 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file 5.82 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range 5.82 ± 9% -1.3 4.50 ± 7% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range 5.87 ± 9% -0.9 4.99 ± 6% perf-profile.calltrace.cycles-pp.llseek.stress_metamix 3.42 ± 6% -0.6 2.78 ± 5% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc 3.20 ± 5% -0.6 2.64 ± 14% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 2.42 ± 6% -0.5 1.96 ± 15% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 2.09 ± 8% -0.4 1.66 ± 9% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages 1.00 ± 16% -0.4 0.59 ± 45% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc 1.79 ± 5% -0.4 1.39 ± 15% perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.llseek.stress_metamix 1.37 ± 16% -0.3 1.03 ± 11% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.llseek.stress_metamix 1.62 ± 8% -0.3 1.34 ± 9% perf-profile.calltrace.cycles-pp.mpage_submit_folio.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages 0.89 ± 16% -0.2 0.68 ± 17% perf-profile.calltrace.cycles-pp.sched_balance_rq.sched_balance_domains.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt 0.00 +0.6 0.62 ± 17% perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +0.8 0.76 ± 13% perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write 0.00 +0.8 0.76 ± 25% perf-profile.calltrace.cycles-pp.__ext4_get_inode_loc.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end 0.00 +0.8 0.76 ± 24% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2 0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write 0.00 +0.8 0.78 ± 19% perf-profile.calltrace.cycles-pp.crc32c_pcl_intel_update.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode 0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread 0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork 0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork.ret_from_fork_asm 0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.kjournald2.kthread.ret_from_fork.ret_from_fork_asm 0.00 +1.0 0.96 ± 9% perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.00 +1.0 0.97 ± 16% perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.00 +1.1 1.08 ± 8% perf-profile.calltrace.cycles-pp.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty 0.00 +1.1 1.08 ± 18% perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write 0.00 +1.2 1.15 ± 9% perf-profile.calltrace.cycles-pp.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty 0.00 +1.2 1.16 ± 19% perf-profile.calltrace.cycles-pp.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +1.7 1.70 ± 11% perf-profile.calltrace.cycles-pp.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end 0.00 +1.7 1.71 ± 10% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +1.8 1.78 ± 9% perf-profile.calltrace.cycles-pp.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.00 +1.9 1.88 ± 13% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 0.00 +2.2 2.16 ± 9% perf-profile.calltrace.cycles-pp.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write 0.00 +2.3 2.34 ± 8% perf-profile.calltrace.cycles-pp.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter 0.00 +3.4 3.42 ± 5% perf-profile.calltrace.cycles-pp.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write 43.79 ± 4% +5.7 49.45 perf-profile.calltrace.cycles-pp.stress_metamix 13.98 ± 10% +8.6 22.60 ± 3% perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64 15.52 ± 10% +8.7 24.25 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write.stress_metamix 15.46 ± 10% +8.7 24.21 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix 14.55 ± 10% +8.8 23.32 ± 2% perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe 14.97 ± 10% +8.8 23.79 ± 2% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write 15.16 ± 10% +8.8 24.01 ± 2% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix 17.88 ± 10% +8.9 26.79 ± 2% perf-profile.calltrace.cycles-pp.write.stress_metamix 0.00 +10.0 10.04 ± 6% perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write 40.18 ± 3% -4.3 35.85 ± 2% perf-profile.children.cycles-pp.start_secondary 44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.common_startup_64 44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.cpu_startup_entry 44.87 ± 3% -4.3 40.57 perf-profile.children.cycles-pp.do_idle 43.72 ± 3% -4.2 39.52 perf-profile.children.cycles-pp.cpuidle_idle_call 40.17 ± 4% -4.1 36.03 perf-profile.children.cycles-pp.cpuidle_enter_state 40.30 ± 4% -4.1 36.18 perf-profile.children.cycles-pp.cpuidle_enter 22.45 ± 4% -1.7 20.71 ± 4% perf-profile.children.cycles-pp.intel_idle 3.44 ± 14% -1.7 1.79 ± 16% perf-profile.children.cycles-pp.poll_idle 7.44 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.do_fsync 7.43 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.ext4_sync_file 7.01 ± 7% -1.6 5.45 ± 6% perf-profile.children.cycles-pp.__x64_sys_fdatasync 7.07 ± 7% -1.5 5.54 ± 6% perf-profile.children.cycles-pp.fdatasync 6.20 ± 7% -1.5 4.74 ± 7% perf-profile.children.cycles-pp.file_write_and_wait_range 5.89 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.do_writepages 5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.__filemap_fdatawrite_range 5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc 5.86 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.ext4_writepages 6.07 ± 9% -0.9 5.19 ± 6% perf-profile.children.cycles-pp.llseek 3.42 ± 6% -0.6 2.78 ± 6% perf-profile.children.cycles-pp.mpage_map_and_submit_extent 3.21 ± 5% -0.6 2.64 ± 14% perf-profile.children.cycles-pp.worker_thread 3.42 ± 10% -0.5 2.90 ± 9% perf-profile.children.cycles-pp.handle_softirqs 2.42 ± 6% -0.5 1.96 ± 15% perf-profile.children.cycles-pp.process_one_work 2.09 ± 8% -0.4 1.66 ± 9% perf-profile.children.cycles-pp.mpage_map_and_submit_buffers 0.34 ± 16% -0.2 0.13 ± 55% perf-profile.children.cycles-pp.mpage_process_page_bufs 0.50 ± 23% -0.2 0.33 ± 28% perf-profile.children.cycles-pp.native_irq_return_iret 0.24 ± 31% -0.1 0.13 ± 36% perf-profile.children.cycles-pp.ext4_file_read_iter 0.25 ± 20% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged 0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.hpage_collapse_scan_pmd 0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged_scan_mm_slot 0.23 ± 13% -0.1 0.15 ± 12% perf-profile.children.cycles-pp.collapse_huge_page 0.17 ± 13% -0.0 0.14 ± 7% perf-profile.children.cycles-pp.__collapse_huge_page_copy 0.07 ± 51% +0.1 0.14 ± 27% perf-profile.children.cycles-pp.tmigr_quick_check 0.11 ± 48% +0.1 0.21 ± 15% perf-profile.children.cycles-pp.tmigr_requires_handle_remote 0.00 +0.1 0.14 ± 41% perf-profile.children.cycles-pp.jbd2_journal_dirty_metadata 0.12 ± 72% +0.2 0.27 ± 19% perf-profile.children.cycles-pp.inode_to_bdi 0.13 ± 55% +0.2 0.32 ± 25% perf-profile.children.cycles-pp.jbd2_journal_get_write_access 0.01 ±223% +0.2 0.21 ± 45% perf-profile.children.cycles-pp.add_transaction_credits 0.30 ± 42% +0.2 0.50 ± 14% perf-profile.children.cycles-pp.__find_get_block 0.06 ± 75% +0.2 0.27 ± 20% perf-profile.children.cycles-pp.folio_mark_accessed 0.31 ± 41% +0.2 0.54 ± 13% perf-profile.children.cycles-pp.bdev_getblk 0.03 ±145% +0.2 0.28 ± 27% perf-profile.children.cycles-pp.__ext4_handle_dirty_metadata 0.01 ±223% +0.2 0.26 ± 27% perf-profile.children.cycles-pp.stop_this_handle 0.05 ±165% +0.3 0.31 ± 13% perf-profile.children.cycles-pp.ext4_inode_journal_mode 0.00 +0.3 0.28 ± 26% perf-profile.children.cycles-pp.filemap_dirty_folio 0.37 ± 15% +0.3 0.65 ± 23% perf-profile.children.cycles-pp.__cond_resched 0.14 ± 52% +0.3 0.43 ± 18% perf-profile.children.cycles-pp.__ext4_journal_get_write_access 0.00 +0.3 0.29 ± 33% perf-profile.children.cycles-pp.folio_redirty_for_writepage 0.36 ± 25% +0.4 0.75 ± 10% perf-profile.children.cycles-pp.filemap_get_entry 0.10 ± 80% +0.4 0.55 ± 9% perf-profile.children.cycles-pp.jbd2_journal_stop 0.00 +0.5 0.45 ± 18% perf-profile.children.cycles-pp.jbd2_journal_file_inode 0.10 ± 80% +0.5 0.61 ± 11% perf-profile.children.cycles-pp.__ext4_journal_stop 0.22 ± 57% +0.6 0.86 ± 27% perf-profile.children.cycles-pp.__ext4_get_inode_loc 0.22 ± 58% +0.6 0.87 ± 27% perf-profile.children.cycles-pp.ext4_get_inode_loc 0.44 ± 27% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.kjournald2 0.44 ± 28% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.jbd2_journal_commit_transaction 0.05 ±103% +0.7 0.72 ± 11% perf-profile.children.cycles-pp.start_this_handle 0.42 ± 15% +0.8 1.18 ± 13% perf-profile.children.cycles-pp.crc32c_pcl_intel_update 0.00 +0.8 0.78 ± 23% perf-profile.children.cycles-pp.ext4_normal_submit_inode_data_buffers 0.28 ± 41% +1.0 1.23 ± 17% perf-profile.children.cycles-pp.ext4_reserve_inode_write 0.08 ± 74% +1.0 1.04 ± 6% perf-profile.children.cycles-pp.jbd2__journal_start 0.26 ± 23% +1.0 1.23 ± 5% perf-profile.children.cycles-pp.ext4_inode_csum 0.28 ± 19% +1.0 1.32 ± 6% perf-profile.children.cycles-pp.ext4_inode_csum_set 3.99 ± 13% +1.1 5.13 ± 6% perf-profile.children.cycles-pp.__filemap_get_folio 3.80 ± 18% +1.2 4.95 ± 6% perf-profile.children.cycles-pp.memset_orig 3.80 ± 18% +1.4 5.20 ± 5% perf-profile.children.cycles-pp.zero_user_segments 0.40 ± 19% +1.6 1.96 ± 11% perf-profile.children.cycles-pp.ext4_fill_raw_inode 0.00 +1.8 1.80 ± 9% perf-profile.children.cycles-pp.ext4_zero_partial_blocks 0.55 ± 16% +2.0 2.55 ± 7% perf-profile.children.cycles-pp.ext4_do_update_inode 0.58 ± 19% +2.2 2.78 ± 6% perf-profile.children.cycles-pp.ext4_mark_iloc_dirty 0.85 ± 16% +3.2 4.03 ± 4% perf-profile.children.cycles-pp.__ext4_mark_inode_dirty 0.00 +3.6 3.61 ± 9% perf-profile.children.cycles-pp.__ext4_block_zero_page_range 38.33 ± 4% +5.5 43.88 ± 2% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 38.13 ± 4% +5.6 43.69 ± 2% perf-profile.children.cycles-pp.do_syscall_64 43.79 ± 4% +5.7 49.45 perf-profile.children.cycles-pp.stress_metamix 13.98 ± 10% +8.6 22.61 ± 3% perf-profile.children.cycles-pp.generic_perform_write 14.57 ± 10% +8.8 23.33 ± 2% perf-profile.children.cycles-pp.ext4_buffered_write_iter 15.14 ± 9% +8.8 23.92 ± 2% perf-profile.children.cycles-pp.vfs_write 15.35 ± 9% +8.8 24.15 ± 2% perf-profile.children.cycles-pp.ksys_write 18.28 ± 9% +8.8 27.11 ± 2% perf-profile.children.cycles-pp.write 0.00 +10.0 10.04 ± 6% perf-profile.children.cycles-pp.ext4_da_do_write_end 22.43 ± 4% -1.7 20.71 ± 4% perf-profile.self.cycles-pp.intel_idle 3.24 ± 15% -1.6 1.69 ± 18% perf-profile.self.cycles-pp.poll_idle 0.34 ± 16% -0.2 0.13 ± 53% perf-profile.self.cycles-pp.mpage_process_page_bufs 0.50 ± 23% -0.2 0.33 ± 28% perf-profile.self.cycles-pp.native_irq_return_iret 0.24 ± 31% -0.1 0.12 ± 30% perf-profile.self.cycles-pp.ext4_file_read_iter 0.01 ±223% +0.1 0.10 ± 33% perf-profile.self.cycles-pp.mmap_region 0.00 +0.1 0.11 ± 34% perf-profile.self.cycles-pp.jbd2_journal_dirty_metadata 0.01 ±223% +0.1 0.15 ± 55% perf-profile.self.cycles-pp.jbd2_journal_get_write_access 0.10 ± 85% +0.1 0.24 ± 20% perf-profile.self.cycles-pp.inode_to_bdi 0.00 +0.1 0.14 ± 32% perf-profile.self.cycles-pp.jbd2__journal_start 0.00 +0.1 0.14 ± 25% perf-profile.self.cycles-pp.jbd2_journal_file_inode 0.18 ± 43% +0.2 0.34 ± 11% perf-profile.self.cycles-pp.__cond_resched 0.05 ±102% +0.2 0.22 ± 26% perf-profile.self.cycles-pp.ext4_do_update_inode 0.07 ± 78% +0.2 0.24 ± 43% perf-profile.self.cycles-pp.__ext4_get_inode_loc 0.06 ± 75% +0.2 0.24 ± 33% perf-profile.self.cycles-pp.folio_mark_accessed 0.00 +0.2 0.18 ± 43% perf-profile.self.cycles-pp.ext4_da_do_write_end 0.00 +0.2 0.19 ± 38% perf-profile.self.cycles-pp.ext4_mark_iloc_dirty 0.03 ±142% +0.2 0.22 ± 32% perf-profile.self.cycles-pp.ext4_inode_csum 0.00 +0.2 0.20 ± 45% perf-profile.self.cycles-pp.add_transaction_credits 0.00 +0.2 0.20 ± 38% perf-profile.self.cycles-pp.__ext4_block_zero_page_range 0.00 +0.2 0.21 ± 37% perf-profile.self.cycles-pp.stop_this_handle 0.71 ± 19% +0.2 0.93 ± 15% perf-profile.self.cycles-pp._raw_spin_lock 0.10 ± 55% +0.2 0.34 ± 9% perf-profile.self.cycles-pp.filemap_get_entry 0.05 ±165% +0.2 0.29 ± 12% perf-profile.self.cycles-pp.ext4_inode_journal_mode 0.07 ± 50% +0.3 0.34 ± 32% perf-profile.self.cycles-pp.zero_user_segments 0.00 +0.3 0.34 ± 7% perf-profile.self.cycles-pp.start_this_handle 0.05 ±103% +0.5 0.50 ± 24% perf-profile.self.cycles-pp.ext4_fill_raw_inode 0.19 ± 19% +0.7 0.89 ± 9% perf-profile.self.cycles-pp.crc32c_pcl_intel_update Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki