Hello, kernel test robot noticed a -85.6% regression of filebench.sum_operations/s on: commit: 49b29a573da83b65d5f4ecf2db6619bab7aa910c ("nfs: add support for large folios") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: filebench test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) with 128G memory parameters: disk: 1HDD fs: ext4 fs2: nfsv4 test: randomrw.f cpufreq_governor: performance If you fix the issue in a separate patch/commit (i.e. not just a new version of the same patch/commit), kindly add following tags | Reported-by: kernel test robot <oliver.sang@xxxxxxxxx> | Closes: https://lore.kernel.org/oe-lkp/202408081514.106c770e-oliver.sang@xxxxxxxxx Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20240808/202408081514.106c770e-oliver.sang@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/disk/fs2/fs/kconfig/rootfs/tbox_group/test/testcase: gcc-13/performance/1HDD/nfsv4/ext4/x86_64-rhel-8.3/debian-12-x86_64-20240206.cgz/lkp-icl-2sp6/randomrw.f/filebench commit: 9aac777aaf ("filemap: Convert generic_perform_write() to support large folios") 49b29a573d ("nfs: add support for large folios") 9aac777aaf945978 49b29a573da83b65d5f4ecf2db6 ---------------- --------------------------- %stddev %change %stddev \ | \ 98.36 -1.0% 97.34 iostat.cpu.idle 1.28 ± 21% +89.3% 2.42 ± 10% iostat.cpu.iowait 203868 ± 9% -25.7% 151373 ± 17% numa-meminfo.node0.SUnreclaim 1026057 ± 40% -76.5% 241071 ± 70% numa-meminfo.node1.Dirty 1693537 ± 24% -74.2% 437697 ± 27% sched_debug.cpu.nr_switches.max 191731 ± 18% -61.9% 73070 ± 25% sched_debug.cpu.nr_switches.stddev 1849853 ± 66% +175.6% 5097620 ± 32% numa-vmstat.node0.nr_dirtied 50982 ± 9% -25.8% 37842 ± 17% numa-vmstat.node0.nr_slab_unreclaimable 1849844 ± 66% +175.6% 5097604 ± 32% numa-vmstat.node0.nr_written 256902 ± 40% -76.9% 59457 ± 71% numa-vmstat.node1.nr_dirty 43810 +35.9% 59529 ± 5% vmstat.io.bo 1.65 ± 22% +93.6% 3.19 ± 9% vmstat.procs.b 1.54 ± 3% -11.2% 1.37 vmstat.procs.r 4341 ± 3% -18.4% 3543 ± 4% vmstat.system.in 3713739 ± 3% +10.7% 4110439 ± 3% meminfo.Active 3688730 ± 2% +10.7% 4085171 ± 3% meminfo.Active(file) 1631748 ± 3% -63.1% 602285 ± 12% meminfo.Dirty 398961 -23.8% 304003 meminfo.SUnreclaim 655165 -16.2% 549118 meminfo.Slab 1411642 ± 3% +56.0% 2202614 ± 5% meminfo.Writeback 1.29 ± 21% +1.2 2.44 ± 10% mpstat.cpu.all.iowait% 0.02 ± 2% +0.0 0.02 ± 5% mpstat.cpu.all.soft% 0.28 ± 5% -0.1 0.15 ± 6% mpstat.cpu.all.sys% 0.05 ± 2% -0.0 0.04 ± 6% mpstat.cpu.all.usr% 22.67 ±184% -86.8% 3.00 mpstat.max_utilization.seconds 7.61 ± 9% -12.7% 6.65 mpstat.max_utilization_pct 919.42 ± 16% -85.6% 132.50 ± 34% filebench.sum_bytes_mb/s 7061641 ± 16% -85.6% 1017633 ± 34% filebench.sum_operations 117684 ± 16% -85.6% 16959 ± 34% filebench.sum_operations/s 59443 ± 16% -85.7% 8524 ± 34% filebench.sum_reads/s 0.02 ± 13% +611.2% 0.12 ± 31% filebench.sum_time_ms/op 58241 ± 16% -85.5% 8435 ± 34% filebench.sum_writes/s 21810424 +37.4% 29966336 ± 11% filebench.time.file_system_outputs 14251 -5.6% 13458 ± 2% filebench.time.minor_page_faults 17.67 ± 5% -90.6% 1.67 ± 66% filebench.time.percent_of_cpu_this_job_got 41.30 ± 5% -88.1% 4.90 ± 43% filebench.time.system_time 3582819 ± 16% -85.8% 509229 ± 33% filebench.time.voluntary_context_switches 921497 ± 2% +11.0% 1022743 ± 3% proc-vmstat.nr_active_file 5452715 +37.4% 7491524 ± 11% proc-vmstat.nr_dirtied 407500 ± 2% -62.8% 151456 ± 13% proc-vmstat.nr_dirty 1653629 -5.4% 1564747 ± 2% proc-vmstat.nr_inactive_file 64005 -4.2% 61291 proc-vmstat.nr_slab_reclaimable 99730 -23.8% 75999 proc-vmstat.nr_slab_unreclaimable 352757 ± 3% +55.8% 549420 ± 5% proc-vmstat.nr_writeback 5452662 +37.4% 7491481 ± 11% proc-vmstat.nr_written 921497 ± 2% +11.0% 1022743 ± 3% proc-vmstat.nr_zone_active_file 1653629 -5.4% 1564747 ± 2% proc-vmstat.nr_zone_inactive_file 1655 ± 15% -42.7% 947.83 ± 33% proc-vmstat.numa_hint_faults 1546 ± 16% -53.2% 723.33 ± 38% proc-vmstat.numa_hint_faults_local 4381534 ± 6% -25.2% 3276231 ± 16% proc-vmstat.numa_hit 4248725 ± 6% -26.0% 3143651 ± 17% proc-vmstat.numa_local 10725 ± 54% -57.7% 4532 ± 35% proc-vmstat.numa_pte_updates 1271493 ± 2% +3.6% 1317523 proc-vmstat.pgactivate 10906410 +37.4% 14983963 ± 11% proc-vmstat.pgpgout 1.982e+08 ± 5% -35.7% 1.274e+08 ± 3% perf-stat.i.branch-instructions 7.83 ± 3% -1.4 6.40 ± 7% perf-stat.i.cache-miss-rate% 5551160 ± 7% -48.1% 2879590 ± 5% perf-stat.i.cache-misses 25389854 ± 6% -30.6% 17609995 ± 2% perf-stat.i.cache-references 1.92 +3.1% 1.98 perf-stat.i.cpi 1.685e+09 ± 4% -35.6% 1.086e+09 ± 5% perf-stat.i.cpu-cycles 175.05 ± 3% -15.6% 147.75 ± 2% perf-stat.i.cpu-migrations 1277 ± 3% +27.3% 1625 ± 8% perf-stat.i.cycles-between-cache-misses 8.992e+08 ± 5% -32.6% 6.062e+08 ± 3% perf-stat.i.instructions 0.57 -3.2% 0.55 perf-stat.i.ipc 0.24 ± 14% -27.6% 0.17 ± 23% perf-stat.i.metric.K/sec 6.17 ± 2% -22.8% 4.76 ± 3% perf-stat.overall.MPKI 1.95 ± 4% +0.8 2.76 ± 4% perf-stat.overall.branch-miss-rate% 21.85 ± 2% -5.5 16.37 ± 4% perf-stat.overall.cache-miss-rate% 304.50 ± 4% +23.8% 377.10 ± 4% perf-stat.overall.cycles-between-cache-misses 0.53 ± 2% +4.7% 0.56 ± 4% perf-stat.overall.ipc 1.975e+08 ± 5% -35.7% 1.27e+08 ± 3% perf-stat.ps.branch-instructions 3849663 -8.9% 3506691 ± 4% perf-stat.ps.branch-misses 5532498 ± 7% -48.0% 2879018 ± 5% perf-stat.ps.cache-misses 25301762 ± 6% -30.5% 17579581 perf-stat.ps.cache-references 1.68e+09 ± 4% -35.4% 1.084e+09 ± 5% perf-stat.ps.cpu-cycles 174.27 ± 3% -15.6% 147.10 ± 2% perf-stat.ps.cpu-migrations 8.961e+08 ± 5% -32.5% 6.045e+08 ± 3% perf-stat.ps.instructions 2.211e+11 ± 4% -32.0% 1.504e+11 ± 6% perf-stat.total.instructions 52.52 ± 12% -45.4 7.08 ± 20% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 51.90 ± 12% -45.0 6.86 ± 20% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm 79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm 79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.ret_from_fork_asm 79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm 42.11 ± 12% -42.1 0.00 perf-profile.calltrace.cycles-pp.rpc_async_release.process_one_work.worker_thread.kthread.ret_from_fork 42.11 ± 12% -42.1 0.00 perf-profile.calltrace.cycles-pp.rpc_free_task.rpc_async_release.process_one_work.worker_thread.kthread 28.67 ± 19% -28.7 0.00 perf-profile.calltrace.cycles-pp.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work.worker_thread 16.89 ± 31% -16.9 0.00 perf-profile.calltrace.cycles-pp.nfs_page_end_writeback.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work 16.28 ± 32% -16.3 0.00 perf-profile.calltrace.cycles-pp.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion.rpc_free_task.rpc_async_release 15.82 ± 32% -15.8 0.00 perf-profile.calltrace.cycles-pp.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion.rpc_free_task 13.99 ± 35% -14.0 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion 13.55 ± 35% -13.5 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback 13.42 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.nfs_commit_release.rpc_free_task.rpc_async_release.process_one_work.worker_thread 13.42 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.nfs_commit_release_pages.nfs_commit_release.rpc_free_task.rpc_async_release.process_one_work 10.58 ± 16% -10.6 0.00 perf-profile.calltrace.cycles-pp.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work 10.48 ± 15% -10.5 0.00 perf-profile.calltrace.cycles-pp.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release.rpc_free_task.rpc_async_release 8.89 ± 17% -8.9 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release.rpc_free_task 8.19 ± 17% -8.2 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release 7.56 ± 24% -7.6 0.00 perf-profile.calltrace.cycles-pp.__mutex_lock.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task.rpc_async_release 6.57 ± 27% -6.6 0.00 perf-profile.calltrace.cycles-pp.osq_lock.__mutex_lock.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit.nfsd4_commit 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit 4.45 ± 19% -2.1 2.31 ± 25% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc 4.05 ± 13% -1.3 2.74 ± 21% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.cpu_startup_entry.rest_init.start_kernel.x86_64_start_reservations.x86_64_start_kernel 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.rest_init.start_kernel.x86_64_start_reservations 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.rest_init.start_kernel.x86_64_start_reservations.x86_64_start_kernel.common_startup_64 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.start_kernel.x86_64_start_reservations.x86_64_start_kernel.common_startup_64 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.x86_64_start_kernel.common_startup_64 0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.x86_64_start_reservations.x86_64_start_kernel.common_startup_64 0.00 +0.6 0.59 ± 14% perf-profile.calltrace.cycles-pp.tick_nohz_idle_exit.do_idle.cpu_startup_entry.start_secondary.common_startup_64 0.00 +1.1 1.12 ± 34% perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue 0.00 +1.2 1.16 ± 32% perf-profile.calltrace.cycles-pp.activate_task.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue 0.31 ±102% +1.4 1.68 ± 15% perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.common_startup_64 0.22 ±141% +1.4 1.62 ± 15% perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary 0.00 +1.6 1.60 ± 25% perf-profile.calltrace.cycles-pp.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle 0.12 ±223% +2.4 2.50 ± 23% perf-profile.calltrace.cycles-pp.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle.cpu_startup_entry 0.23 ±146% +3.1 3.36 ± 23% perf-profile.calltrace.cycles-pp.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle.cpu_startup_entry.start_secondary 0.00 +3.1 3.13 ± 95% perf-profile.calltrace.cycles-pp.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file 0.00 +3.8 3.77 ± 94% perf-profile.calltrace.cycles-pp.folio_wait_writeback.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit 1.51 ± 30% +4.3 5.78 ± 48% perf-profile.calltrace.cycles-pp.flush_smp_call_function_queue.do_idle.cpu_startup_entry.start_secondary.common_startup_64 0.00 +4.5 4.53 ± 94% perf-profile.calltrace.cycles-pp.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit.nfsd4_commit 0.71 ± 24% +10.0 10.76 ± 40% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle 10.00 ± 10% +12.6 22.59 ± 13% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64 8.46 ± 10% +12.7 21.13 ± 14% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary 8.27 ± 11% +13.2 21.46 ± 14% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry 12.87 ± 10% +18.6 31.52 ± 18% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64 12.89 ± 10% +18.7 31.56 ± 18% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64 12.90 ± 10% +18.7 31.60 ± 18% perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64 13.40 ± 10% +19.0 32.35 ± 17% perf-profile.calltrace.cycles-pp.common_startup_64 52.52 ± 12% -45.4 7.08 ± 20% perf-profile.children.cycles-pp.worker_thread 51.90 ± 12% -45.0 6.86 ± 20% perf-profile.children.cycles-pp.process_one_work 79.10 ± 9% -43.9 35.15 ± 29% perf-profile.children.cycles-pp.kthread 79.12 ± 9% -43.9 35.18 ± 29% perf-profile.children.cycles-pp.ret_from_fork 79.12 ± 9% -43.9 35.18 ± 29% perf-profile.children.cycles-pp.ret_from_fork_asm 42.11 ± 12% -42.0 0.08 ± 19% perf-profile.children.cycles-pp.rpc_async_release 42.11 ± 12% -42.0 0.08 ± 19% perf-profile.children.cycles-pp.rpc_free_task 28.68 ± 19% -28.6 0.06 ± 19% perf-profile.children.cycles-pp.nfs_write_completion 23.08 ± 18% -21.5 1.54 ±113% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 16.89 ± 31% -16.9 0.00 perf-profile.children.cycles-pp.nfs_page_end_writeback 16.54 ± 31% -15.9 0.62 ± 42% perf-profile.children.cycles-pp.__folio_end_writeback 17.05 ± 31% -14.9 2.20 ± 93% perf-profile.children.cycles-pp.folio_end_writeback 15.65 ± 32% -14.4 1.30 ± 49% perf-profile.children.cycles-pp._raw_spin_lock_irqsave 13.42 ± 15% -13.4 0.00 perf-profile.children.cycles-pp.nfs_commit_release 13.42 ± 15% -13.4 0.00 perf-profile.children.cycles-pp.nfs_commit_release_pages 10.58 ± 16% -10.6 0.00 perf-profile.children.cycles-pp.nfs_request_add_commit_list 10.48 ± 15% -10.5 0.00 perf-profile.children.cycles-pp.nfs_inode_remove_request 9.54 ± 16% -7.8 1.70 ± 31% perf-profile.children.cycles-pp._raw_spin_lock 7.59 ± 24% -7.6 0.00 perf-profile.children.cycles-pp.__mutex_lock 7.33 ± 14% -4.7 2.62 ± 24% perf-profile.children.cycles-pp.do_writepages 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.__writeback_inodes_wb 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.__writeback_single_inode 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.nfs_writepages 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.wb_do_writeback 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.wb_writeback 2.73 ± 25% -2.5 0.18 ± 48% perf-profile.children.cycles-pp.write_cache_pages 2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.writeback_sb_inodes 2.73 ± 25% -2.5 0.18 ± 48% perf-profile.children.cycles-pp.wb_workfn 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.__filemap_fdatawrite_range 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.ext4_do_writepages 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.ext4_writepages 4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc 4.46 ± 19% -2.1 2.32 ± 25% perf-profile.children.cycles-pp.mpage_prepare_extent_to_map 2.27 ± 28% -2.1 0.16 ± 46% perf-profile.children.cycles-pp.nfs_writepages_callback 2.20 ± 29% -2.0 0.16 ± 46% perf-profile.children.cycles-pp.nfs_page_async_flush 4.16 ± 12% -1.2 2.94 ± 21% perf-profile.children.cycles-pp.intel_idle 1.48 ± 28% -0.8 0.68 ± 22% perf-profile.children.cycles-pp.__folio_start_writeback 1.30 ± 14% -0.7 0.58 ± 19% perf-profile.children.cycles-pp.sched_balance_newidle 0.85 ± 29% -0.7 0.15 ± 37% perf-profile.children.cycles-pp.kmem_cache_free 1.44 ± 13% -0.7 0.77 ± 19% perf-profile.children.cycles-pp.sched_balance_rq 1.31 ± 13% -0.7 0.65 ± 18% perf-profile.children.cycles-pp.sched_balance_find_src_group 1.30 ± 14% -0.6 0.64 ± 17% perf-profile.children.cycles-pp.update_sd_lb_stats 1.16 ± 13% -0.6 0.57 ± 19% perf-profile.children.cycles-pp.update_sg_lb_stats 0.79 ± 22% -0.6 0.22 ±121% perf-profile.children.cycles-pp.nfs_folio_find_private_request 1.25 ± 13% -0.5 0.77 ± 22% perf-profile.children.cycles-pp.pick_next_task_fair 0.55 ± 20% -0.3 0.22 ± 39% perf-profile.children.cycles-pp.tick_nohz_stop_tick 0.56 ± 20% -0.3 0.23 ± 37% perf-profile.children.cycles-pp.tick_nohz_idle_stop_tick 0.34 ± 29% -0.3 0.05 ± 76% perf-profile.children.cycles-pp.__slab_free 0.68 ± 21% -0.3 0.40 ± 24% perf-profile.children.cycles-pp.__lruvec_stat_mod_folio 0.34 ± 25% -0.3 0.08 ± 27% perf-profile.children.cycles-pp.__xa_clear_mark 0.54 ± 10% -0.2 0.30 ± 17% perf-profile.children.cycles-pp.percpu_counter_add_batch 0.22 ± 15% -0.2 0.03 ± 70% perf-profile.children.cycles-pp.seq_read 0.38 ± 11% -0.2 0.20 ± 12% perf-profile.children.cycles-pp.seq_read_iter 0.21 ± 15% -0.2 0.04 ±102% perf-profile.children.cycles-pp.blk_mq_run_work_fn 0.62 ± 8% -0.2 0.45 ± 9% perf-profile.children.cycles-pp.ksys_read 0.24 ± 12% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.blk_mq_sched_dispatch_requests 0.23 ± 12% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.__blk_mq_sched_dispatch_requests 0.25 ± 22% -0.2 0.10 ± 25% perf-profile.children.cycles-pp.submit_bio_noacct_nocheck 0.22 ± 15% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.__blk_mq_do_dispatch_sched 0.66 ± 8% -0.1 0.51 ± 9% perf-profile.children.cycles-pp.read 0.24 ± 25% -0.1 0.10 ± 26% perf-profile.children.cycles-pp.__submit_bio 0.24 ± 25% -0.1 0.10 ± 26% perf-profile.children.cycles-pp.blk_mq_submit_bio 0.21 ± 19% -0.1 0.07 ± 31% perf-profile.children.cycles-pp.__common_interrupt 0.19 ± 21% -0.1 0.05 ± 50% perf-profile.children.cycles-pp.__handle_irq_event_percpu 0.21 ± 20% -0.1 0.07 ± 28% perf-profile.children.cycles-pp.handle_edge_irq 0.20 ± 21% -0.1 0.06 ± 53% perf-profile.children.cycles-pp.handle_irq_event 0.18 ± 22% -0.1 0.05 ± 50% perf-profile.children.cycles-pp.ahci_single_level_irq_intr 0.24 ± 33% -0.1 0.10 ± 13% perf-profile.children.cycles-pp.idle_cpu 0.24 ± 18% -0.1 0.12 ± 30% perf-profile.children.cycles-pp.__get_next_timer_interrupt 0.28 ± 17% -0.1 0.16 ± 26% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state 0.16 ± 26% -0.1 0.04 ± 73% perf-profile.children.cycles-pp.ahci_handle_port_intr 0.28 ± 22% -0.1 0.17 ± 10% perf-profile.children.cycles-pp.svc_send 0.28 ± 24% -0.1 0.17 ± 9% perf-profile.children.cycles-pp.svc_tcp_sendto 0.23 ± 19% -0.1 0.12 ± 19% perf-profile.children.cycles-pp.__mod_lruvec_state 0.16 ± 11% -0.1 0.06 ± 47% perf-profile.children.cycles-pp.blk_mq_dispatch_rq_list 0.27 ± 10% -0.1 0.17 ± 30% perf-profile.children.cycles-pp.tick_nohz_restart_sched_tick 0.14 ± 17% -0.1 0.04 ±112% perf-profile.children.cycles-pp.nfs_unlock_and_release_request 0.16 ± 13% -0.1 0.06 ± 47% perf-profile.children.cycles-pp.scsi_queue_rq 0.16 ± 22% -0.1 0.06 ± 50% perf-profile.children.cycles-pp.blk_mq_flush_plug_list 0.25 ± 26% -0.1 0.16 ± 4% perf-profile.children.cycles-pp.svc_tcp_sendmsg 0.19 ± 24% -0.1 0.10 ± 12% perf-profile.children.cycles-pp.__mod_node_page_state 0.16 ± 20% -0.1 0.06 ± 50% perf-profile.children.cycles-pp.blk_mq_dispatch_plug_list 0.21 ± 13% -0.1 0.13 ± 33% perf-profile.children.cycles-pp.tick_nohz_next_event 0.18 ± 25% -0.1 0.11 ± 25% perf-profile.children.cycles-pp.xas_start 0.11 ± 38% -0.1 0.04 ± 75% perf-profile.children.cycles-pp.inode_to_bdi 0.14 ± 23% -0.1 0.07 ± 52% perf-profile.children.cycles-pp.kick_pool 0.16 ± 27% -0.1 0.10 ± 15% perf-profile.children.cycles-pp.xs_stream_data_receive_workfn 0.19 ± 18% -0.1 0.12 ± 23% perf-profile.children.cycles-pp.__queue_work 0.10 ± 10% -0.1 0.04 ± 71% perf-profile.children.cycles-pp._find_next_and_bit 0.09 ± 11% -0.1 0.03 ±102% perf-profile.children.cycles-pp.hrtimer_cancel 0.17 ± 17% -0.1 0.12 ± 25% perf-profile.children.cycles-pp.sched_balance_update_blocked_averages 0.09 ± 18% -0.1 0.04 ±105% perf-profile.children.cycles-pp.__lock_sock 0.12 ± 10% -0.1 0.06 ± 45% perf-profile.children.cycles-pp.__hrtimer_start_range_ns 0.12 ± 19% -0.0 0.07 ± 34% perf-profile.children.cycles-pp.lock_sock_nested 0.04 ± 73% +0.0 0.09 ± 28% perf-profile.children.cycles-pp.copy_mc_enhanced_fast_string 0.07 ± 14% +0.1 0.12 ± 22% perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags 0.01 ±223% +0.1 0.06 ± 21% perf-profile.children.cycles-pp.pick_next_task_idle 0.00 +0.1 0.06 ± 19% perf-profile.children.cycles-pp.sched_clock_noinstr 0.08 ± 20% +0.1 0.14 ± 20% perf-profile.children.cycles-pp.ct_kernel_enter 0.03 ±143% +0.1 0.10 ± 23% perf-profile.children.cycles-pp.__collapse_huge_page_copy 0.00 +0.1 0.07 ± 15% perf-profile.children.cycles-pp.__dequeue_entity 0.02 ±141% +0.1 0.08 ± 23% perf-profile.children.cycles-pp.nohz_run_idle_balance 0.10 ± 17% +0.1 0.17 ± 18% perf-profile.children.cycles-pp.cpuidle_governor_latency_req 0.12 ± 19% +0.1 0.19 ± 17% perf-profile.children.cycles-pp.update_curr 0.12 ± 15% +0.1 0.20 ± 12% perf-profile.children.cycles-pp.read_tsc 0.02 ±142% +0.1 0.11 ± 24% perf-profile.children.cycles-pp.call_cpuidle 0.01 ±223% +0.1 0.10 ± 20% perf-profile.children.cycles-pp.avg_vruntime 0.00 +0.1 0.10 ± 34% perf-profile.children.cycles-pp.place_entity 0.02 ±142% +0.1 0.12 ± 23% perf-profile.children.cycles-pp.ct_kernel_exit_state 0.06 ± 63% +0.1 0.17 ± 37% perf-profile.children.cycles-pp.__update_load_avg_se 0.12 ± 14% +0.1 0.23 ± 12% perf-profile.children.cycles-pp.___perf_sw_event 0.02 ±144% +0.1 0.16 ± 69% perf-profile.children.cycles-pp.llist_add_batch 0.08 ± 64% +0.2 0.23 ± 17% perf-profile.children.cycles-pp.set_next_entity 0.02 ±223% +0.2 0.18 ± 24% perf-profile.children.cycles-pp.__wrgsbase_inactive 0.04 ±101% +0.2 0.22 ± 7% perf-profile.children.cycles-pp.tick_nohz_idle_enter 0.13 ± 20% +0.2 0.33 ± 15% perf-profile.children.cycles-pp.prepare_task_switch 0.15 ± 15% +0.2 0.35 ± 20% perf-profile.children.cycles-pp.sched_clock_cpu 0.11 ± 10% +0.2 0.30 ± 16% perf-profile.children.cycles-pp.local_clock_noinstr 0.40 ± 11% +0.2 0.62 ± 14% perf-profile.children.cycles-pp.tick_nohz_idle_exit 0.09 ± 23% +0.2 0.32 ± 21% perf-profile.children.cycles-pp.tick_nohz_stop_idle 0.07 ± 50% +0.2 0.31 ± 21% perf-profile.children.cycles-pp.nr_iowait_cpu 0.11 ± 24% +0.2 0.35 ± 6% perf-profile.children.cycles-pp.__switch_to_asm 0.15 ± 23% +0.2 0.39 ± 48% perf-profile.children.cycles-pp.update_rq_clock_task 0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.rest_init 0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.start_kernel 0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.x86_64_start_kernel 0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.x86_64_start_reservations 0.09 ± 28% +0.3 0.35 ± 18% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq 0.10 ± 81% +0.3 0.36 ± 36% perf-profile.children.cycles-pp.syscall_exit_to_user_mode 0.08 ± 32% +0.3 0.36 ± 46% perf-profile.children.cycles-pp.wake_affine 0.08 ± 33% +0.3 0.40 ± 44% perf-profile.children.cycles-pp.available_idle_cpu 0.22 ± 22% +0.4 0.60 ± 16% perf-profile.children.cycles-pp.dequeue_entity 0.21 ± 21% +0.4 0.61 ± 31% perf-profile.children.cycles-pp.select_task_rq_fair 0.12 ± 30% +0.4 0.52 ± 10% perf-profile.children.cycles-pp.__switch_to 0.26 ± 17% +0.4 0.67 ± 16% perf-profile.children.cycles-pp.dequeue_task_fair 0.22 ± 32% +0.5 0.68 ± 18% perf-profile.children.cycles-pp.update_load_avg 0.17 ± 44% +0.5 0.66 ± 19% perf-profile.children.cycles-pp.sched_clock 0.14 ± 25% +0.5 0.62 ± 18% perf-profile.children.cycles-pp.update_rq_clock 1.82 ± 9% +0.5 2.35 ± 12% perf-profile.children.cycles-pp.schedule 0.10 ± 50% +0.6 0.70 ± 20% perf-profile.children.cycles-pp.llist_reverse_order 0.22 ± 31% +0.6 0.83 ± 11% perf-profile.children.cycles-pp.native_sched_clock 0.19 ± 41% +0.6 0.83 ± 19% perf-profile.children.cycles-pp.select_task_rq 0.11 ± 30% +0.7 0.82 ± 40% perf-profile.children.cycles-pp.call_function_single_prep_ipi 0.24 ± 31% +0.8 1.01 ± 32% perf-profile.children.cycles-pp.enqueue_entity 0.15 ± 29% +0.8 1.00 ± 42% perf-profile.children.cycles-pp.__smp_call_single_queue 0.29 ± 26% +0.9 1.19 ± 31% perf-profile.children.cycles-pp.enqueue_task_fair 0.31 ± 23% +0.9 1.24 ± 30% perf-profile.children.cycles-pp.activate_task 0.18 ± 30% +1.0 1.16 ± 40% perf-profile.children.cycles-pp.ttwu_queue_wakelist 0.21 ± 26% +1.1 1.32 ±130% perf-profile.children.cycles-pp._raw_spin_lock_irq 0.58 ± 17% +1.1 1.73 ± 14% perf-profile.children.cycles-pp.schedule_idle 0.35 ± 24% +1.3 1.66 ± 24% perf-profile.children.cycles-pp.ttwu_do_activate 2.40 ± 8% +1.6 3.98 ± 9% perf-profile.children.cycles-pp.__schedule 0.76 ± 14% +1.7 2.45 ± 30% perf-profile.children.cycles-pp.try_to_wake_up 0.43 ± 33% +2.1 2.53 ± 22% perf-profile.children.cycles-pp.sched_ttwu_pending 1.10 ± 19% +2.3 3.37 ± 85% perf-profile.children.cycles-pp.folio_wait_bit_common 4.11 ± 24% +2.7 6.81 ± 14% perf-profile.children.cycles-pp.rwsem_spin_on_owner 0.56 ± 32% +2.8 3.40 ± 22% perf-profile.children.cycles-pp.__flush_smp_call_function_queue 0.93 ±159% +3.6 4.53 ± 89% perf-profile.children.cycles-pp.intel_idle_irq 0.00 +3.8 3.78 ± 94% perf-profile.children.cycles-pp.folio_wait_writeback 1.51 ± 30% +4.3 5.79 ± 48% perf-profile.children.cycles-pp.flush_smp_call_function_queue 0.00 +4.5 4.54 ± 94% perf-profile.children.cycles-pp.__filemap_fdatawait_range 0.72 ± 24% +10.0 10.77 ± 40% perf-profile.children.cycles-pp.poll_idle 10.42 ± 9% +12.8 23.27 ± 13% perf-profile.children.cycles-pp.cpuidle_idle_call 8.76 ± 10% +12.9 21.66 ± 14% perf-profile.children.cycles-pp.cpuidle_enter_state 8.81 ± 10% +12.9 21.73 ± 14% perf-profile.children.cycles-pp.cpuidle_enter 12.90 ± 10% +18.7 31.60 ± 18% perf-profile.children.cycles-pp.start_secondary 13.38 ± 10% +18.9 32.30 ± 17% perf-profile.children.cycles-pp.do_idle 13.40 ± 10% +19.0 32.35 ± 17% perf-profile.children.cycles-pp.common_startup_64 13.40 ± 10% +19.0 32.35 ± 17% perf-profile.children.cycles-pp.cpu_startup_entry 23.04 ± 18% -21.5 1.54 ±112% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 4.16 ± 12% -1.2 2.94 ± 21% perf-profile.self.cycles-pp.intel_idle 1.38 ± 16% -1.2 0.17 ± 70% perf-profile.self.cycles-pp.__folio_end_writeback 0.70 ± 22% -0.6 0.08 ± 84% perf-profile.self.cycles-pp.nfs_folio_find_private_request 0.87 ± 15% -0.4 0.44 ± 19% perf-profile.self.cycles-pp.update_sg_lb_stats 0.50 ± 21% -0.4 0.10 ±105% perf-profile.self.cycles-pp.folio_end_writeback 0.37 ± 29% -0.3 0.04 ±105% perf-profile.self.cycles-pp.kmem_cache_free 0.33 ± 30% -0.3 0.04 ±102% perf-profile.self.cycles-pp.__slab_free 0.49 ± 11% -0.2 0.26 ± 17% perf-profile.self.cycles-pp.percpu_counter_add_batch 0.26 ± 32% -0.2 0.10 ± 34% perf-profile.self.cycles-pp.mpage_prepare_extent_to_map 0.30 ± 26% -0.1 0.17 ± 33% perf-profile.self.cycles-pp.__folio_start_writeback 0.20 ± 20% -0.1 0.10 ± 14% perf-profile.self.cycles-pp.idle_cpu 0.18 ± 24% -0.1 0.09 ± 15% perf-profile.self.cycles-pp.__mod_node_page_state 0.20 ± 22% -0.1 0.11 ± 25% perf-profile.self.cycles-pp.__mod_memcg_lruvec_state 0.09 ± 17% -0.1 0.03 ±100% perf-profile.self.cycles-pp._find_next_and_bit 0.04 ± 73% +0.0 0.09 ± 28% perf-profile.self.cycles-pp.copy_mc_enhanced_fast_string 0.12 ± 16% +0.1 0.20 ± 15% perf-profile.self.cycles-pp.read_tsc 0.02 ±141% +0.1 0.09 ± 30% perf-profile.self.cycles-pp.sched_ttwu_pending 0.02 ±141% +0.1 0.09 ± 35% perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags 0.01 ±223% +0.1 0.08 ± 25% perf-profile.self.cycles-pp.schedule 0.01 ±223% +0.1 0.09 ± 30% perf-profile.self.cycles-pp.select_task_rq_fair 0.00 +0.1 0.08 ± 19% perf-profile.self.cycles-pp.nohz_run_idle_balance 0.03 ±101% +0.1 0.12 ± 20% perf-profile.self.cycles-pp.dequeue_entity 0.09 ± 26% +0.1 0.18 ± 16% perf-profile.self.cycles-pp.___perf_sw_event 0.01 ±223% +0.1 0.10 ± 17% perf-profile.self.cycles-pp.avg_vruntime 0.02 ±141% +0.1 0.10 ± 26% perf-profile.self.cycles-pp.call_cpuidle 0.00 +0.1 0.09 ± 24% perf-profile.self.cycles-pp.cpu_startup_entry 0.05 ± 88% +0.1 0.15 ± 36% perf-profile.self.cycles-pp.__update_load_avg_se 0.02 ±141% +0.1 0.12 ± 22% perf-profile.self.cycles-pp.ct_kernel_exit_state 0.06 ± 52% +0.1 0.17 ± 77% perf-profile.self.cycles-pp.ext4_finish_bio 0.02 ±142% +0.1 0.12 ± 36% perf-profile.self.cycles-pp.prepare_task_switch 0.02 ±141% +0.1 0.13 ± 27% perf-profile.self.cycles-pp.pick_next_task_fair 0.02 ±141% +0.1 0.13 ± 41% perf-profile.self.cycles-pp.ttwu_queue_wakelist 0.05 ± 79% +0.1 0.16 ± 20% perf-profile.self.cycles-pp.generic_perform_write 0.07 ± 15% +0.1 0.19 ± 24% perf-profile.self.cycles-pp.update_load_avg 0.09 ± 14% +0.1 0.22 ± 21% perf-profile.self.cycles-pp.cpuidle_idle_call 0.04 ± 73% +0.1 0.18 ± 45% perf-profile.self.cycles-pp.enqueue_task_fair 0.02 ±144% +0.1 0.16 ± 71% perf-profile.self.cycles-pp.llist_add_batch 0.01 ±223% +0.2 0.17 ± 47% perf-profile.self.cycles-pp.__flush_smp_call_function_queue 0.02 ±223% +0.2 0.18 ± 24% perf-profile.self.cycles-pp.__wrgsbase_inactive 0.05 ± 46% +0.2 0.23 ± 28% perf-profile.self.cycles-pp.enqueue_entity 0.05 ± 74% +0.2 0.26 ± 23% perf-profile.self.cycles-pp.do_idle 0.10 ± 23% +0.2 0.34 ± 8% perf-profile.self.cycles-pp.__switch_to_asm 0.07 ± 50% +0.2 0.31 ± 21% perf-profile.self.cycles-pp.nr_iowait_cpu 0.09 ± 28% +0.2 0.34 ± 17% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq 0.11 ± 26% +0.2 0.36 ± 54% perf-profile.self.cycles-pp.update_rq_clock_task 0.03 ±163% +0.3 0.30 ± 35% perf-profile.self.cycles-pp.select_task_rq 0.07 ± 50% +0.3 0.36 ± 91% perf-profile.self.cycles-pp._raw_spin_lock_irq 0.00 +0.3 0.29 ± 98% perf-profile.self.cycles-pp.__filemap_fdatawait_range 0.08 ± 33% +0.3 0.39 ± 45% perf-profile.self.cycles-pp.available_idle_cpu 0.02 ±141% +0.3 0.34 ± 27% perf-profile.self.cycles-pp.flush_smp_call_function_queue 0.06 ± 57% +0.4 0.42 ± 18% perf-profile.self.cycles-pp.update_rq_clock 0.14 ± 22% +0.4 0.50 ± 6% perf-profile.self.cycles-pp.__schedule 0.12 ± 33% +0.4 0.50 ± 11% perf-profile.self.cycles-pp.__switch_to 0.01 ±223% +0.4 0.41 ± 55% perf-profile.self.cycles-pp.ttwu_do_activate 0.22 ± 28% +0.6 0.80 ± 11% perf-profile.self.cycles-pp.native_sched_clock 0.10 ± 50% +0.6 0.70 ± 20% perf-profile.self.cycles-pp.llist_reverse_order 0.00 +0.6 0.62 ± 89% perf-profile.self.cycles-pp.folio_wait_writeback 0.11 ± 30% +0.7 0.82 ± 41% perf-profile.self.cycles-pp.call_function_single_prep_ipi 0.20 ± 20% +1.1 1.27 ± 73% perf-profile.self.cycles-pp.folio_wait_bit_common 0.91 ±163% +3.6 4.50 ± 89% perf-profile.self.cycles-pp.intel_idle_irq 0.71 ± 25% +9.9 10.64 ± 40% perf-profile.self.cycles-pp.poll_idle Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki