hi, Shakeel Butt, below report is just FYI that we observed expected performance improvement by this commit when it merges into mainline. Hello, kernel test robot noticed a 12.7% improvement of will-it-scale.per_thread_ops on: commit: fefc6e6631ff43427e81f08c8e49f7787ff0213a ("memcg: rearrange fields of mem_cgroup_per_node") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master testcase: will-it-scale test machine: 104 threads 2 sockets (Skylake) with 192G memory parameters: nr_task: 100% mode: thread test: fallocate1 cpufreq_governor: performance Details are as below: --------------------------------------------------------------------------------------------------> The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20240808/202408081325.cba32176-oliver.sang@xxxxxxxxx ========================================================================================= compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase: gcc-13/performance/x86_64-rhel-8.3/thread/100%/debian-12-x86_64-20240206.cgz/lkp-skl-fpga01/fallocate1/will-it-scale commit: 16540dae95 ("mm/hugetlb: mm/memory_hotplug: use a folio in scan_movable_pages()") fefc6e6631 ("memcg: rearrange fields of mem_cgroup_per_node") 16540dae959d8629 fefc6e6631ff43427e81f08c8e4 ---------------- --------------------------- %stddev %change %stddev \ | \ 3379 +3.7% 3505 vmstat.system.cs 0.03 +0.0 0.03 mpstat.cpu.all.soft% 2.43 ± 2% +0.3 2.74 mpstat.cpu.all.usr% 1.175e+09 ± 4% +12.7% 1.324e+09 ± 2% numa-numastat.node1.local_node 1.176e+09 ± 4% +12.7% 1.325e+09 ± 2% numa-numastat.node1.numa_hit 1.176e+09 ± 4% +12.7% 1.325e+09 ± 2% numa-vmstat.node1.numa_hit 1.175e+09 ± 4% +12.7% 1.324e+09 ± 2% numa-vmstat.node1.numa_local 512.33 ± 7% +18.1% 605.00 ± 4% perf-c2c.DRAM.remote 11617 ± 3% -36.0% 7432 ± 3% perf-c2c.HITM.local 11729 ± 3% -35.5% 7560 ± 3% perf-c2c.HITM.total 3888543 ± 2% +12.7% 4381027 will-it-scale.104.threads 37389 ± 2% +12.7% 42124 will-it-scale.per_thread_ops 3888543 ± 2% +12.7% 4381027 will-it-scale.workload 108778 +1.9% 110794 proc-vmstat.nr_active_anon 125909 +1.6% 127870 proc-vmstat.nr_shmem 108778 +1.9% 110794 proc-vmstat.nr_zone_active_anon 2.346e+09 ± 2% +12.7% 2.645e+09 proc-vmstat.numa_hit 2.346e+09 ± 2% +12.7% 2.644e+09 proc-vmstat.numa_local 2.344e+09 ± 2% +12.7% 2.642e+09 proc-vmstat.pgalloc_normal 2.344e+09 ± 2% +12.7% 2.642e+09 proc-vmstat.pgfree 0.38 ± 3% +14.5% 0.44 ± 3% perf-stat.i.MPKI 9.03e+09 ± 2% +9.8% 9.913e+09 perf-stat.i.branch-instructions 62094186 ± 2% +9.4% 67900586 perf-stat.i.branch-misses 9.12 ± 12% +3.4 12.55 ± 4% perf-stat.i.cache-miss-rate% 16493860 ± 4% +26.3% 20825485 ± 3% perf-stat.i.cache-misses 3351 +3.8% 3478 perf-stat.i.context-switches 6.65 ± 2% -9.4% 6.02 perf-stat.i.cpi 17670 ± 4% -21.0% 13968 ± 3% perf-stat.i.cycles-between-cache-misses 4.357e+10 ± 2% +10.3% 4.806e+10 perf-stat.i.instructions 0.15 ± 2% +10.2% 0.17 perf-stat.i.ipc 0.38 ± 3% +14.6% 0.43 ± 4% perf-stat.overall.MPKI 9.08 ± 12% +3.4 12.53 ± 4% perf-stat.overall.cache-miss-rate% 6.65 ± 2% -9.4% 6.03 perf-stat.overall.cpi 17578 ± 4% -20.9% 13910 ± 3% perf-stat.overall.cycles-between-cache-misses 0.15 ± 2% +10.3% 0.17 perf-stat.overall.ipc 3381489 -2.1% 3309583 perf-stat.overall.path-length 8.999e+09 ± 2% +9.8% 9.879e+09 perf-stat.ps.branch-instructions 61855583 ± 2% +9.4% 67639923 perf-stat.ps.branch-misses 16442095 ± 4% +26.3% 20763410 ± 3% perf-stat.ps.cache-misses 3339 +3.8% 3465 perf-stat.ps.context-switches 4.343e+10 ± 2% +10.3% 4.79e+10 perf-stat.ps.instructions 1.315e+13 ± 2% +10.3% 1.45e+13 perf-stat.total.instructions 41.20 -2.4 38.78 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs.shmem_undo_range.shmem_setattr 41.17 -2.4 38.76 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs 41.18 -2.4 38.78 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__page_cache_release.folios_put_refs.shmem_undo_range 42.62 -2.4 40.24 perf-profile.calltrace.cycles-pp.folios_put_refs.shmem_undo_range.shmem_setattr.notify_change.do_truncate 50.27 -2.3 47.96 perf-profile.calltrace.cycles-pp.shmem_undo_range.shmem_setattr.notify_change.do_truncate.do_ftruncate 50.30 -2.3 47.99 perf-profile.calltrace.cycles-pp.do_ftruncate.__x64_sys_ftruncate.do_syscall_64.entry_SYSCALL_64_after_hwframe.ftruncate64 50.29 -2.3 47.98 perf-profile.calltrace.cycles-pp.notify_change.do_truncate.do_ftruncate.__x64_sys_ftruncate.do_syscall_64 50.28 -2.3 47.98 perf-profile.calltrace.cycles-pp.shmem_setattr.notify_change.do_truncate.do_ftruncate.__x64_sys_ftruncate 50.30 -2.3 47.99 perf-profile.calltrace.cycles-pp.__x64_sys_ftruncate.do_syscall_64.entry_SYSCALL_64_after_hwframe.ftruncate64 50.30 -2.3 48.00 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.ftruncate64 50.29 -2.3 47.99 perf-profile.calltrace.cycles-pp.do_truncate.do_ftruncate.__x64_sys_ftruncate.do_syscall_64.entry_SYSCALL_64_after_hwframe 50.30 -2.3 48.00 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.ftruncate64 50.33 -2.3 48.03 perf-profile.calltrace.cycles-pp.ftruncate64 41.67 -2.3 39.39 perf-profile.calltrace.cycles-pp.__page_cache_release.folios_put_refs.shmem_undo_range.shmem_setattr.notify_change 4.78 -0.2 4.60 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range 4.77 -0.2 4.60 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release 4.77 -0.2 4.60 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu 4.83 -0.2 4.66 perf-profile.calltrace.cycles-pp.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range.shmem_setattr.notify_change 4.83 -0.2 4.67 perf-profile.calltrace.cycles-pp.__folio_batch_release.shmem_undo_range.shmem_setattr.notify_change.do_truncate 4.82 -0.2 4.66 perf-profile.calltrace.cycles-pp.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range.shmem_setattr 0.58 +0.1 0.68 ± 2% perf-profile.calltrace.cycles-pp.__alloc_pages_noprof.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.shmem_alloc_folio.shmem_alloc_and_add_folio 1.90 ± 3% +0.1 2.00 ± 2% perf-profile.calltrace.cycles-pp.__filemap_remove_folio.filemap_remove_folio.truncate_inode_folio.shmem_undo_range.shmem_setattr 0.74 +0.1 0.86 ± 2% perf-profile.calltrace.cycles-pp.alloc_pages_mpol_noprof.folio_alloc_mpol_noprof.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp 0.76 +0.1 0.89 ± 2% perf-profile.calltrace.cycles-pp.folio_alloc_mpol_noprof.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate 2.12 ± 3% +0.1 2.26 perf-profile.calltrace.cycles-pp.filemap_remove_folio.truncate_inode_folio.shmem_undo_range.shmem_setattr.notify_change 0.74 +0.1 0.88 ± 2% perf-profile.calltrace.cycles-pp.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate 0.88 +0.1 1.02 perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.fallocate64 0.91 +0.2 1.06 ± 2% perf-profile.calltrace.cycles-pp.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate 1.24 +0.2 1.40 ± 2% perf-profile.calltrace.cycles-pp.__lruvec_stat_mod_folio.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate 0.97 +0.2 1.14 ± 2% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.fallocate64 2.42 ± 3% +0.2 2.60 perf-profile.calltrace.cycles-pp.truncate_inode_folio.shmem_undo_range.shmem_setattr.notify_change.do_truncate 1.87 +0.3 2.14 perf-profile.calltrace.cycles-pp.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate 0.93 ± 4% +0.4 1.32 ± 3% perf-profile.calltrace.cycles-pp.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate 0.17 ±141% +0.5 0.64 perf-profile.calltrace.cycles-pp.lru_add_fn.folio_batch_move_lru.folio_add_lru.shmem_alloc_and_add_folio.shmem_get_folio_gfp 0.08 ±223% +0.6 0.64 ± 5% perf-profile.calltrace.cycles-pp.get_mem_cgroup_from_mm.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate 0.08 ±223% +0.6 0.73 ± 7% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.__lruvec_stat_mod_folio.filemap_unaccount_folio.__filemap_remove_folio.filemap_remove_folio 0.00 +0.8 0.76 ± 3% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.__lruvec_stat_mod_folio.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp 44.65 +1.5 46.16 perf-profile.calltrace.cycles-pp.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate 45.05 +1.6 46.62 perf-profile.calltrace.cycles-pp.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64 45.84 +1.7 47.56 perf-profile.calltrace.cycles-pp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe 46.08 +1.7 47.83 perf-profile.calltrace.cycles-pp.vfs_fallocate.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64 46.28 +1.8 48.05 perf-profile.calltrace.cycles-pp.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64 46.42 +1.8 48.20 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64 46.96 +1.9 48.83 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fallocate64 49.32 +2.3 51.59 perf-profile.calltrace.cycles-pp.fallocate64 42.73 -2.4 40.38 perf-profile.children.cycles-pp.folios_put_refs 50.30 -2.3 47.99 perf-profile.children.cycles-pp.do_ftruncate 50.29 -2.3 47.98 perf-profile.children.cycles-pp.notify_change 50.28 -2.3 47.98 perf-profile.children.cycles-pp.shmem_setattr 50.27 -2.3 47.96 perf-profile.children.cycles-pp.shmem_undo_range 50.30 -2.3 47.99 perf-profile.children.cycles-pp.__x64_sys_ftruncate 50.29 -2.3 47.99 perf-profile.children.cycles-pp.do_truncate 50.33 -2.3 48.03 perf-profile.children.cycles-pp.ftruncate64 41.70 -2.3 39.43 perf-profile.children.cycles-pp.__page_cache_release 84.80 -2.3 82.54 perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave 84.78 -2.2 82.54 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 84.77 -2.2 82.53 perf-profile.children.cycles-pp._raw_spin_lock_irqsave 96.84 -0.5 96.31 perf-profile.children.cycles-pp.do_syscall_64 97.39 -0.4 96.95 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 4.85 -0.2 4.68 perf-profile.children.cycles-pp.lru_add_drain_cpu 4.83 -0.2 4.67 perf-profile.children.cycles-pp.__folio_batch_release 0.41 ± 3% -0.0 0.38 ± 2% perf-profile.children.cycles-pp.__mod_lruvec_state 0.05 +0.0 0.06 perf-profile.children.cycles-pp.rcu_all_qs 0.05 +0.0 0.06 perf-profile.children.cycles-pp.syscall_exit_to_user_mode 0.07 +0.0 0.08 ± 4% perf-profile.children.cycles-pp.xas_create 0.06 ± 6% +0.0 0.08 ± 6% perf-profile.children.cycles-pp.__errno_location 0.09 ± 4% +0.0 0.10 ± 3% perf-profile.children.cycles-pp._raw_spin_trylock 0.08 ± 4% +0.0 0.09 ± 4% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack 0.12 ± 4% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.file_modified 0.08 ± 8% +0.0 0.10 ± 4% perf-profile.children.cycles-pp.security_file_permission 0.09 ± 4% +0.0 0.10 ± 4% perf-profile.children.cycles-pp.inode_add_bytes 0.12 ± 4% +0.0 0.13 ± 3% perf-profile.children.cycles-pp.folio_unlock 0.10 ± 5% +0.0 0.11 ± 4% perf-profile.children.cycles-pp.xas_find_conflict 0.04 ± 44% +0.0 0.06 perf-profile.children.cycles-pp.__rmqueue_pcplist 0.06 ± 7% +0.0 0.08 ± 5% perf-profile.children.cycles-pp.__mod_zone_page_state 0.10 ± 4% +0.0 0.12 ± 3% perf-profile.children.cycles-pp.uncharge_folio 0.10 ± 5% +0.0 0.12 ± 6% perf-profile.children.cycles-pp.xas_start 0.12 ± 3% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.try_charge_memcg 0.12 +0.0 0.14 ± 2% perf-profile.children.cycles-pp.noop_dirty_folio 0.14 ± 3% +0.0 0.17 ± 2% perf-profile.children.cycles-pp.__fdget 0.12 ± 4% +0.0 0.14 ± 2% perf-profile.children.cycles-pp.shmem_recalc_inode 0.17 ± 5% +0.0 0.19 ± 3% perf-profile.children.cycles-pp.xas_init_marks 0.08 ± 17% +0.0 0.10 ± 9% perf-profile.children.cycles-pp.mem_cgroup_update_lru_size 0.16 ± 2% +0.0 0.19 ± 3% perf-profile.children.cycles-pp.__cond_resched 0.14 ± 3% +0.0 0.16 ± 4% perf-profile.children.cycles-pp.uncharge_batch 0.22 ± 5% +0.0 0.25 ± 2% perf-profile.children.cycles-pp.xas_clear_mark 0.10 ± 8% +0.0 0.13 ± 5% perf-profile.children.cycles-pp.cap_vm_enough_memory 0.18 ± 3% +0.0 0.21 ± 2% perf-profile.children.cycles-pp.filemap_get_entry 0.18 ± 14% +0.0 0.21 perf-profile.children.cycles-pp._raw_spin_lock_irq 0.21 ± 4% +0.0 0.24 perf-profile.children.cycles-pp.truncate_cleanup_folio 0.13 ± 5% +0.0 0.16 ± 4% perf-profile.children.cycles-pp.security_vm_enough_memory_mm 0.29 ± 4% +0.0 0.32 ± 2% perf-profile.children.cycles-pp.free_unref_folios 0.29 ± 3% +0.0 0.32 ± 2% perf-profile.children.cycles-pp.find_lock_entries 0.27 ± 2% +0.0 0.32 ± 2% perf-profile.children.cycles-pp._raw_spin_lock 0.16 ± 3% +0.0 0.20 ± 3% perf-profile.children.cycles-pp.cgroup_rstat_updated 0.24 ± 2% +0.0 0.28 ± 2% perf-profile.children.cycles-pp.__dquot_alloc_space 0.22 ± 3% +0.0 0.26 ± 3% perf-profile.children.cycles-pp.rmqueue 0.25 ± 2% +0.0 0.30 ± 2% perf-profile.children.cycles-pp.xas_load 0.25 ± 4% +0.0 0.30 ± 3% perf-profile.children.cycles-pp.__mem_cgroup_uncharge_folios 0.15 ± 3% +0.1 0.20 ± 5% perf-profile.children.cycles-pp.__count_memcg_events 0.34 ± 2% +0.1 0.40 perf-profile.children.cycles-pp.get_page_from_freelist 0.40 ± 2% +0.1 0.47 ± 3% perf-profile.children.cycles-pp.entry_SYSCALL_64 0.61 ± 2% +0.1 0.71 ± 2% perf-profile.children.cycles-pp.__alloc_pages_noprof 1.91 ± 3% +0.1 2.02 ± 2% perf-profile.children.cycles-pp.__filemap_remove_folio 0.65 ± 2% +0.1 0.76 ± 2% perf-profile.children.cycles-pp.xas_store 0.29 +0.1 0.40 ± 2% perf-profile.children.cycles-pp.mem_cgroup_commit_charge 0.75 ± 2% +0.1 0.87 ± 2% perf-profile.children.cycles-pp.alloc_pages_mpol_noprof 0.77 ± 2% +0.1 0.90 ± 2% perf-profile.children.cycles-pp.folio_alloc_mpol_noprof 2.41 +0.1 2.54 ± 2% perf-profile.children.cycles-pp.__lruvec_stat_mod_folio 2.13 ± 3% +0.1 2.27 perf-profile.children.cycles-pp.filemap_remove_folio 0.76 +0.2 0.91 ± 2% perf-profile.children.cycles-pp.shmem_inode_acct_blocks 0.92 +0.2 1.08 ± 2% perf-profile.children.cycles-pp.shmem_alloc_folio 0.94 +0.2 1.10 ± 2% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack 0.99 +0.2 1.16 ± 2% perf-profile.children.cycles-pp.syscall_return_via_sysret 0.51 ± 5% +0.2 0.67 perf-profile.children.cycles-pp.lru_add_fn 2.42 ± 3% +0.2 2.60 perf-profile.children.cycles-pp.truncate_inode_folio 0.43 ± 9% +0.2 0.64 ± 5% perf-profile.children.cycles-pp.get_mem_cgroup_from_mm 1.90 +0.3 2.18 perf-profile.children.cycles-pp.shmem_add_to_page_cache 0.94 ± 4% +0.4 1.33 ± 3% perf-profile.children.cycles-pp.__mem_cgroup_charge 1.28 ± 4% +0.7 2.01 ± 5% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state 44.72 +1.5 46.24 perf-profile.children.cycles-pp.shmem_alloc_and_add_folio 45.09 +1.6 46.66 perf-profile.children.cycles-pp.shmem_get_folio_gfp 45.86 +1.7 47.57 perf-profile.children.cycles-pp.shmem_fallocate 46.09 +1.7 47.83 perf-profile.children.cycles-pp.vfs_fallocate 46.28 +1.8 48.06 perf-profile.children.cycles-pp.__x64_sys_fallocate 49.38 +2.3 51.66 perf-profile.children.cycles-pp.fallocate64 84.78 -2.2 82.54 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 1.19 ± 5% -0.4 0.76 ± 6% perf-profile.self.cycles-pp.__lruvec_stat_mod_folio 0.40 ± 2% -0.2 0.22 ± 2% perf-profile.self.cycles-pp.folios_put_refs 0.48 ± 3% -0.0 0.44 perf-profile.self.cycles-pp.folio_batch_move_lru 0.13 ± 5% -0.0 0.10 ± 3% perf-profile.self.cycles-pp.__mod_lruvec_state 0.09 +0.0 0.10 ± 3% perf-profile.self.cycles-pp.alloc_pages_mpol_noprof 0.10 ± 3% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.__cond_resched 0.10 ± 4% +0.0 0.11 perf-profile.self.cycles-pp.__folio_cancel_dirty 0.08 ± 6% +0.0 0.09 perf-profile.self.cycles-pp.xas_find_conflict 0.09 ± 5% +0.0 0.10 ± 3% perf-profile.self.cycles-pp._raw_spin_trylock 0.09 ± 5% +0.0 0.11 ± 3% perf-profile.self.cycles-pp.rmqueue 0.08 +0.0 0.10 ± 5% perf-profile.self.cycles-pp.folio_add_lru 0.08 ± 4% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.truncate_cleanup_folio 0.08 ± 4% +0.0 0.10 ± 5% perf-profile.self.cycles-pp.shmem_alloc_folio 0.12 ± 3% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.shmem_get_folio_gfp 0.12 +0.0 0.14 ± 3% perf-profile.self.cycles-pp.get_page_from_freelist 0.09 +0.0 0.11 ± 6% perf-profile.self.cycles-pp.xas_start 0.06 ± 6% +0.0 0.08 perf-profile.self.cycles-pp.__mod_zone_page_state 0.14 ± 3% +0.0 0.16 ± 2% perf-profile.self.cycles-pp.free_unref_folios 0.13 ± 4% +0.0 0.15 ± 4% perf-profile.self.cycles-pp.vfs_fallocate 0.11 ± 4% +0.0 0.13 ± 2% perf-profile.self.cycles-pp.folio_unlock 0.14 ± 2% +0.0 0.16 ± 4% perf-profile.self.cycles-pp.__fdget 0.13 ± 4% +0.0 0.15 ± 2% perf-profile.self.cycles-pp._raw_spin_lock_irq 0.12 ± 3% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.fallocate64 0.10 ± 3% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.uncharge_folio 0.12 +0.0 0.14 ± 2% perf-profile.self.cycles-pp.noop_dirty_folio 0.15 ± 4% +0.0 0.18 ± 5% perf-profile.self.cycles-pp.__dquot_alloc_space 0.10 ± 9% +0.0 0.12 ± 7% perf-profile.self.cycles-pp.cap_vm_enough_memory 0.07 ± 25% +0.0 0.10 ± 8% perf-profile.self.cycles-pp.mem_cgroup_update_lru_size 0.14 ± 3% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.shmem_alloc_and_add_folio 0.24 ± 4% +0.0 0.27 ± 3% perf-profile.self.cycles-pp.find_lock_entries 0.13 ± 3% +0.0 0.17 ± 4% perf-profile.self.cycles-pp.cgroup_rstat_updated 0.17 ± 4% +0.0 0.20 ± 2% perf-profile.self.cycles-pp.xas_load 0.14 ± 5% +0.0 0.18 ± 4% perf-profile.self.cycles-pp.__filemap_remove_folio 0.20 ± 2% +0.0 0.24 ± 3% perf-profile.self.cycles-pp.__alloc_pages_noprof 0.14 ± 3% +0.0 0.18 ± 6% perf-profile.self.cycles-pp.shmem_inode_acct_blocks 0.26 ± 2% +0.0 0.30 ± 2% perf-profile.self.cycles-pp._raw_spin_lock 0.12 ± 4% +0.0 0.17 ± 6% perf-profile.self.cycles-pp.__count_memcg_events 0.10 ± 3% +0.0 0.15 ± 5% perf-profile.self.cycles-pp.__mem_cgroup_charge 0.27 ± 3% +0.0 0.32 ± 3% perf-profile.self.cycles-pp.shmem_fallocate 0.15 ± 5% +0.1 0.20 perf-profile.self.cycles-pp.__page_cache_release 0.24 ± 2% +0.1 0.30 ± 3% perf-profile.self.cycles-pp.shmem_add_to_page_cache 0.36 ± 3% +0.1 0.41 perf-profile.self.cycles-pp.xas_store 0.35 ± 2% +0.1 0.41 ± 2% perf-profile.self.cycles-pp.entry_SYSCALL_64 0.12 ± 3% +0.1 0.19 ± 7% perf-profile.self.cycles-pp.mem_cgroup_commit_charge 0.22 ± 2% +0.1 0.28 perf-profile.self.cycles-pp.lru_add_fn 0.56 +0.1 0.65 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 0.93 +0.2 1.09 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack 0.99 +0.2 1.16 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret 0.42 ± 9% +0.2 0.64 ± 5% perf-profile.self.cycles-pp.get_mem_cgroup_from_mm 1.18 ± 5% +0.7 1.88 ± 5% perf-profile.self.cycles-pp.__mod_memcg_lruvec_state Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki