On Tue, Mar 18, 2025 at 04:17:54PM -0700, Luis Chamberlain wrote: > Ah, then that LTP test isn't going to easily reproduce bugs around > compaction bug. To help proactively find compaction bugs more > deterministically we wrote generic/750 and indeed we can easily see > issues creep up with a SOAK_DURATION=9000 on ext4 on linux-next as of > yesterday next-20250317. Umm .. this is an entirely separate bug. How much COMFIG_DEBUG do you have enabled (ie is this a consequence of something that we have an assert for, but you've disabled?) > BUG: unable to handle page fault for address: ffff9d5640010c48 > #PF: supervisor read access in kernel mode > #PF: error_code(0x0000) - not-present page > PGD 38601067 P4D 38601067 PUD 0 > Oops: Oops: 0000 [#1] PREEMPT SMP NOPTI > CPU: 0 UID: 0 PID: 74 Comm: kcompactd0 Not tainted 6.14.0-rc7-next-20250317 #30 > Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 2024.11-5 01/28/2025 > RIP: 0010:__zone_watermark_ok+0x4e/0x1e0 > Code: 00 00 00 41 f7 c0 38 02 00 00 0f 85 2c 01 00 00 48 8b 4f 30 48 63 d2 48 01 ca 85 db 0f 84 f3 00 00 00 49 29 d1 bb 80 00 00 00 <4c> 03 54 f7 38 31 d2 4d 39 ca 0f 8d d2 00 00 00 ba 01 00 00 00 85 > RSP: 0018:ffffbf47c02b7c78 EFLAGS: 00010202 > RAX: 0000000000000000 RBX: 0000000000000080 RCX: 0000000000000000 > RDX: 0000000000000000 RSI: 0000000000002f52 RDI: ffff9d563fff9180 > RBP: 0000000000000009 R08: 0000000000000080 R09: 00000000000030a1 > R10: 0000000000000be4 R11: 0000000000000be4 R12: 0000000000000002 > R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000002f52 2a:* 4c 03 54 f7 38 add 0x38(%rdi,%rsi,8),%r10 <-- trapping instruction Not quite sure what this is. Perhaps running this through decode_stacktrace.sh would be helpful? > FS: 0000000000000000(0000) GS:ffff9d56b6cce000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: ffff9d5640010c48 CR3: 0000000115920006 CR4: 0000000000772ef0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > PKRU: 55555554 > Call Trace: > <TASK> > ? __die_body.cold+0x19/0x28 > ? page_fault_oops+0xa1/0x230 > ? search_module_extables+0x40/0x60 > ? __zone_watermark_ok+0x4e/0x1e0 > ? search_bpf_extables+0x5b/0x80 > ? exc_page_fault+0x16d/0x190 > ? __zone_watermark_ok+0x4e/0x1e0 > ? hrtimer_try_to_cancel+0x78/0x110 > compaction_suit_allocation_order+0x8f/0x110 > kcompactd_do_work+0xbc/0x260 > kcompactd+0x396/0x3e0 > ? __pfx_autoremove_wake_function+0x10/0x10 > ? __pfx_kcompactd+0x10/0x10 > kthread+0xf6/0x240 > ? __pfx_kthread+0x10/0x10 > ? _raw_spin_unlock+0x15/0x30 > ? finish_task_switch.isra.0+0x94/0x290 > ? __pfx_kthread+0x10/0x10 > ret_from_fork+0x2d/0x50 > ? __pfx_kthread+0x10/0x10 > ret_from_fork_asm+0x1a/0x30 > </TASK> > Modules linked in: exfat xfs ext2 loop sunrpc 9p nls_iso8859_1 nls_cp437 crc32c_generic vfat fat kvm_intel kvm ghash_clmulni_intel sha512_ssse3 sha512_generic sha256_ssse3 sha1_ssse3 aesni_intel gf128mul crypto_simd cryptd 9pnet_virtio virtio_console virtio_balloon button joydev evdev serio_raw nvme_fabrics dm_mod nvme_core drm vsock_loopback vmw_vsock_virtio_transport_common vsock nfnetlink autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic efivarfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 md_mod virtio_net net_failover failover virtio_blk psmouse virtio_pci virtio_pci_legacy_dev virtio_pci_modern_dev virtio virtio_ring > CR2: ffff9d5640010c48 > ---[ end trace 0000000000000000 ]--- > RIP: 0010:__zone_watermark_ok+0x4e/0x1e0 > Code: 00 00 00 41 f7 c0 38 02 00 00 0f 85 2c 01 00 00 48 8b 4f 30 48 63 d2 48 01 ca 85 db 0f 84 f3 00 00 00 49 29 d1 bb 80 00 00 00 <4c> 03 54 f7 38 31 d2 4d 39 ca 0f 8d d2 00 00 00 ba 01 00 00 00 85 > RSP: 0018:ffffbf47c02b7c78 EFLAGS: 00010202 > RAX: 0000000000000000 RBX: 0000000000000080 RCX: 0000000000000000 > RDX: 0000000000000000 RSI: 0000000000002f52 RDI: ffff9d563fff9180 > RBP: 0000000000000009 R08: 0000000000000080 R09: 00000000000030a1 > R10: 0000000000000be4 R11: 0000000000000be4 R12: 0000000000000002 > R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000002f52 > FS: 0000000000000000(0000) GS:ffff9d56b6cce000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: ffff9d5640010c48 CR3: 0000000115920006 CR4: 0000000000772ef0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > PKRU: 55555554 > note: kcompactd0[74] exited with irqs disabled