Just patched it, then I rerun the reproduce program, and I got this output from console: BUG: Bad page state in process POC pfn:0eb8d page:ffffea00003ae340 refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0xeb8d flags: 0xfff00000001000(reserved|node=0|zone=1|lastcpupid=0x7ff) page_type: 0xffffffff() raw: 00fff00000001000 ffffea00003ae348 ffffea00003ae348 0000000000000000 raw: 0000000000000000 0000000000000000 00000000ffffffff 0000000000000000 page dumped because: PAGE_FLAGS_CHECK_AT_FREE flag(s) set page_owner info is not present (never set?) Modules linked in: CPU: 0 PID: 7959 Comm: POC Not tainted 6.5.0-rc2 #2 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014 Call Trace: <TASK> __dump_stack lib/dump_stack.c:88 [inline] dump_stack_lvl+0xd4/0xf0 lib/dump_stack.c:106 bad_page+0x71/0x1a0 mm/page_alloc.c:533 free_page_is_bad_report mm/page_alloc.c:974 [inline] free_page_is_bad mm/page_alloc.c:984 [inline] free_pages_prepare mm/page_alloc.c:1153 [inline] free_unref_page_prepare+0x5f3/0xb50 mm/page_alloc.c:2348 free_unref_page+0x2f/0x3c0 mm/page_alloc.c:2443 __folio_put_small mm/swap.c:106 [inline] __folio_put+0xa2/0x110 mm/swap.c:129 folio_put include/linux/mm.h:1423 [inline] put_page include/linux/mm.h:1492 [inline] extract_user_to_sg lib/scatterlist.c:1151 [inline] extract_iter_to_sg lib/scatterlist.c:1349 [inline] extract_iter_to_sg+0x11ec/0x1570 lib/scatterlist.c:1339 hash_sendmsg+0x487/0xf50 crypto/algif_hash.c:119 sock_sendmsg_nosec net/socket.c:725 [inline] sock_sendmsg+0xcf/0x170 net/socket.c:748 ____sys_sendmsg+0x676/0x860 net/socket.c:2494 ___sys_sendmsg+0x109/0x1a0 net/socket.c:2548 __sys_sendmsg+0xe4/0x1b0 net/socket.c:2577 do_syscall_x64 arch/x86/entry/common.c:50 [inline] do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 entry_SYSCALL_64_after_hwframe+0x63/0xcd RIP: 0033:0x7fbd79539f29 Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 37 8f 0d 00 f7 d8 64 89 01 48 RSP: 002b:00007ffeed5b63d8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fbd79539f29 RDX: 0000000000000000 RSI: 00000000200001c0 RDI: 0000000000000004 RBP: 00007ffeed5b63f0 R08: 00007ffeed5b63f0 R09: 00007ffeed5b63f0 R10: 00007ffeed5b63f0 R11: 0000000000000246 R12: 000055d8a44b91a0 R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 </TASK> page:ffffea00003ae340 refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0xeb8d flags: 0xfff00000001000(reserved|node=0|zone=1|lastcpupid=0x7ff) page_type: 0xffffffff() raw: 00fff00000001000 ffffea00003ae348 ffffea00003ae348 0000000000000000 raw: 0000000000000000 0000000000000000 00000000ffffffff 0000000000000000 page dumped because: VM_WARN_ON_ONCE_FOLIO(folio_ref_count(folio) <= 0) page_owner info is not present (never set?) ------------[ cut here ]------------ WARNING: CPU: 0 PID: 7962 at mm/gup.c:229 try_grab_page+0x307/0x3c0 mm/gup.c:229 Modules linked in: CPU: 0 PID: 7962 Comm: POC Tainted: G B 6.5.0-rc2 #2 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014 RIP: 0010:try_grab_page+0x307/0x3c0 mm/gup.c:229 Code: 80 3d 61 0e 82 0b 00 41 bc f4 ff ff ff 75 b4 e8 3f 96 cb ff 48 c7 c6 40 83 57 89 48 89 ef e8 60 a7 ff ff c6 05 3e 0e 82 0b 01 <0f> 0b eb 95 e8 20 96 cb ff be 04 00 00 00 4c 89 e7 e8 93 fa 13 00 RSP: 0018:ffffc90002927178 EFLAGS: 00010293 RAX: 0000000000000000 RBX: ffffea00003ae340 RCX: 0000000000000000 RDX: ffff88801ab18000 RSI: ffffffff81ad81e0 RDI: ffffffff8af7ea00 RBP: ffffea00003ae340 R08: 0000000000000000 R09: fffffbfff1a8a74a R10: ffffffff8d453a57 R11: 6e776f5f65676170 R12: 00000000fffffff4 R13: 0000000000290000 R14: ffffea00003ae340 R15: ffffea00003ae340 FS: 00007fbd7961a540(0000) GS:ffff888063e00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007fbd794d03d0 CR3: 0000000019855000 CR4: 0000000000750ef0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 PKRU: 55555554 Call Trace: <TASK> follow_page_pte+0x18c/0x1610 mm/gup.c:651 follow_pmd_mask mm/gup.c:727 [inline] follow_pud_mask mm/gup.c:765 [inline] follow_p4d_mask mm/gup.c:782 [inline] follow_page_mask+0x2e4/0xbd0 mm/gup.c:839 __get_user_pages+0x3fa/0xcf0 mm/gup.c:1256 __get_user_pages_locked mm/gup.c:1487 [inline] __gup_longterm_locked+0x5fa/0x1ec0 mm/gup.c:2181 internal_get_user_pages_fast+0x119b/0x2690 mm/gup.c:3179 pin_user_pages_fast+0x95/0xe0 mm/gup.c:3285 iov_iter_extract_user_pages lib/iov_iter.c:1768 [inline] iov_iter_extract_pages+0x24c/0x1600 lib/iov_iter.c:1831 extract_user_to_sg lib/scatterlist.c:1123 [inline] extract_iter_to_sg lib/scatterlist.c:1349 [inline] extract_iter_to_sg+0x21a/0x1570 lib/scatterlist.c:1339 hash_sendmsg+0x487/0xf50 crypto/algif_hash.c:119 sock_sendmsg_nosec net/socket.c:725 [inline] sock_sendmsg+0xcf/0x170 net/socket.c:748 ____sys_sendmsg+0x676/0x860 net/socket.c:2494 ___sys_sendmsg+0x109/0x1a0 net/socket.c:2548 __sys_sendmsg+0xe4/0x1b0 net/socket.c:2577 do_syscall_x64 arch/x86/entry/common.c:50 [inline] do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 entry_SYSCALL_64_after_hwframe+0x63/0xcd RIP: 0033:0x7fbd79539f29 Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 37 8f 0d 00 f7 d8 64 89 01 48 RSP: 002b:00007ffeed5b63d8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fbd79539f29 RDX: 0000000000000000 RSI: 00000000200001c0 RDI: 0000000000000004 RBP: 00007ffeed5b63f0 R08: 00007ffeed5b63f0 R09: 00007ffeed5b63f0 R10: 00007ffeed5b63f0 R11: 0000000000000246 R12: 000055d8a44b91a0 R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 </TASK> Modules linked in: CPU: 0 PID: 7962 Comm: POC Tainted: G B 6.5.0-rc2 #2 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014 RIP: 0010:try_grab_page+0x307/0x3c0 mm/gup.c:229 Code: 80 3d 61 0e 82 0b 00 41 bc f4 ff ff ff 75 b4 e8 3f 96 cb ff 48 c7 c6 40 83 57 89 48 89 ef e8 60 a7 ff ff c6 05 3e 0e 82 0b 01 <0f> 0b eb 95 e8 20 96 cb ff be 04 00 00 00 4c 89 e7 e8 93 fa 13 00 RSP: 0018:ffffc90002927178 EFLAGS: 00010293 RAX: 0000000000000000 RBX: ffffea00003ae340 RCX: 0000000000000000 RDX: ffff88801ab18000 RSI: ffffffff81ad81e0 RDI: ffffffff8af7ea00 RBP: ffffea00003ae340 R08: 0000000000000000 R09: fffffbfff1a8a74a R10: ffffffff8d453a57 R11: 6e776f5f65676170 R12: 00000000fffffff4 R13: 0000000000290000 R14: ffffea00003ae340 R15: ffffea00003ae340 FS: 00007fbd7961a540(0000) GS:ffff888063e00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007fbd794d03d0 CR3: 0000000019855000 CR4: 0000000000750ef0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 PKRU: 55555554 Call Trace: <TASK> follow_page_pte+0x18c/0x1610 mm/gup.c:651 follow_pmd_mask mm/gup.c:727 [inline] follow_pud_mask mm/gup.c:765 [inline] follow_p4d_mask mm/gup.c:782 [inline] follow_page_mask+0x2e4/0xbd0 mm/gup.c:839 __get_user_pages+0x3fa/0xcf0 mm/gup.c:1256 __get_user_pages_locked mm/gup.c:1487 [inline] __gup_longterm_locked+0x5fa/0x1ec0 mm/gup.c:2181 internal_get_user_pages_fast+0x119b/0x2690 mm/gup.c:3179 pin_user_pages_fast+0x95/0xe0 mm/gup.c:3285 iov_iter_extract_user_pages lib/iov_iter.c:1768 [inline] iov_iter_extract_pages+0x24c/0x1600 lib/iov_iter.c:1831 extract_user_to_sg lib/scatterlist.c:1123 [inline] extract_iter_to_sg lib/scatterlist.c:1349 [inline] extract_iter_to_sg+0x21a/0x1570 lib/scatterlist.c:1339 hash_sendmsg+0x487/0xf50 crypto/algif_hash.c:119 sock_sendmsg_nosec net/socket.c:725 [inline] sock_sendmsg+0xcf/0x170 net/socket.c:748 ____sys_sendmsg+0x676/0x860 net/socket.c:2494 ___sys_sendmsg+0x109/0x1a0 net/socket.c:2548 __sys_sendmsg+0xe4/0x1b0 net/socket.c:2577 do_syscall_x64 arch/x86/entry/common.c:50 [inline] do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 entry_SYSCALL_64_after_hwframe+0x63/0xcd RIP: 0033:0x7fbd79539f29 Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 37 8f 0d 00 f7 d8 64 89 01 48 RSP: 002b:00007ffeed5b63d8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fbd79539f29 RDX: 0000000000000000 RSI: 00000000200001c0 RDI: 0000000000000004 RBP: 00007ffeed5b63f0 R08: 00007ffeed5b63f0 R09: 00007ffeed5b63f0 R10: 00007ffeed5b63f0 R11: 0000000000000246 R12: 000055d8a44b91a0 R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 </TASK> Kernel panic - not syncing: kernel: panic_on_warn set ... CPU: 0 PID: 7962 Comm: POC Tainted: G B 6.5.0-rc2 #2 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014 Call Trace: <TASK> __dump_stack lib/dump_stack.c:88 [inline] dump_stack_lvl+0x92/0xf0 lib/dump_stack.c:106 panic+0x570/0x620 kernel/panic.c:340 check_panic_on_warn+0x8e/0x90 kernel/panic.c:236 __warn+0xee/0x340 kernel/panic.c:673 __report_bug lib/bug.c:199 [inline] report_bug+0x25d/0x460 lib/bug.c:219 handle_bug+0x3c/0x70 arch/x86/kernel/traps.c:324 exc_invalid_op+0x14/0x40 arch/x86/kernel/traps.c:345 asm_exc_invalid_op+0x16/0x20 arch/x86/include/asm/idtentry.h:568 RIP: 0010:try_grab_page+0x307/0x3c0 mm/gup.c:229 Code: 80 3d 61 0e 82 0b 00 41 bc f4 ff ff ff 75 b4 e8 3f 96 cb ff 48 c7 c6 40 83 57 89 48 89 ef e8 60 a7 ff ff c6 05 3e 0e 82 0b 01 <0f> 0b eb 95 e8 20 96 cb ff be 04 00 00 00 4c 89 e7 e8 93 fa 13 00 RSP: 0018:ffffc90002927178 EFLAGS: 00010293 RAX: 0000000000000000 RBX: ffffea00003ae340 RCX: 0000000000000000 RDX: ffff88801ab18000 RSI: ffffffff81ad81e0 RDI: ffffffff8af7ea00 RBP: ffffea00003ae340 R08: 0000000000000000 R09: fffffbfff1a8a74a R10: ffffffff8d453a57 R11: 6e776f5f65676170 R12: 00000000fffffff4 R13: 0000000000290000 R14: ffffea00003ae340 R15: ffffea00003ae340 follow_page_pte+0x18c/0x1610 mm/gup.c:651 follow_pmd_mask mm/gup.c:727 [inline] follow_pud_mask mm/gup.c:765 [inline] follow_p4d_mask mm/gup.c:782 [inline] follow_page_mask+0x2e4/0xbd0 mm/gup.c:839 __get_user_pages+0x3fa/0xcf0 mm/gup.c:1256 __get_user_pages_locked mm/gup.c:1487 [inline] __gup_longterm_locked+0x5fa/0x1ec0 mm/gup.c:2181 internal_get_user_pages_fast+0x119b/0x2690 mm/gup.c:3179 pin_user_pages_fast+0x95/0xe0 mm/gup.c:3285 iov_iter_extract_user_pages lib/iov_iter.c:1768 [inline] iov_iter_extract_pages+0x24c/0x1600 lib/iov_iter.c:1831 extract_user_to_sg lib/scatterlist.c:1123 [inline] extract_iter_to_sg lib/scatterlist.c:1349 [inline] extract_iter_to_sg+0x21a/0x1570 lib/scatterlist.c:1339 hash_sendmsg+0x487/0xf50 crypto/algif_hash.c:119 sock_sendmsg_nosec net/socket.c:725 [inline] sock_sendmsg+0xcf/0x170 net/socket.c:748 ____sys_sendmsg+0x676/0x860 net/socket.c:2494 ___sys_sendmsg+0x109/0x1a0 net/socket.c:2548 __sys_sendmsg+0xe4/0x1b0 net/socket.c:2577 do_syscall_x64 arch/x86/entry/common.c:50 [inline] do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 entry_SYSCALL_64_after_hwframe+0x63/0xcd RIP: 0033:0x7fbd79539f29 Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 37 8f 0d 00 f7 d8 64 89 01 48 RSP: 002b:00007ffeed5b63d8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fbd79539f29 RDX: 0000000000000000 RSI: 00000000200001c0 RDI: 0000000000000004 RBP: 00007ffeed5b63f0 R08: 00007ffeed5b63f0 R09: 00007ffeed5b63f0 R10: 00007ffeed5b63f0 R11: 0000000000000246 R12: 000055d8a44b91a0 R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 </TASK> Dumping ftrace buffer: (ftrace buffer empty) Kernel Offset: disabled Rebooting in 1 seconds.. --------------------------------------------------------------------------------------------- I think the previous question you mentioned about ioctl() is triggered because of another crash WARNING in kvm_arch_vcpu_ioctl_run, I think somehow these two crashes triggered at one time. But I cannot figure out why it happened. after I tried to fixed that problem, and rerun C reproducer on this issue, I got different output from console as above. Matthew Wilcox <willy@xxxxxxxxxxxxx> 于2023年8月3日周四 21:19写道: > > On Thu, Aug 03, 2023 at 04:56:03PM +0800, Yikebaer Aizezi wrote: > > console output: > > https://drive.google.com/file/d/1Lq71bFwtEDix82PEf_193CLG6uh1Pjj9/view?usp=drive_link > > I dug through this, and what I found troubles me. > > ------------[ cut here ]------------ > WARNING: CPU: 0 PID: 13067 at mm/gup.c:229 try_grab_page+0x2dd/0x3a0 > Modules linked in: > CPU: 0 PID: 13067 Comm: syz-executor Tainted: G B 6.5.0-rc2 #1 > Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014 > RIP: 0010:try_grab_page+0x2dd/0x3a0 > Code: ff be 04 00 00 00 4c 89 e7 e8 cf fa 13 00 f0 41 ff 04 24 e8 65 96 cb ff 45 31 e4 5b 44 89 e0 5d 41 5c 41 5d c3 e8 53 96 cb ff <0f> 0b e8 4c 96 cb ff 41 bc f4 ff ff ff 5b 44 89 e0 5d 41 5c 41 5d > RSP: 0018:ffffc9000c2777e0 EFLAGS: 00010212 > RAX: 0000000000000247 RBX: ffffea00003ae340 RCX: ffffc90002bb1000 > RDX: 0000000000040000 RSI: ffffffff81ad81ed RDI: ffffea00003ae374 > RBP: ffffea00003ae340 R08: 0000000000000000 R09: fffff94000075c6e > R10: ffffea00003ae377 R11: 0000000000084001 R12: ffffea00003ae374 > R13: 0000000000210002 R14: ffffea00003ae340 R15: 000000000eb8d225 > FS: 00007f5841a13640(0000) GS:ffff888063e00000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: 0000000000500310 CR3: 0000000018d0c000 CR4: 0000000000750ef0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > PKRU: 55555554 > Call Trace: > <TASK> > ? __warn+0xe2/0x340 > ? try_grab_page+0x2dd/0x3a0 > ? report_bug+0x25d/0x460 > ? handle_bug+0x3c/0x70 > ? exc_invalid_op+0x14/0x40 > ? asm_exc_invalid_op+0x16/0x20 > ? try_grab_page+0x2dd/0x3a0 > ? try_grab_page+0x2dd/0x3a0 > follow_page_pte+0x18c/0x1610 > ? try_grab_page+0x3a0/0x3a0 > ? rcu_is_watching+0xe/0xb0 > follow_page_mask+0x2e4/0xbd0 > __get_user_pages+0x3fa/0xcf0 > ? follow_page_mask+0xbd0/0xbd0 > ? down_read_killable+0x146/0x4f0 > ? down_read_interruptible+0x4f0/0x4f0 > ? rcu_is_watching+0xe/0xb0 > __gup_longterm_locked+0x5fa/0x1ec0 > ? io_schedule_timeout+0x150/0x150 > ? rcu_is_watching+0xe/0xb0 > ? get_user_pages_unlocked+0x580/0x580 > ? lock_release+0x4f7/0x670 > ? internal_get_user_pages_fast+0xe27/0x2690 > ? lock_downgrade+0x690/0x690 > ? preempt_schedule_common+0x45/0xb0 > ? pud_huge+0x9c/0xe0 > ? pmd_huge+0xe0/0xe0 > internal_get_user_pages_fast+0x119b/0x2690 > ? mtree_load+0x1df/0x980 > ? __gup_device_huge+0x530/0x530 > ? rcu_is_watching+0xe/0xb0 > ? lock_release+0x4f7/0x670 > get_user_pages_fast+0x95/0xe0 > ? get_user_pages_fast_only+0xe0/0xe0 > do_get_mempolicy+0x50c/0xd20 > ? sp_delete+0xf0/0xf0 > ? seccomp_notify_ioctl+0xd80/0xd80 > __x64_sys_get_mempolicy+0x187/0x2a0 > ? __ia32_sys_migrate_pages+0xf0/0xf0 > ? __secure_computing+0x1ff/0x360 > do_syscall_64+0x35/0xb0 > entry_SYSCALL_64_after_hwframe+0x63/0xcd > RIP: 0033:0x47959d > Code: 02 b8 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b4 ff ff ff f7 d8 64 89 01 48 > RSP: 002b:00007f5841a13068 EFLAGS: 00000246 ORIG_RAX: 00000000000000ef > RAX: ffffffffffffffda RBX: 000000000059c0a0 RCX: 000000000047959d > RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 > RBP: 000000000059c0a0 R08: 0000000000000003 R09: 0000000000000000 > R10: 0000000020ff9000 R11: 0000000000000246 R12: 000000000059c0ac > R13: 000000000000000b R14: 0000000000437250 R15: 00007f58419f3000 > </TASK> > Kernel panic - not syncing: kernel: panic_on_warn set ... > > > WARNING: CPU: 0 PID: 13067 at mm/gup.c:229 try_grab_page+0x2dd/0x3a0 > > That's this line: > if (WARN_ON_ONCE(folio_ref_count(folio) <= 0)) > Called from: > follow_page_pte+0x18c/0x1610 > > That did: > ptep = pte_offset_map_lock(mm, pmd, address, &ptl); > pte = ptep_get(ptep); > page = vm_normal_page(vma, address, pte); > ret = try_grab_page(page, flags); > > So we grabbed the PTE lock, looked up the PTE, translated that into > a page ... and found a page with a zero (or negative) refcount. > That's Really Bad. I think it was a zero refcount because r08 is 0 > and I don't see any other registers which have a plausible negative > 32-bit number in them. > > Yikebaer, could I trouble you to add this: > > +++ b/mm/gup.c > @@ -226,7 +226,7 @@ int __must_check try_grab_page(struct page *page, unsigned int flags) > { > struct folio *folio = page_folio(page); > > - if (WARN_ON_ONCE(folio_ref_count(folio) <= 0)) > + if (VM_WARN_ON_ONCE_FOLIO(folio_ref_count(folio) <= 0, folio)) > return -ENOMEM; > > if (unlikely(!(flags & FOLL_PCI_P2PDMA) && is_pci_p2pdma_page(page))) > > and rerun the syzkaller? That'll give us some more information about > what has happened, although it won't tell us why it happened. > > We might need to catch someone decrementing the refcount to lower than > the mapcount to catch this ... which will be tricky, given the other > things we reuse the mapcount for.