ntfs3 folio related traceback

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello, I'm including some kernel stack traces below.
They were spurious, so I hesitate to file bug. I have also filed some similar bugs before, which were back then more sure to occur. 
I also see many recent posts mentioning similar kernel stack traces and related to ntfs3 and folios.
Are there any bugs fixes/issue-nos related to this that are being addressed, already committed to kernel-6.1, 6.2, 6.3 that I should be aware of?
thx -Gana


Host: fedora-37/kernel-6.0.15-300.fc37.x86_64
qemu: 7.2.0
Guest: fedora-37/kernel-6.0.15-300.fc37.x86_64

Prior similar bugs: 
Bug 215460 fs/ntfs3: page_cache_Ra_unbounded on rsync from ntfs3 to ext4
  https://bugzilla.kernel.org/show_bug.cgi?id=215460
Bug 215563 during rsync, vm guest kernel trace arising from memcg_kmem_charge_page alloca_pages
  https://bugzilla.kernel.org/show_bug.cgi?id=215563

Essentially, I do the same standard steps as mentioned in those bugs.
The bug I am really stress testing for is https://gitlab.com/qemu-project/qemu/-/issues/727
In it, two vdisks are connected to a qemu vm. The destination vdisk starts of as an empty dynamic vdisk, the source vdisk contains synthetic generated data  
The empty dynamic vdisk is inflated from a tiny size to 85Gib in under 16 minutes. 

reproducibility: Spurious
2 out of 6 testruns went to completion. 

Steps to reproduce:
same as before
On a qcow2-dynamic vdisk, that starts of empty, stress test copy about 85Gib of data to inflate its size. 
src: ntfs3/sdc/vhdx/ntfs3/hdd
dst: ntfs3/sdb/qcow2-dynamic/ntfs3/ssd

Below are 6 testruns with 2 strack-traces.
Prior to these I have tested the destination vdisk being 93Gib-raw-img-fixed, fixed-93Gib-qemu-fixed and 93Gib-vhdx-fixed. These did not give any problem.
The write stress on the expanding qcow2-dynamic vdisk is required. 

RUN1:
guest:
[root@sirius ~]# ls -l /mnt/a17/gkpics01.qcow2
-rw-r--r--. 1 gana gana 198096 Jan  3 15:27 /mnt/a17/gkpics01.qcow2

[root@sirius ~]# ls -l /mnt/a17/gkpics01.qcow2 # after gdisk
-rw-r--r--. 1 gana gana 524288 Jan  3 15:34 /mnt/a17/gkpics01.qcow2

[root@sirius ~]# run the rsync step
: 
:
photos001/D0038_16ab9e88/File_0583.bin
Tue Jan  3 03:42:36 PM IST 2023
rsync: connection unexpectedly closed (635379 bytes received so far) [generator]
rsync error: error in rsync protocol data stream (code 12) at io.c(231) [generator=3.2.7]
Tue Jan  3 03:57:57 PM IST 2023

host:
[root@sirius ~]# ls -l /mnt/a17/gkpics01.qcow2
-rw-r--r--. 1 gana gana 90386464768 Jan  3 15:28 /mnt/a17/gkpics01.qcow2



:
[  995.145136] BUG: unable to handle page fault for address: 00000000486501e3
[  995.145162] #PF: supervisor write access in kernel mode
[  995.145167] #PF: error_code(0x0002) - not-present page
[  995.145172] PGD 0 P4D 0 
[  995.145188] Oops: 0002 [#1] PREEMPT SMP PTI
[  995.145199] CPU: 7 PID: 2531 Comm: rsync Not tainted 6.0.15-300.fc37.x86_64 #1
[  995.145211] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS edk2-20221117gitfff6d81270b5-8.fc37 unknown
[  995.145216] RIP: 0010:io_schedule+0x41/0x70
[  995.145231] Code: 8b ba 28 0d 00 00 89 c3 83 c8 04 c0 eb 02 88 82 90 09 00 00 83 e3 01 48 85 ff 74 0a be 01 00 00 00 e8 e3 58 8b ff e8 be fe ff <ff> 83 e3 01 65 48 8b 14 25 c0 fb 01 00 0f b6 82 90 09 00 00 c1 e3
[  995.145237] RSP: 0018:ffffb70c81343b98 EFLAGS: 00010246
[  995.145243] RAX: 0000000000400000 RBX: 0000000000000000 RCX: 0000000000000000
[  995.145248] RDX: 0000000000000001 RSI: ffffffffac7aac9a RDI: 00000000ffffffff
[  995.145253] RBP: 0000000000000002 R08: 0000000000000000 R09: 0000000000000000
[  995.145256] R10: 0000000000000001 R11: 0000000000000007 R12: 0000000000000102
[  995.145260] R13: ffffffffad008f18 R14: ffffb70c81343c00 R15: fffffb1d808c62c0
[  995.145265] FS:  00007fba69929c40(0000) GS:ffff9b2b3bdc0000(0000) knlGS:0000000000000000
[  995.145271] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  995.145276] CR2: 00000000486501e3 CR3: 0000000176606000 CR4: 00000000000006e0
[  995.145285] Call Trace:
[  995.145290]  <TASK>
[  995.145296]  folio_wait_bit_common+0x12d/0x3a0
[  995.145311]  ? filemap_alloc_folio+0xc0/0xc0
[  995.145322]  filemap_get_pages+0x5f7/0x670
[  995.145333]  ? copy_page_to_iter+0x8a/0x2e0
[  995.145342]  ? workingset_activation+0xb1/0x1a0
[  995.145352]  filemap_read+0xbe/0x3d0
[  995.145365]  vfs_read+0x21a/0x2f0
[  995.145374]  ksys_read+0x5b/0xd0
[  995.145380]  do_syscall_64+0x5b/0x80
[  995.145389]  ? ksys_write+0x97/0xd0
[  995.145392]  ? syscall_exit_to_user_mode+0x17/0x40
[  995.145397]  ? do_syscall_64+0x67/0x80
[  995.145402]  ? do_syscall_64+0x67/0x80
[  995.145407]  ? syscall_exit_to_user_mode+0x17/0x40
[  995.145411]  ? do_syscall_64+0x67/0x80
[  995.145416]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  995.145423] RIP: 0033:0x7fba6931e021
[  995.148377] Code: 31 c0 e9 b2 fe ff ff 50 48 8d 3d 32 35 0a 00 e8 35 08 02 00 0f 1f 44 00 00 f3 0f 1e fa 80 3d dd 05 0e 00 00 74 13 31 c0 0f 05 <48> 3d 00 f0 ff ff 77 57 c3 66 0f 1f 44 00 00 48 83 ec 28 48 89 54
[  995.148384] RSP: 002b:00007fff3a2ff5f8 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[  995.148392] RAX: ffffffffffffffda RBX: 000055f6ad9b9a10 RCX: 00007fba6931e021
[  995.148396] RDX: 000000000003c5ff RSI: 000055f6ada138c0 RDI: 0000000000000003
[  995.148399] RBP: 000000000003c5ff R08: 000000000003c5ff R09: 000000000003c5ff
[  995.148403] R10: 000000000000000f R11: 0000000000000246 R12: 0000000000000000
[  995.148406] R13: 0000000000000000 R14: 0000000000000000 R15: 000000000003c5ff
[  995.148417]  </TASK>
[  995.148427] Modules linked in: ntfs3 uinput snd_seq_dummy snd_hrtimer snd_seq snd_seq_device snd_timer snd soundcore nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 rfkill ip_set nf_tables nfnetlink qrtr vfat fat ppdev iTCO_wdt intel_pmc_bxt iTCO_vendor_support joydev parport_pc parport i2c_i801 pcspkr i2c_smbus lpc_ich zram virtio_net net_failover virtio_console virtio_gpu failover serio_raw virtio_dma_buf aacraid mvsas libsas vmw_pvscsi megaraid_sas megaraid smartpqi pmcraid uas vhci_hcd usbip_core xhci_plat_hcd usb_storage nvme nvme_core nvme_common mptspi scsi_transport_spi mptsas mptscsih mptbase scsi_transport_sas sunrpc be2iscsi bnx2i cnic uio cxgb4i cxgb4 tls cxgb3i cxgb3 mdio libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip6_tables ip_tables fuse
[  995.149789]  qemu_fw_cfg
[  995.149807] CR2: 00000000486501e3
[  995.149883] ---[ end trace 0000000000000000 ]---
[  995.149889] RIP: 0010:io_schedule+0x41/0x70
[  995.149898] Code: 8b ba 28 0d 00 00 89 c3 83 c8 04 c0 eb 02 88 82 90 09 00 00 83 e3 01 48 85 ff 74 0a be 01 00 00 00 e8 e3 58 8b ff e8 be fe ff <ff> 83 e3 01 65 48 8b 14 25 c0 fb 01 00 0f b6 82 90 09 00 00 c1 e3
[  995.149902] RSP: 0018:ffffb70c81343b98 EFLAGS: 00010246
[  995.149906] RAX: 0000000000400000 RBX: 0000000000000000 RCX: 0000000000000000
[  995.149909] RDX: 0000000000000001 RSI: ffffffffac7aac9a RDI: 00000000ffffffff
[  995.149912] RBP: 0000000000000002 R08: 0000000000000000 R09: 0000000000000000
[  995.149914] R10: 0000000000000001 R11: 0000000000000007 R12: 0000000000000102
[  995.149916] R13: ffffffffad008f18 R14: ffffb70c81343c00 R15: fffffb1d808c62c0
[  995.149918] FS:  00007fba69929c40(0000) GS:ffff9b2b3bdc0000(0000) knlGS:0000000000000000
[  995.149920] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  995.149922] CR2: 00000000486501e3 CR3: 0000000176606000 CR4: 00000000000006e0

RUN2:
system freeze

RUN3:
sort of freeze, gui responsive, but can't start new terminal/shell/new process 

RUN4:
traceback but operable 
[  131.484082] BUG: unable to handle page fault for address: 0000000000001001
[  131.484091] #PF: supervisor write access in kernel mode
[  131.484094] #PF: error_code(0x0002) - not-present page
[  131.484096] PGD 0 P4D 0 
[  131.484104] Oops: 0002 [#1] PREEMPT SMP PTI
[  131.484110] CPU: 0 PID: 2558 Comm: rsync Not tainted 6.0.15-300.fc37.x86_64 #1
[  131.484115] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS edk2-20221117gitfff6d81270b5-8.fc37 unknown
[  131.484117] RIP: 0010:copy_page_from_iter+0x76/0x170
[  131.484123] Code: 4c 89 e2 4c 29 fe 4c 39 f6 49 0f 47 f6 48 2b 3d 40 ab 18 01 48 c1 ff 06 48 c1 e7 0c 48 03 3d 41 ab 18 01 4c 01 ff e8 09 fa ff <ff> 48 01 c5 49 29 c6 74 5f 48 85 c0 74 5a 49 01 c7 49 81 ff 00 10
[  131.484125] RSP: 0018:ffffb70000c97bc8 EFLAGS: 00010286
[  131.484127] RAX: 0000000000001000 RBX: ffffe5cfc0dc30c0 RCX: 0000000000004008
[  131.484129] RDX: 0000000000004000 RSI: 00005616cc757138 RDI: ffff99ed770c4000
[  131.484130] RBP: 0000000000003000 R08: 326e73452c4e6cb7 R09: 7227a8304c2683b2
[  131.484131] R10: cb3f3c018be49eba R11: 219097ff2b911ec9 R12: ffffb70000c97d18
[  131.484132] R13: 0000000000001000 R14: 0000000000005000 R15: 0000000000000000
[  131.484134] FS:  00007f1634972c40(0000) GS:ffff99eebbc00000(0000) knlGS:0000000000000000
[  131.484135] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  131.484137] CR2: 0000000000001001 CR3: 0000000019ec2000 CR4: 00000000000006f0
[  131.484141] Call Trace:
[  131.484143]  <TASK>
[  131.484147]  skb_copy_datagram_from_iter+0x10e/0x1e0
[  131.484153]  unix_stream_sendmsg+0x29d/0x700
[  131.484158]  sock_sendmsg+0x5f/0x70
[  131.484161]  sock_write_iter+0x89/0xe0
[  131.484164]  vfs_write+0x34e/0x3e0
[  131.484172]  ksys_write+0x97/0xd0
[  131.484174]  do_syscall_64+0x5b/0x80
[  131.484179]  ? do_pselect.constprop.0+0xe9/0x150
[  131.484184]  ? syscall_exit_to_user_mode+0x17/0x40
[  131.484187]  ? do_syscall_64+0x67/0x80
[  131.484189]  ? exit_to_user_mode_prepare+0x18f/0x1f0
[  131.484196]  ? syscall_exit_to_user_mode+0x17/0x40
[  131.484198]  ? do_syscall_64+0x67/0x80
[  131.484200]  ? do_syscall_64+0x67/0x80
[  131.484203]  ? do_syscall_64+0x67/0x80
[  131.484206]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  131.484209] RIP: 0033:0x7f163431e0c4
[  131.484233] Code: 15 71 7d 0d 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 80 3d 3d 05 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
[  131.484234] RSP: 002b:00007fff4a53a0f8 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
[  131.484237] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007f163431e0c4
[  131.484238] RDX: 0000000000008008 RSI: 00005616cc753130 RDI: 0000000000000004
[  131.484239] RBP: 00005616ca8dc080 R08: 0000000000008008 R09: 0000000000000000
[  131.484248] R10: 00007fff4a53a1d0 R11: 0000000000000202 R12: 00007fff4a53a150
[  131.484250] R13: 0000000000000002 R14: 00007fff4a53a250 R15: 00007fff4a53a1d0
[  131.484254]  </TASK>
[  131.484267] Modules linked in: ntfs3 uinput snd_seq_dummy snd_hrtimer snd_seq snd_seq_device snd_timer snd soundcore nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set rfkill nf_tables nfnetlink qrtr vfat fat iTCO_wdt ppdev intel_pmc_bxt iTCO_vendor_support parport_pc i2c_i801 parport pcspkr i2c_smbus joydev lpc_ich zram virtio_net virtio_gpu net_failover serio_raw failover virtio_console virtio_dma_buf aacraid mvsas libsas vmw_pvscsi megaraid_sas megaraid smartpqi pmcraid uas vhci_hcd usbip_core xhci_plat_hcd usb_storage nvme nvme_core nvme_common mptspi scsi_transport_spi mptsas mptscsih mptbase scsi_transport_sas sunrpc be2iscsi bnx2i cnic uio cxgb4i cxgb4 tls cxgb3i cxgb3 mdio libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip6_tables ip_tables fuse
[  131.484334]  qemu_fw_cfg
[  131.484342] CR2: 0000000000001001
[  131.484347] ---[ end trace 0000000000000000 ]---
[  131.484349] RIP: 0010:copy_page_from_iter+0x76/0x170
[  131.484353] Code: 4c 89 e2 4c 29 fe 4c 39 f6 49 0f 47 f6 48 2b 3d 40 ab 18 01 48 c1 ff 06 48 c1 e7 0c 48 03 3d 41 ab 18 01 4c 01 ff e8 09 fa ff <ff> 48 01 c5 49 29 c6 74 5f 48 85 c0 74 5a 49 01 c7 49 81 ff 00 10
[  131.484355] RSP: 0018:ffffb70000c97bc8 EFLAGS: 00010286
[  131.484357] RAX: 0000000000001000 RBX: ffffe5cfc0dc30c0 RCX: 0000000000004008
[  131.484358] RDX: 0000000000004000 RSI: 00005616cc757138 RDI: ffff99ed770c4000
[  131.484359] RBP: 0000000000003000 R08: 326e73452c4e6cb7 R09: 7227a8304c2683b2
[  131.484360] R10: cb3f3c018be49eba R11: 219097ff2b911ec9 R12: ffffb70000c97d18
[  131.484361] R13: 0000000000001000 R14: 0000000000005000 R15: 0000000000000000
[  131.484363] FS:  00007f1634972c40(0000) GS:ffff99eebbc00000(0000) knlGS:0000000000000000
[  131.484364] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  131.484365] CR2: 0000000000001001 CR3: 0000000019ec2000 CR4: 00000000000006f0


RUN5: success
copy:
Tue Jan  3 07:16:29 PM IST 2023
Tue Jan  3 07:32:07 PM IST 2023
16 min
verify:
Tue Jan  3 07:32:53 PM IST 2023
Tue Jan  3 07:48:19 PM IST 2023
Tue Jan  3 07:57:29 PM IST 2023
Tue Jan  3 08:13:25 PM IST 2023
16 min

[root@sirius ~]# ls -l /mnt/a17/gkpics01.qcow2  ; df | grep sda17
-rw-r--r--. 1 gana gana 91077017600 Jan  3 19:56 /mnt/a17/gkpics01.qcow2
/dev/sda17       97517564   89051588   8465976  92% /mnt/a17



RUN6: success
mounted src vdisk using ntfs-3g fuseblk driver.
src: ntfs3g-fuseblk/sdc/vhdx/ntfs3/hdd
dst: ntfs3/sdb/qcow2-dynamic/ntfs3/ssd

copy
Tue Jan  3 09:12:17 PM IST 2023
Tue Jan  3 09:28:51 PM IST 2023
16 min
verify
Tue Jan  3 09:30:23 PM IST 2023
Tue Jan  3 09:51:04 PM IST 2023
Tue Jan  3 10:14:27 PM IST 2023
Tue Jan  3 10:29:20 PM IST 2023
15 min





[Index of Archives]     [Linux Driver Backports]     [DMA Engine]     [Linux GPIO]     [Linux SPI]     [Video for Linux]     [Linux USB Devel]     [Linux Coverity]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [Yosemite Backpacking]
  Powered by Linux