On 7/9/22 11:07, Hillf Danton wrote: > On Sat, 9 Jul 2022 07:33:51 +0200 Helge Deller <deller@xxxxxx> wrote: >> On the debian buildd servers for the parisc architecture I'm facing >> the following warning, after which it gets stuck so that I need to >> reboot the server. >> This happens rarely, but I can reproduce it after some time. >> Filesystem is tmpfs. >> >> I'm happy to add debug code if necessary... > > Spin with the diff below applied if it makes two-cent sense to you. >[...] > +++ b/fs/namei.c > @@ -3349,8 +3349,8 @@ static const char *open_last_lookups(str > struct dentry *dir = nd->path.dentry; > int open_flag = op->open_flag; > bool got_write = false; > - unsigned seq; > - struct inode *inode; > + unsigned seq = 0; > + struct inode *inode = NULL; > struct dentry *dentry; > const char *res; That patch didn't helped. I've faced another crash a few days back (see below), but right now I can't trigger it any longer. I'll added some more debug info and switched to v5.18.11, so when it happens again I can probably provide more input... [71492.491336] ------------[ cut here ]------------ [71492.544917] WARNING: CPU: 1 PID: 16312 at fs/dcache.c:365 dentry_free+0x100/0x128 [71492.632868] Modules linked in: ipt_REJECT(E) nf_reject_ipv4(E) xt_multiport(E) nft_compat(E) nf_tables(E) nfnetlink(E) dm_mod(E) sunrpc(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) sg(E) fuse(E) configfs(E) ip_tables(E) x_tables(E) autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) btrfs(E) blake2b_generic(E) xor(E) raid6_pq(E) zstd_compress(E) libcrc32c(E) crc32c_generic(E) sd_mod(E) t10_pi(E) crc64_rocksoft(E) crc64(E) crc_t10dif(E) crct10dif_generic(E) crct10dif_common(E) ata_generic(E) ohci_pci(E) sata_sil(E) mptspi(E) mptscsih(E) ohci_hcd(E) pata_sil680(E) ehci_pci(E) mptbase(E) ehci_hcd(E) scsi_transport_spi(E) libata(E) e1000(E) scsi_mod(E) usbcore(E) usb_common(E) scsi_common(E) [71493.366569] CPU: 1 PID: 16312 Comm: rm Tainted: G E 5.18.10+ #28 [71493.454274] Hardware name: 9000/785/C8000 [71493.500913] [71493.516909] YZrvWESTHLNXBCVMcbcbcbcbOGFRQPDI [71493.572891] PSW: 00001000000001001111111100001111 Tainted: G E [71493.662405] r00-03 000000ff0804ff0f 000000005b2c0490 000000004073bea0 000000005b2c0530 [71493.758449] r04-07 00000000410e1a00 00000003050366c0 000000016b97b718 0000000305036718 [71493.854493] r08-11 000000005b2c0460 0000000305036718 00000000410dae60 0000000000000001 [71493.950539] r12-15 0000000000800000 0000000041147200 0000000000000001 0000000000000000 [71494.046582] r16-19 0000000041146200 0000000000000000 0000000041146200 0000000002c55000 [71494.142628] r20-23 000000000800000f 000000004f9ca578 000000000800000f 000000016b97b760 [71494.238670] r24-27 0000000305036750 0000000305036750 00000003050366c0 00000000410e1a00 [71494.334714] r28-31 00000000416bf380 000000005b2c0500 000000005b2c05b0 fffffffffffff89e [71494.430759] sr00-03 0000000001df3800 0000000000000000 0000000000000000 0000000001df3800 [71494.527845] sr04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [71494.624929] [71494.642842] IASQ: 0000000000000000 0000000000000000 IAOQ: 000000004073b3e4 000000004073b3e8 [71494.743059] IIR: 03ffe01f ISR: 0000000010340000 IOR: 0000006cb02c0538 [71494.825560] CPU: 1 CR30: 00000000584d0ce0 CR31: ffffffffffffffff [71494.908063] ORIG_R28: 0000000042e5e388 [71494.954102] IAOQ[0]: dentry_free+0x100/0x128 [71495.006400] IAOQ[1]: dentry_free+0x104/0x128 [71495.058695] RP(r2): __dentry_kill+0x284/0x2e8 [71495.112026] Backtrace: [71495.140360] [<000000004073bea0>] __dentry_kill+0x284/0x2e8 [71495.207240] [<000000004073ec50>] shrink_dentry_list+0xe4/0x1e8 [71495.278284] [<000000004073f044>] shrink_dcache_parent+0x1a0/0x250 [71495.352452] [<000000004072489c>] vfs_rmdir+0x268/0x468 [71495.412868] [<000000004072ce08>] do_rmdir+0x39c/0x418 [71495.476826] [<000000004072d644>] sys_unlinkat+0xd0/0x110 [71495.540869] [<0000000040303e30>] syscall_exit+0x0/0x10 [71495.604330] [71495.622248] ---[ end trace 0000000000000000 ]--- [71583.296513] Backtrace: [71583.300330] [<0000000040834460>] proc_fill_cache+0x194/0x2b8 [71583.300330] [<000000004083d678>] proc_readfd_common+0x16c/0x3c0 [71583.300330] [<000000004083d904>] proc_readfd+0x38/0x68 [71583.300330] [<0000000040733cf0>] iterate_dir+0xec/0x2f0 [71583.300330] [<0000000040734ae4>] sys_getdents64+0xd8/0x1e0 [71583.300330] [<0000000040303e30>] syscall_exit+0x0/0x10 [71583.300330] [71583.300330] [71583.300330] Page fault: bad address: Code=15 (Data TLB miss fault) at addr 800002e76660997f [71583.300330] CPU: 1 PID: 16466 Comm: ssh Tainted: G W E 5.18.10+ #28 [71583.300330] Hardware name: 9000/785/C8000 [71583.300330] [71583.300330] YZrvWESTHLNXBCVMcbcbcbcbOGFRQPDI [71583.300330] PSW: 00001000000001101111111100001111 Tainted: G W E [71583.300330] r00-03 000000ff0806ff0f 0000000041146a00 000000004073fd50 0000000057d7c490 [71583.300330] r04-07 00000000410e1a00 8d00dae766609a0b 000000005f0ea600 0000000057d7c3b0 [71583.300330] r08-11 0000000000000001 0000000057d7c5e0 0000000000000002 00000000416bf380 [71583.300330] r12-15 0000000057d7c4c0 0000000000000000 0000000000000000 0000000041412ad0 [71583.300330] r16-19 000000005f0ea900 00000000a0c15ff4 00000000416483e0 000000000800000f [71583.300330] r20-23 000000005f0eaa50 000000000800000f 0000000055555556 000000005f0ea9a0 [71583.300330] r24-27 0000000057d7c4c0 0000000057d7c3b0 000000005f0ea900 00000000410e1a00 [71583.300330] r28-31 0000000000000000 0000000057d7c5e0 0000000057d7c610 0000000305036771 [71583.300330] sr00-03 0000000001e35000 0000000000000000 0000000000000000 0000000001e35000 [71583.300330] sr04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [71583.300330] [71583.300330] IASQ: 0000000000000000 0000000000000000 IAOQ: 000000004073fdec 000000004073fdf0 [71583.300330] IIR: 48bc3ee9 ISR: 000000000d00d800 IOR: 800002e76660997f [71583.300330] CPU: 1 CR30: 000000007fed0ce0 CR31: ffffffffffffffff [71583.300330] ORIG_R28: 0000000000000cc0 [71583.300330] IAOQ[0]: d_alloc_parallel+0x220/0x748 [71583.300330] IAOQ[1]: d_alloc_parallel+0x224/0x748 [71583.300330] RP(r2): d_alloc_parallel+0x184/0x748 [71583.300330] Backtrace: [71585.616335] [<0000000040834460>] proc_fill_cache+0x194/0x2b8 [71585.616335] [<000000004083d678>] proc_readfd_common+0x16c/0x3c0 [71585.616335] [<000000004083d904>] proc_readfd+0x38/0x68 [71585.616335] [<0000000040733cf0>] iterate_dir+0xec/0x2f0 [71585.616335] [<0000000040734ae4>] sys_getdents64+0xd8/0x1e0 [71585.616335] [<0000000040303e30>] syscall_exit+0x0/0x10 [71585.616335] <Cpu1> 0300109101e00000 0000000000000000 CC_PROCS_ENTRY_OUT [71586.384319] Kernel panic - not syncing: Page fault: bad address