Hi again, > Hi linux-i2c. > > I'm daily-driving the linux from Linus' git (recompiling every day or > two), and yesterday it stopped booting. Below is the dmesg from > pstore. I'm currently at > https://github.com/torvalds/linux/commit/6b75d88fa81b122cce37ebf17428a849ccd3d0f1 > > My cmdline > BOOT_IMAGE=/vmlinuz-5.15.0+ > root=UUID=8759fa14-93a4-4dc1-87e6-aa6f5cdbb2ff ro nosplash > mitigations=off no_file_caps apparmor=0 selinux=0 audit=0 > amdgpu.ppfeaturemask=0xffffffff hugepagesz=1G default_hugepagesz=1G > hugepages=8 amd_iommu=on iommu=pt > vfio-pci.ids=10de:1f02,10de:10f9,10de:1ada,10de:1adb > drm.edid_firmware=DP-1:edid/sam-g9.edid isolcpus=1,2,5,6,17,18,21,22 > tsc=unstable > > I can try bisecting if you think it might, but I haven't done it so > far with the Linux kernel, so wondering if you can see anything from > the dmesg. I can also test some patches. This introduced the bug: 0c5c62ddf88c34bc83b66e4ac9beb2bb0e1887d4 https://github.com/torvalds/linux/commit/0c5c62ddf88c34bc83b66e4ac9beb2bb0e1887d4 CCing Bjorn from https://git.kernel.org/pub/scm/linux/kernel/git/helgaas/pci.git > The controller is on the RTX6800RX. > > But, when I compile out the designware driver, problems appear > elsewhere in the i2c code, so it seems to be something more > fundamental to the i2c subsystem. > > 0c:00.3 Serial bus controller [0c80]: Advanced Micro Devices, Inc. > [AMD/ATI] Navi 21 USB > Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Device 0408 > Flags: bus master, fast devsel, latency 0, IRQ 66, IOMMU group 30 > Memory at fbb20000 (64-bit, non-prefetchable) [size=16K] > Capabilities: [48] Vendor Specific Information: Len=08 <?> > Capabilities: [50] Power Management version 3 > Capabilities: [64] Express Endpoint, MSI 00 > Capabilities: [a0] MSI: Enable+ Count=1/2 Maskable- 64bit+ > Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 > Len=010 <?> > Capabilities: [150] Advanced Error Reporting > Capabilities: [2a0] Access Control Services > Kernel driver in use: i2c-designware-pci > Kernel modules: i2c_designware_pci > > grep 447 dmesg.txt > <1>[ 1.431369][ T447] BUG: kernel NULL pointer dereference, > address: 0000000000000540 > <1>[ 1.431371][ T447] #PF: supervisor read access in kernel mode > <1>[ 1.431375][ T447] #PF: error_code(0x0000) - not-present page > <6>[ 1.431378][ T447] PGD 0 P4D 0 > <4>[ 1.431384][ T447] Oops: 0000 [#1] PREEMPT SMP NOPTI > <4>[ 1.431388][ T447] CPU: 12 PID: 447 Comm: systemd-udevd > Tainted: G E 5.15.0+ #91 > <4>[ 1.431391][ T447] Hardware name: ASUS System Product Name/ROG > CROSSHAIR VIII FORMULA, BIOS 3801 07/30/2021 > <4>[ 1.431392][ T447] RIP: 0010:i2c_dw_pci_resume+0x8/0x40 > [i2c_designware_pci] > <4>[ 1.431399][ T447] Code: 00 00 00 00 66 66 2e 0f 1f 84 00 00 00 > 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 84 00 00 00 00 00 53 48 > 8b 5f 78 48 89 df <ff> 93 40 05 00 00 c6 83 c0 05 00 00 00 5b c3 66 66 > 2e 0f 1f 84 00 > <4>[ 1.431401][ T447] RSP: 0018:ffffb3e740a13ba8 EFLAGS: 00010286 > <4>[ 1.431403][ T447] RAX: 0000000000000000 RBX: 0000000000000000 > RCX: 0000000000000000 > <4>[ 1.431404][ T447] RDX: ffffffffc07311c0 RSI: 0000000000000000 > RDI: 0000000000000000 > <4>[ 1.431406][ T447] RBP: ffffa13f41ea3000 R08: 0000000000000002 > R09: 0000000000008008 > <4>[ 1.431406][ T447] R10: 0000000000000000 R11: 000000000000000c > R12: ffffffffc0732660 > <4>[ 1.431407][ T447] R13: 0000000000000000 R14: 0000000000000001 > R15: 0000000000000000 > <4>[ 1.431409][ T447] FS: 00007fb5a80228c0(0000) > GS:ffffa1464ed00000(0000) knlGS:0000000000000000 > <4>[ 1.431411][ T447] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > <4>[ 1.431412][ T447] CR2: 0000000000000540 CR3: 000000010e630000 > CR4: 0000000000750ee0 > <4>[ 1.431413][ T447] PKRU: 55555554 > <4>[ 1.431416][ T447] Call Trace: > <4>[ 1.431421][ T447] <TASK> > <4>[ 1.431422][ T447] pci_pm_runtime_resume+0xaa/0x100 > <4>[ 1.431433][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431434][ T447] __rpm_callback+0x3c/0x100 > <4>[ 1.431440][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431442][ T447] rpm_callback+0x54/0x80 > <4>[ 1.431443][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431445][ T447] rpm_resume+0x410/0x700 > <4>[ 1.431447][ T447] ? kernfs_activate+0x11/0x100 > <4>[ 1.431452][ T447] ? kernfs_add_one+0xdd/0x140 > <4>[ 1.431455][ T447] __pm_runtime_resume+0x45/0x80 > <4>[ 1.431457][ T447] pci_device_probe+0xa2/0x140 > <4>[ 1.431459][ T447] really_probe+0x1e4/0x400 > <4>[ 1.431464][ T447] __driver_probe_device+0xf9/0x180 > <4>[ 1.431466][ T447] driver_probe_device+0x19/0xc0 > <4>[ 1.431468][ T447] __driver_attach+0xb8/0x1c0 > <4>[ 1.431470][ T447] ? __device_attach_driver+0x100/0x100 > <4>[ 1.431472][ T447] ? __device_attach_driver+0x100/0x100 > <4>[ 1.431473][ T447] bus_for_each_dev+0x6c/0xc0 > <4>[ 1.431475][ T447] bus_add_driver+0x13f/0x200 > <4>[ 1.431478][ T447] driver_register+0x8a/0x100 > <4>[ 1.431481][ T447] ? 0xffffffffc0736000 > <4>[ 1.431482][ T447] do_one_initcall+0x44/0x1c0 > <4>[ 1.431487][ T447] ? load_module+0x9f8/0xac0 > <4>[ 1.431491][ T447] ? kmem_cache_alloc+0x14e/0x340 > <4>[ 1.431496][ T447] do_init_module+0x51/0x240 > <4>[ 1.431498][ T447] __do_sys_finit_module+0xaf/0x140 > <4>[ 1.431501][ T447] do_syscall_64+0x35/0x80 > <4>[ 1.431505][ T447] entry_SYSCALL_64_after_hwframe+0x44/0xae > <4>[ 1.431509][ T447] RIP: 0033:0x7fb5a84d97b9 > <4>[ 1.431511][ T447] Code: 48 8d 3d 5a a0 0c 00 0f 05 eb a5 66 0f > 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c > 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 87 56 0c 00 f7 > d8 64 89 01 48 > <4>[ 1.431512][ T447] RSP: 002b:00007ffcafbc6258 EFLAGS: 00000246 > ORIG_RAX: 0000000000000139 > <4>[ 1.431513][ T447] RAX: ffffffffffffffda RBX: 0000563806da65d0 > RCX: 00007fb5a84d97b9 > <4>[ 1.431514][ T447] RDX: 0000000000000000 RSI: 00007fb5a8679eed > RDI: 000000000000000c > <4>[ 1.431515][ T447] RBP: 0000000000020000 R08: 0000000000000000 > R09: 0000563806daecc0 > <4>[ 1.431516][ T447] R10: 000000000000000c R11: 0000000000000246 > R12: 00007fb5a8679eed > <4>[ 1.431518][ T447] R13: 0000000000000000 R14: 0000563806d9ca00 > R15: 0000563806da65d0 > <4>[ 1.431520][ T447] </TASK> > <4>[ 1.431520][ T447] Modules linked in: i2c_designware_pci(E+) > aesni_intel(E+) i2c_piix4(E) i2c_designware_core(E) backlight(E) > xhci_pci(E+) ice(E+) xhci_pci_renesas(E) nvme(E) wmi(E) > <4>[ 1.431525][ T447] CR2: 0000000000000540 > <4>[ 1.431528][ T447] ---[ end trace d5eecb89365b3d11 ]--- > <4>[ 1.580240][ T447] RIP: 0010:i2c_dw_pci_resume+0x8/0x40 > [i2c_designware_pci] > <4>[ 1.587403][ T447] Code: 00 00 00 00 66 66 2e 0f 1f 84 00 00 00 > 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 84 00 00 00 00 00 53 48 > 8b 5f 78 48 89 df <ff> 93 40 05 00 00 c6 83 c0 05 00 00 00 5b c3 66 66 > 2e 0f 1f 84 00 > <4>[ 1.588236][ T447] RSP: 0018:ffffb3e740a13ba8 EFLAGS: 00010286 > <4>[ 1.589626][ T447] > <4>[ 1.589626][ T447] RAX: 0000000000000000 RBX: 0000000000000000 > RCX: 0000000000000000 > <4>[ 1.590245][ T447] RDX: ffffffffc07311c0 RSI: 0000000000000000 > RDI: 0000000000000000 > <4>[ 1.590245][ T447] RBP: ffffa13f41ea3000 R08: 0000000000000002 > R09: 0000000000008008 > <4>[ 1.590245][ T447] R10: 0000000000000000 R11: 000000000000000c > R12: ffffffffc0732660 > <4>[ 1.590245][ T447] R13: 0000000000000000 R14: 0000000000000001 > R15: 0000000000000000 > <4>[ 1.594239][ T447] FS: 00007fb5a80228c0(0000) > GS:ffffa1464ed00000(0000) knlGS:0000000000000000 > <4>[ 1.594239][ T447] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > <4>[ 1.594239][ T447] CR2: 0000000000000540 CR3: 000000010e630000 > CR4: 0000000000750ee0 > <4>[ 1.596236][ T447] PKRU: 55555554 > <1>[ 1.431369][ T447] BUG: kernel NULL pointer dereference, > address: 0000000000000540 > <1>[ 1.431371][ T447] #PF: supervisor read access in kernel mode > <1>[ 1.431375][ T447] #PF: error_code(0x0000) - not-present page > <6>[ 1.431378][ T447] PGD 0 P4D 0 > <4>[ 1.431384][ T447] Oops: 0000 [#1] PREEMPT SMP NOPTI > <4>[ 1.431388][ T447] CPU: 12 PID: 447 Comm: systemd-udevd > Tainted: G E 5.15.0+ #91 > <4>[ 1.431391][ T447] Hardware name: ASUS System Product Name/ROG > CROSSHAIR VIII FORMULA, BIOS 3801 07/30/2021 > <4>[ 1.431392][ T447] RIP: 0010:i2c_dw_pci_resume+0x8/0x40 > [i2c_designware_pci] > <4>[ 1.431399][ T447] Code: 00 00 00 00 66 66 2e 0f 1f 84 00 00 00 > 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 84 00 00 00 00 00 53 48 > 8b 5f 78 48 89 df <ff> 93 40 05 00 00 c6 83 c0 05 00 00 00 5b c3 66 66 > 2e 0f 1f 84 00 > <4>[ 1.431401][ T447] RSP: 0018:ffffb3e740a13ba8 EFLAGS: 00010286 > <4>[ 1.431403][ T447] RAX: 0000000000000000 RBX: 0000000000000000 > RCX: 0000000000000000 > <4>[ 1.431404][ T447] RDX: ffffffffc07311c0 RSI: 0000000000000000 > RDI: 0000000000000000 > <4>[ 1.431406][ T447] RBP: ffffa13f41ea3000 R08: 0000000000000002 > R09: 0000000000008008 > <4>[ 1.431406][ T447] R10: 0000000000000000 R11: 000000000000000c > R12: ffffffffc0732660 > <4>[ 1.431407][ T447] R13: 0000000000000000 R14: 0000000000000001 > R15: 0000000000000000 > <4>[ 1.431409][ T447] FS: 00007fb5a80228c0(0000) > GS:ffffa1464ed00000(0000) knlGS:0000000000000000 > <4>[ 1.431411][ T447] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > <4>[ 1.431412][ T447] CR2: 0000000000000540 CR3: 000000010e630000 > CR4: 0000000000750ee0 > <4>[ 1.431413][ T447] PKRU: 55555554 > <4>[ 1.431416][ T447] Call Trace: > <4>[ 1.431421][ T447] <TASK> > <4>[ 1.431422][ T447] pci_pm_runtime_resume+0xaa/0x100 > <4>[ 1.431433][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431434][ T447] __rpm_callback+0x3c/0x100 > <4>[ 1.431440][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431442][ T447] rpm_callback+0x54/0x80 > <4>[ 1.431443][ T447] ? pci_pm_thaw+0xc0/0xc0 > <4>[ 1.431445][ T447] rpm_resume+0x410/0x700 > <4>[ 1.431447][ T447] ? kernfs_activate+0x11/0x100 > <4>[ 1.431452][ T447] ? kernfs_add_one+0xdd/0x140 > <4>[ 1.431455][ T447] __pm_runtime_resume+0x45/0x80 > <4>[ 1.431457][ T447] pci_device_probe+0xa2/0x140 > <4>[ 1.431459][ T447] really_probe+0x1e4/0x400 > <4>[ 1.431464][ T447] __driver_probe_device+0xf9/0x180 > <4>[ 1.431466][ T447] driver_probe_device+0x19/0xc0 > <4>[ 1.431468][ T447] __driver_attach+0xb8/0x1c0 > <4>[ 1.431470][ T447] ? __device_attach_driver+0x100/0x100 > <4>[ 1.431472][ T447] ? __device_attach_driver+0x100/0x100 > <4>[ 1.431473][ T447] bus_for_each_dev+0x6c/0xc0 > <4>[ 1.431475][ T447] bus_add_driver+0x13f/0x200 > <4>[ 1.431478][ T447] driver_register+0x8a/0x100 > <4>[ 1.431481][ T447] ? 0xffffffffc0736000 > <4>[ 1.431482][ T447] do_one_initcall+0x44/0x1c0 > <4>[ 1.431487][ T447] ? load_module+0x9f8/0xac0 > <4>[ 1.431491][ T447] ? kmem_cache_alloc+0x14e/0x340 > <4>[ 1.431496][ T447] do_init_module+0x51/0x240 > <4>[ 1.431498][ T447] __do_sys_finit_module+0xaf/0x140 > <4>[ 1.431501][ T447] do_syscall_64+0x35/0x80 > <4>[ 1.431505][ T447] entry_SYSCALL_64_after_hwframe+0x44/0xae > <4>[ 1.431509][ T447] RIP: 0033:0x7fb5a84d97b9 > <4>[ 1.431511][ T447] Code: 48 8d 3d 5a a0 0c 00 0f 05 eb a5 66 0f > 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c > 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 87 56 0c 00 f7 > d8 64 89 01 48 > <4>[ 1.431512][ T447] RSP: 002b:00007ffcafbc6258 EFLAGS: 00000246 > ORIG_RAX: 0000000000000139 > <4>[ 1.431513][ T447] RAX: ffffffffffffffda RBX: 0000563806da65d0 > RCX: 00007fb5a84d97b9 > <4>[ 1.431514][ T447] RDX: 0000000000000000 RSI: 00007fb5a8679eed > RDI: 000000000000000c > <4>[ 1.431515][ T447] RBP: 0000000000020000 R08: 0000000000000000 > R09: 0000563806daecc0 > <4>[ 1.431516][ T447] R10: 000000000000000c R11: 0000000000000246 > R12: 00007fb5a8679eed > <4>[ 1.431518][ T447] R13: 0000000000000000 R14: 0000563806d9ca00 > R15: 0000563806da65d0 > <4>[ 1.431520][ T447] </TASK> > <4>[ 1.431520][ T447] Modules linked in: i2c_designware_pci(E+) > aesni_intel(E+) i2c_piix4(E) i2c_designware_core(E) backlight(E) > xhci_pci(E+) ice(E+) xhci_pci_renesas(E) nvme(E) wmi(E) > <4>[ 1.431525][ T447] CR2: 0000000000000540 > <4>[ 1.431528][ T447] ---[ end trace d5eecb89365b3d11 ]--- > > Hope this helps. -- Robert Święcki