amdgpu triggers two resource sanity checks on every boot with NAVI22 GPU

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hey,
not sure this is a bug, but since I have installed a new AMD Radeon RX 6700 XT (Navy Flounder/Navi22) the amdgpu module triggers two "resource sanity checks" on each boot (by the way: not sure what is supposed to cause the "G" taint to the kernel – according to `lsmod | awk '{print $1}' | xargs modinfo -F intree` all modules say they are "intree" and this is a clean build of 5.18 from kernel.org with no modifications; but I do run an unsigned kernel, so the E seems right):

[ +0.000006] resource sanity check: requesting [mem 0x7800000000-0x7bffffffff], which spans more than BOOTFB [mem 0x7800000000-0x78002fffff]
[  +0.000003] ------------[ cut here ]------------
[ +0.000000] WARNING: CPU: 23 PID: 318 at arch/x86/mm/pat/memtype.c:776 memtype_reserve_io+0x115/0x130 [ +0.000003] Modules linked in: amdgpu(E+) sd_mod(E+) gpu_sched(E) t10_pi(E) i2c_algo_bit(E) drm_dp_helper(E) crc64_rocksoft_generic(E) cec(E) crc64_rocksoft(E) rc_core(E) crc_t10dif(E) sr_mod(E+) drm_ttm_helper(E) crct10dif_generic(E) crc64(E) cdrom(E) ttm(E) hid_generic(E) usbhid(E) hid(E) crct10dif_pclmul(E) c> [ +0.000013] CPU: 23 PID: 318 Comm: systemd-udevd Tainted: G E 5.18.0-citadel #1 [ +0.000002] Hardware name: ASUS System Product Name/TUF GAMING B550-PLUS WIFI II, BIOS 2604 02/25/2022
[  +0.000000] RIP: 0010:memtype_reserve_io+0x115/0x130
[ +0.000002] Code: 89 ef e8 5e f7 ff ff e9 68 ff ff ff 8b 54 24 0c 4c 89 f6 48 89 ef e8 0a fe ff ff 85 c0 78 ca 8b 44 24 0c 89 03 e9 4a ff ff ff <0f> 0b e9 29 ff ff ff e8 af 18 90 00 66 66 2e 0f 1f 84 00 00 00 00
[  +0.000000] RSP: 0018:ffffb342c098fad0 EFLAGS: 00010286
[  +0.000001] RAX: 00000000ffffffff RBX: ffffb342c098fb24 RCX: 0000000000000000
[  +0.000001] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  +0.000000] RBP: 0000007800000000 R08: 0000000000000000 R09: 0000000000000000
[  +0.000001] R10: 0000000000000000 R11: 0000000000000000 R12: 0000007c00000000
[  +0.000000] R13: 0000000000000001 R14: 0000000400000000 R15: ffff8cfb918a6040
[ +0.000001] FS: 00007f2b807a38c0(0000) GS:ffff8d024edc0000(0000) knlGS:0000000000000000
[  +0.000001] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0.000000] CR2: 0000562bbd54b0f8 CR3: 000000013232a000 CR4: 0000000000750ee0
[  +0.000001] PKRU: 55555554
[  +0.000000] Call Trace:
[  +0.000001]  <TASK>
[  +0.000001]  ? _dev_info+0x6c/0x96
[  +0.000003]  arch_io_reserve_memtype_wc+0x2e/0x50
[  +0.000001]  amdgpu_bo_init+0x2b/0x70 [amdgpu]
[  +0.000111]  gmc_v10_0_sw_init+0x2ef/0x460 [amdgpu]
[  +0.000107]  amdgpu_device_init.cold+0x1434/0x1be2 [amdgpu]
[  +0.000131]  ? _raw_spin_unlock_irqrestore+0x23/0x50
[  +0.000002]  ? pci_bus_read_config_word+0x49/0x90
[  +0.000004]  amdgpu_driver_load_kms+0x15/0x120 [amdgpu]
[  +0.000105]  amdgpu_pci_probe+0x1cb/0x3e0 [amdgpu]
[  +0.000097]  local_pci_probe+0x45/0x90
[  +0.000002]  ? pci_match_device+0xdf/0x140
[  +0.000002]  pci_device_probe+0xc5/0x240
[  +0.000002]  really_probe+0x1aa/0x390
[  +0.000002]  __driver_probe_device+0x112/0x190
[  +0.000001]  driver_probe_device+0x1e/0x90
[  +0.000001]  __driver_attach+0xc8/0x1e0
[  +0.000001]  ? __device_attach_driver+0xf0/0xf0
[  +0.000001]  ? __device_attach_driver+0xf0/0xf0
[  +0.000001]  bus_for_each_dev+0x7c/0xd0
[  +0.000001]  bus_add_driver+0x154/0x200
[  +0.000001]  driver_register+0x8f/0xf0
[  +0.000001]  ? 0xffffffffc08c0000
[  +0.000001]  do_one_initcall+0x44/0x210
[  +0.000002]  ? kmem_cache_alloc_trace+0x177/0x2b0
[  +0.000002]  do_init_module+0x4c/0x260
[  +0.000002]  __do_sys_finit_module+0xb7/0x130
[  +0.000002]  do_syscall_64+0x3b/0x90
[  +0.000002]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[  +0.000002] RIP: 0033:0x7f2b80c5af79
[ +0.000001] Code: 48 8d 3d da db 0d 00 0f 05 eb a5 66 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c7 9e 0d 00 f7 d8 64 89 01 48
[  +0.000000] RSP: 002b:00007fffd1afde58 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[  +0.000001] RAX: ffffffffffffffda RBX: 0000562bbd54f130 RCX: 00007f2b80c5af79
[  +0.000001] RDX: 0000000000000000 RSI: 0000562bbd548890 RDI: 0000000000000018
[  +0.000000] RBP: 0000000000020000 R08: 0000000000000000 R09: 0000562bbd544e20
[  +0.000001] R10: 0000000000000018 R11: 0000000000000246 R12: 0000562bbd548890
[  +0.000000] R13: 0000000000000000 R14: 0000562bbd546d70 R15: 0000562bbd54f130
[  +0.000001]  </TASK>
[  +0.000001] ---[ end trace 0000000000000000 ]---

and then right below that:

[ +0.000021] resource sanity check: requesting [mem 0x7800000000-0x7afeffffff], which spans more than BOOTFB [mem 0x7800000000-0x78002fffff]
[  +0.000001] caller amdgpu_ttm_init+0x127/0x450 [amdgpu] mapping multiple BARs

Can this be fixed to make the boot slient again? Or is there some setting in the UEFI I should change? I do have resizable BAR active and I do have the SR-IOV setting set to "auto". The mainboard runs a B550 chipset with a Ryzen 9 5900X plugged into it.
There should be enough RAM available, since I have 32 GB installed.

Let me know if you need more information.

Cheers,
Kai

P.S.: Attached to this e-mail you find the dmesg filtered for amdgpu and drm messages.
[  +0.047180] [drm] amdgpu kernel modesetting enabled.
[  +0.002899] amdgpu: Ignoring ACPI CRAT on non-APU system
[  +0.000002] amdgpu: Virtual CRAT table created for CPU
[  +0.000009] amdgpu: Topology: Add CPU node
[  +0.000043] amdgpu 0000:07:00.0: vgaarb: deactivate vga console
[  +0.000021] amdgpu 0000:07:00.0: enabling device (0006 -> 0007)
[  +0.000024] [drm] initializing kernel modesetting (NAVY_FLOUNDER 0x1002:0x73DF 0x1DA2:0xE445 0xC5).
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
[  +0.000005] [drm] register mmio base: 0xFCB00000
[  +0.000001] [drm] register mmio size: 1048576
[  +0.001741] [drm] add ip block number 0 <nv_common>
[  +0.000001] [drm] add ip block number 1 <gmc_v10_0>
[  +0.000001] [drm] add ip block number 2 <navi10_ih>
[  +0.000000] [drm] add ip block number 3 <psp>
[  +0.000000] [drm] add ip block number 4 <smu>
[  +0.000001] [drm] add ip block number 5 <dm>
[  +0.000000] [drm] add ip block number 6 <gfx_v10_0>
[  +0.000000] [drm] add ip block number 7 <sdma_v5_2>
[  +0.000001] [drm] add ip block number 8 <vcn_v3_0>
[  +0.000000] [drm] add ip block number 9 <jpeg_v3_0>
[  +0.000008] amdgpu 0000:07:00.0: amdgpu: Fetched VBIOS from VFCT
[  +0.000001] amdgpu: ATOM BIOS: 113-D5122100-S03
[  +0.000004] [drm] VCN(0) decode is enabled in VM mode
[  +0.000000] [drm] VCN(0) encode is enabled in VM mode
[  +0.000001] [drm] JPEG decode is enabled in VM mode
[  +0.000022] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
[  +0.000003] amdgpu 0000:07:00.0: amdgpu: VRAM: 12272M 0x0000008000000000 - 0x00000082FEFFFFFF (12272M used)
[  +0.000002] amdgpu 0000:07:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: AGP: 267894784M 0x0000008400000000 - 0x0000FFFFFFFFFFFF
[  +0.000006] resource sanity check: requesting [mem 0x7800000000-0x7bffffffff], which spans more than BOOTFB [mem 0x7800000000-0x78002fffff]
[  +0.000003] ------------[ cut here ]------------
[  +0.000000] WARNING: CPU: 23 PID: 318 at arch/x86/mm/pat/memtype.c:776 memtype_reserve_io+0x115/0x130
[  +0.000003] Modules linked in: amdgpu(E+) sd_mod(E+) gpu_sched(E) t10_pi(E) i2c_algo_bit(E) drm_dp_helper(E) crc64_rocksoft_generic(E) cec(E) crc64_rocksoft(E) rc_core(E) crc_t10dif(E) sr_mod(E+) drm_ttm_helper(E) crct10dif_generic(E) crc64(E) cdrom(E) ttm(E) hid_generic(E) usbhid(E) hid(E) crct10dif_pclmul(E) c>
[  +0.000013] CPU: 23 PID: 318 Comm: systemd-udevd Tainted: G            E     5.18.0-citadel #1
[  +0.000002] Hardware name: ASUS System Product Name/TUF GAMING B550-PLUS WIFI II, BIOS 2604 02/25/2022
[  +0.000000] RIP: 0010:memtype_reserve_io+0x115/0x130
[  +0.000002] Code: 89 ef e8 5e f7 ff ff e9 68 ff ff ff 8b 54 24 0c 4c 89 f6 48 89 ef e8 0a fe ff ff 85 c0 78 ca 8b 44 24 0c 89 03 e9 4a ff ff ff <0f> 0b e9 29 ff ff ff e8 af 18 90 00 66 66 2e 0f 1f 84 00 00 00 00
[  +0.000000] RSP: 0018:ffffb342c098fad0 EFLAGS: 00010286
[  +0.000001] RAX: 00000000ffffffff RBX: ffffb342c098fb24 RCX: 0000000000000000
[  +0.000001] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  +0.000000] RBP: 0000007800000000 R08: 0000000000000000 R09: 0000000000000000
[  +0.000001] R10: 0000000000000000 R11: 0000000000000000 R12: 0000007c00000000
[  +0.000000] R13: 0000000000000001 R14: 0000000400000000 R15: ffff8cfb918a6040
[  +0.000001] FS:  00007f2b807a38c0(0000) GS:ffff8d024edc0000(0000) knlGS:0000000000000000
[  +0.000001] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0.000000] CR2: 0000562bbd54b0f8 CR3: 000000013232a000 CR4: 0000000000750ee0
[  +0.000001] PKRU: 55555554
[  +0.000000] Call Trace:
[  +0.000001]  <TASK>
[  +0.000001]  ? _dev_info+0x6c/0x96
[  +0.000003]  arch_io_reserve_memtype_wc+0x2e/0x50
[  +0.000001]  amdgpu_bo_init+0x2b/0x70 [amdgpu]
[  +0.000111]  gmc_v10_0_sw_init+0x2ef/0x460 [amdgpu]
[  +0.000107]  amdgpu_device_init.cold+0x1434/0x1be2 [amdgpu]
[  +0.000131]  ? _raw_spin_unlock_irqrestore+0x23/0x50
[  +0.000002]  ? pci_bus_read_config_word+0x49/0x90
[  +0.000004]  amdgpu_driver_load_kms+0x15/0x120 [amdgpu]
[  +0.000105]  amdgpu_pci_probe+0x1cb/0x3e0 [amdgpu]
[  +0.000097]  local_pci_probe+0x45/0x90
[  +0.000002]  ? pci_match_device+0xdf/0x140
[  +0.000002]  pci_device_probe+0xc5/0x240
[  +0.000002]  really_probe+0x1aa/0x390
[  +0.000002]  __driver_probe_device+0x112/0x190
[  +0.000001]  driver_probe_device+0x1e/0x90
[  +0.000001]  __driver_attach+0xc8/0x1e0
[  +0.000001]  ? __device_attach_driver+0xf0/0xf0
[  +0.000001]  ? __device_attach_driver+0xf0/0xf0
[  +0.000001]  bus_for_each_dev+0x7c/0xd0
[  +0.000001]  bus_add_driver+0x154/0x200
[  +0.000001]  driver_register+0x8f/0xf0
[  +0.000001]  ? 0xffffffffc08c0000
[  +0.000001]  do_one_initcall+0x44/0x210
[  +0.000002]  ? kmem_cache_alloc_trace+0x177/0x2b0
[  +0.000002]  do_init_module+0x4c/0x260
[  +0.000002]  __do_sys_finit_module+0xb7/0x130
[  +0.000002]  do_syscall_64+0x3b/0x90
[  +0.000002]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[  +0.000002] RIP: 0033:0x7f2b80c5af79
[  +0.000001] Code: 48 8d 3d da db 0d 00 0f 05 eb a5 66 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c7 9e 0d 00 f7 d8 64 89 01 48
[  +0.000000] RSP: 002b:00007fffd1afde58 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[  +0.000001] RAX: ffffffffffffffda RBX: 0000562bbd54f130 RCX: 00007f2b80c5af79
[  +0.000001] RDX: 0000000000000000 RSI: 0000562bbd548890 RDI: 0000000000000018
[  +0.000000] RBP: 0000000000020000 R08: 0000000000000000 R09: 0000562bbd544e20
[  +0.000001] R10: 0000000000000018 R11: 0000000000000246 R12: 0000562bbd548890
[  +0.000000] R13: 0000000000000000 R14: 0000562bbd546d70 R15: 0000562bbd54f130
[  +0.000001]  </TASK>
[  +0.000001] ---[ end trace 0000000000000000 ]---
[  +0.000003] [drm] Detected VRAM RAM=12272M, BAR=16384M
[  +0.000001] [drm] RAM width 192bits GDDR6
[  +0.000021] resource sanity check: requesting [mem 0x7800000000-0x7afeffffff], which spans more than BOOTFB [mem 0x7800000000-0x78002fffff]
[  +0.000001] caller amdgpu_ttm_init+0x127/0x450 [amdgpu] mapping multiple BARs
[  +0.000105] [drm] amdgpu: 12272M of VRAM memory ready
[  +0.000001] [drm] amdgpu: 12272M of GTT memory ready.
[  +0.000007] [drm] GART: num cpu pages 131072, num gpu pages 131072
[  +0.000418] [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
[  +0.000120] amdgpu 0000:07:00.0: amdgpu: PSP runtime database doesn't exist
[  +0.152972] amdgpu 0000:07:00.0: amdgpu: STB initialized to 2048 entries
[  +0.000057] [drm] Loading DMUB firmware via PSP: version=0x0202000F
[  +0.000525] [drm] use_doorbell being set to: [true]
[  +0.000010] [drm] use_doorbell being set to: [true]
[  +0.000066] [drm] Found VCN firmware Version ENC: 1.20 DEC: 2 VEP: 0 Revision: 5
[  +0.000004] amdgpu 0000:07:00.0: amdgpu: Will use PSP to load VCN firmware
[  +0.007844] [drm] reserve 0xa00000 from 0x82fe000000 for PSP TMR
[  +0.040836] amdgpu 0000:07:00.0: amdgpu: RAS: optional ras ta ucode is not available
[  +0.014216] amdgpu 0000:07:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[  +0.000021] amdgpu 0000:07:00.0: amdgpu: smu driver if version = 0x0000000e, smu fw if version = 0x00000012, smu fw program = 0, version = 0x00413500 (65.53.0)
[  +0.000003] amdgpu 0000:07:00.0: amdgpu: SMU driver if version not matched
[  +0.000002] amdgpu 0000:07:00.0: amdgpu: use vbios provided pptable
[  +0.056540] amdgpu 0000:07:00.0: amdgpu: SMU is initialized successfully!
[  +0.000170] [drm] Display Core initialized with v3.2.177!
[  +0.001140] [drm] DMUB hardware initialized: version=0x0202000F
[  +0.248727] [drm] REG_WAIT timeout 1us * 100000 tries - mpc2_assert_idle_mpcc line:478
[  +0.233156] [drm] kiq ring mec 2 pipe 1 q 0
[  +0.004805] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[  +0.000306] [drm] JPEG decode initialized successfully.
[  +0.001675] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
[  +0.000699] amdgpu: SRAT table not found
[  +0.000001] amdgpu: Virtual CRAT table created for GPU
[  +0.000134] amdgpu: Topology: Add dGPU node [0x73df:0x1002]
[  +0.000003] kfd kfd: amdgpu: added device 1002:73df
[  +0.000015] amdgpu 0000:07:00.0: amdgpu: SE 2, SH per SE 2, CU per SH 10, active_cu_number 40
[  +0.000036] amdgpu 0000:07:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
[  +0.000000] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
[  +0.000001] amdgpu 0000:07:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
[  +0.000768] amdgpu 0000:07:00.0: amdgpu: Using BACO for runtime pm
[  +0.000189] [drm] Initialized amdgpu 3.46.0 20150101 for 0000:07:00.0 on minor 0
[  +0.005771] fbcon: amdgpudrmfb (fb0) is primary device
[  +0.000001] fbcon: Deferring console take-over
[  +0.000001] amdgpu 0000:07:00.0: [drm] fb0: amdgpudrmfb frame buffer device
[  +0.110253] fbcon: Taking over console
[  +0.000074] [drm] DSC precompute is not needed.
[  +0.152161] [drm] REG_WAIT timeout 1us * 100000 tries - mpc2_assert_idle_mpcc line:478

Attachment: OpenPGP_signature
Description: OpenPGP digital signature


[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux