https://bugzilla.kernel.org/show_bug.cgi?id=196117 --- Comment #3 from Paul K. Gerke (paulkgerke@xxxxxxxxxxxx) --- Ah, thank you! Really quick @Michel Dänzer:"Sounds like maybe some files under /lib/firmware/amdgpu/ are missing in the initrd": I thought the same, so I baked the important firmware blobs right into the kernel, just in case there would be some issues. I will dig up the settings that I used tonight when I have some more time. Anyway, the other tip allowed me to extract the (Crash) logs using my serial console. The machine freezes up just as usual after modprobing. I did the following: - Boot with: linux /vmlinuz... root=UUID=xxxx ro debug ignore_loglevel modprobe.blacklist=amdgpu console=ttyUSB0,115200 - (I disabled all X-server functionality before so that it would not screw with any graphics settings) - I logged in and issued "sudo modprobe amdgpu", as suggested. - I recorded the following logs on my second system. [ 279.264565] [drm] amdgpu kernel modesetting enabled. [ 279.271228] AMD IOMMUv2 driver by Joerg Roedel <jroedel@xxxxxxx> [ 279.271248] AMD IOMMUv2 functionality not available on this system [ 279.279814] CRAT table not found [ 279.279837] Finished initializing topology ret=0 [ 279.279877] kfd kfd: Initialized module [ 279.280153] checking generic (c0000000 760000) vs hw (c0000000 10000000) [ 279.280163] fb: switching to amdgpudrmfb from VESA VGA [ 279.280383] Console: switching to colour dummy device 80x25 [ 279.280941] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF 0x1043:0x0505 0xC7). [ 279.280960] [drm] register mmio base: 0xDFFC0000 [ 279.280962] [drm] register mmio size: 262144 [ 279.280968] [drm] doorbell mmio base: 0xDEA00000 [ 279.280971] [drm] doorbell mmio size: 2097152 [ 279.280983] [drm] probing gen 2 caps for device 10de:778 = 313d02/0 [ 279.280989] [drm] probing mlw for device 10de:778 = 313d02 [ 279.280997] [drm] UVD is enabled in VM mode [ 279.280999] [drm] VCE enabled in VM mode [ 279.303168] [drm] BIOS signature incorrect 1 1 [ 279.303174] amdgpu 0000:02:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0x0000 [ 279.303781] ATOM BIOS: 67DFHB.15.50.0.0.AS18 [ 279.303792] [drm] GPU post is not needed [ 279.304306] amdgpu 0000:02:00.0: VRAM: 8192M 0x0000000000000000 - 0x00000001FFFFFFFF (8192M used) [ 279.304310] amdgpu 0000:02:00.0: GTT: 8192M 0x0000000200000000 - 0x00000003FFFFFFFF [ 279.304314] [drm] Detected VRAM RAM=8192M, BAR=256M [ 279.304316] [drm] RAM width 256bits GDDR5 [ 279.304414] [TTM] Zone kernel: Available graphics memory: 8214392 kiB [ 279.304416] [TTM] Zone dma32: Available graphics memory: 2097152 kiB [ 279.304418] [TTM] Initializing pool allocator [ 279.304423] [TTM] Initializing DMA pool allocator [ 279.304457] [drm] amdgpu: 8192M of VRAM memory ready [ 279.304459] [drm] amdgpu: 8192M of GTT memory ready. [ 279.304468] [drm] GART: num cpu pages 2097152, num gpu pages 2097152 [ 279.305642] [drm] PCIE GART of 8192M enabled (table at 0x0000000000040000). [ 279.305653] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013). [ 279.305654] [drm] Driver supports precise vblank timestamp query. [ 279.305687] amdgpu 0000:02:00.0: amdgpu: using MSI. [ 279.305705] [drm] amdgpu: irq initialized. [ 279.305728] amdgpu: [powerplay] amdgpu: powerplay sw initialized [ 279.306759] [drm] AMDGPU Display Connectors [ 279.306762] [drm] Connector 0: [ 279.306765] [drm] DP-1 [ 279.306766] [drm] HPD1 [ 279.306770] [drm] DDC: 0x486c 0x486c 0x486d 0x486d 0x486e 0x486e 0x486f 0x486f [ 279.306772] [drm] Encoders: [ 279.306774] [drm] DFP1: INTERNAL_UNIPHY1 [ 279.306776] [drm] Connector 1: [ 279.306777] [drm] DP-2 [ 279.306780] [drm] HPD5 [ 279.306782] [drm] DDC: 0x4874 0x4874 0x4875 0x4875 0x4876 0x4876 0x4877 0x4877 [ 279.306783] [drm] Encoders: [ 279.306785] [drm] DFP2: INTERNAL_UNIPHY1 [ 279.306786] [drm] Connector 2: [ 279.306788] [drm] HDMI-A-1 [ 279.306791] [drm] HPD6 [ 279.306793] [drm] DDC: 0x4868 0x4868 0x4869 0x4869 0x486a 0x486a 0x486b 0x486b [ 279.306795] [drm] Encoders: [ 279.306797] [drm] DFP3: INTERNAL_UNIPHY2 [ 279.306798] [drm] Connector 3: [ 279.306800] [drm] HDMI-A-2 [ 279.306802] [drm] HPD4 [ 279.306803] [drm] DDC: 0x4870 0x4870 0x4871 0x4871 0x4872 0x4872 0x4873 0x4873 [ 279.306804] [drm] Encoders: [ 279.306806] [drm] DFP4: INTERNAL_UNIPHY2 [ 279.306809] [drm] Connector 4: [ 279.306812] [drm] DVI-D-1 [ 279.306813] [drm] HPD3 [ 279.306816] [drm] DDC: 0x487c 0x487c 0x487d 0x487d 0x487e 0x487e 0x487f 0x487f [ 279.306818] [drm] Encoders: [ 279.306820] [drm] DFP5: INTERNAL_UNIPHY [ 279.306870] amdgpu 0000:02:00.0: fence driver on ring 0 use gpu addr 0x0000000200000008, cpu addr 0xffff8a3867e70008 [ 279.307375] amdgpu 0000:02:00.0: fence driver on ring 1 use gpu addr 0x0000000200000018, cpu addr 0xffff8a3867e70018 [ 279.307435] amdgpu 0000:02:00.0: fence driver on ring 2 use gpu addr 0x0000000200000028, cpu addr 0xffff8a3867e70028 [ 279.307478] amdgpu 0000:02:00.0: fence driver on ring 3 use gpu addr 0x0000000200000038, cpu addr 0xffff8a3867e70038 [ 279.307513] amdgpu 0000:02:00.0: fence driver on ring 4 use gpu addr 0x0000000200000048, cpu addr 0xffff8a3867e70048 [ 279.307541] amdgpu 0000:02:00.0: fence driver on ring 5 use gpu addr 0x0000000200000058, cpu addr 0xffff8a3867e70058 [ 279.307727] amdgpu 0000:02:00.0: fence driver on ring 6 use gpu addr 0x0000000200000068, cpu addr 0xffff8a3867e70068 [ 279.308100] amdgpu 0000:02:00.0: fence driver on ring 7 use gpu addr 0x0000000200000078, cpu addr 0xffff8a3867e70078 [ 279.308152] amdgpu 0000:02:00.0: fence driver on ring 8 use gpu addr 0x0000000200000088, cpu addr 0xffff8a3867e70088 [ 279.308961] amdgpu 0000:02:00.0: fence driver on ring 9 use gpu addr 0x0000000200000098, cpu addr 0xffff8a3867e70098 [ [ 279.348966] amdgpu: [powerplay] [AVFS] Something is broken. See log! [ 279.711657] amdgpu: [powerplay] [ 279.711657] failed to send message 254 ret is 0 [ 279.711681] amdgpu: [powerplay] Can't find requested voltage id in vdd_dep_on_sclk table! [ 279.854919] amdgpu: [powerplay] DPM is already running [ 280.134978] clocksource: timekeeping watchdog on CPU2: Marking clocksource 'tsc' as unstable because the skew is too large: [ 280.135000] clocksource: 'hpet' wd_now: a19f4644 wd_last: a0b4da18 mask: ffffffff [ 280.135005] clocksource: 'tsc' cs_now: 1133a962472 cs_last: 112d67312e4 mask: ffffffffffffffff [ 280.135011] sched_clock: Marking unstable (279910536262, 224436834)<-(280245936976, -110963880) [ 280.135028] tsc: Marking TSC unstable due to clocksource watchdog [ 283.495299] clocksource: Switched to clocksource hpet [ 283.495410] amdgpu: [powerplay] SMC address must be 4 byte aligned. [ 283.495417] amdgpu: [powerplay] Failed to initialize Graphics Level! [ 283.495423] amdgpu: [powerplay] Failed to initialize SMC table! [ 283.635476] amdgpu: [powerplay] Failed to enable VR hot GPIO interrupt! [ 284.195615] amdgpu: [powerplay] Failed to enable ULV! [ 284.335700] amdgpu: [powerplay] Attempt to enable Master Deep Sleep switch failed! [ 284.335719] amdgpu: [powerplay] Failed to enable deep sleep master switch! - I spot a "Something is broken", so something seems fishy. - Note: I do not have time right now to process the log since I have to go to work. -- You are receiving this mail because: You are watching the assignee of the bug. _______________________________________________ dri-devel mailing list dri-devel@xxxxxxxxxxxxxxxxxxxxx https://lists.freedesktop.org/mailman/listinfo/dri-devel