[Bug 201957] amdgpu: ring gfx timeout

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



https://bugzilla.kernel.org/show_bug.cgi?id=201957

--- Comment #71 from Panagiotis Polychronis (panospolychronis@xxxxxxxxx) ---
(In reply to Martin von Wittich from comment #70)
> My Ubuntu 20.04 desktop is crashing several times per day due to this bug
> since I've upgraded my computer from an old Intel Xeon to an AMD Ryzen 9
> 5900X on a B550 mainboard. I've had the same AMD RX Vega 56 graphics card in
> both computers, so I assume this is probably more related to the
> mainboard/CPU than to the graphics card.
> 
> The crashes from today:
> 
> ```
> martin@martin ~ % grep amdgpu /var/log/syslog | grep ERROR | grep -v 'Failed
> to initialize parser'
> Jun 11 03:15:33 martin kernel: [21494.642889] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=1750601, emitted seq=1750603
> Jun 11 03:15:33 martin kernel: [21494.643055] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 5037 thread
> firefox:cs0 pid 5123
> Jun 11 03:15:50 martin kernel: [21511.795007] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=1750605, emitted seq=1750608
> Jun 11 03:15:50 martin kernel: [21511.795174] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 5037 thread
> firefox:cs0 pid 5123
> Jun 11 15:56:07 martin kernel: [ 1477.069969] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=216293, emitted seq=216295
> Jun 11 15:56:07 martin kernel: [ 1477.070140] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 5237 thread
> firefox:cs0 pid 5302
> Jun 11 15:56:22 martin kernel: [ 1492.174077] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=216297, emitted seq=216300
> Jun 11 15:56:22 martin kernel: [ 1492.174248] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
> Jun 11 16:03:28 martin kernel: [ 1918.161101] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=264406, emitted seq=264408
> Jun 11 16:03:28 martin kernel: [ 1918.161271] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 10569 thread
> firefox:cs0 pid 10633
> Jun 11 16:03:49 martin kernel: [ 1938.385307] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=264410, emitted seq=264413
> Jun 11 16:03:49 martin kernel: [ 1938.385479] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 10569 thread
> firefox:cs0 pid 10633
> Jun 11 23:28:12 martin kernel: [25491.854294] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=2390985, emitted seq=2390987
> Jun 11 23:28:12 martin kernel: [25491.854460] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 4922 thread
> firefox:cs0 pid 4989
> Jun 11 23:28:28 martin kernel: [25507.982446] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=2390989, emitted seq=2390992
> Jun 11 23:28:28 martin kernel: [25507.982613] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
> Jun 11 23:29:51 martin kernel: [25591.333483] amdgpu 0000:2d:00.0: amdgpu:  
> WALKER_ERROR: 0x0
> Jun 11 23:29:51 martin kernel: [25591.333485] amdgpu 0000:2d:00.0: amdgpu:  
> MAPPING_ERROR: 0x0
> Jun 11 23:30:01 martin kernel: [25601.412838] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring uvd_0 timeout, signaled seq=308, emitted seq=310
> Jun 11 23:30:01 martin kernel: [25601.413009] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process mpv pid 44110 thread mpv:cs0
> pid 44122
> Jun 11 23:30:16 martin kernel: [25616.014983] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* ring gfx timeout, signaled seq=2409182, emitted seq=2409185
> Jun 11 23:30:16 martin kernel: [25616.015151] [drm:amdgpu_job_timedout
> [amdgpu]] *ERROR* Process information: process firefox pid 42941 thread
> firefox:cs0 pid 43005
> ```
> 
> When I upgraded my computer at the end of 2021, I had to switch from the
> default Ubuntu 20.04 kernel `linux-image-generic` (5.4.0) to
> `linux-image-generic-hwe-20.04` (5.11.0) because of some hardware issues
> with the new computer (I don't remember what exactly didn't work, IIRC the
> network).
> 
> I'm not exactly sure when the crashes started, but I changed from
> `linux-image-generic-hwe-20.04` (5.14) to `linux-image-oem-20.04d` (5.14) on
> 2022-04-30 in the hopes that that might resolve the issue, but unfortunately
> it didn't help.
> 
> I tried the `amdgpu.runpm=0` workaround today which also didn't help.
> 
> I can also confirm that the attached video "5 second video clip that
> triggers a crash" successfully triggers the crash on my system.
> 
> The main other thing that seems to trigger the crash is to open new tabs in
> Firefox (in that not every new tab I open causes the crash, but when it
> crashes, it's usually when I was trying to open a new tab).

Did you try with the latest Linux Kernel? I had a lot of gpu lockups like this.
Also try these kernel parameters : "amdgpu.ppfeaturemask=0xffffbffb 
amdgpu.noretry=0 amdgpu.lockup_timeout=0 amdgpu.gpu_recovery=1 amdgpu.audio=0
amdgpu.deep_color=1 amd_iommu=on iommu=pt"" ( you might also try with
amdgpu.ppfeaturemask=0xfffd7fff or amdgpu.ppfeaturemask=0xffffffff )

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.



[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux