[Bug 201957] amdgpu: ring gfx timeout

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



https://bugzilla.kernel.org/show_bug.cgi?id=201957

Martin von Wittich (martin.von.wittich@xxxxxxxx) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |martin.von.wittich@xxxxxxxx

--- Comment #70 from Martin von Wittich (martin.von.wittich@xxxxxxxx) ---
My Ubuntu 20.04 desktop is crashing several times per day due to this bug since
I've upgraded my computer from an old Intel Xeon to an AMD Ryzen 9 5900X on a
B550 mainboard. I've had the same AMD RX Vega 56 graphics card in both
computers, so I assume this is probably more related to the mainboard/CPU than
to the graphics card.

The crashes from today:

```
martin@martin ~ % grep amdgpu /var/log/syslog | grep ERROR | grep -v 'Failed to
initialize parser'
Jun 11 03:15:33 martin kernel: [21494.642889] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=1750601, emitted seq=1750603
Jun 11 03:15:33 martin kernel: [21494.643055] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 5037 thread
firefox:cs0 pid 5123
Jun 11 03:15:50 martin kernel: [21511.795007] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=1750605, emitted seq=1750608
Jun 11 03:15:50 martin kernel: [21511.795174] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 5037 thread
firefox:cs0 pid 5123
Jun 11 15:56:07 martin kernel: [ 1477.069969] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=216293, emitted seq=216295
Jun 11 15:56:07 martin kernel: [ 1477.070140] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 5237 thread
firefox:cs0 pid 5302
Jun 11 15:56:22 martin kernel: [ 1492.174077] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=216297, emitted seq=216300
Jun 11 15:56:22 martin kernel: [ 1492.174248] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jun 11 16:03:28 martin kernel: [ 1918.161101] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=264406, emitted seq=264408
Jun 11 16:03:28 martin kernel: [ 1918.161271] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 10569 thread
firefox:cs0 pid 10633
Jun 11 16:03:49 martin kernel: [ 1938.385307] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=264410, emitted seq=264413
Jun 11 16:03:49 martin kernel: [ 1938.385479] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 10569 thread
firefox:cs0 pid 10633
Jun 11 23:28:12 martin kernel: [25491.854294] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=2390985, emitted seq=2390987
Jun 11 23:28:12 martin kernel: [25491.854460] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 4922 thread
firefox:cs0 pid 4989
Jun 11 23:28:28 martin kernel: [25507.982446] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=2390989, emitted seq=2390992
Jun 11 23:28:28 martin kernel: [25507.982613] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jun 11 23:29:51 martin kernel: [25591.333483] amdgpu 0000:2d:00.0: amdgpu:     
 WALKER_ERROR: 0x0
Jun 11 23:29:51 martin kernel: [25591.333485] amdgpu 0000:2d:00.0: amdgpu:     
 MAPPING_ERROR: 0x0
Jun 11 23:30:01 martin kernel: [25601.412838] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring uvd_0 timeout, signaled seq=308, emitted seq=310
Jun 11 23:30:01 martin kernel: [25601.413009] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process mpv pid 44110 thread mpv:cs0 pid
44122
Jun 11 23:30:16 martin kernel: [25616.014983] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* ring gfx timeout, signaled seq=2409182, emitted seq=2409185
Jun 11 23:30:16 martin kernel: [25616.015151] [drm:amdgpu_job_timedout
[amdgpu]] *ERROR* Process information: process firefox pid 42941 thread
firefox:cs0 pid 43005
```

When I upgraded my computer at the end of 2021, I had to switch from the
default Ubuntu 20.04 kernel `linux-image-generic` (5.4.0) to
`linux-image-generic-hwe-20.04` (5.11.0) because of some hardware issues with
the new computer (I don't remember what exactly didn't work, IIRC the network).

I'm not exactly sure when the crashes started, but I changed from
`linux-image-generic-hwe-20.04` (5.14) to `linux-image-oem-20.04d` (5.14) on
2022-04-30 in the hopes that that might resolve the issue, but unfortunately it
didn't help.

I tried the `amdgpu.runpm=0` workaround today which also didn't help.

I can also confirm that the attached video "5 second video clip that triggers a
crash" successfully triggers the crash on my system.

The main other thing that seems to trigger the crash is to open new tabs in
Firefox (in that not every new tab I open causes the crash, but when it
crashes, it's usually when I was trying to open a new tab).

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.



[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux