On Thu, Oct 6, 2022 at 3:48 PM Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> wrote: > > On Thu, Oct 6, 2022 at 12:28 PM Alex Deucher <alexdeucher@xxxxxxxxx> wrote: > > > > Maybe you are seeing this which is an issue with GPU TLB flushes which > > is kind of sporadic: > > https://gitlab.freedesktop.org/drm/amd/-/issues/2113 > > Well, that seems to be 5.19, and while timing changes (or whatever > other software updates) could have made it start trigger, this machine > has been pretty solid otgerwise. > > > Are you seeing any GPU page faults in your kernel log? > > Nothing even remotely like that "no-retry page fault" in that issue > report. Of course, if it happens just before the whole thing locks > up... Your chip is too old to support retry faults so it's likely you could be just seeing a GPU page fault followed by a hang. Your chip also lacks a paging queue, so you would be affected by the TLB issue. Alex