On Thu, 16 May 2024 at 06:29, Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> wrote: > > On Wed, 15 May 2024 at 13:24, Linus Torvalds > <torvalds@xxxxxxxxxxxxxxxxxxxx> wrote: > > > > I have to revert both > > > > a68c7eaa7a8f ("drm/amdgpu: Enable clear page functionality") > > e362b7c8f8c7 ("drm/amdgpu: Modify the contiguous flags behaviour") > > > > to make things build cleanly. Next step: see if it boots and fixes the > > problem for me. > > Well, perhaps not surprisingly, the WARN_ON() no longer triggers with > this, and everything looks fine. > > Let's see if the machine ends up being stable now. It took several > hours for the "scary messages" state to turn into the "hung machine" > state, so they *could* have been independent issues, but it seems a > bit unlikely. This worries me actually, it's possible this warn could cause a problem, but I'm not convinced it should have machine ending properties without some sort of different error at the end, so I'd keep an eye open here. Dave.