On Wed, 15 May 2024 at 16:51, Dave Airlie <airlied@xxxxxxxxx> wrote: > > > Let's see if the machine ends up being stable now. It took several > > hours for the "scary messages" state to turn into the "hung machine" > > state, so they *could* have been independent issues, but it seems a > > bit unlikely. > > This worries me actually, it's possible this warn could cause a > problem, but I'm not convinced it should have machine ending > properties without some sort of different error at the end, so I'd > keep an eye open here. Well, since I'm a big believer in dogfooding, I always run my own kernel even during the merge window. I don't reboot between each pull, but I try to basically reboot daily. And it's entirely possible that the eventual "bad page flags" error - which is what I think triggered the eventual hang - is something else that came in during this merge window. I haven't actually gotten the -mm changes from Andrew yet, but it did happen in the btrfs kworker, and I have merged the btrfs changes for 6.10. So maybe they are the cause. I was blaming the DRM case mainly because it clearly *was* about some kind of allocation management, and I got a *lot* of those warnings: $ journalctl -b -1 | grep 'WARNING: CPU' | wc -1 16015 but let's see if it happens with my amdgpu reverts in place, and no drm warnings. It most definitely wouldn't be the first time we had multiple independent bugs during the merge window ;/ Linus