Dear Cal, Am 12.04.22 um 00:31 schrieb Cal Peake:
I wanted to put a capper on this just in case anyone was interested, or in case any future people find this thread, because I did find a resolution.
Yes, that is very much appreciated.
Turns out the way to stop the system from crashing was to disable Global C-State Control in the BIOS. Christian, Alex, you guys seem to have been on the right track in that it was something power related. I haven't yet been able to figure out what Global C-State Control exactly does. My best guess as to what was happening: once the GPU power management code was loaded and the GPU dropped into a very low power state, the CPU saw this and decided to match it, lowering its own power state to such a point that it hard resets the system. (Just a wild theory anyway :-) If anyone knows what this feature really does, or has any better theories as to why it doesn't play nice with AMD GPUs, please do share!
It might be related to bug report *Random Soft Lockup on new Ryzen build* [1], and the referenced issues there. It’d be great if you could post here or there a summary with your system details (especially the system and GPU firmware versions).
Me is still a little upset, how AMD has until now not been able to properly analyze and fix this (with the ODMs).
Kind regards, Paul [1]: https://bugzilla.kernel.org/show_bug.cgi?id=196683