Am 24.04.24 um 15:13 schrieb Li, Yunxiang (Teddy):
[Public]
We have the KFD, FLR, the per engine one in the scheduler and IIRC one more for the CP (illegal operation and register write).
I'm not sure about the CP one, but all others should be handled correctly with the V2 patch as far as I can see.
Where can I find the CP one? Nothing came up when I search for amdgpu_device_gpu_recover
I had to dig that up as well in the code since I haven't looked into it
in years.
Looks like that is handled by the scheduler work item now as well. See
function gfx_v9_0_fault() for an example.
Regards,
Christian.