Beside umc, others' UE interrupt callback could enter into amdgpu_ras_reset_gpu, so the first patch change to save bad pages in UMC error interrupt callback. When bad page error happens, the bad page mostly still be hold by some process, therefore driver will fail to reserve the bad page. The third patch will reserve the bad page when freeing it, make system has no chance to allocate it to other proccess. Dennis Li (3): drm/amdgpu: change to save bad pages in UMC error interrupt callback drm/amdgpu: remove redundant GPU reset drm/amdgpu: fix the issue of reserving bad pages failed drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 155 +++++------------- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 17 +- .../gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c | 16 -- drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.h | 4 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 7 +- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 115 +++++++++++++ 6 files changed, 164 insertions(+), 150 deletions(-) -- 2.17.1 _______________________________________________ amd-gfx mailing list amd-gfx@xxxxxxxxxxxxxxxxxxxxx https://lists.freedesktop.org/mailman/listinfo/amd-gfx