Patch "drm/amdgpu: drop gfx_v11_0_cp_ecc_error_irq_funcs" has been added to the 6.1-stable tree

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



This is a note to let you know that I've just added the patch titled

    drm/amdgpu: drop gfx_v11_0_cp_ecc_error_irq_funcs

to the 6.1-stable tree which can be found at:
    http://www.kernel.org/git/?p=linux/kernel/git/stable/stable-queue.git;a=summary

The filename of the patch is:
     drm-amdgpu-drop-gfx_v11_0_cp_ecc_error_irq_funcs.patch
and it can be found in the queue-6.1 subdirectory.

If you, or anyone else, feels it should not be added to the stable tree,
please let <stable@xxxxxxxxxxxxxxx> know about it.



commit 59974466f06e09c1f1c9953dc3ddf6ed00773370
Author: Horatio Zhang <Hongkun.Zhang@xxxxxxx>
Date:   Thu May 4 01:46:12 2023 -0400

    drm/amdgpu: drop gfx_v11_0_cp_ecc_error_irq_funcs
    
    [ Upstream commit 720b47229a5b24061d1c2e29ddb6043a59178d79 ]
    
    The gfx.cp_ecc_error_irq is retired in gfx11. In gfx_v11_0_hw_fini still
    use amdgpu_irq_put to disable this interrupt, which caused the call trace
    in this function.
    
    [  102.873958] Call Trace:
    [  102.873959]  <TASK>
    [  102.873961]  gfx_v11_0_hw_fini+0x23/0x1e0 [amdgpu]
    [  102.874019]  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
    [  102.874072]  amdgpu_device_ip_suspend_phase2+0x240/0x460 [amdgpu]
    [  102.874122]  amdgpu_device_ip_suspend+0x3d/0x80 [amdgpu]
    [  102.874172]  amdgpu_device_pre_asic_reset+0xd9/0x490 [amdgpu]
    [  102.874223]  amdgpu_device_gpu_recover.cold+0x548/0xce6 [amdgpu]
    [  102.874321]  amdgpu_debugfs_reset_work+0x4c/0x70 [amdgpu]
    [  102.874375]  process_one_work+0x21f/0x3f0
    [  102.874377]  worker_thread+0x200/0x3e0
    [  102.874378]  ? process_one_work+0x3f0/0x3f0
    [  102.874379]  kthread+0xfd/0x130
    [  102.874380]  ? kthread_complete_and_exit+0x20/0x20
    [  102.874381]  ret_from_fork+0x22/0x30
    
    v2:
    - Handle umc and gfx ras cases in separated patch
    - Retired the gfx_v11_0_cp_ecc_error_irq_funcs in gfx11
    
    v3:
    - Improve the subject and code comments
    - Add judgment on gfx11 in the function of amdgpu_gfx_ras_late_init
    
    v4:
    - Drop the define of CP_ME1_PIPE_INST_ADDR_INTERVAL and
    SET_ECC_ME_PIPE_STATE which using in gfx_v11_0_set_cp_ecc_error_state
    - Check cp_ecc_error_irq.funcs rather than ip version for a more
    sustainable life
    
    v5:
    - Simplify judgment conditions
    
    Signed-off-by: Horatio Zhang <Hongkun.Zhang@xxxxxxx>
    Reviewed-by: Hawking Zhang <Hawking.Zhang@xxxxxxx>
    Acked-by: Christian König <christian.koenig@xxxxxxx>
    Reviewed-by: Guchun Chen <guchun.chen@xxxxxxx>
    Reviewed-by: Feifei Xu <Feifei.Xu@xxxxxxx>
    Signed-off-by: Alex Deucher <alexander.deucher@xxxxxxx>
    Cc: stable@xxxxxxxxxxxxxxx
    Signed-off-by: Sasha Levin <sashal@xxxxxxxxxx>

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c
index 3380daf42da8a..b803e785d3aff 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c
@@ -683,9 +683,11 @@ int amdgpu_gfx_ras_late_init(struct amdgpu_device *adev, struct ras_common_if *r
 		if (r)
 			return r;
 
-		r = amdgpu_irq_get(adev, &adev->gfx.cp_ecc_error_irq, 0);
-		if (r)
-			goto late_fini;
+		if (adev->gfx.cp_ecc_error_irq.funcs) {
+			r = amdgpu_irq_get(adev, &adev->gfx.cp_ecc_error_irq, 0);
+			if (r)
+				goto late_fini;
+		}
 	} else {
 		amdgpu_ras_feature_enable_on_boot(adev, ras_block, 0);
 	}



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux