Re: [PATCH 2/2] drm/amdgpu/mes: Add cleaner shader fence address handling in MES for GFX12

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Feb 10, 2025 at 8:48 PM Liu, Shaoyun <Shaoyun.Liu@xxxxxxx> wrote:
>
> [AMD Official Use Only - AMD Internal Distribution Only]
>
> Why only free the mem when  unified mes is enabled , I remembered that set_hw_resource_1 will also be called in none-unified mes mode .

mes_v12_0_set_hw_resources_1() is currently only called when unified
mes mode is enabled so it makes sense to align the free side as well.

>
> Another thing is eventually we probably should allocate total 129 pages the same as GFX11  (128  for hw_resource_1 and  one  extra page for MES internal usage) .  But seems SRIOV still not use that 128 pages, It can  be  updated later when  I update the API for gfx12 to enable that .
>

yeah, we can add that later if we need it.  The current mes 12 packet
doesn't seem to support that feature yet.

Alex

> Regards
> Shaoyun.liu
>
> -----Original Message-----
> From: Deucher, Alexander <Alexander.Deucher@xxxxxxx>
> Sent: Monday, February 10, 2025 4:19 PM
> To: amd-gfx@xxxxxxxxxxxxxxxxxxxxx
> Cc: Deucher, Alexander <Alexander.Deucher@xxxxxxx>; Koenig, Christian <Christian.Koenig@xxxxxxx>; SHANMUGAM, SRINIVASAN <SRINIVASAN.SHANMUGAM@xxxxxxx>; Liu, Shaoyun <Shaoyun.Liu@xxxxxxx>
> Subject: [PATCH 2/2] drm/amdgpu/mes: Add cleaner shader fence address handling in MES for GFX12
>
> This commit introduces enhancements to the handling of the cleaner shader fence in the AMDGPU MES driver:
>
> - The MES (Microcode Execution Scheduler) now sends a PM4 packet to the
>   KIQ (Kernel Interface Queue) to request the cleaner shader, ensuring
>   that requests are handled in a controlled manner and avoiding the
>   race conditions.
> - The CP (Compute Processor) firmware has been updated to use a private
>   bus for accessing specific registers, avoiding unnecessary operations
>   that could lead to issues in VF (Virtual Function) mode.
> - The cleaner shader fence memory address is now set correctly in the
>   `mes_set_hw_res_pkt` structure, allowing for proper synchronization of
>   the cleaner shader execution.
>
> Cc: Christian König <christian.koenig@xxxxxxx>
> Cc: Srinivasan Shanmugam <srinivasan.shanmugam@xxxxxxx>
> Suggested-by: Shaoyun Liu <shaoyun.liu@xxxxxxx>
>
> Signed-off-by: Alex Deucher <alexander.deucher@xxxxxxx>
> ---
>  drivers/gpu/drm/amd/amdgpu/mes_v12_0.c | 22 ++++++++++++++++++++++
>  1 file changed, 22 insertions(+)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c b/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c
> index dcbc31279f29b..8dbab3834d82d 100644
> --- a/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c
> +++ b/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c
> @@ -678,6 +678,9 @@ static int mes_v12_0_misc_op(struct amdgpu_mes *mes,
>
>  static int mes_v12_0_set_hw_resources_1(struct amdgpu_mes *mes, int pipe)  {
> +       unsigned int alloc_size = AMDGPU_GPU_PAGE_SIZE;
> +       int ret = 0;
> +       struct amdgpu_device *adev = mes->adev;
>         union MESAPI_SET_HW_RESOURCES_1 mes_set_hw_res_1_pkt;
>
>         memset(&mes_set_hw_res_1_pkt, 0, sizeof(mes_set_hw_res_1_pkt)); @@ -687,6 +690,19 @@ static int mes_v12_0_set_hw_resources_1(struct amdgpu_mes *mes, int pipe)
>         mes_set_hw_res_1_pkt.header.dwsize = API_FRAME_SIZE_IN_DWORDS;
>         mes_set_hw_res_1_pkt.mes_kiq_unmap_timeout = 0xa;
>
> +       ret = amdgpu_bo_create_kernel(adev, alloc_size, PAGE_SIZE,
> +                               AMDGPU_GEM_DOMAIN_VRAM,
> +                               &mes->resource_1,
> +                               &mes->resource_1_gpu_addr,
> +                               &mes->resource_1_addr);
> +       if (ret) {
> +               dev_err(adev->dev, "(%d) failed to create mes resource_1 bo\n", ret);
> +               return ret;
> +       }
> +
> +       mes_set_hw_res_1_pkt.cleaner_shader_fence_mc_addr =
> +               mes->resource_1_gpu_addr;
> +
>         return mes_v12_0_submit_pkt_and_poll_completion(mes, pipe,
>                         &mes_set_hw_res_1_pkt, sizeof(mes_set_hw_res_1_pkt),
>                         offsetof(union MESAPI_SET_HW_RESOURCES_1, api_status)); @@ -1770,6 +1786,12 @@ static int mes_v12_0_hw_init(struct amdgpu_ip_block *ip_block)
>
>  static int mes_v12_0_hw_fini(struct amdgpu_ip_block *ip_block)  {
> +       struct amdgpu_device *adev = ip_block->adev;
> +
> +       if (adev->enable_uni_mes)
> +               amdgpu_bo_free_kernel(&adev->mes.resource_1,
> +                                     &adev->mes.resource_1_gpu_addr,
> +                                     &adev->mes.resource_1_addr);
>         return 0;
>  }
>
> --
> 2.48.1
>




[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux