RE: [PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to TMA

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



[AMD Official Use Only - General]

> -----Original Message-----
> From: Kuehling, Felix <Felix.Kuehling@xxxxxxx>
> Sent: November 24, 2022 11:24 AM
> To: Kim, Jonathan <Jonathan.Kim@xxxxxxx>; amd-
> gfx@xxxxxxxxxxxxxxxxxxxxx
> Subject: Re: [PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to
> TMA
>
>
> Am 2022-11-24 um 09:51 schrieb Kim, Jonathan:
> > [Public]
> >
> >> -----Original Message-----
> >> From: Kuehling, Felix <Felix.Kuehling@xxxxxxx>
> >> Sent: November 22, 2022 7:45 PM
> >> To: Kim, Jonathan <Jonathan.Kim@xxxxxxx>; amd-
> >> gfx@xxxxxxxxxxxxxxxxxxxxx
> >> Subject: Re: [PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to
> >> TMA
> >>
> >>
> >> On 2022-10-31 12:23, Jonathan Kim wrote:
> >>> From: Jay Cornwall <jay.cornwall@xxxxxxx>
> >>>
> >>> Trap handler behavior will differ when a debugger is attached.
> >>>
> >>> Make the debug trap flag available in the trap handler TMA.
> >>> Update it when the debug trap ioctl is invoked.
> >>>
> >>> v3: Rebase for upstream
> >>>
> >>> v2:
> >>> Add missing debug flag setup on APUs
> >>>
> >>> Signed-off-by: Jay Cornwall <jay.cornwall@xxxxxxx>
> >>> Reviewed-by: Felix Kuehling <Felix.Kuehling@xxxxxxx>
> >>> Signed-off-by: Jonathan Kim <jonathan.kim@xxxxxxx>
> >>> ---
> >>>    drivers/gpu/drm/amd/amdkfd/kfd_debug.c   |  4 ++++
> >>>    drivers/gpu/drm/amd/amdkfd/kfd_priv.h    |  2 ++
> >>>    drivers/gpu/drm/amd/amdkfd/kfd_process.c | 16
> ++++++++++++++++
> >>>    3 files changed, 22 insertions(+)
> >>>
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> index ae6e701a2656..d4f87f2adada 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> @@ -193,6 +193,8 @@ void kfd_dbg_trap_deactivate(struct
> kfd_process
> >> *target, bool unwind, int unwind
> >>>              if (unwind && count == unwind_count)
> >>>                      break;
> >>>
> >>> +           kfd_process_set_trap_debug_flag(&pdd->qpd, false);
> >>> +
> >>>              /* GFX off is already disabled by debug activate if not RLC
> >> restore supported. */
> >>>              if (kfd_dbg_is_rlc_restore_supported(pdd->dev))
> >>>                      amdgpu_gfx_off_ctrl(pdd->dev->adev, false);
> >>> @@ -278,6 +280,8 @@ int kfd_dbg_trap_activate(struct kfd_process
> >> *target)
> >>>              if (kfd_dbg_is_rlc_restore_supported(pdd->dev))
> >>>                      amdgpu_gfx_off_ctrl(pdd->dev->adev, true);
> >>>
> >>> +           kfd_process_set_trap_debug_flag(&pdd->qpd, true);
> >>> +
> >>>              r = debug_refresh_runlist(pdd->dev->dqm);
> >>>              if (r) {
> >>>                      target->runtime_info.runtime_state =
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> index 9690a2adb9ed..82b28588ab72 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> @@ -1101,6 +1101,8 @@ int kfd_init_apertures(struct kfd_process
> >> *process);
> >>>    void kfd_process_set_trap_handler(struct qcm_process_device *qpd,
> >>>                                uint64_t tba_addr,
> >>>                                uint64_t tma_addr);
> >>> +void kfd_process_set_trap_debug_flag(struct qcm_process_device
> >> *qpd,
> >>> +                                bool enabled);
> >>>
> >>>    /* CWSR initialization */
> >>>    int kfd_process_init_cwsr_apu(struct kfd_process *process, struct file
> >> *filep);
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> index 59c4c38833b6..d62e0c62df76 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> @@ -1252,6 +1252,8 @@ int kfd_process_init_cwsr_apu(struct
> >> kfd_process *p, struct file *filep)
> >>>              memcpy(qpd->cwsr_kaddr, dev->cwsr_isa, dev-
> >>> cwsr_isa_size);
> >>>
> >>> +           kfd_process_set_trap_debug_flag(qpd, p-
> >>> debug_trap_enabled);
> >>> +
> >>>              qpd->tma_addr = qpd->tba_addr +
> >> KFD_CWSR_TMA_OFFSET;
> >>>              pr_debug("set tba :0x%llx, tma:0x%llx, cwsr_kaddr:%p for
> >> pqm.\n",
> >>>                      qpd->tba_addr, qpd->tma_addr, qpd->cwsr_kaddr);
> >>> @@ -1288,6 +1290,9 @@ static int
> >> kfd_process_device_init_cwsr_dgpu(struct kfd_process_device *pdd)
> >>>      memcpy(qpd->cwsr_kaddr, dev->cwsr_isa, dev->cwsr_isa_size);
> >>>
> >>> +   kfd_process_set_trap_debug_flag(&pdd->qpd,
> >>> +                                   pdd->process-
> >>> debug_trap_enabled);
> >>> +
> >>>      qpd->tma_addr = qpd->tba_addr + KFD_CWSR_TMA_OFFSET;
> >>>      pr_debug("set tba :0x%llx, tma:0x%llx, cwsr_kaddr:%p for pqm.\n",
> >>>               qpd->tba_addr, qpd->tma_addr, qpd->cwsr_kaddr);
> >>> @@ -1374,6 +1379,17 @@ bool kfd_process_xnack_mode(struct
> >> kfd_process *p, bool supported)
> >>>      return true;
> >>>    }
> >>>
> >>> +void kfd_process_set_trap_debug_flag(struct qcm_process_device
> >> *qpd,
> >>> +                                bool enabled)
> >>> +{
> >>> +   /* If TMA doesn't exist then flag will be set during allocation. */
> >> I would expect a change to the TMA allocation function, but that isn't
> >> in this patch?
> > The TMA is allocated under kfd_process_init_cwsr_* and CWSR enabled is
> a pre-condition for the 1st level trap handler loading.
> > The lack of context in the patch for those functions may be hiding that fact.
> > Is the placement of this comment misleading?  Maybe it should go in
> kfd_dbg_trap_activate when kfd_process_set_trap_debug_flag is called?
> > Or should it just be removed since the combined calls within initialization of
> CWSR + debug enable seem complete for enablement?
>
> I think the comment is fine. I was sort of expecting to see the
> corresponding change in the TMA allocation in the same patch. So my
> question is just lack of context. If that change in the TMA allocation
> got squashed into another patch in the series, maybe it would make sense
> to move it into this patch instead.

The change to set flag on TMA allocation is done in this patch as kfd_process_set_trap_debug_flag is now called in kfd_process_init_cwsr_*.
To my knowledge, CWSR init and trap handler memory allocation should be atomic and that has been upstreamed for a while.

Did you mean the user trap handler assignment?  That should be independent from flagging.

Thanks,

Jon


>
> Regards,
>    Felix
>
>
> >
> > Thanks,
> >
> > Jon
> >
> >> Regards,
> >>     Felix
> >>
> >>> +   if (qpd->cwsr_kaddr) {
> >>> +           uint64_t *tma =
> >>> +                   (uint64_t *)(qpd->cwsr_kaddr +
> >> KFD_CWSR_TMA_OFFSET);
> >>> +           tma[2] = enabled;
> >>> +   }
> >>> +}
> >>> +
> >>>    /*
> >>>     * On return the kfd_process is fully operational and will be freed when
> >> the
> >>>     * mm is released

<<attachment: winmail.dat>>


[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux