Re: [RFC][PATCH] drm/amdgpu/powerplay/smu10: Add custom profile

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Sep 8, 2021 at 3:23 AM Daniel Gomez <daniel@xxxxxxxx> wrote:
>
> On Tue, 7 Sept 2021 at 19:23, Alex Deucher <alexdeucher@xxxxxxxxx> wrote:
> >
> > On Tue, Sep 7, 2021 at 4:53 AM Daniel Gomez <daniel@xxxxxxxx> wrote:
> > >
> > > Add custom power profile mode support on smu10.
> > > Update workload bit list.
> > > ---
> > >
> > > Hi,
> > >
> > > I'm trying to add custom profile for the Raven Ridge but not sure if
> > > I'd need a different parameter than PPSMC_MSG_SetCustomPolicy to
> > > configure the custom values. The code seemed to support CUSTOM for
> > > workload types but it didn't show up in the menu or accept any user
> > > input parameter. So far, I've added that part but a bit confusing to
> > > me what is the policy I need for setting these parameters or if it's
> > > maybe not possible at all.
> > >
> > > After applying the changes I'd configure the CUSTOM mode as follows:
> > >
> > > echo manual > /sys/class/drm/card0/device/hwmon/hwmon1/device/power_dpm_force_performance_level
> > > echo "6 70 90 0 0" > /sys/class/drm/card0/device/hwmon/hwmon1/device/pp_power_profile_mode
> > >
> > > Then, using Darren Powell script for testing modes I get the following
> > > output:
> > >
> > > 05:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] [1002:15dd] (rev 83)
> > > === pp_dpm_sclk ===
> > > 0: 200Mhz
> > > 1: 400Mhz *
> > > 2: 1100Mhz
> > > === pp_dpm_mclk ===
> > > 0: 400Mhz
> > > 1: 933Mhz *
> > > 2: 1067Mhz
> > > 3: 1200Mhz
> > > === pp_power_profile_mode ===
> > > NUM        MODE_NAME BUSY_SET_POINT FPS USE_RLC_BUSY MIN_ACTIVE_LEVEL
> > >   0 BOOTUP_DEFAULT :             70  60          0              0
> > >   1 3D_FULL_SCREEN :             70  60          1              3
> > >   2   POWER_SAVING :             90  60          0              0
> > >   3          VIDEO :             70  60          0              0
> > >   4             VR :             70  90          0              0
> > >   5        COMPUTE :             30  60          0              6
> > >   6         CUSTOM*:             70  90          0              0
> > >
> > > As you can also see in my changes, I've also updated the workload bit
> > > table but I'm not completely sure about that change. With the tests
> > > I've done, using bit 5 for the WORKLOAD_PPLIB_CUSTOM_BIT makes the
> > > gpu sclk locked around ~36%. So, maybe I'm missing a clock limit
> > > configuraton table somewhere. Would you give me some hints to
> > > proceed with this?
> >
> > I don't think APUs support customizing the workloads the same way
> > dGPUs do.  I think they just support predefined profiles.
> >
> > Alex
>
>
> Thanks Alex for the quick response. Would it make sense then to remove
> the custom workload code (PP_SMC_POWER_PROFILE_CUSTOM) from the smu10?
> That workload was added in this commit:
> f6f75ebdc06c04d3cfcd100f1b10256a9cdca407 [1] and not use at all in the
> code as it's limited to PP_SMC_POWER_PROFILE_COMPUTE index. The
> smu10.h also includes the custom workload bit definition and that was
> a bit confusing for me to understand if it was half-supported or not
> possible to use at all as I understood from your comment.
>
> Perhaps could also be mentioned (if that's kind of standard) in the
> documentation[2] so, the custom pp_power_profile_mode is only
> supported in dGPUs.
>
> I can send the patches if it makes sense.

I guess I was thinking of another asic.  @Huang Rui, @changzhu, @Quan,
Evan can any of you comment on what is required for custom profiles on
APUs?

Alex


>
> [1]: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c?id=f6f75ebdc06c04d3cfcd100f1b10256a9cdca407
> [2]: https://www.kernel.org/doc/html/latest/gpu/amdgpu.html#pp-power-profile-mode
>
> Daniel
>
> >
> >
> > >
> > > Thanks in advance,
> > > Daniel
> > >
> > >
> > >  drivers/gpu/drm/amd/pm/inc/smu10.h            | 14 +++--
> > >  .../drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c  | 57 +++++++++++++++++--
> > >  .../drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.h  |  1 +
> > >  3 files changed, 61 insertions(+), 11 deletions(-)
> > >
> > > diff --git a/drivers/gpu/drm/amd/pm/inc/smu10.h b/drivers/gpu/drm/amd/pm/inc/smu10.h
> > > index 9e837a5014c5..b96520528240 100644
> > > --- a/drivers/gpu/drm/amd/pm/inc/smu10.h
> > > +++ b/drivers/gpu/drm/amd/pm/inc/smu10.h
> > > @@ -136,12 +136,14 @@
> > >  #define FEATURE_CORE_CSTATES_MASK     (1 << FEATURE_CORE_CSTATES_BIT)
> > >
> > >  /* Workload bits */
> > > -#define WORKLOAD_PPLIB_FULL_SCREEN_3D_BIT 0
> > > -#define WORKLOAD_PPLIB_VIDEO_BIT          2
> > > -#define WORKLOAD_PPLIB_VR_BIT             3
> > > -#define WORKLOAD_PPLIB_COMPUTE_BIT        4
> > > -#define WORKLOAD_PPLIB_CUSTOM_BIT         5
> > > -#define WORKLOAD_PPLIB_COUNT              6
> > > +#define WORKLOAD_DEFAULT_BIT              0
> > > +#define WORKLOAD_PPLIB_FULL_SCREEN_3D_BIT 1
> > > +#define WORKLOAD_PPLIB_POWER_SAVING_BIT   2
> > > +#define WORKLOAD_PPLIB_VIDEO_BIT          3
> > > +#define WORKLOAD_PPLIB_VR_BIT             4
> > > +#define WORKLOAD_PPLIB_COMPUTE_BIT        5
> > > +#define WORKLOAD_PPLIB_CUSTOM_BIT         6
> > > +#define WORKLOAD_PPLIB_COUNT              7
> > >
> > >  typedef struct {
> > >         /* MP1_EXT_SCRATCH0 */
> > > diff --git a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c
> > > index 1de3ae77e03e..fef9f9ac1c56 100644
> > > --- a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c
> > > +++ b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c
> > > @@ -110,6 +110,11 @@ static int smu10_initialize_dpm_defaults(struct pp_hwmgr *hwmgr)
> > >         smu10_data->num_active_display = 0;
> > >         smu10_data->deep_sleep_dcefclk = 0;
> > >
> > > +       smu10_data->custom_profile_mode[0] = 0;
> > > +       smu10_data->custom_profile_mode[1] = 0;
> > > +       smu10_data->custom_profile_mode[2] = 0;
> > > +       smu10_data->custom_profile_mode[3] = 0;
> > > +
> > >         phm_cap_unset(hwmgr->platform_descriptor.platformCaps,
> > >                                         PHM_PlatformCaps_SclkDeepSleep);
> > >
> > > @@ -544,6 +549,10 @@ static int smu10_hwmgr_backend_init(struct pp_hwmgr *hwmgr)
> > >
> > >         hwmgr->backend = data;
> > >
> > > +       hwmgr->workload_mask = 1 << hwmgr->workload_prority[PP_SMC_POWER_PROFILE_BOOTUP_DEFAULT];
> > > +       hwmgr->power_profile_mode = PP_SMC_POWER_PROFILE_BOOTUP_DEFAULT;
> > > +       hwmgr->default_power_profile_mode = PP_SMC_POWER_PROFILE_BOOTUP_DEFAULT;
> > > +
> > >         result = smu10_initialize_dpm_defaults(hwmgr);
> > >         if (result != 0) {
> > >                 pr_err("smu10_initialize_dpm_defaults failed\n");
> > > @@ -1408,9 +1417,15 @@ static int conv_power_profile_to_pplib_workload(int power_profile)
> > >         int pplib_workload = 0;
> > >
> > >         switch (power_profile) {
> > > +       case PP_SMC_POWER_PROFILE_BOOTUP_DEFAULT:
> > > +               pplib_workload = WORKLOAD_DEFAULT_BIT;
> > > +               break;
> > >         case PP_SMC_POWER_PROFILE_FULLSCREEN3D:
> > >                 pplib_workload = WORKLOAD_PPLIB_FULL_SCREEN_3D_BIT;
> > >                 break;
> > > +       case PP_SMC_POWER_PROFILE_POWERSAVING:
> > > +               pplib_workload = WORKLOAD_PPLIB_POWER_SAVING_BIT;
> > > +               break;
> > >         case PP_SMC_POWER_PROFILE_VIDEO:
> > >                 pplib_workload = WORKLOAD_PPLIB_VIDEO_BIT;
> > >                 break;
> > > @@ -1430,22 +1445,24 @@ static int conv_power_profile_to_pplib_workload(int power_profile)
> > >
> > >  static int smu10_get_power_profile_mode(struct pp_hwmgr *hwmgr, char *buf)
> > >  {
> > > +       struct smu10_hwmgr *smu10_data = (struct smu10_hwmgr *)(hwmgr->backend);
> > >         uint32_t i, size = 0;
> > >         static const uint8_t
> > > -               profile_mode_setting[6][4] = {{70, 60, 0, 0,},
> > > +               profile_mode_setting[7][4] = {{70, 60, 0, 0,},
> > >                                                 {70, 60, 1, 3,},
> > >                                                 {90, 60, 0, 0,},
> > >                                                 {70, 60, 0, 0,},
> > >                                                 {70, 90, 0, 0,},
> > >                                                 {30, 60, 0, 6,},
> > >                                                 };
> > > -       static const char *profile_name[6] = {
> > > +       static const char *profile_name[7] = {
> > >                                         "BOOTUP_DEFAULT",
> > >                                         "3D_FULL_SCREEN",
> > >                                         "POWER_SAVING",
> > >                                         "VIDEO",
> > >                                         "VR",
> > > -                                       "COMPUTE"};
> > > +                                       "COMPUTE",
> > > +                                       "CUSTOM"};
> > >         static const char *title[6] = {"NUM",
> > >                         "MODE_NAME",
> > >                         "BUSY_SET_POINT",
> > > @@ -1459,11 +1476,15 @@ static int smu10_get_power_profile_mode(struct pp_hwmgr *hwmgr, char *buf)
> > >         size += sysfs_emit_at(buf, size, "%s %16s %s %s %s %s\n",title[0],
> > >                         title[1], title[2], title[3], title[4], title[5]);
> > >
> > > -       for (i = 0; i <= PP_SMC_POWER_PROFILE_COMPUTE; i++)
> > > +       for (i = 0; i < PP_SMC_POWER_PROFILE_CUSTOM; i++)
> > >                 size += sysfs_emit_at(buf, size, "%3d %14s%s: %14d %3d %10d %14d\n",
> > >                         i, profile_name[i], (i == hwmgr->power_profile_mode) ? "*" : " ",
> > >                         profile_mode_setting[i][0], profile_mode_setting[i][1],
> > >                         profile_mode_setting[i][2], profile_mode_setting[i][3]);
> > > +       size += sysfs_emit_at(buf, size, "%3d %14s%s: %14d %3d %10d %14d\n", i,
> > > +                       profile_name[i], (i == hwmgr->power_profile_mode) ? "*" : " ",
> > > +                       smu10_data->custom_profile_mode[0], smu10_data->custom_profile_mode[1],
> > > +                       smu10_data->custom_profile_mode[2], smu10_data->custom_profile_mode[3]);
> > >
> > >         return size;
> > >  }
> > > @@ -1480,16 +1501,42 @@ static bool smu10_is_raven1_refresh(struct pp_hwmgr *hwmgr)
> > >
> > >  static int smu10_set_power_profile_mode(struct pp_hwmgr *hwmgr, long *input, uint32_t size)
> > >  {
> > > +       struct smu10_hwmgr *smu10_data = (struct smu10_hwmgr *)(hwmgr->backend);
> > > +       uint8_t busy_set_point, FPS, use_rlc_busy, min_active_level;
> > > +       uint32_t power_profile_mode = input[size];
> > >         int workload_type = 0;
> > >         int result = 0;
> > >
> > > -       if (input[size] > PP_SMC_POWER_PROFILE_COMPUTE) {
> > > +       if (input[size] > PP_SMC_POWER_PROFILE_CUSTOM) {
> > >                 pr_err("Invalid power profile mode %ld\n", input[size]);
> > >                 return -EINVAL;
> > >         }
> > >         if (hwmgr->power_profile_mode == input[size])
> > >                 return 0;
> > >
> > > +       if (power_profile_mode == PP_SMC_POWER_PROFILE_CUSTOM) {
> > > +               if (size != 0 && size != 4)
> > > +                       return -EINVAL;
> > > +
> > > +               if (size == 0) {
> > > +                       if (smu10_data->custom_profile_mode[0] != 0)
> > > +                               goto out;
> > > +                       else
> > > +                               return -EINVAL;
> > > +               }
> > > +
> > > +               smu10_data->custom_profile_mode[0] = busy_set_point = input[0];
> > > +               smu10_data->custom_profile_mode[1] = FPS = input[1];
> > > +               smu10_data->custom_profile_mode[2] = use_rlc_busy = input[2];
> > > +               smu10_data->custom_profile_mode[3] = min_active_level = input[3];
> > > +               smum_send_msg_to_smc_with_parameter(hwmgr,
> > > +                                       PPSMC_MSG_SetCustomPolicy,
> > > +                                       busy_set_point | FPS<<8 |
> > > +                                       use_rlc_busy << 16 | min_active_level<<24,
> > > +                                       NULL);
> > > +       }
> > > +
> > > +out:
> > >         /* conv PP_SMC_POWER_PROFILE* to WORKLOAD_PPLIB_*_BIT */
> > >         workload_type =
> > >                 conv_power_profile_to_pplib_workload(input[size]);
> > > diff --git a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.h b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.h
> > > index 808e0ecbe1f0..4c4b2b1b510a 100644
> > > --- a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.h
> > > +++ b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.h
> > > @@ -302,6 +302,7 @@ struct smu10_hwmgr {
> > >         uint32_t                             num_active_display;
> > >
> > >         bool                                                    fine_grain_enabled;
> > > +       uint8_t                              custom_profile_mode[4];
> > >  };
> > >
> > >  struct pp_hwmgr;
> > > --
> > > 2.30.2
> > >



[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux