Re: [PATCH v4 10/13] drm/msm/dpu: allow sharing SSPP between planes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 





On 6/12/2024 2:08 AM, Dmitry Baryshkov wrote:
On Wed, 12 Jun 2024 at 02:12, Abhinav Kumar <quic_abhinavk@xxxxxxxxxxx> wrote:



On 3/13/2024 5:02 PM, Dmitry Baryshkov wrote:
Since SmartDMA planes provide two rectangles, it is possible to use them
to drive two different DRM planes, first plane getting the rect_0,
another one using rect_1 of the same SSPP. The sharing algorithm is
pretty simple, it requires that each of the planes can be driven by the
single rectangle and only consequetive planes are considered.


consequetive - > consecutive

Can you please explain why only consecutive planes are considered for this?

So lets say we have 4 virtual planes : 0, 1, 2, 3

It will try 0-1, 1-2, 2-3

Because all planes are virtual, there are only 3 unique pairs to be
considered? Otherwise technically 6 pairs are possible.

An implementation that tries all 6 pairs taking the zpos and the
overlapping into account is appreciated. I cared for the simplest case
here. Yes, further optimizations can be implemented.


Ok got it. So you would like to build a better one on top of this.
But I see one case where this has an issue or is not optimal. Pls see below.



General request:

Patches 1-9 : Add support for using 2 SSPPs in one plane
Patches 10-12 : Add support for using two rectangles of the same SSPP as
two virtual planes
Patch 13 : Can be pushed along with the first set.

Can we break up this series in this way to make it easier to test and
land the bulk of it in this cycle?

Sure.


Thanks.


I have some doubts on patches 10-12 and would like to spend more time
reviewing and testing this. So I am trying to reduce the debt of patches
we have been carrying as this is a tricky feature to simulate and test
the cases.

Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@xxxxxxxxxx>
---
   drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c | 128 +++++++++++++++++++---
   1 file changed, 112 insertions(+), 16 deletions(-)

diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
index cde20c1fa90d..2e1c544efc4a 100644
--- a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
+++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
@@ -886,10 +886,9 @@ static int dpu_plane_atomic_check_nopipe(struct drm_plane *plane,
       return 0;
   }

-static int dpu_plane_is_multirect_parallel_capable(struct dpu_sw_pipe *pipe,
-                                                struct dpu_sw_pipe_cfg *pipe_cfg,
-                                                const struct dpu_format *fmt,
-                                                uint32_t max_linewidth)
+static int dpu_plane_is_multirect_capable(struct dpu_sw_pipe *pipe,
+                                       struct dpu_sw_pipe_cfg *pipe_cfg,
+                                       const struct dpu_format *fmt)
   {
       if (drm_rect_width(&pipe_cfg->src_rect) != drm_rect_width(&pipe_cfg->dst_rect) ||
           drm_rect_height(&pipe_cfg->src_rect) != drm_rect_height(&pipe_cfg->dst_rect))
@@ -901,6 +900,13 @@ static int dpu_plane_is_multirect_parallel_capable(struct dpu_sw_pipe *pipe,
       if (DPU_FORMAT_IS_YUV(fmt))
               return false;

+     return true;
+}
+
+static int dpu_plane_is_parallel_capable(struct dpu_sw_pipe_cfg *pipe_cfg,
+                                      const struct dpu_format *fmt,
+                                      uint32_t max_linewidth)
+{
       if (DPU_FORMAT_IS_UBWC(fmt) &&
           drm_rect_width(&pipe_cfg->src_rect) > max_linewidth / 2)
               return false;
@@ -908,6 +914,82 @@ static int dpu_plane_is_multirect_parallel_capable(struct dpu_sw_pipe *pipe,
       return true;
   }

+static int dpu_plane_is_multirect_parallel_capable(struct dpu_sw_pipe *pipe,
+                                                struct dpu_sw_pipe_cfg *pipe_cfg,
+                                                const struct dpu_format *fmt,
+                                                uint32_t max_linewidth)
+{
+     return dpu_plane_is_multirect_capable(pipe, pipe_cfg, fmt) &&
+             dpu_plane_is_parallel_capable(pipe_cfg, fmt, max_linewidth);
+}
+
+
+static int dpu_plane_try_multirect(struct dpu_plane_state *pstate,
+                                struct dpu_plane_state *prev_pstate,
+                                const struct dpu_format *fmt,
+                                uint32_t max_linewidth)
+{
+     struct dpu_sw_pipe *pipe = &pstate->pipe;
+     struct dpu_sw_pipe *r_pipe = &pstate->r_pipe;
+     struct dpu_sw_pipe_cfg *pipe_cfg = &pstate->pipe_cfg;
+     struct dpu_sw_pipe *prev_pipe = &prev_pstate->pipe;
+     struct dpu_sw_pipe_cfg *prev_pipe_cfg = &prev_pstate->pipe_cfg;
+     const struct dpu_format *prev_fmt =
+             to_dpu_format(msm_framebuffer_format(prev_pstate->base.fb));
+     u16 max_tile_height = 1;
+
+     if (prev_pstate->r_pipe.sspp != NULL ||
+         prev_pipe->multirect_mode != DPU_SSPP_MULTIRECT_NONE)
+             return false;
+
+     if (!dpu_plane_is_multirect_capable(pipe, pipe_cfg, fmt) ||
+         !dpu_plane_is_multirect_capable(prev_pipe, prev_pipe_cfg, prev_fmt) ||
+         !(test_bit(DPU_SSPP_SMART_DMA_V1, &prev_pipe->sspp->cap->features) ||
+           test_bit(DPU_SSPP_SMART_DMA_V2, &prev_pipe->sspp->cap->features)))

This test_bit check should be absorbed into
dpu_plane_is_multirect_capable()?

Yep.


+             return false;
+
+     if (DPU_FORMAT_IS_UBWC(fmt))
+             max_tile_height = max(max_tile_height, fmt->tile_height);
+
+     if (DPU_FORMAT_IS_UBWC(prev_fmt))
+             max_tile_height = max(max_tile_height, prev_fmt->tile_height);
+
+     r_pipe->multirect_index = DPU_SSPP_RECT_SOLO;
+     r_pipe->multirect_mode = DPU_SSPP_MULTIRECT_NONE;
+
+     r_pipe->sspp = NULL;
+
+     if (dpu_plane_is_parallel_capable(pipe_cfg, fmt, max_linewidth) &&
+         dpu_plane_is_parallel_capable(prev_pipe_cfg, prev_fmt, max_linewidth) &&
+         (pipe_cfg->dst_rect.x1 >= prev_pipe_cfg->dst_rect.x2 ||
+          prev_pipe_cfg->dst_rect.x1 >= pipe_cfg->dst_rect.x2)) {

Even if y1 > y2 or y2 > y1 but the separation is less than the  2 *
max_tile_height, it can qualify for parallel fetch.

So parallel fetch is possible not only in x direction but y direction as
well as it will be fetched by different SSPPs.

I think that's now what I see in the SDE driver.


hmm , okay, we can support that case once this one works without issues.


+             pipe->sspp = prev_pipe->sspp;
+
+             pipe->multirect_index = DPU_SSPP_RECT_1;
+             pipe->multirect_mode = DPU_SSPP_MULTIRECT_PARALLEL;
+
+             prev_pipe->multirect_index = DPU_SSPP_RECT_0;
+             prev_pipe->multirect_mode = DPU_SSPP_MULTIRECT_PARALLEL;
+
+             return true;
+     }
+
+     if (pipe_cfg->dst_rect.y1 >= prev_pipe_cfg->dst_rect.y2 + 2 * max_tile_height ||
+         prev_pipe_cfg->dst_rect.y1 >= pipe_cfg->dst_rect.y2 + 2 * max_tile_height) {
+             pipe->sspp = prev_pipe->sspp;
+
+             pipe->multirect_index = DPU_SSPP_RECT_1;
+             pipe->multirect_mode = DPU_SSPP_MULTIRECT_TIME_MX;
+
+             prev_pipe->multirect_index = DPU_SSPP_RECT_0;
+             prev_pipe->multirect_mode = DPU_SSPP_MULTIRECT_TIME_MX;
+
+             return true;
+     }
+
+     return false;
+}
+
   static int dpu_plane_atomic_check_pipes(struct drm_plane *plane,
                                       struct drm_atomic_state *state,
                                       const struct drm_crtc_state *crtc_state)
@@ -1098,13 +1180,14 @@ static int dpu_plane_virtual_atomic_check(struct drm_plane *plane,
   static int dpu_plane_virtual_assign_resources(struct drm_crtc *crtc,
                                             struct dpu_global_state *global_state,
                                             struct drm_atomic_state *state,
-                                           struct drm_plane_state *plane_state)
+                                           struct drm_plane_state *plane_state,
+                                           struct drm_plane_state *prev_plane_state)
   {
       const struct drm_crtc_state *crtc_state = NULL;
       struct drm_plane *plane = plane_state->plane;
       struct dpu_kms *dpu_kms = _dpu_plane_get_kms(plane);
       struct dpu_rm_sspp_requirements reqs;
-     struct dpu_plane_state *pstate;
+     struct dpu_plane_state *pstate, *prev_pstate;
       struct dpu_sw_pipe *pipe;
       struct dpu_sw_pipe *r_pipe;
       struct dpu_sw_pipe_cfg *pipe_cfg;
@@ -1117,6 +1200,7 @@ static int dpu_plane_virtual_assign_resources(struct drm_crtc *crtc,
                                                          plane_state->crtc);

       pstate = to_dpu_plane_state(plane_state);
+     prev_pstate = prev_plane_state ? to_dpu_plane_state(prev_plane_state) : NULL;
       pipe = &pstate->pipe;
       r_pipe = &pstate->r_pipe;
       pipe_cfg = &pstate->pipe_cfg;
@@ -1137,19 +1221,27 @@ static int dpu_plane_virtual_assign_resources(struct drm_crtc *crtc,

       max_linewidth = dpu_kms->catalog->caps->max_linewidth;

-     pipe->sspp = dpu_rm_reserve_sspp(&dpu_kms->rm, global_state, crtc, &reqs);
-     if (!pipe->sspp)
-             return -ENODEV;
-
       if (drm_rect_width(&r_pipe_cfg->src_rect) == 0) {
-             pipe->multirect_index = DPU_SSPP_RECT_SOLO;
-             pipe->multirect_mode = DPU_SSPP_MULTIRECT_NONE;
+             if (!prev_pstate ||
+                 !dpu_plane_try_multirect(pstate, prev_pstate, fmt, max_linewidth)) {

This is a bit confusing to check esp since i am unable to apply this
patch and check .... but...

It was posted several months ago. No surprise that the source code has
evolved. Getting the patches reviewed in time would have helped them
to be applicable.


Yes, part of the delays for virtual plane was purely because the CB setup was down (both due to internal IT issues and general sc7280 being down) and I want to make sure this series is compositor-tested and not just modetest tested.

But anyway, thats why I didnt request a rebase this time even though it was very hard to review the patch emails for this series.

dpu_plane_atomic_check_nopipe() will set r_pipe_cfg if we are going to
do multirect with two rectangles of the same sspp. Right?

No. It sets r_pipe_cfg in all the cases.


From what I see, we still have this check before a valid rectangle is set for the r_pipe_cfg

 	if ((drm_rect_width(&pipe_cfg->src_rect) > max_linewidth) ||
_dpu_plane_calc_clk(&crtc_state->adjusted_mode, pipe_cfg) > max_mdp_clk_rate) {



Which means r_pipe_cfg will be 0 if multirect is not possible with same
SSPP. Thats why the else case of this either uses two SSPPs.

No. It means that the plane can use a single rectangle of the SSPP.


OR that the plane does not need to use multirect because its rectangle width is < max_linewidth.


So why are we trying multirect with again with the two rectangles of the
same SSPP as different planes? The result will be same right?

No, if the width of r_pipe_cfg is 0, it means that this plane doesn't
need a second rectangle to be displayed. So we can try reusing the
SSPP from the previous plane.


Yes, agreed to this point that this plane doesnt need a second rectangle to be displayed as it will fit in one rectangle.

And I see what you mean now, if the current plane needs only one rectangle to be used, you are trying to use the prev plane's SSPP's other rect?

So lets say we have plane 1 and plane 2 in the list.

Plane 1 has only one rect used and plane 2 also needs only one rect.

Then you use plane 1's SSPP even for plane 2.

Cant you use an alternative check like !dpu_plane_is_wideplane_multirect() to make this condition clear?

Also dpu_plane_try_multirect() name is confusing then because you are trying multi-rect again to see if the SSPP can be shared and wide-plane multirect was not possible. So technically both are multirect, just dfferent applications.

That will make it clear that you are trying to use multi-rect for sharing SSPP.

So there are essentially two use-cases of multi-rect:

1) Wide plane multi-rect
2) SSPP sharing multi-rect

So this will make it clear.

Coming to the algorithm, I see one issue with this now.

Lets say we have this list of SSPPs.

DMA0 Vig0 Vig1

DMA0 has only rec0 used and rec1 is free.

Vig0 needs both recs used.

Vig1 needs only one rec.

Here it will notice that its previous plane has both rects used and will not try the DMA0 even though it has one rect free and will end up using a new SSPP.

Thats why considering only immediate pairs is not enough. All possible pairs will address this.



+                     pipe->sspp = dpu_rm_reserve_sspp(&dpu_kms->rm, global_state, crtc, &reqs);
+                     if (!pipe->sspp)
+                             return -ENODEV;

-             r_pipe->multirect_index = DPU_SSPP_RECT_SOLO;
-             r_pipe->multirect_mode = DPU_SSPP_MULTIRECT_NONE;
+                     r_pipe->sspp = NULL;
+
+                     pipe->multirect_index = DPU_SSPP_RECT_SOLO;
+                     pipe->multirect_mode = DPU_SSPP_MULTIRECT_NONE;
+
+                     r_pipe->multirect_index = DPU_SSPP_RECT_SOLO;
+                     r_pipe->multirect_mode = DPU_SSPP_MULTIRECT_NONE;
+             }

-             r_pipe->sspp = NULL;
       } else {
+             pipe->sspp = dpu_rm_reserve_sspp(&dpu_kms->rm, global_state, crtc, &reqs);
+             if (!pipe->sspp)
+                     return -ENODEV;
+

Unless I am missing something, you are assigning pipe->sspp in both if
and else cases, so why dont you keep the allocation if pipe->sspp
outside the conditionals.

You missed the conditional in the previous chunk. We need to reserve
SSPP if the plane uses two rectangles. We don't need to reserve an
SSPP if the old SSPP is going to be used.


Ack.


               if (dpu_plane_is_multirect_parallel_capable(pipe, pipe_cfg, fmt, max_linewidth) &&
                   dpu_plane_is_multirect_parallel_capable(r_pipe, r_pipe_cfg, fmt, max_linewidth) &&
                   (test_bit(DPU_SSPP_SMART_DMA_V1, &pipe->sspp->cap->features) ||
@@ -1186,6 +1278,7 @@ int dpu_assign_plane_resources(struct dpu_global_state *global_state,
   {
       unsigned int i;
       int ret;
+     struct drm_plane_state *prev_plane_state = NULL;


This naming is a bit confusing. prev_plane_state could mean the plane's
previous state but here you are implying the state of the previous plane
in the list of planes.

Maybe prev_adjacent_plane_state?

Ack.


       for (i = 0; i < num_planes; i++) {
               struct drm_plane_state *plane_state = states[i];
@@ -1195,9 +1288,12 @@ int dpu_assign_plane_resources(struct dpu_global_state *global_state,
                       continue;

               ret = dpu_plane_virtual_assign_resources(crtc, global_state,
-                                                      state, plane_state);
+                                                      state, plane_state,
+                                                      prev_plane_state);
               if (ret)
                       break;
+
+             prev_plane_state = plane_state;
       }

       return ret;






[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux