Re: [PATCH 06/10] media: uapi: h264: Cleanup DPB entry interface

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Jonas,

On Mon, 2020-07-27 at 23:39 +0000, Jonas Karlman wrote:
> Hi,
> 
> On 2020-07-24 21:08, Ezequiel Garcia wrote:
> > Hello Jonas,
> > 
> > On Wed, 2020-07-22 at 21:52 +0000, Jonas Karlman wrote:
> > > On 2020-07-15 22:22, Ezequiel Garcia wrote:
> > > > As discussed recently, the current interface for the
> > > > Decoded Picture Buffer is not enough to properly
> > > > support field coding.
> > > > 
> > > > This commit introduces enough semantics to support
> > > > frame and field coding, and to signal how DPB entries
> > > > are "used for reference".
> > > > 
> > > > Signed-off-by: Ezequiel Garcia <ezequiel@xxxxxxxxxxxxx>
> > > > ---
> > > >  .../media/v4l/ext-ctrls-codec.rst             | 46 ++++++++++++-------
> > > >  drivers/media/v4l2-core/v4l2-h264.c           |  4 +-
> > > >  drivers/staging/media/rkvdec/rkvdec-h264.c    |  8 ++--
> > > >  include/media/h264-ctrls.h                    |  8 +++-
> > > >  4 files changed, 42 insertions(+), 24 deletions(-)
> > > > 
> > > > diff --git a/Documentation/userspace-api/media/v4l/ext-ctrls-codec.rst b/Documentation/userspace-api/media/v4l/ext-ctrls-codec.rst
> > > > index dd8e5a2e8986..46d4c8c6ad47 100644
> > > > --- a/Documentation/userspace-api/media/v4l/ext-ctrls-codec.rst
> > > > +++ b/Documentation/userspace-api/media/v4l/ext-ctrls-codec.rst
> > > > @@ -2058,10 +2058,35 @@ enum v4l2_mpeg_video_h264_hierarchical_coding_type -
> > > >      * - __s32
> > > >        - ``bottom_field_order_cnt``
> > > >        -
> > > > +    * - enum :c:type:`v4l2_h264_dpb_reference`
> > > > +      - ``reference``
> > > > +      - Specifies how the DPB entry is referenced.
> > > >      * - __u32
> > > >        - ``flags``
> > > >        - See :ref:`DPB Entry Flags <h264_dpb_flags>`
> > > >  
> > > > +.. c:type:: v4l2_h264_dpb_reference
> > > > +
> > > > +.. cssclass:: longtable
> > > > +
> > > > +.. flat-table::
> > > > +    :header-rows:  0
> > > > +    :stub-columns: 0
> > > > +    :widths:       1 1 2
> > > > +
> > > > +    * - ``V4L2_H264_DPB_TOP_REF``
> > > > +      - 0x1
> > > > +      - The top field in field pair is used for
> > > > +        short-term reference.
> > > > +    * - ``V4L2_H264_DPB_BOTTOM_REF``
> > > > +      - 0x2
> > > > +      - The bottom field in field pair is used for
> > > > +        short-term reference.
> > > > +    * - ``V4L2_H264_DPB_FRAME_REF``
> > > > +      - 0x3
> > > > +      - The frame (or the top/bottom fields, if it's a field pair)
> > > > +        is used for short-term reference.
> > > > +
> > > >  .. _h264_dpb_flags:
> > > >  
> > > >  ``DPB Entries Flags``
> > > > @@ -2075,29 +2100,16 @@ enum v4l2_mpeg_video_h264_hierarchical_coding_type -
> > > >  
> > > >      * - ``V4L2_H264_DPB_ENTRY_FLAG_VALID``
> > > >        - 0x00000001
> > > > -      - The DPB entry is valid and should be considered
> > > > +      - The DPB entry is valid (non-empty) and should be considered.
> > > >      * - ``V4L2_H264_DPB_ENTRY_FLAG_ACTIVE``
> > > >        - 0x00000002
> > > > -      - The DPB entry is currently being used as a reference frame
> > > > +      - The DPB entry is used for reference.
> > > >      * - ``V4L2_H264_DPB_ENTRY_FLAG_LONG_TERM``
> > > >        - 0x00000004
> > > > -      - The DPB entry is a long term reference frame
> > > > +      - The DPB entry is used for long-term reference.
> > > >      * - ``V4L2_H264_DPB_ENTRY_FLAG_FIELD``
> > > >        - 0x00000008
> > > > -      - The DPB entry is a field reference, which means only one of the field
> > > > -        will be used when decoding the new frame/field. When not set the DPB
> > > > -        entry is a frame reference (both fields will be used). Note that this
> > > > -        flag does not say anything about the number of fields contained in the
> > > > -        reference frame, it just describes the one used to decode the new
> > > > -        field/frame
> > > > -    * - ``V4L2_H264_DPB_ENTRY_FLAG_BOTTOM_FIELD``
> > > > -      - 0x00000010
> > > > -      - The DPB entry is a bottom field reference (only the bottom field of the
> > > > -        reference frame is needed to decode the new frame/field). Only valid if
> > > > -        V4L2_H264_DPB_ENTRY_FLAG_FIELD is set. When
> > > > -        V4L2_H264_DPB_ENTRY_FLAG_FIELD is set but
> > > > -        V4L2_H264_DPB_ENTRY_FLAG_BOTTOM_FIELD is not, that means the
> > > > -        DPB entry is a top field reference
> > > > +      - The DPB entry is a single field or a complementary field pair.
> > > >  
> > > >  ``V4L2_CID_MPEG_VIDEO_H264_DECODE_MODE (enum)``
> > > >      Specifies the decoding mode to use. Currently exposes slice-based and
> > > > diff --git a/drivers/media/v4l2-core/v4l2-h264.c b/drivers/media/v4l2-core/v4l2-h264.c
> > > > index edf6225f0522..306a51683606 100644
> > > > --- a/drivers/media/v4l2-core/v4l2-h264.c
> > > > +++ b/drivers/media/v4l2-core/v4l2-h264.c
> > > > @@ -66,10 +66,10 @@ v4l2_h264_init_reflist_builder(struct v4l2_h264_reflist_builder *b,
> > > >  		else
> > > >  			b->refs[i].frame_num = dpb[i].frame_num;
> > > >  
> > > > -		if (!(dpb[i].flags & V4L2_H264_DPB_ENTRY_FLAG_FIELD))
> > > > +		if (dpb[i].reference & V4L2_H264_DPB_FRAME_REF)
> > > 
> > > This looks wrong, should probably use ==,
> > > 
> > > dpb[i].reference == V4L2_H264_DPB_FRAME_REF
> > > 
> > > else this would match any reference value.
> > > 
> > > >  			pic_order_count = min(dpb[i].top_field_order_cnt,
> > > >  					      dpb[i].bottom_field_order_cnt);
> > > > -		else if (dpb[i].flags & V4L2_H264_DPB_ENTRY_FLAG_BOTTOM_FIELD)
> > > > +		else if (dpb[i].reference & V4L2_H264_DPB_BOTTOM_REF)
> > > >  			pic_order_count = dpb[i].bottom_field_order_cnt;
> > > >  		else
> > > >  			pic_order_count = dpb[i].top_field_order_cnt;
> > > > diff --git a/drivers/staging/media/rkvdec/rkvdec-h264.c b/drivers/staging/media/rkvdec/rkvdec-h264.c
> > > > index 7b66e2743a4f..57539c630422 100644
> > > > --- a/drivers/staging/media/rkvdec/rkvdec-h264.c
> > > > +++ b/drivers/staging/media/rkvdec/rkvdec-h264.c
> > > > @@ -953,11 +953,11 @@ static void config_registers(struct rkvdec_ctx *ctx,
> > > >  			     RKVDEC_COLMV_USED_FLAG_REF;
> > > >  
> > > >  		if (!(dpb[i].flags & V4L2_H264_DPB_ENTRY_FLAG_FIELD))
> > > > -			refer_addr |= RKVDEC_TOPFIELD_USED_REF |
> > > > -				      RKVDEC_BOTFIELD_USED_REF;
> > > > -		else if (dpb[i].flags & V4L2_H264_DPB_ENTRY_FLAG_BOTTOM_FIELD)
> > > > +			refer_addr |= RKVDEC_FIELD_REF;
> > > > +
> > > > +		if (dpb[i].reference & V4L2_H264_DPB_TOP_REF)
> > > >  			refer_addr |= RKVDEC_BOTFIELD_USED_REF;
> > > > -		else
> > > > +		else if (dpb[i].reference & V4L2_H264_DPB_BOTTOM_REF)
> > > 
> > > This should probably be if and not else if, and BOTFIELD/TOPFIELD_USED_REF
> > > seems to be mixed up.
> > > 
> > > I have only taken a quick look so far, I will update ffmpeg and runtime test
> > > later this weekend, will get back with result and full review on Sunday evening.
> > > 
> > 
> > Thanks that would be useful.
> > 
> > However, keep in mind this series is specifically concerned
> > with the uAPI review.
> > 
> > This is not supposed to fix the field coded support, or anything
> > else in any driver.
> > 
> > IMO, at this stage, fixing drivers is somewhat lower priority
> > than discussing and stabilizing the uAPI.
> 
> I have now tested rkvdec on a RK3328 device and needed to do 3 fixups, see [1].
> Initial ffmpeg update using update h264 uapi is located at [2], the ffmpeg
> update still needs to be tested with cedrus and hantro.
> 
> So far I have not seen any issue with the uapi changes.
> 

Great, thanks for the test.

> Q: ffmpeg will not try to set SLICE_PARAMS or PRED_WEIGHT ctrls for
> DECODE_MODE_SLICE_BASED,

You mean it will not try to set those controls for
DECODE_MODE_FRAME_BASED?

I think that's correct, as we've discussed multiple times,
frame-based drivers shouldn't need to use those controls,
by definition.

>  should userspace check if ctrl exists or is using
> DECODE_MODE value okay?
> 

If a driver supporting DECODE_MODE_SLICE_BASED doesn't 
support SLICE_PARAMS, that can probably be considered
a failure from the application side.

If it doesn't support PRED_WEIGHTS controls, then
strictly speaking it won't be able to decode slices
that have a prediction weight table, i.e.:

((weighted_pred_flag && (slice_type == P || slice_type == SP)) || \
(weighted_bipred_idc == 1 && slice_type == B))

I doubt we'll ever counter such case, so probably
if the controls aren't supported, applications
can just safely fail (and e.g. fallback to software).

> I have also pushed an updated WIP branch at [3] containing high 10,
> field encoded and hevc work.
> 
> [1] https://github.com/Kwiboo/linux-rockchip/compare/b6b91f27c0cb33520e954e7bb2550e0e07ed4d85...b82b6e93feb9ca44d2c677f25416cf6345f0114d
> [2] https://github.com/Kwiboo/FFmpeg/commits/v4l2-request-hwaccel-4.3.1
> [3] https://github.com/Kwiboo/linux-rockchip/commits/linuxtv-rkvdec-work-in-progress
> 
> Best regards,
> Jonas
> 
> > Thanks,
> > Ezequiel
> > 
> > 





[Index of Archives]     [Linux Input]     [Video for Linux]     [Gstreamer Embedded]     [Mplayer Users]     [Linux USB Devel]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [Yosemite Backpacking]

  Powered by Linux