> From: Laurent Pinchart [mailto:laurent.pinchart@xxxxxxxxxxxxxxxx] > > Hi Kamil, Hi Laurent, > > Thanks for the RC. Here are some comments (more might come later when I'll > get > a better understanding of some MPEG4 and H264 concepts :-)). Thank you for your comments. I have answered them below. > On Tuesday 29 March 2011 11:48:03 Kamil Debski wrote: > > Hi, > > > > I would like to propose a new set of controls for codec devices. By codec > > devices I mean hardware that can encode raw video into compressed stream > > and can decode compressed streams. > > > > This topic has been discussed during the v4l2 brainstorming session and > > this email contains the propositions that have been made. > > > > The conclusion was that we should use the existing V4L2_CID_MPEG_CLASS for > > controls related do encoding and decoding data. I had to distinguish which > > controls should be common for all codecs and which should be specific to > > MFC 5.1 (I am working on the driver for this hardware). To do this I > > have analysed the options of ffmpeg and x264 software decoders. If the > > options was used only in MFC then it should be hardware specific. If it > > was also used in ffmpeg or x264 then it would qualify as a common control. > > I would welcome comments from other hw codec driver developers. > > > > I have another question - regarding v4l2_buffer flags. What do you think > > about adding a new flag - V4L2_BUF_FLAG_SKIPPED? It would be used in > > encoding to indicate that the frame to be encoded should be skipped and > > write information that the decoder should display the previous frame > > again. > > > > Please find the table with controls attached below: > > - the names have the "V4L2_CID_MPEG_" prefix truncated > > - letter in the second column means D for decoder, E for encoder and A for > > both > > - last column is MFC if the control should be hw specific, common > otherwise > > > > *** Controls for decoding *** > > 1) DECODER_MPEG4_LOOP_FILTER D MPEG4 MFC > > Deblocking filter on the decoder output for MPEG4. > > 2) DECODER_SLICE_INTERFACE D all common > > If set the decoder can accept separate picture slices on input, otherwise > > it requires a whole frame. > > Isn't that a property instead of a settable control ? I'm not sure to > understand why applications would need to tell the decoder what it can > accept. > Or is it supposed to be a read-only control ? I think I have not described this control properly. It is used by the application to set what kind of buffer it will send to the driver. A picture can consist of one or more slice. Our hardware can accept two kinds of input buffers and its behavior is determined by a register. So it is a read/write control. In the first case, a single input buffer contains a whole picture (one or more slices). In the second case, a single input buffer contains a single slice. A nice figure describing pictures and slices can be found here: http://goo.gl/info/bnFe2 > > 3) DECODER_H264_DISPLAY_DELAY D H264 MFC > > Programmable display delay for H264. MFC will return a decoded frame after > > the set number of frames (this may cause that frames are not returned in > > display order). > > 4) DECODER_H264_DISPLAY_DELAY_ENABLE D H264 MFC > > Enable display delay for H264. > > Can't display delay enable/disable be controlled through the > DECODER_H264_DISPLAY_DELAY control ? A zero value would mean no display > delay. Firstly I have had the same idea as you have proposed. But in this case zero value is meaningful. If display delay is disabled then it is up to the decoder to determine how many OUTPUT buffers will it process before a buffer on the CAPTURE is dequeued. This is influenced by the number of B frames and how many reference frames are used by P and B frames. This ensures that after the CAPTURE buffer is dequeued it won't be used by the hardware. If the display delay is enabled then the decoder has to return an CAPTURE buffer after processing a certain number of OUTPUT buffers. If this number is low, then it may result in buffers not being dequeued in display order. Even worse, I have just discovered that the buffer can still be used by the hardware as a reference after it has been dequeued in case of low display delay. So I think that this feature is very specific to our hardware (thus I think it should be hw specific control) and application has to be aware of this. > What is the display delay used for ? As described above - it is used to force hardware to return an CAPTURE buffer after a set number or OUTPUT buffers was processed. > > *** Controls for both decoding and encoding *** > > 5) H264_AR_VUI_IDC A H264 common > > VUI aspect ratio IDC for H.264 encoding. The value is defined in VUI > Table > > E-1 in the standard. > > 6) H264_EXT_SAR_WIDTH A H264 common > > Extended sample aspect ratio width for H.264 VUI encoding. > > 7) H264_EXT_SAR_HEIGHT A H264 common > > Extended sample aspect ratio height for H.264 VUI encoding > > What are those 3 controls used for when decoding ? While encoding it is used to set the aspect ratio indicator in the VUI. During decoding it is used to read such information. Maybe a better name would be H264_VUI_(AR_IDC, EXT_SAR_WIDTH, EXT_SAR_HEIGHT). > > 8) MIN_REQ_BUFS_OUT A all common > > This is the minimum number of buffers required for the output queue. This > > option may be useful if the encoding settings require a minimum number of > > buffers required as reference and the application would like to have N > more > > buffers. For example - the encoding options require 3 buffers and the > > application wants to have 2 more. One can read this value (3) and supply > > (3+2) to the reqbufs call. > > 9) MIN_REQ_BUFS_CAP A all common > > This is the minimum number of buffers required for the capture queue. This > > option may be useful if the decoder requires a minimum number of buffers > > required as reference and the application would like to have N more > > buffers. For example - the stream requires 3 buffers and the application > > wants to have 2 more. One can read this value (3) and supply (3+2) to the > > reqbufs call. > > Are these 2 read-only controls ? Yes they are read only. > > *** Controls for encoding *** > > 10) GOP_SIZE E all common > > The size of group of pictures. For H264 this is the IDR period. > > 11) H264_LEVEL E H264 common > > The level information for H264. > > 12) MPEG4_LEVEL E MPEG4 common > > The level information for MPEG4. > > Do those need to be separate controls ? I've been thinking hard about this for some time. My first thought was to have a single control. On second thought I have decided to have them separate. Level names are pretty similar though there are important differences. In case of H264 the level is a number consisting of two digits (such as 1.1 or 4.0) and level "1b". In case of MPEG4 we have a single digit 0, 1, 2, 3, 4, 5 and 0b and 3b levels. > > 13) H264_PROFILE E H264 common > > The profile information for H264. > > 14) MPEG4_PROFILE E MPEG4 common > > The profile information for MPEG4. > In case of profiles the naming differs quite a lot. For H264 we have a lot of profiles to choose from (http://en.wikipedia.org/wiki/H.264/MPEG-4_AVC#Profiles). For MPEG4 we have: Simple, Advanced Simple, Core, Simple Scalable and Advanced Coding Efficiency. > > > 15) B_FRAMES E H264, MPEG4 common > > The number of B-frames to use between P frames. > > 16) MAX_REF_PIC E H264 common > > The maximum number of reference pictures used for encoding. > > 17) NUM_REF_PIC_FOR_P E H264 MFC > > The number of reference pictures used for encoding a P picture. > > 18) MULTI_SLICE_MODE E H264, MPEG4 common > > Determines how multiple slices are handled. > > 19) MULTI_SLICE_MB E H264, MPEG4 common > > The upper limit of macroblocks for a slice. > > Maybe MULTI_SLICE_MAX_MB ? I agree. > > 20) MULTI_SLICE_BITS E H264, MPEG4 common > > The upper limit of size for a slice. > > MULTI_SLICE_MAX_BITS ? Ditto. > > 21) H264_LOOP_FILTER_MODE E H264 common > > Loop filter mode for H264. > > There's a DECODER_MPEG4_LOOP_FILTER control. Are they similar ? The difference is that H264_LOOP_FILTER is a control for encoding. It is used during encoding. The encoder set this and a decoder has to obey the setting used to encode that stream. In case of MPEG4 the filter is used for post processing. > > 22) H264_LOOP_FILTER_ALPHA E H264 common > > Loop filter alpha coefficient, defined in the standard. > > 23) H264_LOOP_FILTER_BETA E H264 common > > Loop filter alpha coefficient, defined in the standard. > > s/alpha/beta/ Thanks, it was a typo. > > 24) H264_SYMBOL_MODE E H264 common > > Symbol mode for H264 - CABAC/CAVALC. > > 25) INTERLACE E H264, MPEG4 common > > Enable interlace mode. > > 26) H264_8X8_TRANSFORM E H264 common > > Enable 8X8 transform for H264. > > 27) INTRA_REFRESH_MB E all common > > Period of random intra macroblock refresh. > > 28) PADDING_ENABLE E all MFC > > Padding enable - use a color instead of repeating border pixels. > > 29) PADDING_LUMA E all MFC > > 30) PADDING_CB E all MFC > > 31) PADDING_CR E all MFC > > Do we need 3 controls for that, or can we use a single one (similarly to > what > we did with the color key control) ? You mean like the chromakey field here: http://v4l2spec.bytesex.org/spec-single/v4l2.html#V4L2-WINDOW ? For V4L2_PIX_FMT_BGR24 it seems quite reasonable. What about V4L2_PIX_FMT_NV12MT? There are two planes which makes it less obvious. Something like 0xYYCbCr? > > > 32) FRAME_RC_ENABLE E all common > > Frame level rate control enable. > > 33) MB_RC_ENABLE E H264 common > > Macroblock level rate control enable. > > 34) FRAME_RATE E all common > > Frames per second in 1000x scale (e.g., 7500 stands for 7.5 frames/sec). > > What is this used for ? This is used for encoding rate control. If you want to control the bit rate per second then you need to know how many frames are there per second. > > 35) RC_BITRATE E all common > > Bitrate for rate control. > > 36) RC_REACTION_COEFF E all MFC > > Reaction coefficient for MFC rate control. > > 37) H264_ADAPTIVE_RC_DARK_DISABLE E H264 MFC > > Disable adaptive rate control for dark region. > > 38) H264_ADAPTIVE_RC_SMOOTH_DISABLE E H264 MFC > > Disable adaptive rate control for smooth region. > > 39) H264_ADAPTIVE_RC_STATIC_DISABLE E H264 MFC > > Disable adaptive rate control for static region. > > 40) H264_ADAPTIVE_RC_ACTIVITY_DISABLE E H264 MFC > > Disable adaptive rate control for region with activity. > > 41) MPEG4_QPEL_DISABLE E MPEG4 common > > Disable quarter pixel motion estimation for MPEG4. > > Should we remove the _DISABLE suffix and make the controls default to true ? I agree, thanks. > > 42) I_FRAME_QP E all common > > Quantization parameter for an I frame. > > 43) MIN_QP E all common > > Minimum quantization parameter. > > 44) MAX_QP E all common > > Maximum quantization parameter. > > 45) P_FRAME_QP E all common > > Quantization parameter for an P frame. > > 46) B_FRAME_QP E H264, MPEG4 common > > Quantization parameter for an B frame. > > 47) VBV_BUF_SIZE E H264, MPEG* common > > VBV buffer size. I think it is valid for all MPEG versions. > > 48) FRAME_SKIP_MODE E all MFC > > Mode of skipping frames for VBV compliance. > > 49) RC_FIXED_TARGET_BIT E all MFC > > 50) MPEG4_VOP_TIME_RES E MPEG4 MFC > > Used to compute vop_time_increment and modulo_time_base in MPEG4. > > 51) MPEG4_VOP_FRAME_DELTA E MPEG4 MFC > > Used to compute vop_time_increment and modulo_time_base in MPEG4. > > 52) H264_OPEN_GOP E H264 common > > Enable open GOP in H264. > > 53) H264_I_PERIOD E H264 common > > Period between I frames in open GOP for H264. > > 54) H264_AR_VUI_ENABLE E H264 MFC > > Enable writing aspect ratio in VUI. > > 55) HEADER_MODE E all common > > Determines whether the header is returned as the first buffer or is > > it returned together with the first frame. > > 56) FORCE_FRAME_TYPE E all MFC > > Force frame type on the encoder - either I-frame or skipped. > > I hope that buffer flags could be used instead of this. > > 57) FRAME_TAG E all MFC > > Frame tag is assigned to an input buffer passed to hardware, and > > the same frame tag is then assigned to the buffer that contains the > > result of processing that frame. > Best regards, -- Kamil Debski Linux Platform Group Samsung Poland R&D Center -- To unsubscribe from this list: send the line "unsubscribe linux-media" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html