Hello, On Friday 13 May 2016 12:52:42 Sakari Ailus wrote: > On Fri, May 13, 2016 at 11:26:22AM +0200, Hans Verkuil wrote: > > On 05/12/2016 02:17 AM, Laurent Pinchart wrote: > > > Hello, > > > > > > This RFC patch series is a second attempt at adding support for passing > > > statistics data to userspace using a standard API. > > > > > > The core requirements haven't changed. Statistics data capture requires > > > zero-copy and decoupling statistics buffers from images buffers, in > > > order to make statistics data available to userspace as soon as they're > > > captured. For those reasons the early consensus we have reached is to > > > use a video device node with a buffer queue to pass statistics buffers > > > using the V4L2 API, and this new RFC version doesn't challenge that. > > > > > > The major change compared to the previous version is how the first patch > > > has been split in two. Patch 1/4 now adds a new metadata buffer type > > > and format (including their support in videobuf2-v4l2), usable with > > > regular V4L2 video device nodes, while patch 2/4 adds the new metadata > > > video device type. Metadata buffer queues are thus usable on both the > > > regular V4L2 device nodes and the new metadata device nodes. > > > > > > This change was driven by the fact that an important category of use > > > cases doesn't differentiate between metadata and image data in hardware > > > at the DMA engine level. With such hardware (CSI-2 receivers in > > > particular, but other bus types could also fall into this category) a > > > stream containing both metadata and image data virtual streams is > > > transmitted over a single physical link. The receiver demultiplexes, > > > filters and routes the virtual streams to further hardware blocks, and > > > in many cases, directly to DMA engines that are part of the receiver. > > > Those DMA engines can capture a single virtual stream to memory, with as > > > many DMA engines physically present in the device as the number of > > > virtual streams that can be captured concurrently. All those DMA engines > > > are usually identical and don't care about the type of data they receive > > > and capture. For that reason limiting the metadata buffer type to > > > metadata device nodes would require creating two device nodes for each > > > DMA engine (and possibly more later if we need to capture other types > > > of data). Not only would this make the API more complex to use for > > > applications, it wouldn't bring any added value as the video and > > > metadata device nodes associated with a DMA engine couldn't be used > > > concurrently anyway, as they both correspond to the same hardware > > > resource. > > > > > > For this reason the ability to capture metadata on a video device node > > > is useful and desired, and is implemented patch 1/4 using a dedicated > > > video buffers queue. In the CSI-2 case a driver will create two buffer > > > queues internally for the same DMA engine, and can select which one to > > > use based on the buffer type passed for instance to the REQBUFS ioctl > > > (details still need to be discussed here). > > > > Not quite. It still has only one vb2_queue, you just change the type > > depending on what mode it is in (video or meta data). Similar to raw vs > > sliced VBI. > > > > In the latter case it is the VIDIOC_S_FMT call that changes the vb2_queue > > type depending on whether raw or sliced VBI is requested. That's probably > > where I would do this for video vs meta as well. > > > > There is one big thing missing here: how does userspace know in this case > > whether it will get metadata or video? Who decides which CSI virtual > > stream is routed > > My first impression would be to say by formats, so that's actually defined > by the user. The media bus formats do not have such separation between image > and metadata formats either. It should be configured by userspace, as ultimately it's userspace that knows which virtual channels it wants to capture. This is somehow analogous to the input selection API, a capture driver that supports multiple inputs can't decide on its own which input(s) to capture. We might be able to make an educated guess in the kernel if we have access to information about what the source produces, but that would just be a default configuration that userspace will override when needed (and I'm not even sure that would be a good idea, I'm brainstorming here). In order to configure virtual channel filtering/selection userspace would need to know what the source produces. This knowledge could be hardcoded in applications that deal with known hardware, but in the general case we probably need a new API to expose that information to userspace. The next question is how to configure filtering and routing. We also need a new API here, and it could make sense to discuss it along with the subdev routing API I've proposed a while back. I don't know whether the two problems could be solved by the same API, but I believe it's worth a try. > VIDIOC_ENUM_FMT should be amended with media bus code as well so that the > user can figure out which format corresponds to a given media bus code. > > > to which video node? > > I think that should be considered as a seprate problem, albeit it will > require a solution as well. And it's a much biffer problem than this one. I agree. We need a solution, but that's separate from how to capture metadata on the video node side. -- Regards, Laurent Pinchart -- To unsubscribe from this list: send the line "unsubscribe linux-media" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html