On Thu, Oct 19, 2023 at 04:53:43PM +0100, Matthew Wilcox wrote: > > RDMA core API could support struct biovec array arguments. The > > series compiles on x86, but I haven't tested it further. I'm posting > > early in hopes of starting further discussion. > > Good call, because I think patch 2/9 is a complete non-starter. > > The fundamental problem with scatterlist is that it is both input > and output for the mapping operation. You're replicating this mistake > in a different data structure. Agreed. > > My vision for the future is that we have phyr as our input structure. > That looks something like: > > struct phyr { > phys_addr_t start; > size_t len; > }; So my plan was always to turn the bio_vec into that structure, since before you came u wit hthe phyr name. But that's really a separate discussion as we might as well support multiple input formats if we really have to. > Our output structure can continue being called the scatterlist, but > it needs to go on a diet and look more like: > > struct scatterlist { > dma_addr_t dma_address; > size_t dma_length; > }; I called it a dma_vec in my years old proposal I can't find any more. > Getting to this point is going to be a huge amount of work, and I need > to finish folios first. Or somebody else can work on it ;-) Well, we can stage this. I wish I could find my old proposal about the dma_batch API (I remember Robin commented on it, my he is better at finding it than me). I think that mostly still stands, independent of the transformation of the input structure. The basic idea is that we add a dma batching API, where you start a batch with one call, and then add new physically discontiguous vectors to add it until it is full and finalized it. Very similar to how the iommu API works internally. We'd then only use this API if we actually have an iommu (or if we want to be fancy swiotlb that could do the same linearization), for the direct map we'd still do the equivalent of dma_map_page for each element as we need one output vector per input vector anyway. As Jason pointed out the only fancy implementation we need for now is the IOMMU API. arm32 and powerpc will need to do the work to convert to it or do their own work.