On Thu, Jun 06, 2019 at 06:28:47AM +0000, Yoshihiro Shimoda wrote: > > The problem is that we need a way to communicate to the block layer > > that more than a single segment is ok IFF the DMA API instance supports > > merging. And of course the answer will depend on futher parameters > > like the maximum merged segment size and alignment for the segement. > > I'm afraid but I don't understand why we need a way to communicate to > the block layer that more than a single segment is ok IFF the DMA API > instance supports merging. Assume a device (which I think is your case) that only supports a single segment in hardware. In that case we set max_segments to 1 if no IOMMU is present. But if we have a merge capable IOMMU we can set max_segments to unlimited (or some software limit for scatterlist allocation), as long as we set a virt_boundary matching what the IOMMU expects, and max_sectors_kb isn't larger than the max IOMMU mapping size. Now we could probably just open code this in the driver, but I'd feel much happier having a block layer like this: bool blk_can_use_iommu_merging(struct request_queue *q, struct device *dev) { if (!IOMMU_CAN_MERGE_SEGMENTS(dev)) return false; blk_queue_virt_boundary(q, IOMMU_PAGE_SIZE(dev)); blk_queue_max_segment_size(q, IOMMU_MAX_SEGMENT_SIZE(dev)); return true; } and the driver then does: if (blk_can_use_iommu_merging(q, dev)) { blk_queue_max_segments(q, MAX_SW_SEGMENTS); // initialize sg mempool, etc.. } Where the SCREAMING pseudo code calls are something we need to find a good API for. And thinking about it the backend doesn't need to be an iommu, swiotlb could handle this as well, which might be interesting for devices that need to boune buffer anyway. IIRC mmc actually has some code to copy multiple segments into a bounce buffer somewhere. > The block layer already has a limit "max_segment_size" for each device so that > regardless it can/cannot merge the segments, we can use the limit. > Is my understanding incorrect? Yes.