This is version 0.5 of the virtio-iommu specification, the paravirtualized IOMMU. This version addresses feedback from v0.4 and adds an event virtqueue. Please find the specification, LaTeX sources and pdf, at: git://linux-arm.org/virtio-iommu.git viommu/v0.5 http://linux-arm.org/git?p=virtio-iommu.git;a=blob;f=dist/v0.5/virtio-iommu-v0.5.pdf A detailed changelog since v0.4 follows. You can find the pdf diff at: http://linux-arm.org/git?p=virtio-iommu.git;a=blob;f=dist/diffs/virtio-iommu-pdf-diff-v0.4-v0.5.pdf * Add an event virtqueue for the device to report translation faults to the driver. For the moment only unrecoverable faults are available but future versions will extend it. * Simplify PROBE request by removing the ack part, and flattening RESV properties. * Rename "address space" to "domain". The change might seem futile but allows to introduce PASIDs and other features cleanly in the next versions. In the same vein, the few remaining "device" occurrences were replaced by "endpoint", to avoid any confusion with "the device" referring to the virtio device across the document. * Add implementation notes for RESV_MEM properties. * Update ACPI table definition. * Fix typos and clarify a few things. I will publish the Linux driver for v0.5 shortly. Then for next versions I'll focus on optimizations and adding support for hardware acceleration. Existing implementations are simple and can certainly be optimized, even without architectural changes. But the architecture itself can also be improved in a number of ways. Currently it is designed to work well with VFIO. However, having explicit MAP requests is less efficient* than page tables for emulated and PV endpoints, and the current architecture doesn't address this. Binding page tables is an obvious way to improve throughput in that case, but we can explore cleverer (and possibly simpler) ways to do it. So first we'll work on getting the base device and driver merged, then we'll analyze and compare several ideas for improving performance. Thanks, Jean * I have yet to study this behaviour, and would be interested in any prior art on the subject of analyzing devices DMA patterns (virtio and others)