Thanks David for your review! On 1/13/2025 6:54 PM, David Hildenbrand wrote: > On 08.01.25 11:56, Chenyi Qiang wrote: >> >> >> On 1/8/2025 12:48 PM, Alexey Kardashevskiy wrote: >>> On 13/12/24 18:08, Chenyi Qiang wrote: >>>> As the commit 852f0048f3 ("RAMBlock: make guest_memfd require >>>> uncoordinated discard") highlighted, some subsystems like VFIO might >>>> disable ram block discard. However, guest_memfd relies on the discard >>>> operation to perform page conversion between private and shared memory. >>>> This can lead to stale IOMMU mapping issue when assigning a hardware >>>> device to a confidential VM via shared memory (unprotected memory >>>> pages). Blocking shared page discard can solve this problem, but it >>>> could cause guests to consume twice the memory with VFIO, which is not >>>> acceptable in some cases. An alternative solution is to convey other >>>> systems like VFIO to refresh its outdated IOMMU mappings. >>>> >>>> RamDiscardManager is an existing concept (used by virtio-mem) to adjust >>>> VFIO mappings in relation to VM page assignment. Effectively page >>>> conversion is similar to hot-removing a page in one mode and adding it >>>> back in the other, so the similar work that needs to happen in response >>>> to virtio-mem changes needs to happen for page conversion events. >>>> Introduce the RamDiscardManager to guest_memfd to achieve it. >>>> >>>> However, guest_memfd is not an object so it cannot directly implement >>>> the RamDiscardManager interface. >>>> >>>> One solution is to implement the interface in HostMemoryBackend. Any >>> >>> This sounds about right. >>> >>>> guest_memfd-backed host memory backend can register itself in the >>>> target >>>> MemoryRegion. However, this solution doesn't cover the scenario where a >>>> guest_memfd MemoryRegion doesn't belong to the HostMemoryBackend, e.g. >>>> the virtual BIOS MemoryRegion. >>> >>> What is this virtual BIOS MemoryRegion exactly? What does it look like >>> in "info mtree -f"? Do we really want this memory to be DMAable? >> >> virtual BIOS shows in a separate region: >> >> Root memory region: system >> 0000000000000000-000000007fffffff (prio 0, ram): pc.ram KVM >> ... >> 00000000ffc00000-00000000ffffffff (prio 0, ram): pc.bios KVM >> 0000000100000000-000000017fffffff (prio 0, ram): pc.ram >> @0000000080000000 KVM >> >> We also consider to implement the interface in HostMemoryBackend, but >> maybe implement with guest_memfd region is more general. We don't know >> if any DMAable memory would belong to HostMemoryBackend although at >> present it is. >> >> If it is more appropriate to implement it with HostMemoryBackend, I can >> change to this way. > > Not sure that's the right place. Isn't it the (cc) machine that controls > the state? > > It's not really the memory backend, that's just the memory provider. Yes, the cc machine defines the require_guest_memfd. And besides the normal memory, there's also some other memory region requires the state control. See in another thread, For example, private mmio may require the state change notification but not belong to memory backend. So I think it is still better to define a specific object to control the state. >