On 2021-06-10 11:17 a.m., Christian König wrote: > Since we can't find a consensus on hot to move forward with the dma_resv object I concentrated on changing the approach for amdgpu first. > > This new approach changes how the driver stores the command submission fence in the dma_resv object in DMA-buf exported BOs. > > For exported BOs we now store the CS fence in a dma_fence_chain container and assign that one to the exclusive fences slot. > > During synchronization this dma_fence_chain container is unpacked again and the containing fences handled individually. > > This has a little bit more overhead than the old approach, but it allows for waiting for the exclusive slot for writes again. Nice! This seems to work as expected with https://gitlab.gnome.org/GNOME/mutter/-/merge_requests/1880: Some buffers now don't poll readable at first, until the GPU is done processing them. Unfortunately, as expected, without a high priority context for the compositor which can preempt client drawing, this isn't enough to prevent slow clients from slowing down the compositor as well. But it should already help for fullscreen apps where the compositor can directly scan out the client buffers at least. -- Earthling Michel Dänzer | https://redhat.com Libre software enthusiast | Mesa and X developer