On 7/25/23 19:06, Michel Dänzer wrote: > On 7/24/23 19:52, Philip Yang wrote: >> start_cpsch map queues when kfd_init_node have race condition with >> IOMMUv2 init, and cause the gfx ring test failed later. Remove it >> from start_cpsch because map queues will be done when creating queues >> and resume queues. >> >> Reported-by: Michel Dänzer <michel@xxxxxxxxxxx> >> Signed-off-by: Philip Yang <Philip.Yang@xxxxxxx> > > This patch doesn't help for any of the symptoms I've described I'm afraid. Actually, I failed to check one thing before: The patch fixed both IOMMU page faults. The IB tests on the compute rings still failed though. Interestingly, with iommu=pt there was still one IOMMU page fault, even with this patch: amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x14105a380 flags=0x0070] -- Earthling Michel Dänzer | https://redhat.com Libre software enthusiast | Mesa and Xwayland developer