[AMD Official Use Only - AMD Internal Distribution Only] > One thing I could see going wrong is, that down_read_trylock(&dqm->dev- > >adev->reset_domain->sem) will not fail immediately when the reset is > scheduled. So there may be multipe attempts at HW access that detect an > error or time out, which may get the HW into a worse state or delay the actual > reset. I suppose we can always check amdgpu_in_reset first before we do down_read_trylock, this would prevent new readers from coming in while the reset thread is waiting on current readers to finish. With a the rwsem alone I suppose there's a chance that the writer would be starved? Teddy