Re: [PATCH] drm/i915/guc: Don't error on reset of banned context

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 1/6/2022 16:31, John.C.Harrison@xxxxxxxxx wrote:
From: John Harrison <John.C.Harrison@xxxxxxxxx>

There is a race (already documented in the code) whereby a context can
be (re-)queued for submission at the same time as it is being banned
due to a hang and reset. That leads to a hang/reset report from GuC
for a context which i915 thinks is already banned.

While the race is indented to be fixed in a future GuC update, there
is no actual harm beyond the wasted execution time of that new hang
detection period. The context has already been banned for bad
behaviour so a fresh hang is hardly surprising and certainly isn't
going to be losing any work that wouldn't already have been lost if
there was no race.

So don't treat this situation as an error. The error message is seen
by the CI system as something fatal and causes test failures. Instead,
just print an informational so the user at least knows a context reset
occurred (given that the error capture is being skipped).

Signed-off-by: John Harrison <John.C.Harrison@xxxxxxxxx>
PS: Meant to add:

Fixes: 249af7242af72 ("drm/i915/guc: Report error on invalid reset notification")




[Index of Archives]     [AMD Graphics]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux