-----Original Message----- From: Chris Wilson [mailto:chris@xxxxxxxxxxxxxxxxxx] Sent: Tuesday, September 19, 2017 5:19 PM To: intel-gfx@xxxxxxxxxxxxxxxxxxxxx Cc: Tahvanainen, Jari <jari.tahvanainen@xxxxxxxxx>; Mika Kuoppala <mika.kuoppala@xxxxxxxxxxxxxxx> Subject: Re: [PATCH] drm/i915/selftests: Try to recover from a wedged GPU during reset tests Quoting Chris Wilson (2017-09-15 14:09:29) > If we see the seqno stop progressing, we abandon the test for fear > that the GPU died following the reset. However, during test teardown > we still wait for the GPU to idle before continuing, but we have > already confirmed that the GPU is dead. Furthermore, since we are > inside a reset test, we have disabled the hangchecker, and so there is > no safety net and we wait indefinitely. Detect the stuck GPU and > declare it wedged as a state of emergency so we can escape. > > Signed-off-by: Chris Wilson <chris@xxxxxxxxxxxxxxxxxx> > Cc: Jari Tahvanainen <jari.tahvanainen@xxxxxxxxx> > Cc: Mika Kuoppala <mika.kuoppala@xxxxxxxxxxxxxxx> >Ping? Sorry Chris for late answer. Tried to get touch with you earlier through IRC. I merged the series on top of the drm-tip and executed it in HSW - no hang anymore - FAIL. (drv_selftest:6304) igt-kmod-CRITICAL: Test assertion failure function igt_kselftest_execute, file igt_kmod.c:513: (drv_selftest:6304) igt-kmod-CRITICAL: Failed assertion: err == 0 (drv_selftest:6304) igt-kmod-CRITICAL: kselftest "i915 igt__19__live_hangcheck=1 live_selftests=-1" failed: Input/output error [5] (drv_selftest:6304) igt-core-INFO: Stack trace: (drv_selftest:6304) igt-core-INFO: #0 [__igt_fail_assert+0x101] (drv_selftest:6304) igt-core-INFO: #1 [igt_kselftest_execute+0x296] (drv_selftest:6304) igt-core-INFO: #2 [igt_kselftests+0x295] (drv_selftest:6304) igt-core-INFO: #3 [main+0x5f] (drv_selftest:6304) igt-core-INFO: #4 [__libc_start_main+0xf1] (drv_selftest:6304) igt-core-INFO: #5 [_start+0x2a] (drv_selftest:6304) igt-core-INFO: #6 [<unknown>+0x2a] **** END **** Stack trace: #0 [__igt_fail_assert+0x101] #1 [igt_kselftest_execute+0x296] #2 [igt_kselftests+0x295] #3 [main+0x5f] #4 [__libc_start_main+0xf1] #5 [_start+0x2a] #6 [<unknown>+0x2a] Subtest live_hangcheck: FAIL (1.911s) _______________________________________________ Intel-gfx mailing list Intel-gfx@xxxxxxxxxxxxxxxxxxxxx https://lists.freedesktop.org/mailman/listinfo/intel-gfx