Hi Dave, Sima, Here is the main pull request for drm-intel-gt-next targeting 6.11. First is the new userspace API for allowing upload of custom context state used for replaying GPU hang error state captures. This will be used by Mesa (see https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27594) for debugging GPU hangs captured in the wild on real hardware. So far that was only possible under simulation and that via some hacks. Also, simulation in general has certain limitations to what hangs it can reproduce. As the UAPI it is intended for Mesa developers only, it is hidden behind a kconfig and runtime enablement switches. Then there are fixes for hangs on Meteorlake due incorrect reduced CCS configuration and a missing video engine workaround. Then fixes for a couple race conditions in multi GT and breadcrumb handling, and a more robust functional level reset by extending the timeout used. A couple tiny cleanups here and there and finally one back-merge which was required to land some display code base refactoring. Regards, Tvrtko drm-intel-gt-next-2024-06-12: UAPI Changes: - Support replaying GPU hangs with captured context image (Tvrtko Ursulin) Driver Changes: Fixes/improvements/new stuff: - Automate CCS Mode setting during engine resets [gt] (Andi Shyti) - Revert "drm/i915: Remove extra multi-gt pm-references" (Janusz Krzysztofik) - Fix HAS_REGION() usage in intel_gt_probe_lmem() (Ville Syrjälä) - Disarm breadcrumbs if engines are already idle [gt] (Chris Wilson) - Shadow default engine context image in the context (Tvrtko Ursulin) - Support replaying GPU hangs with captured context image (Tvrtko Ursulin) - avoid FIELD_PREP warning [guc] (Arnd Bergmann) - Fix CCS id's calculation for CCS mode setting [gt] (Andi Shyti) - Increase FLR timeout from 3s to 9s (Andi Shyti) - Update workaround 14018575942 [mtl] (Angus Chen) Future platform enablement: - Enable w/a 16021333562 for DG2, MTL and ARL [guc] (John Harrison) Miscellaneous: - Pass the region ID rather than a bitmask to HAS_REGION() (Ville Syrjälä) - Remove counter productive REGION_* wrappers (Ville Syrjälä) - Fix typo [gem/i915_gem_ttm_move] (Deming Wang) - Delete the live_hearbeat_fast selftest [gt] (Krzysztof Niemiec) The following changes since commit 431c590c3ab0469dfedad3a832fe73556396ee52: drm/tests: Add a unit test for range bias allocation (2024-05-16 12:50:14 +1000) are available in the Git repository at: https://gitlab.freedesktop.org/drm/i915/kernel.git tags/drm-intel-gt-next-2024-06-12 for you to fetch changes up to 79655e867ad6dfde2734c67c7704c0dd5bf1e777: drm/i915/mtl: Update workaround 14018575942 (2024-06-11 16:06:20 +0200) ---------------------------------------------------------------- UAPI Changes: - Support replaying GPU hangs with captured context image (Tvrtko Ursulin) Driver Changes: Fixes/improvements/new stuff: - Automate CCS Mode setting during engine resets [gt] (Andi Shyti) - Revert "drm/i915: Remove extra multi-gt pm-references" (Janusz Krzysztofik) - Fix HAS_REGION() usage in intel_gt_probe_lmem() (Ville Syrjälä) - Disarm breadcrumbs if engines are already idle [gt] (Chris Wilson) - Shadow default engine context image in the context (Tvrtko Ursulin) - Support replaying GPU hangs with captured context image (Tvrtko Ursulin) - avoid FIELD_PREP warning [guc] (Arnd Bergmann) - Fix CCS id's calculation for CCS mode setting [gt] (Andi Shyti) - Increase FLR timeout from 3s to 9s (Andi Shyti) - Update workaround 14018575942 [mtl] (Angus Chen) Future platform enablement: - Enable w/a 16021333562 for DG2, MTL and ARL [guc] (John Harrison) Miscellaneous: - Pass the region ID rather than a bitmask to HAS_REGION() (Ville Syrjälä) - Remove counter productive REGION_* wrappers (Ville Syrjälä) - Fix typo [gem/i915_gem_ttm_move] (Deming Wang) - Delete the live_hearbeat_fast selftest [gt] (Krzysztof Niemiec) ---------------------------------------------------------------- Andi Shyti (3): drm/i915/gt: Automate CCS Mode setting during engine resets drm/i915/gt: Fix CCS id's calculation for CCS mode setting drm/i915: Increase FLR timeout from 3s to 9s Angus Chen (1): drm/i915/mtl: Update workaround 14018575942 Arnd Bergmann (1): drm/i915/guc: avoid FIELD_PREP warning Chris Wilson (1): drm/i915/gt: Disarm breadcrumbs if engines are already idle Deming Wang (1): drm/i915/gem/i915_gem_ttm_move: Fix typo Janusz Krzysztofik (1): Revert "drm/i915: Remove extra multi-gt pm-references" John Harrison (1): drm/i915/guc: Enable w/a 16021333562 for DG2, MTL and ARL Niemiec, Krzysztof (1): drm/i915/gt: Delete the live_hearbeat_fast selftest Tvrtko Ursulin (3): Merge drm/drm-next into drm-intel-gt-next drm/i915: Shadow default engine context image in the context drm/i915: Support replaying GPU hangs with captured context image Ville Syrjälä (3): drm/i915: Fix HAS_REGION() usage in intel_gt_probe_lmem() drm/i915: Pass the region ID rather than a bitmask to HAS_REGION() drm/i915: Remove counter productive REGION_* wrappers drivers/gpu/drm/i915/Kconfig.debug | 17 ++++ drivers/gpu/drm/i915/gem/i915_gem_context.c | 113 +++++++++++++++++++++ drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c | 18 ++++ drivers/gpu/drm/i915/gem/i915_gem_ttm_move.c | 2 +- drivers/gpu/drm/i915/gt/intel_breadcrumbs.c | 15 ++- drivers/gpu/drm/i915/gt/intel_context.c | 2 + drivers/gpu/drm/i915/gt/intel_context.h | 22 ++++ drivers/gpu/drm/i915/gt/intel_context_types.h | 3 + drivers/gpu/drm/i915/gt/intel_engine_cs.c | 6 ++ drivers/gpu/drm/i915/gt/intel_gt_ccs_mode.c | 8 +- drivers/gpu/drm/i915/gt/intel_gt_ccs_mode.h | 2 +- drivers/gpu/drm/i915/gt/intel_gt_types.h | 8 ++ drivers/gpu/drm/i915/gt/intel_lrc.c | 8 +- drivers/gpu/drm/i915/gt/intel_ring_submission.c | 8 +- drivers/gpu/drm/i915/gt/intel_workarounds.c | 12 ++- .../gpu/drm/i915/gt/selftest_engine_heartbeat.c | 110 -------------------- drivers/gpu/drm/i915/gt/uc/abi/guc_klvs_abi.h | 7 +- drivers/gpu/drm/i915/gt/uc/intel_guc_ads.c | 32 +++--- drivers/gpu/drm/i915/i915_drv.h | 4 +- drivers/gpu/drm/i915/i915_params.c | 5 + drivers/gpu/drm/i915/i915_params.h | 3 +- drivers/gpu/drm/i915/i915_pci.c | 6 +- drivers/gpu/drm/i915/intel_memory_region.c | 2 +- drivers/gpu/drm/i915/intel_memory_region.h | 5 - drivers/gpu/drm/i915/intel_uncore.c | 9 +- drivers/gpu/drm/i915/selftests/mock_gem_device.c | 2 +- include/uapi/drm/i915_drm.h | 27 +++++ 27 files changed, 296 insertions(+), 160 deletions(-)