Hi Dave, Sima. Habanalabs pull request for 6.11. Adding support for Gaudi2-D product, minor debugfs uapi changes this time. Just minor features, improvements, code cleanups and bug fixes. In addition the maintainer change is included. Full details are in the signed tag. Thanks, Ofir The following changes since commit 1ddaaa244021aba8496536a6627b4ad2bc0f936a: Merge tag 'amd-drm-next-6.11-2024-06-07' of https://gitlab.freedesktop.org/agd5f/linux into drm-next (2024-06-11 14:01:55 +1000) are available in the Git repository at: https://github.com/HabanaAI/drivers.accel.habanalabs.kernel.git tags/drm-habanalabs-next-2024-06-20 for you to fetch changes up to 824a450c0d90a9922321842155f563087f8c30f7: accel/habanalabs: gradual sleep in polling memory macro (2024-06-19 09:49:11 +0300) ---------------------------------------------------------------- This tag contains habanalabs driver changes for v6.11. The notable changes are: - uAPI changes: - Use device-name directory in debugfs-driver-habanalabs. - Expose server type in debugfs. - New features and improvements: - Gradual sleep in polling memory macro. - Reduce Gaudi2 MSI-X interrupt count to 128. - Add Gaudi2-D revision support. - Firmware related changes: - Add timestamp to CPLD info. - Gaudi2: Assume hard-reset by firmware upon MC SEI severe error. - Align Gaudi2 interrupt names. - Check for errors after preboot is ready. - Bug fixes and code cleanups: - Move heartbeat work initialization to early init. - Fix a race when receiving events during reset. - Change the heartbeat scheduling point. - Maintainers: - Change habanalabs maintainer and git repo path. ---------------------------------------------------------------- Ariel Suller (1): accel/habanalabs/gaudi2: align interrupt names to table Dani Liberman (2): accel/habanalabs: add device name to error print accel/habanalabs: separate nonce from max_size in cpucp_packet struct Didi Freiman (1): accel/habanalabs: gradual sleep in polling memory macro Farah Kassabri (6): accel/habanalabs/gaudi2: update interrupts related headers accel/habanalabs: add heartbeat debug info accel/habanalabs: check for errors after preboot is ready accel/habanalabs/gaudi2: add GAUDI2D revision support accel/habanalabs: change the heartbeat scheduling point accel/habanalabs: add more info upon cpu pkt timeout Igal Zeltser (1): accel/habanalabs: use msg_header instead of desc_header Ilia Levi (1): accel/habanalabs: additional print in device-in-use info Oded Gabbay (1): MAINTAINERS: Change habanalabs maintainer and git repo path Ofir Bitton (3): accel/habanalabs/gaudi2: align embedded specs headers accel/habanalabs: remove timestamp registration debug prints accel/habanalbs/gaudi2: reduce interrupt count to 128 Ohad Sharabi (4): accel/habanalabs/gaudi2: use single function to compare FW versions accel/habanalabs: restructure function that checks heartbeat received accel/habanalabs: no CPUCP prints on heartbeat failure accel/habanalabs: add device name to invalidation failure msg Rakesh Ughreja (1): accel/habanalabs/gaudi2: unsecure edma max outstanding register Tal Cohen (1): accel/habanalabs: disable EQ interrupt after disabling pci Tal Risin (1): accel/habanalabs: expose server type in debugfs Tomer Tayar (11): accel/habanalabs: use parent device for trace events accel/habanalabs/gaudi2: revise return value handling in gaudi2_hbm_sei_handle_read_err() accel/habanalabs/gaudi2: assume hard-reset by FW upon MC SEI severe error accel/habanalabs: add an EQ size ASIC property accel/habanalabs: move hl_eq_heartbeat_event_handle() to common code accel/habanalabs: use device-name directory in debugfs-driver-habanalabs accel/habanalabs: add a common handler for clock change events accel/habanalabs: revise print on EQ heartbeat failure accel/habanalabs: dump the EQ entries headers on EQ heartbeat failure accel/habanalabs: print timestamp of last PQ heartbeat on EQ heartbeat failure accel/habanalabs: move heartbeat work initialization to early init Vitaly Margolin (1): accel/habanalabs: add cpld ts cpld_timestamp cpucp .../ABI/testing/debugfs-driver-habanalabs | 12 +- MAINTAINERS | 4 +- .../accel/habanalabs/common/command_submission.c | 13 -- drivers/accel/habanalabs/common/debugfs.c | 22 +- drivers/accel/habanalabs/common/device.c | 240 +++++++++++++++----- drivers/accel/habanalabs/common/firmware_if.c | 229 ++++++++++++------- drivers/accel/habanalabs/common/habanalabs.h | 76 +++++-- drivers/accel/habanalabs/common/habanalabs_drv.c | 5 +- drivers/accel/habanalabs/common/hwmon.c | 60 ++--- drivers/accel/habanalabs/common/irq.c | 33 ++- drivers/accel/habanalabs/common/memory_mgr.c | 37 +++- drivers/accel/habanalabs/common/mmu/mmu.c | 14 +- drivers/accel/habanalabs/common/pci/pci.c | 4 +- drivers/accel/habanalabs/common/sysfs.c | 8 +- drivers/accel/habanalabs/gaudi/gaudi.c | 12 +- drivers/accel/habanalabs/gaudi2/gaudi2.c | 88 ++------ drivers/accel/habanalabs/gaudi2/gaudi2P.h | 8 +- drivers/accel/habanalabs/gaudi2/gaudi2_security.c | 1 + drivers/accel/habanalabs/goya/goya.c | 13 +- drivers/accel/habanalabs/include/gaudi2/gaudi2.h | 4 +- .../include/gaudi2/gaudi2_async_ids_map_extended.h | 244 ++++++++++----------- .../accel/habanalabs/include/gaudi2/gaudi2_fw_if.h | 27 +-- .../habanalabs/include/gaudi2/gaudi2_reg_map.h | 8 + .../habanalabs/include/hw_ip/pci/pci_general.h | 3 +- include/linux/habanalabs/cpucp_if.h | 20 +- include/linux/habanalabs/hl_boot_if.h | 29 ++- 26 files changed, 725 insertions(+), 489 deletions(-)