Re: ASMedia 2464PD-based SSD enclosure hotplug failure

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Oct 31, 2023 at 12:20:08PM +0800, initramfs wrote:
> Hi all,
> 
> I own a ZikeDrive USB4 NVMe SSD enclosure and noticed that upon
> hotplugging the enclosure I receive the following kernel oops (6.6.0):
> 
>     [  138.597027] thunderbolt 0-0:1.1: new retimer found, vendor=0x8087
>     device=0x15ee
>     [  138.626031] thunderbolt 0-1: new device found, vendor=0x187
>     device=0xd666
>     [  138.626046] thunderbolt 0-1: Gopod Group Limited. USB4 SSD Drive
>     Enclosure
>     [  138.736546] thunderbolt 0000:00:0d.2: 1:1: failed to enable lane
>     bonding
>     [  138.736888] thunderbolt 0000:00:0d.2: 1: failed to enable CL states
>     [  138.737874] thunderbolt 0000:00:0d.2: 1: failed to enable TMU
>     [  138.739417] thunderbolt 0000:00:0d.2: 1:1: hop deactivation
>     failed for hop 1, index 8
>     [  138.739421] ------------[ cut here ]------------
>     [  138.739422] thunderbolt 0000:00:0d.2: path activation failed
>     [  138.739462] WARNING: CPU: 6 PID: 3845 at
>     drivers/thunderbolt/path.c:573 tb_path_activate+0x10f/0x480
>     [thunderbolt]
>     [  138.739495] Modules linked in: ccm rfcomm snd_seq_dummy
>     snd_hrtimer snd_seq snd_seq_device cmac algif_hash algif_skcipher
>     af_alg nft_ct nf_conntrack nf_defrag_ipv4 nf_defrag_ipv6 bnep
>     nf_tables nfnetlink ntfs3 snd_sof_pci_intel_tgl
>     snd_sof_intel_hda_common soundwire_intel snd_sof_intel_hda_mlink
>     soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp
>     snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core
>     snd_soc_acpi_intel_match snd_soc_acpi soundwire_generic_allocation
>     soundwire_bus iwlmvm intel_uncore_frequency snd_soc_core
>     intel_uncore_frequency_common intel_tcc_cooling snd_compress
>     snd_hda_codec_realtek ac97_bus x86_pkg_temp_thermal intel_powerclamp
>     snd_hda_codec_generic snd_hda_codec_hdmi snd_pcm_dmaengine coretemp
>     mac80211 snd_hda_intel hid_multitouch kvm_intel snd_intel_dspcfg
>     snd_intel_sdw_acpi libarc4 mei_pxp kvm btusb uvcvideo mei_hdcp
>     snd_hda_codec btrtl uvc processor_thermal_device_pci videobuf2_v4l2
>     btbcm irqbypass snd_hda_core iTCO_wdt processor_thermal_device
>     dm_zero btintel videodev btmtk
>     [  138.739535]  iTCO_vendor_support processor_thermal_rapl snd_hwdep
>     r8169 rapl ucsi_acpi intel_pmc_bxt intel_rapl_msr iwlwifi bluetooth
>     realtek videobuf2_vmalloc pmt_telemetry typec_ucsi snd_pcm spi_nor
>     intel_cstate vfat mei_me intel_rapl_common intel_lpss_pci
>     mdio_devres videobuf2_memops pmt_class snd_timer videobuf2_common
>     i2c_i801 intel_lpss intel_uncore wmi_bmof asus_nb_wmi pcspkr typec
>     ecdh_generic cfg80211 libphy mtd fat processor_thermal_rfim mc mei
>     idma64 i2c_smbus snd thunderbolt crc16 roles i2c_hid_acpi soundcore
>     intel_vsec processor_thermal_mbox igen6_edac i2c_hid int3403_thermal
>     int340x_thermal_zone int3400_thermal acpi_thermal_rel acpi_tad
>     mousedev acpi_pad joydev mac_hid pkcs8_key_parser dm_multipath
>     i2c_dev crypto_user fuse loop ip_tables x_tables btrfs
>     blake2b_generic libcrc32c crc32c_generic xor raid6_pq dm_crypt cbc
>     encrypted_keys trusted tee asn1_encoder hid_asus asus_wmi
>     ledtrig_audio sparse_keymap rfkill platform_profile usbhid i915
>     crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni
>     [  138.739592]  polyval_generic gf128mul serio_raw
>     ghash_clmulni_intel drm_buddy dm_mod sha512_ssse3 intel_gtt atkbd
>     ttm sdhci_pci vivaldi_fmap aesni_intel libps2 i2c_algo_bit cqhci
>     nvme sdhci crypto_simd drm_display_helper nvme_core xhci_pci cryptd
>     spi_intel_pci mmc_core xhci_pci_renesas nvme_common spi_intel cec
>     video i8042 serio wmi
>     [  138.739610] CPU: 6 PID: 3845 Comm: pool-/usr/lib/b Tainted: G
>     S                 6.6.0-1-mainline #1
>     a243c5e3c83e72fc2b2243b5ea465e514d6c24c1
>     [  138.739614] Hardware name: ASUSTeK COMPUTER INC. ROG Zephyrus M16
>     GU603ZW_GU603ZW/GU603ZW, BIOS GU603ZW.311 12/22/2022
>     [  138.739615] RIP: 0010:tb_path_activate+0x10f/0x480 [thunderbolt]
>     [  138.739637] Code: 01 00 00 48 85 ed 0f 84 d7 00 00 00 48 81 c7 c0
>     00 00 00 e8 93 24 7b f8 48 89 ea 48 c7 c7 20 95 db c0 48 89 c6 e8 f1
>     78 f4 f7 <0f> 0b 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 0f 85 54
>     03 00 00
>     [  138.739639] RSP: 0018:ffffc90005937d30 EFLAGS: 00010282
>     [  138.739641] RAX: 0000000000000000 RBX: 00000000ffffff95 RCX:
>     0000000000000027
>     [  138.739642] RDX: ffff88886b3a1708 RSI: 0000000000000001 RDI:
>     ffff88886b3a1700
>     [  138.739644] RBP: ffff88810310edc0 R08: 0000000000000000 R09:
>     ffffc90005937bb8
>     [  138.739644] R10: 0000000000000003 R11: ffff88888b7aac28 R12:
>     0000000000000028
>     [  138.739645] R13: ffff88813ebc9100 R14: ffff888103fe5800 R15:
>     ffff8881701d8fe0
>     [  138.739647] FS:  00007fd6231fe6c0(0000) GS:ffff88886b380000(0000)
>     knlGS:0000000000000000
>     [  138.739648] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>     [  138.739650] CR2: 000055d0bbfe6610 CR3: 0000000141210000 CR4:
>     0000000000f50ee0
>     [  138.739651] PKRU: 55555554
>     [  138.739652] Call Trace:
>     [  138.739654]  <TASK>
>     [  138.739655]  ? tb_path_activate+0x10f/0x480 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739676]  ? __warn+0x81/0x130
>     [  138.739681]  ? tb_path_activate+0x10f/0x480 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739702]  ? report_bug+0x171/0x1a0
>     [  138.739706]  ? console_unlock+0x78/0x120
>     [  138.739710]  ? handle_bug+0x3c/0x80
>     [  138.739718]  ? exc_invalid_op+0x17/0x70
>     [  138.739721]  ? asm_exc_invalid_op+0x1a/0x20
>     [  138.739725]  ? tb_path_activate+0x10f/0x480 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739746]  tb_tunnel_restart+0xb3/0x220 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739767]  tb_tunnel_pci+0x1b2/0x480 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739787]  authorized_store+0x289/0x2d0 [thunderbolt
>     c85481cfb2695e02347db36bb54248816593d261]
>     [  138.739806]  kernfs_fop_write_iter+0x133/0x1d0
>     [  138.739810]  vfs_write+0x23b/0x420
>     [  138.739815]  ksys_write+0x6f/0xf0
>     [  138.739818]  do_syscall_64+0x5d/0x90
>     [  138.739820]  ? syscall_exit_to_user_mode+0x2b/0x40
>     [  138.739824]  ? do_syscall_64+0x6c/0x90
>     [  138.739826]  ? do_syscall_64+0x6c/0x90
>     [  138.739828]  entry_SYSCALL_64_after_hwframe+0x6e/0xd8
>     [  138.739830] RIP: 0033:0x7fd628ec406f
>     [  138.739876] Code: 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 19 4d
>     f8 ff 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 b8 01 00 00
>     00 0f 05 <48> 3d 00 f0 ff ff 77 31 44 89 c7 48 89 44 24 08 e8 6c 4d
>     f8 ff 48
>     [  138.739877] RSP: 002b:00007fd6231fdc90 EFLAGS: 00000293 ORIG_RAX:
>     0000000000000001
>     [  138.739880] RAX: ffffffffffffffda RBX: 00007fd614008c40 RCX:
>     00007fd628ec406f
>     [  138.739881] RDX: 0000000000000001 RSI: 00007fd6231fdd30 RDI:
>     000000000000000a
>     [  138.739882] RBP: 00007fd6231fdd70 R08: 0000000000000000 R09:
>     00007fd6231fdba0
>     [  138.739883] R10: 0000000000000000 R11: 0000000000000293 R12:
>     00007fd614002e20
>     [  138.739884] R13: 000000000000000a R14: 0000000000000000 R15:
>     00007fd6231fdd28
>     [  138.739886]  </TASK>
>     [  138.739887] ---[ end trace 0000000000000000 ]---
>     [  138.739888] thunderbolt 0000:00:0d.2: 0:8 <-> 1:3 (PCI):
>     activation failed
>     [  138.740586] thunderbolt 0000:00:0d.2: 1:3: PCIe tunnel activation
>     failed, aborting
>     [  138.740697] thunderbolt 0-0:1.1: retimer disconnected
>     [  138.740791] thunderbolt 0-1: device disconnected
> 
> the device is then rediscovered and the kernel oops reoccurs ad
> infinitum (device remaining non-functional the whole time).
> 
> If the device was plugged in during system boot, no such error occurs
> and the device is fully functional.
> If the device is then unplugged/replugged, the above kernel oops appears
> and the device remains unusable.
> 
> I have also tried enrolling the device prior to connect as well as
> enabling auto-authorization via udev rules as described in the "USB4 and
> Thunderbolt" admin guide, both to no avail.
> 
> The system under test is an Asus ROG Zephyrus M16 GU603ZW (Intel
> i9-12900H [Alder Lake]) loaded with Arch Linux (tested under
> 6.5.2-arch1-1, 6.5.3-arch1-1, and 6.6.0-1-mainline [vanilla]).
> 
> The external enclosure is fully functional with the same kernel on a
> system with a TB3-only port (Intel i7-8750H [Coffee Lake] + Intel® HM370
> chipset), including hotplugging support.
> The external enclosure is also fully functional on the test system
> described above booted into Windows 11 x64.
> 
> The bug was first reported to the Arch Linux forums with no response
> (https://bbs.archlinux.org/viewtopic.php?id=288731), more details can be
> found there too.
> 

Do you have this issue on kernel v6.1?

-- 
An old man doll... just what I always wanted! - Clara

Attachment: signature.asc
Description: PGP signature


[Index of Archives]     [Linux Media]     [Linux Input]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Old Linux USB Devel Archive]

  Powered by Linux