On Thu, 2019-02-14 at 06:40 +0500, Ivan Mironov wrote: > Unfortunately, everything broke again after yet another suspend/resume. > Currently I'm suspecting that my patch maybe only helps to survive the > short suspend, but not the long one. > > After this bad suspend/resume, card reader disappeared again. Debug > logging was not enabled this time, so not too many information in the > dmesg: > > [44013.429613] usb 2-4: Disable of device-initiated U1 failed. > [44018.549809] usb 2-4: Disable of device-initiated U2 failed. > [44024.182043] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44029.814239] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44030.022207] usb 2-4: device not accepting address 2, error -62 > [44035.446526] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44041.078732] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44041.286640] usb 2-4: device not accepting address 2, error -62 > [44046.710928] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44052.343184] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44052.551120] usb 2-4: device not accepting address 2, error -62 > [44057.975369] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44063.607605] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44063.815538] usb 2-4: device not accepting address 2, error -62 > [44063.882505] PM: resume devices took 55.895 seconds > [44063.882508] ------------[ cut here ]------------ > [44063.882511] Component: resume devices, time: 55895 > [44063.882530] WARNING: CPU: 1 PID: 10887 at kernel/power/suspend_test.c:55 suspend_test_finish+0x6b/0x70 > [44063.882531] Modules linked in: vfat fat rfcomm fuse xt_CHECKSUM ipt_MASQUERADE tun bridge stp llc devlink nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ebtable_nat ip6table_nat nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat_ipv4 nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables cmac bnep sunrpc nls_utf8 hfsplus joydev iTCO_wdt iTCO_vendor_support intel_rapl x86_pkg_temp_thermal btusb intel_powerclamp applesmc btrtl input_polldev brcmfmac btbcm coretemp btintel bluetooth kvm_intel brcmutil snd_hda_codec_cirrus snd_hda_codec_generic snd_hda_codec_hdmi intel_cstate snd_hda_intel intel_uncore cfg80211 snd_hda_codec ecdh_generic intel_rapl_perf snd_hda_core snd_hwdep bcm5974 snd_seq snd_seq_device snd_pcm mmc_core rfkill thunderbolt snd_timer snd mei_me mei soundcore i2c_i801 intel_pch_thermal > [44063.882614] lpc_ich sbs acpi_als kfifo_buf sbshc industrialio apple_gmux pcc_cpufreq apple_bl binfmt_misc xfs libcrc32c dm_crypt i915 kvmgt mdev vfio kvm irqbypass i2c_algo_bit drm_kms_helper crct10dif_pclmul crc32_pclmul crc32c_intel drm uas ghash_clmulni_intel usb_storage video hid_apple > [44063.882648] CPU: 1 PID: 10887 Comm: systemd-sleep Not tainted 4.20.7-200.ivan3.fc29.x86_64 #1 > [44063.882651] Hardware name: Apple Inc. MacBookPro11,4/Mac-06F11FD93F0323C5, BIOS MBP114.88Z.0184.B00.1806051659 06/05/2018 > [44063.882656] RIP: 0010:suspend_test_finish+0x6b/0x70 > [44063.882660] Code: 06 69 c2 e8 03 00 00 29 c1 e8 df a3 00 00 81 fd 10 27 00 00 77 03 5b 5d c3 89 ea 48 89 de 48 c7 c7 e9 bb 0c b3 e8 1f 56 fa ff <0f> 0b eb e8 90 0f 1f 44 00 00 0f b6 05 49 e5 88 01 c3 0f 1f 00 0f > [44063.882663] RSP: 0000:ffffbad682b0fd30 EFLAGS: 00010286 > [44063.882666] RAX: 0000000000000000 RBX: ffffffffb30cb9c2 RCX: 0000000000000006 > [44063.882669] RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff8f70af0568c0 > [44063.882671] RBP: 000000000000da57 R08: 0000000000000002 R09: 00000000000207c0 > [44063.882674] R10: 0000002394f2f376 R11: 000000000001cd94 R12: 0000000000000000 > [44063.882676] R13: ffffffffb3254210 R14: 0000000000000000 R15: ffffbad682b0fd60 > [44063.882681] FS: 00007fd487426940(0000) GS:ffff8f70af040000(0000) knlGS:0000000000000000 > [44063.882683] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [44063.882686] CR2: 0000000000000000 CR3: 00000003fcc92002 CR4: 00000000001606e0 > [44063.882688] Call Trace: > [44063.882699] suspend_devices_and_enter+0x248/0x7f0 > [44063.882706] pm_suspend.cold.5+0x33c/0x392 > [44063.882711] state_store+0x80/0xe0 > [44063.882718] kernfs_fop_write+0x116/0x190 > [44063.882728] __vfs_write+0x36/0x1a0 > [44063.882736] ? selinux_file_permission+0xf0/0x130 > [44063.882745] ? security_file_permission+0x2c/0xb0 > [44063.882751] vfs_write+0xa5/0x1a0 > [44063.882758] ksys_write+0x4f/0xb0 > [44063.882767] do_syscall_64+0x5b/0x160 > [44063.882776] entry_SYSCALL_64_after_hwframe+0x44/0xa9 > [44063.882781] RIP: 0033:0x7fd48816dff8 > [44063.882785] Code: 89 02 48 c7 c0 ff ff ff ff eb b3 0f 1f 80 00 00 00 00 f3 0f 1e fa 48 8d 05 25 77 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 58 c3 0f 1f 80 00 00 00 00 41 54 49 89 d4 55 > [44063.882788] RSP: 002b:00007fff49106d48 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 > [44063.882792] RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007fd48816dff8 > [44063.882794] RDX: 0000000000000004 RSI: 000055a1d121afe0 RDI: 0000000000000004 > [44063.882796] RBP: 000055a1d121afe0 R08: 000055a1d1219390 R09: 00007fd487426940 > [44063.882799] R10: 000000000000000a R11: 0000000000000246 R12: 000055a1d12192b0 > [44063.882801] R13: 0000000000000004 R14: 00007fd48823c740 R15: 0000000000000004 > [44063.882805] ---[ end trace d28e1c30545cf612 ]--- > [44063.883214] OOM killer enabled. > [44063.883216] Restarting tasks ... > [44063.883683] usb 2-4: USB disconnect, device number 2 > [44063.895454] done. > > Then I tried to suspend/resume again, just to check what will happen. Suspend succeed, > but card reader did not reappear. Then I tried to unbind xhci_hcd driver and bind it > again (to "reset" the host controller) and the whole USB support died completely: > > [44171.570065] xhci_hcd 0000:00:14.0: USB bus 2 deregistered > [44171.570292] xhci_hcd 0000:00:14.0: remove, state 1 > [44171.570305] usb usb1: USB disconnect, device number 1 > [44171.570309] usb 1-8: USB disconnect, device number 2 > [44171.624668] usb 1-12: USB disconnect, device number 3 > [44171.665118] bcm5974 1-12:1.2: could not read from device > [44171.674339] xhci_hcd 0000:00:14.0: USB bus 1 deregistered > [44174.677453] xhci_hcd 0000:00:14.0: xHCI Host Controller > [44174.678738] xhci_hcd 0000:00:14.0: new USB bus registered, assigned bus number 1 > [44174.679880] xhci_hcd 0000:00:14.0: hcc params 0x200077c1 hci version 0x100 quirks 0x0000000000009810 > [44174.679892] xhci_hcd 0000:00:14.0: cache line size of 256 is not supported > [44174.680861] usb usb1: New USB device found, idVendor=1d6b, idProduct=0002, bcdDevice= 4.20 > [44174.680867] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1 > [44174.680871] usb usb1: Product: xHCI Host Controller > [44174.680875] usb usb1: Manufacturer: Linux 4.20.7-200.ivan3.fc29.x86_64 xhci-hcd > [44174.680879] usb usb1: SerialNumber: 0000:00:14.0 > [44174.681804] hub 1-0:1.0: USB hub found > [44174.681884] hub 1-0:1.0: 14 ports detected > [44174.683892] xhci_hcd 0000:00:14.0: xHCI Host Controller > [44174.684421] xhci_hcd 0000:00:14.0: new USB bus registered, assigned bus number 2 > [44174.684434] xhci_hcd 0000:00:14.0: Host supports USB 3.0 SuperSpeed > [44174.684643] usb usb2: New USB device found, idVendor=1d6b, idProduct=0003, bcdDevice= 4.20 > [44174.684652] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1 > [44174.684657] usb usb2: Product: xHCI Host Controller > [44174.684663] usb usb2: Manufacturer: Linux 4.20.7-200.ivan3.fc29.x86_64 xhci-hcd > [44174.684669] usb usb2: SerialNumber: 0000:00:14.0 > [44174.685224] hub 2-0:1.0: USB hub found > [44174.685244] hub 2-0:1.0: 6 ports detected > [44175.003318] usb 1-8: new full-speed USB device number 2 using xhci_hcd > [44175.132257] usb 1-8: New USB device found, idVendor=05ac, idProduct=8290, bcdDevice= 1.61 > [44175.132265] usb 1-8: New USB device strings: Mfr=1, Product=2, SerialNumber=0 > [44175.132269] usb 1-8: Product: Bluetooth USB Host Controller > [44175.132273] usb 1-8: Manufacturer: Broadcom Corp. > [44175.139274] input: Broadcom Corp. Bluetooth USB Host Controller as /devices/pci0000:00/0000:00:14.0/usb1/1-8/1-8:1.0/0003:05AC:8290.0007/input/input17 > [44175.192287] hid-generic 0003:05AC:8290.0007: input,hidraw0: USB HID v1.11 Keyboard [Broadcom Corp. Bluetooth USB Host Controller] on usb-0000:00:14.0-8/input0 > [44175.193365] input: Broadcom Corp. Bluetooth USB Host Controller as /devices/pci0000:00/0000:00:14.0/usb1/1-8/1-8:1.1/0003:05AC:8290.0008/input/input18 > [44175.194093] hid-generic 0003:05AC:8290.0008: input,hidraw1: USB HID v1.11 Mouse [Broadcom Corp. Bluetooth USB Host Controller] on usb-0000:00:14.0-8/input1 > [44175.306721] Bluetooth: hci0: BCM: chip id 102 build 0715 > [44175.307686] Bluetooth: hci0: BCM: product 05ac:8290 > [44175.308676] Bluetooth: hci0: BCM: features 0x2f > [44175.324659] Bluetooth: hci0: im-mac > [44180.370567] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44186.001831] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44186.209761] usb 2-4: device not accepting address 2, error -62 > [44186.345774] usb 1-12: new full-speed USB device number 3 using xhci_hcd > [44186.474418] usb 1-12: New USB device found, idVendor=05ac, idProduct=0274, bcdDevice= 6.24 > [44186.474426] usb 1-12: New USB device strings: Mfr=1, Product=2, SerialNumber=3 > [44186.474431] usb 1-12: Product: Apple Internal Keyboard / Trackpad > [44186.474435] usb 1-12: Manufacturer: Apple Inc. > [44186.474440] usb 1-12: SerialNumber: D3H61830CG1FTV4A16PF > [44186.480463] apple 0003:05AC:0274.0009: hiddev96,hidraw2: USB HID v1.10 Device [Apple Inc. Apple Internal Keyboard / Trackpad] on usb-0000:00:14.0-12/input0 > [44186.481998] input: Apple Inc. Apple Internal Keyboard / Trackpad as /devices/pci0000:00/0000:00:14.0/usb1/1-12/1-12:1.1/0003:05AC:0274.000A/input/input19 > [44186.534551] apple 0003:05AC:0274.000A: input,hiddev97,hidraw3: USB HID v1.10 Keyboard [Apple Inc. Apple Internal Keyboard / Trackpad] on usb-0000:00:14.0-12/input1 > [44186.535255] input: bcm5974 as /devices/pci0000:00/0000:00:14.0/usb1/1-12/1-12:1.2/input/input20 > [44186.536753] apple 0003:05AC:0274.000B: hiddev98,hidraw4: USB HID v1.10 Device [Apple Inc. Apple Internal Keyboard / Trackpad] on usb-0000:00:14.0-12/input3 > [44186.538025] apple 0003:05AC:0274.000C: hiddev99,hidraw5: USB HID v1.10 Device [Apple Inc. Apple Internal Keyboard / Trackpad] on usb-0000:00:14.0-12/input4 > [44192.145867] xhci_hcd 0000:00:14.0: xHCI host not responding to stop endpoint command. > [44192.145882] xhci_hcd 0000:00:14.0: xHCI host controller not responding, assume dead > [44192.145948] xhci_hcd 0000:00:14.0: Timeout while waiting for configure endpoint command > [44192.145955] xhci_hcd 0000:00:14.0: Timeout while waiting for setup device command > [44192.145960] usb 1-8: Not enough bandwidth for altsetting 0 > [44192.145966] Bluetooth: hci0: setting interface failed (62) > [44192.145979] Bluetooth: hci0: urb 00000000a285970d failed to resubmit (22) > [44192.145988] Bluetooth: hci0: urb 00000000e0abf065 failed to resubmit (22) > [44192.145993] Bluetooth: hci0: urb 00000000cf6ab8ae failed to resubmit (22) > [44192.146009] xhci_hcd 0000:00:14.0: HC died; cleaning up > [44192.146267] usb 1-8: USB disconnect, device number 2 > [44192.201413] usb 1-12: USB disconnect, device number 3 > [44192.251930] bcm5974 1-12:1.2: could not read from device > [44192.561908] usb 2-4: device not accepting address 3, error -108 > [44192.561958] usb usb2-port4: attempt power cycle > [44192.873872] usb usb2-port4: couldn't allocate usb_device > > Full dmesg is here: https://raw.githubusercontent.com/im-0/investigate-card-reader-suspend-problem-on-mbp11.4/master/test-16/dmesg > > Any ideas on how to proceed with this? > > P.S. I'll try to reproduce this again with debug enabled. > It seems that I was wrong and long suspend was not the cause of that failure. I tried to reproduce it again, and after 4 successful suspend/resume cycles, with last suspend lasted for ~10 hour, and everything was fine. No suspend failures, card reader is there and functioning. Complete dmesg: https://raw.githubusercontent.com/im-0/investigate-card-reader-suspend-problem-on-mbp11.4/master/test-17/dmesg Maybe I forgot to add some wait somewhere in the code for something?