Re: xhci_hcd crash on linux 4.7.0

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Greg,

I managed to reproduce with a untainted kernel, see dmesg paste below.
The stack seemed corrupted as well ?

I refered to it as a crash since after a couple of these issues, the
machine hard freezes - I set up a serial console via a USB cable, but
I don't get the kernel oops out of the machine. The network is also
dead before getting any data. I could not think of any other way to
get a console out of a Macbook - any ideas ?

There is a progressive level of deterioration going on below, this is
why I'm adding multiple pastes. See the obviously invalid pointer
0000000000000001 in 3rd paste below. Also, see the protection fault in
the last paste. To me, something is trampling all over memory, and it
is usb-related.

Any suggestions are welcome,
Alex

[ 6462.324835] ------------[ cut here ]------------
[ 6462.324844] WARNING: CPU: 2 PID: 6296 at
/home/kernel/COD/linux/lib/list_debug.c:59
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 6462.324846] list_del corruption. prev->next should be
ffff880260d5b780, but was ffff8801f6377000
[ 6462.324847] Modules linked in: tun msr cpufreq_stats mcs7830 usbnet
mii uas usb_storage fuse zram zsmalloc rfcomm lz4_compress bnep joydev
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp nls_utf8
nls_cp437 vfat btusb btrtl btbcm btintel bluetooth kvm_intel kvm fat
applesmc input_polldev bcm5974 snd_hda_codec_hdmi irqbypass
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hmac drbg ansi_cprng
snd_hda_codec_cirrus snd_hda_codec_generic efi_pstore snd_hda_intel
snd_hda_codec snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi
aes_x86_64 snd_seq_midi_event lrw gf128mul glue_helper ablk_helper
cryptd intel_cstate snd_rawmidi intel_rapl_perf brcmfmac snd_seq
brcmutil snd_seq_device snd_timer cfg80211 lpc_ich sg
intel_pch_thermal mfd_core efivars mmc_core snd thunderbolt mei_me
rfkill mei
[ 6462.324876]  soundcore shpchp sbs acpi_als kfifo_buf sbshc
industrialio evdev ac battery apple_bl tpm_tis tpm binfmt_misc nfsd
auth_rpcgss nfs_acl lockd grace parport_pc ppdev lp parport sunrpc
efivarfs autofs4 ext4 crc16 jbd2 mbcache sd_mod hid_apple hid_generic
usbhid hid crc32c_intel ahci libahci libata i915 scsi_mod xhci_pci
xhci_hcd usbcore usb_common i2c_algo_bit drm_kms_helper drm fjes video
button
[ 6462.324895] CPU: 2 PID: 6296 Comm: chrome Not tainted
4.7.0-040700-generic #201607241632
[ 6462.324896] Hardware name: Apple Inc.
MacBookPro12,1/Mac-E43C1C25D4880AD6, BIOS
MBP121.88Z.0167.B17.1606231721 06/23/2016
[ 6462.324898]  0000000000000086 00000000a491e776 ffffffff81321405
ffff88026ec83d90
[ 6462.324899]  0000000000000000 ffffffff81078a3e ffff880260d5b790
ffff88026ec83de8
[ 6462.324901]  ffff880260238258 0000000000000000 ffff880260d5b780
ffff8802603dcb40
[ 6462.324902] Call Trace:
[ 6462.324903]  <IRQ>  [<ffffffff81321405>] ? dump_stack+0x5c/0x77
[ 6462.324909]  [<ffffffff81078a3e>] ? __warn+0xbe/0xe0
[ 6462.324910]  [<ffffffff81078abf>] ? warn_slowpath_fmt+0x5f/0x80
[ 6462.324914]  [<ffffffffc013b6ab>] ?
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 6462.324918]  [<ffffffffc013d896>] ? xhci_irq+0x326/0xb00 [xhci_hcd]
[ 6462.324920]  [<ffffffff810a14e3>] ? try_to_wake_up+0x53/0x390
[ 6462.324922]  [<ffffffff810cdf78>] ? handle_irq_event_percpu+0x78/0x1b0
[ 6462.324924]  [<ffffffff810ce0e9>] ? handle_irq_event+0x39/0x60
[ 6462.324926]  [<ffffffff810d159b>] ? handle_edge_irq+0x7b/0x140
[ 6462.324928]  [<ffffffff8102de49>] ? handle_irq+0x19/0x30
[ 6462.324930]  [<ffffffff815ee436>] ? do_IRQ+0x46/0xd0
[ 6462.324931]  [<ffffffff815ec582>] ? common_interrupt+0x82/0x82
[ 6462.324932]  <EOI>  [<ffffffff8128eba0>] ? cap_settime+0x20/0x20
[ 6462.324935]  [<ffffffff81291161>] ? security_capable+0x41/0x60
[ 6462.324937]  [<ffffffff81081dd5>] ? ns_capable+0x25/0x50
[ 6462.324955]  [<ffffffffc0215823>] ?
i915_get_reset_stats_ioctl+0x83/0xe0 [i915]
[ 6462.324965]  [<ffffffffc002a611>] ? drm_ioctl+0x131/0x4c0 [drm]
[ 6462.324979]  [<ffffffffc02157a0>] ? i915_reg_read_ioctl+0x100/0x100 [i915]
[ 6462.324981]  [<ffffffff81157a30>] ? bpf_prog_select_runtime+0xc0/0xc0
[ 6462.324983]  [<ffffffff81120f05>] ? seccomp_phase1+0xa5/0x2d0
[ 6462.324985]  [<ffffffff81209cfd>] ? do_vfs_ioctl+0x9d/0x5c0
[ 6462.324987]  [<ffffffff8100391f>] ? syscall_trace_enter_phase1+0x11f/0x150
[ 6462.324989]  [<ffffffff8120a294>] ? SyS_ioctl+0x74/0x80
[ 6462.324990]  [<ffffffff81003c39>] ? do_syscall_64+0x59/0xb0
[ 6462.324992]  [<ffffffff815ebce5>] ? entry_SYSCALL64_slow_path+0x25/0x25
[ 6462.324993] ---[ end trace affc0ce74aea10c3 ]---

[ 6882.389120] WARNING: CPU: 2 PID: 12607 at
/home/kernel/COD/linux/lib/list_debug.c:59
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 6882.389121] list_del corruption. prev->next should be
ffff8800891a9500, but was ffff88008116b510
[ 6882.389122] Modules linked in: tun msr cpufreq_stats mcs7830 usbnet
mii uas usb_storage fuse zram zsmalloc rfcomm lz4_compress bnep joydev
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp nls_utf8
nls_cp437 vfat btusb btrtl btbcm btintel bluetooth kvm_intel kvm fat
applesmc input_polldev bcm5974 snd_hda_codec_hdmi irqbypass
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hmac drbg ansi_cprng
snd_hda_codec_cirrus snd_hda_codec_generic efi_pstore snd_hda_intel
snd_hda_codec snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi
aes_x86_64 snd_seq_midi_event lrw gf128mul glue_helper ablk_helper
cryptd intel_cstate snd_rawmidi intel_rapl_perf brcmfmac snd_seq
brcmutil snd_seq_device snd_timer cfg80211 lpc_ich sg
intel_pch_thermal mfd_core efivars mmc_core snd thunderbolt mei_me
rfkill mei
[ 6882.389151]  soundcore shpchp sbs acpi_als kfifo_buf sbshc
industrialio evdev ac battery apple_bl tpm_tis tpm binfmt_misc nfsd
auth_rpcgss nfs_acl lockd grace parport_pc ppdev lp parport sunrpc
efivarfs autofs4 ext4 crc16 jbd2 mbcache sd_mod hid_apple hid_generic
usbhid hid crc32c_intel ahci libahci libata i915 scsi_mod xhci_pci
xhci_hcd usbcore usb_common i2c_algo_bit drm_kms_helper drm fjes video
button
[ 6882.389170] CPU: 2 PID: 12607 Comm: chrome Tainted: G        W
 4.7.0-040700-generic #201607241632
[ 6882.389171] Hardware name: Apple Inc.
MacBookPro12,1/Mac-E43C1C25D4880AD6, BIOS
MBP121.88Z.0167.B17.1606231721 06/23/2016
[ 6882.389172]  0000000000000086 00000000bc19f1d9 ffffffff81321405
ffff88026ec83d90
[ 6882.389174]  0000000000000000 ffffffff81078a3e ffff8800891a9510
ffff88026ec83de8
[ 6882.389176]  ffff880260238258 0000000000000000 ffff8800891a9500
ffff8802603dcb40
[ 6882.389177] Call Trace:
[ 6882.389178]  <IRQ>  [<ffffffff81321405>] ? dump_stack+0x5c/0x77
[ 6882.389185]  [<ffffffff81078a3e>] ? __warn+0xbe/0xe0
[ 6882.389187]  [<ffffffff81078abf>] ? warn_slowpath_fmt+0x5f/0x80
[ 6882.389190]  [<ffffffffc013b6ab>] ?
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 6882.389194]  [<ffffffffc013d896>] ? xhci_irq+0x326/0xb00 [xhci_hcd]
[ 6882.389196]  [<ffffffff810cdf78>] ? handle_irq_event_percpu+0x78/0x1b0
[ 6882.389197]  [<ffffffff810ce0e9>] ? handle_irq_event+0x39/0x60
[ 6882.389199]  [<ffffffff810d159b>] ? handle_edge_irq+0x7b/0x140
[ 6882.389201]  [<ffffffff8102de49>] ? handle_irq+0x19/0x30
[ 6882.389204]  [<ffffffff815ee436>] ? do_IRQ+0x46/0xd0
[ 6882.389205]  [<ffffffff815ec582>] ? common_interrupt+0x82/0x82
[ 6882.389205]  <EOI>
[ 6882.389207] ---[ end trace affc0ce74aea10c4 ]---

[ 7803.280729] WARNING: CPU: 2 PID: 0 at
/home/kernel/COD/linux/lib/list_debug.c:59
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 7803.280732] list_del corruption. prev->next should be
ffff8801f61acf00, but was 0000000000000001
[ 7803.280733] Modules linked in: tun msr cpufreq_stats mcs7830 usbnet
mii uas usb_storage fuse zram zsmalloc rfcomm lz4_compress bnep joydev
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp nls_utf8
nls_cp437 vfat btusb btrtl btbcm btintel bluetooth kvm_intel kvm fat
applesmc input_polldev bcm5974 snd_hda_codec_hdmi irqbypass
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hmac drbg ansi_cprng
snd_hda_codec_cirrus snd_hda_codec_generic efi_pstore snd_hda_intel
snd_hda_codec snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi
aes_x86_64 snd_seq_midi_event lrw gf128mul glue_helper ablk_helper
cryptd intel_cstate snd_rawmidi intel_rapl_perf brcmfmac snd_seq
brcmutil snd_seq_device snd_timer cfg80211 lpc_ich sg
intel_pch_thermal mfd_core efivars mmc_core snd thunderbolt mei_me
rfkill mei
[ 7803.280772]  soundcore shpchp sbs acpi_als kfifo_buf sbshc
industrialio evdev ac battery apple_bl tpm_tis tpm binfmt_misc nfsd
auth_rpcgss nfs_acl lockd grace parport_pc ppdev lp parport sunrpc
efivarfs autofs4 ext4 crc16 jbd2 mbcache sd_mod hid_apple hid_generic
usbhid hid crc32c_intel ahci libahci libata i915 scsi_mod xhci_pci
xhci_hcd usbcore usb_common i2c_algo_bit drm_kms_helper drm fjes video
button
[ 7803.280802] CPU: 2 PID: 0 Comm: swapper/2 Tainted: G        W
4.7.0-040700-generic #201607241632
[ 7803.280803] Hardware name: Apple Inc.
MacBookPro12,1/Mac-E43C1C25D4880AD6, BIOS
MBP121.88Z.0167.B17.1606231721 06/23/2016
[ 7803.280804]  0000000000000086 b54de8bf7ad1011b ffffffff81321405
ffff88026ec83d90
[ 7803.280806]  0000000000000000 ffffffff81078a3e ffff8801f61acf10
ffff88026ec83de8
[ 7803.280808]  ffff880260238258 0000000000000000 ffff8801f61acf00
ffff8802603dcb40
[ 7803.280810] Call Trace:
[ 7803.280811]  <IRQ>  [<ffffffff81321405>] ? dump_stack+0x5c/0x77
[ 7803.280819]  [<ffffffff81078a3e>] ? __warn+0xbe/0xe0
[ 7803.280820]  [<ffffffff81078abf>] ? warn_slowpath_fmt+0x5f/0x80
[ 7803.280825]  [<ffffffffc013b6ab>] ?
handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
[ 7803.280829]  [<ffffffffc013d896>] ? xhci_irq+0x326/0xb00 [xhci_hcd]
[ 7803.280832]  [<ffffffff810e2d76>] ? hrtimer_start_range_ns+0x1d6/0x3e0
[ 7803.280835]  [<ffffffff810cdf78>] ? handle_irq_event_percpu+0x78/0x1b0
[ 7803.280837]  [<ffffffff810ce0e9>] ? handle_irq_event+0x39/0x60
[ 7803.280839]  [<ffffffff810d159b>] ? handle_edge_irq+0x7b/0x140
[ 7803.280841]  [<ffffffff8102de49>] ? handle_irq+0x19/0x30
[ 7803.280844]  [<ffffffff815ee436>] ? do_IRQ+0x46/0xd0
[ 7803.280845]  [<ffffffff815ec582>] ? common_interrupt+0x82/0x82
[ 7803.280846]  <EOI>  [<ffffffff8109da57>] ? finish_task_switch+0x77/0x1f0
[ 7803.280850]  [<ffffffff815e783d>] ? __schedule+0x1ed/0x6b0
[ 7803.280852]  [<ffffffff815e7d31>] ? schedule+0x31/0x80
[ 7803.280854]  [<ffffffff815e7fba>] ? schedule_preempt_disabled+0xa/0x10
[ 7803.280856]  [<ffffffff810b92f5>] ? cpu_startup_entry+0x195/0x330
[ 7803.280859]  [<ffffffff8104ce21>] ? start_secondary+0x151/0x190
[ 7803.280861] ---[ end trace affc0ce74aea10c5 ]---

[10035.839462] general protection fault: 0000 [#1] SMP
[10035.839482] Modules linked in: tun msr cpufreq_stats mcs7830 usbnet
mii uas usb_storage fuse zram zsmalloc rfcomm lz4_compress bnep joydev
intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp nls_utf8
nls_cp437 vfat btusb btrtl btbcm btintel bluetooth kvm_intel kvm fat
applesmc input_polldev bcm5974 snd_hda_codec_hdmi irqbypass
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hmac drbg ansi_cprng
snd_hda_codec_cirrus snd_hda_codec_generic efi_pstore snd_hda_intel
snd_hda_codec snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi
aes_x86_64 snd_seq_midi_event lrw gf128mul glue_helper ablk_helper
cryptd intel_cstate snd_rawmidi intel_rapl_perf brcmfmac snd_seq
brcmutil snd_seq_device snd_timer cfg80211 lpc_ich sg
intel_pch_thermal mfd_core efivars mmc_core snd thunderbolt mei_me
rfkill mei
[10035.839746]  soundcore shpchp sbs acpi_als kfifo_buf sbshc
industrialio evdev ac battery apple_bl tpm_tis tpm binfmt_misc nfsd
auth_rpcgss nfs_acl lockd grace parport_pc ppdev lp parport sunrpc
efivarfs autofs4 ext4 crc16 jbd2 mbcache sd_mod hid_apple hid_generic
usbhid hid crc32c_intel ahci libahci libata i915 scsi_mod xhci_pci
xhci_hcd usbcore usb_common i2c_algo_bit drm_kms_helper drm fjes video
button
[10035.839885] CPU: 2 PID: 16070 Comm: colord-sane Tainted: G        W
      4.7.0-040700-generic #201607241632
[10035.839909] Hardware name: Apple Inc.
MacBookPro12,1/Mac-E43C1C25D4880AD6, BIOS
MBP121.88Z.0167.B17.1606231721 06/23/2016
[10035.839937] task: ffff880203f9cec0 ti: ffff8801cd570000 task.ti:
ffff8801cd570000
[10035.839956] RIP: 0010:[<ffffffff8132f3e6>]  [<ffffffff8132f3e6>]
memcpy_erms+0x6/0x10
[10035.839980] RSP: 0018:ffff8801cd573df8  EFLAGS: 00010202
[10035.839994] RAX: ffff880247019c12 RBX: 0000000000000400 RCX: 0000000000000029
[10035.840012] RDX: 0000000000000029 RSI: 09030003002a0f05 RDI: ffff880247019c12
[10035.840030] RBP: 0000000000000029 R08: 0000000000000000 R09: 0000000000000400
[10035.840048] R10: ffff88020fe03e00 R11: ffff8800893c2700 R12: ffff880203e5c0a8
[10035.840066] R13: 0000000000000001 R14: 00000000000003ee R15: ffff880247156d98
[10035.840085] FS:  00007f798744a880(0000) GS:ffff88026ec80000(0000)
knlGS:0000000000000000
[10035.840105] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[10035.840120] CR2: 000055921bae8748 CR3: 00000001f6366000 CR4: 00000000003406e0
[10035.840138] Stack:
[10035.840144]  ffffffffc00f02cf ffff880247156d80 0000000000000400
ffff880247019c00
[10035.840166]  ffff8801cd573f20 000055921bac6ed0 ffffffff812713d3
ffff88020fe03e00
[10035.840188]  ffff8801cd573f20 000055921bac6ed0 ffff8801cd573f20
0000000000000000
[10035.840211] Call Trace:
[10035.840225]  [<ffffffffc00f02cf>] ? read_descriptors+0x9f/0xe0 [usbcore]
[10035.840245]  [<ffffffff812713d3>] ? kernfs_fop_read+0xa3/0x170
[10035.840261]  [<ffffffff811f6273>] ? __vfs_read+0x33/0x150
[10035.840276]  [<ffffffff811f6740>] ? vfs_read+0x90/0x130
[10035.840291]  [<ffffffff811f7bb2>] ? SyS_read+0x52/0xc0
[10035.840306]  [<ffffffff815ebc36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
[10035.840323] Code: 90 90 90 90 90 eb 1e 0f 1f 00 48 89 f8 48 89 d1
48 c1 e9 03 83 e2 07 f3 48 a5 89 d1 f3 a4 c3 66 0f 1f 44 00 00 48 89
f8 48 89 d1 <f3> a4 c3 0f 1f 80 00 00 00 00 48 89 f8 48 83 fa 20 72 7e
40 38
[10035.840429] RIP  [<ffffffff8132f3e6>] memcpy_erms+0x6/0x10
[10035.840445]  RSP <ffff8801cd573df8>
[10035.856879] ---[ end trace affc0ce74aea10c7 ]---

On Thu, Jul 28, 2016 at 2:40 PM, Greg KH <greg@xxxxxxxxx> wrote:
> On Thu, Jul 28, 2016 at 11:29:41AM +0100, Alex Damian wrote:
>> Hello all,
>>
>> Hope this is the right place to report a bug; if not, please direct me
>> to where it should go.
>>
>> I am running a mainline kernel build by the Ubuntu mainline PPA, on a
>> 2015 Macbook. It is being tainted by use of the vboxdrv module. I am
>> repeatedly getting errors on xhci_hcd irq handler.
>>
>> There is detailed data below. I generally associate the error with
>> plugging in the external mouse, but I can't reproduce reliably.
>>
>>
>> The relevant trace in dmesg are listed below:
>>
>> [  699.935961] ------------[ cut here ]------------
>> [  699.935971] WARNING: CPU: 2 PID: 0 at
>> /home/kernel/COD/linux/lib/list_debug.c:59
>> handle_cmd_completion+0x4db/0xc60 [xhci_hcd]
>> [  699.935972] list_del corruption. prev->next should be
>> ffff8800829fc680, but was ffff8801c12ad5c0
>
> This isn't a "crash", but just a warning of some kernel debug options.
>
> If you can reproduce this without the horrid vboxdrv kernel modules,
> please let us know, otherwise you are going to have to ask the virtual
> box developers, as there's nothing we can do if their code is loaded in
> the kernel (and their code is really really really horrid, seriously, I
> don't even understand how it is semi-stable at all, it's the stuff of
> nightmares and there's good reasons why it is not included in the
> upstream kernel releases.)
>
> Best of luck,
>
> greg k-h
--
To unsubscribe from this list: send the line "unsubscribe linux-usb" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Media]     [Linux Input]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Old Linux USB Devel Archive]

  Powered by Linux