Re: Control message failures kill entire XHCI stack

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Mathias & Devin,

On Thu, Feb 19, 2015 at 3:18 PM, Mathias Nyman
<mathias.nyman@xxxxxxxxxxxxxxx> wrote:
>
> Got one more patch added to the for-usb-next-branch.
> It makes sure we allocate enough scratchpad memory for xhci.
>
> It's one possible cause.
> Patch will anyway go to 3.20, but you can try it out first to see if it helps

My apologies for my slow response, I've been in hospital for almost
two weeks having my gallbladder removed.

I tried recording with the Hauppauge USB Live2 using the following kernel:

* 3.19.0 with the following patches:
* xhci: Allocate correct amount of scratchpad buffers
* xhci: Don't touch TRBs memory if those are no longer on the endpoint ring
* xhci: fix invalid pointer in reset device debugging
* xhci: add debugging for reset device and stop endpoint commands
* xhci: add command ring stop and restart debug messages

Unfortunately it still fails with the xHCI host controller dying.
I've included an extract from syslog which will hopefully show you the
main activity.  The entire syslog (boot, test, shutdown) is available
from: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1412121/+attachment/4330313/+files/syslog

Feb 27 11:59:03 alistair-XPS13 kernel: [  154.081591] xhci_hcd
0000:00:14.0: Cached old ring, 26 rings cached
Feb 27 11:59:03 alistair-XPS13 kernel: [  154.081644] xhci_hcd
0000:00:14.0: Endpoint 0x83 ep reset callback called
Feb 27 11:59:03 alistair-XPS13 rtkit-daemon[2137]: Successfully made
thread 4191 of process 3296 (n/a) owned by '1000' RT at priority 5.
Feb 27 11:59:03 alistair-XPS13 rtkit-daemon[2137]: Supervising 5
threads of 1 processes of 1 users.
Feb 27 11:59:03 alistair-XPS13 kernel: [  154.087381] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 11:59:03 alistair-XPS13 kernel: [  154.087393] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 4 segments
Feb 27 11:59:03 alistair-XPS13 kernel: [  154.087411] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 11:59:03 alistair-XPS13 kernel: [  154.087415] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 8 segments
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090518] xhci_hcd
0000:00:14.0: Cancel URB ffff8801f949f000, dev 2, ep 0x83, starting at
offset 0x1ff9d03b0
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090531] xhci_hcd
0000:00:14.0: Starting stop cmd watchdog timer for slot  4 ep index 6.
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090544] xhci_hcd
0000:00:14.0: cmdring ctrl reg before ringing 0x8
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090547] xhci_hcd
0000:00:14.0: // Ding dong!
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090554] xhci_hcd
0000:00:14.0: cmdring ctrl reg after ringing 0x8
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090590] xhci_hcd
0000:00:14.0: Stopped on Transfer TRB
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090602] xhci_hcd
0000:00:14.0: Removing canceled TD starting at 0x1ff9d0690 (dma).
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090607] xhci_hcd
0000:00:14.0: Finding endpoint context
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090612] xhci_hcd
0000:00:14.0: Cycle state = 0x0
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090616] xhci_hcd
0000:00:14.0: New dequeue segment = ffff880214e25940 (virtual)
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090620] xhci_hcd
0000:00:14.0: New dequeue pointer = 0x1ff9d06a0 (DMA)
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090624] xhci_hcd
0000:00:14.0: Removing canceled TD starting at 0x1ff9d06a0 (dma).
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090628] xhci_hcd
0000:00:14.0: TRB to noop at offset 0x1ff9d06a0
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090631] xhci_hcd
0000:00:14.0: Removing canceled TD starting at 0x1ff9d06b0 (dma).
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090635] xhci_hcd
0000:00:14.0: TRB to noop at offset 0x1ff9d06b0

...

Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090828] xhci_hcd
0000:00:14.0: event_trb is a no-op TRB. Skip it
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090833] xhci_hcd
0000:00:14.0: xhci_drop_endpoint called for udev ffff880208e61000
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090835] xhci_hcd
0000:00:14.0: Removing canceled TD starting at 0x1ff9d07b0 (dma).
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090838] xhci_hcd
0000:00:14.0: TRB to noop at offset 0x1ff9d07b0
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090842] xhci_hcd
0000:00:14.0: Removing canceled TD starting at 0x1ff9d07c0 (dma).
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.090845] xhci_hcd
0000:00:14.0: TRB to noop at offset 0x1ff9d07c0

...

Feb 27 11:59:08 alistair-XPS13 kernel: [  159.094764] xhci_hcd
0000:00:14.0: Cached old ring, 27 rings cached
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.094789] xhci_hcd
0000:00:14.0: Cancel URB ffff8801f949e000, dev 2, ep 0x83, starting at
offset 0x1ff9cf3d0
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.094794] xhci_hcd
0000:00:14.0: Cancel URB NOT on current ring
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.094798] xhci_hcd
0000:00:14.0: Cancel URB NOT on current ring
Feb 27 11:59:08 alistair-XPS13 kernel: [  159.094801] xhci_hcd
0000:00:14.0: Cancel URB NOT on current ring

...

Feb 27 12:00:02 alistair-XPS13 kernel: [  212.739606] xhci_hcd
0000:00:14.0: Cached old ring, 28 rings cached
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.739669] xhci_hcd
0000:00:14.0: Endpoint 0x83 ep reset callback called
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.743305] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.743313] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 4 segments
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.743344] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.743354] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 8 segments
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.753037] xhci_hcd
0000:00:14.0: xhci_drop_endpoint called for udev ffff880208e61000
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.753046] xhci_hcd
0000:00:14.0: drop ep 0x84, slot id 4, new drop flags = 0x200, new add
flags = 0x0
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.753054] xhci_hcd
0000:00:14.0: add ep 0x84, slot id 4, new drop flags = 0x200, new add
flags = 0x200
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.753057] xhci_hcd
0000:00:14.0: xhci_check_bandwidth called for udev ffff880208e61000
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.753060] xhci_hcd
0000:00:14.0: New Input Control Context:

...

Feb 27 12:00:02 alistair-XPS13 kernel: [  212.754850] xhci_hcd
0000:00:14.0: Cached old ring, 29 rings cached
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.754965] xhci_hcd
0000:00:14.0: Endpoint 0x84 ep reset callback called
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.847078] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.847090] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 4 segments
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.847107] xhci_hcd
0000:00:14.0: ERROR no room on ep ring, try ring expansion
Feb 27 12:00:02 alistair-XPS13 kernel: [  212.847110] xhci_hcd
0000:00:14.0: ring expansion succeed, now has 8 segments
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.626923] xhci_hcd
0000:00:14.0: Cancel URB ffff8801f949e000, dev 2, ep 0x83, starting at
offset 0xaff600f0
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.626930] xhci_hcd
0000:00:14.0: Starting stop cmd watchdog timer for slot  4 ep index 6.
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.626937] xhci_hcd
0000:00:14.0: cmdring ctrl reg before ringing 0x8
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.626939] xhci_hcd
0000:00:14.0: // Ding dong!
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.626945] xhci_hcd
0000:00:14.0: cmdring ctrl reg after ringing 0x8
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.878228] xhci_hcd
0000:00:14.0: Cancel URB ffff8801f8b68480, dev 2, ep 0x0, starting at
offset 0xbd839670
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.878234] xhci_hcd
0000:00:14.0: Starting stop cmd watchdog timer for slot  4 ep index 0.
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.878242] xhci_hcd
0000:00:14.0: cmdring ctrl reg before ringing 0x8
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.878243] xhci_hcd
0000:00:14.0: // Ding dong!
Feb 27 12:00:37 alistair-XPS13 kernel: [  247.878247] xhci_hcd
0000:00:14.0: cmdring ctrl reg after ringing 0x8
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623214] xhci_hcd
0000:00:14.0: xHCI host not responding to stop endpoint command.
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623224] xhci_hcd
0000:00:14.0: Assuming host is dying, halting host.
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623245] xhci_hcd
0000:00:14.0: // Halt the HC
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623255] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 1, ep index 0
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623260] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 1, ep index 2
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623272] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 1, ep index 3
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623278] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 2, ep index 0
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623283] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 2, ep index 2
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623296] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 2, ep index 6
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623310] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 0
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623319] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 2
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623323] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 3
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623327] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 4
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623331] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 5
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623335] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 3, ep index 6
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623341] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 0
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623374] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 6
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623410] usb 1-2: Not
enough bandwidth for altsetting 0
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623426] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 8
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623439] cx231xx 1-2:1.1:
can't change interface 2 alt no. to 0 (err=-19)
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623449] cx231xx 1-2:1.1:
failed to set alternate setting !
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623479] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 9
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623512] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 12
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623551] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 27
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623568] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 28
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623581] xhci_hcd
0000:00:14.0: Killing URBs for slot ID 4, ep index 30
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623600] xhci_hcd
0000:00:14.0: Calling usb_hc_died()
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623613] xhci_hcd
0000:00:14.0: HC died; cleaning up
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623621] cx231xx 1-2:1.1:
cx231xx_send_usb_command: failed with status --22
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623636] cx231xx 1-2:1.1:
cx231xx_send_usb_command: failed with status --22
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623659] xhci_hcd
0000:00:14.0: xHCI host controller is dead.
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623694] xhci_hcd
0000:00:14.0: Command timeout on stopped ring
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623708] xhci_hcd
0000:00:14.0: Turn aborted command ffff8800be631190 to no-op
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623720] xhci_hcd
0000:00:14.0: Not restarting Command ring after stop event
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.623783] usb 1-2: USB
disconnect, device number 5
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.624123] cx231xx 1-2:1.1:
device video1 is open! Deregistration and memory deallocation are
deferred on close.
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.662979] usb 1-2:1.1:
V4L2 device vbi0 deregistered
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.663117] usb 1-2:1.1:
V4L2 device video1 deregistered
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.663124] ------------[
cut here ]------------
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.663131] WARNING: CPU: 2
PID: 4250 at fs/sysfs/group.c:219 sysfs_remove_group+0xa1/0xb0()
Feb 27 12:00:42 alistair-XPS13 kernel: [  252.663134] sysfs group
ffffffff81cc06e0 not found for kobject 'video1'
...
--
To unsubscribe from this list: send the line "unsubscribe linux-usb" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Media]     [Linux Input]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Old Linux USB Devel Archive]

  Powered by Linux