On Thu, Jun 14, 2012 at 12:46 PM, Alan Stern <stern@xxxxxxxxxxxxxxxxxxx> wrote: > On Wed, 13 Jun 2012, Austin Schuh wrote: > >> 2.6.38.8 seems to work. I didn't run it overnight, but it should have >> failed by the time that I canceled it. >> >> 2.6.38.8 with ehci->need_io_watchdog = 0 for all NVIDIA devices fails. >> dumping async shows that the transfer is just sitting there just like >> on the other two machines. > > Odd. > > Here's something to try. In ehci-hcd.c:ehci_iaa_watchdog(), change the > ehci_vdbg() function call to ehci_warn(), leaving the arguments the > same. This will log a message for each of the lost IAA interrupts, so > we'll be able to see a little bit of what's happening. Here's your dump. For completeness, I had ehci->need_io_watchdog = 1 enabled, on the desktop machine, kernel 2.6.38.8. All the other entries look very similar to one of these two. I can give you more of them if you would like. Jun 14 13:41:35 aschuh kernel: [ 791.577643] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:35 aschuh kernel: [ 791.632975] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:35 aschuh kernel: [ 791.632978] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:35 aschuh kernel: [ 791.780619] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:35 aschuh kernel: [ 791.809158] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:35 aschuh kernel: [ 791.809160] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:35 aschuh kernel: [ 791.972396] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:36 aschuh kernel: [ 792.198165] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:36 aschuh kernel: [ 792.198170] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:36 aschuh kernel: [ 792.483748] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:36 aschuh kernel: [ 792.486446] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:36 aschuh kernel: [ 792.486451] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:36 aschuh kernel: [ 792.931196] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:36 aschuh kernel: [ 792.939975] ehci_hcd 0000:00:1a.7: IAA watchdog: status 8028 cmd 10021 Jun 14 13:41:36 aschuh kernel: [ 792.940055] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:36 aschuh kernel: [ 792.940060] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:37 aschuh kernel: [ 793.378652] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 13:41:37 aschuh kernel: [ 793.381087] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 13:41:37 aschuh kernel: [ 793.381092] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1977.761339] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1977.810854] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 14:01:23 aschuh kernel: [ 1977.810857] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1977.839189] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 14:01:23 aschuh kernel: [ 1977.988243] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1978.043241] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 14:01:23 aschuh kernel: [ 1978.043247] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1978.243937] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:23 aschuh kernel: [ 1978.361084] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 14:01:23 aschuh kernel: [ 1978.361089] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:24 aschuh kernel: [ 1978.691386] usb 1-4: unlink qh64-0001/ffff88059fb92800 start 1 [1/0 us] Jun 14 14:01:24 aschuh kernel: [ 1978.698693] ehci_hcd 0000:00:1a.7: reused qh ffff88059fb92800 schedule Jun 14 14:01:24 aschuh kernel: [ 1978.698699] usb 1-4: link qh64-0001/ffff88059fb92800 start 1 [1/0 us] cat /var/log/syslog | IAA Jun 14 13:35:10 aschuh kernel: [ 407.168318] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 13:41:36 aschuh kernel: [ 792.939975] ehci_hcd 0000:00:1a.7: IAA watchdog: status 8028 cmd 10021 Jun 14 13:41:54 aschuh kernel: [ 810.747858] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 13:45:40 aschuh kernel: [ 1036.705444] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 13:45:47 aschuh kernel: [ 1042.827822] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 13:52:09 aschuh kernel: [ 1424.787116] ehci_hcd 0000:00:1a.7: IAA watchdog: status 8028 cmd 10021 Jun 14 13:56:10 aschuh kernel: [ 1665.323853] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 13:57:38 aschuh kernel: [ 1753.762315] ehci_hcd 0000:00:1a.7: IAA watchdog: status 8028 cmd 10021 Jun 14 14:01:23 aschuh kernel: [ 1977.839189] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 14:04:01 aschuh kernel: [ 2135.979125] ehci_hcd 0000:00:1a.7: IAA watchdog: status c028 cmd 10031 Jun 14 14:09:31 aschuh kernel: [ 2465.372666] ehci_hcd 0000:00:1a.7: IAA watchdog: status 8028 cmd 10021 Jun 14 14:18:32 aschuh kernel: [ 3005.369987] ehci_hcd 0000:00:1a.7: IAA watchdog: status e028 cmd 10031 # cat /sys/kernel/debug/usb/ehci/0000\:00\:1a.7/registers bus pci, device 0000:00:1a.7 EHCI Host Controller EHCI 1.00, hcd state 4 ownership 00000001 SMI sts/enable 0xc0080000 structural params 0x00103206 capability params 0x00016871 status 3008 Recl Halt FLR command 0010000 (park)=0 ithresh=1 period=1024 HALT intrenable 37 IAA FATAL PCD ERR INT uframe 1eef port:1 status 503000 0 ACK POWER OWNER sig=se0 port:2 status 501000 0 ACK POWER sig=se0 port:3 status 501000 0 ACK POWER sig=se0 port:4 status 501000 0 ACK POWER sig=se0 port:5 status 501000 0 ACK POWER sig=se0 port:6 status 501000 0 ACK POWER sig=se0 irq normal 32991 err 0 reclaim 4863 (lost 19) complete 32993 unlink 3094 Austin Schuh -- To unsubscribe from this list: send the line "unsubscribe linux-usb" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html