Re: OHCI unplug kernel crash in kernel 4.3, 4.4 and 4.5

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Please use Reply-To-All so that your message gets sent to the mailing 
list as well as to me.

On Mon, 29 Feb 2016, Stefani Seibold wrote:

> Hi,
> 
> this is the result with kernel 4.5.0-rc6 without virtual box modules
> and nvideo driver. The patch was applied. The Bug still occurs.

The lines added by the patch were never executed.  This means that
dev->archdata.iommu is getting set to NULL some place other than
unlink_domain_info().  I have no idea where that would be.

Joerg, can you suggest any possibilities?

Alan Stern


> pcieport 0000:08:00.0: BAR 13: failed to assign [io  size 0x1000]
> pcieport 0000:08:00.0: BAR 13: no space for [io  size 0x1000]
> pcieport 0000:08:00.0: BAR 13: failed to assign [io  size 0x1000]
> pcieport 0000:09:03.0: res[13]=[io  0x1000-0x0fff] res_to_dev_res add_size 1000 min_align 1000
> pcieport 0000:09:03.0: res[13]=[io  0x1000-0x1fff] res_to_dev_res add_size 1000 min_align 1000
> pcieport 0000:09:03.0: BAR 13: no space for [io  size 0x1000]
> pcieport 0000:09:03.0: BAR 13: failed to assign [io  size 0x1000]
> pcieport 0000:09:03.0: BAR 13: no space for [io  size 0x1000]
> pcieport 0000:09:03.0: BAR 13: failed to assign [io  size 0x1000]
> usb usb6: New USB device found, idVendor=1d6b, idProduct=0001
> usb usb6: New USB device strings: Mfr=3, Product=2, SerialNumber=1
> usb usb6: Product: OHCI PCI host controller
> usb usb6: Manufacturer: Linux 4.5.0-rc6-dirty ohci_hcd
> usb usb6: SerialNumber: 0000:0b:00.0
> hub 6-0:1.0: USB hub found
> hub 6-0:1.0: 3 ports detected
> ohci-pci 0000:0b:00.1: OHCI PCI host controller
> ohci-pci 0000:0b:00.1: new USB bus registered, assigned bus number 7
> ohci-pci 0000:0b:00.1: irq 19, io mem 0xd4001000
> usb usb7: New USB device found, idVendor=1d6b, idProduct=0001
> usb usb7: New USB device strings: Mfr=3, Product=2, SerialNumber=1
> usb usb7: Product: OHCI PCI host controller
> usb usb7: Manufacturer: Linux 4.5.0-rc6-dirty ohci_hcd
> usb usb7: SerialNumber: 0000:0b:00.1
> hub 7-0:1.0: USB hub found
> hub 7-0:1.0: 2 ports detected
> pciehp 0000:06:03.0:pcie24: Card not present on Slot(3)
> pciehp 0000:06:03.0:pcie24: slot(3): Link Down event
> pciehp 0000:06:03.0:pcie24: Link Down event ignored on slot(3): already powering off
> pciehp 0000:00:1c.4:pcie04: Card not present on Slot(4)
> pciehp 0000:00:1c.4:pcie04: slot(4): Link Down event
> pciehp 0000:06:03.0:pcie24: Cannot remove display device 0000:08:00.0
> pciehp 0000:00:1c.4:pcie04: Link Down event ignored on slot(4): already powering off
> pciehp 0000:06:06.0:pcie24: unloading service driver pciehp
> pciehp 0000:06:06.0:pcie24: Timeout on hotplug command 0x1038 (issued 26775 msec ago)
> pciehp 0000:06:06.0:pcie24: pcie_do_write_cmd: no response from device
> pciehp 0000:06:05.0:pcie24: unloading service driver pciehp
> pciehp 0000:06:05.0:pcie24: Timeout on hotplug command 0x1038 (issued 26776 msec ago)
> pciehp 0000:06:05.0:pcie24: pcie_do_write_cmd: no response from device
> pciehp 0000:06:04.0:pcie24: unloading service driver pciehp
> pciehp 0000:06:04.0:pcie24: Timeout on hotplug command 0x1038 (issued 26778 msec ago)
> pciehp 0000:06:04.0:pcie24: pcie_do_write_cmd: no response from device
> ehci-pci 0000:0b:00.2: HC died; cleaning up
> ehci-pci 0000:0b:00.2: remove, state 4
> usb usb5: USB disconnect, device number 1
> ehci-pci 0000:0b:00.2: USB bus 5 deregistered
> ohci-pci 0000:0b:00.1: HC died; cleaning up
> ohci-pci 0000:0b:00.1: remove, state 4
> usb usb7: USB disconnect, device number 1
> ohci-pci 0000:0b:00.1: USB bus 7 deregistered
> ohci-pci 0000:0b:00.0: HC died; cleaning up
> ohci-pci 0000:0b:00.0: remove, state 4
> usb usb6: USB disconnect, device number 1
> ------------[ cut here ]------------
> kernel BUG at drivers/iommu/intel-iommu.c:3595!
> invalid opcode: 0000 [#1] PREEMPT SMP 
> Modules linked in: ohci_pci ohci_hcd netconsole iwlmvm iwlwifi vhost_net tun vhost kvm_intel kvm irqbypass dell_smm_hwmon
> CPU: 0 PID: 4 Comm: kworker/0:0 Not tainted 4.5.0-rc6-dirty #10
> Hardware name: Dell Inc. XPS 2720/05R2TK  , BIOS A12 09/21/2015
> Workqueue: pciehp-4 pciehp_power_thread
> task: ffff88046d298080 ti: ffff88046d2a4000 task.ti: ffff88046d2a4000
> RIP: 0010:[<ffffffff80638834>]  [<ffffffff80638834>] intel_unmap+0x1c4/0x1d0
> RSP: 0018:ffff88046d2a7b48  EFLAGS: 00010246
> RAX: 0000000000000000 RBX: ffff8804675d2098 RCX: 00000000ffffc000
> RDX: 0000000000000000 RSI: 00000000ffffc000 RDI: ffff8804675d2098
> RBP: ffff88046d2a7b80 R08: 0000000000000000 R09: 0000000000000000
> R10: ffff88046cc000c0 R11: 0000000000000000 R12: 00000000ffffffff
> R13: 00000000ffffc000 R14: ffff88046bd27940 R15: ffff88042ea65c08
> FS:  0000000000000000(0000) GS:ffff88047f200000(0000) knlGS:0000000000000000
> CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 00005623bb8f9000 CR3: 000000000120a000 CR4: 00000000001406f0
> Stack:
>  ffff88042ea6a6f8 ffff88046d2a7ba0 ffffea00108d2180 00000000ffffffff
>  ffffffff812a8540 ffff88046bd27940 ffff88042ea65c08 ffff88046d2a7ba0
>  ffffffff806388ca ffff8804646dc800 ffff8804675d2098 ffff88046d2a7bd8
> Call Trace:
>  [<ffffffff806388ca>] intel_free_coherent+0x5a/0xa0
>  [<ffffffffa00b4ce4>] ohci_stop+0x144/0x1c0 [ohci_hcd]
>  [<ffffffff80804134>] usb_remove_hcd+0xe4/0x1a0
>  [<ffffffff808145a3>] usb_hcd_pci_remove+0x63/0x130
>  [<ffffffff8058cca9>] pci_device_remove+0x39/0xc0
>  [<ffffffff8073432a>] __device_release_driver+0x9a/0x140
>  [<ffffffff807343f3>] device_release_driver+0x23/0x30
>  [<ffffffff8058666a>] pci_stop_bus_device+0x8a/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586782>] pci_stop_and_remove_bus_device+0x12/0x20
>  [<ffffffff8059ed1b>] pciehp_unconfigure_device+0x9b/0x180
>  [<ffffffff8059e823>] pciehp_disable_slot+0x43/0xb0
>  [<ffffffff8059e91d>] pciehp_power_thread+0x8d/0xb0
>  [<ffffffff8028e9e0>] process_one_work+0x140/0x3e0
>  [<ffffffff8028ecce>] worker_thread+0x4e/0x480
>  [<ffffffff8028ec80>] ? process_one_work+0x3e0/0x3e0
>  [<ffffffff802942e9>] kthread+0xc9/0xe0
>  [<ffffffff80294220>] ? kthread_create_on_node+0x180/0x180
>  [<ffffffff80b7bfef>] ret_from_fork+0x3f/0x70
>  [<ffffffff80294220>] ? kthread_create_on_node+0x180/0x180
> Code: f9 48 89 de e8 4e cc ff ff 4c 89 e6 4c 89 f7 e8 03 90 ff ff 4c 89 ef e8 5b cd ff ff 48 83 c4 10 5b 41 5c 41 5d 41 5e 41 5f 5d c3 <0f> 0b e8 95 cd ff ff e9 f3 fe ff ff 0f 1f 44 00 00 55 48 8b 76 
> RIP  [<ffffffff80638834>] intel_unmap+0x1c4/0x1d0
>  RSP <ffff88046d2a7b48>
> ---[ end trace 2827abf1d848627d ]---
> BUG: unable to handle kernel paging request at ffffffffffffffd8
> IP: [<ffffffff80294951>] kthread_data+0x11/0x20
> PGD 120b067 PUD 120d067 PMD 0 
> Oops: 0000 [#2] PREEMPT SMP 
> Modules linked in: ohci_pci ohci_hcd netconsole iwlmvm iwlwifi vhost_net tun vhost kvm_intel kvm irqbypass dell_smm_hwmon
> CPU: 0 PID: 4 Comm: kworker/0:0 Tainted: G      D         4.5.0-rc6-dirty #10
> Hardware name: Dell Inc. XPS 2720/05R2TK  , BIOS A12 09/21/2015
> task: ffff88046d298080 ti: ffff88046d2a4000 task.ti: ffff88046d2a4000
> RIP: 0010:[<ffffffff80294951>]  [<ffffffff80294951>] kthread_data+0x11/0x20
> RSP: 0018:ffff88046d2a7828  EFLAGS: 00010002
> RAX: 0000000000000000 RBX: 0000000000000000 RCX: ffffffff814c9900
> RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff88046d298080
> RBP: ffff88046d2a7838 R08: 0000000000000000 R09: ffff88046d1d1101
> R10: 000000000000c000 R11: 000000000000001a R12: 0000000000000000
> R13: 0000000000015500 R14: ffff88046d298080 R15: ffff88047f215500
> FS:  0000000000000000(0000) GS:ffff88047f200000(0000) knlGS:0000000000000000
> CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 0000000000000028 CR3: 000000000120a000 CR4: 00000000001406f0
> Stack:
>  ffffffff8028f901 ffff88047f215500 ffff88046d2a7898 ffffffff80b78089
>  ffff88046d2a7858 ffffffff00000000 ffff88046d298080 ffffffff802792b1
>  0000000000000000 ffff88046d2a8000 ffff88046d2a78e0 ffff88046d2a73f0
> Call Trace:
>  [<ffffffff8028f901>] ? wq_worker_sleeping+0x11/0x90
>  [<ffffffff80b78089>] __schedule+0x629/0xb10
>  [<ffffffff802792b1>] ? release_task+0x311/0x3e0
>  [<ffffffff80b785ac>] schedule+0x3c/0x90
>  [<ffffffff8027a9fb>] do_exit+0x75b/0xaf0
>  [<ffffffff80207bb9>] oops_end+0x89/0xc0
>  [<ffffffff80207e1b>] die+0x4b/0x70
>  [<ffffffff80205024>] do_trap+0xb4/0x150
>  [<ffffffff80205137>] do_error_trap+0x77/0xe0
>  [<ffffffff80638834>] ? intel_unmap+0x1c4/0x1d0
>  [<ffffffff80633706>] ? qi_submit_sync+0x2d6/0x3e0
>  [<ffffffff80205890>] do_invalid_op+0x20/0x30
>  [<ffffffff80b7d648>] invalid_op+0x18/0x20
>  [<ffffffff80638834>] ? intel_unmap+0x1c4/0x1d0
>  [<ffffffff80638691>] ? intel_unmap+0x21/0x1d0
>  [<ffffffff806388ca>] intel_free_coherent+0x5a/0xa0
>  [<ffffffffa00b4ce4>] ohci_stop+0x144/0x1c0 [ohci_hcd]
>  [<ffffffff80804134>] usb_remove_hcd+0xe4/0x1a0
>  [<ffffffff808145a3>] usb_hcd_pci_remove+0x63/0x130
>  [<ffffffff8058cca9>] pci_device_remove+0x39/0xc0
>  [<ffffffff8073432a>] __device_release_driver+0x9a/0x140
>  [<ffffffff807343f3>] device_release_driver+0x23/0x30
>  [<ffffffff8058666a>] pci_stop_bus_device+0x8a/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586611>] pci_stop_bus_device+0x31/0xa0
>  [<ffffffff80586782>] pci_stop_and_remove_bus_device+0x12/0x20
>  [<ffffffff8059ed1b>] pciehp_unconfigure_device+0x9b/0x180
>  [<ffffffff8059e823>] pciehp_disable_slot+0x43/0xb0
>  [<ffffffff8059e91d>] pciehp_power_thread+0x8d/0xb0
>  [<ffffffff8028e9e0>] process_one_work+0x140/0x3e0
>  [<ffffffff8028ecce>] worker_thread+0x4e/0x480
>  [<ffffffff8028ec80>] ? process_one_work+0x3e0/0x3e0
>  [<ffffffff802942e9>] kthread+0xc9/0xe0
>  [<ffffffff80294220>] ? kthread_create_on_node+0x180/0x180
>  [<ffffffff80b7bfef>] ret_from_fork+0x3f/0x70
>  [<ffffffff80294220>] ? kthread_create_on_node+0x180/0x180
> Code: 02 00 00 00 e8 71 fd ff ff 5d c3 0f 1f 44 00 00 66 2e 0f 1f 84 00 00 00 
> 
> Am Sonntag, den 28.02.2016, 15:52 -0500 schrieb Alan Stern:
> > On Sun, 28 Feb 2016, Greg KH wrote:
> > 
> > > On Sun, Feb 28, 2016 at 11:04:03AM -0800, Greg KH wrote:
> > > > On Sun, Feb 28, 2016 at 10:24:32AM +0100, Stefani Seibold wrote:
> > > > > I still reported this bug 6 Weeks ago... and i checked it know
> > with the
> > > > > current kernel 4.5.0-rc5. The bug is yet not fixed.
> > > > > 
> > > > > A unplug of an USB 1.0 OHCI controller express card will result
> > in a
> > > > > kernel crash. The express card is attached via thunderbolt and
> > a sonnet
> > > > > express card to thunderbolt adapter. The computer hangs after
> > the
> > > > > unplug, only a power down fix the situation.
> > > > > 
> > > > > This is the kernel log of a kernel 4.4 via netconsole:
> > > > 
> > > > Can you duplicate this without the nvidia driver loaded?
> > > 
> > > Oh, and without virtualbox, that is a horrid mess of total crap,
> > NEVER
> > > trust that code to do anything correctly.  I'm totally amazed that
> > it
> > > works at all, and it turns out, often times it really doesn't...
> > 
> > If the problem still occurs, please apply the diagnostic patch below
> > and post the resulting log messages.
> > 
> > Alan Stern
> > 
> > 
> > 
> > Index: usb-4.x/drivers/iommu/intel-iommu.c
> > ===================================================================
> > --- usb-4.x.orig/drivers/iommu/intel-iommu.c
> > +++ usb-4.x/drivers/iommu/intel-iommu.c
> > @@ -2270,8 +2270,11 @@ static inline void unlink_domain_info(st
> >         assert_spin_locked(&device_domain_lock);
> >         list_del(&info->link);
> >         list_del(&info->global);
> > -       if (info->dev)
> > +       if (info->dev) {
> > +               dev_info(info->dev, "unlink_domain_info\n");
> > +               dump_stack();
> >                 info->dev->archdata.iommu = NULL;
> > +       }
> >  }
> >  
> >  static void domain_remove_dev_info(struct dmar_domain *domain)
> > 
> 
> 


--
To unsubscribe from this list: send the line "unsubscribe linux-usb" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Media]     [Linux Input]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Old Linux USB Devel Archive]

  Powered by Linux