> -----Ursprüngliche Nachricht----- > Von: Greg KH [mailto:gregkh@xxxxxxxxxxxxxxxxxxx] > Gesendet: Freitag, 9. August 2019 09:56 > An: Schmid, Carsten <Carsten_Schmid@xxxxxxxxxx> > Cc: Alan Stern <stern@xxxxxxxxxxxxxxxxxxx>; Andrey Konovalov > <andreyknvl@xxxxxxxxxx>; Oliver Neukum <oneukum@xxxxxxxx>; > syzkaller-bugs <syzkaller-bugs@xxxxxxxxxxxxxxxx>; syzbot > <syzbot+ef5de9c4f99c4edb4e49@xxxxxxxxxxxxxxxxxxxxxxxxx>; USB list > <linux-usb@xxxxxxxxxxxxxxx>; Hillf Danton <hdanton@xxxxxxxx> > Betreff: Re: KASAN: use-after-free Read in usbhid_power > > On Fri, Aug 09, 2019 at 07:35:32AM +0000, Schmid, Carsten wrote: > > Hi all having use-after-free issues in USB shutdowns: > > I hunted for a similar case in the intel_xhci_usb_sw driver. > > What i have found and proposed is (from yesterday): > > --- > > [PATCH] kernel/resource.c: invalidate parent when freed resource has > childs > > > > When a resource is freed and has children, the childrens are > > left without any hint that their parent is no more valid. > > This caused at least one use-after-free in the xhci-hcd using > > ext-caps driver when platform code released platform devices. > > > > Fix this by setting child's parent to zero and warn. > > > > Signed-off-by: Carsten Schmid <carsten_schmid@xxxxxxxxxx> > > --- > > Rationale: > > When hunting for the root cause of a crash on a 4.14.86 kernel, i > > have found the root cause and checked it being still present > > upstream. Our case: > > Having xhci-hcd and intel_xhci_usb_sw active we can see in > > /proc/meminfo: (exceirpt) > > b3c00000-b3c0ffff : 0000:00:15.0 > > b3c00000-b3c0ffff : xhci-hcd > > b3c08070-b3c0846f : intel_xhci_usb_sw > > intel_xhci_usb_sw being a child of xhci-hcd. > > > > Doing an unbind command > > echo 0000:00:15.0 > /sys/bus/pci/drivers/xhci_hcd/unbind > > leads to xhci-hcd being freed in __release_region. > > The intel_xhci_usb_sw resource is accessed in platform code > > in platform_device_del with > > for (i = 0; i < pdev->num_resources; i++) { > > struct resource *r = &pdev->resource[i]; > > if (r->parent) > > release_resource(r); > > } > > as the resource's parent has not been updated, the release_resource > > uses the parent: > > p = &old->parent->child; > > which is now invalid. > > Fix this by marking the parent invalid in the child and give a warning > > in dmesg. > > --- > > kernel/resource.c | 9 +++++++++ > > 1 file changed, 9 insertions(+) > > > > diff --git a/kernel/resource.c b/kernel/resource.c > > index 158f04ec1d4f..95340cb0b1c2 100644 > > --- a/kernel/resource.c > > +++ b/kernel/resource.c > > @@ -1200,6 +1200,15 @@ void __release_region(struct resource *parent, > resource_size_t start, > > write_unlock(&resource_lock); > > if (res->flags & IORESOURCE_MUXED) > > wake_up(&muxed_resource_wait); > > + > > + write_lock(&resource_lock); > > Nit, should't this be written so that you only drop/get the lock if the > above if statement is true? > What if some other async part invalidates the child while accessing it? I wanted to make sure that the res->child is valid through the time it is accessed. > > + if (res->child) { > > + printk(KERN_WARNING "__release_region: %s has child > %s," > > + "invalidating childs parent\n", > > + res->name, res->child->name); > > What can userspace/anyone do about this if it triggers? > At least a platform driver developer can see he did something wrong. I had a look at the code of Hans and did not see anything weird. (platform driver is in drivers/usb/host/xhci-ext-caps.c) The issue is very racy - what happens when the platform code accesses the resource mainly depends on if the freed resource memory already has been reused by someone. It was hard to find that, and only a single core dump enabled me to find it. A first attempt was to set resource count to 0 in Hans' driver in the unregister pdev before calling platform_device_unregister. That worked. But this is a dirty hack in my eyes. The framework should detect such issues, so i decided to find the best place where to insert the check - and found it is the place where the resource is freed and still has childrens. > Can't we fix the root problem in the xhci driver to properly order how > it tears things down? If it has intel_xhci_usb_driver as a "child" > shouldn't that be unbound/freed before the parent is? > Hans, any idea? > thanks, > > greg k-h