On Wed, Oct 2, 2013 at 2:41 PM, Russell King - ARM Linux <linux@xxxxxxxxxxxxxxxx> wrote: > On Mon, Sep 30, 2013 at 10:53:46PM -0700, Bjorn Helgaas wrote: >> I think the current kobject delayed release is too aggressive, > > I don't agree with that statement, but the rest of the sentence I do: I think you're right. I think we need to keep the delayed release the way it is, and pay more attention to the way we remove things from sysfs. >> in the >> sense that even after we've released all references, the object can >> still be in sysfs, which causes future creates to fail. E.g., this >> fails: >> >> kset = kset_create_and_add("kobj_test", NULL, NULL); >> kset_unregister(kset); >> kset = kset_create_and_add("kobj_test", NULL, NULL); // FAILS >> >> when I think it should succeed. We don't have a way for the caller to >> determine when it's safe to do the second kset_create_and_add(). > > The reason this happens is that for some reason I can't fathom, the > sysfs cleanup is done when the release function is called, not when > the object is unregistered. Right. I think we might want to have kset_unregister() call kobject_del() explicitly. That way the kset is removed from sysfs immediately even if the release is delayed, and it is safe to create another kset with the same name as soon as kset_register() returns. Part of my confusion has been that I didn't connect "kobject_del()" with the removal from sysfs (even though this is well-documented in Documentation/kobject.txt). I think using kobject_del()an important part of fixing not only this MSI issue, but also races in the PCI device removal path. A name like "kobject_sysfs_unlink()" might be more descriptive and help convey the idea that this is the point where we prevent any future references via sysfs. > I can see why that's done - it is so that when a kobject is unregistered, > its sysfs entry hangs around until all the children have gone (and hence > its reference count then hits zero.) > >> After we release all references, I think it's OK for the kobject >> itself to continue to exist, i.e., we can delay calling t->release(). >> But it should be impossible to find a kobject with refcount == 0 via >> sysfs, so we should be able to create a new one with the same name. >> >> In terms of code, I'm suggesting something like the following: > > I think I can give you an ack for this - it looks sensible enough, and > should still have the intended debugging behaviour. I haven't tested > it though. > > Acked-by: Russell King <rmk+kernel@xxxxxxxxxxxxxxxx> Thanks. I think I'm going to propose something like the patch below instead. It solves the kset_create_and_add(), kset_unregister(), kset_create_and_add() problem as well. I think we'll have to address the MSI issue by adding kobject_del() in the appropriate places. diff --git a/lib/kobject.c b/lib/kobject.c index 9621751..9098992 100644 --- a/lib/kobject.c +++ b/lib/kobject.c @@ -753,6 +753,7 @@ void kset_unregister(struct kset *k) { if (!k) return; + kobject_del(&k->kobj); kobject_put(&k->kobj); } -- To unsubscribe from this list: send the line "unsubscribe linux-pci" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html