On Sat, Feb 04, 2023 at 10:32:11PM +0900, Tetsuo Handa wrote: > Hello. > > There is a long-standing deadlock problem in driver core code caused by > "struct device"->mutex being marked as "do not apply lockdep checks". What exactly is the deadlock problem? Furthermore, how can skipping some lockdep checks _cause_ a problem? > We can make this deadlock visible by applying [1], and we can confirm that > there is a deadlock problem that I think needs to be addressed in core code [2]. I don't understand why you think there is a deadlock problem. The splat in [2] says "WARNING: possible recursive locking detected". This is only a warning; it doesn't mean there really is a problem. > Also, since driver developers are taking it for granted that driver callback > functions can behave as if dev->mutex is not held (because possibility of deadlock > was never reported), What? I have no idea what developers you're talking about. I have never heard of anyone taking this for granted. Certainly developers working on the USB subsystem's core are well aware of dev->mutex locking. > it would solve many deadlocks in driver code if you can update What deadlocks? If there are so many deadlocks floating around in driver code, why haven't we heard about them before now? > driver core code to avoid calling driver callback functions with dev->mutex held We most definitely cannot do that. The driver core relies on mutual exclusion. > (by e.g. replacing dev->mutex with dev->spinlock and dev->atomic_flags). > But I'm not familiar enough to propose such change... Such a change cannot be made. Consider this: Driver callbacks often need to sleep. But when a thread holds a spinlock, it is not allowed to sleep. Therefore driver callbacks must not be invoked while a spinlock is held. Besides, how will replacing a mutex with a spinlock prevent any deadlock problems? If the new locks get held at the same time as the old mutexes were held, won't the same deadlocks occur? Alan Stern > [1] https://lkml.kernel.org/r/8c3fc3d1-8fed-be22-e0e7-ef1e1ea723ce@xxxxxxxxxxxxxxxxxxx > [2] https://lkml.kernel.org/r/b7bc63c8-bb28-d21d-7c3f-97e4e79a9292@xxxxxxxxxxxxxxxxxxx