On Fri, Sep 09, 2016 at 10:21:39AM +0200, Peter Zijlstra wrote: > On Fri, Sep 09, 2016 at 11:06:01AM +1000, Dave Chinner wrote: > > On Thu, Sep 08, 2016 at 08:54:42AM +0200, Peter Zijlstra wrote: > > From my perspective, lockdep is a very poor replacement for > > architecting a robust locking model and then implementing code that > > directly asserts that the correct locks are held at the correct > > time. > > Very few people people architect locking :-/, And so lockdep was written to help catch such situations. Instead of encouraging people more careful about locking, it has become a crutch that people lean on: "if lockdep is silent, my locking must be good now"... Law of unintended consequences, eh? > but lockdep as the assert, > and I've been encouraging people to use that instead of comments like: > > /* this function should be called with foo lock held */ Yup, that's been a standard practice in XFS since day zero. I'm glad to see that locking development practices are slowly catching up with the state of the art from the early 90s. :P > Now, the problem is that lockdep also asserts the caller is the lock > owner, and not some other random thing. Yeah, it is does not handle the "lock follows object" model used by multi-process asynchronous execution engines (which is pretty much what the linux IO subsystem is). As XFS is likely to move more towards async execution in future, the need for non-owner support in semaphores is more important than ever... > And given the amount of locking fail caught by lockdep (still..) you > really cannot argue against it. Sure, but I'm not saying "remove lockdep". What I'm saying is that /lockdep does not define locking primitve behaviour/. Lockdep is *only a debugging tool*. IMO, it is wrong to restrict the semantics of a lock type because of the limitations of a debugging tool. All that does is encourage people to invent their own locking primitives that are hidden from the debugging tool and are more likely to be broken than not. We've done this several times in XFS, and we've even propagated such frakenstein infrastructure into the VFS to save others the pain of inventing their own non-owner exclusion mechanisms... > > > you should use > > > {down,up}_read_non_owner(). > > > I'm not sure we've got write_non_owner() variants for this. > > > > For the case Christoph reported, it's the write side context of the > > inode locks that is handed off to other threads. And no, we don't > > have annotations for that. > > So the xfs mrlock already uses rwsem, semantics have not changed. So the > case Christoph found should already conform to the owner semantics. Which, until there was "spin on owner" added to the rwsems, the rwsem did not track ownership of the lock. i.e. prior to 2014 (commit 4fc828e "locking/rwsem: Support optimistic spinning"), the rwsem contained: struct rw_semaphore { long count; raw_spinlock_t wait_lock; struct list_head wait_list; #ifdef CONFIG_DEBUG_LOCK_ALLOC struct lockdep_map dep_map; #endif }; i.e. historically the only "ownership dependency" a rwsem has ever had was added by lockdep, which then required annotations to tell lockdep the usage model of the lock. And even now the optimistic spinning code doesn't actually use the owner for checking ownership - the value is simply a special cookie to determine if the lock is held in read or write mode... > I've not looked at code, but if the worker is synchronized against the > critical section (it'd pretty much have to be to rely on its locking) > there's nothing wrong, its just that the lockdep_assert_held() thingies > cannot work as it. > > That is: > > task A task B > > down_write(&rwsem); > queue_work(&work); > worker() > lockdep_assert_held(&rwsem); > flush_work(&work); > up_write(&rwsem); > > > Doesn't work. Explicitly so in fact. Then lockdep behaviour is incorrect for rwsems. By definition, semaphores have explicit non-owner semantics - a semaphore with strict task ownership constraints is a mutex, not a semaphore.... > Does the worker have a pointer back to taskA ? Nope. it runs the work while task A sleeps on a completion. When it's done, task A is woken and it continues. And, FWIW, we can take locks on objects (e.g. buffers) in task B that are then released by task A, too. Unsurprisingly, those objects are also protected by semaphores, explicitly because of the non-owner behaviour those object locks have... > I could try something > like lockdep_assert_held_by(lock, task), just need to be careful, > the per-task lock state is just that, per-task, no serialization. Add a lockdep init flag for rwsems to say "don't track owners" and we'll set it on the rwsems we pass to other contexts. Then you can make sure that lockdep asserts don't fire when we pass those locked objects to other tasks and/or get locked/unlocked by different tasks... Cheers, Dave. -- Dave Chinner david@xxxxxxxxxxxxx _______________________________________________ xfs mailing list xfs@xxxxxxxxxxx http://oss.sgi.com/mailman/listinfo/xfs