Argh, forgot to cc the userfaultfd people. Sorry. On Thu, Dec 14, 2017 at 04:58:09AM -0800, Matthew Wilcox wrote: > On Wed, Dec 13, 2017 at 05:03:00PM -0800, Andrew Morton wrote: > > > sched/wait: assert the wait_queue_head lock is held in __wake_up_common > > > > > > Better ensure we actually hold the lock using lockdep than just commenting > > > on it. Due to the various exported _locked interfaces it is far too easy > > > to get the locking wrong. > > > > I'm probably sitting on an older version. I've dropped > > > > epoll: use the waitqueue lock to protect ep->wq > > sched/wait: assert the wait_queue_head lock is held in __wake_up_common > > Looks pretty clear to me that userfaultfd is also abusing the wake_up_locked > interfaces: > > spin_lock(&ctx->fault_pending_wqh.lock); > __wake_up_locked_key(&ctx->fault_pending_wqh, TASK_NORMAL, &range); > __wake_up_locked_key(&ctx->fault_wqh, TASK_NORMAL, &range); > spin_unlock(&ctx->fault_pending_wqh.lock); > > Sure, it's locked, but not by the lock you thought it was going to be. > > There doesn't actually appear to be a bug here; fault_wqh is always serialised > by fault_pending_wqh.lock, but lockdep can't know that. I think this patch > will solve the problem. > > diff --git a/fs/userfaultfd.c b/fs/userfaultfd.c > index ac9a4e65ca49..a39bc3237b68 100644 > --- a/fs/userfaultfd.c > +++ b/fs/userfaultfd.c > @@ -879,7 +879,7 @@ static int userfaultfd_release(struct inode *inode, struct file *file) > */ > spin_lock(&ctx->fault_pending_wqh.lock); > __wake_up_locked_key(&ctx->fault_pending_wqh, TASK_NORMAL, &range); > - __wake_up_locked_key(&ctx->fault_wqh, TASK_NORMAL, &range); > + __wake_up(&ctx->fault_wqh, TASK_NORMAL, 1, &range); > spin_unlock(&ctx->fault_pending_wqh.lock); > > /* Flush pending events that may still wait on event_wqh */ > @@ -1045,7 +1045,7 @@ static ssize_t userfaultfd_ctx_read(struct userfaultfd_ctx *ctx, int no_wait, > * anyway. > */ > list_del(&uwq->wq.entry); > - __add_wait_queue(&ctx->fault_wqh, &uwq->wq); > + add_wait_queue(&ctx->fault_wqh, &uwq->wq); > > write_seqcount_end(&ctx->refile_seq); > > @@ -1194,7 +1194,7 @@ static void __wake_userfault(struct userfaultfd_ctx *ctx, > __wake_up_locked_key(&ctx->fault_pending_wqh, TASK_NORMAL, > range); > if (waitqueue_active(&ctx->fault_wqh)) > - __wake_up_locked_key(&ctx->fault_wqh, TASK_NORMAL, range); > + __wake_up(&ctx->fault_wqh, TASK_NORMAL, 1, range); > spin_unlock(&ctx->fault_pending_wqh.lock); > } > > > -- > To unsubscribe, send a message with 'unsubscribe linux-mm' in > the body to majordomo@xxxxxxxxx. For more info on Linux MM, > see: http://www.linux-mm.org/ . > Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a> -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>