On 2020-11-02 13:23:32 [+0100], To Mike Galbraith wrote: > On 2020-11-01 06:23:50 [+0100], Mike Galbraith wrote: > > Apparently, timer ain't ever supposed to be running when you get there > > via flush_delayed_work(), but it was. > > The first hunk will always trigger the warning and not just if the > slow/wait path is really used. The second part should fix it. After writing a proper changelog I realized that it doesn't fly. So I made dis: ------->8----- Subject: [PATCH] timers: Don't block on ->expiry_lock for TIMER_IRQSAFE PREEMPT_RT does not spin and wait until a running timer completes its callback but instead it blocks on a sleeping lock to prevent a deadlock. This blocking can not be done for workqueue's IRQ_SAFE timer which will be canceled in an IRQ-off region. It has to happen to in IRQ-off region because changing the PENDING bit and clearing the timer must not be interrupted to avoid a busy-loop. For the IRQ_SAFE timer it is guaranteed that the timer callback can not be preempted on PREEMPT_RT so there is no need to synchronize on timer_base::expiry_lock. Add TIMER_IRQSAFE to flags on which the timer_base::expiry_lock must not the acquired. Add a lockdep annotation to ensure that this function is always invoked in preemptible context on PREEMPT_RT. Reported-by: Mike Galbraith <efault@xxxxxx> Signed-off-by: Sebastian Andrzej Siewior <bigeasy@xxxxxxxxxxxxx> --- kernel/time/timer.c | 9 ++++++++- 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/kernel/time/timer.c b/kernel/time/timer.c index 7edc9fba34bbd..1e2403126ccfe 100644 --- a/kernel/time/timer.c +++ b/kernel/time/timer.c @@ -1288,7 +1288,7 @@ static void del_timer_wait_running(struct timer_list *timer) u32 tf; tf = READ_ONCE(timer->flags); - if (!(tf & TIMER_MIGRATING)) { + if (!(tf & (TIMER_MIGRATING | TIMER_IRQSAFE))) { struct timer_base *base = get_timer_base(tf); /* @@ -1372,6 +1372,13 @@ int del_timer_sync(struct timer_list *timer) */ WARN_ON(in_irq() && !(timer->flags & TIMER_IRQSAFE)); + /* + * Must be able to sleep on PREEMPT_RT because of the slowpath in + * del_timer_wait_running(). + */ + if (IS_ENABLED(CONFIG_PREEMPT_RT) && !(timer->flags & TIMER_IRQSAFE)) + lockdep_assert_preemption_enabled(); + do { ret = try_to_del_timer_sync(timer); -- 2.29.1 Sebastian