On Wed, Apr 13 2022 at 15:11, Nicholas Piggin wrote: > So we traced the problem down to possibly a misunderstanding between > decrementer clock event device and core code. > > The decrementer is only oneshot*ish*. It actually needs to either be > reprogrammed or shut down otherwise it just continues to cause > interrupts. I always thought that PPC had sane timers. That's really disillusioning. > Before commit 35de589cb879, it was sort of two-shot. The initial > interrupt at the programmed time would set its internal next_tb variable > to ~0 and call the ->event_handler(). If that did not set_next_event or > stop the timer, the interrupt will fire again immediately, notice > next_tb is ~0, and only then stop the decrementer interrupt. > > So that was already kind of ugly, this patch just turned it into a hang. > > The problem happens when the tick is stopped with an event still > pending, then tick_nohz_handler() is called, but it bails out because > tick_stopped == 1 so the device never gets programmed again, and so it > keeps firing. > > How to fix it? Before commit a7cba02deced, powerpc's decrementer was > really oneshot, but we would like to avoid doing that because it requires > additional programming of the hardware on each timer interrupt. We have > the ONESHOT_STOPPED state which seems to be just about what we want. > > Did the ONESHOT_STOPPED patch just miss this case, or is there a reason > we don't stop it here? This patch seems to fix the hang (not heavily > tested though). This was definitely overlooked, but it's arguable it is is not required for real oneshot clockevent devices. This should only handle the case where the interrupt was already pending. The ONESHOT_STOPPED state was introduced to handle the case where the last timer gets canceled, so the already programmed event does not fire. It was not necessarily meant to "fix" clockevent devices which are pretending to be ONESHOT, but keep firing over and over. That, said. I'm fine with the change along with a big fat comment why this is required. Thanks, tglx