On Mon, Oct 02, 2023 at 03:50:24PM -0400, Liang, Kan wrote: > Now, the NMI watchdog is using a "CPU-pinned" event. But I think it can > be replaced by the buddy system, commit 1f423c905a6b > ("watchdog/hardlockup: detect hard lockups using secondary (buddy) CPUs") For some simple cases. I've had plenty experience with situations where that thing would be completely useless. That is, at some point the 'all CPUs hard locked up' scenario was something I ran into a lot (although I can't for the life of me remember wtf I was doing at the time). All that needs is a single spin_lock_irqsave() on a common lock (or group of locks, like the rq->lock). Before you know it, the whole machine is a brick. That said; if you augment this thing with a bunch of CPUs that have HPET-NMI and IPI-NMI for backtraces, it might actually be useful.