On Fri, Sep 25, 2015 at 08:28:07PM +0900, Hidehiro Kawai wrote: > --- a/arch/x86/kernel/reboot.c > +++ b/arch/x86/kernel/reboot.c > @@ -718,6 +718,7 @@ void machine_crash_shutdown(struct pt_regs *regs) > static nmi_shootdown_cb shootdown_callback; > > static atomic_t waiting_for_crash_ipi; > +static int crash_ipi_done; > > static int crash_nmi_callback(unsigned int val, struct pt_regs *regs) > { > @@ -779,6 +780,7 @@ void nmi_shootdown_cpus(nmi_shootdown_cb callback) > wmb(); > > smp_send_nmi_allbutself(); > + crash_ipi_done = 1; /* Kick cpus looping in nmi context */ I would suggest using WRITE_ONCE() for that, because without the volatile the compiler need not actually emit the store until after the whole waiting thing _IF_ it can inline the whole thing. Currently udelay() will end up being a function call and will therefore force the store to be emitted, but I'd rather not rely on that. > > msecs = 1000; /* Wait at most a second for the other cpus to stop */ > while ((atomic_read(&waiting_for_crash_ipi) > 0) && msecs) {