On Fri, Apr 27, 2018 at 04:35:05PM +0100, James Morse wrote: > Arm64 has multiple NMI-like notifications, but ghes.c only has one > in_nmi() path, risking deadlock if one NMI-like notification can > interrupt another. > > To support this we need a fixmap entry and lock for each notification > type. But ghes_probe() attempts to process each struct ghes at probe > time, to ensure any error that was notified before ghes_probe() was > called has been done, and the buffer released (and maybe acknowledge > to firmware) so that future errors can be delivered. > > This means NMI-like notifications need two fixmap entries and locks, > one for the ghes_probe() time call, and another for the actual NMI > that could interrupt ghes_probe(). > > Split this single path up by adding an NMI fixmap idx and lock into > the struct ghes. Any notification that can be called as an NMI can > use these to separate its resources from any other notification it > may interrupt. > > The majority of notifications occur in IRQ context, so unless its > called in_nmi(), ghes_copy_tofrom_phys() will use the FIX_APEI_GHES_IRQ > fixmap entry and the ghes_fixmap_lock_irq lock. This allows > NMI-notifications to be processed by ghes_probe(), and then taken > as an NMI. > > The double-underscore version of fix_to_virt() is used because the index > to be mapped can't be tested against the end of the enum at compile > time. > > Signed-off-by: James Morse <james.morse@xxxxxxx> > > --- > Changes since v1: > * Fixed for ghes_proc() always calling every notification in process context. > Now only NMI-like notifications need an additional fixmap-slot/lock. ... > @@ -986,6 +960,8 @@ int ghes_notify_sea(void) > > static void ghes_sea_add(struct ghes *ghes) > { > + ghes->nmi_fixmap_lock = &ghes_fixmap_lock_nmi; > + ghes->nmi_fixmap_idx = FIX_APEI_GHES_NMI; > ghes_estatus_queue_grow_pool(ghes); > > mutex_lock(&ghes_list_mutex); > @@ -1032,6 +1008,8 @@ static int ghes_notify_nmi(unsigned int cmd, struct pt_regs *regs) > > static void ghes_nmi_add(struct ghes *ghes) > { > + ghes->nmi_fixmap_lock = &ghes_fixmap_lock_nmi; Ewww, we're assigning the spinlock to a pointer which we'll take later? Yuck. Why? Do I see it correctly that one can have ACPI_HEST_NOTIFY_SEA and ACPI_HEST_NOTIFY_NMI coexist in parallel on a single system? If not, you can use a single spinlock. If yes, then I'd prefer to make it less ugly and do the notification type check ghes_probe() does: switch (generic->notify.type) and take the respective spinlock in ghes_copy_tofrom_phys(). This way it is a bit better than using a spinlock ptr. Thx. -- Regards/Gruss, Boris. Good mailing practices for 400: avoid top-posting and trim the reply.