Hi Borislav, On 05/05/18 13:27, Borislav Petkov wrote: > On Fri, Apr 27, 2018 at 04:35:05PM +0100, James Morse wrote: >> Arm64 has multiple NMI-like notifications, but ghes.c only has one >> in_nmi() path, risking deadlock if one NMI-like notification can >> interrupt another. >> >> To support this we need a fixmap entry and lock for each notification >> type. But ghes_probe() attempts to process each struct ghes at probe >> time, to ensure any error that was notified before ghes_probe() was >> called has been done, and the buffer released (and maybe acknowledge >> to firmware) so that future errors can be delivered. >> >> This means NMI-like notifications need two fixmap entries and locks, >> one for the ghes_probe() time call, and another for the actual NMI >> that could interrupt ghes_probe(). >> >> Split this single path up by adding an NMI fixmap idx and lock into >> the struct ghes. Any notification that can be called as an NMI can >> use these to separate its resources from any other notification it >> may interrupt. >> >> The majority of notifications occur in IRQ context, so unless its >> called in_nmi(), ghes_copy_tofrom_phys() will use the FIX_APEI_GHES_IRQ >> fixmap entry and the ghes_fixmap_lock_irq lock. This allows >> NMI-notifications to be processed by ghes_probe(), and then taken >> as an NMI. >> >> The double-underscore version of fix_to_virt() is used because the index >> to be mapped can't be tested against the end of the enum at compile >> time. >> @@ -986,6 +960,8 @@ int ghes_notify_sea(void) >> >> static void ghes_sea_add(struct ghes *ghes) >> { >> + ghes->nmi_fixmap_lock = &ghes_fixmap_lock_nmi; >> + ghes->nmi_fixmap_idx = FIX_APEI_GHES_NMI; >> ghes_estatus_queue_grow_pool(ghes); >> >> mutex_lock(&ghes_list_mutex); >> @@ -1032,6 +1008,8 @@ static int ghes_notify_nmi(unsigned int cmd, struct pt_regs *regs) >> >> static void ghes_nmi_add(struct ghes *ghes) >> { >> + ghes->nmi_fixmap_lock = &ghes_fixmap_lock_nmi; > > Ewww, we're assigning the spinlock to a pointer which we'll take later? > Yuck. > Why? So that APEI doesn't need to know which lock goes with which fixmap page, and how these notifications interact. > Do I see it correctly that one can have ACPI_HEST_NOTIFY_SEA and > ACPI_HEST_NOTIFY_NMI coexist in parallel on a single system? NOTIFY_NMI is x86's NMI, arm doesn't have anything that behaves in the same way, so doesn't use it. The equivalent notifications with NMI-like behaviour are: * SEA (synchronous external abort) * SEI (SError Interrupt) * SDEI (software delegated exception interface) > If not, you can use a single spinlock. Today we could, but once we have SDEI and SEI this won't work: SDEI behaves as two notifications, 'normal' and 'critical', a different fixmap page is needed for these as they can interrupt each other, and a different lock. SEA can interrupt SEI, so they need a different fixmap-pages and locks. We can always disable SEI when we're handling another NMI-like notification. I doubt anyone would implement all three, but if they did SEA can interrupt the lot. I'd like to avoid describing any of these interactions in ghes.c, I think it should be possible that any notification can interrupt any other notification without the risk of deadlock. > If yes, then I'd prefer to make it less ugly and do the notification > type check ghes_probe() does: > > switch (generic->notify.type) > > and take the respective spinlock in ghes_copy_tofrom_phys(). This way it > is a bit better than using a spinlock ptr. I wanted to avoid duplicating that list, some of the locks are #ifdef'd so it gets ugly quickly. (We would only need the NMI-like notifications though). I'd really like to avoid the GHES code having to know about normal/critical SDEI events. Alternatively, I can put the fixmap-page and spinlock in some 'struct ghes_notification' that only the NMI-like struct-ghes need. This is just moving the indirection up a level, but it does pair the lock with the thing it locks, and gets rid of assigning spinlock pointers. Thanks, James -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html