On Fri, May 19, 2017 at 02:32:02PM -0600, Tyler Baicar wrote: > When a memory error, CPU error, PCIe error, or other type of hardware error > that's covered by RAS occurs, firmware should populate the shared GHES memory > location with the proper GHES structures to notify the OS of the error. > For example, platforms that implement firmware first handling may implement > separate GHES sources for corrected errors and uncorrected errors. If the > error is an uncorrectable error, then the firmware will notify the OS > immediately since the error needs to be handled ASAP. The OS will then be able > to take the appropriate action needed such as offlining a page. If the error > is a corrected error, then the firmware will not interrupt the OS immediately. > Instead, the OS will see and report the error the next time it's GHES timer > expires. The kernel will first parse the GHES structures and report the errors > through the kernel logs and then notify the user space through RAS trace > events. This allows user space applications such as RAS Daemon to see the > errors and report them however the user desires. This patchset extends the > kernel functionality for RAS errors based on updates in the UEFI 2.6 and > ACPI 6.1 specifications. Thanks, I've pushed this out as: git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux.git for-next/ras-apei which I'll merge into for-next/core (and therefore linux-next) either the end of this week or the beginning of next week. Please take a look if you get a chance. Will