On Monday, August 29, 2022 2:39 PM, Borislav Petkov wrote: > On Mon, Aug 29, 2022 at 03:59:28PM +0000, Yazen Ghannam wrote: > > GHES can be used for more than just memory errors. There are platforms where > > memory errors are handled through the OS MCA, and PCIe AER errors are handled > > through the FW, for example. > > > > Is the HPE Server platform guaranteed to always provide memory errors through > > GHES regardless of CPU vendor/architecture? > > /me looks in the direction of HPE folks... The HPE platforms enabled by the platform check are guaranteed to be operating in FW First mode, which FW decides which error to report to the OS via GHES or other means. This may include multiple CPU vendors/architecture. On such platforms, for instance, FW does not report corrected errors to the OS since FW manages the threshold & FRU notification. Chipset-specific edac drivers, designed for OS First mode, is not necessary on such platforms. Disabling such OS First edac driver is achieved by enabling ghes_edac as well. OS MCA is still used for uncorrected errors, such as SRAR (software recoverable action required) which requires recovery action synchronous to the execution via MCE signalling. Toshi