Hi Jonathan, (Beware: this CCIX thing is new to me, some of this will be questions on the scope of CCIX) On 06/06/2019 13:36, Jonathan Cameron wrote: > UEFI 2.8 defines a new CPER record Appendix N for CCIX Protocol Error Records > (PER). www.uefi.org > These include Protocol Error Record logs which are defined in the > CCIX 1.0 Base Specification www.ccixconsortium.com. I can't find a public version of this spec, and I don't have an account to look at the private one. (It looks like I could get one but I'm guessing there is an NDA between me and the document text!) Will it ever be public? > Handling of coherency protocol errors is complex and how Linux does this > will take some time to evolve. For now, fatal errors are handled via the > usual means and everything else is reported. > There are 6 types of error defined, covering: > * Memory errors How does this interact with the vanilla Memory-Error CPER records? Do we always get them as a pair? (that struct definition looks familiar...) Is memory described like this never present in the UEFI memory map? (so we never accidentally use it as host memory, for processes etc) ~ include/linux/memremap.h has: | * Device memory that is cache coherent from device and CPU point of view. This | * is use on platform that have an advance system bus (like CAPI or CCIX). A | * driver can hotplug the device memory using ZONE_DEVICE and with that memory | * type. Any page of a process can be migrated to such memory. However no one | * should be allow to pin such memory so that it can always be evicted. Is this the same CCIX? If process memory can be migrated onto this stuff we need to kick off memory_failure() in response to memory errors, otherwise we don't mark the pages as poison, signal user-space etc. > * Cache errors > * Address translation unit errors > * CCIX port errors > * CCIX link errors It looks like this stuff operates 'over' PCIe[0]. How does this interact with AER? Will we always get a vanilla PCIE-AER CPER in addition to these two? (I see 'CHECK THE AER EQUIVALENT' comments in your series, are these TODO:?) > * Agent internal errors. > > The set includes tracepoints to report the errors to RAS Daemon and a patch > set for RAS Daemon will follow shortly. > > There are several open questions for this RFC. > 1. Reporting of vendor data. We have little choice but to do this via a > dynamic array as these blocks can take arbitrary size. I had hoped > no one would actually use these given the odd mismatch between a > standard error structure and non standard element, but there are > already designs out there that do use it. I think its okay to spit these blobs out of the trace points, but could we avoid printing them in the kernel log? > 2. The trade off between explicit tracepoint fields, on which we might > want to filter, and the simplicity of a blob. I have gone for having > the whole of the block specific to the PER error type in an opaque blob. > Perhaps this is not the right balance? (I suspect I don't understand this). The filtering can be done by user-space. Isn't that enough? I see the memory-error event format file has 'pa', which is all anyone is likely to care about. Do 'source/component' indicate the device? (what do I pull out of the machine to stop these happening?) > 3. Whether defining 6 new tracepoints is sensible. I think it is: > * They are all defined by the CCIX specification as independant error > classes. > * Many of them can only be generated by particular types of agent. > * The handling required will vary widely depending on types. > In the kernel some map cleanly onto existing handling. Keeping the > whole flow separate will aide this. They vary by a similar amount > in scope to the RAS errors found on an existing system which have > independent tracepoints. > * Separating them out allows for filtering on the tracepoints by > elements that are not shared between them. > * Muxing the lot into one record type can lead to ugly code both in > kernel and in userspace. Sold! > This patch is being distributed by the CCIX Consortium, Inc. (CCIX) to > you and other parties that are paticipating (the "participants") in the > Linux kernel with the understanding that the participants will use CCIX's > name and trademark only when this patch is used in association with the > Linux kernel and associated user space. > > CCIX is also distributing this patch to these participants with the > understanding that if any portion of the CCIX specification will be > used or referenced in the Linux kernel, the participants will not modify > the cited portion of the CCIX specification and will give CCIX propery > copyright attribution by including the following copyright notice with > the cited part of the CCIX specification: > "© 2019 CCIX CONSORTIUM, INC. ALL RIGHTS RESERVED." Is this text permission from the CCIX-Consortium to reproduce bits of their spec in the kernel? I'm guessing any one who can hold of the spec is also prevented from publishing it. (Nits: participating, proper) Thanks, James [0] https://en.wikichip.org/wiki/ccix