On Tue, 26 Oct 2010, Andi Kleen wrote: > > So please explain why your error reporting is so different from the > > above that it justifies a separate facility. And you better come up > > with a real good explanation other than we looked at EDAC and it did > > not fit our needs. > > Well it didn't fit the needs at all. Is that not enough, Mr Inquisitor? No, it's not enough without a reasonable explanation. > Really if you want to nack and maintain and design things you need > to do a bit more than just arguing from two lines of > high level description. If you want to shove a new facility into the kernel you need to do a bit more than sending that stuff to LKML without explaining why you can't use existing facilities and why it's impossible to extend those existing facilities. So the questions arise on the high level because you failed to provide a reasonable explanation in the first place. > EDAC enumerates hardware and exports some hardware > registers and decodes a few errors in a format into > the kernel log that is hard to impossible to post-process. You are well aware that EDAC folks are working on consolidating the interfaces and providing better error reporting, but it's out of your interest so you just ignore that effort instead of working together? > APEI does nothing of that, so it doesn't fit into EDAC. Thanks for this overly detailed explanation. APEI is about error detection and error reporting, nothing else. So it fits into EDAC, which is an existing Error Detection and Correction reporting facility by definition. If you see EDAC has shortcomings, the proper answer to that is to go there and work with those folks to make EDAC a fully integrated facility. It's not like there's some disagreement with them. But you did not even try to talk to them about this and went straight ahead and implemented your own EDAC facility. > The main interface in APEI right now is to manage fatal > errors after reboot. ... That's completely irrelevant. It does not matter at all when and wherefrom error data come and whether they were generated before or after reboot. That is the hardware/firmware dependend side of things. Nobody is saying that this isn't necessary. But it matters very much how we report those errors. And it's not a completely unreasonable request to avoid separate interfaces for the very same problem, especially separate user space ABIs. If an existing facility has shortcomings, then the usual way is to extend it for the sake of all existing users of that facility. We try to consolidate stuff all over the place and avoid stuff which is artificially separate, and that applies to that area as well. Of course that might be more work and might change the design a bit, but in the end it's a benefit for everyone. Thanks, tglx -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html