Le lundi 23 juillet 2018 à 12:43 +0200, Oliver Freyermuth a écrit : > There ARE chassis/BMC/IPMI level events, one of which is "CPU > > CATERR > > Fault", with a timestamp matching the timestamps below, and no more > > information. > > If this kind of failure (or a less severe one) also happens at > runtime, mcelog should catch it. I'll install mcelog ASAP, even though it probably wouldn't have added much in that case. > For CATERR errors, we also found that sometimes the web interface of > the BMC shows more information for the event log entry > than querying the event log via ipmitool - you may want to check > this. I got that from the web interface. ipmitool does not give more information anyway (lots of "missing" and "unknown", and not description...): ipmitool> sel get 118 SEL Record ID : 0076 Record Type : 02 Timestamp : 07/21/2018 01:58:48 Generator ID : 0020 EvM Revision : 04 Sensor Type : Unknown Sensor Number : 76 Event Type : Sensor-specific Discrete Event Direction : Assertion Event Event Data (RAW) : 00ffff Event Interpretation : Missing Description : Sensor ID : CPU CATERR (0x76) Entity ID : 26.1 Sensor Type (Discrete): Unknown -- Nicolas Huillard _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com