Re: Unexpected system error Notifications

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



George N. White III
On 2016-03-01 6:24 PM, "Rick Stevens" <ricks@xxxxxxxxxxxxxx> wrote:
>
> On 03/01/2016 02:14 PM, CS DBA wrote:
>>
>>
>>
>> On 03/01/2016 03:12 PM, Rick Stevens wrote:
>>>
>>> On 03/01/2016 02:06 PM, CS DBA wrote:
>>>>
>>>> All;
>>>>
>>>> I'm seeing this notification multiple times a day (from 5 - 30 minutes
>>>> apart):
>>>>
>>>> Unexpected system error
>>>> The system has encountered a problem and recovered.
>>>>
>>>> Name:        kernel-core
>>>>
>>>> Version:     4.4.2.301.fc23.x86_64
>>>>
>>>> Detected:   3 minutes ago
>>>>
>>>> Reported:  cannot be reported
>>>>
>>>> The kernel log indicates that hardware errors were detected.
>>>> This is most likely not a software problem.
>>>>
>>>>
>>>> I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed.
>>>> On a Lenove Thinkpad X1 Carbon (3rd Gen)
>>>>
>>>> I wonder if I should be doing something to trap & report?
>>>
>>>
>>> Have a look at the output of dmesg. It may be apparent there what the
>>> issue is. Most likely it's some bad memory or possibly some errant
>>> firmware--either in the BIOS or a device (e.g. a wifi card or
>>> something).

Or cooling issues -- see below.

>>>
>>> If it's not that, it could be your power supply starting to have issues.
>>> Wonky power supplies do lots of odd things.
>>> ----------------------------------------------------------------------
>>> - Rick Stevens, Systems Engineer, AllDigital ricks@xxxxxxxxxxxxxx -
>>> - AIM/Skype: therps2        ICQ: 226437340           Yahoo: origrps2 -
>>> - -
>>> -         "If you can't fix it...duct tape it!"  -- Tim Allen        -
>>> ----------------------------------------------------------------------
>>
>>
>>
>> I see lots of these:
>>
>> [ 8521.141831] mce: [Hardware Error]: Machine check events logged
>> [ 8521.141834] CPU0: Package temperature above threshold, cpu clock

A few years ago we had several ThinkPads that became unstable due to accumulated dust in the CPU cooling ducts, fins, and fans. Opening the up and liberal application of  canned air solved the problem until dust built up again.  These systems would slow down noticeably before they became unstable as the clock was slowed in an attempt to reduce CPU temps.

>> throttled (total events = 280)
>> [ 8521.141837] mce: [Hardware Error]: Machine check events logged
>> [ 8521.142829] CPU1: Core temperature/speed normal
>> [ 8521.142830] CPU2: Package temperature/speed normal
>> [ 8521.142831] CPU3: Package temperature/speed normal
>> [ 8521.142834] CPU0: Core temperature/speed normal
>> [ 8521.142836] CPU0: Package temperature/speed normal
>> [ 8521.142838] CPU1: Package temperature/speed normal
>>
>>
>> Should I be concerned?  Anything I can do?
>
>
> Well, yeah, it appears you have a CPU fan that's dying, or the heatsink
> is very dirty. I'd recommend opening up the box and looking at the fans
> (especially the one on the CPU) and verify they're turning at the
> correct speed and not binding up. Fans are cheap and they're generally
> built cheaply as well. I've had to replace dozens of them over the
> years.
>
> I'd also see if you can get a vacuum to clean the interior and a
> "can o' air" to blow the cruft out of all the heatsinks. When you do
> that, do it from the SIDE of the heatsinks parallel to the fins, not
> down on top of them and do it GENTLY or you may force dirt into the
> sockets--a "bad thing".
>
> I generally open my boxes and give them a real good cleaning at least
> once a year. I replace the fans every two years. Cheap insurance--I had
> a CPU cook itself once.
>
> ----------------------------------------------------------------------
> - Rick Stevens, Systems Engineer, AllDigital    ricks@xxxxxxxxxxxxxx -
> - AIM/Skype: therps2        ICQ: 226437340           Yahoo: origrps2 -
> -                                                                    -
> -         The world is coming to an end ... SAVE YOUR FILES!!!       -
> ----------------------------------------------------------------------
>
> --
> users mailing list
> users@xxxxxxxxxxxxxxxxxxxxxxx
> To unsubscribe or change subscription options:
> https://admin.fedoraproject.org/mailman/listinfo/users
> Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct
> Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines
> Have a question? Ask away: http://ask.fedoraproject.org

-- 
users mailing list
users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe or change subscription options:
https://admin.fedoraproject.org/mailman/listinfo/users
Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct
Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines
Have a question? Ask away: http://ask.fedoraproject.org
[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [EPEL Devel]     [Fedora Magazine]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Desktop]     [Fedora Fonts]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Fedora Sparc]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux