Re: died again

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]



On Mon, Nov 25, 2013 at 10:45 AM, Michael Hennebry
<hennebry@xxxxxxxxxxxxxxxxxxxxx> wrote:
> >>      Keep an eagle eye on dmesg and the logs. If you can, bring
>> machine down and run memtest86 for a few hours (say, when you go to
>
> I've run the memory test that comes with the Fedora 13 install disk.
> My computer's memory got a clean bill of health.

I've seen a machine where it took 3+ days of running memtest86 to
catch the error.  And then after replacing the RAM, the machine still
crashed occasionally.  Turned out the software RAID1 mirrors had
mismatching contents caused by the bad RAM and even though it would
check clean, sometimes the read would come from the other mirror.
After fixing that, the server has run for years.

But in general, I always suspect power supplies first for mysterious crashes.

-- 
   Les Mikesell
     lesmikesell@xxxxxxxxx
_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
http://lists.centos.org/mailman/listinfo/centos




[Index of Archives]     [CentOS]     [CentOS Announce]     [CentOS Development]     [CentOS ARM Devel]     [CentOS Docs]     [CentOS Virtualization]     [Carrier Grade Linux]     [Linux Media]     [Asterisk]     [DCCP]     [Netdev]     [Xorg]     [Linux USB]
  Powered by Linux