Re: raid 5 mismatch_cnt errors

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 20 May 2010 12:02:23 -0500
Trey Scarborough <treys@xxxxxxxxxxxxxx> wrote:

> I have a raid 5 array with 9 disks and I have a mismatch_cnt that keeps 
> growing. This is causing file corruption on the underlaying file systems 
> as well.  I can copy a group of 100 100mb files and then do a md5sum on 
> them and 1-3 will be corrupt. If this is a drive that is bad is there 
> anyway to run a report on the count per drive that these mismatches 
> occur. I have run smarttools test and do not see one drive that stands 
> out to be causing errors. Could something else be causing these errors?


When RAID5 detects an inconsistency there is no way to know which device was
wrong.
SMART only detects some errors, not all.
I have had hard drives before which appears to have a single-bit error in
their internal buffer.  No error would be reported, but data you read would
sometimes be wrong.
RAID5 cannot help you with this sort of error.

I would suggest backing up all your data (if it isn't already to late),
breaking the array, and testing each device individually.
e.g. create a filesystem on the device and try copying data on and reading it
off.

NeilBrown
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux