Re: Recent drive errors

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu 21 May 2015 09:58:48 AM Mikael Abrahamsson wrote:
> On Tue, 19 May 2015, Thomas Fjellstrom wrote:
> > How many UREs are considered "ok"? Tens, hundreds, thousands, tens of
> > thousands?
> 
> I will replace any drive that have developed UNC sectors a few times, so
> I'd say "less than 10".

In this case, it looked like 5 UNC errors for a single sector, and some weird 
latency patterns, till I ran badblocks -w on it, then it gave me > 10k 
relocated sectors and many thousands more uncorrectable sectors. Before the 
badblocks test, it "looked" ok, now It's most definitely dead.

> +1 on the "set kernel timeout to more than 120 seconds". I have this in
> /etc/rc.local:
> 
> for x in /sys/block/sd[a-z] ; do
>          echo 180  > $x/device/timeout
> done
> 
> echo 4096 > /sys/block/md0/md/stripe_cache_size

I presume it's ok to do that even if the drives do ERC/TLER? Just woke up, but 
my brain seems to be telling me it shouldn't break anything since the ERC 
drives should always return after 7s no matter what...

-- 
Thomas Fjellstrom
thomas@xxxxxxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux