Re: Strange Uncorrectable Section Count Produced By Fio

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 29 June 2017 at 18:39, Forrest, Jon <nobozo@xxxxxxxxx> wrote:
>
> Running the fio job shown below seems to generate the expected
> i/o load, but it also caused the following output on the console:
>
> [...]
>
> WARNING: Your hard drive is failing
> Device: /dev/sdb [SAT], 37009733189632 Offline uncorrectable sectors
> WARNING: Your hard drive is failing
> Device: /dev/sdc [SAT], 54275501719552 Offline uncorrectable sectors
> WARNING: Your hard drive is failing
> Device: /dev/sdd [SAT], 71987946848256 Offline uncorrectable sectors
>
> On 6/29/2017 10:21 AM, Todd Lawall wrote:
>>
>> Correct me if I'm wrong everybody, but those look suspiciously like
>> errors coming from the kernel/driver rather than FIO.
>
> I agree but see below.
>
>> John, you could confirm this by looking for these messages in the output
>> of the
>> 'dmesg' command.   If they're from the driver, then there's further
>> questions for the authors of that code that you'd need to ask.
>
> Much to my surprise, 'dmesg' doesn't include those messages.
>
> Also, I received 12 email messages sent to 'root' from 'root' on that
> system with the same error messages, one message per disk.
>
> I hadn't thought these messages came from 'fio' directly, but
> I don't know where they come from. Whatever is producing them
> seems to be confused because the number of errors are incorrect.

That output looks like it comes from SMART monitoring of your disks
(i.e. you have a script regularly monitoring for changes in smartctl
output and sending the results to logwatch or emailing you). Further,
it looks like in your case smartctl doesn't know how to correctly
interpret the "Offline uncorrectable sectors" value - perhaps because
you're disks are "new" you need a more recent smartctl?

However, I'd be somewhat worried that the "Offline uncorrectable
sectors" are changing at all. It is one of the few values that has a
strong predictive power of a spinning disk being faulty (see
https://www.backblaze.com/blog/what-smart-stats-indicate-hard-drive-failures/
) but since you're on an SSD I don't know if it's predictive power is
as good.

-- 
Sitsofe | http://sucs.org/~sits/
--
To unsubscribe from this list: send the line "unsubscribe fio" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Kernel]     [Linux SCSI]     [Linux IDE]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux SCSI]

  Powered by Linux