Re: HDD reports errors while completing RAID6 array check

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 10 June 2011 19:23, Mathias BurÃn <mathias.buren@xxxxxxxxx> wrote:
> On 10 June 2011 19:00, Roman Mamedov <rm@xxxxxxxxxx> wrote:
>> On Fri, 10 Jun 2011 18:37:06 +0100
>> Mathias BurÃn <mathias.buren@xxxxxxxxx> wrote:
>>
>>>  9 Power_On_Hours     Â0x0032  090  090  000  ÂOld_age
>>> Always    -    7781
>>
>>
>>> # 1 ÂExtended offline  ÂCompleted without error    00%   Â6827
>>> - # 2 ÂExtended offline  ÂCompleted without error    00%
>>> 6550     - # 3 ÂExtended offline  ÂCompleted without error
>>> 00%   Â6468     - # 4 ÂExtended offline  ÂCompleted without
>>> error    00%   Â6329     - # 5 ÂExtended offline  ÂCompleted
>>> without error    00%   Â6040     - # 6 ÂExtended offline
>>> Completed without error    00%   Â5584     - # 7 ÂExtended
>>> offline  ÂCompleted without error    00%   Â5178     - # 8
>>> Extended offline  ÂCompleted without error    00%   Â4761     - #
>>> 9 ÂShort offline    Completed without error    00%   Â2285     -
>>> #10 ÂExtended offline  ÂCompleted without error    00%   Â1514
>>
>> I suggest that you do another "smartctl -t long" on it, the latest one was
>> done almost 1000 hours ago which is also much longer than the period between
>> previous tests. Freezes on reads could be a symptom of a bad (unreadable, or
>> very slowly readable - which is worse) sector, perhaps it could be detected by
>> the SMART test. Or also do a full read of the drive directly (not through the
>> RAID) e.g. with "badblocks" and see if you get any I/O errors that way.
>>
>> --
>> With respect,
>> Roman
>>
>
> Thanks for the suggestions, I'll start the long selftest now.
>
> /M
>

Things look OK after the test:

 $ sudo smartctl -a /dev/sdd
Password:
smartctl 5.40 2010-10-16 r3189 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (Adv. Format) family
Device Model:     WDC WD20EARS-00MVWB0
Serial Number:    WD-WMAZ20188479
Firmware Version: 50.0AB50
User Capacity:    2,000,398,934,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Sat Jun 11 10:48:05 2011 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 (36000) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 255) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x3035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail
Always       -       0
  3 Spin_Up_Time            0x0027   176   162   021    Pre-fail
Always       -       6183
  4 Start_Stop_Count        0x0032   100   100   000    Old_age
Always       -       59
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail
Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age
Always       -       0
  9 Power_On_Hours          0x0032   090   090   000    Old_age
Always       -       7797
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age
Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age
Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age
Always       -       53
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age
Always       -       32
193 Load_Cycle_Count        0x0032   162   162   000    Old_age
Always       -       114863
194 Temperature_Celsius     0x0022   109   102   000    Old_age
Always       -       41
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age
Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age
Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age
Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age
Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age
Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      7788         -
# 2  Extended offline    Completed without error       00%      6827         -
# 3  Extended offline    Completed without error       00%      6550         -
# 4  Extended offline    Completed without error       00%      6468         -
# 5  Extended offline    Completed without error       00%      6329         -
# 6  Extended offline    Completed without error       00%      6040         -
# 7  Extended offline    Completed without error       00%      5584         -
# 8  Extended offline    Completed without error       00%      5178         -
# 9  Extended offline    Completed without error       00%      4761         -
#10  Short offline       Completed without error       00%      2285         -
#11  Extended offline    Completed without error       00%      1514         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

I initiated a self test on each of the other HDDs as well. It's time
to run badblocks then!

/M
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux