Re: raid1 issue after disk failure: both disks of the array are still active

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



CHECK didn't help me, so I did a echo "repair > /sys/block/md0/md/sync_action". REPAIR didn't work too :(

Here is syslog of REPAIR:

Sep 15 19:34:10 asterisk mdadm[2117]: RebuildStarted event detected on md device /dev/md/0 Sep 15 19:34:10 asterisk kernel: [258470.152296] md: requested-resync of RAID array md0 Sep 15 19:34:10 asterisk kernel: [258470.152301] md: minimum _guaranteed_ speed: 1000 KB/sec/disk. Sep 15 19:34:10 asterisk kernel: [258470.152304] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for requested-resync. Sep 15 19:34:10 asterisk kernel: [258470.152310] md: using 128k window, over a total of 311619448k. Sep 15 19:34:11 asterisk kernel: [258471.165653] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Sep 15 19:34:11 asterisk kernel: [258471.167468] ata3.00: BMDMA stat 0x44
Sep 15 19:34:11 asterisk kernel: [258471.169912] ata3.00: failed command: READ DMA EXT Sep 15 19:34:11 asterisk kernel: [258471.172769] ata3.00: cmd 25/00:00:00:15:00/00:04:00:00:00/e0 tag 0 dma 524288 in Sep 15 19:34:11 asterisk kernel: [258471.172771] res 51/40:00:90:17:00/40:00:00:00:00/e0 Emask 0x9 (media error) Sep 15 19:34:11 asterisk kernel: [258471.176753] ata3.00: status: { DRDY ERR }
Sep 15 19:34:11 asterisk kernel: [258471.178605] ata3.00: error: { UNC }
Sep 15 19:34:12 asterisk kernel: [258472.148217] ata3.00: configured for UDMA/133
Sep 15 19:34:12 asterisk kernel: [258472.148232] ata3: EH complete
Sep 15 19:34:13 asterisk kernel: [258473.131054] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Sep 15 19:34:13 asterisk kernel: [258473.132881] ata3.00: BMDMA stat 0x44
Sep 15 19:34:13 asterisk kernel: [258473.134639] ata3.00: failed command: READ DMA EXT Sep 15 19:34:13 asterisk kernel: [258473.136413] ata3.00: cmd 25/00:00:00:15:00/00:04:00:00:00/e0 tag 0 dma 524288 in Sep 15 19:34:13 asterisk kernel: [258473.136415] res 51/40:00:90:17:00/40:00:00:00:00/e0 Emask 0x9 (media error) Sep 15 19:34:13 asterisk kernel: [258473.141768] ata3.00: status: { DRDY ERR }
Sep 15 19:34:13 asterisk kernel: [258473.144049] ata3.00: error: { UNC }
Sep 15 19:34:14 asterisk kernel: [258474.112209] ata3.00: configured for UDMA/133
Sep 15 19:34:14 asterisk kernel: [258474.112224] ata3: EH complete
Sep 15 19:34:15 asterisk kernel: [258475.071642] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Sep 15 19:34:15 asterisk kernel: [258475.073476] ata3.00: BMDMA stat 0x44
Sep 15 19:34:15 asterisk kernel: [258475.075240] ata3.00: failed command: READ DMA EXT Sep 15 19:34:15 asterisk kernel: [258475.077027] ata3.00: cmd 25/00:00:00:15:00/00:04:00:00:00/e0 tag 0 dma 524288 in Sep 15 19:34:15 asterisk kernel: [258475.077029] res 51/40:00:90:17:00/40:00:00:00:00/e0 Emask 0x9 (media error) Sep 15 19:34:15 asterisk kernel: [258475.080720] ata3.00: status: { DRDY ERR }
Sep 15 19:34:15 asterisk kernel: [258475.083512] ata3.00: error: { UNC }
Sep 15 19:34:16 asterisk kernel: [258476.100935] ata3.00: configured for UDMA/133
Sep 15 19:34:16 asterisk kernel: [258476.100960] ata3: EH complete
Sep 15 19:41:29 asterisk asterisk[3492]: rc_avpair_new: unknown attribute 1490026597 Sep 15 19:41:46 asterisk asterisk[3492]: rc_avpair_new: unknown attribute 1490026597 Sep 15 19:41:52 asterisk asterisk[3492]: rc_avpair_new: unknown attribute 1490026597 Sep 15 19:42:52 asterisk asterisk[3492]: rc_avpair_new: unknown attribute 1490026597 Sep 15 19:46:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 2 Currently unreadable (pending) sectors Sep 15 19:46:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 1 Offline uncorrectable sectors Sep 15 19:50:51 asterisk mdadm[2117]: Rebuild26 event detected on md device /dev/md/0 Sep 15 20:07:31 asterisk mdadm[2117]: Rebuild53 event detected on md device /dev/md/0 Sep 15 20:16:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 2 Currently unreadable (pending) sectors Sep 15 20:16:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 1 Offline uncorrectable sectors Sep 15 20:16:34 asterisk smartd[2581]: Device: /dev/sda [SAT], Temperature changed +4 Celsius to 42 Celsius (Min/Max 30/46) Sep 15 20:16:34 asterisk smartd[2581]: Device: /dev/sda [SAT], SMART Usage Attribute: 201 Soft_Read_Error_Rate changed from 99 to 100 Sep 15 20:16:34 asterisk smartd[2581]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 61 to 60 Sep 15 20:24:11 asterisk mdadm[2117]: Rebuild75 event detected on md device /dev/md/0 Sep 15 20:40:51 asterisk mdadm[2117]: Rebuild93 event detected on md device /dev/md/0 Sep 15 20:46:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 2 Currently unreadable (pending) sectors Sep 15 20:46:34 asterisk smartd[2581]: Device: /dev/sda [SAT], 1 Offline uncorrectable sectors Sep 15 20:46:34 asterisk smartd[2581]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 61 to 60 Sep 15 20:47:24 asterisk kernel: [262863.781068] md: md0: requested-resync done. Sep 15 20:47:24 asterisk mdadm[2117]: RebuildFinished event detected on md device /dev/md/0



I still get:

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Offline Completed: read failure 90% 8985 3912

and

197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 2 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 1


How is it possible? Next thing I will try is manually failing /dev/sda and filling it with zeros. I would like to do a *low level format* but I didn't find the utility for my disk :(

Disk is:

=== START OF INFORMATION SECTION ===
Model Family:     SAMSUNG SpinPoint F1 DT
Device Model:     SAMSUNG HD322HJ
Serial Number:    S17AJDWQ402689
LU WWN Device Id: 5 0000f0 003046298
Firmware Version: 1AC01110
User Capacity:    320,072,933,376 bytes [320 GB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 3b
Local Time is:    Sat Sep 15 21:02:36 2012 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===



root@asterisk:~# smartctl -a /dev/sda -P show
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.2.0-2-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Drive found in smartmontools Database.  Drive identity strings:
MODEL:              SAMSUNG HD322HJ
FIRMWARE:           1AC01110
match smartmontools Drive Database entry:
MODEL REGEXP: SAMSUNG HD(083G|16[12]G|25[12]H|32[12]H|50[12]I|642J|75[23]L|10[23]U)J
FIRMWARE REGEXP:    .*
MODEL FAMILY:       SAMSUNG SpinPoint F1 DT
ATTRIBUTE OPTIONS:  None preset; no -v options are required.


Thanks,
Niccolò
--
http://www.linuxsystems.it
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux