all of my drives are spares

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi, all --

After a surprise reboot the other day, I came home to find diskfarm's
RAID5 arrays all offline with all disks marked as spares.  wtf?!?

After some googling around I found

  https://ronhks.hu/2021/01/07/mdadm-raid-5-all-disk-became-spare/

(for recent example) that it has happened to others, and at least the
pieces are all there rather than completely destroyed, but before I try
stopping and reassembling each array I thought I should double check :-)

Below is the output of a big ol' debugging run.  I tried to dump only what
is interesting :-)  [The smartctl-disks-timeout.sh script is based on the
wiki site script to check and set as necessary.]  I'm not sure why sd[dbc]
show a missing device while sd[lkf] are happy on each array, and I wonder
what happened to md53 with the widely differing event counter that may
make assembly interesting (and why does md52 have such a low event count
when the six of these are linear striped into a big fat array?).

Soooooo ...  What do you guys suggest to get us back up and happy?  TIA


  diskfarm:~ # uname -a ; mdadm --version ; for D in sd{d,c,b,l,k,f} ; do mdadm -E /dev/$D ; smartctl -H -i /dev/$D | egrep 'Model|SMART' | sed -e 's/^/    /' ; done ; echo '' ; for A in 51 52 53 54 55 56 ; do egrep md$A /proc/mdstat ; mdadm -D /dev/md$A | egrep 'Version|State|Events|/dev' ; for D in sd{d,b,c,l,k,f} ; do echo $D$A ; mdadm -E /dev/$D$A | egrep 'Raid|State|Events' ; done ; echo '' ; done ; /usr/local/bin/smartctl-disks-timeout.sh

  Linux diskfarm 5.3.18-lp152.106-default #1 SMP Mon Nov 22 08:38:17 UTC 2021 (52078fe) x86_64 x86_64 x86_64 GNU/Linux
  mdadm - v4.1 - 2018-10-01
  /dev/sdd:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     TOSHIBA HDWR11A
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  /dev/sdc:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     TOSHIBA HDWR11A
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  /dev/sdb:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     TOSHIBA HDWR11A
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  /dev/sdl:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     TOSHIBA HDWR11A
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  /dev/sdk:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     ST20000NM007D-3DJ103
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  /dev/sdf:
     MBR Magic : aa55
  Partition[0] :   4294967295 sectors at            1 (type ee)
  	Device Model:     ST20000NM007D-3DJ103
  	SMART support is: Available - device has SMART capability.
  	SMART support is: Enabled
  	=== START OF READ SMART DATA SECTION ===
  	SMART overall-health self-assessment test result: PASSED
  
  md51 : inactive sdd51[3](S) sdb51[0](S) sdc51[1](S) sdl51[4](S) sdk51[6](S) sdf51[5](S)
  /dev/md51:
             Version : 1.2
               State : inactive
              Events : 46655
         -     259       39        -        /dev/sdl51
         -     259        9        -        /dev/sdb51
         -     259       31        -        /dev/sdk51
         -     259       16        -        /dev/sdd51
         -     259        2        -        /dev/sdc51
         -     259       23        -        /dev/sdf51
  sdd51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46670
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46670
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46670
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46655
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46655
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf51
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 46655
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  md52 : inactive sdd52[3](S) sdb52[0](S) sdc52[1](S) sdl52[4](S) sdk52[6](S) sdf52[5](S)
  /dev/md52:
             Version : 1.2
               State : inactive
              Events : 16482
         -     259        3        -        /dev/sdc52
         -     259       24        -        /dev/sdf52
         -     259       40        -        /dev/sdl52
         -     259       10        -        /dev/sdb52
         -     259       32        -        /dev/sdk52
         -     259       17        -        /dev/sdd52
  sdd52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16482
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16482
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16482
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16478
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16478
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf52
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 16478
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  md53 : inactive sdd53[3](S) sdc53[1](S) sdb53[0](S) sdl53[4](S) sdk53[6](S) sdf53[5](S)
  /dev/md53:
             Version : 1.2
               State : inactive
              Events : 41470
         -     259       33        -        /dev/sdk53
         -     259       18        -        /dev/sdd53
         -     259        4        -        /dev/sdc53
         -     259       25        -        /dev/sdf53
         -     259       41        -        /dev/sdl53
         -     259       11        -        /dev/sdb53
  sdd53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 53337
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 53337
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 53337
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 41470
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 41470
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf53
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 41470
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  md54 : inactive sdd54[3](S) sdc54[1](S) sdb54[0](S) sdl54[4](S) sdk54[6](S) sdf54[5](S)
  /dev/md54:
             Version : 1.2
               State : inactive
              Events : 37400
         -     259        5        -        /dev/sdc54
         -     259       26        -        /dev/sdf54
         -     259       42        -        /dev/sdl54
         -     259       12        -        /dev/sdb54
         -     259       34        -        /dev/sdk54
         -     259       19        -        /dev/sdd54
  sdd54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37400
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37400
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37400
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37377
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37377
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf54
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 37377
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  md55 : inactive sdd55[3](S) sdc55[1](S) sdb55[0](S) sdl55[4](S) sdk55[6](S) sdf55[5](S)
  /dev/md55:
             Version : 1.2
               State : inactive
              Events : 42328
         -     259       35        -        /dev/sdk55
         -     259       20        -        /dev/sdd55
         -     259        6        -        /dev/sdc55
         -     259       27        -        /dev/sdf55
         -     259       43        -        /dev/sdl55
         -     259       13        -        /dev/sdb55
  sdd55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42332
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42332
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42332
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42328
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42328
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf55
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 42328
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  md56 : inactive sdd56[3](S) sdb56[0](S) sdc56[1](S) sdl56[4](S) sdk56[6](S) sdf56[5](S)
  /dev/md56:
             Version : 1.2
               State : inactive
              Events : 43091
         -     259        7        -        /dev/sdc56
         -     259       28        -        /dev/sdf56
         -     259       44        -        /dev/sdl56
         -     259       14        -        /dev/sdb56
         -     259       36        -        /dev/sdk56
         -     259       21        -        /dev/sdd56
  sdd56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43091
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdb56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43091
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdc56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43091
     Array State : AAAA.A ('A' == active, '.' == missing, 'R' == replacing)
  sdl56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43087
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdk56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43087
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  sdf56
       Raid Level : raid5
     Raid Devices : 6
            State : clean
           Events : 43087
     Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing)
  
  Drive timeouts: sda Y ; sdb Y ; sdc Y ; sdd Y ; sde Y ; sdf Y ; sdg 180 ; sdh Y ; sdi Y ; sdj Y ; sdk Y ; sdl Y ; sdm Y ; 


:-D
-- 
David T-G
See http://justpickone.org/davidtg/email/
See http://justpickone.org/davidtg/tofu.txt




[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux