On 2 May 2013 14:29, Stefan Borggraefe <stefan@xxxxxxxxxxx> wrote: > Am Donnerstag, 2. Mai 2013, 14:17:04 schrieb Mathias Burén: >> On 2 May 2013 14:14, Stefan Borggraefe <stefan@xxxxxxxxxxx> wrote: >> > Am Donnerstag, 2. Mai 2013, 13:30:22 schrieb Mathias Burén: >> >> I won't scold you for using RAID5 instead of RAID6 with this number of >> >> if drives and especially the size of the drives. >> >> >> >> Could you please post the output of smartctl -a for each device? (from >> >> smartmontools) >> >> >> >> That way we can verify which HDDs are broken, before proceeding. >> >> >> >> Mathias >> > >> > Hello Mathias, >> > >> > RAID6 would have been the safer option clearly, but we needed the >> > extra-space and only had this number of drives available. >> > >> > Here the requested output: >> > >> > smartctl -a /dev/sdc >> > smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.2.0-37-generic] (local >> > build) Copyright (C) 2002-11 by Bruce Allen, >> > http://smartmontools.sourceforge.net >> > >> > Vendor: Hitachi >> > Product: HUS724040ALE640 >> > Revision: MJAO >> > User Capacity: 4.000.787.030.016 bytes [4,00 TB] >> > Logical block size: 512 bytes >> > Logical Unit id: 0x5000cca22bd08a85 >> > Serial number: PK2331PAH5D0YT >> > Device type: disk >> > Local Time is: Thu May 2 15:09:16 2013 CEST >> > Device supports SMART and is Enabled >> > Temperature Warning Disabled or Not Supported >> > SMART Health Status: OK >> > >> > [...] >> >> Hm are these behind some controller of sorts? What about smartctl -x ? > > We use an Adaptec 71605 controller. smartctl -x does not provide any more > useful information, I suppose. I only post the output of one drive as an > example this time. They all give a similar result. > > smartctl -x /dev/sdc > smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.2.0-37-generic] (local build) > Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net > > Vendor: Hitachi > Product: HUS724040ALE640 > Revision: MJAO > User Capacity: 4.000.787.030.016 bytes [4,00 TB] > Logical block size: 512 bytes > Logical Unit id: 0x5000cca22bd08a85 > Serial number: PK2331PAH5D0YT > Device type: disk > Local Time is: Thu May 2 15:20:55 2013 CEST > Device supports SMART and is Enabled > Temperature Warning Disabled or Not Supported > SMART Health Status: OK > > Current Drive Temperature: <not available> > > Error Counter logging not supported > Device does not support Self Test logging > Device does not support Background scan results logging > scsiPrintSasPhy Log Sense Failed [unsupported field in scsi command] > > What about the state of the software RAID5? It would be great if I > could bring it back to a state where the filesystem on it is fully > working again without having to copy the 20 TB of data to it again > (copying this amount of data takes some time :( ). > -- > Best regards, > Stefan Borggraefe Ugh, Adaptec, not my favourite controller. Do you have arcconf installed? You could to arcconf getconfig 1 (or whatever your controller number is) to grab some information regarding your controller and the HDDs connected to it. Do you have /dev/sg? devices? If so, smartctl -a might work on them. Re the software RAID, I would double check the health of your HDDs before attempting anything. Mathias -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html