On Sun, 18 Feb 2007 07:13:28 -0500 (EST), Justin Piszcz wrote > On Sun, 18 Feb 2007, Marc Marais wrote: > > > On Sun, 18 Feb 2007 20:39:09 +1100, Neil Brown wrote > >> On Sunday February 18, marcm@xxxxxxxxxxxxxxxx wrote: > >>> Ok, I understand the risks which is why I did a full backup before doing > >>> this. I have subsequently recreated the array and restored my data from > >>> backup. > >> > >> Could you still please tell me exactly what kernel/mdadm version you > >> were using? > >> > >> Thanks, > >> NeilBrown > > > > 2.6.20 with the patch you supplied in response to the "md6_raid5 crash > > email" I posted in linux-raid a few days ago. Just as background, I replaced > > the failing drive and at the same time bought an additional drive in order > > to increase the array size. > > > > mdadm -V = v2.6 - 21 December 2006. Compiled under Debian (stable). > > > > Also, I've just noticed another drive failure with the new array with a > > similar error to what happened during the grow operation (although on a > > different drive) - I wonder if I should post this to linux-ide? > > > > Feb 18 00:58:10 xerces kernel: ata4: command timeout > > Feb 18 00:58:10 xerces kernel: ata4: no sense translation for status: 0x40 > > Feb 18 00:58:10 xerces kernel: ata4: translated ATA stat/err 0x40/00 to SCSI > > SK/ASC/ASCQ 0xb/00/00 > > Feb 18 00:58:10 xerces kernel: ata4: status=0x40 { DriveReady } > > Feb 18 00:58:10 xerces kernel: sd 4:0:0:0: SCSI error: return code = > > 0x08000002 > > Feb 18 00:58:10 xerces kernel: sdd: Current [descriptor]: sense key: Aborted > > Command > > Feb 18 00:58:10 xerces kernel: Additional sense: No additional sense > > information > > Feb 18 00:58:10 xerces kernel: Descriptor sense data with sense descriptors > > (in hex): > > Feb 18 00:58:10 xerces kernel: 72 0b 00 00 00 00 00 0c 00 0a 80 00 > > 00 00 00 00 > > Feb 18 00:58:10 xerces kernel: 00 00 00 00 > > Feb 18 00:58:10 xerces kernel: end_request: I/O error, dev sdd, sector > > 35666775 > > Feb 18 00:58:10 xerces kernel: raid5: Disk failure on sdd1, disabling > > device. Operation continuing on 3 devices > > > > Regards, > > Marc > > > > - > > To unsubscribe from this list: send the line "unsubscribe linux-raid" in > > the body of a message to majordomo@xxxxxxxxxxxxxxx > > More majordomo info at http://vger.kernel.org/majordomo-info.html > > > > Just out of curiosity: > > Feb 18 00:58:10 xerces kernel: end_request: I/O error, dev sdd, > sector 35666775 > > Can you run: > > smartctl -d ata -t short /dev/sdd > wait 5 min > smartctl -d ata -t long /dev/sdd > wait 2-3 hr > smartctl -d ata -a /dev/sdd > > And then e-mail that output to the list? > > Justin. I have smartmontools performing regular short and long scans but I will run the tests immediately and send the output of smartctl -a when done. Note I'm getting similar errors on sdc too (as in 5 minutes ago). Interestingly the SMART error logs for sdc and sdd show no errors at all. ata3: command timeout ata3: no sense translation for status: 0x40 ata3: translated ATA stat/err 0x40/00 to SCSI SK/ASC/ASCQ 0xb/00/00 ata4: status=0x40 { DriveReady } sd 3:0:0:0: SCSI error: return code = 0x08000002 sdd: Current [descriptor]: sense key: Aborted Command Additional sense: No additional sense information Descriptor sense data with sense descriptors (in hex): 72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00 00 00 00 00 end_request: I/O error, dev sdc, sector 260419647 raid5:md6: read error corrected (8 sectors at 260419584 on sdc1) Will post logs when done... Marc -- - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html