On Tuesday July 18, blindcoder@xxxxxxxxxxxxxxxxxxxx wrote: > Jul 16 16:59:37 ceres kernel: ide: failed opcode was: unknown > Jul 16 16:59:37 ceres kernel: hdb: drive not ready for command > Jul 16 16:59:37 ceres kernel: ide0: reset: success > Jul 16 16:59:37 ceres kernel: hdb: status error: status=0x00 { } > Jul 16 16:59:37 ceres kernel: ide: failed opcode was: unknown > Jul 16 16:59:37 ceres kernel: end_request: I/O error, dev hdb, sector 488391932 > Jul 16 16:59:37 ceres kernel: hdb: drive not ready for command > Jul 16 16:59:37 ceres kernel: hdb: status error: status=0x10 { SeekComplete } > Jul 16 16:59:37 ceres kernel: ide: failed opcode was: 0xea > Jul 16 16:59:37 ceres kernel: raid5: Disk failure on hdb8, disabling device. Operation continuing on 2 devices > Jul 16 16:59:37 ceres kernel: hdb: drive not ready for command > Jul 16 16:59:37 ceres kernel: RAID5 conf printout: > Jul 16 16:59:37 ceres kernel: --- rd:3 wd:2 fd:1 > Jul 16 16:59:37 ceres kernel: disk 0, o:0, dev:hdb8 > Jul 16 16:59:37 ceres kernel: disk 1, o:1, dev:hda8 > Jul 16 16:59:37 ceres kernel: disk 2, o:1, dev:hdc8 > Jul 16 16:59:37 ceres kernel: RAID5 conf printout: > Jul 16 16:59:37 ceres kernel: --- rd:3 wd:2 fd:1 > Jul 16 16:59:37 ceres kernel: disk 1, o:1, dev:hda8 > Jul 16 16:59:37 ceres kernel: disk 2, o:1, dev:hdc8 > --- > > Now, is this a broken IDE controller or harddisk? Because smartctl claims > that everything is fine. > Ouch indeed. I've no idea whose 'fault' this is. Maybe ask on linux-ide. > > I don't have a script log or something, but here's what I did from an initrd > with init=/bin/bash > > # < mount /dev /proc /sys /tmp > > # < start udevd udevtrigger udevsettle > > while read a dev c ; do > [ "$a" != "ARRAY" ] && continue > [ -e /dev/md/${dev##*/} ] || /bin/mknod $dev b 9 ${dev##*/} > /sbin/mdadm -A ${dev} > done < /etc/mdadm.conf mdadm -As --auto=yes should be sufficient > > Personalities : [linear] [raid0] [raid1] [raid5] [raid4] > md5 : inactive raid5 hda8[1] hdc8[2] > 451426304 blocks level 5, 64k chunk, algorithm 2 [2/3] [_UU] Ahh, Ok, make that mdadm -As --force --auto=yes A crash combined with a drive failure can cause undetectable data corruption. You need to give --force to effectively acknowledge that.. I should get mdadm to explain what is happening so that I don't have to as much.... NeilBrown - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html