Doug Ledford wrote:
And actually, with 1TB drives, your most likely culprit for this is the last item I just listed: reallocated drive sectors. Here's the deal. If the drive detects the bad sectors during a write, it reallocates and redoes the write to the new sectors, data saved. If, on the other hand, the sectors go bad after the write, then whether or not your data gets saved depends on a number of factors. For instance, if the sectors were going bad slowly and you also read those sectors on a regular basis so the drive firmware would have reason to know that they are going bad (it would start gettings reads with errors that it had to ECC correct before it went totally bad), then some drives will reallocate the sectors and move the data before it's totally lost. But, if they go bad suddenly, or if they went bad without having frequent enough intervening reads to pick it up that it was on its way to going bad, then the data is just lost. But, that's what RAID is for, so we can get it back. Anyway, that's my guess for the culprit of your situation. And, unfortunately, without getting in and looking at the mismatch to identify the correct data, a repair operation is just as likely (50-50 chance) to corrupt things as opposed to correct things. With Fedora 11 there should be the palimpsest program installed. Run it and it will allow you to see the SMART details on each drive. Take a look and see if you have any showing reallocated sectors. I happen to have 4 of 6 drives in my array that show reallocated sectors. I also happen to be lucky in that none of my weekly raid-checks have turned up a mismatch count on any devices, so the bad sectors must have been caught in time (or there was a read error sometime for the sectors and the raid subsystem corrected it, but if that happened I missed it in the kernel logs).
Are you saying or implying that this palimpsest program will show relocated sectors which the current tools (smartools) don't? If not, what does this tool do other than what's surrently done by most people using smartctl?
-- Bill Davidsen <davidsen@xxxxxxx> "We can't solve today's problems by using the same thinking we used in creating them." - Einstein -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html