Re: Re: question about software Raid 1

Les Mikesell <lesmikesell@xxxxxxxxx> · Tue, 30 Sep 2008 14:43:57 -0500

Kay Diederichs wrote:

Fact is that with CentOS-5 kernels (but not with CentOS-4, as this 
functionality became available in kernel 2.6.17) you could (or rather 
_should_ regularly)
   echo check > /sys/block/mdX/md/sync_action
to check agreement between the two (or more) copies. When this finishes, 
/sys/block/mdX/md/mismatch_cnt shows you the number of mismatches. You 
can fix these with
   echo repair > /sys/block/mdX/md/sync_action

This applies to at least RAID1 and RAID5.
At this point the question arises: how does the "repair job" know which 
copy is the correct one? I have no answer to this question.

Thanks for posting this.  I have a machine that periodically had 
filesystem errors on a RAID1 volume that I eventually found were caused 
by bad RAM but even after replacing it I'd still see filesystem problems 
 reappear every few weeks.  It turned out that there were quite a few 
mismatched blocks between the mirrors and the fsck passes must have 
sometimes seen the good copy but subsequently the still-bad alternate 
would be used.  Now I've done a repair and fsck and so far everything 
seems stable.  It's hard to tell with problems that only happen once or 
twice a month, though.  I suppose I have some files with corrupt 
contents on there but they are backups that will expire as more current 
ones are saved anyway.

BTW, there is - even with current kernels - no speed gain in using RAID1 
- see http://kernelnewbies.org/KernelProjects/Raid1ReadBalancing .

I don't think I believe that - you can see the reads alternating drives 
by watching the lights.

--
  Les Mikesell
   lesmikesell@xxxxxxxxx
_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
http://lists.centos.org/mailman/listinfo/centos