On Wed, 01 Aug 2012 08:02:38 -0400 matt <listy@xxxxxxxxxxx> wrote: > Quick intro: Last year I was having problems with an md array > continuously having a mismatch_cnt in the tens of thousands, > inexplicably. After a week or two of hardware swapping and such, I > narrowed it down to bad reads of the hard drive block devices. I used > scripts that would repetitively do something like this on all my drives: > dd if=/dev/sdk1 bs=1024 count=50000000 |md5sum -b > Some devices would intermittently get different results. I ended up > resolving (?) it by replacing the cheapo (Syba) SATA controller cards > with other cheapo (Rosewill) ones. I've been fine for about a year > since then. Syba was mentioned in a bad context last time we discussed this here, somewhere down the thread: http://comments.gmane.org/gmane.linux.raid/33346 I don't remember if you have participated in that discussion, if not, have an entertaining read. Is your new Rosewill card also Silicon Image based? > Is silent data corruption like this simply to be expected when using cheap > commodity hardware? I'd say no. There are known good and known bad hardware, where both good and bad is most easily measured by the amount (or lack of) problem reports you hear or can find. One method to do that is asking a searching engine about "<chip name> data corruption". -- With respect, Roman ~~~~~~~~~~~~~~~~~~~~~~~~~~~ "Stallman had a printer, with code he could not see. So he began to tinker, and set the software free."
Attachment:
signature.asc
Description: PGP signature