Re: Distributed RAID

Gordan Bobic <gordan@xxxxxxxxxx> · Thu, 06 Nov 2008 09:38:02 +0000

Michael O'Sullivan wrote:

I have just read that GFS on mdadm does not work because mdadm is not 
cluster aware. I really hoped to build a n + 1 RAID of the disks I have 
presented to the RHCS nodes via iSCSI. I had a look at DDRAID which is 
old and looks like it only supports 3, 5 and 9 disks in the distributed 
RAID. I currently only have two (multipathed) devices, but I want them 
to be active-active. If I put them into a mirrored logical volume in 
CLVM will this do the trick? Or will I have to install DRDB? Is there 
any more up-to-date distributed RAID options available for when I want 
to make a 2 + 1, 3 +1, etc storage array? There are some posts that say 
this may be available in CLVM soon or that mdadm may be cluster aware 
soon. Any progress on either of these options?

You probably saw me asking these very same questions in the archives, 
without any response.

DDRAID is unmaintained, and IIRC the code was removed from the current 
development tree a while back. So don't count on it ever getting 
resurrected.

I rather doubt md will become cluster aware any time soon. CLVM doesn't 
yet support even more important features like snapshotting, so I 
wouldn't count on it supporting anything more advanced.

For straight mirroring (which is all you could sensibly do with 2 nodes 
anyway), I can highly recommend DRBD. It "just works" and works well. I 
have a number of 2-node clusters deployed with it with shared-root.

Of you really want to look into larger scale clustering with n+m 
redundancy, look into cleversafe.org. There's a thread I started on the 
forum there looking into exactly this sort of thing. I'll be testing it 
in the next month or so, when I get the hardware together, but it's 
looking plausible. It also provides proper n+m redundancy.

Another thing to note is that RAID5 is not really usable on today's big 
disks in arrays of more than 6. Think about the expected read failure 
rates on modern disks: 10^-14. That's about one uncorrectable error 
every 10TB. So if you have a 6x1TB disk array, and you lose a disk, you 
have to read 5TB of data to reconstruct onto a fresh disk. Since you get 
one uncorrectable error every 10TB, that means you have a 50/50 chance 
of another disk encountering an error and dropping out of the array, and 
losing all your data. These days higher RAID levels are really a 
necessity, not an optional extra, and at this rate, considering the read 
error rates have stayed constant while sizes have exploded, even RAID6 
won't last long.

Gordan

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster