Re: Proposal: non-striping RAID4

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



James Lee wrote:
I have come across an unusual RAID configuration type which differs
from any of the standard RAID 0/1/4/5 levels currently available in
the md driver, and has a couple of very useful properties (see below).
 I think it would be useful to have this code included in the main
kernel, as it allows for some use cases that aren't well catered for
with the standard RAID levels.  I was wondering what people's thoughts
on this might be?

The RAID type has been named "unRAID" by it's author, and is basically
similar to RAID 4 but without data being striped across the drives in
the array.  In an n-drive array (where the drives need not have the
same capacity), n-1 of the drives appear as independent drives with
data written to them as with a single standalone drive, and the 1
remaining drive is a parity drive (this must be the largest capacity
drive), which stores the bitwise XOR of the data on the other n-1
drives (where the data being XORed is taken to be 0 if we're past the
end of that particular drive).  Data recovery then works as per normal
RAID 4/5 in the case of the failure of any one of the drives in the
array.

The advantages of this are:
- Drives need not be of the same size as each other; the only
requirement is that the parity drive must be the largest drive in the
array.  The available space of the array is the sum of the space of
all drives in the array, minus the size of the largest drive.
- Data protection is slightly better than with RAID 4/5 in that in the
event of multiple drive failures, only some data is lost (since the
data on any non-failed, non-parity drives is usable).

The disadvantages are:
- Performance:
    - As there is no striping, on a non-degraded array the read
performance will be identical to that of a single drive setup, and the
write performance will be comparable or somewhat worse than that of a
single-drive setup.
    - On a degraded arrays with many drives the read and write
performance could take further hits due to the PCI / PCI-E bus getting
saturated.

I personally feel that "this still looks like a bunch of little drives" should be listed first...
The company which has implemented this is "Lime technology" (website
here: http://www.lime-technology.com/); an overview of the technical
detail is given on their website here:
http://www.lime-technology.com/wordpress/?page_id=13.  The changes
made to the Linux md driver to support this have been released under
the GPL by the author - I've attached these to this email.

Now I'm guessing that the reason this hasn't been implemented before
is that in most cases the points above mean that this is a worse
option than RAID 5, however there is a strong use case for this
system.  For many home users who want data redundancy, the current
RAID levels are impractical because the user has many hard drives of
different sizes accumulated over the years.  Even for new setups, it

And over the years is just the problem. You have a bunch of tiny drives unsuited, or marginally suited, for the size of modern distributions, and using assorted old technology. There's a reason these are thrown out and available, they're pretty much useless. Also they're power-hungry, slow, and you would probably need more of these than would fit in a standard case to provide even minimal useful size. They're also probably PATA, meaning that many modern motherboards don't support them well (if at all).
is generally not cost-effective to buy multiple identical sized hard
drives, compared with incrementally adding storage of the capacity
which is at the best price per GB at the time.  The fact that there is
a need for this type of flexibility is evidenced, for example, by
various forum threads such as for example this thread containing over
1500 posts in a specialized audio / video forum:
http://www.avsforum.com/avs-vb/showthread.php?t=573986, as well as the
active community in the forums on the Lime technology website.

I can buy 500GB USB drives for $98+tax if I wait until Stapels or Office Max have a sale, $120 anytime, anywhere. I see 250GB PATA drives being flogged for $50-70 for lack of demand. I simply can't imagine any case where this would be useful other than as a proof of concept.

Note: you can do this with existing code by setting up a partitions of various sizes on multiple drives, first partitions size of smallest drive, next the remaining space on the next drive, etc. On every set of >2 drives make a raid-5, on every set of two drives make raid-10, and have a bunch of smaller redundant drives which are faster. You can then combine them all into one linear array if it pleases you. I have a crate if drives from 360MB (six) to about 4GB, and they are going to be sold by the pound because they are garbage.
Would there be interest in making this kind of addition to the md code?

I can't see that the cost of maintaining it is justified by the benefit, but not my decision. If you were to set up such a thing using FUSE, keeping it out of the kernel but still providing the functionality, it might be worth doing. On the other hand, setting up the partitions and creating the arrays could probably be done by a perl script which would take only a few hours to write.
PS: In case it wasn't clear, the attached code is simply the code the
author has released under GPL - it's intended just for reference, not
as proposed code for review.
Much as I generally like adding functionality, I *really* can't see much in this idea. It seems to me to be in the "clever but not useful" category.

--
bill davidsen <davidsen@xxxxxxx>
 CTO TMR Associates, Inc
 Doing interesting things with small computers since 1979

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux