Re: What RAID type and why?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, Mar 6, 2010 at 3:17 PM, Guy Watkins <linux-raid@xxxxxxxxxxxxxxxx> wrote:
> } -----Original Message-----
> } From: linux-raid-owner@xxxxxxxxxxxxxxx [mailto:linux-raid-
> } owner@xxxxxxxxxxxxxxx] On Behalf Of Greg Freemyer
> } Sent: Saturday, March 06, 2010 5:33 PM
> } To: Mark Knecht
> } Cc: Linux-RAID
> } Subject: Re: What RAID type and why?
> }
> } On Sat, Mar 6, 2010 at 5:02 PM, Mark Knecht <markknecht@xxxxxxxxx> wrote:
> } > First post. I've never used RAID but am thinking about it and looking
> } > for newbie-level info. Thanks in advance.
> } >
> } > I'm thinking about building a machine for long term number crunching
> } > of stock market data. Highest end processor I can get, 16GB and at
> } > least reasonably fast drives. I've not done RAID before and don't know
> } > how to choose one RAID type over another for this sort of workload.
> } > All I know is I want the machine to run 24/7 computing 100% of the
> } > time and be reliable at least in the sense of not losing data if 1
> } > drive or possibly 2 go down.
> } >
> } > If a drive does go down I'm not overly worried about down time. I'll
> } > stock a couple of spares when I build the machine and power the box
> } > back up within an hour or two.
> } >
> } > What RAID type do I choose and why?
> } >
> } > Do I need a 5 physical drive RAID array to meet these requirements?
> } > Assume 1TB+ drives all around.
> } >
> } > How critical is it going forward with Linux RAID solutions to be able
> } > to get exactly the same drives in the future? 1TB today is 4TB a year
> } > from now, etc.
> } >
> } > With an 8 core processor (high-end Intel Core i7 probably) do I need
> } > to worry much about CPU usage doing RAID? I suspect not and I don't
> } > really want to get into hardware RAID controllers unless critically
> } > necessary which I suspect it isn't.
> } >
> } > Anyway, if there's a document around somewhere that helps a newbie
> } > like me I'd sure appreciate finding out about it.
> } >
> } > Thanks,
> } > Mark
> }
> } I'm not sure about a newbie doc, but here's some basics:
> }
> } You haven't said what kind of i/o rates you expect, nor how much
> } storage you need.
> }
> } At a minimum I would build a 3-disk raid 6.  raid 6 does a lot of i/o
> } which may be a problem.
>
> If he only needs 3 drives I would recommend RAID1.  Can still loose 2 drives
> and you don't have the RAID6 I/O overhead.
>
> Also, you said your data is important.  If so, you need a backup solution!
> 2 copies with 1 off-site.  Maybe alternate between the 2 each day or week.
>
> How much data per day?  How much data during the next 3 years?
>
> Guy
>
> }
> } Raid-5 is out of favor for me due to issues people are seeing with
> } discrete bad sectors with the remaining drives after you have a drive
> } failure.  raid-6 tolerates those much better.  Even raid 10 is not as
> } robust as raid 6 and with the current generation drives robustness in
> } the raid solution is more important than ever.
> }
> } But raid 6 uses 2 parity drives, so you'll only get 1TB of useable
> } space from a 3-disk raid 6 made from 1TB drives.
> }
> } mdraid just requires replacement disks be bigger than the old disk
> } you're replacing.
> }
> } You might consider layering LVM on top of mdraid to help you manage
> } the array as it grows.
> }
> } Greg
> } --
> } Greg Freemyer
> } Head of EDD Tape Extraction and Processing team
> } Litigation Triage Solutions Specialist
> } http://www.linkedin.com/in/gregfreemyer
> } Preservation and Forensic processing of Exchange Repositories White Paper
> } -
> } <http://www.norcrossgroup.com/forms/whitepapers/tng_whitepaper_fpe.html>
> }
> } The Norcross Group
> } The Intersection of Evidence & Technology
> } http://www.norcrossgroup.com
> } --
> } To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> } the body of a message to majordomo@xxxxxxxxxxxxxxx
> } More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

More importantly, it sounds like his workload will be mostly /database/ driven.

As far as I'm aware, databases tend to produce many small operations;
which unfortunately pushes favor to the simple mirroring operations.
If two drives going bad is a concern then using 2 backup copies per
raid 1 mirror set would work.  Most modern consumer systems come with
6 SATA ports or more, so it should be possible to get 6 hard drives
installed and shared among two raid 1 sets of 3 drives each.  LVM with
striping could be used over the raid 1 sets.

On the other hand, he says that the system will have 16 GB of memory;
I'm not sure what size his working set is, but it sounds entirely
plausible that a well constructed database could live entirely within
the ram.  If that's the case it doesn't really matter what the precise
performance of the storage solution is.  Raid 6 would offer more
efficient drive use at similar rates of error tolerance at a cost
savings of 2 drives in the six drive case.

Update for new email: Just go with the raid 1 version; it sounds like
you aren't trying to store terabytes of data so the raid 1 solution
with even just 3 drives should be sufficient.  Put the saved resources
in to more, faster, or better memory.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux