Hi Mark,
I would say forget the SSD's - they are not ideal for VM files, and I
don't think they would be worth the cost. Raid 10 (any arrangement) is
likely to give the best speed for such files, and would do a lot better
than raid 6. Raid 10,f2 is probably a good choice - but you might want
to test things out a bit if that is possible.
I don't know how much ram you've got in the machine, but if you can
afford more, it will always help (especially if you make sure the VM's
use the host's cache rather than direct writes).
mvh.,
David
On 01/06/14 17:59, Mark Knecht wrote:
David,
You are correct and I'm sorry I didn't do that. I started this
question on a Gentoo list where I put a lot more information about the
machine/ When I came here I should have included more.
The machine is used 7 days a week. I'm self employed writing
software analyzing the stock & futures markets. Most of it is written
in R in Linux, some of it in proprietary languages in Windows. Some of
it is quite computational but mostly it's just looking at a _lot_ of
locally stored financial data. Almost all financial data is currently
stored on the machine in Linux in ext4. Over the past year this data
has been growing at around 30GB/month. With 100GB left on my current
RAID6 I don't have much time before I'm full.
When I'm actually trading in the market I have a few Virtualbox VMs
running Windows 7. They aren't overly large in terms of disk space.
(Currently about 150GB total.) The VMs are each stored in massive
single files which I suspect basically represent a hard drive to
Virtualbox. I have no idea what size any IO might be coming from the
VM. The financial data in the previous paragraph is available to these
Windows VMs as a network mount from the Windows perspective. Read &
write speeds of this data in Windows is not overly high.
These VMs are the area where my current RAID6 (5 drive, 16k chunk
size) seems to have been a bad decision. The machine is powered off
every night. Loading these VMs takes at least 10-15 minutes each
morning where I see disk activity lights just grinding away the whole
time. If I had a single _performance_ goal in upgrading the disks it
would be to improve this significantly. Craig's SSD RAID1 suggestion
would certainly help here but at 240GB there wouldn't be a lot of room
left. That may be OK though.
The last area is video storage. Write speed is unimportant, read
speeds are quite low. Over time I hope to migrate it off to a NAS box
but for now this is where it's stored. This is currently using about
1/2 the storage my RAID6 provides.
Most important to me is data safety. I currently do weekly
rotational backups to a couple of USB drives. I have no real-time
issues at all if the machine goes down. I have 2 other machines I can
do day-to-day work on while I fix this machine. What I am most
concerned about is not losing anything more than a couple of previous
days work. If I took a week to rebuild the machine after a failure
it's pretty much a non-issue to me.
Thanks,
Mark
On Sun, Jun 1, 2014 at 8:06 AM, David Brown <david.brown@xxxxxxxxxxxx> wrote:
Hi Mark,
What would be really useful here is a description of what you actually
/want/. What do you want to do with these drives? What sort of files are
they - big or small? Do you need fast access for large files? Do you need
fast access for many files in parallel? How important is the data? How
important is uptime? What sort of backups do you have? What will the
future be like - are you making one big system to last for the foreseeable
future, or do you need something that can easily be expanded? Are you
looking for "fun, interesting and modern" or "boring but well-tested"
solutions?
Then you need to make a list of the hardware you have, or the budget for new
hardware.
Without know at least roughly what you are looking for, it's easy to end up
with expensive SSDs because they are "cool", even though you might get more
speed for your money with a couple of slow rust disks and a bit more ram in
your system. It may be that there is no need for any sort of raid at all -
perhaps one big main disk is fine, and the rest of the money spent on a
backup disk (possibly external) with rsync'd copies of your data. This
would mean longer downtime if your main disk failed - but it also gives some
protection against user error.
And perhaps btrfs with raid1 would be the best choice.
A raid10,f2 is often the best choice for desktops or workstations with 2 or
3 hard disks, but it is not necessarily /the/ best choice.
mvh.,
David
On 01/06/14 16:25, Mark Knecht wrote:
Hi Craig,
Responding to both you and David Brown. Thanks for your ideas.
- Mark
On Sat, May 31, 2014 at 9:40 AM, Craig Curtin <craigc@xxxxxxxxxxxxx>
wrote:
It sounds like the op has additional data ports on his MOBO - wouldn't he
be
better off looking at a couple of SSDs in raid 1 for his OS, swap etc and
his VMs and then leave the rest for data as raid5 - By moving the things
from the existing drives he gets back space and only purchases a couple
of
good sized fast SSDs now
It's a possibility. I can get 240GB SSDs in the $120 range so that's
$240 for RAID1. If I take the five existing 500GB drives and
reconfigure for RAID5 that's 2TB. Overall it's not bad going from
1.4TB to about 2.2TB but being it's not all one big disk I'll likely
never use it all as efficiently. Still, it's an option.
I do in fact have extra ports:
c2RAID6 ~ # lspci | grep SATA
00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port
SATA IDE Controller #1
00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port
SATA IDE Controller #2
03:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9123 PCIe
SATA 6.0 Gb/s controller (rev 11)
06:00.0 SATA controller: JMicron Technology Corp. JMB363 SATA/IDE
Controller (rev 03)
06:00.1 IDE interface: JMicron Technology Corp. JMB363 SATA/IDE
Controller (rev 03)
c2RAID6 ~ #
Currently my 5-drive RAID6 uses 5 of the Intel ports. The 6th port
goes to the CD/DVD drive. Some time ago I bought the SATA3 Marvell
card and a smaller (120GB) SSD. I put Gentoo on it and played around a
bit but I've never really used it day-to-day. Part of my 2-drive RAID1
thinking was that I could build the new RAID1 on the SATA3 controller
not even touch the existing RAID6. If it works reliably on that
controller I'd be done and have 3TB.
I think David's RAID10 3-drive solution could possibly work if I buy 3
of the lower cost new WD drives. I'll need to think about that. Not
sure.
Thanks,
Mark
On Sat, May 31, 2014 at 9:40 AM, Craig Curtin <craigc@xxxxxxxxxxxxx>
wrote:
It sounds like the op has additional data ports on his MOBO - wouldn't he
be
better off looking at a couple of SSDs in raid 1 for his OS, swap etc and
his VMs and then leave the rest for data as raid5 - By moving the things
from the existing drives he gets back space and only purchases a couple
of
good sized fast SSDs now
Sent from my Samsung tablet
.
-------- Original message --------
From: David Brown
Date:31/05/2014 21:01 (GMT+10:00)
To: Mark Knecht ,"L.M.J"
Cc: Linux-RAID
Subject: Re: Home desktop/server RAID upgrade
On 30/05/14 22:14, Mark Knecht wrote:
On Fri, May 30, 2014 at 12:29 PM, L.M.J <linuxmasterjedi@xxxxxxx>
wrote:
Le Fri, 30 May 2014 12:04:07 -0700, Mark Knecht
<markknecht@xxxxxxxxx> a écrit :
In a RAID1 would a 3-drive Red RAID1 possibly be faster than the
2-drive Se RAID1 and at the same time give me more safety?
Just a question inside the question : how do you manager a RAID1
with 3 drives ? Maybe you're talking about RAID5 then ?
OK, I'm no RAID expert but RAID1 is just drives in parallel right. 2
drives, 3 drives, 4 drives, all holding exactly the same data. In
the case of a 3-drive RAID1 - if there is such a beast - I could
safely lose 2 drives. You ask a reasonable question though as maybe
the way this is actually done is 2 drives + a hot spare in the box
that gets sync'ed if and only if one drive fails. Not sure and maybe
I'm totally wrong about that.
A 3-drive RAID5 would be 2 drives in series - in this case making
6TB - and then the 3rd drive being the redundancy. In the case of a
3-drive RAID5 I could safely lose 1 drive.
In my case I don't need more than 3TB, so an option would be a
3-drive RAID5 made out of 2TB drives which would give me 4TB but I
don't need the space as much as I want the redundancy and I think
RAID5 is slower than RAID1. Additionally some more mdadm RAID
knowledgeable people on other lists say Linux mdadm RAID1 would be
faster as it will get data from more than one drive at a time. (Or
possibly get data from which ever drive returns it the fastest. Not
sure.)
I believe one good option if I wanted 4 physical drives would be
RAID10 but that's getting more complicated again which I didn't
really want to do.
So maybe it is just 2 drives and the 3 drive version isn't even a
possibility? Could be.
With 3 drives, you have several possibilities.
Raid5 makes "stripes" across the three drives, with 2 parts holding data
and one part holding parity to provide redundancy.
Raid1 is commonly called "mirroring", because you get the same data on
each disk. md raid has no problem making a 3-way mirror, so that each
disk is identical. This gives you excellent redundancy, and you can
make three different reads in parallel - but writes have to go to each
disk, which can be a little slower than using 2 disks. It's not often
that people need that level of redundancy.
Another option with md raid is the raid10 setups. For many uses, the
fastest arrangement is raid10,f2. This means there is two copies of all
your data (f3 would be three copies), with a "far" layout.
<http://en.wikipedia.org/wiki/Linux_MD_RAID_10#LINUX-MD-RAID-10>
With this arrangement, reads are striped across all three disks, which
is fast for large reads. Small reads can be handled in parallel. Most
reads while be handled from the outer half of the disk, which is faster
and needs less head movement - so reading is on average faster than a
raid0 on the same disks. Small writes are fast, but large writes
require quite a bit of head movement to get everything written twice to
different parts of the disks.
The "best" option always depends on your needs - how you want to access
your files. A layout geared to fast striped reads of large files will
be poorer for parallel small writes, and vice versa. raid10,f2 is often
the best choice for a desktop or small system - but it is not very
flexible if you later want to add new disks or replace the disks with
bigger ones.
md raid is flexible enough that it will even let you make a 3 disk raid6
array if you want - but a 3-way raid1 mirror will give you the same disk
space and much better performance.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html