On Sat, 1 Mar 2008, Craig James wrote:
So my question still stands: From a strictly performance point of view, would
it be better to separate the OS and the WAL onto two disks?
You're not getting a more useful answer here because you haven't mentioned
yet a) what the disk controller is or b) how much writing activity is
going on here. If you can cache writes, most of the advantages to having
a seperate WAL disk aren't important unless you've got an extremely high
write throughput (higher you can likely sustain with only 8 disks) so you
can put the WAL data just about anywhere.
This is a dedicated system and does nothing but Apache/Postgres, so the OS
should get very little traffic. But if that's the case, I guess you could
argue that your suggestion of combining OS and WAL on a 2-disk RAID 1 would
be the way to go, since the OS activity wouldn't affect the WAL very much.
The main thing to watch out for if the OS and WAL are on the same disk is
that some random process spewing logs files could fill the disk and now
the database is stalled.
I think there are two configurations that make sense for your situation:
8 disks RAID 1+0 Everything
This maximizes potential sequential and seek throughput for the database,
which is probably going to be your bottleneck unless you're writing lots
of simple data, while still allowing survival of any one disk. The crazy
log situation I mentioned above is less likely to be a problem because
having so much more disk space available to everything means it's more
likely you'll notice it before the disk actually fills.
6 disks RAID 0 Postgres data+WAL
2 disks RAID 1 Linux
This puts some redundancy on the base OS, so no single disk loss can
actually take down the system altogether. You get maximum throughput on
the database. If you lose a database disk, you replace it and rebuild the
whole database at that point.
I suppose the thing to do is get the system, and run bonnie on various
configurations. I've never run bonnie before -- can I get some useful
results without a huge learning curve?
I've collected some bonnie++ examples at
http://www.westnet.com/~gsmith/content/postgresql/pg-disktesting.htm you
may find useful. With only 8 disks you should be able to get useful
results without a learning curve; with significantly more it can be
necessary to run more than one bonnie at once to really saturate the disks
and that's trickier.
I don't think you're going to learn anything useful from that though
(other than figuring out if your disk+controller combination is
fundamentally fast or not). As you put more disks into the array,
sequential throughput and seeks/second will go up. This doesn't tell you
anything useful about whether the WAL is going to get enough traffic to be
a bottleneck such that it needs to be on a seperate disk. To figure that
out, you need to run some simulations of the real database and its
application, and doing that fairly is a more serious benchmarking project.
--
* Greg Smith gsmith@xxxxxxxxxxxxx http://www.gregsmith.com Baltimore, MD
---------------------------(end of broadcast)---------------------------
Do not post admin requests to the list. They will be ignored.
pgsql-performance mailing list (pgsql-performance@xxxxxxxxxxxxxx)
Help/Unsubscribe/Update your Subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.org&extra=pgsql-performance