Re: UPDATES hang every 5 minutes

Greg Smith <gsmith@xxxxxxxxxxxxx> · Fri, 10 Aug 2007 11:48:28 -0400 (EDT)

On Fri, 10 Aug 2007, Marc Rossi wrote:

Thanks for the heads up.  The box in question is a dual cpu (xeon dual 
cores) with 8 gig & a pair of 10k 146gb raid 1 arrays.  I have the 
pg_xlog dir on one array (along with the OS) & the rest of the data on 
the other array by itself.

Yeah, that's kinda what I thought.  Here's a quick way to think about the 
background writer parameters.  Pages are 8KB, and the default 
bgwriter_delay will execute every 200ms or 5 times per second.  If you set 
one of the maxpages parameters to 100, that means that component of the 
background writer might write as much as 8KB*100*5=4MB/sec worth of data 
to the disk continuously.  The config you were copying had the maxpages 
parameters set to 200 and 600, which meant that in a heavy update 
situation the background writer might be writing out (2+6)*4=32MB/second 
*on top of* all the other reads ands writes going on.  That's a pretty 
substantial additional load to add to just two pairs of disks.

Also, anything written by the all-scan writer (which was by far the more 
aggressive one in that setup) has the potential to be a wasted write if 
that particular page gets changed again before the next checkpoint, which 
sounds highly likely given how you described your workload.  Making those 
parameters too high will actually make the system less efficient--and that 
can make checkpoints worse.

A couple of quick questions.  On the fly I can change these params and 
use 'pg_ctl reload" to put them in effect, correct?

I believe all of the ones you'll be touching can be adjusted that way. 
To confirm a change took, pop into psql and do "show <parameter>" to see 
the current value.

Also, I have my checkpoint_segments set to 10, if I were to lower this 
(say 5) would this possible have the effect of checkpointing a little 
more often with less data?  (right now I hit the checkpoint_timeout).

You're thinking in the right terms here.  It's possible that by forcing 
more frequent checkpoints, the impact of each of them will be reduced. 
However, the net will be a less efficient system, because checkpointing is 
intensive.  And having twice as many of them doubles how often you get 
this spike in response times.

The other approach is to increase the timeout and the segments, so maybe 
you're only running into this every 10 minutes or more which makes the 
problem less annoying, then try to keep the buffers clean between 
checkpoints using the background writer.

What's shared_buffers set to on this system?  A third possibility is to 
reduce that and rely more on the operating system to buffer the data for 
you.  If there's less data in the buffer cache, it will certainly take 
less time to write things out at checkpoint time; there's obviously 
downsides to that in terms of how fast regular queries execute.

It will take some experimentation here to get this right.  Checkpoint 
problems like you're running into aren't something you knock out in a day 
if you're new to this.  Try not to make too big a step at any time and 
expect you'll have to dig into this a bit.  I would recommend you start by 
intalling the contrib/pg_buffercache module against your database (read 
README.pg_buffercache for an intro), which lets you watch what's in the 
buffer cache at any time, so you can tell the balance of clean vs. dirty 
pages and what tables they're in.  That will let you monitor how effective 
the background writer is doing and estimate how bad the checkpoint is 
going to be before it happens.  Here's a sample query to get you started:

SELECT c.relname, isdirty, count(*) AS buffers
  FROM pg_class c INNER JOIN pg_buffercache b
  ON b.relfilenode = c.relfilenode INNER JOIN pg_database d
  ON (b.reldatabase = d.oid AND d.datname = current_database())
  GROUP BY c.relname,isdirty
  ORDER BY 2 DESC;

I have something I'm working on that covers a lot of this topic at 
http://developer.postgresql.org/index.php/Buffer_Cache%2C_Checkpoints%2C_and_the_BGW 
but that's probably a little too low-level for you to chew on usefully 
right now.

--
* Greg Smith gsmith@xxxxxxxxxxxxx http://www.gregsmith.com Baltimore, MD

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings