standby questions

"Roberto Scattini" <roberto.scattini@xxxxxxxxx> · Sat, 9 Feb 2008 01:07:38 -0200

hi list:
im working in the setup of a warm standby server. im using
postgres-8.2.5 in the slave and 8.2.4 in master right now, in
production it will be only 8.2.5, everything in debian etch.

we decided this solution because we want an easy to
migrate/implement/adminstrate backup sever, and we have a few minutes
of data loss (if we can be precise with the "starting point in time"
of that loss).
by now, i have the warm standby server working fine (at least, it logs
that processes the archives...).
if i am undestanding, the "complete" (in broad strokes) process for
WAL archiving and warm standby, is something like this:
        1) the primary server processes some transactions and
generates some "WAL segment files" in the $DATA/pg_xlog/ dir of 16MB
each (by default).
        2) if archive_command is activated and working, the primary
server sends (preferably with rsync or some other "atomic tool") the
NEW WAL files to the standby server. Later, at some point, the primary
server will delete this files when considers that are not necessary
anymore.
        3) the standby server receives the archive files and processes
them somehow (almost everybody does this with pg_standby by now?) and
keep waiting for a new WAL file or the trigger file.

now, as you can see, im not a expert on database tech, maybe only a
"brave admin", so, reading the docs and seeing the warm standby
"working", i have a few questions (in any case, links to docs a
welcome!!):

-a "WAL segment file" is the same that a "log file segment"?

-what are the "log file segments" referenced by checkpoint_segments?
are the "binary logs" where postgres stores lasts transactions, the
ones in $DATA/pg_xlog dir? if this is true, then:
what means "Maximum distance between automatic WAL checkpoints"???
this is how often, in "log file segments", postgres will perform a
checkpoint, generating a special checkpoint record from which to start
the redo operation, dont?

-what is the "restartpoint" named in the "Warm Standby Servers for
High Availability" page?
(http://www.postgresql.org/docs/8.2/static/warm-standby.html) i cant
find a definition in any other doc...

-how often a new WAL file is generated? this depends on the server load?

-in one WAL file i could have one, more than one and even an
incomplete "transaction"?? (where is the doc i need to read?!)

-if i have incomplete transactions on a WAL, how the standby server
processes that? what if the rest of the transaction never reaches the
standby server?

-how do i know exactly at which point in time (in transactions) my
standby server is, if i have to switch to primary role?

-how many archive files is safe to keep in the standby server? right
now, i have "-k 100" in the pg_standby opts but in the pg_standby
README says:
"You should be wary against setting this number too low,
since this may mean you cannot restart the standby. This
is because the last restartpoint marked in the WAL files
may be many files in the past and can vary considerably.
This should be set to a value exceeding the number of WAL
files that can be recovered in 2*checkpoint_timeout seconds,
according to the value in the warm standby postgresql.conf.
It is wholly unrelated to the setting of checkpoint_segments
on either primary or standby."

i cant use the %r because im in 8.2 and not thinking in upgrade by now...
this is related to the restartpoint in my previous question... but i
dont know exactly what it is so i cant make a more detailed analysis.

finally: does anybody have a config like this one working with
heartbeat? is that recommendable?

wow! sorry for the huge post... i think i tied some loose ends just by
writing this email!! :D

thanks in advance!!

-- 
Roberto Scattini
 ___     _
 ))_) __ )L __
((__)(('(( ((_)

---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend