Re: WARNINGs after starting backup server created with PITR

Erik Jones <erik@xxxxxxxxxx> · Mon, 21 Jan 2008 10:19:34 -0600

On Jan 20, 2008, at 3:19 AM, Simon Riggs wrote:

On Sat, 2008-01-19 at 13:53 -0600, Erik Jones wrote:

Here's the python script I was using to ship to both servers, right
now I'm back to a direct rsync call for my archive_command to sb1.
What's really weird, is that for the two WALs that disappeared, or
didn't make it, even though the primary's log said they were
successfully archived, they weren't on either server.

Can you explain this some more? How do you know this? What messages  
are
received relating to those files?

In my archive directory on the standby, a sorted listing has in it  
this sequence:

000000010000059D00000021
000000010000059D00000023

I found this when I check the standby.log pg_standby is writing to  
and found it waiting on 000000010000059D00000022.  Checking the  
primary server's log I found this line:

2008-01-16 04:58:41 CST 7841 :LOG:  archived transaction log file  
"000000010000059D00000022"

The same file was missing on the sb1 server (the one with the local  
NFS mounted wal_archive directory).  Since that was the local, to NFS  
rsync, which I've never had any issues with, and which was preceded by

if res != 0:
	sys.exit(res)

wherein the value of res comes from a remotely execute command (test  
via ssh or rsync via ssh), something weird happened causing my  
transfer script to exit abnormally there (or before), but Postgres  
somehow saw a 0 exit status, which I understand is required in order  
to get that archived transaction log file entry.

Erik Jones

DBA | Emma®
erik@xxxxxxxxxx
800.595.4401 or 615.292.5888
615.292.0777 (fax)

Emma helps organizations everywhere communicate & market in style.
Visit us online at http://www.myemma.com

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings