Re: pg_upgrade and frozen xids

Alvaro Herrera <alvherre@xxxxxxxxxxxxxxx> · Wed, 7 Mar 2018 16:48:11 -0300

bricklen wrote:

> We recently upgraded about 1500 Postgres clusters from 9.3 to 10.1 and we
> are seeing similar issues. Specifically:
> * An autovacuum worker stuck vacuuming regular tables for days until they
> are killed (I've seen "vacuuming indexes" phase), and they are showing 99%
> CPU in top. pg_cancel|terminate_backend does not usually kill the process,
> only a hard kill does. In the latter case, two (out of four) times it also
> crashed Postgres.
> * There have been about 10 examples of btree index corruption, across a
> handful of databases, in different data centres. In two of the cases, it's
> possible there were hardware issues as the hypervisor those two VM's were
> running on crashed.

I wonder if this may be related to changes in collation algorithm.  If
the indexes are all on text columns, I think you would be better served
by reindexing them all instead of trying to treat this situation as a
code bug.

-- 
Álvaro Herrera                https://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services