Bad Query Plans on 10.3 vs 9.6

Cory Tucker <cory.tucker@xxxxxxxxx> · Thu, 29 Mar 2018 05:26:24 +0000

Hello all.  I'm migrating a database from PG 9.6 to 10.3 and have noticed a particular query that is performing very badly compared to its 9.6 counterpart.  
The plan on 9.6 v 10.3 are effectively identical except in 9.6 the planner decides to use an index only scan on the primary key and in 10.3 it does a sequential scan.  The problem is the sequential scan is for a table of 75M rows and 25 columns so its quiet a lot of pages it has to traverse.

This is the query:

explain verbose
WITH removed AS (
  DELETE FROM match m
  WHERE
    NOT EXISTS (
        SELECT 1
        FROM build.household h  -- This is the table that has 70M rows and does a full table scan in 10.3
        WHERE h.household_id = m.household_id
    ) OR (
      m.property_id IS NOT NULL AND
      NOT EXISTS (
          SELECT 1
          FROM build.property p
          WHERE p.household_id = m.household_id AND p.property_id = m.property_id
      )
    )
  RETURNING *
)
INSERT INTO orphaned_matches (household_id, account_id, candidate_id, matched_at, full_name, first_name, last_name, match_reason, property_id, owner_id)
  SELECT
    removed.household_id,
    removed.account_id,
    removed.candidate_id,
    removed.created_at,
    removed.full_name,
    removed.first_name,
    removed.last_name,
    removed.match_reason,
    removed.property_id,
    removed.owner_id
  FROM removed;

What's worse is that in 10.3, the number of rows is actually much smaller than in 9.6 because I am doing this query on a partitioned table (table name "match") with a reduced data set.

Query plans for both are attached, plus the query.

thanks
--Cory
 Insert on public.orphaned_matches  (cost=204030825.83..204247350.03 rows=8660968 width=264)
   CTE removed
     ->  Delete on public.match m  (cost=0.00..204030825.83 rows=8660968 width=6)
           Output: m.id, m.created_at, m.modified_at, m.household_id, m.property_id, m.match_reason, m.full_name, m.first_name, m.middle_name, m.last_name, m.account_id, m.candidate_id, m.match_category, m.confidence, m.owner_id, m.match_resource
           ->  Seq Scan on public.match m  (cost=0.00..204030825.83 rows=8660968 width=6)
                 Output: m.ctid
                 Filter: ((NOT (SubPlan 1)) OR ((m.property_id IS NOT NULL) AND (NOT (SubPlan 2))))
                 SubPlan 1
                   ->  Index Only Scan using uq_household_id on build.household h  (cost=0.57..8.59 rows=1 width=0)
                         Index Cond: (h.household_id = (m.household_id)::text)
                 SubPlan 2
                   ->  Index Scan using property_property_id_idx on build.property p  (cost=0.57..8.59 rows=1 width=0)
                         Index Cond: (p.property_id = m.property_id)
                         Filter: (p.household_id = (m.household_id)::text)
   ->  CTE Scan on removed  (cost=0.00..216524.20 rows=8660968 width=264)
         Output: nextval('orphaned_matches_id_seq'::regclass), now(), removed.household_id, removed.account_id, removed.candidate_id, removed.created_at, removed.full_name, removed.first_name, removed.last_name, removed.match_reason, removed.property_id, removed.owner_id
(16 rows)
 Insert on match.orphaned_matches  (cost=1823761513653.51..1823761525043.91 rows=455616 width=380)
   CTE removed
     ->  Delete on match m  (cost=0.00..1823761513653.51 rows=455616 width=6)
           Output: m.id, m.created_at, m.modified_at, m.household_id, m.account_id, m.candidate_id, m.match_reason, m.property_id, m.full_name, m.first_name, m.middle_name, m.last_name, m.match_category, m.confidence, m.owner_id, m.match_resource
           ->  Seq Scan on match m  (cost=0.00..1823761513653.51 rows=455616 width=6)
                 Output: m.ctid
                 Filter: ((NOT (SubPlan 1)) OR ((m.property_id IS NOT NULL) AND (NOT (SubPlan 2))))
                 SubPlan 1
                   ->  Seq Scan on build.household h  (cost=0.00..2948996.80 rows=1 width=0)
                         Filter: (h.household_id = (m.household_id)::text)
                 SubPlan 2
                   ->  Index Scan using uq_idx_property_id on build.property p  (cost=0.57..2.59 rows=1 width=0)
                         Index Cond: (p.property_id = m.property_id)
                         Filter: (p.household_id = (m.household_id)::text)
   ->  CTE Scan on removed  (cost=0.00..11390.40 rows=455616 width=380)
         Output: nextval('orphaned_matches_id_seq'::regclass), now(), removed.household_id, removed.account_id, removed.candidate_id, removed.created_at, removed.full_name, removed.first_name, removed.last_name, removed.match_reason, removed.property_id, removed.owner_id
(16 rows)
Attachment:
query.sql

Description: Binary data