Hi all:
What confused me is that: When I select data using order by clause, I got the following execution plan:
postgres=# set session enable_indexscan=true;
SET
postgres=# explain SELECT * FROM pg_proc ORDER BY oid;
QUERY PLAN
----------------------------------------------------------------------------------------
Index Scan using pg_proc_oid_index on pg_proc (cost=0.00..321.60 rows=2490 width=552)
(1 row)
postgres=#
My Question is :
If I want to find record using the where clause which hold the id column, the index scan might be used.
But I just want to get all the records on sorted output format, Why index scan can be used here?
I can’t imagine that:
Step 1 Index is read into memory, then for each tuple in it,
Step 2 Then we got the address of related data block, and then access the data block .
Step 2 will be repeated for many times. I think it is not efficient.
But comparing with sort , I got that even index scan with all the entry , the cost is still lower than sort operation:
postgres=# set session enable_indexscan=false; |
||||||
SET |
||||||
postgres=# explain SELECT * FROM pg_proc ORDER BY oid; |
||||||
QUERY PLAN |
||||||
------------------------------------------------------------------- |
||||||
Sort (cost=843.36..849.59 rows=2490 width=552) |
||||||
Sort Key: oid |
||||||
-> Seq Scan on pg_proc (cost=0.00..86.90 rows=2490 width=552) |
||||||
(3 rows) |
||||||
postgres=# |
That is to say: cost of seq scan + sort > cost of index scan for every index entry + cost of access for every related data ?
Maybe the database system is clever enough to accumulate data access for same physical page, and reduce the times of physical page acess ?
And can somebody kindly give some more detailed information which help to know the execution plan calculation process?
Thanks in advance.