Slow index

Matthew Wakeling <matthew@xxxxxxxxxxx> · Thu, 25 Sep 2008 13:07:09 +0100 (BST)

Hi all. I'm having an interesting time performance-wise with a set of indexes. 
Any clues as to what is going on or tips to fix it would be appreciated.

My application runs lots of queries along the lines of:

SELECT * from table where field IN (.., .., ..);

There is always an index on the field in the table, but the table is not 
necessarily clustered on the index.

There are six different indexes which the application hits quite hard, that I 
have investigated. These are:

gene__key_primaryidentifier      (index size 20MB)  (table size 72MB)
gene__key_secondaryidentifier    (index size 20MB)  (table size 72MB)
ontologyterm__key_name_ontology  (index size 2.5MB) (table size 10MB)
protein__key_primaryacc          (index size 359MB) (table size 1.2GB)
publication__key_pubmed          (index size 12MB)  (table size 48MB)
synonym__key_synonym             (index size 3GB)   (table size 3.5GB)

These six indexes all perform very differently.

These are the results from a few thousand queries on each index, from our 
application logs. Generally, the same value is not used more than once in all 
the queries.

                                 (1)   (2)   (3)   (4)    (5)
gene__key_primaryidentifier      22    17    417   19     24.5
gene__key_secondaryidentifier    8.5   5.3   21    2.4    3.9
ontologyterm__key_name_ontology  6.5   6.5   9.4   1.4    1.4
protein__key_primaryacc          73    8.1   164   2.2    20
publication__key_pubmed          52    31    156   3.0    5.0
synonym__key_synonym             335   66    245   0.7    3.7

(1) - Average number of values in the IN list.
(2) - Average number of rows returned by the queries.
(3) - Average time taken to execute the query, in ms.
(4) - Average time per value in the IN lists.
(5) - Average time per row returned.

All the queries are answered with a bitmap index scan on the correct 
index.

I have also plotted all the log entries on an XY graph, with number of 
elements in the IN list against time taken, which is at 
http://wakeling.homeip.net/~mnw21/slow_index1.png. It is clear that the 
gene__key_primaryidentifier index runs a lot slower than some of the other 
indexes.

The thing is, the table and the index are both small. The machine has 16GB of 
RAM, and its disc subsystem is a RAID array of 16 15krpm drives with a BBU 
caching RAID controller. The entire table and index should be in the cache. Why 
it is taking 20 milliseconds per value is beyond me. Moreover, the synonym 
index is MUCH larger, has bigger queries, and performs better.

If we concentrate on just this index, it seems that some queries are 
answered very quickly indeed, while others are answered a lot slower. I 
have plotted just this one index on an XY graph, with two colours for 
values in the IN list and actual rows returned, which is at 
http://wakeling.homeip.net/~mnw21/slow_index2.png. It is clear that there 
is a gap in the graph between the slow queries and the fast queries.

Is there something else going on here which is slowing the system down? The 
table is not bloated. There is quite heavy simultaneous write traffic, but 
little other read traffic, and the 16 spindles and BBU cache should take care 
of that quite happily. I don't think it's slow parsing the query, as it seems 
to manage on other queries in a millisecond or less.

Any ideas welcome.

Also, the mailing list server doesn't seem to be able to cope with image 
attachments.

Matthew

--
import oz.wizards.Magic;
  if (Magic.guessRight())...           -- Computer Science Lecturer