Tim <elatllat@xxxxxxxxx> writes: > My limited testing indicates tsvector size has an approximately linear > (binomial) correlation with "number of unique words", and "word count". Yes, it would be linear in the number of unique words (assuming the average word length stays constant). It's not linear in word count, because tsvector only keeps track of distinct word positions for the first 16K words in the text. So for large documents only the number of unique words is really going to matter, and that grows *much* less than linearly with document size for real-world documents. regards, tom lane -- Sent via pgsql-admin mailing list (pgsql-admin@xxxxxxxxxxxxxx) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-admin