TSearch2 vs. Apache Lucene

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Greetings all,

I'm going to do a performance comparison with DocMgr and PG81/TSearch2 on one end, and Apache Lucene on the other end.

In order to do this, I'm going to create a derivative of the docmgr-autoimport script so that I can specify one file to import at a time. I'll then create a Perl script which logs all details (such as timing, etc.) as the test progresses.

As test data, I have approximately 9,000 text files from Project Gutenberg ranging in size from a few hundred bytes to 4.5M.

I plan to test the speed of import of each file. Then, I plan to write a web-robot in Perl that will test the speed and number of results returned.

Can anyone think of a validation of this test, or how I should configure PG to maximise import and search speed? Can I maximise search speed and import speed, or are those things mutually exclusive? (Note that this will be run on limited hardware - 900MHz Athlon with 512M of ram)

Has anyone ever compared TSearch2 to Lucene, as far as performance is concerned?

Thanks,
-Josh


[Postgresql General]     [Postgresql PHP]     [PHP Users]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Yosemite]

  Powered by Linux