Search Postgresql Archives

tsearch2 for alphabetic character strings & codes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I'm looking for a way search for substrings strings within 
documents in a way very similar to tsearch2, but my strings
are not alphabetical codes so I'm having a tough time 
trying to use the current tsearch2 configurations with them.

For example, using tsearch to search for codes like 
  '31.03(e)(2)(A)'
in a set of documents is tricky because tsearch seems
to treat most of the punctuation as word separators.

  fli=# select
  fli-#      to_tsvector('default','31.03(e)(2)(A)'),
  fli-#      to_tsvector('simple','31.03(e)(2)(A)');
 
        to_tsvector      |         to_tsvector
  -----------------------+-----------------------------
   '2':3 'e':2 '31.03':1 | '2':3 'a':4 'e':2 '31.03':1
  (1 row)


I see that tsearch2 allows different "configurations"
that appaently differ in how they parse strings.

I guess what I'm looking for is a "configuration"
that's even simpler-than-simple, and only breaks
up strings on whitespace and doesn't use any natural
language dictionaries. I was hoping I could download 
or define such a configuration; but didn't see any
obvious documentation on how to set up my own 
configuration.

Does this sound like a good approach (and if so, could
someone please point me in the right direction), or 
are there other things I should be looking to.

   Ron

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

               http://archives.postgresql.org

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Postgresql Jobs]     [Postgresql Admin]     [Postgresql Performance]     [Linux Clusters]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Postgresql & PHP]     [Yosemite]
  Powered by Linux