Rural Hunter wrote: >>>>> My db is in utf-8, I have a row in my table say tmp_article and I wanted >>>>> to generate ts_vector from the article content: >>>>> select to_tsvector(content) from tmp_article; >>>>> But I got this error: >>>>> ERROR: invalid byte sequence for encoding "UTF8": 0xf481 >> Do you use any nonstandard text search parsers or dictionaries? >> What is the text search configuration you use (parameter >> default_text_search_config)? > Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the > default default_text_search_config is 'chinesecfg'. I tested "select > to_tsvector('english',content) from tmp_article" and this works. So it's > the problem of nlpbamboo? That's probably the problem - it seems to emit something that is not proper UTF-8 sometimes. Do you get the error if you try Chinese settings without nlpbamboo? > btw, what's the meaning of "Please don't top post."? I > don't understand but sorry if I caused any problem. http://en.wikipedia.org/wiki/Posting_style Yours, Laurenz Albe -- Sent via pgsql-admin mailing list (pgsql-admin@xxxxxxxxxxxxxx) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-admin