Re: invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Rural Hunter wrote:
>>>>> My db is in utf-8, I have a row in my table say tmp_article and I wanted
>>>>> to generate ts_vector from the article content:
>>>>> select to_tsvector(content) from tmp_article;
>>>>> But I got this error:
>>>>> ERROR:  invalid byte sequence for encoding "UTF8": 0xf481

>> Do you use any nonstandard text search parsers or dictionaries?
>> What is the text search configuration you use (parameter
>> default_text_search_config)?

> Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the
> default default_text_search_config is 'chinesecfg'. I tested "select
> to_tsvector('english',content) from tmp_article" and this works. So it's
> the problem of nlpbamboo?

That's probably the problem - it seems to emit something that is
not proper UTF-8 sometimes.

Do you get the error if you try Chinese settings without nlpbamboo?

>              btw, what's the meaning of "Please don't top post."? I
> don't understand but sorry if I caused any problem.

http://en.wikipedia.org/wiki/Posting_style

Yours,
Laurenz Albe

-- 
Sent via pgsql-admin mailing list (pgsql-admin@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-admin



[Index of Archives]     [KVM ARM]     [KVM ia64]     [KVM ppc]     [Virtualization Tools]     [Spice Development]     [Libvirt]     [Libvirt Users]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite Questions]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux