Search Postgresql Archives

Re: Concerning about Unicode-aware string handling

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 05/21/12 2:09 AM, Vincas Dargis wrote:
We have problems (currently using 8.4, but also in latest 9.1.3) in
our application with Unicode word symbols in Lithuanian ('ąčęėįšųūž'),
Russian and of course potentially other languages.

For example, regex_replace('acząčž', E'\\W', '', 'g') removes ąčž.

lower() and ~* comparison works only with locale that is set (no
internationalization).

Could we expect Unciode support in near future? Or should we do quick
hacks by reimplementing regexp_replace(), lower(), upper() and other
string SQL functions using, for example, Qt libraries..? Or maybe are
there some kind simpler workarounds?

your database encoding is UTF8 ? the language or environment you're using to generate those strings such as 'acząčž' is also UTF8 ?

postgresql supports UTF-8 unicode just fine. It does not directly support the bastardized UTF-16 'unicode' implemented by Windows NT and derivatives (2000, XP, 2003, Vista, 2008, 7), but on those platforms it generally behaves fairly sanely as long as you realize UTF8 is its native tongue.

of course, the database has to be created as a UTF8 database, its possible to initialize the server cluster in "C"/"POSIX"/"SQLASCII" which says bytes-are-bytes and encodings are unknown, or in various 8 bit encodings like LATIN-1.




--
john r pierce                            N 37, W 122
santa cruz ca                         mid-left coast



--
Sent via pgsql-general mailing list (pgsql-general@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Postgresql Jobs]     [Postgresql Admin]     [Postgresql Performance]     [Linux Clusters]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Postgresql & PHP]     [Yosemite]
  Powered by Linux