Search Postgresql Archives

how to get collation according to Unicode Collation Algorithm?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

I have a problem with proper collation of UTF-8 strings using PostgreSQL version 9.2.4 under Debian Linux 6.0 with de_DE.utf8 (but en_US behaves the same) locale:

CREATE TABLE test_collation ( q text );
INSERT INTO test_collation (q) VALUES ('aa'), ('ac'), ('a&b');
SELECT * FROM test_collation ORDER BY q COLLATE "de_DE";
  q
-----
 aa
 a&b
 ac

I need the "&" character to be sorted at the beginning or at the end of the alphabet, but it seems like it is simply ignored. The space character (" ") is treated the same way (just swap the ampersand in previous example with a space).

I made a test on ICU pages (http://site.icu-project.org/) and there I get proper collation: 1. a&b, 2. aa, 3. ac. Screenshot: http://software.eq.cz/icu_collation_de_DE.png

Is there a way to achieve this collation (note also the order of the characters with umlaut on the screenshot) with PostgreSQL? Or is it a glibc bug?

Thanks,

r.


--
Sent via pgsql-general mailing list (pgsql-general@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general




[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Postgresql Jobs]     [Postgresql Admin]     [Postgresql Performance]     [Linux Clusters]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Postgresql & PHP]     [Yosemite]
  Powered by Linux