On 2/28/25 09:16, Laurenz Albe wrote:
On Thu, 2025-02-27 at 16:54 +0300, Alexey Borschev wrote:
I see poor performance of text sorting of collate "en_US.utf8" in PG 17.4.
I'd say that you would have to complain to the authors of the
GNU C library, which provides this collation.
Yep -- glibc starting with version 2.21 has a massive performance
regression for certain cases and the glibc folks have basically said
they will not fix it. If you try the same thing on RHEL 7.x with glibc
2.17 it will perform about the same as ICU.
If you are using pg17 you should consider using the new builtin
collation provider -- it will perform almost as well as the 'C' locale.
Something like:
--------
CREATE DATABASE builtincoll LOCALE_PROVIDER builtin
BUILTIN_LOCALE 'C.UTF-8' TEMPLATE template0;
--------
--
Joe Conway
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com