Regexps not unicode-compatible

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

When I use regexp from GNU tools, such as emacs or grep, which I think use the GNU libc, regexps support pretty well, depending on locale settings, all unicode, so with french locale [a-z] will match « ç », and with C locale, [а-ю] will match « д » (or any original cyrillic letter contained in all cyrillic languages, I guess). [[:alnum:]] will correctly base on unicode class to determine if a given codepoint is a letter or not, and will support all languages.

However, I noticed that in apache these last ones won’t work and will only support ascii. Why a such restriction? Couldn’t it support at least optionally unicode at least on GNU systems?

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@xxxxxxxxxxxxxxxx
For additional commands, e-mail: users-help@xxxxxxxxxxxxxxxx





[Index of Archives]     [Open SSH Users]     [Linux ACPI]     [Linux Kernel]     [Linux Laptop]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Squid]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]

  Powered by Linux