El 20/1/2008, a las 2:04, Linus Torvalds escribió:
On Sun, 20 Jan 2008, Wincent Colaiuta wrote:
For what it's worth, their choice wasn't entirely "insane" ie. it
did have an
element of rationality: that decomposed forms are a little bit
simpler to
sort.
No they are *not*.
In many languages, 'ä' does *not* sort like 'a' at all, and if you
think
it does, you'll sort at least Finnish and Swedish totally wrong (åäö
are
real letters, and they sort at the *end* of the alphabet, they have
nothing what-so-ever to do with the letters 'a' or 'o').
The fact that in *some* languages the decomposed forms sort as the
base
letter is immaterial. It's only true in some cases.
So no, sort order is not it. To sort right, you need to use the a real
Unicode sort (and the decomposed form is *not* going to help you one
bit,
quite the reverse).
That's what I get for believing Wikipedia ("This makes sorting far
simpler"):
http://en.wikipedia.org/wiki/UTF-8#Mac_OS_X
Cheers,
Wincent
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html