Re: git on MacOSX and files with decomposed utf-8 file names

Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> · Wed, 23 Jan 2008 09:32:11 -0800 (PST)

On Wed, 23 Jan 2008, Kevin Ballard wrote:
> 
> Well yes, any context in which a string is treated as Unicode instead of an
> opaque sequence of bytes will probably lead to normalization at some point
> (e.g. when searching text, I'm going to want Märchen and Märchen to be treated
> as the same string).

As pointed out (multiple times), this is only true if the programmer is a 
moron.

You do not need to - and *should* not - convert to a common normalization 
in order to compare to Uncode strings. You should just compare them with a 
Unicode-aware comparison routine. It will be faster, and it will avoid 
corrupting the input.

Sadly, stupid people are much too common.

		Linus
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html