Den Saturday 29 March 2008 08.22.03 skrev Jeff King: > On Sat, Mar 29, 2008 at 08:19:07AM +0100, Robin Rosenberg wrote: > > Den Friday 28 March 2008 22.29.01 skrev Jeff King: > > > We always use 'utf-8' as the encoding, since we currently > > > have no way of getting the information from the user. > > > > Don't set encoding to UTF-8 unless it actually looks like UTF-8. > > OK. Do you have an example function that guesses with high probability > whether a string is utf-8? If there are non-ascii characters but we > _don't_ guess utf-8, what should we do? Any test for valid UTF-8 will do that with a very high probability. The perl UTF-8 "api" is a mess. I couldn't find such a routine!?. Calling decode/encode and see if you get the original string works, but that is too clumsy, IMHO. -- robin -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html