Nicolas Pitre <nico@xxxxxxx> writes: > On Fri, 22 Dec 2006, Johannes Schindelin wrote: > >> Hi, >> >> On Thu, 21 Dec 2006, Junio C Hamano wrote: >> >> > (2) update commit-tree to reject non utf-8 log messages and >> > author/committer names when i18n.commitEncoding is _NOT_ >> > set, or set to utf-8. >> >> The problem is: you cannot easily recognize if it is UTF8 or not, >> programatically. There is a good indicator _against_ UTF8, namely the >> first byte can _only_ be 0xxxxxxx, 110xxxxx, 1110xxxx, 11110xxx. But there >> is no _positive_ sign that it is UTF8. For example, many umlauts and other >> special modifications to letters, stay in the range 0x7f-0xff. > > Still... that would be a good enough thing to have in the majority of > cases, wouldn't it? I think that would be very sane thing to do. - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html