On Fri, 22 Dec 2006, Johannes Schindelin wrote: > Hi, > > On Thu, 21 Dec 2006, Junio C Hamano wrote: > > > (2) update commit-tree to reject non utf-8 log messages and > > author/committer names when i18n.commitEncoding is _NOT_ > > set, or set to utf-8. > > The problem is: you cannot easily recognize if it is UTF8 or not, > programatically. There is a good indicator _against_ UTF8, namely the > first byte can _only_ be 0xxxxxxx, 110xxxxx, 1110xxxx, 11110xxx. But there > is no _positive_ sign that it is UTF8. For example, many umlauts and other > special modifications to letters, stay in the range 0x7f-0xff. Still... that would be a good enough thing to have in the majority of cases, wouldn't it? Nicolas - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html