Hi, On Tue, 13 Feb 2007, Linus Torvalds wrote: > 0x00 - which is the only one mmfile_is_binart() uses - is arguably the > "most binary" one, of course, but it might be interesting to give > different weights to the whole range.. In particular, especially for > small files, the fact that there is no 0x00 byte in no way indicates > that it's not "binary". Last time I checked, the text files never had lines longer than 200 characters (I chose this intentionally large). So, it might be a good heuristic to check the maximal line length, and refuse to believe that it's text once a certain (configurable) threshold is reached. Ciao, Dscho - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html