Hi, On Tue, 12 May 2009, Shawn O. Pearce wrote: > Esko Luontola <esko.luontola@xxxxxxxxx> wrote: > > Are there any plans on storing the encoding information of file names > > and commit messages in the Git repository? > > Commit messages already store their encoding in an optional "encoding" > header if the message isn't stored in UTF-8, or US-ASCII, which is a > strict subset of UTF-8. > > As for file names, no plans, its a sequence of bytes, but I think a > lot of people wind up using some subset of US-ASCII for their file > names, especially if their project is going to be cross platform. Some context: this issue cropped up in msysGit, of course. As to storing all file names in UTF-8, my point about Unicode being not necessarily appropriate for everyone still stands. UTF-8 _might_ be the de-facto standard for Linux filesystems, but IMHO we should not take away the freedom for everybody to decide what they want their file names to be encoded as. However, I see that there might be a need to be able to encode the file names differently, such as on Windows. IMHO the best solution would be a config variable controlling the reencoding of file names. For some time, it looked as if two people were interested in implementing something like that (Peter and Robin IIRC), but efforts have stalled. Ciao, Dscho -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html