Re: Path character encodings

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Paul Betts <paul@xxxxxxxxxxxxx> writes:

> On Mon, Mar 05, 2012 at 01:40:32PM -0800, Junio C Hamano wrote:
>> Hmph, I do not see a reason to make a huge molehill in this. The
>> pathnames are of unspecified encoding, and if a project declares
>> that they always use UTF-8, that would be great. 
>
> I would like to propose, that Git codifies-as-required the majority case
> today, that trees should *only* be encoded in UTF-8 going forward. 

I am afraid that that would be a hard sell.

As the pathnames are uninterpreted strings, an older project that
has been using 8859-1 (or EUC for various locales, or anything that
is superset of ASCII) has no good incentive or reason to follow such
a unilateral decision made outside their project, only to get their
participants inconvenienced.  Even giving a "warning" will trigger
"Yeah we know our paths are in latin-1 and everybody in our project
has happily been using that, thankyouverymuch!" and annoy them.

It is not an argument that their history can be re-coded to UTF-8
without losing information, if they still have to go through the
conversion process that their project do not benefit from.

Can we make this per-project opt-*in* somehow?

--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]