On Wed, Sep 14, 2011 at 12:55 AM, Luke Diamand <luke@xxxxxxxxxxx> wrote: > On 13/09/11 22:33, Chris Li wrote: >> The fix is simple, just ask perforce to print the depot >> file into a real file. This way perforce will not performe >> the utf16 to utf8 conversion. Git can import the exact same >> file as perforce checkout. > > Does this change do the right thing with RCS keywords in UTF16 files? I don't know what is the rules about the RCS keyword in UTF16 files. I look at the current git-p4, it does not do any keyword replacement in utf16 files. So this patch did not change that. It should be a separate issue. The way I see it, this patch is a straight enhancement compare to the current git-p4 because the current git-p4 *corrupts* the utf16 files. > > If p4CmdList() fails, e.g. due to running out of diskspace, will this just > happily import a truncated/corrupt file? Good point. I add the error check and attach the new patch. > (And I could be wrong about this, but does you patch have newline damage? It > didn't seem to apply for me). Gmail dmage the white space. I should always use the attachment. Does the attached patch work for you? Thanks Chris
From 06de9cfdcd89e8bfb6575f40d36fdfcefe1a1985 Mon Sep 17 00:00:00 2001 From: Chris Li <git@xxxxxxxxxxx> Date: Tue, 13 Sep 2011 13:57:31 -0700 Subject: [PATCH] git-p4: import utf16 file properly The current git-p4 does not handle utf16 files properly. The "p4 print" command, when output to stdout, convert the utf16 file into utf8. That effectively imported the utf16 file as utf8 for git. In other words, git-p4 import a different file compare to file check out by perforce. This breaks my windows build in the company project. The fix is simple, just ask perforce to print the depot file into a real file. This way perforce will not perform the utf16 to utf8 conversion. Git can import the exact same file as perforce checkout. --- contrib/fast-import/git-p4 | 8 ++++++++ 1 files changed, 8 insertions(+), 0 deletions(-) diff --git a/contrib/fast-import/git-p4 b/contrib/fast-import/git-p4 index 6b9de9e..c111cad 100755 --- a/contrib/fast-import/git-p4 +++ b/contrib/fast-import/git-p4 @@ -1239,6 +1239,14 @@ class P4Sync(Command, P4UserMap): contents = map(lambda text: re.sub(r'(?i)\$(Id|Header):[^$]*\$',r'$\1$', text), contents) elif file['type'] in ('text+k', 'ktext', 'kxtext', 'unicode+k', 'binary+k'): contents = map(lambda text: re.sub(r'\$(Id|Header|Author|Date|DateTime|Change|File|Revision):[^$\n]*\$',r'$\1$', text), contents) + elif file['type'] == 'utf16': + tmpFile = tempfile.NamedTemporaryFile() + cmd = "print -o %s %s"%(tmpFile.name, file['depotFile']) + result = p4Cmd(cmd) + if "p4ExitCode" in result: + die("Problems executing p4 %s"%cmd) + contents = [ open(tmpFile.name).read() ] + tmpFile.close() self.gitStream.write("M %s inline %s\n" % (mode, relPath)) -- 1.7.6