Re: cvs2git with modules?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Jun 17, 2008 at 11:02 AM, Geoffrey Irving <irving@xxxxxxx> wrote:
> On Tue, Jun 17, 2008 at 8:45 AM, Michael Haggerty <mhagger@xxxxxxxxxxxx> wrote:
>> [I see you sent essentially the same question to both the git and the
>> cvs2svn mailing lists.  I am replying on the git list with bcc to the
>> cvs2svn list.  Followups please to git@xxxxxxxxxxxxxxxx]
>>
>> Kelly F. Hickel wrote:
>>> I'm trying to use cvs2svn in cvs2git mode to convert a repo with a
>>> number of modules.  Can anyone tell me how to keep that module
>>> structure in the new git repo? So, if in cvs there are two modules,
>>> ModA and ModB, I want to see those two as top level directories in
>>> the git repo.
>>>
>>> I've tried putting adding the projects in my options file as below,
>>> but it puts the files ModA/* and ModB/* at the top level in the git repo.
>>>
>>> run_options.add_project(
>>>     r'/home/foo/cvsrepo/ModA,
>>>     trunk_path=ModA',
>>>     [...]
>>>     )
>>>
>>> run_options.add_project(
>>>     r'/home/foo/cvsrepo/ModB,
>>>     trunk_path=ModB',
>>>     [...]
>>>     )
>>
>> I assume that what you mean is that the CVS repository contains
>> directories like ModA/a, ModA/b, ModB/c, and ModB/d, but the resulting
>> git repository has only /a, /b, /c, and /d.  That is because cvs2git
>> completely ignores the trunk_path argument to add_project().
>>
>> It is strange that the code allows you to add multiple projects,
>> considering that the cvs2git documentation[1] states that cvs2git only
>> supports converting single projects at a time.  I guess I forgot to
>> build that check in.
>>
>> You can get the result you want by treating ModA and ModB not as two
>> separate projects, but simply as two separate subdirectories within a
>> bigger project; i.e.,
>>
>> run_options.add_project(
>>    r'/home/foo/cvsrepo,
>>    [...]
>>    )
>>
>> .  This is assuming that ModA and ModB are the only subdirectories
>> within /home/foo/cvsrepo/; otherwise, make a copy of your CVS repo and
>> remove the other subdirectories from the copy before the conversion.
>>
>> Please note that when cvs2git is run this way, it treats tags and
>> branches as being global.  If you tagged your projects simultaneously,
>> then this is probably what you want.  But if you tagged your projects
>> separately, then tag names that happen to be the same across projects
>> will be considered the same.
>>
>> It would be possible to add cvs2git support for multiproject
>> conversions, but I was under the impression that it doesn't make much
>> sense to put multiple projects into a single git repository.  But I'm a
>> novice git user, so I could very well be wrong about that.
>>
>> Michael
>
> I haven't touched cvs2git, but I did modify git-cvsimport to handle
> submodules.  The patch is below.  The idea is that you first convert
> the submodule parts from cvs to git, and then when you convert the
> parent project the submodules are continuously updated based on a
> submodule description file and the dates.
>
> To use it, write a file in the same format as .gitmodules and pass it
> to cvs-import via -E.
>
> I eventually decided that submodules were getting in the way far more
> than they were helping, so I stopped using this.  Perhaps someone else
> will find it useful.

Oops.  Copying and pasting the patch messed it up due to 80 character
wrapping.  Trying again...

Geoffrey
From 653fe2e8440893da06fa0eba9fd5c2c275435897 Mon Sep 17 00:00:00 2001
From: Geoffrey Irving <irving@xxxxxxx>
Date: Thu, 12 Jun 2008 07:38:49 -0700
Subject: [PATCH] cvsimport: add support for generating submodules

Add an -E <submodules> option to cvsimport to generate submodule links to
other git repositories and to update them after each commit based on date.
This is useful when one is splitting up a cvs repository into separate git
modules which are strongly related to each other.
---
 git-cvsimport.perl |   71 +++++++++++++++++++++++++++++++++++++++++++++++----
 1 files changed, 65 insertions(+), 6 deletions(-)

diff --git a/git-cvsimport.perl b/git-cvsimport.perl
index cacbfc0..9b0285a 100755
--- a/git-cvsimport.perl
+++ b/git-cvsimport.perl
@@ -29,8 +29,9 @@ use IPC::Open2;
 $SIG{'PIPE'}="IGNORE";
 $ENV{'TZ'}="UTC";
 
-our ($opt_h,$opt_o,$opt_v,$opt_k,$opt_u,$opt_d,$opt_p,$opt_C,$opt_z,$opt_i,$opt_P, $opt_s,$opt_m,@opt_M,$opt_A,$opt_S,$opt_L, $opt_a, $opt_r);
+our ($opt_h,$opt_o,$opt_v,$opt_k,$opt_u,$opt_d,$opt_p,$opt_C,$opt_z,$opt_i,$opt_P, $opt_s,$opt_m,@opt_M,$opt_A,$opt_S,$opt_E,$opt_L, $opt_a, $opt_r);
 my (%conv_author_name, %conv_author_email);
+my (%submodule_path, %submodule_url);
 
 sub usage(;$) {
 	my $msg = shift;
@@ -40,7 +41,7 @@ Usage: ${\basename $0}     # fetch/update GIT from CVS
        [-o branch-for-HEAD] [-h] [-v] [-d CVSROOT] [-A author-conv-file]
        [-p opts-for-cvsps] [-P file] [-C GIT_repository] [-z fuzz] [-i] [-k]
        [-u] [-s subst] [-a] [-m] [-M regex] [-S regex] [-L commitlimit]
-       [-r remote] [CVS_module]
+       [-E gitmodules-file] [-r remote] [CVS_module]
 END
 	exit(1);
 }
@@ -88,6 +89,23 @@ sub write_author_info($) {
 	close ($f);
 }
 
+sub read_submodule_info($) {
+	my ($file) = @_;
+	open(my $f, '-|', 'git-config', '--file', $file, '--list')
+		or die "Failed to read: $!\n";
+
+	while (<$f>) {
+		/^submodule\.(\w+)\.(path|url)=(.*)$/
+			or die "Unknown submodule config line: $_\n";
+		if ($2 eq 'path') {
+			$submodule_path{$1} = $3;
+		} else {
+			$submodule_url{$1} = $3;
+		}
+	}
+	close $f;
+}
+
 # convert getopts specs for use by git config
 sub read_repo_config {
     # Split the string between characters, unless there is a ':'
@@ -110,7 +128,7 @@ sub read_repo_config {
 	}
 }
 
-my $opts = "haivmkuo:d:p:r:C:z:s:M:P:A:S:L:";
+my $opts = "haivmkuo:d:p:r:C:z:s:M:P:A:S:L:E:";
 read_repo_config($opts);
 Getopt::Long::Configure( 'no_ignore_case', 'bundling' );
 
@@ -610,6 +628,10 @@ if ($opt_A) {
 	write_author_info("$git_dir/cvs-authors");
 }
 
+# read submodule info
+if ($opt_E) {
+	read_submodule_info($opt_E);
+}
 
 #
 # run cvsps into a file unless we are getting
@@ -664,16 +686,19 @@ open(CVS, "<$cvspsfile") or die $!;
 
 my $state = 0;
 
-sub update_index (\@\@) {
+sub update_index (\@\@\@) {
 	my $old = shift;
 	my $new = shift;
+	my $submodules = shift;
 	open(my $fh, '|-', qw(git-update-index -z --index-info))
 		or die "unable to open git-update-index: $!";
 	print $fh
 		(map { "0 0000000000000000000000000000000000000000\t$_\0" }
 			@$old),
 		(map { '100' . sprintf('%o', $_->[0]) . " $_->[1]\t$_->[2]\0" }
-			@$new)
+			@$new),
+		(map { "160000 commit $_->[0]\t$_->[1]\0" }
+			@$submodules)
 		or die "unable to write to git-update-index: $!";
 	close $fh
 		or die "unable to write to git-update-index: $!";
@@ -721,7 +746,41 @@ sub commit {
 	}
         $ENV{GIT_INDEX_FILE} = $index{$branch};
 
-	update_index(@old, @new);
+	# add .gitmodules and find submodule commits
+	my @submodules;
+	if ($opt_E) {
+		print "Update .gitmodules\n" if $opt_v;
+		my $pid = open(my $F, '-|');
+		die $! unless defined $pid;
+		if (!$pid) {
+		    exec("git-hash-object", "-w", $opt_E)
+				or die "Cannot create object: $!\n";
+		}
+		my $sha = <$F>;
+		chomp $sha;
+		close $F;
+		push(@new,[0644, $sha, ".gitmodules"]);
+
+		for my $m (keys %submodule_path) {
+			my $search_date = $date + (defined $opt_z ? $opt_z : 300);
+			$ENV{"GIT_DIR"} = $submodule_url{$m};
+			open(my $f, '-|', 'git-log', "--before=$search_date", '-1', '--pretty=format:%H %ct')
+				or die "Failed to extract submodule commit id: $!\n";
+			$ENV{"GIT_DIR"} = $git_dir;
+			my $line = <$f>;
+			close $f;
+			if (defined $line) {
+				$line =~ /^([\da-f]{40}) (\d+)$/ or die "Failed to parse submodule log: $line\n";
+				push @submodules, [$1, $submodule_path{$m}];
+				if ($opt_v) {
+					my $subdate = strftime("+0000 %Y-%m-%d %H:%M:%S",gmtime($2));
+					print "Updating submodule $m to $subdate\n";
+				}
+			}
+		}
+	}
+
+	update_index(@old, @new, @submodules);
 	@old = @new = ();
 	my $tree = write_tree();
 	my $parent = get_headref("$remote/$last_branch");
-- 
1.5.5.1.508.ga00f3


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux