Markus Schiltknecht <markus@xxxxxxxxxx> wrote: > Shawn Pearce wrote: > >I don't know how the Monotone guys feel about it but I think Git > >is happy with the data in any order, just so long as the dependency > >chains aren't fed out of order. Which I think nearly all changeset > >based SCMs would have an issue with. So we should be just fine > >with the current chronological order produced by cvs2svn. > > I'd vote for splitting into file data (and delta / patches) import and > metadata import (author, changelog, DAG). > > Monotone would be happiest if the file data were sent one file after > another and (inside each file) in the order of each file's single > history. That guarantees good import performance for monotone. I imagine > it's about the same for git. And if you have to somehow cache the files > anyway, subversion will benefit, too. (Well, at least the cache will > thank us with good performance). > > After all file data has been delivered, the metadata can be delivered. > As neigther monotone nor git care much if they are chronological across > branches, I'd vote for doing it that way. Right. I think that one of the cvs2svn guys had the right idea here. Provide two hooks: one early during the RCS file parse which supplies a backend each full text file revision and another during the very last stage which includes the "file" in the metadata stream for commit. This would give Git and Monotone a way to grab the full text for each file and stream them out up front, then include only a "token" in the metadata stream which identifies the specific revision. Meanwhile SVN can either cache the file revision during the early part or ignore it, then dump out the full content during the metadata. As it happens Git doesn't care what order the file revisions come in. If we don't repack the imported data we would prefer to get the revisions in newest->oldest order so we can delta the older versions against the newer versions (like RCS). This is also happens to be the fastest way to extract the revision data from RCS. On the other hand from what I understand of Monotone it needs the revisions in oldest->newest order, as does SVN. Doing both orderings in cvs2noncvs is probably ugly. Doing just oldest->newest (since 2/3 backends want that) would be acceptable but would slow down Git imports as the RCS parsing overhead would be much higher. -- Shawn. - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html