On Fri, Mar 30, 2007 at 07:01:27PM -0700, Steven Grimm wrote: > I've imported the full history of a large project from Subversion using > the latest git-svn. The resulting repo is huge, and I believe it's due > in large part to a series of big tar.gz files that got checked into the > Subversion repository by mistake early in the project's history. They > were subsequently removed from svn, but of course git-svn grabs them and > puts them in my local history. > > Is there any way to excise those files? They are of no interest to us > now -- they were data files for a third-party application we ended up > not using -- and they're making git look bad in the disk usage department. > > I believe this has been asked before in the context of removing > copyrighted content from public repositories. However, I have a twist > that may make it easier: nobody else has cloned this repository yet. I > am free to rewrite history with no risk of messing up any downstream > repositories, and I don't have to worry about propagating the deletions > out to anyone. I just don't know how to do it (assuming it's doable at all.) It's painful to rewrite history, since you end up needing to rewrite every single commit after the point where you've tampered with time to fix up the parent commit ID. Are you planning on doing a one-shot import, or are you hoping to be able to do bidirectional gatewaying between svn and git? If you want to do the latter, rewriting history is going to be very painful if you want the bidirectional gateway to work afterwards. If you just want to do a one-way import, it's probably going to be much easier to modify whatever importer you use to not import the big files in the first place. - Ted - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html