Tim Ansell <mithro@xxxxxxxxxx> writes: > Last week at the GitTogether I lead some discussions about how we could > make Git better support large media repositories (which is one area > where Subversion still make sense). It was suggested that I post to this > list to get a discussion going. > > The general idea is that we always clone the complete meta-data (tags, > commits and trees) and then only clone blobs when they are needed (using > something like alternates). This allows us to support shallow, narrow > and sparse checkouts while still being able to perform operations such > as committing and merging. > > You can find a copy of the summary presentation at > http://www.thousandparsec.net/~tim/media+git.pdf > > I have started working on adapting git to check a remote http alternate > to provide a proof of concept. > > I appreciate any help or suggestions. Dana How (CC-ed) worked on better support for large files, but in corporate setting. The solution that was the result of all discussion and all patches (not all accpeted) was to create kept packfile for those large files, and share those packfiles (perhaps via alternates) using network filesystem, instead of keeping separate copies and trasferring them on fetch / push. >From what I remember there was one serious attempt (by serious I mean here with patches) to add 'lazy clone' / 'sparse clone' / 'remote alternates', using some kind of "stub" objects and trasferring objects lazily. This patch was fairly intrusive, and didn't get accepted. I think you can find it in archives. Unfortunately I haven't bookmarked this thread... The problem with lazy clone is that git assumes in many places that if it has some object, it has all its dependencies. Lazy clone (on-demand object loading) breaks this assumption... although in your case (only blobs of large size can be asked to be loaded lazily) it is migitated somehow. I also think that you would have to have 'sparse checkout' support. If you don't have blob in object repository (and don't want to have it there), you can not check it out. Fortunately this feature is quite alive, and worked on by Duy (pclouds), see "What's cooking..." (nd/narrow branch in 'pu'). HTH -- Jakub Narebski Poland ShadeHawk on #git -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html