I tried the patch but it doesn't appear to have helped :( Clone time with it was ~32m. Do you all by any chance have a tool to obfuscate a repository? Probably I still wouldn't be permitted to distribute it but might make the option slightly more palatable. Anything else that I can do to help debug this problem? On Thu, Nov 8, 2012 at 9:56 AM, Jeff King <peff@xxxxxxxx> wrote: > On Wed, Nov 07, 2012 at 11:32:37AM -0600, Uri Moszkowicz wrote: > >> #4 parse_object (sha1=0xb0ee98 >> "\017C\205Wj\001`\254\356\307Z\332\367\353\233.\375P}D") at >> object.c:212 >> #5 0x00000000004ae9ec in handle_one_ref (path=0xb0eec0 >> "refs/tags/<removed>", sha1=0xb0ee98 >> "\017C\205Wj\001`\254\356\307Z\332\367\353\233.\375P}D", flags=2, >> cb_data=<optimized out>) at pack-refs. >> >> [...] >> >> It looks like handle_one_ref() is called for each ref and most result >> in a call to read_sha1_file(). > > Right. When generating the packed-refs file, we include the "peeled" > reference for a tag (i.e., the commit that a tag object points to). So > we have to actually read any tag objects to get the value. > > The upload-pack program generates a similar list, and I recently added > some optimizations. This code path could benefit from some of them by > using "peel_ref" instead of hand-rolling the tag dereferencing. The main > optimization, though, is reusing peeled values that are already in > packed-refs; we would probably need some additional magic to reuse the > values from the source repository. > > However: > >> It only takes a second or so for each call but when you have thousands >> of them (one for each ref) it adds up. > > I am more concerned that it takes a second to read each tag. Even in my > pathological tests for optimizing upload-pack, peeling 50,000 refs took > only half a second. > >> Adding --single-branch --branch <branch> doesn't appear to help as >> it is implemented afterwards. I would like to debug this problem >> further but am not familiar enough with the implementation to know >> what the next step is. Can anyone offer some suggestions? I don't see >> why a clone should be dependent on an O(#refs) operations. > > Does this patch help? In a sample repo with 5000 annotated tags, it > drops my local clone time from 0.20s to 0.11s. Which is a big percentage > speedup, but this code isn't taking a long time in the first place for > me. > > --- > diff --git a/pack-refs.c b/pack-refs.c > index f09a054..3344749 100644 > --- a/pack-refs.c > +++ b/pack-refs.c > @@ -40,13 +40,9 @@ static int handle_one_ref(const char *path, const unsigned char *sha1, > > fprintf(cb->refs_file, "%s %s\n", sha1_to_hex(sha1), path); > if (is_tag_ref) { > - struct object *o = parse_object(sha1); > - if (o->type == OBJ_TAG) { > - o = deref_tag(o, path, 0); > - if (o) > - fprintf(cb->refs_file, "^%s\n", > - sha1_to_hex(o->sha1)); > - } > + unsigned char peeled[20]; > + if (!peel_ref(path, peeled)) > + fprintf(cb->refs_file, "^%s\n", sha1_to_hex(peeled)); > } > > if ((cb->flags & PACK_REFS_PRUNE) && !do_not_prune(flags)) { -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html