On 01/17/2012 04:07 PM, Michael Haggerty wrote: > On 12/14/2011 12:24 AM, Junio C Hamano wrote: >> Michael Haggerty <mhagger@xxxxxxxxxxxx> writes: >>> ... But there are so many calls to the >>> for_each_ref*() family of functions that I wasn't able to determine >>> exactly which should allow for extra refs and which shouldn't. >> >> Only the ones that follow add_extra_ref() in the control flow. >> >> builtin/clone.c adds them in setup_reference() to deal with --reference. >> The objects known to be complete in these repositories we borrow from >> need to be marked complete on our end (i.e. clone does not have to fetch) >> and transport_fetch_refs() that eventually goes to fetch_refs_via_pack() >> that calls fetch_pack() uses this information. All three for_each_ref() >> calls in builtin/fetch-pack.c are about "what are the objects that we know >> are complete?" and needs to pay attention to extra refs. >> >> Having said that, I have a slight suspicion that you might be able to >> eliminate this one in clone. setup_reference() adds the reference >> repository to the $GIT_DIR/objects/info/alternates, and the fetch logic >> already knows to account for the refs in alternate repositories via >> insert_alternate_refs() callchain. > > If I comment out the call from add_one_reference() to add_extra_ref() > then I get a single failure, in t5700: > > not ok - 8 fetched no objects > # ! grep "^want" "$U" > > So your suspicion does not seem to be borne out (at least not in the > naivest form). > > Still studying... I finally had some time to get back to this puzzle. It took me a while to narrow down the problem, and I still don't understand it entirely. It seems like Junio should be right that setup_reference() doesn't need to add the alternate references to extra_refs. Indeed, if I remove the call to add_extra_ref(), I see the alternate references being added to ref_list via insert_one_alternate_ref(). However, in the context of t5700, clone nevertheless sends "want" lines for one of those references and test 8 fails. Something is screwy. So how do the extra_refs prevent the "want" lines from being emitted? It turns out that when the alternate refs *are* added as extra_refs, then find_common() is not called at all. do_fetch_pack() calls everything_local(), which returns true, and do_fetch_pack() skips over the call to find_common(). So ISTM that there are two problems: First problem: everything_local() seems to be either broken or used incorrectly. I can't decide which because I don't know what its semantics are *supposed* to be. If everything_local() is trying to check that the references are all in the local repository itself, then it is incorrect for clone to enter alternates into extra_refs because everything_local() then mistakes them for local. If everything_local() is trying to check that the references are in the local repository plus alternates, then it is incorrect that everything_local() doesn't consider alternate references in its determination. My guess is that this is the case, and that something like the following might be the fix: > ----------------------------- builtin/fetch-pack.c ----------------------------- > index 9500f35..4257a8d 100644 > @@ -581,6 +581,11 @@ static void filter_refs(struct ref **refs, int nr_match, char **match) > *refs = newlist; > } > > +static void mark_alternate_complete(const struct ref *ref, void *unused) > +{ > + mark_complete(NULL, ref->old_sha1, 0, NULL); > +} > + > static int everything_local(struct ref **refs, int nr_match, char **match) > { > struct ref *ref; > @@ -609,6 +614,7 @@ static int everything_local(struct ref **refs, int nr_match, char **match) > > if (!args.depth) { > for_each_ref(mark_complete, NULL); > + for_each_alternate_ref(mark_alternate_complete, NULL); > if (cutoff) > mark_recent_complete_commits(cutoff); > } With this patch, then the full test suite passes even if I take out the code that adds the alternate refs to extra_refs. Second problem: it seems like the presence of alternate refs is not suppressing the "want" lines for those refs in all cases. Strangely, in the case of t5700, adding the two alternate refs to ref_list insert_one_alternate_ref() causes the "want" line for one of them to be suppressed, but not for the other. Specifically: (without the above patch) I commented out the call to add_extra_ref() in clone.c:add_one_reference(), then ran t5700 through step 8 then aborted. insert_one_alternate_ref() was called four times: insert_one_alternate_ref(ccc25a1f9655742174c93f48f616bea8ad0bc6ff) insert_one_alternate_ref(ccc25a1f9655742174c93f48f616bea8ad0bc6ff) insert_one_alternate_ref(5355551c5a927a2b6349505ada2da4bb702c0a49) insert_one_alternate_ref(5355551c5a927a2b6349505ada2da4bb702c0a49) (The duplication here seems strange.) However, UPLOAD_LOG still contains "want" lines (2 of them!) for one of the commits: #S #E #S #S #E want 5355551c5a927a2b6349505ada2da4bb702c0a49 multi_ack_detailed side-band-64k thin-pack ofs-delta want 5355551c5a927a2b6349505ada2da4bb702c0a49 #E The "5355551c" object corresponds to refs/remotes/origin/master in the alternate object store: $ (cd B; git for-each-ref) ccc25a1f9655742174c93f48f616bea8ad0bc6ff commit refs/heads/master 5355551c5a927a2b6349505ada2da4bb702c0a49 commit refs/remotes/origin/HEAD 5355551c5a927a2b6349505ada2da4bb702c0a49 commit refs/remotes/origin/master It seems to me that even in the absence of short-circuiting due to everything_local() returning true, the presence of the alternate refs should be suppressing the "want" lines for those references. I have some patches that seem to work (and get rid of extra_refs entirely, hurrah!), but I don't want to submit them until I understand what the correct behavior is *supposed* to be. Michael -- Michael Haggerty mhagger@xxxxxxxxxxxx http://softwareswirl.blogspot.com/ -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html