On 06/27, Jonathan Tan wrote: > During negotiation, fetch-pack eventually reports as "have" lines all > commits reachable from all refs. Allow the user to restrict the commits > sent in this way by providing a whitelist of tips; only the tips > themselves and their ancestors will be sent. > > This feature is only supported for protocols that support connect or > stateless-connect (such as HTTP with protocol v2). > > This will speed up negotiation when the repository has multiple > relatively independent branches (for example, when a repository > interacts with multiple repositories, such as with linux-next [1] and > torvalds/linux [2]), and the user knows which local branch is likely to > have commits in common with the upstream branch they are fetching. > > [1] https://kernel.googlesource.com/pub/scm/linux/kernel/git/next/linux-next/ > [2] https://kernel.googlesource.com/pub/scm/linux/kernel/git/torvalds/linux/ > > Signed-off-by: Jonathan Tan <jonathantanmy@xxxxxxxxxx> > --- > v2 is exactly the same as the original, except with user-facing > documentation in Documentation/fetch-options.txt. > > > What's the plan to expose this "feature" to end-users? There is no > > end-user facing documentation added by this patch, and in-code > > comments only talk about what (mechanical) effect the option has, > > but not when a user may want to use the feature, or how the user > > would best decide the set of commits to pass to this new option. > > Jonathan Nieder also mentioned this. Lack of documentation was an > oversight, sorry. I've added it in this version. > > > Would something like this > > > > git fetch $(git for-each-ref \ > > --format=--nego-tip="%(objectname)" \ > > refs/remotes/linux-next/) \ > > linux-next > > > > be an expected typical way to pull from one remote, exposing only > > the tips of refs we got from that remote and not the ones we > > obtained from other places? > > Yes, that is one way. Alternatively, if the user is only fetching one > branch, they may also want to specify a single branch. > --- > Documentation/fetch-options.txt | 12 +++++++ > builtin/fetch.c | 21 +++++++++++++ > fetch-pack.c | 19 ++++++++++-- > fetch-pack.h | 7 +++++ > t/t5510-fetch.sh | 55 +++++++++++++++++++++++++++++++++ > transport-helper.c | 3 ++ > transport.c | 1 + > transport.h | 10 ++++++ > 8 files changed, 126 insertions(+), 2 deletions(-) > > diff --git a/Documentation/fetch-options.txt b/Documentation/fetch-options.txt > index 97d3217df9..80c4c94595 100644 > --- a/Documentation/fetch-options.txt > +++ b/Documentation/fetch-options.txt > @@ -42,6 +42,18 @@ the current repository has the same history as the source repository. > .git/shallow. This option updates .git/shallow and accept such > refs. > > +--negotiation-tip:: > + By default, Git will report, to the server, commits reachable > + from all local refs to find common commits in an attempt to > + reduce the size of the to-be-received packfile. If specified, > + Git will only report commits reachable from the given commit. > + This is useful to speed up fetches when the user knows which > + local ref is likely to have commits in common with the > + upstream ref being fetched. This seems like a pretty difficult to use feature, requiring that I provide the actual OIDs. I think a much better UI would probably be to accept a number of different things ranging from exact OIDs to actual ref names or even better, allowing for ref-patterns which include globs. That way I can do the following: git fetch --negotiation-tip=refs/remotes/my-remote/* my-remote in order to easily limit the tips to all the refs I have from that particular remote. > ++ > +This option may be specified more than once; if so, Git will report > +commits reachable from any of the given commits. > + > ifndef::git-pull[] > --dry-run:: > Show what would be done, without making any changes. > diff --git a/builtin/fetch.c b/builtin/fetch.c > index ea5b9669ad..12daec0f3b 100644 > --- a/builtin/fetch.c > +++ b/builtin/fetch.c > @@ -63,6 +63,7 @@ static int shown_url = 0; > static struct refspec refmap = REFSPEC_INIT_FETCH; > static struct list_objects_filter_options filter_options; > static struct string_list server_options = STRING_LIST_INIT_DUP; > +static struct string_list negotiation_tip = STRING_LIST_INIT_NODUP; > > static int git_fetch_config(const char *k, const char *v, void *cb) > { > @@ -174,6 +175,8 @@ static struct option builtin_fetch_options[] = { > TRANSPORT_FAMILY_IPV4), > OPT_SET_INT('6', "ipv6", &family, N_("use IPv6 addresses only"), > TRANSPORT_FAMILY_IPV6), > + OPT_STRING_LIST(0, "negotiation-tip", &negotiation_tip, N_("revision"), > + N_("report that we have only objects reachable from this object")), > OPT_PARSE_LIST_OBJECTS_FILTER(&filter_options), > OPT_END() > }; > @@ -1075,6 +1078,24 @@ static struct transport *prepare_transport(struct remote *remote, int deepen) > filter_options.filter_spec); > set_option(transport, TRANS_OPT_FROM_PROMISOR, "1"); > } > + if (negotiation_tip.nr) { > + struct oid_array *oids; > + if (transport->smart_options) { > + int i; > + oids = xcalloc(1, sizeof(*oids)); > + for (i = 0; i < negotiation_tip.nr; i++) { > + struct object_id oid; > + if (get_oid(negotiation_tip.items[i].string, > + &oid)) > + die("%s is not a valid object", > + negotiation_tip.items[i].string); > + oid_array_append(oids, &oid); > + } > + transport->smart_options->negotiation_tips = oids; > + } else { > + warning("Ignoring --negotiation-tip because the protocol does not support it."); > + } > + } > return transport; > } > > diff --git a/fetch-pack.c b/fetch-pack.c > index ba12085c4a..c66bd49bd1 100644 > --- a/fetch-pack.c > +++ b/fetch-pack.c > @@ -213,6 +213,21 @@ static int next_flush(int stateless_rpc, int count) > return count; > } > > +static void mark_tips(struct fetch_negotiator *negotiator, > + const struct oid_array *negotiation_tips) > +{ > + int i; > + if (!negotiation_tips) { > + for_each_ref(rev_list_insert_ref_oid, negotiator); > + return; > + } > + > + for (i = 0; i < negotiation_tips->nr; i++) > + rev_list_insert_ref(negotiator, NULL, > + &negotiation_tips->oid[i]); > + return; > +} > + > static int find_common(struct fetch_negotiator *negotiator, > struct fetch_pack_args *args, > int fd[2], struct object_id *result_oid, > @@ -230,7 +245,7 @@ static int find_common(struct fetch_negotiator *negotiator, > if (args->stateless_rpc && multi_ack == 1) > die(_("--stateless-rpc requires multi_ack_detailed")); > > - for_each_ref(rev_list_insert_ref_oid, negotiator); > + mark_tips(negotiator, args->negotiation_tips); > for_each_cached_alternate(negotiator, insert_one_alternate_object); > > fetching = 0; > @@ -1295,7 +1310,7 @@ static struct ref *do_fetch_pack_v2(struct fetch_pack_args *args, > else > state = FETCH_SEND_REQUEST; > > - for_each_ref(rev_list_insert_ref_oid, &negotiator); > + mark_tips(&negotiator, args->negotiation_tips); > for_each_cached_alternate(&negotiator, > insert_one_alternate_object); > break; > diff --git a/fetch-pack.h b/fetch-pack.h > index bb45a366a8..1859ee9275 100644 > --- a/fetch-pack.h > +++ b/fetch-pack.h > @@ -16,6 +16,13 @@ struct fetch_pack_args { > const struct string_list *deepen_not; > struct list_objects_filter_options filter_options; > const struct string_list *server_options; > + > + /* > + * If not NULL, during packfile negotiation, fetch-pack will send "have" > + * lines only with these tips and their ancestors. > + */ > + const struct oid_array *negotiation_tips; > + > unsigned deepen_relative:1; > unsigned quiet:1; > unsigned keep_pack:1; > diff --git a/t/t5510-fetch.sh b/t/t5510-fetch.sh > index e402aee6a2..ea1b5e53c1 100755 > --- a/t/t5510-fetch.sh > +++ b/t/t5510-fetch.sh > @@ -865,4 +865,59 @@ test_expect_success C_LOCALE_OUTPUT 'fetch compact output' ' > test_cmp expect actual > ' > > +negotiator_tip () { > + SERVER="$1" > + URL="$2" > + USE_PROTOCOL_V2="$3" > + > + rm -rf "$SERVER" client && > + git init "$SERVER" && > + test_commit -C "$SERVER" alpha_1 && > + test_commit -C "$SERVER" alpha_2 && > + git -C "$SERVER" checkout --orphan beta && > + test_commit -C "$SERVER" beta_1 && > + test_commit -C "$SERVER" beta_2 && > + > + git clone "$URL" client && > + > + if [ "$USE_PROTOCOL_V2" -eq 1 ] > + then > + git -C "$SERVER" config protocol.version 2 > + git -C client config protocol.version 2 > + fi && > + > + test_commit -C "$SERVER" beta_s && > + git -C "$SERVER" checkout master && > + test_commit -C "$SERVER" alpha_s && > + git -C "$SERVER" tag -d alpha_1 alpha_2 beta_1 beta_2 && > + > + GIT_TRACE_PACKET="$(pwd)/trace" git -C client fetch \ > + --negotiation-tip=alpha_1 --negotiation-tip=beta_1 \ > + origin alpha_s beta_s && > + > + # Ensure that {alpha,beta}_1 are sent as "have", but not {alpha_beta}_2 > + ALPHA_1=$(git -C client rev-parse alpha_1) && > + grep "fetch> have $ALPHA_1" trace && > + BETA_1=$(git -C client rev-parse beta_1) && > + grep "fetch> have $BETA_1" trace && > + ALPHA_2=$(git -C client rev-parse alpha_2) && > + ! grep "fetch> have $ALPHA_2" trace && > + BETA_2=$(git -C client rev-parse beta_2) && > + ! grep "fetch> have $BETA_2" trace > +} > + > +test_expect_success '--negotiator-tip limits "have" lines sent' ' > + negotiator_tip server server 0 > +' > + > +. "$TEST_DIRECTORY"/lib-httpd.sh > +start_httpd > + > +test_expect_success 'fetch with --filter=blob:limit=0 and HTTP' ' > + negotiator_tip "$HTTPD_DOCUMENT_ROOT_PATH/server" \ > + "$HTTPD_URL/smart/server" 1 > +' > + > +stop_httpd > + > test_done > diff --git a/transport-helper.c b/transport-helper.c > index 1f8ff7e942..ad8f7c7726 100644 > --- a/transport-helper.c > +++ b/transport-helper.c > @@ -684,6 +684,9 @@ static int fetch(struct transport *transport, > transport, "filter", > data->transport_options.filter_options.filter_spec); > > + if (data->transport_options.negotiation_tips) > + warning("Ignoring --negotiation-tip because the protocol does not support it."); > + > if (data->fetch) > return fetch_with_fetch(transport, nr_heads, to_fetch); > > diff --git a/transport.c b/transport.c > index a32da30dee..9f10f8ad9f 100644 > --- a/transport.c > +++ b/transport.c > @@ -318,6 +318,7 @@ static int fetch_refs_via_pack(struct transport *transport, > args.filter_options = data->options.filter_options; > args.stateless_rpc = transport->stateless_rpc; > args.server_options = transport->server_options; > + args.negotiation_tips = data->options.negotiation_tips; > > if (!data->got_remote_heads) > refs_tmp = get_refs_via_connect(transport, 0, NULL); > diff --git a/transport.h b/transport.h > index 7792b08582..d31be5be63 100644 > --- a/transport.h > +++ b/transport.h > @@ -25,6 +25,16 @@ struct git_transport_options { > const char *receivepack; > struct push_cas_option *cas; > struct list_objects_filter_options filter_options; > + > + /* > + * This is only used during fetch. See the documentation of > + * negotiation_tips in struct fetch_pack_args. > + * > + * This field is only supported by transports that support connect or > + * stateless_connect. Set this field directly instead of using > + * transport_set_option(). > + */ > + struct oid_array *negotiation_tips; > }; > > enum transport_family { > -- > 2.18.0.rc2.346.g013aa6912e-goog > -- Brandon Williams