On Tue, Nov 5, 2019 at 3:04 PM Jonathan Tan <jonathantanmy@xxxxxxxxxx> wrote: > > First of all, thanks for taking a year-old patch and updating it. > Overall, this looks good. I have some minor comments, but it might be > best to wait until someone more experienced with bisect.c takes a look > too. > > Your commit message title should be of the form "<component>: <change>", > e.g.: > > rev-list: support --first-parent with --bisect* > > > Not all repository maintainers expect every commit to pass tests, only > > testing the merge commits. Currently bisection assumes every commit is > > of interest. The highly-requested --bisect --first-parent feature imbues > > git with the same indifference to minutiae when the option is set, so > > that it casually riffles through commits, throwing aside mountains of > > irrelevant data when looking for a breaking change. Further refinement > > of where breaks occurred can be gained by bisecting over the merge's > > range. > > I would be much more laconic (in particular, omitting subjective terms > like "minutiae" and "mountains of irrelevant data"), but perhaps that is > just a matter of subjective style. > > > Note, bisecting on --first-parent becomes part of findall's previously > > existing pass-through as an "option state" flag. > > I don't understand this part. > > > In order to limit possible obfuscation of bisect operations resulting > > from the addition of new flags, some extra documentation was folded in > > to the patch. > > What is being obfuscated, and what extra documentation was "folded"? > > Also, clarify in the commit message somewhere that this commit does not > change the behavior of "git bisect". > > As for the diff, besides my comments below, a change in the user-facing > documentation of "rev-list" is needed, since --bisect and --first-parent > now work together. Will do. > > > -static int count_interesting_parents(struct commit *commit) > > +static int count_interesting_parents(struct commit *commit, unsigned bisect_flags) > > { > > struct commit_list *p; > > int count; > > > > for (count = 0, p = commit->parents; p; p = p->next) { > > - if (p->item->object.flags & UNINTERESTING) > > - continue; > > - count++; > > + if (!(p->item->object.flags & UNINTERESTING)) > > + count++; > > + if (bisect_flags & BISECT_FIRST_PARENT) > > + break; > > } > > return count; > > } > > (Note that I'm writing my thoughts as I go along to aid future > reviewers, and to show the author (you) how I'm understanding the > patch.) > > We only take into account the first parent - straightforward enough. > I'll have to see how this function is used to ensure that this change is > correct. > > > static void show_list(const char *debug, int counted, int nr, > > - struct commit_list *list) > > + struct commit_list *list, unsigned bisect_flags) > > { > > struct commit_list *p; > > What is the purpose of this change? bisect_flags is never used anywhere > in show_list(). Insufficiently cleaned-up change cruft! Thanks for catching it. > This is also in a loop. As can be seen at the top of the diff ("if (0 <= > weight(p))"), this only operates on commits with negative weights. > > Originally, the inner loop advances until a non-UNINTERESTING parent > with a non-negative weight. If no such parent is found, at the end of > the loop, q is NULL. The added code effectively replicates what's going > on, but ignoring any parents after the first. > > A previous reviewer [1] wanted an explanation for this part, so thanks > for attempting to do that. But I don't understand the explanation - > firstly, it is not a question of "can" (optional) but of "will" > (mandatory), and it is not only UNINTERESTING that determines skipping, > but weight as well. > > I would write the entire section like this (remember to wrap the lines): > > if (first_parent) { > q = p->item->parents; > if (q && ((q->item->object.flags & UNINTERESTING) || weight(q) < 0)) > q = NULL; > } else { > /* > * Find an interesting parent with non-negative weight. > */ > for (...) { > } > } > "uninteresting" was meant in the colloquial sense rather than the CONSTANT, but fair, it's probably just confusing. > Looking at the rest of do_find_bisection(): > > - I don't see any other parts that would be affected by only calculating > weights based on the first parent, so that's fine. > > - There are some early returns that assume that "list" is generated by > iterating only over first parents. do_find_bisection() is called only > by find_bisection(), and the latter is called only by cmd_rev_list() > and bisect_next_all(). The former is fine, but I will discuss the > latter later. > > - I do see some unclear parts (in particular, counter might not reach nr > if any of the weights are 0 and if the "weight_set(p, weight(q));" > line is reached, potentially resulting in an infinite loop) but that > is unrelated to this patch, so don't worry about it. > > [1] https://public-inbox.org/git/nycvar.QRO.7.76.6.1808281512240.73@xxxxxxxxxxxxxxxxx/ > > > @@ -964,7 +981,12 @@ int bisect_next_all(struct repository *r, const char *prefix, int no_checkout) > > > > bisect_common(&revs); > > > > - find_bisection(&revs.commits, &reaches, &all, !!skipped_revs.nr); > > + if (skipped_revs.nr) > > + bisect_flags |= BISECT_FIND_ALL; > > + if (revs.first_parent_only) > > + bisect_flags |= BISECT_FIRST_PARENT; > > + > > + find_bisection(&revs.commits, &reaches, &all, bisect_flags); > > revs.commits = managed_skipped(revs.commits, &tried); > > > > if (!revs.commits) { > > I don't see how revs.first_parent_only is ever set in this function. If > it's never set, undo this change, since this code is never executed. In this function, we call bisect_rev_setup() using the revs struct we made, which then calls setup_revisions() on the revs, which appears to call handle_revision_opt() with that struct again,which finally is allowed to set revs->first_parent_only = 1; in revision.c. So unless I am horribly misreading something, we do set it. > > diff --git a/t/t6000-rev-list-misc.sh b/t/t6000-rev-list-misc.sh > > index b8cf82349b..95949e4ff1 100755 > > --- a/t/t6000-rev-list-misc.sh > > +++ b/t/t6000-rev-list-misc.sh > > @@ -122,8 +122,8 @@ test_expect_success 'rev-list can negate index objects' ' > > test_cmp expect actual > > ' > > > > -test_expect_success '--bisect and --first-parent can not be combined' ' > > - test_must_fail git rev-list --bisect --first-parent HEAD > > +test_expect_success '--bisect and --first-parent CAN be combined' ' > > + git rev-list --bisect --first-parent HEAD > > ' > > > > I think this test can just be deleted. It is tested in t6002. Sure, I'll drop it. > > diff --git a/t/t6002-rev-list-bisect.sh b/t/t6002-rev-list-bisect.sh > > index a661408038..6caf2af650 100755 > > --- a/t/t6002-rev-list-bisect.sh > > +++ b/t/t6002-rev-list-bisect.sh > > @@ -263,4 +263,58 @@ test_expect_success 'rev-parse --bisect can default to good/bad refs' ' > > test_cmp expect.sorted actual.sorted > > ' > > > > +# --first-parent tests > > + > > +# --bisect --first-parent should pluck out the middle. > > +printf "%s\n" e4 | > > +test_output_expect_success "--bisect --first-parent" ' > > + git rev-list --bisect --first-parent E ^F > > +' > > + > > +printf "%s\n" E e1 e2 e3 e4 e5 e6 e7 e8 | > > +test_output_expect_success "--first-parent" ' > > + git rev-list --first-parent E ^F > > +' > > + > > +test_output_expect_success '--bisect-vars --first-parent' ' > > + git rev-list --bisect-vars --first-parent E ^F > > +' <<-EOF > > + bisect_rev='e5' > > + bisect_nr=4 > > + bisect_good=4 > > + bisect_bad=3 > > + bisect_all=9 > > + bisect_steps=2 > > +EOF > > Looks good, except for the middle test - that should already be working > even before the current patch, right? If there's a reason for including > it anyway, mention it in the commit message. Also cutting, I think. > > +test_expect_success '--bisect-all --first-parent returns correct order' ' > > + git rev-list --bisect-all --first-parent E ^F >actual && > > + > > + # Make sure the entries are sorted in the dist order > > + sed -e "s/.*dist=\([0-9]\).*/\1/" actual >actual.dists && > > + sort -r -n actual.dists >actual.dists.sorted && > > + test_cmp actual.dists.sorted actual.dists > > +' > > + > > +# NEEDSWORK: this test could afford being hardened against other > > +# changes in the same file. > > +test_expect_success '--bisect-all --first-parent compares correctly' ' > > + cat >expect <<-EOF && > > + $(git rev-parse tags/e5) (tag: e5, dist=4) > > + $(git rev-parse tags/e4) (tag: e4, dist=4) > > + $(git rev-parse tags/e6) (tag: e6, dist=3) > > + $(git rev-parse tags/e3) (tag: e3, dist=3) > > + $(git rev-parse tags/e7) (tag: e7, dist=2) > > + $(git rev-parse tags/e2) (tag: e2, dist=2) > > + $(git rev-parse tags/e8) (tag: e8, dist=1) > > + $(git rev-parse tags/e1) (tag: e1, dist=1) > > + $(git rev-parse tags/E) (tag: E, dist=0) > > +EOF > > + > > +git rev-list --bisect-all --first-parent E ^F >actual && > > + sort actual >actual.sorted && > > + sort expect >expect.sorted && > > + test_cmp expect.sorted actual.sorted > > +' > > I think these 2 tests can be combined, since the latter also checks the > dists. Also, correct the indentation of the latter test. I understand they are similar tests, but... Is there a tangible reason for combining them? Especially when their logic can live and breathe completely separately, compacting tests reduces the resolution of the information we can extract from failure. I would rather simply drop one and preserve 1 test = 1 data point.