On Fri, Dec 03 2021, SZEDER Gábor wrote: > On Mon, Oct 25, 2021 at 02:38:04AM -0700, Junio C Hamano wrote: >> SZEDER Gábor <szeder.dev@xxxxxxxxx> writes: >> >> > I still very much dislike the idea of a BUG() in the progress code >> > that can trigger outside of the test suite, because the progress line >> > is only a UI gimmick and not a crucial part of any Git operation, and >> > even though a progress line might be buggy, the underlying Git >> > operation is not affected by it and would still finish successfully, >> > as was the case with the dozen of so progress line bugs in the past. >> >> I too recall that we have fixed numerous bugs in the past year in >> the area, but weren't they kind of obvious ones _once_ they are >> pointed out at you (e.g. progress never reaching to 100%)? Yet the >> developers have failed to catch them because their eyes would coast >> over without paying attention to them, exactly because the progress >> bar is merely a UI gimmick. >> >> I haven't formed a firm opinion on this yet, but I think the idea >> behind these BUG() is to help such problems be caught while they are >> still in the lab. You may not notice when your live progress bar >> behaved a bit funny, but if you hit a BUG(), that would be squarely >> in your face and you cannot ignore it. > > The "outside of the test suite" part is important. Running the test > suite with a GIT_TEST_CHECK_PROGRESS knob is sufficient to catch these > issues as my inital two patch series have shown at the very beginning > of this thread before Ævar hijacked them. FWIW I didn't hijack and adjust your patches to be BUG() instead of a test mode. We came up with more-or-less the same thing independently around the same time. You submitted yours on the 20th of June, but I hacked up mine on top af on already-WIP series I had ~5 days earlier in response to another discussion about progress.c[1]. You submitter yours first, and I then replied to the effect of "here's a similar thing I had locally, RFC"[2]. Or do you mean the smaller "fix bogus counting" fixed that are now landed[3]? I just wanted to push that forward since it seemed you weren't per[4]. Sorry if I stepped on any toes there... > Interested Git developers > can even enable it in their .profile if they wish; I did so with > GIT_TEST_SPLIT_INDEX for a while to expose some of my patches to more > real-world use. > > However, I think it's conceptually a bad idea to abort with a BUG() an > otherwise successful Git operation by default, when that bug happens > to be in such an ancillary component as the progress display, and find > the justifications given in the commit message unconvincing. I think it's important to get to the root of what we really disagree about here. We hard die on all sorts of trivial things in BUG() already, as paging through this shows: git grep -w -F 'BUG(' I.e. the general criteria for BUG() isn't whether we could hypothetically recover from the scenario. Which, if you adjust the code around many existing BUG()s to trigger the "bad thing" (as it were..) is the case. I haven't done any exhaustive checking, but I'd bet it's more common than not, or at least something like 1/4 or 1/8 of them. Rather it's a combination of: A. This should absolutely not happen B. We are sure (or are sure as we can be) that it doesn't C. We think if it does, we'll know/really want to know via tests/early integration testing. D. We mark it with a BUG() because we're as confident as we reasonably can be in A and B I took your earlier comments to mean that you agreed on "A", but you weren't so sure about "B". I agree that without confidence about "B" that having "D" would be overzealous. But here with your mention of "conceptually a bad idea to[...]" I'm not so sure, i.e. it seems like you're saying that you're categorically opposed to such use of BUG(). I think that's a fair stance to take if that's what you're going for, but per the above I also think it's true to say that a lot of our existing use of that API doesn't match that worldview. I.e. it's not an UNRECOVARABLE_BUG(), but SHOULD_NOT_ESCAPE_DEVELOPMENT_BUG(). I think that in this case we should be confident that we've got "B", as explained in detail in the upthread patch[5]. I would be exceedingly paranoid about more exhaustive checks, e.g. I don't think we've got enough coverage to have BUG() checks for some of the things you added test-only asserts (which would be good) for in your initial series. I.e. asserting whether we hit 99% and not 100%, whether display_progress() "misses" updates, or if we miss a call to stop_progress(), or whatever else one might assert. I've been running my personal git with many of those sorts of pedantic assertions for a while now, but I'm still not confident those changes are OK to submit for inclusion. I very much share your concerns in that area. But I think (and [5] argues) that it's *much* easier to be confident about the narrow BUG() assertion being added here. I.e. it's trivial to discover that almost all callers of progress.c are never going to start more than one progress bar ever, so we can discard those out of hand. For the rest they're rather easily auditable. But clearly you disagree, and I thinks it helps neither of us to just repeat our points, which is not what I'm trying to do here, but to get to the following: - Can you cite specifically what about [5] you think wouldn't catch any cases where we couldn't be certain about the BUG(). I.e. it outlines the sort of testing I did, which involved forcing all hidden progress bars to ignore their existing isatty() checks, adding more pedantic asserts to narrow down potentially affected callers etc. Do you think that testing could have missed something? How & why? - Related to that, are there cases where you would agree that we've got "B", as opposed to others where we don't? As I've argued I do think these patches are ready as-is, but one alternative way forward with them would be to add the BUG() to everything, but whitelist those current callers that have >1 progress bar. Then as a (slow) follow-up proceed to un-whitelist those one at a time. I'm not familiar with the GIT_TEST_SPLIT_INDEX changes you're mentioning, but I'd think that those cases would have been more complex (the code flow around index-related stuff) than the progress API users. 1. https://lore.kernel.org/git/87o8c8z105.fsf@xxxxxxxxxxxxxxxxxxx/ 2. https://lore.kernel.org/git/cover-00.25-00000000000-20210623T155626Z-avarab@xxxxxxxxx/ 3. https://lore.kernel.org/git/cover-v4-0.2-00000000000-20210909T010722Z-avarab@xxxxxxxxx/ 4. https://lore.kernel.org/git/cover-0.3-0000000000-20210722T121801Z-avarab@xxxxxxxxx/ 5. https://lore.kernel.org/git/patch-v6-8.8-bff919994b5-20211102T122507Z-avarab@xxxxxxxxx/