On Fri, Apr 2, 2021 at 2:01 PM Junio C Hamano <gitster@xxxxxxxxx> wrote: > > Phillip Wood <phillip.wood123@xxxxxxxxx> writes: > > > I think we would save a lot by only syncing the state to disk when we > > stop or run an exec command (the state needs to be synced so exec > > commands can alter the todo list). In those cases we need to write the > > index and possibly run an external process so writing a couple of > > files is probably insignificant. > > The optimization opportunity of this may be a lot smaller than you > would think---you must cater to not just exec but hook scripts that > are run while a new commit is made, which means every step you'd > need to write anyway. >From Documentation/git-rebase.txt, right after discussing how the backends disagree on which hooks are called and how they are called: ...In each case, the calling of these hooks was by accident of implementation rather than by design (both backends were originally implemented as shell scripts and happened to invoke other commands like 'git checkout' or 'git commit' that would call the hooks). Both backends should have the same behavior, though it is not entirely clear which, if any, is correct. We will likely make rebase stop calling either of these hooks in the future. Even if others now disagree with the above, I know I can get a huge speedup by changing sequencer to stop per-commit wasteful work (stop forking processes like "git commit", don't write control structures if the rebase hasn't ended or hit a conflict, don't update the working copy or index or reflog). It's enough of a speedup that if backward compatibility won't allow such a method to be used by default, I'd still make yet another backend that could be optionally used. And I'd have the default rebase and cherry-pick start printing annoying deprecation notices so that users become aware of a faster option. Should I go that route? t/helpers/test-fast-rebase.c already has a good start... > > Where I think we can usefully consolidate is the one-line files which > > store the options rather than state - these are read an written much > > less frequently so I don't think they have much of a performance hit > > but it would be much nicer to just serialize the options to a single > > file. > > Would that break external scripts, hooks, etc.? I am not sure if we > even have any rough consensus for allowing other people to peek into > the .git/rebase-*/ directories, but I am inclined to say that it > sounds more like a solution looking for a problem than a good idea > to solve some concrete problem. To be honest, my bigger complaint with the non-unified backend config is how far it bled into the code and how difficult it is to determine how things are controlled and which sections of code are relevant for which types of operations. And how much of a pain I'm worried it'd make the "allow rebasing and cherry-picking in a bare repository (or for an unchecked-out branch)" functionality. Perhaps the code can be cleaned up without changing these on-disk structures, in which case my concern for this point would lessen.