On 11/30/2022 1:30 PM, Han-Wen Nienhuys wrote: > On Wed, Nov 30, 2022 at 4:16 PM Derrick Stolee <derrickstolee@xxxxxxxxxx> wrote: >> (Note: there is a strategy that doesn't need this approach, but it's a bit >> complicated. It would involve rotating all replicas to new repositories >> that are configured to use reftable upon creation, getting the refs from >> other replicas via fetches. In my opinion, this is prohibitively >> expensive.) > > I'm not sure I understand the problem. Any deletion of a ref (that is > in packed-refs) today already requires rewriting the entire > packed-refs file ("all or nothing" operation). Whether you write a > packed-refs or reftable is roughly equally expensive. > > Are you looking for a way to upgrade a repo, while concurrent git > process may write updates into the repository during the update? That > may be hard to pull off, because you probably need to rename more than > one file atomically. If you accept momentarily failed writes, you > could do > > * rename refs/ to refs.old/ (loose ref writes will fail now) > * collect loose refs under refs.old/ , put into packed-refs > * populate the reftable/ dir > * set refFormat extension. > * rename refs.old/ to refs/ with a refs/heads a file (as described in > the reftable spec.) > > See also https://gerrit.googlesource.com/jgit/+/ca166a0c62af2ea87fdedf2728ac19cb59a12601/org.eclipse.jgit/src/org/eclipse/jgit/internal/storage/file/FileRepository.java#734 Yes, I would ideally like for the repository to "upgrade" its ref storage mechanism during routine maintenance in a non-blocking way while other writes and reads continue as normal. After discussing it a bit internally, we _could_ avoid the "rotate the replicas" solution if there was a "git upgrade-ref-format" command that could switch from one to another, but it would still involve pulling that replica out of the rotation and then having it catch up to the other replicas after that is complete. If I'm reading your draft correctly, that is not currently available in your work, but we could add it after the fact. Requiring pulling replicas out of rotation is still a bit heavy- handed for my liking, but it's much less expensive than moving all of the Git data. >> The reason to start with this step is that the benefits and risks are >> clearly understood, which can motivate us to establish the mechanism for >> changing the ref format by defining the extension. > > I believe that the v2 format is a safe change with performance > improvements, but it's a backward incompatible format change with only > modest payoff. I also don't understand how it will help you do a stack > of tables, > which you need for your primary goal (ie. transactions/deletions > writing only the delta, rather than rewriting the whole file?). The v2 format doesn't help me on its own, but it has other benefits in terms of size and speed, as well as the "ref count" functionality. The important thing is that the definition of extensions.refFormat that I'm proposing in this RFC establishes a way to make incremental progress on the ref format, allowing the stacked format to come in later with less friction. >> * The reftable is currently fundamentally different enough that it could >> not be used as a replacement for the packed-refs file underneath loose >> refs (primarily due to its integration with the reflog). Doing so would >> require significant work on top of your prototype. > > It could, but I don't see the point. My point is that we can upgrade repositories by replacing packed-refs with reftable during routine maintenance instead of the heavier approaches discussed earlier. * Step 1: replace packed-refs with reftable. * Step 2: stop writing loose refs, only update reftable (but still read loose refs). * Step 3: collapse all loose refs into reftable, stop reading or writing loose refs. >> I'm going to take the following actions on my end to better understand the >> situation: >> >> 1. I'll take your draft PR branch and do some performance evaluations on >> the speed of ref updates compared to loose refs and my prototype of a >> two-stack packed-ref where the second layer of the stack is only for >> deleted refs. > > (tangent) - wouldn't that design perform poorly once the number of > deletions gets large? You'd basically have to rewrite the > deleted-packed-refs file all the time. We have regular maintenance that is triggered by pushes that rewrites the packed-refs file frequently, anyway. The maintenance currently is blocked on the amount of time spent repacking object data, so a large number of ref updates can come in during this process. (That maintenance step would collapse the deleted-refs layer into the base layer.) I've tested a simple version of this stack that shows that rewriting the file with 1,000 deletions is still within 2x the cost of updating a loose ref, so it solves the immediate problem using a much simpler stack model, at least in the most-common case where ref deletions are less frequent than other updates. Even if the size outgrew the 2x cost limit, the deleted file is still going to be much smaller than the base packed-refs file, which is currently rewritten for every deletion, so it is still an improvement. The more complicated stack model would be required to funnel all ref updates into that structure and away from loose refs. Thanks, -Stolee