On 08/28, Martin Ågren wrote: > On Tue, 27 Aug 2019 at 12:14, Thomas Gummerer <t.gummerer@xxxxxxxxx> wrote: > > > > Getting the lock for the index, refreshing it and then writing it is a > > pattern that happens more than once throughout the codebase. Factor > > out the refresh_and_write_cache function from builtin/am.c to > > read-cache.c, so it can be re-used in other places in a subsequent > > commit. > > > +/* > > + * Refresh the index and write it to disk. > > + * > > + * Return 1 if refreshing the cache failed, -1 if writing the cache to > > + * disk failed, 0 on success. > > + */ > > Thank you for documenting. :-) Should we say something about how this > doesn't explicitly print any error in case refreshing fails (that is, we > leave it to `refresh_index()`), but that we *do* explicitly print an > error if writing the index fails? That caught me off-guard as I looked > at how you convert the callers. > > And do we actually want that asymmetry? Maybe we do. I think I needed the error for something while I went through a few iterations of how to best structure this function, but I don't remember for what exactly now. I think it might actually be better to just return -1 here, and let the caller distinguish and show the error message if they need to. That also avoids duplicating the error in case the caller wants to die on error. > Might be worth pointing out as you convert the callers how some (all?) > of them now emit different error messages from before, but that it > shouldn't matter(?) and it makes sense to unify those messages. Yeah, I don't think changing the error message should matter, but unifying them is not actually a goal of this series. So with what you pointed out above, I think I'll leave them as they are. > > +int repo_refresh_and_write_index(struct repository*, unsigned int refresh_flags, unsigned int write_flags, const struct pathspec *, char *seen, const char *header_msg); > > > +int repo_refresh_and_write_index(struct repository *repo, > > + unsigned int refresh_flags, > > + unsigned int write_flags, > > + const struct pathspec *pathspec, > > + char *seen, const char *header_msg) > > +{ > > + struct lock_file lock_file = LOCK_INIT; > > + > > + repo_hold_locked_index(repo, &lock_file, LOCK_DIE_ON_ERROR); > > + if (refresh_index(repo->index, refresh_flags, pathspec, seen, header_msg)) > > + return 1; > > + if (write_locked_index(repo->index, &lock_file, write_flags)) > > + return error(_("unable to write index file")); > > + return 0; > > +} > > If `flags` doesn't contain `COMMIT_LOCK`, the lockfile will be closed > "gently", meaning we still need to either commit it, or roll it back. Or > let the exit handler roll it back, which is what would happen here, no? > We lose our handle on the stack and there's no way for anyone to say > "ok, now I'm done, commit it please" (or "roll it back"). > > In short, I think calling this function without providing `COMMIT_LOCK` > would be useless at best. We should probably let this function provide > `COMMIT_LOCK | write_flags` or `COMMIT_LOCK | extra_write_flags` or > whatever. Most callers would just provide "0". Hm? > > Or, we could BUG if the COMMIT_LOCK bit isn't set, but that seems like a > less good choice to me. If we're so adamant about the bit being set -- > which we should be, IMHO -- we might as well set it ourselves. Yeah, you're right, making this function use `COMMIT_LOCK | write_flags` would probably be the best option. I'll change that, and document it as well. Thanks for your review! > > > Martin