Re: [PATCH 6/9] sha1-file: use an object_directory for the main object dir

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 11/12/2018 9:50 AM, Jeff King wrote:
Our handling of alternate object directories is needlessly different
from the main object directory. As a result, many places in the code
basically look like this:

   do_something(r->objects->objdir);

   for (odb = r->objects->alt_odb_list; odb; odb = odb->next)
         do_something(odb->path);

That gets annoying when do_something() is non-trivial, and we've
resorted to gross hacks like creating fake alternates (see
find_short_object_filename()).

Instead, let's give each raw_object_store a unified list of
object_directory structs. The first will be the main store, and
everything after is an alternate. Very few callers even care about the
distinction, and can just loop over the whole list (and those who care
can just treat the first element differently).

A few observations:

   - we don't need r->objects->objectdir anymore, and can just
     mechanically convert that to r->objects->odb->path

   - object_directory's path field needs to become a real pointer rather
     than a FLEX_ARRAY, in order to fill it with expand_base_dir()

   - we'll call prepare_alt_odb() earlier in many functions (i.e.,
     outside of the loop). This may result in us calling it even when our
     function would be satisfied looking only at the main odb.

     But this doesn't matter in practice. It's not a very expensive
     operation in the first place, and in the majority of cases it will
     be a noop. We call it already (and cache its results) in
     prepare_packed_git(), and we'll generally check packs before loose
     objects. So essentially every program is going to call it
     immediately once per program.

     Arguably we should just prepare_alt_odb() immediately upon setting
     up the repository's object directory, which would save us sprinkling
     calls throughout the code base (and forgetting to do so has been a
     source of subtle bugs in the past). But I've stopped short of that
     here, since there are already a lot of other moving parts in this
     patch.

   - Most call sites just get shorter. The check_and_freshen() functions
     are an exception, because they have entry points to handle local and
     nonlocal directories separately.

Signed-off-by: Jeff King <peff@xxxxxxxx>
---
If the "the first one is the main store, the rest are alternates" bit is
too subtle, we could mark each "struct object_directory" with a bit for
"is_local".

This is probably a good thing to do proactively. We have the equivalent in the packed_git struct, but that's also because they get out of order. At the moment, I can't think of a read-only action that needs to treat the local object directory more carefully. The closest I know about is 'git pack-objects --local', but that also writes a pack-file.

I assume that when we write a pack-file to the "default location" we use get_object_directory() instead of referring to the default object_directory?


  builtin/fsck.c |  21 ++-------
  builtin/grep.c |   2 +-
  commit-graph.c |   5 +-
  environment.c  |   4 +-
  object-store.h |  27 ++++++-----
  object.c       |  19 ++++----
  packfile.c     |  10 ++--
  path.c         |   2 +-
  repository.c   |   8 +++-
  sha1-file.c    | 122 ++++++++++++++++++-------------------------------
  sha1-name.c    |  17 ++-----
  11 files changed, 90 insertions(+), 147 deletions(-)

diff --git a/builtin/fsck.c b/builtin/fsck.c
index 55153cf92a..15338bd178 100644
--- a/builtin/fsck.c
+++ b/builtin/fsck.c
@@ -725,13 +725,8 @@ int cmd_fsck(int argc, const char **argv, const char *prefix)
  		for_each_loose_object(mark_loose_for_connectivity, NULL, 0);
  		for_each_packed_object(mark_packed_for_connectivity, NULL, 0);
  	} else {
-		struct object_directory *alt_odb_list;
-
-		fsck_object_dir(get_object_directory());
-
  		prepare_alt_odb(the_repository);
-		alt_odb_list = the_repository->objects->alt_odb_list;
-		for (odb = alt_odb_list; odb; odb = odb->next)
+		for (odb = the_repository->objects->odb; odb; odb = odb->next)
  			fsck_object_dir(odb->path);
if (check_full) {
@@ -834,13 +829,8 @@ int cmd_fsck(int argc, const char **argv, const char *prefix)
  		struct child_process commit_graph_verify = CHILD_PROCESS_INIT;
  		const char *verify_argv[] = { "commit-graph", "verify", NULL, NULL, NULL };
- commit_graph_verify.argv = verify_argv;
-		commit_graph_verify.git_cmd = 1;
-		if (run_command(&commit_graph_verify))
-			errors_found |= ERROR_COMMIT_GRAPH;
-
  		prepare_alt_odb(the_repository);
-		for (odb = the_repository->objects->alt_odb_list; odb; odb = odb->next) {
+		for (odb = the_repository->objects->odb; odb; odb = odb->next) {
  			child_process_init(&commit_graph_verify);
  			commit_graph_verify.argv = verify_argv;
  			commit_graph_verify.git_cmd = 1;
@@ -855,13 +845,8 @@ int cmd_fsck(int argc, const char **argv, const char *prefix)
  		struct child_process midx_verify = CHILD_PROCESS_INIT;
  		const char *midx_argv[] = { "multi-pack-index", "verify", NULL, NULL, NULL };
- midx_verify.argv = midx_argv;
-		midx_verify.git_cmd = 1;
-		if (run_command(&midx_verify))
-			errors_found |= ERROR_COMMIT_GRAPH;
-
  		prepare_alt_odb(the_repository);
-		for (odb = the_repository->objects->alt_odb_list; odb; odb = odb->next) {
+		for (odb = the_repository->objects->odb; odb; odb = odb->next) {
  			child_process_init(&midx_verify);
  			midx_verify.argv = midx_argv;
  			midx_verify.git_cmd = 1;
diff --git a/builtin/grep.c b/builtin/grep.c
index d8508ddf79..714c8d91ba 100644
--- a/builtin/grep.c
+++ b/builtin/grep.c
@@ -441,7 +441,7 @@ static int grep_submodule(struct grep_opt *opt, struct repository *superproject,
  	 * object.
  	 */
  	grep_read_lock();
-	add_to_alternates_memory(submodule.objects->objectdir);
+	add_to_alternates_memory(submodule.objects->odb->path);
  	grep_read_unlock();
if (oid) {
diff --git a/commit-graph.c b/commit-graph.c
index 5dd3f5b15c..99163c244b 100644
--- a/commit-graph.c
+++ b/commit-graph.c
@@ -231,7 +231,6 @@ static void prepare_commit_graph_one(struct repository *r, const char *obj_dir)
  static int prepare_commit_graph(struct repository *r)
  {
  	struct object_directory *odb;
-	char *obj_dir;
  	int config_value;
if (r->objects->commit_graph_attempted)
@@ -252,10 +251,8 @@ static int prepare_commit_graph(struct repository *r)
  	if (!commit_graph_compatible(r))
  		return 0;
- obj_dir = r->objects->objectdir;
-	prepare_commit_graph_one(r, obj_dir);
  	prepare_alt_odb(r);
-	for (odb = r->objects->alt_odb_list;
+	for (odb = r->objects->odb;
  	     !r->objects->commit_graph && odb;
  	     odb = odb->next)
  		prepare_commit_graph_one(r, odb->path);
diff --git a/environment.c b/environment.c
index 3f3c8746c2..441ce56690 100644
--- a/environment.c
+++ b/environment.c
@@ -274,9 +274,9 @@ const char *get_git_work_tree(void)
char *get_object_directory(void)
  {
-	if (!the_repository->objects->objectdir)
+	if (!the_repository->objects->odb)
  		BUG("git environment hasn't been setup");
-	return the_repository->objects->objectdir;
+	return the_repository->objects->odb->path;
  }
int odb_mkstemp(struct strbuf *temp_filename, const char *pattern)
diff --git a/object-store.h b/object-store.h
index b2fa0d0df0..30faf7b391 100644
--- a/object-store.h
+++ b/object-store.h
@@ -24,19 +24,14 @@ struct object_directory {
  	 * Path to the alternative object store. If this is a relative path,
  	 * it is relative to the current working directory.
  	 */
-	char path[FLEX_ARRAY];
+	char *path;
  };
+
  void prepare_alt_odb(struct repository *r);
  char *compute_alternate_path(const char *path, struct strbuf *err);
  typedef int alt_odb_fn(struct object_directory *, void *);
  int foreach_alt_odb(alt_odb_fn, void*);
-/*
- * Allocate a "struct alternate_object_database" but do _not_ actually
- * add it to the list of alternates.
- */
-struct object_directory *alloc_alt_odb(const char *dir);
-
  /*
   * Add the directory to the on-disk alternates file; the new entry will also
   * take effect in the current process.
@@ -80,17 +75,21 @@ struct multi_pack_index;
struct raw_object_store {
  	/*
-	 * Path to the repository's object store.
-	 * Cannot be NULL after initialization.
+	 * Set of all object directories; the main directory is first (and
+	 * cannot be NULL after initialization). Subsequent directories are
+	 * alternates.
  	 */
-	char *objectdir;
+	struct object_directory *odb;
+	struct object_directory **odb_tail;
+	int loaded_alternates;
- /* Path to extra alternate object database if not NULL */
+	/*
+	 * A list of alternate object directories loaded from the environment;
+	 * this should not generally need to be accessed directly, but will
+	 * populate the "odb" list when prepare_alt_odb() is run.
+	 */
  	char *alternate_db;
- struct object_directory *alt_odb_list;
-	struct object_directory **alt_odb_tail;
-
  	/*
  	 * Objects that should be substituted by other objects
  	 * (see git-replace(1)).
diff --git a/object.c b/object.c
index dd485ac629..79d636091c 100644
--- a/object.c
+++ b/object.c
@@ -482,26 +482,26 @@ struct raw_object_store *raw_object_store_new(void)
  	return o;
  }
-static void free_alt_odb(struct object_directory *odb)
+static void free_object_directory(struct object_directory *odb)
  {
+	free(odb->path);
  	oid_array_clear(&odb->loose_objects_cache);
  	free(odb);
  }
-static void free_alt_odbs(struct raw_object_store *o)
+static void free_object_directories(struct raw_object_store *o)
  {
-	while (o->alt_odb_list) {
+	while (o->odb) {
  		struct object_directory *next;
- next = o->alt_odb_list->next;
-		free_alt_odb(o->alt_odb_list);
-		o->alt_odb_list = next;
+		next = o->odb->next;
+		free_object_directory(o->odb);
+		o->odb = next;
  	}
  }
void raw_object_store_clear(struct raw_object_store *o)
  {
-	FREE_AND_NULL(o->objectdir);
  	FREE_AND_NULL(o->alternate_db);
oidmap_free(o->replace_map, 1);
@@ -511,8 +511,9 @@ void raw_object_store_clear(struct raw_object_store *o)
  	o->commit_graph = NULL;
  	o->commit_graph_attempted = 0;
- free_alt_odbs(o);
-	o->alt_odb_tail = NULL;
+	free_object_directories(o);
+	o->odb_tail = NULL;
+	o->loaded_alternates = 0;
INIT_LIST_HEAD(&o->packed_git_mru);
  	close_all_packs(o);
diff --git a/packfile.c b/packfile.c
index d6d511cfd2..1eda33247f 100644
--- a/packfile.c
+++ b/packfile.c
@@ -970,12 +970,12 @@ static void prepare_packed_git(struct repository *r)
if (r->objects->packed_git_initialized)
  		return;
-	prepare_multi_pack_index_one(r, r->objects->objectdir, 1);
-	prepare_packed_git_one(r, r->objects->objectdir, 1);
+
  	prepare_alt_odb(r);
-	for (odb = r->objects->alt_odb_list; odb; odb = odb->next) {
-		prepare_multi_pack_index_one(r, odb->path, 0);
-		prepare_packed_git_one(r, odb->path, 0);
+	for (odb = r->objects->odb; odb; odb = odb->next) {
+		int local = (odb == r->objects->odb);

Here seems to be a place where `odb->is_local` would help.

+		prepare_multi_pack_index_one(r, odb->path, local);
+		prepare_packed_git_one(r, odb->path, local);
  	}
  	rearrange_packed_git(r);

Thanks,
-Stolee



[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux