Re: [RFC PATCH 0/8] Get rid of "git --super-prefix"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Ævar Arnfjörð Bjarmason <avarab@xxxxxxxxx> writes:

> As noted in the CL above I included this because I see you're keen to
> include it, but I'm personally a bit "meh" on it. I.e. it's just
> renaming an existing unrelated option, although being able to use
> OPT__SUPER_PREFIX() makes it slightly nicer.
>
> As post-cleanups go I think removing the "submodule_prefix" from the
> "struct repository" would make more sense, and maybe it's worth peeling
> off the 10/10 to include in such a post-cleanup series? I.e. the below
> on top of all of this works, and reduces allocations and cargo-culting
> around the submodule API.

As a first impression I'm not particularly keen on this, since it makes
perfect sense to me to have a repo->submodule_prefix, especially when
recursing into N-level deep submodules...

>
> -- >8 --
> Subject: [PATCH] repo & submodule API: stop carrying "repo->submodule_prefix"
>
> As this change shows the "submodule_prefix" field to "struct
> repository" added in 96dc883b3cd (repository: enable initialization of
> submodules, 2017-06-22) was only used by "ls-files" and "grep". Let's
> have those two carry forward the "super_prefix" instead.
>
> Having every user of "struct repository" needing to worry about this
> created a mismatch in the API where e.g. "grep" would re-compute a
> "name_base_len" which we knew before. Now we use a "struct strbuf" in
> the "struct grep_opt" there instead, so we'll know the length
> already. This simplifies "grep_cache()" and "grep_tree()".
>
> We're also deleting cargo-culted code that the previous API foisted
> upon us. In 605f0ec1350 (submodule: use submodule repos for object
> lookup, 2018-12-14) the "Mark it as a submodule" code was added to
> "open_submodule()", but the resulting xstrdup()'d "submodule_prefix"
> was never used by anything.

(As an aside, I think open_submodule() should have been replaced by
repo_submodule_init().)

In which case, yes it isn't used by anything in that code path, but
being meticulous about maintaining .super_prefix means that other
callers could use it if they wanted to, which might be crucial once we
start plumb "struct repository" deeper and deeper and...

>
> Still, removing this field might not be a good move, as the
> "super_prefix" might be a common enough case in the future, e.g. when
> eventually migrating the "submodule--helper" users[1] to run
> in-process.
>
> As the "grep" example demonstrates I don't think that's the
> case. There instead of xstrdup()-ing all the way down we're now
> carrying a single "super_prefix" in the form of a "struct strbuf". As
> we recurse we then append to it, and strbuf_setlen() it back when we
> we recurse out of that submodule. This is similar to how e.g. the
> "read_tree_at()" API works.

This technique might no longer be so appealing. We _could_ pass both
"struct repository" and "super_prefix", but that seems odd given that
the super prefix is tied to the repository.

But that's just a first impression anyway. I don't mind taking another
look if this gets a standalone review.

>
> Doing it that way means that we have just one allocation, which in the
> common case we might realloc() if we don't have enough room in the
> "struct strbuf".
>
> 1. https://lore.kernel.org/git/cover-v2-00.10-00000000000-20221114T100803Z-avarab@xxxxxxxxx/
> 2. https://github.com/avar/git/tree/avar/grep-post-drop-prefix-cleanup
>
> Signed-off-by: Ævar Arnfjörð Bjarmason <avarab@xxxxxxxxx>
> ---
>  builtin/grep.c     | 18 ++++++++----------
>  builtin/ls-files.c | 40 +++++++++++++++++++++++++---------------
>  grep.c             |  6 +++---
>  grep.h             |  2 ++
>  repository.c       |  7 -------
>  repository.h       |  7 -------
>  submodule.c        |  3 ---
>  7 files changed, 38 insertions(+), 45 deletions(-)
>
> diff --git a/builtin/grep.c b/builtin/grep.c
> index 5fa927d4e22..69826daba26 100644
> --- a/builtin/grep.c
> +++ b/builtin/grep.c
> @@ -437,6 +437,7 @@ static int grep_submodule(struct grep_opt *opt,
>  	struct repository *superproject = opt->repo;
>  	struct grep_opt subopt;
>  	int hit = 0;
> +	size_t oldlen = opt->super_prefix.len;
>  
>  	if (!is_submodule_active(superproject, path))
>  		return 0;
> @@ -497,6 +498,7 @@ static int grep_submodule(struct grep_opt *opt,
>  	add_submodule_odb_by_path(subrepo->objects->odb->path);
>  	obj_read_unlock();
>  
> +	strbuf_addf(&opt->super_prefix, "%s/", path);
>  	memcpy(&subopt, opt, sizeof(subopt));
>  	subopt.repo = subrepo;
>  
> @@ -527,6 +529,7 @@ static int grep_submodule(struct grep_opt *opt,
>  	} else {
>  		hit = grep_cache(&subopt, pathspec, cached);
>  	}
> +	strbuf_setlen(&opt->super_prefix, oldlen);
>  
>  	return hit;
>  }
> @@ -538,11 +541,8 @@ static int grep_cache(struct grep_opt *opt,
>  	int hit = 0;
>  	int nr;
>  	struct strbuf name = STRBUF_INIT;
> -	int name_base_len = 0;
> -	if (repo->submodule_prefix) {
> -		name_base_len = strlen(repo->submodule_prefix);
> -		strbuf_addstr(&name, repo->submodule_prefix);
> -	}
> +	size_t name_base_len = opt->super_prefix.len;
> +	strbuf_addbuf(&name, &opt->super_prefix);
>  
>  	if (repo_read_index(repo) < 0)
>  		die(_("index file corrupt"));
> @@ -618,11 +618,9 @@ static int grep_tree(struct grep_opt *opt, const struct pathspec *pathspec,
>  	struct name_entry entry;
>  	int old_baselen = base->len;
>  	struct strbuf name = STRBUF_INIT;
> -	int name_base_len = 0;
> -	if (repo->submodule_prefix) {
> -		strbuf_addstr(&name, repo->submodule_prefix);
> -		name_base_len = name.len;
> -	}
> +	size_t name_base_len = opt->super_prefix.len;
> +
> +	strbuf_addbuf(&name, &opt->super_prefix);
>  
>  	while (tree_entry(tree, &entry)) {
>  		int te_len = tree_entry_len(&entry);
> diff --git a/builtin/ls-files.c b/builtin/ls-files.c
> index 4cf8a236483..c76a6be2fbe 100644
> --- a/builtin/ls-files.c
> +++ b/builtin/ls-files.c
> @@ -216,10 +216,12 @@ static void show_killed_files(struct index_state *istate,
>  	}
>  }
>  
> -static void show_files(struct repository *repo, struct dir_struct *dir);
> +static void show_files(struct repository *repo, struct dir_struct *dir,
> +		       const char *super_prefix);
>  
>  static void show_submodule(struct repository *superproject,
> -			   struct dir_struct *dir, const char *path)
> +			   struct dir_struct *dir, const char *path,
> +			   const char *super_prefix)
>  {
>  	struct repository subrepo;
>  
> @@ -229,7 +231,7 @@ static void show_submodule(struct repository *superproject,
>  	if (repo_read_index(&subrepo) < 0)
>  		die("index file corrupt");
>  
> -	show_files(&subrepo, dir);
> +	show_files(&subrepo, dir, super_prefix);
>  
>  	repo_clear(&subrepo);
>  }
> @@ -303,14 +305,19 @@ static void show_ce_fmt(struct repository *repo, const struct cache_entry *ce,
>  
>  static void show_ce(struct repository *repo, struct dir_struct *dir,
>  		    const struct cache_entry *ce, const char *fullname,
> -		    const char *tag)
> +		    const char *tag, const char *super_prefix)
>  {
>  	if (max_prefix_len > strlen(fullname))
>  		die("git ls-files: internal error - cache entry not superset of prefix");
>  
>  	if (recurse_submodules && S_ISGITLINK(ce->ce_mode) &&
>  	    is_submodule_active(repo, ce->name)) {
> -		show_submodule(repo, dir, ce->name);
> +		struct strbuf sp = STRBUF_INIT;
> +
> +		strbuf_addf(&sp, "%s%s/", super_prefix ? super_prefix : "",
> +			    ce->name);
> +		show_submodule(repo, dir, ce->name, sp.buf);
> +		strbuf_release(&sp);
>  	} else if (match_pathspec(repo->index, &pathspec, fullname, strlen(fullname),
>  				  max_prefix_len, ps_matched,
>  				  S_ISDIR(ce->ce_mode) ||
> @@ -374,16 +381,17 @@ static int ce_excluded(struct dir_struct *dir, struct index_state *istate,
>  	return is_excluded(dir, istate, fullname, &dtype);
>  }
>  
> -static void construct_fullname(struct strbuf *out, const struct repository *repo,
> -			       const struct cache_entry *ce)
> +static void construct_fullname(struct strbuf *out, const struct cache_entry *ce,
> +			       const char *super_prefix)
>  {
>  	strbuf_reset(out);
> -	if (repo->submodule_prefix)
> -		strbuf_addstr(out, repo->submodule_prefix);
> +	if (super_prefix)
> +		strbuf_addstr(out, super_prefix);
>  	strbuf_addstr(out, ce->name);
>  }
>  
> -static void show_files(struct repository *repo, struct dir_struct *dir)
> +static void show_files(struct repository *repo, struct dir_struct *dir,
> +		       const char *super_prefix)
>  {
>  	int i;
>  	struct strbuf fullname = STRBUF_INIT;
> @@ -410,7 +418,7 @@ static void show_files(struct repository *repo, struct dir_struct *dir)
>  		struct stat st;
>  		int stat_err;
>  
> -		construct_fullname(&fullname, repo, ce);
> +		construct_fullname(&fullname, ce, super_prefix);
>  
>  		if ((dir->flags & DIR_SHOW_IGNORED) &&
>  			!ce_excluded(dir, repo->index, fullname.buf, ce))
> @@ -422,7 +430,7 @@ static void show_files(struct repository *repo, struct dir_struct *dir)
>  			show_ce(repo, dir, ce, fullname.buf,
>  				ce_stage(ce) ? tag_unmerged :
>  				(ce_skip_worktree(ce) ? tag_skip_worktree :
> -				 tag_cached));
> +				 tag_cached), super_prefix);
>  			if (skipping_duplicates)
>  				goto skip_to_next_name;
>  		}
> @@ -435,13 +443,15 @@ static void show_files(struct repository *repo, struct dir_struct *dir)
>  		if (stat_err && (errno != ENOENT && errno != ENOTDIR))
>  			error_errno("cannot lstat '%s'", fullname.buf);
>  		if (stat_err && show_deleted) {
> -			show_ce(repo, dir, ce, fullname.buf, tag_removed);
> +			show_ce(repo, dir, ce, fullname.buf, tag_removed,
> +				super_prefix);
>  			if (skipping_duplicates)
>  				goto skip_to_next_name;
>  		}
>  		if (show_modified &&
>  		    (stat_err || ie_modified(repo->index, ce, &st, 0))) {
> -			show_ce(repo, dir, ce, fullname.buf, tag_modified);
> +			show_ce(repo, dir, ce, fullname.buf, tag_modified,
> +				super_prefix);
>  			if (skipping_duplicates)
>  				goto skip_to_next_name;
>  		}
> @@ -874,7 +884,7 @@ int cmd_ls_files(int argc, const char **argv, const char *cmd_prefix)
>  		overlay_tree_on_index(the_repository->index, with_tree, max_prefix);
>  	}
>  
> -	show_files(the_repository, &dir);
> +	show_files(the_repository, &dir, NULL);
>  
>  	if (show_resolve_undo)
>  		show_ru_info(the_repository->index);
> diff --git a/grep.c b/grep.c
> index 06eed694936..10d52219229 100644
> --- a/grep.c
> +++ b/grep.c
> @@ -791,9 +791,9 @@ void free_grep_patterns(struct grep_opt *opt)
>  		free(p);
>  	}
>  
> -	if (!opt->pattern_expression)
> -		return;
> -	free_pattern_expr(opt->pattern_expression);
> +	if (opt->pattern_expression)
> +		free_pattern_expr(opt->pattern_expression);
> +	strbuf_release(&opt->super_prefix);
>  }
>  
>  static const char *end_of_line(const char *cp, unsigned long *left)
> diff --git a/grep.h b/grep.h
> index 6075f997e68..d353bfa21ce 100644
> --- a/grep.h
> +++ b/grep.h
> @@ -133,6 +133,7 @@ struct grep_opt {
>  	 * t7814-grep-recurse-submodules.sh for more information.
>  	 */
>  	struct repository *repo;
> +	struct strbuf super_prefix;
>  
>  	int linenum;
>  	int columnnum;
> @@ -178,6 +179,7 @@ struct grep_opt {
>  };
>  
>  #define GREP_OPT_INIT { \
> +	.super_prefix = STRBUF_INIT, \
>  	.relative = 1, \
>  	.pathname = 1, \
>  	.max_depth = -1, \
> diff --git a/repository.c b/repository.c
> index 5d166b692c8..2f8581c517d 100644
> --- a/repository.c
> +++ b/repository.c
> @@ -228,12 +228,6 @@ int repo_submodule_init(struct repository *subrepo,
>  			goto out;
>  		}
>  	}
> -
> -	subrepo->submodule_prefix = xstrfmt("%s%s/",
> -					    superproject->submodule_prefix ?
> -					    superproject->submodule_prefix :
> -					    "", path);
> -
>  out:
>  	strbuf_release(&gitdir);
>  	strbuf_release(&worktree);
> @@ -261,7 +255,6 @@ void repo_clear(struct repository *repo)
>  	FREE_AND_NULL(repo->graft_file);
>  	FREE_AND_NULL(repo->index_file);
>  	FREE_AND_NULL(repo->worktree);
> -	FREE_AND_NULL(repo->submodule_prefix);
>  
>  	raw_object_store_clear(repo->objects);
>  	FREE_AND_NULL(repo->objects);
> diff --git a/repository.h b/repository.h
> index 6c461c5b9de..a08da26133c 100644
> --- a/repository.h
> +++ b/repository.h
> @@ -120,13 +120,6 @@ struct repository {
>  	 */
>  	char *worktree;
>  
> -	/*
> -	 * Path from the root of the top-level superproject down to this
> -	 * repository.  This is only non-NULL if the repository is initialized
> -	 * as a submodule of another repository.
> -	 */
> -	char *submodule_prefix;
> -
>  	struct repo_settings settings;
>  
>  	/* Subsystems */
> diff --git a/submodule.c b/submodule.c
> index 1e4eee3492b..1c5ef904a03 100644
> --- a/submodule.c
> +++ b/submodule.c
> @@ -528,9 +528,6 @@ static struct repository *open_submodule(const char *path)
>  		return NULL;
>  	}
>  
> -	/* Mark it as a submodule */
> -	out->submodule_prefix = xstrdup(path);
> -
>  	strbuf_release(&sb);
>  	return out;
>  }
> -- 
> 2.38.0.1471.ge4d8947e7aa




[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux