Re: [PATCH v5] gc: save log from daemonized gc --auto and print it next time

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Nguyễn Thái Ngọc Duy  <pclouds@xxxxxxxxx> writes:

> While commit 9f673f9 (gc: config option for running --auto in
> background - 2014-02-08) helps reduce some complaints about 'gc
> --auto' hogging the terminal, it creates another set of problems.
>
> The latest in this set is, as the result of daemonizing, stderr is
> closed and all warnings are lost. This warning at the end of cmd_gc()
> is particularly important because it tells the user how to avoid "gc
> --auto" running repeatedly. Because stderr is closed, the user does
> not know, naturally they complain about 'gc --auto' wasting CPU.
>
> Daemonized gc now saves stderr to $GIT_DIR/gc.log. Following gc --auto
> will not run and gc.log printed out until the user removes gc.log.
>
> Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@xxxxxxxxx>
> ---
>  This is the lock-based version. I'm sending an alternative that does
>  not need atexit() with comparison between the two.

I think Michael deserves to be CC'ed.

I am wondering if we need an audit of what our signal handlers do in
the same light as what led to 507d7804 (pager: don't use unsafe
functions in signal handlers, 2015-09-04).  lockfile API already
does removal in remove_lock_files_on_signal(), which may or may not
need further fixing.  And that is why I really hate to see us adding
custom signal handler if we can avoid it.

What you are gaining by having a custom signal handler here is
twofold, I think:

 * You have a chance to remove an empty log message upon signal, but
   if you follow Michael's idea of equating an empty and missing log
   file, that would not be a huge plus either way.

 * You can keep a non-empty log message that may be incomplete when
   you are killed yourself.  If you reused the lockfile API and let
   its signal handler clean it up, you obviously will lose that, but
   you do not even know if that incomplete non-empty log message is
   something you want to base the "stop auto-gc until an existing
   problem is resolved" decision on.  Your subprocess may have left
   that message only because you are getting killed, no?

Neither looks like a compelling enough reason to have it to me.

Wouldn't the logic be vastly simplified and be still correct, if you
let the lockfile API do its usual "unless you explicitly commit,
lockfiles are removed upon signal or exit"?

>  builtin/gc.c | 56 +++++++++++++++++++++++++++++++++++++++++++++++++++++++-
>  1 file changed, 55 insertions(+), 1 deletion(-)
>
> diff --git a/builtin/gc.c b/builtin/gc.c
> index bcc75d9..4e738fa 100644
> --- a/builtin/gc.c
> +++ b/builtin/gc.c
> @@ -43,6 +43,7 @@ static struct argv_array prune_worktrees = ARGV_ARRAY_INIT;
>  static struct argv_array rerere = ARGV_ARRAY_INIT;
>  
>  static char *pidfile;
> +static struct lock_file log_lock;
>  
>  static void remove_pidfile(void)
>  {
> @@ -57,6 +58,28 @@ static void remove_pidfile_on_signal(int signo)
>  	raise(signo);
>  }
>  
> +static void process_log_file(void)
> +{
> +	struct stat st;
> +	if (!fstat(log_lock.fd, &st) && st.st_size)
> +		commit_lock_file(&log_lock);
> +	else
> +		rollback_lock_file(&log_lock);
> +}
> +
> +static void process_log_file_at_exit(void)
> +{
> +	fflush(stderr);
> +	process_log_file();
> +}
> +
> +static void process_log_file_on_signal(int signo)
> +{
> +	process_log_file();
> +	sigchain_pop(signo);
> +	raise(signo);
> +}
> +
>  static void git_config_date_string(const char *key, const char **output)
>  {
>  	if (git_config_get_string_const(key, output))
> @@ -253,6 +276,24 @@ static const char *lock_repo_for_gc(int force, pid_t* ret_pid)
>  	return NULL;
>  }
>  
> +static int report_last_gc_error(void)
> +{
> +	struct strbuf sb = STRBUF_INIT;
> +	int ret;
> +
> +	ret = strbuf_read_file(&sb, git_path("gc.log"), 0);
> +	if (ret > 0)
> +		return error(_("The last gc run reported the following. "
> +			       "Please correct the root cause\n"
> +			       "and remove %s.\n"
> +			       "Automatic cleanup will not be performed "
> +			       "until the file is removed.\n\n"
> +			       "%s"),
> +			     git_path("gc.log"), sb.buf);
> +	strbuf_release(&sb);
> +	return 0;
> +}
> +
>  static int gc_before_repack(void)
>  {
>  	if (pack_refs && run_command_v_opt(pack_refs_cmd.argv, RUN_GIT_CMD))
> @@ -274,6 +315,7 @@ int cmd_gc(int argc, const char **argv, const char *prefix)
>  	int force = 0;
>  	const char *name;
>  	pid_t pid;
> +	int daemonized = 0;
>  
>  	struct option builtin_gc_options[] = {
>  		OPT__QUIET(&quiet, N_("suppress progress reporting")),
> @@ -330,13 +372,16 @@ int cmd_gc(int argc, const char **argv, const char *prefix)
>  			fprintf(stderr, _("See \"git help gc\" for manual housekeeping.\n"));
>  		}
>  		if (detach_auto) {
> +			if (report_last_gc_error())
> +				return -1;
> +
>  			if (gc_before_repack())
>  				return -1;
>  			/*
>  			 * failure to daemonize is ok, we'll continue
>  			 * in foreground
>  			 */
> -			daemonize();
> +			daemonized = !daemonize();
>  		}
>  	} else
>  		add_repack_all_option();
> @@ -349,6 +394,15 @@ int cmd_gc(int argc, const char **argv, const char *prefix)
>  		    name, (uintmax_t)pid);
>  	}
>  
> +	if (daemonized) {
> +		hold_lock_file_for_update(&log_lock,
> +					  git_path("gc.log"),
> +					  LOCK_DIE_ON_ERROR);
> +		dup2(log_lock.fd, 2);
> +		sigchain_push_common(process_log_file_on_signal);
> +		atexit(process_log_file_at_exit);
> +	}
> +
>  	if (gc_before_repack())
>  		return -1;
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]