Re: [PATCH] Make git prune remove temporary packs that look like write failures

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Feb 6, 2008 7:31 PM, Nicolas Pitre <nico@xxxxxxx> wrote:
> On Wed, 6 Feb 2008, David Tweed wrote:
>
> > I guess the -n ought to be honoured. However, unless I'm missing
> > something, the case of expiring objects is different. The primary
> > reason is that objects can get orphaned by "semantic" decisions
> > (delete this branch, rewind, etc) so they contain valid content that
> > you might want to later rescue (using low-level command like git cat
> > if necessary).
>
> You can also get loose unconnected objects when fetching and the number
> of objects is lower than the transfer.unpackLimit value.

I didn't know that, but the key point is that these are still
well-formed objects, so rescuing data from them makes sense. In
contrast, I'm concerned with packs which are ill-formed from a
write-error, which I don't believe git will even read. (I've just
tried truncating a finished pack file and none of the git commands
I've tried will read any objects in it. So you'll have to do low-level
hacking to get data out of a write-error pack. WARNING: be aware of
how to do a non-hardlinked clone before trying this.)

> > In contrast, the only way to get a temporary pack when
> > the repository is quiescent is resulting from a _write error_ and thus
> > is a corrupt entity which it would take a great deal of work to
> > extract any valid data from.
>
> Or when a fetch is in progress, just like the case above, but with the
> number of objects greater than transfer.unpackLimit.

Being "in progress" contradicts the presumption of "being quiescent".

> This is uncommon to have a prune occurring at the same time as a fetch,
> but the --expire argument is there if for example you do a prune from a
> cron job but still want to be safe by giving a grace period to garbage
> files which might not be so after all.

Ah, I hadn't realised this was an intended usage of --expire. Since as
you note there's no way to tell an abandoned tmp pack from one that's
in the process of being written, following expire is probably
necessary for safety. I'll look at adding support for that.

Many thanks for the advice,

-- 
cheers, dave tweed__________________________
david.tweed@xxxxxxxxx
Rm 124, School of Systems Engineering, University of Reading.
"while having code so boring anyone can maintain it, use Python." --
attempted insult seen on slashdot
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux