Re: [RFC] Dynamic window size on repack?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 7/8/07, Brian Downing <bdowning@xxxxxxxxx> wrote:
I have a CVS repository which is mostly sane, but has an approximately
20MB RTF file that has two hundred revisions or so.  (Thank you, Windows
help.)

Now, since this is old history, I want to make it as small as possible.
The only problem is that when I use high --window values for repack,
it goes along swimmingly until it gets to this file, at which point
memory usage quickly rises to the point where I'm well into my swap file.

I think what I'd like is an extra option to repack to limit window
memory usage.  This would dynamically scale the window size down if it
can't fit within the limit, then scale it back up once you're off of the
nasty file.  This would let me repack my repository with --window=100
and have it actually finish someday on the machines I have access to.
The big file may not be as efficiently packed as possible, but I can
live with that.

My question is, is this sane?  Does the repack algorithm depend on having
a fixed window size to work?  I'd rather not look into implementing this
if it's silly on the face of it.

Sounds very sane to me.

It seems like you want something like this
(I've not referred to the code,  but there is a loop
that could be modifed to include something like this):
 /* build list of delta candidates */
 tot = 0;
 for (i = 0; i < window; ++i ) {
   obj = objects_sorted_for_delta[here + i];
   tot += SIZE(obj);
   if ( tot > window_limit )
     break;
   /* insert obj in list of things to delta, or just try it here */
   ...
 }
 if ( i <= 1 )
   break/return;

window_limit could be set automatically like the variables
for the mmap windows are (no new options).
SIZE() should be defined to return the actual bytes consumed
by the object (I think for this it's uncompressed and undeltified,
but as I said I haven't looked at the code).

It would be better if the current list of objects in the window
were a FIFO.  Before each deltification attempt,  add objects
from the sort list until #objects > window or tot > window_limit.
After each attempt, drop off the object we were trying to delta.

I like your idea and think you should look into implementing it.
--
Dana L. How  danahow@xxxxxxxxx  +1 650 804 5991 cell
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux