Ok, I've fixed things up a bit, and done a bit of benchmarking, let me
know if something/wrong.
--
Mark A Rada (ferrous26)
marada@xxxxxxxxxxxx
------------------->8-------------------
From: Mark Rada <marada@xxxxxxxxxxxx>
Date: Thu, 30 Jul 2009 08:56:42 -0400
Subject: [PATCH] Gitweb support for XZ compressed snapshots
The XZ compression format uses the LZMA compression algorithm, which
generally is capable of yielding higher compression ratios than both
GZip and BZip2 at the cost of using more CPU time and RAM (lots more).
It is relevant to note that while LZMA is the slowest of the mentioned
algorithms for compression, it is much faster than BZip2 for
decompression (but still slower than GZip).
You can enable XZ compressed snapshots by adding 'txz' to the list of
default options for snapshots in $GITWEB_CONFIG or adding txz to an
individual repository using the gitweb.snapshot variable for the config
file.
I did some simple benchmarks, starting with an already tarballed
archive of the repos listed below. Memory usage seemed to be consistent
for any given algorithm. All tests done at default compression level.
CPU: AMD Sempron 3400+ (1 core @ 1.8GHz with 256K L2 cache)
Virtual Memory
GZip: 4152K
BZip2: 13352K
XZ: 102M
Linux kernel 2.6 series (f5886c7f96f2542382d3a983c5f13e03d7fc5259)
gzip 23.70s user 0.47s system 99% cpu 24.227 total
gunzip 3.74s user 0.74s system 94% cpu 4.741 total
bzip2 130.96s user 0.53s system 99% cpu 2:11.97 total
bunzip2 31.05s user 1.02s system 99% cpu 32.355 total
xz 448.78s user 0.91s system 99% cpu 7:31.28 total
unxz 7.67s user 0.80s system 98% cpu 8.607 total
Git (0a53e9ddeaddad63ad106860237bbf53411d11a7)
gzip 0.77s user 0.03s system 99% cpu 0.792 total
gunzip 0.12s user 0.02s system 98% cpu 0.142 total
bzip2 3.42s user 0.02s system 99% cpu 3.454 total
bunzip2 0.95s user 0.03s system 99% cpu 0.984 total
xz 12.88s user 0.14s system 98% cpu 13.239 total
unxz 0.27s user 0.03s system 99% cpu 0.298 total
XZ (669413bb2db954bbfde3c4542fddbbab53891eb4)
xz 1.62s user 0.03s system 99% cpu 1.652 total
unxz 0.05s user 0.00s system 99% cpu 0.058 total
bzip2 1.28s user 0.01s system 99% cpu 1.298 total
bunzip2 0.15s user 0.01s system 100% cpu 0.157 total
gzip 0.12s user 0.00s system 95% cpu 0.132 total
gunzip 0.02s user 0.00s system 97% cpu 0.027 total
I don't think it should be the default format, at least not right now,
simply because the XZ format is still fairly new (the format was
declared stable about 6 months ago), and there have been no "stable"
releases of the utils yet.
Signed-off-by: Mark Rada <marada@xxxxxxxxxxxx>
---
gitweb/gitweb.perl | 8 ++++++++
1 files changed, 8 insertions(+), 0 deletions(-)
diff --git a/gitweb/gitweb.perl b/gitweb/gitweb.perl
index 7fbd5ff..3398163 100755
--- a/gitweb/gitweb.perl
+++ b/gitweb/gitweb.perl
@@ -176,6 +176,13 @@ our %known_snapshot_formats = (
'format' => 'tar',
'compressor' => ['bzip2']},
+ 'txz' => {
+ 'display' => 'tar.xz',
+ 'type' => 'application/x-xz',
+ 'suffix' => '.tar.xz',
+ 'format' => 'tar',
+ 'compressor' => ['xz']},
+
'zip' => {
'display' => 'zip',
'type' => 'application/x-zip',
@@ -188,6 +195,7 @@ our %known_snapshot_formats = (
our %known_snapshot_format_aliases = (
'gzip' => 'tgz',
'bzip2' => 'tbz2',
+ 'xz' => 'txz',
# backward compatibility: legacy gitweb config support
'x-gzip' => undef, 'gz' => undef,
--
1.6.4
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html