If a server sets allowtipsha1inwant (or allowreachablesha1inwant), a client can call "git fetch <server> <SHA-1>" where SHA-1 is the hash of a blob (reachable or unreachable) to obtain it. The test below (which passes) demonstrates that. I have bisected this, and this bug occurs at least as early as the introduction of allowreachablesha1inwant in commit 68ee628 ("upload-pack: optionally allow fetching reachable sha1", 2015-05-21). It may have occurred earlier, though - initially, I thought that this was due to allowreachablesha1inwant, so I did not investigate allowtipsha1inwant until later. I found this out while investigating the feasibility of using the existing fetch-pack/upload-pack protocol to support fetching of arbitrary blobs. This happens most likely because the "rev-list" call in "do_reachable_revlist" (called through "has_unreachable") is invoked without the "--objects" argument. "has_unreachable" determines that an object is unreachable if nothing is printed to stdout, which normally works, except that "rev-list" prints nothing when asked which commits are reachable from a blob (which makes sense). Adding "--objects" works, and all existing tests pass, except for the potential performance issue and the side effect that even fetching a reachable blob no longer works. This is due to a possible bug where a call like "git rev-list --objects $tree ^master" (where $tree is the tree object corresponding to master) prints out objects even though all objects reachable from the tree are also reachable from master. (There should be no issue with "get_reachable_list", the other invoker of "do_reachable_revlist", because non-commits in the command's stdout are skipped.) The other option, of course, is to whitelist object types (that is, only allow commits and tags - I haven't checked if this is sufficient, though). This might be a sufficient temporary fix, although I expect that we would still have to revisit this later (especially if we decide that we want to allow downloading blobs through this interface). I don't mind taking a look at either solution, but thought of putting this out first in case people have any opinion or insight into this problem and its possible solutions. (CC-ing some people who might have an interest in downloading arbitrary blobs.) Signed-off-by: Jonathan Tan <jonathantanmy@xxxxxxxxxx> --- t/t9999-mytests.sh | 61 ++++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 61 insertions(+) create mode 100644 t/t9999-mytests.sh diff --git a/t/t9999-mytests.sh b/t/t9999-mytests.sh new file mode 100644 index 000000000..e71cfbf48 --- /dev/null +++ b/t/t9999-mytests.sh @@ -0,0 +1,61 @@ +#!/bin/sh + +test_description='my tests' +. ./test-lib.sh +. "$TEST_DIRECTORY"/lib-httpd.sh +start_httpd + +test_expect_success 'setup' ' + server="$HTTPD_DOCUMENT_ROOT_PATH/blobs" && + git init "$server" && + ( + cd "$server" && + test_commit myfile + ) && + + # A reachable blob. + reachable=$( + git -C "$server" cat-file -p \ + $(git -C "$server" cat-file -p HEAD | grep "^tree " | cut -c6-) | \ + grep myfile | cut -c13-52) && + test -n "$reachable" && + + # 2 unreachable blobs. Only one will be fetched. (2 are included here + # to demonstrate that there is no whole-repo copying or anything like + # that.) + unreachable=$(echo abc123 | git -C "$server" hash-object -w --stdin) && + test -n "$unreachable" && + another_unreachable=$(echo def456 | git -C "$server" hash-object -w --stdin) && + test -n "$another_unreachable" && + + git init --bare client.git && + git -C client.git remote add origin "$HTTPD_URL/smart/blobs" && + + # These fetches are supposed to fail (and do) + test_must_fail git -C client.git fetch origin $reachable && + test_must_fail git -C client.git fetch origin $unreachable +' + +test_expect_success 'allowtipsha1inwant suddenly allows blobs' ' + test -n "$reachable" && + + git -C "$server" config uploadpack.allowtipsha1inwant 1 && + + # This fetch passes + git -C client.git fetch origin $reachable && + test "myfile" = $(git -C client.git cat-file -p $reachable) +' + +test_expect_success 'even unreachable ones' ' + test -n "$unreachable" && + + # This fetch is supposed to fail (for multiple reasons), but passes. + # Only the wanted blob is fetched. + git -C client.git fetch origin $unreachable && + git -C client.git cat-file -e $unreachable && + test_must_fail git -C client.git cat-file -e $another_unreachable && + test "abc123" = $(git -C client.git cat-file -p $unreachable) +' + +stop_httpd +test_done -- 2.11.0.483.g087da7b7c-goog