Re: Standardized escaping to store a .git in git?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, May 19, 2021 at 02:31:00PM -0700, Jonathan Nieder wrote:
> Josh Triplett wrote:
> > On rare occasions, a project may need to store and version a .git
> > directory in a git repository. For instance, a project that interacts
> > with git repositories may need test cases. Or, a project using git to
> > store backups may also want to back up git repositories. `.git` is the
> > only filename that git can't transparently store and version.
>
> My take on this might be a bit surprising, but it's probably worth
> spelling out anyway: Git is first and foremost a source code
> management tool, and ".git" directories are not a good interchange
> format, so while I have sympathy for this use case, I do _not_ think
> that Git should make changes that hurt other use cases in order to
> support it.

I absolutely agree that such changes would be entirely inappropriate if
they hurt other use cases. That's part of why I'm suggesting that I
don't think any *defaults* in git should change. My hope is more to have
some kind of guidance along the lines of "if you need to do escaping, do
it this way", to lead towards having one canonical way to do such
escaping rather than multiple incompatible ways.

Part of my motivation, here, is that I'm looking to implement one such
escaping mechanism (in a tool built atop libgit2 that needs to handle
and version arbitrary files), and rather than inventing something
bespoke I'd love to interoperate. And since I've seen various approaches
used in the wild, I didn't want to add Yet Another distinct approach
before starting a design conversation about it.

> Instead, I recommend doing one of the following, in order from most to
> least preferred:
> 
>  1. Make the test case run git commands to create a Git repository.
>     This makes it obvious what the test is trying to do, without
>     having to deal with unrelated details also recorded in ".git".
>     This is what Git's test suite does, for example.
> 
>  2. Check in a fast-import file and use "git fast-import" to make a
>     Git repository out of it.
> 
>  3. Check in a "git bundle" file and use "git clone" to make a Git
>     repository out of it.

For the test-case approach, these are potentially workable, though they
only work if you just need a git repo with a given set of semantics,
rather than a binary-identical test case.

For the storing-arbitrary-files case, these wouldn't apply.

>  4. Check in an archive file (e.g., tar) containing a .git directory.
>     (I consider this preferable over checking in a .git directory
>     directly because it prevents a user from accidentally "cd"-ing
>     into it and running git commands within the checked-in repository
>     that they intended to run in the top-level repository.  That seems
>     especially worth preventing because the checked-in repository can
>     contain git aliases and other settings such as core.pager that
>     cause automatic code execution, as you mentioned.)

Storing as an archive is an option, but that would then require tools
that want to track arbitrary files to distinguish between "tar file that
should be unpacked" and "tar file that was originally a tar file". It's
also a harder format to interoperate with.

To clarify, I don't think the default behavior of git should be to
un-escape this escaping mechanism. Rather, I think the default behavior
should be to treat the filenames as literal, and the user could opt in
to un-escaping on checkout and escaping on check-in.



[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux