Re: Simplifying work across multiple projects (while tracking relationships among commit histories)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Yang,

Yang Zhang wrote:

> We're
> developing a large system consisting of several components (libraries,
> servers, applications, etc.).
[...]
> For now, we'll probably just have some simple scripts that basically
> do 'for i in $projects' loops for pulls, pushes, commits, etc.
> However, this loses a lot of information that should be tracked about
> the version/dependency information among the projects -- information
> that at the same time we're not interested in manually tracking. We're
> currently thinking of having a simple system that is initially set up
> with a dependency graph among projects, e.g.:
> 
>   a: no dependencies
>   b: depends on a
> 
> and whenever a commit is made to a project with dependencies (b), the
> commit (perhaps in the commit message) contains a reference to the
> particular versions of the dependent project(s) (a) that were checked
> out.

It sounds to me like submodules would be a better approach.  Because
it fits my great love of complaining (and I would like to hear what
solutions you come up with, if any), let me try to go through the
problems you would run into.

I am not a submodule developer or heavy user, so please check anything
I say before relying on it.

1. Suppose you are working on the program frobber and you notice it
   contains a usable sub-component veryfastregexp.  So you make a new
   repository for it, make sure it builds on its own, and publish.
   As always when starting a new project, there is a question of how
   much early history to preserve.  Probably best to start with a
   single commit, and provide a separate branch with
   ‘filter-branch --subdirectory-filter’ output if you are feeling
   generous.

   In frobber, you remove the copy of veryfastregexp, add it back
   with ‘git submodule add git://someserver/path/to/veryfastregexp’,
   commit, and publish.

    - New clones must use ‘git clone --recursive’.  How do you
      advertise this?

    - Existing clones must use ‘git submodule update --init’ after
      they pull.  In fact, it seems to me it’s not a bad idea to
      always use ‘git submodule update --init --recursive’ after each
      pull.  How do you advertise this?

    - Incoming patches that touch both veryfastregexp and frobber
      have to be split into separate patches for the two projects.
      How?

    - Pull requests are even worse (or just as bad, depending on
      how you solved the previous problem).

2. Some people like to use the latest stable version of all components
   they use, while other people like to avoid change wherever
   possible.  I’ll consider the latter sort of person in a moment.

   The developers of frobber want to use the latest version on the
   master branch for all components.  So they try the following:

	git submodule foreach '
		git checkout master &&
		git pull &&
		git submodule update --init --recursive
	'

   This checks out a branch tracking the upstream master branch
   for each submodule.

   Next they run ‘git add -u’ to mention all the updated submodule
   versions, test to make sure everything’s okay, and commit.

    - This does not bring sub-submodules to the latest version at the
      same time.  If the frobber developers wanted to do that, they
      might try

	git submodule foreach --recursive '
		git checkout master &&
		git pull &&
		git submodule update --init --recursive
	'

      Then they run ‘git add -u’, test, and run ‘git commit’.  But
      the editor informs them that submodules have unstaged changes.
      What happened, what are the consequences for others using this
      project, and can this be avoided?

      I’ll return to this in a moment.

3. Some people never upgrade until forced to.

   The veryfastregexp library is a resounding success, picked up
   by other people in the company, and rapidly developed.  After a
   particularly painful upgrade, the developers of frobber have a turn
   for the conservative.  From now on, their policy is “necessary
   fixes only”.  So they would like to maintain their own branch and
   cherry-pick from master.

   They put in a request for privileges to commit to their own branch,
   and wait.  In the meantime, an important fix comes up.  So they
   do the only thing they can do: publish a fork of veryfastregexp,
   update .gitmodules to point to it, run ‘git add -u’ to register
   the version they are using, test, commit, and push.

   New clones made with ‘git clone --recursive’ will use the
   project-specific version of veryfastregexp.

    - Users with existing clones must update the URL with
      ‘git remote set-url origin <new url>’ from the submodule
      or ‘git submodule sync’; otherwise, the next time they run
      ‘git submodule update’ there will be an ‘unable to checkout’
      error.  How do you advertise this?

    - Suppose a frobber developer tries the following from the
      frobber repository:

	cd veryfastregexp && git cherry-pick important-fix

      runs ‘git add -u’ from the toplevel, tests, commits, and
      pushes the result.  Of course, an important step is missing: he
      forgot to push to the veryfastregexp-frobber repository!  
      Anyone who tries to pull this change and run ‘git submodule
      update’ will find the commit object missing and be unable to
      check out the new revision.

      An update hook could have prevented this, since from the
      server side it is obvious which objects a new clone will have
      access to.  Where can one find such a hook?

   The frobber developers’ request for a branch in the veryfastregexp
   repository is granted.  So they switch .gitmodules back again
   and keep the submodule pointed to the for-frobber branch, updating
   as needed.

    - Now the old recipe

	git submodule foreach --recursive '
		git checkout master &&
		git pull &&
		git submodule update --init --recursive
	'

     does not work for them anymore, since this would switch
     the frobber branch back.  What should they do to adjust?
     How can they make it easy for new people on their team to
     get started, too?

4. The developers who want all components to be aggressively updated
   (see #2 above) need to do something similar.  They first switch all
   components to point to repositories with branches they own, and
   then run aggressively-update, where aggressively-update is a
   script something like

	#!/bin/sh
	git reset --keep upstream/master &&
	git submodule foreach aggressively-update &&
	git add -u &&
	make test &&
	git commit -v &&
	git push -f origin master

   - Can this be modified to pick up new submodules?

Thoughts welcome.
Jonathan
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]