Re: tracking submodules out of main directory.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Le samedi 30 juillet 2011 à 16:16 +0200, Jens Lehmann a écrit :
> Am 29.07.2011 11:39, schrieb henri GEIST:
> > Le jeudi 28 juillet 2011 à 18:48 +0200, Jens Lehmann a écrit :
> >> Am 28.07.2011 10:57, schrieb henri GEIST:
> > 
> > It is not a matter of disabling any control of git in its own
> > repository.
> > It is just a matter of adding inside the git repository a reference
> > (dependency) to an other git repository.
> 
> ... which you want to have *outside* of the containing repository!

yes

> That will then be registered in other git repositories too in your model,
> which gets rid of the "one file/submodule, one repo" assumption we now have
> and will introduce ambiguities which are *really* hard to handle.
> 

I am sorry, I am not a native English speaker. This sentence is to
complex for me. And google translator is of no help in this case.

> >>> This pach :
> >> <snip>
> >>>   - Still forbids to add anything else.
> >>
> >> Why? If you let submodules live outside the tree I don't see any reason why
> >> regular files shouldn't live there too (Disclaimer: I d not think that would
> >> be a good idea either ;-).
> > 
> > there is regular file in the submodule repository but they are controled
> > by the submodule itself. Then the main repository do not need to handle
> > them.
> > 
> > Just like you I think that regular file should not be touche outside of
> > the repository by git.
> 
> No, I think we should /either/ allow gitlinks *and* files to live outside of
> the repository /or/ *none* of them, as they have a lot in common (and I'm
> working on making them to behave as similar as possible). And in the use case
> you describe here it is totally irrelevant if you dependency consists of a
> directory tree in a submodule (= a bunch of files) or just a single file (say
> a header containing project wide definitions).
> 

Yes they have a lot in common but they still have a different nature.
git links pointer (not files on a hard-drive) to something which has a
special meaning for git, (a git repository).
Just like the .git directory which also have a special handling.

But I agree the step is really weak before enabling to put any regular
file outside of the directory.
I do not see any reasonable workflow (to my eyes) for it but' maybe some
day someone will came with a justifiable workflow which need it. we will
never know.

But in this case we need solve some questions :
  - Will we extend git status signaling untracked files out of the
    repository ?
  - What will do git-clean ? it is already dangerous inside the
    repository. and it will be worst if it can access outside of it.

May be some other points that I do not see for the moment.

> > Because in this case it is not just a reference that is managed but the
> > file itself. And this way there is a risk to overwrite some data not
> > under revision control outside of the repository.
> 
> You have the same risk when a gitlink points outside, as a submodule is a
> way of controlling a bunch of files through that reference. And the file
> would be under version control in the repository where it is registered, no?
> 

I agree on this point.

But they are still confined in an another git repository not
disseminated all over the file system.
And it never corrupt this pointed repository. just ask it to do by it's
own regular git commands.

In fact you can argue that it can disseminate some complete git
repository anywhere in the file system.
And you will be right. (nothing is perfect.)

I can do a second patch to prevent git submodule command to make clones
outside of the repository.
It will requires the user to do those clones manually.
In fact this is already what I do.
My only use of this is to track dependencies.
 
> > And in fact it is just what I want, it enable me if I decide to work on
> > an optional "BigProject" depending on both project 1 & 2.
> > 
> > Then If lib1 is in version M:
> >  - a git status in project2 will say nothing
> >  - a git status in project1 will say
> >    "modified:   ../lib1 (modified content)
> >  - a git status in BigProject will say
> >    "modified:   ../project1 (modified content)
> > 
> > Then I know that I need to update project1 to work with the last version
> > M of lib1.
> 
> Maybe no update for project1 is needed, because M only contains a bugfix
> which doesn't even need a recompilation of project1. But now you need to
> add a commit to project1 nonetheless with a message like "Updated lib1
> with a bugfix which is needed by project2" which makes your idea of
> independent submodules break down.
> 

In fact I work ni the world of "high integrity programming" then It is
just what I need.
If there is a bugfix in any library, used by the program it is no more
the same program.
I need the "SHA1" to correspond to the exact and complete source code
involved in my executable.

And this way the "SHA1" of the project sign the "SHA1" of the
libraries. 

> >> You are opening a can of worms by having two different repos point to the same
> >> submodule living in a third repo (which also happens to be their superproject
> >> and must somehow ignore it). You'll have two SHA1s for a single submodule;
> >> "git submodule foreach --recursive" will have interesting results too; and so
> >> on. Not good.
> > 
> > As I just said before it is my purpose to do it like that.
> 
> I understood that, but what are you proposing to do to solve all the
> problems your approach introduces? You can't just hand wave them away.
> 

There is some solutions :

  - First it is one more **feature** if it does not correspond to your
    work flow it does not prevent you to work exactly the way you did
    until now.

  - Second if you want to use the feature but not want to have the
    conflict **feature** (for me it is one), just put the independent
    project with there libs in different directory

      -+- foo -+- lib1     (in version N)
       |       +- project1
       |
       +- bar -+- lib1     (in version M)
               +- project2

  - Third if you really need to have project 1 & 2 in the same
    directory foo, that means they are needed by a third BigProject in
    the same directory foo depending on project 1 & 2.
    And then you really need git to declare a conflict.

> > Let say a concret exemple
> > 
> > 3 different teams work on libtiff, libpng, and libjpeg they are totally
> > unrelated.
> > 
> > One more team is working on the "gimp". And they need those 3 libs in
> > specific versions not necessarily there heads.
> > 
> > One other unrelated team is working on "gqview" and need the same libs
> > in other specifics versions (Why should they know what te gimp team
> > does)
> > 
> > Neither "gimp" and "gqview" project will contain directory with those
> > libs inside. They just depend on them.
> > 
> > And the last team work on the gnome project which need the "gimp" and
> > "gqview". It will be this team witch have to care about having both
> > "gimp" and "gqview" sharing the same libs version>
> > And has well the gnome project will not contain "gqview" and "gimp" in
> > its own tree.
> > It will also depend on them.
> 
> Cool, that is a real life example resembling what we have a my dayjob. But
> a "gimp" and "gqview" project will only have dependencies like "use libpng
> of version 1.2.3 or newer (because we need a feature/bugfix introduced
> there)" and won't be tied to a special version of that library. This means
> they need a dependency like "SHA1 or newer" instead of "exactly this SHA1".
> 

It is useful and simpler to work like this but could introduce some
bugs.

The "gimp" team has tested it with libpng 1.2.3 and maybe know that it
did not work with previous versions but if they do not have any crystal
ball they never know if newer versions will not break something.
In fact I doubt that the first version of gimp will work with the last
version of libpng.

> > It is just the same with aptitude on debian.
> > Each package know there dependency by themselves, does not contain there
> > dependencies, and do not need a bigger superpackage to tell them what
> > are there own dependencies.
> 
> And this is a very good point for the "version x.yy-z *or newer*" argument,
> they are /never/ tied to the /exact/ x.yy-z version, as that would make the
> dependencies pretty much unusable. They use a "newer than x.yy-z" scheme.
> 

It is an other feature that the one I need.
But it is a good idea.

Nothing prevent us to make a patch to add a new test in git status to
see if the current SHA1 in the libpng repository has the SHA1 of the
gitlink in the gimp in its ancestor.


> 
> > And Still I realy want to have every project knowing there own
> > dependency by themselves and not needing an external superproject to
> > tell them what they need.
> 
> I want to have that too! I'm just convinced using a gitlink to achieve that
> is wrong in so many ways. I'd rather prefer to express such dependencies in
> something like a config file, and I believe they should not be as strict as
> "I need exactly that version" but rather like "this version or newer (and by
> the way: we of course only tested that specific version ;-)". These
> dependencies could then be checked and displayed by git status.
> 

It effectively could be in a config file it seem good to me as well.

But if git handle this config file.
Update it on a "git add ../libpng && git commit"
And control the matching between the project and libraries on
"git status">

I can not see the difference with a gitlink.

	Henri GEIST


--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]