Re: how git pull works?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Nov 19, 2010 at 23:15:51 +0800, Dowlin Yang wrote:
> I sent an email earlier but it seems rejected. I resent again with plain
> text format. I am sorry for any inconvenience if you receive my messages
> twice, and please read the new one.

If it was HTML it was rejected by the list software and nobody ever saw it.
HTML is not accepted by any of the lists on vger.kernel.org.

> Suppose A had a branch b1 and B had a branch b2. They work on their own
> branch separately but b1 and b2 share the same file f1. Suppose A made
> a few changes to f1 on Nov 5th.

Dates are irrelevant. What is the most recent common ancestor?

Note, that most recent common ancestor is such commit, that is reachable from
both branches ("common ancestor") and it is not reachable from any other
common ancestor ("most recent"). No mention of dates anywhere -- just
parent-child relation between commits.

> Here are A's changes:
> [snip]
> A removed one line and added a few lines to f1 and then committed on Nov
> 5th.
>
> On the other hand, B made a few changes to f1 too on the next day Nov 6th.
> Here are B's changes:
> [snip] 
> B removed a few lines and then committed on Nov 6th.
>
> They kept updating other files in the following days. After a few days, we
> decide to merge A's branch b1 with B's branch b2. So A did git pull origin
> b2 in b1, and the expected resultant file is sth like this:
> [snip]

Now git -- and for that matter any other version control system out there
that ever had a merge command -- looks for the most recent common ancestor.
Let's call it 'a'(1).

Now git applied both changes from a to b1 and changes from a to b2. It uses
the 3-way merge algorithm, which is basically:
 - match up all lines that are the same in all three versions
 - for lines where a and b1 is the same, take b2
 - for lines where a and b2 is the same, take b1
 - for lines where b1 and b2 are the same, take b1 (== b2)
 - for lines where a, b1 and b2 all differ, declare conflict

You can equally think about this algorithm as applying diff from a to b1 to
b2 or applying diff from a to b2 to b1. All should give the same results(2).

For adding/removing files, just think of the tree as file listing what files
are included (where order is ignored, so the same name is always matched up).

The individual changes are not considered, ever. Only the sum of changes
since most recent common ancestor on one and the other side.

> Aren't newer changes supposed to be applied? Why older changes are the
> final results?

No. All changes since branch point or last merge are applied.

> B's b2 branch had com/category_bar.js added earlier than A's b1. A manually
> added the same changes to b1 on Nov 5th, but later B decided to remove
> com/category_bar.js from b2 on Nov 6h as I described.

b2 had category_bar.js added and removed again, so in the end it had no
changes in category_bar.js

b1 had category_bar.js added

No change (between a and b2) versus addition (between a and b1) is addition.
So category_bar.js is added in the result.

If you didn't add category_bar.js independently on b1, but instead pulled
from b2, the pulled revision would have been most recent common ancestor, so
b1 would see no further change and b2 would see deletion and result would be
deletion.

Footnotes:
~~~~~~~~~~
1) You can query most recent common ancestor with 'git merge-base b1 b2'.
   You can view/list all commits from common ancestor to b2 with 'b1..b2'
   refspec to gitk/git log. You can view/list all commits from common
   ancestor to both b1 and b2 with 'b1...b2' refspec to gitk/git log. You can
   get the cumulative diff from common ancestor to b2 with 'git diff
   b1...b2'.

2) Unless there are conflicts or a repeated text, in which case patch
   application (with limited context) would not have enough information on
   where to apply while 3-way merge would.

-- 
						 Jan 'Bulb' Hudec <bulb@xxxxxx>
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]