[StGit PATCH 1/2] Teach "stg assimilate" to repair patch reachability

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Rewrite "stg assimilate" so that it in addition to assimilating
commits made on top of the patch stack also will rewrite the
applied/unapplied files to reflect reality. A patch is considered to
be applied if it is reachable from HEAD without passing through a
merge, and unreachable otherwise.

Performance is adequate: On my laptop, it takes less than four seconds
to process the Linux tree (with hot caches).

Signed-off-by: Karl Hasselström <kha@xxxxxxxxxxx>

---

 stgit/commands/assimilate.py |  190 ++++++++++++++++++++++++++++++++----------
 stgit/commands/common.py     |    6 +
 stgit/stack.py               |    6 +
 t/t1301-assimilate.sh        |   12 +--
 4 files changed, 157 insertions(+), 57 deletions(-)


diff --git a/stgit/commands/assimilate.py b/stgit/commands/assimilate.py
index c5f4340..ab2264a 100644
--- a/stgit/commands/assimilate.py
+++ b/stgit/commands/assimilate.py
@@ -23,21 +23,80 @@ from optparse import OptionParser, make_option
 from stgit.commands.common import *
 from stgit.utils import *
 from stgit.out import *
+from stgit.run import *
 from stgit import stack, git
 
-help = 'StGIT-ify any GIT commits made on top of your StGIT stack'
+help = 'StGit-ify any git commits made on top of your StGit stack'
 usage = """%prog [options]
 
-If you have made GIT commits on top of your stack of StGIT patches,
-many StGIT commands will refuse to work. This command converts any
-such commits to StGIT patches, preserving their contents.
+"assimilate" will repair three kinds of inconsistencies in your StGit
+stack, all of them caused by using plain git commands on the branch:
 
-Only GIT commits with exactly one parent can be assimilated; in other
-words, if you have committed a merge on top of your stack, this
-command cannot help you."""
+  1. If you have made regular git commits on top of your stack of
+     StGit patches, "assimilate" converts them to StGit patches,
+     preserving their contents.
+
+  2. Merge commits cannot become patches; if you have committed a
+     merge on top of your stack, "assimilate" will simply mark all
+     patches below the merge unapplied, since they are no longer
+     reachable. If this is not what you want, use "git reset" to get
+     rid of the merge and run "assimilate" again.
+
+  3. The applied patches are supposed to be precisely those that are
+     reachable from the branch head. If you have used e.g. "git reset"
+     to move the head, some applied patches may no longer be
+     reachable, and some unapplied patches may have become reachable.
+     "assimilate" will correct the appliedness of such patches.
+
+Note that these are "inconsistencies", not "errors"; furthermore,
+"assimilate" will repair them reliably. As long as you are satisfied
+with the way "assimilate" handles them, you have no reason to avoid
+causing them in the first place if that is convenient for you."""
 
 options = []
 
+class Commit(object):
+    def __init__(self, id):
+        self.id = id
+        self.parents = set()
+        self.children = set()
+        self.patch = None
+        self.__commit = None
+    def __get_commit(self):
+        if not self.__commit:
+            self.__commit = git.get_commit(self.id)
+        return self.__commit
+    commit = property(__get_commit)
+    def __str__(self):
+        if self.patch:
+            return '%s (%s)' % (self.id, self.patch)
+        else:
+            return self.id
+    def __repr__(self):
+        return '<%s>' % str(self)
+
+def read_commit_dag(branch):
+    out.start('Reading commit DAG')
+    commits = {}
+    patches = set()
+    for line in Run('git-rev-list', '--parents', '--all').output_lines():
+        cs = line.split()
+        for id in cs:
+            if not id in commits:
+                commits[id] = Commit(id)
+        for id in cs[1:]:
+            commits[cs[0]].parents.add(commits[id])
+            commits[id].children.add(commits[cs[0]])
+    for line in Run('git-show-ref').output_lines():
+        id, ref = line.split()
+        m = re.match(r'^refs/patches/%s/(.+)$' % branch, ref)
+        if m:
+            c = commits[id]
+            c.patch = m.group(1)
+            patches.add(c)
+    out.done()
+    return commits, patches
+
 def func(parser, options, args):
     """Assimilate a number of patches.
     """
@@ -45,49 +104,86 @@ def func(parser, options, args):
     def nothing_to_do():
         out.info('No commits to assimilate')
 
-    top_patch = crt_series.get_current_patch()
-    if not top_patch:
-        return nothing_to_do()
+    orig_applied = crt_series.get_applied()
+    orig_unapplied = crt_series.get_unapplied()
 
-    victims = []
-    victim = git.get_commit(git.get_head())
-    while victim.get_id_hash() != top_patch.get_top():
-        victims.append(victim)
-        parents = victim.get_parents()
-        if not parents:
-            raise CmdException, 'Commit %s has no parents, aborting' % victim
-        elif len(parents) > 1:
-            raise CmdException, 'Commit %s is a merge, aborting' % victim
-        victim = git.get_commit(parents[0])
-
-    if not victims:
+    # If head == top, we're done.
+    head = git.get_commit(git.get_head()).get_id_hash()
+    top = crt_series.get_current_patch()
+    if top and head == top.get_top():
         return nothing_to_do()
 
     if crt_series.get_protected():
         raise CmdException(
-            'This branch is protected. Modification is not permitted')
-
-    patch2name = {}
-    name2patch = {}
-
+            'This branch is protected. Modification is not permitted.')
+
+    # Find commits to assimilate, and applied patches.
+    commits, patches = read_commit_dag(crt_series.get_name())
+    c = commits[head]
+    patchify = []
+    applied = []
+    while len(c.parents) == 1:
+        parent, = c.parents
+        if c.patch:
+            applied.append(c)
+        elif not applied:
+            patchify.append(c)
+        c = parent
+    applied.reverse()
+    patchify.reverse()
+
+    # Find patches hidden behind a merge.
+    merge = c
+    todo = set([c])
+    seen = set()
+    hidden = set()
+    while todo:
+        c = todo.pop()
+        seen.add(c)
+        todo |= c.parents - seen
+        if c.patch:
+            hidden.add(c)
+    if hidden:
+        out.warn(('%d patch%s are hidden below the merge commit'
+                  % (len(hidden), ['es', ''][len(hidden) == 1])),
+                 '%s,' % merge.id, 'and will be considered unapplied.')
+
+    # Assimilate any linear sequence of commits on top of a patch.
+    names = set(p.patch for p in patches)
     def name_taken(name):
-        return name in name2patch or crt_series.patch_exists(name)
-
-    for victim in victims:
-        patchname = make_patch_name(victim.get_log(), name_taken)
-        patch2name[victim] = patchname
-        name2patch[patchname] = victim
-
-    victims.reverse()
-    for victim in victims:
-        out.info('Creating patch "%s" from commit %s'
-                 % (patch2name[victim], victim))
-        aname, amail, adate = name_email_date(victim.get_author())
-        cname, cmail, cdate = name_email_date(victim.get_committer())
-        crt_series.new_patch(
-            patch2name[victim],
-            can_edit = False, before_existing = False, commit = False,
-            top = victim.get_id_hash(), bottom = victim.get_parent(),
-            message = victim.get_log(),
-            author_name = aname, author_email = amail, author_date = adate,
-            committer_name = cname, committer_email = cmail)
+        return name in names
+    if applied and patchify:
+        out.start('Creating %d new patch%s'
+                  % (len(patchify), ['es', ''][len(patchify) == 1]))
+        for p in patchify:
+            name = make_patch_name(p.commit.get_log(), name_taken)
+            out.info('Creating patch %s from commit %s' % (name, p.id))
+            aname, amail, adate = name_email_date(p.commit.get_author())
+            cname, cmail, cdate = name_email_date(p.commit.get_committer())
+            parent, = p.parents
+            crt_series.new_patch(
+                name, can_edit = False, commit = False,
+                top = p.id, bottom = parent.id, message = p.commit.get_log(),
+                author_name = aname, author_email = amail, author_date = adate,
+                committer_name = cname, committer_email = cmail)
+            p.patch = name
+            applied.append(p)
+            names.add(name)
+        out.done()
+
+    # Write the applied/unapplied files.
+    out.start('Checking patch appliedness')
+    applied_name_set = set(p.patch for p in applied)
+    unapplied_names = []
+    for name in orig_applied:
+        if not name in applied_name_set:
+            out.info('%s is now unapplied' % name)
+            unapplied_names.append(name)
+    for name in orig_unapplied:
+        if name in applied_name_set:
+            out.info('%s is now applied' % name)
+        else:
+            unapplied_names.append(name)
+    crt_series.set_applied(p.patch for p in applied)
+    crt_series.set_unapplied(unapplied_names)
+    out.done()
diff --git a/stgit/commands/common.py b/stgit/commands/common.py
index 8ed47ca..79576fa 100644
--- a/stgit/commands/common.py
+++ b/stgit/commands/common.py
@@ -114,9 +114,9 @@ def check_local_changes():
 def check_head_top_equal():
     if not crt_series.head_top_equal():
         raise CmdException(
-            'HEAD and top are not the same. You probably committed\n'
-            '  changes to the tree outside of StGIT. To bring them\n'
-            '  into StGIT, use the "assimilate" command')
+"""HEAD and top are not the same. This can happen if you
+   modify a branch with git. The "assimilate" command can
+   fix this situation.""")
 
 def check_conflicts():
     if os.path.exists(os.path.join(basedir.get(), 'conflicts')):
diff --git a/stgit/stack.py b/stgit/stack.py
index bd08b35..d889f37 100644
--- a/stgit/stack.py
+++ b/stgit/stack.py
@@ -509,11 +509,17 @@ class Series(PatchSet):
             raise StackException, 'Branch "%s" not initialised' % self.get_name()
         return read_strings(self.__applied_file)
 
+    def set_applied(self, applied):
+        write_strings(self.__applied_file, applied)
+
     def get_unapplied(self):
         if not os.path.isfile(self.__unapplied_file):
             raise StackException, 'Branch "%s" not initialised' % self.get_name()
         return read_strings(self.__unapplied_file)
 
+    def set_unapplied(self, unapplied):
+        write_strings(self.__unapplied_file, unapplied)
+
     def get_hidden(self):
         if not os.path.isfile(self.__hidden_file):
             return []
diff --git a/t/t1301-assimilate.sh b/t/t1301-assimilate.sh
index 906f5bb..7f47c31 100755
--- a/t/t1301-assimilate.sh
+++ b/t/t1301-assimilate.sh
@@ -5,7 +5,7 @@ test_description='Test the assimilate command.'
 
 test_expect_success \
     'Assimilate in a non-initialized repository' \
-    'stg assimilate'
+    '! stg assimilate'
 
 test_expect_success \
     'Initialize the StGIT repository' \
@@ -75,12 +75,10 @@ test_expect_success \
     git pull . br
     '
 
-test_expect_success \
-    'Try (and fail) to assimilate the merge commit' \
-    '
+test_expect_success 'Assimilate in the presence of a merge commit' '
     [ $(stg applied | wc -l) -eq 5 ] &&
-    ! stg assimilate &&
-    [ $(stg applied | wc -l) -eq 5 ]
-    '
+    stg assimilate &&
+    [ $(stg applied | wc -l) -eq 0 ]
+'
 
 test_done

-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux