Re: pre-commit to reject EOL changes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



----- Original Message -----
> From: "Jakub Narebski" <jnareb@xxxxxxxxx>
> Sent: Saturday, June 4, 2011 3:52:06 AM
> Subject: Re: pre-commit to reject EOL changes
> 
> > I see that the pre-commit.sample reject introduction of lines with
> > trailing whitespaces. Is there a way to have it reject changes in 
> > EOL format (CRLF vs LF)?
> 
> Well, from description of relevant config variable, namely 
> `core.whitespace` it looks like by default `trailing-space` with 
> `cr-at-eol` not enabled (default) treats adding CR in LF -> CR LF 
> change as whitespace error.
> 
> I don't think there is anything *canned* (out of the box) for the
> change in reverse direction.

We recently had issues with flip-flopping line endings, so I wrote the following update hook for our central repo to stop major whitespace changes on push.  It's currently only lightly tested, and while the comments imply it'll work for merges I'm not absolutely convinced of that.  YMMV.

Please excuse the long lines, this was thrown together rather quickly...

HTH,
Stephen


#!/usr/bin/env python

import sys
from subprocess import Popen, PIPE

GIT_PATH = '/usr/local/bin/git'
MAX_DIFFS = 7

def call_git(*args):
    return Popen([GIT_PATH] + list(args), stdout=PIPE).communicate()[0]

if len(sys.argv) != 4:
    print 'Usage: %s refname oldsha newsha'%sys.argv[0]
    sys.exit(1)

refname = sys.argv[1]
oldsha = sys.argv[2]
newsha = sys.argv[3]

if oldsha == '0'*40 or newsha == '0'*40:
    # New or deleted ref, nothing to do
    # The deleted ref is obvious, but new ref is a bit tricky.  Arguably someone could create
    # a new branch, change some line endings, commit and push, at which point this hook will
    # ignore the change because there's no oldsha to compare to.  This script will catch 
    # merges that introduce EOL changes, but that's much later (the idea is to make commiters
    # as aware of EOL changes as possible as early as possible).  So for now just ignore this
    # case...
    sys.exit(0)

print 'Checking %s for EOL changes...'%refname
sys.stdout.flush()

# List all changed files between oldsha and newsha
changed_blobs = call_git('diff-tree', '-r', oldsha, newsha)
for change_line in changed_blobs.split('\n'):
    if len(change_line) > 0 and change_line[0] == ':':
        # Get old/new blob ID for each changed file
        (oldmode, newmode, oldblob, newblob, action, extra) = change_line.split(None,5)
        if action in ['A', 'D']:
            # Added or deleted file, nothing to diff
            continue

        # Diff the old and new blobs
        # Could use -z to terminate fields with '\0', but the add/del fields are still
        # tab separated, so that doesn't help much
        raw_diffstat = call_git('diff', '--numstat', oldblob, newblob)
        (raw_add,raw_del,raw_extra) = raw_diffstat.split('\t', 2)
        if raw_add == '-' and raw_del == '-':
            # numstat returns '-' for binary files, skip
            continue
        raw_add = int(raw_add)
        raw_del = int(raw_del)

        # Diff the old and new blobs ignoring EOL whitespace
        noeol_diffstat = call_git('diff', '--numstat', '--ignore-space-at-eol', 
                                  oldblob, newblob)
        (noeol_add,noeol_del,noeol_extra) = noeol_diffstat.split('\t', 2)
        noeol_add = int(noeol_add)
        noeol_del = int(noeol_del)

        # Calculate how different the two diffs are (should ideally be zero)
        diff_add = abs(raw_add - noeol_add)
        diff_del = abs(raw_del - noeol_del)

        if diff_add > MAX_DIFFS or diff_del > MAX_DIFFS:
            # Too much difference, probably screwing up the line endings
            # print 'Change to %s rejected (sha: %s, diff_add: %g, diff_del: %g)'%(refname, newsha, diff_add, diff_del)
            oldsha_short = call_git('rev-parse', '--short', oldsha).rstrip()
            newsha_short = call_git('rev-parse', '--short', newsha).rstrip()
            print 'Too many EOL changes detected (try \'git diff %s %s\')'%(oldsha_short, newsha_short)
            sys.stdout.flush()
            sys.exit(1)
#        else:
#            print 'Change accepted (diff_add: %g, diff_del: %g)'%(diff_add, diff_del)
#            sys.stdout.flush()

# Just to be sure
sys.exit(0)
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]