Re: Set up for better tree diff optimizations

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On Wed, 21 Mar 2007, Junio C Hamano wrote:
> 
> I've already applied the patch, but I do not know how much this
> interface would help, as the only easy case the function
> tree_entry_interesting() can say "no subsequent entries will be
> either" without looking ahead is where no pathspecs match the
> base, but that is already prevented by the way you walk the tree
> (you do not descend into an uninteresting tree).

No. You miss the important case.

> If you do:
> 
> 	$ git log arch/i386/ include/asm-i386/

Right. Forget about the paths we're *interested* in. 

Look at the ones we're *not* interested in!

At the top level, what's the tree? It looks like this:

	100644 blob 060a71d41ad7ccc3214065a182e6f67568420071    .gitignore
	100644 blob bf62dbea88e613d8a22a6f9977289b4eb2eb968b    .mailmap
	100644 blob ca442d313d86dc67e0a2e5d584b465bd382cbf5c    COPYING
	100644 blob 6bd8ab86b5bd1b80f2df2ceaf0a4f8d7634639a0    CREDITS
	040000 tree 14499497d9d170b046e5cf9aafeec76647ece9a3    Documentation
	100644 blob 0451f69353bad4d07de34fd4658f40b805bd467a    Kbuild
	100644 blob 6d8d5b917d1fe0f4ff011d738ca9d13947521eb8    MAINTAINERS
	100644 blob 1c018c468e153ac85079f90a209a9bb3a09c5671    Makefile
	100644 blob 159912cf515549455ed5f579b8825d28962bd22e    README
	100644 blob ac02e42a2627f359575ae0e689afca911a75e0ff    REPORTING-BUGS
	040000 tree 46262adf3cfb89b4028a2d4beb17d9089e26902f    arch
	040000 tree 45d8a2724b4414f7fc3a2dac66929fdf0d032986    block
	040000 tree 21696299da054c8ad04c5fc87f21ecc4a7d0e464    crypto
	040000 tree ff75402a4a48101c95e9dc820c7c822767ca2ac3    drivers
	040000 tree 734d24c7dcfbe8d9b03045aebc2148f92c5f4607    fs
	040000 tree e3d0731913e852e7f256af5b6a3cd50bea8d4993    include
	040000 tree e39426d7179d961d96e27f7250fb16a483cc17db    init
	040000 tree 0438f8748d21ad402d305bf469b27f59bc1300b9    ipc
	040000 tree dc33b0bd31bdf49aaa04f3357f20bb1a189bac4a    kernel
	040000 tree 7cfb510f6854fe8c51f15fefbedda66773c0f23a    lib
	040000 tree cfb673c385e328bbdefb60c3ce59c8f3ba7f215e    mm
	040000 tree 30550e7a0676fe13a2e48d5f3c3872112e9634d2    net
	040000 tree 5194f3b70deaa7f98cadfeca300fd140b1086409    scripts
	040000 tree 164b706596d177b15750eb0da157758397195885    security
	040000 tree e5d9e19873759dcccd45dde4d76a0ad75db8fe5c    sound
	040000 tree 09109a71d77e211a4253fdde8afec7f119055e3b    usr

And with your pathspec, we can tell after we see "init", that we're not 
interested in any of the rest. In other words, we could consistently skip 
and not even parse almost 50% of the top-level tree.

And if you only have the first path-spec ("arch/i386") you could skip 
everything after you've seen "block", because you know that the rest of 
the tree simply isn't interesting any more.

Similarly, look at what happens (with your two-entry pathspec) when you 
enter "arch". Obviously the "include" thing will be totally uninteresting, 
so as far as that one is concerned, you can skip the *whole* thing, but 
because we have multiple path-specs we need to skip only when *both* 
decide that it's uninteresting. So look at the "arch" sub-tree, and we 
have

	alpha
	arm
	arm26
	avr32
	cris
	frv
	h8300
	i386
	ia64
	m32r
	m68k
	m68knommu
	mips
	parisc
	powerpc
	ppc
	s390
	sh
	sh64
	sparc
	sparc64
	um
	v850
	x86_64
	xtensa

and again, when we hit ia64, we *could* return -1, because "include" is 
totally uninteresting for all of it, and the "arch/i386/" pathspec can 
tell that anything below ia64 is uninteresting, because ia64 sorts after 
it, and the tree is alphabetically sorted.

So you could skip even parsing half of both the top-level tree _and_ more 
than half of the "arch" tree.

		Linus
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]