Re: [GSoC] Improving parallelism

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Mar 18, 2012 at 5:18 AM, Felipe Tanus <fotanus@xxxxxxxxx> wrote:
> I find the idea of the proposal straight-forward, and no doubts pop up
> in my mind, except on what commands can I work on. The idea described
> in the wiki tells that the commands "git grep --cached" and "git grep
> COMMIT" need this improvement, and most likely "git diff" and "git log

Note that if you improve diff machinery, many commands will benefit
(add, apply, checkout, merge, status)

> -p" need too. That is a good start, but if you know already other
> commands that might benefit from this parallelism, please tell me in
> order for me to include in my proposal.

"git blame" (I think, I don't use this command much) and "git fsck".
"git index-pack" is getting multithread support soon (you can search
mail archive), but even then I think there's still room for further
improvements (i.e. parallelize the hashing code in the first phase of
checking the pack).

If that's not enough, you may want to investigate whether multithread
support can speed up "git rev-list --objects --all" without adding too
much complexity. Speeding up this can also be achieved by implementing
pack format version 4 (current version is 3). But that's a bigger work
and may need more time to land.
-- 
Duy
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]