[GSoC][Project Idea] Refactor lazy-fetching in a partial clone

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



The term "object" below always means a blob or a tree since git doesn't
yet allow filters on commits or tags.

Whenever an object is missing in a partial clone and we check for it
or read it, we trigger a lazy-fetch depending on whether fetch_if_missing
is set to 1 or 0. Currently, this global is set to 1 everywhere and some
commands which do not want to lazy-fetch have it set to 0 internally (for
example index-pack, fetch-pack, rev-list).

The goal of this project is to look into all the commands where fetch_if_missing
is set to 1 (in which case, whenever an object is missing, a connection
is made, the object that is missing is fetched and we disconnect. This
is bad because we are fetching for each object individually and this
leads to huge performance loss) and make changes so that fetching of
objects is done in a batch. In this way, when all of commands know how
to nicely fetch objects, we can change fetch_if_missing to default to 0.

I think this can be implemented by looking for all the places in the code
where has_object_file*(), read_object_file() or really any other function
which uses oid_object_info_extended(), is used and where fetch_if_missing is
set to 1 and make the necessary and appropriate changes to either fetch
efficiently or to not fetch at all and also write the tests necessary
according to the family [1] they belong to.

The above idea is the result of the discussion on the patch

  https://lore.kernel.org/git/20230225052439.27096-1-five231003@xxxxxxxxx/

Please let me know if it is doable as a project.

Thanks

[1] https://lore.kernel.org/git/20230311025906.4170554-1-jonathantanmy@xxxxxxxxxx/



[Index of Archives]     [Linux Kernel Development]     [Gcc Help]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [V4L]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Fedora Users]

  Powered by Linux