Please don't send attachments. Copying the text: Commit feature extractionCommit feature extraction is finished with multiprocessing. The commits come from the csv table. Features are based on the patch (what changes in the commit), code features, author features and so on. The output is saved in data/commits.json.
Unit test feature extractionUnit test feature extraction is finished with single thread with speed up. It computes features of unit tests from data/commits.json