API Reference
Extraction
Disambiguation
- git2net.disambiguation.disambiguate_aliases_db(sqlite_db_file, method='gambit', **quargs)
Disambiguates author aliases in a given SQLite database mined with git2net. The disambiguation is performed using the Python package gambit. Internally, disambiguate_aliases_db calls the function gambit.disambiguate_aliases.
- Parameters:
sqlite_db_file (str) – path to SQLite database
method (str) – disambiguation method from {“gambit”, “bird”, “simple”}
**quargs – hyperparameters for the gambit and bird algorithms; gambit: thresh (float) – similarity threshold from interval 0 to 1, sim (str) – similarity measure from {‘lev’, ‘jw’}, bird: thresh (float) – similarity threshold from interval 0 to 1
- Returns:
creates new column with unique author_id in the commits table of the provided database