This script is intended to crawl license information of repositories through the GitHub API. Taking a csv file with requirements.txt format the script will return a csv with the associated license information.
Input file is expected to be a requirements.txt Expected format looks like this, for two exemplary repositories:
Output file will be generated on the fly, named licenses.csv and the columns depict:
| Repo name | Repo Url | License name | License Url |
[email protected] Obviously the Github API is way more powerful than what has been done here. Feel free to extend this code or preferably directly contribute here.