| |||||
| |||||
Search Irongeek.com:
Help Irongeek.com pay for bandwidth and research equipment: |
This presentation intends to cover the thought process and logistics behind building a better wordlist using github public repositories as its source. With an estimated 20,000,000 github projects to date, how would one store that amount of data? Would you even want or need to? After downloading approximately 2,000,000 repositories, storing 15TB; this will be a story of one computer, bandwidth, basic python and how to make the data useful.
15 most recent posts on Irongeek.com:
|
If you would like to republish one of the articles from this site on your
webpage or print journal please contact IronGeek.
Copyright 2020, IronGeek
Louisville / Kentuckiana Information Security Enthusiast