- Clone the repository
- Cd into the project directory
- Run
pip install -r .\requirements.txt
- Run
main.py - Should take around an hour for 500k addresses. Go grab a coffee! ☕ Or some boba tea ♨️, that's pretty good too.
- There is a dataclass class in
api_wrapper.pynamedEntityParsedResults. This class represents the result of a single API call to the address analysis endpoint. You can use this object to easily access the results of the API call without having to go through the documentation! - in
main.py, each batch of API calls generates a list ofEntityParsedResults. An empty functionhandle_results(results: list[EntityParsedResults])is provided so that you can handle actions like saving the results to a dataframe, JSON, or SQL database. The implementation of this function is on you! - The
test_addresses.csvis used for our benchmarking. Simply provide a CSV with a single row calledaddressesas input and the script will handle the rest. Thetest_addresses.csvis provided as an example. (ignore the ID row) - The main function automatically prints progress for you, with a basic ETA. No need to panic!