This project was to investigate the top keywords within various cities in the US required by data scientist to have in their resume. I generated n-grams(unigram & bigram) from a given text after which the results where visualized to compare the results which lead to the project's goal.
These instructions will get you a copy of the project up and running on your local machine.
What things you need to run the script.
- Python: for the data manipulation.
- Gitbash: For pushing the files your repo.
- AWS(S3): For hosting the extracted files.
- An IDE: To run and edit the codes
A step by step series of examples that tell you how to get a development env running
- Install python on your system(macOS, windows or Linux)
- Using pip and the requirement file, use the below command to install the required dependencies.
- Clone repo.
pip install -r requirement.txt