Code in Program Service Expense is jupyter notebook based analytics. We should pull dependencies into requirements.txt, migrate code to python3, fix issues (e.g. with paths), and document what each notebook is doing.
This script creates EINS_AND_HANDLES.csv, which contains webpages, twitter handles, and eins. This script should be run to be validated, then should be documented (along with package requirements and dependencies on other code to produce input csv files)
Get the Twitter Handles.ipynb produces Twitter handles by scraping the nonprofit webpage. However, the current code outputs garbage object information, instead of a twitter handle.
This seems to be because old code indexed into map objects, and the code change to enumerate over the map objects didn't work.
We should fix this and verify that what's dumped to TwitterHandleOutput.csv is correct. Then, the csv file should be added to the repo.
POWERBI_CLUSTERING/PowerBIDashBoard.pbix contains PowerBI visualizations for the clustering project. It should be documented (with screen shots) on what's there, and how it works.
Code in Operational Volatility Analysis-rev.ipynb seems to do financial analysis (is it related to Program Service Expense?), but one of the csv files seems missing.
We should either migrate and fix this, or kill it.