I followed the course Mining Massive Datasets by the University of Stanford.
You'll find in this repository my solutions to the different exercices.
It has been written using iPython Notebook.
- Clone repository
- Install dependencices (I recommend using a virtualenv):
pip install -r requirement.txt
- Run python notebook:
ipython notebook