Simple web crawler to fetch all show titles of ATP podcasts (atp.fm)
50 lines of code on top of ~165,000 lines of dependencies (for Casey)
- request (HTTP client)
- domino (server-side DOM)
- zepto-node (JQuery-like library)
- fetch base URL of atp.fm
- download HTML
- build DOM object with Domino
- select show titles with Zepto
- select next page link with Zepto
- repeat with URL of next page
$ npm install
$ npm start
- make it fetch more metadata