I write a lot of random code around SEO and want to make more of it public in 2020.
If you have an idea, but not sure how to code it, hit me up @johnmurch on twitter
This script allows you to download all the URLs from a given domain.
cd archive.org && npm install
node fetch.js https://www.domain.com
This script reads URLs from urls.txt (One URL Per Line) and checks for redirects using Puppeteer
cd puppeteer-redirect && npm install
node puppeteer-redirect.js
This script fetches a webpage with Puppeteer and saves both the source and DOM of a webpage. Great for identifying some SEO issues. Be sure to change the URL in fetch.js (line 7)
cd domVsSource && npm install
node fetch.js
This script fetches a webpage with Puppeteer and loops through all links checking for a specific backlink. This script also pulls out the anchor text and rel attributes (e.g. nofollow)
cd backlink-checker && npm install
Update urls.txt
node checker.js
This script fetches a URL and parses the meta data using Web Auto Extractor which parses meta tags as well as Microdata, RDFa-lite and JSON-LD
cd meta-extract && npm install
node fetch.js https://www.domain.com/product
cat meta.json
หห
This script extracts phone numbers and emails from a webpage, including checking links mailto tags
cd puppeteer-extract && npm install
node puppeteer-extract.js
cat output.csv
This script checks a URL and captures the redirect chain
cd puppeteer-redirect && npm install
node puppeteer-redirect.js
cat output.csv
@TODO
- BigSitemap - Postgres + Puppeteer
- SEOCI - Automated SEO Testing
- ngram
- entities // Google NLP