Giter Site home page Giter Site logo

lowerdeez / supercharged-web-scraping-with-asyncio Goto Github PK

View Code? Open in Web Editor NEW

This project forked from codingforentrepreneurs/supercharged-web-scraping-with-asyncio

0.0 1.0 0.0 345 KB

Scrape websites asynchronously with Python 3.8+, Asyncio, & arsenic (aka Selenium for Async).

Home Page: https://www.codingforentrepreneurs.com/projects/supercharged-web-scraping-with-asyncio

Jupyter Notebook 99.35% Python 0.65%

supercharged-web-scraping-with-asyncio's Introduction

Superchaged Web Scraping with Asyncio Logo

Supercharged Web Scraping with Asyncio

Web scraping is simply automatically opening up any website and grabbing the data you find important on that website. It's fundamental to the internet, search engines, Data Science, automation, machine learning, and much more.

Opening websites and extracting data are only part of what makes web scraping great. It's the parsing of the data that's where the value is.

This project will cover:

  • Basic web scraping with Python
  • Web scraping with Selenium
  • Sync vs Async
  • Asynchronous Web scraping with Asyncio

Requirements:

To use this code:

1. Clone

git clone https://github.com/codingforentrepreneurs/Supercharged-Web-Scraping-with-Asyncio supercharged

2. Create Virtual Environment

cd supercharged
python3.6 -m venv .

3. Activate virtual environment and install requirements Mac/Linux

source bin/activate

Windows:

.\Scripts\activate

If using pipenv, run pipenv shell && pipenv install

Run jupyter

jupyter notebook

or

python -m jupyter notebook

If using pipenv, run pipenv run jupyter notebook

supercharged-web-scraping-with-asyncio's People

Contributors

codingforentrepreneurs avatar jmitchel3 avatar teamcfe avatar dependabot[bot] avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.