Giter Site home page Giter Site logo

mawdoo3-scrapper's Introduction

Mawdoo3 scrapper

πŸ“” Ψ¨Ψ§Ω„ΨΉΨ±Ψ¨ΩŠ πŸ“”

🚩 Table of Contents

Introduction

Warning: This project is only for study purpose , please don’t re-share these articles under your name , all these articles is only belongs to Mawdoo3 .

how to get 144k articles and more . . . ?!

  • first create object from class mawdoo3
  • second call function save_all_articles_title_into_file
  • finally take the name of generated file from previous step and pass it to save_all_articles_into_file

Installation

πŸ”˜ Cloning repository

  1. On GitHub.com, navigate to the main page of the repository.
  2. Above the list of files, click Code.
  3. Copy the URL for the repository.
  4. Open Terminal.
  5. Change the current working directory to the location where you want the cloned directory.
  6. Type git clone, and then paste the URL you copied earlier.
git clone github.com/Faris-abukhader/mawdoo3-scrapper

Press Enter to create your local clone

git clone https://github.com/YOUR-USERNAME/YOUR-REPOSITORY
> Cloning into `mawdoo3-scrapper`...
> remote: Counting objects: 10, done.
> remote: Compressing objects: 100% (8/8), done.
> remove: Total 10 (delta 1), reused 10 (delta 1)
> Unpacking objects: 100% (10/10), done.

Development setup

To set up this project you need to download Python in your machine or if you have it make sure you have the latest version of it.

πŸ”˜ Checking up Python version in mac

python3 -V

πŸ”˜ Checking up Python version in windows

python --version

πŸ”˜ Downloading Python

for Windows

Download the windows installer from Python offical website make sure you have download the latest version of Python.

for Mac

  • You can download Python using brew CLI
brew install python


πŸ”˜ Downloading the packages

Go to project direct where requirements.txt is exist and type in terminal :

pip install -r requirements.txt 


πŸ“¦ Packages

Name Description
BeautifulSoup Beautiful Soup is a Python library for pulling data out of HTML and XML files.
selenium The selenium package is used to automate web browser interaction from Python.
aiohttp Asynchronous HTTP Client/Server for asyncio and Python.
asyncio asyncio is a library to write concurrent code using the async/await syntax.

πŸ“œ License

This software is licensed under the MIT Β© FaRiS.

mawdoo3-scrapper's People

Contributors

faris-abukhader avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.