Giter Site home page Giter Site logo

olenerollie924 / pydatascraper Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tamilselvanarjun/pydatascraper

0.0 0.0 0.0 392 KB

pydatascraper is a Python application that provides web scraping capabilities, including fetching Google and Yelp reviews.

Home Page: https://pypi.org/project/pydatascraper/

License: MIT License

Python 100.00%

pydatascraper's Introduction

pydatascraper

pydatascraper is a Python application that provides web scraping capabilities, including fetching Google and Yelp reviews. The application has a user-friendly graphical user interface (GUI) for easy interaction.

Features

  • Web Scraping: Extract information from web pages based on user-provided URLs.
  • Google Reviews: Fetch reviews for a given business or location using Google Maps API.
  • Yelp Reviews: Retrieve reviews for a business using the Yelp API.
  • OpenStreetMap Data: Extract latitude, longitude, and additional information from OpenStreetMap.

Requirements

  • Python 3.x
  • Required Python packages (install using pip install -r requirements.txt):
    • requests
    • beautifulsoup4
    • pandas
    • openpyxl
    • nltk (for text processing)
    • tkinter (GUI toolkit)

Usage

  1. Clone the repository:

    git clone https://github.com/arjunlimat/pydatascraper.git
    
  2. Install the package directly:


pip install pydatascraper

  1. import the webscraper model:
from pydatascraper.pyscraper import main
  1. Run the application:
main()

The GUI will appear, allowing you to choose different services and perform web scraping tasks.

Services

Web Scraping

Enter a URL and click "Search" to explore available data types.

Choose the desired data type, enter a file name, and click "Download" to save the data.

Google Reviews

Select "Google reviews" from the services dropdown.

Enter the business or location name and address. Provide a file name and click "Download" to fetch and save Google reviews.

Yelp Reviews

Select "Yelp reviews" from the services dropdown. Enter the business name and address. Provide a file name and click "Download" to fetch and save Yelp reviews.

OpenStreetMap

Select "Open Street Map" from the services dropdown. Enter the map URL, provide a file name, and click "Download" to extract map data.

Contributing

Contributions are welcome! If you encounter issues or have ideas for improvement, please open an issue or submit a pull request.

License:

This project is licensed under the MIT License - see the LICENSE file for details.

pydatascraper's People

Contributors

tamilselvanarjun avatar andrewalbertand avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.