Giter Site home page Giter Site logo

data-scrapper's Introduction

Data Scraper GUI Application

Empower your web scraping needs by leveraging the versatility of this tool. While the provided code demonstrates scraping from a specific website, users can effortlessly adapt it to harvest data from their preferred web sources.

The Data Scraper GUI Application is a tool that allows users to fetch and scrape data from either an API or a web page. It provides a user-friendly interface to input the source, URL, and desired output format (JSON or CSV) for the scraped data. The application is built using the Python tkinter library for the graphical user interface, and it uses requests for making HTTP requests, BeautifulSoup for web scraping, and pandas for handling data.

Preview

Data Scrapped from Web (Based on Written Code from specific website)

from web

Scrapped Data in JSON:

JSON show case

Data fetched from API

from API

Table of Contents

Overview

The Data Scraper GUI Application simplifies the process of data scraping from web sources by providing an easy-to-use graphical interface. Users can choose between fetching data from an API or scraping data from a web page. The application supports saving the scraped data in either JSON or CSV format.

Features

  • Choose between API and web scraping as data sources.
  • Input the URL of the API or the web page to scrape.
  • Select the desired output format: JSON or CSV.
  • Fetch data from the specified source and URL.
  • Display a progress bar to indicate the scraping process.
  • Handle errors and display status messages.
  • Save the scraped data in the selected format.

Installation

  1. Clone this repository to your local machine:
    git clone https://github.com/Dev-Hooman/data-scraper.git
    cd data-scraper

Usage

  1. Run the application by executing the following command:
       python run.py
  2. The graphical user interface (GUI) will open. Here's how to use the application:
  • Choose the data source by selecting "API" or "Web".
  • Enter the URL of the API or web page you want to scrape.
  • Select the output format as either "JSON" or "CSV".
  • Click the "Scrap" button to initiate the scraping process.
  • The progress bar will show the progress of the scraping.
  • Once the scraping is complete, the status label will display the result.
  1. The scraped data will be saved in the selected format (JSON or CSV) in a location of your choice.

Customization

You can customize this application to suit your needs. Some possible improvements include:

  • Adding more data sources or web scraping targets.
  • Enhancing the user interface with more styling and design.
  • Adding options to customize the data scraping process further.
  • Handling additional errors and edge cases.
  • Implementing multi-threading for smoother user experience.

Contributing

  • Contributions are welcome! If you have any suggestions, bug fixes, or enhancements, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

data-scrapper's People

Contributors

dev-hooman avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.