Giter Site home page Giter Site logo

macrusso / price_scraper Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 7 KB

Scrapy spider which gets prices form a web catalouge and saves them in Excel.

Python 100.00%
python selenium-webdriver openpyxl scrapy scrapy-spider automation spider price price-scraper web-catalouge

price_scraper's Introduction

Siemens Mall Price Scrapper

Small stand alone crawler script. Made to make my life easier, it goes through part numbers in a db and then goes to that parts pages and gets its prices.

How to use

Download both files. Start with db_seed.py, fill in part numbers you want to get prices for and then run the file. Then in prices.py insert your Siemens Mall credentials where USER_LOGIN and USER_PASSWORD are.

Check locations of your Selenium Webdiver and SQLite files and change the code if needed.

Spider automatically logs in with provided credentials then iterates through the db. Two prices are taken, one is a list price for the item and second is a price with your company discount. If the webpage for the given part is non existent or there is no prices for some reason, script inserts error message to the table.

Below you can find example table with two prices scrapped and one wrong part number and error notification.

Part number Discounted Price List Price
6ES7215-1HF40-0XB0 753.45 984.34
6ES7414-3XM07-0AB1 error error
6ES7221-1BF32-0XB0 72.94 96.90

Technologies used

  • Scrapy
  • Selenium
  • SQLite3

price_scraper's People

Contributors

dameg avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.