We will scrape data from an E-Commerce website, using Beautiful Soup.
Beautiful Soup is a Python package for parsing HTML and XML documents. It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping.
For people who work with data it is important to be able to create own datasets. Often we rely on datasets from someone else. This course should show all data enthusiasts how to scrape and store data in Excel Files.
Web Scraping
Beautiful Soup
Data Extraction
Web Scraping for Data Science
Data Mining
Data Scraping & Data Cleaning
-web scraping
-data extraction
-beautiful soup
-requests library
I have extracted the data for top 10 pages only, as the data set was containing more that 41000 pages due to which it was taking around 4-5 hours to load the dataset in jupyter notebooks.