Giter Site home page Giter Site logo

imdb-movie-scraper's Introduction

IMDB Movie Scraper

Using BeautifulSoup and Requests library implemented scraper to scrape details of Top 250 Indian movies according to imdb.com and performed different analysis based on year, decade, genres, director, language and cast.

Instructions of usage

Step 1 :

Install important tools using command :

  • sudo apt-get install git
  • sudo apt install python3-pip

Clone this app using the command:

  • git clone https://github.com/vishalgaddam873/Imdb-Movie-Scraper.git

Step 2:

Install requirements of the project using command:

  • pip3 install -r requirement.txt

Step 3:

Run the scraper file using command:

  • cd Imdb-Movie-Scraper/
  • python3 scraper.py

This project is divided in different task, The task are as follows:

Task 1

Scrape the data of the top rated Indian Movies from the Web site of IMDB, with the names of 250 movies and their rank as well as the ratings and the number of movies that are released in that year.

Task 2

List the movies decade wise.

Task 3

List the movies year wise.

Task 4

Scrape the single movie details which include movie's name, director, bio, country, language, run-time and also create a cache memory.

Task 5

Return the 250 movies details

Task 6

List the movies by their languages.

Task 7

List the movies by their directors.

Task 8

Create a cache memory and fetch the data from cache memory.

Task 9

Use time module and random module it take nap of 2-3 seconds randomly.

Task 10

List the movie directors by languages in which they have worked in.

Task 11

List the movies by their geners

Task 12

Scrape movie cast details.

Task 13

Scrape the cast url

Task 14

List the movie cast by their co-actores.

Task 15

List the movie by their actors.

Bonus Task

List the other movies by same gener from movies like this link.

imdb-movie-scraper's People

Contributors

vishalgaddam873 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.