Using BeautifulSoup and Requests library implemented scraper to scrape details of Top 250 Indian movies according to imdb.com and performed different analysis based on year, decade, genres, director, language and cast.
Install important tools using command :
sudo apt-get install git
sudo apt install python3-pip
Clone this app using the command:
git clone https://github.com/vishalgaddam873/Imdb-Movie-Scraper.git
Install requirements of the project using command:
pip3 install -r requirement.txt
Run the scraper file using command:
cd Imdb-Movie-Scraper/
python3 scraper.py
Scrape the data of the top rated Indian Movies from the Web site of IMDB, with the names of 250 movies and their rank as well as the ratings and the number of movies that are released in that year.
List the movies decade wise.
List the movies year wise.
Scrape the single movie details which include movie's name, director, bio, country, language, run-time and also create a cache memory.
Return the 250 movies details
List the movies by their languages.
List the movies by their directors.
Create a cache memory and fetch the data from cache memory.
Use time module and random module it take nap of 2-3 seconds randomly.
List the movie directors by languages in which they have worked in.
List the movies by their geners
Scrape movie cast details.
Scrape the cast url
List the movie cast by their co-actores.
List the movie by their actors.
List the other movies by same gener from movies like this link.