In this project, there are four tabular datasets (Amazon Prim, Disney+, Hulu, Netflix) consisting of information of movies or TV shows available on these top-tier video streaming platforms. We first do data preprocessing like column convertion to clean the original data. Then, we do an overall analysis of the movies and TV shows from type, title, rating, country, release year and genres and understand the trends by visualization. Then, we analysis these four platforms from type, rating, duration by visualization and compare them with each other.
Media and Video streaming platforms are getting more and more popular in recent years and many companies are following this trend. In this project, we have 4 tabular datasets consisting of information of movies or TV shows available on 4 top-tier video streaming platforms, including Amazon Prim, Disney+, Hulu, Netflix. We will try to do an overall analysis of the movies and TV Shows and understand the trends. Also, we will analysis these four platforms using given datasets and compare them with each other.
The given datasets are contained in four CSV files, named in format of “company_titles.csv”. Except the first row, each row has information of one movie / TV show. Every CSV file has 12 columns separated by comma, whose name and details are shown below. 2 consecutive commas mean one empty cell in the middle.