Giter Site home page Giter Site logo

data-analysis-videogames's Introduction

Gamers don't die, they respawn

Objective

This repo is about simple data analysis of video game titles, platforms and their sales in various parts of the world. By leveraging the wonderful python libraries for data analysis we gain deep insights into the data.

Some areas worth exploring:

  • Titles which are available for more than one platform
  • Top contending platform
  • Which type of platform is popular?
  • Top selling genres
  • Top publishers by Global Sales

Dataset can be downloaded from here

Data Decription (as available on Kaggle)

Context

Motivated by Gregory Smith's web scrape of VGChartz Video Games Sales, this data set simply extends the number of variables with another web scrape from Metacritic. Unfortunately, there are missing observations as Metacritic only covers a subset of the platforms. Also, a game may not have all the observations of the additional variables discussed below. Complete cases are ~ 6,900

Content

Alongside the fields: Name, Platform, Year_of_Release, Genre, Publisher, NA_Sales, EU_Sales, JP_Sales, Other_Sales, Global_Sales, we have:-

  • Critic_score - Aggregate score compiled by Metacritic staff
  • Critic_count - The number of critics used in coming up with the Critic_score
  • User_score - Score by Metacritic's subscribers
  • User_count - Number of users who gave the user_score
  • Developer - Party responsible for creating the game
  • Rating - The ESRB ratings

Prerequiste

  1. Python 3
  2. Jupyter Notebook
  3. Pandas (for data analysis)
  4. Numpy
  5. Matplotlib (for visualization)
  6. Seaborn (for visualization)

View the Notebook

How to run the Notebook

  1. Download and install Anaconda. It contains all the relevant packages mentioned in Prerequiste section
  2. Download the dataset and this repository
  3. Open terminal/command prompt and navigate to the downloaded repository, then run "jupyter notebook" command in the terminal
  4. Jupyter file explorer will open in the browser. Click on the notebook then run Cell-->Run All from the menu.

Data view

Dataframe head

Plots from notebook

Most Number if Titles per Platform

Most Number of titles

Region Sales

TODO

  • Use predictive modelling to estimate future sales

data-analysis-videogames's People

Contributors

hpanwar08 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.