Giter Site home page Giter Site logo

netflix's Introduction

cs329e-netflix

This document provides the description of the pre-built caches to help with your project.

You can use any (or none if you want to make your own).

The purpose of the caches is to make it easier to access the data you need without having to read all the project's files (because they're rather large).

The idea is similar to if you're trying to pick a new car. You could spend some time and research every review ever made of a new car. However, that would probably take a very long time because there are so many reviews. Instead, what if you aggregated all the reviews and just looked at the average review. You'd save time by not having to read all the reviews and you'd get the information you need. You could then use this information to decide if you should buy the car.

Now, instead of aggregating car reviews for yourself, you're aggregating movie reviews and you're using these aggregations to predict a user's rating.

Available caches

cache-averageMovieRating.pickle

This is a dictionary with movie_id as keys (int) and average movie rating as values (float) .

cache-averageCustomerRating.pickle

This is a dictionary with customer_id (int) as keys (int) and their average rating as values (float).

cache-actualCustomerRating.pickle

This is a dictionary of (customer_id, movie_id) as keys (int, int) and their actual rating as values (int).

cache-customerAverageRatingByYear.pickle

This is a cache of (customerId, year) as keys (int, int) and then a float of their average rating for the year as the value (float).

cache-movieAverageByYear.pickle

This is a cache of (movieId, year) as keys (int, int) and it's average rating for that year as values (float).

cache-yearCustomerRatedMovie.pickle

This is a cache of (customerId, movieId ) as keys (int, int) and the year the movie was rated by that customer as a value (int).

Note: Values are rounded to 3 decimal places. Also, the actualCustomerRating cache contains only data corresponding to probe data

Variables used:

cID: customerID, type: int

mID: movieID, type: int

yr: year, type: int

rt: rating, type: int/float

cache-actualCustomerRating.pickle

a dictionary of elements below:

(cID, mID): rt

mID rated cID as rt.

cache-averageCustomerRating.pickle

a dictionary of elements below:

cID: rt

The avg rating of cID is rt.

cache-averageMovieRating.pickle

a dictionary of elements below:

mID: rt

The avg rating of mID is rt

cache-customerAverageRatingByYear.pickle

a dictionary of elements below:

(cID, yr): rt

The avg rating of cID in year yr is rt.

cache-movieAverageByYear.pickle

a dictionary of elements below:

(mID, yr): rt

The avg rating of mID in year yr is rt.

cache-yearCustomerRatedMovie.pickle

a dictionary of elements below:

(cID, mID): yr

cID rated mID in year yr.

netflix's People

Contributors

cloudashd avatar

Stargazers

Jesse  Tochukwu uzoma avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.