sousablde / recommendations-with-ibm Goto Github PK

View Code? Open in Web Editor NEW

Analysis of the interactions that users have with articles on the IBM Watson Studio platform, make recommendations to users based on model predictions.

Jupyter Notebook 31.44% HTML 67.99% Python 0.57%

recommendations-with-ibm's Introduction

Recommendations with IBM

Installation
Libraries Used
Project Motivation
Data
Project Outline
Licensing, Authors, and Acknowledgements

Installation

No installations needed. Used libraries available via Anaconda package manager.

Libraries used:

Numpy
Pandas

Project Motivation

This project provided an opportunity to get exposure to the generation of recommenders. Involves the expansion of the understanding of Rank based filtering, Collaborative filtering, and SVD models for recommendations.

Project Data

Provided by IBM in collaboration with Udacity.

Project Outline

There are three components in this project.

Exploratory Data Analysis
- Getting to know the data and developing data understanding.
- What is the distribution of how many articles a user interacts with in the dataset?
- The number of unique articles that have an interaction with a user.
- The number of unique articles in the dataset (whether they have any interactions or not).
- The number of unique users in the dataset. (excluding null values)
- The number of user-article interactions in the dataset.
Rank Based Recommendations
- Find the most popular articles simply based on the most interactions.
- In the absence of ratings for any of the articles, assume the articles with the most interactions are the most popular.
- These are then the articles we might recommend to new users (or anyone depending on what we know about them).
User-User Based Collaborative Filtering
- In order to build better recommendations for the users of IBM's platform, we could look at users that are similar in terms of the items they have interacted with. These items could then be recommended to the similar users. This would be a step in the right direction towards more personal recommendations for the users.
Matrix Factorization
- Using the user-item interactions, build out a matrix decomposition. Using decomposition, evaluate performance.