Giter Site home page Giter Site logo

outbrain-click-prediction's Introduction

Outbrain-Click-Prediction


screen shot 2017-04-28 at 11 06 49 pm

Deliverables


Project Proposal Video : https://www.youtube.com/watch?v=IeKPIaOxPJI

RShiny Deployed Link: https://outbrainproject.shinyapps.io/WebOutBrain/

Tableau Online link: https://us-east-1.online.tableau.com/#/site/ads_outbrain_finalproject/workbooks (Note: since data file was big tableau server was used. Please use your credentials to login in view the dashboard)

Final Report document: ADS_Final_Project_Report_Team8.docx

DockerHub Link : : https://hub.docker.com/r/sweta/outbrain_prediction/

Project Demo Link: https://www.youtube.com/watch?v=19ihFCw8RIM

Final Presentation Link :ADS_Final_Project_Team8.pptx

Tools used: R, R-shiny for web devolpment, Python, Plotly and Tableau for analysis, Docker, IBM Data Science, Microsoft Azure, AWS

Note

In Exploratory analysis we have used plotly in offline mode, hence the graph will not be visible unless the code is run. The Report has all the graph images along with the explanation.

PROBLEM STATEMENT:


Millions of user use social network for surfing, visiting countless websites and clicking on countless ads/recommendations on these website. Knowing what the users are interested in and what the users are using in real world would be of great significance for future recommendations used by marketing team to attract potential users as well as ad placements and real time bidding Predicting the likelihood of users clicking on a particular content Ranking the recommendations in each group by decreasing predicted likelihood of being clicked

We have made Kaggle submission of random forest model. The model is evaluated as shown in the image below

screen shot 2017-04-28 at 11 31 33 pm

and below is the model evaulation

whatsapp image 2017-04-28 at 11 48 48 pm

  • Below is a screenshot of rest API that will predict advertisement ids for an individual display id.

click probability

  • Below is the screenshot of rest API that will predict individual ad click probablity

whatsapp image 2017-04-28 at 11 51 04 pm

Exploratory Analysis

Analysis 1


screen shot 2017-04-28 at 11 16 21 pm

Observations

  • Maximum percentage of clicks were made through mobile phones, followed by desktop and then tablets. Mainly because:

  • App Availability: the app is available that is available on desktop is now present on model too.

  • Convenience: Games or social networking apps frequently serve as a way to pass the time while on the subway commuting home or in a cab or surfing net. This directly reflects the increase in uses of mobile devices

Analysis 2


clicks_by_hour_events_table

Observations

  • We observed that the click frequency is very high between 10-15 hrs. Around 10-11 a.m. time people are usually on their commute to work, school etc. and pass their time surfing net. Around 1-3 p.m. is usually lunch time where people get time to surf internet and that’s the frequency is high.

  • The frequency in the bucket 15-20 is again when people are traveling back home and around dinner time when they have time to surf internet.

Analysis 3


screen shot 2017-04-26 at 10 22 26 am

Observations

  • The number of clicks increase from 9.am. and reaches highest at around 10 a.m. and remains high till 12p.m. and then gradually decreases till it hits 3.p.m. It again rises from 3p.m.

Analysis 4


screen shot 2017-04-28 at 7 35 57 pm

Observations

From the dashboard we can conclude that USA that the maximum number of view rate. The frequency of clicks gradually increases from 6 am and was high around 10 a.m. to 12 p.m. The click was highest on Day 0 which is June 14th. It is a US national holiday

Analysis 5


screen shot 2017-04-28 at 7 36 39 pm

Observations

  • California state has the maximum number of views.
  • The maximum number of views were from mobile devices at around 11 a.m.

outbrain-click-prediction's People

Contributors

yogitaj508 avatar bajajsweta avatar antriksh22 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.