Giter Site home page Giter Site logo

challenge's People

Contributors

baldmer avatar omartrinidad avatar shinhokang avatar

Watchers

 avatar  avatar  avatar  avatar

challenge's Issues

To-dos to settle down

  • Create a Kaggle team for your own account and name it with your DSG team name
  • Decide which method to use - deep learning? or other way?
  • Decide which framework to use - e.g. tensorflow, theano, etc..
  • Do exploratory data analysis in the data set - here we can see some examples https://www.kaggle.com/kernels

Do preprocessing by columns

Shinho

  • Create 10 samples on CSV.

  • Write the options to use listen_type.

  • Functionalize preprocess modules

  • Add another column with a binary status weekday and weekend.

  • Correct the release date preprocessing.

Baldo

  • Plot a age and release year graph.
  • Plot a relation between age and platforms and is_listened column.

Omar

  • Send a naive solution with random numbers.

All

  • Clustering or kriging with columns: album, media, genre, media_duration, and artist

Modeling strategy

  1. Model A: train a model without user_id to find correlations between (user_gender, user_age, and the other features) to (song's cluster).

  2. Model B: train models for each user's history using Model A and get the result of test sample.
    Model B should be temporary and wrap Model A.

Usage of listen_type

Since there's no data with listen_type=0 in test set, the usage of listen_type is an issue.

  1. Option 1.
    Ignore listen_type

  2. Option 2.
    Use listen_type=1 only in the training

  3. Option 3.
    Use listen_type=0 as training set, listen_type=1 as validation set

  4. Option 4.
    Just Use It.

Translate Unix timestamp to date.

This is a nice observation. As the dataset is from one month, maybe we can have as a result only the hour and another column indicating the day.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.