Giter Site home page Giter Site logo

obrunet / my_data_science_portfolio Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 11.32 MB

An aggregation of all my Data Science challenges, studies, projects... from Kaggle / ENS Data Challenges / Zindi - various ML tasks : regression / classification / computer vision / N.L.P / recommendation engine

Python 0.02% Jupyter Notebook 99.98%
data-science zindi challenges kaggle machine-learning classification recommendation-engine regression

my_data_science_portfolio's Introduction

Here is a summary of all the Data Science projects i've done so far :

Those challenges come from:

I've achieved varisous tasks / used different types of ML models:

  • regression / classification
  • computer vision
  • natural language processing
  • recommendation engine
Title & notebook Year Source Description & Goals
Fashion MNIST 2019 Kaggle Classification task of 28x28 grayscale clothes' images with CNNs & MLPs in Tensorflow
Bike Sharing Demand 2019 Kaggle Forecast rentals of a city bikeshare system with datetime, weather, rentals number infos (supervised ML regression)
House Prices 2019 Kaggle Use case: predict sales prices (Supervised ML with feature engineering)
Dog or Cat 2019 ? Distinguish images of dogs from cats : binary classification / computer vision using CNN with Tensorflow
Adult Census Income 2019 Kaggle Predict whether income exceeds $50K/yr based on census defining people profiles (supervised ML binary classification)
Credit Default Risk 2019 Kaggle Prediction whether or not an applicant will be able to repay a loan based on previous credits, POS (point of sales), cash loans, repayment history...
Hybrid Recommendation Engine 2019 Kaggle Recommandation System with LightFM (collaborative & content-based filtering) trained on the MovieLens 100K dataset
Customer Segmentation 2019 Kaggle Segmentation : target customers with whom you can start marketing strategy (unsupervised ML - KMeans Clustering)
Fraud Detection 2019 Kaggle Anomaly detection among anonymized credit card transactions labeled as fraudulent or genuine (PCA than oversampling on highly imbalanced classes (SMOTe))
Real Estate Price 2019 Kaggle Prediction of house prices for California districts derived from a census (analysis of geospatial data)
Customer Churn 2019 Kaggle Predict behavior to retain your customers before they live
Historical consumption regression for electricity supply pricing 2020 E.N.S Predict the electricity consumption of 2 given sites during the next year using time series & deep learning models R.N.N / G.R.U.
Solar Power Generation in EU - Part 1 / 2 / 3 2020 Kaggle Countries segmentation depending on their generation profile, power plant's efficiency prediction accross time with R.N.N & Prophet.
Cirta - Particle Type Classification - Part 1 & 2 2020 Zindi Build a machine learning model to help physicists identify particles after a collision based on images.
Handwritten Digit Generation 2021 Kaggle Unsupervised deep learning with G.A.N - training of a Generator and a Discriminator with the famous MNIST dataset
Wind Power Generation in EU - Part 1 / 2 2021 Kaggle Countries segmentation depending on their generation profile, efficiency prediction analysis.
Fowl Escapades 2022 Zindi Classification of numerious classes of birds callings. Audio data augmentation, preprocessing, spectrograms & deep learning with CNNs
Pima Indians Diabetes - Analysis & Predictions 2022 Kaggle Analysis of patients observations, health measures and how to build a machine learning model to diagnostically predict whether or not a patient has diabetes
Treadmills users - Advanced data viz, recommendation) 2022 Kaggle Interactive & dynamic charts with Plotly with advanced stats, recommendation & clustering of mix data types with K-prototypes
Prediction of building's greenhouse gas emissions (GHG) - part 1 - EDA & 2 - Modelling 2023 Kaggle Geospatial analysis - regression & explainability
H&M Recommendation System for Personalized Fashion - part 1 - EDA & 2 - Modelling 2023 Kaggle Analysis of datasets with RecoSys specific caracteristics, and fine tuning of an hybrid lightfm model

my_data_science_portfolio's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.