Giter Site home page Giter Site logo

adishirsath / machine_learning_projects Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 1.0 12.88 MB

Projects on Classification and Regression

Jupyter Notebook 99.80% Python 0.10% SCSS 0.04% HTML 0.07%
classification regression exploratory-data-analysis feature-engineering feature-selection linear-models bagging-ensemble boosting-algorithms

machine_learning_projects's Introduction

Machine_learning_projects

Overview:-

  • This repository contains projects on ML Classification and Regression
  • Classification:- classification is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known.
  • Ex. Prediction of heart disease( Yes or No)

classification

  • Regression:- In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome variable') and one or more independent variables (often called 'predictors', 'covariates', or 'features').
  • Ex. Prediction of number (price prediction)

Regression

Repository Guide:-

Here is list for every project in this repository and link to introduction of project


Classification:-

Regression:-

Tools And Workflow:-

machine-learning-life-cycle

  • Collect Data
  • Exploratory Data Analysis:- (with visualization)
  • Feature Engineering:-
  • Handle missing values and Categorical Features - sklearn
  • Outlier's (depending on which model to use)
  • Feature Scaling (depending on model)
  • Split data
  • Feature selection:-
  • Building model
  • Predict on test set
  • Build web-app Flask, HTML5, CSS3 or Streamlit
  • Deploy on cloud

Project Introduction's:-(for detail discription visit respective project)

Classification:-


Telco Churn Predictor

  • This project is to predict if customer will leave telco or not, using machine learning
  • In this we used XGBoost classifier to predict it and got 78.64 accuracy
  • by using gamma=0, learning_rate=0.1, max_depth=4, reg_lambda=10, scale_pos_weight=2, subsample=0.9, colsample_bytree=0.5, which we got by hyperparameter tunning XGBClassifier

Heart Disease Predictor

  • On basis of person's health records predict if person have heart disease or not
  • In this we used 4 different models to see initial accuracy (ie. SVM, RandomForestClassifier, Adaboost, KNN)
  • Out of this we we picked top two for hypertunning( RandomForest and Adaboost) and Adaboost got 91% accuracy_score

Titanic Survial Predictor

  • This project is to predict the passangers in titanic will survive or not on the basis of given data.
  • In this notebook we've used Random Forest,KNN and GradientBoost classifier and got better accuracy on these top 2 models i.e.GradientBoostClassifier and RandomForestlassifier.
  • After Hypertunning got an accuracy 78.73% for Gradient Boosting.
  • Hypertunnig parameters for GBC learning_rate= 0.01, max_depth= 4, max_features= 16, min_samples_leaf= 1, min_samples_split=20, n_estimators= 500.

Regression:-


Bulldozer Price Predictor

  • Prediction of sale price of bulldozer on basis of its specifications like year made, productSize and 50 more features
  • Here we used RandomForestRegressor and got pretty good score of 85% r^2
  • so after hypertuning we got best_params n_estimators=90, min_samples_leaf=1, min_samples_split=14,max_features=0.5, n_jobs=-1, max_samples=None
  • with this we got 95% and 88% r^2 on train and test set respectively
  • Get source code Visit

House Price Predictor

  • Prediction of House price on basis of features of house
  • Here we used Lasso, randomforest and XGBoost for initial r^2 score
  • Then we hypertuined XGB cause it had bit more r^2 and got gamma= 0, learning_rate= 0.05, max_depth =6, reg_lambda= 0, subsample=0.9, colsample_bytree=0.5
  • with this params I got 98% and 85% r^2 on train and validation set respectively
  • Get source code Visit

Flight Fare Prediction

  • Predicting fare of flight based on given information
  • Trained Lasso, Gradient Bosst, Random Forest on full data
  • Random forest had best 80 r2 so took this for hyperparameter tuning got 83 r2
  • Using Flask with help of html5-css3 built web-app and Deployed on Heroku
  • To view demo of web app visit

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.