Giter Site home page Giter Site logo

codingbee77 / data_science_projects Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 9.84 MB

Repository with my data science projects.

Jupyter Notebook 100.00%
data-science dimensionality-reduction multiclass-classification ensemble-learning classifiers webscraping

data_science_projects's Introduction

Data-Science_Projects

This repository holds some of my personal projects which I've done over the last few months. Currently, they're classified into below categories:

  1. Complex and original data science project containing:     web scraping, data cleaning, exploratory analysis,     model building and optimization: PEH_Classifier.

The aim of this project is to classify hair products into 3 categories     according to PEH balance based on the ingredient list.

  1. Web Scraping and data cleaning project: Data_science_salary_predictions

The project contains web scrapped data from the glassdoor website with salary ranges. Dataset was cleaned and new features were extracted to prepare a dataset for future predictions. Min-max salary, company name, job state, and encoded skills: everything ready to start building ML model.

  1. Dimensionality reduction techniques comparison - Dimensionality_reduction

Used different classification models to achieve the most accurate classifier on high dimension dataset. Investigated how dimensionality reduction algorithms affect accuracy and learning time and make the dataset more understandable to the business.

  1. Classification problem with different classifiers and ensemble learning : Ensemble_learning_with_mushroom_dataset

Use different classifiers and their ensembles to recognize poisonous and edible mushrooms by their attributes. Use different accuracy metrics and classification reports to assess classifier accuracy. Examine feature importance for different algorithms.

Second notebook contains "Ensemble_learning_mushroom_class_pyforest" tests for importing ml&data science modules using pyforest library.

  1. Classification problem with different accuracy metrics and learning curves: Mushroom_classification

Build mushroom classifier with the highest precision and find the most indicative features.

data_science_projects's People

Contributors

codingbee77 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.