Giter Site home page Giter Site logo

early-stage-diabetes-prediction-using-machine-learning's Introduction

Early stage diabetes prediction using Machine Learning

thumbnail

Project Overview :

In this project I have predicted the chances of diabetes using Early stage diabetes risk prediction dataset.This has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh and approved by a doctor. This dataset contains the sign and symptpom data of newly diabetic or would be diabetic patient.

The datasets consists of several medical predictor variables and one target variable, class. Predictor variables includes the Age, gender, Polyuria,Polydipsia and so on. The dataset used is available at UCI Machine Learning repository

Installations :

This project requires Python 3.x and the following Python libraries should be installed to get the project started:

I also reccommend to install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project which also include jupyter notebook to run and execute IPython Notebook.

Code :

Actual code to get started with the project is provided in two files one is,Early Stage Diabetes Prediction.ipynb

Run :

In a terminal or command window, navigate to the top-level project directory PIMA_Indian_Diabetes/ (that contains this README) and run one of the following commands:

ipython notebook Early Stage Diabetes Prediction.ipynb or

jupyter notebook Early Stage Diabetes Prediction.ipynb

This will open the Jupyter Notebook software and project file in your browser.

About Data

This dataset contains the sign and symptpom data of newly diabetic or would be diabetic patient.This has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh and approved by a doctor.

Features of the dataset

The dataset consist of total 16 features and one target variable named class.

1. Age: Age in years ranging from (20years to 65 years)
2. Gender: Male / Female
3. Polyuria: Yes / No
4. Polydipsia: Yes/ No
5. Sudden weight loss: Yes/ No
6. Weakness: Yes/ No
7. Polyphagia: Yes/ No
8. Genital Thrush: Yes/ No
9. Visual blurring: Yes/ No
10. Itching: Yes/ No
11. Irritability: Yes/No
12. Delayed healing: Yes/ No
13. Partial Paresis: Yes/ No
14. Muscle stiffness: yes/ No
15. Alopecia: Yes/ No
16. Obesity: Yes/ No

Class: Positive / Negative

Steps to be Followed :

Following steps I have taken to apply machine learning models:

  • Importing Essential Libraries.
  • Data Preparation & Data Cleaning.
  • Data Visualization (already done in early_Diabetes_Prediction_EDA.ipynb)
  • Feature Engineering to discover essential features in the process of applying machine learning.
  • Encoding Categorical Variables.
  • Train Test Split
  • Apply Machine Learning Algorithm
  • Cross Validation
  • Model Evaluation

Model Evaluation :

I have done model evaluation based on following sklearn metric.

Results :

The result of Logistic Regression and Random forest algorithms with or without feature selection is shown below.

Result

Feature Importance :

Result

ROC Curve :

Result

early-stage-diabetes-prediction-using-machine-learning's People

Contributors

sid321axn avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.