Giter Site home page Giter Site logo

hajimohamedrufai / data_visualisation Goto Github PK

View Code? Open in Web Editor NEW

This project forked from jasperzeroes/heart_disease_visualisation_analysis_using_powerbi_

0.0 0.0 0.0 36 KB

This is a group project on Data visualisation and Exploratory Data Analysis

data_visualisation's Introduction

Data_visualisation Project

This is a group project on Data visualisation and Exploratory Data Analysis

Personal Key Indicators of Heart Disease

This dataset contains data from the 2020 annual CDC survey of 400,000 adults in the US about their health status, specifically focusing on key indicators of heart disease. Heart disease is a leading cause of death in the US, affecting people of most races. The dataset includes variables such as high blood pressure, high cholesterol, smoking, diabetic status, obesity, physical activity, and alcohol consumption.

Data Source:

The dataset is part of the Behavioral Risk Factor Surveillance System (BRFSS) of the Centers for Disease Control and Prevention (CDC). BRFSS conducts annual telephone surveys to gather data on the health status of US residents. The dataset has undergone cleaning to include only the most relevant variables, reducing the original nearly 300 variables to about 20.

Data Content:

The dataset consists of 401,958 rows and 279 columns. The vast majority of columns are questions asked to respondents about their health status. The relevant variables include key indicators of heart disease such as high blood pressure, high cholesterol, smoking, diabetic status, obesity, physical activity, and alcohol consumption.

Data Use:

This dataset can be used for exploratory data analysis (EDA), Visualisation as well as machine learning methods such as logistic regression, SVM, and random forest to predict the likelihood of heart disease. The variable "HeartDisease" should be treated as a binary ("Yes" - respondent had heart disease; "No" - respondent had no heart disease). However, note that the classes are not balanced, so fixing the weights/undersampling is advisable for better results.

Data Analysis:

The dataset can be used to investigate which variables have a significant effect on the likelihood of heart disease. The key indicators of heart disease such as high blood pressure, high cholesterol, smoking, diabetic status, obesity, physical activity, and alcohol consumption can be analyzed to identify patterns and predictors of heart disease.

Disclaimer:

The data in this dataset is observational and should not be used to draw causal conclusions.

data_visualisation's People

Contributors

jasperzeroes avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.