Analyze the data set and identify most relevant heart disease related risk factors as well as predict the overall risk.
- Perform exploratory analysis on the data and describe your understanding of the data.
- Perform data wrangling / pre-processing a. E.g., missing data, normalization, discretization, etc.
- Apply any two feature selection engineering techniques
- Compare the two selected feature engineering techniques.
- Plot top 5, 6, and 8 features.
- Provide a high-level description of Machine Learning models โ Logistic regression and Decision tree, ANN to predict.
- Compare the performance of the two classifiers โ Logistic regression and Decision tree to predict.
Inferences : KNN algorithms best suited for this problem statement
For reference : https://towardsdatascience.com/predicting-presence-of-heart-diseases-using-machine-learning-36f00f3edb2c