This is an attempt to cliassify the posibility of survival of patients that suffered heart attack after a year
I used a decision tree and a random foreest algorithm I made.
The data cleaning and handling where inspired by : Beaulieu-Jones BK, Lavage DR, Snyder JW, Moore JH, Pendergrass SA, Bauer CR. Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis. JMIR Med Inform. 2018 Feb 23;6(1):e11. doi: 10.2196/medinform.8960. PMID: 29475824; PMCID: PMC5845101.
and
Stuart EA, Azur M, Frangakis C, Leaf P. Multiple imputation with large data sets: a case study of the Children's Mental Health Initiative. American Journal of Epidemiology. 2009 May;169(9):1133-1139. DOI: 10.1093/aje/kwp026. PMID: 19318618; PMCID: PMC2727238.
There will be another comnmit with a more deteailed view of the problem.
This data set has been sourced from the Machine Learning Repository of University of California, Irvine Echocardiogram Data Set (UC Irvine). The UCI page mentions the following author for providing the data set: Steven Salzberg ([email protected]) (https://archive.ics.uci.edu/ml/datasets/Echocardiogram)