A program that classifies whether a person is diabetic or not based on a series of information.
The program reads the data from the data folder named diabetes.csv. Diabetes.csv contains the training data for knearest neighbor. It has columns of no. of pregnancies, Glucose value, Blood Pressure, Skin Thickness, Insulin value, BMI, Diabetes Pedigree Function, Age, and Outcome (Class Variable). It is a labelled dataset the last column (output/class variable) signifies if the patient has diabetes with 1 while does who are not are is given a label 0. The program reads input.in which contains unlabelled data points.
The dataset is taken from https://www.kaggle.com/mathchi/diabetes-data-set
The output of the program is a text file named output.txt which contains the values from input.in and their corresponding output/class variables/labels