Here I have tried to determine which features affect the student's subjectwise and overall performance by looking at the data distribution overall and also based on clusters formed on the basis of scores. The features involved are:
- gender : sex of students
- race/ethnicity : ethnicity of students
- parental level of education : parents' final education
- lunch : standard or free/reduced
- test preparation course : Any course done/completed to prepare for test or not