Breast Cancer Prediction for Data Analysis Project
Subject: Data Analytics
Team Name: DataDemystifiers
Team Members:
Name | SRN |
---|---|
Vaibhav Gupta | PES2201800093 |
Srujan Vasudevrao Deshpande | PES2201800105 |
Aditya M Shetty | PES2201800169 |
Safa Hurayn | PES2201800392 |
Breast Cancer Wisconsin Dataset from Kaggle https://www.kaggle.com/uciml/breast-cancer-wisconsin-data
The Exploratory Data Analysis for this project was done using R. The results can be found in the file EDA.Rmd
.
The model used in the project was a Support Vector Machine. The prediction was done using Python and the code can be found in the file Predict.ipynb
.
C=1.0 break_ties=False cache_size=200 class_weight=None coef0=0.0 decision_function_shape='ovr' degree=3 gamma='scale' kernel='rbf' max_iter=-1 probability=False random_state=None shrinking=True tol=0.001 verbose=False
SVC() cv=5 verbose=1 scoring="accuracy" max_features=10 n_population=50 crossover_proba=0.5 mutation_proba=0.2 n_generations=40 crossover_independent_proba=0.5 mutation_independent_proba=0.05 tournament_size=3 n_gen_no_change=10 caching=True n_jobs=1
- Dataset Download https://www.kaggle.com/uciml/breast-cancer-wisconsin-data/download
- Prediction Notebook https://colab.research.google.com/drive/11EVPPXdvC3P0ojUXZjiriFeFG62W1QAV?usp=sharing
- Video
- Report