This project demonstrates how to perform clustering on two different datasets: "Iris" and "Netflix". KMeans algorithms are used to group the data into clusters and the quality of the clusters is evaluated using the silhouette coefficient.
dataset: https://www.kaggle.com/datasets/shivamb/netflix-shows
- Use Dockerfile
- Use virtual enviroments and apply requirements.txt
conda create -n my_env python=3.10.12
pip install -r requirements.txt