Giter Site home page Giter Site logo

btk-datathon-2023's Introduction

BTK DATATHON 2023

  • This project focuses on prediction of customers segments according to given datasets. I made this project for DATATHON orginized by BTK Akademi.The project includes machine learning clustring,classification, exploratory data analysis (EDA), and data visualization techniques to gain insights into the dataset and understand its patterns. The project uses train.csv,test_x.csv datasets, which can be downloaded from Kaggle: https://www.kaggle.com/competitions/datathon2023/data

  • img

DATA CONTENT

  • train.csv - train set
  • test_x.csv - test set
  • sample_submission.csv - sample file upload format

Feature description

  • İndex
  • Cinsiyet
  • Yaş Grubu
  • Medeni Durum
  • Eğitim Düzeyi
  • İstihdam Durumu
  • Yıllık Ortalama Gelir
  • Yaşadığı Şehir
  • En Çok İlgilendiği Ürün Grubu
  • Yıllık Ortalama Satın Alım Miktarı
  • Yıllık Ortalama Sipariş Verilen Ürün Adedi
  • Eğitime Devam Etme Durumu
  • Yıllık Ortalama Sepete Atılan Ürün Adedi
  • Öbek İsmi - Tahmin edilecek olan etiket

Installation

The following tools were used for this analysis:

  • Python 3

  • Pandas

  • NumPy

  • Matplotlib

  • Seaborn

  • Plotly

  • Scipy

  • Sklearn

  • To run this project, you will need to have Python 3 installed on your machine. You can install the required libraries by running the following command:

  • pip install pandas matplotlib seaborn numpy plotly Sklearn

Usage

  • To run the analysis, simply execute the notebook. The script will generate several visualizations that help illustrate analysis of data.

Roadmap

  • IMPORTING LIBRARIES

  • LOADING DATASET

  • DATA DESCRIPTION

  • EXPLORATORY DATA ANALYSIS

  • MISSING VALUES

  • FEATURE ENGINEERING

  • DATA VISUALIZATION

  • OUTLIER DETECTION

  • DATA PREPROCESSING

  • CLUSTRING

  • PREPARING DATA FOR CLASSIFICATION MODEL

  • MODEL TRAINING AND EVALUATING

  • MODEL TUNING

  • SUBMISSION

  • CONCLUSION

  • The analysis includes visualizations using Matplotlib, Plotly and Seaborn.

Contributing

  • Contributions to this project are welcome. If you notice any errors or have ideas for additional analyses, please feel free to open an issue or submit a pull request.

Conclusion

  • After exploratory analysis and visualization of the data sets, I used clustering algorithms to set the target tags based on the data, then added the tags I found to the data and made predictions using the classification algorithms with the final data. As a result of these predictions, I chose the model that gave the best result and adjusted its hyperparameters. Finally, I made predictions on the test dataset and saved these predictions as a csv file. I entered the contest by uploading this file via kaggle. I got the best accuracy result with the logistic regression model. The result I obtained with the logistic regression model was 98.53%.

img

btk-datathon-2023's People

Contributors

sinanbertan avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.