Giter Site home page Giter Site logo

datasciencevishal / attrition_prediciton Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 22 KB

Employees Attrition prediction and identifying its main influencing features.

Jupyter Notebook 100.00%
data-preprocessing data-visualization eda feature-engineering gradient-boosting-classifier

attrition_prediciton's Introduction

Employee Attrition Prediction Project

Employee attrition, the phenomenon of employees leaving their jobs, is a significant challenge for organizations. This project utilizes a hypothetical dataset from IBM HR Analytics Employee Attrition & Performance to analyze factors influencing employee attrition and develop a predictive model.

Problem Statement

The objective of this project is to understand and determine how various factors relate to workforce attrition. By analyzing employee background and characteristics, we aim to predict whether an employee is likely to stay in the company or leave for another job.

Dataset Overview

  • Size: The dataset contains 1,470 data points (rows) and 35 features (columns).
  • Features: Each feature describes an employee's background and characteristics.
  • Label: The dataset is labeled (supervised learning) with whether the employee is still in the company or has left for another job.

Libraries Used

  • Pandas: For data manipulation and analysis.
  • NumPy: For numerical computations.
  • Seaborn and Matplotlib: For data visualization.
  • Plotly: For interactive visualizations.
  • Scikit-learn: For machine learning tasks such as model building, preprocessing, and evaluation.

Approach

  1. Data Preprocessing:
    • Cleaning the data, handling missing values, and encoding categorical variables.
  2. Exploratory Data Analysis (EDA):
    • Analyzing the relationships between various features and employee attrition.
  3. Feature Engineering:
    • Selecting relevant features and transforming data if necessary.
  4. Model Building:
    • Utilizing Gradient Boosting Classifier for predicting employee attrition.
  5. Model Evaluation:
    • Assessing model performance using metrics such as accuracy, log loss, and confusion matrix.

Findings

After analyzing the dataset and building the predictive model, it was observed that monthly income and daily rates were identified as the main influencing features causing employee attrition. The developed model achieved an accuracy score of 85%, indicating its effectiveness in predicting employee attrition.

Conclusion

Understanding the factors contributing to employee attrition is crucial for organizations to implement effective retention strategies. By leveraging machine learning techniques, such as the Gradient Boosting Classifier used in this project, organizations can proactively identify at-risk employees and take necessary actions to improve employee satisfaction and retention.

For detailed implementation and code, refer to the Jupyter notebook provided in this repository.

Feel free to explore further and contribute to the enhancement of the project!

attrition_prediciton's People

Contributors

datasciencevishal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.