Giter Site home page Giter Site logo

employee-attrition-analysis's Introduction

employee-attrition-analysis

Introduction

The purpose of this report is to evaluate the underlying relationships and patterns between and within each of the observed variables in the given dataset. Clustering is used to group the data to better understand the relationships and patterns across groups. Multiple predictive models are used to discover which group of employees is more likely to resign from their jobs, and what factors contribute to it. We choose accuracy and AUC as the evaluation metrics.

Dataset Description

Created by IBM data scientists, the dataset contains 1471 employees’ information including demographic, job-specific, and personal factors. It has 35 variables that include both numerical and categorical data.

The graph shows the importance of variables by random forest model. importance

If use k means to cluster employees to different groups, we choose the number of centers by the total within sum of squares plot, according to the graph, two groups is appropriate.

importance

  • group 1: employees with less work experience and income
  • group 2: employees with more work experience and higher income

Conclusion

For the dataset, logistic regression works best among models without clustering. It has the highest accuracy and AUC. However, we need to consider that the data is imbalanced, it is likely that there are no enough positive samples and models fail to learn from the training set.

Factors that decrease the likelihood of employees’ attrition are

• Environment Satisfaction

• Job Involvement

• Job Satisfaction

• Years In Current Role

• Relationship Satisfaction

• Training Times Last Year

Factors that increase the likelihood of employees’ attrition are

• Business Travel Frequently

• Work over time

• Distance From Home

• Marital Status Single

• Years Since Last Promotion

employee-attrition-analysis's People

Contributors

hz2657 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.