Abstract
The key success of any organization lies on its employees and thus ability to monitor employee attrition efficiently becomes important. Our work aims to use machine learning models to accurately predict on whether an employee would decide to leave a company or not. Main algorithms used for this purpose include Logistic Regression, KNN and Weighted Decision tree. Another aspect that needs to be looked into is the amount of classification imbalance present in this problem statement. Our work has applied multiple data based and algorithmic approaches to resolve this imbalance. The use of SMOTE, Near-Miss, ADASYNC and other techniques provide an efficient and balanced prediction of employee attrition. From all the experiments we have done, we arrive at a proposed hybrid model that combines data based and algorithmic method to resolve imbalance that accurately predicts attrition of up to 98.4%, proves recall rate of 98.43%, shows precision of 97.69% and an excellent F1 score of 99.17%. Our analysis would help companies understand the important areas that needs to be focused on to avoid attrition and hence lead to growth and higher achievements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alduayj, S.S., Rajpoot, K.: Predicting employee attrition using machine learning, pp. 93–98 (2018)
Ali, A., Shamsuddin, S.M.H., Ralescu, A.L.: Classification with class imbalance problem: a review (2015)
Dutta, S., Bandyopadhyay, S.: Employee attrition prediction using neural network cross validation method. Int. J. Comm. Manag. 6, 80–85 (2020)
Fallucchi, F., Coladangelo, M., Giuliano, R., William DeLuca, E.: Predicting employee attrition using machine learning techniques. Computers 9(4) (2020)
Fernandez, A., del Río, S., Chawla, N., Herrera, F.: An insight into imbalanced big data classification: outcomes and challenges”. Complex Intell. Syst. 3, 105–120 (2017)
Jain, P.K., Jain, M., Pamula, R.: “Explaining and predicting employees’ attrition a machine learning approach. SN Appl. Sci. 2, 757 (2020)
Jain, R., Nayyar, A.: Predicting employee attrition using XGBoost machine learning approach, pp. 113–120 (2018)
Karande, S., Shyamala, L.: Prediction of employee turnover using ensemble learning, pp. 319–327 (2019)
Mhatre, A., Mahalingam, A., Narayanan, M., Nair, A., Jaju, S.: Predicting employee attrition along with identifying high risk employees using big data and machine learning, pp. 269–276 (2020)
Patel, A., Pardeshi, N., Patil, S., Sutar, S., Sadafule, R., Bhat, S.: Employee attrition predictive model using machine learning 07(05) (2020)
Shankar, R.S., Rajanikanth, J., Sivaramaraju, V., Murthy, K.: Prediction of employee attrition using datamining, pp. 1–8 (2018)
Yahia, N.B., Hlel, J., Colomo, R.: From big data to deep data to support people analytics for employee attrition prediction. IEEE Access 9, 60447–60458 (2021)
Mahadevan, A., Arock, M.: A class imbalance-aware review rating prediction using hybrid sampling and ensemble learning. Multimed. Tools Appl. 80, 6911–6938 (2021)
Prathilothamai, M., Indra Kumar, V., Anjali Ragupathi, Aradhana, J.: Analysis of techniques to handle class imbalance in road traffic prediction. Int. J. Adv. Sci. Technol. 29(05), 7549–7567 (2020)
Garlapati, A., Krishna, D.R., Garlapati, K., Narayanan, G.: Predicting employees under stress for pre-emptive remediation using machine learning algorithm, pp. 315–319 (2020). https://doi.org/10.1109/RTEICT49044.2020.9315726
Jose, C., Gopakumar, G.: An improved random forest algorithm for classification in an imbalanced dataset. In: 2019 URSI Asia-Pacific Radio Science Conference (AP-RASC), pp. 1–4 (2019). https://doi.org/10.23919/URSIAPRASC.2019.8738232.
Ketha, S., Balakrishna, P., Ravi, V., Dr. Soman K.P.: Deep learning based frameworks for handling imbalance in DGA, Email, and URL data analysis. In: Communications in Computer and Information Science (2020)
Vinayakumar, R., Soman, K.P., Poornachandran, P.: DeepDGAMINet: cost-sensitive deep learning based framework for handling multiclass imbalanced DGA detection. In: Gupta, B., Perez, G., Agrawal, D., Gupta, D. (eds.) Handbook of Computer Networks and Cyber Security. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-22277-2-37
Mohammed Harun Babu, R., Vinayakumar, R., Soman, K.P.: CostSensitive long short-term memory for imbalanced DGA family categorization. In: Sengodan, T., Murugappan, M., Misra, S. (eds.) Advances in Electrical and Computer Technologies. Lecture Notes in Electrical Engineering, vol. 672. Springer, Singapore (2020). https://doi.org/10.1007/978- 981–15–5558–9–49.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Prathilothamai, M., Sudarshana, Sri Sakthi Maheswari, A., Chandravadhana, A., Goutham, R. (2022). Efficient Approach to Employee Attrition Prediction by Handling Class Imbalance. In: Singh, M., Tyagi, V., Gupta, P.K., Flusser, J., Ören, T. (eds) Advances in Computing and Data Sciences. ICACDS 2022. Communications in Computer and Information Science, vol 1614. Springer, Cham. https://doi.org/10.1007/978-3-031-12641-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-031-12641-3_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-12640-6
Online ISBN: 978-3-031-12641-3
eBook Packages: Computer ScienceComputer Science (R0)