Absenteeism Prediction in Call Center Using Machine Learning Algorithms

de Oliveira, Evandro Lopes; Torres, José M.; Moreira, Rui S.; de Lima, Rafael Alexandre França

doi:10.1007/978-3-030-16181-1_90

Evandro Lopes de Oliveira¹⁸,
José M. Torres^18,19,
Rui S. Moreira^18,19,20 &
…
Rafael Alexandre França de Lima²¹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 930))

Included in the following conference series:

World Conference on Information Systems and Technologies

3188 Accesses
5 Citations

Abstract

Absenteeism is a major problem faced particularly by companies with a large number of employees. Therefore, the existence of absenteeism prediction tools is essential for such companies depending on intensive human-resources. This paper focuses on using machine learning technologies for predicting the absences of employees from work. More precisely, a few prediction models were tuned and tested with 241 features extracted from a population of 13.805 employees. This target population was sampled from the help desk work force of a major Brazilian phone company. The features were extracted from the profile of the help desk agents and then filtered by processes of correlation and feature selection. The selected features were then used to compare absenteeism prediction given by different classification algorithm (cf. Random Forest, Multilayer Perceptron, Support Vector Machine, Naive Bayes, XGBoost and Long Short Term Memory). The parameterization of these ML models was also studied to reach the classifier best suited for the prediction problem. Such parameterizations were tuned through the use of evolutionary algorithms, from which considerable precision was reached, the best being 72% (XGBoost) and 71% (Random Forest).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

McCallum, A. Nigamt, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text (1998)
Google Scholar
Carino, B.M.A., Saliby, S.E.: Prevendo a demanda de ligações em um call center por meio de um modelo de Regressão Múltipla - Gestão & Produção - SciELO Brasil (2009)
Google Scholar
Courtney, C.: Time series nested cross-validation. Towards Data Science (2018). https://towardsdatascience.com/time-series-nested-cross-validation-76adba623eb9. Accessed Jan 2018
Hulme, C., Maughan, S., Brown, G.D.A.: Memory for familiar and unfamiliar words: evidence for a long-term memory contribution to short-term memory span. J. Mem. Lang. 30(6), 685–701 (1991)
Article Google Scholar
Hibbert, D.B.: Genetic algorithms in chemistry. Chemom. Intell. Lab. Syst. 19, 277–293 (1993)
Article Google Scholar
Cohen, A., Golan, R.: Predicting absenteeism and turnover intentions by past absenteeism and work attitudes - an empirical examination of female employees in long term nursing care facilities. Career Dev. Int. 12(5), 416–432 (2007)
Article Google Scholar
Cournapeau, D.: Scikit-learn. http://scikit-learn.org/stable/. Accessed Nov 2017
Mari, D., Kotz, S.: Correlation and Dependence. Imperial College Press, London (2001)
Book Google Scholar
Sanchis, E., Casacuberta, F., Galiano, I., Segarra, E.: Learning structural models of subword units through grammatical inference techniques. In: ICASSP-91 Proceedings, pp. 189–192 (1991)
Google Scholar
Jantan, H., Hamdan, A.R., Othman, Z.A.: Towards applying data mining techniques for talent managements. In: International Conference on Computer Engineering and Applications, IPCSIT, vol. 2, p. 2011. IACSIT Press, Singapore (2009)
Google Scholar
Holland, J.: Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Arbor (1975)
Google Scholar
Wainer, J.: Comparison of 14 different families of classification algorithms on 115 binary datasets. Campinas Campinas, SP, 13083-852, Brasil, 6 June 2016
Google Scholar
Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: AAAI (1992)
Google Scholar
Lessmann, S., Voß, S.: A reference model for customer-centric data mining with support vector machines. Eur. J. Oper. Res. 199, 520–530 (2009)
Article MathSciNet Google Scholar
Breiman, L.: Random Forest. Mach. Learn. 45, 5–32 (2001). Kluwer Academic Publishers, Manufactured in The Netherlands
Article Google Scholar
Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(1–4), 131–156 (1997)
Article Google Scholar
Mancini, L.: Call Center: estratégia para vencer. Summus editorial (2001)
Google Scholar
Fernández-Delgado, M., Cernadas, E., Barro, S.: Do we need hundreds of classifiers to solve real world classification problems? J. Mach. Learn. Res. 15, 3133–3181 (2014)
MathSciNet MATH Google Scholar
Miranda, M.: Algoritmos Genéticos: Fundamentos e Aplicações. Disponível em. http://www.gta.ufrj.br/~marcio/genetic.html. Acesso em: 20 outubro 2009
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29, 131–163 (1997)
Article Google Scholar
Rohit, P., Pankaj, A.: Prediction of employee turnover in organizations using machine learning algorithms a case for extreme gradient boosting. (IJARAI) Int. J. Adv. Res. Artif. Intelli. 5(9), C5 (2016)
Google Scholar
Riddle, P., Segal, R., Etzioni, O.: Representation design and brute-force induction in a Boeing manufacturing domain. Appl. Artif. Intell. Int. J. 8(1), 125–147 (1994)
Article Google Scholar
Guo, R., Abraham, A., Paprzycki, M.: Analyzing call center performance: a data mining approach. J. Knowl. Manag. 4(1), 24–37 (2006)
Google Scholar
Ivanir, C.. Pinto, F.R., Prado, P.K.R.M, Jose, S.R.: Um estudo sobre dashboard inteligente para apoio à tomada de decisão em uma empresa de courier. In: 13th CONTECSI International Conference on Information Systems and Technology Management (2016)
Google Scholar
Leardi, R.: Application of genetic algorithm-PLS for feature selection in spectral data sets. J. Chemom. 14, 643–655 (2000)
Article Google Scholar
Leardi, R., Lupianez, A.: Gonzalez Genetic algorithms applied to feature selection in PLS regression. Chemom. Intell. Lab. Syst. 41, 195–207 (1998). Lupianez Gonzalez
Article Google Scholar
Leardi, R., Boggia, R., Terrile, M.: Genetic algorithms as a strategy for feature selection. J. Chemom. 6(5), 267–281 (1992)
Article Google Scholar
Smit, S.K., Eiben, A.E.: Comparing parameter tuning methods for evolutionary algorithms. In: IEEE Congress on CEC 2009 (2009)
Google Scholar
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Nagadevara, V., Srinivasan, V., Valk, R.: Establishing a link between employee turnover and withdrawal behaviours: application of data mining techniques. Res. Pract. Hum. Res. Manag. 16(2), 81–97 (2008)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning. Springer, New York (1995)
Book Google Scholar

Download references

Acknowledgements

This work was partially funded by: FCT-Fundação para a Ciência e Tecnologia in the scope of the strategic project LIACC-Artificial Intelligence and Computer Science Laboratory (PEst-UID / CEC/ 00027/ 2013); and by Fundação Ensino e Cultura Fernando Pessoa.

Author information

Authors and Affiliations

ISUS Unit, University Fernando Pessoa, Porto, Portugal
Evandro Lopes de Oliveira, José M. Torres & Rui S. Moreira
LIACC, University of Porto, Porto, Portugal
José M. Torres & Rui S. Moreira
INESC-TEC, FEUP - University of Porto, Porto, Portugal
Rui S. Moreira
Federal University of Minas Gerais, Belo Horizonte, Brazil
Rafael Alexandre França de Lima

Authors

Evandro Lopes de Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
José M. Torres
View author publications
You can also search for this author in PubMed Google Scholar
Rui S. Moreira
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Alexandre França de Lima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rui S. Moreira .

Editor information

Editors and Affiliations

Departamento de Engenharia Informática, Universidade de Coimbra, Coimbra, Portugal
Álvaro Rocha
The Ohio State University, Columbus, OH, USA
Hojjat Adeli
Faculdade de Engenharia/LIACC, Universidade do Porto, Porto, Portugal
Luís Paulo Reis
DIMES, Università della Calabria, Arcavacata di Rende, Italy
Sandra Costanzo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

de Oliveira, E.L., Torres, J.M., Moreira, R.S., de Lima, R.A.F. (2019). Absenteeism Prediction in Call Center Using Machine Learning Algorithms. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S. (eds) New Knowledge in Information Systems and Technologies. WorldCIST'19 2019. Advances in Intelligent Systems and Computing, vol 930. Springer, Cham. https://doi.org/10.1007/978-3-030-16181-1_90

Download citation

DOI: https://doi.org/10.1007/978-3-030-16181-1_90
Published: 27 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16180-4
Online ISBN: 978-3-030-16181-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics