Abstract
The objective of this work is to predict the severity loss value of an insurance claim using machine learning regression techniques. The high dimensional data used for this research work is obtained from Allstate insurance company which consists of 116 categorical and 14 continuous predictor variables. We implemented Linear regression, Random forest regression (RFR), Support vector regression (SVR) and Feed forward neural network (FFNN) for this problem. The performance and accuracy of the models are compared using mean squared error (MSE) value and coefficient of determination (Rsquare) value. We predicted the claim severity loss value with a MSE value of 0.390 and a Rsquare value 0.562 using bagged RFR model. In addition where applicable, the final loss value was also predicted with an error of 0.440 using FFNN regression model. We also demonstrate the use of lasso regularization to avoid over-fitting for some of the regression models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rudra, R., Biswas, A., Dutta, P., Aarthi, G.: Applying regression models to calculate the Q factor of multiplexed video signal based on optisystem. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 201–209 (2015)
Cummins, J.D., Griepentrog, G.: Forecasting automobile insurance paid claims using econometric and ARIMA models. Res. Gate J. 1(3), 203–215 (1985)
Nissan, P., Emil, J., Liu, D.: Applied machine learning project 4 prediction of real estate property prices in Montreal. Accessed 6 Dec 2016
Andrea, Z., Katarina, G., Miriam, C., Luke, S.: Energy cost forecasting for event venues. In: EPEC 2015 (2015)
Glenn, M.: On predictive modeling for claim severity. Sci. Direct J
Vatsal, H.S.: Machine learning techniques for stock prediction (2007)
Philipe, H., Glenn, G.M.: The calculation of aggregate loss distributions from claim severity and claim count distributions (1993)
Smola, A.J., Scholkopf, B.: A tutorial on support vector regression. Statist. Comput. 14(3), 199–222 (2004)
Ryu, H., Son, K., Kim, J.-M.: Loss prediction model for building construction projects using insurance claim payout. J. Asian Archit. Build. Eng. 15(3), 441–446 (2016)
Pak, R.J.: Estimating loss severity distribution convolution approach. J. Math. Statist. 10(3), 247–254 (2014)
Xue, Q., Chen, C.-C., Chen, K.-C.: Damage and loss assessment for the basic earthquake insurance claim of residential RC buildings in Taiwan. J. Build. Appraisal 6, 213–226 (2011)
Frees, E.W., Shi, P., Valdez, E.A.: Actuarial applications of hierarchical insurance claims model. Astin Bull. 39, 165–197 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
A Appendix
A Appendix
The dataset and implementation codes used for this project can be found on the link provided https://github.com/ruthogunnaike/lossPredictor.
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Ogunnaike, R.M., Si, D. (2017). Prediction of Insurance Claim Severity Loss Using Regression Models. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2017. Lecture Notes in Computer Science(), vol 10358. Springer, Cham. https://doi.org/10.1007/978-3-319-62416-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-62416-7_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62415-0
Online ISBN: 978-3-319-62416-7
eBook Packages: Computer ScienceComputer Science (R0)