Skip to main content

Prediction of Insurance Claim Severity Loss Using Regression Models

  • Conference paper
  • First Online:
Machine Learning and Data Mining in Pattern Recognition (MLDM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10358))

Abstract

The objective of this work is to predict the severity loss value of an insurance claim using machine learning regression techniques. The high dimensional data used for this research work is obtained from Allstate insurance company which consists of 116 categorical and 14 continuous predictor variables. We implemented Linear regression, Random forest regression (RFR), Support vector regression (SVR) and Feed forward neural network (FFNN) for this problem. The performance and accuracy of the models are compared using mean squared error (MSE) value and coefficient of determination (Rsquare) value. We predicted the claim severity loss value with a MSE value of 0.390 and a Rsquare value 0.562 using bagged RFR model. In addition where applicable, the final loss value was also predicted with an error of 0.440 using FFNN regression model. We also demonstrate the use of lasso regularization to avoid over-fitting for some of the regression models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rudra, R., Biswas, A., Dutta, P., Aarthi, G.: Applying regression models to calculate the Q factor of multiplexed video signal based on optisystem. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 201–209 (2015)

    Google Scholar 

  2. Cummins, J.D., Griepentrog, G.: Forecasting automobile insurance paid claims using econometric and ARIMA models. Res. Gate J. 1(3), 203–215 (1985)

    Google Scholar 

  3. Nissan, P., Emil, J., Liu, D.: Applied machine learning project 4 prediction of real estate property prices in Montreal. Accessed 6 Dec 2016

    Google Scholar 

  4. Andrea, Z., Katarina, G., Miriam, C., Luke, S.: Energy cost forecasting for event venues. In: EPEC 2015 (2015)

    Google Scholar 

  5. Glenn, M.: On predictive modeling for claim severity. Sci. Direct J

    Google Scholar 

  6. Vatsal, H.S.: Machine learning techniques for stock prediction (2007)

    Google Scholar 

  7. Philipe, H., Glenn, G.M.: The calculation of aggregate loss distributions from claim severity and claim count distributions (1993)

    Google Scholar 

  8. Smola, A.J., Scholkopf, B.: A tutorial on support vector regression. Statist. Comput. 14(3), 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  9. Ryu, H., Son, K., Kim, J.-M.: Loss prediction model for building construction projects using insurance claim payout. J. Asian Archit. Build. Eng. 15(3), 441–446 (2016)

    Article  Google Scholar 

  10. Pak, R.J.: Estimating loss severity distribution convolution approach. J. Math. Statist. 10(3), 247–254 (2014)

    Article  Google Scholar 

  11. Xue, Q., Chen, C.-C., Chen, K.-C.: Damage and loss assessment for the basic earthquake insurance claim of residential RC buildings in Taiwan. J. Build. Appraisal 6, 213–226 (2011)

    Article  Google Scholar 

  12. Frees, E.W., Shi, P., Valdez, E.A.: Actuarial applications of hierarchical insurance claims model. Astin Bull. 39, 165–197 (2009)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruth M. Ogunnaike .

Editor information

Editors and Affiliations

A Appendix

A Appendix

The dataset and implementation codes used for this project can be found on the link provided https://github.com/ruthogunnaike/lossPredictor.

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Ogunnaike, R.M., Si, D. (2017). Prediction of Insurance Claim Severity Loss Using Regression Models. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2017. Lecture Notes in Computer Science(), vol 10358. Springer, Cham. https://doi.org/10.1007/978-3-319-62416-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62416-7_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62415-0

  • Online ISBN: 978-3-319-62416-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics