Skip to main content

Privacy-Preserving Machine Learning in Life Insurance Risk Prediction

  • Conference paper
  • First Online:
  • 573 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1753))

Abstract

The application of machine learning to insurance risk prediction requires learning from sensitive data. This raises multiple ethical and legal issues. One of the most relevant ones is privacy. However, privacy-preserving methods can potentially hinder the predictive potential of machine learning models. In this paper, we present preliminary experiments with life insurance data using two privacy-preserving techniques: discretization and encryption. Our objective with this work is to assess the impact of such privacy preservation techniques in the accuracy of ML models. We instantiate the problem in three general, but plausible Use Cases involving the prediction of insurance claims within a 1-year horizon. Our preliminary experiments suggest that discretization and encryption have negligible impact in the accuracy of ML models.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Narayanan, A., Shmatikov, V.: Robust de-anonymization of large sparse datasets. In: 2008 IEEE Symposium on Security and Privacy (SP 2008), pp. 111–125 (2008)

    Google Scholar 

  2. Dwork, C., Naor, M.: On the difficulties of disclosure prevention in statistical databases or the case for differential privacy. J. Priv. Confidentiality 2 (2010)

    Google Scholar 

  3. Boldyreva, A., Chenette, N., Lee, Y., O’Neill, A.: Order-preserving symmetric encryption. In: IACR Cryptol. ePrint Arch. (2012)

    Google Scholar 

  4. Boodhun, N., Jayabalan, M.: Risk prediction in life insurance industry using supervised learning algorithms. Complex Intell. Syst. 4(2), 145–154 (2018). https://doi.org/10.1007/s40747-018-0072-1

    Article  Google Scholar 

  5. Liu, Q., Li, P., Zhao, W., Cai, W., Yu, S., Leung, V.C.M.: A survey on security threats and defensive techniques of machine learning: a data driven view. IEEE Access 6, 12 103–12 117 (2018)

    Google Scholar 

  6. Papernot, N., Mcdaniel, P., Sinha, A., Wellman, M.P.: SOK: security and privacy in machine learning. In: 2018 IEEE European Symposium on Security and Privacy (EuroS &P), pp. 399–414 (2018)

    Google Scholar 

  7. Kenthapadi, K., Mironov, I., Thakurta, A.: Privacy-preserving data mining in industry. In: Companion Proceedings of the 2019 World Wide Web Conference (2019)

    Google Scholar 

  8. Maier, M.E., Carlotto, H., Sanchez, F., Balogun, S., Merritt, S.A.: Transforming underwriting in the life insurance industry. In: AAAI (2019)

    Google Scholar 

  9. Kaissis, G., Makowski, M.R., Rückert, D., Braren, R.F.: Secure, privacy-preserving and federated machine learning in medical imaging. Nat. Mach. Intell. 2, 305–311 (2020)

    Article  Google Scholar 

  10. Levantesi, S., Nigri, A., Piscopo, G.: Longevity risk management through machine learning: state of the art (2020)

    Google Scholar 

  11. Liu, B., Ding, M., Shaham, S., Rahayu, W., Farokhi, F., Lin, Z.: When machine learning meets privacy. ACM Comput. Surv. (CSUR) 54, 1–36 (2021)

    Google Scholar 

  12. Majeed, A., Lee, S.: Anonymization techniques for privacy preserving data publishing: a comprehensive survey. IEEE Access 9, 8512–8545 (2021)

    Article  Google Scholar 

  13. Xu, R., Baracaldo, N., Joshi, J.: Privacy-preserving machine learning: methods, challenges and directions, ArXiv, vol. abs/2108.04417 (2021)

    Google Scholar 

Download references

Acknowledgements

This work is financed by National Funds through the Portuguese funding agency, FCT - Fundação para a Ciência e a Tecnologia, within project LA/P/0063/2020, and by the ERDF - European Regional Development Fund through the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement within project SIS\(\hat{\,}\)1 (NORTE-01-0247-FEDER-45355).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Klismam Pereira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pereira, K., Vinagre, J., Alonso, A.N., Coelho, F., Carvalho, M. (2023). Privacy-Preserving Machine Learning in Life Insurance Risk Prediction. In: Koprinska, I., et al. Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2022. Communications in Computer and Information Science, vol 1753. Springer, Cham. https://doi.org/10.1007/978-3-031-23633-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23633-4_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23632-7

  • Online ISBN: 978-3-031-23633-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics