skip to main content
10.1145/3639856.3639870acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaimlsystemsConference Proceedingsconference-collections
research-article

Leveraging Uncertainty for Credit Risk Estimation and Reliable Predictions in lending decision making

Published:17 May 2024Publication History

ABSTRACT

As the adoption of point prediction models, such as deep neural networks and Gradient Boosting methods, continues to grow in critical decision-making systems, ensuring the reliability of their inferences has become a paramount concern. These deterministic models often exhibit excessively high confidence even on out-of-distribution datasets, leading to potentially costly errors in essential applications like lending decisions and credit risk estimations. Hence, accurate uncertainty quantification is essential for practical and reliable applications.

In this research, we propose to address this issue by employing predictive uncertainty. Through extensive experimentation, we demonstrate that employing probabilistic methods for in-shifted or out-of-distribution data leads to improved results, fostering more reliable predictions. Also, we introduce two innovative methods that utilize gradient-boosting techniques to mine model uncertainty effectively. These methods offer an alternative perspective on uncertainty estimation, contributing to the growing body of research in this domain.

To validate the efficacy of our proposed methods, we conduct experiments on the Lending Club loan dataset, showcasing their potential to enhance decision-making in critical scenarios. Ultimately, our study emphasizes the significance of uncertainty estimation in credit risk assessment and highlights the practical benefits of incorporating probabilistic methods in deep neural networks and gradient-boosting-based models.

References

  1. Peter Martey Addo, Dominique Guegan, and Bertrand Hassani. 2018. Credit risk analysis using machine and deep learning models. Risks 6, 2 (2018), 38.Google ScholarGoogle ScholarCross RefCross Ref
  2. Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, and Dan Mané. 2016. Concrete problems in AI safety. arXiv preprint arXiv:1606.06565 (2016).Google ScholarGoogle Scholar
  3. Tianqi Chen and Carlos Guestrin. 2016. XGBoost. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM. https://doi.org/10.1145/2939672.2939785Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning. PMLR, 1050–1059.Google ScholarGoogle Scholar
  5. Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q Weinberger. 2017. On calibration of modern neural networks. In International conference on machine learning. PMLR, 1321–1330.Google ScholarGoogle Scholar
  6. Dan Hendrycks and Kevin Gimpel. 2016. A baseline for detecting misclassified and out-of-distribution examples in neural networks. arXiv preprint arXiv:1610.02136 (2016).Google ScholarGoogle Scholar
  7. José Miguel Hernández-Lobato and Ryan Adams. 2015. Probabilistic backpropagation for scalable learning of bayesian neural networks. In International conference on machine learning. PMLR, 1861–1869.Google ScholarGoogle Scholar
  8. Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal processing magazine 29, 6 (2012), 82–97.Google ScholarGoogle Scholar
  9. Alex Kendall and Yarin Gal. 2017. What uncertainties do we need in bayesian deep learning for computer vision?Advances in neural information processing systems 30 (2017).Google ScholarGoogle Scholar
  10. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).Google ScholarGoogle Scholar
  11. Balaji Lakshminarayanan, Alexander Pritzel, and Charles Blundell. 2017. Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems 30 (2017).Google ScholarGoogle Scholar
  12. Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436–444.Google ScholarGoogle Scholar
  13. LendingClub. 2023. LendingClub Data. https://www.lendingclub.com/statistics/additional-statistics Accessed on 2023-10-25.Google ScholarGoogle Scholar
  14. David JC MacKay. 1992. The evidence framework applied to classification networks. Neural computation 4, 5 (1992), 720–736.Google ScholarGoogle Scholar
  15. Andrey Malinin, Bruno Mlodozeniec, and Mark Gales. 2019. Ensemble distribution distillation. arXiv preprint arXiv:1905.00076 (2019).Google ScholarGoogle Scholar
  16. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google ScholarGoogle Scholar
  17. Jeffrey S. Vitter. 1985. Random Sampling with a Reservoir. ACM Trans. Math. Softw. 11, 1 (mar 1985), 37–57. https://doi.org/10.1145/3147.3165Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Michał Woźniak, Manuel Grana, and Emilio Corchado. 2014. A survey of multiple classifier systems as hybrid systems. Information Fusion 16 (2014), 3–17.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    AIMLSystems '23: Proceedings of the Third International Conference on AI-ML Systems
    October 2023
    381 pages
    ISBN:9798400716492
    DOI:10.1145/3639856

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 17 May 2024

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)1

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format