skip to main content
10.1145/3229345.3229390acmotherconferencesArticle/Chapter ViewAbstractPublication PagessbsiConference Proceedingsconference-collections
research-article

A Comparative Analysis of Loan Requests Classification Algorithms in a Peer-to-Peer Lending Platform

Published:04 June 2018Publication History

ABSTRACT

The Peer-to-Peer (P2P) on-line lending is an emerging lending modality that brings creditors and borrowers closer while enabling a significant reduction of bureaucracy in the lending process. Despite its appealing, the increase in the demand for this lending modality depends on a rigorous settlement of the risk assignment to the potential borrowers. Considering this issue, this article discusses an experimental analysis of classification methods for P2P on-line lending default prediction. The performed experiments were based on the application of the implemented classification algorithms over the data mass formed by borrowers' profiles and loan history records of the P2P Lending Club platform. As the main contribution, the study revealed that it is possible to obtain satisfactory prediction results with a set of attributes smaller than those that were used in studies previously presented in the literature. In addition, it could be verified that, since the algorithms based on decision trees have proved highly effective, the use of these methods is a feasible approach to support the development of lending negotiation tools.

References

  1. {n. d.}. FICO® | FICO Decisions. http://www.fico.com/br/. (Accessed on 03/04/2018).Google ScholarGoogle Scholar
  2. {n. d.}. Peer to Peer Lending & Alternative Investing | Invest with LendingClub. https://www.lendingclub.com/. (Accessed on 03/03/2018).Google ScholarGoogle Scholar
  3. Naomi S Altman. 1992. An introduction to kernel and nearest-neighbor nonparametric regression. The American Statistician 46, 3 (1992), 175--185.Google ScholarGoogle ScholarCross RefCross Ref
  4. A Ashta and D Assadi. 2009. Do social cause and social technology meet? Impact of web 2.0 technologies on peer-to-peer lending transactions in general and microfinance in particular. In First European Research Conference on Microfinance, Brussels. 2--4.Google ScholarGoogle Scholar
  5. Sven C Berger and Fabian Gleisner. 2010. Emergence of financial intermediaries in electronic markets: The case of online P2P lending. BuR: Business Research 2, 1 (2010), 39--65.Google ScholarGoogle ScholarCross RefCross Ref
  6. Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ajay Byanjankar, Markku Heikkilä, and Jozsef Mezei. 2015. Predicting credit risk in peer-to-peer lending: A neural network approach. In Computational Intelligence, 2015 IEEE Symposium Series on. IEEE, 719--725.Google ScholarGoogle ScholarCross RefCross Ref
  8. Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research 16 (2002), 321--357. Google ScholarGoogle ScholarCross RefCross Ref
  9. Tianqi Chen and Carlos Guestrin. 2016. XGBoost: Reliable Large-scale Tree Boosting System. arXiv 2016; 1--6. arXiv preprint arXiv:1603.02754 (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Michael Conlin. 1999. Peer group micro-lending programs in Canada and the United States. Journal of Development Economics 60, 1 (oct 1999), 249--269.Google ScholarGoogle ScholarCross RefCross Ref
  11. Corinna Cortes and Vladimir Vapnik. 1995. Support vector machine. Machine learning 20, 3 (1995), 273--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. David R Cox. 1958. The regression analysis of binary sequences. Journal of the Royal Statistical Society. Series B (Methodological) (1958), 215--242.Google ScholarGoogle Scholar
  13. Riza Emekter, Yanbin Tu, Benjamas Jirasakuldech, and Min Lu. 2015. Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending. Applied Economics 47, 1 (2015), 54--70.Google ScholarGoogle ScholarCross RefCross Ref
  14. Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001), 1189--1232.Google ScholarGoogle Scholar
  15. David W Hosmer Jr, Stanley Lemeshow, and Rodney X Sturdivant. 2013. Applied logistic regression. Vol. 398. John Wiley & Sons.Google ScholarGoogle ScholarCross RefCross Ref
  16. Michael K Hulme and Collette Wright. 2006. Internet based social lending: Past, present and future. Social Futures Observatory 115 (2006).Google ScholarGoogle Scholar
  17. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An introduction to statistical learning. Vol. 112. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. David D Lewis. 1998. Naive (Bayes) at forty: The independence assumption in information retrieval. In European conference on machine learning. Springer, 4--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Zhiyong Li, Xiao Yao, Qing Wen, and Wei Yang. 2016. Prepayment and Default of Consumer Loans in Online Lending. (2016).Google ScholarGoogle Scholar
  20. Milad Malekipirbazari and Vural Aksakalli. 2015. Risk assessment in social lending via random forests. Expert Systems with Applications 42, 10 (2015), 4621--4631. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Nilas Möllenkamp. 2017. Determinants of Loan Performance in P2P Lending. B.S. thesis. University of Twente.Google ScholarGoogle Scholar
  22. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825--2830. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Michel Polena and Regner Tobias. 2016. Determinants of borrowers' default in P2P lending under consideration of the loanrisk class. Jena Economic Research Paper (2016), 1--30.Google ScholarGoogle Scholar
  24. Carlos Serrano-Cinca, Begoña Gutiérrez-Nieto, and Luz López-Palacios. 2015. Determinants of default in P2P lending. PloS one 10, 10 (2015), e0139427.Google ScholarGoogle ScholarCross RefCross Ref
  25. Vladimir Vapnik. 2013. The nature of statistical learning theory. Springer science & business media.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Strother H Walker and David B Duncan. 1967. Estimation of the probability of an event as a function of several independent variables. Biometrika 54, 1--2 (1967), 167--179.Google ScholarGoogle ScholarCross RefCross Ref
  27. Min Xu, Pakorn Watanachaturaporn, Pramod K Varshney, and Manoj K Arora. 2005. Decision tree regression for soft classification of remote sensing data. Remote Sensing of Environment 97, 3 (2005), 322--336.Google ScholarGoogle ScholarCross RefCross Ref
  28. Yuejin Zhang, Haifeng Li, Mo Hai, Jiaxuan Li, and Aihua Li. 2017. Determinants of loan funded successful in online P2P Lending. Procedia Computer Science 122 (2017), 896--901.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Comparative Analysis of Loan Requests Classification Algorithms in a Peer-to-Peer Lending Platform

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          SBSI '18: Proceedings of the XIV Brazilian Symposium on Information Systems
          June 2018
          578 pages
          ISBN:9781450365598
          DOI:10.1145/3229345

          Copyright © 2018 ACM

          © 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 June 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate181of557submissions,32%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader