Skip to main content

Machine Learning Model for Predicting Non-performing Agricultural Loans

  • Conference paper
  • First Online:
Book cover Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV2020) (AICV 2020)

Abstract

Extending agricultural loans to individuals is essential to support the agriculture sector and markets. One of the most important risks affecting the banking sector is the concept of credit risk. Predicting the probability of non-performing loans for an individual is a vital and beneficial role for banks to decrease credit risk and make the right decisions. These decisions are based on credit study and in accordance with generally accepted standards, loan payment history, and demographic data of the clients. The subject paper here is proposing an ensemble-based model, to enhance classification accuracy. For the building model, the dataset was gathered from an agricultural bank in Egypt. Egyptian credit dataset involves 112907 instances and 17 features that are used in the current study. Variable selections were used to select important features for the classification. Cross-classification has also been used with ten subsets. Classification methods have been applied with Logistics Regression (LR), k-nearest neighbors (KNN), Support Vector Machine (SVM), Decision Tree (DT) and Meta-classifier methods for training and testing toward the dataset. The outcome of the specified experiments showed that the accuracy of the ensemble method is the highly recommended one for individuals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chen, W.H., Shih, J.Y.: A study of Taiwan’s issuer credit rating systems using support vector machines. Expert Syst. Appl. 30, 427–435 (2006)

    Article  Google Scholar 

  2. European Central Bank: What are non-performing loans? https://www.ecb.europa.eu/explainers/tell-me/html/npl.en.html. Accessed 11 Dec 2019

  3. Paireekreng, W., Choensawat, W.: An ensemble learning based model for real estate project classification. In: 6th International Conference on Applied Human Factors and Ergonomics and the Affiliated Conferences, vol. 3, pp. 3852–3859 (2015)

    Google Scholar 

  4. Zhang, Y., et al.: Predicting non-performing loan of business bank by multiple classifier fusion algorithms. J. Interdisc. Math. 19(4), 657–667 (2016)

    Article  Google Scholar 

  5. Goyal, A., Kaur, R.: Loan prediction using ensemble technique. Int. J. Adv. Res. Comput. Commun. Eng. 5(3), 523–526 (2016)

    Google Scholar 

  6. Okesola, O.J., et al.: An improved bank credit scoring model: a naïve Bayesian approach. In: 2017 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 228–233 (2017)

    Google Scholar 

  7. Soni, P.M., Paul, V.: A novel optimized classifier for the loan repayment capability prediction system. In: 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), pp. 23–28 (2019)

    Google Scholar 

  8. Zhao, W.: Research on the deep learning of the small sample data based on transfer learning. In: AIP Conference Proceedings, vol. 1864, p. 020018 (2017)

    Google Scholar 

  9. Maheswari, J.P.: Breaking the curse of small datasets in Machine Learning: Part 1. https://towardsdatascience.com/breaking-the-curse-of-small-datasets-in-machine-learning-part-1-36f28b0c044d. Accessed 10 Dec 2019

  10. Hand, D.J., Vinciotti, V.: Choosing k for two-class nearest neighbour classifiers with unbalanced classes. Pattern Recogn. Lett. 24(9–10), 1555–1562 (2003)

    Article  MATH  Google Scholar 

  11. Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, pp. 41–46 (2001)

    Google Scholar 

  12. Patel, B., Rana, K.: A survey on decision tree algorithm for classification. Int. J. Eng. Dev. Res. (IJEDR) 2(1), 1–5 (2014)

    Google Scholar 

  13. Gestel, V., et al.: A support vector machine approach to credit scoring. Bank en Financiewezen 2, 73–82 (2003)

    Google Scholar 

  14. Wang, G., et al.: A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38(1), 223–230 (2011)

    Article  Google Scholar 

  15. Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996). https://doi.org/10.1023/A:1018054314350

    Article  MATH  Google Scholar 

  16. Lin, W.-Z., et al.: iDNA-Prot: identification of DNA binding proteins using random forest with grey model. PLoS ONE 6, 9 (2011)

    Article  Google Scholar 

  17. Khalilia, M., et al.: Predicting disease risks from highly imbalanced data using random forest. BMC Med. Inf. Decis. Making 11(1), 51 (2011)

    Article  Google Scholar 

  18. Mohan, A., et al.: Automatic classification of protein structures using physicochemical parameters. Interdisc. Sci.: Comput. Life Sci. 6(3), 176–186 (2014)

    Article  Google Scholar 

  19. Seera, M., Lim, C.P.: A hybrid intelligent system for medical data classification. Expert Syst. Appl. 41(5), 2239–2249 (2014)

    Article  Google Scholar 

  20. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324

    Article  MATH  Google Scholar 

  21. Zajkowski, A., et al.: Data Normalization. U.S. Patent US20030110250 (2003)

    Google Scholar 

  22. Zhou, Z.-H.: Ensemble Methods: Foundations and Algorithms. Chapman & Hall/CRC, Boca Raton (2012)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed Ahmed Elnaggar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Elnaggar, M.A., EL Azeem, M.A., Maghraby, F.A. (2020). Machine Learning Model for Predicting Non-performing Agricultural Loans. In: Hassanien, AE., Azar, A., Gaber, T., Oliva, D., Tolba, F. (eds) Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV2020). AICV 2020. Advances in Intelligent Systems and Computing, vol 1153. Springer, Cham. https://doi.org/10.1007/978-3-030-44289-7_37

Download citation

Publish with us

Policies and ethics