Abstract
Extending agricultural loans to individuals is essential to support the agriculture sector and markets. One of the most important risks affecting the banking sector is the concept of credit risk. Predicting the probability of non-performing loans for an individual is a vital and beneficial role for banks to decrease credit risk and make the right decisions. These decisions are based on credit study and in accordance with generally accepted standards, loan payment history, and demographic data of the clients. The subject paper here is proposing an ensemble-based model, to enhance classification accuracy. For the building model, the dataset was gathered from an agricultural bank in Egypt. Egyptian credit dataset involves 112907 instances and 17 features that are used in the current study. Variable selections were used to select important features for the classification. Cross-classification has also been used with ten subsets. Classification methods have been applied with Logistics Regression (LR), k-nearest neighbors (KNN), Support Vector Machine (SVM), Decision Tree (DT) and Meta-classifier methods for training and testing toward the dataset. The outcome of the specified experiments showed that the accuracy of the ensemble method is the highly recommended one for individuals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, W.H., Shih, J.Y.: A study of Taiwan’s issuer credit rating systems using support vector machines. Expert Syst. Appl. 30, 427–435 (2006)
European Central Bank: What are non-performing loans? https://www.ecb.europa.eu/explainers/tell-me/html/npl.en.html. Accessed 11 Dec 2019
Paireekreng, W., Choensawat, W.: An ensemble learning based model for real estate project classification. In: 6th International Conference on Applied Human Factors and Ergonomics and the Affiliated Conferences, vol. 3, pp. 3852–3859 (2015)
Zhang, Y., et al.: Predicting non-performing loan of business bank by multiple classifier fusion algorithms. J. Interdisc. Math. 19(4), 657–667 (2016)
Goyal, A., Kaur, R.: Loan prediction using ensemble technique. Int. J. Adv. Res. Comput. Commun. Eng. 5(3), 523–526 (2016)
Okesola, O.J., et al.: An improved bank credit scoring model: a naïve Bayesian approach. In: 2017 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 228–233 (2017)
Soni, P.M., Paul, V.: A novel optimized classifier for the loan repayment capability prediction system. In: 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), pp. 23–28 (2019)
Zhao, W.: Research on the deep learning of the small sample data based on transfer learning. In: AIP Conference Proceedings, vol. 1864, p. 020018 (2017)
Maheswari, J.P.: Breaking the curse of small datasets in Machine Learning: Part 1. https://towardsdatascience.com/breaking-the-curse-of-small-datasets-in-machine-learning-part-1-36f28b0c044d. Accessed 10 Dec 2019
Hand, D.J., Vinciotti, V.: Choosing k for two-class nearest neighbour classifiers with unbalanced classes. Pattern Recogn. Lett. 24(9–10), 1555–1562 (2003)
Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, pp. 41–46 (2001)
Patel, B., Rana, K.: A survey on decision tree algorithm for classification. Int. J. Eng. Dev. Res. (IJEDR) 2(1), 1–5 (2014)
Gestel, V., et al.: A support vector machine approach to credit scoring. Bank en Financiewezen 2, 73–82 (2003)
Wang, G., et al.: A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38(1), 223–230 (2011)
Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996). https://doi.org/10.1023/A:1018054314350
Lin, W.-Z., et al.: iDNA-Prot: identification of DNA binding proteins using random forest with grey model. PLoS ONE 6, 9 (2011)
Khalilia, M., et al.: Predicting disease risks from highly imbalanced data using random forest. BMC Med. Inf. Decis. Making 11(1), 51 (2011)
Mohan, A., et al.: Automatic classification of protein structures using physicochemical parameters. Interdisc. Sci.: Comput. Life Sci. 6(3), 176–186 (2014)
Seera, M., Lim, C.P.: A hybrid intelligent system for medical data classification. Expert Syst. Appl. 41(5), 2239–2249 (2014)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Zajkowski, A., et al.: Data Normalization. U.S. Patent US20030110250 (2003)
Zhou, Z.-H.: Ensemble Methods: Foundations and Algorithms. Chapman & Hall/CRC, Boca Raton (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Elnaggar, M.A., EL Azeem, M.A., Maghraby, F.A. (2020). Machine Learning Model for Predicting Non-performing Agricultural Loans. In: Hassanien, AE., Azar, A., Gaber, T., Oliva, D., Tolba, F. (eds) Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV2020). AICV 2020. Advances in Intelligent Systems and Computing, vol 1153. Springer, Cham. https://doi.org/10.1007/978-3-030-44289-7_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-44289-7_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-44288-0
Online ISBN: 978-3-030-44289-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)