Abstract
In social lending, it is hard to know whether borrowers will repay well or not. Most researchers use supervised learning for default prediction, but labeling data by hand is time-consuming. Moreover, labeling results of semi-supervised learning methods are not the same each other. In this paper, we propose a fusion method of label propagation and transductive SVM based on Dempster-Shafer theory for precisely labeling unlabeled data to improve the performance. We remove few unlabeled data with lower reliabilities in labeling results and fusion of the two results based on Dempster-Shafer theory. We have conducted experiments with supervised learning method trained with labeled unlabeled data. As a result, the proposed method produced the best accuracies, 6.15% higher than the result trained with labeled data only, and 1.3% higher than the conventional methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Davis, K., Murphy, J.: Peer to Peer lending: structures, risks and regulation. Finsia J. Appl. Finan. 3, 37–44 (2016)
Lending Club. https://www.lendingclub.com/info/download-data.action. Accessed 29 June 2017
Zhu, X.: Semi-Supervised Learning. Encyclopedia of Machine Learning. Springer, Heidelberg (2010)
Triguero, I., García, S., Herrera, F.: Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl. Inf. Syst. 42(2), 245–284 (2015)
Chapelle, O., Scholkopf, B., Zien, A.: Semi-Supervised Learning. The MIT Press, Cambridge (2006)
Malekipirbazari, M., Aksakalli, V.: Risk assessment in social lending via random forests. Expert Syst. Appl. 42, 4621–4631 (2015)
Bayanjankar, A., Heikkilä, M., Mezei, J.: Predicting credit risk in peer-to-peer lending: a neural network approach. In: 2015 IEEE Symposium series on Computational Intelligence, pp. 719–725. IEEE (2015)
Guo, Y., Zhou, W., Luo, C., Liu, C., Xiong, H.: Instance-based credit risk assessment for investment decisions in P2P lending. Eur. J. Oper. Res. 249(2), 417–426 (2016)
Serrano-Cinca, C., Gutiérrez-Nieto, B.: The use of profit scoring as an alternative to credit scoring systems in peer-to-peer (P2P) lending. Decis. Support Syst. 89, 113–122 (2016)
Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation (2002)
Joachims, T.: Transductive inference for text classification using support vector machines. In: International Conference on Machine Learning, pp. 200–209. NIPS (1998)
Shafer, G.: A Mathematical Theory of Evidence. Princeton University Press, Princeton (1976)
Acknowledgements
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2017-2015-0-00369) supervised by the IITP (Institute for Information & communications Technology Promotion).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Kim, A., Cho, SB. (2017). Dempster-Shafer Fusion of Semi-supervised Learning Methods for Predicting Defaults in Social Lending. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10635. Springer, Cham. https://doi.org/10.1007/978-3-319-70096-0_87
Download citation
DOI: https://doi.org/10.1007/978-3-319-70096-0_87
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70095-3
Online ISBN: 978-3-319-70096-0
eBook Packages: Computer ScienceComputer Science (R0)