Credit Rating Analysis by the Decision-Tree Support Vector Machine with Ensemble Strategies

Pai, Ping-Feng; Tan, Yi-Shien; Hsu, Ming-Fu

doi:10.1007/s40815-015-0063-y

Credit Rating Analysis by the Decision-Tree Support Vector Machine with Ensemble Strategies

Published: 18 July 2015

Volume 17, pages 521–530, (2015)
Cite this article

International Journal of Fuzzy Systems Aims and scope Submit manuscript

Ping-Feng Pai¹,
Yi-Shien Tan¹ &
Ming-Fu Hsu²

510 Accesses
13 Citations
Explore all metrics

Abstract

The recent financial tsunami and subprime crisis both triggered an excessive global financial decline. Since then, the ability to ensure the creditworthiness of firms, the development of a suitable credit rating mechanism and appropriate rating policies have become critical issues for most financial institutions. Credit rating is a multi-classification task. The decision-tree support vector machine (DTSVM) has been one of the most powerful models in dealing with multi-classification problems. However, the determination of important data features significantly affects the computation time and classification accuracy of DTSVM models. In addition, the inability to provide rules for decision-makers limits the practical applications of DTSVM models. This study proposes an M-DTSVM-RST model which integrates the unique strength of multiple feature selection strategies in feature determination, DTSVM in multi-classification, and rough set theory (RST) in rule generation. The advantages of the designed M-DTSVM-RST model include the ensemble learning ability in selecting essential attributes, the capability in yielding rules for decision-makers in multi-classification cases by the DTSVM technique with RST. Experimental results show that the M-DTSVM-RST model is a promising alternative in analyzing the credit rating problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Credit Scoring Models Using Ensemble Learning and Classification Approaches: A Comprehensive Survey

Article 01 October 2021

The Most Effective Strategy for Incorporating Feature Selection into Credit Risk Assessment

Article 17 December 2022

Rough set-based feature selection for credit risk prediction using weight-adjusted boosting ensemble method

Article 30 July 2019

References

Abad, P., Robles, M.D.: Credit rating agencies and idiosyncratic risk: is there a linkage? Evidence from the Spanish market. Int. Rev. Econ. Finance 33, 152–171 (2014)
Article Google Scholar
Barakat, N., Diederich, J.: Eclectic rule-extraction from support vector machines. Int. J. Comput. Intell. 2, 59–62 (2005)
Google Scholar
Bose, I.: Deciding the financial health of dot-coms using rough sets. Inf. Manag. 43, 835–846 (2006)
Article Google Scholar
Bottou, l., Cortes, C., Denker, J., Drucker, H., Guyon, I., Jackel, L., LeCun, Y., Muller, U., Sackinger, E., Simard, P., Vapnik, V.: Comparison of classifier methods: a case study in hand writing digit recognition. In: Proceedings of the 12th International Conference on Pattern Recognition, Jerusalem, pp. 77–82 (1994)
Capotorti, A., Barbanera, E.: Credit scoring analysis using a fuzzy probabilistic rough set model. Comput. Stat. Data Anal. 56, 981–994 (2012)
Article MATH MathSciNet Google Scholar
Chang, F., Guo, C.Y., Lin, X.R., Lu, C.J.: Tree decomposition for large-scale SVM Problems. J. Mach. Learn. Res. 11, 2935–2972 (2010)
MATH MathSciNet Google Scholar
Chang, F., Liu, C.C.: Solving large-scale multi-label SVM problems with a tree decomposition approach. In: The 3rd Asian Conference on Machine Learning, Taoyuan, pp. 1–16 (2011)
Chaudhuri, A., De, K.: Fuzzy support vector machine for bankruptcy prediction. Appl. Soft Comput. 11, 2472–2486 (2011)
Article Google Scholar
Chen, Y.S., Cheng, C.H.: Hybrid models based on rough set classifiers for setting credit rating decision rules in the global banking industry. Knowl.-Based Syst. 39, 224–239 (2013)
Article Google Scholar
Chen, C.C., Li, S.T.: Credit rating with a monotonicity-constrained support vector machine model. Expert Syst. Appl. 41, 7235–7247 (2014)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector network. Mach. Learn. 20, 273–297 (1995)
MATH Google Scholar
Deng, J.D., Simmermacher, C., Cranefield, S.: “A study on feature analysis for musical instrument classification”, IEEE Transactions on Systems, Man, and Cybernetics—Part B. Cybernetics 38, 429–438 (2008)
Google Scholar
Fung, G., Sandilya, S., Rao, R.: Rule extraction from linear support vector machines. In: Proceedings of the Eleventh SIGKDD International Conference on Knowledge Discovery and Data Mining, Illinois, pp. 32–40 (2005)
Friedman, J.: Another approach to polychotomous classification. Technical Report, Stanford University (1996)
Hajek, P., Michalak, K.: Feature selection in corporate credit rating prediction. Knowl.-Based Syst. 51, 72–84 (2013)
Article Google Scholar
Hanilçi, C., Ertaş, F.: Investigation of the effect of data duration and speaker gender on text-independent speaker recognition. Comput. Electr. Eng. 39, 441–452 (2013)
Article Google Scholar
Hou, S., Ramani, K.: Classifier combination for sketch-based 3D part retrieval. Comput. Graph. 31, 598–609 (2007)
Article Google Scholar
Hsu, M.F., Pai, P.F.: Incorporating support vector machines with multiple criteria decision making for financial crisis analysis. Qual. Quant. 47, 3481–3492 (2013)
Article Google Scholar
Huang, C.C., Tseng, T.L., Fan, Y.N., Hsu, C.H.: Alternative rule induction methods based on incremental object using rough set theory. Appl. Soft Comput. 13, 372–389 (2013)
Article Google Scholar
Huang, Z., Chen, H., Hsu, C.J., Chen, W.H., Wu, S.: Credit rating analysis with support vector machines and neural networks: a market comparative. Decis. Support Syst. 37, 543–558 (2004)
Article Google Scholar
Hwang, R.C., Chung, H., Chu, C.K.: Predicting issuer credit ratings using a semiparametric method. J. Empir. Finance 17, 120–137 (2010)
Article Google Scholar
Jia, J., Yang, N., Zhang, C., Yue, A., Yang, J., Zhu, D.: Object-oriented feature selection of high spatial resolution images using an improved Relief algorithm. Math. Comput. Model. 58, 619–626 (2013)
Article Google Scholar
Joachims, T.: Making large-scale support vector machine learning practical. In: Schlkopf, B., Burges, C.J., Smola, A.J. (eds.) Advances in Kernel Methods—Support Vector Learning, pp. 169–184. MIT Press, Cambridge (1999)
Google Scholar
Kabir, MdM, Islam, MdM, Murase, K.: A new wrapper feature selection approach using neural network. Neurocomputing 73, 3273–3283 (2010)
Article Google Scholar
Klautau, A., Jevtic, N., Orlitsky, A.: On nearest-neighbor error-correcting output codes with application to all-pairs multiclass support vector machines. J. Mach. Learn. Res. 4, 1–15 (2003)
MathSciNet Google Scholar
Kim, H.S., Sohn, S.Y.: Support vector machines for default prediction of SMEs based on technology. Eur. J. Oper. Res. 201, 838–846 (2010)
Article MATH Google Scholar
Kim, K., Ahn, H.: A corporate credit rating model using multi-class support vector machines with an ordinal pairwise partitioning approach. Comput. Oper. Res. 39, 1800–1811 (2012)
Article MATH MathSciNet Google Scholar
Kreßel, U.: Pairwise classification and support vector machines. In: Schölkopf, B., Burges, C.I., Smola, A.J. (eds.) Advances in Kernel Methods—Support Vector Learning, pp. 255–268. MIT Press, Cambridge (1999)
Google Scholar
Kullback, S.: An application of information theory to multivariate analysis. Ann. Math. Stat. 23, 88–102 (1952)
Article MATH MathSciNet Google Scholar
Kumar, M.A., Gopal, M.: A comparison study on multiple binary-class SVM methods for unilabel text categorization. Pattern Recogn. Lett. 31, 1437–1444 (2010)
Article Google Scholar
Liou, J.H., Tzeng, G.H.: A Dominance-based Rough Set Approach to customer behavior in the airline market. Inf. Sci. 180, 2230–2238 (2010)
Article Google Scholar
Li, F., Yin, Y., Lu, L.: (ϑ, T)-fuzzy rough approximation operators and the TL-fuzzy rough ideals on a ring. Inf. Sci. 177, 4711–4726 (2007)
Article MATH MathSciNet Google Scholar
Lin, K.P., Pai, P.F., Lu, Y.M., Chang, P.T.: Revenue forecasting using a least-squares support vector regression model in a fuzzy environment. Inf. Sci. 220, 196–209 (2013)
Article Google Scholar
Li, C.F., Yin, J.Y.: Variational Bayesian independent component analysis-support vector machine for remote sensing classification. Comput. Electr. Eng. 39, 717–726 (2013)
Article Google Scholar
Martens, D., Baesens, B., Gestel, T.V.: De-compositional rule extraction from support vector machines by active learning. IEEE Trans. Knowl. Data Eng. 21, 178–191 (2009)
Article Google Scholar
Nguyen, H.S., Nguyen, S.H.: Analysis of Stulong data by rough set exploration system (RSES). In: The 14th European Conference on Machine Learning (ECML) and the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases, Cavtat, pp. 71–82 (2003)
Öğüt, H., Doğanay, M.M., Ceylan, N.B., Aktaş, R.: Prediction of bank financial strength ratings: the case of Turkey. Econ. Model. 29, 632–640 (2012)
Article Google Scholar
Pawlak, Z.: Rough sets. Int. J. Inf. Comput. Sci. 11, 341–356 (1982)
Article MATH MathSciNet Google Scholar
Paleologo, G., Elisseeff, A., Antonini, G.: Subagging for credit scoring models. Eur. J. Oper. Res. 201, 490–499 (2010)
Article Google Scholar
Peng, Y., Wang, G., Kou, G., Shi, Y.: An empirical study of classification algorithm evaluation for financial risk prediction. Appl. Soft Comput. 11, 2906–2915 (2011)
Article Google Scholar
Platt, J.C., Cristianini, N., Shawe-Taylor, J.: Large margin DAGs for multiclass classification. In: Jordan, M., Keams, M., SoJla, S. (eds.) Advances in Neural Information Processing Systems, vol. 12, pp. 547–553. MIT Press, Cambridge (2000)
Google Scholar
Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Schlkopf, B., Burges, C.J., Smola, A.J. (eds.) Advances in Kernel Methods—Support Vector Learning, pp. 169–184. MIT Press, Cambridge (1998)
Google Scholar
Pudil, P., Novovicova, J., Kittler, J.: Floating search methods in feature selection. Pattern Recogn. Lett. 15, 1119–1125 (1994)
Article Google Scholar
Qian, H., Mao, Y., Xiang, W., Wang, Z.: Recognition of human activities using SVM multi-class classifier. Pattern Recognit. Lett. 31, 100–111 (2010)
Article Google Scholar
Ren, J.: ANN vs. SVM: which one performs better in classification of MCCs in mammogram imaging. Knowl.-Based Syst. 26, 144–153 (2012)
Article Google Scholar
Scherbart, A., Nattkemper, T.W.: Looking inside self-organizing map ensembles with resampling and negative correlation learning. Neural. Netw. 24, 130–141 (2011)
Article MATH Google Scholar
Sun, J., Li, H.: Financial distress prediction using support vector machines: ensemble vs. individual. Appl. Soft Comput. 12, 2254–2265 (2012)
Article Google Scholar
Smyth, P., Goodman, R.M.: Rule induction using information theory. In: Piatetsky-Shapiro, G., Frawley, W. (eds.) Knowledge Discovery in Databases, pp. 169–184. MIT Press, Cambridge (1991)
Google Scholar
Tsai, C.F., Hsiao, Y.C.: Combining multiple feature selection methods for stock prediction: union, intersection, and multi-intersection approaches. Decis. Support Syst. 50, 258–269 (2010)
Article Google Scholar
Uğuz, H.: A hybrid system based on information gain and principal component analysis for the classification of transcranial Doppler signals. Comput. Methods Programs Biomed. 107, 598–609 (2012)
Article Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer Verlag, New York (1995)
Book MATH Google Scholar
Vieira, S.M., Sousa, J.M.C., Kaymak, U.: Fuzzy criteria for feature selection. Fuzzy Sets Syst. 189, 1–18 (2012)
Article MathSciNet Google Scholar
Wang, G., Ma, J.: A hybrid ensemble approach for enterprise credit risk assessment based on Support Vector Machine. Expert Syst. Appl. 39, 5325–5331 (2012)
Article Google Scholar
Wang, G., Hao, J., Ma, J., Jiang, H.: A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38, 223–230 (2011)
Article Google Scholar
Whitney, A.: A direct method of nonparametric measurement selection. IEEE Trans. Comput. 20, 1100–1103 (1971)
Article MATH MathSciNet Google Scholar
Witlox, F., Tindemans, H.: The application of rough sets analysis in activity-based modelling, opportunities and constraints. Expert Syst. Appl. 27, 585–592 (2004)
Article Google Scholar
Wang, A., Yuan, W., Liu, J., Yu, Z., Li, H.: A novel pattern recognition algorithm: combining ART network with SVM to reconstruct a multi-class classifier. Comput. Math Appl. 57, 1908–1914 (2009)
Article MATH Google Scholar
Xia, H.: Can investor-paid credit rating agencies improve the information quality of issuer-paid rating agencies? J. Financ. Econ. 111, 450–468 (2014)
Article Google Scholar
Xie, J., Wang, C.: Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases. Expert Syst. Appl. 38, 5809–5815 (2011)
Article Google Scholar
Yap, B.W., Ong, S.H., Husain, N.H.M.: Using data mining to improve assessment of credit worthiness via credit scoring models. Expert Syst. Appl. 38, 13274–13283 (2011)
Article Google Scholar
Zhong, H., Miao, C., Shen, Z., Feng, Y.: Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings. Neurocomputing 128, 285–295 (2014)
Article Google Scholar
Zhu, P., Hu, Q.: Rule extraction from support vector machines based on consistent region covering reduction. Knowl.-Based Syst. 42, 1–8 (2013)
Article Google Scholar
Zhu, D., Premkumar, G., Zhang, X., Chu, C.H.: Data mining for network intrusion detection: a comparison of alternative methods. Decis. Sci. 32, 1–26 (2001)
Article Google Scholar

Download references

Acknowledgments

The authors would like to thank Ministry of Science and Technology of the Republic of China, Taiwan, for financially supporting this research under Contract Numbers NSC 101-2410-H-260-005-MY2 and MOST 103-2410-H-260-020.

Author information

Authors and Affiliations

Department of Information Management, National Chi Nan University, 1, University Rd., Puli, Nantou, 545, Taiwan, ROC
Ping-Feng Pai & Yi-Shien Tan
Department of International Business Studies, National Chi Nan University, 1, University Rd., Puli, Nantou, 545, Taiwan, ROC
Ming-Fu Hsu

Authors

Ping-Feng Pai
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Shien Tan
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Fu Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ping-Feng Pai.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pai, PF., Tan, YS. & Hsu, MF. Credit Rating Analysis by the Decision-Tree Support Vector Machine with Ensemble Strategies. Int. J. Fuzzy Syst. 17, 521–530 (2015). https://doi.org/10.1007/s40815-015-0063-y

Download citation

Received: 03 July 2014
Revised: 03 May 2015
Accepted: 28 May 2015
Published: 18 July 2015
Issue Date: December 2015
DOI: https://doi.org/10.1007/s40815-015-0063-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Credit Rating Analysis by the Decision-Tree Support Vector Machine with Ensemble Strategies

Abstract

Access this article

Similar content being viewed by others

Credit Scoring Models Using Ensemble Learning and Classification Approaches: A Comprehensive Survey

The Most Effective Strategy for Incorporating Feature Selection into Credit Risk Assessment

Rough set-based feature selection for credit risk prediction using weight-adjusted boosting ensemble method

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Credit Rating Analysis by the Decision-Tree Support Vector Machine with Ensemble Strategies

Abstract

Access this article

Similar content being viewed by others

Credit Scoring Models Using Ensemble Learning and Classification Approaches: A Comprehensive Survey

The Most Effective Strategy for Incorporating Feature Selection into Credit Risk Assessment

Rough set-based feature selection for credit risk prediction using weight-adjusted boosting ensemble method

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation