Skip to main content
Log in

A novel classifier ensemble approach for financial distress prediction

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Financial distress prediction is very important to financial institutions who must be able to make critical decisions regarding customer loans. Bankruptcy prediction and credit scoring are the two main aspects considered in financial distress prediction. To assist in this determination, thereby lowering the risk borne by the financial institution, it is necessary to develop effective prediction models for prediction of the likelihood of bankruptcy and estimation of credit risk. A number of financial distress prediction models have been constructed, which utilize various machine learning techniques, such as single classifiers and classifier ensembles, but improving the prediction accuracy is the major research issue. In addition, aside from improving the prediction accuracy, there have been very few studies that specifically consider lowering the Type I error. In practice, Type I errors need to receive careful consideration during model construction because they can affect the cost to the financial institution. In this study, we introduce a classifier ensemble approach designed to reduce the misclassification cost. The outputs produced by multiple classifiers are combined by utilizing the unanimous voting (UV) method to find the final prediction result. Experimental results obtained based on four relevant datasets show that our UV ensemble approach outperforms the baseline single classifiers and classifier ensembles. Specifically, the UV ensemble not only provides relatively good prediction accuracy and minimizes Type I/II errors, but also produces the smallest misclassification cost.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://archive.ics.uci.edu/ml/datasets/Statlog+(Australian+Credit+Approval).

  2. http://archive.ics.uci.edu/ml/datasets/Statlog+(German+Credit+Data).

  3. http://www.tej.com.tw/twsite/.

References

  1. Altman EI (1968) Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. J Finance 23:589–609

    Article  Google Scholar 

  2. Balcaen S, Ooghe H (2006) 35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems. Br Account Rev 38:63–93

    Article  Google Scholar 

  3. Beaver WH (1966) Financial ratios as predictors of failure. J Account Res 4:71–111

  4. Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press, Oxford

    MATH  Google Scholar 

  5. Boritz JE, Kennedy DB (1995) Effectiveness of neural network types for prediction of business failure. Expert Syst Appl 9:503–512

    Article  Google Scholar 

  6. Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140

    MathSciNet  MATH  Google Scholar 

  7. Clyde MA, Lee HK (2001) Bagging and the Bayesian bootstrap. In: International conference on artificial intelligence and statistics, pp 169–174

  8. Crook JN, Edelman DB, Thomas LC (2007) Recent developments in consumer credit risk assessment. Eur J Oper Res 183:1447–1465

    Article  MathSciNet  MATH  Google Scholar 

  9. Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  MATH  Google Scholar 

  10. Dietterich TG (1997) Machine-learning research. AI Mag 18:97

    Google Scholar 

  11. Ethem A (2004) Introduction to machine learning. MIT Press, Cambridge

    MATH  Google Scholar 

  12. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861–874

    Article  Google Scholar 

  13. Fitzpartrick PJ (1932) A comparison of the ratios of successful industrial enterprises with those of failed companies. The Accountants Publishing Company.

  14. Freund Y, Schapire RE (July 1996) Experiments with a new boosting algorithm. In: Proceedings of the international conference on machine learning, Bari, Italy, pp 148–156

  15. Geng R, Bose I, Chen X (2015) Prediction of financial distress: an empirical study of listed Chinese companies using data mining. Eur J Oper Res 241:236–247

    Article  Google Scholar 

  16. Heo J, Yang JY (2014) AdaBoost based bankruptcy forecasting of Korean construction companies. Appl Soft Comput 24:494–499

    Article  Google Scholar 

  17. John GH Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: International conference on uncertainty in artificial intelligence, pp 338–345

  18. Kim H, Kim H, Moon H, Ahn H (2011) A weight-adjusted voting algorithm for ensembles of classifiers. J Korean Stat Soc 40:437–449

    Article  MathSciNet  MATH  Google Scholar 

  19. Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239

    Article  Google Scholar 

  20. Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International joint conference on artificial intelligence, pp 1137–1143

  21. Kumar PR, Ravi V (2007) Bankruptcy prediction in banks and firms via statistical and intelligent techniques—a review. Eur J Oper Res 180:1–28

    Article  MATH  Google Scholar 

  22. Lei S, Xinming M, Lei X, Xiaohong H (2010) Financial data mining based on support vector machines and ensemble learning. In: International conference on intelligent computation technology and automation, pp 313–314

  23. Li H, Sun J, Wu J (2010) Predicting business failure using classification and regression tree: an empirical comparison with popular classical statistical methods and top classification mining methods. Expert Syst Appl 37:5895–5904

    Article  Google Scholar 

  24. Lin W-Y, Hu Y-H, Tsai C-F (2012) Machine learning in financial crisis prediction: a survey. IEEE Trans Syst Man Cybern Part C Appl Rev 42(4):421–436

    Article  Google Scholar 

  25. Ohlson JA (1980) Financial ratios and the probabilistic prediction of bankruptcy. J Account Res 18:109–131

    Article  Google Scholar 

  26. Ozkan-Gunay EN, Ozkan M (2007) Prediction of bank failures in emerging financial markets: an ANN approach. J Risk Finance 8:465–480

    Article  Google Scholar 

  27. Paleologo G, Elisseeff A, Antonini G (2010) Subagging for credit scoring models. Eur J Oper Res 201:490–499

    Article  Google Scholar 

  28. Sexton RS, Sriram RS, Etheridge H (2003) Improving decision effectiveness of artificial neural networks: a modified genetic algorithm approach. Decis Sci 34:421–442

    Article  Google Scholar 

  29. Shi L, Xi L, Ma X, Hu X (2009) Bagging of artificial neural networks for bankruptcy prediction. In: International conference on information and financial engineering, pp 154–156

  30. Shin K-S, Lee TS, Kim H-J (2005) An application of support vector machines in bankruptcy prediction model. Expert Syst Appl 28:127–135

    Article  Google Scholar 

  31. Tam KY, Kiang MY (1992) Managerial applications of neural networks: the case of bank failure predictions. Manag Sci 38:926–947

    Article  MATH  Google Scholar 

  32. Tsai C-F (2009) Feature selection in bankruptcy prediction. Knowl Based Syst 22:120–127

    Article  Google Scholar 

  33. Verikas A, Kalsyte Z, Bacauskiene M, Gelzinis A (2010) Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey. Soft Comput 14:995–1010

    Article  Google Scholar 

  34. Wang G, Hao J, Ma J, Jiang H (2011) A comparative assessment of ensemble learning for credit scoring. Expert Syst Appl 38:223–230

    Article  Google Scholar 

  35. Wang G, Ma J (2012) A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine. Expert Syst Appl 39(5):5325–5331

    Article  Google Scholar 

  36. Wang S-J, Mathew A, Chen Y, Xi L-F, Ma L, Lee J (2009) Empirical analysis of support vector machine ensemble classifiers. Expert Syst Appl 36:6466–6476

    Article  Google Scholar 

  37. West D (2000) Neural network credit scoring models. Comput Oper Res 27:1131–1152

    Article  MATH  Google Scholar 

  38. West D, Dellana S, Qian J (2005) Neural network ensemble strategies for financial decision applications. Comput Oper Res 32:2543–2559

    Article  MATH  Google Scholar 

  39. Wolpert DH (1992) Stacked generalization. Neural Netw 5(2):241–259

    Article  Google Scholar 

  40. Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu PS, Zhou Z-H, Steinbach M, Hand DJ, Steinberg D (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14:1–37

    Article  Google Scholar 

  41. Xu X Frank E (2004) Logistic regression and boosting for labeled bags of instances. In: Dai H, Srikant R, Zhang C (eds) Proceedings 8th Pacific-Asia Conference, PAKDD 2004, Sydney, Australia, 26–28 May 2004. Springer, Berlin, pp. 272–281

  42. Yao P (2009) Credit scoring using ensemble machine learning. In: International conference on hybrid intelligent systems, pp 244–246

  43. Zhang D, Zhou X, Leung SCH, Zheng J (2010) Vertical bagging decision trees model for credit scoring. Expert Syst Appl 37:7838–7843

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chih-Fong Tsai.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liang, D., Tsai, CF., Dai, AJ. et al. A novel classifier ensemble approach for financial distress prediction. Knowl Inf Syst 54, 437–462 (2018). https://doi.org/10.1007/s10115-017-1061-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-017-1061-1

Keywords

Navigation