Skip to main content
Log in

A New Discrete Learning-Based Logistic Regression Classifier for Bankruptcy Prediction

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

Credit scoring or predicting bankruptcy is among the most crucial techniques for identifying high-risk and low-risk credit situations. Accordingly, enhancing the accuracy of bankruptcy prediction methods decreases the risk of inappropriate financial decisions. Also, increasing the accuracy of credit scoring models brings significant benefits such as improved turnover, credit market growth, proper and efficient allocation of financial resources, and sustained improvement of the profits of banks, investors, funds, and governments. Various statistical classification methods have been developed in the literature with different features and characteristics for more accurate bankruptcy prediction. However, despite all appearance differences in statistical classification approaches, they all adhere to a common idea and concept in their training procedures. The basic operation logic in whole-developed statistical classification methods focuses on maximizing a continuous distance-based cost function to yield the highest performance. Despite it being a common and frequently used procedure for classification purposes, it is an unreasonable and inefficient manner to achieve maximum accuracy in a discrete classification field. In this paper, a new discrete direction-based Logistic Regression that is a common statistical classifier method for bankruptcy forecasting is proposed. In the proposed Logistic Regression, in contrast to all traditionally developed statistical classifiers, the compatibility of the cost function and the training procedure is considered. While it can be shown overall that the performance of the presented discrete direction-based classifier will not be inferior to its continuous counterpart, an evaluation of the suggested classifier is conducted to ascertain its superiority. For this purpose, three credit scoring datasets are considered to assess the classification rate of the presented classifier. Empirical outcomes demonstrate that, as pre-expected, in all cases, the model put forward can attain a superior performance compared to conventional alternatives. These findings clearly demonstrated the significant influence of the consistency between the cost function and the training process on the classification capability, a consideration absent in any of the traditional statistical classification procedures. Consequently, the presented Logistic Regression can be considered an efficient alternative for credit scoring purposes to achieve more accurate results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Data Availability

Data will be available from the corresponding author upon reasonable request.

References

  1. Kao, L. J., Chiu, C. C., & Chiu, F. Y. (2012). A Bayesian latent variable model with classification and regression tree approach for behavior and credit scoring. Knowledge-Based Systems, 36, 245–252. https://doi.org/10.1016/j.knosys.2012.07.004

    Article  Google Scholar 

  2. Akkoç, S. (2012). An empirical comparison of conventional techniques, neural networks and the three stage hybrid adaptive neuro fuzzy inference system (ANFIS) model for credit scoring analysis: The case of Turkish credit card data. European Journal of Operational Research, 222(1), 168–178. https://doi.org/10.1016/j.ejor.2012.04.009

    Article  Google Scholar 

  3. Brown, I., & Mues, C. (2012). An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Systems with Applications, 39(3), 3446–3453. https://doi.org/10.1016/j.eswa.2011.09.033

    Article  Google Scholar 

  4. Abdou, H. A. (2009). Genetic programming for credit scoring: The case of Egyptian public sector banks. Expert systems with applications, 36(9), 11402–11417. https://doi.org/10.1016/j.eswa.2009.01.076

    Article  Google Scholar 

  5. Silva, D. M., Pereira, G. H., & Magalhães, T. M. (2022). A class of categorization methods for credit scoring models. European Journal of Operational Research, 296(1), 323–331. https://doi.org/10.1016/j.ejor.2021.04.029

    Article  MathSciNet  Google Scholar 

  6. Ahelegbey, D. F., Giudici, P., & Hadji-Misheva, B. (2019). Latent factor models for credit scoring in P2P systems. Physica A: Statistical Mechanics and its Applications, 522, 112–121. https://doi.org/10.1016/j.physa.2019.01.130

    Article  Google Scholar 

  7. Filippopoulou, C., Galariotis, E., & Spyrou, S. (2020). An early warning system for predicting systemic banking crises in the Eurozone: A logit regression approach. Journal of Economic Behavior & Organization, 172, 344–363. https://doi.org/10.1016/j.jebo.2019.12.023

    Article  Google Scholar 

  8. Lee, S., & Jun, C. H. (2018). Fast incremental learning of logistic model tree using least angle regression. Expert Systems with Applications, 97, 137–145. https://doi.org/10.1016/j.eswa.2017.12.014

    Article  Google Scholar 

  9. Klieštik, T., Kočišová, K., & Mišanková, M. (2015). Logit and probit model used for prediction of financial health of company. Procedia economics and finance, 23, 850–855. https://doi.org/10.1016/S2212-5671(15)00485-2

    Article  Google Scholar 

  10. Dumitrescu, E., Hue, S., Hurlin, C., & Tokpavi, S. (2022). Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research. https://doi.org/10.1016/j.ejor.2021.06.053

    Article  MathSciNet  Google Scholar 

  11. Villuendas-Rey, Y., Rey-Benguría, C. F., Ferreira-Santiago, Á., Camacho-Nieto, O., & Yáñez-Márquez, C. (2017). The naïve associative classifier (NAC): A novel, simple, transparent, and accurate classification model evaluated on financial data. Neurocomputing, 265, 105–115. https://doi.org/10.1016/j.neucom.2017.03.085

    Article  Google Scholar 

  12. Sohn, S. Y., Kim, D. H., & Yoon, J. H. (2016). Technology credit scoring model with fuzzy logistic regression. Applied Soft Computing, 43, 150–158. https://doi.org/10.1016/j.asoc.2016.02.025

    Article  Google Scholar 

  13. Nikolic, N., Zarkic-Joksimovic, N., Stojanovski, D., & Joksimovic, I. (2013). The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements. Expert Systems with Applications, 40(15), 5932–5944. https://doi.org/10.1016/j.eswa.2013.05.022

    Article  Google Scholar 

  14. Vrontos, S. D., Galakis, J., & Vrontos, I. D. (2021). Modeling and predicting US recessions using machine learning techniques. International Journal of Forecasting, 37(2), 647–671. https://doi.org/10.1016/j.ijforecast.2020.08.005

    Article  Google Scholar 

  15. Teply, P., & Polena, M. (2020). Best classification algorithms in peer-to-peer lending. The North American Journal of Economics and Finance, 51, 100904. https://doi.org/10.1016/j.najef.2019.01.001

    Article  Google Scholar 

  16. Moscatelli, M., Parlapiano, F., Narizzano, S., & Viggiano, G. (2020). Corporate default forecasting with machine learning. Expert Systems with Applications, 161, 113567. https://doi.org/10.1016/j.eswa.2020.113567

    Article  Google Scholar 

  17. Wang, Y., Zhang, Y., Lu, Y., & Yu, X. (2020). A comparative assessment of credit risk model based on machine learning—a case study of bank loan data. Procedia Computer Science, 174, 141–149. https://doi.org/10.1016/j.procs.2020.06.069

    Article  Google Scholar 

  18. Barboza, F., Kimura, H., & Altman, E. (2017). Machine learning models and bankruptcy prediction. Expert Systems with Applications, 83, 405–417. https://doi.org/10.1016/j.eswa.2017.04.006

    Article  Google Scholar 

  19. Fitzpatrick, T., & Mues, C. (2016). An empirical comparison of classification algorithms for mortgage default prediction: Evidence from a distressed mortgage market. European Journal of Operational Research, 249(2), 427–439. https://doi.org/10.1016/j.ejor.2015.09.014

    Article  MathSciNet  Google Scholar 

  20. Abdou, H. A., Tsafack, M. D. D., Ntim, C. G., & Baker, R. D. (2016). Predicting creditworthiness in retail banking with limited scoring data. Knowledge-Based Systems, 103, 89–103. https://doi.org/10.1016/j.knosys.2016.03.023

    Article  Google Scholar 

  21. Gordini, N. (2014). A genetic algorithm approach for SMEs bankruptcy prediction: Empirical evidence from Italy. Expert systems with applications, 41(14), 6433–6445. https://doi.org/10.1016/j.eswa.2014.04.026

    Article  Google Scholar 

  22. Cubiles-De-La-Vega, M. D., Blanco-Oliver, A., Pino-Mejías, R., & Lara-Rubio, J. (2013). Improving the management of microfinance institutions by using credit scoring models based on statistical learning techniques. Expert systems with applications, 40(17), 6910–6917. https://doi.org/10.1016/j.eswa.2013.06.031

    Article  Google Scholar 

  23. Lin, T. H. (2009). A cross model study of corporate financial distress prediction in Taiwan: Multiple discriminant analysis, logit, probit and neural networks models. Neurocomputing, 72(16–18), 3507–3516. https://doi.org/10.1016/j.neucom.2009.02.018

    Article  Google Scholar 

  24. Etemadi, S., & Khashei, M. (2024). Etemadi regression in chemometrics: Reliability-based procedures for modeling and forecasting. Heliyon, 10(5), e26399. https://doi.org/10.1016/j.heliyon.2024.e26399

    Article  Google Scholar 

  25. Etemadi, S., Khashei, M., & Tamizi, S. (2023). Etemadi reliability-based multi-layer perceptrons for classification and forecasting. Information Sciences, 651, 119716. https://doi.org/10.1016/j.ins.2023.119716

    Article  Google Scholar 

  26. Liu, W., Fan, H., & Xia, M. (2021). Step-wise multi-grained augmented gradient boosting decision trees for credit scoring. Engineering Applications of Artificial Intelligence, 97, 104036. https://doi.org/10.1016/j.engappai.2020.104036

    Article  Google Scholar 

  27. Guo, S., He, H., & Huang, X. (2019). A multi-stage self-adaptive classifier ensemble model with application in credit scoring. IEEE Access, 7, 78549–78559. https://doi.org/10.1109/ACCESS.2019.2922676

    Article  Google Scholar 

  28. Zhang, H., He, H., & Zhang, W. (2018). Classifier selection and clustering with fuzzy assignment in ensemble model for credit scoring. Neurocomputing, 316, 210–221. https://doi.org/10.1016/j.neucom.2018.07.070

    Article  Google Scholar 

  29. Tripathi, D., Edla, D. R., Kuppili, V., Bablani, A., & Dharavath, R. (2018). Credit scoring model based on weighted voting and cluster based feature selection. Procedia computer science, 132, 22–31. https://doi.org/10.1016/j.procs.2018.05.055

    Article  Google Scholar 

  30. Khashei, M., Rezvan, M. T., Hamadani, A. Z., & Bijari, M. (2013). A bi-level neural-based fuzzy classification approach for credit scoring problems. Complexity, 18(6), 46–57. https://doi.org/10.1002/cplx.21458

    Article  MathSciNet  Google Scholar 

  31. Ala’raj, M., & Abbod, M. F. (2016). A new hybrid ensemble credit scoring model based on classifiers consensus system approach. Expert Systems with Applications, 64, 36–55. https://doi.org/10.1016/j.eswa.2016.07.017

    Article  Google Scholar 

  32. Tsai, C. F. (2014). Combining cluster analysis with classifier ensembles to predict financial distress. Information Fusion, 16, 46–58.

    Article  Google Scholar 

  33. Tripathi, D., Edla, D. R., Kuppili, V., & Bablani, A. (2020). Evolutionary extreme learning machine with novel activation function for credit scoring. Engineering Applications of Artificial Intelligence, 96, 103980. https://doi.org/10.1016/j.engappai.2020.103980

    Article  Google Scholar 

  34. Zhang, W., Yang, D., Zhang, S., Ablanedo-Rosas, J. H., Wu, X., & Lou, Y. (2021). A novel multi-stage ensemble model with enhanced outlier adaptation for credit scoring. Expert Systems with Applications, 165, 113872. https://doi.org/10.1016/j.eswa.2020.113872

    Article  Google Scholar 

Download references

Funding

There is no funding to report.

Author information

Authors and Affiliations

Authors

Contributions

All authors have the same contribution to preparing this manuscript.

Corresponding author

Correspondence to Mehdi Khashei.

Ethics declarations

Conflict of interest

The authors have no conflict of interest.

Ethical Approval

The authors confirm that ethical conduct has been respected.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khashei, M., Etemadi, S. & Bakhtiarvand, N. A New Discrete Learning-Based Logistic Regression Classifier for Bankruptcy Prediction. Wireless Pers Commun 134, 1075–1092 (2024). https://doi.org/10.1007/s11277-024-10961-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-024-10961-3

Keywords

Navigation