Skip to main content
Log in

An empirical application of a hybrid ANFIS model to predict household over-indebtedness

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

The increase in debt levels of families in different parts of the world has drawn the attention of organizations dedicated to the prevention of financial risk and has highlighted the need to develop early detection methods for over-indebtedness. In this paper, we propose a hybrid model of the adaptive neural fuzzy inference system (ANFIS) and Probit model for the prediction of household over-indebtedness. The proposed model is compared with Probit, artificial neural networks (ANN), classification and regression trees (CART), random forest (RF) and support vector machine (SVM) models. The most relevant parameters for the performance of each model are optimized, and we address data balance problems through the synthetic minority over-sampling technique (SMOTE). We use data obtained from the Financial Household Survey of the Central Bank of Chile. The results show that the proposed model performs significantly better than the reference models in terms of the correct classification of indebted individuals. Consequently, this model provides an innovative understanding of household over-indebtedness, which can be useful for different governmental entities focused on preventing excessive indebtedness and maintaining financial stability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. https://www.efhweb.cl/ES/EFH/EncuestaAnterior/2014.

  2. The information contained in the variables \({X}_{1},{X}_{2},{X}_{5},{X}_{6}{X}_{8}\) represent data of individuals designated as heads of household. More details about the data and variables used can be found in Appendix 1.

References

  1. Altman EI, Marco G, Varetto F (1994) Corporate distress diagnosis: comparisons using linear discriminant analysis and neural networks (the Italian experience). J Bank Finance 18(3):505–529. https://doi.org/10.1016/0378-4266(94)90007-8

    Article  Google Scholar 

  2. Anderloni L, Vandone D (2008) Households over-indebtedness in the economic literature. Universit’a Degli Studi Di Milano Working Paper, 46, 775

  3. Angel S, Heitzmann K (2015) Over-indebtedness in Europe: the relevance of country-level variables for the over-indebtedness of private households. J Eur Soc Policy 25(3):331–351. https://doi.org/10.1177/0958928715588711

    Article  Google Scholar 

  4. Athey S, Imbens GW (2019) Machine learning methods that economists should know about. Annual Rev Econom 11:685–725

    Article  Google Scholar 

  5. Azayite FZ, Achchab S (2016) Hybrid discriminant neural networks for bankruptcy prediction and risk scoring. Procedia Comput Sci 83:670–674. https://doi.org/10.1016/J.PROCS.2016.04.149

    Article  Google Scholar 

  6. Bahrammirzaee A (2010) A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems. Neur Comput Appl 19(8):1165–1195

    Article  Google Scholar 

  7. Betti G, Dourmashkin N, Rossi M, Ping Yin Y (2007) Consumer over-indebtedness in the EU: measurement and characteristics. J Econ Stud 34(2):136–156. https://doi.org/10.1108/01443580710745371

    Article  Google Scholar 

  8. Chandra DK, Ravi V, Bose I (2009) Failure prediction of dotcom companies using hybrid intelligent techniques. Expert Syst Appl 36(3):4830–4837. https://doi.org/10.1016/J.ESWA.2008.05.047

    Article  Google Scholar 

  9. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    Article  Google Scholar 

  10. Charpentier A, Flachaire E, Ly A (2018) Econometrics and machine learning. Econ Stat 505(1):147–169

    Google Scholar 

  11. D’Alessio G, Iezzi S (2013) Household over-indebtedness: definition and measurement with Italian data. accessed from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.642.1533&rep=rep1&type=pdf#page=502

  12. Fedorova E, Gilenko E, Dovzhenko S (2013) Bankruptcy prediction for Russian companies: application of combined classifiers. Expert Syst Appl 40(18):7285–7293. https://doi.org/10.1016/J.ESWA.2013.07.032

    Article  Google Scholar 

  13. Gathergood J (2012) Self-control, financial literacy and consumer over-indebtedness. J Econ Psychol 33(3):590–602

    Article  Google Scholar 

  14. Gogas P, Papadimitriou T (2021) Machine Learning in Economics and Finance. Computat Econ. https://doi.org/10.1007/s10614-021-10094-w

    Article  Google Scholar 

  15. Gutierrez P, Gérardy JY (2017) Causal inference and uplift modelling: A review of the literature. In International conference on predictive applications and APIs (pp. 1–13). PMLR

  16. Henrique BM, Sobreiro VA, Kimura H (2019) Literature review: Machine learning techniques applied to financial market prediction. Expert Syst Appl 124:226–251

    Article  Google Scholar 

  17. Hojman DA, Miranda Á, Ruiz-Tagle J (2016) Debt trajectories and mental health. Soc Sci Med 167:54–62

    Article  Google Scholar 

  18. Hsu MW, Lessmann S, Sung MC, Ma T, Johnson JE (2016) Bridging the divide in financial market forecasting: machine learners vs. financial economists. Expert Syst Appl 61:215–234

    Article  Google Scholar 

  19. Khairalla M, AL-Jallad NT (2017) Hybrid forecasting scheme for financial time-series data using neural network and statistical methods. Int J Adv Comput Sci Appl 8(9):319–327

    Google Scholar 

  20. Ladas A, Garibaldi J, Scarpel R, Aickelin U (2104) Augmented neural networks for modelling consumer indebtedness, 2104 International joint conference on neural networks

  21. Lea SEG, Webley P, Walker CM (1995) Psychological factors in consumer debt: money management, economic socialization, and credit use. J Econ Psychol 16(4):681–701. https://doi.org/10.1016/0167-4870(95)00013-4

    Article  Google Scholar 

  22. Melnychenko O (2020) Is artificial intelligence ready to assess an enterprise’s financial security? J Risk Finan Manage 13(9):191

    Article  Google Scholar 

  23. Mi Y (2013) Imbalanced classification based on active learning SMOTE. Resear J Appl Sci Eng Tech 5:944–949

    Article  Google Scholar 

  24. Min JH, Lee Y-C (2005) Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters. Expert Syst Appl 28(4):603–614. https://doi.org/10.1016/J.ESWA.2004.12.008

    Article  Google Scholar 

  25. Mselmi N, Lahiani A, Hamza T (2017) Financial distress prediction: The case of French small and medium-sized firms. Int Rev Finance Anal 50:67–80. https://doi.org/10.1016/J.IRFA.2017.02.004

    Article  Google Scholar 

  26. Mullainathan S, Spiess J (2017) Machine learning: an applied econometric approach. J Econ Perspect 31(2):87–106

    Article  Google Scholar 

  27. Ntsalaze L, Ikhide S (2016) Household over-indebtedness: understanding its extent and characteristics of those affected. J Social Scien 48(1–2):79–93. https://doi.org/10.1080/09718923.2016.11893573

    Article  Google Scholar 

  28. Peng Y, Wang G, Kou G, Shi Y (2011) An empirical study of classification algorithm evaluation for financial risk prediction. Appl Soft Comput 11(2):2906–2915. https://doi.org/10.1016/J.ASOC.2010.11.028

    Article  Google Scholar 

  29. Ruiz-Tagle J, García L, Miranda A (2013) Proceso de endeudamiento y sobre endeudamiento de los hogares en chile. Banco Central de Chile, Documento de trabajo, 703

  30. Sarle WS (1997) Neural Network FAQ, periodic posting to the Usenet newsgroup comp. ai. neural-nets. URL:Ftp://Ftp.Sas.Com/Pub/Neural/FAQ.Html.

  31. Stone B, Maury RV (2006) Indicators of personal financial debt using a multi-disciplinary behavioral model. J Econ Psychol 27(4):543–556. https://doi.org/10.1016/J.JOEP.2005.11.002

    Article  Google Scholar 

  32. Syrgkanis V, Lewis G, Oprescu M, Hei M, Battocchi K, Dillon E, Lee JY (2021) Causal inference and machine learning in practice with econml and causalml: Industrial use cases at microsoft, tripadvisor, uber. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining (pp. 4072–4073)

  33. Tkáč M, Verner R (2016) Artificial neural networks in business: Two decades of research. Appl Soft Comput 38:788–804. https://doi.org/10.1016/J.ASOC.2015.09.040

    Article  Google Scholar 

  34. Varian HR (2014) Big data: New tricks for econometrics. J Econ Perspect 28(2):3–28

    Article  Google Scholar 

  35. Veganzones D, Séverin E (2018) An investigation of bankruptcy prediction in imbalanced datasets. Decis Support Syst 112:111–124. https://doi.org/10.1016/J.DSS.2018.06.011

    Article  Google Scholar 

  36. Zelenkov Y, Volodarskiy N (2021) Bankruptcy prediction on the base of the unbalanced data using multi-objective selection of classifiers. Expert Syst Appl 185:115559

    Article  Google Scholar 

  37. Zhang G, Hu YM, Eddy Patuwo B, Indro DC (1999) Artificial neural networks in bankruptcy prediction: general framework and cross-validation analysis. Eur J Oper Res 116(1):16–32. https://doi.org/10.1016/S0377-2217(98)00051-4

    Article  MATH  Google Scholar 

  38. Zoričák M, Gnip P, Drotár P, Gazda V (2020) Bankruptcy prediction for small-and medium-sized companies using severely imbalanced datasets. Econ Model 84:165–176

    Article  Google Scholar 

Download references

Acknowledgements

The authors are grateful for the financial support of the General Directorate for Research, Innovation, and Postgraduate Studies (DGIIP) of UTFSM Chile through the "Program of Incentives for Scientific Initiation" (PIIC).

Funding

The article was funded by Universidad Técnica Federico Santa María, PIIC, Nicole Astudillo

Author information

Authors and Affiliations

Authors

Contributions

Werner Kristjanpoller: Conceptualization, Methodology, Software, Methodology, Supervision, Data curation, Writing- Reviewing and Editing, Formal analysis, Validation. Nicole Astudillo: Conceptualization, Methodology, Software, Data curation, Writing Original draft preparation, Formal analysis. Josephine Olson: Methodology, Supervision, Writing- Reviewing and Editing, Formal analysis, Validation.

Corresponding author

Correspondence to Werner Kristjanpoller.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This paper does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

For this type of study formal consent is not required.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1

1.1 Variable description

The following table describes the variables used in this study to describe household financial over-indebtedness for the developed models.

Variables

Description

\({X}_{1}\) Gender

Dichotomous variable that describes the gender of the head of household where:1: Woman / 0: Man

\({X}_{2}\) Age

Integer variable that indicates the age of the head of household

\({X}_{3}\) Savings

Continuous variable that describes the total declared amount of savings recorded by the household in the last 12 months. Expressed in millions of Chilean pesos

\({X}_{4}\) Workers

Integer variable that describes the number of members of the household that are working

\({X}_{5}\) Card

Dichotomous variable that indicates if any members of the household have a credit card with:1: Has a card/ 0: Does not have a card

\({X}_{6}\) Marital status

Dichotomous variable that represents the marital status of the head of household with:1: Married or partner / 0: Other

\({X}_{7}\) Income

Continuous variable that indicates the total amount of income that the household receives monthly. Expressed in millions of Chilean pesos

\({X}_{8}\) Educational level

Integer variable that represents the years of schooling of the head of household. Chile's educational system has an average of 8 years for primary education, 4 years for secondary education, and 6 for higher education. The survey also considers postgraduate studies or second professions

\({X}_{9}\) Mortgage burden

Continuous variable that indicates the total amount that the household pays monthly for mortgage debt. Expressed in millions of Chilean pesos

\({X}_{10}\) Educational debt

Continuous variable which indicates the total amount owed by the household for the education of its members. Expressed in millions of Chilean pesos

Appendix 2

1.1 Performance measurement

To evaluate and compare all the models, we use the confusion matrices obtained from each evaluated model and calculate the accuracy metrics (ACC), positive (\(A{R}_{P}\)) and negative \((A{R}_{N})\) class accuracy and area under curve (AUC). These are defined in Table

Table 6 Confusion matrix and accuracy measure confusion matrix

6 as follows:

$$\mathrm{Accuracy }\left(ACC\right)= \frac{TP+TN}{TP+FP+TN+FN}$$
(1)
$$\mathrm{Positive class accuracy} \left(A{R}_{P}\right)=\frac{TP}{TP+FN}$$
(2)
$$\mathrm{Negative class accuracy }\left(A{R}_{N}\right)=\frac{TN}{TN+FP}$$
(3)
$$\mathrm{Area under Curve }\left(AUC\right)=\frac{A{R}_{P}+A{R}_{N}}{2}$$
(4)

In order to supplement the evaluation of over-indebtedness model performance, we consider the presence and significance of misclassification. Given a null hypothesis (over-indebted household), each time this hypothesis is rejected when it is true, a type I error is committed. On the other hand, when the null hypothesis is false (household is not over-indebted) and it is rejected, a type II error is committed. For the over-indebtedness models developed in this article, the relevance of a type I error is much greater than a type II error, since it impairs the usefulness and general applicability of the model. For this reason, the key performance measurement is the AUC.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kristjanpoller, W., Astudillo, N. & Olson, J.E. An empirical application of a hybrid ANFIS model to predict household over-indebtedness. Neural Comput & Applic 34, 17343–17353 (2022). https://doi.org/10.1007/s00521-022-07389-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07389-w

Keywords

Navigation