Abstract
Recognizing potential defaulters is a crucial problem for financial institutions. Therefore, many credit scoring methods have been proposed in the past to address this issue. However, these methods rarely consider the interaction among customers such as bank transfer and remittance. With rapid growth in the number of customers adopting online banking services, such interaction information plays a significant role in assessing their credit score. In this paper, we propose a novel scalable credit scoring approach called CDGAT (Graph attention network for credit card defaulters) for predicting potential credit card defaulters. In CDGAT, a customer’s credit score is calculated based on transaction embedding and neighborhood embedding. To obtain the neighborhood embedding, CDGAT first utilizes the Amount-bias Sampling (AbS) strategy to extract a subgraph for each customer. Next, CDGAT directly aggregates neighbors’ features according to their influence weights. The experimental results on the dataset from Industrial and Commercial Bank of China (Macau) Limited (ICBC (Macau)) show that CDGAT significantly outperforms the baseline methods. Furthermore, experimental results reveal that the proposed method is also superior to several state-of-the-art Graph Convolutional Neural Network models in terms of scalability and performance.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Code Availability
The program developed during the current study are not publicly available.
References
Berg T, Burg V, Gombović A, Puri M (2019) On the rise of fintechs: credit scoring using digital footprints. Rev Financ Stud 33(7):2845–2897
Bellotti T, Crook J (2013) Forecasting and stress testing credit card default using dynamic models. Int J Forecast 29(4):563–574
Butaru F, Chen Q, Clark B, Das S, Lo AW, Siddique A (2016) Risk and risk management in the credit card industry. J B Financ 72:218–239
Björkegren D, Grissen D (2017) Behavior revealed in mobile phone usage predicts loan repayment. arXiv:1712:05840. 1–28
Bhattacharyya S, Jha S, Tharakunnel KK, Westland JC (2011) Data mining for credit card fraud: a comparative study. Decis Support Syst 50(3):602–613
Babaev D, Savchenko M, Tuzhilin A, Umerenkov D (2019) ET-RNN: applying deep learning to credit loan applications. In: Teredesai A, Kumar V, Li Y, Rosales R, Terzi E, Karypis G (eds) Proceedings of the 25th ACM SIGKDD International conference on knowledge discovery and data mining. ACM, Anchorage, pp 2183–2190
Chung J, Gülçehre C, KyungHyun C, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555, 1–9
Cao B, Mao M, Viidu S, Philip SYu (2017) Collective fraud detection capturing inter-transaction dependency. In: Anandakrishnan A, Kumar S, Statnikov AR, Faruquie TA, Xu D (eds) Proceedings of the 23th KDD workshop on anomaly detection. PMLR, Halifax, pp 66–75
Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. In: Lee DD, Sugiyama M, von Luxburg U, Guyon I, Garnett R (eds) Proceedings of the 29th advances in neural information processing systems. Curran Associates Inc, Barcelona, pp 3837–3845
Hand DJ, Henley WE (1997) Statistical classification methods in consumer credit scoring: a review. J R Stat Soc Ser A Stat Soc 160(3):523–541
Hu D (2019) An introductory survey on attention mechanisms in NLP problems. In: Bi Y, Bhatia R, Kapoor S (eds) Proceedings of the 5th intelligent systems conference. Springer, London, pp 432–448
Hamilton WL, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 1024–1034
Ke G, Qi M, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y (2017) Lightgbm: a highly efficient gradient boosting decision tree. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 3146–3154
Kvamme H, Sellereite N, Aas K, Sjursen S (2018) Predicting mortgage default using convolutional neural networks. Expert Syst Appl 102:207–217
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: Bengio Y, LeCun Y, Ranzato MA, Larochelle H, Vinyals O, Sainath T (eds) Proceedings of the 5th international conference on learning representations. OpenReview, Toulon, pp 1–14
LeCun Y, Bengio Y, Hinton GE (2015) Deep learning. Nat 521(7553):436–444
Lessmann S, Baesens B, Seow H-V, Thomas LC (2015) Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur J Oper Res 247(1):124–136
Lin T-Y, Goyal P, Girshick RB, He K, Dollár P (2020) Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 42(2):318–327
Masmoudi K, Abid L, Masmoudi A (2019) Credit risk modeling using bayesian network with a latent variable. Syst Expert Appl 127:157–166
Maldonado S, Bravo C, López J, Pérez J (2017) Integrated framework for profit-based feature selection and SVM classification in credit scoring. Decis Support Syst 104:113–121
Niu K, Zhang Z, Liu Y, Li R (2020) Resampling ensemble model based on data distribution for imbalanced credit risk evaluation in P2P lending. Inf Sci 536:120–134
Onay C, Ozturk E (2018) A review of credit scoring research in the age of big data. J Financial Regul Compliance 6(3):382–405
Plawiak P, Abdar M, Plawiak J, Makarenkov V, Rajendra Acharya U (2020) DGHNL: a new deep genetic hierarchical network of learners for prediction of credit scoring. Inf Sci 516:401–418
Sun J, Lang J, Fujita H, Li H (2018) Imbalanced enterprise credit evaluation with DTE-SBD: decision tree ensemble based on SMOTE and bagging with differentiated sampling rates. Inf Sci 425:76–91
Tobback E, Martens D (2019) Retail credit scoring using fine-grained payment data. J R Stat Soc Ser A Stat Soc 182(4):1227–1246
Velickovic P, Cucurull G, Casanova A, Romero A, Lió P., Bengio Y (2018) Graph attention networks. In: Bengio Y, LeCun Y, Sainath T, Murray I, Ranzato MA, Vinyals O (eds) Proceedings of the 6th international conference on learning representations. OpenReview, Vancouver, pp 1–12
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN , Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 5998–6008
Whitrow C, Hand DJ, Juszczak P, Weston DJ, Adams NM (2009) Transaction aggregation as a strategy for credit card fraud detection. Data Min Knowl Discov 18(1):30–55
Wang C, Han D, Liu Q, Luo S (2019) A deep learning approach for credit scoring of peer-to-peer lending using attention mechanism LSTM. IEEE Access 7:2161–2168
Wan’an L, Hong F, Min X (2021) Multi-grained and multi-layered gradient boosting decision tree for credit scoring. Appl Intell. https://doi.org/10.1007/s10489-021-02715-6
Wu F, Souza AH Jr, Zhang T, Fifty C, Tao Y u, Weinberger K. (2019) Simplifying graph convolutional networks. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th international conference on machine learning, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research, pp 6861–6871. PMLR
Zonghan W u, Pan S, Chen F, Long G, Zhang C, Philip SYu (2019) A comprehensive survey on graph neural networks. arXiv:1901.00596, 1–22
Wang M, Lingfan Yu, Da Z, Gan Q, Gai Y u, Ye Z, Li M, Zhou J, Qi H, Ma C, Huang Z, Guo Q, Zhang H, Lin H, Zhao J, Li J, Smola AJ, Zhang Z (2019) Deep graph library: towards efficient and scalable deep learning on graphs. arXiv:1909.01315, 1–18
Keyulu X u, Weihua H u, Leskovec J, Jegelka S (2019) How powerful are graph neural networks? In: Sainath T, Rush A, Levine S, Livescu K, Mohamed S (eds) Proceedings of the 7th international conference on learning representations. OpenReview, New Orleans, pp 1–17
Zhou J, Cui G, Zhang Z, Yang C, Liu Z, Sun M (2018) Graph neural networks: a review of methods and applications. arXiv:1812.08434, 1–22
Funding
This research was funded by the University of Macau (File no. MYRG2019-00136-FST).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest/Competing interests
The authors declare that there are no conflicts of interest.
Additional information
Availability of data and material
The data are not publicly available due to them containing information that could compromise the customers’ privacy/consent.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wu, J., Zhao, X., Yuan, H. et al. CDGAT: a graph attention network method for credit card defaulters prediction. Appl Intell 53, 11538–11552 (2023). https://doi.org/10.1007/s10489-022-03996-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03996-1