Abstract
Establishing a reliable credit card fraud detection model has become a primary focus for academia and the financial industry. The existing anti-fraud methods face challenges related to low recall rates, inaccurate results, and insufficient causal modeling ability. This paper proposes a credit card fraud detection model based on counterfactual data enhancement of the triplet network. Firstly, we convert the problem of generating optimal counterfactual explanations (CFs) into a policy optimization of agents in the discrete–continuous mixed action space, thereby ensuring the stable generation of optimal CFs. The triplet network then utilizes the feature similarity and label difference of positive example samples and CFs to enhance the learning of the causal relationship between features and labels. Experimental results demonstrate that the proposed method improves the accuracy and robustness of the credit card fraud detection model, outperforming existing methods. The research outcomes are of significant value for both credit card anti-fraud research and practice while providing a novel approach to causal modeling issues across other fields.
Similar content being viewed by others
Data availability
Data will be made available on request.
References
Delamaire L, Abdou H, Pointon J (2009) Credit card fraud and detection techniques: a review. Banks Bank Syst 4(2):57–68
Song R, Huang L, Cui W, Oskarsdottir M, Vanthienen J (2020) Fraud detection of bulk cargo theft in port using Bayesian network models. Appl Sci 10(3):1056
Mishra KN, Pandey SC (2021) Fraud prediction in smart societies using logistic regression and k-fold machine learning techniques. Wireless Pers Commun 119:1341–1367
Jiang C, Lu W, Wang Z, Ding Y (2023) Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring. Expert Syst Appl 213:118878
Butaru F, Chen Q, Clark B, Das S, Lo AW, Siddique A (2016) Risk and risk management in the credit card industry. J Bank Financ 72:218–239
Thabtah F, Hammoud S, Kamalov F, Gonsalves A (2020) Data imbalance in classification: experimental evaluation. Inf Sci 513:429–441
Awoyemi JO, Adetunmbi AO, Oluwadare SA (2017) Credit card fraud detection using machine learning techniques: a comparative analysis. In: 2017 international conference on computing networking and informatics (ICCNI), pp 1–9 . IEEE
Khine AA, Khin HW (2020) Credit card fraud detection using online boosting with extremely fast decision tree. In: 2020 IEEE conference on computer applications (ICCA), pp 1–4. IEEE
Agarwal R, Melnick L, Frosst N, Zhang X, Lengerich B, Caruana R, Hinton GE (2021) Neural additive models: interpretable machine learning with neural nets. Adv Neural Inf Process Syst 34:4699–4711
Bockel-Rickermann C, Verdonck T, Verbeke W (2023) Fraud analytics: a decade of research organizing challenges and solutions in the field. Expert Syst Appl 120605
Carta S, Fenu G, Recupero DR, Saia R (2019) Fraud detection for e-commerce transactions by employing a prudential multiple consensus model. J Inf Secur Appl 46:13–22
Fanai H, Abbasimehr H (2023) A novel combined approach based on deep autoencoder and deep classifiers for credit card fraud detection. Expert Syst Appl 217:119562
Aftabi SZ, Ahmadi A, Farzi S (2023) Fraud detection in financial statements using data mining and GAN models. Expert Syst Appl 227:120144
Settipalli L, Gangadharan G (2023) WMTDBC: an unsupervised multivariate analysis model for fraud detection in health insurance claims. Expert Syst Appl 215:119259
Mirtaheri M, Abu-El-Haija S, Morstatter F, Ver Steeg G, Galstyan A (2021) Identifying and analyzing cryptocurrency manipulations in social media. IEEE Trans Comput Soc Syst 8(3):607–617
Wang X, Cui P, Zhu W (2021) Out-of-distribution generalization and its applications for multimedia. In: Proceedings of the 29th ACM international conference on multimedia, pp 5681–5682
Cui P, Shen Z, Li S, Yao L, Li Y, Chu Z, Gao J (2020) Causal inference meets machine learning. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 3527–3528
Kuang K, Cui P, Athey S, Xiong R, Li B (2018) Stable prediction across unknown environments. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1617–1626
Cui P, Athey S (2022) Stable learning establishes some common ground between causal inference and machine learning. Nat Mach Intell 4(2):110–115
Mothilal RK, Sharma A, Tan C (2020) Explaining machine learning classifiers through diverse counterfactual explanations. In: Proceedings of the 2020 conference on fairness, accountability, and transparency, pp 607–617
Karimi AH, Barthe G, Balle B, Valera I (2020) Model-agnostic counterfactual explanations for consequential decisions. In: International conference on artificial intelligence and statistics, pp 895–905. PMLR
Chen Z, Silvestri F, Wang J, Zhu H, Ahn H, Tolomei G (2022) Relax: reinforcement learning agent explainer for arbitrary predictive models. In: Proceedings of the 31st ACM international conference on information & knowledge management, pp 252–261
Xiong J, Wang Q, Yang Z, Sun P, Han L, Zheng Y, Fu H, Zhang T, Liu J, Liu H (2018) Parametrized deep q-networks learning: reinforcement learning with discrete-continuous hybrid action space. arXiv preprint arXiv:1810.06394
Sailusha R, Gnaneswar V, Ramesh R, Rao GR (2020) Credit card fraud detection using machine learning. In: 2020 4th international conference on intelligent computing and control systems (ICICCS), pp 1264–1270 . IEEE
Bin Sulaiman R, Schetinin V, Sant P (2022) Review of machine learning approach on credit card fraud detection. Human-Centric Intell Syst 2(1–2):55–68
Saia R(2018) Unbalanced data classification in fraud detection by introducing a multidimensional space analysis. In: IoTBDS, pp 29–40
Salekshahrezaee Z, Leevy JL, Khoshgoftaar TM (2023) The effect of feature extraction and data sampling on credit card fraud detection. J Big Data 10(1):6
Harwani H, Jain J, Jadhav C, Hodavdekar M (2020) Credit card fraud detection technique using hybrid approach: an amalgamation of self organizing maps and neural networks. Int Res J Eng Technol (IRJET) 7(2020)
Voican O (2021) Credit card fraud detection using deep learning techniques. Inf Econ 25(1)
Nguyen TT, Tahir H, Abdelrazek M, Babar A (2020) Deep learning methods for credit card fraud detection. arXiv preprint arXiv:2012.03754
Van Belle R, Baesens B, De Weerdt J (2023) CATCHM: a novel network-based credit card fraud detection method using node representation learning. Decis Support Syst 164:113866
RamaKalyani K, UmaDevi D (2012) Fraud detection of credit card payment system by genetic algorithm. Int J Sci Eng Res 3(7):1–6
Jain Y, Tiwari N, Dubey S, Jain S (2019) A comparative analysis of various credit card fraud detection techniques. Int J Recent Technol Eng 7(5):402–407
Phua C, Lee V, Smith K, Gayler R (2010) A comprehensive survey of data mining-based fraud detection research. Artif Intell Revi 33(3):229–246
Pearl J (2000) Models reasoning and inference. vol 19, Cambridge University Press, Cambridge, p 3
Pearl J, Mackenzie D (2018) The book of why: the new science of cause and effect. Basic Books, New York
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning, pp 1597–1607. PMLR
Dal Pozzolo A, Caelen O, Johnson RA, Bontempi G (2015) Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE symposium series on computational intelligence, pp 159–166. IEEE
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Prusti D, Rath SK (2019) Fraudulent transaction detection in credit card by applying ensemble machine learning techniques. In: 2019 10th international conference on computing, communication and networking technologies (ICCCNT), pp 1–6. IEEE
Dheepa V, Dhanapal R (2012) Behavior based credit card fraud detection using support vector machines. ICTACT J Soft Comput 2(4):391–397
Sasank JS, Sahith GR, Abhinav K, Belwal M (2019) Credit card fraud detection using various classification and sampling techniques: a comparative study. In: 2019 international conference on communication and electronics systems (ICCES), pp 1713–1718. IEEE
Le T-T-H, Kim H, Kang H, Kim H (2022) Classification and explanation for intrusion detection system based on ensemble trees and shap method. Sensors 22(3):1154
Zhang K, Xu P, Zhang J(2020) Explainable AI in deep reinforcement learning models: a shap method applied in power system emergency control. In: 2020 IEEE 4th conference on energy internet and energy system integration (EI2), pp 711–716 . IEEE
Winter E (2002) The shapley value. Handbook of game theory with economic applications, vol 3, pp 2025–2054
Acknowledgements
This study was supported by the Natural Science Foundation of Hunan Province of China(grant number 2022JJ30673), Key R &D Program of Hunan Province(grant number 2023DK2003), Foundation of Department of Science and Technology of Hunan Province(grant number 2022GK3003), and the Graduate Innovation Project of Central South University(2023XQLH032, 2023ZZTS0304).
Author information
Authors and Affiliations
Contributions
MK contributed to conceptualization, methodology, software, validation, formal analysis, investigation, data curation, and writing an original draft. RL contributed to investigation, resources, and data curation. JW and WX performed data curation. XL and SJ contributed to software. MH contributed to resources, writing, and review editing. CC contributed to methodology, investigation, resources, data curation, writing, and review editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kong, M., Li, R., Wang, J. et al. CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation. Neural Comput & Applic 36, 8607–8623 (2024). https://doi.org/10.1007/s00521-024-09546-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-024-09546-9