CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation

Kong, Menglin; Li, Ruichen; Wang, Jia; Li, Xingquan; Jin, Shengzhong; Xie, Wanying; Hou, Muzhou; Cao, Cong

doi:10.1007/s00521-024-09546-9

CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation

Original Article
Published: 26 February 2024

Volume 36, pages 8607–8623, (2024)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Menglin Kong¹^na1,
Ruichen Li¹^na1,
Jia Wang²,
Xingquan Li³,
Shengzhong Jin¹,
Wanying Xie¹,
Muzhou Hou¹ &
…
Cong Cao ORCID: orcid.org/0000-0002-6853-6421¹

136 Accesses
Explore all metrics

Abstract

Establishing a reliable credit card fraud detection model has become a primary focus for academia and the financial industry. The existing anti-fraud methods face challenges related to low recall rates, inaccurate results, and insufficient causal modeling ability. This paper proposes a credit card fraud detection model based on counterfactual data enhancement of the triplet network. Firstly, we convert the problem of generating optimal counterfactual explanations (CFs) into a policy optimization of agents in the discrete–continuous mixed action space, thereby ensuring the stable generation of optimal CFs. The triplet network then utilizes the feature similarity and label difference of positive example samples and CFs to enhance the learning of the causal relationship between features and labels. Experimental results demonstrate that the proposed method improves the accuracy and robustness of the credit card fraud detection model, outperforming existing methods. The research outcomes are of significant value for both credit card anti-fraud research and practice while providing a novel approach to causal modeling issues across other fields.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

Article Open access 24 August 2023

Artificial Intelligence and Fraud Detection

Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges

Data availability

Data will be made available on request.

References

Delamaire L, Abdou H, Pointon J (2009) Credit card fraud and detection techniques: a review. Banks Bank Syst 4(2):57–68
Google Scholar
Song R, Huang L, Cui W, Oskarsdottir M, Vanthienen J (2020) Fraud detection of bulk cargo theft in port using Bayesian network models. Appl Sci 10(3):1056
Article Google Scholar
Mishra KN, Pandey SC (2021) Fraud prediction in smart societies using logistic regression and k-fold machine learning techniques. Wireless Pers Commun 119:1341–1367
Article Google Scholar
Jiang C, Lu W, Wang Z, Ding Y (2023) Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring. Expert Syst Appl 213:118878
Article Google Scholar
Butaru F, Chen Q, Clark B, Das S, Lo AW, Siddique A (2016) Risk and risk management in the credit card industry. J Bank Financ 72:218–239
Article Google Scholar
Thabtah F, Hammoud S, Kamalov F, Gonsalves A (2020) Data imbalance in classification: experimental evaluation. Inf Sci 513:429–441
Article MathSciNet Google Scholar
Awoyemi JO, Adetunmbi AO, Oluwadare SA (2017) Credit card fraud detection using machine learning techniques: a comparative analysis. In: 2017 international conference on computing networking and informatics (ICCNI), pp 1–9 . IEEE
Khine AA, Khin HW (2020) Credit card fraud detection using online boosting with extremely fast decision tree. In: 2020 IEEE conference on computer applications (ICCA), pp 1–4. IEEE
Agarwal R, Melnick L, Frosst N, Zhang X, Lengerich B, Caruana R, Hinton GE (2021) Neural additive models: interpretable machine learning with neural nets. Adv Neural Inf Process Syst 34:4699–4711
Google Scholar
Bockel-Rickermann C, Verdonck T, Verbeke W (2023) Fraud analytics: a decade of research organizing challenges and solutions in the field. Expert Syst Appl 120605
Carta S, Fenu G, Recupero DR, Saia R (2019) Fraud detection for e-commerce transactions by employing a prudential multiple consensus model. J Inf Secur Appl 46:13–22
Google Scholar
Fanai H, Abbasimehr H (2023) A novel combined approach based on deep autoencoder and deep classifiers for credit card fraud detection. Expert Syst Appl 217:119562
Article Google Scholar
Aftabi SZ, Ahmadi A, Farzi S (2023) Fraud detection in financial statements using data mining and GAN models. Expert Syst Appl 227:120144
Article Google Scholar
Settipalli L, Gangadharan G (2023) WMTDBC: an unsupervised multivariate analysis model for fraud detection in health insurance claims. Expert Syst Appl 215:119259
Article Google Scholar
Mirtaheri M, Abu-El-Haija S, Morstatter F, Ver Steeg G, Galstyan A (2021) Identifying and analyzing cryptocurrency manipulations in social media. IEEE Trans Comput Soc Syst 8(3):607–617
Article Google Scholar
Wang X, Cui P, Zhu W (2021) Out-of-distribution generalization and its applications for multimedia. In: Proceedings of the 29th ACM international conference on multimedia, pp 5681–5682
Cui P, Shen Z, Li S, Yao L, Li Y, Chu Z, Gao J (2020) Causal inference meets machine learning. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 3527–3528
Kuang K, Cui P, Athey S, Xiong R, Li B (2018) Stable prediction across unknown environments. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1617–1626
Cui P, Athey S (2022) Stable learning establishes some common ground between causal inference and machine learning. Nat Mach Intell 4(2):110–115
Article Google Scholar
Mothilal RK, Sharma A, Tan C (2020) Explaining machine learning classifiers through diverse counterfactual explanations. In: Proceedings of the 2020 conference on fairness, accountability, and transparency, pp 607–617
Karimi AH, Barthe G, Balle B, Valera I (2020) Model-agnostic counterfactual explanations for consequential decisions. In: International conference on artificial intelligence and statistics, pp 895–905. PMLR
Chen Z, Silvestri F, Wang J, Zhu H, Ahn H, Tolomei G (2022) Relax: reinforcement learning agent explainer for arbitrary predictive models. In: Proceedings of the 31st ACM international conference on information & knowledge management, pp 252–261
Xiong J, Wang Q, Yang Z, Sun P, Han L, Zheng Y, Fu H, Zhang T, Liu J, Liu H (2018) Parametrized deep q-networks learning: reinforcement learning with discrete-continuous hybrid action space. arXiv preprint arXiv:1810.06394
Sailusha R, Gnaneswar V, Ramesh R, Rao GR (2020) Credit card fraud detection using machine learning. In: 2020 4th international conference on intelligent computing and control systems (ICICCS), pp 1264–1270 . IEEE
Bin Sulaiman R, Schetinin V, Sant P (2022) Review of machine learning approach on credit card fraud detection. Human-Centric Intell Syst 2(1–2):55–68
Article Google Scholar
Saia R(2018) Unbalanced data classification in fraud detection by introducing a multidimensional space analysis. In: IoTBDS, pp 29–40
Salekshahrezaee Z, Leevy JL, Khoshgoftaar TM (2023) The effect of feature extraction and data sampling on credit card fraud detection. J Big Data 10(1):6
Article Google Scholar
Harwani H, Jain J, Jadhav C, Hodavdekar M (2020) Credit card fraud detection technique using hybrid approach: an amalgamation of self organizing maps and neural networks. Int Res J Eng Technol (IRJET) 7(2020)
Voican O (2021) Credit card fraud detection using deep learning techniques. Inf Econ 25(1)
Nguyen TT, Tahir H, Abdelrazek M, Babar A (2020) Deep learning methods for credit card fraud detection. arXiv preprint arXiv:2012.03754
Van Belle R, Baesens B, De Weerdt J (2023) CATCHM: a novel network-based credit card fraud detection method using node representation learning. Decis Support Syst 164:113866
Article Google Scholar
RamaKalyani K, UmaDevi D (2012) Fraud detection of credit card payment system by genetic algorithm. Int J Sci Eng Res 3(7):1–6
Google Scholar
Jain Y, Tiwari N, Dubey S, Jain S (2019) A comparative analysis of various credit card fraud detection techniques. Int J Recent Technol Eng 7(5):402–407
Google Scholar
Phua C, Lee V, Smith K, Gayler R (2010) A comprehensive survey of data mining-based fraud detection research. Artif Intell Revi 33(3):229–246
Google Scholar
Pearl J (2000) Models reasoning and inference. vol 19, Cambridge University Press, Cambridge, p 3
Pearl J, Mackenzie D (2018) The book of why: the new science of cause and effect. Basic Books, New York
Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning, pp 1597–1607. PMLR
Dal Pozzolo A, Caelen O, Johnson RA, Bontempi G (2015) Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE symposium series on computational intelligence, pp 159–166. IEEE
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Prusti D, Rath SK (2019) Fraudulent transaction detection in credit card by applying ensemble machine learning techniques. In: 2019 10th international conference on computing, communication and networking technologies (ICCCNT), pp 1–6. IEEE
Dheepa V, Dhanapal R (2012) Behavior based credit card fraud detection using support vector machines. ICTACT J Soft Comput 2(4):391–397
Article Google Scholar
Sasank JS, Sahith GR, Abhinav K, Belwal M (2019) Credit card fraud detection using various classification and sampling techniques: a comparative study. In: 2019 international conference on communication and electronics systems (ICCES), pp 1713–1718. IEEE
Le T-T-H, Kim H, Kang H, Kim H (2022) Classification and explanation for intrusion detection system based on ensemble trees and shap method. Sensors 22(3):1154
Article Google Scholar
Zhang K, Xu P, Zhang J(2020) Explainable AI in deep reinforcement learning models: a shap method applied in power system emergency control. In: 2020 IEEE 4th conference on energy internet and energy system integration (EI2), pp 711–716 . IEEE
Winter E (2002) The shapley value. Handbook of game theory with economic applications, vol 3, pp 2025–2054

Download references

Acknowledgements

This study was supported by the Natural Science Foundation of Hunan Province of China(grant number 2022JJ30673), Key R &D Program of Hunan Province(grant number 2023DK2003), Foundation of Department of Science and Technology of Hunan Province(grant number 2022GK3003), and the Graduate Innovation Project of Central South University(2023XQLH032, 2023ZZTS0304).

Author information

Menglin Kong and Ruichen Li have contributed equally to this work.

Authors and Affiliations

School of Mathematics and Statistics, Central South University, Lushan Road, Changsha, 410083, Hunan, China
Menglin Kong, Ruichen Li, Shengzhong Jin, Wanying Xie, Muzhou Hou & Cong Cao
School of Advanced Technology, Xi’an Jiaotong-Liverpool University, Renai Road, Suzhou, 215123, Jiangsu, China
Jia Wang
Peng Cheng Laboratory, Xingke Road, Shenzhen, 518000, Guangdong, China
Xingquan Li

Authors

Menglin Kong
View author publications
You can also search for this author in PubMed Google Scholar
Ruichen Li
View author publications
You can also search for this author in PubMed Google Scholar
Jia Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xingquan Li
View author publications
You can also search for this author in PubMed Google Scholar
Shengzhong Jin
View author publications
You can also search for this author in PubMed Google Scholar
Wanying Xie
View author publications
You can also search for this author in PubMed Google Scholar
Muzhou Hou
View author publications
You can also search for this author in PubMed Google Scholar
Cong Cao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MK contributed to conceptualization, methodology, software, validation, formal analysis, investigation, data curation, and writing an original draft. RL contributed to investigation, resources, and data curation. JW and WX performed data curation. XL and SJ contributed to software. MH contributed to resources, writing, and review editing. CC contributed to methodology, investigation, resources, data curation, writing, and review editing.

Corresponding author

Correspondence to Cong Cao.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Kong, M., Li, R., Wang, J. et al. CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation. Neural Comput & Applic 36, 8607–8623 (2024). https://doi.org/10.1007/s00521-024-09546-9

Download citation

Received: 09 August 2023
Accepted: 22 January 2024
Published: 26 February 2024
Issue Date: May 2024
DOI: https://doi.org/10.1007/s00521-024-09546-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation

Abstract

Access this article

Similar content being viewed by others

Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

Artificial Intelligence and Fraud Detection

Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation

Abstract

Access this article

Similar content being viewed by others

Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

Artificial Intelligence and Fraud Detection

Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation