Skip to main content
Log in

CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Establishing a reliable credit card fraud detection model has become a primary focus for academia and the financial industry. The existing anti-fraud methods face challenges related to low recall rates, inaccurate results, and insufficient causal modeling ability. This paper proposes a credit card fraud detection model based on counterfactual data enhancement of the triplet network. Firstly, we convert the problem of generating optimal counterfactual explanations (CFs) into a policy optimization of agents in the discrete–continuous mixed action space, thereby ensuring the stable generation of optimal CFs. The triplet network then utilizes the feature similarity and label difference of positive example samples and CFs to enhance the learning of the causal relationship between features and labels. Experimental results demonstrate that the proposed method improves the accuracy and robustness of the credit card fraud detection model, outperforming existing methods. The research outcomes are of significant value for both credit card anti-fraud research and practice while providing a novel approach to causal modeling issues across other fields.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

Data will be made available on request.

References

  1. Delamaire L, Abdou H, Pointon J (2009) Credit card fraud and detection techniques: a review. Banks Bank Syst 4(2):57–68

    Google Scholar 

  2. Song R, Huang L, Cui W, Oskarsdottir M, Vanthienen J (2020) Fraud detection of bulk cargo theft in port using Bayesian network models. Appl Sci 10(3):1056

    Article  Google Scholar 

  3. Mishra KN, Pandey SC (2021) Fraud prediction in smart societies using logistic regression and k-fold machine learning techniques. Wireless Pers Commun 119:1341–1367

    Article  Google Scholar 

  4. Jiang C, Lu W, Wang Z, Ding Y (2023) Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring. Expert Syst Appl 213:118878

    Article  Google Scholar 

  5. Butaru F, Chen Q, Clark B, Das S, Lo AW, Siddique A (2016) Risk and risk management in the credit card industry. J Bank Financ 72:218–239

    Article  Google Scholar 

  6. Thabtah F, Hammoud S, Kamalov F, Gonsalves A (2020) Data imbalance in classification: experimental evaluation. Inf Sci 513:429–441

    Article  MathSciNet  Google Scholar 

  7. Awoyemi JO, Adetunmbi AO, Oluwadare SA (2017) Credit card fraud detection using machine learning techniques: a comparative analysis. In: 2017 international conference on computing networking and informatics (ICCNI), pp 1–9 . IEEE

  8. Khine AA, Khin HW (2020) Credit card fraud detection using online boosting with extremely fast decision tree. In: 2020 IEEE conference on computer applications (ICCA), pp 1–4. IEEE

  9. Agarwal R, Melnick L, Frosst N, Zhang X, Lengerich B, Caruana R, Hinton GE (2021) Neural additive models: interpretable machine learning with neural nets. Adv Neural Inf Process Syst 34:4699–4711

    Google Scholar 

  10. Bockel-Rickermann C, Verdonck T, Verbeke W (2023) Fraud analytics: a decade of research organizing challenges and solutions in the field. Expert Syst Appl 120605

  11. Carta S, Fenu G, Recupero DR, Saia R (2019) Fraud detection for e-commerce transactions by employing a prudential multiple consensus model. J Inf Secur Appl 46:13–22

    Google Scholar 

  12. Fanai H, Abbasimehr H (2023) A novel combined approach based on deep autoencoder and deep classifiers for credit card fraud detection. Expert Syst Appl 217:119562

    Article  Google Scholar 

  13. Aftabi SZ, Ahmadi A, Farzi S (2023) Fraud detection in financial statements using data mining and GAN models. Expert Syst Appl 227:120144

    Article  Google Scholar 

  14. Settipalli L, Gangadharan G (2023) WMTDBC: an unsupervised multivariate analysis model for fraud detection in health insurance claims. Expert Syst Appl 215:119259

    Article  Google Scholar 

  15. Mirtaheri M, Abu-El-Haija S, Morstatter F, Ver Steeg G, Galstyan A (2021) Identifying and analyzing cryptocurrency manipulations in social media. IEEE Trans Comput Soc Syst 8(3):607–617

    Article  Google Scholar 

  16. Wang X, Cui P, Zhu W (2021) Out-of-distribution generalization and its applications for multimedia. In: Proceedings of the 29th ACM international conference on multimedia, pp 5681–5682

  17. Cui P, Shen Z, Li S, Yao L, Li Y, Chu Z, Gao J (2020) Causal inference meets machine learning. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 3527–3528

  18. Kuang K, Cui P, Athey S, Xiong R, Li B (2018) Stable prediction across unknown environments. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1617–1626

  19. Cui P, Athey S (2022) Stable learning establishes some common ground between causal inference and machine learning. Nat Mach Intell 4(2):110–115

    Article  Google Scholar 

  20. Mothilal RK, Sharma A, Tan C (2020) Explaining machine learning classifiers through diverse counterfactual explanations. In: Proceedings of the 2020 conference on fairness, accountability, and transparency, pp 607–617

  21. Karimi AH, Barthe G, Balle B, Valera I (2020) Model-agnostic counterfactual explanations for consequential decisions. In: International conference on artificial intelligence and statistics, pp 895–905. PMLR

  22. Chen Z, Silvestri F, Wang J, Zhu H, Ahn H, Tolomei G (2022) Relax: reinforcement learning agent explainer for arbitrary predictive models. In: Proceedings of the 31st ACM international conference on information & knowledge management, pp 252–261

  23. Xiong J, Wang Q, Yang Z, Sun P, Han L, Zheng Y, Fu H, Zhang T, Liu J, Liu H (2018) Parametrized deep q-networks learning: reinforcement learning with discrete-continuous hybrid action space. arXiv preprint arXiv:1810.06394

  24. Sailusha R, Gnaneswar V, Ramesh R, Rao GR (2020) Credit card fraud detection using machine learning. In: 2020 4th international conference on intelligent computing and control systems (ICICCS), pp 1264–1270 . IEEE

  25. Bin Sulaiman R, Schetinin V, Sant P (2022) Review of machine learning approach on credit card fraud detection. Human-Centric Intell Syst 2(1–2):55–68

    Article  Google Scholar 

  26. Saia R(2018) Unbalanced data classification in fraud detection by introducing a multidimensional space analysis. In: IoTBDS, pp 29–40

  27. Salekshahrezaee Z, Leevy JL, Khoshgoftaar TM (2023) The effect of feature extraction and data sampling on credit card fraud detection. J Big Data 10(1):6

    Article  Google Scholar 

  28. Harwani H, Jain J, Jadhav C, Hodavdekar M (2020) Credit card fraud detection technique using hybrid approach: an amalgamation of self organizing maps and neural networks. Int Res J Eng Technol (IRJET) 7(2020)

  29. Voican O (2021) Credit card fraud detection using deep learning techniques. Inf Econ 25(1)

  30. Nguyen TT, Tahir H, Abdelrazek M, Babar A (2020) Deep learning methods for credit card fraud detection. arXiv preprint arXiv:2012.03754

  31. Van Belle R, Baesens B, De Weerdt J (2023) CATCHM: a novel network-based credit card fraud detection method using node representation learning. Decis Support Syst 164:113866

    Article  Google Scholar 

  32. RamaKalyani K, UmaDevi D (2012) Fraud detection of credit card payment system by genetic algorithm. Int J Sci Eng Res 3(7):1–6

    Google Scholar 

  33. Jain Y, Tiwari N, Dubey S, Jain S (2019) A comparative analysis of various credit card fraud detection techniques. Int J Recent Technol Eng 7(5):402–407

    Google Scholar 

  34. Phua C, Lee V, Smith K, Gayler R (2010) A comprehensive survey of data mining-based fraud detection research. Artif Intell Revi 33(3):229–246

    Google Scholar 

  35. Pearl J (2000) Models reasoning and inference. vol 19, Cambridge University Press, Cambridge, p 3

  36. Pearl J, Mackenzie D (2018) The book of why: the new science of cause and effect. Basic Books, New York

    Google Scholar 

  37. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  38. Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning, pp 1597–1607. PMLR

  39. Dal Pozzolo A, Caelen O, Johnson RA, Bontempi G (2015) Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE symposium series on computational intelligence, pp 159–166. IEEE

  40. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710

  41. Prusti D, Rath SK (2019) Fraudulent transaction detection in credit card by applying ensemble machine learning techniques. In: 2019 10th international conference on computing, communication and networking technologies (ICCCNT), pp 1–6. IEEE

  42. Dheepa V, Dhanapal R (2012) Behavior based credit card fraud detection using support vector machines. ICTACT J Soft Comput 2(4):391–397

    Article  Google Scholar 

  43. Sasank JS, Sahith GR, Abhinav K, Belwal M (2019) Credit card fraud detection using various classification and sampling techniques: a comparative study. In: 2019 international conference on communication and electronics systems (ICCES), pp 1713–1718. IEEE

  44. Le T-T-H, Kim H, Kang H, Kim H (2022) Classification and explanation for intrusion detection system based on ensemble trees and shap method. Sensors 22(3):1154

    Article  Google Scholar 

  45. Zhang K, Xu P, Zhang J(2020) Explainable AI in deep reinforcement learning models: a shap method applied in power system emergency control. In: 2020 IEEE 4th conference on energy internet and energy system integration (EI2), pp 711–716 . IEEE

  46. Winter E (2002) The shapley value. Handbook of game theory with economic applications, vol 3, pp 2025–2054

Download references

Acknowledgements

This study was supported by the Natural Science Foundation of Hunan Province of China(grant number 2022JJ30673), Key R &D Program of Hunan Province(grant number 2023DK2003), Foundation of Department of Science and Technology of Hunan Province(grant number 2022GK3003), and the Graduate Innovation Project of Central South University(2023XQLH032, 2023ZZTS0304).

Author information

Authors and Affiliations

Authors

Contributions

MK contributed to conceptualization, methodology, software, validation, formal analysis, investigation, data curation, and writing an original draft. RL contributed to investigation, resources, and data curation. JW and WX performed data curation. XL and SJ contributed to software. MH contributed to resources, writing, and review editing. CC contributed to methodology, investigation, resources, data curation, writing, and review editing.

Corresponding author

Correspondence to Cong Cao.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kong, M., Li, R., Wang, J. et al. CFTNet: a robust credit card fraud detection model enhanced by counterfactual data augmentation. Neural Comput & Applic 36, 8607–8623 (2024). https://doi.org/10.1007/s00521-024-09546-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-024-09546-9

Keywords

Navigation