Abstract
Representation learning in graphs has proven useful for many predictive tasks. In this paper we assess the feasibility of representation learning in a credit card fraud setting. Data analytics has been successful in predicting fraud in previous research. However, the research field has focused on techniques which require tedious and expensive hand-crafting of features. In addition, existing works often ignore information related to the network of transactions. Representation learning in graphs tackles both of these challenges. First, it provides the possibility to tap into the relational and structural aspects of the transaction network and leverage these in a predictive model. Second, it featurizes the graph without the need for manual feature engineering. This work contributes to the literature by being the first to explicitly and extensively show how fraud detection modeling can benefit from representation learning. We discern three different approaches in this paper: traditional network featurization, an inductive representation learning algorithm and a transductive representational learner. Through extensive experimental evaluation on a real-world dataset we show that state-of-the-art representation learning in graphs outperforms traditional graph featurization.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Akila, S., Reddy, U.S.: Cost-sensitive risk induced Bayesian inference bagging (RIBIB) for credit card fraud detection. J. Comput. Sci. 27, 247–254 (2018)
Aleskerov, E., Freisleben, B., Rao, B.: CARDWATCH: a neural network based database mining system for credit card fraud detection. In: Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr), pp. 220–226. IEEE (1997)
Bahnsen, A.C., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Expert Syst. Appl. 51, 134–142 (2016)
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Bhattacharyya, S., Jha, S., Tharakunnel, K., Westland, J.C.: Data mining for credit card fraud: a comparative study. Decis. Support Syst. 50(3), 602–613 (2011)
Bolton, R.J., Hand, D.J., et al.: Unsupervised profiling methods for fraud detection. Credit Scoring and Credit Control VII, 235–255 (2001)
Cao, S., Lu, W., Xu, Q.: GraRep: learning graph representations with global structural information. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 891–900. ACM (2015)
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7(Jan), 1–30 (2006)
Fabris, N.: Cashless society-the future of money or a Utopia? J. Cent. Bank. Theory Pract. 8(1), 53–66 (2019)
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)
Guo, J., Xu, L., Chen, E.: Spine: structural identity preserved inductive network embedding. arXiv preprint arXiv:1802.03984 (2018)
Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. In: Advances in Neural Information Processing Systems, pp. 1024–1034 (2017)
Hamilton, W.L., Ying, R., Leskovec, J.: Representation learning on graphs: methods and applications. CoRR abs/1709.05584 (2017)
Jiang, F., Zheng, L., Xu, J., Yu, P.: Fi-grl: Fast inductive graph representation learning via projection-cost preservation. In: 2018 IEEE International Conference on Data Mining (ICDM), pp. 1067–1072. IEEE (2018)
Jurgovsky, J., et al.: Sequence classification for credit-card fraud detection. Expert Syst. Appl. 100, 234–245 (2018)
Maes, S., Tuyls, K., Vanschoenwinkel, B., Manderick, B.: Credit card fraud detection using Bayesian and neural networks. In: Proceedings of the 1st International Naiso Congress on Neuro Fuzzy Technologies, pp. 261–270 (2002)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Mitrović, S., Baesens, B., Lemahieu, W., De Weerdt, J.: tcc2vec: RFM-informed representation learning on call graphs for churn prediction. Inf. Sci. (2019). https://doi.org/10.1016/j.ins.2019.02.044
Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)
Porwal, U., Mukund, S.: Credit card fraud detection in e-commerce: an outlier detection approach. arXiv preprint arXiv:1811.02196 (2018)
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45–50. ELRA (2010)
Rossi, R.A., Zhou, R., Ahmed, N.K.: Deep inductive network representation learning. In: Companion of the the Web Conference 2018 on the Web Conference 2018, pp. 953–960. International World Wide Web Conferences Steering Committee (2018)
Sánchez, D., Vila, M., Cerda, L., Serrano, J.M.: Association rules applied to credit card fraud detection. Expert Syst. Appl. 36(2), 3630–3640 (2009)
Sohony, I., Pratap, R., Nambiar, U.: Ensemble learning for credit card fraud detection. In: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, pp. 289–294. ACM (2018)
Somasundaram, A., Reddy, S.: Parallel and incremental credit card fraud detection model to handle concept drift and data imbalance. Neural Comput. Appl. 31, 1–12 (2018)
Srivastava, A., Kundu, A., Sural, S., Majumdar, A.: Credit card fraud detection using hidden Markov model. IEEE Trans. Dependable Secure Comput. 5(1), 37–48 (2008)
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077. International World Wide Web Conferences Steering Committee (2015)
Van Vlasselaer, V., et al.: APATE: a novel approach for automated credit card transaction fraud detection using network-based extensions. Decis. Support Syst. 75, 38–48 (2015)
Weston, D.J., Hand, D.J., Adams, N.M., Whitrow, C., Juszczak, P.: Plastic card fraud detection using peer group analysis. Adv. Data Anal. Classif. 2(1), 45–62 (2008)
Xu, C., Feng, Z., Chen, Y., Wang, M., Wei, T.: FeatNet: large-scale fraud device detection by network representation learning with rich features. In: Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security, pp. 57–63. ACM (2018)
Xuan, S., Liu, G., Li, Z., Zheng, L., Wang, S., Jiang, C.: Random forest for credit card fraud detection. In: 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), pp. 1–6. IEEE (2018)
Yu, W., Cheng, W., Aggarwal, C.C., Zhang, K., Chen, H., Wang, W.: NetWalk: a flexible deep embedding approach for anomaly detection in dynamic networks. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2672–2681. ACM (2018)
Acknowledgement
We acknowledge the support given by the Research Fund - Flanders (FWO) as Aspirant (Rafaël Van Belle).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Van Belle, R., Mitrović, S., De Weerdt, J. (2020). Representation Learning in Graphs for Credit Card Fraud Detection. In: Bitetta, V., Bordino, I., Ferretti, A., Gullo, F., Pascolutti, S., Ponti, G. (eds) Mining Data for Financial Applications. MIDAS 2019. Lecture Notes in Computer Science(), vol 11985. Springer, Cham. https://doi.org/10.1007/978-3-030-37720-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-37720-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37719-9
Online ISBN: 978-3-030-37720-5
eBook Packages: Computer ScienceComputer Science (R0)