Representation Learning in Graphs for Credit Card Fraud Detection

Van Belle, Rafaël; Mitrović, Sandra; De Weerdt, Jochen

doi:10.1007/978-3-030-37720-5_3

Representation Learning in Graphs for Credit Card Fraud Detection

Rafaël Van Belle¹⁴,
Sandra Mitrović¹⁴ &
Jochen De Weerdt¹⁴

Conference paper
First Online: 03 January 2020

1251 Accesses
7 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11985))

Abstract

Representation learning in graphs has proven useful for many predictive tasks. In this paper we assess the feasibility of representation learning in a credit card fraud setting. Data analytics has been successful in predicting fraud in previous research. However, the research field has focused on techniques which require tedious and expensive hand-crafting of features. In addition, existing works often ignore information related to the network of transactions. Representation learning in graphs tackles both of these challenges. First, it provides the possibility to tap into the relational and structural aspects of the transaction network and leverage these in a predictive model. Second, it featurizes the graph without the need for manual feature engineering. This work contributes to the literature by being the first to explicitly and extensively show how fraud detection modeling can benefit from representation learning. We discern three different approaches in this paper: traditional network featurization, an inductive representation learning algorithm and a transductive representational learner. Through extensive experimental evaluation on a real-world dataset we show that state-of-the-art representation learning in graphs outperforms traditional graph featurization.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Akila, S., Reddy, U.S.: Cost-sensitive risk induced Bayesian inference bagging (RIBIB) for credit card fraud detection. J. Comput. Sci. 27, 247–254 (2018)
Article Google Scholar
Aleskerov, E., Freisleben, B., Rao, B.: CARDWATCH: a neural network based database mining system for credit card fraud detection. In: Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr), pp. 220–226. IEEE (1997)
Google Scholar
Bahnsen, A.C., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Expert Syst. Appl. 51, 134–142 (2016)
Article Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Bhattacharyya, S., Jha, S., Tharakunnel, K., Westland, J.C.: Data mining for credit card fraud: a comparative study. Decis. Support Syst. 50(3), 602–613 (2011)
Article Google Scholar
Bolton, R.J., Hand, D.J., et al.: Unsupervised profiling methods for fraud detection. Credit Scoring and Credit Control VII, 235–255 (2001)
Google Scholar
Cao, S., Lu, W., Xu, Q.: GraRep: learning graph representations with global structural information. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 891–900. ACM (2015)
Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7(Jan), 1–30 (2006)
MathSciNet MATH Google Scholar
Fabris, N.: Cashless society-the future of money or a Utopia? J. Cent. Bank. Theory Pract. 8(1), 53–66 (2019)
Article Google Scholar
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)
Google Scholar
Guo, J., Xu, L., Chen, E.: Spine: structural identity preserved inductive network embedding. arXiv preprint arXiv:1802.03984 (2018)
Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. In: Advances in Neural Information Processing Systems, pp. 1024–1034 (2017)
Google Scholar
Hamilton, W.L., Ying, R., Leskovec, J.: Representation learning on graphs: methods and applications. CoRR abs/1709.05584 (2017)
Google Scholar
Jiang, F., Zheng, L., Xu, J., Yu, P.: Fi-grl: Fast inductive graph representation learning via projection-cost preservation. In: 2018 IEEE International Conference on Data Mining (ICDM), pp. 1067–1072. IEEE (2018)
Google Scholar
Jurgovsky, J., et al.: Sequence classification for credit-card fraud detection. Expert Syst. Appl. 100, 234–245 (2018)
Article Google Scholar
Maes, S., Tuyls, K., Vanschoenwinkel, B., Manderick, B.: Credit card fraud detection using Bayesian and neural networks. In: Proceedings of the 1st International Naiso Congress on Neuro Fuzzy Technologies, pp. 261–270 (2002)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Mitrović, S., Baesens, B., Lemahieu, W., De Weerdt, J.: tcc2vec: RFM-informed representation learning on call graphs for churn prediction. Inf. Sci. (2019). https://doi.org/10.1016/j.ins.2019.02.044
Article Google Scholar
Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)
Google Scholar
Porwal, U., Mukund, S.: Credit card fraud detection in e-commerce: an outlier detection approach. arXiv preprint arXiv:1811.02196 (2018)
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45–50. ELRA (2010)
Google Scholar
Rossi, R.A., Zhou, R., Ahmed, N.K.: Deep inductive network representation learning. In: Companion of the the Web Conference 2018 on the Web Conference 2018, pp. 953–960. International World Wide Web Conferences Steering Committee (2018)
Google Scholar
Sánchez, D., Vila, M., Cerda, L., Serrano, J.M.: Association rules applied to credit card fraud detection. Expert Syst. Appl. 36(2), 3630–3640 (2009)
Article Google Scholar
Sohony, I., Pratap, R., Nambiar, U.: Ensemble learning for credit card fraud detection. In: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, pp. 289–294. ACM (2018)
Google Scholar
Somasundaram, A., Reddy, S.: Parallel and incremental credit card fraud detection model to handle concept drift and data imbalance. Neural Comput. Appl. 31, 1–12 (2018)
Google Scholar
Srivastava, A., Kundu, A., Sural, S., Majumdar, A.: Credit card fraud detection using hidden Markov model. IEEE Trans. Dependable Secure Comput. 5(1), 37–48 (2008)
Article Google Scholar
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077. International World Wide Web Conferences Steering Committee (2015)
Google Scholar
Van Vlasselaer, V., et al.: APATE: a novel approach for automated credit card transaction fraud detection using network-based extensions. Decis. Support Syst. 75, 38–48 (2015)
Article Google Scholar
Weston, D.J., Hand, D.J., Adams, N.M., Whitrow, C., Juszczak, P.: Plastic card fraud detection using peer group analysis. Adv. Data Anal. Classif. 2(1), 45–62 (2008)
Article MathSciNet Google Scholar
Xu, C., Feng, Z., Chen, Y., Wang, M., Wei, T.: FeatNet: large-scale fraud device detection by network representation learning with rich features. In: Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security, pp. 57–63. ACM (2018)
Google Scholar
Xuan, S., Liu, G., Li, Z., Zheng, L., Wang, S., Jiang, C.: Random forest for credit card fraud detection. In: 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), pp. 1–6. IEEE (2018)
Google Scholar
Yu, W., Cheng, W., Aggarwal, C.C., Zhang, K., Chen, H., Wang, W.: NetWalk: a flexible deep embedding approach for anomaly detection in dynamic networks. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2672–2681. ACM (2018)
Google Scholar

Download references

Acknowledgement

We acknowledge the support given by the Research Fund - Flanders (FWO) as Aspirant (Rafaël Van Belle).

Author information

Authors and Affiliations

Research Center for Information Systems Engineering, KU Leuven, Naamsestraat 69, 3000, Leuven, Belgium
Rafaël Van Belle, Sandra Mitrović & Jochen De Weerdt

Authors

Rafaël Van Belle
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Mitrović
View author publications
You can also search for this author in PubMed Google Scholar
Jochen De Weerdt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rafaël Van Belle .

Editor information

Editors and Affiliations

UniCredit, Milan, Italy
Valerio Bitetta
UniCredit, Rome, Italy
Ilaria Bordino
UniCredit, Milan, Italy
Andrea Ferretti
UniCredit, Rome, Italy
Francesco Gullo
UniCredit, Milan, Italy
Stefano Pascolutti
ENEA Portici Research Center, Portici, Italy
Giovanni Ponti

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Van Belle, R., Mitrović, S., De Weerdt, J. (2020). Representation Learning in Graphs for Credit Card Fraud Detection. In: Bitetta, V., Bordino, I., Ferretti, A., Gullo, F., Pascolutti, S., Ponti, G. (eds) Mining Data for Financial Applications. MIDAS 2019. Lecture Notes in Computer Science(), vol 11985. Springer, Cham. https://doi.org/10.1007/978-3-030-37720-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-37720-5_3
Published: 03 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37719-9
Online ISBN: 978-3-030-37720-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)