research-article

Open access

Collaborative Fraud Detection on Large Scale Graph Using Secure Multi-Party Computation

Authors:

Wei XuAuthors Info & Claims

CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management

Pages 1473 - 1482

https://doi.org/10.1145/3627673.3679863

Published: 21 October 2024 Publication History

Abstract

Enabling various parties to share data enhances online fraud detection capabilities considering fraudsters tend to reuse resources attacking multiple platforms. Multi-party computation (MPC) techniques, such as secret sharing, offer potential privacy-preserving solutions but face efficiency challenges when handling large-scale data. This paper presents a novel approach, SecureFD (Secure Fraud Detector), aimed at detecting fraud in multi-party graph data, ensuring privacy, accuracy, and scalability. We propose a graph neural network EPR-GNN, which is MPC-friendly, as the base detector. Then we design a framework that allows multiple parties to train EPR-GNN collaboratively on secure sparse graphs in a privacy- preserving manner. The oblivious node embedding sharing protocol in the collaborative training procedure achieves up to a 45× speed-up, supporting over four million users compared to the naive solution. Additionally, we further reduce secure computation by locally pruning a significant number of non-suspicious users and selecting only the most valuable resources for sharing. Experiments on real datasets demonstrate that by securely integrating data from different parties, SecureFD achieves superior detection performance compared to state-of-the-art local detectors. And the local pruning greatly improves the scalability without compromising detection accuracies.

References

[1]

Panos Alexopoulos, Kostas Kafentzis, Xanthi Benetou, Tassos Tagaris, and Panos Georgolios. 2008. Towards a Generic Fraud Ontology in e-Government. In International Conference on E-Business. 269--276.

[2]

Toshinori Araki, Jun Furukawa, Yehuda Lindell, Ariel Nof, and Kazuma Ohara. 2016. High-throughput Semi-Honest Secure Three-Party Computation with an Honest Majority. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security. 805--817.

Digital Library

[3]

Muhammad Ajmal Azad, Samiran Bag, Shazia Tabassum, and Feng Hao. 2020. Privy: Privacy Preserving Collaboration Across Multiple Service Providers to Combat Telecom Spams. IEEE Transactions on Emerging Topics in Computing, Vol. 8, 2 (2020), 313--327. https://doi.org/10.1109/TETC.2017.2771251

[4]

Yikun Ban, Xin Liu, Yitao Duan, Xue Liu, and Wei Xu. 2019. No Place to Hide: Catching Fraudulent Entities in Tensors. In Proceedings of The Web Conference 2019.

[5]

Mihir Bellare, Viet Tung Hoang, and Phillip Rogaway. 2012. Foundations of Garbled Circuits. In Proceedings of the 2012 ACM conference on Computer and communications security. 784--796.

Digital Library

[6]

Alex Beutel, Wanhong Xu, Venkatesan Guruswami, Christopher Palow, and Christos Faloutsos. 2013. CopyCatch: Stopping Group Attacks by Spotting Lockstep Behavior In Social Networks. In WWW. 119--130.

[7]

Siddharth Bhatia, Mohit Wadhwa, Kenji Kawaguchi, Neil Shah, Philip S. Yu, and Bryan Hooi. 2023. Sketch-Based Anomaly Detection in Streaming Graphs. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 93--104.

Digital Library

[8]

Richard J. Bolton and David J. Hand. 2002. Statistical Fraud Detection: A Review. Statist. Sci., Vol. 17, 3 (2002), 235--249.

[9]

Elette Boyle, Niv Gilboa, and Yuval Ishai. 2016. Breaking the Circuit Size Barrier for Secure Computation Under DDH. In CRYPTO 2016.

[10]

Ran Canetti. 2001. Universally Composable Security: A New Paradigm for Cryptographic Protocols. In 42nd IEEE Symposium on Foundations of Computer Science. IEEE, 136--145.

[11]

Qiang Cao, Xiaowei Yang, Jieqi Yu, and Christopher Palow. 2014. Uncovering Large Groups of Active Malicious Accounts in Online Social Networks. In ACM SIGSAC Conference on Computer and Communications Security. ACM, 477--488.

[12]

Bo Chen, Calvin Hawkins, Kasra Yazdani, and Matthew Hale. 2021. Edge Differential Privacy for Algebraic Connectivity of Graphs. In 2021 60th IEEE Conference on Decision and Control (CDC). 2764--2769.

[13]

Chaochao Chen, Jun Zhou, L. xilinx Wang, Xibin Wu, Wenjing Fang, Jin Tan, Lei Wang, Xiaoxi Ji, Alex X. Liu, Hao Wang, and Cheng Hong. 2021. When Homomorphic Encryption Marries Secret Sharing: Secure Large-Scale Sparse Logistic Regression and Applications in Risk Control. In 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining.

[14]

Kewei Cheng, Tao Fan, Yilun Jin, Yang Liu, Tianjian Chen, Dimitrios Papadopoulos, and Qiang Yang. 2021. SecureBoost: A Lossless Federated Learning Framework. IEEE Intelligent Systems, Vol. 36, 6 (2021), 87--98.

Digital Library

[15]

Eli Chien, Jianhao Peng, Pan Li, and Olgica Milenkovic. 2021. Adaptive Universal Generalized PageRank Graph Neural Network. In International Conference on Learning Representations. https://openreview.net/forum?id=n6jl7fLxrP

[16]

Yingtong Dou, Zhiwei Liu, Li Sun, Yutong Deng, Hao Peng, and Philip S. Yu. 2020. Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters. Proceedings of the 29th ACM International Conference on Information & Knowledge Management (2020).

Digital Library

[17]

Yingtong Dou, Guixiang Ma, Philip S. Yu, and Sihong Xie. 2020. Robust Spammer Detection by Nash Reinforcement Learning. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 924--933.

Digital Library

[18]

Xiaoyu Fan, Kun Chen, Guosai Wang, Mingchun Zhuang, Yi Li, and Wei Xu. 2022. NFGen: Automatic Non-linear Function Evaluation Code Generator for General-purpose MPC Platforms. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security (CCS '22).

Digital Library

[19]

Craig Gentry. 2009. Fully Homomorphic Encryption Using Ideal Lattices. In STOC '09.

[20]

Shai Halevi and Victor Shoup. 2014. Algorithms in HElib. IACR Cryptology ePrint Archive, Vol. 2014 (2014), 106.

[21]

Koki Hamada, Ryo Kikuchi, Dai Ikarashi, Koji Chida, and Katsumi Takahashi. 2012. Practically Efficient Multi-Party Sorting Protocols from Comparison Sort Algorithms. In 15th International Conference on Information Security and Cryptology. 202--216.

Digital Library

[22]

Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In Advances in Neural Information Processing Systems. 1024--1034.

[23]

Bryan Hooi, Hyun Ah Song, Alex Beutel, Neil Shah, Kijung Shin, and Christos Faloutsos. 2016. FRAUDAR: Bounding Graph Fraud in the Face of Camouflage. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 895--904.

Digital Library

[24]

Chiraag Juvekar, Vinod Vaikuntanathan, and Anantha Chandrakasan. 2018. Gazelle: A Low Latency Framework for Secure Neural Network Inference. In IACR Cryptology ePrint Archive.

[25]

Thomas N. Kipf and Max Welling. 2007. Semi-Supervised Classification with Graph Convolutional Networks. In Proceedings of the International Conference on Learning Representations.

[26]

Peeter Laud. 2015. Parallel Oblivious Array Access for Secure Multiparty Computation and Privacy-Preserving Minimum Spanning Trees. Proceedings on Privacy Enhancing Technologies, Vol. 2015 (2015), 188 -- 205.

[27]

Pan Li, Eli Chien, and Olgica Milenkovic. 2019. Optimizing generalized PageRank methods for seed-expansion community detection. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, Article 1050, 12 pages.

Digital Library

[28]

Yi Li and Wei Xu. 2019. PrivPy: General and Scalable Privacy-Preserving Data Mining. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.

Digital Library

[29]

Yang Liu, Xiang Ao, Zidi Qin, Jianfeng Chi, Jinghua Feng, Hao Yang, and Qing He. 2021. Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection. In Proceedings of the Web Conference 2021. 3168--3177.

Digital Library

[30]

Ziqi Liu, Chaochao Chen, Xinxing Yang, Jun Zhou, Xiaolong Li, and Le Song. 2018. Heterogeneous Graph Neural Networks for Malicious Account Detection. In CIKM. ACM, 2077--2085.

Digital Library

[31]

Wenjie Lu, Shohei Kawasaki, and Jun Sakuma. 2016. Using Fully Homomorphic Encryption for Statistical Analysis of Categorical, Ordinal and Numerical Data. IACR Cryptology ePrint Archive, Vol. 2016 (2016), 1163.

[32]

Payman Mohassel and Peter Rindal. 2018. ABY3: A Mixed Protocol Framework for Machine Learning. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security.

Digital Library

[33]

Jianmo Ni, Jiacheng Li, and Julian McAuley. 2019. Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects. In EMNLP.

[34]

U.S. Department of Justice Federal Bureau of Investigation. 2015. 2015 Internet Crime Report. https://pdf.ic3.gov/2015_IC3Report.pdf.

[35]

Sofya Raskhodnikova and Adam Smith. 2016. Differentially Private Analysis of Graphs. Springer New York, New York, NY, 543--547. https://doi.org/10.1007/978--1--4939--2864--4_549

[36]

Kijung Shin, Bryan Hooi, and Christo Faloutsos. 2018. Fast, Accurate, and Flexible Algorithms for Dense Subtensor Mining. TKDD, Vol. 12, 3 (2018), 28.

Digital Library

[37]

Kijung Shin, Bryan Hooi, Jisu Kim, and Christos Faloutsos. 2017. D-Cube: Dense-Block Detection in Terabyte-Scale Tensors. In WSDM. 681--689.

[38]

Erez Shmueli and Tamir Tassa. 2017. Secure Multi-Party Protocols for Item-Based Collaborative Filtering. In RecSys '17.

[39]

Kurt Thomas, Danny Huang, David Wang, Elie Bursztein, Chris Grier, Thomas J. Holt, Christopher Kruegel, Damon McCoy, Stefan Savage, and Giovanni Vigna. 2015. Framing Dependencies Introduced by Underground Commoditization. In Workshop on the Economics of Information Security.

[40]

Tian Tian, Jun Zhu, Fen Xia, Xin Zhuang, and Tong Zhang. 2015. Crowd fraud detection in internet advertising. In WWW. 1100--1110.

[41]

Sameer Wagh, Divya Gupta, and Nishanth Chandran. 2019. SecureNN: 3-Party Secure Computation for Neural Network Training. Proceedings on Privacy Enhancing Technologies, Vol. 2019 (2019), 26 -- 49.

[42]

Xueyu Wu, Zhuoran Ji, and Cho-Li Wang. 2023. Embedding Communication for Federated Graph Neural Networks with Privacy Guarantees. In 2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS). 305--315. https://doi.org/10.1109/ICDCS57875.2023.00029

[43]

Sheng Xiang, Mingzhi Zhu, Dawei Cheng, Enxia Li, Ruihui Zhao, Yi Ouyang, Ling Chen, and Yefeng Zheng. 2023. Semi-Supervised Credit Card Fraud Detection via Attribute-Driven Graph Representation. In The Annual AAAI Conference on Artificial Intelligence.

Digital Library

[44]

Han Xie, Jing Ma, Li Xiong, and Carl Yang. 2021. Federated Graph Classification over Non-IID Graphs. Advances in Neural Information Processing Systems, Vol. 34 (2021), 18839--18852.

[45]

Ibrahim Yakut and Huseyin Polat. 2012. Arbitrarily Distributed Data-Based Recommendations with Privacy. Data & Knowledge Engineering, Vol. 72 (2012), 239--256.

Digital Library

[46]

Chao Yang, Robert Harkreader, Jialong Zhang, Seungwon Shin, and Guofei Gu. 2012. Analyzing Spammers' Social Networks for Fun and Profit: A Case Study of Cyber Criminal Ecosystem on Twitter. In 21st International Conference on World Wide Web.

Digital Library

[47]

Han Zhang, Wenhao Zheng, Charley Chen, Kevin Gao, Yao Hu, Ling Huang, and Wei Xu. 2020. Modeling Heterogeneous Statistical Patterns in High-dimensional Data by Adversarial Distributions: An Unsupervised Generative Framework. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW '20). 1389--1399. https://doi.org/10.1145/3366423.3380213

Digital Library

[48]

Ke Zhang, Carl Yang, Xiaoxiao Li, Lichao Sun, and Siu Ming Yiu. 2021. Subgraph Federated Learning with Missing Neighbor Generation. Advances in Neural Information Processing Systems, Vol. 34 (2021), 6671--6682.

[49]

Jun Zhou, Chaochao Chen, Longfei Zheng, Huiwen Wu, Jia Wu, Xiaolin Zheng, Bingzhe Wu, Ziqi Liu, and Li Wang. 2022. Vertically Federated Graph Neural Network for Privacy-Preserving Node Classification. In Proceedings of the International Joint Conference on Artificial Intelligence 2022.

Index Terms

Collaborative Fraud Detection on Large Scale Graph Using Secure Multi-Party Computation
1. Security and privacy
  1. Software and application security
    1. Domain-specific security and privacy architectures

Recommendations

An efficient fair UC-secure protocol for two-party computation

With the development of modern Internet and mobile networks, there is an increasing need for collaborative privacy-preserving applications. Secure multi-party computation SMPC gives a general solution to these applications and has become a hot topic. ...
Secure Multi-Party Computation without Agreement

It has recently been shown that authenticated Byzantine agreement, in which more than a third of the parties are corrupted, cannot be securely realized under concurrent or parallel (stateless) composition. This result puts into question any usage of ...
FAMC: Fair and Publicly Auditable Multi-Party Computation with Cheater Detection
Information and Communications Security
Abstract
Secure multi-party computation (MPC) protocols do not completely prevent malicious parties from cheating. Though numerous researches on cheating detection are proposed, most cheater detection works do not guarantee fairness of the protocol. That ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management

October 2024

5705 pages

ISBN:9798400704369

DOI:10.1145/3627673

General Chairs:
Edoardo Serra
Boise State University, USA
,
Francesca Spezzano
Boise State University, USA

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2024

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key Research and Development Program of China

Conference

CIKM '24

Sponsor:

SIGIR

CIKM '24: The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

ID, Boise, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
266
Total Downloads

Downloads (Last 12 months)266
Downloads (Last 6 weeks)107

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten