Abstract
Graph Neural Networks (GNNs) have been widely used in graph-based anomaly detection tasks, and these methods require a sufficient amount of labeled data to achieve satisfactory performance. However, the high cost for data annotation leads to some well-designed algorithms in low practicality in real-world tasks. Active learning has been used to find a trade-off between labeling cost and model performance, while few prior works take it into anomaly detection. Therefore, we propose GADAL, a novel Active Learning framework for Graph Anomaly Detection, which employs a multi-aspects query strategy to achieve high performance within a limited budget. First, we design an abnormal-aware query strategy based on the scalable sliding window to enrich abnormal patterns and alleviate the class imbalance problem. Second, we design an inconsistency-aware query strategy based on the effective degree to capture the most specificity nodes in information aggregation. Then we provide a hybrid solution for the above query strategies. Empirical studies demonstrate that our query strategy significantly outperforms other strategies, and GADAL achieves a comparable performance to the state-of-art anomaly detection methods within less than 3% of the budget.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cai, H., Zheng, V.W., Chang, K.C.C.: Active learning for graph embedding. arXiv preprint arXiv:1705.05085 (2017)
Dou, Y., Liu, Z., Sun, L., Deng, Y., Peng, H., Yu, P.S.: Enhancing graph neural network-based fraud detectors against camouflaged fraudsters. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 315–324 (2020)
Gao, L., Yang, H., Zhou, C., Wu, J., Pan, S., Hu, Y.: Active discriminative network representation learning. In: IJCAI International Joint Conference on Artificial Intelligence (2018)
Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. Adv. Neural Inf. Process. Syst. 30, 1–11 (2017)
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)
Liu, Y., et al.: Pick and choose: a gnn-based imbalanced learning approach for fraud detection. In: Proceedings of the Web Conference 2021, pp. 3168–3177 (2021)
Liu, Z., Dou, Y., Yu, P.S., Deng, Y., Peng, H.: Alleviating the inconsistency problem of applying graph neural network to fraud detection. In: Proceedings of the 43nd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020)
McAuley, J.J., Leskovec, J.: From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 897–908 (2013)
Rayana, S., Akoglu, L.: Collective opinion spam detection: bridging review networks and metadata. In: Proceedings of the 21th ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, pp. 985–994 (2015)
Settles, B.: Active learning literature survey (2009)
Tuteja, S., Kumar, R.: A unification of heterogeneous data sources into a graph model in e-commerce. Data Sci. Eng. 7(1), 57–70 (2022)
Wang, D., et al.: A semi-supervised graph attentive network for financial fraud detection. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 598–607. IEEE (2019)
Zhang, W., Shen, Y., Li, Y., Chen, L., Yang, Z., Cui, B.: Alg: fast and accurate active learning framework for graph convolutional networks. In: Proceedings of the 2021 International Conference on Management of Data, pp. 2366–2374 (2021)
Zhang, Y., Fan, Y., Ye, Y., Zhao, L., Shi, C.: Key player identification in underground forums over attributed heterogeneous information network embedding framework. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 549–558 (2019)
Acknowledgements
This research is supported by a grant from MOE Social Science Laboratory of Digital Economic Forecasts and Policy Simulation at UCAS and CAS 145 Informatization Project CAS-WX2022GC-0301.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chang, W., Yu, J., Zhou, X. (2023). GADAL: An Active Learning Framework for Graph Anomaly Detection. In: Li, B., Yue, L., Tao, C., Han, X., Calvanese, D., Amagasa, T. (eds) Web and Big Data. APWeb-WAIM 2022. Lecture Notes in Computer Science, vol 13421. Springer, Cham. https://doi.org/10.1007/978-3-031-25158-0_35
Download citation
DOI: https://doi.org/10.1007/978-3-031-25158-0_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25157-3
Online ISBN: 978-3-031-25158-0
eBook Packages: Computer ScienceComputer Science (R0)