Skip to main content

GADAL: An Active Learning Framework for Graph Anomaly Detection

  • Conference paper
  • First Online:
Web and Big Data (APWeb-WAIM 2022)

Abstract

Graph Neural Networks (GNNs) have been widely used in graph-based anomaly detection tasks, and these methods require a sufficient amount of labeled data to achieve satisfactory performance. However, the high cost for data annotation leads to some well-designed algorithms in low practicality in real-world tasks. Active learning has been used to find a trade-off between labeling cost and model performance, while few prior works take it into anomaly detection. Therefore, we propose GADAL, a novel Active Learning framework for Graph Anomaly Detection, which employs a multi-aspects query strategy to achieve high performance within a limited budget. First, we design an abnormal-aware query strategy based on the scalable sliding window to enrich abnormal patterns and alleviate the class imbalance problem. Second, we design an inconsistency-aware query strategy based on the effective degree to capture the most specificity nodes in information aggregation. Then we provide a hybrid solution for the above query strategies. Empirical studies demonstrate that our query strategy significantly outperforms other strategies, and GADAL achieves a comparable performance to the state-of-art anomaly detection methods within less than 3% of the budget.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cai, H., Zheng, V.W., Chang, K.C.C.: Active learning for graph embedding. arXiv preprint arXiv:1705.05085 (2017)

  2. Dou, Y., Liu, Z., Sun, L., Deng, Y., Peng, H., Yu, P.S.: Enhancing graph neural network-based fraud detectors against camouflaged fraudsters. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 315–324 (2020)

    Google Scholar 

  3. Gao, L., Yang, H., Zhou, C., Wu, J., Pan, S., Hu, Y.: Active discriminative network representation learning. In: IJCAI International Joint Conference on Artificial Intelligence (2018)

    Google Scholar 

  4. Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. Adv. Neural Inf. Process. Syst. 30, 1–11 (2017)

    Google Scholar 

  5. Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)

  6. Liu, Y., et al.: Pick and choose: a gnn-based imbalanced learning approach for fraud detection. In: Proceedings of the Web Conference 2021, pp. 3168–3177 (2021)

    Google Scholar 

  7. Liu, Z., Dou, Y., Yu, P.S., Deng, Y., Peng, H.: Alleviating the inconsistency problem of applying graph neural network to fraud detection. In: Proceedings of the 43nd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020)

    Google Scholar 

  8. McAuley, J.J., Leskovec, J.: From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 897–908 (2013)

    Google Scholar 

  9. Rayana, S., Akoglu, L.: Collective opinion spam detection: bridging review networks and metadata. In: Proceedings of the 21th ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, pp. 985–994 (2015)

    Google Scholar 

  10. Settles, B.: Active learning literature survey (2009)

    Google Scholar 

  11. Tuteja, S., Kumar, R.: A unification of heterogeneous data sources into a graph model in e-commerce. Data Sci. Eng. 7(1), 57–70 (2022)

    Article  Google Scholar 

  12. Wang, D., et al.: A semi-supervised graph attentive network for financial fraud detection. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 598–607. IEEE (2019)

    Google Scholar 

  13. Zhang, W., Shen, Y., Li, Y., Chen, L., Yang, Z., Cui, B.: Alg: fast and accurate active learning framework for graph convolutional networks. In: Proceedings of the 2021 International Conference on Management of Data, pp. 2366–2374 (2021)

    Google Scholar 

  14. Zhang, Y., Fan, Y., Ye, Y., Zhao, L., Shi, C.: Key player identification in underground forums over attributed heterogeneous information network embedding framework. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 549–558 (2019)

    Google Scholar 

Download references

Acknowledgements

This research is supported by a grant from MOE Social Science Laboratory of Digital Economic Forecasts and Policy Simulation at UCAS and CAS 145 Informatization Project CAS-WX2022GC-0301.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianjun Yu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chang, W., Yu, J., Zhou, X. (2023). GADAL: An Active Learning Framework for Graph Anomaly Detection. In: Li, B., Yue, L., Tao, C., Han, X., Calvanese, D., Amagasa, T. (eds) Web and Big Data. APWeb-WAIM 2022. Lecture Notes in Computer Science, vol 13421. Springer, Cham. https://doi.org/10.1007/978-3-031-25158-0_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-25158-0_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-25157-3

  • Online ISBN: 978-3-031-25158-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics