Abstract
Given a network with node descriptions or labels, how to identify anomalous nodes and anomalous links in it? Existing methods (e.g., non-negative matrix factorization) mostly focuses on structural anomalies, without taking node descriptions or labels seriously into account. However, such information is obviously valuable for detecting anomalous nodes and links. On the other hand, network representation learning aims to represent the nodes and links in a network as low-dimensional, real-valued and dense vectors, so that the resulting vectors have representation and reasoning ability. It is straightforward that the reconstruction errors between normal node representations are small, while anomaly ones are not. Therefore, we propose a novel Content-Aware Anomaly Detection (CAAD) method based on network representation learning and encoder-decoder. The CAAD method learns structural and content representations with convolutional neural networks. By using the learned low-dimensional node representations, an encoder-decoder model is trained to perform anomaly detection in terms of reconstruction errors. Experiments on two synthetic datasets and one real-world dataset demonstrate that CAAD consistently outperforms the existing baseline methods. For more information and source codes of this study please visit https://github.com/lizhong2613/CAAD.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yu, W., et al.: Netwalk: a flexible deep embedding approach for anomaly detection in dynamic networks. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2018)
Hu, R., et al.: An embedding approach to anomaly detection. In: IEEE 32nd International Conference on Data Engineering (ICDE). IEEE (2016)
Gao, J., et al.: A spectral framework for detecting inconsistency across multi-source object relationships. In: 2011 IEEE 11th International Conference on Data Mining. IEEE (2011)
Fischer, A., et al.: Virtual network embedding: a survey. IEEE Commun. Surv. Tutorials 15(4), 1888–1906 (2013)
Akoglu, L., McGlohon, M., Faloutsos, C.: Oddball: spotting anomalies in weighted graphs. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS (LNAI), vol. 6119, pp. 410–421. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13672-6_40
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2014)
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2016)
Tang, J., et al.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee (2015)
Chawla, N.V., et al.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Prakash, B.A., Sridharan, A., Seshadri, M., Machiraju, S., Faloutsos, C.: EigenSpokes: surprising patterns and scalable community chipping in large graphs. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS (LNAI), vol. 6119, pp. 435–448. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13672-6_42
Jiang, M., et al.: A general suspiciousness metric for dense blocks in multimodal data. In: IEEE International Conference on Data Mining. IEEE (2015)
Shin, K., Hooi, B., Faloutsos, C.: M-Zoom: fast dense-block detection in tensors with quality guarantees. In: Frasconi, P., Landwehr, N., Manco, G., Vreeken, J. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9851, pp. 264–280. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46128-1_17
Shin, K., et al.: D-cube: dense-block detection in terabyte-scale tensors. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. ACM (2017)
Tong, H., Lin, C.-Y.: Non-negative residual matrix factorization with application to graph anomaly detection. In: Proceedings of the 2011 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics (2011)
Tu, C., et al.: Cane: context-aware network embedding for relation modeling. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2017)
Kagan, D., Elovichi, Y., Fire, M.: Generic anomalous vertices detection utilizing a link prediction algorithm. Social Netw. Anal. Min. 8(1), 27 (2018)
Kagan, D., Fire, M., Elovici, Y.: Unsupervised anomalous vertices detection utilizing link prediction algorithms. arXiv preprint arXiv:1610.07525 (2016)
Liu, F.T., Ting, K.M., Zhou, Z.-H.: Isolation forest. In: Eighth IEEE International Conference on Data Mining. IEEE (2008)
Acknowledgements
This work is supported by the National Key Research and Development Program of China, the National Natural Science Foundation of China under grants U1911401, 61772501, and U1836206, and the GF Innovative Research Program. We are grateful to the editor and anonymous reviewers for their constructive comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, Z., Jin, X., Zhuang, C., Sun, Z. (2020). Content-Aware Anomaly Detection with Network Representation Learning. In: Qiu, M. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2020. Lecture Notes in Computer Science(), vol 12454. Springer, Cham. https://doi.org/10.1007/978-3-030-60248-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-60248-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60247-5
Online ISBN: 978-3-030-60248-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)