Abstract
For smart devices such as smartphones and tablets, developing new software using open source software (OSS) is becoming mainstream. While OSS-based development can greatly increase project productivity, it is more difficult to identify the cause of software defects. In this paper, we propose a deep learning model that performs unsupervised learning based on the log data accumulated in the project and calculates the degree of abnormality per line for newly given logs. The proposed method is evaluated using open supercomputer system log data, Blue Gene/L, and the accuracy of the proposed method is compared with the conventional log anomaly detection method using LSTM AutoEncoder. As a result of the comparative experiment, it was found that the proposed method performed better than the conventional method in the two scores of AUROC and F1 Score at the cutoff point.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hirakawa, R., Tominaga, K., Nakatoh, Y.: Study on real-time log anomaly detection method using HTM algorithm. In: Proceedings of the Institute of Electronics. Information and Communication Engineers, Society Conference, vol. 2019, p. 74 (2019)
Du, M., Li, F., Zheng, G., Srikumar, V.: DeepLog: anomaly detection and diagnosis from system logs through deep learning, pp. 1285–1298 (2017). https://doi.org/10.1145/3133956.3134015
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT (2018)
Wu, Y. et al.: Google’s neural machine translation system: bridging the gap between human and machine translation (2016)
Chalapathy, R., Menon, A., Chawla, S.: Anomaly detection using one-class neural networks (2018)
Oliner, A.J., Stearley, J.: What supercomputers say: a study of five system logs. In: Proceedings of IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) (2007)
Zhu, J., et al.: Tools and benchmarks for automated log parsing. In: International Conference on Software Engineering (ICSE) (2019)
Bert (Github). https://github.com/google-research/bert. Accessed 14 Mar 2020
Optuna (Github). https://github.com/optuna/optuna. Accessed 15 Mar 2020
text-autoencoders (Github). https://github.com/shentianxiao/text-autoencoders. Accessed 15 Mar 2020
Shen, T., Mueller, J., Barzilay, R., Jaakkola, T.: Educating text autoencoders: latent representation guidance via denoising. arXiv preprint arXiv:1905.12777 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hirakawa, R., Tominaga, K., Nakatoh, Y. (2020). Software Log Anomaly Detection Through One Class Clustering of Transformer Encoder Representation. In: Stephanidis, C., Antona, M. (eds) HCI International 2020 - Posters. HCII 2020. Communications in Computer and Information Science, vol 1224. Springer, Cham. https://doi.org/10.1007/978-3-030-50726-8_85
Download citation
DOI: https://doi.org/10.1007/978-3-030-50726-8_85
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-50725-1
Online ISBN: 978-3-030-50726-8
eBook Packages: Computer ScienceComputer Science (R0)