Skip to main content

Software Log Anomaly Detection Through One Class Clustering of Transformer Encoder Representation

  • Conference paper
  • First Online:
HCI International 2020 - Posters (HCII 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1224))

Included in the following conference series:

  • 2438 Accesses

Abstract

For smart devices such as smartphones and tablets, developing new software using open source software (OSS) is becoming mainstream. While OSS-based development can greatly increase project productivity, it is more difficult to identify the cause of software defects. In this paper, we propose a deep learning model that performs unsupervised learning based on the log data accumulated in the project and calculates the degree of abnormality per line for newly given logs. The proposed method is evaluated using open supercomputer system log data, Blue Gene/L, and the accuracy of the proposed method is compared with the conventional log anomaly detection method using LSTM AutoEncoder. As a result of the comparative experiment, it was found that the proposed method performed better than the conventional method in the two scores of AUROC and F1 Score at the cutoff point.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hirakawa, R., Tominaga, K., Nakatoh, Y.: Study on real-time log anomaly detection method using HTM algorithm. In: Proceedings of the Institute of Electronics. Information and Communication Engineers, Society Conference, vol. 2019, p. 74 (2019)

    Google Scholar 

  2. Du, M., Li, F., Zheng, G., Srikumar, V.: DeepLog: anomaly detection and diagnosis from system logs through deep learning, pp. 1285–1298 (2017). https://doi.org/10.1145/3133956.3134015

  3. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT (2018)

    Google Scholar 

  4. Wu, Y. et al.: Google’s neural machine translation system: bridging the gap between human and machine translation (2016)

    Google Scholar 

  5. Chalapathy, R., Menon, A., Chawla, S.: Anomaly detection using one-class neural networks (2018)

    Google Scholar 

  6. Oliner, A.J., Stearley, J.: What supercomputers say: a study of five system logs. In: Proceedings of IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) (2007)

    Google Scholar 

  7. Zhu, J., et al.: Tools and benchmarks for automated log parsing. In: International Conference on Software Engineering (ICSE) (2019)

    Google Scholar 

  8. Bert (Github). https://github.com/google-research/bert. Accessed 14 Mar 2020

  9. Optuna (Github). https://github.com/optuna/optuna. Accessed 15 Mar 2020

  10. text-autoencoders (Github). https://github.com/shentianxiao/text-autoencoders. Accessed 15 Mar 2020

  11. Shen, T., Mueller, J., Barzilay, R., Jaakkola, T.: Educating text autoencoders: latent representation guidance via denoising. arXiv preprint arXiv:1905.12777 (2019)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoshihisa Nakatoh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hirakawa, R., Tominaga, K., Nakatoh, Y. (2020). Software Log Anomaly Detection Through One Class Clustering of Transformer Encoder Representation. In: Stephanidis, C., Antona, M. (eds) HCI International 2020 - Posters. HCII 2020. Communications in Computer and Information Science, vol 1224. Springer, Cham. https://doi.org/10.1007/978-3-030-50726-8_85

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-50726-8_85

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-50725-1

  • Online ISBN: 978-3-030-50726-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics