Skip to main content

MicroDACP: Microservice Fault Diagnosis Method Based on Dual Attention Contrastive Learning and Graph Attention Networks

  • Conference paper
  • First Online:
Advanced Intelligent Computing Technology and Applications (ICIC 2024)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14878))

Included in the following conference series:

  • 653 Accesses

Abstract

With the wide application of microservice architecture, the fault diagnosis of microservice software system becomes difficult due to the complex dependencies between microservices. In order to diagnose the faults of microservices quickly and accurately, this paper proposes MicroDACP. First, a contrastive representation of the dual attention mechanism is used to learn to determine whether the microservice system is anomalous. Second, a graph attention network is used to learn the microservice invocation dependency graph, and the fault root cause scores are ranked using an improved PageRank algorithm. Then, a Sock-shop microservice system is built on a Kubernetes cluster to evaluate the performance of MicroDACP. We conduct extensive experiments on Sock-shop dataset, SMD dataset and AIOps 2020 dataset to compare and analyze our method with the baseline, and the results show that MicroDACP achieves improvements of up to about 0.13 in F1 score for anomaly detection and 0.32 in mean average precision for root cause localization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Li, B., et al.: Enjoy your observability: an industrial survey of microservice tracing and analysis. Empir. Softw. Eng. 27(1), 1–28 (2022)

    MathSciNet  Google Scholar 

  2. Caron, M., Misra, I., Mairal, J., Goyal, P., Bojanowski, P., Joulin, A.: Unsupervised learning of visual features by contrasting cluster assignments. In: Advances in Neural Information Processing Systems, vol. 33, pp. 9912–9924 (2020)

    Google Scholar 

  3. Liu, P., et al.: Unsupervised detection of microservice trace anomalies through service-level deep Bayesian networks. In: IEEE 31st International Symposium on Software Reliability Engineering, pp. 48–58 (2020)

    Google Scholar 

  4. Aggarwal, P., et al.: Localization of operational faults in cloud applications by mining causal dependencies in logs using golden signals. In: 18th International Conference on Service-Oriented Computing, vol. 12632, pp. 137–149 (2020)

    Google Scholar 

  5. Wu, L., Tordsson, J., Elmroth, E., Kao, O.: MicroRCA: root cause localization of performance issues in microservices. In: IEEE/IFIP Network Operations and Management Symposium, pp. 1–9 (2020)

    Google Scholar 

  6. Nedelkoski, S., Cardoso, J., Kao, O.: Anomaly detection from system tracing data using multimodal deep learning. In: 12th IEEE International Conference on Cloud Computing, pp. 179–186 (2019)

    Google Scholar 

  7. Zhang, S., Jin, P., Lin, Z., Sun, Y., Zhang, B., Xia, S., et al.: Robust failure diagnosis of microservice system through multimodal data. IEEE Trans. Serv. Comput. 16(6), 3851–3864 (2023)

    Google Scholar 

  8. Chen, Y., Xu, D., Chen, N., Wu, X.: FRL-MFPG: propagation-aware fault root cause location for microservice intelligent operation and maintenance. Inf. Softw. Technol. 153(107083) (2023)

    Google Scholar 

  9. Yang, Y., Zhang, C., Zhou, T., Wen, Q., Sun, L.: DCdetector: dual attention contrastive representation learning for time series anomaly detection. In: 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 3033–3045 (2023)

    Google Scholar 

  10. Brody, S., Uri, A., Eran, Y.: How attentive are graph attention networks? In: International Conference on Learning Representations (2021)

    Google Scholar 

  11. Lea, C., Vidal, R., Reiter, A., Hager, G.D.: Temporal convolutional networks: a unified approach to action segmentation. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9915, pp. 47–54. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_7

  12. Mariani, L., Monni, C., Pezzé, M., Riganelli, O., Xin, R.: Localizing faults in cloud systems. In: 11th IEEE International Conference on Software Testing, Verification and Validation, pp. 262–273 (2018)

    Google Scholar 

  13. Zhang, Z., Bing, L., Jian, W., Liu, Y.: AAMR: automated anomalous microservice ranking in cloud-native environment. In: International Conference on Software Engineering and Knowledge Engineering, pp. 86–91 (2021)

    Google Scholar 

  14. Lin, J.J., Chen, P.F., Zheng, Z.B.: Microscope: pinpoint performance issues with causal graphs in micro-service environments. In: 16th International Conference on Service-Oriented Computing, vol. 11236, pp. 3–20 (2018)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (No. 62162003), Guangxi Key Laboratory of Big Data in Finance and Economics (Grant No. FEDOP2022A02), and the Nanning Science and Technology project (No. 20221031).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ningjiang Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, D., Wu, X., Chen, N., Liu, C. (2024). MicroDACP: Microservice Fault Diagnosis Method Based on Dual Attention Contrastive Learning and Graph Attention Networks. In: Huang, DS., Si, Z., Zhang, C. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14878. Springer, Singapore. https://doi.org/10.1007/978-981-97-5672-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-5672-8_8

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-5671-1

  • Online ISBN: 978-981-97-5672-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics