Skip to main content

Autonomic Service Operation for Cloud Applications: Safe Actuation and Risk Management

  • Conference paper
  • First Online:
Dependable Computing - EDCC 2021 Workshops (EDCC 2021)

Abstract

Cloud-native applications consist of highly specialized and decoupled services that can be deployed, scaled and managed independently. Maintaining such applications available is a complex task for operators, because software defects and other kinds of faults can be challenging to diagnose and repair to quickly resume operations. Autonomic service operation is therefore a promising approach. However, there are risks associated to guaranteeing safe autonomic actuation, which must be managed. This paper discusses the challenges identified in the context of the development of a platform for autonomic service operation and describe the software architecture of the platform. Results show mean times to detect, diagnose and repair failures in the order of tens of seconds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Apache Software Foundation: Apache Kafka. https://kafka.apache.org/. Accessed 01 June 2021

  2. Cerveira, F., Barbosa, R., Madeira, H., Araújo, F.: The effects of soft errors and mitigation strategies for virtualization servers. IEEE Trans. Cloud Comput. (2020)

    Google Scholar 

  3. Florio, L., Nitto, E.D.: GRU: an approach to introduce decentralized autonomic behavior in microservices architectures (2016). https://doi.org/10.1109/ICAC.2016.25

  4. Gunawi, H.S., et al.: Why does the cloud stop computing? lessons from hundreds of service outages (2016). https://doi.org/10.1145/2987550.2987583

  5. Instana: Stan’s robot shop, a sample microservice application (2021). https://github.com/instana/robot-shop

  6. Jamshidi, P., Pahl, C., Mendonça, N.C., Lewis, J., Tilkov, S.: Microservices: the journey so far and challenges ahead. IEEE Softw. 35(3), 24–35 (2018)

    Article  Google Scholar 

  7. Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36, 41–50 (2003). https://doi.org/10.1109/MC.2003.1160055

  8. Liu, H., Lu, S., Musuvathi, M., Nath, S.: What bugs cause production cloud incidents? (2019). https://doi.org/10.1145/3317550.3321438

  9. Wu, L., Tordsson, J., Acker, A., Kao, O.: MicroRAS: Automatic recovery in the absence of historical failure data for microservice systems (2020)

    Google Scholar 

Download references

Acknowledgements

This work has been funded through the FCT - Foundation for Science and Technology, I.P., within the scope of project CISUC - UID/CEC/00326/2020, by the European Social Fund, through the Regional Operational Program Centro 2020, and by the AESOP project (P2020-31/SI/2017, No. 040004).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raul Barbosa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tomás, J. et al. (2021). Autonomic Service Operation for Cloud Applications: Safe Actuation and Risk Management. In: Adler, R., et al. Dependable Computing - EDCC 2021 Workshops. EDCC 2021. Communications in Computer and Information Science, vol 1462. Springer, Cham. https://doi.org/10.1007/978-3-030-86507-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-86507-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-86506-1

  • Online ISBN: 978-3-030-86507-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics