skip to main content
10.1145/3555776.3578809acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Defektor: An Extensible Tool for Fault Injection Campaign Management in Microservice Systems

Published:07 June 2023Publication History

ABSTRACT

To achieve dependability, system designers often resort to fault-tolerance mechanisms. The evaluation of these mechanisms requires the observation of failures, which typically are rare events. To increase the failure rate, practitioners use fault injection techniques, leading to an increased occurrence of failures and allowing the assessment of the system's dependability properties. While many fault injection tools exist for this end, they are usually limited in scope, applicability, and in their configuration abilities for microservice applications.

We propose a generalist and extensible tool named "Defektor" capable of controlling a fault injection campaign on different types of applications, especially suited for microservice-based applications, compatible with different container orchestration technologies and different fault injection tools. The Defektor configuration follows a high-level approach, based on an injection campaign plan specifying the instructions for the Defektor operation and the parameters of the fault injection campaign. Defektor automates the entire workflow, consisting of defining the campaign plan, generating a workload, specifying and injecting the faults, collecting data, aiding the experiment's repeatability, improving the consistency of results, and saving a considerable amount of time.

References

  1. Baptista, G., Bento, A., Correia, J.: Defektor, https://github.com/SysOBs/defektorGoogle ScholarGoogle Scholar
  2. Basiri, A., Behnam, N., De Rooij, R., Hochstein, L., Kosewski, L., Reynolds, J., Rosenthal, C.: Chaos engineering. IEEE Software 33(3), 35--41 (2016)Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Brown, S.: The C4 model for visualising software architecture, https://c4model.com/Google ScholarGoogle Scholar
  4. Carreira, J., Madeira, H., Silva, J.G.: Xception: A technique for the experimental evaluation of dependability in modern computers. IEEE Trans. Softw. Eng. 24(2), 125--136 (Feb 1998). https://doi.org/10.1109/32.666826 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. CNCF SURVEY 2019: Deployments are getting larger as cloud native adoption becomes mainstream (2019), https://www.cncf.io/wp-content/uploads/2020/08/CNCF_Survey_Report.pdfGoogle ScholarGoogle Scholar
  6. Durães, J. and Vieira, M. and Madeira H.: Dependability benchmarking of web-servers. In: Lecture Notes in Computer Science. SAFECOMP 2004, vol. 3219, pp. 297--310 (2004). Google ScholarGoogle ScholarCross RefCross Ref
  7. Heorhiadi, V., Rajagopalan, S., Jamjoom, H., Reiter, M.K., Sekar, V.: Gremlin: Systematic resilience testing of microservices. In: 2016 IEEE 36th International Conference on Distributed Computing Systems (ICDCS). pp. 57--66 (2016). Google ScholarGoogle ScholarCross RefCross Ref
  8. Instana: Robot shop: Sample microservice application (2021), https://github.com/instana/robot-shopGoogle ScholarGoogle Scholar
  9. Kanawati, G., Kanawati, N., Abraham, J.: Ferrari: a tool for the validation of system dependability properties. In: [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing. pp. 336--344 (1992). Google ScholarGoogle ScholarCross RefCross Ref
  10. What is Kubernetes?, https://kubernetes.ioGoogle ScholarGoogle Scholar
  11. Litmus: Chaos engineering for your kubernetes, https://docs.litmuschaos.io/docs/introduction/what-is-litmusGoogle ScholarGoogle Scholar
  12. Merkel, D.: Docker: Lightweight linux containers for consistent development and deployment. Linux J. 2014(239) (Mar 2014)Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Moraes, R. and Duraes, J. and Barbosa, R. and Martins, E. and Madeira, H.: Experimental risk assessment and comparison using software fault injection. In: 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07). pp. 512--521 (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Netflix: Simianarmy, https://github.com/Netflix/SimianArmyGoogle ScholarGoogle Scholar
  15. Vieira, Marco and Madeira, Henrique: A dependability benchmark for oltp application environments. In: Proceedings of the 29th International Conference on Very Large Data Bases - Volume 29. p. 742--753. VLDB '03, VLDB Endowment (2003)Google ScholarGoogle ScholarCross RefCross Ref
  16. Wu, N., Zuo, D., Zhang, Z.: An extensible fault tolerance testing framework for microservice-based cloud applications. In: Proceedings of the 4th International Conference on Communication and Information Processing. pp. 38--42 (2018)Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Defektor: An Extensible Tool for Fault Injection Campaign Management in Microservice Systems

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SAC '23: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing
          March 2023
          1932 pages
          ISBN:9781450395175
          DOI:10.1145/3555776

          Copyright © 2023 Owner/Author(s)

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 June 2023

          Check for updates

          Qualifiers

          • poster

          Acceptance Rates

          Overall Acceptance Rate1,650of6,669submissions,25%
        • Article Metrics

          • Downloads (Last 12 months)42
          • Downloads (Last 6 weeks)5

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader