Skip to main content

Variations and Evaluations of an Adaptive Accrual Failure Detector to Enable Self-healing Properties in Distributed Systems

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4415))

Abstract

The initiatives Organic Computing and Autonomic Computing introduced challenging visions for future computer systems. They address the growing complexity of these systems that demands for new ways to control them. Future systems should be able to adapt dynamically to the current conditions of their environment. They should be characterised by so-called self-x properties like self-configuring, self-healing, self-optimising, self-protecting, and context-aware. For the incorporation of self-healing capabilities into distributed systems the detection of failures is a crucial part. Recently we proposed a new failure detector that can be described as an adaptive accrual algorithm. It has been designed for flexible generic usability as a basis to realise self-healing of distributed systems. This paper introduces variations of the proposed basic algorithm to improve its performance and provides an evaluation of all algorithms using message delay and loss models of the internet.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bertier, M., Marin, O., Sens, P.: Implementation and performance evaluation of an adaptable failure detector. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, Washington, DC, USA, pp. 354–363. IEEE Computer Society Press, Los Alamitos (2002)

    Chapter  Google Scholar 

  2. Bolot, J.-C.: End-to-end packet delay and loss behavior in the internet. In: SIGCOMM, pp. 289–298 (1993), http://doi.acm.org/10.1145/166237.166265

  3. Chandra, T.D., Hadzilacos, V., Toueg, S.: The weakest failure detector for solving consensus. J. ACM 43(4), 685–722 (1996), doi:10.1145/234533.234549

    Article  MathSciNet  MATH  Google Scholar 

  4. Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. J. ACM 43(2), 225–267 (1996), doi:10.1145/226643.226647

    Article  MathSciNet  MATH  Google Scholar 

  5. Chen, W., Toueg, S., Aguilera, M.K.: On the quality of service of failure detectors. In: Proceedings of the International Conference on Dependable Systems and Networks (DSN 2000), New York, IEEE Computer Society Press, Los Alamitos (2000), citeseer.ist.psu.edu/chen00quality.html

    Google Scholar 

  6. Dam, K.K., Ni, L.M.: Design and implementation of a network emulator. Technical Report MSU-CPS-ACS-98-16, Department of Computer Science and Engineering, Michigan State University (May 1998)

    Google Scholar 

  7. Fetzer, C., Raynal, M., Tronel, F.: An adaptive failure detection protocol. In: PRDC ’01: Proceedings of the 2001 Pacific Rim International Symposium on Dependable Computing, Washington, DC, USA, p. 146. IEEE Computer Society Press, Los Alamitos (2001)

    Chapter  Google Scholar 

  8. Fischer, M.J., Lynch, N.A., Paterson, M.S.: Impossibility of distributed consensus with one faulty process. J. ACM 32(2), 374–382 (1985), doi:10.1145/3149.214121

    Article  MathSciNet  MATH  Google Scholar 

  9. Härdle, W.: Smoothing Techniques with Implementation in S. Springer, Berlin (1991)

    MATH  Google Scholar 

  10. Hayashibara, N., Défago, X., Yared, R., Katayama, T.: The f accrual failure detector. In: SRDS, pp. 66–78. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  11. Horn, P.: Autonomic computing: Ibms perspective on the state of information technology (2001), http://www.research.ibm.com/autonomic/

  12. Horstmann, M., Kirtland, M.: Dcomarchitecture. Technical report (July 1997), http://msdn.microsoft.com/library/backgrnd/html/msdn_dcomarch.htm

  13. Jacobson, V.: Congestion avoidance and control. In: SIGCOMM ’88: Symposium proceedings on Communications architectures and protocols, Stanford, California, United States, pp. 314–329. ACM Press, New York (1988), doi:10.1145/52324.52356

    Chapter  Google Scholar 

  14. Kephart, J.O.: Research challenges of autonomic computing. In: ICSE ’05: Proceedings of the 27th international conference on Software engineering, St. Louis, MO, USA, pp. 15–22 (2005), doi:10.1145/1062455.1062464

    Google Scholar 

  15. Lynch, N.: A hundred impossibility proofs for distributed computing. In: PODC ’89: Proceedings of the eighth annual ACM Symposium on Principles of distributed computing, Edmonton, Alberta, Canada, pp. 1–28. ACM Press, New York (1989), doi:10.1145/72981.72982

    Chapter  Google Scholar 

  16. Mukherjee, A.: On the dynamics and significance of low frequency components of internet load. Technical Report MIS-CIS-92-83, University of Pennsylvania (December 1992)

    Google Scholar 

  17. Müller-Schloer, C., von der Malsburg, C., Würtz, R.P.: Organic computing. Informatik Spektrum 27(4), 332–336 (2004), doi:10.1007/s00287-004-0409-6

    Article  Google Scholar 

  18. Pietzowski, A., Trumler, W., Ungerer, T.: An artificial immune system and its integration into an organic middleware for self-protection. In: GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation, Seattle, Washington, USA, pp. 129–130. ACM Press, New York (2006), doi:10.1145/1143997.1144020

    Chapter  Google Scholar 

  19. Richter, U., Mnif, M., Branke, J., Müller-Schloer, C., Schmeck, H.: Towards a generic observer/controller architecture for organic computing. In: Hochberger, C., Liskowsky, R. (eds.) INFORMATIK 2006 – Informatik für Menschen, Dresden, Germany, sep 2006. GI-Edition – Lecture Notes in Informatics, vol. P-93, pp. 112–119. Köllen Verlag, Bonn (2006)

    Google Scholar 

  20. Sanghi, D., Agrawala, A.K., Gudmundsson, O., Jain, B.N.: Experimental assessment of end-to-end behavior on internet. In: INFOCOM, pp. 867–874 (1993)

    Google Scholar 

  21. Satzger, B., Pietzowski, A., Trumler, W., Ungerer, T.: A new adaptive accrual failure detector for dependable distributed systems. In: SAC ’07: Proceedings of the 2006 ACM symposium on Applied computing, Seoul, South Korea, ACM Press, New York (2007)

    Google Scholar 

  22. Schmeck, H.: Organic computing-vision and challenge for system design. In: Proceedings of the Parallel Computing in Electrical Engineering, International Conference on (PARELEC 2004), Washington, DC, USA, pp. 3–3. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  23. Schmeck, H.: Organic computing. Künstliche Intelligenz 5(3), 68–69 (2005)

    Google Scholar 

  24. Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman & Hall/CRC, Boca Raton (April 1986)

    MATH  Google Scholar 

  25. Trumler, W., Bagci, F., Petzold, J., Ungerer, T.: Amun - autonomic middleware for ubiquitous environments applied to the smart doorplate. In: Advanced Engineering Informatics, vol. 19, Washington, DC, USA, pp. 243–252. Elsevier, Amsterdam (2005)

    Google Scholar 

  26. Trumler, W., Klaus, R., Ungerer, T.: Self-configuration via cooperative social behavior. In: Yang, L.T., Jin, H., Ma, J., Ungerer, T. (eds.) ATC 2006. LNCS, vol. 4158, pp. 90–99. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  27. Trumler, W., Thiemann, T., Ungerer, T.: An artificial hormone system for self-organization of networked nodes. In: IFIP Conference on Biologically Inspired Cooperative Computing, Santiago de Chile, August 2006, pp. 85–94. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Paul Lukowicz Lothar Thiele Gerhard Tröster

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Satzger, B., Pietzowski, A., Trumler, W., Ungerer, T. (2007). Variations and Evaluations of an Adaptive Accrual Failure Detector to Enable Self-healing Properties in Distributed Systems. In: Lukowicz, P., Thiele, L., Tröster, G. (eds) Architecture of Computing Systems - ARCS 2007. ARCS 2007. Lecture Notes in Computer Science, vol 4415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71270-1_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71270-1_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71267-1

  • Online ISBN: 978-3-540-71270-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics