Skip to main content

Adaptive Fault Tolerance in Multi-Computer Systems Using Data Compaction and Two-Level Voting

  • Conference paper
  • 73 Accesses

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 283))

Abstract

Multi-computer systems offer good conditions for utilization of various fault tolerance principles. In order to increase reliability with little expense, this paper proposes an Adaptive Two-Level Voting scheme for multicomputer systems. This scheme enables a process-oriented dynamically determined fault tolerance by fault masking. The basic principle of the first voting level is a decentralized 2-out-of-N Signature Voting of exchanged process signatures generated by data compaction of N identical process copies on N computer nodes. It can tolerate N-2 faulty computer nodes. On the second voting level the results of the first voting level are compared by V-Version Programming using diverse software. For Two-Level Voting (V,N), the number of involved program versions V and the number of involved program copies N can be chosen. This paper describes four variants of Adaptive Two-Level Voting, a sequential, a parallel, a comparison variant and a very efficient dynamic variant. An estimation of efficiency of the suggested variants is also given.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Avizienis, “The N-Version Approach to Fault-Tolerant Software”, IEEE Transactions on Software Engineering Dec.1985 pp 1491–1501.

    Google Scholar 

  2. M. Dal Cin, “Zur explizit fehlertoleranten Programmierung von Parallelrechnern”, In: Tagungsband PARS-Workshop 10-12 April 1989 München Neu-Perlach S. 47–55.

    Google Scholar 

  3. K. Echtle, “Fehlermaskierung durch verteilte Systeme”, Informatik-Fachberichte Band 121 Springer-Verlag 1986.

    Google Scholar 

  4. K. Echtle, “Distance Agreement Protocols”, Proc. of FTCS-19, Chigago, 21.-23. 5. 1989.

    Google Scholar 

  5. 5/ J.P.J. Kelly, “Current Experiences with Fault-Tolerant Software Design: Dependability Through Diverse Formal Specifications”, In: Proceedings of the 4th Conf. Fault- tolerant Computing Systems Baden-Baden 1989 ed.by W.Görke and H.Sörensen. Informatik-Fachberichte 214. Sponger-Verlag 1989.

    Google Scholar 

  6. K.H. Kim, H.O. Welch, “Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real-Time Applications”, IEEE Trans.on Computers Vol.39, No.5, may 1989. pp. 626–636.

    Google Scholar 

  7. J.C. Laprie, “Dependable Computing - Concepts and Terminology”, In: Proceedings of FTSD-87, Varna 1987, pp. 44–53.

    Google Scholar 

  8. J.C. Laprie, “Dependability: A Unifying Concept for Reliable Computing and Fault Tolerance”, In Dependability and Resilient Systems, T.Anderson ( Ed. ), Blackwell Scientific Publications (1989), pp. 1–28.

    Google Scholar 

  9. S. Miyamoto, et.al., “FMPA: A Fault-Tolerant Multi-Microprocessor System Based on Autonomous Decentralization Concept”, IEEE 1983. pp. 4–9.

    Google Scholar 

  10. 10/ D. Powell, et.al., “The Delta-4 Approach to Dependability in Open Distributed Computing Systems”, Proc. of FTCS-18, Tokyo, June 1988, pp. 246–251.

    Google Scholar 

  11. 11/ D.K. Pradhan, “Fault-Tolerant Computing - Theory and Techniques”, Prentice Hall, 1986.

    Google Scholar 

  12. A. Stopp, “A Signature Supervisor for Diagnosis in Hypercubes” In: Plander, I.(Ed.) Artificial Intelligence and Information-Control Systems of Robots - 87. North-Holland 1987, pp. 461–465.

    Google Scholar 

  13. A. Stopp, “Decentralized Diagnosis in Regular Structured Multiprocessor Systems”, In: Legendi,T., Parkinson,D., Vollmar,R., Wolf,G., (Eds.), Parallel Processing by Cellular Automatas ana Arrays, North-Holland, Amsterdam 1987, pp. 192–189.

    Google Scholar 

  14. J.H. Wensley, et.al., “SIFT Design and Analysis of a Fault-Tolerant Computer for Aircraft Control”, Proc. of IEEE, Vol. 66, No. 10 1978, pp. 1240–1255.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1991 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stopp, A. (1991). Adaptive Fault Tolerance in Multi-Computer Systems Using Data Compaction and Two-Level Voting. In: Cin, M.D., Hohl, W. (eds) Fault-Tolerant Computing Systems. Informatik-Fachberichte, vol 283. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76930-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-76930-6_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-54545-3

  • Online ISBN: 978-3-642-76930-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics