Adaptive Fault Tolerance in Multi-Computer Systems Using Data Compaction and Two-Level Voting

Stopp, Andreas

doi:10.1007/978-3-642-76930-6_8

Adaptive Fault Tolerance in Multi-Computer Systems Using Data Compaction and Two-Level Voting

Andreas Stopp³

Conference paper

73 Accesses

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 283))

Abstract

Multi-computer systems offer good conditions for utilization of various fault tolerance principles. In order to increase reliability with little expense, this paper proposes an Adaptive Two-Level Voting scheme for multicomputer systems. This scheme enables a process-oriented dynamically determined fault tolerance by fault masking. The basic principle of the first voting level is a decentralized 2-out-of-N Signature Voting of exchanged process signatures generated by data compaction of N identical process copies on N computer nodes. It can tolerate N-2 faulty computer nodes. On the second voting level the results of the first voting level are compared by V-Version Programming using diverse software. For Two-Level Voting (V,N), the number of involved program versions V and the number of involved program copies N can be chosen. This paper describes four variants of Adaptive Two-Level Voting, a sequential, a parallel, a comparison variant and a very efficient dynamic variant. An estimation of efficiency of the suggested variants is also given.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Avizienis, “The N-Version Approach to Fault-Tolerant Software”, IEEE Transactions on Software Engineering Dec.1985 pp 1491–1501.
Google Scholar
M. Dal Cin, “Zur explizit fehlertoleranten Programmierung von Parallelrechnern”, In: Tagungsband PARS-Workshop 10-12 April 1989 München Neu-Perlach S. 47–55.
Google Scholar
K. Echtle, “Fehlermaskierung durch verteilte Systeme”, Informatik-Fachberichte Band 121 Springer-Verlag 1986.
Google Scholar
K. Echtle, “Distance Agreement Protocols”, Proc. of FTCS-19, Chigago, 21.-23. 5. 1989.
Google Scholar
5/ J.P.J. Kelly, “Current Experiences with Fault-Tolerant Software Design: Dependability Through Diverse Formal Specifications”, In: Proceedings of the 4th Conf. Fault- tolerant Computing Systems Baden-Baden 1989 ed.by W.Görke and H.Sörensen. Informatik-Fachberichte 214. Sponger-Verlag 1989.
Google Scholar
K.H. Kim, H.O. Welch, “Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real-Time Applications”, IEEE Trans.on Computers Vol.39, No.5, may 1989. pp. 626–636.
Google Scholar
J.C. Laprie, “Dependable Computing - Concepts and Terminology”, In: Proceedings of FTSD-87, Varna 1987, pp. 44–53.
Google Scholar
J.C. Laprie, “Dependability: A Unifying Concept for Reliable Computing and Fault Tolerance”, In Dependability and Resilient Systems, T.Anderson ( Ed. ), Blackwell Scientific Publications (1989), pp. 1–28.
Google Scholar
S. Miyamoto, et.al., “FMPA: A Fault-Tolerant Multi-Microprocessor System Based on Autonomous Decentralization Concept”, IEEE 1983. pp. 4–9.
Google Scholar
10/ D. Powell, et.al., “The Delta-4 Approach to Dependability in Open Distributed Computing Systems”, Proc. of FTCS-18, Tokyo, June 1988, pp. 246–251.
Google Scholar
11/ D.K. Pradhan, “Fault-Tolerant Computing - Theory and Techniques”, Prentice Hall, 1986.
Google Scholar
A. Stopp, “A Signature Supervisor for Diagnosis in Hypercubes” In: Plander, I.(Ed.) Artificial Intelligence and Information-Control Systems of Robots - 87. North-Holland 1987, pp. 461–465.
Google Scholar
A. Stopp, “Decentralized Diagnosis in Regular Structured Multiprocessor Systems”, In: Legendi,T., Parkinson,D., Vollmar,R., Wolf,G., (Eds.), Parallel Processing by Cellular Automatas ana Arrays, North-Holland, Amsterdam 1987, pp. 192–189.
Google Scholar
J.H. Wensley, et.al., “SIFT Design and Analysis of a Fault-Tolerant Computer for Aircraft Control”, Proc. of IEEE, Vol. 66, No. 10 1978, pp. 1240–1255.
Google Scholar

Download references

Author information

Authors and Affiliations

Central Institute for Cybernetics and Information Processes, Institute for Computer Architecture, Kurstr.33, O-1086, Berlin, Germany
Andreas Stopp

Authors

Andreas Stopp
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Mathematische Maschinen und Datenverarbeitung III (Rechnerstrukturen), Universität Erlangen-Nürnberg, Martensstr. 3, W-8520, Erlangen, Germany
Mario Dal Cin & Wolfgang Hohl &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stopp, A. (1991). Adaptive Fault Tolerance in Multi-Computer Systems Using Data Compaction and Two-Level Voting. In: Cin, M.D., Hohl, W. (eds) Fault-Tolerant Computing Systems. Informatik-Fachberichte, vol 283. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76930-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-76930-6_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-54545-3
Online ISBN: 978-3-642-76930-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics