Modelle der Fehlertoleranz in Nachrichten-gekoppelten Parallelrechnern

Seidel, Winfried

doi:10.1007/978-3-642-74135-7_28

Winfried Seidel³

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 188))

55 Accesses

Übersicht

Parallelrechnerarchitekturen werden in Zukunft eine wesentliche Rolle bei der Realisierung von höherer Rechenleistung spielen. Mit diesem Artikel wird ein Konzept vorgestellt, das den Aspekt der Fehlertoleranz in dieser neu aufkommenden Rechnerklasse konkretisiert und Lösungsansitze zu hier bestehenden Problemen bietet. Den Ausgangspunkt bildet die Verwendung von lose gekoppelten Prozessorelementen, die durch ein Message-Passing-Betriebssystem verwaltet werden. Mit diesen Systemeigenschaften gelingt es, Verfahren zu implementieren, die nach dem Prinzip der skalierbaren modularen Redundanz operieren. Dieser Ansatz bietet zum einen die Möglichkeit, Prozessorelemente im Sinne von Fehlertoleranz als unabhöngige, diensterbringende Einheiten zu benutzen. Andererseits bietet ein Message-Passing-Betriebssystem geeignete Elementaroperationen an, um diese Autonomie in Form von kommunizierenden Prozessen für hahere Softwareschichten im Hinblick auf Gewährleistungsarchitekturevne verfügbar zu machen.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Literatur

R. Abramovitz, R. De Vaughn, R. Patrie: Testability in a RISC Environment, MIPS Computer Systems Inc, Sunnyvale CA, Proceedings, International Conference on Computer Design, Rye, N.Y. 1986 (ICCD 86).
Google Scholar
M. Acetta, R. Baron, W. Bolosky, D. Golub, R. Rashid, A. Tevanian, M. Young: Mach, A New Kernel Foundation for UNIX Development, Proceedings of the USENIX 1986 Summer Technical Conference (Atlanta, Ga., June 9–13). The Usenix Association, El Cerrito, California, 93–112, 1986.
Google Scholar
A. Avizienis: The N-Version Approach to Fault-Tolerant Software, IEEE Transactions on Software Engineering, Vol. SE-11, No. 12, Dec. 1985.
Google Scholar
D. Birell, B. J. Nelson: Implementing Remote Procedure Calls, ACM Transactions on Computer Systems, Vol. 2, No. 1, 39–59, 1984.
Article Google Scholar
M. Blaum, R. Goodman, R. McEliece: The Reliability of Single-Error Protected Computer Memories, IEEE Transactions on Computers, Vol. C-37, No.l, pp. 114–119, Jan. 1988.
Article Google Scholar
D. R. Cheriton, W. Zwaenepoel: The Distributed V Kernel and its Performance for Diskless Workstations, ACM Operating Systems Review, 17, 5, Proceedings of the Ninth ACM Symposium on Operating Systems Principles, Bretton Woods, New Hampshire, 1983.
Google Scholar
K. Echtle: Fehlermaskierung durch verteilte Systeme, Informatik Fachberichte Nr.121, Springer-Verlag, Berlin, Heidelberg, 1986.
Google Scholar
J. Gall: Systemantics; How Systems Work and Especially How They Fail, Kangaroo Pocket Books, New York, 1977.
Google Scholar
R. W. Hockney: MIMD Computing in the USA — 1984, Parallel Computing 2, S. 119–136, 1985.
Google Scholar
R. Kober, Ch. Müller-Schloer, E. Schmitter: Chancen für Parallelarchitekturen, in H. Schwärtzel (Hrsg.), Informatik in der Praxis, Springer—Verlag, Berlin, 1986.
Google Scholar
B W. Lampson: Hints for Computer System Design, ACM Operating Systems Review, 17, 5, Proceedings of the Ninth ACM Symposium on Operating Systems Principles, Bretton Woods, New Hampshire, 1983.
Google Scholar
L. Mancini: Modular Redundancy in a Message Passing System, IEEE Transactions on Software Engineering, Vol. SE-12, No. 1, Jan. 1986.
Google Scholar
R. M. Metcalfe, D. R. Boggs: Ethernet: Distributed Packet Switching for Local Computer Networks, Communications of the ACM, 19, 7, 395–404, 1976.
Article Google Scholar
S. J. Mullander, A. S. Tanenbaum, The Design of a Capability-Based Distributed Operating System, The Computer Journal, Vol. 29, No. 4, 1986.
Google Scholar
J. Nehmer: Experiences with Distributed Systems, Kaiserslautern, FRG, Sept. 1987. In Lecture Notes in Computer Science, J. Nehmer, Springer-Verlag.
Google Scholar
D. A. Rennels: On Implementing Fault-Tolerance in Binary Hypercubes, Research Report, Computer Science Dept., University of California, Los Angeles, California, USA, 2/1986.
Google Scholar
W. Schröder: Eine Familie von UNIX-ähnlichen Betriebssystemen — Anwendung von Prozessen und des Nachrichtenübermittlungskonzeptes beim strukturierten Betriebssystementwurf, Dissertation, TU Berlin, Fachbereich Informatik, 1986.
Google Scholar
W. Schröder: A Distributed Process Execution and Communication Environment for High-Performance Application Systems, Workshop on Experiences with Distributed Systems, Kaiserslautern, FRG, Sept. 1987. In Lecture Notes in Computer Science, J. Nehmer, Springer-Verlag.
Google Scholar
J. Stadler: Error Insurance: New Semiconductor Technology and Algorithms make Memory Error Detection and Correction more attractive, Digital Review, 69–73, Jan. 1986.
Google Scholar
Y. Tamir: Fault Tolerance in VLSI Multicomputers, Ph. D. thesis, CS Division Report No. UCB/CSD 86/256, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, Aug. 1985.
Google Scholar
M. M. Theimer: Preemptable Remote Execution Facilities for Loosely-Coupled Distributed Systems, Ph. D. thesis, Technical Report STAN-CS-86–1098, Department of Computer Science, Stanford University, 1986.
Google Scholar

Download references

Author information

Authors and Affiliations

Zentralbereich Forschung und Technik, Siemens AG, Otto-Hahn-Ring 6, 8000, München 83, Deutschland
Winfried Seidel

Authors

Winfried Seidel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fachbereich Informatik, Universität Hamburg, Rothenbaumchaussee 67/69, 2000, Hamburg 13, Deutschland
Rüdiger Valk

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Seidel, W. (1988). Modelle der Fehlertoleranz in Nachrichten-gekoppelten Parallelrechnern. In: Valk, R. (eds) GI — 18. Jahrestagung II. Informatik-Fachberichte, vol 188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-74135-7_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-74135-7_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-50360-6
Online ISBN: 978-3-642-74135-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics