Fault tolerance in distributed UNIX

Borg, Anita; Blau, Wolfgang; Oberle, Wolfgang; Graetsch, Wolfgang

doi:10.1007/BFb0042339

Anita Borg¹,
Wolfgang Blau²,
Wolfgang Oberle² &
…
Wolfgang Graetsch²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 448))

128 Accesses

Abstract

An initial design for a fault tolerant, distributed version of UNIX was presented in an earlier paper [2]. That design left a number of open questions in two particular areas: Fault tolerance for server processes through which peripherals are accessed; recovery after a crash including the re-backup of processes. Since then, the fundamental design involving three-way message transmission has remained unchanged. However, server fault tolerance has been redesigned and is now more consistent with the fault tolerance of normal user processes. Recovery and re-backup have been completed in a more efficient manner than previously envisioned. In addition, important changes in the implementation have occurred. In this paper, we review the original design, borrowing heavily from the earlier paper in sections 1–3, and explain additions and modifications in later sections.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bartlett, J. A NonStop Kernel. Eighth Symposium on Operating Systems Principles, December, 1981.
Google Scholar
Borg, A., Baumbach, J., Glazer, S. A Message System Supporting Fault Tolerance. Ninth Symposium on Operating Systems Principles, October, 1983.
Google Scholar
Walter, B. A Robust and Efficient Protocol for Checking the Availability of Remote Sites. Sixth Workshop on Distributed Data Management and Computer Networks, December, 1982.
Google Scholar

Download references

Author information

Authors and Affiliations

Western Research Laboratory, Digital Equipment Corporation, Palo Alto, Ca.
Anita Borg
Nixdorf Computer, Paderborn, West Germany
Wolfgang Blau, Wolfgang Oberle & Wolfgang Graetsch

Authors

Anita Borg
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Blau
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Oberle
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Graetsch
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Barbara Simons Alfred Spector

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Borg, A., Blau, W., Oberle, W., Graetsch, W. (1990). Fault tolerance in distributed UNIX. In: Simons, B., Spector, A. (eds) Fault-Tolerant Distributed Computing. Lecture Notes in Computer Science, vol 448. Springer, New York, NY. https://doi.org/10.1007/BFb0042339

Download citation

DOI: https://doi.org/10.1007/BFb0042339
Published: 08 June 2005
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-97385-2
Online ISBN: 978-0-387-34812-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics