Abstract
The paper describes a system architecture, based on virtual machine layers, which can be used in the implementation of software fault tolerant systems. The top layer of such hierarchy details recovery actions; various policies such as conversations, N-Modularity, Programmer Transparent Coordination, Resilient Procedures and Optimistic Recovery are shown to be implementable within this framework.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
M. Ancona, A. Clematis, V. Gianuzzi, “Interfacing User Processes and Kernel in High Level Languages”, ACM Operating System Review, Jan. 1986.
M. Ancona, A. Clematis, G. Dodero, E.B. Fernandez, V. Gianuzzi, “Using Different Language Levels for Implementing Fault Tolerant Programs”, Euromicro 86 Symp., (to appear on ‘Microprocessing and Microprogramming’, The Euromicro Journal, Spring 1987).
T. Anderson and J.C. Knight, “A framework for software fault tolerance in real time systems”, IEEE Trans, on Software Eng., SE-9, 3 (May 1983), 335-364.
L. Chen and A. Avizienis, “N-Version Programming: A Fault tolerant approach to Reliability of Software Operation”, dig., 8th Annu. Int. Symp. Fault-Tolerant Comput., Madison, Wis., June 21–23, 1978.
K. Kant and A. Silberschatz, “Error propagation and recovery in concurrent environments”, The Computer Journal, 28, 5 (1985), 466–473.
K.H. Kim, “Approaches to mechanization of the conversation scheme based on monitors”, IEEE Trans. on Software Eng., SE-8, 3 (May 1982), 189–197.
K.H. Kim, “A Scheme for Coordinated Execution of Independently Designed Recoverable Distributed Processes”, Proc. FTCS-16, pp.130-135, 1986.
T.J. Le Blanc and S.A. Friedberg, “HPC: A Model of Structure and Change in Distributed Systems”, IEEE Trans. on Computers, Vol.C-34, No.12, Dec.1985.
K.J. Lin, “Resilient procedures-an approach to highly available system”, Proc. Intern. Conf. on Computer Languages, IEEE, 1986.
B. Randell, “System structure for software fault tolerance”, IEEE Trans. on Software Eng., SE-1, (June 1975), 221-232.
R.E. Strom and S. Yemini, “Optimistic recovery in distributed systems”, ACM Trans. On Computer Systems, Vol.3, no.3, Aug.1985, pp.204–226.
S. Thanawastien, R.S. Pamula and Y.L. Varol, “Evaluation of Global Checkpoint Rollback Strategies for Error Recovery in Concurrent Processing Systems”, Proc. FTCS-16, pp.246-251, 1986.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ancona, M., Clematis, A., Dodero, G., Fernandez, E.B., Gianuzzi, V. (1987). A System Architecture for Software Fault Tolerance. In: Belli, F., Görke, W. (eds) Fehlertolerierende Rechensysteme / Fault-Tolerant Computing Systems. Informatik-Fachberichte, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45628-2_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-45628-2_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-18294-8
Online ISBN: 978-3-642-45628-2
eBook Packages: Springer Book Archive