Abstract
Most of the mission critical real time systems are characterized by the cyclical execution of periodic processes with hard deadlines. Provision of fault tolerance for such systems, has been extensively studied. Most of these treat hardware faults as belonging to the permanent stuck-at category. In this paper we deal with hardware transient faults, which by their intrinsic nature manifest as software faults and thereby lend themselves to correction based on concepts used in the study of software fault tolerance. This scheme which has been proposed to be used in real time embedded computers used in India’s satellite launch vehicle programs is simple to implement and uses an underlying process structure similar to the one proposed by Anderson and Knight [1]. It takes advantage of the natural synchronization existing in such processes and also the fact that the inertia of the physical systems in which these controllers are embedded absorb minor temporary deviations in their outputs so long as subsequent outputs are correct.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Thomas Anderson and John C. Knight: A Framework for Software Fault Tolerance in Real Time Systems, IEEE Trans. Software Eng., Vol. SE-9, No.3, (1983), 355–364.
K.H. Kim and Howard O. Welch: Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real Time Applications, IEEE Trans. Computers, Vol. 38, No.5, (1989), 626–636.
Jaynarayan H. Lala and Richard E. Harper: Architectural Principles for Safety— Critical Real Time Applications, Proceedings of the IEEE, Vol. 82, No. 1, (1994), 25–40.
Krishna Kant: Performance Analysis of Real Time Software Supporting Fault-Tolerant Operation, IEEE Trans. Computers, Vol. 39, No. 7, (1990), 906–918.
A. Avizienis: The N-Version Approach to Fault-Tolerant Software, IEEE Trans. Software Eng., Vol. SE-11, No. 12, (1985), 1491–1501.
Brian Randell: System Structure for Software Fault Tolerance, IEEE Trans. Software Eng., Vol. SE-1, No. 2, (1975), 220–232.
Jia Xu and David Lorge Parnas: On Satisfying Timing Constraints in Hard-Real Time Systems, IEEE Trans. Software Eng., Vol. SE-19, No. 1, (1993), 70–84.
C.L. Liu and J.W. Layland: Scheduling Algorithms for Multiprogramming in a Hard Real Time Environment, J. Ass. Comput. Mach., Vol. 20, (1973), 46–61.
Gerry Kane, Doug Hawkins, Lance Leventhal: 68000 Assembly Language Programming, OSBORNE/McGraw Hill, (1986).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Basu, D., Paramasivam, R. (2000). An Approach to Software Assisted Recovery from Hardware Transient Faults for Real Time Systems. In: Koornneef, F., van der Meulen, M. (eds) Computer Safety, Reliability and Security. SAFECOMP 2000. Lecture Notes in Computer Science, vol 1943. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40891-6_23
Download citation
DOI: https://doi.org/10.1007/3-540-40891-6_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41186-4
Online ISBN: 978-3-540-40891-8
eBook Packages: Springer Book Archive