Abstract
Real-time systems are one of the most important applications of computers, both in commercial terms and in terms of social impact. Increasingly, real-time computers are used to control life-critical applications and need to meet stringent reliability conditions. Since the reliability of a real-time system is related to the probability of meeting its hard deadlines, these reliability requirements translate to the need to meet critical task deadlines with a very high probability. We survey the problem of how to schedule tasks in such a way that deadlines continue to be met despite processor (permanent or transient) or software failure.
- K. Ahn, J. Kim, and S. Hong. 1997. Fault-Tolerant Real-Time Scheduling Using Passive Replicas. In Proceedings of the Pacific Rim International Symposium on Fault-Tolerance. 98--103. Google ScholarDigital Library
- R. Al-Omari, G. Manimaran, and A. K. Somani. 2001. An Efficient Backup-Overloading for Fault-Tolerant Scheduling of Real-Time Tasks. In Proceedings of the International Parallel Processing Symposium. 1291--1295. Google ScholarDigital Library
- A. A. Bertossi, L. V. Mancini, and A. Menapace. 2006. Scheduling Hard-Real-Time Tasks with Backup Phasing Delay. In Proceedings of the IEEE Symposium on Distributed Simulation and Real-Time Applications (DS-RT). Google ScholarDigital Library
- A. A. Bertossi, L. V. Mancini, and F. Rossini. 1999. Fault-Tolerant Rate-Monotonic First-Fit Scheduling in Hard-Real-Time Systems. IEEE Transactions on Parallel and Distributed Systems 10, 9(September 1999), 934--945. Google ScholarDigital Library
- A. Burns, R. Davis, and S. Punnekat. 1996. Feasibility Analysis of Fault-Tolerant Real-Time Task Sets. In Proceedings of the 8th Euromicro Workshop on Real-Time Systems (EUROWRTS). 29--33.Google Scholar
- M. Caccamo and M. Buttazzo. 1998. Optimal Scheduling for Fault-Tolerant and Firm Real-Time Systems. In Proceedings of the IEEE Conference on Real-Time Computing Systems and Applications (RTCSA). Google ScholarDigital Library
- A. Cheng. 2002. Real-Time Systems: Scheduling, Analysis and Verification. Wiley-Interscience. Google ScholarDigital Library
- T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. 2004. Introduction to Algorithms. MIT Press. Google ScholarDigital Library
- M. S. Garey and D. S. Johnson. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman. Google ScholarDigital Library
- S. Ghosh, R. Melhem, and D. Mosse. 1972. Fault-Tolerance Through Scheduling of Aperiodic Tasks in Hard Real-Time Multiprocessor Systems. IEEE Transactions on Parallel and Distributed Systems 8, 3 (March 1997), 272--283. Google ScholarDigital Library
- O. Gonzalez, H. Shrikumar, J. A. Stankovic, and K. Ramamritham. 1997. Adaptive Fault Tolerance and Graceful Degradation Under Dynamic Hard Real-time Scheduling. In Proceedings of the IEEE Real-Time Systems Symposium. 79--89. Google ScholarDigital Library
- R. L. Graham. 1969. Bounds on Multiprocessing Timing Anomalies. SIAM Journal of Applied Mathematics 17, 2 (March 1969), 416--429.Google ScholarCross Ref
- C.-C. Han, K. G. Shin, and J. Wu. 2003. A Fault-Tolerant Scheduling Algorithm for Real-Time Periodic Tasks with Possible Software Faults. IEEE Transactions on Computers 52, 3 (March 2003), 363--372. Google ScholarDigital Library
- F. S. Hillier and G. J. Lieberman. 2001. Introduction to Operations Research. McGraw-Hill. Google ScholarDigital Library
- B. Johnson. 1989. The Design and Analysis of Fault-Tolerant Digital Systems. Addison-Wesley.Google Scholar
- M. Joseph and P. Pandya. 1986. Finding Response Times in a Real-Time System. Computer Journal 29, 5 (October 1986), 390--395.Google ScholarCross Ref
- H. Kopetz. 1997. Real-Time Systems. Kluwer Academic Publishers.Google Scholar
- H. Kopetz and G. Bauer. 2003. The Time-Triggered Architecture. Proceedings of the IEEE 91, 1 (January 2003), 112--126.Google ScholarCross Ref
- H. Kopetz and D. Millinger. 1999. The Transparent Implementation of Fault Tolerance in the Time-Triggered Architecture. In Dependable Computing for Critical Applications, A. Avizienis, H. Kopetz, and J. C. Laprie (Eds.), 192--205. Google ScholarDigital Library
- H. Kopetz and W. Ochsenreiter. 1987. Clock Synchronization in Distributed Real-Time Systems. IEEE Transactions on Computers C-36, 933--940. Google ScholarDigital Library
- I. Koren and C. M. Krishna. 2007. Fault-Tolerant Systems. Morgan Kaufmann. Google ScholarDigital Library
- C. M. Krishna and K. G. Shin. 1986. Scheduling Tasks with a Quick Recovery from Failure. IEEE Transactions on Computers C-35, 5 (May 1986), 448--455. Google ScholarDigital Library
- C. M. Krishna and K. G. Shin. 1987. Performance Measures for Control Computers. IEEE Transactions on Automatic Control AC-32, 6, 467--473.Google ScholarCross Ref
- C. M. Krishna and K. G. Shin. 1997. Real-Time Systems. McGraw-Hill. Google ScholarDigital Library
- J. P. Lehoczky, L. Sha, and J. K. Strosnider. 1987. Enhanced Aperiodic Responsiveness in Hard Real-Time Environments. In Proceedings of the IEEE Real-Time Systems Symposium. 261--270.Google Scholar
- W. Liao, L. He, and K. M. Lepak. 2005. Temperature and Supply Voltage Aware Performance and Power Modeling at Microarchitecture Level. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 24, 7 (July 2005), 1042--1053. Google ScholarDigital Library
- F. Liberato, R. Melhem, and D. Mosse. 2000. Tolerance to Multiple Transient Faults in Hard Real-Time Systems. IEEE Transactions on Computers 49, 9 (September 2000), 906--914. Google ScholarDigital Library
- A. L. Liestman and R. H. Campbell. 1986. A Fault-Tolerant Scheduling Problem. IEEE Transactions on Software Engineering 12, 11 (November 1986), 1089--1095. Google ScholarDigital Library
- C. L. Liu and J. W. Layland. 1973. Scheduling Algorithms for Multiprogramming in a Hard Real-Time Environment. Journal of the ACM 20, 1 (January 1973), 40--61. Google ScholarDigital Library
- J. W. S. Liu. 2000. Real-Time Systems. Wiley.Google Scholar
- C. Siva Ram Murthy and G. Manimaran. 2001. Resource Management in Real-Time Systems and Networks. MIT Press. Google ScholarDigital Library
- G. Manimaran and C. Siva Ram Murthy. 1998. A Fault-Tolerant Dynamic Scheduling Algorithm for Multiprocessor Real-Time Systems and Its Analysis. IEEE Transactions on Parallel and Distributed Processing Systems 9, 11 (November 1998), 1137--1152. Google ScholarDigital Library
- M. Naedele. 1999. Fault-Tolerant Real-Time Scheduling Under Real-Time Constraints. In Proceedings of the International Workshop on Real-Time Computing Systems and Applications (RTCSA). 392--395. Google ScholarDigital Library
- N. Nissanke. 1997. Realtime Systems. Prentice Hall. Google ScholarDigital Library
- Y. Oh and S. H. Son. 1992. An Algorithm for Real-Time Fault-Tolerant Scheduling in Multiprocessor Systems. In Proceedings of the Euromicro Workshop on Real-Time Systems. 190--195.Google Scholar
- M. Pandya and M. Malek. 1998. Minimum Achievable Utilization for Fault-Tolerant Processing of Periodic Tasks. IEEE Transactions on Computers 47, 10 (October 1998), 1102--1112. Google ScholarDigital Library
- E. L. Petersen. 1997. Predictions and Observations of SEU Rates in Space. IEEE Transactions on Nuclear Science 44, 6 (December 1997), 2174--2187.Google ScholarCross Ref
- S. Poledna, A. Burns, A. Wellings, and P. Barretta. 2000. Replica Determinism and Flexible Scheduling in Hard Real-Time Dependable Systems. IEEE Transactions on Computers 49, 2 (February 2000), 100--111. Google ScholarDigital Library
- P. Pop, K. H. Poulsen, V. Izosimov, and P. Eles. 2007. Scheduling and Voltage Scaling for Energy/Reliability Trade-Offs in Fault-Tolerant Time-Triggered Embedded Systems. CODES+ISSS. 233--238. Google ScholarDigital Library
- D. K. Pradhan. 1996. Fault-Tolerant Computer System Design. Prentice Hall. Google ScholarDigital Library
- K. Ramamritham, J. A. Stankovic, and P.-F. Shiah. 1994. Efficient Scheduling Algorithms for Real-Time Multiprocessor Systems. Proceedings of the IEEE 82, 1 (January 1994), 55--67.Google ScholarCross Ref
- R. M. Santos, J. Santos, and J. D. Orozco. 2009. Power Saving and Fault-Tolerance in Real-Time Critical Embedded Systems. Journal of Systems Architecture 55, 90--101. Google ScholarDigital Library
- K. G. Shin and X. Cui. 1995. Computing Time Delay and Its Effects on Real-Time Control Systems. IEEE Transactions on Control Systems Technology 3, 2 (June 1995), 218--224.Google ScholarCross Ref
- K. G. Shin and C. M. Krishna. 1987. Performance Measures for Control Computers. IEEE Transactions on Automatic Control AC-32, 6 (June 1987), 467--473.Google Scholar
- D. Siewiorek and R. Swarz. 1999. Reliable Computer Systems: Design and Evaluation. A. K. Peters. Google ScholarDigital Library
- N. Speirs and P. Barrett. 1989. Using Passive Replicates in Delta-4 to Provide Dependable Distributed Computing. In Proceedings of the Fault-Tolerant Computing Symposium (FTCS-19). 184--190.Google Scholar
- J. A. Stankovic and K. Ramamritham. 1989. The Spring Kernel: A New Paradigm for Real-Time Operating Systems. ACM Operating Systems Review 23, 3 (July 1989), 54--71. Google ScholarDigital Library
- T. Tsuchiya, Y. Kakuda, and T. Kikuno. 1995a. Fault-Tolerant Scheduling Algorithm for Distributed Real-Time Systems. 1995. In Proceedings of the 3rd Workshop on Parallel and Distributed Real-Time Systems. 99--103. Google ScholarDigital Library
- T. Tsuchiya, Y. Kakuda, and T. Kikuno. 1995b. A New Fault-Tolerant Scheduling Technique for Real-Time Multiprocessor Systems. In Proceedings of the International Workshop on Real-Time Computing Systems and Applications (RTCSA). 197--202. Google ScholarDigital Library
- O. S. Unsal and I. Koren. 2003. System-Level Power-Aware Design Techniques in Real-Time Systems. Proceedings of the IEEE 91, 7 (July 2003), 1055--1069.Google ScholarCross Ref
- D. Zhu, R. Melhem, and D. Mosse. 2004. The Effects of Energy Management on Reliability in Real-Time Embedded Systems. In Proceedings of the International Conference on Computer-Aided Design (ICCAD). 35--40. Google ScholarDigital Library
Index Terms
- Fault-tolerant scheduling in homogeneous real-time systems
Recommendations
A new fault-tolerant scheduling technique for real-time multiprocessor systems
RTCSA '95: Proceedings of the 2nd International Workshop on Real-Time Computing Systems and ApplicationsIn multiprocessors used for implementing real-time applications, it is strongly required to ensure that tasks' deadlines are met in spite of faults. In this paper, we discuss fault-tolerant task scheduling techniques in real-time multiprocessor systems ...
Task-Level Probabilistic Scheduling Guarantees for Dependable Real-Time Systems - A Designer Centric Approach
ISORCW '11: Proceedings of the 2011 14th IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing WorkshopsDependable real-time systems typically consist of tasks of mixed-criticality levels with associated fault tolerance (FT) requirements and scheduling them in a fault-tolerant manner to efficiently satisfy these requirements is a challenging problem. From ...
A fault-tolerant time-triggered scheduling algorithm of mixed-criticality systems
AbstractReal-time and safety-critical systems are an integration of multiple functionalities onto a single computing platform. Some of the functionalities are safety-critical and subject to certification while the rest of the functionalities are nonsafety-...
Comments