Abstract
Building reliable real-time applications on top of commercial off-the-shelf (COTS) components is not a straightforward task. Thus, it is essential to provide a simple and transparent programming model, in order to abstract programmers from the low-level implementation details of distribution and replication. However, the recent trend for incorporating pre-emptive multitasking applications in reliable real-time systems inherently increases its complexity. It is therefore important to provide a transparent programming model, enabling pre-emptive multitasking applications to be implemented without resorting to simultaneously dealing with both system requirements and distribution and replication issues. The distributed embedded architecture using COTS components (DEAR-COTS) architecture has been previously proposed as an architecture to support real-time and reliable distributed computer-controlled systems (DCCS) using COTS components. Within the DEAR-COTS architecture, the hard real-time subsystem provides a framework for the development of reliable real-time applications, which are the core of DCCS applications. This paper presents the proposed framework, and demonstrates how it can be used to support the transparent replication of software components.
Similar content being viewed by others
References
Audsley, A. N., Burns, A., Richardson, M., Tindell, K., and Wellings, A. 1993. Applying new scheduling theory to static priority pre-emptive scheduling. Software Engineering Journal 8(5): 285-292.
Barrett, P. A., Burns, A., and Wellings, A. J. 1995. Models of replication for safety critical hard real-time systems. In Proceedings of the 20th IFAC/IFIP Workshop on Real-Time Programming. USA, pp. 181-188.
Bondavalli, A., Giandomenico, F. D., Grandoni, F., Powell, D., and Rabejac, C. 1998. State restoration in a COTS-based N-modular architecture. In Proceedings of the First International Symposium in Object-Oriented Real-Time Distributed Computing. Japan, pp. 174-183.
Burns, A. 1997. Session summary: Tasking profiles. In Proceedings of the 8th International Real-Time Ada Workshop, Ravenscar, England. Ada Letters, XVII(5): 5-7. ACM Press.
ISO 11898. 1993. Road Vehicle-Interchange of Digital Information-Controller Area Network (CAN) for High-Speed Communication, ISO.
ISO/IEC 8652. 1995. Information technology-Programming Languages-Ada. Ada Reference Manual, ISO/IEC.
Johnson, S., Jahanian, F., Ghosh, S., VanVoorst, B., and Weininger, N. 2000. Experiences with group communication middleware. In Proceedings of the International Conference on Dependable Systems and Networks. New York City, USA, pp. 37-42.
Keickhafer, R. M., Walter, C. J., Finn, A. M., and Thambidurai, P. M. 1988. The MAFT architecture for distributed fault tolerance. IEEE Transactions on Computers 37(4): 398-404.
Kopetz, H., Damm, A., Koza, C., Mulazzani, M., Schwabl, W., Senft, C., and Zainlinger, R. 1989. Distributed fault-tolerant real-time systems: The Mars approach. IEEE Micro 9(1): 25-41.
Laprie, J. L. 1992. Dependability: Basic Concepts and Terminology. Dependable Computing and Fault-Tolerant Systems, Vol. 5. Berlin: Springer Verlag.
Melliar-Smith, P. M., and Schwartz, R. L. 1982. Formal specification and mechanical verification of sift: A fault tolerance flight control system. IEEE Transactions on Computers 31(7): 616-630.
Pinho, L. 2001. A framework for the transparent replication of real-time applications. Ph.D. thesis. School of Engineering of the University of Porto, Portugal. Available at http://www.hurray.isep.ipp.pt
Pinho, L., and Vasques, F. 2001. Timing analysis of reliable real-time communication in CAN networks. In Proceedings of the 13th Euromicro Conference on Real-Time Systems. Delft, The Netherlands, pp. 103-112.
Poledna, S. 1994. Replica determinism in distributed real-time systems: A brief survey. Real-Time Systems 6(3): 289-316.
Poledna, S. 1998. Deterministic operation of dissimilar replicated task sets in fault-tolerant distributed real-time systems. In Proceedings of the dependable computing for critical applications 6. Grainau, Germany, pp. 103-119.
Poledna, S., Burns, A., Wellings, A., and Barret, P. 2000. Replica determinism and flexible scheduling in hard real-time dependable systems. IEEE Transactions on Computers 49(2): 100-111.
Powell, D. (ed.). 1991. Delta-4-A Generic Architecture for Dependable Distributed Computing. ESPRIT Research Reports. Berlin: Springer Verlag.
Powell, D. 1994. Distributed fault tolerance-lessons learnt from Delta-4. Hardware and software architectures for fault tolerance. In Banatre, M., and Lee P. A. (eds.), Experiences and Perspectives. Lecture Notes in Computer Science 774. Berlin: Springer Verlag, pp. 199-217.
Powell, D. (ed.) 2001. A Generic Fault-Tolerant Architecture for Real-Time Dependable Systems. Dordrecht: Kluwer Academic Publishers.
Pradhan, D. K. 1996. fault-tolerant Computer System Design. Prentice Hall.
Rufino, J., Veríssimo, P., Arroz, G., Almeida, C., and Rodrigues, L. 1998. Fault-tolerant broadcasts in CAN. In Proceedings of the Symposium on Fault-Tolerant Computing. Munich, Germany, pp. 150-159.
Rushby, J. 1996. Reconfiguration and transient recovery in state machines architectures. In Proceedings of the 26th Symposium on Fault-Tolerant Computing. Sendai, Japan, pp. 6-15.
Schneider, F. 1990. Implementing fault-tolerant services using the state machine approach: A tutorial. ACM Computing Surveys 22(4): 299-319.
Veríssimo, P., Casimiro, A., and Fetzer, C. 2000. The timely computing base: Timely actions in the presence of uncertain timeliness. In Proceedings of the International Conference on Dependable Systems and Networks. New York City, USA, pp. 533-542.
Veríssimo, P., Casimiro, A., Pinho, L. M., Vasques, F., Rodrigues, L., and Tovar, E. 2000. Distributed computer-controlled systems: The DEAR-COTS approach. In Proceedings of the 16th IFAC Workshop on Distributed Computer Control Systems. Sydney, Australia, pp. 128-135.
Wellings, A. J., Beus-Dukic, Lj., and Powell, D. 1998. Real-time scheduling in a generic fault-tolerant architecture. In Proceedings of the IEEE Real-Time Systems Symposium. Madrid, Spain, pp. 390-398.
Yeh, Y. 1995. Dependability of the 777 primary flight control system. In Proceedings of the Dependable Computing for Critical Applications 5. Urbana-Champaign, USA, pp. 1-13.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Pinho, L.M., Vasques, F. & Wellings, A. Replication Management in Reliable Real-Time Systems. Real-Time Systems 26, 261–296 (2004). https://doi.org/10.1023/B:TIME.0000018248.18519.46
Issue Date:
DOI: https://doi.org/10.1023/B:TIME.0000018248.18519.46