Skip to main content
Log in

Replication Management in Reliable Real-Time Systems

  • Published:
Real-Time Systems Aims and scope Submit manuscript

Abstract

Building reliable real-time applications on top of commercial off-the-shelf (COTS) components is not a straightforward task. Thus, it is essential to provide a simple and transparent programming model, in order to abstract programmers from the low-level implementation details of distribution and replication. However, the recent trend for incorporating pre-emptive multitasking applications in reliable real-time systems inherently increases its complexity. It is therefore important to provide a transparent programming model, enabling pre-emptive multitasking applications to be implemented without resorting to simultaneously dealing with both system requirements and distribution and replication issues. The distributed embedded architecture using COTS components (DEAR-COTS) architecture has been previously proposed as an architecture to support real-time and reliable distributed computer-controlled systems (DCCS) using COTS components. Within the DEAR-COTS architecture, the hard real-time subsystem provides a framework for the development of reliable real-time applications, which are the core of DCCS applications. This paper presents the proposed framework, and demonstrates how it can be used to support the transparent replication of software components.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Audsley, A. N., Burns, A., Richardson, M., Tindell, K., and Wellings, A. 1993. Applying new scheduling theory to static priority pre-emptive scheduling. Software Engineering Journal 8(5): 285-292.

    Google Scholar 

  • Barrett, P. A., Burns, A., and Wellings, A. J. 1995. Models of replication for safety critical hard real-time systems. In Proceedings of the 20th IFAC/IFIP Workshop on Real-Time Programming. USA, pp. 181-188.

  • Bondavalli, A., Giandomenico, F. D., Grandoni, F., Powell, D., and Rabejac, C. 1998. State restoration in a COTS-based N-modular architecture. In Proceedings of the First International Symposium in Object-Oriented Real-Time Distributed Computing. Japan, pp. 174-183.

  • Burns, A. 1997. Session summary: Tasking profiles. In Proceedings of the 8th International Real-Time Ada Workshop, Ravenscar, England. Ada Letters, XVII(5): 5-7. ACM Press.

    Google Scholar 

  • ISO 11898. 1993. Road Vehicle-Interchange of Digital Information-Controller Area Network (CAN) for High-Speed Communication, ISO.

  • ISO/IEC 8652. 1995. Information technology-Programming Languages-Ada. Ada Reference Manual, ISO/IEC.

  • Johnson, S., Jahanian, F., Ghosh, S., VanVoorst, B., and Weininger, N. 2000. Experiences with group communication middleware. In Proceedings of the International Conference on Dependable Systems and Networks. New York City, USA, pp. 37-42.

  • Keickhafer, R. M., Walter, C. J., Finn, A. M., and Thambidurai, P. M. 1988. The MAFT architecture for distributed fault tolerance. IEEE Transactions on Computers 37(4): 398-404.

    Google Scholar 

  • Kopetz, H., Damm, A., Koza, C., Mulazzani, M., Schwabl, W., Senft, C., and Zainlinger, R. 1989. Distributed fault-tolerant real-time systems: The Mars approach. IEEE Micro 9(1): 25-41.

    Google Scholar 

  • Laprie, J. L. 1992. Dependability: Basic Concepts and Terminology. Dependable Computing and Fault-Tolerant Systems, Vol. 5. Berlin: Springer Verlag.

    Google Scholar 

  • Melliar-Smith, P. M., and Schwartz, R. L. 1982. Formal specification and mechanical verification of sift: A fault tolerance flight control system. IEEE Transactions on Computers 31(7): 616-630.

    Google Scholar 

  • Pinho, L. 2001. A framework for the transparent replication of real-time applications. Ph.D. thesis. School of Engineering of the University of Porto, Portugal. Available at http://www.hurray.isep.ipp.pt

    Google Scholar 

  • Pinho, L., and Vasques, F. 2001. Timing analysis of reliable real-time communication in CAN networks. In Proceedings of the 13th Euromicro Conference on Real-Time Systems. Delft, The Netherlands, pp. 103-112.

    Google Scholar 

  • Poledna, S. 1994. Replica determinism in distributed real-time systems: A brief survey. Real-Time Systems 6(3): 289-316.

    Google Scholar 

  • Poledna, S. 1998. Deterministic operation of dissimilar replicated task sets in fault-tolerant distributed real-time systems. In Proceedings of the dependable computing for critical applications 6. Grainau, Germany, pp. 103-119.

    Google Scholar 

  • Poledna, S., Burns, A., Wellings, A., and Barret, P. 2000. Replica determinism and flexible scheduling in hard real-time dependable systems. IEEE Transactions on Computers 49(2): 100-111.

    Google Scholar 

  • Powell, D. (ed.). 1991. Delta-4-A Generic Architecture for Dependable Distributed Computing. ESPRIT Research Reports. Berlin: Springer Verlag.

    Google Scholar 

  • Powell, D. 1994. Distributed fault tolerance-lessons learnt from Delta-4. Hardware and software architectures for fault tolerance. In Banatre, M., and Lee P. A. (eds.), Experiences and Perspectives. Lecture Notes in Computer Science 774. Berlin: Springer Verlag, pp. 199-217.

    Google Scholar 

  • Powell, D. (ed.) 2001. A Generic Fault-Tolerant Architecture for Real-Time Dependable Systems. Dordrecht: Kluwer Academic Publishers.

    Google Scholar 

  • Pradhan, D. K. 1996. fault-tolerant Computer System Design. Prentice Hall.

  • Rufino, J., Veríssimo, P., Arroz, G., Almeida, C., and Rodrigues, L. 1998. Fault-tolerant broadcasts in CAN. In Proceedings of the Symposium on Fault-Tolerant Computing. Munich, Germany, pp. 150-159.

  • Rushby, J. 1996. Reconfiguration and transient recovery in state machines architectures. In Proceedings of the 26th Symposium on Fault-Tolerant Computing. Sendai, Japan, pp. 6-15.

  • Schneider, F. 1990. Implementing fault-tolerant services using the state machine approach: A tutorial. ACM Computing Surveys 22(4): 299-319.

    Google Scholar 

  • Veríssimo, P., Casimiro, A., and Fetzer, C. 2000. The timely computing base: Timely actions in the presence of uncertain timeliness. In Proceedings of the International Conference on Dependable Systems and Networks. New York City, USA, pp. 533-542.

  • Veríssimo, P., Casimiro, A., Pinho, L. M., Vasques, F., Rodrigues, L., and Tovar, E. 2000. Distributed computer-controlled systems: The DEAR-COTS approach. In Proceedings of the 16th IFAC Workshop on Distributed Computer Control Systems. Sydney, Australia, pp. 128-135.

  • Wellings, A. J., Beus-Dukic, Lj., and Powell, D. 1998. Real-time scheduling in a generic fault-tolerant architecture. In Proceedings of the IEEE Real-Time Systems Symposium. Madrid, Spain, pp. 390-398.

  • Yeh, Y. 1995. Dependability of the 777 primary flight control system. In Proceedings of the Dependable Computing for Critical Applications 5. Urbana-Champaign, USA, pp. 1-13.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pinho, L.M., Vasques, F. & Wellings, A. Replication Management in Reliable Real-Time Systems. Real-Time Systems 26, 261–296 (2004). https://doi.org/10.1023/B:TIME.0000018248.18519.46

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:TIME.0000018248.18519.46

Navigation