Skip to main content

A Fault-Tolerant Distributed Legacy-Based System and Its Evaluation

  • Conference paper
Book cover Dependable Computing (LADC 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2847))

Included in the following conference series:

Abstract

In this paper, we present a complete architecture for improving the dependability of complex COTS and legacy-based systems. For long-lived applications, such as most of those being constructed nowadays via integration of legacy subsystems, fault treatment is a very important part of the fault tolerance strategy. The paper advocates the need for careful diagnosis and damage assessment, and for precise and effective recovery actions, specifically tailored to the affecting fault and/or to the extent of the damage in the affected component. In our proposal, threshold-based mechanisms are exploited to trigger alternative actions. The design and implementation of the resulting solution is illustrated with respect to a case study. This consists of a distributed architectural framework, handling replicated legacy-based subsystems. Replication and voting are used for error detection and masking. An experimental prototype deployed over a COTS-based LAN is described and has allowed a dependability analysis, via combined use of direct measurements and analytical modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Microsoft Corporation, NET Framework Reference (2002), http://msdn.microsoft.com/netframework/techinfo/documentation/default.asp

  2. Shannon, B.: Java 2 Platform Enterprise Edition Specification, v1.4 (2002), http://java.sun.com/j2ee

  3. Arlat, J., Fabre, J.-C., Rodríguez, M., Salles, F.: Dependability of COTS Microkernel- Based Systems. IEEE Transactions on Computers 51(2) (2002)

    Google Scholar 

  4. Narasimhan, P., Melliar-Smith, P.M.: State Synchronization and Recovery for Strongly Consistent Replicated CORBA Objects. In: Proc. of The 2001 International Conference on Dependable Systems and Networks (2001)

    Google Scholar 

  5. Sabnis, C., Sanders, W.H., Bakken, D.E., Berman, M.E., Karr, D.A., Cukier, M.: AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects. In: Proc. of The IEEE 17th Symposium on Reliable Distributed Systems (1998)

    Google Scholar 

  6. Kalbarczyk, Z.T., Iyer, R.K., Bagchi, S., Whisnant, K.: Chameleon: a Software Infrastructure for Adaptive Fult Tolerance. IEEE Trans. on Parallel and Distributed Systems 10, 560–579 (1999)

    Article  Google Scholar 

  7. Baldoni, R., Marchetti, C., Mecella, M., Virgillito, A.: An Interoperable Replication Logic for CORBA Systems. In: Proc. of The 2nd International Symposium on Distributed Object Applications 2000, DOA 2000 (2000)

    Google Scholar 

  8. Natarajan, B., Gokhale, A., Yajnik, S., Schmidt, D.C.: DOORS: TowardsHighperformance Fault-tolerant CORBA. In: Proc. of International Symposium on Distributed Objects and Applications, DOA 2000 (2000)

    Google Scholar 

  9. Cotroneo, D., Mazzocca, N., Romano, L., Russo, S.: Building a Dependable System from a Legacy Application with CORBA. Journal of Systems Architecture 48, 81–98 (2002)

    Article  Google Scholar 

  10. Fabre, J.C., Perennou, T.: A metaobject architecture for fault-tolerant distributed systems: the FRIENDS approach. IEEE Transactions on Computers 47, 78–95 (1998)

    Article  Google Scholar 

  11. Avizienis, A., Laprie, J.C., Randell, B.: Fundamental Concepts of Dependability, LAAS, Technical Report n.ro 01145, Tolosa (France), Technical Report n.ro 01145 (2001)

    Google Scholar 

  12. Bondavalli, A., Chiaradonna, S., Di Giandomenico, F., Grandoni, F.: Threshold- Based Mechanisms to Discriminate Transient from Intermittent Faults. IEEE Transactions on Computers 49, 230–245 (2000)

    Article  Google Scholar 

  13. Powell, D., Bonn, G., Seaton, D., Verissimo, P., Waeselynck, F.: The delta-4 approach to dependability in open distributed computing systems. In: Proc. of the 18th International Symposium on Fault Tolerant Computing Systems, FTCS 18 (1988)

    Google Scholar 

  14. Group, O.M.: Fault-Tolerant CORBA Specification, v1.0, OMG document ptc/00-04-04 (2001), http://www.omg.org

  15. Romano, L., Chiaradonna, S., Bondavalli, A., Cotroneo, D.: Implementation of Threshold-based Diagnostic Mechanisms for COTS-based Applications. In: Proc. of The 21st IEEE Symposium on Reliable Distributed Systems (SRDS 2002), Osaka, Japan (2002)

    Google Scholar 

  16. Goswami, K.K., Iyer, R.K.: Simulation of Software Behavior Under Hardware Faults. In: Proc. of the 23rd Annual International Symposium on Fault-Tolerant Computing (1993)

    Google Scholar 

  17. Iyer, R.K., Tang, D.: Experimental Analysis of Computer System Fault tolerance. In: Pradhan, D.K. (ed.) Fault-Tolerant Computer System Design. ch. 5. Prentice Hall Inc., Englewood Cliffs (1996)

    Google Scholar 

  18. Stott, D., Jones, P.H., Hamman, M., Kalbarczyk, Z., Iyer, R.K.: NFTAPE: networked fault tolerance and performance evaluator. In: Proc. of International Conference on Dependable Systems and Networks (2002)

    Google Scholar 

  19. Bakken, D.E., Zhan, Z., Jones, C.C., Karr, D.A.: Middleware support for voting and data fusion. Presented at DSN 2001- IEEE International Conference on Dependable Systems and Networks, Gotenburg, Sweden, pp. 453–462 (2001)

    Google Scholar 

  20. DBench Consortium, Measurements, Deliverable ETIE1, IST-2000-25425 Dependability Benchmarking, DBench (2002)

    Google Scholar 

  21. Mullen, R.: The Lognormal Distribution of Software Failure Rates: Origin and Evidence. In: Proc. of The Ninth International Symposium on Software Reliability Engineering, Paderborn, Germany (1998)

    Google Scholar 

  22. Sanders, W.H., Meyer, J.F.: A Unified Approach for Specifying Measures of Performance. In: Avizienis, A., Kopetz, H., Laprie, J.C. (eds.) Dependable Computing for Critical Applications. Dependable Computing and Fault-Tolerant Systems, vol. 4, pp. 215–237. Springer, Heidelberg (1991)

    Google Scholar 

  23. Birman, K., Constable, R., Hayden, M., Kreitz, C., Rodeh, O., van Renesse, R., Vogels, W.: The Horus and Ensemble Projects: Accomplishments and Limitations. In: Proceedings of the DARPA Information Survivability Conference & Exposition, DISCEX 2000 (2000)

    Google Scholar 

  24. Cotroneo, D., Mazzeo, A., Romano, L., Russo, S.: Implementing a CORBA-based architecture for leveraging the security level of existing applications. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bondavalli, A., Chiaradonna, S., Cotroneo, D., Romano, L. (2003). A Fault-Tolerant Distributed Legacy-Based System and Its Evaluation. In: de Lemos, R., Weber, T.S., Camargo, J.B. (eds) Dependable Computing. LADC 2003. Lecture Notes in Computer Science, vol 2847. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45214-0_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45214-0_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20224-0

  • Online ISBN: 978-3-540-45214-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics