Skip to main content

Carrying the Crash-Only Software Concept to the Legacy Application Servers

  • Chapter
Making Grids Work

Abstract

In the last few years, high-availability on internet services has become a main goal for the academia and industry. We all know how complex and heterogeneous Internet service systems are and how sensitive to suffer from transient failures or even crashes also. Because developing systems that are guaranteed to never crash and never suffer transient or intermittent failures seems an impractical and unfeasible business, there is a need to develop mechanisms that can suffer crashes and transient failures as if they were a clean shutdown. Behind this idea, the creators of the crash-only software concept proposed a new design strategy in order to get crash-safe and fast recovery systems by defining a list of laws which are needed in order to achieve that goal. However, their proposals are focused on new systems design. For this reason, we will discuss how to develop crash-safe and masked fast self-recovery legacy systems following the ideas behind the crash-only software concept. In our work, we have focused on legacy application servers because they are a more sensitive piece of the internet services' big puzzle.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J.Hennessy, D.Patterson. Computer Architecture: A Quantitative Approach, Morgan & Kaufmann Publishers, 2002.

    Google Scholar 

  2. D. Patterson, et. al. Recovery Oriented Computing (ROC): Motivation, Definition, Tech-niques and Case Studies., Technical Report UCB CSD-02-1175, U.C. Berkeley, March 2002.

    Google Scholar 

  3. K. Vaidyanathan and K.S. Trivedi. Extended Classification of Software Faults based on Aging, In Fast Abstracts, Proc. of the IEEE IntÅ l Symp. on Software Reliability Engineering, Hong Kong, November 2001.

    Google Scholar 

  4. D. Scott. Operation Zero Downtime A Gartner Group report, Donna Scott, 2000

    Google Scholar 

  5. Chet Dembeck. Yahoo cashes in on Ebay’s outage, E-commerce Times, June 18, 1999. [web] http://www.ecommercetimes.com/perl/story/545.html

  6. G. Candea, S. Kawamoto, Y. Fujiki, G. Friedman, A. Fox. A Microreboot - A Technique for Cheap Recovery Proc. 6th Symp. on Operating Systems Design and Implementation (OSDI), Dec. 2004.

    Google Scholar 

  7. G.Candea, A.Fox. Crash-only Software, Proc. 9th Workshop on Hot Topics in Operating Systems, Germany, 2001

    Google Scholar 

  8. B. Ling and A. Fox. A self-tuning, self-protecting, selfhealing session state management layer, In Proc. 5th Int. Workshop on Active Middleware Services, Seattle, WA, 2003.

    Google Scholar 

  9. Luis Silva, Javier Alonso, Paulo Silva, Jordi Torres and Artur Andrzejak. Using Virtu-alization to Improve Software Rejuvenation The 6th IEEE International Symposium on Network Computing and Applications (IEEE NCA07), 12 - 14 July 2007,Cambridge, MA USA

    Google Scholar 

  10. Roger S. Barga. Phoenix Application Recovery Project IEEE Data Engineering Bulletin,2002.

    Google Scholar 

  11. Barga, R., Lomet, D., Paparizos, S., Yu, H., and Chandrasekaran, S. Persistent applications via automatic recovery, In Proceedings of the 17th International Database Engineering and Applications Symposium, Hong Kong, China, July 2003.

    Google Scholar 

  12. R. Barga, D. Lomet, G. Shegalov, G. Weikum. Recovery Guarantees for Internet Appli-cations, ACM Transactions on Internet Technology (TOIT), vol. 4, no. 3, pp. 289-328, 2004.

    Article  Google Scholar 

  13. R. Barga, D. Lomet, G. Shegalov, G. Weikum. Recovery Guarantees for General Multi-tier Applications, Proc. of the 18th Int. Conf. on Data Engineering, p. 543, Feb. 26-March, 2002.

    Google Scholar 

  14. Roger S. Barga , David B. Lomet. Measuring and Optimizing a System for Persistent Database Sessions, Proc, of the 17th Int. Conf. on Data Engineering, p.21-30, April 02-06, 2001.

    Google Scholar 

  15. M. Stonebraker. The design of the Postgres storage system, Proc. 13th Conf. on Very LargeDatabases, Brighton, England, 1987.

    Google Scholar 

  16. R. Figueiredo, P. Dinda, J. Fortes. Resource Virtualization Renaissance. IEEE Computer,38(5), pp. 28-69, May 2005

    Google Scholar 

  17. [website] http://roc.cs.berkeley.edu/

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Alonso, J., Torres, J., Silva, L. (2008). Carrying the Crash-Only Software Concept to the Legacy Application Servers. In: Making Grids Work. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78448-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-78448-9_13

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-78447-2

  • Online ISBN: 978-0-387-78448-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics