Carrying the Crash-Only Software Concept to the Legacy Application Servers

Alonso, Javier; Torres, Jordi; Silva, Luis

doi:10.1007/978-0-387-78448-9_13

Javier Alonso⁵,
Jordi Torres⁵ &
Luis Silva⁶

281 Accesses

Abstract

In the last few years, high-availability on internet services has become a main goal for the academia and industry. We all know how complex and heterogeneous Internet service systems are and how sensitive to suffer from transient failures or even crashes also. Because developing systems that are guaranteed to never crash and never suffer transient or intermittent failures seems an impractical and unfeasible business, there is a need to develop mechanisms that can suffer crashes and transient failures as if they were a clean shutdown. Behind this idea, the creators of the crash-only software concept proposed a new design strategy in order to get crash-safe and fast recovery systems by defining a list of laws which are needed in order to achieve that goal. However, their proposals are focused on new systems design. For this reason, we will discuss how to develop crash-safe and masked fast self-recovery legacy systems following the ideas behind the crash-only software concept. In our work, we have focused on legacy application servers because they are a more sensitive piece of the internet services' big puzzle.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J.Hennessy, D.Patterson. Computer Architecture: A Quantitative Approach, Morgan & Kaufmann Publishers, 2002.
Google Scholar
D. Patterson, et. al. Recovery Oriented Computing (ROC): Motivation, Definition, Tech-niques and Case Studies., Technical Report UCB CSD-02-1175, U.C. Berkeley, March 2002.
Google Scholar
K. Vaidyanathan and K.S. Trivedi. Extended Classification of Software Faults based on Aging, In Fast Abstracts, Proc. of the IEEE IntŠl Symp. on Software Reliability Engineering, Hong Kong, November 2001.
Google Scholar
D. Scott. Operation Zero Downtime A Gartner Group report, Donna Scott, 2000
Google Scholar
Chet Dembeck. Yahoo cashes in on Ebay’s outage, E-commerce Times, June 18, 1999. [web] http://www.ecommercetimes.com/perl/story/545.html
G. Candea, S. Kawamoto, Y. Fujiki, G. Friedman, A. Fox. A Microreboot - A Technique for Cheap Recovery Proc. 6th Symp. on Operating Systems Design and Implementation (OSDI), Dec. 2004.
Google Scholar
G.Candea, A.Fox. Crash-only Software, Proc. 9th Workshop on Hot Topics in Operating Systems, Germany, 2001
Google Scholar
B. Ling and A. Fox. A self-tuning, self-protecting, selfhealing session state management layer, In Proc. 5th Int. Workshop on Active Middleware Services, Seattle, WA, 2003.
Google Scholar
Luis Silva, Javier Alonso, Paulo Silva, Jordi Torres and Artur Andrzejak. Using Virtu-alization to Improve Software Rejuvenation The 6th IEEE International Symposium on Network Computing and Applications (IEEE NCA07), 12 - 14 July 2007,Cambridge, MA USA
Google Scholar
Roger S. Barga. Phoenix Application Recovery Project IEEE Data Engineering Bulletin,2002.
Google Scholar
Barga, R., Lomet, D., Paparizos, S., Yu, H., and Chandrasekaran, S. Persistent applications via automatic recovery, In Proceedings of the 17th International Database Engineering and Applications Symposium, Hong Kong, China, July 2003.
Google Scholar
R. Barga, D. Lomet, G. Shegalov, G. Weikum. Recovery Guarantees for Internet Appli-cations, ACM Transactions on Internet Technology (TOIT), vol. 4, no. 3, pp. 289-328, 2004.
Article Google Scholar
R. Barga, D. Lomet, G. Shegalov, G. Weikum. Recovery Guarantees for General Multi-tier Applications, Proc. of the 18th Int. Conf. on Data Engineering, p. 543, Feb. 26-March, 2002.
Google Scholar
Roger S. Barga , David B. Lomet. Measuring and Optimizing a System for Persistent Database Sessions, Proc, of the 17th Int. Conf. on Data Engineering, p.21-30, April 02-06, 2001.
Google Scholar
M. Stonebraker. The design of the Postgres storage system, Proc. 13th Conf. on Very LargeDatabases, Brighton, England, 1987.
Google Scholar
R. Figueiredo, P. Dinda, J. Fortes. Resource Virtualization Renaissance. IEEE Computer,38(5), pp. 28-69, May 2005
Google Scholar
[website] http://roc.cs.berkeley.edu/

Download references

Author information

Authors and Affiliations

Technical University of Catalonia, Barcelona Supercomputing Center, Barcelona, Spain
Javier Alonso & Jordi Torres
University of Coimbra CISUC, Portugal
Luis Silva

Authors

Javier Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Torres
View author publications
You can also search for this author in PubMed Google Scholar
Luis Silva
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Alonso, J., Torres, J., Silva, L. (2008). Carrying the Crash-Only Software Concept to the Legacy Application Servers. In: Making Grids Work. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78448-9_13

Download citation

DOI: https://doi.org/10.1007/978-0-387-78448-9_13
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-78447-2
Online ISBN: 978-0-387-78448-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics