Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 149))

Summary

Unplanned computer system outages are more likely to be the result of software failures than of hardware failures. Moreover, software applications executing continuously for a long period of time show a degraded performance and/or an increased occurrence rate of hang/crash failures. This phenomenon has been referred to as software aging. In this paper, we have conducted a study of virtualization technology and software rejuvenation that follows a proactive fault management approach to counter act the software aging. We present a model to evaluate the effectiveness of proactive fault management approach with the use of virtualization technology in operational software systems, and express downtime and costs due to downtime during rejuvenation in terms of the parameters in that model. We perform mathematical derivation and use SHARPE (Symbolic Hierarchical Automated Reliability and Performance Evaluator) tool to evaluate the feasibility of our model. Our results show that proposed approach can provide uninterrupted availability of the services.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alonso, J., Silva, L., Andrzejak, A., Silva, P., Torres, J.: High-available grid services through the use of virtualized clustering. In: Proc. of the 8th IEEE/ACM International Conference on Grid Computing, pp. 34–41 (2007)

    Google Scholar 

  2. Cassidy, K., Gross, K., Malekpour, A.: Advanced pattern recognition for detection of complex software aging phenomenain online transaction processing servers. In: Proc. of the Int. Conf. on Dependable Systems and Networks, pp. 478–482 (2002)

    Google Scholar 

  3. Castelli, V., Harper, R.E., Heidelberger, P., Hunter, S.W., Trivedi, K.S., Vaidyanathan, K., Zeggert, W.P.: Proactive management of software aging. IBM Journal of Research and Development 45(2), 311–332 (2001)

    Article  Google Scholar 

  4. Creasy, R.J.: The origin of the VM/370 time-sharing system. IBM Journal of Research and Development 25(5), 483 (1981)

    Google Scholar 

  5. Dohi, T., Popstojanova, K.G., Vaidyanathan, K., Trivedi, K.S., Osaki, S.: Software rejuvenation modeling and applications, Springer Reliability Engineering Handbook, pp. 245–263. Springer, Heidelberg (2003)

    Google Scholar 

  6. Garg, S., van Moorsel, A., Vaidyanathan, K., Trivedi, K.: A methodology for detection and estimation of software aging. In: Proc. of the 9th Int. Symp. on Software Reliability Engineering, pp. 282–292 (1998)

    Google Scholar 

  7. Hirel, C., Sahner, R.A., Zang, X., Trivedi, K.S.: Reliability and performability modeling using SHARPE 2000. In: Proc. of the Int. Conf. on Computer Performance Evaluation: Modelling Techniques and Tools, pp. 345–349 (2000)

    Google Scholar 

  8. Huang, Y., Kintala, C., Kolettis, N., Fulton, N.D.: Software rejuvenation: analysis, module and application. In: Proc. Of the Fault Tolerance Computing Symp., pp. 381–390 (1995)

    Google Scholar 

  9. Jagarajan, A., Mueller, F., Engelmann, C., Scott, S.: Proactive fault tolerance for HPC with Xen virtualization. In: Proc. of the Int. Conf. on Supercomputing 2007, pp. 23–32 (2007)

    Google Scholar 

  10. Kourai, K., Chiba, S.: A fast rejuvenation technique for server consolidation with virtual machines. In: Proc. of the 37th Annual IEEE/IFIP Int. Conf. on Dependable Systems and Networks (DSN 2007), pp. 245–255 (2007)

    Google Scholar 

  11. Park, K., Kim, S.: Availability analysis and improvement of active/standby cluster systems using software rejuvenation. The Journal of Systems and Software 61, 121–128 (2002)

    Article  Google Scholar 

  12. Silva, L.M., Alonso, J., Silva, P., Torres, J., Andrzejak, A.: Using virtualization to improve software rejuvenation. In: Proc. of the 6th IEEE Int. Symp. on Network Computing and Applications, pp. 33–44 (2007)

    Google Scholar 

  13. Silva, L., Madeira, H., Silva, J.G.: Software aging and rejuvenation in a SOAP-based server. In: Proc. of the IEEE Network Computing and Applications, pp. 56–65 (2006)

    Google Scholar 

  14. Software Rejuvenation. Department of Electrical and Computer Engineering, Duke University, http://www.software-rejuvenation.com/

  15. Trivedi, K.S.: Symbolic hierarchical automated reliability and performance evaluator (SHARPE). In: Proc. of the Dependable System and Networks, p. 544 (2002)

    Google Scholar 

  16. Trivedi, K.S., Vaidyanathan, K., Postojanova, K.G.: Modeling and analysis of software aging and rejuvenation. In: Proc. of the 33rd Annual Simulation Symp., pp. 270–279 (2000)

    Google Scholar 

  17. Vaidyanathan, K., Trivedi, K.: A comprehensive model for software rejuvenation. IEEE Trans. on Dependable and Secure Computing 2(2), 124–137 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roger Lee

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Thein, T., Chi, SD., Park, J.S. (2008). Proactive Fault Management with Virtualization for Software Aging. In: Lee, R. (eds) Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing. Studies in Computational Intelligence, vol 149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70560-4_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70560-4_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70559-8

  • Online ISBN: 978-3-540-70560-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics