Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 615))

Included in the following conference series:

  • 882 Accesses

Abstract

The success of ever growing warehouse-sized Cloud data centers built to respond to the increasing demand for computing resources depends on the ability to provide reliability and availability at scale. In order to provide dependable and secure systems and services, one needs to implement security controls capable of avoiding, coping and recovering from failures. However, dependability and security of services at all cost is not a solution for Cloud providers. In this paper, two state-of-the-art fault-tolerance techniques are compared in terms of availability of services to consumers, and energy costs to Cloud providers. The results have shown that proactive fault-tolerance technique outperforms traditional redundancy in terms of cost to Cloud users, while providing available compute environments and services to consumers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Matt, B., et al.: Introduction to Computer Security. Pearson Education, India (2006)

    Google Scholar 

  2. Armstrong, D., Djemame, K.: Towards quality of service in the cloud. In: Proceedings of the 25th UK Performance Engineering Workshop (2009)

    Google Scholar 

  3. Schroeder, B., Gibson, G.A.: Understanding failures in petascale computers. J. Phys.: Conf. Ser. 78, 012022 (2007)

    Google Scholar 

  4. Egwutuoha, I.P., Levy, D., Selic, B., Chen, S.: A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems. J. Supercomput. 65(3), 1302–1326 (2013)

    Article  Google Scholar 

  5. Ferreira, K., Stearley, J., Laros, J.H., Oldfield, R., Pedretti, K., Brightwell, R., Riesen, R., Bridges, P.G., Arnold, D.: Evaluating the viability of process replication reliability for exascale systems. In: 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp. 1–12. IEEE (2011)

    Google Scholar 

  6. Casanova, H., Vivien, F., Zaidouni, D.: Using replication for resilience on exascale systems. In: Fault-Tolerance Techniques for High-Performance Computing, pp. 229–278. Springer (2015)

    Google Scholar 

  7. Fu, S.: Failure-aware resource management for high-availability computing clusters with distributed virtual machines. J. Parallel Distrib. Comput. 70(4), 384–393 (2010)

    Article  MATH  Google Scholar 

  8. Sampaio, A.M., Barbosa, J.G.: Towards high-available and energy-efficient virtual computing environments in the cloud. Future Gener. Comput. Syst. 40, 30–43 (2014)

    Article  Google Scholar 

  9. Liu, J., Wang, S., Zhou, A., Kumar, S., Yang, F., Buyya, R.: Using proactive fault-tolerance approach to enhance cloud service reliability. IEEE Trans. Cloud Comput. (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Altino M. Sampaio .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Sampaio, A.M., Barbosa, J.G. (2017). A Comparative Cost Study of Fault-Tolerant Techniques for Availability on the Cloud. In: De Paz, J., Julián, V., Villarrubia, G., Marreiros, G., Novais, P. (eds) Ambient Intelligence– Software and Applications – 8th International Symposium on Ambient Intelligence (ISAmI 2017). ISAmI 2017. Advances in Intelligent Systems and Computing, vol 615. Springer, Cham. https://doi.org/10.1007/978-3-319-61118-1_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61118-1_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61117-4

  • Online ISBN: 978-3-319-61118-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics