Abstract
The success of ever growing warehouse-sized Cloud data centers built to respond to the increasing demand for computing resources depends on the ability to provide reliability and availability at scale. In order to provide dependable and secure systems and services, one needs to implement security controls capable of avoiding, coping and recovering from failures. However, dependability and security of services at all cost is not a solution for Cloud providers. In this paper, two state-of-the-art fault-tolerance techniques are compared in terms of availability of services to consumers, and energy costs to Cloud providers. The results have shown that proactive fault-tolerance technique outperforms traditional redundancy in terms of cost to Cloud users, while providing available compute environments and services to consumers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Matt, B., et al.: Introduction to Computer Security. Pearson Education, India (2006)
Armstrong, D., Djemame, K.: Towards quality of service in the cloud. In: Proceedings of the 25th UK Performance Engineering Workshop (2009)
Schroeder, B., Gibson, G.A.: Understanding failures in petascale computers. J. Phys.: Conf. Ser. 78, 012022 (2007)
Egwutuoha, I.P., Levy, D., Selic, B., Chen, S.: A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems. J. Supercomput. 65(3), 1302–1326 (2013)
Ferreira, K., Stearley, J., Laros, J.H., Oldfield, R., Pedretti, K., Brightwell, R., Riesen, R., Bridges, P.G., Arnold, D.: Evaluating the viability of process replication reliability for exascale systems. In: 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp. 1–12. IEEE (2011)
Casanova, H., Vivien, F., Zaidouni, D.: Using replication for resilience on exascale systems. In: Fault-Tolerance Techniques for High-Performance Computing, pp. 229–278. Springer (2015)
Fu, S.: Failure-aware resource management for high-availability computing clusters with distributed virtual machines. J. Parallel Distrib. Comput. 70(4), 384–393 (2010)
Sampaio, A.M., Barbosa, J.G.: Towards high-available and energy-efficient virtual computing environments in the cloud. Future Gener. Comput. Syst. 40, 30–43 (2014)
Liu, J., Wang, S., Zhou, A., Kumar, S., Yang, F., Buyya, R.: Using proactive fault-tolerance approach to enhance cloud service reliability. IEEE Trans. Cloud Comput. (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Sampaio, A.M., Barbosa, J.G. (2017). A Comparative Cost Study of Fault-Tolerant Techniques for Availability on the Cloud. In: De Paz, J., Julián, V., Villarrubia, G., Marreiros, G., Novais, P. (eds) Ambient Intelligence– Software and Applications – 8th International Symposium on Ambient Intelligence (ISAmI 2017). ISAmI 2017. Advances in Intelligent Systems and Computing, vol 615. Springer, Cham. https://doi.org/10.1007/978-3-319-61118-1_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-61118-1_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61117-4
Online ISBN: 978-3-319-61118-1
eBook Packages: EngineeringEngineering (R0)