Abstract
Scalable computing resources are provided via the Internet in the cloud computing environment. A growing number of application providers begin to deploy their applications in cloud to save the infrastructure maintaince cost. The probability of node failures cannot be nontrivial due to a great quantity of nodes in the cloud data center. To address the problem, the virtual machine replication technique is extensively adopted in the cloud system to enhance the application/service reliability. K-fault tolerance is a typical replication strategy employed in cloud. However, currently proposed K-fault tolerance replication strategies cannot achieve the best effect due to the ignorance of switch failure. In this paper, we study to design a (m, n)-fault tolerance virtual machine placement algorithm to solve the problem. Firstly, we formulate the problem as an integer linear programming problem, and prove that the problem is NP-hard. Secondly, we extensively employ differential evolution (DE) algorithm to solve the integer linear programming problem. Finally, experiments are conducted to study the effectiveness of our algorithm, and the simulation results demonstrate that our algorithm outperforms other algorithms in reliability enhancement.
Similar content being viewed by others
References
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I.: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)
Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging IT platforms: vision, hype, and reality for delivering computing as the 5th utility. Future Gener. Comput. Syst. 25(6), 599–616 (2009)
He, J., Dong, M., Ota, K., Fan, M., Wang, G.: NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw. Appl. 9(1), 67–81 (2016)
Luo, G., Qian, Z., Dong, M., Ota, K., and Lu, S.: Network-Aware Re-Scheduling: Towards Improving Network Performance of Virtual Machines in a Data Center. In Algorithms and Architectures for Parallel Processing: 14th International Conference, ICA3PP 2014, Dalian, China, August 24-27, 2014. Proceedings, Part I, Sun, X.-h., Qu, W., Stojmenovic, I. et al. (eds) pp. 255–269, Cham: Springer International Publishing (2014)
Bauer, E., Adams, R.: Reliability and Availability of Cloud Computing. Wiley, Chichester (2012)
Dai, Y.-S., Yang, B., Dongarra, J., and Zhang, G.: ‘Cloud service reliability: modeling and analysis. In 15th IEEE Pacific Rim International Symposium on Dependable Computing(PRDC), pp. 1–17
Jhawar, R., Piuri, V., Santambrogio, M.: Fault tolerance management in cloud computing: a system-level perspective. Syst. J. IEEE 7(2), 288–297 (2013)
Al-Fares, M., Loukissas, A., Vahdat, A.: A scalable, commodity data center network architecture. ACM SIGCOMM Comp. Commun. Rev. 38(4), 63–74 (2008)
Dong, M., Lit, H., Ota, K., and Zhu, H.: HVSTO: efficient privacy preserving hybrid storage in cloud data center. In 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 529–534
Lin, J., Chen, C., Chang, J.: QoS-aware data replication for data intensive applications in cloud computing systems. IEEE Trans. Cloud Comput. 1(1), 101–115 (2013)
Machida, F., Kawato, M., Maeno, Y.: Redundant virtual machine placement for fault-tolerant consolidated server clusters. In Network Operations and Management Symposium (NOMS), IEEE, pp. 32–39. (2010)
Shen, M., Ke, X., Li, F., Li, F., Zhu, L., and Guan, L.: Availability-Aware Virtual Network Embedding for Multi-tier Applications in Cloud Networks. In Proceedings of the 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conf on Embedded Software and Systems, pp. 1–6. (2015)
Li, X., and Qian, C.: Traffic and failure aware VM placement for multi-tenant cloud computing. In 2015 IEEE 23rd International Symposium on Quality of Service (IWQoS), pp. 41–50. (2015)
Nicolae, B., and Cappello, F.: BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Seattle, Washington, pp. 1–12. (2011)
Knauth, T., and Fetzer, C.: VeCycle: Recycling VM Checkpoints for Faster Migrations. In Proceedings of the 16th Annual Middleware Conference, Vancouver, pp. 210–221. (2015)
Goiri, Í., Julia, F., Guitart, J., Torres, J.: Checkpoint-based fault-tolerant infrastructure for virtualized service providers. In Network Operations and Management Symposium (NOMS), 2010 IEEE, pp. 455–462. (2010)
Limrungsi, N., Zhao, J., Xiang, Y., Lan, T., Huang, H. H., and Subramaniam, S.: Providing reliability as an elastic service in cloud computing. In Communications (ICC), 2012 IEEE International Conference on, pp. 2912–2917
Zhou, A., Wang, S., Zheng, Z., Hsu, C., and Lyu, M.: On cloud service reliability enhancement with optimal resource usage. IEEE Transactions on Cloud Computing, vol. 99, pp. 1–15. (2014)
Zheng, Z., Zhou, T.C., Lyu, M.R., King, I.: Component ranking for fault-tolerant cloud applications. Serv. Comput. IEEE Transac. 5(4), 540–550 (2012)
Jayasinghe, D., Pu, C., Eilam, T., Steinder, M., Whally, I., and Snible, E.: Improving performance and availability of services hosted on IaaS clouds with structural constraint-aware virtual machine placement. In 2011 IEEE International Conference on Services Computing, pp. 72–79
Xu, J., Tang, J., Kwiat, K., Zhang, W., and Xue, G.: Survivable virtual infrastructure mapping in virtualized data centers. In Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on, pp. 196–203
Tanenbaum, A.S., Van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice-Hall, Upper Saddle River (2007)
Beloglazov, A., Buyya, R.: Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in Cloud data centers. Concurr. Comput. 24(13), 1397–1420 (2012)
Beloglazov, A., Abawajy, J., Buyya, R.: Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing. Future Gener. Comput. Syst. 28(5), 755–768 (2012)
Chekuri, C., Khanna, S.: On multidimensional packing problems. SIAM J. Comput. 33(4), 837–851 (2004)
Bansal, N., Caprara, A., and Sviridenko, M.: Improved approximation algorithms for multidimensional bin packing problems. In Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science, pp. 697–708. (2006)
Karp, R. M., Luby, M., and Marchetti-Spaccamela, A.: A probabilistic analysis of multidimensional bin packing problems. In Proceedings of the sixteenth annual ACM symposium on Theory of computing, pp. 289–298. (1984)
Zhou, A., Wang, S., Sun, Q., Zou, H., and Yang, F.: FTCloudSim: a simulation tool for cloud service reliability enhancement mechanisms. In Proceedings Demo & Poster Track of ACM/IFIP/USENIX International Middleware Conference, pp. 2
Acknowledgements
The work presented in this study is supported by NSFC (61602054), Beijing Natural Science Foundation (4174100), and NSFC (61571066).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhou, A., Wang, S., Hsu, CH. et al. Virtual machine placement with (m, n)-fault tolerance in cloud data center. Cluster Comput 22 (Suppl 5), 11619–11631 (2019). https://doi.org/10.1007/s10586-017-1426-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-017-1426-y