Skip to main content
Log in

Virtual machine placement with (m, n)-fault tolerance in cloud data center

Cluster Computing Aims and scope Submit manuscript

Abstract

Scalable computing resources are provided via the Internet in the cloud computing environment. A growing number of application providers begin to deploy their applications in cloud to save the infrastructure maintaince cost. The probability of node failures cannot be nontrivial due to a great quantity of nodes in the cloud data center. To address the problem, the virtual machine replication technique is extensively adopted in the cloud system to enhance the application/service reliability. K-fault tolerance is a typical replication strategy employed in cloud. However, currently proposed K-fault tolerance replication strategies cannot achieve the best effect due to the ignorance of switch failure. In this paper, we study to design a (m, n)-fault tolerance virtual machine placement algorithm to solve the problem. Firstly, we formulate the problem as an integer linear programming problem, and prove that the problem is NP-hard. Secondly, we extensively employ differential evolution (DE) algorithm to solve the integer linear programming problem. Finally, experiments are conducted to study the effectiveness of our algorithm, and the simulation results demonstrate that our algorithm outperforms other algorithms in reliability enhancement.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I.: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)

    Article  Google Scholar 

  2. Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging IT platforms: vision, hype, and reality for delivering computing as the 5th utility. Future Gener. Comput. Syst. 25(6), 599–616 (2009)

    Article  Google Scholar 

  3. He, J., Dong, M., Ota, K., Fan, M., Wang, G.: NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw. Appl. 9(1), 67–81 (2016)

    Article  Google Scholar 

  4. Luo, G., Qian, Z., Dong, M., Ota, K., and Lu, S.: Network-Aware Re-Scheduling: Towards Improving Network Performance of Virtual Machines in a Data Center. In Algorithms and Architectures for Parallel Processing: 14th International Conference, ICA3PP 2014, Dalian, China, August 24-27, 2014. Proceedings, Part I, Sun, X.-h., Qu, W., Stojmenovic, I. et al. (eds) pp. 255–269, Cham: Springer International Publishing (2014)

  5. Bauer, E., Adams, R.: Reliability and Availability of Cloud Computing. Wiley, Chichester (2012)

    Book  MATH  Google Scholar 

  6. Dai, Y.-S., Yang, B., Dongarra, J., and Zhang, G.: ‘Cloud service reliability: modeling and analysis. In 15th IEEE Pacific Rim International Symposium on Dependable Computing(PRDC), pp. 1–17

  7. Jhawar, R., Piuri, V., Santambrogio, M.: Fault tolerance management in cloud computing: a system-level perspective. Syst. J. IEEE 7(2), 288–297 (2013)

    Article  Google Scholar 

  8. Al-Fares, M., Loukissas, A., Vahdat, A.: A scalable, commodity data center network architecture. ACM SIGCOMM Comp. Commun. Rev. 38(4), 63–74 (2008)

    Article  Google Scholar 

  9. Dong, M., Lit, H., Ota, K., and Zhu, H.: HVSTO: efficient privacy preserving hybrid storage in cloud data center. In 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 529–534

  10. Lin, J., Chen, C., Chang, J.: QoS-aware data replication for data intensive applications in cloud computing systems. IEEE Trans. Cloud Comput. 1(1), 101–115 (2013)

    Article  Google Scholar 

  11. Machida, F., Kawato, M., Maeno, Y.: Redundant virtual machine placement for fault-tolerant consolidated server clusters. In Network Operations and Management Symposium (NOMS), IEEE, pp. 32–39. (2010)

  12. Shen, M., Ke, X., Li, F., Li, F., Zhu, L., and Guan, L.: Availability-Aware Virtual Network Embedding for Multi-tier Applications in Cloud Networks. In Proceedings of the 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conf on Embedded Software and Systems, pp. 1–6. (2015)

  13. Li, X., and Qian, C.: Traffic and failure aware VM placement for multi-tenant cloud computing. In 2015 IEEE 23rd International Symposium on Quality of Service (IWQoS), pp. 41–50. (2015)

  14. Nicolae, B., and Cappello, F.: BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Seattle, Washington, pp. 1–12. (2011)

  15. Knauth, T., and Fetzer, C.: VeCycle: Recycling VM Checkpoints for Faster Migrations. In Proceedings of the 16th Annual Middleware Conference, Vancouver, pp. 210–221. (2015)

  16. Goiri, Í., Julia, F., Guitart, J., Torres, J.: Checkpoint-based fault-tolerant infrastructure for virtualized service providers. In Network Operations and Management Symposium (NOMS), 2010 IEEE, pp. 455–462. (2010)

  17. Limrungsi, N., Zhao, J., Xiang, Y., Lan, T., Huang, H. H., and Subramaniam, S.: Providing reliability as an elastic service in cloud computing. In Communications (ICC), 2012 IEEE International Conference on, pp. 2912–2917

  18. Zhou, A., Wang, S., Zheng, Z., Hsu, C., and Lyu, M.: On cloud service reliability enhancement with optimal resource usage. IEEE Transactions on Cloud Computing, vol. 99, pp. 1–15. (2014)

  19. Zheng, Z., Zhou, T.C., Lyu, M.R., King, I.: Component ranking for fault-tolerant cloud applications. Serv. Comput. IEEE Transac. 5(4), 540–550 (2012)

    Article  Google Scholar 

  20. Jayasinghe, D., Pu, C., Eilam, T., Steinder, M., Whally, I., and Snible, E.: Improving performance and availability of services hosted on IaaS clouds with structural constraint-aware virtual machine placement. In 2011 IEEE International Conference on Services Computing, pp. 72–79

  21. Xu, J., Tang, J., Kwiat, K., Zhang, W., and Xue, G.: Survivable virtual infrastructure mapping in virtualized data centers. In Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on, pp. 196–203

  22. Tanenbaum, A.S., Van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice-Hall, Upper Saddle River (2007)

    MATH  Google Scholar 

  23. Beloglazov, A., Buyya, R.: Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in Cloud data centers. Concurr. Comput. 24(13), 1397–1420 (2012)

    Article  Google Scholar 

  24. Beloglazov, A., Abawajy, J., Buyya, R.: Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing. Future Gener. Comput. Syst. 28(5), 755–768 (2012)

    Article  Google Scholar 

  25. Chekuri, C., Khanna, S.: On multidimensional packing problems. SIAM J. Comput. 33(4), 837–851 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  26. Bansal, N., Caprara, A., and Sviridenko, M.: Improved approximation algorithms for multidimensional bin packing problems. In Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science, pp. 697–708. (2006)

  27. Karp, R. M., Luby, M., and Marchetti-Spaccamela, A.: A probabilistic analysis of multidimensional bin packing problems. In Proceedings of the sixteenth annual ACM symposium on Theory of computing, pp. 289–298. (1984)

  28. Zhou, A., Wang, S., Sun, Q., Zou, H., and Yang, F.: FTCloudSim: a simulation tool for cloud service reliability enhancement mechanisms. In Proceedings Demo & Poster Track of ACM/IFIP/USENIX International Middleware Conference, pp. 2

Download references

Acknowledgements

The work presented in this study is supported by NSFC (61602054), Beijing Natural Science Foundation (4174100), and NSFC (61571066).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ching-Hsien Hsu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, A., Wang, S., Hsu, CH. et al. Virtual machine placement with (m, n)-fault tolerance in cloud data center. Cluster Comput 22 (Suppl 5), 11619–11631 (2019). https://doi.org/10.1007/s10586-017-1426-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-017-1426-y

Keywords

Navigation