Skip to main content

Advertisement

Log in

A power and thermal-aware virtual machine management framework based on machine learning

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Energy consumption in data centers grows rapidly in recent years. As a widely-applied energy-efficient method, workload consolidation also has its own limitations that may bring some negative effects, such as performance degradation, QoS violation, localized hotspots and so on, which is especially true when optimal objectives are inherently conflict. In this paper, we present a power and thermal-aware VM management framework called PTM-ML, which relies on machine learning technique to find optimal host configuration based on workload characteristics and cooling system’s working state. Based on such an optimal host configuration, it then makes VM migration and consolidation decisions by enforcing an efficient load-balancing policy, with aiming at achieving a better trade-off between energy efficiency and performance. The prototype of PTM-ML framework is deployed and evaluated in a real-world cloud data center. Extensive experiments are conducted by using different workload traces with distinctive characteristics, and the results are compared with four similar approaches in terms of total energy consumption, real-time power consumption, average latency and etc. Experimental results show that the proposed PTM-ML outperforms the existing approaches in terms of multiple metrics, and it also exhibits better robustness and adaptability in presence of dynamic workloads.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Singh, S., Chana, I.: Resource provisioning and scheduling in clouds: QoS perspective. J. Supercomput. 72(3), 926–960 (2016)

    Article  Google Scholar 

  2. Lu, Y., Sun, N.: An effective task scheduling algorithm based on dynamic energy management and efficient resource utilization in green cloud computing environment. Cluster Comput. 22(1), 513–520 (2019)

    Article  MathSciNet  Google Scholar 

  3. Abdessamia, F., Zhang, W.-Z., Tian, Y.-C.: Energy-efficiency virtual machine placement based on binary gravitational search algorithm. Cluster Comput. 23(3), 1577–1588 (2020)

    Article  Google Scholar 

  4. Sharma, Y., Si, W., Sun, D., et al.: Failure-aware energy-efficient VM consolidation in cloud computing systems. Future Gener. Comput. Syst. 94, 620–633 (2019)

    Article  Google Scholar 

  5. He, K., Li, Z., Deng, D., et al.: Energy-efficient framework for virtual machine consolidation in cloud data centers. China Commun. 14(10), 192–201 (2017)

    Article  Google Scholar 

  6. Arif, M., Mahmood, T.: Cloud computing and its environmental effects. Int. J. Grid Distrib. Comput. 8(1), 279–286 (2015)

    Article  Google Scholar 

  7. Shehabi, A., Smith, S., Sartor, D., et al.: United states data center energy usage report. Technical Report, Lawrence Berkeley National Laboratory. https://eta.lbl.gov/publications/united-states-data-center-energy (2016)

  8. Brown, R.: Report to congress on server and data center energy efficiency: public law, 109–431. Technical Report, Lawrence Berkeley National Laboratory. https://escholarship.org/uc/item/74g2r0vg (2008)

  9. Le, T.N., Liang, J., Liu, Z., et al.: Optimal energy procurement for geo-distributed data centers in multi-timescale electricity markets. SIGMETRICS 45(3), 185–197 (2018)

    Article  Google Scholar 

  10. Akhter, N., Othman, M.: Energy aware resource allocation of cloud data center: review and open issues. Cluster Comput. 19(3), 1163–1182 (2016)

    Article  Google Scholar 

  11. Aksanli, B.: Data center peak power management with energy storage devices. IEEE Internet Comput. 21(4), 26–33 (2017)

    Article  Google Scholar 

  12. Grange, L., Costa, G.D., Stolf, P.: Green IT scheduling for data center powered with renewable energy. Future Gener. Comput. Syst. 86, 99–120 (2018)

    Article  Google Scholar 

  13. Gaggero, M., Caviglione, L.: Predictive control for energy-aware consolidation in cloud datacenters. IEEE Trans. Control Syst. Technol. 24(2), 461–474 (2016)

    Google Scholar 

  14. Khoshkholghi, M.A., Derahman, M.N., Abdullah, A., et al.: Energy-efficient algorithms for dynamic virtual machine consolidation in cloud data centers. IEEE Access 5, 10709–10722 (2017)

    Article  Google Scholar 

  15. Teng, F., Yu, L., Li, T., et al.: Energy efficiency of VM consolidation in IaaS clouds. J. Supercomput. 73(2), 782–809 (2017)

    Article  Google Scholar 

  16. Yao, J., Guan, H., Luo, J., et al.: Adaptive power management through thermal aware workload balancing in internet data centers. IEEE Trans. Parallel Distrib. Syst. 26(9), 2400–2409 (2015)

    Article  Google Scholar 

  17. Skach, M., Arora, M., Hsu, C.H., et al.: Thermal time shifting: decreasing data center cooling costs with phase-change materials. IEEE Internet Comput. 21(4), 34–43 (2017)

    Article  Google Scholar 

  18. Ilager, S., Ramamohanarao, K., Buyya, R.: ETAS: energy and thermal-aware dynamic virtual machine consolidation in cloud data center with proactive hotspot mitigation. Concurr. Comput. Pract. Exp. 31(17), 1–15 (2019)

    Google Scholar 

  19. Li, X., Garraghan, P., Jiang, X., et al.: Holistic virtual machine scheduling in cloud datacenters towards minimizing total energy. IEEE Trans. Parallel Distrib. Syst. 29(6), 1317–1331 (2018)

    Article  Google Scholar 

  20. Goiri, I., Nguyen, T.D., Bianchini, R.: Temperature- and variation-aware management for free-cooled datacenters. Proceeding International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2015), pp. 1–12. ACM Press, Istanbul (2015)

    Google Scholar 

  21. Oxley, M.A., Jonardi, E., Pasricha, S., et al.: Rate-based thermal, power, and co-location aware resource management for heterogeneous data centers. J. Parallel Distrib Comput. 112, 126–139 (2018)

    Article  Google Scholar 

  22. Al-Qawasmeh, A.M., Pasricha, S., Maciejewski, A.A., et al.: Power and thermal-aware workload allocation in heterogeneous data centers. IEEE Trans. Comput. 64(2), 447–461 (2015)

    Article  MathSciNet  Google Scholar 

  23. Damme, T.V., Persis, C.D., Tesi, P.: Optimized thermal-aware job scheduling and control of data centers. IEEE Trans. Control Syst. Technol. 27(2), 760–771 (2019)

    Article  Google Scholar 

  24. Arianyan, E., Taheri, H., Sharifian, S.: Novel energy and SLA efficient resource management heuristics for consolidation of virtual machines in cloud data centers. Comput. Electr. Eng. 47, 222–240 (2015)

    Article  Google Scholar 

  25. Justafort, V.D., Beaubrun, R., Pierre, S.: On the carbon footprint optimization in an InterCloud environment. IEEE Trans. Cloud Comput. 6(3), 829–842 (2018)

    Article  Google Scholar 

  26. Farahnakian, F., Pahikkala, T., Liljeberg, P., et al.: Energy-aware VM consolidation in cloud data centers using utilization prediction model. IEEE Trans. Cloud Comput. 7(2), 524–536 (2019)

    Article  Google Scholar 

  27. Hussain, W., Hussain, F.K., Saberi, M., et al.: Comparing time series with machine learning-based prediction approaches for violation management in cloud SLAs. Future Gener. Comput. Syst. 89, 464–477 (2018)

    Article  Google Scholar 

  28. Zhang, W., Wulan, G., Zhai, J., et al.: An intelligent power distribution service architecture using cloud computing and deep learning techniques. J. Network Comput. Appl. 103, 239–248 (2018)

    Article  Google Scholar 

  29. Abusitta, A., Bellaiche, M., Dagenais, M., et al.: A deep learning approach for proactive multi-cloud cooperative intrusion detection system. Future Gener. Comput. Syst. 98, 308–318 (2019)

    Article  Google Scholar 

  30. Garg, S., Kaur, K., Kumar, N., et al.: A hybrid deep learning-based model for anomaly detection in cloud datacenter networks. IEEE Trans. Netw. Serv. Manag. 16(3), 924–935 (2019)

    Article  Google Scholar 

  31. Oh, J., Kim, Y.: Job placement using reinforcement learning in GPU virtualization environment. Cluster Comput. 23(3), 2219–2234 (2020)

    Article  Google Scholar 

  32. Gong, Y., Li, B., Liang, B., et al.: Chic: experience-driven scheduling in machine learning clusters. Proceeding of International Symposium on Quality of Service (IWQoS 2019), pp. 1–10. ACM Press, Phoenix (2019)

    Google Scholar 

  33. Madireddy, S., Balaprakash, P., Carns, P., et al.: Adaptive learning for concept drift in application performance modeling. Proceeding of International Conference on Parallel Processing (ICPP 2019), pp. 1–11. ACM Press, Kyoto (2019)

    Google Scholar 

  34. Gschwind, M., Kaldewey, T., Tam, D.K.: Optimizing the efficiency of deep learning through accelerator virtualization. IBM J. Res. Dev. 61(4/5), 1–11 (2017)

    Article  Google Scholar 

  35. Kim, Y., Choi, H., Lee, J., et al.: Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster. Cluster Comput. 23(3), 2287–2300 (2020)

    Article  Google Scholar 

  36. Liu, D.: Design and implementation of performance monitor and evaluation middleware in virtual clouds. Int. J. Networking Virtual Organ. 15(1), 48–64 (2015)

    Article  Google Scholar 

  37. Diao, L., Liu, Y., Nan, D., et al.: Units and layer’s effects on deep boltzman machines. Proceedings of the International Conference on Computer Science and Application Engineering (CASE 2019), pp. 1–5. ACM Press, Sanya (2019)

    Google Scholar 

  38. Beloglazov, A., Abawajy, J., Buyya, R.: Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Gener. Comput. Syst. 28(5), 755–768 (2012)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61402163), and the Hunan Provincial Natural Science Foundation of China (No. 2020JJ6022).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peng Xiao.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiao, P., Ni, Z., Liu, D. et al. A power and thermal-aware virtual machine management framework based on machine learning. Cluster Comput 24, 2231–2248 (2021). https://doi.org/10.1007/s10586-020-03228-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-020-03228-6

Keywords

Navigation