Abstract
As the number of enterprises dispatching their workload to the cloud has increased significantly over the last decade, service level agreements (SLAs) becoming a key element to consider for maintaining the quality of service (QoS). In order to facilitate the perseverance of service quality at a satisfactory level, clouds perform load balancing through migration of virtual machines (VMs) from overloaded physical machines (PMs). However, there are several challenges in achieving effective and efficient load balancing. First, VMs in clouds use different resources to serve a variety of applications, which results in varying levels of resource overutilization in different PMs. Second, due to the application’s time-varying heterogeneous nature of resource requirements, the PM’s resource consumption vary over time, making the profiling of resources difficult. Migration decisions in previous load balancing techniques are mostly based on deterministic resource demand estimation, which treats each resource equally and leads to inefficient migrations, causing severe SLA violations in terms of performance degradation. To address this problem, we propose a SLA-aware stochastic load balancing scheme using VM migrations, namely SLA-LB. It provides probabilistic guarantee against resource overloading, while satisfying the SLA. As opposed to previous methods, SLA-LB dynamically assigns different weights to different resources based on PM’s overload probability and effectively addresses the multidimensional resource requirement with stochastic characterization. Experimental results using PlanetLab and Google Cluster trace show that SLA-LB outperforms previous load balancing methods, i.e., RIAL, Sandpiper and CloudScale in terms of performance degradation by an average margin of 10.8%, 23.53% and 33%, respectively.
Similar content being viewed by others
Data Availability
The Google Cloud and PlanetLab trace data have been analysed during this work. They are obtained from ”Borg cluster traces from Google” (https://github.com/google/cluster-datahttps://github.com/google/cluster-data) and ”A set of CPU utilization traces from PlanetLab VMs collected during 10 random days in March and April 2011” (https://github.com/beloglazov/planetlab-workload-traces/https://github.com/beloglazov/planetlab-workload-traces/), respectively.
References
Google cluster data. https://github.com/google/cluster-data/https://github.com/google/cluster-data/, Dec 2020
PlanetLab workload traces. https://github.com/beloglazov/planetlab-workload-traces/, Dec 2020
VMWare. http://www.vmware.com/, Dec 2020
Ardagna, D., Casale, G., Ciavotta, M., Pérez, J.F., Wang, W.: Quality-of-service in cloud computing: modeling techniques and their applications. J. Internet Serv. Appl. 5(1), 1–17 (2014)
Arzuaga, E., Kaeli, D.R.: Quantifying load imbalance on virtualized enterprise servers. In: Proceedings of the first joint WOSP/SIPEW international conference on Performance engineering, pp. 235–242 (2010)
Ashouraei, M., Khezr, S.N., Benlamri, R., Navimipour, N.J.: A new sla-aware load balancing method in the cloud using an improved parallel task scheduling algorithm. In: IEEE 6th international conference on future internet of things and cloud (FiCloud), pp. 71–76. IEEE (2018)
Banerjee, S., Roy, S., Khatua, S.: Efficient resource utilization using multi-step-ahead workload prediction technique in cloud. J. Supercomput., 1–28 (2021)
Barham, P., Dragovic, B., Fraser, K., Hand, S., Harris, T., Ho, A., Neugebauer, R., Pratt, I., Warfield, A.: Xen and the art of virtualization. SIGOPS Oper. Syst. Rev. 37 (5), 164–177 (2003)
Beloglazov, A., Buyya, R.: Managing overloaded hosts for dynamic consolidation of virtual machines in cloud data centers under quality of service constraints. IEEE Trans. Parallel Distrib. Syst. 24(7), 1366–1379 (2012)
Beloglazov, A., Buyya, R.: Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers. Concurr. Comput. Pract. Exper. 24(13), 1397–1420 (2012)
Bhattacherjee, S., Das, R., Khatua, S., Roy, S.: Energy-efficient migration techniques for cloud environment: a step toward green computing. J. Supercomput. 76(7), 5192–5220 (2020)
Bobroff, N., Kochut, A., Beaty, K.: Dynamic placement of virtual machines for managing sla violations. In: 2007 10th IFIP/IEEE International Symposium on Integrated Network Management, pp. 119–128. IEEE (2007)
Chandra, A., Gong, W., Shenoy. P.: Dynamic resource allocation for shared data centers using online measurements. In: International Workshop on Quality of Service, pp. 381–398. Springer (2003)
Chen, M., Zhang, H., Su, Y.-Y., Wang, X., Jiang, G., Yoshihira, K.: Effective vm sizing in virtualized data centers. In: 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops, pp. 594–601. IEEE (2011)
Clark, C., Fraser, K., Hand, S., Hansen, J.G., Jul, E., Limpach, C., Pratt, I., Warfield, A.: Live migration of virtual machines. In: Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation-Volume. vol. 2, pp. 273–286 (2005)
Gao, Q., Tang, P., Deng, T., Wo, T.: Virtualrank: A prediction based load balancing technique in virtual computing environment. In: 2011 IEEE World Congress on Services, pp. 247–256. IEEE (2011)
Goel, A., Indyk, P.: Stochastic load balancing and related problems. In: 40th Annual Symposium on Foundations of Computer Science (Cat. No. 99CB37039), pp. 579–586. IEEE (1999)
Gong, Z., Gu, X., Wilkes, J.: Press: Predictive elastic resource scaling for cloud systems. In: 2010 International Conference on Network and Service Management, pp. 9–16. IEEE (2010)
Jin, H., Pan, D., Xu, J., Pissinou, N.: Efficient vm placement with multiple deterministic and stochastic resources in data centers. In: 2012 IEEE Global Communications Conference (GLOBECOM), pp. 2505–2510. IEEE (2012)
Khanna, G., Beaty, K.s, Kar, G., Kochut, A.: Application performance management in virtualized server environments. In: IEEE/IFIP Network Operations and Management Symposium NOMS 2006, pp. 373–381. IEEE (2006)
Kim, I.K., Wang, W., Qi, Y., Humphrey, M.: Forecasting cloud application workloads with cloudinsight for predictive resource management. IEEE Transactions on Cloud Computing (2020)
Lim, S.-H., Huh, J.-S., Kim, Y., Das, C.R.: Migration, assignment, and scheduling of jobs in virtualized environment. Migration 40, 45 (2011)
Liu, C., Li, K., Li, K.: A game approach to multi-servers load balancing with load-dependent server availability consideration. IEEE Transactions on Cloud Computing (2018)
Mishra, M., Das, A., Kulkarni, P., Sahoo, A.: Dynamic resource management using virtual machine migrations. IEEE Commun. Mag. 50(9), 34–40 (2012)
Noghin, V.D.: Linear scalarization in multi-criterion optimization. Sci. Tech. Inf. Process. 42(6), 463–469 (2015)
Park, K., Pai, V.S.: Comon: a mostly-scalable monitoring system for planetlab. ACM SIGOPS Oper. Syst. Rev. 40(1), 65–74 (2006)
Ray, B., Saha, A., Khatua, S., Roy, S.: Quality and profit assured trusted cloud federation formation: Game theory based approach. IEEE Transactions on Services Computing (2018)
Ray, B., Saha, A., Khatua, S, Roy, S.: Proactive fault-tolerance technique to enhance reliability of cloud service in cloud federation environment. IEEE Transactions on Cloud Computing (2020)
Ray, B.K., Saha, A., Khatua, S., Roy, S.: Toward maximization of profit and quality of cloud federation: solution to cloud federation formation problem. J. Supercomput. 75(2), 885–929 (2019)
Ray, B.K., Saha, A., Roy, S.: Migration cost and profit oriented cloud federation formation: hedonic coalition game based approach. Clust. Comput. 21(4), 1981–1999 (2018)
Shen, H., Chen, L.: A resource usage intensity aware load balancing method for virtual machine migration in cloud datacenters. IEEE Trans. Cloud Comput. 8(1), 17–31 (2017)
Shen, Z., Subbiah, S., Gu, X., Wilkes, J.: Cloudscale: elastic resource scaling for multi-tenant cloud systems. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, pp. 1–14 (2011)
Shrivastava, V., Zerfos, P., Lee, K.-W., Jamjoom, H., Liu, Y.-H., Banerjee, S.: Application-aware virtual machine migration in data centers. In: 2011 Proceedings IEEE INFOCOM, pp. 66–70. IEEE (2011)
Tarafdar, A., Debnath, M., Khatua, S., Das, R.K.: Energy and quality of service-aware virtual machine consolidation in a cloud data center. J. Supercomput., 1–32 (2020)
Tarighi, M., Motamedi, SA, Sharifian, S: A new model for virtual machine migration in virtualized cluster server based on fuzzy decision making. arXiv:1002.3329 (2010)
Wang, A., Venkataraman, S., Alspaugh, S., Katz, R., Stoica, I.: Cake: enabling high-level slos on shared storage systems. In: Proceedings of the Third ACM Symposium on Cloud Computing, pp. 1–14 (2012)
Wood, T., Shenoy, P., Venkataramani, A., Yousif, M.: Sandpiper: Black-box and gray-box resource management for virtual machines. Comput. Netw. 53(17), 2923–2938 (2009)
Xiao, Z., Song, W., Qi, C.: Dynamic resource allocation using virtual machines for cloud computing environment. IEEE Trans. Parallel Distributed Syst. 24(6), 1107–1117 (2012)
Xu, F., Liu, F., Jin, H.: Heterogeneity and interference-aware virtual machine provisioning for predictable performance in the cloud. IEEE Trans. Comput. 65(8), 2470–2483 (2015)
Xu, F., Liu, F., Jin, H.s, Vasilakos, A.V.: Managing performance overhead of virtual machines in cloud computing A survey, state of the art, and future directions. Proc. IEEE 102(1), 11–31 (2013)
Acknowledgements
This research work of Sounak Banerjee is supported by UGC-NET Junior Research Fellowship (UGC-Ref. No.: 3709/(NET-DEC 2018)) provided by the University Grants Commission, Government of India.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Banerjee, S., Roy, S. & Khatua, S. SLA-aware Stochastic Load Balancing in Dynamic Cloud Environment. J Grid Computing 19, 49 (2021). https://doi.org/10.1007/s10723-021-09592-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10723-021-09592-w