Skip to main content

SLA-aware Stochastic Load Balancing in Dynamic Cloud Environment

  • Published:
Journal of Grid Computing Aims and scope Submit manuscript

Abstract

As the number of enterprises dispatching their workload to the cloud has increased significantly over the last decade, service level agreements (SLAs) becoming a key element to consider for maintaining the quality of service (QoS). In order to facilitate the perseverance of service quality at a satisfactory level, clouds perform load balancing through migration of virtual machines (VMs) from overloaded physical machines (PMs). However, there are several challenges in achieving effective and efficient load balancing. First, VMs in clouds use different resources to serve a variety of applications, which results in varying levels of resource overutilization in different PMs. Second, due to the application’s time-varying heterogeneous nature of resource requirements, the PM’s resource consumption vary over time, making the profiling of resources difficult. Migration decisions in previous load balancing techniques are mostly based on deterministic resource demand estimation, which treats each resource equally and leads to inefficient migrations, causing severe SLA violations in terms of performance degradation. To address this problem, we propose a SLA-aware stochastic load balancing scheme using VM migrations, namely SLA-LB. It provides probabilistic guarantee against resource overloading, while satisfying the SLA. As opposed to previous methods, SLA-LB dynamically assigns different weights to different resources based on PM’s overload probability and effectively addresses the multidimensional resource requirement with stochastic characterization. Experimental results using PlanetLab and Google Cluster trace show that SLA-LB outperforms previous load balancing methods, i.e., RIAL, Sandpiper and CloudScale in terms of performance degradation by an average margin of 10.8%, 23.53% and 33%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Data Availability

The Google Cloud and PlanetLab trace data have been analysed during this work. They are obtained from ”Borg cluster traces from Google” (https://github.com/google/cluster-datahttps://github.com/google/cluster-data) and ”A set of CPU utilization traces from PlanetLab VMs collected during 10 random days in March and April 2011” (https://github.com/beloglazov/planetlab-workload-traces/https://github.com/beloglazov/planetlab-workload-traces/), respectively.

References

  1. Google cluster data. https://github.com/google/cluster-data/https://github.com/google/cluster-data/, Dec 2020

  2. PlanetLab workload traces. https://github.com/beloglazov/planetlab-workload-traces/, Dec 2020

  3. VMWare. http://www.vmware.com/, Dec 2020

  4. Ardagna, D., Casale, G., Ciavotta, M., Pérez, J.F., Wang, W.: Quality-of-service in cloud computing: modeling techniques and their applications. J. Internet Serv. Appl. 5(1), 1–17 (2014)

    Article  Google Scholar 

  5. Arzuaga, E., Kaeli, D.R.: Quantifying load imbalance on virtualized enterprise servers. In: Proceedings of the first joint WOSP/SIPEW international conference on Performance engineering, pp. 235–242 (2010)

  6. Ashouraei, M., Khezr, S.N., Benlamri, R., Navimipour, N.J.: A new sla-aware load balancing method in the cloud using an improved parallel task scheduling algorithm. In: IEEE 6th international conference on future internet of things and cloud (FiCloud), pp. 71–76. IEEE (2018)

  7. Banerjee, S., Roy, S., Khatua, S.: Efficient resource utilization using multi-step-ahead workload prediction technique in cloud. J. Supercomput., 1–28 (2021)

  8. Barham, P., Dragovic, B., Fraser, K., Hand, S., Harris, T., Ho, A., Neugebauer, R., Pratt, I., Warfield, A.: Xen and the art of virtualization. SIGOPS Oper. Syst. Rev. 37 (5), 164–177 (2003)

    Article  Google Scholar 

  9. Beloglazov, A., Buyya, R.: Managing overloaded hosts for dynamic consolidation of virtual machines in cloud data centers under quality of service constraints. IEEE Trans. Parallel Distrib. Syst. 24(7), 1366–1379 (2012)

    Article  Google Scholar 

  10. Beloglazov, A., Buyya, R.: Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers. Concurr. Comput. Pract. Exper. 24(13), 1397–1420 (2012)

    Article  Google Scholar 

  11. Bhattacherjee, S., Das, R., Khatua, S., Roy, S.: Energy-efficient migration techniques for cloud environment: a step toward green computing. J. Supercomput. 76(7), 5192–5220 (2020)

    Article  Google Scholar 

  12. Bobroff, N., Kochut, A., Beaty, K.: Dynamic placement of virtual machines for managing sla violations. In: 2007 10th IFIP/IEEE International Symposium on Integrated Network Management, pp. 119–128. IEEE (2007)

  13. Chandra, A., Gong, W., Shenoy. P.: Dynamic resource allocation for shared data centers using online measurements. In: International Workshop on Quality of Service, pp. 381–398. Springer (2003)

  14. Chen, M., Zhang, H., Su, Y.-Y., Wang, X., Jiang, G., Yoshihira, K.: Effective vm sizing in virtualized data centers. In: 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops, pp. 594–601. IEEE (2011)

  15. Clark, C., Fraser, K., Hand, S., Hansen, J.G., Jul, E., Limpach, C., Pratt, I., Warfield, A.: Live migration of virtual machines. In: Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation-Volume. vol. 2, pp. 273–286 (2005)

  16. Gao, Q., Tang, P., Deng, T., Wo, T.: Virtualrank: A prediction based load balancing technique in virtual computing environment. In: 2011 IEEE World Congress on Services, pp. 247–256. IEEE (2011)

  17. Goel, A., Indyk, P.: Stochastic load balancing and related problems. In: 40th Annual Symposium on Foundations of Computer Science (Cat. No. 99CB37039), pp. 579–586. IEEE (1999)

  18. Gong, Z., Gu, X., Wilkes, J.: Press: Predictive elastic resource scaling for cloud systems. In: 2010 International Conference on Network and Service Management, pp. 9–16. IEEE (2010)

  19. Jin, H., Pan, D., Xu, J., Pissinou, N.: Efficient vm placement with multiple deterministic and stochastic resources in data centers. In: 2012 IEEE Global Communications Conference (GLOBECOM), pp. 2505–2510. IEEE (2012)

  20. Khanna, G., Beaty, K.s, Kar, G., Kochut, A.: Application performance management in virtualized server environments. In: IEEE/IFIP Network Operations and Management Symposium NOMS 2006, pp. 373–381. IEEE (2006)

  21. Kim, I.K., Wang, W., Qi, Y., Humphrey, M.: Forecasting cloud application workloads with cloudinsight for predictive resource management. IEEE Transactions on Cloud Computing (2020)

  22. Lim, S.-H., Huh, J.-S., Kim, Y., Das, C.R.: Migration, assignment, and scheduling of jobs in virtualized environment. Migration 40, 45 (2011)

    Google Scholar 

  23. Liu, C., Li, K., Li, K.: A game approach to multi-servers load balancing with load-dependent server availability consideration. IEEE Transactions on Cloud Computing (2018)

  24. Mishra, M., Das, A., Kulkarni, P., Sahoo, A.: Dynamic resource management using virtual machine migrations. IEEE Commun. Mag. 50(9), 34–40 (2012)

    Article  Google Scholar 

  25. Noghin, V.D.: Linear scalarization in multi-criterion optimization. Sci. Tech. Inf. Process. 42(6), 463–469 (2015)

    Article  Google Scholar 

  26. Park, K., Pai, V.S.: Comon: a mostly-scalable monitoring system for planetlab. ACM SIGOPS Oper. Syst. Rev. 40(1), 65–74 (2006)

    Article  Google Scholar 

  27. Ray, B., Saha, A., Khatua, S., Roy, S.: Quality and profit assured trusted cloud federation formation: Game theory based approach. IEEE Transactions on Services Computing (2018)

  28. Ray, B., Saha, A., Khatua, S, Roy, S.: Proactive fault-tolerance technique to enhance reliability of cloud service in cloud federation environment. IEEE Transactions on Cloud Computing (2020)

  29. Ray, B.K., Saha, A., Khatua, S., Roy, S.: Toward maximization of profit and quality of cloud federation: solution to cloud federation formation problem. J. Supercomput. 75(2), 885–929 (2019)

    Article  Google Scholar 

  30. Ray, B.K., Saha, A., Roy, S.: Migration cost and profit oriented cloud federation formation: hedonic coalition game based approach. Clust. Comput. 21(4), 1981–1999 (2018)

    Article  Google Scholar 

  31. Shen, H., Chen, L.: A resource usage intensity aware load balancing method for virtual machine migration in cloud datacenters. IEEE Trans. Cloud Comput. 8(1), 17–31 (2017)

    Article  Google Scholar 

  32. Shen, Z., Subbiah, S., Gu, X., Wilkes, J.: Cloudscale: elastic resource scaling for multi-tenant cloud systems. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, pp. 1–14 (2011)

  33. Shrivastava, V., Zerfos, P., Lee, K.-W., Jamjoom, H., Liu, Y.-H., Banerjee, S.: Application-aware virtual machine migration in data centers. In: 2011 Proceedings IEEE INFOCOM, pp. 66–70. IEEE (2011)

  34. Tarafdar, A., Debnath, M., Khatua, S., Das, R.K.: Energy and quality of service-aware virtual machine consolidation in a cloud data center. J. Supercomput., 1–32 (2020)

  35. Tarighi, M., Motamedi, SA, Sharifian, S: A new model for virtual machine migration in virtualized cluster server based on fuzzy decision making. arXiv:1002.3329 (2010)

  36. Wang, A., Venkataraman, S., Alspaugh, S., Katz, R., Stoica, I.: Cake: enabling high-level slos on shared storage systems. In: Proceedings of the Third ACM Symposium on Cloud Computing, pp. 1–14 (2012)

  37. Wood, T., Shenoy, P., Venkataramani, A., Yousif, M.: Sandpiper: Black-box and gray-box resource management for virtual machines. Comput. Netw. 53(17), 2923–2938 (2009)

    Article  Google Scholar 

  38. Xiao, Z., Song, W., Qi, C.: Dynamic resource allocation using virtual machines for cloud computing environment. IEEE Trans. Parallel Distributed Syst. 24(6), 1107–1117 (2012)

    Article  Google Scholar 

  39. Xu, F., Liu, F., Jin, H.: Heterogeneity and interference-aware virtual machine provisioning for predictable performance in the cloud. IEEE Trans. Comput. 65(8), 2470–2483 (2015)

    Article  MathSciNet  Google Scholar 

  40. Xu, F., Liu, F., Jin, H.s, Vasilakos, A.V.: Managing performance overhead of virtual machines in cloud computing A survey, state of the art, and future directions. Proc. IEEE 102(1), 11–31 (2013)

    Article  Google Scholar 

Download references

Acknowledgements

This research work of Sounak Banerjee is supported by UGC-NET Junior Research Fellowship (UGC-Ref. No.: 3709/(NET-DEC 2018)) provided by the University Grants Commission, Government of India.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sarbani Roy.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Banerjee, S., Roy, S. & Khatua, S. SLA-aware Stochastic Load Balancing in Dynamic Cloud Environment. J Grid Computing 19, 49 (2021). https://doi.org/10.1007/s10723-021-09592-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10723-021-09592-w

Keywords