Abstract
Load balancing is one of the most significant concerns for data center (DC) management, and the basic method is reassigning applications from overloaded servers to underloaded servers. However, to ensure the service availability, during the reassignment of an application, some resources (i.e., transient resources) are consumed simultaneously on its initial server and its target server, which imposes a challenge for load balancing. The latest research has proposed a concept called resource equivalence class (REC: a set of resource configurations such that a latency-critical (LC) application running with any one of them can meet the QoS target). In this paper, we use the REC to improve the load balancing for a DC where multiple LC applications have already been co-located on servers with the service availability and QoS requirements. We formulate the proposed load rebalancing problem as a multi-objective constrained programming model. To solve the proposed problem, we propose to use a machine learning-based classification model to construct the RECs for applications, and we develop a local search (LS) algorithm to approximate the optimal solution. We evaluate the proposed algorithm via simulated experiments using real LC applications. To our knowledge, it is the first time to use REC for improving load balancing.
This work is supported by State Key Lab of Computer Architecture, ICT, CAS, under Grant No. CARCHB202013; NSFC (U1833114, 61872201); Science and Technology Development Plan of Tianjin (18ZXZNGX00140, 18ZXZNGX00200, 20JCZDJC00610).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Elastic load balancing. https://aws.amazon.com/elasticloadbalancing/
XGBoost documentation. https://xgboost.readthedocs.io/en/latest/
Aggarwal, G., Motwani, R., Zhu, A.: The load rebalancing problem. In: SPAA 2003, pp. 258–265 (2003)
Azar, Y., Cohen, I.R., Panigrahi, D.: Randomized algorithms for online vector load balancing. In: ACM SIAM, pp. 980–991 (2018)
Belikovetsky, S., Tamir, T.: Load rebalancing games in dynamic systems with migration costs. TCS 622, 16–33 (2016)
Butelle, F., et al.: Fast machine reassignment. Ann. Oper. Res. 242(1), 133–160 (2015). https://doi.org/10.1007/s10479-015-2082-3
Chen, S., Delimitrou, C., MartÃnez, J.F.: Parties: QoS-aware resource partitioning for multiple interactive services. In: ASPLOS 2019, pp. 107–120 (2019)
Cybenko, G.: Dynamic load balancing for distributed memory multiprocessors. J. Parallel Distrib. Comput. 7(2), 279–301 (1989)
Dean, J., Barroso, L.A.: The tail at scale. Commun. ACM 56(2), 74–80 (2013)
Derr, S., Jackson, P., Lameter, C., Menage, P., Seto, H.: CPUSETS (2004). https://www.kernel.org/doc/Documentation/cgroup-v1/cpusets.txt
El-Sayed, N., Mukkara, A., Tsai, P.A., Kasture, H., Ma, X., et al.: KPart: a hybrid cache partitioning-sharing technique for commodity multicores. In: HPCA-24, pp. 104–117 (2018)
Fang, J., Zhang, R., Fu, T.Z.J., Zhang, Z., Zhou, A., et al.: Distributed stream rebalance for stateful operator under workload variance. TPDS 29(10), 2223–2240 (2018)
Fardbastani, M.A., Sharif, M.: Scalable complex event processing using adaptive load balancing. J. Syst. Softw. 149, 305–317 (2019)
Gabay, M., Zaourar, S.: Vector bin packing with heterogeneous bins: application to the machine reassignment problem. Ann. Oper. Res. 242(1), 161–194 (2016)
Gavranović, H., Buljubašić, M.: An efficient local search with noising strategy for google machine reassignment problem. Ann. Oper. Res. 242(1), 19–31 (2016)
Grosu, D., Chronopoulos, A.T.: Noncooperative load balancing in distributed systems. J. Parallel Distrib. Comput. 65(9), 1022–1034 (2005)
Gurobi Optimization, L.: Gurobi Optimization, LLC (2018). http://www.gurobi.com
Jaśkowski, W., Szubert, M., Gawron, P.: A hybrid MIP-based large neighborhood search heuristic for solving the machine reassignment problem. Ann. Oper. Res. 242(1), 33–62 (2015). https://doi.org/10.1007/s10479-014-1780-6
Jin, H., Yang, G., Yu, B., Yoo, C.: TALON: tenant throughput allocation through traffic load-balancing in virtualized software-defined networks. In: ICOIN, pp. 233–238 (2019)
Kapoor, R., Porter, G., Tewari, M., Voelker, G.M., Vahdat, A.: Chronos: predictable low latency for data center applications. In: SoCC 2012 (2012)
Kasture, H., Sanchez, D.: TailBench: a benchmark suite and evaluation methodology for latency-critical applications. In: IISWC, pp. 1–10 (2016)
Dhinesh, B.L., Krishna, P.V.: Honey bee behavior inspired load balancing of tasks in cloud computing environments. Appl. Soft Comput. 13(5), 2292–2303 (2013)
Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: improving resource efficiency at scale. In: ISCA 2015, pp. 450–462 (2015)
Mekala, M.S., Viswanathan, P.: Energy-efficient virtual machine selection based on resource ranking and utilization factor approach in cloud computing for IoT. Comput. Electr. Eng. 73, 227–244 (2019)
Mrad, M., Gharbi, A., Haouari, M., Kharbeche, M.: An optimization-based heuristic for the machine reassignment problem. Ann. Oper. Res. 242(1), 115–132 (2015). https://doi.org/10.1007/s10479-015-2002-6
Nashaat, H., Ashry, N., Rizk, R.: Smart elastic scheduling algorithm for virtual machine migration in cloud computing. J. Supercomput. 75(7), 3842–3865 (2019). https://doi.org/10.1007/s11227-019-02748-2
Nguyen, K.T.: Introduction to cache allocation technology in the Intel® Xeon® processor E5 v4 Family (2016). https://software.intel.com/content/www/us/en/develop/articles/introduction-to-cache-allocation-technology.html
Nishtala, R., Petrucci, V., Carpenter, P., Själander, M.: Twig: multi-agent task management for colocated latency-critical cloud services. In: HPCA-26, pp. 167–179 (2020)
Mahitha, O., Suma, V.: Deadlock avoidance through efficient load balancing to control disaster in cloud environment. In: ICCCNT, pp. 1–6 (2013)
Park, J., Park, S., Baek, W.: CoPart: coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers. In: EuroSys 2019, pp. 1–16 (2019)
Park, J., Park, S., Han, M., Hyun, J., Baek, W.: HyPart: a hybrid technique for practical memory bandwidth partitioning on commodity servers. In: PACT 2018, pp. 1–14 (2018)
Patel, T., Tiwari, D.: CLITE: efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers. In: HPCA-26, pp. 193–206 (2020)
roadef.org: ROADEF/EURO challenge 2012: machine reassignment (2012). http://www.roadef.org/challenge/2012/en/
Sabar, N.R., Song, A., Zhang, M.: A variable local search based memetic algorithm for the load balancing problem in cloud computing. In: Squillero, G., Burelli, P. (eds.) EvoApplications 2016. LNCS, vol. 9597, pp. 267–282. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31204-0_18
Samal, P., Mishra, P.: Analysis of variants in round robin algorithms for load balancing in cloud computing. IJCSIT 4(3), 416–419 (2013)
Tantawi, A.N., Towsley, D.: Optimal static load balancing in distributed computer systems. J. ACM 32(2), 445–465 (1985)
Tian, W., Zhao, Y., Zhong, Y., Xu, M., Jing, C.: A dynamic and integrated load-balancing scheduling algorithm for cloud datacenters. In: CCIS, pp. 311–315 (2011)
Wang, X., Chen, S., Setter, J., MartÃnez, J.F.: SWAP: effective fine-grain management of shared last-level caches with minimum hardware support. In: HPCA-23, pp. 121–132 (2017)
Xiang, Y., Wang, X., Huang, Z., Wang, Z., Luo, Y., et al.: DCAPS: dynamic cache allocation with partial sharing. In: EuroSys 2018, pp. 1–15 (2018)
Zhu, H., Erez, M.: Dirigent: enforcing QoS for latency-critical tasks on shared multicore systems. In: ASPLOS 2016, pp. 33–47 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Duan, K., Li, Y., Marbach, T.G., Wang, G., Liu, X. (2021). Improving Load Balancing for Modern Data Centers Through Resource Equivalence Classes. In: Hacid, H., Kao, O., Mecella, M., Moha, N., Paik, Hy. (eds) Service-Oriented Computing. ICSOC 2021. Lecture Notes in Computer Science(), vol 13121. Springer, Cham. https://doi.org/10.1007/978-3-030-91431-8_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-91431-8_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91430-1
Online ISBN: 978-3-030-91431-8
eBook Packages: Computer ScienceComputer Science (R0)