Skip to main content

Improving Load Balancing for Modern Data Centers Through Resource Equivalence Classes

  • Conference paper
  • First Online:
Service-Oriented Computing (ICSOC 2021)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 13121))

Included in the following conference series:

  • 3032 Accesses

Abstract

Load balancing is one of the most significant concerns for data center (DC) management, and the basic method is reassigning applications from overloaded servers to underloaded servers. However, to ensure the service availability, during the reassignment of an application, some resources (i.e., transient resources) are consumed simultaneously on its initial server and its target server, which imposes a challenge for load balancing. The latest research has proposed a concept called resource equivalence class (REC: a set of resource configurations such that a latency-critical (LC) application running with any one of them can meet the QoS target). In this paper, we use the REC to improve the load balancing for a DC where multiple LC applications have already been co-located on servers with the service availability and QoS requirements. We formulate the proposed load rebalancing problem as a multi-objective constrained programming model. To solve the proposed problem, we propose to use a machine learning-based classification model to construct the RECs for applications, and we develop a local search (LS) algorithm to approximate the optimal solution. We evaluate the proposed algorithm via simulated experiments using real LC applications. To our knowledge, it is the first time to use REC for improving load balancing.

This work is supported by State Key Lab of Computer Architecture, ICT, CAS, under Grant No. CARCHB202013; NSFC (U1833114, 61872201); Science and Technology Development Plan of Tianjin (18ZXZNGX00140, 18ZXZNGX00200, 20JCZDJC00610).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/harisgavranovic/roadef-challenge2012-S41.

References

  1. Elastic load balancing. https://aws.amazon.com/elasticloadbalancing/

  2. XGBoost documentation. https://xgboost.readthedocs.io/en/latest/

  3. Aggarwal, G., Motwani, R., Zhu, A.: The load rebalancing problem. In: SPAA 2003, pp. 258–265 (2003)

    Google Scholar 

  4. Azar, Y., Cohen, I.R., Panigrahi, D.: Randomized algorithms for online vector load balancing. In: ACM SIAM, pp. 980–991 (2018)

    Google Scholar 

  5. Belikovetsky, S., Tamir, T.: Load rebalancing games in dynamic systems with migration costs. TCS 622, 16–33 (2016)

    Article  MathSciNet  Google Scholar 

  6. Butelle, F., et al.: Fast machine reassignment. Ann. Oper. Res. 242(1), 133–160 (2015). https://doi.org/10.1007/s10479-015-2082-3

    Article  MathSciNet  MATH  Google Scholar 

  7. Chen, S., Delimitrou, C., Martínez, J.F.: Parties: QoS-aware resource partitioning for multiple interactive services. In: ASPLOS 2019, pp. 107–120 (2019)

    Google Scholar 

  8. Cybenko, G.: Dynamic load balancing for distributed memory multiprocessors. J. Parallel Distrib. Comput. 7(2), 279–301 (1989)

    Article  Google Scholar 

  9. Dean, J., Barroso, L.A.: The tail at scale. Commun. ACM 56(2), 74–80 (2013)

    Article  Google Scholar 

  10. Derr, S., Jackson, P., Lameter, C., Menage, P., Seto, H.: CPUSETS (2004). https://www.kernel.org/doc/Documentation/cgroup-v1/cpusets.txt

  11. El-Sayed, N., Mukkara, A., Tsai, P.A., Kasture, H., Ma, X., et al.: KPart: a hybrid cache partitioning-sharing technique for commodity multicores. In: HPCA-24, pp. 104–117 (2018)

    Google Scholar 

  12. Fang, J., Zhang, R., Fu, T.Z.J., Zhang, Z., Zhou, A., et al.: Distributed stream rebalance for stateful operator under workload variance. TPDS 29(10), 2223–2240 (2018)

    Google Scholar 

  13. Fardbastani, M.A., Sharif, M.: Scalable complex event processing using adaptive load balancing. J. Syst. Softw. 149, 305–317 (2019)

    Article  Google Scholar 

  14. Gabay, M., Zaourar, S.: Vector bin packing with heterogeneous bins: application to the machine reassignment problem. Ann. Oper. Res. 242(1), 161–194 (2016)

    Article  MathSciNet  Google Scholar 

  15. Gavranović, H., Buljubašić, M.: An efficient local search with noising strategy for google machine reassignment problem. Ann. Oper. Res. 242(1), 19–31 (2016)

    Article  MathSciNet  Google Scholar 

  16. Grosu, D., Chronopoulos, A.T.: Noncooperative load balancing in distributed systems. J. Parallel Distrib. Comput. 65(9), 1022–1034 (2005)

    Article  Google Scholar 

  17. Gurobi Optimization, L.: Gurobi Optimization, LLC (2018). http://www.gurobi.com

  18. Jaśkowski, W., Szubert, M., Gawron, P.: A hybrid MIP-based large neighborhood search heuristic for solving the machine reassignment problem. Ann. Oper. Res. 242(1), 33–62 (2015). https://doi.org/10.1007/s10479-014-1780-6

    Article  MathSciNet  MATH  Google Scholar 

  19. Jin, H., Yang, G., Yu, B., Yoo, C.: TALON: tenant throughput allocation through traffic load-balancing in virtualized software-defined networks. In: ICOIN, pp. 233–238 (2019)

    Google Scholar 

  20. Kapoor, R., Porter, G., Tewari, M., Voelker, G.M., Vahdat, A.: Chronos: predictable low latency for data center applications. In: SoCC 2012 (2012)

    Google Scholar 

  21. Kasture, H., Sanchez, D.: TailBench: a benchmark suite and evaluation methodology for latency-critical applications. In: IISWC, pp. 1–10 (2016)

    Google Scholar 

  22. Dhinesh, B.L., Krishna, P.V.: Honey bee behavior inspired load balancing of tasks in cloud computing environments. Appl. Soft Comput. 13(5), 2292–2303 (2013)

    Article  Google Scholar 

  23. Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: improving resource efficiency at scale. In: ISCA 2015, pp. 450–462 (2015)

    Google Scholar 

  24. Mekala, M.S., Viswanathan, P.: Energy-efficient virtual machine selection based on resource ranking and utilization factor approach in cloud computing for IoT. Comput. Electr. Eng. 73, 227–244 (2019)

    Article  Google Scholar 

  25. Mrad, M., Gharbi, A., Haouari, M., Kharbeche, M.: An optimization-based heuristic for the machine reassignment problem. Ann. Oper. Res. 242(1), 115–132 (2015). https://doi.org/10.1007/s10479-015-2002-6

    Article  MathSciNet  MATH  Google Scholar 

  26. Nashaat, H., Ashry, N., Rizk, R.: Smart elastic scheduling algorithm for virtual machine migration in cloud computing. J. Supercomput. 75(7), 3842–3865 (2019). https://doi.org/10.1007/s11227-019-02748-2

    Article  Google Scholar 

  27. Nguyen, K.T.: Introduction to cache allocation technology in the Intel® Xeon® processor E5 v4 Family (2016). https://software.intel.com/content/www/us/en/develop/articles/introduction-to-cache-allocation-technology.html

  28. Nishtala, R., Petrucci, V., Carpenter, P., Själander, M.: Twig: multi-agent task management for colocated latency-critical cloud services. In: HPCA-26, pp. 167–179 (2020)

    Google Scholar 

  29. Mahitha, O., Suma, V.: Deadlock avoidance through efficient load balancing to control disaster in cloud environment. In: ICCCNT, pp. 1–6 (2013)

    Google Scholar 

  30. Park, J., Park, S., Baek, W.: CoPart: coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers. In: EuroSys 2019, pp. 1–16 (2019)

    Google Scholar 

  31. Park, J., Park, S., Han, M., Hyun, J., Baek, W.: HyPart: a hybrid technique for practical memory bandwidth partitioning on commodity servers. In: PACT 2018, pp. 1–14 (2018)

    Google Scholar 

  32. Patel, T., Tiwari, D.: CLITE: efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers. In: HPCA-26, pp. 193–206 (2020)

    Google Scholar 

  33. roadef.org: ROADEF/EURO challenge 2012: machine reassignment (2012). http://www.roadef.org/challenge/2012/en/

  34. Sabar, N.R., Song, A., Zhang, M.: A variable local search based memetic algorithm for the load balancing problem in cloud computing. In: Squillero, G., Burelli, P. (eds.) EvoApplications 2016. LNCS, vol. 9597, pp. 267–282. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31204-0_18

    Chapter  Google Scholar 

  35. Samal, P., Mishra, P.: Analysis of variants in round robin algorithms for load balancing in cloud computing. IJCSIT 4(3), 416–419 (2013)

    Google Scholar 

  36. Tantawi, A.N., Towsley, D.: Optimal static load balancing in distributed computer systems. J. ACM 32(2), 445–465 (1985)

    Article  MathSciNet  Google Scholar 

  37. Tian, W., Zhao, Y., Zhong, Y., Xu, M., Jing, C.: A dynamic and integrated load-balancing scheduling algorithm for cloud datacenters. In: CCIS, pp. 311–315 (2011)

    Google Scholar 

  38. Wang, X., Chen, S., Setter, J., Martínez, J.F.: SWAP: effective fine-grain management of shared last-level caches with minimum hardware support. In: HPCA-23, pp. 121–132 (2017)

    Google Scholar 

  39. Xiang, Y., Wang, X., Huang, Z., Wang, Z., Luo, Y., et al.: DCAPS: dynamic cache allocation with partial sharing. In: EuroSys 2018, pp. 1–15 (2018)

    Google Scholar 

  40. Zhu, H., Erez, M.: Dirigent: enforcing QoS for latency-critical tasks on shared multicore systems. In: ASPLOS 2016, pp. 33–47 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yusen Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Duan, K., Li, Y., Marbach, T.G., Wang, G., Liu, X. (2021). Improving Load Balancing for Modern Data Centers Through Resource Equivalence Classes. In: Hacid, H., Kao, O., Mecella, M., Moha, N., Paik, Hy. (eds) Service-Oriented Computing. ICSOC 2021. Lecture Notes in Computer Science(), vol 13121. Springer, Cham. https://doi.org/10.1007/978-3-030-91431-8_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-91431-8_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-91430-1

  • Online ISBN: 978-3-030-91431-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics