skip to main content
research-article

Mean Field Analysis of Join-Below-Threshold Load Balancing for Resource Sharing Servers

Published:17 December 2019Publication History
Skip Abstract Section

Abstract

Load balancing plays a crucial role in many large scale computer systems. Much prior work has focused on systems with First-Come-First-Served (FCFS) servers. However, servers in practical systems are more complicated. They serve multiple jobs at once, and their service rate can depend on the number of jobs in service. Motivated by this, we study load balancing for systems using Limited-Processor-Sharing (LPS). Our model has heterogeneous servers, meaning the service rate curve and multiprogramming level (limit on the number of jobs sharing the processor) differs between servers. We focus on a specific load balancing policy: Join-Below-Threshold (JBT), which associates a threshold with each server and, whenever possible, dispatches to a server which has fewer jobs than its threshold. Given this setup, we ask: how should we configure the system to optimize objectives such as mean response time? Configuring the system means choosing both a load balancing threshold and a multiprogramming level for each server. To make this question tractable, we study the many-server mean field regime. In this paper we provide a comprehensive study of JBT in the mean field regime. We begin by developing a mean field model for the case of exponentially distributed job sizes. The evolution of our model is described by a differential inclusion, which complicates its analysis. We prove that the sequence of stationary measures of the finite systems converges to the fixed point of the differential inclusion, provided a unique fixed point exists. We derive simple conditions on the service rate curves to guarantee the existence of a unique fixed point. We demonstrate that when these conditions are not satisfied, there may be multiple fixed points, meaning metastability may occur. Finally, we give a simple method for determining the optimal system configuration to minimize the mean response time and related metrics. While our theoretical results are proven for the special case of exponentially distributed job sizes, we provide evidence from simulation that the system becomes insensitive to the job size distribution in the mean field regime, suggesting our results are more generally applicable.

References

  1. J. Abate and W. Whitt. Numerical inversion of Laplace transforms of probability distributions. ORSA Journal on computing, 7(1):36--43, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  2. M. Bramson, Y. Lu, and B. Prabhakar. Randomized load balancing with general service time distributions. In ACM SIGMETRICS 2010, pages 275--286, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. L. Brumelle. A generalization of Erlang's loss system to state dependent arrival and service rates. Mathematics of Operations Research, 3(1):10--16, 1978.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Gamarnik, J. N. Tsitsiklis, and M. Zubeldia. Delay, memory, and messaging tradeoffs in distributed service systems. SIGMETRICS Perform. Eval. Rev., 44(1):1--12, June 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Ganesh, S. Lilienthal, D. Manjunath, A. Proutiere, and F. Simatos. Load balancing via random local search in closed and open systems. SIGMETRICS Perform. Eval. Rev., 38(1):287--298, June 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. N. Gast and B. Gaujal. A mean field model of work stealing in large-scale systems. SIGMETRICS Perform. Eval. Rev., 38(1):13--24, June 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Gast and B. Gaujal. Markov chains with discontinuous drifts have differential inclusion limits. Performance Evaluation, 69(12):623 -- 642, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Gast and B. Van Houdt. A refined mean field approximation. Proc. ACM Meas. Anal. Comput. Syst., 1(2):33:1--33:28, December 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Grosof, Z. Scully, and M. Harchol-Balter. Load balancing guardrails: Keeping your heavy traffic on the road to low response times. Proc. ACM Meas. Anal. Comput. Syst., 3(2):42:1--42:31, June 2019.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Gupta and M. Harchol-Balter. Self-adaptive admission control policies for resource-sharing systems. SIGMETRICS Perform. Eval. Rev., 37(1):311--322, June 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M.W. Hirsch and H. Smith. Monotone dynamical systems. In Handbook of differential equations: ordinary differential equations, volume 2, pages 239--357. Elsevier, 2006.Google ScholarGoogle Scholar
  12. G. Horváth, I. Horváth, S. Al-Deen Almousa, and M. Telek. High order low variance matrix-exponential distributions. The Tenth International Conference on Matrix-Analytic Methods in Stochastic Models (MAM10), 02 2019.Google ScholarGoogle Scholar
  13. Y. Lu, Q. Xie, G. Kliot, A. Geller, J. R. Larus, and A. Greenberg. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services. Perform. Eval., 68:1056--1071, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Mitzenmacher. The power of two choices in randomized load balancing. IEEE Trans. Parallel Distrib. Syst., 12:1094--1104, October 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Mitzenmacher. Analyzing distributed join-idle-queue: A fluid limit approach. In 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 312--318, Sept 2016.Google ScholarGoogle ScholarCross RefCross Ref
  16. G. Roth and W.H. Sandholm. Stochastic approximations with constant step size and differential inclusions. SIAM Journal on Control and Optimization, 51(1):525--555, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A.L. Stolyar. Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems, 80(4):341--361, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Telek and B. Van Houdt. Response time distribution of a class of limited processor sharing queues. SIGMETRICS Perform. Eval. Rev., 45(3):143--155, March 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. T. Vasantam, A. Mukhopadhyay, and R. R. Mazumdar. The mean-field behavior of processor sharing systems with general job lengths under the sq(d) policy. Performance Evaluation, 127--128:120 -- 153, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  20. N.D. Vvedenskaya, R.L. Dobrushin, and F.I. Karpelevich. Queueing system with selection of the shortest of two queues: an asymptotic approach. Problemy Peredachi Informatsii, 32:15--27, 1996.Google ScholarGoogle Scholar
  21. X. Zhou, J. Tan, and N. Shroff. Heavy-traffic delay optimality in pull-based load balancing systems: Necessary and sufficient conditions. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2(3):41, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mean Field Analysis of Join-Below-Threshold Load Balancing for Resource Sharing Servers

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
        Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 3, Issue 3
        SIGMETRICS
        December 2019
        525 pages
        EISSN:2476-1249
        DOI:10.1145/3376928
        Issue’s Table of Contents

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 December 2019
        Published in pomacs Volume 3, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader