skip to main content
extended-abstract

Universality of Power-of-d Load Balancing Schemes

Published: 29 September 2016 Publication History

Abstract

We consider a system of N parallel queues with unit exponential service rates and a single dispatcher where tasks arrive as a Poisson process of rate λ(N). When a task arrives, the dispatcher assigns it to a server with the shortest queue among d(N) ≤ N randomly selected servers. This load balancing policy is referred to as a power-of-d(N) or JSQ(d(N)) scheme, and subsumes the Join-the-Shortest Queue (JSQ) policy as a crucial special case for d(N) = N.
We construct a coupling to bound the difference in the queue length processes between the JSQ policy and an arbitrary value of d(N). We use the coupling to derive the fluid limit in the regime where λ(N)/N → λ < 1 and d(N)→ ∞ as N → ∞, along with the corresponding fixed point. The fluid limit turns out not to depend on the exact growth rate of d(N), and in particular coincides with that for the JSQ policy. We further leverage the coupling to establish that the diffusion limit in the regime where (N--λ(N))/ √N → β > 0 and d(N)/ √ N logN → ∞ as N → ∞ corresponds to that for the JSQ policy. These results indicate that the stochastic optimality of the JSQ policy can be preserved at the fluid-level and diffusion-level while reducing the overhead by nearly a factor O(N) and O(√ N), respectively.

References

[1]
M. Bramson, Y. Lu, B. Prabhakar (2010). Randomized load balancing with general service time distributions. Performance Evaluation Review 38 (1) (Proc. ACM SIGMETRICS 2010), 275--286.
[2]
M. Bramson, Y. Lu, B. Prabhakar (2012). Asymptotic independence of queues under randomized load balancing. Queueing Systems 71 (3), 247--292.
[3]
P. Eschenfeldt, D. Gamarnik (2015). Join the shortest queue with many servers: The heavy traffic asymptotics. Preprint, arXiv:1502.00999.
[4]
D. Gamarnik, J.N. Tsitsiklis, M. Zubeldia (2016). Delay, memory and messaging tradeoffs in distributed service systems. In: Proc. ACM SIGMETRICS 2016 / Performance 2016.
[5]
V. Gupta, M. Harchol-Balter, K. Sigman, W. Whitt (2007). Analysis of join-the-shortest-queue routing for web server farms. Performance Evaluation 64 (9-12), 1062--1081.
[6]
S. Halfin, W. Whitt (1981). Heavy-traffic limits for queues with many exponential servers. Operations Research 29, 567--588.
[7]
M. Mitzenmacher (2001). The power of two choices in randomized load balancing. IEEE Transactions on Parallel and Distributed Systems 12 (10), 1094--1104.
[8]
A. Stolyar (2015). Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80 (4), 341--361.
[9]
N. Vvedenskaya, R. Dobrushin, F. Karpelevich (1996). Queueing system with selection of the shortest of two queues: An asymptotic approach. Problems of Information Transmission 32 (1), 20--34.
[10]
R.R. Weber (1978). On the optimal assignment of customers to parallel queues. Journal of Applied Probability 15, 406--413.
[11]
W. Winston (1977). Optimality of the shortest line discipline. Journal of Applied Probability 14, 181--189.
[12]
L. Ying, R. Srikant, X. Kang (2015). The power of slightly more than one sample in randomized load balancing. In: Proc. IEEE INFOCOM 2015.

Cited By

View all
  • (2022)k-Nearest Neighbor Queues with Delayed InformationInternational Journal of Bifurcation and Chaos10.1142/S021812742250174732:12Online publication date: 10-Oct-2022
  • (2022)Why (and when) do asymptotic methods work so well?Queueing Systems: Theory and Applications10.1007/s11134-022-09834-y100:3-4(297-299)Online publication date: 1-Apr-2022
  • (2019) Insensitivity of the mean field limit of loss systems under SQ( d ) routeing Advances in Applied Probability10.1017/apr.2019.4151:4(1027-1066)Online publication date: 15-Nov-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 44, Issue 2
September 2016
98 pages
ISSN:0163-5999
DOI:10.1145/3003977
  • Editor:
  • Nidhi Hegde
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 September 2016
Published in SIGMETRICS Volume 44, Issue 2

Check for updates

Qualifiers

  • Extended-abstract

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)k-Nearest Neighbor Queues with Delayed InformationInternational Journal of Bifurcation and Chaos10.1142/S021812742250174732:12Online publication date: 10-Oct-2022
  • (2022)Why (and when) do asymptotic methods work so well?Queueing Systems: Theory and Applications10.1007/s11134-022-09834-y100:3-4(297-299)Online publication date: 1-Apr-2022
  • (2019) Insensitivity of the mean field limit of loss systems under SQ( d ) routeing Advances in Applied Probability10.1017/apr.2019.4151:4(1027-1066)Online publication date: 15-Nov-2019
  • (2018)Delay, Memory, and Messaging Tradeoffs in Distributed Service SystemsStochastic Systems10.1287/stsy.2017.00088:1(45-74)Online publication date: Mar-2018
  • (2018)On the Power-of-d-choices with Least Loaded Server SelectionProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/32244222:2(1-22)Online publication date: 13-Jun-2018
  • (2018)GB-PANDAS:ACM SIGMETRICS Performance Evaluation Review10.1145/3199524.319952845:3(2-14)Online publication date: 20-Mar-2018
  • (2018)Feedback control of server instances for right sizing in the cloud2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)10.1109/ALLERTON.2018.8635636(749-756)Online publication date: 2-Oct-2018
  • (2018)Delta probing policies for redundancyPerformance Evaluation10.1016/j.peva.2018.09.002Online publication date: Oct-2018
  • (2018)The Supermarket Model with Bounded Queue Lengths in EquilibriumJournal of Statistical Physics10.1007/s10955-018-2044-7173:3-4(1149-1194)Online publication date: 28-Apr-2018
  • (2017)The PDE Method for the Analysis of Randomized Load Balancing NetworksProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/31544971:2(1-28)Online publication date: 19-Dec-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media