Abstract
In many distributed computing systems, stochastically arriving jobs need to be assigned to servers with the objective of minimizing waiting times. Many existing dispatching algorithms are basically included in the SQ(d) framework: Upon arrival of a job, \(d\ge 2\) servers are contacted uniformly at random to retrieve their state and then the job is routed to a server in the best observed state. One practical issue in this type of algorithm is that server states may not be observable, depending on the underlying architecture. In this paper, we investigate the assignment problem in the open-loop setting where no feedback information can flow dynamically from the queues back to the controller, i.e., the queues are unobservable. This is an intractable problem, and unless particular cases are considered, the structure of an optimal policy is not known. Under mild assumptions and in a heavy-traffic many-server limiting regime, our main result proves the optimality of a subset of deterministic and periodic policies within a wide set of (open-loop) policies that can be randomized or deterministic and can be dependent on the arrival process at the controller. The limiting value of the scaled stationary mean waiting time achieved by any policy in our subset provides a simple approximation for the optimal system performance.

Similar content being viewed by others
References
Altman, E., Gaujal, B., Hordijk, A.: Multimodularity, convexity, and optimization properties. Math. Oper. Res. 25(2), 324–347 (2000)
Anselmi, J., Gaujal, B.: The price of forgetting in parallel and non-observable queues. Perform. Eval. 68(12), 1291–1311 (2011)
Anselmi, J., Gaujal, B., Nesti, T.: Control of parallel non-observable queues: asymptotic equivalence and optimality of periodic policies. Stoch. Syst. 5(1), 120–145 (2015)
Baccelli, F., Brémaud, P.: Elements of Queueing Theory: Palm Martingale Calculus and Stochastic Recurrences, vol. 26. Springer, Berlin (2003)
Barlow, R.E., Proschan, F.: Mathematical Theory of Reliability. Wiley, New York (1965)
Bell, C.H., Stidham, S.: Individual versus social optimization in the allocation of customers to alternative servers. Manag. Sci. 29(7), 831–839 (1983)
Bhulai, S., Farenhorst-Yuan, T., Heidergott, B., van der Laan, D.: Optimal balanced control for call centers. Ann. Oper. Res. 201(1), 39–62 (2012)
Borst, S.C., Mandelbaum, A., Reiman, M.I.: Dimensioning large call centers. Oper. Res. 52(1), 17–34 (2004)
Hajek, B.: Extremal splitting of point processes. Math. Oper. Res. 10, 543–556 (1986)
Harchol-Balter, M., Scheller-Wolf, A., Young, A.R.: Surprising results on task assignment in server farms with high-variability workloads. In: SIGMETRICS/Performance, pp. 287–298. ACM (2009)
Hordijk, A., Van der Laan, D.: The unbalance and bounds on the average waiting time for periodic routing to one queue. Math. Methods Oper. Res. 59(1), 1–23 (2004)
Javadi, B., Kondo, D., Vincent, J.-M., Anderson, D.P.: Discovering statistical models of availability in large distributed systems: an empirical study of seti@home. IEEE Trans. Parallel Distrib. Syst. 22(11), 1896–1903 (2011)
Javadi, B., Thulasiraman, P., Buyya, R.: Cloud resource provisioning to extend the capacity of local resources in the presence of failures. In: HPCC-ICESS, pp. 311–319. IEEE Computer Society (2012)
Kleinrock, L.: Queueing Systems, Volume II: Computer Applications. Wiley Interscience, New York (1976). (Published in Russian, 1979. Published in Japanese, 1979.)
Lindley, D.V.: The theory of queues with a single server. In: Mathematical Proceedings of the Cambridge Philosophical Society, vol. 48, pp. 277–289 (1952)
Loynes, R.: The stability of a queue with nonindependent interarrival and service times. In: Mathematical Proceedings of the Cambridge Philosophical Society, vol. 58, pp. 497–520 (1962)
Sethuraman, J., Squillante, M.S. Optimal stochastic scheduling in multiclass parallel queues. In: SIGMETRICS ’99, pp. 93–102. ACM, New York (1999)
Shirakawa, H., Mori, M., Kijima, M.: Further properties of extremal sequences in queues. Commun. Stat. Stoch. Models 4(1), 117–132 (1988)
Shirakawa, H., Mori, M., Kijima, M.: Evaluation of regular splitting queues. Commun. Stat. Stoch. Models 5(2), 219–234 (1989)
van der Laan, D.: Routing jobs to servers with deterministic service times. Math. Oper. Res. 30(1), 195–224 (2005)
Author information
Authors and Affiliations
Corresponding author
Appendix: Proof of Proposition 3
Appendix: Proof of Proposition 3
Since \(p_r = \mu _r \bar{v}\) is a constant and \(\frac{ \lambda _k k p_r}{\mu _r\Vert p\Vert (k+f_k)}<1\), we can directly use (54) to obtain
for any \(\pi ^{(k)}\in \mathcal {A}_{\bar{v}\mu -TTRR}^{(k)}\). Thus,
On the other hand, using Lemma 1 when q is given by \(q_{r,k}=\mu _r/(\Vert \mu \Vert (k+f_k))\), we obtain
Thus, as desired, we obtain
Rights and permissions
About this article
Cite this article
Anselmi, J. Asymptotically optimal open-loop load balancing. Queueing Syst 87, 245–267 (2017). https://doi.org/10.1007/s11134-017-9547-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-017-9547-9