Abstract
Service systems often feature multiple classes of customers with different service needs and multiple pools of servers with different skillsets. How to efficiently match customers of different classes with servers of different skillsets is of great importance to the management of these systems. In this survey, we review works on skill-based routing in queues. We first summarize key insights on routing/scheduling policies developed in the literature. We then discuss complications brought by modern service operations management problems, particularly healthcare systems. These complications stimulate a growing body of literature on new modeling and analysis tools. Lastly, we provide additional numerical experiments to highlight the complex nature of a routing problem motivated from hospital patient-flow management, and provide some useful intuition to develop good skill-based routing policies in practice. Our goal is to provide a brief overview of the skill-based routing research landscape and to help generate interesting research ideas.
Similar content being viewed by others
References
Adan, I., Weiss, G.: A loss system with skill-based servers under assign to longest idle server policy. Probab. Eng. Inf. Sci. 26(3), 307–321 (2012)
Adan, I., Weiss, G.: A skill based parallel service system under FCFS-ALIS: steady state, overloads, and abandonments. Stoch. Syst. 4(1), 250–299 (2014)
Argon, N., Ziya, S.: Priority assignment under imperfect information on customer type identities. Manuf. Serv. Oper. Manag. 11(4), 674–693 (2009)
Armony, M.: Dynamic routing in large-scale service systems with heterogeneous servers. Queueing Syst. 51(3–4), 287–329 (2005)
Armony, M., Bambos, N.: Queueing dynamics and maximal throughput scheduling in switched processing systems. Queueing Syst. 44(3), 209–252 (2003)
Armony, M., Israelit, S., Mandelbaum, A., Marmor, Y.N., Tseytlin, Y., Yom-Tov, G.B.: On patient flow in hospitals: a data-based queueing-science perspective. Stochas. Syst. 5(1), 146–194 (2015)
Armony, M., Ward, A.R.: Fair dynamic routing in large-scale heterogeneous-server systems. Oper. Res. 58(3), 624–637 (2010)
Ata, B., Peng, X.: An optimal callback policy for general arrival processes: a pathwise analysis. Oper. Res. 68(2), 327–347 (2020)
Atar, R.: Scheduling control for queueing systems with many servers: asymptotic optimality in heavy traffic. Ann. Appl. Probab. 15(4), 2606–2650 (2005)
Atar, R.: A diffusion regime with nondegenerate slowdown. Oper. Res. 60(2), 490–500 (2012)
Atar, R., Mandelbaum, A., Reiman, M.I.: Scheduling a multi class queue with many exponential servers: asymptotic optimality in heavy traffic. Ann. Appl. Probab. 14(3), 1084–1134 (2004)
Balseiro, S., Brown, D.B., Chen, C.: Dynamic pricing of relocating resources in large networks. Manage. Sci. (forthcoming) (2020)
Bassamboo, A., Harrison, J.M., Zeevi, A.: Design and control of a large call center: asymptotic analysis of an lp-based method. Oper. Res. 54(3), 419–435 (2006)
Bassamboo, A., Randhawa, R.: Scheduling homogeneous impatient customers. Manag. Sci. 62(7), 2129–2147 (2016)
Bassamboo, A., Zeevi, A.: On a data-driven method for staffing large call centers. Oper. Res. 57(3), 714–726 (2009)
Bell, S.L., Williams, R.J.: Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy. Ann. Appl. Probab. 11(3), 608–649 (2001)
Best, T.J., Sandçı, B., Eisenstein, D.D., Meltzer, D.O.: Managing hospital inpatient bed capacity through partitioning care into focused wings. Manuf. Serv. Oper. Manag. 17(2), 157–176 (2015)
Bonald, T., Proutiere, A.: Insensitive bandwidth sharing in data networks. Queueing Syst. 44(1), 69–100 (2003)
Borst, S., Mandelbaum, A., Reiman, M.I.: Dimensioning large call centers. Oper. Res. 52(1), 17–34 (2004)
Braverman, A., Gurvich, I., Huang, J.: On the Taylor expansion of value functions. Oper. Res. 68(2), 631–654 (2020)
Buyukkoc, C., Varaiya, P., Walrand, J.: The c\(\mu \) rule revisited. Adv. Appl. Probab. 17(1), 237–238 (1985)
Chalfin, D., Trzeciak, S., Likourezos, A., Baumann, B., Dellinger, R.: Impact of delayed transfer of critically ill patients from the emergency department to the intensive care unit. Crit. Care Med. 35, 1477–1483 (2007)
Chan, C.W., Farias, V.F., Bambos, N., Escobar, G.J.: Optimizing intensive care unit discharge decisions with patient readmissions. Oper. Res. 60(6), 1323–1341 (2012)
Chan, C.W., Farias, V.F., Escobar, G.J.: The impact of delays on service times in the intensive care unit. Manag. Sci. 63(7), 2049–2072 (2017)
Chan, C.W., Yom-Tov, G., Escobar, G.J.: When to use speedup: an examination of service systems with returns. Oper. Res. 62(2), 462–482 (2014)
Chen, Y., Dong, J.: Scheduling with service-time information: the power of two priority classes. Working paper (2019)
Cox, D.R., Smith, W.L.: Queues. Methuen, London (1961)
Dai, J., Shi, P.: Inpatient bed overflow: an approximate dynamic programming approach. Manuf. Serv. Oper. Manag. 21(4), 894–911 (2019)
Dai, J.G., Lin, W.: Maximum pressure policies in stochastic processing networks. Oper. Res. 53(2), 197–218 (2005)
Dai, J.G., Lin, W.: Asymptotic optimality of maximum pressure policies in stochastic processing networks. Ann. Appl. Probab. 18(6), 2239–2299 (2008)
Dai, J.G., Tezcan, T.: Optimal control of parallel server systems with many servers in heavy traffic. Queueing Syst. 59(2), 95 (2008)
Dai, J.G., Tezcan, T.: State space collapse in many-server diffusion limits of parallel server systems. Math. Oper. Res. 36(2), 271–320 (2011)
Daniel Adelman, A.J.M.: Relaxations of weakly coupled stochastic dynamic programs. Oper. Res. 56(3), 712–727 (2008)
Delana, K., Savva, N., Tezcan, T.: Proactive customer service: operational benefits and economic frictions. Manuf. Serv. Oper. Manag., Articles in Advance (2020)
Dong, J., Feldman, P., Yom-Tov, G.B.: Service systems with slowdowns: potential failures and proposed solutions. Oper. Res. 63(2), 305–324 (2015)
Dong, J., Perry, O.: Queueing models for patient-flow dynamics in inpatient wards. Oper. Res. 68(1), 250–275 (2020)
Dong, J., Shi, P., Zheng, F., Jin, X.: Off-service placement in inpatient ward network: resource pooling versus service slowdown. Working paper (2019)
Foley, R.D., McDonald, D.R.: Join the shortest queue: stability and exact asymptotics. Ann. Appl. Probab. 11(3), 569–607 (2001)
Foss, S., Chernova, N.: On the stability of a partially accessible multi-station queue with state-dependent routing. Queueing Syst. 29, 55–73 (1998)
Gans, N., Koole, G., Mandelbaum, A.: Telephone call centers: tutorial, review, and research prospects. Manuf. Serv. Oper. Manag. 5(2), 79–141 (2003)
Gardner, K., Righter, R.: Product forms for FCFS queueing models with arbitrary server-job compatibilities: an overview. In: Queueing Systems (2020)
Gardner, K., Zbarsky, S., Doroudi, S., Harchol-Balter, M., Hyytiä, E., Scheller-Wolf, A.: Queueing with redundant requests: exact analysis. Queueing Syst. 83(3), 227–259 (2016)
Garnett, O., Mandelbaum, A.: An introduction to skills-based routing and its operational complexities. Teaching Note, Technion, Haifa, Israel (2000)
Garnett, O., Mandelbaum, A., Reiman, M.: Designing a call center with impatient customers. Manuf. Serv. Oper. Manag. 4(3), 208–227 (2002)
Ghamami, S., Ward, A.R.: Dynamic scheduling of a two-server parallel server system with complete resource pooling and reneging in heavy traffic: Asymptotic optimality of a two-threshold policy. Math. Oper. Res. 38(4), 761–824 (2013)
Gurvich, I., Luedtke, J., Tezcan, T.: Staffing call centers with uncertain demand forecasts: a chance-constrained optimization approach. Manag. Sci. 56(7), 1093–1115 (2010)
Gurvich, I., Van Meghem, J.: Collaboration and multitasking in networks: prioritization and achievable capacity. Manag. Sci. 64(5), 2390–2406 (2018)
Gurvich, I., Ward, A.R.: On the dynamic control of matching queues. Stochas. Syst. 4(2), 479–523 (2014)
Gurvich, I., Whitt, W.: Scheduling flexible servers with convex delay costs in many-server service systems. Manuf. Serv. Oper. Manag. 11(2), 237–253 (2008)
Gurvich, I., Whitt, W.: Queue-and-idleness-ratio controls in many-server service systems. Math. Oper. Res. 34(2), 363–396 (2009)
Gurvich, I., Whitt, W.: Service-level differentiation in many-server service systems via queue-ratio routing. Oper. Res. 58(2), 316–328 (2010)
Halfin, S., Whitt, W.: Heavy-traffic limits for queues with many exponential servers. Oper. Res. 29(3), 567–588 (1981)
Harrison, J.M.: Heavy traffic analysis of a system with parallel servers: asymptotic optimality of discrete-review policies. Ann. Appl. Probab. 8(3), 822–848 (1998)
Harrison, J.M., López, M.J.: Heavy traffic resource pooling in parallel-server systems. Queueing Syst. 33(4), 339–368 (1999)
Harrison, J.M., Zeevi, A.: Dynamic scheduling of a multiclass queue in the Halfin–Whitt heavy traffic regime. Oper. Res. 52(2), 243–257 (2004)
Hu, Y., Chan, C.W., Dong, J.: Optimal scheduling of proactive service with customer deterioration and improvement. Manage. Sci. (forthcoming) (2020)
Huang, J., Carmeli, B., Mandelbaum, A.: Control of patient flow in emergency departments, or multiclass queues with deadlines and feedback. Oper. Res. 63(4), 892–908 (2015)
Iglehart, D.L., Whitt, W.: Multiple channel queues in heavy traffic. II: sequences, networks, and batches. Adv. Appl. Probab. 2(2), 355–369 (1970)
Jaeker, J.A.B., Tucker, A.L.: Past the point of speeding up: the negative effects of workload saturation on efficiency and patient severity. Manag. Sci. 63(4), 1042–1062 (2017)
Kc, D.S., Terwiesch, C.: An econometric analysis of patient flows in the cardiac intensive care unit. Manuf. Serv. Oper. Manag. 14(1), 50–65 (2012)
Kc, D.S., Terwiesch, C.: Impact of workload on service time and patient safety: an econometric analysis of hospital operations. Manag. Sci. 55(9), 1486–1498 (2009)
Krishnasamy, S., Sen, R., Johari, R., Shakkottai, S.: Learning unknown service rates in queues: a multi-armed bandit approach. Oper. Res. Articles in Advance (2020)
Lu, Y., Xie, Q., Kliot, G., Geller, A., Larus, J.R., Greenberg, A.: Join-idle-queue: a novel load balancing algorithm for dynamically scalable web services. Perform. Eval. 68(11), 1056–1071 (2011)
Luo, J., Zhang, J.: Staffing and control of instant messaging contact centers. Oper. Res. 61(2), 328–343 (2013)
Mandelbaum, A., Momcilovic, P.: Personalized queues: the customer view, via a fluid model of serving least-patient first. Queueing Syst. 87, 1–31 (2017)
Mandelbaum, A., Stolyar, A.L.: Scheduling flexible servers with convex delay costs: heavy-traffic optimality of the generalized c\(\mu \)-rule. Oper. Res. 52(6), 836–855 (2004)
van Mieghem, J.A.: Dynamic scheduling with convex delay costs: the generalized \(c\mu \) rule. Ann. Appl. Probab. 5(3), 809–833 (1995)
Mitzenmacher, M.: The power of two choices in randomized load balancing. IEEE Trans. Parallel Distrib. Syst. 12(10), 1094–1104 (2001)
Örmeci, E.L., Güneş, E.D., Kunduzcu, D.: A modeling framework for control of preventive services. Manuf. Serv. Oper. Manag. 18(2), 227–244 (2015)
Perry, O., Whitt, W.: Responding to unexpected overloads in large-scale service systems. Manage Sci. 55(8), 1353–1367 (2009)
Perry, O., Whitt, W.: A fluid approximation for service systems responding to unexpected overloads. Oper. Res. 59(5), 1159–1170 (2011)
Puha, A.L., Ward, A.R.: Tutorial paper: scheduling an overloaded multiclass many-server queue with impatient customers. In: Tutorials in Operations Research (2019)
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley-Interscience, Hoboken (2005)
Reed, J.E., Ward, A.R.: Approximating the GI/GI/1+GI queue with a nonlinear drift diffusion: hazard rate scaling in heavy traffic. Math. Oper. Res. 33(3), 606–644 (2008)
Reiman, M.I.: Some diffusion approximations with state space collapse. In: Baccelli, F., Fayolle, G. (eds.) Modelling and Performance Evaluation Methodology. Lecture Notes in Control and Information Sciences, pp. 207–240. Springer, Berlin (1984)
Selen, J., Adan, I.J., Kulkarni, V.G., van Leeuwaarden, J.S.: The snowball effect of customer slowdown in critical many-server systems. Stochas. Models 32(3), 366–391 (2016)
Shi, P., Chou, M.C., Dai, J., Ding, D., Sim, J.: Models and insights for hospital inpatient operations: time-dependent ED boarding time. Manag. Sci. 62(1), 1–28 (2016)
Shi, P., Helm, J., Deglise-Hawkinson, J., Pan, J.: Timing it right: balancing inpatient congestion versus readmission risk at discharge. Oper. Res. (forthcoming) (2020)
Song, H., Tucker, A., Graue, R., Moravick, S., Yang, J.: Capacity pooling in hospitals: the hidden consequences of off-service placement. Manag. Sci. 66(9), 3825–3842 (2020)
Srikant, R., Ying, L.: Communication Networks: An Optimization, Control, and Stochastic Networks Perspective. Cambridge University Press, Cambridge (2013)
Stolyar, A.L.: MaxWeight scheduling in a generalized switch: state space collapse and workload minimization in heavy traffic. Ann. Appl. Probab. 14(1), 1–53 (2004)
Sun, Z., Argon, N., Ziya, S.: Patient triage and prioritization under austere conditions. Manag. Sci. 64(10), 4471–4489 (2018)
Tassiulas, L., Ephremides, A.: Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks. In: 29th IEEE Conference on Decision and Control, IEEE, pp. 2130–2132 (1990)
Tezcan, T., Dai, J.G.: Dynamic control of N-systems with many servers: asymptotic optimality of a static priority policy in heavy traffic. Oper. Res. 58(1), 94–110 (2010)
Ward, A.R., Armony, M.: Blind fair routing in large-scale service systems with heterogeneous customers and servers. Oper. Res. 61(1), 228–243 (2013)
Ward, A.R., Glynn, P.W.: A diffusion approximation for a Markovian queue with reneging. Queueing Syst. 43, 103–128 (2003)
Whitt, W.: How multiserver queues scale with growing congestion-dependent demand. Oper. Res. 51(4), 531–542 (2003)
Wierman, A., Harchol-Balter, M.: Classifying scheduling policies with respect to unfairness in an M/GI/1. In: Proceedings of the 2003 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, pp. 238–249 (2003)
Wierman, A., Nuyens, M.: Scheduling despite inexact job-size informaion. ACM SIGMETRICS Perform. Eval. Rev. 36, 25–36 (2008)
Woei, L., Kumar, P.: Optimal control of a queueing system with two heterogeneous servers. IEEE Trans. Autom. Control 29(8), 696–703 (1984)
Xu, K., Chan, C.: Using future information to reduce waiting times in the emergency department via diversion. Manuf. Serv. Oper. Manag. 18(3), 314–331 (2016)
Yom-Tov, G.B., Mandelbaum, A.: Erlang-r: a time-varying queue with reentrant customers, in support of healthcare staffing. Manuf. Serv. Oper. Manag. 16(2), 283–299 (2014)
Zhan, D., Ward, A.R.: Staffing, routing, and payment to trade off speed and quality in large service systems. Oper. Res. 67(6), 1738–1751 (2019)
Acknowledgements
The authors would like to thank Kristen Gardner, Yoni Nazarathy, and the referee for their insightful suggestions. Support from NSF Grant CMMI-1762544 is gratefully acknowledged by J. Dong.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, J., Dong, J. & Shi, P. A survey on skill-based routing with applications to service operations management. Queueing Syst 96, 53–82 (2020). https://doi.org/10.1007/s11134-020-09669-5
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-020-09669-5