research-article

Public Access

Load balancers need in-band feedback control

Authors:

Bhavana Vannarth Shobhana,

Srinivas Narayana,

Badri NathAuthors Info & Claims

HotNets '22: Proceedings of the 21st ACM Workshop on Hot Topics in Networks

Pages 76 - 84

https://doi.org/10.1145/3563766.3564094

Published: 14 November 2022 Publication History

Abstract

Server load balancers (LBs) are critical components of interactive services, routing client requests to servers in a pool. LBs improve service performance and increase availability by spreading the request load evenly across servers.

It is time to rethink what LBs can do for applications. As application compute becomes increasingly granular (e.g., microservices), request-processing latencies at servers will be ever more impacted by software and system variability at small time scales (e.g., 100μs-1ms). Beyond balancing load, we argue that LBs must actively optimize application response time, by adapting request-routing to quickly-varying server performance.

Specifically, we advocate for in-band feedback control: LBs should adapt the request-routing policy using purely local observations of server performance, derived from requests traversing the LB. A key challenge to designing such feedback controllers is that high-speed LBs only see the requests, not the responses. We present the design of an LB that adapts to a server latency inflation of 1 ms and reduces tail latencies in milliseconds, while observing only client-to-server traffic.

References

[1]

2006. Dynamic Feedback Load Balancing Scheduling. [Online, Retrieved Oct 14, 2022.] http://kb.linuxvirtualserver.org/wiki/Dynamic_Feedback_Load_Balancing_Scheduling. (2006).

[2]

2012. Send Hints to Dynamic Snitch when Compaction or repair is going on for a node. [Online, Retrieved Jun 12, 2022.] https://issues.apache.org/jira/browse/CASSANDRA-3722. (2012).

[3]

2013. HAProxy load balancer feedback agent check. [Online, Retrieved Oct 14, 2022.] https://www.loadbalancer.org/blog/open-source-windows-service-for-reporting-server-load-back-to-haproxy-load-balancer-feedback-agent/. (2013).

[4]

2015. Adopting Microservices at Netflix: Lessons for Architectural Design. [Online, Retrieved Jun 12, 2022.] https://www.nginx.com/blog/microservices-at-netflix-architectural-best-practices/. (2015).

[5]

2015. RFC 7540 HTTP/2: Streams and Multiplexing. [Online, Retrieved Jun 12, 2022.] https://www.rfc-editor.org/rfc/rfc7540.html#section-5. (2015).

[6]

2016. Autoscaling in Kubernetes. [Online, Retrieved Jun 12, 2022.] https://kubernetes.io/blog/2016/07/autoscaling-in-kubernetes/. (2016).

[7]

2017. gRPC load balancing. [Online, Retrieved Jun 12, 2022.] https://grpc.io/blog/grpc-load-balancing/. (2017).

[8]

2017. What's a service mesh and why do I need one? [Online, Retrieved Jun 12, 2022.] https://linkerd.io/2017/04/25/whats-a-service-mesh-and-why-do-i-need-one/. (2017).

[9]

2018. Introduction to HAProxy ACLs. [Online, Retrieved Jun 12, 2022.] https://www.haproxy.com/blog/introduction-to-haproxy-acls/. (2018).

[10]

2018. Open-sourcing Katran, a scalable load balancer. [Online, Retrieved Jun 12, 2022.] https://engineering.fb.com/2018/05/22/open-source/open-sourcing-katran-a-scalable-network-load-balancer/. (2018).

[11]

2019. Cilium: Socket-based load balancing. [Online, Retrieved Jun 12, 2022.] https://cilium.io/blog/2019/08/20/cilium-16#hostservices. (2019).

[12]

2019. Deploying load balancing. [Online, Retrieved Jun 12, 2022.] https://docs.microsoft.com/en-us/windows/win32/rpc/deploying-load-balancing. (2019).

[13]

2019. Managing Uber's data workflows at scale. [Online, Retrieved Jun 12, 2022.] https://eng.uber.com/managing-data-workflows-at-scale/. (2019).

[14]

2019. Microsoft RPC load balancing. [Online, Retrieved Jun 12, 2022.] https://docs.microsoft.com/en-us/windows/win32/rpc/rpc-load-balancing. (2019).

[15]

2020. Kube-proxy replacement with Direct Server Return. [Online, Retrieved Oct 14, 2022.] https://cilium.io/blog/2020/02/18/cilium-17/#kubeproxy-removal. (2020).

[16]

2020. Rebuilding Twitter's public API. [Online, Retrieved Jun 12, 2022.] https://blog.twitter.com/engineering/en_us/topics/infrastructure/2020/rebuild_twitter_public_api_2020. (2020).

[17]

2020. RFC 9000: QUIC: flow control. [Online, Retrieved Jun 12, 2022.] https://www.rfc-editor.org/rfc/rfc9000.html#flow-control. (2020).

[18]

2021. Comparing Kubernetes Container Network Interface (CNI) providers. [Online, Retrieved Jun 12, 2022.] https://kubevious.io/blog/post/comparing-kubernetes-container-network-interface-cni-providers. (2021).

[19]

2021. gRPC performance best practices. [Online, Retrieved Jun 12, 2022.] https://grpc.io/docs/guides/performance/. (2021).

[20]

2021. How eBPF will solve Service Mesh - Goodbye Sidecars. [Online, Retrieved Jun 12, 2022.] https://isovalent.com/blog/post/2021-12-08-ebpf-servicemesh/. (2021).

[21]

2021. Load balancing algorithms. [Online, Retrieved Oct 14, 2022.] https://docs.citrix.com/en-us/citrix-adc/current-release/load-balancing/load-balancing-customizing-algorithms.html. (2021).

[22]

2021. The Cost of Cloud, a Trillion Dollar Paradox. [Online, Retrieved Jun 12, 2022.] https://a16z.com/2021/05/27/cost-of-cloud-paradox-market-cap-cloud-lifecycle-scale-growth-repatriation-optimization/. (2021).

[23]

2021. The Human Side of Airbnb's Microservice Architecture. [Online, Retrieved Jun 12, 2022.] https://www.infoq.com/presentations/airbnb-culture-soa/. (2021).

[24]

2022. Automatic mTLS. [Online, Retrieved Jun 12, 2022.] https://linkerd.io/2.11/features/automatic-mtls/. (2022).

[25]

2022. AWS Lambda. [Online, Retrieved Jun 12, 2022.] https://aws.amazon.com/lambda/. (2022).

[26]

2022. Envoy: supported load balancers. [Online, Retrieved Jun 12, 2022.] https://www.envoyproxy.io/docs/envoy/latest/intro/arch_overview/upstream/load_balancing/load_balancers. (2022).

[27]

2022. Google cloud: Load balancing mode. [Online, Retrieved Jun 12, 2022.] https://cloud.google.com/load-balancing/docs/backend-service#balancing-mode. (2022).

[28]

2022. Google cloud: Traffic policies. [Online, Retrieved Jun 12, 2022.] https://cloud.google.com/load-balancing/docs/l7-internal/traffic-management#traffic_policies. (2022).

[29]

2022. Istio: Locality load balancing. [Online, Retrieved Jun 12, 2022.] https://istio.io/latest/docs/tasks/traffic-management/locality-load-balancing/. (2022).

[30]

2022. Kubernetes Networking: Load Balancer and Network Load Balancer. [Online, Retrieved Jun 12, 2022.] https://ibm.github.io/kubernetes-networking/services/loadbalancer/. (2022).

[31]

2022. Kubernetes scheduler. [Online, Retrieved Jun 12, 2022.] https://kubernetes.io/docs/concepts/scheduling-eviction/kube-scheduler/. (2022).

[32]

2022. Kubernetes without Kube-Proxy. [Online, Retrieved Jun 12, 2022.] https://docs.cilium.io/en/stable/gettingstarted/kubeproxy-free/. (2022).

[33]

2022. Linux container and virtualization tools. [Online, Retrieved Jun 12, 2022.] https://linuxcontainers.org/. (2022).

[34]

2022. memtier_benchmark. [Online, Retrieved Jun 12, 2022.] https://github.com/RedisLabs/memtier_benchmark/. (2022).

[35]

2022. Microservices and Microservices architecture. [Online, Retrieved Jun 12, 2022.] https://www.intel.com/content/www/us/en/cloud-computing/microservices.html. (2022).

[36]

2022. Next-generation mutual authentication with Cilium service mesh. [Online, Retrieved Jun 12, 2022.] https://isovalent.com/blog/post/2022-05-03-servicemesh-security/. (2022).

[37]

2022. NGINX Plus Feature: Load Balancing. [Online, Retrieved Jun 12, 2022.] https://www.nginx.com/products/nginx/load-balancing. (2022).

[38]

2022. Serverless on AWS. [Online, Retrieved Jun 12, 2022.] https://aws.amazon.com/serverless/. (2022).

[39]

2022. The Kubernetes network model. [Online, Retrieved Jun 12, 2022.] https://kubernetes.io/docs/concepts/services-networking/. (2022).

[40]

2022. The Kubernetes Networking Guide: NodePort. [Online, Retrieved Jun 12, 2022.] https://www.tkng.io/services/nodeport/. (2022).

[41]

2022. There are several Load Balancing Methods. Which one is best for your environment? [Online, Retrieved Oct 14, 2022.] https://support.f5.com/csp/article/K42275060. (2022).

[42]

2022. Use containers to Build, Share and Run your applications. [Online, Retrieved Jun 12, 2022.] https://www.docker.com/resources/what-container. (2022).

[43]

2022. Using nginx as HTTP load balancer. [Online, Retrieved Jun 12, 2022.] https://nginx.org/en/docs/http/load_balancing.html. (2022).

[44]

2022. What is a container? [Online, Retrieved Jun 12, 2022.] https://azure.microsoft.com/en-us/overview/what-is-a-container/#overview. (2022).

[45]

2022. ZeroMQ: Advanced request-reply patterns. [Online, Retrieved Jun 12, 2022.] https://zguide.zeromq.org/docs/chapter3/. (2022).

[46]

Mohammad Al-Fares, Sivasankar Radhakrishnan, Barath Raghavan, Nelson Huang, Amin Vahdat, et al. 2010. Hedera: dynamic flow scheduling for data center networks. In Nsdi, Vol. 10. San Jose, USA, 89--92.

[47]

Gianni Antichi and Gábor Rétvári. 2020. Full-stack SDN: The next big challenge?. In Proceedings of the Symposium on SDN Research. 48--54.

Digital Library

[48]

Maria Apostolaki, Ankit Singla, and Laurent Vanbever. 2021. Performance-Driven Internet Path Selection. Association for Computing Machinery, New York, NY, USA, 41--53.

Digital Library

[49]

João Taveira Araújo, Lorenzo Saino, Lennert Buytenhek, and Raul Landa. 2018. Balancing on the edge: Transport affinity without network state. In Usenix Symposium on Networked Systems Design and Implementation (NSDI).

[50]

Sachin Ashok, P Brighten Godfrey, and Radhika Mittal. 2021. Leveraging Service Meshes as a New Network Layer. In Proceedings of the Twentieth ACM Workshop on Hot Topics in Networks. 229--236.

Digital Library

[51]

Tom Barbette, Chen Tang, Haoran Yao, Dejan Kostić, Gerald Q Maguire Jr, Panagiotis Papadimitratos, and Marco Chiesa. 2020. A high-speed load-balancer design with guaranteed per-connection-consistency. In Usenix Symposium on Networked Systems Design and Implementation (NSDI).

[52]

Paul Barford and Mark Crovella. 2000. Critical path analysis of TCP transactions. In ACM SIGCOMM.

[53]

Brandon Williams. 2012. Dynamic snitching in Cassandra: past, present, and future. [Online, Retrieved Jun 12, 2022.] https://www.datastax.com/blog/dynamic-snitching-cassandra-past-present-and-future. (2012).

[54]

Daniel Bristot de Oliveira, Daniel Casini, Rômulo Oliveira, and Tommaso Cucinotta. 2020. Demystifying the Real-Time Linux Scheduling Latency. In ECRTS.

[55]

Brendan Burns, Brian Grant, David Oppenheimer, Eric Brewer, and John Wilkes. 2016. Borg, Omega, and Kubernetes: Lessons learned from three container-management systems over a decade. Queue 14, 1 (2016), 70--93.

Digital Library

[56]

Carson Anderson. 2017. Kubernetes deconstructed. [Online, Retrieved Jun 12, 2022.] https://vimeo.com/245778144/4d1d597c5e. (2017).

[57]

Daniel Borkmann. 2020. Kube-proxy replacement at the XDP layer. [Online, Retrieved Jun 12, 2022.] https://cilium.io/blog/2020/06/22/cilium-18#kubeproxy-removal. (2020).

[58]

Daniel Borkmann. 2022. Cilium & BPF: a fundamentally better dataplane. [Online, Retrieved Jun 12, 2022.] https://guild42.ch/wp-content/uploads/2021/12/Guild42.ch-BPF-Borkmann.pdf. (2022).

[59]

Daniel Borkmann and Martynas Pumputis. 2020. K8s Service Load Balancing with BPF & XDP. https://lpc.events/event/7/contributions/674/attachments/568/1002/plumbers_2020_cilium_load_balancer.pdf. In Linux Plumbers Conference.

[60]

Jeffrey Dean and Luiz André Barroso. 2013. The tail at scale. Commun. ACM 56, 2 (2013), 74--80.

Digital Library

[61]

Dmitry Duplyakin, Robert Ricci, Aleksander Maricq, Gary Wong, Jonathon Duerig, Eric Eide, Leigh Stoller, Mike Hibler, David Johnson, Kirk Webb, Aditya Akella, Kuangching Wang, Glenn Ricart, Larry Landweber, Chip Elliott, Michael Zink, Emmanuel hh0Cecchet, Snigdhaswin Kar, and Prabodh Mishra. 2019. The Design and Operation of CloudLab. In Proceedings of the USENIX Annual Technical Conference (ATC). https://www.flux.utah.edu/paper/duplyakin-atc19

[62]

Daniel E. Eisenbud, Cheng Yi, Carlo Contavalli, Cody Smith, Roman Kononov, Eric Mann-Hielscher, Ardas Cilingiroglu, Bin Cheyney, Wentao Shang, and Jinnah Dylan Hosein. 2016. Maglev: A Fast and Reliable Software Network Load Balancer. In USENIX Symposium on Networked Systems Design and Implementation (NSDI 16).

[63]

Anwar Elwalid, Cheng Jin, Steven Low, and Indra Widjaja. 2001. MATE: MPLS adaptive traffic engineering. In Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No. 01CH37213), Vol. 3. IEEE, 1300--1309.

[64]

Joshua Fried, Zhenyuan Ruan, Amy Ousterhout, and Adam Belay. 2020. Caladan: Mitigating interference at microsecond timescales. In Usenix Symposium on Operating Systems Design and Implementation (OSDI).

[65]

Yu Gan, Yanqi Zhang, Dailun Cheng, Ankitha Shetty, Priyal Rathi, Nayan Katarki, Ariana Bruno, Justin Hu, Brian Ritchken, Brendon Jackson, et al. 2019. An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems. In Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems. 3--18.

Digital Library

[66]

Yu Gan, Yanqi Zhang, Kelvin Hu, Dailun Cheng, Yuan He, Meghna Pancholi, and Christina Delimitrou. 2019. Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices. In Proceedings of the twenty-fourth international conference on architectural support for programming languages and operating systems. 19--33.

Digital Library

[67]

Rohan Gandhi, Hongqiang Harry Liu, Y. Charlie Hu, Guohan Lu, Jitendra Padhye, Lihua Yuan, and Ming Zhang. 2014. Duet: Cloud Scale Load Balancing with Hardware and Software. In Proceedings of the 2014 ACM Conference on SIGCOMM (SIGCOMM '14). Association for Computing Machinery, New York, NY, USA, 27--38.

Digital Library

[68]

Mojgan Ghasemi, Theophilus Benson, and Jennifer Rexford. 2017. Dapper: Data plane performance diagnosis of tcp. In Proceedings of the Symposium on SDN Research (SOSR).

Digital Library

[69]

Stephen Ibanez, Alex Mallery, Serhat Arslan, Theo Jepsen, Muhammad Shahbaz, Changhoon Kim, and Nick McKeown. 2021. The nanoPU: A Nanosecond Network Stack for Datacenters. In 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI 21). USENIX Association, 239--256. https://www.usenix.org/conference/osdi21/presentation/ibanez

[70]

Van Jacobson and Michael J. Karels. 1988. Congestion Avoidance and Control. In SIGCOMM 1988. Stanford, CA.

[71]

S. Jaiswal, G. Iannaccone, C. Diot, J. Kurose, and D. Towsley. 2004. Inferring TCP connection characteristics through passive measurements. In IEEE INFOCOM 2004, Vol. 3. 1582--1592 vol.3.

[72]

James Lewis and Martin Fowler. 2014. Microservices: a definition of this new architectural term. [Online, Retrieved Jun 12, 2022.] https://martinfowler.com/articles/microservices.html. (2014).

[73]

Hao Jiang and Constantinos Dovrolis. 2002. Passive Estimation of TCP Round-Trip Times. SIGCOMM Comput. Commun. Rev. 32 (2002), 75--88.

Digital Library

[74]

Kostis Kaffes, Timothy Chong, Jack Tigar Humphries, Adam Belay, David Mazières, and Christos Kozyrakis. 2019. Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency. In 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 19). 345--360.

[75]

Jonathan Kaldor, Jonathan Mace, Michał Bejda, Edison Gao, Wiktor Kuropatwa, Joe O'Neill, Kian Win Ong, Bill Schaller, Pingjia Shan, Brendan Viscomi, et al. 2017. Canopy: An end-to-end performance tracing and analysis system. In Proceedings of the 26th symposium on operating systems principles. 34--50.

Digital Library

[76]

Srikanth Kandula, Dina Katabi, Bruce Davie, and Anna Charny. 2005. Walking the tightrope: Responsive yet stable traffic engineering. ACM SIGCOMM Computer Communication Review (2005).

[77]

Dina Katabi, Mark Handley, and Charlie Rohrs. 2002. Congestion control for high bandwidth-delay product networks. In Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications. 89--102.

Digital Library

[78]

Jeremy Kerr. 2003. Using Dynamic Feedback to Optimise Load Balancing Decisions. In Australian Linux Conference.

[79]

Marios Kogias, George Prekas, Adrien Ghosn, Jonas Fietz, and Edouard Bugnion. 2019. {R2P2}: Making {RPCs} first-class datacenter citizens. In 2019 USENIX Annual Technical Conference (USENIX ATC 19). 863--880.

[80]

Collin Lee and John Ousterhout. 2019. Granular computing. In Proceedings of the Workshop on Hot Topics in Operating Systems. 149--154.

Digital Library

[81]

Uichin Lee, Joon-Sang Park, MY Sanadidi, Mario Gerla, et al. 2005. Flowbased dynamic load balancing for passive network monitoring. In Communications and Computer Networks (CCN). 357--362.

[82]

Jialin Li, Naveen Kr. Sharma, Dan R. K. Ports, and Steven D. Gribble. 2014. Tales of the Tail: Hardware, OS, and Application-Level Sources of Tail Latency. In Proceedings of the ACM Symposium on Cloud Computing (SOCC '14). Association for Computing Machinery, New York, NY, USA, 1--14.

Digital Library

[83]

Yilong Li, Seo Jin Park, and John Ousterhout. 2021. MilliSort and MilliQuery: Large-Scale Data-Intensive Computing in Milliseconds. In 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21). USENIX Association, 593--611. https://www.usenix.org/conference/nsdi21/presentation/li-yilong

[84]

Zaoxing Liu, Samson Zhou, Ori Rottenstreich, Vladimir Braverman, and Jennifer Rexford. [n. d.]. Memory-Efficient Performance Monitoring on Programmable Switches with Lean Algorithms. 31--44. arXiv:https://epubs.siam.org/doi/pdf/10.1137/1.9781611976021.3

[85]

Guohan Lu and Xing Li. 2003. On the correspondency between TCP acknowledgment packet and data packet. In Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement (IMC).

Digital Library

[86]

Sarah McClure, Amy Ousterhout, Scott Shenker, and Sylvia Ratnasamy. 2022. Efficient Scheduling Policies for Microsecond-Scale Tasks. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). USENIX Association, Renton, WA, 1--18. https://www.usenix.org/conference/nsdi22/presentation/mcclure

[87]

Radhika Mittal, Vinh The Lam, Nandita Dukkipati, Emily Blem, Hassan Wassel, Monia Ghobadi, Amin Vahdat, Yaogong Wang, David Wetherall, and David Zats. 2015. TIMELY: RTT-based congestion control for the datacenter. ACM SIGCOMM Computer Communication Review 45, 4 (2015), 537--550.

Digital Library

[88]

Satoru Ohta and Ryuichi Andou. 2009. WWW server load balancing technique based on passive performance measurement. 884 -- 887.

[89]

Vladimir Olteanu, Alexandru Agache, Andrei Voinescu, and Costin Raiciu. 2018. Stateless datacenter load-balancing with beamer. In Usenix Symposium on Networked Systems Design and Implementation (NSDI).

[90]

Kay Ousterhout, Ryan Rasti, Sylvia Ratnasamy, Scott Shenker, and Byung-Gon Chun. 2015. Making sense of performance in data analytics frameworks. In Usenix Symposium on Networked Systems Design and Implementation (NSDI).

[91]

Kay Ousterhout, Patrick Wendell, Matei Zaharia, and Ion Stoica. 2013. Sparrow: Distributed, Low Latency Scheduling. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP '13). Association for Computing Machinery, New York, NY, USA, 69--84.

Digital Library

[92]

Jitendra Pahdye and Sally Floyd. 2001. On inferring TCP behavior. In ACM SIGCOMM.

[93]

Austin Parker, Daniel Spoonhower, Jonathan Mace, Ben Sigelman, and Rebecca Isaacs. 2020. Distributed tracing in practice: Instrumenting, analyzing, and debugging microservices. O'Reilly Media.

[94]

Parveen Patel, Deepak Bansal, Lihua Yuan, Ashwin Murthy, Albert Greenberg, David A. Maltz, Randy Kern, Hemant Kumar, Marios Zikos, Hongyu Wu, Changhoon Kim, and Naveen Karri. 2013. Ananta: Cloud Scale Load Balancing. SIGCOMM Comput. Commun. Rev. 43 (2013), 207--218.

Digital Library

[95]

Jonathan Perry, Amy Ousterhout, Hari Balakrishnan, Devavrat Shah, and Hans Fugal. 2014. Fastpass: A centralized" zero-queue" datacenter network. In Proceedings of the 2014 ACM conference on SIGCOMM. 307--318.

Digital Library

[96]

George Prekas, Marios Kogias, and Edouard Bugnion. 2017. ZygOS: Achieving Low Tail Latency for Microsecond-Scale Networked Tasks. In Proceedings of the 26th Symposium on Operating Systems Principles (SOSP '17). Association for Computing Machinery, New York, NY, USA, 325--341.

Digital Library

[97]

Hugo Sadok, Zhipeng Zhao, Valerie Choung, Nirav Atre, Daniel S Berger, James C Hoe, Aurojit Panda, and Justine Sherry. 2021. We need kernel interposition over the network dataplane. In Proceedings of the Workshop on Hot Topics in Operating Systems. 152--158.

Digital Library

[98]

Satadal Sengupta and Hyojoon Kim and Jennifer Rexford. 2022. Continuous In-Network Round-Trip Time Monitoring. In SIGCOMM 2022.

[99]

Natalie Serrino. 2021. Horizontal Pod Autoscaling with Custom Metrics in Kubernetes. [Online, Retrieved Jun 12, 2022.] https://blog.px.dev/autoscaling-custom-k8s-metric/. (2021).

[100]

Nikita V. Shirokov. 2018. XDP: 1.5 years in production. Evolution and lessons learned. http://vger.kernel.org/lpc_net2018_talks/LPC_XDP_Shirokov_v2.pdf. In Linux Plumbers Conference.

[101]

Benjamin H Sigelman, Luiz Andre Barroso, Mike Burrows, Pat Stephenson, Manoj Plakal, Donald Beaver, Saul Jaspan, and Chandan Shanbhag. 2010. Dapper, a large-scale distributed systems tracing infrastructure. (2010).

[102]

Shan Sinha, Srikanth Kandula, and Dina Katabi. 2004. Harnessing TCP's burstiness with flowlet switching. In Proc. 3rd ACM Workshop on Hot Topics in Networks (Hotnets-III).

[103]

Lalith Suresh, Marco Canini, Stefan Schmid, and Anja Feldmann. 2015. C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection. In Usenix Symposium on Networked Systems Design and Implementation (NSDI). 513--527.

[104]

Michal Szymaniak, David Presotto, Guillaume Pierre, and Maarten van Steen. 2008. Practical large-scale latency estimation. Computer Networks 52, 7 (2008), 1343--1364.

Digital Library

[105]

Erico Vanini, Rong Pan, Mohammad Alizadeh, Parvin Taheri, and Tom Edsall. 2017. Let it flow: Resilient asymmetric load balancing with flowlet switching. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). 407--420.

[106]

Bryan Veal, Kang Li, and David Lowenthal. 2005. New methods for passive estimation of TCP round-trip times. In International workshop on passive and active network measurement. Springer, 121--134.

Digital Library

[107]

Wenfei Wu, Guohui Wang, Aditya Akella, and Anees Shaikh. 2013. Virtual network diagnosis as a service. In Proceedings of the 4th annual Symposium on Cloud Computing. 1--15.

Digital Library

[108]

Yin Zhang, Lee Breslau, Vern Paxson, and Scott Shenker. 2002. On the characteristics and origins of internet flow rates. In Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications. 309--322.

Digital Library

[109]

Hang Zhu, Kostis Kaffes, Zixu Chen, Zhenming Liu, Christos Kozyrakis, Ion Stoica, and Xin Jin. 2020. RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers. In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). USENIX Association.

[110]

Danyang Zhuo, Kaiyuan Zhang, Yibo Zhu, Hongqiang Harry Liu, Matthew Rockett, Arvind Krishnamurthy, and Thomas Anderson. 2019. Slim: OS kernel support for a low-overhead container overlay network. In 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 19). 331--344.

Digital Library

Cited By

P S PK RSharma ATemura AShah RTammana P(2024) DL 3 : Adaptive Load Balancing for Latency-critical Edge Cloud Applications 2024 20th International Conference on Network and Service Management (CNSM)10.23919/CNSM62983.2024.10814405(1-5)Online publication date: 28-Oct-2024
https://doi.org/10.23919/CNSM62983.2024.10814405
Michaelis OSchmid SMostafaei HSchiavoni VEdinger JCao JJin Z(2024)L3: Latency-aware Load Balancing in Multi-Cluster Service MeshProceedings of the 25th International Middleware Conference10.1145/3652892.3654793(49-61)Online publication date: 2-Dec-2024
https://dl.acm.org/doi/10.1145/3652892.3654793
Huang CPetrunin ITsourdos A(2023)Distributed Trajectory Management for Urban Air Mobility Operations with Ground-based Edge Intelligence2023 IEEE/AIAA 42nd Digital Avionics Systems Conference (DASC)10.1109/DASC58513.2023.10311301(1-8)Online publication date: 1-Oct-2023
https://doi.org/10.1109/DASC58513.2023.10311301

Index Terms

Load balancers need in-band feedback control
1. Networks
  1. Network components
    1. Middle boxes / network appliances
  2. Network performance evaluation
    1. Network measurement

Recommendations

Elastic Load Balancing Classic Load Balancers
Elastic Load Balancing Application Load Balancers
Server-Based Dynamic Server Selection Algorithms
ICOIN '02: Revised Papers from the International Conference on Information Networking, Wireless Communications Technologies and Network Applications-Part II

Server selection is an important problem in replicated server systems distributed over the Internet. In this paper, we study two server selection algorithms under a server-based framework we have developed. These algorithms utilize server load and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

HotNets '22: Proceedings of the 21st ACM Workshop on Hot Topics in Networks

November 2022

252 pages

ISBN:9781450398992

DOI:10.1145/3563766

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCOMM: ACM Special Interest Group on Data Communication

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 November 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation

Conference

HotNets '22

Sponsor:

SIGCOMM

HotNets '22: The 21st ACM Workshop on Hot Topics in Networks

November 14 - 15, 2022

Texas, Austin

Acceptance Rates

Overall Acceptance Rate 110 of 460 submissions, 24%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
355
Total Downloads

Downloads (Last 12 months)154
Downloads (Last 6 weeks)19

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

P S PK RSharma ATemura AShah RTammana P(2024) DL 3 : Adaptive Load Balancing for Latency-critical Edge Cloud Applications 2024 20th International Conference on Network and Service Management (CNSM)10.23919/CNSM62983.2024.10814405(1-5)Online publication date: 28-Oct-2024
https://doi.org/10.23919/CNSM62983.2024.10814405
Michaelis OSchmid SMostafaei HSchiavoni VEdinger JCao JJin Z(2024)L3: Latency-aware Load Balancing in Multi-Cluster Service MeshProceedings of the 25th International Middleware Conference10.1145/3652892.3654793(49-61)Online publication date: 2-Dec-2024
https://dl.acm.org/doi/10.1145/3652892.3654793
Huang CPetrunin ITsourdos A(2023)Distributed Trajectory Management for Urban Air Mobility Operations with Ground-based Edge Intelligence2023 IEEE/AIAA 42nd Digital Avionics Systems Conference (DASC)10.1109/DASC58513.2023.10311301(1-8)Online publication date: 1-Oct-2023
https://doi.org/10.1109/DASC58513.2023.10311301

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten