skip to main content
10.1145/3627703.3650069acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article
Free Access

Astraea: Towards Fair and Efficient Learning-based Congestion Control

Published:22 April 2024Publication History

ABSTRACT

Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes. However, they fail to provide consistently good convergence properties, including fairness, fast convergence and stability, due to the mismatch between their objective functions and these properties. Despite being intuitive, integrating these properties into existing learning-based CC is challenging, because: 1) their training environments are designed for the performance optimization of single flow but incapable of cooperative multi-flow optimization, and 2) there is no directly measurable metric to represent these properties into the training objective function.

We present Astraea, a new learning-based congestion control that ensures fast convergence to fairness with stability. At the heart of Astraea is a multi-agent deep reinforcement learning framework that explicitly optimizes these convergence properties during the training process by enabling the learning of interactive policy between multiple competing flows, while maintaining high performance. We further build a faithful multi-flow environment that emulates the competing behaviors of concurrent flows, explicitly expressing convergence properties to enable their optimization during training. We have fully implemented Astraea and our comprehensive experiments show that Astraea can quickly converge to fairness point and exhibit better stability than its counterparts. For example, Astraea achieves near-optimal bandwidth sharing (i.e., fairness) when multiple flows compete for the same bottleneck, delivers up to 8.4× faster convergence speed and 2.8× smaller throughput deviation, while achieving comparable or even better performance over prior solutions.

References

  1. Linux tc. https://man7.org/linux/man-pages/man8/tc.8.html.Google ScholarGoogle Scholar
  2. Pantheon tunnel. https://github.com/StanfordSNR/pantheon-tunnel. Accessed: 2021-05-30.Google ScholarGoogle Scholar
  3. Soheil Abbasloo, Chen-Yu Yen, and H Jonathan Chao. Classic meets modern: a pragmatic learning-based congestion control for the internet. In Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication, pages 632--647, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Guido Appenzeller, Isaac Keslassy, and Nick McKeown. Sizing router buffers. ACM SIGCOMM Computer Communication Review, 34(4):281--292, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Venkat Arun and Hari Balakrishnan. Copa: Practical delay-based congestion control for the internet. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18), pages 329--342, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Lawrence S Brakmo, Sean W O'Malley, and Larry L Peterson. TCP Vegas: New techniques for congestion detection and avoidance. Number 4. ACM, 1994.Google ScholarGoogle Scholar
  7. Neal Cardwell, Yuchung Cheng, C Stephen Gunn, Soheil Hassas Yeganeh, and Van Jacobson. BBR: Congestion-based congestion control. Queue, 14(5):20--53, 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Li Chen, Justinas Lingys, Kai Chen, and Feng Liu. Auto: Scaling deep reinforcement learning for datacenter-scale automatic traffic optimization. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, pages 191--205, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Inho Cho, Keon Jang, and Dongsu Han. Credit-scheduled delay-bounded congestion control for datacenters. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication, SIGCOMM '17, page 239-252. Association for Computing Machinery, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Mo Dong, Qingxi Li, Doron Zarchy, P Brighten Godfrey, and Michael Schapira. PCC: Re-architecting congestion control for consistent high performance. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15), pages 395--408, 2015.Google ScholarGoogle Scholar
  11. Mo Dong, Tong Meng, Doron Zarchy, Engin Arslan, Yossi Gilad, Brighten Godfrey, and Michael Schapira. PCC Vivace: Online-learning congestion control. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18), pages 343--356, Renton, WA, April 2018. USENIX Association.Google ScholarGoogle Scholar
  12. Sally Floyd, Tom Henderson, Andrei Gurtov, et al. The newreno modification to tcp's fast recovery algorithm. 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Victor S Frost and Benjamin Melamed. Traffic modeling for telecommunications networks. IEEE Communications Magazine, 32(3):70--81, 1994.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Sangtae Ha, Injong Rhee, and Lisong Xu. Cubic: a new tcp-friendly high-speed tcp variant. ACM SIGOPS operating systems review, (5):64--74, 2008.Google ScholarGoogle Scholar
  15. Shariq Iqbal and Fei Sha. Actor-attention-critic for multi-agent reinforcement learning. In ICML, 2019.Google ScholarGoogle Scholar
  16. Van Jacobson. Congestion avoidance and control. ACM SIGCOMM computer communication review, 18(4):314--329, 1988.Google ScholarGoogle Scholar
  17. Raj Jain, Arjan Durresi, and Gojko Babic. Throughput fairness index: An explanation. In ATM Forum contribution, volume 99, 1999.Google ScholarGoogle Scholar
  18. Nathan Jay, Noga Rotman, Brighten Godfrey, Michael Schapira, and Aviv Tamar. A deep reinforcement learning perspective on internet congestion control. In International Conference on Machine Learning ICML, pages 3050--3059, 2019.Google ScholarGoogle Scholar
  19. Cheng Jin, David X Wei, and Steven H Low. Fast tcp: motivation, architecture, algorithms, performance. In IEEE INFOCOM 2004, volume 4, pages 2490--2501. IEEE, 2004.Google ScholarGoogle Scholar
  20. Dina Katabi, Mark Handley, and Charlie Rohrs. Congestion control for high bandwidth-delay product networks. In Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications, pages 89--102, 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Frank P Kelly, Aman K Maulloo, and David Kim Hong Tan. Rate control for communication networks: shadow prices, proportional fairness and stability. Journal of the Operational Research society, 49(3):237--252, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  22. Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, and Kai Chen. Towards fair and efficient learning-based congestion control. arXiv preprint arXiv:2403.01798, 2024.Google ScholarGoogle Scholar
  23. Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015.Google ScholarGoogle Scholar
  24. Michael L Littman. Markov games as a framework for multi-agent reinforcement learning. In Machine learning proceedings 1994, pages 157--163. Elsevier, 1994.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, and Igor Mordatch. Multi-agent actor-critic for mixed cooperative-competitive environments. arXiv preprint arXiv:1706.02275, 2017.Google ScholarGoogle Scholar
  26. Yiqing Ma, Han Tian, Xudong Liao, Junxue Zhang, Weiyan Wang, Kai Chen, and Xin Jin. Multi-objective congestion control. In Proceedings of the Seventeenth European Conference on Computer Systems, pages 218--235, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication, pages 197--210, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Hongzi Mao, Malte Schwarzkopf, Shaileshh Bojja Venkatakrishnan, Zili Meng, and Mohammad Alizadeh. Learning scheduling algorithms for data processing clusters. In Proceedings of the ACM Special Interest Group on Data Communication, SIGCOMM '19, page 270--288, New York, NY, USA, 2019. Association for Computing Machinery.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Gustavo Marfia, Claudio Palazzi, Giovanni Pau, Mario Gerla, MY Sanadidi, and Marco Roccetti. Tcp libra: Exploring rtt-fairness for tcp. In International Conference on Research in Networking, pages 1005--1013. Springer, 2007.Google ScholarGoogle Scholar
  30. Ravi Netravali, Anirudh Sivaraman, Somak Das, Ameesh Goyal, Keith Winstein, James Mickens, and Hari Balakrishnan. Mahimahi: Accurate record-and-replay for HTTP. In 2015 USENIX Annual Technical Conference (USENIX ATC 15), pages 417--429, 2015.Google ScholarGoogle Scholar
  31. Tabish Rashid, Mikayel Samvelyan, C. S. D. Witt, Gregory Farquhar, Jakob N. Foerster, and Shimon Whiteson. Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning. ArXiv, abs/1803.11485, 2018.Google ScholarGoogle Scholar
  32. Tabish Rashid, Mikayel Samvelyan, C. S. D. Witt, Gregory Farquhar, Jakob N. Foerster, and Shimon Whiteson. Monotonic value function factorisation for deep multi-agent reinforcement learning. J. Mach. Learn. Res., 21:178:1-178:51, 2020.Google ScholarGoogle Scholar
  33. Alessio Sacco, Matteo Flocco, Flavio Esposito, and Guido Marchetto. Owl: congestion control with partially invisible networks via reinforcement learning. In IEEE INFOCOM 2021-IEEE Conference on Computer Communications, pages 1--10. IEEE, 2021.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Anirudh Sivaraman, Keith Winstein, Pratiksha Thaker, and Hari Balakrishnan. An experimental study of the learnability of congestion control. In Proceedings of the 2014 ACM Conference on SIGCOMM, SIGCOMM '14, page 479--490, New York, NY, USA, 2014. Association for Computing Machinery.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Richard S Sutton and Andrew G Barto. Reinforcement learning: An introduction. MIT press, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Kun Tan, Jingmin Song, Qian Zhang, and Murari Sridharan. A compound tcp approach for high-speed and long distance networks. In Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications, pages 1--12. IEEE, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  37. Han Tian, Xudong Liao, Chaoliang Zeng, Junxue Zhang, and Kai Chen. Spine: an efficient drl-based congestion control with ultra-low overhead. In Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies, pages 261--275, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Keith Winstein and Hari Balakrishnan. Tcp ex machina: Computer-generated congestion control. In Proceedings of the ACM SIGCOMM 2013 Conference on SIGCOMM, SIGCOMM '13, page 123--134, New York, NY, USA, 2013. Association for Computing Machinery.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Keith Winstein, Anirudh Sivaraman, and Hari Balakrishnan. Stochastic forecasts achieve high throughput and low delay over cellular networks. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13), pages 459--471, 2013.Google ScholarGoogle Scholar
  40. Zhengxu Xia, Yajie Zhou, Francis Y. Yan, and Junchen Jiang. Genet: Automatic curriculum generation for learning adaptation in networking. In Proceedings of the ACM SIGCOMM 2022 Conference, SIGCOMM '22, page 397--413, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Kaiqiang Xu, Xinchen Wan, Hao Wang, Zhenghang Ren, Xudong Liao, Decang Sun, Chaoliang Zeng, and Kai Chen. Tacc: A full-stack cloud computing infrastructure for machine learning tasks. arXiv preprint arXiv:2110.01556, 2021.Google ScholarGoogle Scholar
  42. Francis Y Yan, Jestin Ma, Greg D Hill, Deepti Raghavan, Riad S Wahby, Philip Levis, and Keith Winstein. Pantheon: the training ground for internet congestion-control research. In 2018 USENIX Annual Technical Conference (USENIX ATC 18), 2018.Google ScholarGoogle Scholar
  43. Junxue Zhang, Chaoliang Zeng, Hong Zhang, Shuihai Hu, and Kai Chen. Liteflow: towards high-performance adaptive neural networks for kernel datapath. In Proceedings of the ACM SIGCOMM 2022 Conference, pages 414--427, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Astraea: Towards Fair and Efficient Learning-based Congestion Control

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        EuroSys '24: Proceedings of the Nineteenth European Conference on Computer Systems
        April 2024
        1245 pages
        ISBN:9798400704376
        DOI:10.1145/3627703

        Copyright © 2024 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 April 2024

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate241of1,308submissions,18%
      • Article Metrics

        • Downloads (Last 12 months)100
        • Downloads (Last 6 weeks)100

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader