skip to main content
10.1145/3542637.3542648acmotherconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

OpenNetLab: Open Platform for RL-based Congestion Control for Real-Time Communications

Published:07 November 2023Publication History

ABSTRACT

With the growing importance of real-time communications (RTC), designing congestion control (CC) algorithms for RTC that achieve high network performance and QoE is gaining attention. Recently, data-driven, reinforcement learning (RL)-based CC algorithms for RTC have shown great potential, outperforming traditional rule-based counterparts. However, there are no open platforms tailored for training, evaluation, and validation of the algorithms that can facilitate this emerging research area.

We present OpenNetLab, an open platform for fast training, reproducible end-to-end evaluation, and performance validation of RL-based CC algorithms for RTC. Preliminary use cases confirm that OpenNetLab concretely aided the training of novel RL-based CC algorithms for RTC that outperform a well-established rule-based baseline in both network performance and QoE metrics.

References

  1. 2021. Grand Challenge on Bandwidth Estimation for Real-Time Communications. (2021). https://2021.acmmmsys.org/rtc_challenge.php.Google ScholarGoogle Scholar
  2. 2022. Google Meet.(2022). https://meet.google.com.Google ScholarGoogle Scholar
  3. 2022. iPerf - The ultimate speed test tool for TCP, UDP and SCTP. (2022). https://iperf.fr.Google ScholarGoogle Scholar
  4. 2022. Microsoft Teams.(2022). https://www.microsoft.com/en-us/microsoft-teams/group-chat-software.Google ScholarGoogle Scholar
  5. 2022. ns-3 Network Simulator.(2022). https://www.nsnam.org.Google ScholarGoogle Scholar
  6. 2022. OpenAI Gym.(2022). https://gym.openai.com/.Google ScholarGoogle Scholar
  7. 2022. Zoom.(2022). https://zoom.us.Google ScholarGoogle Scholar
  8. Venkat Arun and Hari Balakrishnan. 2018. Copa: Practical delay-based congestion control for the internet. In NSDI.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Per Block, Marion Hoffman, Isabel J Raabe, Jennifer Beam Dowd, Charles Rahal, Ridhi Kashyap, and Melinda C Mills. 2020. Social network-based distancing strategies to flatten the COVID-19 curve in a post-lockdown world. Nature Human Behaviour 4, 6 (2020), 588–596.Google ScholarGoogle ScholarCross RefCross Ref
  10. Niklas Blum, Serge Lachapelle, and Harald Alvestrand. 2021. WebRTC: real-time communication for the open web platform. Commun. ACM 64, 8 (2021), 50–54.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Francesca De Simone, Jie Li, Henrique Galvan Debarba, Abdallah El Ali, Simon NB Gunkel, and Pablo Cesar. 2019. Watching videos together in social virtual reality: An experimental study on user’s QoE. In 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). IEEE, 890–891.Google ScholarGoogle ScholarCross RefCross Ref
  12. Mo Dong, Tong Meng, Doron Zarchy, Engin Arslan, Yossi Gilad, Brighten Godfrey, and Michael Schapira. 2018. {PCC} vivace: Online-learning congestion control. In NSDI.Google ScholarGoogle Scholar
  13. Piotr Gawłowicz and Anatolij Zubow. 2019. Ns-3 meets openai gym: The playground for machine learning in networking research. In Proceedings of the 22nd International ACM Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems. 113–120.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Simon NB Gunkel, Hans M Stokking, Martin J Prins, Nanda van der Stap, Frank B ter Haar, and Omar A Niamut. 2018. Virtual Reality Conferencing: Multi-user immersive VR experiences on the web. In Proceedings of the 9th ACM Multimedia Systems Conference. 498–501.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Sangtae Ha, Injong Rhee, and Lisong Xu. 2008. CUBIC: a new TCP-friendly high-speed TCP variant. ACM SIGOPS operating systems review 42, 5 (2008), 64–74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Stefan Holmer, Henrik Lundin, Gaetano Carlucci, Luca De Cicco, and Saverio Mascolo. 2015. A Google Congestion Control Algorithm for Real-Time Communication. (2015). https://datatracker.ietf.org/doc/html/draft-alvestrand-rmcat-congestion-03Google ScholarGoogle Scholar
  17. Gazi Karam Illahi, Thomas Van Gemert, Matti Siekkinen, Enrico Masala, Antti Oulasvirta, and Antti Ylä-Jääski. 2020. Cloud gaming with foveated video encoding. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 16, 1 (2020), 1–24.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Nathan Jay, Noga Rotman, Brighten Godfrey, Michael Schapira, and Aviv Tamar. 2019. A deep reinforcement learning perspective on internet congestion control. In International Conference on Machine Learning. PMLR, 3050–3059.Google ScholarGoogle Scholar
  19. Tong Li, Kai Zheng, Ke Xu, Rahul Arvind Jadhav, Tao Xiong, Keith Winstein, and Kun Tan. 2020. Tack: Improving wireless transport performance by taming acknowledgments. In ACM SIGCOMM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Zhi Li, Christos Bampis, Julie Novak, Anne Aaron, Kyle Swanson, Anush Moorthy, and JD Cock. 2018. VMAF: The journey continues. Netflix Technology Blog 25 (2018).Google ScholarGoogle Scholar
  21. Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, and Ion Stoica. 2018. RLlib: Abstractions for distributed reinforcement learning. In International Conference on Machine Learning. PMLR, 3053–3062.Google ScholarGoogle Scholar
  22. Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. 197–210.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Shubham Pateria, Budhitama Subagdja, Ah-hwee Tan, and Chai Quek. 2021. Hierarchical reinforcement learning: A comprehensive survey. ACM Computing Surveys (CSUR) 54, 5 (2021), 1–35.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Chandan KA Reddy, Vishak Gopal, and Ross Cutler. 2021. Dnsmos: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6493–6497.Google ScholarGoogle ScholarCross RefCross Ref
  25. Saeed Shafiee Sabet, Steven Schmidt, Saman Zadtootaghaj, Carsten Griwodz, and Sebastian Möller. 2020. Delay sensitivity classification of cloud gaming content. In Proceedings of the 12th ACM International Workshop on Immersive Mixed and Virtual Environment Systems. 25–30.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Richard S Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 1999. Policy gradient methods for reinforcement learning with function approximation. Advances in neural information processing systems 12 (1999).Google ScholarGoogle Scholar
  27. Bo Wang, Yuan Zhang, Size Qian, Zipeng Pan, and Yuhong Xie. 2021. A Hybrid Receiver-side Congestion Control Scheme for Web Real-time Communication. In ACM MMSys.Google ScholarGoogle Scholar
  28. Mowei Wang, Yong Cui, Xin Wang, Shihan Xiao, and Junchen Jiang. 2017. Machine learning for networking: Workflow, advances and opportunities. IEEE Network 32, 2 (2017), 92–99.Google ScholarGoogle ScholarCross RefCross Ref
  29. Francis Y Yan, Hudson Ayers, Chenzhi Zhu, Sadjad Fouladi, James Hong, Keyi Zhang, Philip Levis, and Keith Winstein. 2020. Learning in situ: a randomized experiment in video streaming. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). 495–511.Google ScholarGoogle Scholar
  30. Francis Y Yan, Jestin Ma, Greg D Hill, Deepti Raghavan, Riad S Wahby, Philip Levis, and Keith Winstein. 2018. Pantheon: the training ground for Internet congestion-control research. In USENIX ATC.Google ScholarGoogle Scholar
  31. Huanhuan Zhang, Anfu Zhou, Yuhan Hu, Chaoyue Li, Guangping Wang, Xinyu Zhang, Huadong Ma, Leilei Wu, Aiyun Chen, and Changhui Wu. 2021. Loki: improving long tail performance of learning-based real-time video adaptation by fusing rule-based models. In MobiCom.Google ScholarGoogle Scholar
  32. Huanhuan Zhang, Anfu Zhou, Jiamin Lu, Ruoxuan Ma, Yuhan Hu, Cong Li, Xinyu Zhang, Huadong Ma, and Xiaojiang Chen. 2020. OnRL: improving mobile video telephony via online reinforcement learning. In MobiCom.Google ScholarGoogle Scholar
  33. Huanhuan Zhang, Anfu Zhou, Ruoxuan Ma, Jiamin Lu, and Huadong Ma. 2021. Arsenal: Understanding Learning-based Wireless Video Transport via In-depth Evaluation. IEEE Transactions on Vehicular Technology 70, 10 (2021), 10832–10844.Google ScholarGoogle ScholarCross RefCross Ref
  34. Zhengxu Xia, Yajie Zhou, Francis Y. Yan, Junchen Jiang. 2022. Automatic Curriculum Generation for Learning Adaptation in Networking. arXiv preprint arXiv: 2202.05940(2022).Google ScholarGoogle Scholar
  35. Anfu Zhou, Huanhuan Zhang, Guangyuan Su, Leilei Wu, Ruoxuan Ma, Zhen Meng, Xinyu Zhang, Xiufeng Xie, Huadong Ma, and Xiaojiang Chen. 2019. Learning to coordinate video codec with transport protocol for mobile video telephony. In MobiCom.Google ScholarGoogle Scholar

Index Terms

  1. OpenNetLab: Open Platform for RL-based Congestion Control for Real-Time Communications
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            APNet '22: Proceedings of the 6th Asia-Pacific Workshop on Networking
            July 2022
            110 pages
            ISBN:9781450397483
            DOI:10.1145/3542637

            Copyright © 2022 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 7 November 2023

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format