research-article

Continuous Bitrate & Latency Control with Deep Reinforcement Learning for Live Video Streaming

Authors:

Jing WangAuthors Info & Claims

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Pages 2637 - 2641

https://doi.org/10.1145/3343031.3356063

Published: 15 October 2019 Publication History

Abstract

In this paper, we introduce a continuous bitrate control and latency control model for the Live Video Streaming Challenge. Our model is based on Deep Deterministic Policy Gradient, popular on continuous control tasks. Simultaneously, it can take a fine-grained control through continuous control and does not need to discrete the continuous "latency limit", which is a buffer threshold to minimize end-to-end delay by frame skipping. In all considered live video scenarios, our model can provide a better quality of experience with improvements in average QoE of 3.6% than DQN which discrete the "latency limit". Additionally, challenge results show the effectiveness and applicability of the proposed model, which achieved top performance in 3 different networks that include high, low and oscillating throughput, and ranked the second place in the network with medium throughput.

References

[1]

Zahaib Akhtar, Yun Seong Nam, Ramesh Govindan, Sanjay Rao, Jessica Chen, Ethan Katz-Bassett, Bruno Ribeiro, Jibin Zhan, and Hui Zhang. 2018. Oboe: auto-tuning video ABR algorithms to network conditions. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication. ACM, 44--58.

Digital Library

[2]

Abdelhak Bentaleb, Bayan Taani, Ali C Begen, Christian Timmerer, and Roger Zimmermann. 2018. A survey on bitrate adaptation schemes for streaming media over http. IEEE Communications Surveys & Tutorials, Vol. 21, 1 (2018), 562--585.

[3]

VNI Cisco. 2018. Cisco Visual Networking Index: Forecast and Trends, 2017--2022. White Paper (2018).

[4]

Matteo Gadaleta, Federico Chiariotti, Michele Rossi, and Andrea Zanella. 2017. D-DASH: A deep Q-learning framework for DASH video streaming. IEEE Transactions on Cognitive Communications and Networking, Vol. 3, 4 (2017), 703--718.

[5]

Tianchi Huang, Rui-Xiao Zhang, Zhou Chao, and Lifeng Sun. 2018. QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning. (2018).

Digital Library

[6]

Te Yuan Huang, Ramesh Johari, Nick Mckeown, Matthew Trunnell, and Mark Watson. 2014. A Buffer-Based Approach to Rate Adaptation: Evidence from a Large Video Streaming Service. Acm Sigcomm Computer Communication Review, Vol. 44, 4 (2014), 187--198.

Digital Library

[7]

Junchen Jiang, Vyas Sekar, and Hui Zhang. 2014. Improving fairness, efficiency, and stability in http-based adaptive video streaming with festive. IEEE/ACM Transactions on Networking (ToN), Vol. 22, 1 (2014), 326--340.

Digital Library

[8]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).

[9]

Yuxi Li. 2017. Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274 (2017).

[10]

Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015).

[11]

Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. ACM, 197--210.

Digital Library

[12]

Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International conference on machine learning. 1928--1937.

Digital Library

[13]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et almbox. 2015. Human-level control through deep reinforcement learning. Nature, Vol. 518, 7540 (2015), 529.

[14]

Kevin Spiteri, Rahul Urgaonkar, and Ramesh K Sitaraman. 2016. BOLA: Near-optimal bitrate adaptation for online videos. In IEEE INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications. IEEE, 1--9.

Digital Library

[15]

Yi Sun, Xiaoqi Yin, Junchen Jiang, Vyas Sekar, Fuyuan Lin, Nanshu Wang, Tao Liu, and Bruno Sinopoli. 2016. CS2P: Improving video bitrate selection and adaptation with data-driven throughput prediction. In Proceedings of the 2016 ACM SIGCOMM Conference. ACM, 272--285.

Digital Library

[16]

Pawel Wawrzynski. 2015. Control policy with autocorrelated noise in reinforcement learning for robotics. International Journal of Machine Learning and Computing, Vol. 5, 2 (2015), 91.

[17]

Xiaoqi Yin, Abhishek Jindal, Vyas Sekar, and Bruno Sinopoli. 2015. A control-theoretic approach for dynamic adaptive video streaming over HTTP. In ACM SIGCOMM Computer Communication Review, Vol. 45. ACM, 325--338.

Digital Library

Cited By

Nguyen THua DHuong THoang VDao NCho S(2024)Intelligent QoE Management for IoMT Streaming Services in Multiuser Downlink RSMA NetworksIEEE Internet of Things Journal10.1109/JIOT.2023.333447311:7(12602-12618)Online publication date: 1-Apr-2024
https://doi.org/10.1109/JIOT.2023.3334473
Smirnov NTomforde S(2024)Real-time rate control of WebRTC video streams in 5G networks: Improving quality of experience with Deep Reinforcement LearningJournal of Systems Architecture10.1016/j.sysarc.2024.103066148(103066)Online publication date: Mar-2024
https://doi.org/10.1016/j.sysarc.2024.103066
Niu DCheng GChen Z(2023)TDS-KRFI: Reference Frame Identification for Live Web Streaming Toward HTTP Flash Video ProtocolIEEE Transactions on Network and Service Management10.1109/TNSM.2023.328256320:4(4198-4215)Online publication date: Dec-2023
https://doi.org/10.1109/TNSM.2023.3282563
Show More Cited By

Index Terms

Continuous Bitrate & Latency Control with Deep Reinforcement Learning for Live Video Streaming

Recommendations

Live Video Streaming Optimization Based on Deep Reinforcement Learning
ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing

Video players employ adaptive bitrate algorithms in video-on-demand (VoD) scenarios to improve user-perceived quality of experience (QoE), whereas performance will obviously decline in live video streaming scenarios. To this end, we propose a novel deep ...
Multi-camera Live Video Streaming over Wireless Network
Advances in Mobile Computing and Multimedia Intelligence
Abstract
Due to the development of wireless communication technology, more and more streamers are using cameras mounted on mobile devices for live streaming in a wireless LAN environment. Conventional live streaming systems, which employ multiple images ...
An Intelligent Learning Approach to Achieve Near-Second Low-Latency Live Video Streaming under Highly Fluctuating Networks
MM '23: Proceedings of the 31st ACM International Conference on Multimedia

Fueled by the rapid advances in high-speed mobile networks, live video streaming has seen explosive growth in recent years and many DASH-based bitrate adaptive streaming algorithms were specifically proposed for low-latency video delivery. However, our ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

October 2019

2794 pages

ISBN:9781450368896

DOI:10.1145/3343031

General Chairs:
Laurent Amsaleg
CNRS-IRISA, France
,
Benoit Huet
EURECOM, France
,
Martha Larson
Radboud University and TU Delft (Netherlands)
,
Program Chairs:
Guillaume Gravier
CNRS-IRISA, France
,
Hayley Hung
Delft University of Technology Netherlands
,
Chong-Wah Ngo
City University of Hong Kong Hong Kong
,
Wei Tsang Ooi
National University of Singapore Singapore

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Beijing Municipal Natural Science Foundation
National Natural Science Foundation of China
Fundamental Research Funds for the Central Universities

Conference

MM '19

Sponsor:

SIGMM

MM '19: The 27th ACM International Conference on Multimedia

October 21 - 25, 2019

Nice, France

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

15
Total Citations
View Citations
774
Total Downloads

Downloads (Last 12 months)29
Downloads (Last 6 weeks)1

Reflects downloads up to 19 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Nguyen THua DHuong THoang VDao NCho S(2024)Intelligent QoE Management for IoMT Streaming Services in Multiuser Downlink RSMA NetworksIEEE Internet of Things Journal10.1109/JIOT.2023.333447311:7(12602-12618)Online publication date: 1-Apr-2024
https://doi.org/10.1109/JIOT.2023.3334473
Smirnov NTomforde S(2024)Real-time rate control of WebRTC video streams in 5G networks: Improving quality of experience with Deep Reinforcement LearningJournal of Systems Architecture10.1016/j.sysarc.2024.103066148(103066)Online publication date: Mar-2024
https://doi.org/10.1016/j.sysarc.2024.103066
Niu DCheng GChen Z(2023)TDS-KRFI: Reference Frame Identification for Live Web Streaming Toward HTTP Flash Video ProtocolIEEE Transactions on Network and Service Management10.1109/TNSM.2023.328256320:4(4198-4215)Online publication date: Dec-2023
https://doi.org/10.1109/TNSM.2023.3282563
Bentaleb AAkcay MLim MBegen AZimmermann R(2023)BoB: Bandwidth Prediction for Real-Time Communications Using Heuristic and Reinforcement LearningIEEE Transactions on Multimedia10.1109/TMM.2022.321645625(6930-6945)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2022.3216456
Wei XZhou MJia W(2023)Toward Low-Latency and High-Quality Adaptive 360$^\circ$ StreamingIEEE Transactions on Industrial Informatics10.1109/TII.2022.319239819:5(6326-6336)Online publication date: May-2023
https://doi.org/10.1109/TII.2022.3192398
Chen JLuo ZWang ZHu MWu D(2023)Live360: Viewport-Aware Transmission Optimization in Live 360-Degree Video StreamingIEEE Transactions on Broadcasting10.1109/TBC.2023.323440569:1(85-96)Online publication date: Mar-2023
https://doi.org/10.1109/TBC.2023.3234405
Liu CYin JXu Y(2023)Adaptive Live Streaming for Multi-user Access with Fairness Guarantee2023 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)10.1109/BMSB58369.2023.10211604(1-6)Online publication date: 14-Jun-2023
https://doi.org/10.1109/BMSB58369.2023.10211604
Wei XZhou MKwong SYuan HJia W(2022)A Hybrid Control Scheme for 360-Degree Dynamic Adaptive Video Streaming Over Mobile DevicesIEEE Transactions on Mobile Computing10.1109/TMC.2021.305809921:10(3428-3442)Online publication date: 1-Oct-2022
https://doi.org/10.1109/TMC.2021.3058099
Wang BXu MRen FZhou CWu J(2022)Cratus: A Lightweight and Robust Approach for Mobile Live StreamingIEEE Transactions on Mobile Computing10.1109/TMC.2020.304882621:8(2761-2775)Online publication date: 1-Aug-2022
https://doi.org/10.1109/TMC.2020.3048826
Xu YYin JYang QYang L(2022)Media Production Using Cloud and Edge Computing: Recent Progress and NBMP-Based ImplementationIEEE Transactions on Broadcasting10.1109/TBC.2022.314070468:2(545-558)Online publication date: Jun-2022
https://doi.org/10.1109/TBC.2022.3140704
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten