QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning

Real-time video streaming is now one of the main applications in all network environments. Due to the fluctuation of throughput under various network conditions, how to choose a proper bitrate adaptively has become an upcoming and interesting issue. To tackle this problem, most proposed rate control methods work for providing high video bitrates instead of video qualities. Nevertheless, we notice that there exists a trade-off between sending bitrate and video quality, which motivates us to focus on how to reach a balance between them. In this paper, we propose QARC (video Quality Aware Rate Control), a rate control algorithm that aims to obtain a higher perceptual video quality with possible lower sending rate and transmission latency. Starting from scratch, QARC uses deep reinforcement learning(DRL) algorithm to train a neural network to select future bitrates based on previously observed network status and past video frames. To overcome the "state explosion problem'', we design a neural network to predict future perceptual video quality as a vector for taking the place of the raw picture in the DRL's inputs. We evaluate QARC via trace-driven simulation, outperforming existing approach with improvements in average video quality of 18% - 25% and decreasing in average latency with 23% -45%. Meanwhile, comparing QARC with offline optimal high bitrate method on various network conditions, we find that QARC also yields a solid result.

[1]  Carsten Griwodz,et al.  Commute path bandwidth traces from 3G networks: analysis and applications , 2013, MMSys.

[2]  Hiroshi Yoshida,et al.  QoE maximizing bitrate control for live video streaming on a mobile uplink , 2017, 2017 14th International Conference on Telecommunications (ConTEL).

[3]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[4]  Martin T. Hagan,et al.  Neural network design , 1995 .

[5]  Federico Chiariotti,et al.  D-DASH: A Deep Q-Learning Framework for DASH Video Streaming , 2017, IEEE Transactions on Cognitive Communications and Networking.

[6]  Larry L. Peterson,et al.  TCP Vegas: End to End Congestion Avoidance on a Global Internet , 1995, IEEE J. Sel. Areas Commun..

[7]  C. Martin 2015 , 2015, Les 25 ans de l’OMC: Une rétrospective en photos.

[8]  Edmund M. Clarke,et al.  Model Checking and the State Explosion Problem , 2011, LASER Summer School.

[9]  Yong Liu,et al.  Real-time bandwidth prediction and rate adaptation for video calls over cellular networks , 2016, MMSys.

[10]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.

[11]  Luca De Cicco,et al.  Analysis and design of the google congestion control for web real-time communication (WebRTC) , 2016, MMSys.

[12]  Hari Balakrishnan,et al.  TCP ex machina: computer-generated congestion control , 2013, SIGCOMM.

[13]  Dario Rossi,et al.  LEDBAT: The New BitTorrent Congestion Control Protocol , 2010, 2010 Proceedings of 19th International Conference on Computer Communications and Networks.

[14]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17]  E.W. Knightly,et al.  TCP-LP: low-priority service via end-point congestion control , 2006, IEEE/ACM Transactions on Networking.

[18]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[19]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[20]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[21]  Matteo Frigo,et al.  A fast Fourier transform compiler , 1999, SIGP.

[22]  Reza Rassool,et al.  VMAF reproducibility: Validating a perceptual practical video quality metric , 2017, 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[23]  Djemel Ziou,et al.  Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[24]  Jörg Widmer,et al.  TCP Friendly Rate Control (TFRC): Protocol Specification , 2008, RFC.

[25]  Kozo Satoda,et al.  Experimental comparison of machine learning-based available bandwidth estimation methods over operational LTE networks , 2017, 2017 IEEE Symposium on Computers and Communications (ISCC).

[26]  Deborah Estrin,et al.  RAP: An end-to-end rate-based congestion control mechanism for realtime streams in the Internet , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[27]  Chao Zhou,et al.  Delay-constrained rate control for real-time video streaming over wireless networks , 2015, 2015 Visual Communications and Image Processing (VCIP).

[28]  Hari Balakrishnan,et al.  Stochastic Forecasts Achieve High Throughput and Low Delay over Cellular Networks , 2013, NSDI.

[29]  Cisco Visual Networking Index: Forecast and Methodology 2016-2021.(2017) http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual- networking-index-vni/complete-white-paper-c11-481360.html. High Efficiency Video Coding (HEVC) Algorithms and Architectures https://jvet.hhi.fraunhofer. , 2017 .

[30]  Lifeng Sun,et al.  Delay-Constrained Rate Control for Real-Time Video Streaming with Bounded Neural Network , 2018, NOSSDAV.

[31]  Hari Balakrishnan,et al.  Mahimahi: Accurate Record-and-Replay for HTTP , 2015, USENIX Annual Technical Conference.