Quality-Aware Neural Adaptive Video Streaming With Lifelong Imitation Learning

Existing Adaptive Bitrate (ABR) algorithms pick future video chunks’ bitrates via fixed rules or offline trained models to ensure good quality of experience (QoE) for Internet video. Nevertheless, data analysis demonstrates that a good ABR algorithm is required to continually and fast update for adapting itself to time-varying network conditions. Therefore, we propose Comyco, a video quality-aware learning-based ABR approach that enormously improves recent schemes by i) picking the chunk with higher perceptual video qualities rather than video bitrates; ii) training the policy via imitating expert trajectories given by the expert strategy; iii) employing the lifelong learning method to continually train the model w.r.t the fresh trace collected by the users. To achieve this, we develop a complete quality-aware lifelong imitation learning-based ABR system, construct quality-based neural network architecture, collect a quality-driven video dataset, and estimate QoE metrics with video quality features. Using trace-driven and real-world experiments, we demonstrate Comyco reaches 1700-fold improvements in the number of samples required and 16-fold speedup in the training time compared with the prior work. Meanwhile, Comyco outperforms existing methods, with the improvements on average QoE of 7.5%–16.79%. Moreover, experimental results on continual training also illustrate that lifelong learning helps Comyco further improve the average QoE of 1.07%–9.81% in comparison to the offline trained model.

[1]  Pieter Abbeel,et al.  An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.

[2]  Andreas Schmidt,et al.  Cross-Layer Effects on Training Neural Algorithms for Video Streaming , 2018, NOSSDAV.

[3]  Krishna R. Pattipati,et al.  ABR streaming of VBR-encoded videos: characterization, challenges, and solutions , 2018, CoNEXT.

[4]  Ramesh K. Sitaraman,et al.  From theory to practice: improving bitrate adaptation in the DASH reference player , 2018, MMSys.

[5]  Yi Sun,et al.  CS2P: Improving Video Bitrate Selection and Adaptation with Data-Driven Throughput Prediction , 2016, SIGCOMM.

[6]  Ronald Kemker,et al.  Measuring Catastrophic Forgetting in Neural Networks , 2017, AAAI.

[7]  Bruno Ribeiro,et al.  Oboe: auto-tuning video ABR algorithms to network conditions , 2018, SIGCOMM.

[8]  Iraj Sodagar,et al.  The MPEG-DASH Standard for Multimedia Streaming Over the Internet , 2011, IEEE MultiMedia.

[9]  Wei Tsang Ooi,et al.  QUETRA: A Queuing Theory Approach to DASH Rate Adaptation , 2017, ACM Multimedia.

[10]  Gerhard Nahler,et al.  Pearson Correlation Coefficient , 2020, Definitions.

[11]  Lifeng Sun,et al.  Adversarial Feature Alignment: Avoid Catastrophic Forgetting in Incremental Task Lifelong Learning , 2019, Neural Computation.

[12]  Ramesh K. Sitaraman,et al.  BOLA: Near-Optimal Bitrate Adaptation for Online Videos , 2016, IEEE/ACM Transactions on Networking.

[13]  Ali C. Begen,et al.  Probe and Adapt: Rate Adaptation for HTTP Video Streaming At Scale , 2013, IEEE Journal on Selected Areas in Communications.

[14]  Ankit Singla,et al.  Understanding video streaming algorithms in the wild , 2020, PAM.

[15]  Yuan Tang,et al.  TF.Learn: TensorFlow's High-level Module for Distributed Machine Learning , 2016, ArXiv.

[16]  Carsten Griwodz,et al.  Commute path bandwidth traces from 3G networks: analysis and applications , 2013, MMSys.

[17]  Martin Wattenberg,et al.  TensorFlow.js: Machine Learning for the Web and Beyond , 2019, MLSys.

[18]  Hu Tuo,et al.  RESA: A Real-Time Evaluation System for ABR , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[19]  David M. Beazley,et al.  SWIG: An Easy to Use Tool for Integrating Scripting Languages with C and C++ , 1996, Tcl/Tk Workshop.

[20]  Ali C. Begen,et al.  Enhancing MPEG DASH performance via server and network assistance , 2017 .

[21]  Nei Kato,et al.  On Removing Routing Protocol from Future Wireless Networks: A Real-time Deep Learning Approach for Intelligent Traffic Control , 2018, IEEE Wireless Communications.

[22]  Reza Rassool,et al.  VMAF reproducibility: Validating a perceptual practical video quality metric , 2017, 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[23]  Zhengfang Duanmu,et al.  A Quality-of-Experience Database for Adaptive Video Streaming , 2018, IEEE Transactions on Broadcasting.

[24]  Djemel Ziou,et al.  Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[25]  Te-Yuan Huang,et al.  Hindsight: evaluate video bitrate adaptation at scale , 2019, MMSys.

[26]  Lifeng Sun,et al.  QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning , 2018, ACM Multimedia.

[27]  Abdul Rehman,et al.  Perceptual experience of time-varying video quality , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[28]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[29]  Wen Gao,et al.  A Knowledge-Driven Quality-of-Experience Model for Adaptive Streaming Videos , 2019, ArXiv.

[30]  Ali C. Begen,et al.  An experimental evaluation of rate-adaptation algorithms in adaptive streaming over HTTP , 2011, MMSys.

[31]  C.-C. Jay Kuo,et al.  Challenges in cloud based ingest and encoding for high quality streaming media , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[32]  Vyas Sekar,et al.  Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE , 2012, CoNEXT '12.

[33]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[34]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[35]  Timo Aila,et al.  Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[36]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[37]  Marcus Rohrbach,et al.  Memory Aware Synapses: Learning what (not) to forget , 2017, ECCV.

[38]  Federico Chiariotti,et al.  D-DASH: A Deep Q-Learning Framework for DASH Video Streaming , 2017, IEEE Transactions on Cognitive Communications and Networking.

[39]  Pradipta De,et al.  HotDASH: Hotspot Aware Adaptive Video Streaming Using Deep Reinforcement Learning , 2018, 2018 IEEE 26th International Conference on Network Protocols (ICNP).

[40]  Kai Zeng,et al.  Display device-adapted video quality-of-experience assessment , 2015, Electronic Imaging.

[41]  Philip Levis,et al.  Learning in situ: a randomized experiment in video streaming , 2019, NSDI.

[42]  Hari Balakrishnan,et al.  Mahimahi: Accurate Record-and-Replay for HTTP , 2015, USENIX Annual Technical Conference.

[43]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Sergey Levine,et al.  Guided Meta-Policy Search , 2019, NeurIPS.

[45]  Asma Ben Letaifa,et al.  Machine learning based QoE prediction in SDN networks , 2017, 2017 13th International Wireless Communications and Mobile Computing Conference (IWCMC).

[46]  Marco Canini,et al.  Cracking Open the Black Box: What Observations Can Tell Us About Reinforcement Learning Agents , 2019, NetAI@SIGCOMM.

[47]  Mohamed Medhat Gaber,et al.  Imitation Learning , 2017, ACM Comput. Surv..

[48]  Ali C. Begen,et al.  SDNDASH: Improving QoE of HTTP Adaptive Streaming Using Software Defined Networking , 2016, ACM Multimedia.

[49]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[50]  Peter Stone,et al.  Recent Advances in Imitation Learning from Observation , 2019, IJCAI.

[51]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[52]  Yuandong Tian,et al.  Real-world Video Adaptation with Reinforcement Learning , 2019, ArXiv.

[53]  Te-Yuan Huang,et al.  A buffer-based approach to rate adaptation: evidence from a large video streaming service , 2015, SIGCOMM 2015.

[54]  Anca D. Dragan,et al.  DART: Noise Injection for Robust Imitation Learning , 2017, CoRL.

[55]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[56]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[57]  Philip Levis,et al.  Pantheon: the training ground for Internet congestion-control research , 2018, USENIX Annual Technical Conference.

[58]  Hongzi Mao,et al.  Variance Reduction for Reinforcement Learning in Input-Driven Environments , 2018, ICLR.

[59]  Lifeng Sun,et al.  Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning , 2019, ACM Multimedia.

[60]  Xiangyu Zhang,et al.  ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design , 2018, ECCV.

[61]  Zhengfang Duanmu,et al.  Quality-of-Experience of Adaptive Video Streaming: Exploring the Space of Adaptations , 2017, ACM Multimedia.

[62]  Ali C. Begen,et al.  Applications and deployments of server and network assisted DASH (SAND) , 2016 .

[63]  Lifeng Sun,et al.  Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming , 2018, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[64]  Bruno Sinopoli,et al.  A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP , 2015, Comput. Commun. Rev..

[65]  Christian Timmerer,et al.  A Survey on Bitrate Adaptation Schemes for Streaming Media Over HTTP , 2019, IEEE Communications Surveys & Tutorials.

[66]  Mingwei Xu,et al.  PiTree: Practical Implementation of ABR Algorithms Using Decision Trees , 2019, ACM Multimedia.

[67]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.