Deep reinforcement learning-driven intelligent panoramic video bitrate adaptation

Online panoramic video has recently gained enormous popularity. Tile-based adaptive streaming is a promising method to deliver a panoramic video for the sake of bandwidth saving. However, it's challenging to estimate the user's field of view (FoV) and deliver the optimal bitrate due to the dynamic user behavior and time-varying network. In this paper, we propose a novel approach to delivering panoramic video. Specifically, a long short-term memory (LSTM) model is used to estimate the FoV in the next few seconds. Our quality adaptation policy is based on a deep reinforcement learning (DRL) agent, which is able to intelligently adapt its bitrate selection policy to different environments. We have implemented a prototype of this system, which outperforms other existing panoramic video streaming frameworks by 12% in quality of experience (QoE) after getting converged in a wide range of environment metrics, and achieves the best performance.

[1]  Antoine Coutrot,et al.  A dataset of head and eye movements for 360° videos , 2018, MMSys.

[2]  Vyas Sekar,et al.  Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE , 2012, CoNEXT '12.

[3]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[4]  Zhimin Xu,et al.  360ProbDASH: Improving QoE of 360 Video Streaming Using Tile-based HTTP Adaptive Streaming , 2017, ACM Multimedia.

[5]  Minhua Zhou,et al.  An Overview of Tiles in HEVC , 2013, IEEE Journal of Selected Topics in Signal Processing.

[6]  Cyril Concolato,et al.  MPEG DASH SRD: spatial relationship description , 2016, MMSys.

[7]  Mohammad Hosseini,et al.  Adaptive 360 VR Video Streaming Based on MPEG-DASH SRD , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[8]  Bruno Sinopoli,et al.  A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP , 2015, Comput. Commun. Rev..

[9]  Shenghua Gao,et al.  Gaze Prediction in Dynamic 360° Immersive Videos , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  Feng Li,et al.  Rubiks: Practical 360-Degree Streaming for Smartphones , 2018, MobiSys.

[11]  Ramesh K. Sitaraman,et al.  BOLA: Near-Optimal Bitrate Adaptation for Online Videos , 2016, IEEE/ACM Transactions on Networking.