Streaming 360-Degree Videos Using Super-Resolution

360° videos provide an immersive experience to users, but require considerably more bandwidth to stream compared to regular videos. State-of-the-art 360° video streaming systems use viewport prediction to reduce bandwidth requirement, that involves predicting which part of the video the user will view and only fetching that content. However, viewport prediction is error prone resulting in poor user Quality of Experience (QoE). We design PARSEC, a 360° video streaming system that reduces bandwidth requirement while improving video quality. PARSEC trades off bandwidth for additional client-side computation to achieve its goals. PARSEC uses an approach based on super-resolution, where the video is significantly compressed at the server and the client runs a deep learning model to enhance the video to a much higher quality. PARSEC addresses a set of challenges associated with using super-resolution for 360° video streaming: large deep learning models, slow inference rate, and variance in the quality of the enhanced videos. To this end, PAR-SEC trains small micro-models over shorter video segments, and then combines traditional video encoding with super-resolution techniques to overcome the challenges. We evaluate PARSEC on a real WiFi network, over a broadband network trace released by FCC, and over a 4G/LTE network trace. PARSEC significantly outperforms the state-of-art 360° video streaming systems while reducing the bandwidth requirement.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Jinwoo Shin,et al.  Neural Adaptive Content-aware Internet Video Delivery , 2018, OSDI.

[3]  Yaron Silberberg,et al.  Super-resolution enhancement by quantum image scanning microscopy , 2018, Nature Photonics.

[4]  Pan Hu,et al.  Dejavu: Enhancing Videoconferencing with Prior Knowledge , 2019, HotMobile.

[5]  Songqing Chen,et al.  BAS-360°: Exploring Spatial and Temporal Adaptability in 360-degree Videos over HTTP/2 , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[6]  Anh Nguyen,et al.  Your Attention is Unique: Detecting 360-Degree Video Saliency in Head-Mounted Display for Head Movement Prediction , 2018, ACM Multimedia.

[7]  Xiaodong Yang,et al.  Recognizing actions using depth motion maps-based histograms of oriented gradients , 2012, ACM Multimedia.

[8]  K. K. Ramakrishnan,et al.  Characterization of 360-degree Videos , 2017, VR/AR Network@SIGCOMM.

[9]  Bruno Sinopoli,et al.  A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP , 2015, Comput. Commun. Rev..

[10]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[12]  Zhenhua Li,et al.  A Measurement Study of Oculus 360 Degree Video Streaming , 2017, MMSys.

[13]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[14]  Bruno Ribeiro,et al.  Oboe: auto-tuning video ABR algorithms to network conditions , 2018, SIGCOMM.

[15]  Kyoung Mu Lee,et al.  Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Yong Liu,et al.  Very Long Term Field of View Prediction for 360-Degree Video Streaming , 2019, 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[17]  Songqing Chen,et al.  OpTile: Toward Optimal Tiling in 360-degree Video Streaming , 2017, ACM Multimedia.

[18]  Michael G. Strintzis,et al.  Optimized transmission of JPEG2000 streams over wireless channels , 2006, IEEE Transactions on Image Processing.

[19]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[20]  Christian Timmerer,et al.  Towards Bandwidth Efficient Adaptive Streaming of Omnidirectional Video over HTTP: Design, Implementation, and Evaluation , 2017, MMSys.

[21]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Bernd Girod,et al.  A Framework to Evaluate Omnidirectional Video Coding Schemes , 2015, 2015 IEEE International Symposium on Mixed and Augmented Reality.

[23]  Cheng-Hsin Hsu,et al.  Fixation Prediction for 360° Video Streaming in Head-Mounted Virtual Reality , 2017, NOSSDAV.

[24]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[25]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[26]  Daniel Rueckert,et al.  Cardiac Image Super-Resolution with Global Correspondence Using Multi-Atlas PatchMatch , 2013, MICCAI.

[27]  Cyril Concolato,et al.  GPAC: open source multimedia framework , 2007, ACM Multimedia.

[28]  Nicholas D. Lane,et al.  Poster: MobiSR -- Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors , 2019, MobiCom.

[29]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[30]  Samir R. Das,et al.  Advancing User Quality of Experience in 360-degree Video Streaming , 2019, 2019 IFIP Networking Conference (IFIP Networking).

[31]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Cornelius Hellge,et al.  Delay Impact on MPEG OMAF’s Tile-Based Viewport-Dependent 360° Video Streaming , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[33]  Zhimin Xu,et al.  360ProbDASH: Improving QoE of 360 Video Streaming Using Tile-based HTTP Adaptive Streaming , 2017, ACM Multimedia.

[34]  Nicholas D. Lane,et al.  MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors , 2019, MobiCom.

[35]  Ramesh K. Sitaraman,et al.  BOLA: Near-Optimal Bitrate Adaptation for Online Videos , 2016, IEEE/ACM Transactions on Networking.

[36]  Hari Balakrishnan,et al.  Mahimahi: Accurate Record-and-Replay for HTTP , 2015, USENIX Annual Technical Conference.

[37]  Ali Borji,et al.  Salient object detection: A survey , 2014, Computational Visual Media.

[38]  Feng Qian,et al.  Flare: Practical Viewport-Adaptive 360-Degree Video Streaming for Mobile Devices , 2018, MobiCom.

[39]  Cheng-Hsin Hsu,et al.  360° Video Viewing Dataset in Head-Mounted Virtual Reality , 2017, MMSys.

[40]  Iraj Sodagar,et al.  The MPEG-DASH Standard for Multimedia Streaming Over the Internet , 2011, IEEE MultiMedia.

[41]  Miska M. Hannuksela,et al.  HEVC-compliant Tile-based Streaming of Panoramic Video for Virtual Reality Applications , 2016, ACM Multimedia.

[42]  Pong C. Yuen,et al.  Very low resolution face recognition problem , 2010, 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS).