Viewport Adaptation-Based Immersive Video Streaming: Perceptual Modeling and Applications

Immersive video offers the freedom to navigate inside virtualized environment. Instead of streaming the bulky immersive videos entirely, a viewport (also referred to as field of view, FoV) adaptive streaming is preferred. We often stream the high-quality content within current viewport, while reducing the quality of representation elsewhere to save the network bandwidth consumption. Consider that we could refine the quality when focusing on a new FoV, in this paper, we model the perceptual impact of the quality variations (through adapting the quantization stepsize and spatial resolution) with respect to the refinement duration, and yield a product of two closed-form exponential functions that well explain the joint quantization and resolution induced quality impact. Analytical model is cross-validated using another set of data, where both Pearson and Spearman's rank correlation coefficients are close to 0.98. Our work is devised to optimize the adaptive FoV streaming of the immersive video under limited network resource. Numerical results show that our proposed model significantly improves the quality of experience of users, with about 9.36\% BD-Rate (Bjontegaard Delta Rate) improvement on average as compared to other representative methods, particularly under the limited bandwidth.

[1]  Wen Gao,et al.  A Study on the Quantization Scheme in H.264/AVC and Its Application to Rate Control , 2004, PCM.

[2]  Miska M. Hannuksela,et al.  HEVC-compliant Tile-based Streaming of Panoramic Video for Virtual Reality Applications , 2016, ACM Multimedia.

[3]  Wei Tsang Ooi,et al.  Supporting zoomable video streams with dynamic region-of-interest cropping , 2010, MMSys '10.

[4]  Zhan Ma,et al.  Analytical rate model for compressed video considering impacts of spatial, temporal and amplitude resolutions , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[5]  Minhua Zhou,et al.  An Overview of Tiles in HEVC , 2013, IEEE Journal of Selected Topics in Signal Processing.

[6]  Zhan Ma,et al.  Perceptual Quality Assessment of Immersive Images Considering Peripheral Vision Impact , 2018, ArXiv.

[7]  Gordon Wetzstein,et al.  Saliency in VR: How Do People Explore Virtual Environments? , 2016, IEEE Transactions on Visualization and Computer Graphics.

[8]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[9]  Cheng-Hsin Hsu,et al.  Fixation Prediction for 360° Video Streaming in Head-Mounted Virtual Reality , 2017, NOSSDAV.

[10]  Feng Li,et al.  Ultra Wide View Based Panoramic VR Streaming , 2017, VR/AR Network@SIGCOMM.

[11]  Gwendal Simon,et al.  Viewport-adaptive navigable 360-degree video delivery , 2016, 2017 IEEE International Conference on Communications (ICC).

[12]  David J. Brady,et al.  Multiscale gigapixel photography , 2012, Nature.

[13]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[14]  Zhan Ma,et al.  Perceptual Quality Assessment of Video Considering Both Frame Rate and Quantization Artifacts , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Touradj Ebrahimi,et al.  A simple method to obtain visual attention data in head mounted virtual reality , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[16]  Feng Qian,et al.  Optimizing 360 video delivery over cellular networks , 2016, ATC@MobiCom.

[17]  Zhan Ma,et al.  Rate Model for Compressed Video Considering Impacts Of Spatial, Temporal and Amplitude Resolutions and Its Applications for Video Coding and Adaptation , 2012, ArXiv.

[18]  Zhan Ma,et al.  Modeling of Rate and Perceptual Quality of Compressed Video as Functions of Frame Rate and Quantization Stepsize and Its Applications , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Shaowei Xie,et al.  Dynamic VR live streaming over MMT , 2017, 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[20]  Hans Stokking,et al.  Spatial segmentation for immersive media delivery , 2011, 2011 15th International Conference on Intelligence in Next Generation Networks.

[21]  Zhan Ma,et al.  Modeling the Perceptual Impact of Viewport Adaptation for Immersive Video , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[22]  Jie Wu,et al.  Modeling the impact of spatial resolutions on perceptual quality of immersive image/video , 2016, 2016 International Conference on 3D Imaging (IC3D).

[23]  Akio Kameda,et al.  Live streaming system for omnidirectional video , 2015, 2015 IEEE Virtual Reality (VR).

[24]  Xin Liu,et al.  Shooting a moving target: Motion-prediction-based transmission for 360-degree videos , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[25]  Yao Wang,et al.  Q-STAR: A Perceptual Video Quality Model Considering Impact of Spatial, Temporal, and Amplitude Resolutions , 2012, IEEE Transactions on Image Processing.

[26]  Christian Timmerer,et al.  Towards Bandwidth Efficient Adaptive Streaming of Omnidirectional Video over HTTP: Design, Implementation, and Evaluation , 2017, MMSys.

[27]  Mingkai Huang,et al.  Modeling peripheral vision impact on perceptual quality of immersive images , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[28]  Yong Liu,et al.  Prioritized Buffer Control in Two-tier 360 Video Streaming , 2017, VR/AR Network@SIGCOMM.

[29]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[30]  Yong Liu,et al.  View direction and bandwidth adaptive 360 degree video streaming using a two-tier system , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[31]  I. Rentschler,et al.  Peripheral vision and pattern recognition: a review. , 2011, Journal of vision.

[32]  Zhan Ma,et al.  Optimization of spatial, temporal and amplitude resolution for rate-constrained video coding and scalable video adaptation , 2012, 2012 19th IEEE International Conference on Image Processing.

[33]  Christian Timmerer,et al.  Dynamic adaptive streaming over HTTP dataset , 2012, MMSys '12.

[34]  Jin Young Lee,et al.  MMT: An Emerging MPEG Standard for Multimedia Delivery over the Internet , 2013, IEEE MultiMedia.

[35]  Jean-François Macq,et al.  Interactive omnidirectional video delivery: A bandwidth-effective approach , 2012, Bell Labs Technical Journal.

[36]  Mingkai Huang,et al.  Modeling the Perceptual Quality of Immersive Images Rendered on Head Mounted Displays: Resolution and Compression , 2018, IEEE Transactions on Image Processing.

[37]  Yao Wang,et al.  Perceptual video quality assessment on a mobile platform considering both spatial resolution and quantization artifacts , 2010, 2010 18th International Packet Video Workshop.