Subjective and Objective Quality Assessment of 2D and 3D Foveated Video Compression in Virtual Reality

In Virtual Reality (VR), the requirements of much higher resolution and smooth viewing experiences under rapid and often real-time changes in viewing direction, leads to significant challenges in compression and communication. To reduce the stresses of very high bandwidth consumption, the concept of foveated video compression is being accorded renewed interest. By exploiting the space-variant property of retinal visual acuity, foveation has the potential to substantially reduce video resolution in the visual periphery, with hardly noticeable perceptual quality degradations. Accordingly, foveated image / video quality predictors are also becoming increasingly important, as a practical way to monitor and control future foveated compression algorithms. Towards advancing the development of foveated image / video quality assessment (FIQA / FVQA) algorithms, we have constructed 2D and (stereoscopic) 3D VR databases of foveated / compressed videos, and conducted a human study of perceptual quality on each database. Each database includes 10 reference videos and 180 foveated videos, which were processed by 3 levels of foveation on the reference videos. Foveation was applied by increasing compression with increased eccentricity. In the 2D study, each video was of resolution $7680\times 3840$ and was viewed and quality-rated by 36 subjects, while in the 3D study, each video was of resolution $5376\times 5376$ and rated by 34 subjects. Both studies were conducted on top of a foveated video player having low motion-to-photon latency (~50ms). We evaluated different objective image and video quality assessment algorithms, including both FIQA / FVQA algorithms and non-foveated algorithms, on our so called LIVE-Facebook Technologies Foveation-Compressed Virtual Reality (LIVE-FBT-FCVR) databases. We also present a statistical evaluation of the relative performances of these algorithms. The LIVE-FBT-FCVR databases have been made publicly available and can be accessed at https://live.ece.utexas.edu/research/LIVEFBTFCVR/index.html.

[1]  Xiongkuo Min,et al.  Perceptual Quality Assessment of Omnidirectional Images , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[2]  Alan C. Bovik,et al.  RRED indices: Reduced reference entropic differencing framework for image quality assessment , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Desney S. Tan,et al.  Foveated 3D graphics , 2012, ACM Trans. Graph..

[4]  A. Bovik,et al.  Study of 3D Virtual Reality Picture Quality , 2019, IEEE Journal of Selected Topics in Signal Processing.

[5]  Guangtao Zhai,et al.  Study of Subjective and Objective Quality Assessment of Audio-Visual Signals , 2020, IEEE Transactions on Image Processing.

[6]  Yong Man Ro,et al.  Deep Virtual Reality Image Quality Assessment With Human Perception Guider for Omnidirectional Image , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  John D. Villasenor,et al.  Visibility of wavelet quantization noise , 1997, IEEE Transactions on Image Processing.

[8]  Weisi Lin,et al.  CVIQD: Subjective quality evaluation of compressed virtual reality images , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[11]  Touradj Ebrahimi,et al.  Testbed for subjective evaluation of omnidirectional visual content , 2016, 2016 Picture Coding Symposium (PCS).

[12]  Minsu Choi,et al.  Eye tracking based foveated rendering for 360 VR tiled video , 2018, MMSys.

[13]  Snjezana Rimac-Drlje,et al.  Foveation-based content Adaptive Structural Similarity index , 2011, 2011 18th International Conference on Systems, Signals and Image Processing.

[14]  Sabine Süsstrunk,et al.  Measuring colorfulness in natural images , 2003, IS&T/SPIE Electronic Imaging.

[15]  Alan C. Bovik,et al.  Study of the effects of stalling events on the quality of experience of mobile streaming videos , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[16]  Zhou Wang,et al.  Embedded foveation image coding , 2001, IEEE Trans. Image Process..

[17]  Xuelong Li,et al.  Spatiotemporal Statistics for Video Quality Assessment , 2016, IEEE Transactions on Image Processing.

[18]  Gustavo de Veciana,et al.  Video Quality Assessment on Mobile Devices: Subjective, Behavioral and Objective Studies , 2012, IEEE Journal of Selected Topics in Signal Processing.

[19]  Alan Conrad Bovik,et al.  Study of Temporal Effects on Subjective Video Quality of Experience , 2017, IEEE Transactions on Image Processing.

[20]  Praful Gupta,et al.  SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality , 2017, IEEE Signal Processing Letters.

[21]  D. Ruderman The statistics of natural images , 1994 .

[22]  Zhou Wang,et al.  Foveation scalable video coding with automatic fixation selection , 2003, IEEE Trans. Image Process..

[23]  Xiongkuo Min,et al.  MC360IQA: The Multi-Channel CNN for Blind 360-Degree Image Quality Assessment , 2019, 2019 IEEE International Symposium on Circuits and Systems (ISCAS).

[24]  Alan C. Bovik,et al.  Visual pattern image sequence coding , 1993, IEEE Trans. Circuits Syst. Video Technol..

[25]  Alan C. Bovik,et al.  Subjective and Objective Quality Assessment of High Frame Rate Videos , 2020, IEEE Access.

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Yize Jin,et al.  Study of 2D foveated video quality in virtual reality , 2020, Optical Engineering + Applications.

[28]  Alan C. Bovik,et al.  A Foveated Video Quality Assessment Model Using Space-Variant Natural Scene Statistics , 2021, 2021 IEEE International Conference on Image Processing (ICIP).

[29]  Fan Zhang,et al.  Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments , 2011, IEEE Transactions on Multimedia.

[30]  Chen Li,et al.  A subjective visual quality assessment method of panoramic videos , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[31]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Bernd Girod,et al.  A Framework to Evaluate Omnidirectional Video Coding Schemes , 2015, 2015 IEEE International Symposium on Mixed and Augmented Reality.

[33]  Alan C. Bovik,et al.  Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[34]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[35]  K. R. Rao,et al.  The H.264 Video Coding Standard , 2014, IEEE Potentials.

[36]  Xiongkuo Min,et al.  Blind Quality Assessment Based on Pseudo-Reference Image , 2018, IEEE Transactions on Multimedia.

[37]  Wen-Huang Cheng,et al.  A practical foveation-based rate-shaping mechanism for MPEG videos , 2005, IEEE Trans. Circuits Syst. Video Technol..

[38]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Christophe Charrier,et al.  Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[40]  Gregory J. Zelinsky,et al.  Design and evaluation of a foveated video streaming service for commodity client devices , 2016, MMSys.

[41]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[42]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[43]  Marios S. Pattichis,et al.  Foveated video quality assessment , 2002, IEEE Trans. Multim..

[44]  Frédéric Precioso,et al.  Foveated streaming of virtual reality videos , 2018, MMSys.

[45]  Yutao Liu,et al.  Blind Image Quality Estimation via Distortion Aggravation , 2018, IEEE Transactions on Broadcasting.

[46]  Wenhan Zhu,et al.  IVQAD 2017: An immersive video quality assessment database , 2017, 2017 International Conference on Systems, Signals and Image Processing (IWSSIP).

[47]  Margaret H. Pinson,et al.  The Consumer Digital Video Library , 2010 .

[48]  John A. Robinson,et al.  Adaptive foveation of MPEG video , 1997, MULTIMEDIA '96.

[49]  Damon M. Chandler,et al.  ViS3: an algorithm for video quality assessment via analysis of spatial and spatiotemporal slices , 2014, J. Electronic Imaging.

[50]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[51]  Zhou Wang,et al.  Spherical Structural Similarity Index for Objective Omnidirectional Video Quality Assessment , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[52]  David Zhang,et al.  FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[53]  Jari Korhonen,et al.  Two-Level Approach for No-Reference Consumer Video Quality Assessment , 2019, IEEE Transactions on Image Processing.

[54]  Zhou Wang,et al.  Foveated wavelet image quality index , 2001, Optics + Photonics.

[55]  Mikko Nuutinen,et al.  CVD2014—A Database for Evaluating No-Reference Video Quality Assessment Algorithms , 2016, IEEE Transactions on Image Processing.

[56]  Rajiv Soundararajan,et al.  Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[57]  Martin J. Wainwright,et al.  Scale Mixtures of Gaussians and the Statistics of Natural Images , 1999, NIPS.

[58]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[59]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[60]  Alan C. Bovik,et al.  In-Capture Mobile Video Distortions: A Study of Subjective Behavior and Objective Algorithms , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[61]  Antti Oulasvirta,et al.  Cloud Gaming with Foveated Video Encoding , 2020, ACM Trans. Multim. Comput. Commun. Appl..

[62]  Zhenzhong Chen,et al.  Subjective Panoramic Video Quality Assessment Database for Coding Applications , 2018, IEEE Transactions on Broadcasting.

[63]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[64]  Marios S. Pattichis,et al.  Foveated video compression with optimal rate control , 2001, IEEE Trans. Image Process..

[65]  Joohwan Kim,et al.  Latency Requirements for Foveated Rendering in Virtual Reality , 2017, ACM Trans. Appl. Percept..

[66]  Dietmar Saupe,et al.  The Konstanz natural video database (KoNViD-1k) , 2017, 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX).

[67]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[68]  ITU-T Rec. P.910 (04/2008) Subjective video quality assessment methods for multimedia applications , 2009 .

[69]  Wilson S. Geisler,et al.  Implementation of a foveated image coding system for image bandwidth reduction , 1996, Electronic Imaging.

[70]  Vladyslav Zakharchenko,et al.  Quality metric for spherical panoramic video , 2016, Optical Engineering + Applications.

[71]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[72]  Phuoc Tran-Gia,et al.  Best Practices for QoE Crowdtesting: QoE Assessment With Crowdsourcing , 2014, IEEE Transactions on Multimedia.