Screen Content Video Quality Assessment: Subjective and Objective Study

In this article, we make the first attempt to study the subjective and objective quality assessment for the screen content videos (SCVs). For that, we construct the first large-scale video quality assessment (VQA) database specifically for the SCVs, called the screen content video database (SCVD). This SCVD provides 16 reference SCVs, 800 distorted SCVs, and their corresponding subjective scores, and it is made publicly available for research usage. The distorted SCVs are generated from each reference SCV with 10 distortion types and 5 degradation levels for each distortion type. Each distorted SCV is rated by at least 32 subjects in the subjective test. Furthermore, we propose the first full-reference VQA model for the SCVs, called the spatiotemporal Gabor feature tensor-based model (SGFTM), to objectively evaluate the perceptual quality of the distorted SCVs. This is motivated by the observation that 3D-Gabor filter can well stimulate the visual functions of the human visual system (HVS) on perceiving videos, being more sensitive to the edge and motion information that are often-encountered in the SCVs. Specifically, the proposed SGFTM exploits 3D-Gabor filter to individually extract the spatiotemporal Gabor feature tensors from the reference and distorted SCVs, followed by measuring their similarities and later combining them together through the developed spatiotemporal feature tensor pooling strategy to obtain the final SGFTM score. Experimental results on SCVD have shown that the proposed SGFTM yields a high consistency on the subjective perception of SCV quality and consistently outperforms multiple classical and state-of-the-art image/video quality assessment models.

[1]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[2]  J. Hegdé,et al.  Temporal dynamics of shape analysis in macaque visual area V2. , 2004, Journal of neurophysiology.

[3]  Ke Gu,et al.  Highly Efficient Picture-Based Prediction of PM2.5 Concentration , 2019, IEEE Transactions on Industrial Electronics.

[4]  J. Hegdé,et al.  Selectivity for Complex Shapes in Primate Visual Area V2 , 2000, The Journal of Neuroscience.

[5]  Wen Gao,et al.  Subjective and Objective Quality Assessment of Compressed Screen Content Images , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[6]  Jun Luo,et al.  No-Reference Quality Assessment for Screen Content Images Based on Hybrid Region Features Fusion , 2019, IEEE Transactions on Multimedia.

[7]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[8]  Guangming Shi,et al.  Quality Assessment for Video With Degradation Along Salient Trajectories , 2019, IEEE Transactions on Multimedia.

[9]  Damon M. Chandler,et al.  A spatiotemporal most-apparent-distortion model for video quality assessment , 2011, 2011 18th IEEE International Conference on Image Processing.

[10]  Kai-Kuang Ma,et al.  Screen Content Image Quality Assessment Using Multi-Scale Difference of Gaussian , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Tao Lin,et al.  Overview of Screen Content Video Coding: Technologies, Standards, and Beyond , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[12]  Zhou Wang,et al.  Unified Blind Quality Assessment of Compressed Natural, Graphic, and Screen Content Images , 2017, IEEE Transactions on Image Processing.

[13]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[14]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[15]  Daniel Thalmann,et al.  Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description , 2018, IEEE Transactions on Image Processing.

[16]  Kai Zeng,et al.  Objective Quality Assessment and Perceptual Compression of Screen Content Images , 2018, IEEE Computer Graphics and Applications.

[17]  Ke Gu,et al.  Learning a No-Reference Quality Assessment Model of Enhanced Images With Big Data , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Guangming Shi,et al.  Visual Orientation Selectivity Based Structure Description , 2015, IEEE Transactions on Image Processing.

[19]  Lei Cao,et al.  Study of subjective and objective quality assessment for screen content images , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[20]  Guangtao Zhai,et al.  Perceptual image quality assessment: a survey , 2020, Science China Information Sciences.

[21]  Kai-Kuang Ma,et al.  A Gabor Feature-Based Quality Assessment Model for the Screen Content Images , 2018, IEEE Transactions on Image Processing.

[22]  Glenn Healey,et al.  Hyperspectral Region Classification Using a Three-Dimensional Gabor Filterbank , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[23]  E. Gilbert Capacity of a burst-noise channel , 1960 .

[24]  Gustavo de Veciana,et al.  An information fidelity criterion for image quality assessment using natural scene statistics , 2005, IEEE Transactions on Image Processing.

[25]  Jizheng Xu,et al.  Overview of the Emerging HEVC Screen Content Coding Extension , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[27]  Jean-Bernard Martens,et al.  Quality asessment of coded images using numerical category scaling , 1995, Other Conferences.

[28]  Yu Tian,et al.  Light Field Image Quality Assessment via the Light Field Coherence , 2020, IEEE Transactions on Image Processing.

[29]  Lei Zhang,et al.  Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index , 2013, IEEE Transactions on Image Processing.

[30]  Yutao Liu,et al.  Blind Image Quality Estimation via Distortion Aggravation , 2018, IEEE Transactions on Broadcasting.

[31]  Fan Zhang,et al.  BVI-HD: A Video Quality Database for HEVC Compressed and Texture Synthesized Content , 2018, IEEE Transactions on Multimedia.

[32]  Lu Xing,et al.  A multi-scale contrast-based image quality assessment model for multi-exposure image fusion , 2018, Signal Process..

[33]  Xiongkuo Min,et al.  Blind Quality Assessment Based on Pseudo-Reference Image , 2018, IEEE Transactions on Multimedia.

[34]  C. Baker,et al.  Spatio-temporal frequency separability in area 18 neurons of the cat , 1993, Vision Research.

[35]  Yui-Lam Chan,et al.  Online-Learning-Based Bayesian Decision Rule for Fast Intra Mode and CU Partitioning Algorithm in HEVC Screen Content Coding , 2020, IEEE Transactions on Image Processing.

[36]  Damon M. Chandler,et al.  ViS3: an algorithm for video quality assessment via analysis of spatial and spatiotemporal slices , 2014, J. Electronic Imaging.

[37]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[38]  Weisi Lin,et al.  Saliency-Guided Quality Assessment of Screen Content Images , 2016, IEEE Transactions on Multimedia.

[39]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[40]  Jianqing Zhu,et al.  A Light Field Image Quality Assessment Model Based on Symmetry and Depth Features , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[41]  Weisi Lin,et al.  Perceptual Quality Assessment of Screen Content Images , 2015, IEEE Transactions on Image Processing.

[42]  Kai-Kuang Ma,et al.  Gradient Direction for Screen Content Image Quality Assessment , 2016, IEEE Signal Processing Letters.

[43]  Xiongkuo Min,et al.  Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images , 2019, IEEE Transactions on Multimedia.

[44]  Colin W. G. Clifford,et al.  Orientation anisotropies in human primary visual cortex depend on contrast , 2015, NeuroImage.

[45]  Mu Feng,et al.  Motion Estimation in the 3-D Gabor Domain , 2007, IEEE Transactions on Image Processing.

[46]  Praful Gupta,et al.  SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality , 2017, IEEE Signal Processing Letters.

[47]  Martin Reisslein,et al.  Objective Video Quality Assessment Methods: A Classification, Review, and Performance Comparison , 2011, IEEE Transactions on Broadcasting.

[48]  Rong Xie,et al.  SJTU 4K video subjective quality dataset for content adaptive bit rate estimation without encoding , 2016, 2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[49]  Ke Gu,et al.  No-Reference Quality Assessment of Screen Content Pictures , 2017, IEEE Transactions on Image Processing.

[50]  Xiongkuo Min,et al.  Saliency-induced reduced-reference quality index for natural scene and screen content images , 2018, Signal Process..

[51]  Xiongkuo Min,et al.  A Metric for Light Field Reconstruction, Compression, and Display Quality Evaluation , 2020, IEEE Transactions on Image Processing.

[52]  Weisi Lin,et al.  A Fast Reliable Image Quality Predictor by Fusing Micro- and Macro-Structures , 2017, IEEE Transactions on Industrial Electronics.

[53]  Yong Liu,et al.  Blind Image Quality Assessment Based on High Order Statistics Aggregation , 2016, IEEE Transactions on Image Processing.

[54]  N. Ranganathan,et al.  Gabor filter-based edge detection , 1992, Pattern Recognit..

[55]  Jiaying Liu,et al.  Objective Quality Assessment of Screen Content Images by Uncertainty Weighting , 2017, IEEE Transactions on Image Processing.

[56]  Jose Joskowicz,et al.  Advances in video quality estimation models: An Overview , 2013, IEEE Latin America Transactions.

[57]  Jing Chen,et al.  Perceptual feature guided rate distortion optimization for high efficiency video coding , 2017, Multidimens. Syst. Signal Process..

[58]  Jong-Seok Lee,et al.  Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[59]  Daniel Thalmann,et al.  Evaluating Quality of Screen Content Images Via Structural Variation Analysis , 2018, IEEE Transactions on Visualization and Computer Graphics.

[60]  D. Chandler Seven Challenges in Image Quality Assessment: Past, Present, and Future Research , 2013 .

[61]  Todd R. Reed,et al.  On the Computation of Optical Flow using the 3-D Gabor Transform , 1998, Multidimens. Syst. Signal Process..

[62]  Kai-Kuang Ma,et al.  ESIM: Edge Similarity for Screen Content Image Quality Assessment , 2017, IEEE Transactions on Image Processing.

[63]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[64]  David Casasent,et al.  Neural net design of macro Gabor wavelet filters for distortion-invariant object detection in clutter , 1994 .