论文信息 - Predicting the Quality of Compressed Videos With Pre-Existing Distortions

Predicting the Quality of Compressed Videos With Pre-Existing Distortions

Over the past decade, the online video industry has greatly expanded the volume of visual data that is streamed and shared over the Internet. Moreover, because of the increasing ease of video capture, many millions of consumers create and upload large volumes of User-Generated-Content (UGC) videos. Unlike streaming television or cinematic content produced by professional videographers and cinemagraphers, UGC videos are most commonly captured by naive users having limited skills and imperfect technique, and often are afflicted by highly diverse and mixed in-capture distortions. These UGC videos are then often uploaded for sharing onto cloud servers, where they further compressed for storage and transmission. Our paper tackles the highly practical problem of predicting the quality of compressed videos (perhaps during the process of compression, to help guide it), with only (possibly severely) distorted UGC videos as references. To address this problem, we have developed a novel Video Quality Assessment (VQA) framework that we call 1stepVQA (to distinguish it from two-step methods that we discuss). 1stepVQA overcomes limitations of Full-Reference, Reduced-Reference and No-Reference VQA models by exploiting the statistical regularities of both natural videos and distorted videos. We show that 1stepVQA is able to more accurately predict the quality of compressed videos, given imperfect reference videos. We also describe a new dedicated video database which includes (typically distorted) UGC reference videos, and a large number of compressed versions of them. We show that the 1stepVQA model outperforms other VQA models in this scenario. We are providing the dedicated new database free of charge at this https URL

[1] Praful Gupta,et al. SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality , 2017, IEEE Signal Processing Letters.

[2] Xianming Liu,et al. Blind quality assessment of compressed images via pseudo structural similarity , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[3] Lei Zhang,et al. Deep Convolutional Neural Models for Picture-Quality Prediction: Challenges and Solutions to Data-Driven Image Quality Assessment , 2017, IEEE Signal Processing Magazine.

[4] Rajiv Soundararajan,et al. Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[5] Margaret H. Pinson,et al. A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[6] Praful Gupta,et al. From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Gustavo de Veciana,et al. Video Quality Assessment on Mobile Devices: Subjective, Behavioral and Objective Studies , 2012, IEEE Journal of Selected Topics in Signal Processing.

[8] Lei Zhang,et al. A Feature-Enriched Completely Blind Image Quality Evaluator , 2015, IEEE Transactions on Image Processing.

[9] Xin Jin,et al. VideoSet: A large-scale compressed video quality dataset based on JND measurement , 2017, J. Vis. Commun. Image Represent..

[10] Lei Zhang,et al. Blind Image Quality Assessment with a Probabilistic Quality Representation , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[11] Rajiv Soundararajan,et al. Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[12] Sheila S. Hemami,et al. VSNR: A Wavelet-Based Visual Signal-to-Noise Ratio for Natural Images , 2007, IEEE Transactions on Image Processing.

[13] Alan Conrad Bovik,et al. Large-Scale Study of Perceptual Video Quality , 2018, IEEE Transactions on Image Processing.

[14] Ke Gu,et al. Learning a No-Reference Quality Assessment Model of Enhanced Images With Big Data , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[15] Zhou Wang,et al. Information Content Weighting for Perceptual Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[16] C.-C. Jay Kuo,et al. MCL-V: A streaming video quality assessment database , 2015, J. Vis. Commun. Image Represent..

[17] Dietmar Saupe,et al. The Konstanz natural video database (KoNViD-1k) , 2017, 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX).

[18] Xuelong Li,et al. Blind Image Quality Assessment via Deep Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[19] Alan C. Bovik,et al. Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20] Alan C. Bovik,et al. Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[21] Margaret H. Pinson,et al. The Consumer Digital Video Library [Best of the Web] , 2013, IEEE Signal Processing Magazine.

[22] Yutao Liu,et al. Blind Image Quality Estimation via Distortion Aggravation , 2018, IEEE Transactions on Broadcasting.

[23] Alan C. Bovik,et al. Spatio-Temporal Measures Of Naturalness , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[24] Margaret H. Pinson,et al. The Consumer Digital Video Library , 2010 .

[25] Jonathan Westley Peirce,et al. Neuroinformatics Original Research Article Generating Stimuli for Neuroscience Using Psychopy , 2022 .

[26] Cisco Visual Networking Index: Forecast and Methodology 2016-2021.(2017) http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual- networking-index-vni/complete-white-paper-c11-481360.html. High Efficiency Video Coding (HEVC) Algorithms and Architectures https://jvet.hhi.fraunhofer. , 2017 .

[27] Ping Wang,et al. MCL-JCV: A JND-based H.264/AVC video quality assessment dataset , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[28] Alberto Leon-Garcia,et al. Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[29] Sugato Chakravarty,et al. Methodology for the subjective assessment of the quality of television pictures , 1995 .

[30] Lei Zhang,et al. Blind Image Quality Assessment Using Joint Statistics of Gradient Magnitude and Laplacian Features , 2014, IEEE Transactions on Image Processing.

[31] Alan C. Bovik,et al. Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[32] David Zhang,et al. FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[33] Jari Korhonen,et al. Two-Level Approach for No-Reference Consumer Video Quality Assessment , 2019, IEEE Transactions on Image Processing.

[34] F. Wilcoxon. Individual Comparisons by Ranking Methods , 1945 .

[35] Frank Tong,et al. Foundations of Vision , 2018 .

[36] Soo-Chang Pei,et al. Image Quality Assessment Using Human Visual DOG Model Fused With Random Forest , 2015, IEEE Transactions on Image Processing.

[37] Zhou Wang,et al. Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[38] Yue Wang,et al. UGC-VIDEO: Perceptual Quality Assessment of User-Generated Videos , 2019, 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[39] Christophe Charrier,et al. Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[40] Lei Zhang,et al. Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index , 2013, IEEE Transactions on Image Processing.

[41] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[42] Eero P. Simoncelli,et al. On Advances in Statistical Modeling of Natural Images , 2004, Journal of Mathematical Imaging and Vision.

[43] Olivier Verscheure,et al. Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[44] André Kaup,et al. Temporal Trajectory Aware Video Quality Measure , 2009, IEEE Journal of Selected Topics in Signal Processing.

[45] Balu Adsumilli,et al. YouTube UGC Dataset for Video Compression Research , 2019, 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP).

[46] Alan C. Bovik,et al. Predicting the Quality of Images Compressed After Distortion in Two Steps , 2018, IEEE Transactions on Image Processing.

[47] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[48] Peyman Milanfar,et al. NIMA: Neural Image Assessment , 2017, IEEE Transactions on Image Processing.