论文信息 - Regression or classification? New methods to evaluate no-reference picture and video quality models

Regression or classification? New methods to evaluate no-reference picture and video quality models

Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus. However, recent efforts have shown that accurate quality score regression on real-world user-generated content (UGC) is a very challenging task. To make the problem more tractable, we propose two new methods - binary, and ordinal classification - as alternatives to evaluate and compare no-reference quality models at coarser levels. Moreover, the proposed new tasks convey more practical meaning on perceptually optimized UGC transcoding, or for preprocessing on media processing platforms. We conduct a comprehensive benchmark experiment of popular no-reference quality models on recent in-the-wild picture and video quality datasets, providing reliable baselines for both evaluation methods to support further studies. We hope this work promotes coarse-grained perceptual modeling and its applications to efficient UGC processing.

[1] Philip J. Corriveau,et al. Study of Rating Scales for Subjective Quality Assessment of High-Definition Video , 2011, IEEE Transactions on Broadcasting.

[2] Alan C. Bovik,et al. Temporal hysteresis model of time varying subjective video quality , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3] Alan C. Bovik,et al. Predicting the Quality of Compressed Videos With Pre-Existing Distortions , 2020, IEEE Transactions on Image Processing.

[4] Ming Jiang,et al. Quality Assessment of In-the-Wild Videos , 2019, ACM Multimedia.

[5] Yi Li,et al. Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Alan C. Bovik,et al. ProxIQA: A Proxy Approach to Perceptual Optimization of Learned Image Compression , 2021, IEEE Transactions on Image Processing.

[7] David S. Doermann,et al. Unsupervised feature learning framework for no-reference image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Alan C. Bovik,et al. A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.

[9] Sebastian Bosse,et al. Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[10] Yong Liu,et al. Blind Image Quality Assessment Based on High Order Statistics Aggregation , 2016, IEEE Transactions on Image Processing.

[11] Lei Zhang,et al. Blind Image Quality Assessment Using Joint Statistics of Gradient Magnitude and Laplacian Features , 2014, IEEE Transactions on Image Processing.

[12] Alan C. Bovik,et al. Predicting the Quality of Images Compressed After Distortion in Two Steps , 2018, IEEE Transactions on Image Processing.

[13] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[14] Alan C. Bovik,et al. Perceptual quality prediction on authentically distorted images using a bag of features approach , 2016, Journal of vision.

[15] Soo-Chang Pei,et al. Image Quality Assessment Using Human Visual DOG Model Fused With Random Forest , 2015, IEEE Transactions on Image Processing.

[16] Alan C. Bovik,et al. No-Reference Quality Assessment of Tone-Mapped HDR Pictures , 2017, IEEE Transactions on Image Processing.

[17] Christophe Charrier,et al. Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[18] Kede Ma,et al. Perceptual Quality Assessment of Smartphone Photography , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[20] Alan C. Bovik,et al. Massive Online Crowdsourced Study of Subjective and Objective Picture Quality , 2015, IEEE Transactions on Image Processing.

[21] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[22] Pedro Antonio Gutiérrez,et al. Ordinal Regression Methods: Survey and Experimental Study , 2016, IEEE Transactions on Knowledge and Data Engineering.

[23] Balu Adsumilli,et al. YouTube UGC Dataset for Video Compression Research , 2019, 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP).

[24] Lei Zhang,et al. A Unified Probabilistic Formulation of Image Aesthetic Assessment , 2020, IEEE Transactions on Image Processing.

[25] Alan C. Bovik,et al. A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[26] Alan Conrad Bovik,et al. Large-Scale Study of Perceptual Video Quality , 2018, IEEE Transactions on Image Processing.

[27] Dietmar Saupe,et al. The Konstanz natural video database (KoNViD-1k) , 2017, 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX).

[28] D. Saupe,et al. KonIQ-10k: An Ecologically Valid Database for Deep Learning of Blind Image Quality Assessment , 2019, IEEE Transactions on Image Processing.

[29] Alexei A. Efros,et al. Unbiased look at dataset bias , 2011, CVPR 2011.

[30] Alan C. Bovik,et al. Perceptual Video Quality Prediction Emphasizing Chroma Distortions , 2020, IEEE Transactions on Image Processing.

[31] Neil Birkbeck,et al. Video transcoding optimization based on input perceptual quality , 2020, Optical Engineering + Applications.

[32] Joshua Peter Ebenezer,et al. No-Reference Video Quality Assessment Using Space-Time Chips , 2020, 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP).

[33] C.-C. Jay Kuo,et al. Experimental design and analysis of JND test on coded image/video , 2015, SPIE Optical Engineering + Applications.

[34] Jari Korhonen,et al. Two-Level Approach for No-Reference Consumer Video Quality Assessment , 2019, IEEE Transactions on Image Processing.

[35] Xiaoou Tang,et al. Image Aesthetic Assessment: An experimental survey , 2016, IEEE Signal Processing Magazine.

[36] Alan C. Bovik,et al. BBAND INDEX: A NO-REFERENCE BANDING ARTIFACT PREDICTOR , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37] Praful Gupta,et al. From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Alan C. Bovik,et al. UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content , 2020, IEEE Transactions on Image Processing.

[39] Naila Murray,et al. AVA: A large-scale database for aesthetic visual analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.