COME for No-Reference Video Quality Assessment

Nowadays, the issue of objective Video Quality Assessment (VQA) has been extensively studied. In this paper, we present an effective general-purpose VQA method named COnvolutional neural network and Multi-regression based Evaluation (COME). It requires no referred lossless video and is universal for non-specific types of distortion. A modified 2D convolutional neural network is introduced to learn the spatial features at frame level. At the same time, the motion information is extracted as temporal features at sequence level. And a multi-regression model is proposed to comprehensively assess the final video quality according to human’s psychological perception. The proposed method is tested on two commonly used databases with numerous kinds of distortions. The experimental results show that the proposed COME method is comparable with most popular full-reference VQA methods.

[1]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[2]  Eric C. Larson,et al.  Most apparent distortion: full-reference image quality assessment and the role of strategy , 2010, J. Electronic Imaging.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Gustavo de Veciana,et al.  An information fidelity criterion for image quality assessment using natural scene statistics , 2005, IEEE Transactions on Image Processing.

[5]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[6]  Christophe Charrier,et al.  Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[7]  Michael Wangenheim,et al.  Subjective assessment of hand-arm vibrations , 1986 .

[8]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[9]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[10]  Yaowu Chen,et al.  No-reference video quality assessment based on region of interest , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[11]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Damon M. Chandler,et al.  A spatiotemporal most-apparent-distortion model for video quality assessment , 2011, 2011 18th IEEE International Conference on Image Processing.

[13]  Alan C. Bovik,et al.  A subjective study to evaluate video quality assessment algorithms , 2010, Electronic Imaging.

[14]  Yi Li,et al.  Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Bing Zeng,et al.  A new three-step search algorithm for block motion estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[17]  Alan C. Bovik,et al.  Assessment of video naturalness using time-frequency statistics , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[18]  David S. Doermann,et al.  No-reference video quality assessment via feature learning , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[19]  Damon M. Chandler,et al.  ViS3: an algorithm for video quality assessment via analysis of spatial and spatiotemporal slices , 2014, J. Electronic Imaging.

[20]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  S. Tubaro,et al.  Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel , 2009, 2009 International Workshop on Quality of Multimedia Experience.

[22]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[24]  Rajiv Soundararajan,et al.  Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Vijayan K. Asari,et al.  A no-reference video quality assessment based on Laplacian pyramids , 2013, 2013 IEEE International Conference on Image Processing.