Visual quality assessment for web videos

The advent of video-sharing sites such as YouTube has led to an unprecedented Internet delivery of community-contributed video content. However, most of these videos are not quality-controlled. This paper reports a first attempt towards assessing web videos in terms of visual quality with significant tests on 30k web videos. We regard the quality assessment as a two-class classification problem: features motivated from domain knowledge are extracted to be the visual representation while the overall quality is the two-class label. Observing that web videos are characterized by a much higher diversity of content, genres, capture devices, and skills than any other traditional video program, we propose to combine two types of domain knowledge to predict the perceived quality score. One of the domain knowledge types is the spatiotemporal factors affecting the overall perceived quality of web videos, including four spatial factors and two temporal factors. We study the effectiveness of various spatiotemporal factors and propose some novel spatial factors pertaining the characteristics of web videos. The other is the video editing style, including shot editing style, frame size, and black side ratio. Comprehensive experiments and evaluations over 30k web videos which add up to 1200h in total demonstrated the effectiveness of the proposed approach. We show some preliminary results for application to filtering and re-ranking of retrieved web videos.

[1]  Zhou Wang,et al.  No-reference perceptual quality assessment of JPEG compressed images , 2002, Proceedings. International Conference on Image Processing.

[2]  James Hu,et al.  DVQ: A digital video quality metric based on human vision , 2001 .

[3]  Tao Mei,et al.  Home Video Visual Quality Assessment With Spatiotemporal Factors , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[5]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[6]  Toru Yamada,et al.  No-reference video quality estimation based on error-concealment effectiveness , 2007, Packet Video 2007.

[7]  Si Wu,et al.  Video quality classification based home video segmentation , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[8]  Bing-Yu Chen,et al.  Blurred Image Detection and Classification , 2008, MMM.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[11]  Michael G. Strintzis,et al.  Blocking artifact detection and reduction in compressed data , 2002, IEEE Trans. Circuits Syst. Video Technol..

[12]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[13]  Xian-Sheng Hua,et al.  Multi-modality web video categorization , 2007, MIR '07.

[14]  Wei-Ying Ma,et al.  Learning No-Reference Quality Metric by Examples , 2005, 11th International Multimedia Modelling Conference.

[15]  Hong Ren Wu,et al.  A novel objective no-reference metric for digital video quality assessment , 2005, IEEE Signal Processing Letters.

[16]  Deepak S. Turaga,et al.  No reference PSNR estimation for compressed pictures , 2002, Proceedings. International Conference on Image Processing.

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Xiaoou Tang,et al.  Photo and Video Quality Evaluation: Focusing on the Subject , 2008, ECCV.

[19]  Alan C. Bovik,et al.  Video Quality Assessment , 2009 .

[20]  Borko Furht,et al.  Handbook of Video Databases: Design and Applications , 2003 .

[21]  Shih-Fu Chang,et al.  Video search reranking via information bottleneck principle , 2006, MM '06.