Residual Frame for Noisy Video Classification According to Perceptual Quality in Convolutional Neural Networks

Perceptual quality of a video describes the quality consistent with human perception. The growing popularity of short video sharing on mobile platforms such as Tik Tok and WeSee makes the video assessment system based on perceptual quality a necessity. In practice, short videos captured by mobile devices often contain different types of distortions incurred by sensor noise or compression noise, which potentially makes the videos visually unpleasing to users and may degrade the performance of deep neural networks when applied to these noisy videos. Thus, it is necessary to identify noisy videos based on video perceptual quality. However, traditional video/image noise estimation methods are designed to estimate the variance of homogeneously distributed synthetic noise, not real noise. In this paper, we propose a simple yet effective method to recognize the noisy videos using their residual frames. Since the original video frame contains rich content information, which may result in under-or over-estimation of the noise, we construct residual frames to reduce the influence of the content information while maintaining the main noise information in the video. We also create a new data set with more than 30 thousand images captured from videos with real noise. Experimental results demonstrate the effectiveness of our proposed method.

[1]  Eric Dubois,et al.  Fast and reliable structure-oriented video noise estimation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  John Immerkær,et al.  Fast Noise Variance Estimation , 1996, Comput. Vis. Image Underst..

[3]  Léon Bottou,et al.  Wasserstein Generative Adversarial Networks , 2017, ICML.

[4]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  I. Jolliffe Principal Component Analysis , 2002 .

[6]  Thomas Wiegand,et al.  Multi-frame motion compensated prediction for video transmission , 2001 .

[7]  Xinhao Liu,et al.  Single-Image Noise Level Estimation for Blind Denoising , 2013, IEEE Transactions on Image Processing.

[8]  Seungjoon Yang,et al.  Block-based noise estimation using adaptive Gaussian filtering , 2005, 2005 Digest of Technical Papers. International Conference on Consumer Electronics, 2005. ICCE..

[9]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[10]  Shih-Ming Yang,et al.  A fast method for image noise estimation using Laplacian operator and adaptive edge detection , 2008, 2008 3rd International Symposium on Communications, Control and Signal Processing.

[11]  Basura Fernando,et al.  Learning End-to-end Video Classification with Rank-Pooling , 2016, ICML.

[12]  W. B. Collis,et al.  Training Methods for Image Noise Level Estimation on Wavelet Components , 2004, EURASIP J. Adv. Signal Process..

[13]  Daniel S. Weller,et al.  Comparison-Based Image Quality Assessment for Selecting Image Restoration Parameters , 2016, IEEE Transactions on Image Processing.

[14]  Lei Zheng,et al.  Image Noise Level Estimation by Principal Component Analysis , 2013, IEEE Transactions on Image Processing.

[15]  Guangyong Chen,et al.  An Efficient Statistical Method for Image Noise Level Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[16]  Ram M. Narayanan,et al.  Noise estimation in remote sensing imagery using data masking , 2003 .

[17]  Tianqiang Huang,et al.  Using similarity analysis to detect frame duplication forgery in videos , 2014, Multimedia Tools and Applications.

[18]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[19]  Soosan Beheshti,et al.  Adaptive Noise Variance Estimation in BayesShrink , 2010, IEEE Signal Processing Letters.

[20]  Weisi Lin,et al.  Image Quality Assessment Based Label Smoothing in Deep Neural Network Learning , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23]  Jiantao Zhou,et al.  Color Image Noise Covariance Estimation with Cross-Channel Image Noise Modeling , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[24]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Li Yang,et al.  An improved motion detection method for real-time surveillance , 2008 .