论文信息 - No Reference Quality Assessment of Stereo Video Based on Saliency and Sparsity

No Reference Quality Assessment of Stereo Video Based on Saliency and Sparsity

With the popularity of video technology, stereoscopic video quality assessment (SVQA) has become increasingly important. Existing SVQA methods cannot achieve good performance because the videos’ information is not fully utilized. In this paper, we consider various information in the videos together, construct a simple model to combine and analyze the diverse features, which is based on saliency and sparsity. First, we utilize the 3-D saliency map of sum map, which remains the basic information of stereoscopic video, as a valid tool to evaluate the videos’ quality. Second, we use the sparse representation to decompose the sum map of 3-D saliency into coefficients, then calculate the features based on sparse coefficients to obtain the effective expression of videos’ message. Next, in order to reduce the relevance between the features, we put them into stacked auto-encoder, mapping vectors to higher dimensional space, and adding the sparse restraint, then input them into support vector machine subsequently, and finally, get the quality assessment scores. Within that process, we take the advantage of saliency and sparsity to extract and simplify features. Through the later experiment, we can see the proposed method is fitting well with the subjective scores.

[1] Tanaya Guha,et al. Sparse representation-based image quality assessment , 2013, Signal Process. Image Commun..

[2] Yoshua Bengio,et al. On the Expressive Power of Deep Architectures , 2011, ALT.

[3] Guangming Shi,et al. Orientation selectivity based visual pattern for reduced-reference image quality assessment , 2016, Inf. Sci..

[4] Zhou Wang,et al. Video saliency incorporating spatiotemporal cues and uncertainty weighting , 2013, ICME.

[5] Mateu Sbert,et al. Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence , 2014, Inf. Sci..

[6] Gangyi Jiang,et al. Simulating receptive fields of human visual cortex for 3D image quality prediction. , 2016, Applied optics.

[7] Mei Yu,et al. Monocular-binocular feature fidelity induced index for stereoscopic image quality assessment. , 2015, Applied optics.

[8] Weisi Lin,et al. SVD-Based Quality Metric for Image and Video Using Machine Learning , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9] Ran He,et al. Spatiotemporal Saliency Detection based Video Quality Assessment , 2016, ICIMCS.

[10] O. Meur,et al. Predicting visual fixations on video based on low-level visual features , 2007, Vision Research.

[11] Siwei Ma,et al. Stereoscopic video quality assessment model based on spatial-temporal structural information , 2012, 2012 Visual Communications and Image Processing.

[12] Kwanghoon Sohn,et al. No-Reference Quality Assessment for Stereoscopic Images Based on Binocular Quality Perception , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[13] Gangyi Jiang,et al. Modeling the Perceptual Quality of Stereoscopic Images in the Primary Visual Cortex , 2017, IEEE Access.

[14] Feng Qi,et al. Stereoscopic video quality assessment based on visual attention and just-noticeable difference models , 2015, Signal, Image and Video Processing.

[15] Peijun Du,et al. Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging , 2016, Neurocomputing.

[16] Narciso García,et al. NAMA3DS1-COSPAD1: Subjective video quality assessment database on coding conditions introducing freely available high quality 3D stereoscopic sequences , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[17] Liqing Zhang,et al. Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Baoxin Li,et al. Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[20] Rajiv Soundararajan,et al. Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[21] Qin Zhang,et al. A novel no-reference objective stereoscopic video quality assessment method based on visual saliency analysis , 2017, International Conference on Digital Image Processing.

[22] Michael Elad,et al. On the Role of Sparse and Redundant Representations in Image Processing , 2010, Proceedings of the IEEE.

[23] Weisi Lin,et al. Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[24] Muhammad Shahid,et al. A no-reference machine learning based video quality predictor , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[25] Xuelong Li,et al. Sparse representation for blind image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Qionghai Dai,et al. Learning Sparse Representation for No-Reference Quality Assessment of Multiply Distorted Stereoscopic Images , 2017, IEEE Transactions on Multimedia.

[27] G. R. Engel. The visual processes underlying binocular brightness summation. , 1967, Vision research.

[28] Alan C. Bovik,et al. A Completely Blind Video Integrity Oracle , 2016, IEEE Transactions on Image Processing.

[29] Mei Yu,et al. Binocular perception based reduced-reference stereo video quality assessment method , 2016, J. Vis. Commun. Image Represent..

[30] Matthew H Tong,et al. SUN: Top-down saliency using natural statistics , 2009, Visual cognition.

[31] Wei Zhao,et al. No-reference objective stereo video quality assessment based on visual attention and edge difference , 2015, 2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC).

[32] D. Ruderman,et al. Statistics of cone responses to natural images: implications for visual coding , 1998 .

[33] Christophe Charrier,et al. Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[34] Alan C. Bovik,et al. Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[35] Zhou Wang,et al. Modern Image Quality Assessment , 2006, Modern Image Quality Assessment.

[36] Lei Zhang,et al. A Feature-Enriched Completely Blind Image Quality Evaluator , 2015, IEEE Transactions on Image Processing.

[37] Wen Gao,et al. Image Primitive Coding and Visual Quality Assessment , 2012, PCM.

[38] Alan C. Bovik,et al. Subjective evaluation of stereoscopic image quality , 2013, Signal Process. Image Commun..

[39] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[40] Constantin Paleologu,et al. Perceptual Video Quality Assessment Based on Salient Region Detection , 2009, 2009 Fifth Advanced International Conference on Telecommunications.

[41] Vladimir Zlokolica,et al. Salient Motion Features for Video Quality Assessment , 2011, IEEE Transactions on Image Processing.

[42] Xiangyang Ji,et al. Quality assessment of 3D asymmetric view coding using spatial frequency dominance model , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[43] Wei Zhang,et al. The Application of Visual Saliency Models in Objective Image Quality Assessment: A Statistical Evaluation , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[44] Zhihan Lv,et al. Stereoscopic image quality assessment method based on binocular combination saliency model , 2016, Signal Process..

[45] Ahmet M. Kondoz,et al. Perceptual Video Quality Metric for 3D video quality assessment , 2010, 2010 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[46] S. Grossberg,et al. Neural dynamics of binocular brightness perception , 1999, Vision Research.

[47] Wen Gao,et al. Reduced Reference Stereoscopic Image Quality Assessment Based on Binocular Perceptual Information , 2015, IEEE Transactions on Multimedia.

[48] Weisi Lin,et al. Saliency-Guided Quality Assessment of Screen Content Images , 2016, IEEE Transactions on Multimedia.

[49] Hua Yang,et al. Sparse Feature Fidelity for Perceptual Image Quality Assessment , 2013, IEEE Transactions on Image Processing.

[50] Sid Henriksen,et al. Visual Perception: A Novel Difference Channel in Binocular Vision , 2016, Current Biology.

[51] Baihua Li,et al. Quality assessment metric of stereo images considering cyclopean integration and visual saliency , 2016, Inf. Sci..

[52] Yuan Zhang,et al. Video Objective Quality Evaluation System Based on the Visual Saliency Map , 2013 .

[53] Hongyu Li,et al. VSI: A Visual Saliency-Induced Index for Perceptual Image Quality Assessment , 2014, IEEE Transactions on Image Processing.

[54] Eirini Liotou,et al. No-reference video quality measurement: added value of machine learning , 2015, J. Electronic Imaging.