Learning Local Quality-Aware Structures of Salient Regions for Stereoscopic Images via Deep Neural Networks

The perceptual quality of stereoscopic images plays an essential role in the human perception of visual information. However, most available stereoscopic image quality assessment (SIQA) methods evaluate 3D visual experience using hand-crafted features or shallow architectures, which cannot model the visual properties of stereo images well. In this paper, we use convolutional neural networks (CNNs) to learn deeper local quality-aware structures for stereo images. With different inputs, two CNN models are designed for no-reference SIQA tasks. The one-column CNN model directly accepts a cyclopean view as the input, and the three-column CNN model jointly considers the cyclopean, left and right views as CNN inputs. The two SIQA frameworks share the same implementation approach: First, to overcome the obstacle of limited SIQA datasets, we accept image patches that have been cropped from corresponding stereopairs as inputs for local quality-sensitive feature extraction. Next, a local feature selection algorithm is used to remove related features on non-salient patches, which could cause large prediction errors. Finally, the reserved local visual structures of salient regions are aggregated into a final quality score in an end-to-end manner. Experimental results on three public SIQA databases demonstrate that our method outperforms most state-of-the-art no-reference (NR) SIQA methods. The results of a cross-database experiment also show the robustness and generality of the proposed method.

[1]  Ahmet M. Kondoz,et al.  Prediction of stereoscopic video quality using objective quality models of 2-D video , 2008 .

[2]  Ja-Ling Wu,et al.  Quality Assessment of Stereoscopic 3D Image Compression by Binocular Integration Behaviors , 2014, IEEE Transactions on Image Processing.

[3]  Weisi Lin,et al.  Analysis of Distortion Distribution for Pooling in Image Quality Prediction , 2016, IEEE Transactions on Broadcasting.

[4]  Havani,et al.  Quality Assessment of Stereoscopic 3 D Image Compression by Binocular Integration Behaviors , 2016 .

[5]  Sebastian Bosse,et al.  A deep neural network for image quality assessment , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[6]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[7]  Touradj Ebrahimi,et al.  Assessment of Stereoscopic Crosstalk Perception , 2012, IEEE Transactions on Multimedia.

[8]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Feng Shao,et al.  3D Visual Attention for Stereoscopic Image Quality Assessment , 2014, J. Softw..

[11]  Chunping Hou,et al.  No-reference stereoscopic 3D image quality assessment via combined model , 2018, Multimedia Tools and Applications.

[12]  Alan C. Bovik,et al.  No-Reference Quality Assessment of Natural Stereopairs , 2013, IEEE Transactions on Image Processing.

[13]  Andrey S. Krylov,et al.  No-Reference Stereoscopic Image Quality Assessment Using Convolutional Neural Network for Adaptive Feature Extraction , 2018, IEEE Access.

[14]  Ahmet M. Kondoz,et al.  Quality analysis for 3D video using 2D video quality models , 2008, IEEE Transactions on Consumer Electronics.

[15]  Alan C. Bovik,et al.  Subjective evaluation of stereoscopic image quality , 2013, Signal Process. Image Commun..

[16]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[17]  Ioannis A. Kakadiaris,et al.  End-to-End 3D Face Reconstruction with Deep Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Wujie Zhou,et al.  Binocular Responses for No-Reference 3D Image Quality Assessment , 2016, IEEE Transactions on Multimedia.

[19]  Lai-Man Po,et al.  No-reference image quality assessment with deep convolutional neural networks , 2016, 2016 IEEE International Conference on Digital Signal Processing (DSP).

[20]  Kai Zeng,et al.  Quality Prediction of Asymmetrically Distorted Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.

[21]  Yang Zhao,et al.  Blind assessment for stereo images considering binocular characteristics and deep perception map based on deep belief network , 2019, Inf. Sci..

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[24]  Qionghai Dai,et al.  Full-Reference Quality Assessment of Stereoscopic Images by Learning Binocular Receptive Field Properties , 2015, IEEE Transactions on Image Processing.

[25]  Yanqing Li,et al.  No-Reference Stereoscopic Image Quality Assessment Using Natural Scene Statistics , 2017, 2017 2nd International Conference on Multimedia and Image Processing (ICMIP).

[26]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Weisi Lin,et al.  Using Binocular Feature Combination for Blind Quality Assessment of Stereoscopic Images , 2015, IEEE Signal Processing Letters.

[29]  Weisi Lin,et al.  Saliency-Guided Quality Assessment of Screen Content Images , 2016, IEEE Transactions on Multimedia.

[30]  Alan C. Bovik,et al.  Multimodal Interactive Continuous Scoring of Subjective 3D Video Quality of Experience , 2014, IEEE Transactions on Multimedia.

[31]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[32]  Qionghai Dai,et al.  Learning Blind Quality Evaluator for Stereoscopic Images Using Joint Sparse Representation , 2016, IEEE Transactions on Multimedia.

[33]  Sanghoon Lee,et al.  Blind Deep S3D Image Quality Evaluation via Local to Global Feature Aggregation , 2017, IEEE Transactions on Image Processing.

[34]  Alan C. Bovik,et al.  Oriented Correlation Models of Distorted Natural Images With Application to Natural Stereopair Quality Evaluation , 2015, IEEE Transactions on Image Processing.

[35]  Gangyi Jiang,et al.  Modeling the Perceptual Quality of Stereoscopic Images in the Primary Visual Cortex , 2017, IEEE Access.

[36]  Mei Yu,et al.  No-reference Stereoscopic Image Quality Assessment Using Binocular Self-similarity and Deep Neural Network , 2016, Signal Process. Image Commun..

[37]  Sanghoon Lee,et al.  Fully Deep Blind Image Quality Predictor , 2017, IEEE Journal of Selected Topics in Signal Processing.

[38]  Roushain Akhter,et al.  No-reference stereoscopic image quality assessment , 2010, Electronic Imaging.

[39]  Weisi Lin,et al.  Learning a referenceless stereopair quality engine with deep nonnegativity constrained sparse autoencoder , 2018, Pattern Recognit..

[40]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Lin Ma,et al.  Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network , 2016, Pattern Recognit..

[42]  Dennis M Levi,et al.  Binocular combination of phase and contrast explained by a gain-control and gain-enhancement model. , 2013, Journal of vision.

[43]  Yi Li,et al.  Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Mei Yu,et al.  Perceptual stereoscopic image quality assessment method with tensor decomposition and manifold learning , 2018, IET Image Process..

[45]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.