No-Reference Stereoscopic Image Quality Assessment Using Convolutional Neural Network for Adaptive Feature Extraction

The pervasion of 3-D technologies over the years gives rise to the increasing demands of accurate and efficient stereoscopic image quality assessment (SIQA) methods, designed to automatically supervise and optimize 3-D image and video processing systems. Though 2-D IQA has attracted considerable attention, its 3-D counterpart is yet to be well explored. In this paper, a no-reference SIQA method using convolution neural network (CNN) for feature extraction is proposed. In the proposed method, a CNN model is trained from scratch to classify images according to their perceptual quality, with quality-aware monocular features extracted from a higher level layer of the network. Then, visual saliency models are utilized to fuse the captured monocular features. In the meanwhile, multi-scale statistical features are derived from the binocular disparity maps. Finally, the fused CNN features and the disparity features are synthesized by support vector regression into the objective quality score of the stereoscopic image. Experimental results on two public databases demonstrate the superior performance of the proposed method over other state-of-the-art methods, in terms of its accuracy in predicting stereoscopic image quality as well as its robustness across different databases and distortion types.

[1]  Alan C. Bovik,et al.  3D Visual Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics , 2015, IEEE Transactions on Image Processing.

[2]  D V Meegan,et al.  Unequal weighting of monocular inputs in binocular combination: implications for the compression of stereoscopic imagery. , 2001, Journal of experimental psychology. Applied.

[3]  Weisi Lin,et al.  Image retargeting quality assessment based on support vector regression , 2015, Signal Process. Image Commun..

[4]  Ahmet M. Kondoz,et al.  Quality analysis for 3D video using 2D video quality models , 2008, IEEE Transactions on Consumer Electronics.

[5]  Sanghoon Lee,et al.  Blind Deep S3D Image Quality Evaluation via Local to Global Feature Aggregation , 2017, IEEE Transactions on Image Processing.

[6]  Qian Li,et al.  Saliency structure stereoscopic image quality assessment method , 2014 .

[7]  Fei Gao,et al.  DeepSim: Deep similarity for image quality assessment , 2017, Neurocomputing.

[8]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[9]  Sumohana S. Channappayya,et al.  No- Reference Stereoscopic Image Quality Assessment , 2015 .

[10]  Ling-Yu Duan,et al.  Finding the Secret of Image Saliency in the Frequency Domain , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Ting Luo,et al.  Blind quality estimator for 3D images based on binocular combination and extreme learning machine , 2017, Pattern Recognit..

[12]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[13]  Paolo Napoletano,et al.  On the use of deep learning for blind image quality assessment , 2016, Signal Image Video Process..

[14]  Qionghai Dai,et al.  Learning Sparse Representation for No-Reference Quality Assessment of Multiply Distorted Stereoscopic Images , 2017, IEEE Transactions on Multimedia.

[15]  Lei Zhang,et al.  Learning without Human Scores for Blind Image Quality Assessment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Peter Kauff,et al.  Three-Dimensional Video Postproduction and Processing , 2011, Proceedings of the IEEE.

[17]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[18]  Weisi Lin,et al.  On Predicting Visual Comfort of Stereoscopic Images: A Learning to Rank Based Approach , 2016, IEEE Signal Processing Letters.

[19]  Chaminda T. E. R. Hewage,et al.  Reduced-reference quality assessment for 3D video compression and transmission , 2011, IEEE Transactions on Consumer Electronics.

[20]  Lin Ma,et al.  Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network , 2016, Pattern Recognit..

[21]  Ja-Ling Wu,et al.  Quality Assessment of Stereoscopic 3D Image Compression by Binocular Integration Behaviors , 2014, IEEE Transactions on Image Processing.

[22]  Yi Li,et al.  Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Lin Ma,et al.  Multimodal learning for facial expression recognition , 2015, Pattern Recognit..

[24]  Jun Du,et al.  An Experimental Study on Speech Enhancement Based on Deep Neural Networks , 2014, IEEE Signal Processing Letters.

[25]  Alan C. Bovik,et al.  Subjective evaluation of stereoscopic image quality , 2013, Signal Process. Image Commun..

[26]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[28]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[29]  Yanqing Li,et al.  No-Reference Stereoscopic Image Quality Assessment Using Natural Scene Statistics , 2017, 2017 2nd International Conference on Multimedia and Image Processing (ICMIP).

[30]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[31]  Aljoscha Smolic,et al.  Scene Representation Technologies for 3DTV—A Survey , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Havani,et al.  Quality Assessment of Stereoscopic 3 D Image Compression by Binocular Integration Behaviors , 2016 .

[33]  Alan C. Bovik,et al.  A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.

[34]  Alan C. Bovik,et al.  No-Reference Quality Assessment of Natural Stereopairs , 2013, IEEE Transactions on Image Processing.

[35]  Ting Luo,et al.  Utilizing binocular vision to facilitate completely blind 3D image quality measurement , 2016, Signal Process..

[36]  Hongyu Li,et al.  VSI: A Visual Saliency-Induced Index for Perceptual Image Quality Assessment , 2014, IEEE Transactions on Image Processing.

[37]  Weisi Lin,et al.  Using Binocular Feature Combination for Blind Quality Assessment of Stereoscopic Images , 2015, IEEE Signal Processing Letters.

[38]  Lei Zhang,et al.  A Feature-Enriched Completely Blind Image Quality Evaluator , 2015, IEEE Transactions on Image Processing.

[39]  Kai Zeng,et al.  Quality Prediction of Asymmetrically Distorted Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.