Blind Deep S3D Image Quality Evaluation via Local to Global Feature Aggregation

Previously, no-reference (NR) stereoscopic 3D (S3D) image quality assessment (IQA) algorithms have been limited to the extraction of reliable hand-crafted features based on an understanding of the insufficiently revealed human visual system or natural scene statistics. Furthermore, compared with full-reference (FR) S3D IQA metrics, it is difficult to achieve competitive quality score predictions using the extracted features, which are not optimized with respect to human opinion. To cope with this limitation of the conventional approach, we introduce a novel deep learning scheme for NR S3D IQA in terms of local to global feature aggregation. A deep convolutional neural network (CNN) model is trained in a supervised manner through two-step regression. First, to overcome the lack of training data, local patch-based CNNs are modeled, and the FR S3D IQA metric is used to approximate a reference ground-truth for training the CNNs. The automatically extracted local abstractions are aggregated into global features by inserting an aggregation layer in the deep structure. The locally trained model parameters are then updated iteratively using supervised global labeling, i.e., subjective mean opinion score (MOS). In particular, the proposed deep NR S3D image quality evaluator does not estimate the depth from a pair of S3D images. The S3D image quality scores predicted by the proposed method represent a significant improvement over those of previous NR S3D IQA algorithms. Indeed, the accuracy of the proposed method is competitive with FR S3D IQA metrics, having ~ 91% correlation in terms of MOS.

[1]  Alan C. Bovik,et al.  Visual Importance Pooling for Image Quality Assessment , 2009, IEEE Journal of Selected Topics in Signal Processing.

[2]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[3]  Kwanghoon Sohn,et al.  Visual Comfort Enhancement for Stereoscopic Video Based on Binocular Fusion Characteristics , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Alan C. Bovik,et al.  Stereoscopic 3D Visual Discomfort Prediction: A Dynamic Accommodation and Vergence Interaction Model , 2016, IEEE Transactions on Image Processing.

[5]  Nick Holliman,et al.  Stereoscopic image quality metrics and compression , 2008, Electronic Imaging.

[6]  Kwanghyun Lee,et al.  3D Perception Based Quality Pooling: Stereopsis, Binocular Rivalry, and Binocular Suppression , 2015, IEEE Journal of Selected Topics in Signal Processing.

[7]  Radomír Mech,et al.  Deep Multi-patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Alan C. Bovik,et al.  No-Reference Quality Assessment of Natural Stereopairs , 2013, IEEE Transactions on Image Processing.

[10]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[11]  Christophe Charrier,et al.  Blind Image Quality Assessment: A Natural Scene Statistics Approach in the DCT Domain , 2012, IEEE Transactions on Image Processing.

[12]  Alan C. Bovik,et al.  3D Visual Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics , 2015, IEEE Transactions on Image Processing.

[13]  Weisi Lin,et al.  Learning a blind quality evaluation engine of screen content images , 2016, Neurocomputing.

[14]  Alan C. Bovik,et al.  Video Quality Pooling Adaptive to Perceptual Distortion Severity , 2013, IEEE Transactions on Image Processing.

[15]  Nicolas Pinto,et al.  Why is Real-World Visual Object Recognition Hard? , 2008, PLoS Comput. Biol..

[16]  Jie Li,et al.  No-reference image quality assessment using Prewitt magnitude based on convolutional neural networks , 2016, Signal Image Video Process..

[17]  David Kane,et al.  The Limits of Human Stereopsis in Space and Time , 2014, The Journal of Neuroscience.

[18]  Sanghoon Lee,et al.  Transition of Visual Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort , 2015, IEEE Transactions on Multimedia.

[19]  Aldo Maalouf,et al.  CYCLOP: A stereo color image quality assessment metric , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Xuelong Li,et al.  Blind Image Quality Assessment via Deep Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Wenjun Zhang,et al.  No-Reference Stereoscopic IQA Approach: From Nonlinear Effect to Parallax Compensation , 2012, J. Electr. Comput. Eng..

[23]  Wei Shen,et al.  No-reference quality assessment of enhanced images , 2016, China Communications.

[24]  Alan C. Bovik,et al.  3D Visual Discomfort Prediction: Vergence, Foveation, and the Physiological Optics of Accommodation , 2014, IEEE Journal of Selected Topics in Signal Processing.

[25]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[26]  Weisi Lin,et al.  Blind Image Quality Assessment for Stereoscopic Images Using Binocular Guided Quality Lookup and Visual Codebook , 2015, IEEE Transactions on Broadcasting.

[27]  Jianfei Cai,et al.  Three Dimensional Scalable Video Adaptation via User-End Perceptual Quality Assessment , 2008, IEEE Transactions on Broadcasting.

[28]  Wenjun Zhang,et al.  Using Free Energy Principle For Blind Image Quality Assessment , 2015, IEEE Transactions on Multimedia.

[29]  Alan C. Bovik,et al.  Subjective evaluation of stereoscopic image quality , 2013, Signal Process. Image Commun..

[30]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[31]  Weisi Lin,et al.  Saliency-Guided Quality Assessment of Screen Content Images , 2016, IEEE Transactions on Multimedia.

[32]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[33]  Eero P. Simoncelli,et al.  Nonlinear image representation using divisive normalization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Sanghoon Lee,et al.  Fully Deep Blind Image Quality Predictor , 2017, IEEE Journal of Selected Topics in Signal Processing.

[35]  Junyong You,et al.  PERCEPTUAL QUALITY ASSESSMENT FOR STEREOSCOPIC IMAGES BASED ON 2 D IMAGE QUALITY METRICS AND DISPARITY ANALYSIS , 2010 .

[36]  Alan C. Bovik,et al.  Oriented Correlation Models of Distorted Natural Images With Application to Natural Stereopair Quality Evaluation , 2015, IEEE Transactions on Image Processing.

[37]  Mylène C. Q. Farias,et al.  No-reference image quality assessment based on statistics of Local Ternary Pattern , 2016, 2016 Eighth International Conference on Quality of Multimedia Experience (QoMEX).

[38]  Alan C. Bovik,et al.  Feature maps driven no-reference image quality prediction of authentically distorted images , 2015, Electronic Imaging.

[39]  Qionghai Dai,et al.  Learning Receptive Fields and Quality Lookups for Blind Quality Assessment of Stereoscopic Images , 2016, IEEE Transactions on Cybernetics.

[40]  Yuukou Horita,et al.  Objective No-Reference Stereoscopic Image Quality Prediction Based on 2D Image Features and Relative Disparity , 2012, Adv. Multim..

[41]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[42]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[43]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[44]  Alan C. Bovik,et al.  3D Visual Activity Assessment Based on Natural Scene Statistics , 2014, IEEE Transactions on Image Processing.

[45]  Ja-Ling Wu,et al.  Quality Assessment of Stereoscopic 3D Image Compression by Binocular Integration Behaviors , 2014, IEEE Transactions on Image Processing.

[46]  Weisi Lin,et al.  Analysis of Distortion Distribution for Pooling in Image Quality Prediction , 2016, IEEE Transactions on Broadcasting.

[47]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[48]  Aljoscha Smolic,et al.  Automatic View Synthesis by Image-Domain-Warping , 2013, IEEE Transactions on Image Processing.

[49]  Min Zhang,et al.  Blind Image Quality Assessment Using the Joint Statistics of Generalized Local Binary Pattern , 2015, IEEE Signal Processing Letters.

[50]  Wenjun Zhang,et al.  Deep learning network for blind image quality assessment , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[51]  Lai-Man Po,et al.  No-reference image quality assessment with shearlet transform and deep neural networks , 2015, Neurocomputing.

[52]  Zhou Wang,et al.  No-reference image sharpness assessment based on local phase coherence measurement , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[53]  C. Schor,et al.  Binocular sensory fusion is limited by spatial resolution , 1984, Vision Research.

[54]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[55]  Wijnand A. IJsselsteijn,et al.  Evaluation of Stereoscopic Images: Beyond 2D Quality , 2011, IEEE Transactions on Broadcasting.

[56]  Yi Li,et al.  Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[57]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[58]  C W Tyler,et al.  Stereoscopic Depth Movement: Two Eyes Less Sensitive than One , 1971, Science.

[59]  Yuan Zhou,et al.  Objective quality assessment method of stereo images , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[60]  Decebal Constantin Mocanu,et al.  Deep learning for objective quality assessment of 3D images , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[61]  David S. Doermann,et al.  No-Reference Image Quality Assessment Using Visual Codebooks , 2012, IEEE Transactions on Image Processing.

[62]  Patrick Le Callet,et al.  Quality Assessment of Stereoscopic Images , 2008, EURASIP J. Image Video Process..

[63]  Weisi Lin,et al.  No-Reference Image Sharpness Assessment in Autoregressive Parameter Space , 2015, IEEE Transactions on Image Processing.

[64]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[65]  Alan C. Bovik,et al.  Natural scene statistics of color and range , 2011, 2011 18th IEEE International Conference on Image Processing.

[66]  Ting Luo,et al.  Blind 3D image quality assessment based on self-similarity of binocular features , 2017, Neurocomputing.

[67]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[68]  Alan C. Bovik,et al.  Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[69]  Heeseok Oh,et al.  Visual Presence: Viewing Geometry Visual Information of UHD S3D Entertainment. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[70]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[71]  Alan C. Bovik,et al.  A survey on 3D quality of experience and 3D quality assessment , 2013, Electronic Imaging.