Blind assessment for stereo images considering binocular characteristics and deep perception map based on deep belief network

Abstract In recent years, blind image quality assessment in the field of 2D image/video has gained the popularity, but its applications in 3D image/video are to be generalized. In this paper, we propose an effective blind metric evaluating stereo images via deep belief network (DBN). This method is based on wavelet transform with both 2D features from monocular images respectively as image content description and 3D features from a novel depth perception map (DPM) as depth perception description. In particular, the DPM is introduced to quantify longitudinal depth information to align with human stereo visual perception. More specifically, the 2D features are local histogram of oriented gradient (HoG) features from high frequency wavelet coefficients and global statistical features including magnitude, variance and entropy. Meanwhile, the global statistical features from the DPM are characterized as 3D features. Subsequently, considering binocular characteristics, an effective binocular weight model based on multiscale energy estimation of the left and right images is adopted to obtain the content quality. In the training and testing stages, three DBN models for the three types features separately are used to get the final score. Experimental results demonstrate that the proposed stereo image quality evaluation model has high superiority over existing methods and achieve higher consistency with subjective quality assessments.

[1]  Cheolkon Jung,et al.  Visual comfort assessment in stereoscopic 3D images using salient object disparity , 2015 .

[2]  Baihua Li,et al.  A no-reference optical flow-based quality evaluator for stereoscopic videos in curvelet domain , 2017, Inf. Sci..

[3]  Sanghoon Lee,et al.  Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Kai Li,et al.  No-reference stereo image quality assessment by learning gradient dictionary-based color visual characteristics , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[5]  Ja-Ling Wu,et al.  Quality Assessment of Stereoscopic 3D Image Compression by Binocular Integration Behaviors , 2014, IEEE Transactions on Image Processing.

[6]  David F. McAllister,et al.  Stereo and 3‐D Display Technologies , 2002 .

[7]  Jinhui Tang,et al.  Weakly Supervised Deep Matrix Factorization for Social Image Understanding , 2017, IEEE Transactions on Image Processing.

[8]  N. Qian Binocular Disparity and the Perception of Depth , 1997, Neuron.

[9]  Qionghai Dai,et al.  Full-Reference Quality Assessment of Stereoscopic Images by Learning Binocular Receptive Field Properties , 2015, IEEE Transactions on Image Processing.

[10]  Cui-Hua Li,et al.  Unifying visual saliency with HOG feature learning for traffic sign detection , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[11]  Wen Gao,et al.  Reduced Reference Stereoscopic Image Quality Assessment Based on Binocular Perceptual Information , 2015, IEEE Transactions on Multimedia.

[12]  Baihua Li,et al.  Quality assessment metric of stereo images considering cyclopean integration and visual saliency , 2016, Inf. Sci..

[13]  Wujie Zhou,et al.  Local gradient patterns (LGP): An effective local-statistical-feature extraction scheme for no-reference image quality assessment , 2017, Inf. Sci..

[14]  Lin Ma,et al.  Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network , 2016, Pattern Recognit..

[15]  Qionghai Dai,et al.  Toward a Blind Deep Quality Evaluator for Stereoscopic Images Based on Monocular and Binocular Interactions , 2016, IEEE Transactions on Image Processing.

[16]  Lei Zhang,et al.  Blind Image Quality Assessment Using Joint Statistics of Gradient Magnitude and Laplacian Features , 2014, IEEE Transactions on Image Processing.

[17]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[18]  Qionghai Dai,et al.  Learning Receptive Fields and Quality Lookups for Blind Quality Assessment of Stereoscopic Images , 2016, IEEE Transactions on Cybernetics.

[19]  Alan C. Bovik,et al.  Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[20]  Jiachen Yang,et al.  A perceptual stereoscopic image quality assessment model accounting for binocular combination behavior , 2015, J. Vis. Commun. Image Represent..

[21]  Sumohana S. Channappayya,et al.  Full-Reference Stereo Image Quality Assessment Using Natural Stereo Scene Statistics , 2015, IEEE Signal Processing Letters.

[22]  Kwanghoon Sohn,et al.  No-Reference Quality Assessment for Stereoscopic Images Based on Binocular Quality Perception , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Alan C. Bovik,et al.  No-Reference Quality Assessment of Natural Stereopairs , 2013, IEEE Transactions on Image Processing.

[24]  Gordon Erlebacher,et al.  Hybrid No-Reference Natural Image Quality Assessment of Noisy, Blurry, JPEG2000, and JPEG Images , 2011, IEEE Transactions on Image Processing.

[25]  Xuelong Li,et al.  Sparse representation for blind image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Kai Zeng,et al.  Quality Prediction of Asymmetrically Distorted Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.

[27]  Wujie Zhou,et al.  Binocular Responses for No-Reference 3D Image Quality Assessment , 2016, IEEE Transactions on Multimedia.

[28]  R. Patterson,et al.  Factors that Affect Depth Perception in Stereoscopic Displays , 1992, Human factors.

[29]  Sumohana S. Channappayya,et al.  No-reference Stereoscopic Image Quality Assessment Using Natural Scene Statistics , 2016, Signal Process. Image Commun..

[30]  Patrick Le Callet,et al.  Quality Assessment of Stereoscopic Images , 2008, EURASIP J. Image Video Process..

[31]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[32]  Xin Yang,et al.  Image Quality Assessment Based on Multi-scale Geometric Analysis , 2009, ICIAP.

[33]  Pei-Yin Chen,et al.  An Efficient Hardware Implementation of HOG Feature Extraction for Human Detection , 2014, IEEE Transactions on Intelligent Transportation Systems.

[34]  Christophe Charrier,et al.  Blind Image Quality Assessment: A Natural Scene Statistics Approach in the DCT Domain , 2012, IEEE Transactions on Image Processing.

[35]  Robert S. Allison,et al.  The Effect of Crosstalk on the Perceived Depth From Disparity and Monocular Occlusions , 2011, IEEE Transactions on Broadcasting.

[36]  Mei Yu,et al.  No-reference Stereoscopic Image Quality Assessment Using Binocular Self-similarity and Deep Neural Network , 2016, Signal Process. Image Commun..

[37]  Mei Yu,et al.  Local and global sparse representation for no-reference quality assessment of stereoscopic images , 2018, Inf. Sci..

[38]  Mohamed-Chaker Larabi,et al.  A perceptual metric for stereoscopic image quality assessment based on the binocular energy , 2013, Multidimens. Syst. Signal Process..

[39]  Alan C. Bovik,et al.  Subjective evaluation of stereoscopic image quality , 2013, Signal Process. Image Commun..

[40]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[41]  Ichiro Fujita,et al.  Representation of stereoscopic depth based on relative disparity in macaque area V4. , 2007, Journal of neurophysiology.

[42]  Stéphane Coulombe,et al.  A novel discrete wavelet transform framework for full reference image quality assessment , 2013, Signal Image Video Process..

[43]  Xuelong Li,et al.  Image Quality Assessment Based on Multiscale Geometric Analysis , 2009, IEEE Transactions on Image Processing.

[44]  Jinhui Tang,et al.  Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval , 2015, IEEE Transactions on Multimedia.

[45]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[46]  Armin B. Cremers,et al.  Efficient Pedestrian Detection via Rectangular Features Based on a Statistical Shape Model , 2015, IEEE Transactions on Intelligent Transportation Systems.

[47]  Jinhui Tang,et al.  Unsupervised Feature Selection via Nonnegative Spectral Analysis and Redundancy Control , 2015, IEEE Transactions on Image Processing.

[48]  Alan C. Bovik,et al.  Image information and visual quality , 2006, IEEE Trans. Image Process..