论文信息 - No-reference synthetic image quality assessment with convolutional neural network and local image saliency

No-reference synthetic image quality assessment with convolutional neural network and local image saliency

Depth-image-based rendering (DIBR) is widely used in 3DTV, free-viewpoint video, and interactive 3D graphics applications. Typically, synthetic images generated by DIBR-based systems incorporate various distortions, particularly geometric distortions induced by object dis-occlusion. Ensuring the quality of synthetic images is critical to maintaining adequate system service. However, traditional 2D image quality metrics are ineffective for evaluating synthetic images as they are not sensitive to geometric distortion. In this paper, we propose a novel no-reference image quality assessment method for synthetic images based on convolutional neural networks, introducing local image saliency as prediction weights. Due to the lack of existing training data, we construct a new DIBR synthetic image dataset as part of our contribution. Experiments were conducted on both the public benchmark IRCCyN/IVC DIBR image dataset and our own dataset. Results demonstrate that our proposed metric outperforms traditional 2D image quality metrics and state-of-the-art DIBR-related metrics.

[1] Klara Nahrstedt,et al. A real-time remote rendering system for interactive mobile graphics , 2012, TOMCCAP.

[2] Stefan Winkler,et al. Analysis of Public Image and Video Databases for Quality Assessment , 2012, IEEE Journal of Selected Topics in Signal Processing.

[3] Kwan-Yee Lin,et al. Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Aljoscha Smolic,et al. 3D video and free viewpoint video - From capture to display , 2011, Pattern Recognit..

[5] Shi-Min Hu,et al. Detecting and Removing Visual Distractors for Video Aesthetic Enhancement , 2018, IEEE Transactions on Multimedia.

[6] Weisi Lin,et al. Quality assessment of 3D synthesized images via disoccluded region discovery , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[7] Hideaki Kimata,et al. Free-viewpoint Video Communication Using Multi-view Video Coding , 2004 .

[8] Y. Naruse. IP-in-IPv6 Overlay Networking Technology for a Terabit-class Super-network , 2004 .

[9] Nikolay N. Ponomarenko,et al. TID2008 – A database for evaluation of full-reference visual quality assessment metrics , 2004 .

[10] Jian Sun,et al. Saliency Optimization from Robust Background Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Susu Yao,et al. Just noticeable distortion model and its applications in video coding , 2005, Signal Process. Image Commun..

[12] Sebastian Bosse,et al. A deep neural network for image quality assessment , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[13] William R. Mark,et al. Post-Rendering 3D Image Warping: Visibility, Reconstruction, and Performance for Depth-Image Warping , 1999 .

[14] Yi Li,et al. Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Bo Yan,et al. An accurate deep convolutional neural networks model for no-reference image quality assessment , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[16] Christoph Fehn,et al. Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[17] Patrick Le Callet,et al. DIBR synthesized image quality assessment based on morphological wavelets , 2015, 2015 Seventh International Workshop on Quality of Multimedia Experience (QoMEX).

[18] Aljoscha Smolic,et al. An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution , 2009, 2009 Picture Coding Symposium.

[19] Heiko Hirschmüller,et al. Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Richard Szeliski,et al. High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[21] Patrick Le Callet,et al. DIBR-synthesized image quality assessment based on morphological multi-scale approach , 2017, EURASIP J. Image Video Process..

[22] Alberto Leon-Garcia,et al. Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[23] Chong Luo,et al. Multiple Level Feature-Based Universal Blind Image Quality Assessment Model , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[24] Patrick Le Callet,et al. Objective image quality assessment of 3D synthesized views , 2015, Signal Process. Image Commun..

[25] Nikolay N. Ponomarenko,et al. Image database TID2013: Peculiarities, results and perspectives , 2015, Signal Process. Image Commun..

[26] Olivier Déforges,et al. NIQSV+: A No-Reference Synthesized View Quality Assessment Metric , 2018, IEEE Transactions on Image Processing.

[27] Tingting Jiang,et al. From image quality to patch quality: An Image-Patch Model for No-Reference image quality assessment , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28] Paul Bao,et al. A framework for remote rendering of 3-D scenes on limited mobile devices , 2006, IEEE Transactions on Multimedia.

[29] Patrick Le Callet,et al. Towards a New Quality Metric for 3-D Synthesized View Assessment , 2011, IEEE Journal of Selected Topics in Signal Processing.

[30] Frederick W. B. Li,et al. Scalable Remote Rendering Using Synthesized Image Quality Assessment , 2018, IEEE Access.

[31] Pierre-Henri Conze,et al. Objective view synthesis quality assessment , 2012, Electronic Imaging.

[32] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[33] Alan C. Bovik,et al. Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[34] D. Chandler,et al. Supplement to “ VSNR : A Visual Signal-to-Noise Ratio for Natural Images Based on Near-Threshold and Suprathreshold Vision ” , 2007 .

[35] C.-C. Jay Kuo,et al. MCL-3D: A Database for Stereoscopic Image Quality Assessment using 2D-Image-Plus-Depth Source , 2014, J. Inf. Sci. Eng..

[36] Luce Morin,et al. Perceived quality of DIBR-based synthesized views , 2011, Optical Engineering + Applications.

[37] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[38] Daniel Thalmann,et al. Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description , 2018, IEEE Transactions on Image Processing.

[39] David Zhang,et al. FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[40] Sugato Chakravarty,et al. Methodology for the subjective assessment of the quality of television pictures , 1995 .

[41] Hua Huang,et al. No-reference image quality assessment based on spatial and spectral entropies , 2014, Signal Process. Image Commun..

[42] P. Bao,et al. Low bandwidth remote rendering using 3D image warping , 2003 .

[43] Thomas Brox,et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).