Re-Visiting Discriminator for Blind Free-Viewpoint Image Quality Assessment

Accurate measurement of perceptual quality is important for various immersive multimedia, which demand real-time quality control or quality-based bench-marking for relevant algorithms. For instance, virtual views rendering in Free-Viewpoint (FV) navigation scenarios is a typical case that introduces challenging distortions, particularly the ones around dis-occluded regions. Existing quality metrics, most of which are targeting for impairments caused by compression or network condition, fail to quantify such non-uniform structure-related distortions. Moreover, the lack of quality databases for such distortions makes it even more challenging to develop robust quality metrics. In this work, a Generative Adversarial Networks based No-Reference (NR) quality Metric, namely GANs-NRM, is proposed. We first present an approach to create masks mimicking dis-occlusions/textureless regions, which is applicable on large-scale 2D image databases publicly available in the computer vision domain. Using these synthetic data, we then train a GANs-based context renderer with the capability of rendering those masked regions. Since the naturalness of the rendered dis-occluded regions strongly relates to the perceptual quality, we assume that the discriminator of the trained GANs has an intrinsic ability for quality assessment. We thus use the features extracted from the discriminator to learn a Bag-of-Distortion-Word (BDW) codebook. We show that a quality predictor can be then well trained using only a small amount of subjective quality data for the FV views rendering. Moreover, in the proposed framework, the discriminator is also adapted as a distortion-detector to locate possible distorted regions. According to the experimental results, the proposed model outperforms significantly the state-of-the-art quality metrics. The corresponding context renderer also shows appealing visualized results over other rendering algorithms.

[1]  Zhibo Chen,et al.  Tensor Oriented No-Reference Light Field Image Quality Assessment , 2019, IEEE Transactions on Image Processing.

[2]  Patrick Le Callet,et al.  Fast Blind Quality Assessment of DIBR-Synthesized Video Based on High-High Wavelet Subband , 2019, IEEE Transactions on Image Processing.

[3]  Olivier Déforges,et al.  A Benchmark of DIBR Synthesized View Quality Assessment Metrics on a New Database for Immersive Media Applications , 2019, IEEE Transactions on Multimedia.

[4]  Xiongkuo Min,et al.  Quality Assessment of Free-Viewpoint Videos by Quantifying the Elastic Changes of Multi-Scale Motion Trajectories , 2019, IEEE Transactions on Image Processing.

[5]  Wei Zhou,et al.  Dual-Stream Interactive Networks for No-Reference Stereoscopic Image Quality Assessment , 2019, IEEE Transactions on Image Processing.

[6]  Jan-Michael Frahm,et al.  Deep blending for free-viewpoint image-based rendering , 2018, ACM Trans. Graph..

[7]  Hans-Peter Seidel,et al.  Dataset and Metrics for Predicting Local Visible Differences , 2018, ACM Trans. Graph..

[8]  Ke Gu,et al.  Prediction of the Influence of Navigation Scan-Path on Perceived Quality of Free-Viewpoint Videos , 2018, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[9]  Patrick Le Callet,et al.  How to Learn the Effect of Non-Uniform Distortion on Perceived Visual Quality? Case Study Using Convolutional Sparse Coding for Quality Assessment of Synthesized Views , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[10]  Patrick Le Callet,et al.  No-Reference Quality Assessment for Stitched Panoramic Images Using Convolutional Sparse Coding and Compound Feature Selection , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[11]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  2018 - 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) , 2018 .

[13]  Kwan-Yee Lin,et al.  Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Olivier Déforges,et al.  NIQSV+: A No-Reference Synthesized View Quality Assessment Metric , 2018, IEEE Transactions on Image Processing.

[15]  Ke Gu,et al.  Quality Assessment of DIBR-Synthesized Images by Measuring Local Geometric Distortions and Global Sharpness , 2018, IEEE Transactions on Multimedia.

[16]  Bo Yang,et al.  Dense 3D Object Reconstruction from a Single Depth View , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Shiguang Shan,et al.  Shift-Net: Image Inpainting via Deep Feature Rearrangement , 2018, ECCV.

[18]  Thomas S. Huang,et al.  Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  Chao Yang,et al.  Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20]  Xiaogang Wang,et al.  StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ming Jiang,et al.  Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur Images , 2017, ACM Multimedia.

[22]  Patrick Le Callet,et al.  Image Quality Assessment for DIBR Synthesized Views using Elastic Metric , 2017, ACM Multimedia.

[23]  Patrick Le Callet,et al.  Quality assessment for synthesized view based on variable-length context tree , 2017, 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).

[24]  Patrick Le Callet,et al.  Image quality assessment for free viewpoint video based on mid-level contours feature , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[25]  Dacheng Tao,et al.  Perceptual Adversarial Networks for Image-to-Image Transformation , 2017, IEEE Transactions on Image Processing.

[26]  Hans-Peter Seidel,et al.  Towards a Quality Metric for Dense Light Fields , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Olivier Déforges,et al.  NIQSV: A no reference image quality assessment metric for 3D synthesized views , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  Sebastian Bosse,et al.  Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[29]  Joohwan Kim,et al.  Towards foveated rendering for gaze-tracked virtual reality , 2016, ACM Trans. Graph..

[30]  Sebastian Bosse,et al.  A deep neural network for image quality assessment , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[31]  Minh N. Do,et al.  Semantic Image Inpainting with Deep Generative Models , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  P. Callet,et al.  DIBR-synthesized image quality assessment based on morphological multi-scale approach , 2016, EURASIP J. Image Video Process..

[33]  Yuesheng Zhu,et al.  A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[36]  Shuai Li,et al.  Depth Image Based View Synthesis: New Insights and Perspectives on Hole Generation and Filling , 2016, IEEE Transactions on Broadcasting.

[37]  Paolo Napoletano,et al.  On the use of deep learning for blind image quality assessment , 2016, Signal Image Video Process..

[38]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[39]  Patrick Le Callet,et al.  DIBR synthesized image quality assessment based on morphological pyramids , 2015, 2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[40]  Patrick Le Callet,et al.  DIBR synthesized image quality assessment based on morphological wavelets , 2015, 2015 Seventh International Workshop on Quality of Multimedia Experience (QoMEX).

[41]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[42]  Yi Li,et al.  Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Aaron C. Courville,et al.  Generative Adversarial Nets , 2014, NIPS.

[44]  David G. Lowe,et al.  Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Changick Kim,et al.  A Novel Depth-Based Virtual View Synthesis Method for Free Viewpoint Video , 2013, IEEE Transactions on Broadcasting.

[46]  Judith Redi,et al.  Supporting visual quality assessment with machine learning , 2013, EURASIP Journal on Image and Video Processing.

[47]  D. Chandler Seven Challenges in Image Quality Assessment: Past, Present, and Future Research , 2013 .

[48]  Ghassan Al-Regib,et al.  Hierarchical Hole-Filling For Depth-Based View Synthesis in FTV and 3D Video , 2012, IEEE Journal of Selected Topics in Signal Processing.

[49]  Pierre-Henri Conze,et al.  Objective view synthesis quality assessment , 2012, Electronic Imaging.

[50]  Christos Boutsidis,et al.  Randomized Dimensionality Reduction for $k$ -Means Clustering , 2011, IEEE Transactions on Information Theory.

[51]  Patrick Le Callet,et al.  Towards a New Quality Metric for 3-D Synthesized View Assessment , 2011, IEEE Journal of Selected Topics in Signal Processing.

[52]  Yong Man Ro,et al.  Quantitative measurement of binocular color fusion limit for non-spectral colors. , 2011, Optics express.

[53]  Thomas Wiegand,et al.  Temporally consistent handling of disocclusions with texture synthesis for depth-image-based rendering , 2010, 2010 IEEE International Conference on Image Processing.

[54]  Thomas Wiegand,et al.  Depth Image-Based Rendering With Advanced Texture Synthesis for 3-D Video , 2010, IEEE Transactions on Multimedia.

[55]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Mohammed Ghanbari,et al.  Scope of validity of PSNR in image/video quality assessment , 2008 .

[57]  Toshiaki Fujii,et al.  View generation with 3D warping using depth information for FTV , 2009, Signal Process. Image Commun..

[58]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[59]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[60]  Alexandru Telea,et al.  An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[61]  Ke Gu,et al.  Blind Quality Metric of DIBR-Synthesized Images in the Discrete Wavelet Transform Domain , 2020, IEEE Transactions on Image Processing.

[62]  Daniel Thalmann,et al.  Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description , 2018, IEEE Transactions on Image Processing.

[63]  Bin Chen,et al.  Light Field Synthesis from a Single Image using Improved Wasserstein Generative Adversarial Network , 2018, Eurographics.

[64]  Patrick Le Callet,et al.  Objective image quality assessment of 3D synthesized views , 2015, Signal Process. Image Commun..

[65]  Christine Guillemot,et al.  Author manuscript, published in "IEEE International Conference on Image Processing (ICIP), Brussels: Belgium (2011)" OBJECT-BASED LAYERED DEPTH IMAGES FOR IMPROVED VIRTUAL VIEW SYNTHESIS IN RATE-CONSTRAINED CONTEXT , 2011 .

[66]  Toshiaki Fujii,et al.  Free-Viewpoint TV , 2011, IEEE Signal Processing Magazine.

[67]  S. Süsstrunk,et al.  SLIC Superpixels ? , 2010 .

[68]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[69]  Aljoscha Smolic,et al.  View Synthesis for Advanced 3D Video Systems , 2008, EURASIP J. Image Video Process..

[70]  C. Fehn A 3 DTV Approach Using Depth-Image-Based Rendering ( DIBR ) , 2003 .