Sparse Representation-Based Video Quality Assessment for Synthesized 3D Videos

The temporal flicker distortion is one of the most annoying noises in synthesized virtual view videos when they are rendered by compressed multi-view video plus depth in Three Dimensional (3D) video system. To assess the synthesized view video quality and further optimize the compression techniques in 3D video system, objective video quality assessment which can accurately measure the flicker distortion is highly needed. In this paper, we propose a full reference sparse representation-based video quality assessment method toward synthesized 3D videos. First, a synthesized video, treated as a 3D volume data with spatial (X-Y) and temporal (T) domains, is reformed and decomposed as a number of spatially neighboring temporal layers, i.e., X-T or Y-T planes. Gradient features in temporal layers of the synthesized video and strong edges of depth maps are used as key features in detecting the location of flicker distortions. Second, the dictionary learning and sparse representation for the temporal layers are then derived and applied to effectively represent the temporal flicker distortion. Third, a rank pooling method is used to pool all the temporal layer scores and obtain the score for the flicker distortion. Finally, the temporal flicker distortion measurement is combined with the conventional spatial distortion measurement to assess the quality of synthesized 3D videos. Experimental results on synthesized video quality database demonstrate our proposed method is significantly superior to the other state-of-the-art methods, especially on the view synthesis distortions induced from depth videos.

[1]  Y. C. Pati,et al.  Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[2]  Sumohana S. Channappayya,et al.  Modeling sparse spatio-temporal representations for no-reference video quality assessment , 2017, 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[3]  Wilson S. Geisler,et al.  Image quality assessment based on a degradation model , 2000, IEEE Trans. Image Process..

[4]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[5]  Yu Zhou,et al.  No-reference quality assessment of DIBR-synthesized videos by measuring temporal flickering , 2018, J. Vis. Commun. Image Represent..

[6]  Lu Yu,et al.  A perceptual metric for evaluating quality of synthesized sequences in 3DV system , 2010, Visual Communications and Image Processing.

[7]  Ke Gu,et al.  Quality Assessment of DIBR-Synthesized Images by Measuring Local Geometric Distortions and Global Sharpness , 2018, IEEE Transactions on Multimedia.

[8]  Patrick Le Callet,et al.  DIBR-synthesized image quality assessment based on morphological multi-scale approach , 2017, EURASIP J. Image Video Process..

[9]  Weisi Lin,et al.  Analysis of Distortion Distribution for Pooling in Image Quality Prediction , 2016, IEEE Transactions on Broadcasting.

[10]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[11]  Fan Zhang,et al.  Exploring V1 by modeling the perceptual quality of images. , 2014, Journal of vision.

[12]  Patrick Le Callet,et al.  Objective image quality assessment of 3D synthesized views , 2015, Signal Process. Image Commun..

[13]  C.-C. Jay Kuo,et al.  Efficient Multiview Depth Coding Optimization Based on Allowable Depth Distortion in View Synthesis , 2014, IEEE Transactions on Image Processing.

[14]  Chang Wen Chen,et al.  Sparse Spatio-Temporal Representation With Adaptive Regularized Dictionary Learning for Low Bit-Rate Video Coding , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Wen Gao,et al.  Reduced-Reference Image Quality Assessment in Free-Energy Principle and Sparse Representation , 2017, IEEE Transactions on Multimedia.

[16]  Michael Elad,et al.  Image Sequence Denoising via Sparse and Redundant Representations , 2009, IEEE Transactions on Image Processing.

[17]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[18]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[19]  Patrick Le Callet,et al.  DIBR synthesized image quality assessment based on morphological wavelets , 2015, 2015 Seventh International Workshop on Quality of Multimedia Experience (QoMEX).

[20]  Ghassan Al-Regib,et al.  3VQM: A vision-based quality measure for DIBR-based 3D videos , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[21]  Weisi Lin,et al.  Multiscale Natural Scene Statistical Analysis for No-Reference Quality Evaluation of DIBR-Synthesized Views , 2020, IEEE Transactions on Broadcasting.

[22]  Zhou Wang,et al.  Information Content Weighting for Perceptual Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[23]  Weisi Lin,et al.  A Highly Efficient Blind Image Quality Assessment Metric of 3-D Synthesized Images Using Outlier Detection , 2019, IEEE Transactions on Industrial Informatics.

[24]  Weisi Lin,et al.  Efficient Image Deblocking Based on Postfiltering in Shifted Windows , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[26]  张云 Regional bit allocation and rate distortion optimization for multiview depth video coding with View synthesis distortion model , 2013 .

[27]  Denis Pellerin,et al.  Dictionary of gray-level 3D patches for action recognition , 2014, 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[28]  Qionghai Dai,et al.  Full-Reference Quality Assessment of Stereoscopic Images by Learning Binocular Receptive Field Properties , 2015, IEEE Transactions on Image Processing.

[29]  Daniel Thalmann,et al.  Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description , 2018, IEEE Transactions on Image Processing.

[30]  C.-C. Jay Kuo,et al.  Subjective and Objective Video Quality Assessment of 3D Synthesized Views With Texture/Depth Compression Distortion , 2015, IEEE Transactions on Image Processing.

[31]  Erhan Ekmekcioglu,et al.  Depth Based Perceptual Quality Assessment for Synthesised Camera Viewpoints , 2010, UCMedia.

[32]  Marco Grangetto,et al.  Evaluating virtual image quality using the side-views information fusion and depth maps , 2018, Inf. Fusion.

[33]  Gangyi Jiang,et al.  Learning Sparse Representation for Objective Image Retargeting Quality Assessment , 2018, IEEE Transactions on Cybernetics.

[34]  Weisi Lin,et al.  No-Reference View Synthesis Quality Prediction for 3-D Videos Based on Color–Depth Interactions , 2018, IEEE Transactions on Multimedia.

[35]  Xuelong Li,et al.  Sparse representation for blind image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Dragan D. Kukolj,et al.  Multi–Scale Synthesized View Assessment Based on Morphological Pyramids , 2016 .

[37]  Yun Zhang,et al.  High-Efficiency 3D Depth Coding Based on Perceptual Quality of Synthesized Video. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[38]  Touradj Ebrahimi,et al.  A quality assessment protocol for free-viewpoint video sequences synthesized from decompressed depth data , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[39]  Dong Tian,et al.  Boundary Artifact Reduction in View Synthesis of 3D Video: From Perspective of Texture-Depth Alignment , 2011, IEEE Transactions on Broadcasting.

[40]  Hsueh-Ming Hang,et al.  Quality assessment of synthesized 3D video with distorted depth map , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[41]  Weisi Lin,et al.  Image Sharpness Assessment by Sparse Representation , 2016, IEEE Transactions on Multimedia.

[42]  Chunping Hou,et al.  Combining Local and Global Measures for DIBR-Synthesized Image Quality Evaluation , 2019, IEEE Transactions on Image Processing.

[43]  Sunil Prasad Jaiswal,et al.  A Prediction Backed Model for Quality Assessment of Screen Content and 3-D Synthesized Images , 2018, IEEE Transactions on Industrial Informatics.

[44]  Patrick Le Callet,et al.  Towards a New Quality Metric for 3-D Synthesized View Assessment , 2011, IEEE Journal of Selected Topics in Signal Processing.

[45]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[46]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .