WLDISR: Weighted Local Sparse Representation-Based Depth Image Super-Resolution for 3D Video System

In this paper, we propose a Weighted Local sparse representation based Depth Image Super-Resolution (WLDISR) schemes aiming at improving the Virtual View Image (VVI) quality of 3D video system. Different from color images, depth images are mainly used to provide geometrical information in synthesizing VVI. Due to the view synthesis characteristics difference between textural structures and smooth regions of depth images, we divide the depth images into edge and smooth patches and learn two local dictionaries, respectively. Meanwhile, the weight term is derived and incorporated explicitly in the cost function to denote different importance of edge structures and smooth regions to the VVI quality. Then, local sparse representation and weighted sparse representation are jointly used in both dictionary learning and reconstruction phases in depth image super-resolution. Based on different optimizations on learning and reconstruction modules, three WLDISR schemes, WLDISR-D, WLDISR-R, and WLDISR-ALL, are proposed. Experimental results on 3D sequences demonstrate that the proposed WLDISR-D, WLDISR-R, and WLDISR-ALL schemes can achieve more than 1.9-, 2.03-, and 2.16-dB gains on average, respectively, in terms of the VVIs’ quality, as compared with the state-of-the-art schemes. In addition, the visual quality of VVIs is also improved.

[1]  Xiangtao Zheng,et al.  Hyperspectral Image Superresolution by Transfer Learning , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Nianyi Li,et al.  A weighted sparse coding framework for saliency detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[4]  Truong Q. Nguyen,et al.  Depth Reconstruction From Sparse Samples: Representation, Algorithm, and Sampling , 2014, IEEE Transactions on Image Processing.

[5]  Srimanta Mandal,et al.  Depth Map Restoration From Undersampled Data , 2017, IEEE Transactions on Image Processing.

[6]  Chunhong Pan,et al.  Manifold Regularized Local Sparse Representation for Face Recognition , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Jin Young Lee,et al.  Efficient Synthesis-Based Depth Map Coding in AVC-Compatible 3D Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Luc Van Gool,et al.  A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution , 2014, ACCV.

[9]  Qionghai Dai,et al.  CCR: Clustering and Collaborative Representation for Fast Single Image Super-Resolution , 2016, IEEE Transactions on Multimedia.

[10]  Rogério Schmidt Feris,et al.  Joint Super Resolution and Denoising From a Single Depth Image , 2015, IEEE Transactions on Multimedia.

[11]  Gangyi Jiang,et al.  Regional Bit Allocation and Rate Distortion Optimization for Multiview Depth Video Coding With View Synthesis Distortion Model , 2013, IEEE Transactions on Image Processing.

[12]  Jianjun Lei,et al.  Depth Map Super-Resolution Considering View Synthesis Quality , 2017, IEEE Transactions on Image Processing.

[13]  In-So Kweon,et al.  Light-Field Image Super-Resolution Using Convolutional Neural Network , 2017, IEEE Signal Processing Letters.

[14]  Qionghai Dai,et al.  Synthesis-guided depth super resolution , 2014, 2014 IEEE Visual Communications and Image Processing Conference.

[15]  Michael Elad,et al.  On Single Image Scale-Up Using Sparse-Representations , 2010, Curves and Surfaces.

[16]  Michal Joachimiak,et al.  Upsampled-view distortion optimization for mixed resolution 3D Video Coding , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[17]  Srimanta Mandal,et al.  Noise adaptive super-resolution from single image via non-local mean and sparse representation , 2017, Signal Process..

[18]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Qiang Wu,et al.  Explicit modeling on depth-color inconsistency for color-guided depth up-sampling , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  Luc Van Gool,et al.  Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[22]  C.-C. Jay Kuo,et al.  Efficient Multiview Depth Coding Optimization Based on Allowable Depth Distortion in View Synthesis , 2014, IEEE Transactions on Image Processing.

[23]  Shang-Hong Lai,et al.  Face Verification With Local Sparse Representation , 2013, IEEE Signal Processing Letters.

[24]  Feng Wu,et al.  Estimation of Virtual View Synthesis Distortion Toward Virtual View Position , 2016, IEEE Transactions on Image Processing.

[25]  Michal Joachimiak,et al.  View synthesis quality mapping for depth-based super resolution on mixed resolution 3D video , 2014, 2014 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[26]  Horst Bischof,et al.  Variational Depth Superresolution Using Example-Based Edge Representations , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Shuai Yang,et al.  Joint-Feature Guided Depth Map Super-Resolution With Face Priors , 2018, IEEE Transactions on Cybernetics.

[28]  Oscar C. Au,et al.  Depth Map Super-Resolution Using Synthesized View Matching for Depth-Image-Based Rendering , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.

[29]  Gabriel J. Brostow,et al.  Patch Based Synthesis for Single Depth Image Super-Resolution , 2012, ECCV.

[30]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[31]  Rogério Schmidt Feris,et al.  Edge guided single depth image super resolution , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[32]  Ahmet M. Kondoz,et al.  Edge adaptive upsampling of depth map videos for enhanced free-viewpoint video quality , 2009 .

[33]  Yuan Yan Tang,et al.  Weighted Joint Sparse Representation for Removing Mixed Noise in Image , 2017, IEEE Transactions on Cybernetics.

[34]  Hamid R. Rabiee,et al.  When Pixels Team up: Spatially Weighted Sparse Coding for Hyperspectral Image Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[35]  Yusheng Ji,et al.  Multiple Description Coding & Recovery of Free Viewpoint Video for Wireless MultiPath Streaming , 2014 .

[36]  Yun Zhang,et al.  A Virtual View PSNR Estimation Method for 3-D Videos , 2016, IEEE Transactions on Broadcasting.