Depth-image-based rendering with spatial and temporal texture synthesis for 3DTV

A depth-image-based rendering (DIBR) method with spatial and temporal texture synthesis is presented in this article. Theoretically, the DIBR algorithm can be used to generate arbitrary virtual views of the same scene in a three-dimensional television system. But the disoccluded area, which is occluded in the original views and becomes visible in the virtual views, makes it very difficult to obtain high image quality in the extrapolated views. The proposed view synthesis method combines the temporally stationary scene information extracted from the input video and spatial texture in the current frame to fill the disoccluded areas in the virtual views. Firstly, the current texture image and a stationary scene image, which is extracted from the input video, are warped to the same virtual perspective position by the DIBR method. Then, the two virtual images are merged together to reduce the hole regions and maintain the temporal consistency of these areas. Finally, an oriented exemplar-based inpainting method is utilized to eliminate the remaining holes. Experimental results are shown to demonstrate the performance and advantage of the proposed method compared with other view synthesis methods.

[1]  Liang Zhang,et al.  Stereoscopic image generation based on depth images for 3D TV , 2005, IEEE Transactions on Broadcasting.

[2]  Jeremy S. De Bonet,et al.  Multiresolution sampling procedure for analysis and synthesis of texture images , 1997, SIGGRAPH.

[3]  Lucia Ballerini,et al.  IEEE International Symposium on Intelligent Signal Processing and Communication Systems , 1999 .

[4]  Christophe Tillier,et al.  Distance Dependent Depth Filtering in 3D Warping for 3DTV , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[5]  Gavin S. P. Miller,et al.  Hierarchical Z-buffer visibility , 1993, SIGGRAPH.

[6]  Peter Kauff,et al.  Three-Dimensional Video Postproduction and Processing , 2011, Proceedings of the IEEE.

[7]  Tony F. Chan,et al.  Nontexture Inpainting by Curvature-Driven Diffusions , 2001, J. Vis. Commun. Image Represent..

[8]  Levent Onural,et al.  Introduction to the Special Section on 3DTV , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[10]  Dongxiao Li,et al.  An Asymmetric Edge Adaptive Filter for Depth Generation and Hole Filling in 3DTV , 2010, IEEE Transactions on Broadcasting.

[11]  Peter H. N. de With,et al.  Free-viewpoint depth image based rendering , 2010, J. Vis. Commun. Image Represent..

[12]  Jenq-Neng Hwang,et al.  Exemplar-Based Video Inpainting Without Ghost Shadow Artifacts by Maintaining Temporal Continuity , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Liang-Gee Chen,et al.  Efficient Depth Image Based Rendering with Edge Dependent Depth Filter and Interpolation , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[14]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[15]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[17]  Guillermo Sapiro,et al.  Simultaneous structure and texture image inpainting , 2003, IEEE Trans. Image Process..

[18]  Guillermo Sapiro,et al.  Video Inpainting Under Constrained Camera Motion , 2007, IEEE Transactions on Image Processing.

[19]  Michael Schmeing,et al.  Depth Image Based Rendering: A faithful approach for the disocclusion problem , 2010, 2010 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[20]  Taku Komura,et al.  Topology matching for fully automatic similarity estimation of 3D shapes , 2001, SIGGRAPH.

[21]  Toshiaki Fujii,et al.  View generation with 3D warping using depth information for FTV , 2009, Signal Process. Image Commun..

[22]  Nikos Komodakis,et al.  Image Completion Using Efficient Belief Propagation Via Priority Scheduling and Dynamic Pruning , 2007, IEEE Transactions on Image Processing.

[23]  Aljoscha Smolic,et al.  Intermediate view interpolation based on multiview video plus depth for advanced 3D video systems , 2008, 2008 15th IEEE International Conference on Image Processing.

[24]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[25]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[26]  Shang-Hong Lai,et al.  Spatio-Temporally Consistent Novel View Synthesis Algorithm From Video-Plus-Depth Sequences for Autostereoscopic Displays , 2011, IEEE Transactions on Broadcasting.

[27]  C. Fehn A 3 DTV Approach Using Depth-Image-Based Rendering ( DIBR ) , 2003 .

[28]  Aljoscha Smolic,et al.  Coding Algorithms for 3DTV—A Survey , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Masayuki Tanimoto,et al.  Multiview Imaging and 3DTV , 2007, IEEE Signal Processing Magazine.

[30]  Peter H. N. de With,et al.  System architecture for free-viewpoint video and 3D-TV , 2008, IEEE Transactions on Consumer Electronics.

[31]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[32]  Seung-Uk Yoon,et al.  Multiple Color and Depth Video Coding Using a Hierarchical Representation , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Yo-Sung Ho,et al.  Hole filling method using depth based in-painting for view synthesis in free viewpoint television and 3-D video , 2009, 2009 Picture Coding Symposium.

[34]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[35]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.