Adaptive Image Warping for Hole Prevention in 3D View Synthesis

Increasing popularity of 3D videos calls for new methods to ease the conversion process of existing monocular video to stereoscopic or multi-view video. A popular way to convert video is given by depth image-based rendering methods, in which a depth map that is associated with an image frame is used to generate a virtual view. Because of the lack of knowledge about the 3D structure of a scene and its corresponding texture, the conversion of 2D video, inevitably, however, leads to holes in the resulting 3D image as a result of newly-exposed areas. The conversion process can be altered such that no holes become visible in the resulting 3D view by superimposing a regular grid over the depth map and deforming it. In this paper, an adaptive image warping approach as an improvement to the regular approach is proposed. The new algorithm exploits the smoothness of a typical depth map to reduce the complexity of the underlying optimization problem that is necessary to find the deformation, which is required to prevent holes. This is achieved by splitting a depth map into blocks of homogeneous depth using quadtrees and running the optimization on the resulting adaptive grid. The results show that this approach leads to a considerable reduction of the computational complexity while maintaining the visual quality of the synthesized views.

[1]  T. Chan,et al.  Variational image inpainting , 2005 .

[2]  Kwanghoon Sohn,et al.  Visual fatigue modeling and analysis for stereoscopic video , 2012 .

[3]  Matthias Zwicker,et al.  3 Ideal Resampling 3 . 1 Sampling and Aliasing , 2022 .

[4]  I. B. Fidaner A Survey on Variational Image Inpainting , Texture Synthesis and Image Completion , 2007 .

[5]  Peter Kauff,et al.  Three-Dimensional Video Postproduction and Processing , 2011, Proceedings of the IEEE.

[6]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, SIGGRAPH 2009.

[7]  Markus H. Gross,et al.  StereoBrush: interactive 2D to 3D conversion using discontinuous warps , 2011, SBIM '11.

[8]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[9]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[10]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[11]  Guillermo Sapiro,et al.  Navier-stokes, fluid dynamics, and image and video inpainting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Carlos Vázquez,et al.  3D-TV Content Creation: Automatic 2D-to-3D Video Conversion , 2011, IEEE Transactions on Broadcasting.

[13]  Touradj Ebrahimi,et al.  Quality assessment of a stereo pair formed from decoded and synthesized views using objective metrics , 2012, 2012 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[14]  Aseem Agarwala,et al.  Efficient gradient-domain compositing using quadtrees , 2007, ACM Trans. Graph..

[15]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[16]  Meng Wang,et al.  2D-to-3D image conversion by learning depth from examples , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[17]  Stephen Gould,et al.  Single image depth estimation from predicted semantic labels , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Timothy K. Shih,et al.  Digital Inpainting - Survey and Multilayer Image Inpainting Algorithms (Keynote Paper) , 2005 .

[19]  Marcus Barkowsky,et al.  Video quality assessment: From 2D to 3D — Challenges and future trends , 2010, 2010 IEEE International Conference on Image Processing.

[20]  Hans-Peter Seidel,et al.  Adaptive Image-space Stereo View Synthesis , 2010, VMV.

[21]  Jian Shi,et al.  Image Retargeting Using Mesh Parametrization , 2009, IEEE Transactions on Multimedia.

[22]  Doug Moore The cost of balancing generalized quadtrees , 1995, SMA '95.

[23]  Randall J. LeVeque,et al.  Modeling and simulating tsunamis with an eye to hazard mitigation , 2011 .

[24]  Alexandru Telea,et al.  An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[25]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[26]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[27]  Ze-Nian Li,et al.  Review and Preview: Disocclusion by Inpainting for Image-Based Rendering , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Charles R. Dyer,et al.  The space efficiency of quadtrees , 1982, Comput. Graph. Image Process..

[29]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[30]  Maneesh Agrawala,et al.  Interactive video cutout , 2005, ACM Trans. Graph..

[31]  Lihi Zelnik-Manor,et al.  Context-aware saliency detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[33]  Bart Gerard Bernard Barenbrug Declipse 2: multi-layer image and depth with transparency made practical , 2009, Electronic Imaging.

[34]  V. Caselles,et al.  Exemplar-based Image Inpainting and Applications , 2011 .

[35]  M. Gross,et al.  Nonlinear disparity mapping for stereoscopic 3D , 2010, ACM Trans. Graph..

[36]  Dong Tian,et al.  View synthesis techniques for 3D video , 2009, Optical Engineering + Applications.

[37]  Shang-Hong Lai,et al.  Spatio-Temporally Consistent Novel View Synthesis Algorithm From Video-Plus-Depth Sequences for Autostereoscopic Displays , 2011, IEEE Transactions on Broadcasting.

[38]  Olga Sorkine-Hornung,et al.  Optimized scale-and-stretch for image resizing , 2008, SIGGRAPH Asia '08.

[39]  Stefan Heldmann,et al.  Adaptive Mesh Refinement for Nonparametric Image Registration , 2008, SIAM J. Sci. Comput..

[40]  Markus H. Gross,et al.  A system for retargeting of streaming video , 2009, ACM Trans. Graph..

[41]  Andriy Myronenko,et al.  Global active contour-based image segmentation via probability alignment , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.