High-Quality Depth Estimation Using an Exemplar 3D Model for Stereo Conversion

High-quality depth painting for each object in a scene is a challenging task in 2D to 3D stereo conversion. One way to accurately estimate the varying depth within the object in an image is to utilize existing 3D models. Automatic pose estimation approaches based on 2D-3D feature correspondences have been proposed to obtain depth from a given 3D model. However, when the 3D model is not identical to the target object, previous methods often produce erroneous depth in the vicinity of the silhouette of the object. This paper introduces a novel 3D model-based depth estimation method that effectively produces high-quality depth information for rigid objects in a stereo conversion workflow. Given an exemplar 3D model and user correspondences, our method generates detailed depth of an object by optimizing the initial depth obtained by the application of structural fitting and silhouette matching in the image domain. The final depth is accurate up to the given 3D model, while consistent with the image. Our method was applied to various image sequences containing objects with different appearances and varying poses. The experiments show that our method can generate plausible depth information that can be utilized for high-quality 2D to 3D stereo conversion.

[1]  Bodo Rosenhahn,et al.  Three-Dimensional Shape Knowledge for Joint Image Segmentation and Pose Tracking , 2007, International Journal of Computer Vision.

[2]  Alla Sheffer,et al.  Modeling from contour drawings , 2009, SBIM '09.

[3]  Marc Alexa,et al.  A sketch-based interface for detail-preserving mesh editing , 2005, SIGGRAPH 2005.

[4]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[5]  M. Gross,et al.  Nonlinear disparity mapping for stereoscopic 3D , 2010, ACM Trans. Graph..

[6]  Sylvain Lefebvre,et al.  State of the Art in Example-based Texture Synthesis , 2009, Eurographics.

[7]  Wojciech Matusik,et al.  Articulated mesh animation from multi-view silhouettes , 2008, ACM Trans. Graph..

[8]  Richard M. Murray,et al.  A Mathematical Introduction to Robotic Manipulation , 1994 .

[9]  Christian Rössl,et al.  Laplacian surface editing , 2004, SGP '04.

[10]  Simon Baker,et al.  2D vs. 3D Deformable Face Models: Representational Power, Construction, and Real-Time Fitting , 2007, International Journal of Computer Vision.

[11]  Bodo Rosenhahn,et al.  Pose Estimation of Free-Form Objects , 2004, ECCV.

[12]  Kyunghan Lee,et al.  Depth manipulation using disparity histogram analysis for stereoscopic 3D , 2013, The Visual Computer.

[13]  Antonio Susín,et al.  Transferring the Rig and Animations from a Character to Different Face Models , 2008, Comput. Graph. Forum.

[14]  Younghui Kim,et al.  Video Panorama for 2D to 3D Conversion , 2012, Comput. Graph. Forum.

[15]  Frédo Durand,et al.  A gentle introduction to bilateral filtering and its applications , 2007, SIGGRAPH Courses.

[16]  Daniel Cohen-Or,et al.  Semi-automatic stereo extraction from video footage , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17]  Michael J. Black,et al.  Estimating human shape and pose from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Jun-yong Noh,et al.  Terrain Geometry from Monocular Image Sequences , 2008, J. Comput. Sci. Eng..

[19]  Sing Bing Kang,et al.  Depth Director: A System for Adding Depth to Movies , 2011, IEEE Computer Graphics and Applications.

[20]  Liang Zhang,et al.  Stereoscopic image generation based on depth images for 3D TV , 2005, IEEE Transactions on Broadcasting.

[21]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[22]  Andrew P. Van Pernis,et al.  Dimensionalization: converting 2D films to 3D , 2008, Electronic Imaging.

[23]  Peter Kauff,et al.  Three-Dimensional Video Postproduction and Processing , 2011, Proceedings of the IEEE.

[24]  Yeongho Seol,et al.  Artist friendly facial animation retargeting , 2011, ACM Trans. Graph..

[25]  Hao Zhang,et al.  Photo-inspired model-driven 3D object modeling , 2011, SIGGRAPH 2011.

[26]  Marc Alexa,et al.  A sketch-based interface for detail-preserving mesh editing , 2007, SIGGRAPH Courses.

[27]  Patrick Pérez,et al.  Object removal by exemplar-based inpainting , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[28]  Michael F. Cohen,et al.  Optimized Color Sampling for Robust Matting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Yaser Sheikh,et al.  3D object manipulation in a single photograph using stock 3D models , 2014, ACM Trans. Graph..

[30]  Joseph L. Mundy,et al.  Predicting high resolution image edges with a generic, adaptive, 3-D vehicle model , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Markus H. Gross,et al.  StereoBrush: interactive 2D to 3D conversion using discontinuous warps , 2011, SBIM '11.

[32]  Kyehyun Kim,et al.  Stereoscopic image generation of background terrain scenes , 2011, Comput. Animat. Virtual Worlds.

[33]  Yun Zhang,et al.  StereoPasting: Interactive Composition in Stereoscopic Images , 2013, IEEE Transactions on Visualization and Computer Graphics.

[34]  Dani Lischinski,et al.  Colorization using optimization , 2004, ACM Trans. Graph..