Bayesian reconstruction of 3D shapes and scenes from a single image

It is common experience for human vision to perceive full 3D shape and scene from a single 2D image with the occluded parts "filled-in" by prior visual knowledge. We represent prior knowledge of 3D shapes and scenes by probabilistic models at two levels - both are defined on graphs. The first level model is built on a graph representation for single objects, and it is a mixture model for both man-made block objects such as trees and grasses. It assumes surface and boundary smoothness, 3D angle symmetry etc. The second level model is built on the relation graph of all objects in a scene. It assumes that objects should be supported for maximum stability with global bounding surfaces, such as ground, sky and walls. Given an input image, we extract the geometry and photometric structures through image segmentation and sketching, and represent them in a big graph. Then we partition the graph into subgraphs each being an object, infer the 3D shape and recover occluded surfaces, edges and vertices in each subgraph, and infer the scene structures between the recovered 3D sub-graphs. The inference algorithm samples from the prior model under the constraint that it reproduces the observed image/sketch under projective geometry.

[1]  Kokichi Sugihara,et al.  Mathematical Structures of Line Drawings of Polyhedrons-Toward Man-Machine Communication by Means of Line Drawings , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Shree K. Nayar,et al.  Shape from Focus , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[4]  Patrick S. P. Wang,et al.  Three-dimensional object recovery from two-dimensional images: a new approach , 1996, Other Conferences.

[5]  Hod Lipson,et al.  Optimization-based reconstruction of a 3D object from a single freehand line drawing , 1996, Comput. Aided Des..

[6]  M. Carter Computer graphics: Principles and practice , 1997 .

[7]  Thomas Ertl,et al.  Computer Graphics - Principles and Practice, 3rd Edition , 2014 .

[8]  Song-Chun Zhu,et al.  Minimax Entropy Principle and Its Application to Texture Modeling , 1997, Neural Computation.

[9]  Reinhard Koch,et al.  Self-Calibration and Metric Reconstruction Inspite of Varying and Unknown Intrinsic Camera Parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[10]  Ralph R. Martin,et al.  A system for constructing boundary representation solid models from a two-dimensional sketch , 2000, Proceedings Geometric Modeling and Processing 2000. Theory and Applications.

[11]  Frank Dellaert,et al.  Structure from motion without correspondence , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[12]  Song-Chun Zhu,et al.  Exploring Texture Ensembles by Efficient Markov Chain Monte Carlo-Toward a 'Trichromacy' Theory of Texture , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Li Zhang,et al.  Single view modeling of free-form scenes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14]  Stefano Soatto,et al.  Stereoscopic Segmentation , 2003, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[15]  Camillo J. Taylor,et al.  Reconstruction of Linearly Parameterized Models from Single Images with a Camera of Unknown Focal Length , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Zhuowen Tu,et al.  Parsing Images into Region and Curve Processes , 2002, ECCV.

[17]  Zhuowen Tu,et al.  Image Segmentation by Data-Driven Markov Chain Monte Carlo , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Roberto Cipolla,et al.  A Bayesian Estimation of Building Shape Using MCMC , 2002, ECCV.

[19]  Song-Chun Zhu,et al.  Towards a mathematical theory of primal sketch and sketchability , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[20]  Adrian Barbu,et al.  Graph partition by Swendsen-Wang cuts , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[21]  Reinhard Koch,et al.  Self-Calibration and Metric Reconstruction Inspite of Varying and Unknown Intrinsic Camera Parameters , 1999, International Journal of Computer Vision.

[22]  Martin A. Fischler,et al.  An optimization-based approach to the interpretation of single line drawings as 3D wire frames , 1992, International Journal of Computer Vision.

[23]  Berthold K. P. Horn Height and gradient from shading , 1989, International Journal of Computer Vision.

[24]  Thomas Marill,et al.  Emulating the human interpretation of line-drawings as three-dimensional objects , 1991, International Journal of Computer Vision.

[25]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[26]  Kiriakos N. Kutulakos,et al.  A Theory of Shape by Space Carving , 2000, International Journal of Computer Vision.