Modelling and Interpretation of Architecture from Several Images

This paper describes the automatic acquisition of three dimensional architectural models from short image sequences. The approach is Bayesian and model based. Bayesian methods necessitate the formulation of a prior distribution; however designing a generative model for buildings is a difficult task. In order to overcome this a building is described as a set of walls together with a ‘Lego’ kit of parameterised primitives, such as doors or windows. A prior on wall layout, and a prior on the parameters of each primitive can then be defined. Part of this prior is learnt from training data and part comes from expert architects. The validity of the prior is tested by generating example buildings using MCMC and verifying that plausible buildings are generated under varying conditions. The same MCMC machinery can also be used for optimising the structure recovery, this time generating a range of possible solutions from the posterior. The fact that a range of solutions can be presented allows the user to select the best when the structure recovery is ambiguous.

[1]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[2]  R. Bellman Dynamic programming. , 1957, Science.

[3]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[4]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[5]  William J. Mitchell,et al.  The Palladian Grammar , 1978 .

[6]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[7]  William J. Mitchell,et al.  Counting Palladian Plans , 1978 .

[8]  George Stiny,et al.  An Evaluation of Palladian Plans , 1978 .

[9]  G. Stiny Introduction to Shape and Shape Grammars , 1980 .

[10]  Soren W. Henriksen,et al.  Manual of photogrammetry , 1980 .

[11]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[12]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[13]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[14]  T. J. Lauterborn American Society Of Photogrammetry , 1984 .

[15]  Irving Biederman,et al.  Human image understanding: Recent research and a theory , 1985, Comput. Vis. Graph. Image Process..

[16]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Christopher G. Harris,et al.  Determination of Ego-Motion from Matched Points , 1987, Alvey Vision Conference.

[18]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[19]  Christopher G. Harris,et al.  3D positional integration from image sequences , 1988, Image Vis. Comput..

[20]  Robert B. Fisher From surfaces to objects - computer vision and three dimensional scene analysis , 1989 .

[21]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[22]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Richard I. Hartley,et al.  Estimation of Relative Camera Positions for Uncalibrated Cameras , 1992, ECCV.

[24]  Katsushi Ikeuchi,et al.  Why aspect graphs are not (yet) practical for computer vision , 1991, [1991 Proceedings] Workshop on Directions in Automated CAD-Based Vision.

[25]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[26]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[27]  Azriel Rosenfeld,et al.  From volumes to views: An approach to 3-D object recognition , 1992, CVGIP Image Underst..

[28]  Michal Irani,et al.  Motion Analysis for Image Enhancement: Resolution, Occlusion, and Transparency , 1993, J. Vis. Commun. Image Represent..

[29]  Roberto Cipolla,et al.  Robust structure from motion using motion parallax , 1993, 1993 (4th) International Conference on Computer Vision.

[30]  Edward H. Adelson,et al.  Layered representation for motion analysis , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Richard I. Hartley,et al.  Euclidean Reconstruction from Uncalibrated Views , 1993, Applications of Invariance in Computer Vision.

[32]  Michael J. Black,et al.  Mixture models for optical flow computation , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Robert T. Collins,et al.  Projective reconstruction of approximately planar scenes , 1993, Other Conferences.

[34]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[35]  Harpreet S. Sawhney Simplifying motion and structure analysis using planar parallax and image warping , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[36]  Michael I. Miller,et al.  REPRESENTATIONS OF KNOWLEDGE IN COMPLEX SYSTEMS , 1994 .

[37]  Arthur R. Pope Model-Based Object Recognition - A Survey of Recent Research , 1994 .

[38]  Peter Cliiord,et al.  A Metropolis Sampler for Polygonal Image ReconstructionPeter , 1995 .

[39]  David W. Murray,et al.  A unifying framework for structure and motion recovery from image sequences , 1995, Proceedings of IEEE International Conference on Computer Vision.

[40]  Geoffrey D. Sullivan,et al.  A Generic Deformable Model for Vehicle Recognition , 1995, BMVC.

[41]  Richard I. Hartley,et al.  In defence of the 8-point algorithm , 1995, Proceedings of IEEE International Conference on Computer Vision.

[42]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[43]  Rachid Deriche,et al.  A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry , 1995, Artif. Intell..

[44]  Harpreet S. Sawhney,et al.  Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding , 1995, Proceedings of IEEE International Conference on Computer Vision.

[45]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[46]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[47]  Robert B. Fisher,et al.  Recognition of Geons by Parametric Deformable Contour Models , 1996, ECCV.

[48]  Paul A. Beardsley,et al.  3D Model Acquisition from Extended Image Sequences , 1996, ECCV.

[49]  Edward H. Adelson,et al.  A unified mixture framework for motion segmentation: incorporating spatial coherence and estimating the number of models , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[50]  Ingemar J. Cox,et al.  A Maximum Likelihood Stereo Algorithm , 1996, Comput. Vis. Image Underst..

[51]  Bill Triggs,et al.  Factorization methods for projective structure and motion , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[52]  Maurizio Pilu,et al.  Recovery of Generic Solid Parts by Parametrically Deformable Aspects , 1996, ECCV 1996.

[53]  John Skilling,et al.  Data analysis : a Bayesian tutorial , 1996 .

[54]  Robert B. Fisher,et al.  Class-based recognition of 3D objects represented by volumetric primitives , 1997, Image Vis. Comput..

[55]  Bill Triggs,et al.  Autocalibration and the absolute quadric , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56]  Sylvia Richardson,et al.  Markov Chain Monte Carlo in Practice , 1997 .

[57]  Lutz Falkenhagen Hierarchical Block-Based Disparity Estimation Considering Neighbourhood Constraints , 1997 .

[58]  Sven J. Dickinson,et al.  Panel report: the potential of geons for generic 3-D object recognition , 1997, Image Vis. Comput..

[59]  R. Hartley Triangulation, Computer Vision and Image Understanding , 1997 .

[60]  Michal Irani,et al.  Recovery of Ego-Motion Using Region Alignment , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[61]  Cordelia Schmid,et al.  Automatic line matching across views , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[62]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Robert B. Fisher,et al.  Improving model shape acquisition by incorporating geometric constraints , 1997, BMVC.

[64]  Mei Han,et al.  Interactive construction of 3D models from panoramic mosaics , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[65]  P. Green,et al.  Boundary detection through dynamic polygons , 1998 .

[66]  Reinhard Koch,et al.  Metric 3D Surface Reconstruction from Uncalibrated Image Sequences , 1998, SMILE.

[67]  Richard Szeliski,et al.  A layered approach to stereo reconstruction , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[68]  Daphna Weinshall,et al.  From Reference Frames to Reference Planes: Multi-View Parallax Geometry and Applications , 1998, ECCV.

[69]  Heung-Yeung Shum,et al.  Panoramic Image Mosaics , 1998 .

[70]  Andrew Zisserman,et al.  Metric rectification for perspective images of planes , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[71]  Reinhard Koch,et al.  Self-Calibration and Metric Reconstruction Inspite of Varying and Unknown Intrinsic Camera Parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[72]  Andrew W. Fitzgibbon,et al.  Automatic Camera Recovery for Closed or Open Image Sequences , 1998, ECCV.

[73]  Luc Van Gool,et al.  Automatic Modeling and 3D Reconstruction of Urban House Roofs from High Resolution Aerial Imagery , 1998, ECCV.

[74]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[75]  David A. Forsyth,et al.  Bayesian structure from motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[76]  Ian H. Jermyn,et al.  Globally optimal regions and boundaries , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[77]  Stan Birchfield,et al.  Depth and motion discontinuities , 1999 .

[78]  Richard Szeliski,et al.  An integrated Bayesian approach to layer extraction from image sequences , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[79]  Andrew Zisserman,et al.  Automatic reconstruction of piecewise planar models from multiple views , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[80]  Stephen J. Maybank,et al.  A Method for Interactive 3D Reconstruction of Piecewise Planar Objects from Single Images , 1999, BMVC.

[81]  Michael Isard,et al.  Object localization by Bayesian correlation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[82]  Cordelia Schmid,et al.  AUTOMATIC LINE MATCHING AND 3D RECONSTRUCTION OF BUILDINGS FROM MULTIPLE VIEWS , 1999 .

[83]  S. Pinker How the Mind Works , 1999, Annals of the New York Academy of Sciences.

[84]  Alexei A. Efros,et al.  Texture synthesis by non-parametric sampling , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[85]  Roberto Cipolla,et al.  PhotoBuilder-3D models of architectural scenes from uncalibrated images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[86]  John Oliensis,et al.  An Experimental Study of Projective Structure From Motion , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[87]  Paulo R. S. Mendonça,et al.  A simple technique for self-calibration , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[88]  Antonio Criminisi,et al.  Creating Architectural Models from Images , 1999, Comput. Graph. Forum.

[89]  Philip F. Mclauchlan,et al.  The Variable State Dimension Filter applied to Surface-Based Structure from Motion , 1999 .

[90]  P. Anandan,et al.  Factorization with Uncertainty , 2000, ECCV.

[91]  Roberto Cipolla,et al.  Automatic 3D Modelling of Architecture , 2000, BMVC.

[92]  John Oliensis,et al.  A Critique of Structure-from-Motion Algorithms , 2000, Comput. Vis. Image Underst..

[93]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[94]  P. Torr,et al.  Layer Extraction with a Bayesian Model of Shapes , 2000, ECCV.

[95]  Frank Dellaert,et al.  Structure from motion without correspondence , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[96]  Bill Triggs,et al.  Plane+Parallax, Tensors and Factorization , 2000, ECCV.

[97]  David Nistér,et al.  Frame Decimation for Structure and Motion , 2000, SMILE.

[98]  Andrew Blake,et al.  Statistical Foreground Modelling for Object Localisation , 2000, ECCV.

[99]  T. Kanade,et al.  SUPER-RESOLUTION: RECONSTRUCTION OR RECOGNITION? , 2001 .

[100]  Andrew Zisserman,et al.  Automated architectural acquisition from a camera undergoing planar motion , 2001 .

[101]  Peter Sturm Critical motion sequences for the self-calibration of cameras and stereo systems with variable focal length , 2002, Image Vis. Comput..

[102]  Larry H. Matthies,et al.  Kalman filter-based algorithms for estimating depth from image sequences , 1989, International Journal of Computer Vision.

[103]  Aaron F. Bobick,et al.  Large Occlusion Stereo , 1999, International Journal of Computer Vision.

[104]  B. Caprile,et al.  Using vanishing points for camera calibration , 1990, International Journal of Computer Vision.

[105]  J. Koenderink,et al.  The internal representation of solid shape with respect to vision , 1979, Biological Cybernetics.

[106]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[107]  Paul A. Beardsley,et al.  Sequential Updating of Projective and Affine Structure from Motion , 1997, International Journal of Computer Vision.

[108]  Andrew W. Fitzgibbon,et al.  The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences , 1999, International Journal of Computer Vision.

[109]  Quang-Tuan Luong,et al.  Self-Calibration of a Moving Camera from Point Correspondences and Fundamental Matrices , 1997, International Journal of Computer Vision.

[110]  Gérard G. Medioni,et al.  Perceptual grouping for generic recognition , 2004, International Journal of Computer Vision.

[111]  Cordelia Schmid,et al.  The Geometry and Matching of Lines and Curves Over Multiple Views , 2000, International Journal of Computer Vision.

[112]  Peter N. Belhumeur,et al.  A Bayesian approach to binocular steropsis , 1996, International Journal of Computer Vision.

[113]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[114]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[115]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[116]  An Luo,et al.  An intensity-based cooperative bidirectional stereo matching with simultaneous detection of discontinuities and occlusions , 1995, International Journal of Computer Vision.

[117]  Alan L. Yuille,et al.  Occlusions and binocular stereo , 1992, International Journal of Computer Vision.

[118]  Alexandru Tupan,et al.  Triangulation , 1997, Comput. Vis. Image Underst..

[119]  E. H. Thompson A RATIONAL ALGEBRAIC FORMULATION OF THE PROBLEM OF RELATIVE ORIENTATION , 2006 .