Incremental Acquisition of a Three-Dimensional Scene Model from Images

We describe the current state of the 3-D Mosaic project, whose goal is to incrementally acquire a 3-D model of a complex urban scene from images. The notion of incremental acquisition arises from the observations that 1) single images contain only parfial information about a scene, 2) complex images are difficult to fully interpret, and 3) different features of a given scene tend to be easier to extract in different images because of differences in viewpoint and lighting conditions. In our approach, multiple images of the scene are sequentially analyzed so as to incrementaly construct the model. Each new image provides information which refines the model. We describe some experiments toward this end. Our method of extracting 3-D shape information from the images is stereo analysis. Because we are dealing with urban scenes, a junction-based matching technique proves very useful. This technique produces rather sparse wire-frame descriptions of the scene. A reasoning system that relies on task-specific knowledge generates an approximate model of the scene from the stereo output. Gray scale information is also acquired for the faces in the model. Finally, we describe an experiment in combining two views of the scene to obtain a rermed model.

[1]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[2]  Charles M. Eastman,et al.  Geometric modelling: a survey , 1979 .

[3]  Jon Doyle,et al.  A Truth Maintenance System , 1979, Artif. Intell..

[4]  Takeo Kanade,et al.  Using shadows in finding surface orientations , 1983, Comput. Vis. Graph. Image Process..

[5]  Paul S. Schenker,et al.  New sensor geometries for image processing: Computer vision in the polar exponential grid , 1981, ICASSP.

[6]  K. Ramesh Babu,et al.  Linear Feature Extraction and Description , 1979, IJCAI.

[7]  Marsha Jo Hannah,et al.  Computer matching of areas in stereo images. , 1974 .

[8]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[9]  Ruzena Bajcsy,et al.  Texture gradient as a depth cue , 1976 .

[10]  Steven Michael Rubin,et al.  The argos image understanding system. , 1978 .

[11]  David R. Smith,et al.  A Threshold Logic Network for Shape Invariance , 1967, IEEE Trans. Electron. Comput..

[12]  Robert L. Henderson,et al.  Automatic Stereo Reconstruction Of Man-Made Targets , 1979, Other Conferences.

[13]  Robert C. Bolles,et al.  Experiments in Map-Guided Photo Interpretation , 1977, IJCAI.

[14]  Robert N. Devich,et al.  Image Perspective Transformations , 1980, Optics & Photonics.

[15]  William B. Thompson,et al.  Disparity Analysis of Images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Sidney Liebes,et al.  Geometric Constraints For Interpreting Images Of Common Structural Elements: Orthogonal Trihedral Vertices , 1981, Other Conferences.

[17]  Steven M. Rubin,et al.  Natural scene recognition using locus search , 1980 .

[18]  Mj Martin Bastiaans The Wigner distribution function applied to optical signals and systems , 1978 .

[19]  Charles M. Eastman A unified view of solid shape modeling based on consistency verification , 1982 .

[20]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[21]  A. Lohmann,et al.  The wigner distribution function and its optical production , 1980 .

[22]  Thomas O. Binford,et al.  Depth from Edge and Intensity Based Stereo , 1981, IJCAI.

[23]  D Marr,et al.  A computational theory of human stereo vision. , 1979, Proceedings of the Royal Society of London. Series B, Biological sciences.