Using Perceptual Organization to Extract 3-D Structures

The authors describe an approach to perceptual grouping for detecting and describing 3-D objects in complex images and apply it to the task of detecting and describing complex buildings in aerial images. They argue that representations of structural relationships in the arrangements of primitive image features, as detected by the perceptual organization process, are essential for analyzing complex imagery. They term these representations collated features. The choice of collated features is determined by the generic shape of the desired objects in the scene. The detection process for collated features is more robust than the local operations for region segmentation and contour tracing. The important structural information encoded in collated features aids various visual tasks such as object segmentation, correspondence processes, and shape description. The proposed method initially detects all reasonable feature groupings. A constraint satisfaction network is then used to model the complex interactions between the collations and select the promising ones. Stereo matching is performed on the collations to obtain height information. This aids in further reasoning on the collated features and results in the 3-D description of the desired objects. >

[1]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[2]  R. Kelly,et al.  The Gestalt Photomapping System , 1977 .

[3]  M. Arbib Cooperative computation as a concept for brain theory , 1979 .

[4]  K. Ramesh Babu,et al.  Linear Feature Extraction and Description , 1979, IJCAI.

[5]  D. Katz Gestalt Psychology: Its Nature and Significance , 1979 .

[6]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[7]  Thomas O. Binford,et al.  Survey of Model-Based Image Analysis Systems , 1982 .

[8]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Shun-ichi Amari,et al.  Competitive and Cooperative Aspects in Dynamics of Neural Excitation and Self-Organization , 1982 .

[10]  A. Treisman Perceptual grouping and attention in visual search for features and for objects. , 1982, Journal of experimental psychology. Human perception and performance.

[11]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Ramakant Nevatia,et al.  Detection of Buildings in Aerial Images Using Shape and Shadows , 1983, IJCAI.

[13]  Thomas O. Binford,et al.  Perceptual Organization as a Basis for Visual Recognition , 1983, AAAI.

[14]  S. Zucker Computational and Psychophysical Experiments in Grouping: Early Orientation Selection , 1983 .

[15]  Geoffrey E. Hinton,et al.  Massively Parallel Architectures for AI: NETL, Thistle, and Boltzmann Machines , 1983, AAAI.

[16]  Geoffrey E. Hinton,et al.  Parallel visual computation , 1983, Nature.

[17]  S. Palmer The Psychology of Perceptual Organization: A Transformational Approach , 1983 .

[18]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[19]  A. Witkin,et al.  On the Role of Structure in Vision , 1983 .

[20]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Takashi Matsuyama,et al.  SIGMA: A Framework for Image Understanding - Integration of Bottom-Up and Top-Down Analysis , 1985, IJCAI.

[23]  John P. McDermott,et al.  Rule-Based Interpretation of Aerial Imagery , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  M. Arbib Brain theory and cooperative computation. , 1985, Human neurobiology.

[25]  Ramakant Nevatia,et al.  Segment-based stereo matching , 1985, Comput. Vis. Graph. Image Process..

[26]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  J. Hopfield,et al.  Computing with neural circuits: a model. , 1986, Science.

[28]  Robert C. Bolles,et al.  Perceptual Organization and Curve Partitioning , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[30]  Pascal Fua,et al.  Using Generic Geometric Models for Intelligent Shape Extraction , 1987, AAAI.

[31]  T. Kanade,et al.  The 3D MOSAIC scene understanding system: incremental reconstruction of 3D scenes for complex images , 1987 .

[32]  Geoffrey E. Hinton,et al.  Connectionist Architectures for Artificial Intelligence , 1990, Computer.

[33]  Ramakant Nevatia,et al.  Detecting buildings in aerial images , 1988, Comput. Vis. Graph. Image Process..

[34]  Ramakant Nevatia,et al.  Stereo Error Detection, Correction, and Evaluation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Ramakant Nevatia,et al.  Segmentation and description based on perceptual organization , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .