Perceptual Organization as a Basis for Visual Recognition

Evidence is presented showing that bottom-up grouping of image features is usually prerequisite to the recognition and interpretation of images. We describe three functions of these groupings: 1) segmentation, 2) three-dimensional interpretation, and 3) stable descriptions for accessing object models. Several principles are hypothesized for determining which image relations should he formed: relations are significant to the extent that they are unlikely to have arisen by accident from the surrounding distribution of features, relations can only be formed where there are few alternatives within the same proximity, and relations must be based on properties which are invariant over a range of imaging conditions. Using these principles we develop an algorithm for curve segmentation which detects significant structure at multiple resolutions, including the linking of segments on the basis of curvilinearity. The algorithm is able to detect structures which no single-resolution algorithm could detect. Its performance is demonstrated on synthetic and natural image data.

[1]  R. Leeper A Study of a Neglected Portion of the Field of Learning—the Development of Sensory Organization , 1935 .

[2]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[3]  Ramakant Nevatia,et al.  Description and Recognition of Curved Objects , 1977, Artif. Intell..

[4]  Michael L. Baird Structural Pattern Recognition , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[6]  Thomas O. Binford,et al.  The Interpretation of Three-Dimensional Structure from Image Curves , 1981, IJCAI.

[7]  Thomas O. Binford,et al.  Inferring Surfaces from Images , 1981, Artif. Intell..

[8]  A. Witkin,et al.  On the Role of Structure in Vision , 1983 .