Intermediate level picture interpretation using complete two-dimensional models

Abstract An intermediate level of computer vision is presented which is not general purpose as is the case with low level picture segmentation, nor is it based on semantic information processing as is usually prescribed for high level so-called cognitive processors. The input data to this stage is a partially segmented picture in that the given regions do not necessarily correspond to recognizable objects. In addition it is assumed that a complete two-dimensional picture model is available in a knowledge database containing information describing the objects appearing in the image, as well as their spatial structure. The picture interpretation involves two stages of processing, one local, and the other global. The first stage invokes local template matching whereby heuristic search is used to compare and match groups of adjacent regions against all stored object prototypes. This is followed by the second stage, an optimal search achieved by dynamic programming. Here the knowledge of the topological constraints is used to eliminate incorrect interpretations, resolving the competition between possible local object matches and the permissible global structure. The output of this intermediate level of processing is then an ordered list of possible symbolic interpretations for the input partially segmented regions. A confidence level is associated with each interpretation. Examples of face interpretation and object recognition are discussed.

[1]  Jerome A. Feldman,et al.  Decision Theory and Artificial Intelligence: I. A Semantics-Based Region Analyzer , 1974, Artif. Intell..

[2]  Samir I. Shaheen,et al.  A Modular Computer Vision System for Picture Segmentation and Interpretation , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[4]  Dana H. Ballard,et al.  An Approach to Knowledge-Directed Image Analysis , 1977, IJCAI.

[5]  Ugo Montanari,et al.  Optimization Methods in Image Processing , 1974, IFIP Congress.

[6]  Franco P. Preparata,et al.  An approach to artificial nonsymbolic cognition , 1972, Inf. Sci..

[7]  Michael A. Arbib,et al.  Computational Techniques in the Visual Segmentation of Static Scenes. , 1977 .

[8]  Franz L. Alt,et al.  Digital Pattern Recognition by Moments , 1962, JACM.

[9]  David Marr,et al.  Representing Visual Information , 1977 .

[10]  Claude L. Fennema,et al.  Scene Analysis Using Regions , 1970, Artif. Intell..

[11]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[12]  Harry G. Barrow,et al.  Experiments in Interpretation-Guided Segmentation , 1977, Artificial Intelligence.

[13]  John Freeman,et al.  The modelling of spatial relations , 1975 .

[14]  Nils J. Nilsson,et al.  Problem-solving methods in artificial intelligence , 1971, McGraw-Hill computer science series.

[15]  Gösta H. Granlund,et al.  Fourier Preprocessing for Hand Print Character Recognition , 1972, IEEE Transactions on Computers.