Cooperative Spatial Reasoning for Image Understanding

Spatial Reasoning, reasoning about spatial information (i.e. shape and spatial relations), is a crucial function of image understanding and computer vision systems. This paper proposes a novel spatial reasoning scheme for image understanding and demonstrates its utility and effectiveness in two different systems: region segmentation and aerial image understanding systems. The scheme is designed based on a so-called Multi-Agent/Cooperative Distributed Problem Solving Paradigm, where a group of intelligent agents cooperate with each other to fulfill a complicated task. The first part of the paper describes a cooperative distributed region segmentation system, where each region in an image is regarded as an agent. Starting from seed regions given at the initial stage, region agents deform their shapes dynamically so that the image is partitioned into mutually disjoint regions. The deformation of each individual region agent is realized by the snake algorithm14 and neighboring region agents cooperate with each other to find common region boundaries between them. In the latter part of the paper, we first give a brief description of the cooperative spatial reasoning method used in our aerial image understanding system SIGMA. In SIGMA, each recognized object such as a house and a road is regarded as an agent. Each agent generates hypotheses about its neighboring objects to establish spatial relations and to detect missing objects. Then, we compare its reasoning method with that used in the region segmentation system. We conclude the paper by showing further utilities of the Multi-gent/Cooperative Distributed Problem Solving Paradigm for image understanding.

[1]  Makoto Nagao,et al.  A Structural Analysis of Complex Aerial Photographs , 1980, Advanced Applications in Pattern Recognition.

[2]  Laurent D. Cohen,et al.  On active contour models and balloons , 1991, CVGIP Image Underst..

[3]  John P. McDermott,et al.  Rule-Based Interpretation of Aerial Imagery , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Naokazu Yokoya,et al.  Image segmentation schema for low-level computer vision , 1981, Pattern Recognit..

[5]  Yuichi Ohta,et al.  Cooperative integration of multiple stereo algorithms , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[6]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[7]  Saburo Tsuji,et al.  Minimal region extraction using expanding active contours , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol. III. Conference C: Image, Speech and Signal Analysis,.

[8]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[9]  Ramakant Nevatia,et al.  Using Perceptual Organization to Extract 3-D Structures , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Thomas O. Binford,et al.  Survey of Model-Based Image Analysis Systems , 1982 .

[11]  Keith E. Price,et al.  Relaxation Matching Techniques-A Comparison , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  P. M. Narendra,et al.  Image Segmentation with Directed Trees , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Tai Sing Lee,et al.  Region competition: unifying snakes, region growing, energy/Bayes/MDL for multi-band image segmentation , 1995, Proceedings of IEEE International Conference on Computer Vision.

[14]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Trans. Syst. Man Cybern..

[15]  G.B. Coleman,et al.  Image segmentation by clustering , 1979, Proceedings of the IEEE.

[16]  Reid G. Smith,et al.  The Contract Net Protocol: High-Level Communication and Control in a Distributed Problem Solver , 1980, IEEE Transactions on Computers.

[17]  Theodosios Pavlidis,et al.  Picture Segmentation by a Tree Traversal Algorithm , 1976, JACM.