Determi ling Linear Shape Change: Toward Automatic Ger Object Recognition Programs*

A 3D object localization task may be divided into two steps. First, one object appearance will be classified into one of the topologically equivalent classes of the 3D object appearances, referred to as aspects of the object (aspect classification). Then, the precise attitude and position of the object will be determined within one aspect (linear shape change determination (LSCD)). We have been working on designing a compiler which automatically generates an object localization program from a given object and sensor model; the compiler adheres to the above two-step strategy of aspect classification and linear change determination. For the first step, the compiler generates a decision tree structure program. Each branch node of the tree represents a neceSSary decision, such as comparing the area size of a visible face, to classify one object appearance into a smaller number of aspect groups. Along this tree, an object appearance is to be classified into one particular aspect at a leaf node of the tree. This paper will investigate the design of the compiler component to generate the second step. The compiler extends each leaf node of the tree and connects several nodes so that it performs the LSCD. The compiler chooses the largest 3D face as the primal face among several visible faces at the aspect corresponding to a leaf node. The compiler has priority rules which will select one particular method out of several possible ones; this rule defines a face coordinate system on a primal face. By using these rules, it analyzes the primal face, defines the face coordinate system on it, and registers the defining method to a node connected to the leaf node. The compiler also embodies the transformation from the face to the body coordinate system at the node. In order to increase the accuracy of the attitude and p i tion, the compiler further puts two more nodes at each branch of the program. The first node establishes correspondences between model edges and image edges. The second node iteratively solves the transformation equation to determine the object’s attitude and position using these correspondences. We have prepared a program library, which is a collection of prototypical objects to perform tasks mentioned before. In compile mode, the compiler

[1]  Katsushi Ikeuchi,et al.  Determining a Depth Map Using a Dual Photometric Stereo , 1987 .

[2]  Chris Goad,et al.  Special purpose automatic programming for 3D model-based vision , 1987 .

[3]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[4]  Katsushi Ikeuchi Recognition of 3-D Objects Using the Extended Gaussian Image , 1981, IJCAI.

[5]  Martial Hebert,et al.  Outdoor scene analysis using range data , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[6]  Katsushi Ikeuchi,et al.  Generating an interpretation tree from a CAD model for 3D-object recognition in bin-picking tasks , 1987, International Journal of Computer Vision.

[7]  Linda G. Shapiro A CAD-Model-Based System For Object Localization , 1988, Defense, Security, and Sensing.

[8]  Thomas C. Henderson,et al.  CAGD-Based Computer Vision , 1988, Defense, Security, and Sensing.

[9]  Precompiling a Geometrical Model into an Interpretation Tree for Object Recognition in Bin-picking Tasks , 1987 .

[10]  Takeo Kanade,et al.  Automatic generation of object recognition programs , 1988, Proc. IEEE.

[11]  Takeo Kanade,et al.  Model-Based Vision By Cooperative Processing Of Evidence And Hypotheses Using Configuration Spaces , 1988, Defense, Security, and Sensing.

[12]  Katsushi Ikeuchi,et al.  Minimum cost aspect classification: a module of a vision algorithm compiler , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[13]  Takeo Kanade,et al.  Model-Based Vision System by Object-Oriented Programming , 1988 .

[14]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[15]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[16]  Takeo Kanade,et al.  Modeling sensor performance for model-based vision , 1988 .

[17]  Thomas C. Henderson,et al.  CAGD-Based Computer Vision , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Takeo Kanade,et al.  Towards automatic generation of object recognition programs , 1988 .

[19]  Kim L. Boyer,et al.  Robotic Manipulation Experiments Using Structural Stereopsis for 3D Vision , 1986, IEEE Expert.