Planning multiple observations for object recognition

Most computer vision systems perform object recognition on the basis of the features extracted from a single image of the object. The problem with this approach is that it implicitly assumes that the available features are sufficient to determine the identity and pose of the object uniquely. If this assumption is not met, then the feature set is insufficient, and ambiguity results. Consequently, much research in computer vision has gone toward finding sets of features that are sufficient for specific tasks, with the result that each system has its own associated set of features. A single, general feature set would be desirable. However, research in automatic generation of object recognition programs has demonstrated that predetermined, fixed feature sets are often incapable of providing enough information to unambiguously determine either object identity or pose. One approach to overcoming the inadequacy of any feature set is to utilize multiple sensor observations obtained from different viewpoints, and combine them with knowledge of the 3-D structure of the object to perform unambiguous object recognition. This article presents initial results toward performing object recognition by using multiple observations to resolve ambiguities. Starting from the premise that sensor motions should be planned in advance, the difficulties involved in planning with ambiguous information are discussed. A representation for planning that combines geometric information with viewpoint uncertainty is presented. A sensor planner utilizing the representation was implemented, and the results of pose-determination experiments performed with the planner are discussed.

[1]  Avinash C. Kak,et al.  Planning sensing strategies in a robot work cell with multi-sensor capabilities , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[2]  Allen M. Waxman,et al.  Adaptive 3-D Object Recognition from Multiple Views , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  C. H. Liu,et al.  3-D curved object recognition from multiple 2-D camera views , 1990 .

[4]  Takeo Kanade,et al.  Model based recognition of specular objects using sensor models , 1992, CVGIP Image Underst..

[5]  Peter Kovesi,et al.  Automatic Sensor Placement from Vision Task Requirements , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Ruzena Bajcsy,et al.  How to Decide From the First View Where to Look Next , 1990 .

[7]  Robert M. Haralick,et al.  PREMIO: an overview (object recognition) , 1991, [1991 Proceedings] Workshop on Directions in Automated CAD-Based Vision.

[8]  Robert M. Haralick,et al.  Automatic sensor and light source positioning for machine vision , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[9]  Chris Goad,et al.  Special purpose automatic programming for 3D model-based vision , 1987 .

[10]  Kenneth Y. Goldberg,et al.  Bayesian grasping , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[11]  Cheng-Hsiung Liu,et al.  3D Curved Object Recognition from Multiple 2D Camera Views , 1990, Comput. Vis. Graph. Image Process..

[12]  Katsushi Ikeuchi,et al.  Generating an interpretation tree from a CAD model for 3D-object recognition in bin-picking tasks , 1987, International Journal of Computer Vision.

[13]  Robert J. Safranek,et al.  Evidence accumulation using binary frames of discernment for verification vision , 1990, IEEE Trans. Robotics Autom..

[14]  J. Koenderink,et al.  The internal representation of solid shape with respect to vision , 1979, Biological Cybernetics.

[15]  Ming Tan,et al.  CSL: a cost-sensitive learning system for sensing and grasping objects , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[16]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[17]  J. K. Aggarwal,et al.  Automatic generation of recognition strategies using CAD models , 1991, [1991 Proceedings] Workshop on Directions in Automated CAD-Based Vision.

[18]  Anil K. Jain,et al.  BONSAI: 3D Object Recognition Using Constrained Search , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[20]  Katsushi Ikeuchi,et al.  Determining linear shape change: Toward automatic generation of object recognition programs, , 1991, CVGIP Image Underst..

[21]  Takeo Kanade,et al.  Automatic generation of object recognition programs , 1988, Proc. IEEE.

[22]  Randy C. Brost,et al.  Automatic Grasp Planning in the Presence of Uncertainty , 1988, Int. J. Robotics Res..

[23]  Thomas C. Henderson,et al.  CAGD-Based Computer Vision , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Katsushi Ikeuchi,et al.  Appearance-Based Vision and the Automatic Generation of Object Recognition Programs , 1992 .

[25]  Katsushi Ikeuchi,et al.  Minimum cost aspect classification: a module of a vision algorithm compiler , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[26]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[27]  Katsushi Ikeuchi,et al.  Appearance Simulator for Computer Vision Research. , 1991 .