A comparison of four fast vision based object recognition methods for programming by demonstration applications

Service robots require interactive programming interfaces that allow users without programming experience to easily instruct the robots. Systems following the programming-by-demonstration (PbD) paradigm are getting closer to this goal. Visual observation of the user and environment is one important aspect for reasoning about goals and actions. With respect to the automatic generation of adaptive programs, a PbD-System should detect, classify and determine the pose of manipulable objects in a fast and stable way. This paper presents a comparison of four established methods proposing a new object classification approach that combines these methods. This gives a means for setting up a world model of a manipulations scene automatically and initializes active contour parameters in order to trade motions of these objects.

[1]  R. Dillmann,et al.  An efficient image processing algorithm for high-level skill acquisition , 1999, Proceedings of the 1999 IEEE International Symposium on Assembly and Task Planning (ISATP'99) (Cat. No.99TH8470).

[2]  Masayuki Inaba,et al.  Learning by watching: extracting reusable task knowledge from visual observation of human performance , 1994, IEEE Trans. Robotics Autom..

[3]  Clark F. Olson,et al.  View-Based Recognition Using an Eigenspace Approximation to the Hausdorff Measure , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Katsushi Ikeuchi,et al.  Detectability, Uniqueness, and Reliability of Eigen Windows for Stable Verification of Partially Occluded Objects , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Olivier Faugeras,et al.  Three-Dimensional Computer Vision , 1993 .

[6]  Carl F. R. Weiman,et al.  Fast mapping using the log-Hough transformation , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[7]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[8]  John Kru Object Detection with Vector Quantized Binary Features , 1997 .

[9]  Rüdiger Dillmann,et al.  A sensor fusion approach for PbD , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[10]  Peter Steinhaus,et al.  A multisensor system for observation of user actions in programming by demonstration , 1999, Proceedings. 1999 IEEE/SICE/RSJ. International Conference on Multisensor Fusion and Integration for Intelligent Systems. MFI'99 (Cat. No.99TH8480).

[11]  Ales Ude,et al.  Object Localization Using Perceptual Organization and Structural Stereopsis , 1994 .

[12]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[13]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[14]  R. Y. Tsai,et al.  An Efficient and Accurate Camera Calibration Technique for 3D Machine Vision , 1986, CVPR 1986.