Towards autonomous object reconstruction for visual search by the humanoid robot HRP-2

This paper deals with the problem of object reconstruction for visual search by a humanoid robot. Three problems necessary to achieve the behavior autonomously are considered: full-body motion generation according to a camera pose, general object representation for visual recognition and pose estimation, and far-away visual detection of an object. First we deal with the problem of generating full body motion for a HRP-2 humanoid robot to achieve camera pose given by a Next Best View algorithm. We use an optimization based approach including self-collision avoidance. This is made possible by a body to body distance function having a continuous gradient. The second problem has received a lot of attention for several decades, and we present a solution based on 3D vision together with SIFTs descriptor, making use of the information available from the robot. It is shown in this paper that one of the major limitation of this model is the perception distance. Thus a new approach based on a generative object model is presented to cope with more difficult situations. It relies on a local representation which allows handling occlusion as well as large scale and pose variations.

[1]  Jian L. Zhou,et al.  User's Guide for CFSQP Version 2.0: A C Code for Solving (Large Scale) Constrained Nonlinear (Minimax) Optimization Problems, Generating Iterates Satisfying All Inequality Constraints , 1994 .

[2]  Gordon Cheng,et al.  Support vector machines and Gabor kernels for object recognition on a humanoid with active foveated vision , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[3]  Yoshihiko Nakamura,et al.  Whole-body Cooperative Balancing of Humanoid Robot using COG Jacobian , 2002 .

[4]  Cordelia Schmid,et al.  Coloring Local Feature Extraction , 2006, ECCV.

[5]  Gordon Cheng,et al.  Learning feature representations for an object recognition system , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[6]  Olivier Stasse,et al.  Real-time 3D SLAM for Humanoid Robot considering Pattern Generator Information , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Giulio Sandini,et al.  Learning about objects through action - initial steps towards artificial cognition , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[8]  Stefano Soatto,et al.  Local Features, All Grown Up , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Frédéric Jurie,et al.  Category Level Object Segmentation , 2007 .

[10]  Sylvain Miossec,et al.  Planning support contact-points for humanoid robots and experiments on HRP-2 , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  M. Escobar,et al.  Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[12]  Yoshihiro Kawai,et al.  3D Object Recognition in Cluttered Environments by Segment-Based Stereo Vision , 2004, International Journal of Computer Vision.

[13]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[14]  Olivier Stasse,et al.  A Visual Attention Framework for Search Behavior by a Humanoid Robot , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[15]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[16]  Y. Nakarnura Whole-body Cooperative Balancing of Humanoid Robot using COG Jacobian , 2002 .

[17]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[18]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.