Learning spatial object localization from vision on a humanoid robot

We present a combined machine learning and computer vision approach for robots to localize objects. It allows our iCub humanoid to quickly learn to provide accurate 3D position estimates (in the centimetre range) of objects seen. Biologically inspired approaches, such as Artificial Neural Networks (ANN) and Genetic Programming (GP), are trained to provide these position estimates using the two cameras and the joint encoder readings. No camera calibration or explicit knowledge of the robot's kinematic model is needed. We find that ANN and GP are not just faster and have lower complexity than traditional techniques, but also learn without the need for extensive calibration procedures. In addition, the approach is localizing objects robustly, when placed in the robot's workspace at arbitrary positions, even while the robot is moving its torso, head and eyes.

[1]  Jürgen Schmidhuber,et al.  Reflexive Collision Response with Virtual Skin - Roadmap Planning Meets Reinforcement Learning , 2016, ICAART.

[2]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[3]  R. A. Brooks,et al.  Intelligence without Representation , 1991, Artif. Intell..

[4]  Hod Lipson,et al.  Resilient Machines Through Continuous Self-Modeling , 2006, Science.

[5]  Jürgen Leitner,et al.  The Modular Behavioral Environment for Humanoids and other Robots (MoBeE) , 2012, ICINCO.

[6]  Giorgio Metta,et al.  YARP: Yet Another Robot Platform , 2006 .

[7]  Jürgen Leitner,et al.  An Integrated, Modular Framework for Computer Vision and Cognitive Robotics Research (icVision) , 2012, BICA.

[8]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[9]  Jürgen Leitner,et al.  Cartesian Genetic Programming for Image Processing , 2013 .

[10]  Nikolaos G. Tsagarakis,et al.  iCub: the design and realization of an open humanoid platform for cognitive and neuroscience research , 2007, Adv. Robotics.

[11]  Hod Lipson,et al.  Distilling Free-Form Natural Laws from Experimental Data , 2009, Science.

[12]  Giulio Sandini,et al.  The iCub humanoid robot: An open-systems platform for research in cognitive development , 2010, Neural Networks.

[13]  William B. Langdon,et al.  Evolving Hand-Eye Coordination for a Humanoid Robot with Machine Code Genetic Programming , 2001, EuroGP.

[14]  Stefan Schaal,et al.  Learning inverse kinematics , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[15]  Hod Lipson,et al.  Data-Mining Dynamical Systems: Automated Symbolic System Identification for Exploratory Analysis , 2008 .

[16]  I. Gordon Theories of Visual Perception , 1989 .

[17]  Alexandre Bernardino,et al.  A review of log-polar imaging for visual perception in robotics , 2010, Robotics and Autonomous Systems.

[18]  Alexander Gloye,et al.  Reinforcing the Driving Quality of Soccer Playing Robots by Anticipation (Verbesserung der Fahreigenschaften von fußballspielenden Robotern durch Antizipation) , 2005, it Inf. Technol..

[19]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Alexandre Bernardino,et al.  Pose Estimation for Grasping Preparation from Stereo Ellipses , 2008 .

[21]  Gregory D. Hager,et al.  Robot hand-eye coordination based on stereo vision , 1995 .

[22]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[23]  David Vernon,et al.  Machine vision - automated visual inspection and robot vision , 1991 .

[24]  Gary R. Bradski,et al.  REIN - A fast, robust, scalable REcognition INfrastructure , 2011, 2011 IEEE International Conference on Robotics and Automation.

[25]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.