Learning Dextrous Manipulation Skills for Multifingered Robot Hands Using the Evolution Strategy

We present a method for autonomous learning of dextrous manipulation skills with multifingered robot hands. We use heuristics derived from observations made on human hands to reduce the degrees of freedom of the task and make learning tractable. Our approach consists of learning and storing a few basic manipulation primitives for a few prototypical objects and then using an associative memory to obtain the required parameters for new objects and/or manipulations. The parameter space of the robot is searched using a modified version of the evolution strategy, which is robust to the noise normally present in real-world complex robotic tasks. Given the difficulty of modeling and simulating accurately the interactions of multiple fingers and an object, and to ensure that the learned skills are applicable in the real world, our system does not rely on simulation; all the experimentation is performed by a physical robot, in this case the 16-degree-of-freedom Utah/MIT hand. Experimental results show that accurate dextrous manipulation skills can be learned by the robot in a short period of time. We also show the application of the learned primitives to perform an assembly task and how the primitives generalize to objects that are different from those used during the learning phase.

[1]  Alan D. Christiansen,et al.  Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[2]  Ingo Rechenberg,et al.  Evolutionsstrategie : Optimierung technischer Systeme nach Prinzipien der biologischen Evolution , 1973 .

[3]  Thea Iberall,et al.  The nature of human prehension: Three dextrous hands in one , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[4]  Emile H. L. Aarts,et al.  Simulated annealing and Boltzmann machines - a stochastic approach to combinatorial optimization and neural computing , 1990, Wiley-Interscience series in discrete mathematics and optimization.

[5]  W. Vent,et al.  Rechenberg, Ingo, Evolutionsstrategie — Optimierung technischer Systeme nach Prinzipien der biologischen Evolution. 170 S. mit 36 Abb. Frommann‐Holzboog‐Verlag. Stuttgart 1973. Broschiert , 1975 .

[6]  Hans-Paul Schwefel,et al.  Numerical Optimization of Computer Models , 1982 .

[7]  Rodney A. Brooks,et al.  Learning to Coordinate Behaviors , 1990, AAAI.

[8]  A. E. Eiben,et al.  Global Convergence of Genetic Algorithms: A Markov Chain Analysis , 1990, PPSN.

[9]  Maja J. Mataric,et al.  Reward Functions for Accelerated Learning , 1994, ICML.

[10]  R.C. Nelson,et al.  Experiments on dextrous manipulation without prior object models , 1996, Proceedings of the 1996 IEEE International Symposium on Intelligent Control.

[11]  Olac Fuentes,et al.  The virtual tool approach to dextrous telemanipulation , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[12]  Sundar Narasimhan,et al.  Dexterous Robotic Hands: Kinematics and Control , 1988 .

[13]  Shimon Edelman,et al.  Learning to grasp using visual information , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[14]  Olac Fuentes,et al.  Acquiring Visual-Motor Models for Precision Manipulation with Robot Hands , 1996, ECCV.

[15]  Olac Fuentes,et al.  Virtual Tools: A Framework for Simplifying Sensory-Motor Control in Complex Robotic Systems , 1995 .

[16]  Jakub Segen,et al.  Automatic discovery of robotic grasp configurations , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[17]  Peter K. Allen,et al.  Compliant manipulation with a dextrous robot hand , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[18]  Olac Fuentes,et al.  Morphing Hands and Virtual Tools (or What Good is an Extra Degree of Freedom , 1994 .

[19]  Thomas H. Speeter Primitive based control of the Utah/MIT dextrous hand , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[20]  Maja J. Mataric,et al.  Learning social behavior , 1997, Robotics Auton. Syst..

[21]  M. Arbib Coordinated control programs for movements of the hand , 1985 .

[22]  Proceedings of the 1986 IEEE International Conference on Robotics and Automation, San Francisco, California, USA, April 7-10, 1986 , 1986, ICRA.