Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?

We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we follow two aspects of human manipulation: passive compliance and learning. As imitation of human's arm and learning, we use a robot arm with passive compliant joints and it learns a policy for peg in hole by the proposed gradient descending method. We present that the passive compliance provides with quick and stable learning as well as a slow control sampling time.

[1]  Matthew M. Williamson,et al.  Series elastic actuators , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[2]  H. Sebastian Seung,et al.  Stochastic policy gradient reinforcement learning on a simple 3D biped , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[3]  H. Kimura,et al.  Acquiring hand-action models in task and behavior levels by a learning robot through observing human demonstrations , 2000 .

[4]  Geir Hovland,et al.  Skill acquisition from human demonstration using a hidden Markov model , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[5]  Russell H. Taylor,et al.  Automatic Synthesis of Fine-Motion Strategies for Robots , 1984 .

[6]  Yangmin Li,et al.  Hybrid control approach to the peg-in hole problem , 1997, IEEE Robotics Autom. Mag..

[7]  Sungchul Kang,et al.  Input preshaping control of the safe arm with MR-based passive compliant joints , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[8]  John Kenneth Salisbury,et al.  Playing it safe [human-friendly robots] , 2004, IEEE Robotics & Automation Magazine.

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  W. R. Southern,et al.  The study of a passive accommodation device in robotic insertion processes , 2002 .

[11]  Shigeki Sugano,et al.  Development and evaluation of seven DOF MIA ARM , 1997, Proceedings of International Conference on Robotics and Automation.

[12]  Andrew W. Moore,et al.  Barycentric Interpolators for Continuous Space and Time Reinforcement Learning , 1998, NIPS.

[13]  S. Lee,et al.  Learning expert systems for robot fine motion control , 1988, Proceedings IEEE International Symposium on Intelligent Control 1988.

[14]  Roderic A. Grupen,et al.  Learning admittance mappings for force-guided assembly , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[15]  Michael S. Branicky,et al.  Search strategies for peg-in-hole assemblies with position uncertainty , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[16]  Michael A. Peshkin,et al.  Programmed compliance for error corrective assembly , 1990, IEEE Trans. Robotics Autom..

[17]  Pradeep K. Khosla,et al.  Manipulation task primitives for composing robot skills , 1997, Proceedings of International Conference on Robotics and Automation.

[18]  K. Maycock,et al.  A passive compliant wrist for chamferless peg-in-hole assembly operation from vertical and horizontal directions , 1998 .