Optimization of a Compact Model for the Compliant Humanoid Robot COMAN Using Reinforcement Learning

Abstract COMAN is a compliant humanoid robot. The introduction of passive compliance in some of its joints affects the dynamics of the whole system. Unlike traditional stiff robots, there is a deflection of the joint angle with respect to the desired one whenever an external torque is applied. Following a bottom up approach, the dynamic equations of the joints are defined first. Then, a new model which combines the inverted pendulum approach with a three-dimensional (Cartesian) compliant model at the level of the center of mass is proposed. This compact model is based on some assumptions that reduce the complexity but at the same time affect the precision. To address this problem, additional parameters are inserted in the model equation and an optimization procedure is performed using reinforcement learning. The optimized model is experimentally validated on the COMAN robot using several ZMP-based walking gaits.

[1]  Nikolaos G. Tsagarakis,et al.  The mechanical design of the new lower body for the child humanoid robot ‘iCub’ , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Jan Peters,et al.  Learning motor primitives for robotics , 2009, 2009 IEEE International Conference on Robotics and Automation.

[3]  Nikolaos G. Tsagarakis,et al.  Lower Body Design of the `iCub' a Human-baby like Crawling Robot , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[4]  Atsuo Takanishi,et al.  Development of a new humanoid robot WABIAN-2 , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[5]  Atsuo Takanishi,et al.  Development of a bipedal humanoid robot-control method of whole body cooperative dynamic biped walking , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[6]  M. Hirose,et al.  Development of Humanoid Robot ASIMO , 2001 .

[7]  Friedrich Pfeiffer,et al.  Modular joint design for performance enhanced humanoid robot LOLA , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[8]  Nikolaos G. Tsagarakis,et al.  A compact soft actuator unit for small scale human friendly robots , 2009, 2009 IEEE International Conference on Robotics and Automation.

[9]  Kazuhito Yokoi,et al.  Biped walking pattern generation by using preview control of zero-moment point , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[10]  G. Cox,et al.  ~ " " " ' l I ~ " " -" . : -· " J , 2006 .

[11]  M. Spong Modeling and Control of Elastic Joint Robots , 1987 .

[12]  Nikolaos G. Tsagarakis,et al.  iCub: the design and realization of an open humanoid platform for cognitive and neuroscience research , 2007, Adv. Robotics.

[13]  N. G. Tsagarakis,et al.  A compact model for the compliant humanoid robot COMAN , 2012, 2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[14]  Fumio Kanehiro,et al.  Development of Humanoid Robot “HRP-3” , 2008 .

[15]  Kazuhito Yokoi,et al.  Biped walking stabilization based on linear inverted pendulum tracking , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  T. Takenaka,et al.  The development of Honda humanoid robot , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[17]  Nikolaos G. Tsagarakis,et al.  Lower body realization of the baby humanoid - ‘iCub’ , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Nikolaos G. Tsagarakis,et al.  The design of the lower body of the compliant humanoid robot “cCub” , 2011, 2011 IEEE International Conference on Robotics and Automation.

[19]  Jun-Ho Oh,et al.  Mechanical design of the humanoid robot platform, HUBO , 2007, Adv. Robotics.

[20]  Imin Kao,et al.  Simulation of conservative congruence transformation. Conservative properties in the joint and Cartesian spaces , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[21]  Nikolaos G. Tsagarakis,et al.  Bipedal walking energy minimization by reinforcement learning with evolving policy parameterization , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.