Morphology Independent Learning in Modular Robots

Hand-coding locomotion controllers for modular robots is difficult due to their polymorphic nature. Instead, we propose to use a simple and distributed reinforcement learning strategy. ATRON modules with identical controllers can be assembled in any configuration. To optimize the robot’s locomotion speed its modules independently and in parallel adjust their behavior based on a single global reward signal. In simulation, we study the learning strategy’s performance on different robot configurations. On the physical platform, we perform learning experiments with ATRON robots learning to move as fast as possible. We conclude that the learning strategy is effective and may be a practical approach to design gaits.

[1]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[2]  Hod Lipson,et al.  Resilient Machines Through Continuous Self-Modeling , 2006, Science.

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  David Johan Christensen,et al.  A unified simulator for Self-Reconfigurable Robots , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  H. Kurokawa,et al.  Automatic locomotion design and experiments for a Modular robotic system , 2005, IEEE/ASME Transactions on Mechatronics.

[6]  W. Oechel,et al.  Automatic design and manufacture of robotic lifeforms , 2022 .

[7]  Auke Jan Ijspeert,et al.  Learning to Move in Modular Robots using Central Pattern Generators and Online Optimization , 2008, Int. J. Robotics Res..

[8]  Karl Sims,et al.  Evolving 3d morphology and behavior by competition , 1994 .

[9]  山田 祐,et al.  Open Dynamics Engine を用いたスノーボードロボットシミュレータの開発 , 2007 .

[10]  Rodney A. Brooks,et al.  Learning to Coordinate Behaviors , 1990, AAAI.

[11]  Henrik Hautop Lund,et al.  Design of the ATRON lattice-based self-reconfigurable robot , 2006, Auton. Robots.

[12]  Daniel Marbach,et al.  Co-evolution of Configuration and Control for Homogenous Modular Robots , 2004 .

[13]  A.J. Ijspeert,et al.  Online optimization of modular robot locomotion , 2005, IEEE International Conference Mechatronics and Automation, 2005.