论文信息 - A good body is all you need: avoiding catastrophic interference via agent architecture search

A good body is all you need: avoiding catastrophic interference via agent architecture search

In robotics, catastrophic interference continues to restrain policy training across environments. Efforts to combat catastrophic interference to date focus on novel neural architectures or training methods, with a recent emphasis on policies with good initial settings that facilitate training in new environments. However, none of these methods to date have taken into account how the physical architecture of the robot can obstruct or facilitate catastrophic interference, just as the choice of neural architecture can. In previous work we have shown how aspects of a robot’s physical structure (specifically, sensor placement) can facilitate policy learning by increasing the fraction of optimal policies for a given physical structure. Here we show for the first time that this proxy measure of catastrophic interference correlates with sample efficiency across several search methods, proving that favorable loss landscapes can be induced by the correct choice of physical structure. We show that such structures can be found via co-optimization—optimization of a robot’s structure and control policy simultaneously—yielding catastrophic interference resistant robot structures and policies, and that this is more efficient than control policy optimization alone. Finally, we show that such structures exhibit sensor homeostasis across environments and introduce this as the mechanism by which certain robots overcome catastrophic interference.

[1] R. French. Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[2] Nicolas Y. Masse,et al. Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization , 2018, Proceedings of the National Academy of Sciences.

[3] Yee Whye Teh,et al. Functional Regularisation for Continual Learning , 2019 .

[4] Tamara G. Kolda,et al. Optimization by Direct Search: New Perspectives on Some Classical and Modern Methods , 2003, SIAM Rev..

[5] Xu He,et al. Overcoming Catastrophic Interference using Conceptor-Aided Backpropagation , 2018, ICLR.

[6] Josh Bongard,et al. The effects of morphology and fitness on catastrophic interference , 2018, ALIFE.

[7] Josh Bongard,et al. Morphological change in machines accelerates the evolution of robust behavior , 2011, Proceedings of the National Academy of Sciences.

[8] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[9] Josh C. Bongard,et al. Automated shapeshifting for function recovery in damaged robots , 2019, Robotics: Science and Systems.

[10] V. Braitenberg. Vehicles, Experiments in Synthetic Psychology , 1984 .

[11] Tom Schaul,et al. Natural Evolution Strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[12] Nikos Komodakis,et al. Dynamic Few-Shot Visual Learning Without Forgetting , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Anthony V. Robins,et al. Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[14] P. Eggenberger,et al. Evolving the morphology of a compound eye on a robot , 1999, 1999 Third European Workshop on Advanced Mobile Robots (Eurobot'99). Proceedings (Cat. No.99EX355).

[15] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[16] Robert M. French,et al. Using Semi-Distributed Representations to Overcome Catastrophic Forgetting in Connectionist Networks , 1991 .

[17] Todd D. Murphey,et al. Algorithmic materials: Embedding computation within material properties for autonomy , 2019 .

[18] Yoshua Bengio,et al. An Empirical Investigation of Catastrophic Forgeting in Gradient-Based Neural Networks , 2013, ICLR.

[19] Josh C. Bongard,et al. Combating catastrophic forgetting with developmental compression , 2018, GECCO.

[20] Robert J. Wood,et al. Wearable tactile keypad with stretchable artificial skin , 2011, 2011 IEEE International Conference on Robotics and Automation.

[21] Hiroshi Yokoi,et al. An active artificial whisker array for texture discrimination , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[22] Frédo Durand,et al. DiffTaichi: Differentiable Programming for Physical Simulation , 2020, ICLR.

[23] Heinrich M. Jaeger,et al. Universal robotic gripper based on the jamming of granular material , 2010, Proceedings of the National Academy of Sciences.

[24] Leon Kester,et al. Augmented Utilitarianism for AGI Safety , 2019, AGI.

[25] R. Storn,et al. Differential Evolution , 2004 .

[26] Ezequiel A. Di Paolo,et al. The evolution of control and adaptation in a 3D powered passive dynamic walker , 2004 .

[27] Toni Giorgino,et al. Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[28] Patrick M. Reed,et al. Borg: An Auto-Adaptive Many-Objective Evolutionary Computing Framework , 2013, Evolutionary Computation.

[29] Jordan B. Pollack,et al. Automatic design and manufacture of robotic lifeforms , 2000, Nature.

[30] Hod Lipson,et al. Scalable co-optimization of morphology and control in embodied machines , 2017, Journal of The Royal Society Interface.

[31] Yee Whye Teh,et al. Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[32] Yashraj S. Narang,et al. Transforming the Dynamic Response of Robotic Structures and Systems Through Laminar Jamming , 2018, IEEE Robotics and Automation Letters.

[33] J. Clune,et al. The Surprising Creativity of Digital Evolution , 2018, ALIFE.

[34] David Ha,et al. Reinforcement Learning for Improving Agent Design , 2018, Artificial Life.