论文信息 - A Theory of Cheap Control in Embodied Systems

A Theory of Cheap Control in Embodied Systems

We present a framework for designing cheap control architectures of embodied agents. Our derivation is guided by the classical problem of universal approximation, whereby we explore the possibility of exploiting the agent’s embodiment for a new and more efficient universal approximation of behaviors generated by sensorimotor control. This embodied universal approximation is compared with the classical non-embodied universal approximation. To exemplify our approach, we present a detailed quantitative case study for policy models defined in terms of conditional restricted Boltzmann machines. In contrast to non-embodied universal approximation, which requires an exponential number of parameters, in the embodied setting we are able to generate all possible behaviors with a drastically smaller model, thus obtaining cheap universal approximation. We test and corroborate the theory experimentally with a six-legged walking machine. The experiments indicate that the controller complexity predicted by our theory is close to the minimal sufficient value, which means that the theory has direct practical implications.

[1] Chrystopher L. Nehaniv,et al. Relevant information in optimized persistence vs. progeny strategies , 2006 .

[2] R. A. Brooks,et al. Intelligence without Representation , 1991, Artif. Intell..

[3] Keyan Zahedi,et al. Quantifying Morphological Computation , 2013, Entropy.

[4] Toniann Pitassi,et al. On the Expressive Power of Restricted Boltzmann Machines , 2013, NIPS.

[5] R. Pfeifer,et al. Self-Organization, Embodiment, and Biologically Inspired Robotics , 2007, Science.

[6] Rolf Pfeifer,et al. How the body shapes the way we think - a new view on intelligence , 2006 .

[7] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[8] N. Ay,et al. On the Causal Structure of the Sensorimotor Loop , 2014 .

[9] Ralf Der,et al. Higher Coordination With Less Control—A Result of Information Maximization in the Sensorimotor Loop , 2009, Adapt. Behav..

[10] Michel Loève,et al. Probability Theory I , 1977 .

[11] Nicolas Le Roux,et al. Representational Power of Restricted Boltzmann Machines and Deep Belief Networks , 2008, Neural Computation.

[12] Charles F. Hockett,et al. A mathematical theory of communication , 1948, MOCO.

[13] Rolf Pfeifer,et al. Cognition - perspectives from autonomous agents , 1995, Robotics Auton. Syst..

[14] Rodney A. Brooks,et al. Intelligence Without Reason , 1991, IJCAI.

[15] S Schmitt,et al. Quantifying control effort of biological and technical movements: an information-entropy-based approach. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16] Geoffrey E. Hinton,et al. Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[17] Nihat Ay,et al. Selection Criteria for Neuromanifolds of Stochastic Dynamics , 2013 .

[18] Geoffrey E. Hinton,et al. Modeling Human Motion Using Binary Latent Variables , 2006, NIPS.

[19] Yoshua Bengio,et al. Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[20] Naftali Tishby,et al. Predictive Information , 1999, cond-mat/9902341.

[21] Paul Smolensky,et al. Information processing in dynamical systems: foundations of harmony theory , 1986 .

[22] Daniel Polani,et al. An informational perspective on how the embodiment can relieve cognitive burden , 2011, 2011 IEEE Symposium on Artificial Life (ALIFE).

[23] Stanislas Leibler,et al. The Value of Information for Populations in Varying Environments , 2010, ArXiv.

[24] Tad McGeer,et al. Passive Dynamic Walking , 1990, Int. J. Robotics Res..

[25] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[26] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[27] Geoffrey E. Hinton,et al. Reinforcement Learning with Factored States and Actions , 2004, J. Mach. Learn. Res..

[28] Helmut Hauser,et al. Towards a theoretical foundation for morphological computation with compliant bodies , 2011, Biological Cybernetics.

[29] Nihat Ay,et al. Expressive Power and Approximation Errors of Restricted Boltzmann Machines , 2011, NIPS.

[30] Louis Sokoloff,et al. Circulation and Energy Metabolism of the Brain , 1999 .

[31] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[32] Nihat Ay,et al. Expressive Power of Conditional Restricted Boltzmann Machines , 2014 .

[33] Olaf Sporns,et al. Mapping Information Flow in Sensorimotor Networks , 2006, PLoS Comput. Biol..

[34] S. Kety,et al. The circulation and energy metabolism of the brain. , 1963, Clinical neurosurgery.

[35] Joshua Evan Auerbach,et al. Environmental Influence on the Evolution of Morphological Complexity in Machines , 2014, PLoS Comput. Biol..

[36] L. Sokoloff,et al. The effect of mental arithmetic on cerebral circulation and metabolism. , 1955, The Journal of clinical investigation.

[37] Gerhard Neumann,et al. Stochastic Optimal Control Methods for Investigating the Power of Morphological Computation , 2013, Artificial Life.

[38] L. Steels. The Biology and Technology of Intelligent Autonomous Agents , 1995, NATO ASI Series.

[39] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[40] Grgoire Montavon,et al. Neural Networks: Tricks of the Trade , 2012, Lecture Notes in Computer Science.

[41] P. Molinoff,et al. Basic Neurochemistry: Molecular, Cellular and Medical Aspects , 1989 .

[42] Keyan Zahedi,et al. Geometry and expressive power of conditional restricted Boltzmann machines , 2014, J. Mach. Learn. Res..

[43] Jürgen Schmidhuber,et al. Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes , 2008, ABiALS.

[44] Rolf Pfeifer,et al. Understanding intelligence , 2020, Inequality by Design.

[45] M. Lungarella,et al. Information Self-Structuring: Key Principle for Learning and Development , 2005, Proceedings. The 4nd International Conference on Development and Learning, 2005..

[46] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .

[47] Chandana Paul,et al. Morphological computation: A basis for the analysis of morphology and control requirements , 2006, Robotics Auton. Syst..

[48] Luc Steels. The biology and technology of intelligent autonomous agents , 1995, Robotics Auton. Syst..

[49] Rodney A. Brooks,et al. A robot that walks; emergent behaviors from a carefully evolved network , 1989, Proceedings, 1989 International Conference on Robotics and Automation.

[50] Keyan Zahedi,et al. Mathematik in den Naturwissenschaften Leipzig Quantifying Morphological Computation based on an Information Decomposition of the Sensorimotor Loop , 2015 .

[51] Frank Pasemann,et al. YARS: A Physical 3D Simulator for Evolving Controllers for Real Robots , 2008, SIMPAR.

[52] J. Vries. De Gruyter Studies in Mathematics , 2014, USCO and Quasicontinuous Mappings.

[53] David Haussler,et al. Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[54] Geoffrey E. Hinton,et al. Learning Multilevel Distributed Representations for High-Dimensional Sequences , 2007, AISTATS.

[55] Nihat Ay,et al. Refinements of Universal Approximation Results for Deep Belief Networks and Restricted Boltzmann Machines , 2010, Neural Computation.

[56] Yoko Yamaguchi,et al. Advances in Cognitive Neurodynamics (III) , 2013, Springer Netherlands.

[57] Chrystopher L. Nehaniv,et al. Tracking Information Flow through the Environment: Simple Cases of Stigmerg , 2004 .

[58] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[59] Daniel Sol,et al. Evolutionary Divergence in Brain Size between Migratory and Resident Birds , 2010, PloS one.

[60] Mikhail Prokopenko,et al. Guided Self-Organization: Inception , 2014 .

[61] Luc Berthouze,et al. Motor Skill Acquisition Under Environmental Perturbations: On the Necessity of Alternate Freezing and Freeing of Degrees of Freedom , 2004, Adapt. Behav..

[62] M. Hoagland,et al. Feedback Systems An Introduction for Scientists and Engineers SECOND EDITION , 2015 .

[63] A. U.S.,et al. Predictability , Complexity , and Learning , 2002 .

[64] Rocco A. Servedio,et al. Restricted Boltzmann Machines are Hard to Approximately Evaluate or Simulate , 2010, ICML.

[65] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.