Mutual Information As a Task-Independent Utility Function for Evolutionary Robotics

The design of the control system for a swarm of robots is not a trivial enterprise. Above all, it is difficult to define which are the individual rules that produce a desired swarm behaviour without an a priori knowledge of the system features. For this reason, evolutionary or learning processes have been widely used to automatically synthesise group behaviours (see, for instance, Mataric 1997; Quinn et al. 2003; Baldassarre et al. 2007). In this paper, we investigate the use of information-theoretic concepts such as entropy and mutual information as task-independent utility functions for mobile robots, which adapt on the basis of an evolutionary or learning process. We believe that the use of implicit and general purpose utility functions—fitness functions or reward/error measures—can allow evolution or learning to explore the search space more freely, without being constrained by an explicit description of the desired solution. In this way, it is possible to discover behavioural and cognitive skills that play useful functionalities, and that might be hard to identify beforehand by the experimenter without an a priori knowledge of the system under study. Such task-independent utility functions can be conceived as universal intrinsic drives toward the development of useful behaviours in adaptive embodied agents.

[1]  Stefano Nolfi,et al.  Evolutionary Robotics: The Biology, Intelligence, and Technology of Self-Organizing Machines , 2000 .

[2]  David McMahon A Brief Introduction to Information Theory , 2008 .

[3]  H. Van Dyke Parunak,et al.  Entropy and self-organization in multi-agent systems , 2001, AGENTS '01.

[4]  Serge Kernbach,et al.  Handbook of Collective Robotics: Fundamentals and Challenges , 2013 .

[5]  Albert Y. Zomaya,et al.  Emergence of Glider-like Structures in a Modular Robotic System , 2008, ALIFE.

[6]  S. Nolfi,et al.  Self-Organising Sync in a Robotic Swarm , 2007 .

[7]  Olaf Sporns,et al.  Connectivity and complexity: the relationship between neuroanatomy and brain dynamics , 2000, Neural Networks.

[8]  Stephen Marshall,et al.  Connectivity and complexity , 2004 .

[9]  Chrystopher L. Nehaniv,et al.  Sensor adaptation and development in robots by entropy maximization of sensory data , 2005, 2005 International Symposium on Computational Intelligence in Robotics and Automation.

[10]  G. Edelman,et al.  Complexity and coherency: integrating information in the brain , 1998, Trends in Cognitive Sciences.

[11]  Danesh Tarapore,et al.  Fingerprinting Agent-Environment Interaction via Information Theory , 2004 .

[12]  Chrystopher L. Nehaniv,et al.  All Else Being Equal Be Empowered , 2005, ECAL.

[13]  Stefano Nolfi,et al.  Self-organised path formation in a swarm of robots , 2011, Swarm Intelligence.

[14]  Takeo Kanade,et al.  Intelligent Autonomous Systems , 1991, Robotics Auton. Syst..

[15]  John Hallam,et al.  From Animals to Animats 10 , 2008 .

[16]  Philippe Capdepuy,et al.  Maximization of Potential Information Flow as a Universal Utility for Collective Behaviour , 2007, 2007 IEEE Symposium on Artificial Life.

[17]  Kathrin Klamroth,et al.  Introduction and Applications , 2000 .

[18]  Stefano Nolfi,et al.  Evolving coordinated group behaviours through maximisation of mean mutual information , 2008, Swarm Intelligence.

[19]  Olaf Sporns,et al.  Methods for quantifying the informational structure of sensory and motor data , 2007, Neuroinformatics.

[20]  Chrystopher L. Nehaniv,et al.  Empowerment: a universal agent-centric measure of control , 2005, 2005 IEEE Congress on Evolutionary Computation.

[21]  Olaf Sporns,et al.  Evolving Coordinated Behavior by Maximizing Information Structure , 2006 .

[22]  Alcherio Martinoli,et al.  Communication in a Swarm of Miniature Robots: The e-Puck as an Educational Tool for Swarm Robotics , 2006, Swarm Robotics.

[23]  Wolfgang Banzhaf,et al.  Advances in Artificial Life , 2003, Lecture Notes in Computer Science.

[24]  Chrystopher L. Nehaniv,et al.  Keep Your Options Open: An Information-Based Driving Principle for Sensorimotor Systems , 2008, PloS one.

[25]  William Bialek,et al.  Adaptive Rescaling Maximizes Information Transmission , 2000, Neuron.

[26]  M. Lungarella,et al.  Information Self-Structuring: Key Principle for Learning and Development , 2005, Proceedings. The 4nd International Conference on Development and Learning, 2005..

[27]  S. Nolfi,et al.  Evolving collective control , cooperation and distributed cognition , 2013 .

[28]  Maja J. Mataric,et al.  Learning social behavior , 1997, Robotics Auton. Syst..

[29]  G Tononi,et al.  A complexity measure for selective matching of signals by the brain. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Stefano Nolfi,et al.  Evolving mobile robots in simulated and real environments , 1995 .

[31]  Nick Jakobi,et al.  Evolutionary Robotics and the Radical Envelope-of-Noise Hypothesis , 1997, Adapt. Behav..

[32]  Lincoln Smith,et al.  Evolving controllers for a homogeneous system of physical robots: structured cooperation with minimal sensors , 2003, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[33]  Mikhail Prokopenko,et al.  Evaluating Team Performance at the Edge of Chaos , 2003, RoboCup.

[34]  Ralf Der,et al.  Higher Coordination With Less Control—A Result of Information Maximization in the Sensorimotor Loop , 2009, Adapt. Behav..

[35]  X. Rosalind Wang,et al.  Measuring information storage and transfer in swarms , 2011, ECAL.

[36]  Danesh Tarapore,et al.  Quantifying patterns of agent-environment interaction , 2006, Robotics Auton. Syst..

[37]  Marco Dorigo,et al.  Self-Organized Coordinated Motion in Groups of Physically Connected Robots , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[38]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[39]  Christoph Salge,et al.  Local information maximisation creates emergent flocking behavior , 2011, ECAL.

[40]  Peter Stone,et al.  Empowerment for continuous agent—environment systems , 2011, Adapt. Behav..

[41]  Phil Husbands,et al.  Artificial Life IX: Proceedings of the Ninth International Conference on the Simulation and Synthesis of Living Systems , 2004 .

[42]  Chrystopher L. Nehaniv,et al.  Two Agents Acting as One , 2010, ALIFE.

[43]  Chrystopher L. Nehaniv,et al.  Think globally, sense locally: From local information to global features , 2011, 2011 IEEE Symposium on Artificial Life (ALIFE).

[44]  Marco Dorigo,et al.  Evolution, Self-organization and Swarm Robotics , 2008, Swarm Intelligence.

[45]  Stefano Nolfi,et al.  Self-Organizing Sync in a Robotic Swarm: A Dynamical System View , 2009, IEEE Transactions on Evolutionary Computation.

[46]  Haitao Zhang,et al.  Predicting Fitness Effects of Beneficial Mutations in Digital Organisms , 2007, 2007 IEEE Symposium on Artificial Life.

[47]  Max Lungarella,et al.  Robots As Cognitive Tools Information theoretic analysis of sensory-motor data , 2001 .

[48]  G. Edelman,et al.  A measure for brain complexity: relating functional segregation and integration in the nervous system. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Mikhail Prokopenko,et al.  Evolving Spatiotemporal Coordination in a Modular Robotic System , 2006, SAB.