Spiking neural controllers in multi-agent competitive systems for adaptive targeted motor learning

Abstract The proposed work introduces a neural control strategy for guiding adaptation in spiking neural structures acting as nonlinear controllers in a group of bio-inspired robots which compete in reaching targets in a virtual environment. The neural structures embedded into each agent are inspired by a specific part of the insect brain, namely Central Complex, devoted to detect, learn and memorize visual features for targeted motor control. A reduced-order model of a spiking neuron is used as the basic building block for the neural controller. The control methodology employs bio-inspired, correlation based learning mechanisms like Spike timing dependent plasticity with the addition of a reward/punishment-based method experimentally found in insects. The reference signal for the overall multi-agent control system is imposed by a global reward, which guides motor learning to direct each agent towards specific visual targets. The neural controllers within the agents start from identical conditions: the learning strategy induces each robot to show anticipated targeting actions upon specific visual stimuli. The whole control structure also contributes to make the robots refractory or more sensitive to specific visual stimuli, showing distinct preferences in future choices. This leads to an environmentally induced, targeted motor control, even without a direct communication among the agents, giving robots, while running, the ability to perform adaptation in real-time. Experiments, carried out in a dynamic simulation environment, show the suitability of the proposed approach. Specific performance indexes, like Shannon׳s Entropy, are adopted to quantitatively analyze diversity and specialization within the group.

[1]  P. Arena,et al.  Implementation of a drosophila-inspired orientation model on the Eye-Ris platform , 2010, 2010 12th International Workshop on Cellular Nanoscale Networks and their Applications (CNNA 2010).

[2]  Roland Strauss,et al.  Visual Targeting of Motor Actions in Climbing Drosophila , 2010, Current Biology.

[3]  Wolfgang Maass,et al.  Networks of Spiking Neurons: The Third Generation of Neural Network Models , 1996, Electron. Colloquium Comput. Complex..

[4]  D. Nettle The evolution of personality variation in humans and other animals. , 2006, The American psychologist.

[5]  Luigi Fortuna,et al.  Learning Anticipation via Spiking Networks: Application to Navigation Control , 2009, IEEE Transactions on Neural Networks.

[6]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[7]  Roger D. Quinn,et al.  Comparing cockroach and Whegs robot body motions , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[8]  P. Arena,et al.  Drosophila-inspired visual orientation model on the Eye-RIS platform: experiments on a roving robot , 2011, Microtechnologies.

[9]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[10]  M. Heisenberg,et al.  An engram found? Evaluating the evidence from fruit flies , 2004, Current Opinion in Neurobiology.

[11]  D. Tank,et al.  In Vivo Ca2+ Dynamics in a Cricket Auditory Neuron: An Example of Chemical Computation , 1994, Science.

[12]  F. Heitger,et al.  The functional role of contrast adaptation , 1988, Vision Research.

[13]  A. Dornhaus,et al.  Adaptation, Genetic Drift, Pleiotropy, and History in the Evolution of Bee Foraging Behavior , 2006 .

[14]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[15]  Paolo Arena,et al.  SPARKRS4CS: a software/hardware framework for cognitive architectures , 2011, Microtechnologies.

[16]  S. G. Ponnambalam,et al.  Swarm Robotics: An Extensive Research Review , 2010 .

[17]  S. Peron,et al.  Spike frequency adaptation mediates looming stimulus selectivity in a collision-detecting neuron , 2009, Nature Neuroscience.

[18]  G. Robinson Regulation of division of labor in insect societies. , 1992, Annual review of entomology.

[19]  Xiao-Jing Wang,et al.  Spike-Frequency Adaptation of a Generalized Leaky Integrate-and-Fire Model Neuron , 2004, Journal of Computational Neuroscience.

[20]  P. Meyer,et al.  THE USA TODAY INDEX OF ETHNIC DIVERSITY , 1992 .

[21]  Roland Strauss,et al.  Modelling the insect Mushroom Bodies: Application to sequence learning , 2015, Neural Networks.

[22]  Paolo Arena,et al.  Spatial Temporal Patterns for Action-Oriented Perception in Roving Robots II, An Insect Brain Computational Model , 2014, Cognitive Systems Monographs.

[23]  ArenaPaolo,et al.  Learning anticipation via spiking networks , 2009 .

[24]  L. Abbott,et al.  Cortical Development and Remapping through Spike Timing-Dependent Plasticity , 2001, Neuron.

[25]  André Longtin,et al.  Linear versus nonlinear signal transmission in neuron models with adaptation currents or dynamic thresholds. , 2010, Journal of neurophysiology.

[26]  I. Dean,et al.  Neural population coding of sound level adapts to stimulus statistics , 2005, Nature Neuroscience.

[27]  Paolo Arena,et al.  Embedding the AnaFocus' Eye-RIS vision system in roving robots to enhance the action-oriented perception , 2009, Microtechnologies.

[28]  L. Abbott,et al.  Extending the effects of spike-timing-dependent plasticity to behavioral timescales. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[29]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[30]  Roland Strauss,et al.  Modeling the insect mushroom bodies: application to a delayed match-to-sample task. , 2013, Neural networks : the official journal of the International Neural Network Society.

[31]  Alemdar Bayraktar,et al.  Determination of water level effects on the dynamic characteristics of a prototype arch dam model using ambient vibration testing , 2012, Experimental Techniques.

[32]  Marco Dorigo,et al.  Evolving Homogeneous Neurocontrollers for a Group of Heterogeneous Robots: Coordinated Motion, Cooperation, and Acoustic Communication , 2008, Artificial Life.

[33]  J A Garcia-Lazaro,et al.  Shifting and scaling adaptation to dynamic stimuli in somatosensory cortex , 2011, The European journal of neuroscience.

[34]  Paolo Arena,et al.  Efficient hexapodal locomotion control based on flow-invariant subspaces , 2011 .

[35]  Andreas V. M. Herz,et al.  A Universal Model for Spike-Frequency Adaptation , 2003, Neural Computation.

[36]  Paolo Arena,et al.  Modeling attentional loop in the insect Mushroom Bodies , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[37]  Eugene M. Izhikevich,et al.  Which model to use for cortical spiking neurons? , 2004, IEEE Transactions on Neural Networks.

[38]  H. Roitblat,et al.  Categorization , Representations , and the Dynamics of System-Environment Interaction : a case study in autonomous systems , 1992 .

[39]  J. Niven,et al.  Are Bigger Brains Better? , 2009, Current Biology.

[40]  P. Arena,et al.  STDP-based behavior learning on the TriBot robot , 2009, Microtechnologies.

[41]  Mitchell A. Potter,et al.  EVOLVING NEURAL NETWORKS WITH COLLABORATIVE SPECIES , 2006 .

[42]  Lynne E. Parker,et al.  Robot Teams: From Diversity to Polymorphism , 2002 .

[43]  Erol Şahin,et al.  A review of studies in swarm robotics , 2007 .

[44]  Eugene M. Izhikevich,et al.  Simple model of spiking neurons , 2003, IEEE Trans. Neural Networks.

[45]  Ling Li,et al.  Emergent Specialization in Swarm Systems , 2002, IDEAL.

[46]  James Phillips-Portillo,et al.  The central complex of the flesh fly, Neobellieria bullata: Recordings and morphologies of protocerebral inputs and small‐field neurons , 2012, The Journal of comparative neurology.

[47]  M. Sur,et al.  Adaptation-Induced Plasticity of Orientation Tuning in Adult Visual Cortex , 2000, Neuron.

[48]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[49]  Wofgang Maas,et al.  Networks of spiking neurons: the third generation of neural network models , 1997 .

[50]  B. Tabashnik,et al.  Development time and resistance to Bt crops , 1999, Nature.

[51]  P. Arena,et al.  Implementation and experimental validation of an autonomous hexapod robot , 2005 .

[52]  Paolo Arena,et al.  Learning expectation in insects: A recurrent spiking neural model for spatio-temporal representation , 2012, Neural Networks.

[53]  Wolfgang Maass,et al.  Noisy Spiking Neurons with Temporal Coding have more Computational Power than Sigmoidal Neurons , 1996, NIPS.

[54]  M. Fishman,et al.  Prevention of vertebrate neuronal death by the crmA gene. , 1994, Science.

[55]  W. Bossert,et al.  The Measurement of Diversity , 2001 .

[56]  Anna Dornhaus,et al.  Individual and collective cognition in ants and other insects ( Hymenoptera : Formicidae ) , 2008 .

[57]  Li Liu,et al.  Context generalization in Drosophila visual learning requires the mushroom bodies , 1999, Nature.

[58]  J. Schnupp,et al.  Shifting and scaling adaptation to dynamic stimuli in somatosensory cortex , 2007 .

[59]  Dario Floreano,et al.  Evolution of Spiking Neural Controllers for Autonomous Vision-Based Robots , 2001, EvoRobots.

[60]  Roland Strauss,et al.  Software/Hardware Issues in Modelling Insect Brain Architecture , 2011, ICIRA.

[61]  Ling Li,et al.  Learning and Measuring Specialization in Collaborative Swarm Systems , 2004, Adapt. Behav..