Insect inspired unsupervised learning for tactic and phobic behavior enhancement in a hybrid robot

In this paper the implementation of a correlation-based navigation algorithm, based on an unsupervised learning paradigm for spiking neural networks, called Spike Timing Dependent Plasticity (STDP), is presented. The main characteristic of the learning technique implemented is that it allows the robot to learn high-level sensor features, based on a set of basic reflexes, depending on some low-level sensor inputs. The goal is to allow the robot to autonomously learn how to navigate in an unknown environment, avoiding obstacles and heading toward or avoiding the targets (on the basis of the rewarded action).

[1]  Naoyuki Kubota,et al.  A spiking neural network for behavior learning of a mobile robot in a dynamic environment , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[2]  B. Babkin Conditioned Reflexes; an Investigation of the Physiological Activity of the Cerebral Cortex. , 1929 .

[3]  L. Abbott,et al.  Cortical Development and Remapping through Spike Timing-Dependent Plasticity , 2001, Neuron.

[4]  I. Pavlov Conditioned Reflexes: An Investigation of the Physiological Activity of the Cerebral Cortex , 1929 .

[5]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[6]  Paul F. M. J. Verschure,et al.  Categorization, representations, and the dynamics of system-environment interaction: a case study in autonomous systems , 1993 .

[7]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[8]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[9]  Kazuyuki Murase,et al.  Self-Organization of Spiking Neural Network Generating Autonomous Behavior in a Real Mobile Robot , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[10]  G. Bi,et al.  Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type , 1998, The Journal of Neuroscience.

[11]  Eugene M. Izhikevich,et al.  Simple model of spiking neurons , 2003, IEEE Trans. Neural Networks.

[12]  Eugene M. Izhikevich,et al.  Which model to use for cortical spiking neurons? , 2004, IEEE Transactions on Neural Networks.

[13]  Jan Wessnitzer,et al.  Multimodal sensory integration in insects—towards insect brain control architectures , 2006, Bioinspiration & biomimetics.

[14]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[15]  Luigi Fortuna,et al.  Learning Anticipation via Spiking Networks: Application to Navigation Control , 2009, IEEE Transactions on Neural Networks.

[16]  M. Heisenberg,et al.  An engram found? Evaluating the evidence from fruit flies , 2004, Current Opinion in Neurobiology.

[17]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[18]  Hani Hagras,et al.  Evolving spiking neural network controllers for autonomous robots , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[19]  P. Arena,et al.  STDP-based behavior learning on the TriBot robot , 2009, Microtechnologies.

[20]  Dean V. Buonomano,et al.  Mechanisms and significance of spike-timing dependent plasticity , 2002, Biological Cybernetics.