STDP-based behavior learning on the TriBot robot

This paper describes a correlation-based navigation algorithm, based on an unsupervised learning paradigm for spiking neural networks, called Spike Timing Dependent Plasticity (STDP). This algorithm was implemented on a new bio-inspired hybrid mini-robot called TriBot to learn and increase its behavioral capabilities. In fact correlation based algorithms have been found to explain many basic behaviors in simple animals. The main interesting consequence of STDP is that the system is able to learn high-level sensor features, based on a set of basic reflexes, depending on some low-level sensor inputs. TriBot is composed of 3 modules, the first two being identical and inspired by the Whegs hybrid robot. The peculiar characteristics of the robot consists in the innovative shape of the three-spoke appendages that allow to increase stability of the structure. The last module is composed of two standard legs with 3 degrees of freedom each. Thanks to the cooperation among these modules, TriBot is able to face with irregular terrains overcoming potential deadlock situations, to climb high obstacles compared to its size and to manipulate objects. Robot experiments will be reported to demonstrate the potentiality and the effectiveness of the approach.

[1]  J. O’Keefe,et al.  Neuronal computations underlying the firing of place cells and their role in navigation , 1996, Hippocampus.

[2]  Eugene M. Izhikevich,et al.  Which model to use for cortical spiking neurons? , 2004, IEEE Transactions on Neural Networks.

[3]  H. Roitblat,et al.  Categorization , Representations , and the Dynamics of System-Environment Interaction : a case study in autonomous systems , 1992 .

[4]  Barbara Webb,et al.  A simple latency-dependent spiking-neuron model of cricket phonotaxis , 2000, Biological Cybernetics.

[5]  J. Lisman,et al.  Hippocampal sequence-encoding driven by a cortical multi-item working memory buffer , 2005, Trends in Neurosciences.

[6]  Paul F. M. J. Verschure,et al.  Categorization, representations, and the dynamics of system-environment interaction: a case study in autonomous systems , 1993 .

[7]  Eugene M. Izhikevich,et al.  Simple model of spiking neurons , 2003, IEEE Trans. Neural Networks.

[8]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[9]  Luigi Fortuna,et al.  Spike-timing-dependent plasticity in spiking neuron networks for robot navigation control , 2005, SPIE Microtechnologies.

[10]  Luigi Fortuna,et al.  LEARNING HIGH-LEVEL SENSORS FROM REFLEXES VIA SPIKING NETWORKS IN ROVING ROBOTS , 2006 .

[11]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[12]  L. Abbott,et al.  Cortical Development and Remapping through Spike Timing-Dependent Plasticity , 2001, Neuron.

[13]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[14]  Luigi Fortuna,et al.  Learning Anticipation via Spiking Networks: Application to Navigation Control , 2009, IEEE Transactions on Neural Networks.