Indirect training of a spiking neural network for flight control via spike-timing-dependent synaptic plasticity

Recently, spiking neural networks (SNNs) have been shown capable of approximating the dynamics of biological neuronal networks, and of being trainable by biologically-plausible learning mechanisms, such as spike-timing-dependent synaptic plasticity. Numerical simulations also support the possibility that they may possess universal function approximation abilities. However the effectiveness of training algorithms to date is far inferior to those of other artificial neural networks. Moreover, they rely on directly manipulating the SNN weights, which may not be feasible in a number of their potential applications. This paper presents a novel indirect training approach to modulate spike-timing-dependent plasticity (STDP) in an action SNN that serves as a flight controller without directly manipulating its weights. A critic SNN is directly trained with a reward-based Hebbian approach to send spike trains to the action SNN, which in turn controls the aircraft and learns via STDP. The approach is demonstrated by training the action SNN to act as a flight controller for stability augmentation. Its performance and dynamics are analyzed before and after training through numerical simulations and Poincaré maps.

[1]  P. De Camilli,et al.  The distribution of synapsin I and synaptophysin in hippocampal neurons developing in culture , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[2]  Harald Burgsteiner,et al.  Imitation learning with spiking neural networks and real-world devices , 2006, Eng. Appl. Artif. Intell..

[3]  Simei Gomes Wysoski,et al.  Adaptive Learning Procedure for a Network of Spiking Neurons and Visual Pattern Recognition , 2006, ACIVS.

[4]  Robert C. Nelson,et al.  Flight Stability and Automatic Control , 1989 .

[5]  Wolfgang Maass,et al.  Noisy Spiking Neurons with Temporal Coding have more Computational Power than Sigmoidal Neurons , 1996, NIPS.

[6]  C. Pennartz Reinforcement learning by Hebbian synapses with adaptive thresholds , 1997, Neuroscience.

[7]  D. Perrett,et al.  Visual neurones responsive to faces in the monkey temporal cortex , 2004, Experimental Brain Research.

[8]  T. Sejnowski,et al.  Correlated neuronal activity and the flow of neural information , 2001, Nature Reviews Neuroscience.

[9]  Razvan V. Florian,et al.  Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity , 2007, Neural Computation.

[10]  Jean-Pascal Pfister,et al.  Optimal Spike-Timing-Dependent Plasticity for Precise Action Potential Firing in Supervised Learning , 2005, Neural Computation.

[11]  Silvia Ferrari,et al.  Biologically realizable reward-modulated hebbian training for spiking neural networks , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[12]  C. Nicholson Electric current flow in excitable cells J. J. B. Jack, D. Noble &R. W. Tsien Clarendon Press, Oxford (1975). 502 pp., £18.00 , 1976, Neuroscience.

[13]  T. Sejnowski,et al.  Reliability of spike timing in neocortical neurons. , 1995, Science.

[14]  Robert A. Legenstein,et al.  What Can a Neuron Learn with Spike-Timing-Dependent Plasticity? , 2005, Neural Computation.

[15]  A. Hodgkin,et al.  A quantitative description of membrane current and its application to conduction and excitation in nerve , 1952, The Journal of physiology.

[16]  Miguel A. L. Nicolelis,et al.  A simulator for the analysis of neuronal ensemble activity: application to reaching tasks , 2002, Neurocomputing.