Approximating distributions in stochastic learning

On-line machine learning algorithms, many biological spike-timing-dependent plasticity (STDP) learning rules, and stochastic neural dynamics evolve by Markov processes. A complete description of such systems gives the probability densities for the variables. The evolution and equilibrium state of these densities are given by a Chapman-Kolmogorov equation in discrete time, or a master equation in continuous time. These formulations are analytically intractable for most cases of interest, and to make progress a nonlinear Fokker-Planck equation (FPE) is often used in their place. The FPE is limited, and some argue that its application to describe jump processes (such as in these problems) is fundamentally flawed. We develop a well-grounded perturbation expansion that provides approximations for both the density and its moments. The approach is based on the system size expansion in statistical physics (which does not give approximations for the density), but our simple development makes the methods accessible and invites application to diverse problems. We apply the method to calculate the equilibrium distributions for two biologically-observed STDP learning rules and for a simple nonlinear machine-learning problem. In all three examples, we show that our perturbation series provides good agreement with Monte-Carlo simulations in regimes where the FPE breaks down.

[1]  N. Kampen,et al.  Stochastic processes in physics and chemistry , 1981 .

[2]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[3]  Charles F Stevens,et al.  Synaptic plasticity , 1998, Current Biology.

[4]  G. Radons On stochastic dynamics of supervised learning , 1993 .

[5]  S. Orszag,et al.  Advanced Mathematical Methods For Scientists And Engineers , 1979 .

[6]  D. Bedeaux,et al.  On the Relation between Master Equations and Random Walks and Their Solutions , 1971 .

[7]  C. Bell,et al.  The generation and subtraction of sensory expectations within cerebellum-like structures. , 1997, Brain, behavior and evolution.

[8]  H. Risken Fokker-Planck Equation , 1984 .

[9]  John P. Boyd,et al.  The Devil's Invention: Asymptotic, Superasymptotic and Hyperasymptotic Series , 1999 .

[10]  C. Bell,et al.  Properties of a modifiable efference copy in an electric fish. , 1982, Journal of neurophysiology.

[11]  Daniel D. Lee,et al.  Equilibrium properties of temporally asymmetric Hebbian plasticity. , 2000, Physical review letters.

[12]  Ying-Cheng Lai,et al.  Cooperation of spike timing-dependent and heterosynaptic plasticities in neural networks: a Fokker-Planck approach. , 2006, Chaos.

[13]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[14]  Mark C. W. van Rossum,et al.  Stable Hebbian Learning from Spike Timing-Dependent Plasticity , 2000, The Journal of Neuroscience.

[15]  Baktash Babadi,et al.  Intrinsic Stability of Temporally Shifted Spike-Timing Dependent Plasticity , 2010, PLoS Comput. Biol..

[16]  P. Zweifel Advanced Mathematical Methods for Scientists and Engineers , 1980 .

[17]  Patrick D. Roberts,et al.  Computational Consequences of Temporally Asymmetric Learning Rules: II. Sensory Image Cancellation , 2000, Journal of Computational Neuroscience.

[18]  Kazuyuki Aihara,et al.  Self-Organizing Dual Coding Based on , 2004 .

[19]  P. Roberts,et al.  Modeling inhibitory plasticity in the electrosensory system of mormyrid electric fish. , 2000, Journal of neurophysiology.

[20]  H. Markram,et al.  Regulation of Synaptic Efficacy by Coincidence of Postsynaptic APs and EPSPs , 1997, Science.

[21]  S. Blinnikov,et al.  Expansions for nearly Gaussian distributions , 1997 .

[22]  Simon Haykin,et al.  Adaptive filter theory (2nd ed.) , 1991 .

[23]  S. Orszag,et al.  Advanced mathematical methods for scientists and engineers I: asymptotic methods and perturbation theory. , 1999 .

[24]  Todd K. Leen,et al.  Stochastic Perturbation Methods for Spike-Timing-Dependent Plasticity , 2012, Neural Computation.

[25]  D. Sherrington Stochastic Processes in Physics and Chemistry , 1983 .

[26]  Patrick D Roberts,et al.  Stability of negative-image equilibria in spike-timing-dependent plasticity. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Tomoki Fukai,et al.  A Stochastic Method to Predict the Consequence of Arbitrary Forms of Spike-Timing-Dependent Plasticity , 2003, Neural Computation.

[28]  V. Han,et al.  Synaptic plasticity in a cerebellum-like structure depends on temporal order , 1997, Nature.

[29]  S. Sharma,et al.  The Fokker-Planck Equation , 2010 .

[30]  P. Holmes,et al.  Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields , 1983, Applied Mathematical Sciences.

[31]  Michael A. Buice,et al.  Systematic Fluctuation Expansion for Neural Network Activity Equations , 2009, Neural Computation.

[32]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[33]  Frank A. Adrian,et al.  Exact ensemble dynamics for spike-timing-dependent plasticity , 2008 .

[34]  Thomas Villmann,et al.  Dynamics of Self-Organized Feature Mapping , 1993, IWANN.

[35]  Lars Kai Hansen,et al.  Stochastic linear learning: Exact test and training error averages , 1993, Neural Networks.

[36]  D. Johnston,et al.  Regulation of Synaptic Efficacy by Coincidence of Postsynaptic APs and EPSPs , 1997 .

[37]  W. Gerstner,et al.  Triplets of Spikes in a Model of Spike Timing-Dependent Plasticity , 2006, The Journal of Neuroscience.

[38]  S.Blinnikov,et al.  Expansions for nearly Gaussian distributions , 1997 .

[39]  Xueliang Li,et al.  On a Relation Between , 2012 .

[40]  Jesper Tegnér,et al.  Spike-timing-dependent plasticity: common themes and divergent vistas , 2002, Biological Cybernetics.

[41]  N. Kampen,et al.  a Power Series Expansion of the Master Equation , 1961 .

[42]  Todd K. Leen,et al.  Perturbation theory for stochastic learning dynamics , 2011, The 2011 International Joint Conference on Neural Networks.

[43]  G. Bi,et al.  Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type , 1998, The Journal of Neuroscience.

[44]  W. Ebeling Stochastic Processes in Physics and Chemistry , 1995 .

[45]  Todd K. Leen,et al.  Weight Space Probability Densities in Stochastic Learning: II. Transients and Basin Hopping Times , 1992, NIPS.

[46]  Patrick D Roberts,et al.  Random walks for spike-timing-dependent plasticity. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Li I. Zhang,et al.  A critical window for cooperation and competition among developing retinotectal synapses , 1998, Nature.

[48]  C. Gardiner Stochastic Methods: A Handbook for the Natural and Social Sciences , 2009 .