Synaptic weight decay with selective consolidation enables fast learning without catastrophic forgetting

Learning can interfere with pre-existing memories that in classical neural networks may lead to catastrophic forgetting. Different from these networks, biological synapses show an early decay of long-term potentiation, combined with a tag & capture mechanism for selective consolidation. We present a 2-component synaptic plasticity model that, by an early decay and a tag & capture mechanism, enables context-dependent fast learning without catastrophic forgetting. For reinforcement learning in multiple contexts, the fast learning combined with an early weight decay maximizes the expected reward while minimizing interferences between subsequent contexts. Fast learning, enabled by a highly plastic weight component, improves performance for a given context. Between contexts this plastic component decays to prevent interference, but selective consolidation into a stable component protects old memories. As a downside of these mechanisms, learning is hampered when consolidation is triggered prematurely by interleaving easy and difficult tasks, consistent with human psychophysical experiments.

[1]  U. Frey,et al.  Synaptic tagging and long-term potentiation , 1997, Nature.

[2]  L. Squire,et al.  Retrograde amnesia and memory consolidation: a neurobiological perspective , 1995, Current Opinion in Neurobiology.

[3]  R. Stickgold Sleep-dependent memory consolidation , 2005, Nature.

[4]  Nick Chater,et al.  Economic irrationality is optimal during noisy decision making , 2016, Proceedings of the National Academy of Sciences.

[5]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1987, Computer.

[6]  Michael H. Herzog,et al.  Perceptual learning and roving: Stimulus types and overlapping neural populations , 2009, Vision Research.

[7]  Mark C. W. van Rossum,et al.  State Based Model of Long-Term Potentiation and Synaptic Tagging and Capture , 2009, PLoS Comput. Biol..

[8]  Kevan A. C. Martin,et al.  Control of Neuronal Output by Inhibition at the Axon Initial Segment , 1990, Neural Computation.

[9]  S. Sajikumar,et al.  Late-associativity, synaptic tagging, and the role of dopamine during LTP and LTD , 2004, Neurobiology of Learning and Memory.

[10]  Murray Shanahan,et al.  Continual Reinforcement Learning with Complex Synapses , 2018, ICML.

[11]  Christopher Summerfield,et al.  Selective overweighting of larger magnitudes during noisy numerical comparison , 2017, Nature Human Behaviour.

[12]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[13]  W. Gerstner,et al.  Synaptic Consolidation: From Synapses to Behavioral Modeling , 2015, The Journal of Neuroscience.

[14]  Michael H. Herzog,et al.  Human Perceptual Learning by Mental Imagery , 2009, Current Biology.

[15]  Stefano Fusi,et al.  Computational principles of synaptic memory consolidation , 2016, Nature Neuroscience.

[16]  Johanni Brea,et al.  A Normative Theory of Forgetting: Lessons from the Fruit Fly , 2014, PLoS Comput. Biol..

[17]  Henning Sprekeler,et al.  Perceptual learning, roving and the unsupervised bias , 2012, Vision Research.

[18]  W. Abraham,et al.  Memory retention – the synaptic stability versus plasticity dilemma , 2005, Trends in Neurosciences.

[19]  James L. McClelland,et al.  Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.

[20]  Isabel Gauthier,et al.  Deleterious effects of roving on learned tasks , 2014, Vision Research.

[21]  Wulfram Gerstner,et al.  Tag-Trigger-Consolidation: A Model of Early and Late Long-Term-Potentiation and Depression , 2008, PLoS Comput. Biol..

[22]  Mriganka Sur,et al.  Structural and Molecular Remodeling of Dendritic Spine Substructures during Long-Term Potentiation , 2014, Neuron.

[23]  R. Morris,et al.  Synaptic tagging and capture in the living rat , 2012, Nature Communications.

[24]  Christopher Summerfield,et al.  Comparing continual task learning in minds and machines , 2018, Proceedings of the National Academy of Sciences.

[25]  Thomas U. Otto,et al.  Perceptual learning with spatial uncertainties , 2006, Vision Research.

[26]  Stefano Fusi,et al.  Computational principles of biological memory , 2015, 1507.07580.

[27]  Gianluigi Mongillo,et al.  Online Learning with Hidden Markov Models , 2008, Neural Computation.

[28]  Marcin J. Skwark,et al.  Improving Contact Prediction along Three Dimensions , 2014, PLoS Comput. Biol..

[29]  R. French Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[30]  Marcus Rohrbach,et al.  Memory Aware Synapses: Learning what (not) to forget , 2017, ECCV.

[31]  James L. McClelland,et al.  What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[32]  U. Frey,et al.  Weak before strong: dissociating synaptic tagging and plasticity-factor accounts of late-LTP , 1998, Neuropharmacology.

[33]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[34]  Sreedharan Sajikumar,et al.  Anisomycin inhibits the late maintenance of long-term depression in rat hippocampal slices in vitro , 2003, Neuroscience Letters.

[35]  L. Abbott,et al.  Cascade Models of Synaptically Stored Memories , 2005, Neuron.

[36]  R. Morris,et al.  Elements of a neurobiological theory of hippocampal function: the role of synaptic plasticity, synaptic tagging and schemas , 2006, The European journal of neuroscience.

[37]  P. Latham,et al.  Confidence matching in group decision-making , 2017, Nature Human Behaviour.

[38]  Xu He,et al.  Overcoming Catastrophic Interference using Conceptor-Aided Backpropagation , 2018, ICLR.

[39]  K. Shapiro,et al.  The contingent negative variation (CNV) event-related potential (ERP) predicts the attentional blink , 2008 .

[40]  Thomas U. Otto,et al.  Perceptual learning of bisection stimuli under roving: slow and largely specific. , 2008, Journal of vision.

[41]  P. Somogyi,et al.  A new type of specific interneuron in the monkey hippocampus forming synapses exclusively with the axon initial segments of pyramidal cells , 1983, Brain Research.

[42]  R. Morris,et al.  Making memories last: the synaptic tagging and capture hypothesis , 2010, Nature Reviews Neuroscience.

[43]  Stefano Fusi,et al.  Efficient Partitioning of Memory Systems and Its Importance for Memory Consolidation , 2013, PLoS Comput. Biol..