Replay in minds and machines

7 Experience-related brain activity patterns have been found to reactivate during sleep, wakeful 8 rest, and brief pauses from active behavior. In parallel, machine learning research has found that 9 experience replay can lead to substantial performance improvements in artificial agents. Together, 10 these lines of research have significantly expanded our understanding of the potential computa11 tional benefits replay may provide to biological and artificial agents alike. We provide an overview 12 of findings in replay research from neuroscience and machine learning and summarize the compu13 tational benefits an agent can gain from replay that cannot be achieved through direct interactions 14 with the world itself. These benefits include faster learning and data efficiency, less forgetting, 15 prioritizing important experiences, as well as improved planning and generalization. In addition 16 to the benefits of replay for improving an agent’s decision-making policy, we highlight the less-well 17 studied aspect of replay in representation learning, wherein replay could provide a mechanism to 18 learn the structure and relevant aspects of the environment. Thus, replay might help the agent to 19 build task-appropriate state representations. 20

[1]  B. McNaughton,et al.  Preferential Reactivation of Motivationally Relevant Information in the Ventral Striatum , 2008, The Journal of Neuroscience.

[2]  Brad E. Pfeiffer,et al.  Hippocampal place cell sequences depict future paths to remembered goals , 2013, Nature.

[3]  W. Scoville,et al.  LOSS OF RECENT MEMORY AFTER BILATERAL HIPPOCAMPAL LESIONS , 1957, Journal of neurology, neurosurgery, and psychiatry.

[4]  Mattias P. Karlsson,et al.  Awake replay of remote experiences in the hippocampus , 2009, Nature Neuroscience.

[5]  Andrew W. Moore,et al.  Prioritized sweeping: Reinforcement learning with less data and less time , 2004, Machine Learning.

[6]  Peter Dayan,et al.  Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.

[7]  Albert K. Lee,et al.  Memory of Sequential Experience in the Hippocampus during Slow Wave Sleep , 2002, Neuron.

[8]  D. Hassabis,et al.  Deconstructing episodic memory with construction , 2007, Trends in Cognitive Sciences.

[9]  James L. McClelland,et al.  Hippocampal conjunctive encoding, storage, and recall: Avoiding a trade‐off , 1994, Hippocampus.

[10]  J. R.,et al.  Quantitative analysis , 1892, Nature.

[11]  Long Ji Lin,et al.  Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[12]  Jeffrey M. Zacks,et al.  Constructing Experience: Event Models from Perception to Action , 2017, Trends in Cognitive Sciences.

[13]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[14]  E. Tolman The determiners of behavior at a choice point. , 1938 .

[15]  V. Sterpenich,et al.  A nap to recap or how reward regulates hippocampal-prefrontal memory networks during daytime sleep in humans , 2015, eLife.

[16]  Matthew A. Wilson,et al.  Hippocampal Replay of Extended Experience , 2009, Neuron.

[17]  L. Squire "Memory and the hippocampus: A synthesis from findings with rats, monkeys, and humans": Correction. , 1992 .

[18]  A. Redish Beyond the Cognitive Map: From Place Cells to Episodic Memory , 1999 .

[19]  Min Whan Jung,et al.  Distinct effects of reward and navigation history on hippocampal forward and reverse replays , 2019, Proceedings of the National Academy of Sciences.

[20]  Delia Silva,et al.  Trajectory events across hippocampal place-cells require previous experience , 2015, Nature Neuroscience.

[21]  B. McNaughton,et al.  Replay of Neuronal Firing Sequences in Rat Hippocampus During Sleep Following Spatial Experience , 1996, Science.

[22]  David S. Touretzky,et al.  The Role of the Hippocampus in Solving the Morris Water Maze , 1998, Neural Computation.

[23]  R. Muller,et al.  Head-direction cells recorded from the postsubiculum in freely moving rats. II. Effects of environmental manipulations , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[24]  Andreas S. Tolias,et al.  Generative replay with feedback connections as a general strategy for continual learning , 2018, ArXiv.

[25]  Y. Niv Learning task-state representations , 2019, Nature Neuroscience.

[26]  David J. Foster Replay Comes of Age. , 2017, Annual review of neuroscience.

[27]  Rich Sutton,et al.  A Deeper Look at Planning as Learning from Replay , 2015, ICML.

[28]  G. Dragoi,et al.  Preplay of future place cell sequences by hippocampal cellular assemblies , 2011, Nature.

[29]  James L. McClelland,et al.  What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[30]  Neil Burgess,et al.  Coordinated hippocampal-entorhinal replay as structural inference , 2019, NeurIPS.

[31]  Raymond J Dolan,et al.  A map of abstract relational knowledge in the human hippocampal–entorhinal cortex , 2017, eLife.

[32]  Dmitriy Aronov,et al.  Mapping of a non-spatial dimension by the hippocampal/entorhinal circuit , 2017, Nature.

[33]  Lila Davachi,et al.  Persistence of hippocampal multivoxel patterns into postencoding rest is related to memory , 2013, Proceedings of the National Academy of Sciences.

[34]  Richard S. Sutton,et al.  Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[35]  Jai Y. Yu,et al.  Hippocampal–cortical interaction in decision making , 2015, Neurobiology of Learning and Memory.

[36]  Peter Dayan,et al.  Hippocampal Contributions to Control: The Third Way , 2007, NIPS.

[37]  H. Spiers The Hippocampal Cognitive Map: One Space or Many? , 2020, Trends in Cognitive Sciences.

[38]  D. Shohamy,et al.  Memory states influence value-based decisions. , 2016, Journal of experimental psychology. General.

[39]  Zeb Kurth-Nelson,et al.  What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior , 2018, Neuron.

[40]  Yael Niv,et al.  Does mental context drift or shift? , 2017, Current Opinion in Behavioral Sciences.

[41]  G. Buzsáki Hippocampal sharp wave‐ripple: A cognitive biomarker for episodic memory and planning , 2015, Hippocampus.

[42]  Shantanu P. Jadhav,et al.  Sharp-wave ripples as a signature of hippocampal-prefrontal reactivation for memory during sleep and waking states , 2019, Neurobiology of Learning and Memory.

[43]  J. O'Keefe,et al.  The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. , 1971, Brain research.

[44]  Nicholas A. Ketz,et al.  Enhanced Brain Correlations during Rest Are Related to Memory for Recent Experiences , 2010, Neuron.

[45]  Anna C. Schapiro,et al.  Active and effective replay: systems consolidation reconsidered again , 2019, Nature Reviews Neuroscience.

[46]  D. Ji,et al.  Hippocampal awake replay in fear memory retrieval , 2017, Nature Neuroscience.

[47]  A. Bornstein,et al.  Mixing memory and desire: How memory reactivation supports deliberative decision-making. , 2020, Wiley interdisciplinary reviews. Cognitive science.

[48]  K. Paller,et al.  Upgrading the sleeping brain with targeted memory reactivation , 2013, Trends in Cognitive Sciences.

[49]  R. French Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[50]  A W Toga,et al.  Maps of the Brain , 2001, The Anatomical record.

[51]  Adam Johnson,et al.  Neural Ensembles in CA3 Transiently Encode Paths Forward of the Animal at a Decision Point , 2007, The Journal of Neuroscience.

[52]  Yeon Soon Shin,et al.  Structuring Memory Through Inference-Based Event Segmentation , 2020, Top. Cogn. Sci..

[53]  Nicolas W. Schuck,et al.  Dynamics of fMRI patterns reflect sub-second activation sequences and reveal replay in human visual cortex , 2021, Nature Communications.

[54]  M. Eckardt The Hippocampus as a Cognitive Map , 1980 .

[55]  Loren M. Frank,et al.  The hippocampal sharp wave–ripple in memory retrieval for immediate use and consolidation , 2018, Nature Reviews Neuroscience.

[56]  Jan Born,et al.  Sleep's function in the spontaneous recovery and consolidation of memories. , 2007, Journal of experimental psychology. General.

[57]  Christian F. Doeller,et al.  The Role of Mental Maps in Decision-Making , 2017, Trends in Neurosciences.

[58]  Long Ji Lin,et al.  Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[59]  Günther Knoblich,et al.  How Memory Replay in Sleep Boosts Creative Problem-Solving , 2018, Trends in Cognitive Sciences.

[60]  L. Frank,et al.  New Experiences Enhance Coordinated Neural Activity in the Hippocampus , 2008, Neuron.

[61]  Robert M. French,et al.  Catastrophic interference in connectionist networks , 2003 .

[62]  J. Hodges Memory, Amnesia and the Hippocampal System , 1995 .

[63]  Dharshan Kumaran,et al.  What representations and computations underpin the contribution of the hippocampus to generalization and inference? , 2012, Front. Hum. Neurosci..

[64]  L. Davachi,et al.  Awake Reactivation of Prior Experiences Consolidates Memories and Biases Cognition , 2019, Trends in Cognitive Sciences.

[65]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[66]  Tom Schaul,et al.  Prioritized Experience Replay , 2015, ICLR.

[67]  Daniel Bendor,et al.  Biasing the content of hippocampal replay during sleep , 2012, Nature Neuroscience.

[68]  Christian Leibold A model for navigation in unknown environments based on a reservoir of hippocampal sequences , 2020, Neural Networks.

[69]  Shinsuke Shimojo,et al.  Neural Computations Mediating One-Shot Learning in the Human Brain , 2013, PLoS biology.

[70]  Peter Dayan,et al.  The roles of online and offline replay in planning , 2020, bioRxiv.

[71]  B. McNaughton,et al.  Memory trace reactivation in hippocampal and neocortical neuronal ensembles , 2000, Current Opinion in Neurobiology.

[72]  David J. Foster,et al.  Sequence learning and the role of the hippocampus in rodent navigation , 2012, Current Opinion in Neurobiology.

[73]  Jing Peng,et al.  Efficient Learning and Planning Within the Dyna Framework , 1993, Adapt. Behav..

[74]  Howard Eichenbaum,et al.  Does the hippocampus preplay memories? , 2015, Nature Neuroscience.

[75]  Ida Momennejad Learning Structures: Predictive Representations, Replay, and Generalization , 2020, Current Opinion in Behavioral Sciences.

[76]  R. Buckner The role of the hippocampus in prediction and imagination. , 2010, Annual review of psychology.

[77]  Jan Born,et al.  A Backup of Hippocampal Spatial Code outside the Hippocampus? New Light on Systems Memory Consolidation , 2020, Neuron.

[78]  J. G. Taylor,et al.  Vicarious trial and error. , 1951, Psychological review.

[79]  Kenji Doya,et al.  What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? , 1999, Neural Networks.

[80]  Y. Niv,et al.  Learning latent structure: carving nature at its joints , 2010, Current Opinion in Neurobiology.

[81]  R Ratcliff,et al.  Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. , 1990, Psychological review.

[82]  Nitzan Censor,et al.  Modulation of Learning and Memory: A Shared Framework for Interference and Generalization , 2018, Neuroscience.

[83]  Marcelo G Mattar,et al.  Prioritized memory access explains planning and hippocampal replay , 2017, Nature Neuroscience.

[84]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[85]  Andrew M. Wikenheiser,et al.  The balance of forward and backward hippocampal sequences shifts across behavioral states , 2013, Hippocampus.

[86]  A. Redish,et al.  Disrupting the medial prefrontal cortex alters hippocampal sequences during deliberative decision making. , 2019, Journal of neurophysiology.

[87]  Kevin J Miller,et al.  Multi-step planning in the brain , 2021, Current Opinion in Behavioral Sciences.

[88]  Eric L. Denovellis,et al.  Hippocampal replay of experience at real-world speeds , 2020, bioRxiv.

[89]  L. Davachi,et al.  Transcending time in the brain: How event memories are constructed from experience , 2019, Hippocampus.

[90]  Yael Niv,et al.  A State Representation for Reinforcement Learning and Decision-Making in the Orbitofrontal Cortex , 2017, bioRxiv.

[91]  J. Born,et al.  The memory function of sleep , 2010, Nature Reviews Neuroscience.

[92]  Andrew M. Wikenheiser,et al.  Over the river, through the woods: cognitive maps in the hippocampus and orbitofrontal cortex , 2016, Nature Reviews Neuroscience.

[93]  Nicolas W. Schuck,et al.  Sequential replay of nonspatial task states in the human hippocampus , 2018, Science.

[94]  George Dragoi,et al.  Distinct preplay of multiple novel spatial experiences in the rat , 2013, Proceedings of the National Academy of Sciences.

[95]  B. McNaughton,et al.  The Ventral Striatum in Off-Line Processing: Ensemble Reactivation during Sleep and Modulation by Hippocampal Ripples , 2004, The Journal of Neuroscience.

[96]  The Continuity of Context: A Role for the Hippocampus , 2021, Trends in Cognitive Sciences.

[97]  Lorena Deuker,et al.  Replay in Humans—First Evidence and Open Questions , 2017 .

[98]  L. Frank,et al.  Rewarded Outcomes Enhance Reactivation of Experience in the Hippocampus , 2009, Neuron.

[99]  Robert T Knight,et al.  Bidirectional prefrontal-hippocampal dynamics organize information transfer during sleep in humans , 2019, Nature Communications.

[100]  Roddy M. Grieves,et al.  The representation of space in the brain , 2017, Behavioural Processes.

[101]  Tomoki Fukai,et al.  Recurrent network model for learning goal-directed sequences through reverse replay , 2017, bioRxiv.

[102]  F. Fröhlich High-Frequency Oscillations , 2016 .

[103]  Long-Ji Lin,et al.  Reinforcement learning for robots using neural networks , 1992 .

[104]  T. Bliss,et al.  A synaptic model of memory: long-term potentiation in the hippocampus , 1993, Nature.

[105]  Russell A. Epstein,et al.  The cognitive map in humans: spatial navigation and beyond , 2017, Nature Neuroscience.

[106]  J. Fell,et al.  Ripples in the medial temporal lobe are relevant for human memory consolidation. , 2008, Brain : a journal of neurology.

[107]  Jiwon Kim,et al.  Continual Learning with Deep Generative Replay , 2017, NIPS.

[108]  E. Tolman A behavioristic theory of ideas. , 1926 .

[109]  J. Born,et al.  Maintaining memories by reactivation , 2007, Current Opinion in Neurobiology.

[110]  Rebecca M. C. Spencer,et al.  REM-dependent repair of competitive memory suppression , 2010, Experimental Brain Research.

[111]  M. Wilson,et al.  Coordinated memory replay in the visual cortex and hippocampus during sleep , 2007, Nature Neuroscience.

[112]  E. Tolman Cognitive maps in rats and men. , 1948, Psychological review.

[113]  Sridhar Mahadevan,et al.  Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes , 2007, J. Mach. Learn. Res..

[114]  Christopher D. Harvey,et al.  Choice-specific sequences in parietal cortex during a virtual-navigation decision task , 2012, Nature.

[115]  Richard S. Sutton,et al.  A Deeper Look at Experience Replay , 2017, ArXiv.

[116]  C. Buechel,et al.  Learning of distant state predictions by the orbitofrontal cortex in humans , 2018, bioRxiv.

[117]  C. Pavlides,et al.  Influences of hippocampal place cell firing in the awake state on the activity of these cells during subsequent sleep episodes , 1989, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[118]  W. Brogden Sensory pre-conditioning. , 1939 .

[119]  C. Büchel,et al.  Reactivation of single-episode pain patterns in the hippocampus and decision making , 2020, bioRxiv.

[120]  Brice A. Kuhl,et al.  Transforming the Concept of Memory Reactivation , 2020, Trends in Neurosciences.

[121]  J. O’Keefe,et al.  Do hippocampal pyramidal cells respond to nonspatial stimuli? , 2021, Physiological reviews.

[122]  K. Norman,et al.  Reinstated episodic context guides sampling-based decisions for reward , 2017, Nature Neuroscience.

[123]  Sen Cheng,et al.  Reactivation, Replay, and Preplay: How It Might All Fit Together , 2011, Neural plasticity.

[124]  Caswell Barry,et al.  Coordinated grid and place cell replay during rest , 2016, Nature Neuroscience.

[125]  Andrew M. Wikenheiser,et al.  Hippocampal theta sequences reflect current goals , 2015, Nature Neuroscience.

[126]  A. Heller,et al.  Is Hippocampal Replay a Mechanism for Anxiety and Depression? , 2020, JAMA psychiatry.

[127]  K. F. Muenzinger,et al.  Motivation in learning. VI. Escape from electric shock compared with hunger-food tension in the visual discrimination habit. , 1936 .

[128]  A. Redish,et al.  Manipulating Decisiveness in Decision Making: Effects of Clonidine on Hippocampal Search Strategies , 2016, The Journal of Neuroscience.

[129]  Maneesh Sahani,et al.  A neurally plausible model learns successor representations in partially observable environments , 2019, NeurIPS.

[130]  C. Bird How do we remember events? , 2020, Current Opinion in Behavioral Sciences.

[131]  J. O’Keefe,et al.  Hippocampal place units in the freely moving rat: Why they fire where they fire , 1978, Experimental Brain Research.

[132]  C. Barry,et al.  The Role of Hippocampal Replay in Memory and Planning , 2018, Current Biology.

[133]  R. Stickgold,et al.  Sleep-dependent learning and motor-skill complexity. , 2004, Learning & memory.

[134]  B L McNaughton,et al.  Coordinated Reactivation of Distributed Memory Traces in Primate Neocortex , 2002, Science.

[135]  L. Nadel,et al.  Decay happens: the role of active forgetting in memory , 2013, Trends in Cognitive Sciences.

[136]  Marvin Minsky,et al.  Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[137]  M. Wilson,et al.  Temporally Structured Replay of Awake Hippocampal Ensemble Activity during Rapid Eye Movement Sleep , 2001, Neuron.

[138]  Uğur M Erdem,et al.  A goal‐directed spatial navigation model using forward trajectory planning based on grid cells , 2012, The European journal of neuroscience.

[139]  David Andre,et al.  Generalized Prioritized Sweeping , 1997, NIPS.

[140]  G. Buzsáki,et al.  Forward and reverse hippocampal place-cell sequences during ripples , 2007, Nature Neuroscience.

[141]  Shantanu P. Jadhav,et al.  The role of replay and theta sequences in mediating hippocampal‐prefrontal interactions for memory and cognition , 2020, Hippocampus.

[142]  David J. Foster,et al.  Reverse replay of behavioural sequences in hippocampal place cells during the awake state , 2006, Nature.

[143]  Arne D. Ekstrom,et al.  Cellular networks underlying human spatial navigation , 2003, Nature.

[144]  Andrew M. Wikenheiser,et al.  Decoding the cognitive map: ensemble hippocampal sequences and decision making , 2015, Current Opinion in Neurobiology.