The hippocampal formation as a hierarchical generative model supporting generative replay and continual learning

We advance a novel computational theory of the hippocampal formation as a hierarchical generative model that organizes sequential experiences, such as rodent trajectories during spatial navigation, into coherent spatiotemporal contexts. We propose that to make this possible, the hippocampal generative model is endowed with strong inductive biases to pattern-separate individual items of experience (at the first hierarchical layer), organize them into sequences (at the second layer) and then cluster them into maps (at the third layer). This theory entails a novel characterization of hippocampal reactivations as generative replay: the offline resampling of fictive sequences from the generative model, which supports the continual learning of multiple sequential experiences. Our experiments show that the hierarchical model using generative replay is able to learn and retain efficiently multiple spatial navigation trajectories, organizing them into separate spatial maps. Furthermore, it reproduces flexible and prospective aspects of hippocampal dynamics that have been challenging to explain within existing frameworks. This theory reconciles multiple roles of the hippocampal formation in map-based navigation, episodic memory and imagination.

[1]  Bradley C. Love,et al.  A non-spatial account of place and grid cells based on clustering models of concept learning , 2018, Nature Communications.

[2]  G. Buzsáki,et al.  Memory, navigation and theta rhythm in the hippocampal-entorhinal system , 2013, Nature Neuroscience.

[3]  Giovanni Pezzulo,et al.  Planning at decision time and in the background during spatial navigation , 2019, Current Opinion in Behavioral Sciences.

[4]  Dileep George,et al.  Learning cognitive maps as structured graphs for vicarious evaluation , 2019, bioRxiv.

[5]  Matthijs A. A. van der Meer,et al.  Information Processing in Decision-Making Systems , 2012, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[6]  Kefei Liu,et al.  Generative Predictive Codes by Multiplexed Hippocampal Neuronal Tuplets , 2018, Neuron.

[7]  A. Borst Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[8]  G. Dragoi Cell assemblies, sequences and temporal coding in the hippocampus , 2020, Current Opinion in Neurobiology.

[9]  Lisa M. Giocomo,et al.  Experience-dependent contextual codes in the hippocampus , 2019, Nature Neuroscience.

[10]  Brad E. Pfeiffer,et al.  Reverse Replay of Hippocampal Place Cells Is Uniquely Modulated by Changing Reward , 2016, Neuron.

[11]  James L. McClelland,et al.  What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[12]  Adam Johnson,et al.  Neural Ensembles in CA3 Transiently Encode Paths Forward of the Animal at a Decision Point , 2007, The Journal of Neuroscience.

[13]  Lisa M. Giocomo,et al.  Remembered reward locations restructure entorhinal spatial maps , 2019, Science.

[14]  Roddy M. Grieves,et al.  Place cells on a maze encode routes rather than destinations , 2016, eLife.

[15]  G. Buzsáki Hippocampal sharp wave‐ripple: A cognitive biomarker for episodic memory and planning , 2015, Hippocampus.

[16]  Laura A. Atherton,et al.  Memory trace replay: the shaping of memory consolidation by neuromodulation , 2015, Trends in Neurosciences.

[17]  L. Frank,et al.  Awake Hippocampal Sharp-Wave Ripples Support Spatial Memory , 2012, Science.

[18]  Babak Shahbaba,et al.  Hippocampal ensembles represent sequential relationships among discrete nonspatial events , 2019, bioRxiv.

[19]  Giovanni Pezzulo,et al.  Divide et impera: subgoaling reduces the complexity of probabilistic inference and problem solving , 2015, Journal of The Royal Society Interface.

[20]  Hideaki Shimazaki Neurons as an Information-theoretic Engine , 2015, 1512.07855.

[21]  Cyriel M A Pennartz,et al.  Model-based spatial navigation in the hippocampus-ventral striatum circuit: A computational analysis , 2018, PLoS Comput. Biol..

[22]  Timothy E. J. Behrens,et al.  Organizing conceptual knowledge in humans with a gridlike code , 2016, Science.

[23]  Zeb Kurth-Nelson,et al.  What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior , 2018, Neuron.

[24]  Matthijs A. A. van der Meer,et al.  Hippocampal Replay Is Not a Simple Function of Experience , 2010, Neuron.

[25]  G. Pezzulo,et al.  Internally generated hippocampal sequences as a vantage point to probe future‐oriented cognition , 2017, Annals of the New York Academy of Sciences.

[26]  Caswell Barry,et al.  The Tolman-Eichenbaum Machine: Unifying Space and Relational Memory through Generalization in the Hippocampal Formation , 2019, Cell.

[27]  Richard S. Sutton,et al.  Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming , 1990, NIPS 1990.

[28]  Andrew M. Wikenheiser,et al.  Hippocampal theta sequences reflect current goals , 2015, Nature Neuroscience.

[29]  Magdalene I. Schlesiger,et al.  Hippocampal CA1 replay becomes less prominent but more rigid without inputs from medial entorhinal cortex , 2019, Nature Communications.

[30]  James L. McClelland,et al.  Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.

[31]  Yuri Dabaghian,et al.  Reconceiving the hippocampal map as a topological template , 2014, eLife.

[32]  J. G. Taylor,et al.  Vicarious trial and error. , 1951, Psychological review.

[33]  M. Wilson,et al.  Oscillations, neural computations and learning during wake and sleep , 2017, Current Opinion in Neurobiology.

[34]  Marco Idiart,et al.  Grid Cells and Place Cells: An Integrated View of their Navigational and Memory Function , 2015, Trends in Neurosciences.

[35]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[36]  D. Hassabis,et al.  Hippocampal place cells construct reward related sequences through unexplored space , 2015, eLife.

[37]  David J. Foster Replay Comes of Age. , 2017, Annual review of neuroscience.

[38]  Min Whan Jung,et al.  Distinct effects of reward and navigation history on hippocampal forward and reverse replays , 2019, Proceedings of the National Academy of Sciences.

[39]  A. Treves,et al.  Theta-paced flickering between place-cell maps in the hippocampus , 2011, Nature.

[40]  M. Moser,et al.  Understanding memory through hippocampal remapping , 2008, Trends in Neurosciences.

[41]  Giovanni Pezzulo,et al.  Problem Solving as Probabilistic Inference with Subgoaling: Explaining Human Successes and Pitfalls in the Tower of Hanoi , 2016, PLoS Comput. Biol..

[42]  L. Nadel,et al.  The Hippocampus as a Cognitive Map , 1978 .

[43]  Hugo J. Spiers,et al.  Place Field Repetition and Purely Local Remapping in a Multicompartment Environment , 2013, Cerebral cortex.

[44]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[45]  Brad E. Pfeiffer,et al.  Hippocampal place cell sequences depict future paths to remembered goals , 2013, Nature.

[46]  Dmitriy Aronov,et al.  Mapping of a non-spatial dimension by the hippocampal/entorhinal circuit , 2017, Nature.

[47]  Jiwon Kim,et al.  Continual Learning with Deep Generative Replay , 2017, NIPS.

[48]  Yuki Aoki,et al.  The Integration of Goal-Directed Signals onto Spatial Maps of Hippocampal Place Cells. , 2019, Cell reports.

[49]  G. Dragoi,et al.  Preplay of future place cell sequences by hippocampal cellular assemblies , 2011, Nature.

[50]  E. Tolman Cognitive maps in rats and men. , 1948, Psychological review.

[51]  David S. Touretzky,et al.  Context Learning in the Rodent Hippocampus , 2007, Neural Computation.

[52]  Jozsef Csicsvari,et al.  Hippocampal Reactivation of Random Trajectories Resembling Brownian Diffusion , 2019, Neuron.

[53]  Kevin J. Miller,et al.  Dorsal hippocampus contributes to model-based planning , 2017, Nature Neuroscience.

[54]  Jozsef Csicsvari,et al.  The entorhinal cognitive map is attracted to goals , 2019, Science.

[56]  R. N. Spreng,et al.  The Future of Memory: Remembering, Imagining, and the Brain , 2012, Neuron.

[57]  Giovanni Pezzulo,et al.  The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation , 2013, Front. Psychol..

[58]  P. Sachs,et al.  SMARCAD1 ATPase activity is required to silence endogenous retroviruses in embryonic stem cells , 2019, Nature Communications.

[59]  Giovanni Pezzulo,et al.  Model-Based Approaches to Active Perception and Control , 2017, Entropy.

[60]  Neil Burgess,et al.  Forward and Backward Inference in Spatial Cognition , 2013, PLoS Comput. Biol..

[61]  Timothy Edward John Behrens,et al.  Generalisation of structural knowledge in the Hippocampal-Entorhinal system , 2018, NeurIPS.

[62]  Hava T. Siegelmann,et al.  Brain-inspired replay for continual learning with artificial neural networks , 2020, Nature Communications.

[63]  Samuel J Gershman,et al.  The Successor Representation: Its Computational Logic and Neural Substrates , 2018, The Journal of Neuroscience.

[64]  Matthew A. Wilson,et al.  Hippocampal remapping as hidden state inference , 2019, bioRxiv.

[65]  H. Eichenbaum A cortical–hippocampal system for declarative memory , 2000, Nature Reviews Neuroscience.

[66]  George Dragoi,et al.  Distinct preplay of multiple novel spatial experiences in the rat , 2013, Proceedings of the National Academy of Sciences.

[67]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[68]  Miguel Lazaro-Gredilla,et al.  Learning cognitive maps for vicarious evaluation , 2019 .

[69]  Roddy M. Grieves,et al.  Place field repetition and spatial learning in a multicompartment environment , 2015, Hippocampus.

[70]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[71]  Stefano Fusi,et al.  Context-dependent representations of objects and space in the primate hippocampus during virtual navigation , 2019, Nature Neuroscience.

[72]  Giovanni Pezzulo,et al.  Nonparametric Problem-Space Clustering: Learning Efficient Codes for Cognitive Control Tasks , 2016, Entropy.

[73]  Shigeyoshi Fujisawa,et al.  Temporal and Rate Coding for Discrete Event Sequences in the Hippocampus , 2017, Neuron.

[74]  Wenbo Tang,et al.  Dynamics of Awake Hippocampal-Prefrontal Replay for Spatial Learning and Memory-Guided Decision Making , 2019, Neuron.

[75]  E. Tulving Episodic memory: from mind to brain. , 2002, Annual review of psychology.

[76]  B. McNaughton,et al.  Hippocampus Leads Ventral Striatum in Replay of Place-Reward Information , 2009, PLoS biology.

[77]  Michael D. Howard,et al.  Complementary Learning Systems , 2014, Cogn. Sci..

[78]  Peter Gärdenfors,et al.  Navigating cognition: Spatial codes for human thinking , 2018, Science.

[79]  Andreas S. Tolias,et al.  Generative replay with feedback connections as a general strategy for continual learning , 2018, ArXiv.

[80]  Giovanni Pezzulo,et al.  Prefrontal Goal Codes Emerge as Latent States in Probabilistic Value Learning , 2016, Journal of Cognitive Neuroscience.

[81]  David J. Foster,et al.  Reverse replay of behavioural sequences in hippocampal place cells during the awake state , 2006, Nature.

[82]  G. Dragoi,et al.  Preconfigured patterns are the primary driver of offline multi‐neuronal sequence replay , 2018, Hippocampus.

[83]  Timothy E. J. Behrens,et al.  Human Replay Spontaneously Reorganizes Experience , 2019, Cell.

[84]  G. Buzsáki,et al.  Space and Time: The Hippocampus as a Sequence Generator , 2018, Trends in Cognitive Sciences.

[85]  Andrew W. Moore,et al.  Prioritized sweeping: Reinforcement learning with less data and less time , 2004, Machine Learning.

[86]  A D Redish,et al.  Prediction, sequences and the hippocampus , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[87]  David J. Foster,et al.  Hippocampal theta sequences , 2007, Hippocampus.

[88]  Eric Shea-Brown,et al.  Signatures and mechanisms of low-dimensional neural predictive manifolds , 2018, bioRxiv.

[89]  Eric Eaton,et al.  Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data , 2016, ArXiv.

[90]  Mattias P. Karlsson,et al.  Constant Sub-second Cycling between Representations of Possible Futures in the Hippocampus , 2019, Cell.

[91]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[92]  R. Buckner The role of the hippocampus in prediction and imagination. , 2010, Annual review of psychology.

[93]  Nicolas W. Schuck,et al.  Sequential replay of nonspatial task states in the human hippocampus , 2018, Science.

[94]  Matthijs A. A. van der Meer,et al.  Internally generated sequences in learning and executing goal-directed behavior , 2014, Trends in Cognitive Sciences.

[95]  Marcelo G Mattar,et al.  Prioritized memory access explains planning and hippocampal replay , 2017, Nature Neuroscience.