Continual Lifelong Learning with Neural Networks: A Review

Humans and animals have the ability to continually acquire, fine-tune, and transfer knowledge and skills throughout their lifespan. This ability, referred to as lifelong learning, is mediated by a rich set of neurocognitive mechanisms that together contribute to the development and specialization of our sensorimotor skills as well as to long-term memory consolidation and retrieval. Consequently, lifelong learning capabilities are crucial for computational learning systems and autonomous agents interacting in the real world and processing continuous streams of information. However, lifelong learning remains a long-standing challenge for machine learning and neural network models since the continual acquisition of incrementally available information from non-stationary data distributions generally leads to catastrophic forgetting or interference. This limitation represents a major drawback for state-of-the-art deep neural network models that typically learn representations from stationary batches of training data, thus without accounting for situations in which information becomes incrementally available over time. In this review, we critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting. Although significant advances have been made in domain-specific learning with neural networks, extensive research efforts are required for the development of robust lifelong learning on autonomous agents and robots. We discuss well-established and emerging research motivated by lifelong learning factors in biological systems such as structural plasticity, memory replay, curriculum and transfer learning, intrinsic motivation, and multisensory integration.

[1]  Martial Mermillod,et al.  The stability-plasticity dilemma: investigating the continuum from catastrophic forgetting to age-limited learning effects , 2013, Front. Psychol..

[2]  Günther Palm,et al.  Biomimetic Neural Learning for Intelligent Robots - Intelligent Systems, Cognitive Robotics, and Neuroscience , 2005, Biomimetic Neural Learning for Intelligent Robots.

[3]  Anna Chadwick The Scientist in the Crib -- Minds, Brains, and How Children Learn , 2001 .

[4]  T. Stanford,et al.  Development of multisensory integration from the perspective of the individual neuron , 2014, Nature Reviews Neuroscience.

[5]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[6]  Sebastian Risi,et al.  Born to Learn: the Inspiration, Progress, and Future of Evolved Plastic Artificial Neural Networks , 2017, Neural Networks.

[7]  Jürgen Schmidhuber,et al.  Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[8]  C. Koch,et al.  Recurrent excitation in neocortical circuits , 1995, Science.

[9]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  L. Abbott,et al.  Cascade Models of Synaptically Stored Memories , 2005, Neuron.

[11]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[12]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[13]  Gregory Ditzler,et al.  Learning in Nonstationary Environments: A Survey , 2015, IEEE Computational Intelligence Magazine.

[14]  Shie Mannor,et al.  A Deep Hierarchical Approach to Lifelong Learning in Minecraft , 2016, AAAI.

[15]  J. Elman Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[16]  D. Hubel,et al.  Plasticity of ocular dominance columns in monkey striate cortex. , 1977, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[17]  R. O’Reilly,et al.  Computational principles of learning in the neocortex and hippocampus , 2000, Hippocampus.

[18]  I. Fried,et al.  Internally Generated Reactivation of Single Neurons in Human Hippocampus During Free Recall , 2008, Science.

[19]  Ronald Kemker,et al.  FearNet: Brain-Inspired Model for Incremental Learning , 2017, ICLR.

[20]  Yan Liu,et al.  Deep Generative Dual Memory Network for Continual Learning , 2017, ArXiv.

[21]  F. Gage,et al.  Neurogenesis in the adult human hippocampus , 1998, Nature Medicine.

[22]  Stefan Wermter,et al.  Lifelong Learning of Spatiotemporal Representations With Dual-Memory Recurrent Self-Organization , 2018, Front. Neurorobot..

[23]  W. Gerstner,et al.  The temporal paradox of Hebbian learning and homeostatic plasticity , 2017, bioRxiv.

[24]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Stefan Wermter,et al.  A self-organizing neural network architecture for learning human-object interactions , 2017, Neurocomputing.

[26]  M. Sur,et al.  Development and plasticity of cortical areas and networks , 2001, Nature Reviews Neuroscience.

[27]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[28]  B. Bontempi,et al.  Time-dependent reorganization of brain circuitry underlying long-term memory storage , 1999, Nature.

[29]  Karl J. Friston,et al.  Active inference and epistemic value , 2015, Cognitive neuroscience.

[30]  Marc'Aurelio Ranzato,et al.  Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[31]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[32]  H. Cameron,et al.  Differentiation of newly born neurons and glia in the dentate gyrus of the adult rat , 1993, Neuroscience.

[33]  W. Gan,et al.  Branch-specific dendritic Ca2+ spikes cause persistent synaptic plasticity , 2015, Nature.

[34]  D. N. Spinelli,et al.  Visual Experience Modifies Distribution of Horizontally and Vertically Oriented Receptive Fields in Cats , 1970, Science.

[35]  Pietro Perona,et al.  The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[36]  A. Knoblauch Impact of Structural Plasticity on Memory Formation and Decline , 2017 .

[37]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[38]  Karl J. Friston,et al.  Active Inference, Predictive Coding and Cortical Architecture , 2015 .

[39]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[40]  Takashi Kitamura,et al.  Engrams and circuits crucial for systems consolidation of a memory , 2017, Science.

[41]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[42]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[43]  B. Stein,et al.  The Merging of the Senses , 1993 .

[44]  W. Abraham,et al.  Memory retention – the synaptic stability versus plasticity dilemma , 2005, Trends in Neurosciences.

[45]  Geoffrey E. Hinton Using fast weights to deblur old memories , 1987 .

[46]  J. O’Neill,et al.  Play it again: reactivation of waking experience and memory , 2010, Trends in Neurosciences.

[47]  M. Stryker,et al.  Local GABA circuit control of experience-dependent plasticity in developing visual cortex. , 1998, Science.

[48]  Razvan Pascanu,et al.  Sim-to-Real Robot Learning from Pixels with Progressive Nets , 2016, CoRL.

[49]  Michael S. C. Thomas,et al.  Critical periods and catastrophic interference effects in the development of self-organizing feature maps. , 2008, Developmental science.

[50]  F. Gage,et al.  Adult neurogenesis and neural stem cells of the central nervous system in mammals , 2002, Journal of neuroscience research.

[51]  Chrisantha Fernando,et al.  PathNet: Evolution Channels Gradient Descent in Super Neural Networks , 2017, ArXiv.

[52]  K. Miller,et al.  Ocular dominance column development: analysis and simulation. , 1989, Science.

[53]  G. Davis Homeostatic control of neural activity: from phenomenology to molecular design. , 2006, Annual review of neuroscience.

[54]  Stephen R. Marsland,et al.  A self-organising network that grows when required , 2002, Neural Networks.

[55]  C. Nelson CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE 43 the effects of maternal stress on fetal brain development , 2022 .

[56]  Andrew G. Barto,et al.  Intrinsic Motivation and Reinforcement Learning , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.

[57]  Jiwon Kim,et al.  Continual Learning with Deep Generative Replay , 2017, NIPS.

[58]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[59]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[60]  Stanislas Dehaene,et al.  Networks of Formal Neurons and Memory Palimpsests , 1986 .

[61]  Davide Maltoni,et al.  CORe50: a New Dataset and Benchmark for Continuous Object Recognition , 2017, CoRL.

[62]  Ronald Kemker,et al.  Measuring Catastrophic Forgetting in Neural Networks , 2017, AAAI.

[63]  G. Turrigiano Too many cooks? Intrinsic and synaptic homeostatic mechanisms in cortical circuit refinement. , 2011, Annual review of neuroscience.

[64]  Sung Ju Hwang,et al.  Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[65]  D. Hubel,et al.  Cortical and callosal connections concerned with the vertical meridian of visual fields in the cat. , 1967, Journal of neurophysiology.

[66]  Stefan Wermter,et al.  Lifelong learning of human actions with deep neural network self-organization , 2017, Neural Networks.

[67]  T. Kiyota Neurogenesis and Brain Repair , 2017 .

[68]  E. Lenneberg Biological Foundations of Language , 1967 .

[69]  E. Chang,et al.  Human hippocampal neurogenesis drops sharply in children to undetectable levels in adults , 2018, Nature.

[70]  H. Bülthoff,et al.  Merging the senses into a robust percept , 2004, Trends in Cognitive Sciences.

[71]  Stefano Fusi,et al.  Computational principles of synaptic memory consolidation , 2016, Nature Neuroscience.

[72]  Pietro Perona,et al.  A Bayesian approach to unsupervised one-shot learning of object categories , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[73]  Mark H. Johnson,et al.  The computational modelling of sensitive periods , 2006 .

[74]  Honglak Lee,et al.  Online Incremental Feature Learning with Denoising Autoencoders , 2012, AISTATS.

[75]  P. Hagoort,et al.  Development of the Human Cortex and the Concept of “Critical” or “Sensitive” Periods , 2006 .

[76]  A. Cangelosi,et al.  Developmental Robotics: From Babies to Robots , 2015 .

[77]  Stephen Grossberg,et al.  Adaptive Resonance Theory: How a brain learns to consciously attend, learn, and recognize a changing world , 2013, Neural Networks.

[78]  Anthony V. Robins,et al.  Catastrophic forgetting in neural networks: the role of rehearsal mechanisms , 1993, Proceedings 1993 The First New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems.

[79]  M. Hoagland,et al.  Feedback Systems An Introduction for Scientists and Engineers SECOND EDITION , 2015 .

[80]  F. Gage,et al.  New neurons and new memories: how does adult hippocampal neurogenesis affect learning and memory? , 2010, Nature Reviews Neuroscience.

[81]  J B Poline,et al.  Brain imaging of language plasticity in adopted adults: can a second language replace the first? , 2001, NeuroImage.

[82]  P. Dayan,et al.  Flexible shaping: How learning in small steps helps , 2009, Cognition.

[83]  Manuel Schabus,et al.  Sleep transforms the cerebral trace of declarative memories , 2007, Proceedings of the National Academy of Sciences.

[84]  Janet Wiles,et al.  Computational Influence of Adult Neurogenesis on Memory Encoding , 2009, Neuron.

[85]  Nando de Freitas,et al.  Neural Programmer-Interpreters , 2015, ICLR.

[86]  Sergio Gomez Colmenarejo,et al.  Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[87]  Yoshua Bengio,et al.  An Empirical Investigation of Catastrophic Forgeting in Gradient-Based Neural Networks , 2013, ICLR.

[88]  Andreas Knoblauch,et al.  Structural Synaptic Plasticity Has High Memory Capacity and Can Explain Graded Amnesia, Catastrophic Forgetting, and the Spacing Effect , 2014, PloS one.

[89]  D. Hassabis,et al.  Neuroscience-Inspired Artificial Intelligence , 2017, Neuron.

[90]  C. Nelson Neural plasticity and human development: the role of early experience in sculpting memory systems , 2000 .

[91]  Alex Graves,et al.  Automated Curriculum Learning for Neural Networks , 2017, ICML.

[92]  Vasant Honavar,et al.  Learn++: an incremental learning algorithm for supervised neural networks , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[93]  Dorothy Tse,et al.  Schema-Dependent Gene Activation and Memory Encoding in Neocortex , 2011, Science.

[94]  J B Poline,et al.  Brain imaging of language plasticity in adopted adults: can a second language replace the first? , 2001, NeuroImage.

[95]  Oliver Lemon,et al.  Incremental online learning of objects for robots operating in real environments , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[96]  V. Marchman Constraints on Plasticity in a Connectionist Model of the English Past Tense , 1993, Journal of Cognitive Neuroscience.

[97]  Oriol Vinyals,et al.  Matching Networks for One Shot Learning , 2016, NIPS.

[98]  James L. McClelland,et al.  Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.

[99]  C. Spence,et al.  Assessing the Role of the ‘Unity Assumption’ on Multisensory Integration: A Review , 2017, Front. Psychol..

[100]  Haohan Wang,et al.  Multimodal Transfer Deep Learning with Applications in Audio-Visual Recognition , 2014 .

[101]  D. Hubel,et al.  The period of susceptibility to the physiological effects of unilateral eye closure in kittens , 1970, The Journal of physiology.

[102]  Kenneth D. Miller,et al.  The Role of Constraints in Hebbian Learning , 1994, Neural Computation.

[103]  Andrew Y. Ng,et al.  Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[104]  Ping Li,et al.  Early lexical development in a self-organizing neural network , 2004, Neural Networks.

[105]  M. A. Moore,et al.  Neural network models of list learning , 1991 .

[106]  Janet Wiles,et al.  Potential role for adult neurogenesis in the encoding of time in new memories , 2006, Nature Neuroscience.

[107]  Aren Jansen,et al.  Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[108]  F. Gage,et al.  Mechanisms and Functional Implications of Adult Neurogenesis , 2008, Cell.

[109]  M. Boldrini,et al.  Human Hippocampal Neurogenesis Persists throughout Aging. , 2018, Cell stem cell.

[110]  Geoffrey E. Hinton,et al.  Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[111]  Michael W. Spratling,et al.  Neuroconstructivism - I: How the Brain Constructs Cognition , 2007 .

[112]  Alexander Gepperth,et al.  A Bio-Inspired Incremental Learning Architecture for Applied Perceptual Problems , 2016, Cognitive Computation.

[113]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[114]  Stefan Wermter,et al.  Expectation Learning for Adaptive Crossmodal Stimuli Association , 2018, ArXiv.

[115]  Mehryar Mohri,et al.  AdaNet: Adaptive Structural Learning of Artificial Neural Networks , 2016, ICML.

[116]  S. Risi,et al.  Continual Learning through Evolvable Neural Turing Machines , 2016 .

[117]  Terence D. Sanger,et al.  Neural network learning control of robot manipulators using gradually increasing task difficulty , 1994, IEEE Trans. Robotics Autom..

[118]  Andrea Soltoggio,et al.  Short-term plasticity as cause–effect hypothesis testing in distal reward learning , 2014, Biological Cybernetics.

[119]  Stefan Wermter,et al.  A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments * , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[120]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[121]  T. Hensch Critical period regulation. , 2004, Annual review of neuroscience.

[122]  Yuxin Peng,et al.  Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification , 2014, ACM Multimedia.

[123]  D. Heeger,et al.  A Hierarchy of Temporal Receptive Windows in Human Cortex , 2008, The Journal of Neuroscience.

[124]  Junmo Kim,et al.  Less-forgetting Learning in Deep Neural Networks , 2016, ArXiv.

[125]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[126]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[127]  F. Gage,et al.  Mammalian neural stem cells. , 2000, Science.

[128]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[129]  M. L. Lambon Ralph,et al.  Age of acquisition effects depend on the mapping between representations and the frequency of occurrence: Empirical and computational evidence , 2006 .

[130]  N. Doidge,et al.  Book Review: The Brain That Changes Itself: Stories of Personal Triumph from the Frontiers of Brain Science , 2008 .

[131]  J. Tani Exploring Robotic Minds: Actions, Symbols, and Consciousness as Self-Organizing Dynamic Phenomena , 2016 .

[132]  Pierre-Yves Oudeyer,et al.  Intrinsic motivation, curiosity, and learning: Theory and applications in educational technologies. , 2016, Progress in brain research.

[133]  Conrad D. James,et al.  Neurogenesis Deep Learning , 2016, ISNN 2017.

[134]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[135]  David J Heeger,et al.  Theory of cortical function , 2017, Proceedings of the National Academy of Sciences.

[136]  Joshua B. Tenenbaum,et al.  Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.

[137]  Davide Maltoni,et al.  Continuous Learning in Single-Incremental-Task Scenarios , 2018, Neural Networks.

[138]  C. Spence Crossmodal spatial attention , 2010, Annals of the New York Academy of Sciences.

[139]  Michael S C Thomas,et al.  The computational modeling of sensitive periods. , 2006, Developmental psychobiology.

[140]  James L. McClelland,et al.  What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[141]  Pierre-Yves Oudeyer,et al.  Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning , 2017, J. Mach. Learn. Res..

[142]  R. Miikkulainen Dyslexic and Category-Specific Aphasic Impairments in a Self-Organizing Feature Map Model of the Lexicon , 1997, Brain and Language.

[143]  Xu He,et al.  Overcoming Catastrophic Interference using Conceptor-Aided Backpropagation , 2018, ICLR.

[144]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[145]  Leonidas A A Doumas,et al.  A theory of the discovery and predication of relational concepts. , 2008, Psychological review.

[146]  Giorgia Quadrato,et al.  Adult neurogenesis in brain repair: cellular plasticity vs. cellular replacement , 2014, Front. Neurosci..

[147]  D. Lewkowicz Early experience and multisensory perceptual narrowing. , 2014, Developmental psychobiology.

[148]  Stefan Wermter,et al.  An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction , 2017, IEEE Transactions on Cognitive and Developmental Systems.

[149]  Robert M. French,et al.  Pseudo-recurrent Connectionist Networks: An Approach to the 'Sensitivity-Stability' Dilemma , 1997, Connect. Sci..

[150]  R. O’Reilly The Division of Labor Between the Neocortex and Hippocampus , 2010 .

[151]  C. Spence,et al.  The Handbook of Multisensory Processing , 2004 .

[152]  Jan Peters,et al.  Online Learning with Stochastic Recurrent Neural Networks using Intrinsic Motivation Signals , 2017, CoRL.

[153]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[154]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[155]  Oliver Lemon Incremental On-Line Learning of Object Classes using a Combination of Self-Organizing Incremental Neural Networks and Deep Convolutional Neural Networks* , 2016 .

[156]  S. Grossberg How does a brain build a cognitive code , 1980 .

[157]  H T Siegelmann,et al.  The global landscape of cognition: hierarchical aggregation as an organizational principle of human cortical networks and functions , 2015, Scientific Reports.

[158]  D. Lewkowicz,et al.  The multisensory approach to development , 2012 .

[159]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[160]  Alexei A. Efros,et al.  Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[161]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[162]  D. Grodner,et al.  Curiosity-Driven Development of Tool Use Precursors : a Computational Model , 2019 .

[163]  Stefan Wermter,et al.  Human motion assessment in real time using recurrent self-organization , 2016, 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).

[164]  Hongzhi Wang,et al.  Life-long learning based on dynamic combination model , 2017, Appl. Soft Comput..

[165]  C. Shatz Emergence of order in visual system development. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[166]  Mark H. Johnson,et al.  Deviations in the emergence of representations: a neuroconstructivist framework for analysing developmental disorders , 2000 .

[167]  D. Lewkowicz,et al.  Multisensory Processes: A Balancing Act across the Lifespan , 2016, Trends in Neurosciences.

[168]  Marco Mirolli,et al.  Intrinsically Motivated Learning in Natural and Artificial Systems , 2013 .

[169]  Pierre-Yves Oudeyer,et al.  Active learning of inverse models with intrinsically motivated goal exploration in robots , 2013, Robotics Auton. Syst..

[170]  Jonathan D. Power,et al.  Neural plasticity across the lifespan , 2017, Wiley interdisciplinary reviews. Developmental biology.

[171]  Itamar Arel,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 1 Ensemble Learning in Fixed Expansion Layer Network , 2022 .

[172]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[173]  Mark B. Ring Child: A First Step Towards Continual Learning , 1998, Learning to Learn.

[174]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[175]  Mark S. Seidenberg Connectionist Models in Developmental Cognitive Neuroscience : Critical Periods and the Paradox of Success , 2005 .

[176]  C. Malsburg,et al.  How patterned neural connections can be set up by self-organization , 1976, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[177]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[178]  R Ratcliff,et al.  Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. , 1990, Psychological review.

[179]  Stefan Wermter,et al.  A computational model of crossmodal processing for conflict resolution , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[180]  Laurent Itti,et al.  Biologically plausible learning in neural networks with modulatory feedback , 2017, Neural Networks.

[181]  R. O’Reilly,et al.  Opinion TRENDS in Cognitive Sciences Vol.6 No.12 December 2002 , 2022 .

[182]  R. French Catastrophic Forgetting in Connectionist Networks , 2006 .

[183]  L. Abbott,et al.  Synaptic plasticity: taming the beast , 2000, Nature Neuroscience.

[184]  Alexandros Karatzoglou,et al.  Overcoming Catastrophic Forgetting with Hard Attention to the Task , 2018 .

[185]  Xu Ji,et al.  On the role of neurogenesis in overcoming catastrophic forgetting , 2018, ArXiv.

[186]  Susan M. Barnett,et al.  When and where do we apply what we learn? A taxonomy for far transfer. , 2002, Psychological bulletin.

[187]  G. Ming,et al.  Adult Neurogenesis in the Mammalian Brain: Significant Answers and Significant Questions , 2011, Neuron.

[188]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[189]  Pierre-Yves Oudeyer,et al.  Information-seeking, curiosity, and attention: computational and neural mechanisms , 2013, Trends in Cognitive Sciences.

[190]  A. Senghas,et al.  Children Creating Core Properties of Language: Evidence from an Emerging Sign Language in Nicaragua , 2004, Science.

[191]  Stefan Wermter,et al.  Emotion Recognition from Body Expressions with a Neural Network Architecture , 2017, HAI.

[192]  K. Holyoak,et al.  The analogical mind. , 1997, The American psychologist.

[193]  James L. McClelland,et al.  Generalization Through the Recurrent Interaction of Episodic Memories , 2012, Psychological review.

[194]  C. Braun,et al.  Dynamic organization of the somatosensory cortex induced by motor activity. , 2001, Brain : a journal of neurology.

[195]  A. Borst Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[196]  Tom Schaul,et al.  Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.

[197]  J. Altman Autoradiographic investigation of cell proliferation in the brains of rats and cats , 1963, The Anatomical record.

[198]  Stefano Soatto,et al.  Critical Learning Periods in Deep Neural Networks , 2017, ArXiv.

[199]  S. Lewandowsky,et al.  10 – Catastrophic interference in neural networks: Causes, solutions, and data , 1995 .

[200]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[201]  E. Capaldi,et al.  The organization of behavior. , 1992, Journal of applied behavior analysis.

[202]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[203]  E. Bienenstock,et al.  Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.