The Next Big Thing(s) in Unsupervised Machine Learning: Five Lessons from Infant Learning

After a surge in popularity of supervised Deep Learning, the desire to reduce the dependence on curated, labelled data sets and to leverage the vast quantities of unlabelled data available recently triggered renewed interest in unsupervised learning algorithms. Despite a significantly improved performance due to approaches such as the identification of disentangled latent representations, contrastive learning, and clustering optimisations, the performance of unsupervised machine learning still falls short of its hypothesised potential. Machine learning has previously taken inspiration from neuroscience and cognitive science with great success. However, this has mostly been based on adult learners with access to labels and a vast amount of prior knowledge. In order to push unsupervised machine learning forward, we argue that developmental science of infant cognition might hold the key to unlocking the next generation of unsupervised learning approaches. Conceptually, human infant learning is the closest biological parallel to artificial unsupervised learning, as infants too must learn useful representations from unlabelled data. In contrast to machine learning, these new representations are learned rapidly and from relatively few examples. Moreover, infants learn robust representations that can be used flexibly and efficiently in a number of different tasks and contexts. We identify five crucial factors enabling infants' quality and speed of learning, assess the extent to which these have already been exploited in machine learning, and propose how further adoption of these factors can give rise to previously unseen performance levels in unsupervised learning.

[1]  Alexei A. Efros,et al.  What makes ImageNet good for transfer learning? , 2016, ArXiv.

[2]  Manos Tsakiris,et al.  Neurobehavioral evidence of interoceptive sensitivity in early infancy , 2017, eLife.

[3]  Katherine S White,et al.  Read my lips: Visual speech influences word processing in infants , 2017, Cognition.

[4]  C. Spence,et al.  Spatial localization of touch in the first year of life: early influence of a visual spatial code and the development of remapping across changes in limb position. , 2008, Journal of experimental psychology. General.

[5]  A. Henik,et al.  The Itsy Bitsy Spider , 2019 .

[6]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[7]  M. Paulus How and why do infants imitate? An ideomotor approach to social and imitative learning in infancy (and beyond) , 2014, Psychonomic bulletin & review.

[8]  Dietrich Klakow,et al.  Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods , 2019, J. Artif. Intell. Res..

[9]  Sabine Hunnius,et al.  Sensitivity to structure in action sequences: An infant event-related potential study , 2017, Neuropsychologia.

[10]  Benjamin Recht,et al.  Do ImageNet Classifiers Generalize to ImageNet? , 2019, ICML.

[11]  Tarek R. Besold,et al.  A match does not make a sense: on the sufficiency of the comparator model for explaining the sense of agency , 2019, Neuroscience of consciousness.

[12]  Brenden M. Lake,et al.  Learning Inductive Biases with Simple Neural Networks , 2018, CogSci.

[13]  Linda B. Smith Article with Peer Commentaries and Response Do Infants Possess Innate Knowledge Structures? the Con Side the Nativist's Argument , 2022 .

[14]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[15]  Conrad D. James,et al.  Neurogenesis deep learning: Extending deep networks to accommodate new classes , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[16]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[17]  Yasuhiro Kanakogi,et al.  The integration of audio−tactile information is modulated by multimodal social interaction with physical contact in infancy , 2017, Developmental Cognitive Neuroscience.

[18]  Pontus Loviken,et al.  Prerequisites for an Artificial Self , 2020, Frontiers in Neurorobotics.

[19]  A. Meltzoff,et al.  Explaining Facial Imitation: A Theoretical Model. , 1997, Early development & parenting.

[20]  Alex Graves,et al.  Automated Curriculum Learning for Neural Networks , 2017, ICML.

[21]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[22]  Stefano Soatto,et al.  Critical Learning Periods in Deep Networks , 2018, ICLR.

[23]  Detection of sensorimotor contingencies in infants before the age of 1 year: A comprehensive review. , 2020, Developmental psychology.

[24]  Karen E Adolph,et al.  Transition from crawling to walking and infants' actions with objects and people. , 2011, Child development.

[25]  Wei Ji Ma,et al.  A neural network walks into a lab: towards using deep nets as models for human behavior , 2020, ArXiv.

[26]  Geoffrey E. Hinton Using fast weights to deblur old memories , 1987 .

[27]  D. Lewkowicz,et al.  The development of intersensory temporal perception: an epigenetic systems/limitations view. , 2000, Psychological bulletin.

[28]  Dare A. Baldwin,et al.  Evidence for ‘motionese’: modifications in mothers’ infant-directed action , 2002 .

[29]  E. Kymissis,et al.  Generalized vocal imitation in infants. , 1991, Journal of experimental child psychology.

[30]  Alexander Gepperth,et al.  Active learning of local predictable representations with artificial curiosity , 2015, 2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[31]  S. Sokol,et al.  Measurement of infant visual acuity from pattern reversal evoked potentials , 1978, Vision Research.

[32]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  A. Bigelow,et al.  The role of joint attention in the development of infants' play with objects. , 2004, Developmental science.

[34]  A. Meltzoff,et al.  What imitation tells us about social cognition: a rapprochement between developmental psychology and cognitive neuroscience. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[35]  S. Pauen,et al.  Do infants associate spiders and snakes with fearful facial expressions , 2017 .

[36]  D. Lewkowicz,et al.  Three‐month‐old infants learn arbitrary auditory–visual pairings between voices and faces , 2001 .

[37]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[38]  Re-evaluating the neonatal imitation hypothesis. , 2018, Developmental science.

[39]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[40]  Jessica Maye,et al.  Infant sensitivity to distributional information can affect phonetic discrimination , 2002, Cognition.

[41]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[42]  Tom M. Mitchell,et al.  The Need for Biases in Learning Generalizations , 2007 .

[43]  Katharina J. Rohlfing,et al.  How can multimodal cues from child-directed interaction reduce learning complexity in robots? , 2006, Adv. Robotics.

[44]  H. I. Day,et al.  Curiosity and the Interested Explorer. , 1982 .

[45]  Karen E. Smith,et al.  Responsive parenting: establishing early foundations for social, communication, and independent problem-solving skills. , 2006, Developmental psychology.

[46]  Kasey C. Soska,et al.  Head-mounted eye tracking: a new method to describe infant looking. , 2011, Child development.

[47]  Daniel L. K. Yamins,et al.  Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation , 2018, CogSci.

[48]  J. Mehler,et al.  Sounds and silence: An optical topography study of language recognition at birth , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[49]  M. Tomasello The Role of Joint Attentional Processes in Early Language Development. , 1988 .

[50]  Pierre Baldi,et al.  Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[51]  J. Guillemot,et al.  Temporary Deafness Can Impair Multisensory Integration , 2013, Psychological science.

[52]  E. Newport MOTHERESE: THE SPEECH OF MOTHERS TO YOUNG CHILDREN. , 1975 .

[53]  Pierre-Yves Oudeyer,et al.  Object Learning Through Active Exploration , 2014, IEEE Transactions on Autonomous Mental Development.

[54]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[55]  S. Landry,et al.  Early maternal and child influences on children's later independent cognitive and social functioning. , 2000, Child development.

[56]  Linda B. Smith,et al.  What's in View for Toddlers? Using a Head Camera to Study Visual Experience. , 2008, Infancy : the official journal of the International Society on Infant Studies.

[57]  Scott P. Johnson,et al.  Visual statistical learning in infancy: evidence for a domain general learning mechanism , 2002, Cognition.

[58]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[59]  Linda B. Smith,et al.  From faces to hands: Changing visual input in the first two years , 2016, Cognition.

[60]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.

[61]  D. A. Caruso Dimensions of quality in infants' exploratory behavior: Relationships to problem-solving ability , 1993 .

[62]  Guillaume Desjardins,et al.  Understanding disentangling in $\beta$-VAE , 2018, 1804.03599.

[63]  Bria Long,et al.  Neural dynamics of prediction and surprise in infants , 2015, Nature Communications.

[64]  Sebastian Ruder,et al.  An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[65]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[66]  D. Sobel,et al.  Infants Track the Reliability of Potential Informants , 2014, Psychological science.

[67]  Lara Bardi,et al.  The processing of social stimuli in early infancy: from faces to biological motion perception. , 2011, Progress in brain research.

[68]  Anthony M. Zador,et al.  A critique of pure learning and what artificial neural networks can learn from animal brains , 2019, Nature Communications.

[69]  Gunilla Stenberg Why do Infants Look at and Use Positive Information from Some Informants Rather Than Others in Ambiguous Situations? , 2012, Infancy : the official journal of the International Society on Infant Studies.

[70]  Paavo Alku,et al.  Statistical language learning in neonates revealed by event-related brain potentials , 2009, BMC Neuroscience.

[71]  Lorijn Zaadnoordijk,et al.  From movement to action: An EEG study into the emerging sense of agency in early infancy , 2020, Developmental Cognitive Neuroscience.

[72]  Kristinn R. Thórisson,et al.  The Pedagogical Pentagon: A Conceptual Framework for Artificial Pedagogy , 2017, AGI.

[73]  L. S. Vygotskiĭ,et al.  Mind in society : the development of higher psychological processes , 1978 .

[74]  Matthew B. Blaschko,et al.  Encoder Based Lifelong Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[75]  R. Cusack,et al.  Why does language not emerge until the second year? , 2018, Hearing Research.

[76]  E. W. Ames,et al.  A multifactor model of infant preferences for novel and familiar stimuli. , 1988 .

[77]  E. Spelke Perceiving Bimodally Specified Events in Infancy , 1979 .

[78]  Yoshua Bengio,et al.  Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[79]  Katharina J. Rohlfing,et al.  Computational Analysis of Motionese Toward Scaffolding Robot Action Learning , 2009, IEEE Transactions on Autonomous Mental Development.

[80]  Pierre-Yves Oudeyer,et al.  Towards a neuroscience of active sampling and curiosity , 2018, Nature Reviews Neuroscience.

[81]  G. Gredebäck,et al.  Infants Distinguish Between Two Events Based on Their Relative Likelihood. , 2018, Child development.

[82]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[83]  S. Sloman,et al.  Learning Causal Structure , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[84]  H. Bekkering,et al.  Nine-month-old infants update their predictive models of a changing environment , 2019, Developmental Cognitive Neuroscience.

[85]  J. Belsky,et al.  Maternal stimulation and infant exploratory competence: cross-sectional, correlational, and experimental analyses. , 1980, Child development.

[86]  Diane Poulin-Dubois,et al.  Infants prefer to imitate a reliable person. , 2011, Infant behavior & development.

[87]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[88]  R Devon Hjelm,et al.  Learning Representations by Maximizing Mutual Information Across Views , 2019, NeurIPS.

[89]  R. Gómez,et al.  The Developmental Trajectory of Nonadjacent Dependency Learning. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[90]  Yukie Nagai,et al.  Learning to grasp with parental scaffolding , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[91]  Erik D. Thiessen,et al.  Infant-Directed Speech Facilitates Word Segmentation. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[92]  Martha White,et al.  Meta-Learning Representations for Continual Learning , 2019, NeurIPS.

[93]  R. Aslin,et al.  Preference for infant-directed speech in the first month after birth. , 1990, Child development.

[94]  Ronald Kemker,et al.  Measuring Catastrophic Forgetting in Neural Networks , 2017, AAAI.

[95]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[96]  Kasey C. Soska,et al.  Postural position constrains multimodal object exploration in infants. , 2014, Infancy : the official journal of the International Society on Infant Studies.

[97]  Two‐ to three‐month‐old infants prefer moving face patterns to moving top‐heavy patterns , 2013 .

[98]  Jürgen Schmidhuber,et al.  Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes , 2008, ABiALS.

[99]  V. D. de Sa Category learning through multimodality sensing. , 1998, Neural computation.

[100]  Jean-Francois Mangin,et al.  Hearing Faces: How the Infant Brain Matches the Face It Sees with the Speech It Hears , 2009, Journal of Cognitive Neuroscience.

[101]  Yang Feng,et al.  Unsupervised Image Captioning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[102]  Qiang Yang,et al.  An Overview of Multi-task Learning , 2018 .

[103]  Harold Bekkering,et al.  What are you doing? How active and observational experience shape infants' action understanding , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[104]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[105]  Pierre-Yves Oudeyer,et al.  Computational Theories of Curiosity-Driven Learning , 2018, ArXiv.

[106]  Albert Yonas,et al.  Potential downside of high initial visual acuity , 2018, Proceedings of the National Academy of Sciences.

[107]  J. Bruner,et al.  The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[108]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[109]  Daniel D. Dilks,et al.  Organization of high-level visual cortex in human infants , 2017, Nature Communications.

[110]  George Trigeorgis,et al.  End-to-End Multimodal Emotion Recognition Using Deep Neural Networks , 2017, IEEE Journal of Selected Topics in Signal Processing.

[111]  Moritz M. Daum,et al.  Detection of visual–tactile contingency in the first year after birth , 2011, Cognition.

[112]  R. French Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[113]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[114]  J. Elman Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[115]  Petros Maragos,et al.  Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention , 2013, IEEE Transactions on Multimedia.

[116]  Richard N Aslin,et al.  The Goldilocks effect in infant auditory attention. , 2014, Child development.

[117]  R. Cusack,et al.  Anatomical correlates of category-selective visual regions have distinctive signatures of connectivity in neonates , 2019, Developmental Cognitive Neuroscience.

[118]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[119]  Andrew M. Saxe,et al.  If deep learning is the answer, then what is the question? , 2020, 2004.07580.

[120]  F. Turkheimer,et al.  Emergence of resting state networks in the preterm human brain , 2010, Proceedings of the National Academy of Sciences.

[121]  Herbert Jaeger,et al.  The''echo state''approach to analysing and training recurrent neural networks , 2001 .

[122]  Gentaro Taga,et al.  Initial-state dependency of learning in young infants. , 2011, Human movement science.

[123]  Linda B. Smith,et al.  The Developing Infant Creates a Curriculum for Statistical Learning , 2018, Trends in Cognitive Sciences.

[124]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[125]  Patrick Shafto,et al.  Computational models of development, social influences , 2016, Current Opinion in Behavioral Sciences.

[126]  Gordon Cheng,et al.  Sensorimotor learning for artificial body perception , 2018, IROS 2018.

[127]  Tillman Weyde,et al.  Towards a Multimodal Time-Based Empathy Prediction System , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[128]  M. Bedny Evidence from Blindness for a Cognitively Pluripotent Cortex , 2017, Trends in Cognitive Sciences.

[129]  E. Spelke Initial knowledge: six suggestions , 1994, Cognition.

[130]  Velma Dobson,et al.  Visual acuity in human infants: A review and comparison of behavioral and electrophysiological studies , 1978, Vision Research.

[131]  D. Berlyne Conflict, arousal, and curiosity , 2014 .

[132]  Phillip Isola,et al.  Contrastive Multiview Coding , 2019, ECCV.

[133]  R. Aslin What's in a look? , 2007, Developmental science.

[134]  Monica Gori,et al.  Auditory and proprioceptive spatial impairments in blind children and adults. , 2017, Developmental science.

[135]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[136]  Katherine E. Twomey,et al.  Curiosity‐based learning in infants: a neurocomputational approach , 2017, Developmental science.

[137]  P. Mundy,et al.  CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE Attention, Joint Attention, and Social Cognition , 2022 .

[138]  Yoshua Bengio,et al.  Unsupervised and Transfer Learning Challenge: a Deep Learning Approach , 2011, ICML Unsupervised and Transfer Learning.

[139]  Joëlle Provasi,et al.  Do 9- and 12-month-olds learn means-ends relation by observing? , 2001 .

[140]  L. Singh,et al.  Influences of Infant-Directed Speech on Early Word Recognition. , 2009, Infancy : the official journal of the International Society on Infant Studies.

[141]  B. Elsner Infants' imitation of goal-directed actions: the role of movements and action effects. , 2007, Acta psychologica.

[142]  Tomoaki Nakamura,et al.  Multimodal object categorization by a robot , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[143]  Richard N. Aslin,et al.  The Goldilocks Effect: Human Infants Allocate Attention to Visual Sequences That Are Neither Too Simple Nor Too Complex , 2012, PloS one.

[144]  Chrystopher L. Nehaniv,et al.  Teaching robot companions: the role of scaffolding and event structuring , 2008, Connect. Sci..

[145]  Lisa M. Oakes,et al.  Manual object exploration and learning about object features in human infants , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[146]  J. Tenenbaum,et al.  Infants consider both the sample and the sampling process in inductive generalization , 2010, Proceedings of the National Academy of Sciences.

[147]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[148]  E. Kymissis,et al.  The history of imitation in learning theory: the language acquisition process. , 1990, Journal of the experimental analysis of behavior.

[149]  S. Chien No more top-heavy bias: infants and adults prefer upright faces but not top-heavy geometric or face-like patterns. , 2011, Journal of vision.

[150]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[151]  Ryan A. Stevenson,et al.  Multisensory Integration in Cochlear Implant Recipients , 2017, Ear and hearing.

[152]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[153]  Victor K. Han,et al.  Disruption to functional networks in neonates with perinatal brain injury predicts motor skills at 8 months☆ , 2017, NeuroImage: Clinical.

[154]  H. Bekkering,et al.  Infants differentially update their internal models of a dynamic environment , 2019, Cognition.

[155]  N. Kirkham,et al.  Learning to look: probabilistic variation and noise guide infants' eye movements. , 2013, Developmental science.

[156]  S. Hunnius,et al.  Motion tracking of parents’ infant‐ versus adult‐directed actions reveals general and action‐specific modulations , 2019, Developmental science.

[157]  C. Heyes,et al.  Imitation in infancy: the wealth of the stimulus. , 2011, Developmental science.

[158]  A. Bremner Developing body representations in early life: combining somatosensation and vision to perceive the interface between the body and the world , 2016, Developmental medicine and child neurology.

[159]  Kopparti Radha,et al.  Weight Priors for Learning Identity Relations , 2020, ArXiv.

[160]  M. Lobo,et al.  Not just playing around: infants' behaviors with objects reflect ability, constraints, and object properties. , 2014, Infant behavior & development.

[161]  M. Botvinick,et al.  Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective , 2009, Cognition.

[162]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[163]  J. Werker,et al.  Listening to language at birth: evidence for a bias for speech in neonates. , 2007, Developmental science.

[164]  Matthias Bethge,et al.  Engineering a Less Artificial Intelligence , 2019, Neuron.

[165]  Stefanie Hoehl,et al.  Itsy Bitsy Spider…: Infants React with Increased Arousal to Spiders and Snakes , 2017, Front. Psychol..

[166]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[167]  Nico Blodow,et al.  Combined 2D–3D categorization and classification for multimodal perception systems , 2011, Int. J. Robotics Res..

[168]  Tillman Weyde,et al.  Modelling Identity Rules with Neural Networks , 2018, FLAP.

[169]  Kathleen H. Corriveau,et al.  The Theoretical and Methodological Opportunities Afforded by Guided Play With Young Children , 2018, Front. Psychol..

[170]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[171]  Ronald Kemker,et al.  FearNet: Brain-Inspired Model for Incremental Learning , 2017, ICLR.

[172]  Linda B. Smith,et al.  The Faces in Infant-Perspective Scenes Change over the First Year of Life , 2015, PloS one.

[173]  F. Simion,et al.  Can a Nonspecific Bias Toward Top-Heavy Patterns Explain Newborns' Face Preference? , 2004, Psychological science.

[174]  P. Rochat,et al.  Perceived self in infancy , 2000 .

[175]  Guillaume Desjardins,et al.  Understanding disentangling in β-VAE , 2018, ArXiv.

[176]  Olivier Sigaud,et al.  Deep unsupervised network for multimodal perception, representation and classification , 2015, Robotics Auton. Syst..

[177]  D. Berlyne,et al.  An experimental study of human curiosity. , 1954, British journal of psychology.

[178]  Andrew Zisserman,et al.  Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[179]  Harold Bekkering,et al.  The early development of object knowledge: a study of infants' visual anticipations during action observation. , 2010, Developmental psychology.

[180]  Daniel D. Dilks,et al.  Connectivity at the origins of domain specificity in the cortical face and place networks , 2020, Proceedings of the National Academy of Sciences.

[181]  Willem H. Zuidema,et al.  Pre-Wiring and Pre-Training: What Does a Neural Network Need to Learn Truly General Identity Rules? , 2018, CoCo@NIPS.

[182]  M. Haith Who put the cog in infant cognition ? Is rich interpretation too costly ? , 1998 .

[183]  Shimon Ullman,et al.  From simple innate biases to complex visual concepts , 2012, Proceedings of the National Academy of Sciences.

[184]  Kaiming He,et al.  Rethinking ImageNet Pre-Training , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[185]  Elissa L. Newport,et al.  Critical thinking about critical periods: Perspectives on a critical period for language acquisition. , 2001 .

[186]  Burr Settles,et al.  From Theories to Queries: Active Learning in Practice , 2011 .