Computational and Robotic Models of Early Language Development: A Review

We review computational and robotics models of early language learning and development. We first explain why and how these models are used to understand better how children learn language. We argue that they provide concrete theories of language learning as a complex dynamic system, complementing traditional methods in psychology and linguistics. We review different modeling formalisms, grounded in techniques from machine learning and artificial intelligence such as Bayesian and neural network approaches. We then discuss their role in understanding several key mechanisms of language development: cross-situational statistical learning, embodiment, situated social interaction, intrinsically motivated learning, and cultural evolution. We conclude by discussing future challenges for research, including modeling of large-scale empirical data about language acquisition in real-world environments.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  L. Festinger,et al.  A Theory of Cognitive Dissonance , 2017 .

[3]  R. W. White Motivation reconsidered: the concept of competence. , 1959, Psychological review.

[4]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[5]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[6]  N. A. Bernshteĭn The co-ordination and regulation of movements , 1967 .

[7]  C. Snow Mothers' Speech to Children Learning Language. , 1972 .

[8]  P. Lightbown,et al.  Imitation in language development: If, when, and why , 1974 .

[9]  Susan Carey,et al.  Acquiring a Single New Word , 1978 .

[10]  S. Carey The child as word learner , 1978 .

[11]  S. Pinker Formal models of language learning , 1979, Cognition.

[12]  Elliot Saltzman,et al.  The dynamical perspectives on speech production: Data and theory , 1986 .

[13]  Eve V. Clark,et al.  The principle of contrast: A constraint on language acquisition. , 1987 .

[14]  M. Tomasello The Role of Joint Attentional Processes in Early Language Development. , 1988 .

[15]  E. Markman,et al.  Children's use of mutual exclusivity to constrain the meanings of words , 1988, Cognitive Psychology.

[16]  S. Pinker Learnability and Cognition: The Acquisition of Argument Structure , 1989 .

[17]  W. Merriman,et al.  The mutual exclusivity bias in children's word learning. , 1989, Monographs of the Society for Research in Child Development.

[18]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[19]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[20]  A. Bryk,et al.  Early vocabulary growth: Relation to language input and gender. , 1991 .

[21]  E. Markman Constraints on word learning: Speculations about their nature, origins, and domain specificity. , 1992 .

[22]  Dare A. Baldwin,et al.  Infants' ability to consult the speaker for clues to word reference , 1993, Journal of Child Language.

[23]  Lois Bloom,et al.  Language and Interaction. (Book Reviews: The Transition from Infancy to Language. Acquiring the Power of Expression.) , 1995 .

[24]  C. Mervis,et al.  Early object labels: the case for a developmental lexical principles framework , 1994, Journal of Child Language.

[25]  C. Mervis,et al.  Acquisition of the novel name--nameless category (N3C) principle. , 1994, Child development.

[26]  T. Gelder,et al.  Mind as Motion: Explorations in the Dynamics of Cognition , 1995 .

[27]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[28]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[29]  Jeffrey L. Elman,et al.  Language as a dynamical system , 1996 .

[30]  R. Siegler Emerging Minds: The Process of Change in Children's Thinking , 1996 .

[31]  E. Newport,et al.  WORD SEGMENTATION : THE ROLE OF DISTRIBUTIONAL CUES , 1996 .

[32]  J. Schwartz,et al.  The Dispersion-Focalization Theory of vowel systems , 1997 .

[33]  Luc Steels,et al.  The synthetic modeling of language origins , 1997 .

[34]  Rebecca J. Panagos Meaningful Differences in the Everyday Experience of Young American Children , 1998 .

[35]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[36]  B. Scassellati Imitation and mechanisms of joint attention: a developmental structure for building social skills on a humanoid robot , 1999 .

[37]  N. Akhtar,et al.  Early lexical acquisition: the role of cross-situational learning , 1999 .

[38]  P M Todd,et al.  Précis of Simple heuristics that make us smart , 2000, Behavioral and Brain Sciences.

[39]  Luc Steels,et al.  Aibo''s first words. the social learning of language and meaning. Evolution of Communication , 2002 .

[40]  P. Kuhl A new view of language acquisition. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Luc Steels,et al.  Language games for autonomous robots , 2001 .

[42]  Bart de Boer,et al.  The Origins of Vowel Systems , 2001 .

[43]  P. Niyogi,et al.  Computational and evolutionary aspects of language , 2002, Nature.

[44]  Linda B. Smith,et al.  Development as a dynamic system , 1992, Trends in Cognitive Sciences.

[45]  Ping Li,et al.  Early lexical development in a self-organizing neural network , 2004, Neural Networks.

[46]  Terry Regier,et al.  The Emergence of Words: Attentional Learning in Form and Meaning , 2005, Cogn. Sci..

[47]  S. Suter Meaningful differences in the everyday experience of young American children , 2005, European Journal of Pediatrics.

[48]  Vittorio Loreto,et al.  Journal of Statistical Mechanics: An IOP and SISSA journal Theory and Experiment Sharp transition towardsshared vocabularies in multi-agent systems , 2006 .

[49]  Jun Tani,et al.  Learning Semantic Combinatoriality from the Interaction between Linguistic and Behavioral Processes , 2005, Adapt. Behav..

[50]  L. Steels,et al.  coordinating perceptually grounded categories through language: a case study for colour , 2005, Behavioral and Brain Sciences.

[51]  Erik D. Thiessen,et al.  Infant-Directed Speech Facilitates Word Segmentation. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[52]  D. Roy Grounding words in perception and action: computational insights , 2005, Trends in Cognitive Sciences.

[53]  Katharina J. Rohlfing,et al.  How can multimodal cues from child-directed interaction reduce learning complexity in robots? , 2006, Adv. Robotics.

[54]  Pierre-Yves Oudeyer,et al.  Discovering communication , 2006, Connect. Sci..

[55]  Pierre-Yves Oudeyer,et al.  Self-Organization in the Evolution of Speech , 2006, Oxford Studies in the Evolution of Language.

[56]  Christopher D. Manning,et al.  Probabilistic models of language processing and acquisition , 2006, Trends in Cognitive Sciences.

[57]  G. Dell,et al.  Becoming syntactic. , 2006, Psychological review.

[58]  Lisa Gershkoff-Stowe,et al.  Fast mapping skills in the developing lexicon. , 2007, Journal of speech, language, and hearing research : JSLHR.

[59]  R. Pfeifer,et al.  Self-Organization, Embodiment, and Biologically Inspired Robotics , 2007, Science.

[60]  Fernand Gobet,et al.  Modeling the Developmental Patterning of Finiteness Marking in English, Dutch, German, and Spanish Using MOSAIC , 2007, Cogn. Sci..

[61]  Linda B. Smith,et al.  Rapid Word Learning Under Uncertainty via Cross-Situational Statistics , 2007, Psychological science.

[62]  Marina Nespor,et al.  An interaction between prosody and statistics in the segmentation of fluent speech , 2007, Cognitive Psychology.

[63]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[64]  Pierre-Yves Oudeyer,et al.  In Search of the Neural Circuits of Intrinsic Motivation , 2007, Front. Neurosci..

[65]  Vittorio Loreto,et al.  Cultural route to the emergence of linguistic categories , 2007, Proceedings of the National Academy of Sciences.

[66]  Morten H. Christiansen,et al.  Integration of multiple probabilistic cues in syntax acquisition , 2008 .

[67]  Linda B. Smith,et al.  Infants rapidly learn word-referent mappings via cross-situational statistics , 2008, Cognition.

[68]  W. Bruce Croft,et al.  Language Is a Complex Adaptive System: Position Paper , 2009 .

[69]  Michael C. Frank,et al.  PSYCHOLOGICAL SCIENCE Research Article Using Speakers ’ Referential Intentions to Model Early Cross-Situational Word Learning , 2022 .

[70]  Louis ten Bosch,et al.  Adaptive non-negative matrix factorization in a computational model of language acquisition , 2009, INTERSPEECH.

[71]  Toyoaki Nishida,et al.  Unsupervised simultaneous learning of gestures, actions and their associations for Human-Robot Interaction , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[72]  Pierre-Yves Oudeyer,et al.  On the Impact of Robotics in Behavioral and Cognitive Sciences: From Insect Navigation to Human Cognitive Development , 2010, IEEE Transactions on Autonomous Mental Development.

[73]  Thomas T. Hills,et al.  The Associative Structure of Language: Contextual Diversity in Early Word Learning. , 2010, Journal of memory and language.

[74]  Vittorio Loreto,et al.  Modeling the emergence of universality in color naming patterns , 2009, Proceedings of the National Academy of Sciences.

[75]  Kenny Smith,et al.  Learning Times for Large Lexicons Through Cross-Situational Learning , 2010, Cogn. Sci..

[76]  Afsaneh Fazly,et al.  A Probabilistic Computational Model of Cross-Situational Word Learning , 2010, Cogn. Sci..

[77]  Carey K. Morewedge,et al.  Associative processes in intuitive judgment , 2010, Trends in Cognitive Sciences.

[78]  Giulio Sandini,et al.  In Press, Ieee Transactions on Autonomous Mental Development , 2010 .

[79]  P. Kay The World Color Survey , 2011 .

[80]  Linda B. Smith,et al.  Grounding Word Learning in Space , 2011, PloS one.

[81]  Vittorio Loreto,et al.  Statistical physics of language dynamics , 2011 .

[82]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[83]  Linda B. Smith,et al.  Not your mother's view: the dynamics of toddler visual experience. , 2011, Developmental science.

[84]  L. Gleitman,et al.  How words can and cannot be learned by observation , 2011, Proceedings of the National Academy of Sciences.

[85]  Luc Steels,et al.  Emergent functional grammar for space , 2012 .

[86]  Michael C. Frank,et al.  Exploiting Social Information in Grounded Language Learning via Grammatical Reduction , 2012, ACL.

[87]  Remi van Trijp The evolution of case systems for marking event structure , 2012 .

[88]  Vittorio Loreto,et al.  On the origin of the hierarchy of color names , 2012, Proceedings of the National Academy of Sciences.

[89]  Charles Yang,et al.  Computational models of syntactic acquisition. , 2012, Wiley interdisciplinary reviews. Cognitive science.

[90]  Chen Yu,et al.  Modeling cross-situational word-referent learning: prior questions. , 2012, Psychological review.

[91]  Michael Spranger The co-evolution of basic spatial terms and categories , 2012 .

[92]  Laura L. Namy,et al.  Detailed Behavioral Analysis as a Window Into Cross-Situational Word Learning , 2012, Cogn. Sci..

[93]  R. Shiffrin,et al.  An associative model of adaptive inference for learning word–referent mappings , 2012, Psychonomic bulletin & review.

[94]  E. Newport,et al.  Science Current Directions in Psychological Statistical Learning : from Acquiring Specific Items to Forming General Rules on Behalf Of: Association for Psychological Science , 2022 .

[95]  George Kachergis,et al.  Learning Nouns with Domain-General Associative Learning Mechanisms , 2012, CogSci.

[96]  Larissa K. Samuelson,et al.  Word learning emerges from the interaction of online referent selection and slow associative learning. , 2012, Psychological review.

[97]  Paul Vogt Exploring the Robustness of Cross-Situational Learning Under Zipfian Distributions , 2012, Cogn. Sci..

[98]  Matthieu Geist,et al.  A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization , 2012, IEEE Journal of Selected Topics in Signal Processing.

[99]  Charles Kemp,et al.  Kinship Categories Across Languages Reflect General Communicative Principles , 2012, Science.

[100]  S. Hidaka A Computational Model Associating Learning Process, Word Attributes, and Age of Acquisition , 2013, PloS one.

[101]  Chen Yu,et al.  Actively Learning Object Names Across Ambiguous Situations , 2013, Top. Cogn. Sci..

[102]  Gert Westermann,et al.  Prespeech motor learning in a neural network using reinforcement , 2013, Neural Networks.

[103]  Sharon Goldwater,et al.  A role for the developing lexicon in phonetic category acquisition. , 2013, Psychological review.

[104]  Jochen Triesch,et al.  Imitation learning based on an intrinsic motivation mechanism for efficient coding , 2013, Front. Psychol..

[105]  L. Gleitman,et al.  Propose but verify: Fast mapping meets cross-situational word learning , 2013, Cognitive Psychology.

[106]  Michael C. Frank,et al.  Social and Discourse Contributions to the Determination of Reference in Cross-Situational Word Learning , 2013 .

[107]  Marco Mirolli,et al.  Intrinsically Motivated Learning in Natural and Artificial Systems , 2013 .

[108]  Pierre-Yves Oudeyer,et al.  From Language to Motor Gavagai: Unified Imitation Learning of Multiple Linguistic and Nonlinguistic Sensorimotor Skills , 2013, IEEE Transactions on Autonomous Mental Development.

[109]  Anne S. Warlaumont,et al.  Salience-based reinforcement of a spiking neural network leads to increased syllable production , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[110]  Michael S. C. Thomas,et al.  Modelling mechanisms of persisting and resolving delay in language development , 2012 .

[111]  Angelo Cangelosi,et al.  Making fingers and words count in a cognitive robot , 2013, Front. Behav. Neurosci..

[112]  Gert Westermann,et al.  From perceptual to language-mediated categorization , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[113]  S. Kirby,et al.  Iterated learning and the evolution of language , 2014, Current Opinion in Neurobiology.

[114]  S. Piantadosi Zipf’s word frequency law in natural language: A critical review and future directions , 2014, Psychonomic Bulletin & Review.

[115]  Morten H. Christiansen,et al.  Prospects for usage-based computational models of grammatical development: argument structure and semantic roles. , 2014, Wiley interdisciplinary reviews. Cognitive science.

[116]  Pierre-Yves Oudeyer,et al.  Self-organization of early vocal development in infants and machines: the role of intrinsic motivation , 2014, Front. Psychol..

[117]  Pierre-Yves Oudeyer,et al.  MCA-NMF: Multimodal Concept Acquisition with Non-Negative Matrix Factorization , 2015, PloS one.

[118]  Angelo Cangelosi,et al.  Posture Affects How Robots and Infants Map Words to Objects , 2015, PloS one.

[119]  Ramon Ferrer-i-Cancho,et al.  Kauffman's adjacent possible in word order evolution , 2015, ArXiv.

[120]  Angelo Cangelosi,et al.  Embodied language and number learning in developmental robots , 2015 .

[121]  庭野 賀津子 Infant-directed speechの研究の動向と展望 , 2015 .

[122]  S. Kirby,et al.  Compression and communication in the cultural evolution of linguistic structure , 2015, Cognition.

[123]  Michael C. Frank,et al.  An integrative account of constraints on cross-situational learning , 2015, Cognition.

[124]  Kenny Smith,et al.  Word learning under infinite uncertainty , 2014, Cognition.

[125]  Michael S. C. Thomas,et al.  Multiscale Modeling of Gene-Behavior Associations in an Artificial Neural Network Model of Cognitive Development , 2016, Cogn. Sci..

[126]  Pierre-Yves Oudeyer,et al.  An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames , 2016, Front. Psychol..

[127]  William Schueller,et al.  Active Control of Complexity Growth in Naming Games: Hearer's Choice , 2016 .

[128]  Mike Frank,et al.  From uh-oh to tomorrow: Predicting age of acquisition for early words across languages , 2016, CogSci.

[129]  Michael C. Frank,et al.  Wordbank: an open repository for developmental vocabulary data* , 2016, Journal of Child Language.

[130]  Angelo Cangelosi,et al.  Children’s referent selection and word learning:insights from a developmental robotic system , 2016 .

[131]  Alejandrina Cristia,et al.  HomeBank: An Online Repository of Daylong Child-Centered Audio Recordings , 2016, Seminars in Speech and Language.

[132]  Pierre-Yves Oudeyer,et al.  How Evolution May Work Through Curiosity-Driven Developmental Process , 2016, Top. Cogn. Sci..

[133]  Katherine Twomey,et al.  Computational models of word learning , 2017 .

[134]  Pierre-Yves Oudeyer,et al.  A Unified Model of Speech and Tool Use Early Development , 2017, CogSci.

[135]  Nathaniel J. Smith,et al.  Bootstrapping language acquisition , 2017, Cognition.

[136]  George Kachergis,et al.  Quantifying the impact of active choice in word learning , 2017, CogSci.

[137]  Peter Bossaerts,et al.  Computational Complexity and Human Decision-Making , 2017, Trends in Cognitive Sciences.

[138]  Peter Ford Dominey,et al.  Narrative Constructions for the Organization of Self Experience: Proof of Concept via Embodied Robotics , 2017, Front. Psychol..

[139]  Francis Mollica,et al.  How Data Drive Early Word Learning: A Cross-Linguistic Waiting Time Analysis , 2017, Open Mind.

[140]  Caroline F. Rowland,et al.  Diversity not quantity in caregiver speech: Using computational modeling to isolate the effects of the quantity and the diversity of the input on vocabulary growth , 2017, Cognitive Psychology.

[141]  John P Spencer,et al.  Moving Word Learning to a Novel Space: A Dynamic Systems View of Referent Selection and Retention. , 2017, Cognitive science.

[142]  Karl J. Friston,et al.  Active Inference, Curiosity and Insight , 2017, Neural Computation.

[143]  Pierre-Yves Oudeyer,et al.  Computational Theories of Curiosity-Driven Learning , 2018, ArXiv.

[144]  Chen Yu,et al.  Observing and Modeling Developing Knowledge and Uncertainty During Cross-Situational Word Learning , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[145]  Anna-Lisa Vollmer,et al.  On Studying Human Teaching Behavior with Robots: a Review , 2018 .

[146]  Linda B. Smith,et al.  The Developing Infant Creates a Curriculum for Statistical Learning , 2018, Trends in Cognitive Sciences.

[147]  A. Cangelosi,et al.  From Babies to Robots: The Contribution of Developmental Robotics to Developmental Psychology , 2018 .

[148]  David Filliat,et al.  Comparison Studies on Active Cross-Situational Object-Word Learning Using Non-Negative Matrix Factorization and Latent Dirichlet Allocation , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[149]  Emmanuel Dupoux,et al.  Cognitive science in the era of artificial intelligence: A roadmap for reverse-engineering the infant language-learner , 2016, Cognition.