A Computational Model of Infant Speech Development

Almost all theories of child speech development assume that an infant learns speech sounds by direct imitation, performing an acoustic matching of adult output to his own speech. Some theories also postulate an innate link between perception and production. We present a computer model which has no requirement for acoustic matching on the part of the infant and which treats speech production and perception as separate processes with no innate link. Instead we propose that the infant initially explores his speech apparatus and reinforces his own actions on the basis of sensory salience, developing vocal motor schemes [1]. As the infant’s production develops, he will start to generate utterances which are sufficiently speechlike to provoke a linguistic response from its mother. Such, interactions are particularly important, because she is better qualified than he is to judge the quality of his speech. Her response to his vocal output is beneficial in a number of ways. Because she is a learned speaker, her experienced perceptive system can effectively evaluate the infant’s output within the phonological system of the ambient language L1. Simply generating a salient response will tend to encourage the infant’s production of a given utterance. More significantly, during imitative exchanges in which the mother reformulates the infant’s speech, the infant can learn equivalence relations using simple associative mechanisms between his motor activity and his mother’s acoustic output, and thus can solve the correspondence problem. Notice that the infant does not learn equivalence relations between his own acoustic output and that of his mother based on acoustic similarity. Any similarity based matching need only needs to be performed by his mother. We present the results from preliminary experiments and demonstrate that this model is able to progress through two distinct stages of speech development. It begins by generating simple sounds and ends up producing word-like utterances.

[1]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[2]  J. Locke,et al.  Learning to speak , 1993 .

[3]  Marilyn M. Vihman,et al.  Vocal Motor Schemes. , 1987 .

[4]  K. Markey The sensorimotor foundations of phonology: a computational model of early childhood articulatory and phonetic development , 1995 .

[5]  E. Todorov Direct cortical control of muscle activation in voluntary arm movements: a model , 2000, Nature Neuroscience.

[6]  Minoru Asada,et al.  A constructivist approach to infants' vowel acquisition through mother–infant interaction , 2003, Connect. Sci..

[7]  Ian S. Howard,et al.  Learning to control an articulatory synthesizer by imitating real speech , 2005 .

[8]  Lise Menn,et al.  Connectionist Modeling and the Microstructure of Phonological Development: A Progress Report , 1993 .

[9]  P. Messum The role of imitation in learning to pronounce , 2008 .

[10]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[11]  D. Wolpert,et al.  Central cancellation of self-produced tickle sensation , 1998, Nature Neuroscience.

[12]  D. Wolpert,et al.  Two Eyes for an Eye: The Neuroscience of Force Escalation , 2003, Science.

[13]  P. Kuhl A new view of language acquisition. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Ian S. Howard,et al.  Training a Vocal Tract Synthesiser to imitate speech using Distal Supervised Learning , 2005 .

[15]  A. Marchal,et al.  Speech production and speech modelling , 1990 .

[16]  Gérard Bailly,et al.  Learning to speak. Sensori-motor control of speech movements , 1997, Speech Commun..

[17]  A. Meltzoff,et al.  Infant vocalizations in response to speech: vocal imitation and developmental change. , 1996, The Journal of the Acoustical Society of America.

[18]  G. Westermann,et al.  A new model of sensorimotor coupling in the development of speech , 2004, Brain and Language.