Prespeech motor learning in a neural network using reinforcement

Vocal motor development in infancy provides a crucial foundation for language development. Some significant early accomplishments include learning to control the process of phonation (the production of sound at the larynx) and learning to produce the sounds of one's language. Previous work has shown that social reinforcement shapes the kinds of vocalizations infants produce. We present a neural network model that provides an account of how vocal learning may be guided by reinforcement. The model consists of a self-organizing map that outputs to muscles of a realistic vocalization synthesizer. Vocalizations are spontaneously produced by the network. If a vocalization meets certain acoustic criteria, it is reinforced, and the weights are updated to make similar muscle activations increasingly likely to recur. We ran simulations of the model under various reinforcement criteria and tested the types of vocalizations it produced after learning in the different conditions. When reinforcement was contingent on the production of phonated (i.e. voiced) sounds, the network's post-learning productions were almost always phonated, whereas when reinforcement was not contingent on phonation, the network's post-learning productions were almost always not phonated. When reinforcement was contingent on both phonation and proximity to English vowels as opposed to Korean vowels, the model's post-learning productions were more likely to resemble the English vowels and vice versa.

[1]  H. Papoušek,et al.  Forms and functions of vocal matching in interactions between mothers and their precanonical infants , 1989 .

[2]  Pascal Perrier,et al.  A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss , 2000, J. Phonetics.

[3]  M. Yano,et al.  On the effectiveness of whole spectral shape for vowel perception. , 2001, The Journal of the Acoustical Society of America.

[4]  M. Domjan The principles of learning and behavior , 1982 .

[5]  O. Hikosaka Models of information processing in the basal Ganglia edited by James C. Houk, Joel L. Davis and David G. Beiser, The MIT Press, 1995. $60.00 (400 pp) ISBN 0 262 08234 9 , 1995, Trends in Neurosciences.

[6]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[7]  Mark H Johnson,et al.  The development of the social brain in human infancy , 2007, The European journal of neuroscience.

[8]  A. Barto,et al.  Adaptive Critics and the Basal Ganglia , 1994 .

[9]  Bernard Harmegnies,et al.  A study of style-induced vowel variability: Laboratory versus spontaneous speech in Spanish , 1992, Speech Commun..

[10]  B. de Boysson-Bardies,et al.  A crosslinguistic investigation of vowel formants in babbling , 1989, Journal of Child Language.

[11]  P. Denes On the Motor Theory of Speech Perception , 1965 .

[12]  B. McMurray,et al.  Speaker variability augments phonological processing in early word learning. , 2009, Developmental science.

[13]  I. Titze Nonlinear source-filter coupling in phonation: theory. , 2008, The Journal of the Acoustical Society of America.

[14]  Frank H. Guenther,et al.  Unstable or Insufficiently Activated Internal Models and Feedback-Biased Motor Control as Sources of Dysfluency: A Theoretical Model of Stuttering , 2004 .

[15]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[16]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[17]  Rachel E. Stark STAGES OF SPEECH DEVELOPMENT IN THE FIRST YEAR OF LIFE 1 1This work was supported by NINCDS Grant NS.09628 and by NICHD Grant HD. 11970. , 1980 .

[18]  A. Meltzoff,et al.  Infant vocalizations in response to speech: vocal imitation and developmental change. , 1996, The Journal of the Acoustical Society of America.

[19]  Matthias J. Sjerps,et al.  Speaker Normalization in Speech Perception , 2008, The Handbook of Speech Perception.

[20]  S. Zahorian,et al.  Spectral-shape features versus formants as acoustic correlates for vowels. , 1993, The Journal of the Acoustical Society of America.

[21]  Eric Fosler-Lussier,et al.  Evaluating parameters for mapping adult vowels to imitative babbling , 2009, INTERSPEECH.

[22]  Minoru Asada,et al.  A constructivist approach to infants' vowel acquisition through mother–infant interaction , 2003, Connect. Sci..

[23]  Ruben van de Vijver,et al.  Pisoni, D., Remez, R. (eds.), The handbook of speech perception; Oxford, Blackwell, 2005 , 2009 .

[24]  Sarah-Jayne Blakemore,et al.  The Developing Social Brain: Implications for Education , 2010, Neuron.

[25]  Raymond D. Kent,et al.  Acoustic features of infant vocalic utterances at 3, 6, and 9 months. , 1982, The Journal of the Acoustical Society of America.

[26]  M. Goldstein,et al.  Social Feedback to Infants' Babbling Facilitates Rapid Phonological Learning , 2008, Psychological science.

[27]  Andrew P. King,et al.  Mothers provide differential feedback to infants' prelinguistic sounds , 2006 .

[28]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[29]  Byunggon Yang A comparative study of American English and Korean vowels produced by male and female speakers , 1996 .

[30]  F. Guenther,et al.  A theoretical investigation of reference frames for the planning of speech movements. , 1998, Psychological review.

[31]  Gert Westermann,et al.  Self-production facilitates and adult input interferes in a neural network model of infant vowel imitation. , 2011 .

[32]  Katerina Nicolaidis Acoustic Variability of Vowels in Greek Spontaneous Speech , 2003 .

[33]  Jochen Triesch,et al.  Building a Model of Infant Social Interaction , 2010 .

[34]  Raymond D. Kent,et al.  An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system. , 2000, Journal of speech, language, and hearing research : JSLHR.

[35]  Tetsuya Ogata,et al.  Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network , 2009, 2009 IEEE International Conference on Robotics and Automation.

[36]  M. Goldstein,et al.  Social interaction shapes babbling: Testing parallels between birdsong and speech , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Kentaro Ishizuka,et al.  Longitudinal developmental changes in spectral peaks of vowels produced by Japanese infants. , 2007, The Journal of the Acoustical Society of America.

[38]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[39]  Pierre-Yves Oudeyer,et al.  The Self-Organization of Speech Sounds , 2005, Journal of theoretical biology.

[40]  P. Boersma Praat : doing phonetics by computer (version 5.1.05) , 2009 .

[41]  M. Vihman Variable paths to early word production , 1993 .

[42]  D. Oller The emergence of the speech capacity , 2000 .

[43]  G. Westermann,et al.  A new model of sensorimotor coupling in the development of speech , 2004, Brain and Language.

[44]  K. Molina-Luna,et al.  Dopamine in Motor Cortex Is Necessary for Skill Learning and Synaptic Plasticity , 2009, PloS one.

[45]  B. de Boysson-Bardies,et al.  Adaptation to language: Evidence from babbling and first words in four languages , 1991 .

[46]  J. Iverson,et al.  The Influence of Mouthing on Infant Vocalization. , 2007, Infancy : the official journal of the International Society on Infant Studies.

[47]  Florien J. Koopmans-van Beinum,et al.  Early Stages in the Development of Speech Movements , 1986 .

[48]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[49]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[50]  D. Oller,et al.  Vibratory regime classification of infant phonation. , 2008, Journal of voice : official journal of the Voice Foundation.