The emergence of phonology from the interplay of speech comprehension and production ; A distributed connectionist approach

How do infantslearnto understandandproducespoken language?Despitedecadesof intensi ve investigation,the answerto this questionremainslargely a mystery. This is, in part, because, althoughthe use of languageseemsstraightforward to adultnative speakers,speechrecognitionandproductionpresentthe infantwith numerousdifficult computational problems(seeLively, Pisoni,& Goldinger , 1994).First of all, processingspeechis difficult bothbecauseit is extendedin time andbecauseit is subjectto considerable variability acrossspeakersandcontexts. Moreover, even with an accuraterepresentationof the underlying phoneticcontentof a heardutterance,mappingthis representationontoits meaningis difficult because the relationshipof spoken wordsto their meaningsis essentiallyarbitrary. On the outputside,the infant mustlearnto producecomprehensiblespeechin theabsenceof any directfeedbackfrom caretakersor the environmentasto whatarticulatorymovementsarerequiredto produceparticularsoundpatterns.Finally, theprocessesof learningto understandspeechandlearningto produceit mustbecloselyrelated(although certainly not synchronized;Benedict,1979) to ensurethat they eventually settleon mutually consistent solutions. In the currentwork, we formulatea generalframework for understandinghow the infant surmounts thesechallenges, andwepresentacomputational simulationof theframework thatlearnsto understandand producespoken wordsin the absenceof explicit articulatoryfeedback.Our initial focusis on addressing the relevant computationalissues;we postponeconsiderationof how the approachaccountsfor specific empiricalphenomenauntil theGeneralDiscussion. The framework, depictedin abstractform in Figure 1, is basedon connectionist/parall e distributed

[1]  L. Menn,et al.  Phonological development : models, research, implications , 1994 .

[2]  David B. Pisoni,et al.  Spoken word recognition: Research and theory. , 1994 .

[3]  P. Kuhl Perception of auditory equivalence classes for speech in early infancy , 1983 .

[4]  Linda B. Smith,et al.  A Dynamic Systems Approach to the Development of Cognition and Action , 2007, Journal of Cognitive Neuroscience.

[5]  H. Benedict,et al.  Early lexical development: comprehension and production , 1979, Journal of Child Language.

[6]  Peter F. MacNeilage,et al.  Acquisition of Speech Production: The Achievement of Segmental Independence , 1990 .

[7]  P. Ladefoged A course in phonetics , 1975 .

[8]  L. Roug,et al.  Phonetic development in early infancy: a study of four Swedish children during the first eighteen months of life , 1989, Journal of Child Language.

[9]  Raymond D. Kent,et al.  Evidence and Role of Rhythmic Organization in Early Vocal Development in Human Infants , 1991 .

[10]  Michael I. Jordan Constrained supervised learning , 1992 .

[11]  C. A. Ferguson,et al.  WORDS AND SOUNDS IN EARLY LANGUAGE ACQUISITION , 1975 .

[12]  P. MacNeilage,et al.  The articulatory basis of babbling. , 1995, Journal of speech and hearing research.

[13]  J. Perkell,et al.  Invariance and variability in speech processes , 1987 .

[14]  Barak A. Pearlmutter Learning State Space Trajectories in Recurrent Neural Networks , 1989, Neural Computation.

[15]  I Bretherton,et al.  Content and context in early lexical development , 1981, Journal of Child Language.

[16]  Shelley L. Velleman,et al.  Phonological Reorganization: A Case Study , 1989 .

[17]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[18]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[19]  J. Huttenlocher The origins of language comprehension. , 1974 .

[20]  D. Ingram Phonological Disability in Children , 1976 .

[21]  Michael I. Jordan Attractor dynamics and parallelism in a connectionist sequential machine , 1990 .

[22]  K. Markey The sensorimotor foundations of phonology: a computational model of early childhood articulatory and phonetic development , 1995 .

[23]  K R Wagner,et al.  How much do children say in a day? , 1985, Journal of Child Language.

[24]  B. de Boysson-Bardies,et al.  Adaptation to language: Evidence from babbling and first words in four languages , 1991 .

[25]  Julie C. Sedivy,et al.  Eye movements as a window into real-time spoken language comprehension in natural contexts , 1995, Journal of psycholinguistic research.

[26]  Brian Butterworth,et al.  Explanations for Language Universals , 1984 .

[27]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[28]  Geoffrey E. Hinton Learning and Applying Contextual Constraints in Sentence Comprehension , 1991 .

[29]  A. Marchal,et al.  Speech production and speech modelling , 1990 .

[30]  B. Atal,et al.  Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. , 1978, The Journal of the Acoustical Society of America.

[31]  Michael I. Jordan,et al.  Sensorimotor adaptation in speech production. , 1998, Science.

[32]  Kenneth C. Hill,et al.  The genesis of language , 1982 .

[33]  David Caplan Speaking. From Intention to Articulation by Willem J. M. Levelt, MIT Press, 1989. £35.95 (xiv + 566 pages) ISBN 0 262 12137 9 , 1990, Trends in Neurosciences.

[34]  M. Vihman Variable paths to early word production , 1993 .

[35]  B. Dodd Children's Understanding of their Own Phonological Forms , 1975, The Quarterly journal of experimental psychology.

[36]  J. Mullennix,et al.  Some consequences of stimulus variability on speech processing by 2-month-old infants , 1992, Cognition.

[37]  P. Fikkert Acquisition of phonology , 1995 .

[38]  James L. McClelland,et al.  Sentence comprehension: A parallel distributed processing approach , 1989, Language and Cognitive Processes.

[39]  M. Garman,et al.  Language acquisition : studies in first language development , 1982 .

[40]  Michael I. Jordan,et al.  Goal-based speech motor control: A theoretical framework and some preliminary data , 1995 .

[41]  G. Stone,et al.  Word identification in reading and the promise of subsymbolic psycholinguistics. , 1990, Psychological review.

[42]  D. Oller THE EMERGENCE OF THE SOUNDS OF SPEECH IN INFANCY , 1980 .

[43]  J. Steven Reznick,et al.  Rapid change in lexical development in comprehension and production , 1992 .

[44]  Patrick Suppes,et al.  Approaches to Natural Language , 1973 .

[45]  James L. McClelland,et al.  Understanding normal and impaired word reading: computational principles in quasi-regular domains. , 1996, Psychological review.

[46]  Julia S. Falk,et al.  The emergent lexicon : the child's development of a linguistic vocabulary , 1990 .

[47]  P. Jusczyk The discovery of spoken language , 1997 .

[48]  Bruce F. Pennington,et al.  Word identification in reading and the promise of subsymbolic psycholinguistics. , 1990, Psychological review.

[49]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[50]  C. Best,et al.  Examination of perceptual reorganization for nonnative speech contrasts: Zulu click discrimination by English-speaking adults and infants. , 1988, Journal of experimental psychology. Human perception and performance.

[51]  R Treiman,et al.  Children's sensitivity to syllables, onsets, rimes, and phonemes. , 1996, Journal of experimental child psychology.

[52]  Marlys A. Macken,et al.  From Babbling to Speech: A Re-Assessment of the Continuity Issue , 1985 .

[53]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[54]  Geoffrey E. Hinton Connectionist Learning Procedures , 1989, Artif. Intell..

[55]  Jing Peng,et al.  An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[56]  M. Studdert-Kennedy,et al.  Self-organizing processes and the explanation of phonological universals , 1983 .

[57]  Michael Studdert-Kennedy,et al.  Discovering phonetic function , 1993 .

[58]  P. MacNeilage,et al.  The frame/content theory of evolution of speech production , 1998, Behavioral and Brain Sciences.

[59]  Philip Lieberman,et al.  Speech Physiology, Speech Perception, and Acoustic Phonetics , 1988 .

[60]  John J. L. Morton,et al.  Interaction of information in word recognition. , 1969 .

[61]  Harold R. Bauer,et al.  The ethologic model of phonetic development: I. Phonetic contrast estimators , 1988 .

[62]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: part 1.: an account of basic findings , 1988 .

[63]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[64]  Amanda C. Walley,et al.  The Role of Vocabulary Development in Children′s Spoken Word Recognition and Segmentation Ability , 1993 .

[65]  Marilyn M. Vihman,et al.  Phonological Development , 2014 .

[66]  Arlene I. Moskowitz,et al.  The Acquisition of Phonology and Syntax: A Preliminary Study , 1973 .

[67]  David C. Plaut,et al.  Semantic and Associative Priming in a Distributed Attractor Network , 1995 .

[68]  S Nittrouer,et al.  The emergence of mature gestural patterns is not uniform: evidence from an acoustic study. , 1993, Journal of speech and hearing research.

[69]  R. S. McGowan,et al.  The emergence of phonetic segments: evidence from the spectral structure of fricative-vowel syllables spoken by children and adults. , 1989, Journal of speech and hearing research.

[70]  Esther Thelen,et al.  Rhythmical behavior in infancy: An ethological perspective. , 1981 .

[71]  Michael I. Jordan,et al.  Forward dynamic models in human motor control: Psychophysical evidence , 1994, NIPS.

[72]  David C. Plaut,et al.  Structure and Function in the Lexical System: Insights from Distributed Models of Word Reading and Lexical Decision , 1997 .

[73]  W. Marslen-Wilson Functional parallelism in spoken word-recognition , 1987, Cognition.

[74]  Julie C. Sedivy,et al.  Subject Terms: Linguistics Language Eyes & eyesight Cognition & reasoning , 1995 .

[75]  G S Dell,et al.  A spreading-activation theory of retrieval in sentence production. , 1986, Psychological review.

[76]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[77]  Alvin M. Liberman,et al.  Speech: A Special Code , 1996 .

[78]  A. Meltzoff,et al.  Imitation of Facial and Manual Gestures by Human Neonates , 1977, Science.

[79]  Wolfgang Prinz,et al.  Language perception and production , 1987 .

[80]  John L. Locke Development of the Capacity for Spoken Language , 2019, The Handbook of Child Language.

[81]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[82]  G. C. Orden,et al.  Interdependence of form and function in cognitive systems explains perception of printed words. , 1994, Journal of experimental psychology. Human perception and performance.

[83]  W. Zonneveld Syllables and segments : Alan Bell and Joan B. Hooper (eds.), North-Holland Linguistic Series 40. Papers from the Symposium on Segment Organization and the Syllable, Boulder, Colorado, October 21-23, 1977. North_Holland Publ. Co., Amsterdam, 1978 , 1980 .

[84]  C. Stoel-Gammon,et al.  Patterns of early lexical and phonological development , 1984, Journal of Child Language.

[85]  P. Menyuk,et al.  Language acquisition: Early strategies for the perception and production of words and sounds , 1986 .

[86]  Heinz Werner,et al.  Symbol formation : an organismic-developmental approach to language and the expression of thought , 1964 .

[87]  W. Levelt Speaking: From Intention to Articulation , 1990 .