A role for the developing lexicon in phonetic category acquisition.

Infants segment words from fluent speech during the same period when they are learning phonetic categories, yet accounts of phonetic category acquisition typically ignore information about the words in which sounds appear. We use a Bayesian model to illustrate how feedback from segmented words might constrain phonetic category learning by providing information about which sounds occur together in words. Simulations demonstrate that word-level information can successfully disambiguate overlapping English vowel categories. Learning patterns in the model are shown to parallel human behavior from artificial language learning tasks. These findings point to a central role for the developing lexicon in phonetic category acquisition and provide a framework for incorporating top-down constraints into models of category learning.

[1]  Charles Kemp,et al.  The discovery of structural form , 2008, Proceedings of the National Academy of Sciences.

[2]  S. Goldinger Words and voices: episodic traces in spoken word identification and recognition memory. , 1996, Journal of experimental psychology. Learning, memory, and cognition.

[3]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[4]  Lewis Bott,et al.  Knowledge selection in category learning , 2000 .

[5]  R. Port How are words stored in memory? Beyond phones and phonemes , 2007 .

[6]  M. Casasola,et al.  Acquisition of word-object associations by 14-month-old infants. , 1998, Developmental psychology.

[7]  Joan L. Bybee,et al.  Frequency and the emergence of linguistic structure , 2001 .

[8]  Mark Steyvers,et al.  Online Learning Mechanisms for Bayesian Models of Word Segmentation , 2010 .

[9]  R. Gómez,et al.  Artificial grammar learning by 1-year-olds leads to specific and abstract knowledge , 1999, Cognition.

[10]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Mark Steedman,et al.  A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings , 2012, EACL.

[12]  R. Aslin,et al.  Lexical competition in young children’s word learning , 2007, Cognitive Psychology.

[13]  Alexandra R. Marquis The Recognition of Verb Roots & Bound Morphemes when Vowel Alternations Are at Play , 2009 .

[14]  Amy Perfors,et al.  Joint acquisition of word order and word reference , 2009 .

[15]  Katherine S White,et al.  Receptive Grammatical Knowledge of Familiar Content Words and Inflection in 16-Month-Olds. , 2007, Infancy : the official journal of the International Society on Infant Studies.

[16]  Núria Sebastián-Gallés,et al.  Developmental shift in the discrimination of vowel contrasts in bilingual infants: is the distributional account all there is to it? , 2009, Developmental science.

[17]  Thomas L. Griffiths,et al.  Producing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models , 2011, J. Mach. Learn. Res..

[18]  Carl E. Rasmussen,et al.  The Infinite Gaussian Mixture Model , 1999, NIPS.

[19]  Sharon Peperkamp,et al.  Learning Phonemes With a Proto-Lexicon , 2013, Cogn. Sci..

[20]  Douglas L Medin,et al.  Linear separability and concept learning: Context, relational properties, and concept naturalness , 1986, Cognitive Psychology.

[21]  Jessica Maye,et al.  Learning Phonemes Without Minimal Pairs , 2000 .

[22]  K. Onishi,et al.  Allophonic and Phonemic Contrasts in Infants' Learning of Sound Patterns , 2009, Language Learning and Development.

[23]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[24]  Linda B. Smith,et al.  From the lexicon to expectations about kinds: a role for associative learning. , 2005, Psychological review.

[25]  Nikolaj Sergejevič Trubeckoj Grundzüge der Phonologie , 1989 .

[26]  D. Swingley,et al.  Contributions of infant word learning to language development , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[27]  Refractor Vision , 2000, The Lancet.

[28]  Sandra R Waxman,et al.  What paradox? Referential cues allow for infant use of phonetic detail in word learning. , 2010, Child development.

[29]  Anne Cutler,et al.  Phonological Abstraction in the Mental Lexicon , 2006, Cogn. Sci..

[30]  Adam N Sanborn,et al.  Rational approximations to rational models: alternative algorithms for category learning. , 2010, Psychological review.

[31]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[32]  Robert L. Goldstone,et al.  The effect of the internal structure of categories on perception , 2008 .

[33]  B. McMurray,et al.  Infant directed speech and the development of speech perception: Enhancing development or an unintended consequence? , 2013, Cognition.

[34]  Richard N Aslin,et al.  Statistical learning of phonetic categories: insights from a computational approach. , 2009, Developmental science.

[35]  P. Jusczyk,et al.  The role of talker-specific information in word segmentation by infants. , 2000, Journal of experimental psychology. Human perception and performance.

[36]  Jean Christophe Verstraeh Frequency and the emergence of linguistic structure , 2005 .

[37]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[38]  Thomas L. Griffiths,et al.  Interpolating between types and tokens by estimating power-law generators , 2005, NIPS.

[39]  P. Boersma,et al.  SUPERVISION HAMPERS DISTRIBUTIONAL LEARNING OF VOWEL CONTRASTS , 2007 .

[40]  P. Kuhl,et al.  Cross-language analysis of phonetic units in language addressed to infants. , 1997, Science.

[41]  Erik D. Thiessen,et al.  When variability matters more than meaning: the effect of lexical forms on use of phonemic contrasts. , 2011, Developmental psychology.

[42]  J. Tenenbaum,et al.  Variability, negative evidence, and the acquisition of verb argument constructions. , 2010, Journal of child language.

[43]  J. Werker,et al.  Learning words’ sounds before learning how words sound: 9-Month-olds use distinct objects as cues to categorize speech information , 2009, Cognition.

[44]  P. Luce,et al.  An examination of similarity neighbourhoods in young children's receptive vocabularies , 1995, Journal of Child Language.

[45]  F. Ashby,et al.  Categorization as probability density estimation , 1995 .

[46]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[47]  Morten H. Christiansen,et al.  The secret is in the sound: from unsegmented speech to lexical categories. , 2009, Developmental science.

[48]  E. Markman,et al.  Rapid Word Learning in 13- and 18-Month-Olds. , 1994 .

[49]  P. Kuhl Speech perception in early infancy: perceptual constancy for spectrally dissimilar vowel categories. , 1979, The Journal of the Acoustical Society of America.

[50]  Ewan Dunbar,et al.  A Single-Stage Approach to Learning Phonological Categories: Insights From Inuktitut , 2013, Cogn. Sci..

[51]  D. Swingley,et al.  At 6–9 months, human infants know the meanings of many common nouns , 2012, Proceedings of the National Academy of Sciences.

[52]  Noah D. Goodman,et al.  Learning Grounded Causal Models , 2007 .

[53]  P. Jusczyk,et al.  Infants' sensitivity to phonotactic patterns in the native language. , 1994 .

[54]  Katherine S White,et al.  Adaptation to novel accents by toddlers. , 2011, Developmental science.

[55]  Bart de Boer,et al.  Investigating the role of infant-directed speech with a computer model , 2003 .

[56]  J. Saffran,et al.  From Syllables to Syntax: Multilevel Statistical Learning by 12-Month-Old Infants , 2003 .

[57]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[58]  Jessica Maye,et al.  Infant sensitivity to distributional information can affect phonetic discrimination , 2002, Cognition.

[59]  J. Morgan,et al.  Mommy and Me , 2005, Psychological science.

[60]  Erik D. Thiessen,et al.  Dogs, bogs, labs, and lads: what phonemic generalizations indicate about the nature of children's early word-form representations. , 2010, Child development.

[61]  Mary R. Newsome,et al.  The Beginnings of Word Segmentation in English-Learning Infants , 1999, Cognitive Psychology.

[62]  G. Murphy,et al.  A knowledge-resonance (KRES) model of category learning , 2003, Psychonomic bulletin & review.

[63]  K. Behnke The acquisition of phonetic categories in young infants: A self-organising artificial neural network approach , 1998 .

[64]  John R. Anderson The Adaptive Character of Thought , 1990 .

[65]  Sharon Peperkamp,et al.  Rapid acquisition of phonological alternations by infants , 2008, Cognition.

[66]  Núria Sebastián-Gallés,et al.  Cross‐Language Speech Perception , 2008 .

[67]  Thomas L. Griffiths,et al.  Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models , 2006, NIPS.

[68]  C. Dollaghan,et al.  Children's phonological neighbourhoods: half empty or half full? , 1994, Journal of Child Language.

[69]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[70]  Erik D. Thiessen The effect of distributional information on children's use of phonemic contrasts , 2007 .

[71]  Jessica Maye,et al.  Distributional Phonetic Learning at 10 Months of Age. , 2010, Infancy : the official journal of the International Society on Infant Studies.

[72]  K. Stevens,et al.  Linguistic experience alters phonetic perception in infants by 6 months of age. , 1992, Science.

[73]  Aren Jansen,et al.  Towards Unsupervised Training of Speaker Independent Acoustic Models , 2011, INTERSPEECH.

[74]  Bruce Hayes,et al.  A Maximum Entropy Model of Phonotactics and Phonotactic Learning , 2008, Linguistic Inquiry.

[75]  R. Gómez Variability and Detection of Invariant Structure , 2002, Psychological science.

[76]  Thomas L. Griffiths,et al.  Formal Approaches in Categorization: Nonparametric Bayesian models of categorization , 2011 .

[77]  Emily B. Myers,et al.  Word-level information influences phonetic learning in adults and infants , 2013, Cognition.

[78]  P. Jusczyk,et al.  Sensitivity to discontinuous dependencies in language learners: evidence for limitations in processing space , 1998, Cognition.

[79]  Roman Jakobson,et al.  Fundamentals of Language , 1957 .

[80]  Joshua B. Tenenbaum,et al.  One shot learning of simple visual concepts , 2011, CogSci.

[81]  S. Gahl Time and Thyme Are not Homophones: The Effect of Lemma Frequency on Word Durations in Spontaneous Speech , 2008 .

[82]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[83]  R. Aslin,et al.  Statistical phonetic learning in infants: facilitation and feature generalization. , 2008, Developmental science.

[84]  Adam N Sanborn,et al.  Exemplar models as a mechanism for performing Bayesian inference , 2010, Psychonomic bulletin & review.

[85]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[86]  James L. McClelland,et al.  Alternatives to the combinatorial paradigm of linguistic theory based on domain general principles of human cognition , 2005 .

[87]  Janet B. Pierrehumbert,et al.  Exemplar dynamics: Word frequency, lenition and contrast , 2000 .

[88]  Eric Deeson,et al.  Online learning , 2005, Br. J. Educ. Technol..

[89]  M. Pazzani Influence of prior knowledge on concept acquisition: Experimental and computational results. , 1991 .

[90]  T. Griffiths,et al.  A Bayesian framework for word segmentation: Exploring the effects of context , 2009, Cognition.

[91]  James L. McClelland,et al.  Unsupervised learning of vowel categories from infant-directed speech , 2007, Proceedings of the National Academy of Sciences.

[92]  A. Seidl,et al.  The hyperarticulation hypothesis of infant-directed speech* , 2013, Journal of Child Language.

[93]  Yi Xu,et al.  Learning phonetic categories by tracking movements , 2007, Cognition.

[94]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[95]  Y. Rosseel Mixture models of categorization , 2002 .

[96]  J. Werker,et al.  The Perceptual Acquisition of Phonological Contrasts , 2004 .

[97]  J. Werker,et al.  Infants listen for more phonetic detail in speech perception than in word-learning tasks , 1997, Nature.

[98]  R. Aslin,et al.  Statistical learning of higher-order temporal structure from visual shape sequences. , 2002, Journal of experimental psychology. Learning, memory, and cognition.

[99]  Richard N Aslin,et al.  Phonological neighbourhoods in the developing lexicon. , 2003, Journal of child language.

[100]  Paul D. Allopenna,et al.  The locus of knowledge effects in concept learning. , 1994, Journal of experimental psychology. Learning, memory, and cognition.

[101]  Thomas L. Griffiths,et al.  A Nonparametric Bayesian Model of Multi-Level Category Learning , 2011, AAAI.

[102]  D. S. Sivia,et al.  Data Analysis , 1996, Encyclopedia of Evolutionary Psychological Science.

[103]  Yasuhiro Shirai,et al.  The Acquisition of Lexical and Grammatical Aspect , 2000 .

[104]  T. M. Nearey,et al.  Effects of consonant environment on vowel formant patterns. , 1997, The Journal of the Acoustical Society of America.

[105]  J. Werker,et al.  Fourteen-month-old infants learn similar-sounding words. , 2009, Developmental science.

[106]  Joshua B. Tenenbaum,et al.  Combining causal and similarity-based reasoning , 2006, NIPS.

[107]  M. Pazzani Influence of prior knowledge on concept acquisition: Experimental and computational results. , 1991 .

[108]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[109]  J. Werker,et al.  Adult and infant perception of two English phones. , 1997, The Journal of the Acoustical Society of America.

[110]  Chandan R. Narayan,et al.  The interaction between acoustic salience and language experience in developmental speech perception: evidence from nasal place discrimination. , 2010, Developmental science.

[111]  Antonio Torralba,et al.  Describing Visual Scenes Using Transformed Objects and Parts , 2008, International Journal of Computer Vision.

[112]  J. Tenenbaum,et al.  Bayesian Special Section Learning Overhypotheses with Hierarchical Bayesian Models , 2022 .

[113]  Philip I. Pavlik,et al.  iMinerva: A Mathematical Model of Distributional Statistical Learning , 2013, Cogn. Sci..

[114]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[115]  Thomas L. Griffiths,et al.  Modeling human function learning with Gaussian processes , 2008, NIPS.

[116]  Jessica F. Hay,et al.  Statistical learning in a natural language by 8-month-old infants. , 2009, Child development.

[117]  Thomas L. Griffiths,et al.  Modeling Transfer Learning in Human Categorization with the Hierarchical Dirichlet Process , 2010, ICML.

[118]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[119]  J. Werker,et al.  Early Word Learners' Ability to Access Phonetic Detail in Well-Known Words , 2003, Language and speech.

[120]  Bob McMurray,et al.  Cue Integration With Categories: Weighting Acoustic Cues in Speech Using Unsupervised Learning and Distributional Statistics , 2010, Cogn. Sci..

[121]  Micha Elsner,et al.  Bootstrapping a Unified Model of Lexical and Phonetic Acquisition , 2012, ACL.

[122]  P. Luce,et al.  Similarity neighbourhoods of words in young children's lexicons , 1990, Journal of Child Language.

[123]  P. Jusczyk,et al.  Infants′ Detection of the Sound Patterns of Words in Fluent Speech , 1995, Cognitive Psychology.

[124]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[125]  Michael C. Frank,et al.  Learning Words and Their Meanings from Unsegmented Child-directed Speech , 2010, HLT-NAACL.