Zipfian frequency distributions facilitate word segmentation in context

Word frequencies in natural language follow a highly skewed Zipfian distribution, but the consequences of this distribution for language acquisition are only beginning to be understood. Typically, learning experiments that are meant to simulate language acquisition use uniform word frequency distributions. We examine the effects of Zipfian distributions using two artificial language paradigms-a standard forced-choice task and a new orthographic segmentation task in which participants click on the boundaries between words in contexts. Our data show that learners can identify word forms robustly across widely varying frequency distributions. In addition, although performance in recognizing individual words is predicted best by their frequency, a Zipfian distribution facilitates word segmentation in context: the presence of high-frequency words creates more chances for learners to apply their knowledge in processing new sentences. We find that computational models that implement "chunking" are more effective than "transition finding" models at reproducing this pattern of performance.

[1]  J. Morgan,et al.  Mommy and Me , 2005, Psychological science.

[2]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[3]  Casey Lew-Williams,et al.  Isolated words enhance statistical language learning in infancy. , 2011, Developmental science.

[4]  Morten H. Christiansen,et al.  Words in puddles of sound: modelling psycholinguistic effects in speech segmentation. , 2010, Journal of child language.

[5]  Virginia Valian,et al.  Anchor points in language learning: The role of marker frequency ☆ , 1988 .

[6]  Richard N Aslin,et al.  Prosody guides the rapid mapping of auditory word forms onto visual objects in 6-mo-old infants , 2011, Proceedings of the National Academy of Sciences.

[7]  M. Hauser,et al.  Segmentation of the speech stream in a non-human primate: statistical learning in cotton-top tamarins , 2001, Cognition.

[8]  J. Morgan,et al.  SIGNAL TO SYNTAX : Bootstrapping From Speech to Grammar in Early Acquisition , 2008 .

[9]  Michael C. Frank,et al.  PSYCHOLOGICAL SCIENCE Research Article Using Speakers ’ Referential Intentions to Model Early Cross-Situational Word Learning , 2022 .

[10]  Elizabeth K. Johnson,et al.  Word Segmentation by 8-Month-Olds: When Speech Cues Count More Than Statistics , 2001 .

[11]  T. Jaeger,et al.  Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models. , 2008, Journal of memory and language.

[12]  P. Jusczyk,et al.  Phonotactic cues for segmentation of fluent speech by infants , 2001, Cognition.

[13]  A. Vinter,et al.  PARSER: A Model for Word Segmentation , 1998 .

[14]  T. Griffiths,et al.  A Bayesian framework for word segmentation: Exploring the effects of context , 2009, Cognition.

[15]  Denis Mareschal,et al.  TRACX: a recognition-based connectionist framework for sequence segmentation and chunk extraction. , 2011, Psychological review.

[16]  József Fiser,et al.  The other kind of perceptual learning , 2009 .

[17]  M. Brent,et al.  How Infants Use the Words They Know to Learn New Words , 2002 .

[18]  Jeffrey Heinz,et al.  Modeling the contribution of phonotactic cues to the problem of word segmentation. , 2010, Journal of child language.

[19]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[20]  Mark Johnson,et al.  A Particle Filter algorithm for Bayesian Wordsegmentation , 2011, ALTA.

[21]  Barbara Tillmann,et al.  Exploiting Multiple Sources of Information in Learning an Artificial Language: Human Data and Modeling , 2009 .

[22]  Matthew Brook O'Donnell,et al.  Robust Language Acquisition - an Emergent Consequence of Language as a Complex Adaptive System , 2011, CogSci.

[23]  P. Jusczyk,et al.  Infants’ sensitivity to allophonic cues for word segmentation , 1999, Perception & psychophysics.

[24]  Michael R. Brent,et al.  An Efficient, Probabilistically Sound Algorithm for Segmentation and Word Discovery , 1999, Machine Learning.

[25]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[26]  Michael C. Frank,et al.  Modeling human performance in statistical word segmentation , 2010, Cognition.

[27]  Elizabeth K. Johnson,et al.  Testing the limits of statistical learning for word segmentation. , 2010, Developmental science.

[28]  G. Zipf,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology. , 1949 .

[29]  Jessica F. Hay,et al.  Statistical learning in a natural language by 8-month-old infants. , 2009, Child development.

[30]  Jacques Mehler,et al.  Word frequency as a cue for identifying function words in infancy , 2010, Cognition.

[31]  Julia L. Evans,et al.  Can Infants Map Meaning to Newly Segmented Words? , 2007, Psychological science.

[32]  Joost van de Weijer,et al.  The importance of single-word utterances for early word recognition. , 2001 .

[33]  Charles D. Yang Universal Grammar, statistics or both? , 2004, Trends in Cognitive Sciences.

[34]  Michael C. Frank,et al.  Zipfian word frequencies support statistical word segmentation , 2011, CogSci.

[35]  A. Rodríguez-Fornells,et al.  Words as anchors: known words facilitate statistical learning. , 2010, Experimental psychology.

[36]  Scott P. Johnson,et al.  Visual statistical learning in infancy: evidence for a domain general learning mechanism , 2002, Cognition.

[37]  Christopher M. Conway,et al.  Implicit statistical learning in language processing: Word predictability is the key , 2010, Cognition.

[38]  Thierry Dutoit,et al.  The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[39]  Richard N Aslin,et al.  Statistical learning of new visual feature combinations by infants , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Richard N Aslin,et al.  Bayesian learning of visual chunks by human observers , 2008, Proceedings of the National Academy of Sciences.

[41]  M. Brent,et al.  The role of exposure to isolated words in early vocabulary development , 2001, Cognition.

[42]  M. Brent,et al.  On the discovery of novel wordlike units from utterances: an artificial-language study with implications for native-language acquisition. , 1999, Journal of experimental psychology. General.

[43]  Michael C. Frank,et al.  Learning and Long-Term Retention of Large-Scale Artificial Languages , 2013, PloS one.

[44]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[45]  Bob McMurray,et al.  On Leveraged Learning in Lexical Acquisition and Its Relationship to Acceleration , 2009, Cogn. Sci..

[46]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[47]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[48]  Richard N. Aslin,et al.  Models of Word Segmentation in Fluent Maternal Speech to Infants , 2014 .

[49]  Michael C. Frank,et al.  Beyond Transitional Probabilities: Human Learners Impose a Parsimony Bias in Statistical Word Segmentation , 2010 .

[50]  B. Schölkopf,et al.  Edinburgh Research Explorer Interpolating between types and tokens by estimating power-law generators , 2006 .

[51]  Michael C. Frank,et al.  Overcoming Memory Limitations in Rule Learning , 2011 .

[52]  Wentian Li,et al.  Random texts exhibit Zipf's-law-like word frequency distribution , 1992, IEEE Trans. Inf. Theory.

[53]  E. Newport,et al.  WORD SEGMENTATION : THE ROLE OF DISTRIBUTIONAL CUES , 1996 .

[54]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[55]  T. Mexia,et al.  Author ' s personal copy , 2009 .

[56]  Daniel Swingley,et al.  Statistical clustering and the contents of the infant vocabulary , 2005, Cognitive Psychology.