Mark My Words: High Frequency Marker Words Impact Early Stages of Language Learning

High frequency words have been suggested to benefit both speech segmentation and grammatical categorization of the words around them. Despite utilizing similar information, these tasks are usually investigated separately in studies examining learning. We determined whether including high frequency words in continuous speech could support categorization when words are being segmented for the first time. We familiarized learners with continuous artificial speech comprising repetitions of target words, which were preceded by high-frequency marker words. Crucially, marker words distinguished targets into 2 distributionally defined categories. We measured learning with segmentation and categorization tests and compared performance against a control group that heard the artificial speech without these marker words (i.e., just the targets, with no cues for categorization). Participants segmented the target words from speech in both conditions, but critically when the marker words were present, they influenced acquisition of word-referent mappings in a subsequent transfer task, with participants demonstrating better early learning for mappings that were consistent (rather than inconsistent) with the distributional categories. We propose that high-frequency words may assist early grammatical categorization, while speech segmentation is still being learned.

[1]  Jill Lany,et al.  The Role of Prior Experience in Language Acquisition , 2007, Cogn. Sci..

[2]  Padraic Monaghan,et al.  Domain-General Mechanisms for Speech Segmentation: The Role of Duration Information in Language Learning , 2016, Journal of experimental psychology. Human perception and performance.

[3]  P. Monaghan,et al.  Integrating constraints for learning word–referent mappings , 2012, Cognition.

[4]  George R. Kiss,et al.  Grammatical Word Classes: A Learning Process and its Simulation , 1973 .

[5]  Nick Chater,et al.  Phonology impacts segmentation in online speech processing , 2005 .

[6]  A. Rodríguez-Fornells,et al.  Headstart for speech segmentation: a neural signature for the anchor word effect , 2016, Neuropsychologia.

[7]  Morten H. Christiansen,et al.  Words in puddles of sound: modelling psycholinguistic effects in speech segmentation. , 2010, Journal of child language.

[8]  Morten H. Christiansen,et al.  Learning grammatical categories from distributional cues: Flexible frames for language acquisition , 2010, Cognition.

[9]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[10]  Stefanie Shattuck-Hufnagel,et al.  Word-boundary-related duration patterns in English , 2000, J. Phonetics.

[11]  Marina Nespor,et al.  Signal-Driven Computations in Speech Processing , 2002, Science.

[12]  R. Baayen,et al.  Mixed-effects modeling with crossed random effects for subjects and items , 2008 .

[13]  Laurence White,et al.  Integration of multiple speech segmentation cues: a hierarchical framework. , 2005, Journal of experimental psychology. General.

[14]  G. Zipf,et al.  The Psycho-Biology of Language , 1936 .

[15]  Janet L. McDonald,et al.  Properties of Phonological Markers That Affect the Acquisition of Gender-Like Subclasses☆☆☆★ , 1998 .

[16]  Richard N. Aslin,et al.  Models of Word Segmentation in Fluent Maternal Speech to Infants , 2014 .

[17]  Morten H. Christiansen,et al.  Developmental Changes in Cross-Situational Word Learning: The Inverse Effect of Initial Accuracy. , 2017, Cognitive science.

[18]  Nick Chater,et al.  The differential contribution of phonological and distributional cues in grammatical categorisation. , 2005 .

[19]  Richard N. Aslin,et al.  From shared contexts to syntactic categories: The role of distributional information in learning linguistic form-classes , 2013, Cognitive Psychology.

[20]  W. Quine,et al.  Word and object: An inquiry into the linguistic mechanisms of objective reference. , 1960 .

[21]  Elizabeth K. Johnson,et al.  Testing the limits of statistical learning for word segmentation. , 2010, Developmental science.

[22]  Virginia Valian,et al.  Anchor points in language learning: The role of marker frequency ☆ , 1988 .

[23]  Padraic Monaghan,et al.  Gavagai Is as Gavagai Does: Learning Nouns and Verbs From Cross-Situational Statistics , 2015, Cogn. Sci..

[24]  S. Pinker Learnability and Cognition: The Acquisition of Argument Structure , 1989 .

[25]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[26]  LouAnn Gerken,et al.  Infants use rational decision criteria for choosing among models of their input , 2010, Cognition.

[27]  Morten H. Christiansen,et al.  Stress changes the representational landscape: evidence from word segmentation , 2005, Cognition.

[28]  Padraic Monaghan,et al.  Disambiguating durational cues for speech segmentation. , 2013, The Journal of the Acoustical Society of America.

[29]  Christopher M. Conway,et al.  Implicit statistical learning in language processing: Word predictability is the key , 2010, Cognition.

[30]  Chih-Yi Wu,et al.  Tracking Multiple Statistics: Simultaneous Learning of Object Names and Categories in English and Mandarin Speakers , 2017, Cogn. Sci..

[31]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[32]  Morten H. Christiansen,et al.  Looking in the Wrong Direction Correlates With More Accurate Word Learning , 2011, Cogn. Sci..

[33]  Marc Brysbaert,et al.  Power Analysis and Effect Size in Mixed Effects Models: A Tutorial , 2018, Journal of cognition.

[34]  E. Newport,et al.  PSYCHOLOGICAL SCIENCE Research Article INCIDENTAL LANGUAGE LEARNING: Ustening (and Learning) out of the Comer of Your Ear , 2022 .

[35]  Morten H. Christiansen,et al.  Implicit Statistical Learning: A Tale of Two Literatures , 2019, Top. Cogn. Sci..

[36]  R. Gómez,et al.  Twelve-Month-Old Infants Benefit From Prior Experience in Statistical Learning , 2008, Psychological science.

[37]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[38]  Antoni Rodríguez-Fornells,et al.  Bridging the gap between speech segmentation and word-to-world mappings: Evidence from an audiovisual statistical learning task , 2010 .

[39]  J. Morgan,et al.  Mommy and Me , 2005, Psychological science.

[40]  Morten H. Christiansen,et al.  The phonological-distributional coherence hypothesis: Cross-linguistic evidence in language acquisition , 2007, Cognitive Psychology.

[41]  Padraic Monaghan,et al.  Relationships Between Language Structure and Language Learning: The Suffixing Preference and Grammatical Categorization , 2009, Cogn. Sci..

[42]  Patricia J. Brooks,et al.  Exploring language acquisition in children with a miniature artificial language: Effects of item and pattern frequency, arbitrary subclasses, and correction☆ , 1990 .

[43]  Linda B. Smith,et al.  Rapid Word Learning Under Uncertainty via Cross-Situational Statistics , 2007, Psychological science.

[44]  R. Gómez Variability and Detection of Invariant Structure , 2002, Psychological science.

[45]  A. Rodríguez-Fornells,et al.  Words as anchors: known words facilitate statistical learning. , 2010, Experimental psychology.

[46]  Paavo Alku,et al.  Statistical language learning in neonates revealed by event-related brain potentials , 2009, BMC Neuroscience.

[47]  Jacques Mehler,et al.  Primitive computations in speech processing , 2009, Quarterly journal of experimental psychology.

[48]  Nick Chater,et al.  Distributional Information: A Powerful Cue for Acquiring Syntactic Categories , 1998, Cogn. Sci..

[49]  Julia L. Evans,et al.  Can Infants Map Meaning to Newly Segmented Words? , 2007, Psychological science.

[50]  A. Vinter,et al.  PARSER: A Model for Word Segmentation , 1998 .

[51]  Jutta L. Mueller,et al.  Learnability of Embedded Syntactic Structures Depends on Prosodic Cues , 2010, Cogn. Sci..

[52]  Morten H. Christiansen,et al.  Using Statistics to Learn Words and Grammatical Categories: How High Frequency Words Assist Language Acquisition , 2016, CogSci.

[53]  Padraic Monaghan,et al.  Simultaneous segmentation and generalisation of non-adjacent dependencies from continuous speech , 2016, Cognition.

[54]  Peter Green,et al.  SIMR: an R package for power analysis of generalized linear mixed models by simulation , 2016 .

[55]  Nivedita Mani,et al.  Word-form familiarity bootstraps infant speech segmentation. , 2013, Developmental science.

[56]  Kenny Smith,et al.  Cross-Situational Learning: An Experimental Study of Word-Learning Mechanisms , 2011, Cogn. Sci..

[57]  Michael C. Frank,et al.  Zipfian frequency distributions facilitate word segmentation in context , 2013, Cognition.

[58]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[59]  L. Gerken,et al.  Infants can use distributional cues to form syntactic categories , 2005, Journal of Child Language.

[60]  J. Morgan,et al.  SIGNAL TO SYNTAX : Bootstrapping From Speech to Grammar in Early Acquisition , 2008 .

[61]  Jill Lany,et al.  Judging words by their covers and the company they keep: probabilistic cues support word learning. , 2014, Child development.

[62]  Toben H. Mintz Frequent frames as a cue for grammatical categories in child directed speech , 2003, Cognition.

[63]  Jutta L. Mueller,et al.  The Role of Pause Cues in Language Learning: The Emergence of Event-related Potentials Related to Sequence Processing , 2008, Journal of Cognitive Neuroscience.

[64]  Morten H. Christiansen,et al.  The differential role of phonological and distributional cues in grammatical categorisation , 2005, Cognition.

[65]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[66]  Jenny R. Saffran,et al.  Linking sounds to meanings: Infant statistical learning in a natural language , 2011, Cognitive Psychology.

[67]  Jessica S. Horst,et al.  The role of competition in word learning via referent selection. , 2010, Developmental science.

[68]  Linda B. Smith,et al.  Infants rapidly learn word-referent mappings via cross-situational statistics , 2008, Cognition.

[69]  Thierry Nazzi,et al.  When Mommy Comes to the Rescue of Statistics: Infants Combine Top-Down and Bottom-Up Cues to Segment Speech , 2012 .

[70]  L. Gleitman The Structural Sources of Verb Meanings , 2020, Sentence First, Arguments Afterward.

[71]  Jessica F. Hay,et al.  Statistical learning in a natural language by 8-month-old infants. , 2009, Child development.

[72]  Toben H. Mintz Category induction from distributional cues in an artificial language , 2002, Memory & cognition.

[73]  A. Endress,et al.  Rapid learning of syllable classes from a perceptually continuous speech stream , 2007, Cognition.

[74]  Michael Ramscar,et al.  Suffixing, prefixing, and the functional order of regularities in meaningful strings , 2013 .

[75]  R. Aslin,et al.  Statistical learning of higher-order temporal structure from visual shape sequences. , 2002, Journal of experimental psychology. Learning, memory, and cognition.

[76]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[77]  D. Barr,et al.  Random effects structure for confirmatory hypothesis testing: Keep it maximal. , 2013, Journal of memory and language.

[78]  Paul Taylor,et al.  Festival Speech Synthesis System , 1998 .