Statistically Induced Chunking Recall: A Memory-Based Approach to Statistical Learning

The computations involved in statistical learning have long been debated. Here, we build on work suggesting that a basic memory process, chunking, may account for the processing of statistical regularities into larger units. Drawing on methods from the memory literature, we developed a novel paradigm to test statistical learning by leveraging a robust phenomenon observed in serial recall tasks: that short-term memory is fundamentally shaped by long-term distributional learning. In the statistically induced chunking recall (SICR) task, participants are exposed to an artificial language, using a standard statistical learning exposure phase. Afterward, they recall strings of syllables that either follow the statistics of the artificial language or comprise the same syllables presented in a random order. We hypothesized that if individuals had chunked the artificial language into word-like units, then the statistically structured items would be more accurately recalled relative to the random controls. Our results demonstrate that SICR effectively captures learning in both the auditory and visual modalities, with participants displaying significantly improved recall of the statistically structured items, and even recall specific trigram chunks from the input. SICR also exhibits greater test-retest reliability in the auditory modality and sensitivity to individual differences in both modalities than the standard two-alternative forced-choice task. These results thereby provide key empirical support to the chunking account of statistical learning and contribute a valuable new tool to the literature.

[1]  B. Scholl,et al.  The Automaticity of Visual Statistical Learning Statistical Learning , 2005 .

[2]  M. Tomasello First Steps toward a Usage-Based Theory of Language Acquisition , 2001 .

[3]  Scott Sinnett,et al.  Speech segmentation by statistical learning depends on attention , 2005, Cognition.

[4]  Ralph Grishman,et al.  The American National Corpus: A Standardized Resource for American English , 2000, LREC.

[5]  R. Aslin,et al.  PSYCHOLOGICAL SCIENCE Research Article UNSUPERVISED STATISTICAL LEARNING OF HIGHER-ORDER SPATIAL STRUCTURES FROM VISUAL SCENES , 2022 .

[6]  Marina Nespor,et al.  Signal-Driven Computations in Speech Processing , 2002, Science.

[7]  Morten H. Christiansen,et al.  Chunking Ability Shapes Sentence Processing at Multiple Levels of Abstraction , 2017, CogSci.

[8]  Morten H. Christiansen,et al.  Acquiring formulaic language , 2014, The Mental Lexicon.

[9]  Bill Macken,et al.  Long-term associative learning predicts verbal short-term memory performance , 2018, Memory & cognition.

[10]  Morten H. Christiansen,et al.  Statistical learning research: A critical review and possible new directions. , 2019, Psychological bulletin.

[11]  Noam Siegelman,et al.  Redefining "Learning" in Statistical Learning: What Does an Online Measure Reveal About the Assimilation of Visual Regularities? , 2018, Cognitive science.

[12]  John C. Caruso,et al.  A Comparison of the Reliabilities of Four Types of Difference Scores for Five Cognitive Assessment Batteries , 2004 .

[13]  J. Pine,et al.  Chunking mechanisms in human learning , 2001, Trends in Cognitive Sciences.

[14]  S. Gathercole Is nonword repetition a test of phonological memory or long-term knowledge? It all depends on the nonwords , 1995, Memory & cognition.

[15]  Joanne Arciuli,et al.  Statistical Learning Is Related to Reading Ability in Children and Adults , 2012, Cogn. Sci..

[16]  Alan D. Baddeley,et al.  The children's test of non-word repetition , 1996 .

[17]  Fernand Gobet,et al.  Modeling the Development of Children's Use of Optional Infinitives in Dutch and English Using MOSAIC , 2006, Cogn. Sci..

[18]  J. Willett Chapter 9: Questions and Answers in the Measurement of Change , 1988 .

[19]  L R Squire,et al.  The information acquired during artificial grammar learning. , 1994, Journal of experimental psychology. Learning, memory, and cognition.

[20]  S. Gathercole,et al.  Nonword repetition: a comparison of tests. , 2006, Journal of speech, language, and hearing research : JSLHR.

[21]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[22]  David Miller,et al.  The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text , 2004, LREC.

[23]  P. Perruchet,et al.  Implicit learning and statistical learning: one phenomenon, two approaches , 2006, Trends in Cognitive Sciences.

[24]  Viljo Kohonen,et al.  Is the relation between phonological memory and foreign language learning accounted for by vocabulary acquisition? , 1995, Applied Psycholinguistics.

[25]  Fernand Gobet,et al.  Why computational models are better than verbal theories: the case of nonword repetition. , 2014, Developmental science.

[26]  E. Service Phonology, Working Memory, and Foreign-language Learning , 1992, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[27]  J. Pine,et al.  Lexically-based learning and early grammatical development , 1997, Journal of Child Language.

[28]  Morten H. Christiansen,et al.  Learning Simple Statistics for Language Comprehension and Production: The CAPPUCCINO Model , 2011, CogSci.

[29]  S. Levinson Turn-taking in Human Communication – Origins and Implications for Language Processing , 2016, Trends in Cognitive Sciences.

[30]  Pierre Perruchet,et al.  What Mechanisms Underlie Implicit Statistical Learning? Transitional Probabilities Versus Chunks in Language Learning , 2018, Top. Cogn. Sci..

[31]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[32]  David H. Uttal,et al.  Statistical Language Learning: Mechanisms and Constraints Learning the Sounds of Words Statistical Learning and Syntax , 2022 .

[33]  Denis Mareschal,et al.  TRACX: a recognition-based connectionist framework for sequence segmentation and chunk extraction. , 2011, Psychological review.

[34]  G. Dell,et al.  Speech errors, phonotactic constraints, and implicit learning: a study of the role of experience in language production. , 2000, Journal of experimental psychology. Learning, memory, and cognition.

[35]  N. Cowan The magical number 4 in short-term memory: A reconsideration of mental storage capacity , 2001, Behavioral and Brain Sciences.

[36]  Bill Macken,et al.  Questioning short-term memory and its measurement: Why digit span measures long-term associative learning , 2015, Cognition.

[37]  Christopher M. Conway,et al.  Implicit statistical learning in language processing: Word predictability is the key , 2010, Cognition.

[38]  Morten H. Christiansen,et al.  Domain generality versus modality specificity: the paradox of statistical learning , 2015, Trends in Cognitive Sciences.

[39]  Gary S Dell,et al.  The role of consolidation in learning context-dependent phonotactic patterns in speech and digital sequence production , 2018, Proceedings of the National Academy of Sciences.

[40]  S. Gathercole Nonword repetition and word learning: The nature of the relationship , 2006, Applied Psycholinguistics.

[41]  Morten H. Christiansen,et al.  Statistical Learning and Language: An Individual Differences Study , 2012 .

[42]  Annchen R. Knodt,et al.  The reliability paradox: Why robust cognitive tasks do not produce reliable individual differences , 2017, Behavior Research Methods.

[43]  T. Campbell,et al.  Nonword repetition and child language impairment. , 1998, Journal of speech, language, and hearing research : JSLHR.

[44]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[45]  Linda B. Smith,et al.  Rapid Word Learning Under Uncertainty via Cross-Situational Statistics , 2007, Psychological science.

[46]  H. Simon,et al.  Perception in chess , 1973 .

[47]  Jacques Mehler,et al.  The surprising power of statistical learning: When fragment knowledge leads to false memories of unheard words , 2009 .

[48]  A. Goldberg Constructions at Work: The Nature of Generalization in Language , 2006 .

[49]  Morten H. Christiansen,et al.  Testing Statistical Learning Implicitly: A Novel Chunk-based Measure of Statistical Learning , 2017, CogSci.

[50]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[51]  M. Tomasello,et al.  Modeling children's early grammatical knowledge , 2009, Proceedings of the National Academy of Sciences.

[52]  A. Baddeley Immediate Memory and the “Perception” of Letter Sequences , 1964 .

[53]  Philip I. Pavlik,et al.  iMinerva: A Mathematical Model of Distributional Statistical Learning , 2013, Cogn. Sci..

[54]  Gary Jones,et al.  The influence of children’s exposure to language from two to six years: The case of nonword repetition , 2016, Cognition.

[55]  M. Botvinick Effects of domain-specific knowledge on memory for serial order , 2005, Cognition.

[56]  R. Frost,et al.  Statistical learning as an individual ability: Theoretical perspectives and empirical evidence. , 2015, Journal of memory and language.

[57]  Michael P. Kaschak,et al.  Individual Differences in Statistical Learning: Conceptual and Measurement Issues , 2016 .

[58]  Gerald Tehan,et al.  Phonological Similarity and Trace Degradation in the Serial Recall Task: When CAT helps RAT, but not MAN , 1999 .

[59]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[60]  Noam Siegelman,et al.  Measuring individual differences in statistical learning: Current pitfalls and possible solutions , 2017, Behavior research methods.

[61]  Janne von Koss Torkildsen,et al.  Individual differences in statistical learning predict children's reading ability in a semi-transparent orthography , 2019, Learning and Individual Differences.

[62]  Alexa R. Romberg,et al.  Statistical learning and language acquisition. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[63]  Morten H. Christiansen,et al.  The long road of statistical learning research: past, present and future , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[64]  Barbara Tillmann,et al.  New evidence for chunk-based models in word segmentation. , 2014, Acta psychologica.

[65]  Thierry Dutoit,et al.  The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[66]  Joanne Arciuli,et al.  The multi-component nature of statistical learning , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[67]  Morten H. Christiansen,et al.  Implicit Statistical Learning: A Tale of Two Literatures , 2019, Top. Cogn. Sci..

[68]  Gary S Dell,et al.  Limits on learning phonotactic constraints from recent production experience. , 2008, Journal of experimental psychology. Learning, memory, and cognition.

[69]  Roger Ratcliff,et al.  A Theory of Memory Retrieval. , 1978 .

[70]  Jacques Mehler,et al.  The word segmentation process as revealed by click detection , 2011 .

[71]  H A Simon,et al.  How Big Is a Chunk? , 1974, Science.

[72]  Lauren M. Bylsma,et al.  Regularization in short-term memory for serial order. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[73]  Nelson Cowan,et al.  Constant Capacity in an Immediate Serial-Recall Task , 2004, Psychological science.

[74]  Nick Chater,et al.  The Now-or-Never bottleneck: A fundamental constraint on language , 2015, Behavioral and Brain Sciences.

[75]  Morten H. Christiansen,et al.  Individual Differences in Language Acquisition and Processing , 2018, Trends in Cognitive Sciences.

[76]  D. W. Zimmerman,et al.  Reliability of gain scores under realistic assumptions about properties of pre-test and post-test scores , 1998 .

[77]  Prahlad Gupta,et al.  Examining the Relationship between word Learning, Nonword Repetition, and Immediate Serial Recall in Adults , 2003, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[78]  A D Baddeley,et al.  The Children's Test of Nonword Repetition: a test of phonological working memory. , 1994, Memory.

[79]  Vinciane Gaillard,et al.  Visual statistical learning in children and young adults: how implicit? , 2015, Front. Psychol..

[80]  A. Baddeley,et al.  The phonological loop as a language learning device. , 1998, Psychological review.

[81]  Fernand Gobet,et al.  Linking working memory and long-term memory: a computational model of the learning of new words. , 2007, Developmental science.

[82]  Michael Tomasello Introduction: Some surprises for Psychologists , 2003 .

[83]  Elena Lieven,et al.  Usage-based approaches to language development: Where do we go from here? , 2016, Language and Cognition.

[84]  Steve Majerus,et al.  Verbal short-term memory reflects the sublexical organization of the phonological language network: Evidence from an incidental phonotactic learning paradigm , 2004 .

[85]  John B. Willett,et al.  Questions and Answers in the Measurement of Change , 1988 .

[86]  Scott P. Johnson,et al.  Statistical and Chunking Processes in Adults' Visual Sequence Learning , 2015, CogSci.

[87]  K. Paller,et al.  Implicit and explicit contributions to statistical learning. , 2015, Journal of memory and language.

[88]  Axel Cleeremans,et al.  Rapid Serial Auditory Presentation: A New Measure of Statistical Learning in Speech Segmentation. , 2015, Experimental psychology.

[89]  Julia L. Evans,et al.  Statistical learning in children with specific language impairment. , 2009, Journal of speech, language, and hearing research : JSLHR.

[90]  Morten H. Christiansen,et al.  Language Learning as Language Use: A Cross-Linguistic Model of Child Language Development , 2019, Psychological review.

[91]  Aaron R. Seitz,et al.  Testing assumptions of statistical learning: Is it long-term and implicit? , 2009, Neuroscience Letters.

[92]  Gary S Dell,et al.  Speech errors reflect newly learned phonotactic constraints. , 2006, Journal of experimental psychology. Learning, memory, and cognition.

[93]  J B Tomblin,et al.  Nonword repetition performance in school-age children with and without language impairment. , 2000, Journal of speech, language, and hearing research : JSLHR.

[94]  Morten H. Christiansen,et al.  On-Line Individual Differences in Statistical Learning Predict Language Processing , 2010, Front. Psychology.

[95]  Morten H. Christiansen,et al.  Towards a theory of individual differences in statistical learning , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[96]  Morten H. Christiansen,et al.  Language acquisition as skill learning , 2018, Current Opinion in Behavioral Sciences.

[97]  B. MacWhinney Models of the emergence of language. , 1998, Annual review of psychology.

[98]  Inbal Arnon,et al.  Do current statistical learning tasks capture stable individual differences in children? An investigation of task reliability across modality , 2019, Behavior Research Methods.

[99]  Morten H. Christiansen,et al.  Bridging artificial and natural language learning: Comparing processing- and reflection-based measures of learning , 2018, CogSci.

[100]  M. Tomasello First Verbs: A Case Study of Early Grammatical Development , 1994 .

[101]  Gary Jones,et al.  Why Chunking Should be Considered as an Explanation for Developmental Change before Short-Term Memory Capacity and Processing Speed , 2012, Front. Psychology.

[102]  Noam Siegelman,et al.  What Predicts Successful Literacy Acquisition in a Second Language? , 2013, Psychological science.

[103]  A. Baddeley,et al.  Evaluation of the role of phonological STM in the development of vocabulary in children: A longitudinal study , 1989 .

[104]  Elisabeth A. Karuza,et al.  On-line Measures of Prediction in a Self-Paced Statistical Learning Task , 2014, CogSci.

[105]  Scott P. Johnson,et al.  When learning goes beyond statistics: Infants represent visual sequences in terms of chunks , 2018, Cognition.

[106]  E. Newport,et al.  WORD SEGMENTATION : THE ROLE OF DISTRIBUTIONAL CUES , 1996 .

[107]  Shula Chiat,et al.  Nonword repetition depends on the frequency of sublexical representations at different grain sizes: Evidence from a multi-factorial analysis , 2018, Cognition.

[108]  Padraic Monaghan,et al.  Simultaneous segmentation and generalisation of non-adjacent dependencies from continuous speech , 2016, Cognition.

[109]  A. Vinter,et al.  PARSER: A Model for Word Segmentation , 1998 .

[110]  John D. E. Gabrieli,et al.  Hearing Matters More Than Seeing: A Cross-Modality Study of Statistical Learning and Reading Ability , 2018, Scientific Studies of Reading.

[111]  Noam Siegelman,et al.  What exactly is learned in visual statistical learning? Insights from Bayesian modeling , 2019, Cognition.