Probabilistic language models in cognitive neuroscience: promises and pitfalls

Cognitive neuroscientists of language comprehension study how neural computations relate to cognitive computations during comprehension. On the cognitive part of the equation, it is important that the computations and processing complexity are explicitly defined. Probabilistic language models can be used to give a computationally explicit account of language complexity during comprehension. Whereas such models have so far predominantly been evaluated against behavioral data, only recently have the models been used to explain neurobiological signals. Measures obtained from these models emphasize the probabilistic, information-processing view of language understanding and provide a set of tools that can be used for testing neural hypotheses about language comprehension. Here, we provide a cursory review of the theoretical foundations and example neuroimaging studies employing probabilistic language models. We high-light the advantages and potential pitfalls of this approach and indicate avenues for future research.

[1]  Gina R. Kuperberg,et al.  Neural mechanisms of language comprehension: Challenges to syntax , 2007, Brain Research.

[2]  Andrea E. Martin,et al.  Language Processing as Cue Integration: Grounding the Psychology of Language in Perception and Neurophysiology , 2016, Front. Psychol..

[3]  Peter C. Hansen,et al.  MEG. An introduction to methods , 2010 .

[4]  Noam Chomsky,et al.  A Review of B. F. Skinner's Verbal Behavior , 1980 .

[5]  Z. Harris,et al.  Foundations of language , 1941 .

[6]  Michael C. Frank,et al.  Review Pragmatic Language Interpretation as Probabilistic Inference , 2022 .

[7]  D. Poeppel The maps problem and the mapping problem: Two challenges for a cognitive neuroscience of speech and language , 2012, Cognitive neuropsychology.

[8]  P. Binda,et al.  Keeping a large-pupilled eye on high-level visual processing , 2015, Trends in Cognitive Sciences.

[9]  Tom M. Mitchell,et al.  Aligning context-based statistical models of language with brain activity during reading , 2014, EMNLP.

[10]  S. Frank,et al.  The ERP response to the amount of information conveyed by words in sentences , 2015, Brain and Language.

[11]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[12]  Brian Roark,et al.  Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing , 2009, EMNLP.

[13]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[14]  Krzysztof J. Gorgolewski,et al.  Making big data open: data sharing in neuroimaging , 2014, Nature Neuroscience.

[15]  Mark S. Seidenberg,et al.  Language Acquisition and Use: Learning and Applying Probabilistic Constraints , 1997, Science.

[16]  N. Pearlmutter,et al.  Constraints on sentence comprehension , 1998, Trends in Cognitive Sciences.

[17]  S. Levinson Turn-taking in Human Communication – Origins and Implications for Language Processing , 2016, Trends in Cognitive Sciences.

[18]  Christopher R Fetsch,et al.  The importance of task design and behavioral control for understanding the neural basis of cognitive functions , 2016, Current Opinion in Neurobiology.

[19]  Antal van den Bosch,et al.  Prediction During Natural Language Comprehension. , 2016, Cerebral cortex.

[20]  Rens Bod,et al.  Structured cognition and neural systems: From rats to language , 2012, Neuroscience & Biobehavioral Reviews.

[21]  Gina R Kuperberg,et al.  What do we mean by prediction in language comprehension? , 2016, Language, cognition and neuroscience.

[22]  David Marr,et al.  VISION A Computational Investigation into the Human Representation and Processing of Visual Information , 2009 .

[23]  Noam Chomsky Review of B.F. Skinner, Verbal Behavior , 1959 .

[24]  John M. Henderson,et al.  Language structure in the brain: A fixation-related fMRI study of syntactic surprisal in reading , 2016, NeuroImage.

[25]  Alexander Clark,et al.  Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge , 2017, Cogn. Sci..

[26]  Thomas L Griffiths,et al.  Rethinking language: How probabilities shape the words we use , 2011, Proceedings of the National Academy of Sciences.

[27]  Marie-Francine Moens,et al.  A survey on the application of recurrent neural networks to statistical language modeling , 2015, Comput. Speech Lang..

[28]  S. Frank,et al.  Insensitivity of the Human Sentence-Processing System to Hierarchical Structure , 2011, Psychological science.

[29]  Roger Levy,et al.  Cloze but no cigar: The complex relationship between cloze, corpus, and subjective probabilities in language processing , 2011, CogSci.

[30]  James L. McClelland,et al.  What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[31]  J. Zwart The Minimalist Program , 1998, Journal of Linguistics.

[32]  Konrad Paul Kording,et al.  Could a Neuroscientist Understand a Microprocessor? , 2016, bioRxiv.

[33]  K. Rayner Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[34]  A. Friederici The cortical language circuit: from auditory perception to sentence comprehension , 2012, Trends in Cognitive Sciences.

[35]  Christopher J. Honey,et al.  Future trends in Neuroimaging: Neural processes as expressed within real-life contexts , 2012, NeuroImage.

[36]  Florent Meyniel,et al.  The Neural Representation of Sequences: From Transition Probabilities to Algebraic Patterns and Linguistic Trees , 2015, Neuron.

[37]  Dan Jurafsky,et al.  Probabilistic Modeling in Psycholinguistics: Linguistic Comprehension and Production , 2006 .

[38]  Brandon M. Turner,et al.  Model-based cognitive neuroscience. , 2016, Journal of mathematical psychology.

[39]  Gualtiero Piccinini,et al.  The cognitive neuroscience revolution , 2015, Synthese.

[40]  William Bechtel,et al.  Dynamic mechanistic explanation: computational modeling of circadian rhythms as an exemplar for cognitive science. , 2010, Studies in history and philosophy of science.

[41]  Roel M. Willems Cognitive neuroscience of natural language use , 2015 .

[42]  Peter Hagoort,et al.  MUC (Memory, Unification, Control) and beyond , 2013, Front. Psychol..

[43]  Stanislas Dehaene,et al.  Neurophysiological dynamics of phrase-structure building during sentence processing , 2017, Proceedings of the National Academy of Sciences.

[44]  Marina Schmid,et al.  An Introduction To The Event Related Potential Technique , 2016 .

[45]  Ivan Toni,et al.  Conceptual Alignment: How Brains Achieve Mutual Understanding , 2016, Trends in Cognitive Sciences.

[46]  Sujata Ghosh,et al.  Merging Information , 2009 .

[47]  K. Watkins,et al.  Stimulating language: insights from TMS. , 2007, Brain : a journal of neurology.

[48]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[49]  Birte U. Forstmann,et al.  Reciprocal relations between cognitive neuroscience and formal cognitive models: opposites attract? , 2011, Trends in Cognitive Sciences.

[50]  P. Hagoort Nodes and networks in the neural architecture for language: Broca's region and beyond , 2014, Current Opinion in Neurobiology.

[51]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[52]  Stefan Frank,et al.  Uncertainty Reduction as a Measure of Cognitive Processing Effort , 2010, CMCL@ACL.

[53]  Stefan L. Frank,et al.  Uncertainty Reduction as a Measure of Cognitive Load in Sentence Comprehension , 2013, Top. Cogn. Sci..

[54]  E. Gibson,et al.  Memory Limitations and Structural Forgetting: The Perception of Complex Ungrammatical Sentences as Grammatical , 1999 .

[55]  Angela D. Friederici,et al.  Grounding language processing on basic neurophysiological principles , 2015, Trends in Cognitive Sciences.

[56]  Tal Linzen,et al.  Investigating the role of entropy in sentence processing , 2014, CMCL@ACL.

[57]  Haley R Pipkins,et al.  Polyamine transporter potABCD is required for virulence of encapsulated but not nonencapsulated Streptococcus pneumoniae , 2017, PloS one.

[58]  C. Honey,et al.  Topographic Mapping of a Hierarchy of Temporal Receptive Windows Using a Narrated Story , 2011, The Journal of Neuroscience.

[59]  Antal van den Bosch,et al.  Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain , 2017, PloS one.

[60]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[61]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[62]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[63]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[64]  John Hale,et al.  The Information Conveyed by Words in Sentences , 2003, Journal of psycholinguistic research.

[65]  Peter Hagoort,et al.  Reflections on the neurobiology of syntax , 2009 .

[66]  Kara D. Federmeier,et al.  Electrophysiology reveals semantic memory use in language comprehension , 2000, Trends in Cognitive Sciences.

[67]  William Schuler,et al.  Evidence of syntactic working memory usage in MEG data , 2015, CMCL@NAACL-HLT.

[68]  Frank Keller,et al.  A Probabilistic Model of Semantic Plausibility in Sentence Processing , 2009, Cogn. Sci..

[69]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[70]  Frank Keller,et al.  Probabilistic Modeling of Discourse-Aware Sentence Processing , 2013, Top. Cogn. Sci..

[71]  Michael X Cohen,et al.  Where Does EEG Come From and What Does It Mean? , 2017, Trends in Neurosciences.

[72]  Wilson L. Taylor,et al.  “Cloze Procedure”: A New Tool for Measuring Readability , 1953 .

[73]  Adrian Staub,et al.  The Effect of Lexical Predictability on Eye Movements in Reading: Critical Review and Theoretical Interpretation , 2015, Lang. Linguistics Compass.

[74]  Angela D. Friederici,et al.  Mapping sentence form onto meaning: The syntax–semantic interface , 2007, Brain Research.

[75]  Simon Kirby,et al.  Biological Foundations and Origin of Syntax , 2009 .

[76]  Falk Huettig,et al.  Four central questions about prediction in language processing , 2015, Brain Research.

[77]  Friedemann Pulvermüller,et al.  Thinking in circuits: toward neurobiological explanation in cognitive neuroscience , 2014, Biological Cybernetics.

[78]  R. Rosenfeld,et al.  Two decades of statistical language modeling: where do we go from here? , 2000, Proceedings of the IEEE.

[79]  S. Dehaene,et al.  Cortical representation of the constituent structure of sentences , 2011, Proceedings of the National Academy of Sciences.

[80]  Stephen T. Wu,et al.  Complexity Metrics in an Incremental Right-Corner Parser , 2010, ACL.

[81]  John Hale,et al.  Information-theoretical Complexity Metrics , 2016, Lang. Linguistics Compass.

[82]  Jonathan Brennan,et al.  Naturalistic Sentence Comprehension in the Brain , 2016, Lang. Linguistics Compass.

[83]  Reinhold Kliegl,et al.  Parsing costs as predictors of reading difficulty: An evaluation using the Potsdam Sentence Corpus , 2008, Journal of Eye Movement Research.

[84]  D Norris,et al.  Merging information in speech recognition: Feedback is never necessary , 2000, Behavioral and Brain Sciences.

[85]  J. Tenenbaum,et al.  Special issue on “Probabilistic models of cognition , 2022 .

[86]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[87]  D. Stuss,et al.  Cognitive neuroscience. , 1993, Current opinion in neurobiology.

[88]  H. Nusbaum,et al.  On the neurobiological investigation of language understanding in context , 2004, Brain and Language.

[89]  N. Logothetis What we can do and what we cannot do with fMRI , 2008, Nature.

[90]  M. A. MacIver,et al.  Neuroscience Needs Behavior: Correcting a Reductionist Bias , 2017, Neuron.

[91]  David Poeppel,et al.  Towards a computational(ist) neurobiology of language: correlational, integrated and explanatory neurolinguistics , 2015, Language, cognition and neuroscience.

[92]  Gabriella Vigliocco,et al.  Sentence Comprehension as Mental Simulation: An Information-Theoretic Perspective , 2011, Inf..

[93]  J. Saffran Statistical Language Learning , 2003 .

[94]  E. Wagenmakers,et al.  An Introduction to Model-Based Cognitive Neuroscience , 2015, Springer New York.

[95]  Russell A Poldrack,et al.  Science Perspectives on Psychological Mapping Mental Function to Brain Structure: How Can Cognitive Neuroimaging Succeed? on Behalf Of: Association for Psychological Science , 2022 .

[96]  A. Meyer,et al.  Using the visual world paradigm to study language processing: a review and critical evaluation. , 2011, Acta psychologica.

[97]  G. Hickok The cortical organization of speech processing: feedback control and predictive coding the context of a dual-stream model. , 2012, Journal of communication disorders.

[98]  Jeffrey M. Zacks,et al.  Neural substrates of narrative comprehension and memory , 2008, NeuroImage.

[99]  P. Kuhl Brain Mechanisms in Early Language Acquisition , 2010, Neuron.

[100]  John T. Hale,et al.  What a Rational Parser Would Do , 2011, Cogn. Sci..

[101]  Peter Hagoort,et al.  Beyond the sentence given , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[102]  M. Rushworth,et al.  Model-based analyses: Promises, pitfalls, and example applications to the study of cognitive control , 2011, Quarterly journal of experimental psychology.

[103]  C. Craver Explaining the Brain: Mechanisms and the Mosaic Unity of Neuroscience , 2007 .

[104]  Z. Harris,et al.  Foundations of Language , 1940 .

[105]  E. Kaan,et al.  The brain circuitry of syntactic comprehension , 2002, Trends in Cognitive Sciences.

[106]  Thomas L. Dean,et al.  The atoms of neural computation , 2014, Science.

[107]  Brian Roark,et al.  Probabilistic Top-Down Parsing and Language Modeling , 2001, CL.

[108]  W. Luh,et al.  Abstract linguistic structure correlates with temporal activity during naturalistic comprehension , 2016, Brain and Language.

[109]  John Hale,et al.  Uncertainty About the Rest of the Sentence , 2006, Cogn. Sci..