Using surprisal and fMRI to map the neural bases of broad and local contextual prediction during natural language comprehension

Context guides comprehenders’ expectations during language processing, and informationtheoretic surprisal is commonly used as an index of cognitive processing effort. However, prior work using surprisal has considered only within-sentence context, using n-grams, neural language models, or syntactic structure as conditioning context. In this paper, we extend the surprisal approach to use broader topical context, investigating the influence of local and topical context on processing via an analysis of fMRI time courses collected during naturalistic listening. Lexical surprisal calculated from ngram and LSTM language models is used to capture effects of local context; to capture the effects of broader context a new metric based on topic models, topical surprisal, is introduced. We identify distinct patterns of neural activation for lexical surprisal and topical surprisal. These differing neuro-anatomical correlates suggest that local and broad contextual cues during sentence processing recruit different brain regions and that those regions of the language network functionally contribute to processing different dimensions of contextual information during comprehension. More generally, our approach adds to a growing literature using methods from computational linguistics to operationalize and test hypotheses about neuro-cognitive mechanisms in sentence processing.

[1]  Petr Sojka,et al.  Gensim -- Statistical Semantics in Python , 2011 .

[2]  Edouard Grave,et al.  Colorless Green Recurrent Networks Dream Hierarchically , 2018, NAACL.

[3]  Benjamin K. Bergen,et al.  How well does surprisal explain N400 amplitude under different experimental conditions? , 2020, CONLL.

[4]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5]  John T Hale,et al.  Hierarchical structure guides rapid linguistic predictions during naturalistic listening , 2019, PloS one.

[6]  Antal van den Bosch,et al.  Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain , 2017, PloS one.

[7]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[8]  C. Honey,et al.  Topographic Mapping of a Hierarchy of Temporal Receptive Windows Using a Narrated Story , 2011, The Journal of Neuroscience.

[9]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[10]  Andrea Bergmann,et al.  Statistical Parametric Mapping The Analysis Of Functional Brain Images , 2016 .

[11]  Liberty S. Hamilton,et al.  The revolution will not be controlled: natural stimuli in speech neuroscience , 2018, Language, cognition and neuroscience.

[12]  Jonathan W. Peirce,et al.  PsychoPy—Psychophysics software in Python , 2007, Journal of Neuroscience Methods.

[13]  E. Bullmore,et al.  Methods for diagnosis and treatment of stimulus‐correlated motion in generic brain activation studies using fMRI , 1999, Human brain mapping.

[14]  Jeffrey R. Binder,et al.  Neural correlates of implicit and explicit combinatorial semantic processing , 2010, NeuroImage.

[15]  Thomas E. Nichols,et al.  Non-white noise in fMRI: Does modelling have an impact? , 2006, NeuroImage.

[16]  Philipp Koehn,et al.  Scalable Modified Kneser-Ney Language Model Estimation , 2013, ACL.

[17]  Mark Steedman,et al.  Modeling Incremental Language Comprehension in the Brain with Combinatory Categorial Grammar , 2021, CMCL.

[18]  Antal van den Bosch,et al.  Prediction During Natural Language Comprehension. , 2016, Cerebral cortex.

[19]  E. Maguire,et al.  The functional neuroanatomy of comprehension and memory: the importance of prior knowledge. , 1999, Brain : a journal of neurology.

[20]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[21]  John Hale,et al.  Localising memory retrieval and syntactic composition: an fMRI study of naturalistic language comprehension , 2018, Language, Cognition and Neuroscience.

[22]  John M. Henderson,et al.  Language structure in the brain: A fixation-related fMRI study of syntactic surprisal in reading , 2016, NeuroImage.

[23]  Peter Hagoort,et al.  Retrieval and unification of syntactic structure in sentence comprehension: an FMRI study using word-category ambiguity. , 2009, Cerebral cortex.

[24]  Tilo Kircher,et al.  Neural correlates of narrative shifts during auditory story comprehension , 2009, NeuroImage.

[25]  Leila Wehbe,et al.  Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain) , 2019, NeurIPS.

[26]  Ana Raposo,et al.  The contribution of fronto-parietal regions to sentence comprehension: Insights from the Moses illusion , 2013, NeuroImage.

[27]  S. Dehaene,et al.  Cortical representation of the constituent structure of sentences , 2011, Proceedings of the National Academy of Sciences.

[28]  William W. Graves,et al.  Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. , 2009, Cerebral cortex.

[29]  Roger Levy,et al.  Neural language models as psycholinguistic subjects: Representations of syntactic state , 2019, NAACL.

[30]  Benedikt A. Poser,et al.  Multi-echo fMRI: A review of applications in fMRI denoising and analysis of BOLD signals , 2017, NeuroImage.

[31]  John Hale,et al.  LSTMs Can Learn Syntax-Sensitive Dependencies Well, But Modeling Structure Makes Them Better , 2018, ACL.

[32]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[33]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[34]  Richard Futrell,et al.  The Natural Stories corpus: a reading-time corpus of English texts containing rare syntactic constructions , 2020, Language Resources and Evaluation.

[35]  Allyson Ettinger,et al.  What BERT Is Not: Lessons from a New Suite of Psycholinguistic Diagnostics for Language Models , 2019, TACL.

[36]  Jiang Xu,et al.  Language in context: emergent features of word, sentence, and narrative comprehension , 2005, NeuroImage.

[37]  Alexander G. Huth,et al.  Incorporating Context into Language Encoding Models for fMRI , 2018, bioRxiv.

[38]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[39]  Allyson Ettinger,et al.  Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming , 2020, EMNLP.

[40]  Eleanor A. Maguire,et al.  Studying the freely-behaving brain with fMRI , 2012, NeuroImage.

[41]  Mark Davies The Corpus of Contemporary American English (COCA) , 2012 .

[42]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[43]  Peter Hagoort,et al.  The neurobiology of language beyond single words. , 2014, Annual review of neuroscience.

[44]  Ina Bornkessel-Schlesewsky,et al.  From story comprehension to the neurobiology of language , 2019, Language, Cognition and Neuroscience.

[45]  Jonathan Brennan,et al.  Naturalistic Sentence Comprehension in the Brain , 2016, Lang. Linguistics Compass.

[46]  Emmanuel Dupoux,et al.  Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies , 2016, TACL.

[47]  Allyson Ettinger,et al.  Modeling N400 amplitude using vector space models of word representation , 2016, CogSci.

[48]  Jayaram Chandrashekar,et al.  Sequential Processing of Lexical, Grammatical, and Phonological Information Within Broca's Area , 2009 .

[49]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[50]  John Hale,et al.  Processing MWEs: Neurocognitive Bases of Verbal MWEs and Lexical Cohesiveness within MWEs , 2018, LAW-MWE-CxG@COLING.

[51]  D. Kemmerer Cognitive Neuroscience of Language , 2014 .

[52]  Wen-Ming Luh,et al.  Differentiating BOLD and non-BOLD signals in fMRI time series using multi-echo EPI , 2012, NeuroImage.

[53]  William Schuler,et al.  fMRI reveals language-specific predictive coding during naturalistic sentence comprehension , 2020, Neuropsychologia.

[54]  W. Luh,et al.  Abstract linguistic structure correlates with temporal activity during naturalistic comprehension , 2016, Brain and Language.

[55]  John Hale,et al.  The Role of Syntax During Pronoun Resolution: Evidence from fMRI , 2018 .

[56]  Thomas E. Nichols,et al.  Handbook of Functional MRI Data Analysis: Index , 2011 .

[57]  R. Malach,et al.  Syntactic structure building in the anterior temporal lobe during natural story listening , 2012, Brain and Language.

[58]  Tal Linzen,et al.  A Neural Model of Adaptation in Reading , 2018, EMNLP.

[59]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[60]  L. Elliot Hong,et al.  Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech , 2018, Current Biology.

[61]  Aya Meltzer-Asscher,et al.  Neurocognitive mechanisms of verb argument structure processing , 2014 .

[62]  Tom M. Mitchell,et al.  Aligning context-based statistical models of language with brain activity during reading , 2014, EMNLP.