Expectation-based Comprehension: Modeling the Interaction of World Knowledge and Linguistic Experience

ABSTRACT The processing difficulty of each word we encounter in a sentence is affected by both our prior linguistic experience and our general knowledge about the world. Computational models of incremental language processing have, however, been limited in accounting for the influence of world knowledge. We develop an incremental model of language comprehension that constructs—on a word-by-word basis—rich, probabilistic situation model representations. To quantify linguistic processing effort, we adopt Surprisal Theory, which asserts that the processing difficulty incurred by a word is inversely proportional to its expectancy (Hale, 2001; Levy, 2008). In contrast with typical language model implementations of surprisal, the proposed model instantiates a novel comprehension-centric metric of surprisal that reflects the likelihood of the unfolding utterance meaning as established after processing each word. Simulations are presented that demonstrate that linguistic experience and world knowledge are integrated in the model at the level of interpretation and combine in determining online expectations.

[1]  John T. Hale,et al.  What a Rational Parser Would Do , 2011, Cogn. Sci..

[2]  P. Hagoort,et al.  The interaction of discourse context and world knowledge in online sentence comprehension. Evidence from the N400 , 2007, Brain Research.

[3]  E. Gibson The dependency locality theory: A distance-based theory of linguistic complexity. , 2000 .

[4]  P. Johnson-Laird,et al.  Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness , 1985 .

[5]  D. Rumelhart NOTES ON A SCHEMA FOR STORIES , 1975 .

[6]  Jeffrey L. Elman,et al.  A Model of Event Knowledge , 2019, CogSci.

[7]  Colin M. Brown,et al.  When and how do listeners relate a sentence to the wider discourse? Evidence from the N400 effect. , 2003, Brain research. Cognitive brain research.

[8]  John C J Hoeks,et al.  Seeing words in context: the interaction of lexical and sentence level information during reading. , 2004, Brain research. Cognitive brain research.

[9]  Julie C. Sedivy,et al.  Subject Terms: Linguistics Language Eyes & eyesight Cognition & reasoning , 1995 .

[10]  Gina R. Kuperberg,et al.  Neural mechanisms of language comprehension: Challenges to syntax , 2007, Brain Research.

[11]  Mathieu Koppen,et al.  Modeling knowledge-based inferences in story comprehension , 2003, Cogn. Sci..

[12]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[13]  Gabriella Vigliocco,et al.  Sentence Comprehension as Mental Simulation: An Information-Theoretic Perspective , 2011, Inf..

[14]  G. Altmann,et al.  Incremental interpretation at verbs: restricting the domain of subsequent reference , 1999, Cognition.

[15]  Murray Singer,et al.  Exploring Individual Differences in Language Validation , 2014 .

[16]  Simon Garrod,et al.  The Contribution of Lexical and Situational Knowledge to Resolving Discourse Roles: Bonding and Resolution , 2000 .

[17]  John R. Anderson The Adaptive Character of Thought , 1990 .

[18]  I. Rooij,et al.  Connectionist semantic systematicity , 2009, Cognition.

[19]  Walter Kintsch,et al.  Predication , 2001, Cogn. Sci..

[20]  Colin M. Brown,et al.  Anticipating upcoming words in discourse: evidence from ERPs and reading times. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[21]  Daniel Jurafsky,et al.  A Probabilistic Model of Lexical and Syntactic Access and Disambiguation , 1996, Cogn. Sci..

[22]  Matthew W. Crocker,et al.  A Neurocomputational Model of the N400 and the P600 in Language Processing , 2016, Cognitive science.

[23]  Walter Kintsch,et al.  Comprehension: A Paradigm for Cognition , 1998 .

[24]  M. Crocker,et al.  On the Proper Treatment of the N400 and P600 in Language Comprehension , 2017, Front. Psychol..

[25]  Tobias Richter,et al.  Does Validation During Language Comprehension Depend on an Evaluative Mindset? , 2014 .

[26]  T. Trabasso,et al.  Modeling causal integration and availability of information during comprehension of narrative texts. , 1999 .

[27]  G. McKoon,et al.  The readiness is all: The functionality of memory‐based text processing , 1998 .

[28]  Katherine A. DeLong,et al.  Probabilistic word pre-activation during language comprehension inferred from electrical brain activity , 2005, Nature Neuroscience.

[29]  Matthew W. Crocker,et al.  The influence of the immediate visual context on incremental thematic role-assignment: evidence from eye-movements in depicted events , 2005, Cognition.

[30]  John Hale,et al.  Uncertainty About the Rest of the Sentence , 2006, Cogn. Sci..

[31]  Ellen F. Lau,et al.  A cortical network for semantics: (de)constructing the N400 , 2008, Nature Reviews Neuroscience.

[32]  Roger Levy,et al.  Cloze but no cigar: The complex relationship between cloze, corpus, and subjective probabilities in language processing , 2011, CogSci.

[33]  Robin K. Morris,et al.  Lexical and message-level sentence context effects on fixation times in reading. , 1994, Journal of experimental psychology. Learning, memory, and cognition.

[34]  Edward J. O'Brien,et al.  Knowledge Activation, Integration, and Validation During Narrative Text Comprehension , 2014 .

[35]  Jason E. Albrecht,et al.  Comprehension strategies in the development of a mental model. , 1992, Journal of experimental psychology. Learning, memory, and cognition.

[36]  Murray Singer,et al.  Verification of Text Ideas during Reading. , 2006 .

[37]  D. Plaut,et al.  A neurally plausible Parallel Distributed Processing model of Event-Related Potential word reading data , 2012, Brain and Language.

[38]  Roger C. Schank,et al.  Scripts, plans, goals and understanding: an inquiry into human knowledge structures , 1978 .

[39]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[40]  Thomas F Münte,et al.  Cerebral Cortex Advance Access published July 21, 2007 Visual Scenes Trigger Immediate Syntactic Reanalysis: Evidence from ERPs during Situated Spoken Comprehension , 2022 .

[41]  Marcus Nyström,et al.  Semantic override of low-level features in image viewing - both initially and overall , 2008 .

[42]  Hartmut Fitz,et al.  Getting real about Semantic Illusions: Rethinking the functional role of the P600 in language comprehension , 2012, Brain Research.

[43]  Marte Otten,et al.  Discourse-Based Word Anticipation During Language Processing: Prediction or Priming? , 2008 .

[44]  David J. Hess,et al.  Effects of global and local context on lexical processing during language comprehension , 1995 .

[45]  Van Berkum,et al.  The neuropragmatics of 'simple' utterance comprehension: An ERP review , 2009 .

[46]  S. Frank,et al.  The ERP response to the amount of information conveyed by words in sentences , 2015, Brain and Language.

[47]  Brian Roark,et al.  Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing , 2009, EMNLP.

[48]  Kara D. Federmeier,et al.  Electrophysiology reveals semantic memory use in language comprehension , 2000, Trends in Cognitive Sciences.

[49]  Johan Bos,et al.  The Groningen Meaning Bank , 2013, JSSP.

[50]  David L. Davidson,et al.  The Logical Form of Action Sentences , 2001 .

[51]  Jason E. Albrecht,et al.  Updating a mental model: maintaining both local and global coherence , 1993 .

[52]  Ellen F. Lau,et al.  Comprehenders Rationally Adapt Semantic Predictions to the Statistics of the Local Environment: a Bayesian Model of Trial-by-Trial N400 Amplitudes , 2017, CogSci.

[53]  D. Bobrow,et al.  Representation and Understanding: Studies in Cognitive Science , 1975 .

[54]  Mark F. St. John,et al.  The Story Gestalt: A Model of Knowledge-Intensive Processes in Text Comprehension , 1992, Cogn. Sci..

[55]  R. Ratcliff,et al.  A retrieval theory of priming in memory. , 1988, Psychological review.

[56]  James L. McClelland,et al.  Learning and Applying Contextual Constraints in Sentence Comprehension , 1990, Artif. Intell..

[57]  Marshall R. Mayberry,et al.  Situated sentence processing: The coordinated interplay account and a neurobehavioral model , 2010, Brain and Language.

[58]  J. Elman,et al.  Generalized event knowledge activation during online sentence comprehension. , 2012, Journal of memory and language.

[59]  David E. Rumelhart,et al.  Toward an interactive model of reading. , 1994 .

[60]  Terence Parsons,et al.  Events in the Semantics of English: A Study in Subatomic Semantics , 1990 .

[61]  P. Gordon,et al.  The interplay of discourse congruence and lexical association during sentence processing: Evidence from ERPs and eye tracking. , 2007, Journal of memory and language.

[62]  W. Kintsch The role of knowledge in discourse comprehension: a construction-integration model. , 1988, Psychological review.

[63]  John Hoeks,et al.  Modeling the Noun Phrase versus Sentence Coordination Ambiguity in Dutch: Evidence from Surprisal Theory , 2010, CMCL@ACL.

[64]  Murray Singer,et al.  Validation in Reading Comprehension , 2013 .

[65]  John C. J. Hoeks,et al.  A time and place for language comprehension: mapping the N400 and the P600 to a minimal cortical network , 2013, Front. Hum. Neurosci..

[66]  Steven A. Hillyard,et al.  Word Expectancy and Event-Related Brain Potentials During Sentence Processing , 2019, Preparatory States & Processes.

[67]  Martin Paczynski,et al.  Establishing Causal Coherence across Sentences: An ERP Study , 2011, Journal of Cognitive Neuroscience.

[68]  Edward J. O'Brien,et al.  Coherence Threshold and the Continuity of Processing: The RI-Val Model of Comprehension , 2016 .

[69]  M. Crocker,et al.  Teasing apart coercion and surprisal: Evidence from eye-movements and ERPs , 2017, Cognition.

[70]  Matthias Schlesewsky,et al.  An alternative perspective on “semantic P600” effects in language comprehension , 2008, Brain Research Reviews.

[71]  M. Tanenhaus,et al.  Modeling the Influence of Thematic Fit (and Other Constraints) in On-line Sentence Comprehension , 1998 .

[72]  Richard M. Golden,et al.  A parallel distributed processing model of story comprehension and recall , 1993 .

[73]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[74]  Rolf A. Zwaan,et al.  Situation models in language comprehension and memory. , 1998, Psychological bulletin.

[75]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[76]  M. Crocker,et al.  On the predictability of event boundaries in discourse: An ERP investigation , 2017, Memory & Cognition.

[77]  Richard J. Gerrig,et al.  The Scope of Memory-Based Processing , 2005 .

[78]  Wietske Vonk,et al.  World Knowledge in Computational Models of Discourse Comprehension , 2007 .

[79]  Tobias Richter,et al.  Validation and Comprehension of Text Information: Two Sides of the Same Coin , 2015 .

[80]  P. Hagoort,et al.  Integration of Word Meaning and World Knowledge in Language Comprehension , 2004, Science.

[81]  Nathaniel J. Smith,et al.  Optimal Processing Times in Reading: A Formal Model and Empirical Investigation , 2008 .

[82]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[83]  Stefan L. Frank,et al.  Surprisal-based comparison between a symbolic and a connectionist model of sentence processing , 2009 .

[84]  Jerome L. Myers,et al.  Processing discourse roles in scripted narratives: The influences of context and world knowledge , 2004 .

[85]  Harm Brouwer,et al.  The electrophysiology of language comprehension A neurocomputational model , 2010 .

[86]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[87]  Van Berkum,et al.  The electrophysiology of discourse and conversation , 2012 .

[88]  E. Gibson Linguistic complexity: locality of syntactic dependencies , 1998, Cognition.

[89]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[90]  Mante S. Nieuwland,et al.  When Peanuts Fall in Love: N400 Evidence for the Power of Discourse , 2005, Journal of Cognitive Neuroscience.

[91]  David C. Plaut,et al.  A connectionist model of sentence comprehension and production , 2002 .

[92]  Jerome L. Myers,et al.  Accessing the discourse representation during reading , 1998 .

[93]  Reinhold Kliegl,et al.  Parsing costs as predictors of reading difficulty: An evaluation using the Potsdam Sentence Corpus , 2008, Journal of Eye Movement Research.

[94]  Colin M. Brown,et al.  Semantic Integration in Sentences and Discourse: Evidence from the N400 , 1999, Journal of Cognitive Neuroscience.