Coreference information guides human expectations during natural reading

Models of human sentence processing effort tend to focus on costs associated with retrieving structures and discourse referents from memory (memory-based) and/or on costs associated with anticipating upcoming words and structures based on contextual cues (expectation-based) (Levy, 2008). Although evidence suggests that expectation and memory may play separable roles in language comprehension (Levy et al., 2013), theories of coreference processing have largely focused on memory: how comprehenders identify likely referents of linguistic expressions. In this study, we hypothesize that coreference tracking also informs human expectations about upcoming words, and we test this hypothesis by evaluating the degree to which incremental surprisal measures generated by a novel coreference-aware semantic parser explain human response times in a naturalistic self-paced reading experiment. Results indicate (1) that coreference information indeed guides human expectations and (2) that coreference effects on memory retrieval may exist independently of coreference effects on expectations. Together, these findings suggest that the language processing system exploits coreference information both to retrieve referents from memory and to anticipate upcoming material.

[1]  William Schuler,et al.  Accurate Unbounded Dependency Recovery using Generalized Categorial Grammars , 2012, COLING.

[2]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3]  Eric Bach,et al.  Discontinuous constituents in generalized categorial grammars , 1981 .

[4]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[5]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[6]  Ian Cunnings,et al.  Variable Binding and Coreference in Sentence Comprehension: Evidence from Eye-Movements. , 2014 .

[7]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[8]  Luke S. Zettlemoyer,et al.  End-to-end Neural Coreference Resolution , 2017, EMNLP.

[9]  Yejin Choi,et al.  Dynamic Entity Representations in Neural Language Models , 2017, EMNLP.

[10]  Evelina Fedorenko,et al.  The syntactic complexity of Russian relative clauses , 2012, Journal of memory and language.

[11]  William Schuler,et al.  Left-Corner Parsing With Distributed Associative Memory Produces Surprisal and Locality Effects. , 2018, Cognitive science.

[12]  Richard Futrell,et al.  Lossy‐Context Surprisal: An Information‐Theoretic Model of Memory Effects in Sentence Processing , 2020, Cogn. Sci..

[13]  Barbara Di Eugenio,et al.  A Corpus-based Evaluation of Centering Theory , 2002 .

[14]  Philipp Koehn,et al.  Scalable Modified Kneser-Ney Language Model Estimation , 2013, ACL.

[15]  Alexander M. Rush,et al.  Learning Global Features for Coreference Resolution , 2016, NAACL.

[16]  A. Almor Noun-phrase anaphora and focus : The informational load hypothesis , 1999 .

[17]  Omer Levy,et al.  What Does BERT Look at? An Analysis of BERT’s Attention , 2019, BlackboxNLP@ACL.

[18]  Richard Futrell,et al.  The Natural Stories Corpus , 2017, LREC.

[19]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[20]  James R. Curran,et al.  Limited memory incremental coreference resolution , 2014, COLING.

[21]  Joel Nothman,et al.  Using mention accessibility to improve coreference resolution , 2016, ACL.

[22]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[23]  S. Frank,et al.  Insensitivity of the Human Sentence-Processing System to Hierarchical Structure , 2011, Psychological science.

[24]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[25]  Lee H. Wurm,et al.  What residualizing predictors in regression analyses does (and what it does not do) , 2014 .

[26]  Evan Jaffe,et al.  Coreference and Focus in Reading Times , 2018, CMCL.

[27]  John Hale,et al.  Finding syntax in human encephalography with beam search , 2018, ACL.

[28]  Randall Hendrick,et al.  The Representation and Processing of Coreference in Discourse , 1998, Cogn. Sci..

[29]  Mira Ariel Accessibility theory: An overview , 2001 .

[30]  Ronald Christensen,et al.  GRAPHICAL VIEWS OF SUPPRESSION AND MULTICOLLINEARITY IN MULTIPLE LINEAR REGRESSION , 2006 .

[31]  Brian Roark,et al.  Probabilistic Top-Down Parsing and Language Modeling , 2001, CL.

[32]  J. V. Berkum,et al.  On the use of verb-based implicit causality in sentence comprehension : Evidence from self-paced reading and eye tracking , 2006 .

[33]  Hannah Rohde,et al.  Pronominal reference and pragmatic enrichment: a Bayesian analysis , 2015 .

[34]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[35]  William Schuler,et al.  Memory-bounded Neural Incremental Parsing for Psycholinguistic Prediction , 2020, IWPT.

[36]  Nathaniel J. Smith,et al.  The effect of word predictability on reading time is logarithmic , 2013, Cognition.

[37]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[38]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[39]  Omer Levy,et al.  Dependency-Based Word Embeddings , 2014, ACL.

[40]  A. Garnham,et al.  The Locus of Implicit Causality Effects in Comprehension. , 1996, Journal of memory and language.

[41]  Joshua K. Hartshorne What is implicit causality? , 2014 .

[42]  Noah A. Smith,et al.  Recurrent Neural Network Grammars , 2016, NAACL.

[43]  Martin J. Pickering,et al.  The Time Course of the Influence of Implicit Causality Information: Focusing versus Integration Accounts , 2000 .

[44]  R. Ratcliff,et al.  Pronoun resolution and discourse models. , 1992, Journal of experimental psychology. Learning, memory, and cognition.

[45]  Lyn Frazier,et al.  The interaction of syntax and semantics during sentence processing: eye movements in the analysis of semantically biased sentences , 1983 .

[46]  P. Johnson-Laird,et al.  Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness , 1985 .

[47]  William Schuler,et al.  A Model of Language Processing as Hierarchic Sequential Prediction , 2013, Top. Cogn. Sci..

[48]  A computational model of human coreference judgements , 2022 .

[49]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[50]  Reinhold Kliegl,et al.  The cave of Shadows. Addressing the human factor with generalized additive mixed models , 2015, 1511.03120.