Incremental, Predictive Parsing with Psycholinguistically Motivated Tree-Adjoining Grammar

Psycholinguistic research shows that key properties of the human sentence processor are incrementality, connectedness (partial structures contain no unattached nodes), and prediction (upcoming syntactic structure is anticipated). There is currently no broad-coverage parsing model with these properties, however. In this article, we present the first broad-coverage probabilistic parser for PLTAG, a variant of TAG that supports all three requirements. We train our parser on a TAG-transformed version of the Penn Treebank and show that it achieves performance comparable to existing TAG parsers that are incremental but not predictive. We also use our PLTAG model to predict human reading times, demonstrating a better fit on the Dundee eye-tracking corpus than a standard surprisal model.

[1]  William Schuler,et al.  Broad-Coverage Parsing Using Human-Like Memory Constraints , 2010, CL.

[2]  Aravind K. Joshi,et al.  Incremental LTAG Parsing , 2005, HLT/EMNLP.

[3]  Frank Keller,et al.  Cognitively Plausible Models of Human Language Processing , 2010, ACL.

[4]  Julie C. Sedivy,et al.  Subject Terms: Linguistics Language Eyes & eyesight Cognition & reasoning , 1995 .

[5]  Frank Keller,et al.  A Psycholinguistically Motivated Version of TAG , 2008, TAG.

[6]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[7]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[8]  Frank Keller,et al.  The use of verb-specific information for prediction in sentence processing , 2013 .

[9]  David M. Magerman Natural Language Parsing as Statistical Pattern Recognition , 1994, ArXiv.

[10]  Christoph Scheepers,et al.  Integration of Syntactic and Semantic Information in Predictive Processing: Cross-Linguistic Evidence from German and English , 2003, Journal of psycholinguistic research.

[11]  Adrian Staub,et al.  Eye movements and processing difficulty in object relative clauses , 2010, Cognition.

[12]  Wolfgang Menzel,et al.  Incremental Parsing and the Evaluation of Partial Dependency Analyses , 2011 .

[13]  D. Barr,et al.  Random effects structure for confirmatory hypothesis testing: Keep it maximal. , 2013, Journal of memory and language.

[14]  C. Clifton,et al.  Syntactic prediction in language comprehension: evidence from either...or. , 2006, Journal of experimental psychology. Learning, memory, and cognition.

[15]  Philip Resnik,et al.  Probabilistic Tree-Adjoining Grammar as a Framework for Statistical Natural Language Processing , 1992, COLING.

[16]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[17]  Stefan L. Frank,et al.  Surprisal-based comparison between a symbolic and a connectionist model of sentence processing , 2009 .

[18]  Frank Keller,et al.  Syntactic and Semantic Factors in Processing Difficulty: An Integrated Measure , 2010, ACL.

[19]  Aravind K. Joshi,et al.  Feature Structures Based Tree Adjoining Grammars , 1988, COLING.

[20]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[21]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[22]  Richard L. Lewis,et al.  An Activation-Based Model of Sentence Processing as Skilled Memory Retrieval , 2005, Cogn. Sci..

[23]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[24]  Brian Roark,et al.  Incremental Parsing with the Perceptron Algorithm , 2004, ACL.

[25]  Douglas Roland,et al.  Discourse Expectations and Relative Clause Processing. , 2012 .

[26]  Philip Resnik,et al.  Left-Corner Parsing and Psychological Plausibility , 1992, COLING.

[27]  E. Gibson Linguistic complexity: locality of syntactic dependencies , 1998, Cognition.

[28]  S. Shieber,et al.  40 40 08 v 1 2 6 A pr 1 99 4 Principles and Implementation of Deductive Parsing , .

[29]  Vera Demberg-Winterfors,et al.  Broad-coverage model of prediction in human sentence processing , 2010 .

[30]  Brian Roark,et al.  Probabilistic Top-Down Parsing and Language Modeling , 2001, CL.

[31]  Vijay K. Shanker,et al.  Towards efficient statistical parsing using lexicalized grammatical information , 2002 .

[32]  James R. Curran,et al.  Adding Noun Phrase Structure to the Penn Treebank , 2007, ACL.

[33]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[34]  Colin M. Brown,et al.  Anticipating upcoming words in discourse: evidence from ERPs and reading times. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[35]  A. Kennedy,et al.  Parafoveal-on-foveal effects in normal reading , 2005, Vision Research.

[36]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[37]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[38]  Vincenzo Lombardo,et al.  Processing Coordinated Structures: Incrementality and Connectedness , 2005, Cogn. Sci..

[39]  E. Gibson The dependency locality theory: A distance-based theory of linguistic complexity. , 2000 .

[40]  Aravind K. Joshi,et al.  Tree-adjoining grammars and lexicalized grammars , 1992, Tree Automata and Languages.

[41]  G. Altmann,et al.  Incremental interpretation at verbs: restricting the domain of subsequent reference , 1999, Cognition.

[42]  P. Frasconi,et al.  Learning first-pass structural attachment preferences with dynamic grammars and recursive neural networks , 2003, Cognition.

[43]  Nick Cercone,et al.  Computational Linguistics , 1986, Communications in Computer and Information Science.

[44]  Brian Roark,et al.  Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing , 2009, EMNLP.

[45]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[46]  Stephen T. Wu,et al.  Complexity Metrics in an Incremental Right-Corner Parser , 2010, ACL.

[47]  Vincenzo Lombardo,et al.  Dynamic TAG and Lexical Dependencies , 2007 .

[48]  Benjamin W. Tatler,et al.  Systematic tendencies in scene viewing , 2008 .

[49]  Vera Demberg,et al.  German and English Treebanks and Lexica for Tree-Adjoining Grammars , 2012, LREC.

[50]  Michael Dixon,et al.  Compose-Reduce Parsing , 1991, ACL.

[51]  Joakim Nivre,et al.  Incrementality in Deterministic Dependency Parsing , 2004 .

[52]  Yasuyoshi Inagaki,et al.  Stochastically Evaluating the Validity of Partial Parse Trees in Incremental Parsing , 2004, ACL 2004.

[53]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[54]  Masaya Yoshida,et al.  Predictive processing of syntactic structure: Sluicing and ellipsis in real-time sentence processing , 2013 .

[55]  Reinhold Kliegl,et al.  Parsing costs as predictors of reading difficulty: An evaluation using the Potsdam Sentence Corpus , 2008, Journal of Eye Movement Research.

[56]  Mark Steedman,et al.  CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank , 2007, CL.

[57]  M. Tanenhaus,et al.  Modeling the Influence of Thematic Fit (and Other Constraints) in On-line Sentence Comprehension , 1998 .

[58]  Vera Demberg,et al.  Incremental Derivations in CCG , 2012, TAG.

[59]  Frank Keller,et al.  A Computational Model of Prediction in Human Parsing: Unifying Locality and Surprisal Effects , 2009 .

[60]  Fei Xia,et al.  A Uniform Method of Grammar Extraction and Its Applications , 2000, EMNLP.

[61]  L Konieczny,et al.  Locality and Parsing Complexity , 2000, Journal of psycholinguistic research.

[62]  Anoop Sarkar,et al.  Applying Co-Training Methods to Statistical Parsing , 2001, NAACL.

[63]  Michael White,et al.  Better Surface Realization through Psycholinguistics , 2014, Lang. Linguistics Compass.

[64]  David Chiang,et al.  Statistical Parsing with an Automatically-Extracted Tree Adjoining Grammar , 2000, ACL.

[65]  Roger Levy,et al.  Sequential vs. Hierarchical Syntactic Models of Human Incremental Sentence Processing , 2012, CMCL@NAACL-HLT.

[66]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[67]  Masaya Yoshida,et al.  Incremental Processing of Coreference and Binding in Japanese , 2009 .