Quantifying Parsing Complexity as a Function of Grammar

Recent research has used measures of syntactic complexity generated by computational models to predict human processing effort in behavioral and neurophysiological experiments. These models often rely on simplifying grammatical assumptions and the dominant view in the literature has been that such simplifications are adequate approximations for processing accounts. However, the effect of chosen grammar on processing complexity has not been empirically investigated in a natural text. We compared the impact of grammar on estimated processing complexity for a 12 minute English narrative. One grammar was based on the Penn Treebank 2 schema, and the second was based in the Minimalist Grammar framework and represented as an X-bar schema. The impact of grammar was contrasted against the impact of parsing strategy using two stack-based strategies and complexity was measured in terms of the number of nodes built and the stack depth word-by-word. The impact of grammar choice matched or exceeded the impact of parsing strategy. Thus even for a single, fixed corpus and a fixed parsing strategy, the grammar matters.

[1]  Edward P. Stabler,et al.  Computational Perspectives on Minimalism , 2011 .

[2]  Yosef Grodzinsky,et al.  Neuroimaging of syntax and syntactic processing , 2006, Current Opinion in Neurobiology.

[3]  R. Larson On the double object construction , 1988 .

[4]  Lenhart Schubert,et al.  Guiding a Linguistically Well-Founded Parser with Head Patterns , 2001 .

[5]  A. Friederici,et al.  Auditory Language Comprehension: An Event-Related fMRI Study on the Processing of Syntactic and Lexical Information , 2000, Brain and Language.

[6]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[7]  John A. Hawkins,et al.  A Performance Theory of Order and Constituency , 1995 .

[8]  Mark Johnson,et al.  Memory requirements and local ambiguities of parsing strategies , 1991 .

[9]  Victor H. Yngve,et al.  A model and an hypothesis for language structure , 1960 .

[10]  Richard S. Kayne The Antisymmetry of Syntax , 1994 .

[11]  Liina Pylkkänen,et al.  The time-course and spatial distribution of brain activity associated with sentence processing , 2012, NeuroImage.

[12]  Mark C. Baker,et al.  Thematic Roles and Syntactic Structure , 1997 .

[13]  John Hale,et al.  Uncertainty About the Rest of the Sentence , 2006, Cogn. Sci..

[14]  Edward P. Stabler,et al.  An Introduction To Syntactic Analysis And Theory , 2016 .

[15]  Evelyn C. Ferstl,et al.  The extended language network: A meta‐analysis of neuroimaging studies on text comprehension , 2008, Human brain mapping.

[16]  Daniel M. Bikel,et al.  Intricacies of Collins’ Parsing Model , 2004, CL.

[17]  Johan Bos,et al.  Rebanking CCGbank for Improved NP Interpretation , 2010, ACL.

[18]  M. Baker,et al.  Passive Arguments Raised , 2018, Diachronic and Comparative Syntax.

[19]  Edwin Martin,et al.  Grammatical factors in sentence retention , 1966 .

[20]  A. Syrota,et al.  The Cortical Representation of Speech , 1993, Journal of Cognitive Neuroscience.

[21]  Reinhold Kliegl,et al.  Parsing costs as predictors of reading difficulty: An evaluation using the Potsdam Sentence Corpus , 2008, Journal of Eye Movement Research.

[22]  R. Malach,et al.  Syntactic structure building in the anterior temporal lobe during natural story listening , 2012, Brain and Language.

[23]  A. Bachrach Imaging neural correlates of syntactic complexity in a naturalistic context , 2008 .

[24]  Liliane Haegeman,et al.  Introduction to Government and Binding Theory , 1991 .

[25]  Carlo Cecchetto,et al.  Introduction to Government and Binding Theory , 1996 .

[26]  Rocky Ross,et al.  Mental models , 2004, SIGA.

[27]  Barry K. Rosen,et al.  Syntactic Complexity , 1974, Inf. Control..

[28]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[29]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[30]  N. Dronkers,et al.  Lesion analysis of the brain areas involved in language comprehension , 2004, Cognition.

[31]  Daniel M. Bikel,et al.  Design of a multi-lingual, parallel-processing statistical parsing engine , 2002 .

[32]  Brian Roark,et al.  Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing , 2009, EMNLP.

[33]  Vera Demberg-Winterfors,et al.  Broad-coverage model of prediction in human sentence processing , 2010 .

[34]  E. Gibson Linguistic complexity: locality of syntactic dependencies , 1998, Cognition.

[35]  G. A. Miller,et al.  Finitary models of language users , 1963 .

[36]  L. Stowe,et al.  Rethinking the neurological basis of language , 2005 .

[37]  Philip Resnik,et al.  Left-Corner Parsing and Psychological Plausibility , 1992, COLING.

[38]  Christopher D. Manning,et al.  Probabilistic models of word order and syntactic discontinuity , 2005 .