Deep Dependencies from Context-Free Statistical Parsers: Correcting the Surface Dependency Approximation

We present a linguistically-motivated algorithm for reconstructing nonlocal dependency in broad-coverage context-free parse trees derived from treebanks. We use an algorithm based on loglinear classifiers to augment and reshape context-free trees so as to reintroduce underlying nonlocal dependencies lost in the context-free approximation. We find that our algorithm compares favorably with prior work on English using an existing evaluation metric, and also introduce and argue for a new dependency-based evaluation metric. By this new evaluation metric our algorithm achieves 60% error reduction on gold-standard input trees and 5% error reduction on state-of-the-art machine-parsed input trees, when compared with the best previous work. We also present the first results on non-local dependency reconstruction for a language other than English, comparing performance on English and German. Our new evaluation metric quantitatively corroborates the intuition that in a language with freer word order, the surface dependencies in context-free parse trees are a poorer approximation to underlying dependency structure.

[1]  Amit Dubey,et al.  Deep Syntactic Processing by Combining Shallow Methods , 2003, ACL.

[2]  Ronald M. Kaplan,et al.  The Interface between Phrasal and Functional Constraints , 1993, CL.

[3]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[4]  Alaa A. Kharbouch,et al.  Three models for the description of language , 1956, IRE Trans. Inf. Theory.

[5]  Wojciech Skut,et al.  An Annotation Scheme for Free Word Order Languages , 1997, ANLP.

[6]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[7]  Julia Hockenmaier,et al.  Data and models for statistical parsing with combinatory categorial grammar , 2003 .

[8]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[9]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[10]  Christopher D. Manning,et al.  Probabilistic models of word order and syntactic discontinuity , 2005 .

[11]  Stefan Riezler,et al.  Speed and Accuracy in Shallow and Deep Stochastic Parsing , 2004, NAACL.

[12]  Mark Johnson,et al.  A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents , 2002, ACL.

[13]  Sanda M. Harabagiu,et al.  High performance question/answering , 2001, SIGIR '01.

[14]  Frank Keller,et al.  Probabilistic Parsing for German Using Sister-Head Dependencies , 2003, ACL.

[15]  Oliver Plaehn,et al.  Computing the Most Probable Parse for a Discontinuous Phrase Structure Grammar , 2000, IWPT.

[16]  Péter Dienes,et al.  Statistical parsing with non-local dependencies , 2005 .

[17]  Amit Dubey,et al.  Parsing german with sister-head dependencies , 2003, Annual Meeting of the Association for Computational Linguistics.

[18]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[19]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[20]  Geoffrey K. Pullum,et al.  Generalized Phrase Structure Grammar , 1985 .

[21]  Amit Dubey,et al.  Antecedent Recovery: Experiments with a Trace Tagger , 2003, EMNLP.