Stochastic HPSG Parse Disambiguation using the Redwoods Corpus

This article details our experiments on HPSG parse disambiguation, based on the Redwoods treebank. Using existing and novel stochastic models, we evaluate the usefulness of different information sources for disambiguation – lexical, syntactic, and semantic. We perform careful comparisons of generative and discriminative models using equivalent features and show the consistent advantage of discriminatively trained models. Our best system performs at over 76% sentence exact match accuracy.

[1]  T. E. Harris,et al.  The Theory of Branching Processes. , 1963 .

[2]  T. E. Harris,et al.  The Theory of Branching Processes. , 1963 .

[3]  M. Baltin,et al.  The Mental representation of grammatical relations , 1985 .

[4]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[5]  Mats Rooth,et al.  Structural Ambiguity and Lexical Relations , 1991, ACL.

[6]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[7]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[8]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[9]  Maryellen C. MacDonald,et al.  Probabilistic constraints and syntactic ambiguity resolution , 1994 .

[10]  Glenn Carroll,et al.  Context-Sensitive Statistics For Improved Grammatical Language Models , 1994, AAAI.

[11]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[12]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[13]  J. Trueswell THE ROLE OF LEXICAL FREQUENCY IN SYNTACTIC AMBIGUITY RESOLUTION , 1996 .

[14]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[15]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[16]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[17]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  Mark Johnson,et al.  PCFG Models of Linguistic Tree Representations , 1998, CL.

[20]  M. Marciniak,et al.  Journée ATALA, 18–19 juin 1999, Corpus annotés pour la syntaxe CONSTRUCTION OF AN HPSG TREEBANK FOR POLISH , 1999 .

[21]  Stanley F. Chen,et al.  A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .

[22]  Mark Johnson,et al.  Estimators for Stochastic “Unification-Based” Grammars , 1999, ACL.

[23]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[24]  Patrick M. Farrell Syntactic Theory: A Formal Introduction (review) , 2001 .

[25]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[26]  Krassimira Ivanova,et al.  Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank , 2002, LREC.

[27]  Christopher D. Manning,et al.  Feature Selection for a Rich HPSG Grammar Using Decision Trees , 2002, CoNLL.

[28]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[29]  Dan Klein,et al.  Conditional Structure versus Conditional Estimation in NLP Models , 2002, EMNLP.

[30]  Thorsten Brants,et al.  The LinGO Redwoods Treebank: Motivation and Preliminary Applications , 2002, COLING.

[31]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[32]  Stephan Oepen,et al.  Parse Disambiguation for a Rich HPSG Grammar , 2002 .

[33]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[34]  Stephan Oepen,et al.  Parse Selection on the Redwoods Corpus: 3rd Growth Results , 2003 .

[35]  Christopher D. Manning,et al.  Optimizing Local Probability Models for Statistical Parsing , 2003, ECML.

[36]  P. Kantor Foundations of Statistical Natural Language Processing , 2001, Information Retrieval.

[37]  Ronald M. Kaplan,et al.  Lexical Functional Grammar A Formal System for Grammatical Representation , 2004 .