Parsing with the Shortest Derivation

Common wisdom has it that the bias of stochastic grammars in favor of shorter derivations of a sentence is harmful and should be redressed. We show that the common wisdom is wrong for stochastic grammars that use elementary trees instead of context-free rules, such as Stochastic Tree-Substitution Grammars used by Data-Oriented Parsing models. For such grammars a non-probabilistic metric based on the shortest derivation outperforms a probabilistic metric on the ATIS and OVIS corpora, while it obtains competitive results on the Wall Street Journal (WSJ) corpus. This paper also contains the first published experiments with DOP on the WSJ.

[1]  G. Zipf,et al.  The Psycho-Biology of Language , 1936 .

[2]  G. Āllport The Psycho-Biology of Language. , 1936 .

[3]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[4]  Steve Young,et al.  Applications of stochastic context-free grammars using the Inside-Outside algorithm , 1990 .

[5]  Ralph Grishman,et al.  Statistical Parsing of Messages , 1990, HLT.

[6]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[7]  Mitchell P. Marcus,et al.  Pearl: A Probabilistic Chart Parser , 1991, EACL.

[8]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[9]  Richard M. Schwartz,et al.  Coping with Ambiguity and Unknown Words through Probabilistic Models , 1993, CL.

[10]  Ted Briscoe,et al.  Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars , 1993, CL.

[11]  Rens Bod Using an Annotated Corpus as a Stochastic Grammar , 1993, EACL.

[12]  A Corpus-based Approach to Semantic Interpretation , 1994 .

[13]  A. Jefferson Offutt,et al.  An Empirical Evaluation , 1994 .

[14]  ParsingKhalil Sima An optimized algorithm for Data Oriented , 1996 .

[15]  Eugene Charniak,et al.  Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[16]  Rens Bod,et al.  Two Questions about Data-Oriented Parsing , 1996, VLC@COLING.

[17]  Khalil Simaan,et al.  Computational Complexity of Probabilistic Disambiguation by means of Tree-Grammars , 1996, COLING.

[18]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[19]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[20]  Khalil Sima’an,et al.  An optimised algorithm for data oriented parsing , 1997 .

[21]  Jason Eisner Bilexical Grammars and a Cubic-time Probabilistic Parser , 1997, IWPT.

[22]  Rens Bod,et al.  A DOP Model for Semantic Interpretation , 1997, ACL.

[23]  Eugene Charniak,et al.  Figures of Merit for Best-First Probabilistic Chart Parsing , 1998, Comput. Linguistics.

[24]  Rens Bod,et al.  A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis , 1998, ACL.

[25]  Rens Bod,et al.  Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[26]  New Figures of Merit for Best-First Probabilistic Chart Parsing , 1998, CL.

[27]  G. Unter Neumann Automatic Extraction of Stochastic Lexicalized Tree Grammars from Treebanks , 1998 .

[28]  Günter Neumann Automatic extraction of stochastic lexicalized tree grammars from treebanks , 1998, TAG+.

[29]  Joshua Goodman,et al.  Parsing Inside-Out , 1998, ArXiv.

[30]  Khalil Sima'an,et al.  Learning Efficient Disambiguation , 1999, ArXiv.

[31]  unter NeumannDFKI,et al.  Learning Stochastic Lexicalized Tree Grammars from Hpsg , 1999 .

[32]  R. Bonnema A New Probability Model for Data Oriented Parsing , 1999 .

[33]  Andy Way A hybrid architecture for robust MT using LFG-DOP , 1999, J. Exp. Theor. Artif. Intell..

[34]  Jean-Cédric Chappelier,et al.  Monte-Carlo Sampling for NP-Hard Maximization Problems in the Framework of Weighted Parsing , 2000, Natural Language Processing.

[35]  Rens Bod An Empirical Evaluation of LFG-DOP , 2000, COLING.

[36]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[37]  L. Hoogweg Extending DOP1 with the Insertion Operation , 2000 .

[38]  Mark Johnson The DOP Estimation Method Is Biased and Inconsistent , 2002, Computational Linguistics.

[39]  R. Bod,et al.  A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis , COLING.