论文信息 - Combining Constituent Parsers

Combining Constituent Parsers

Combining the 1-best output of multiple parsers via parse selection or parse hybridization improves f-score over the best individual parser (Henderson and Brill, 1999; Sagae and Lavie, 2006). We propose three ways to improve upon existing methods for parser combination. First, we propose a method of parse hybridization that recombines context-free productions instead of constituents, thereby preserving the structure of the output of the individual parsers to a greater extent. Second, we propose an efficient linear-time algorithm for computing expected f-score using Minimum Bayes Risk parse selection. Third, we extend these parser combination methods from multiple 1-best outputs to multiple n-best outputs. We present results on WSJ section 23 and also on the English side of a Chinese-English parallel corpus.

Kevin Knight | Victoria Fossum | Victoria Fossum | Kevin Knight

[1] Mirella Lapata,et al. Proceedings of EMNLP 2004 , 2004 .

[2] Eugene Charniak,et al. Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[3] Dan Klein,et al. Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[4] Chris Quirk,et al. The impact of parse quality on syntactically-informed statistical machine translation , 2006, EMNLP.

[5] Daniel M. Bikel,et al. Design of a multi-lingual, parallel-processing statistical parsing engine , 2002 .

[6] Dan Klein,et al. Accurate Unlexicalized Parsing , 2003, ACL.

[7] Alon Lavie,et al. Parser Combination by Reparsing , 2006, NAACL.

[8] Hopkins UniversityBaltimore. Exploiting Diversity in Natural Language Processing: Combining Parsers , 1999 .

[9] Michael Collins,et al. Discriminative Reranking for Natural Language Parsing , 2000, CL.