论文信息 - Asynchronous Binarization for Synchronous Grammars

Asynchronous Binarization for Synchronous Grammars

Binarization of n-ary rules is critical for the efficiency of syntactic machine translation decoding. Because the target side of a rule will generally reorder the source side, it is complex (and sometimes impossible) to find synchronous rule binarizations. However, we show that synchronous binarizations are not necessary in a two-stage decoder. Instead, the grammar can be binarized one way for the parsing stage, then rebinarized in a different way for the reranking stage. Each individual binarization considers only one monolingual projection of the grammar, entirely avoiding the constraints of synchronous binarization and allowing binarizations that are separately optimized for each stage. Compared to n-ary forest reranking, even simple target-side binarization schemes improve overall decoding accuracy.

John DeNero | Dan Klein | Adam Pauls

[1] Daniel Gildea,et al. Synchronous Binarization for Machine Translation , 2006, NAACL.

[2] David Chiang,et al. Forest Rescoring: Faster Decoding with Integrated Language Models , 2007, ACL.

[3] Daniel Gildea,et al. Binarization of Synchronous Context-Free Grammars , 2009, CL.

[4] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[5] Liang Huang. Binarization, Synchronous Binarization, and Target-side Binarization , 2007, SSST@HLT-NAACL.

[6] Joshua Goodman,et al. Parsing Algorithms and Metrics , 1996, ACL.

[7] Chin-Yew Lin,et al. Better Binarization for the CKY Parsing , 2008, EMNLP.

[8] Dan Klein,et al. Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[9] John DeNero,et al. Efficient Parsing for Transducer Grammars , 2009, HLT-NAACL.

[10] Daniel Marcu,et al. Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.