论文信息 - Feature-Rich Translation by Quasi-Synchronous Lattice Parsing - 字舞流文

Feature-Rich Translation by Quasi-Synchronous Lattice Parsing

We present a machine translation framework that can incorporate arbitrary features of both input and output sentences. The core of the approach is a novel decoder based on lattice parsing with quasi-synchronous grammar (Smith and Eisner, 2006), a syntactic formalism that does not require source and target trees to be isomorphic. Using generic approximate dynamic programming techniques, this decoder can handle "non-local" features. Similar approximate inference techniques support efficient parameter estimation with hidden variables. We use the decoder to conduct controlled experiments on a German-to-English translation task, to compare lexical phrase, syntax, and combined models, and to measure effects of various restrictions on non-isomorphism.

Noah A. Smith | Kevin Gimpel | Kevin Gimpel

[1] David Chiang,et al. Hierarchical Phrase-Based Translation , 2007, CL.

[2] David A. Smith,et al. Quasi-Synchronous Grammars: Alignment by Soft Projection of Syntactic Dependencies , 2006, WMT@HLT-NAACL.

[3] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[4] Yanjun Ma,et al. Using Supertags as Source Language Context in SMT , 2009, EAMT.

[5] Michael Collins,et al. Hidden-Variable Models for Discriminative Reranking , 2005, HLT.

[6] Dan Klein,et al. Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency , 2004, ACL.

[7] Philip Resnik,et al. Online Large-Margin Training of Syntactic and Structural Translation Features , 2008, EMNLP.

[8] Noah A. Smith,et al. Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings , 2009, EACL.

[9] Wolfgang Macherey,et al. Lattice-based Minimum Error Rate Training for Statistical Machine Translation , 2008, EMNLP.

[10] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[11] David A. Smith,et al. Parser Adaptation and Projection with Quasi-Synchronous Grammar Features , 2009, EMNLP.

[12] Adam Lopez,et al. Translation as Weighted Deduction , 2009, EACL.

[13] Srinivas Bangalore,et al. Learning Dependency Translation Models as Collections of Finite-State Head Transducers , 2000, Computational Linguistics.

[14] J. Besag. Statistical Analysis of Non-Lattice Data , 1975 .

[15] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[16] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[17] Qun Liu,et al. Forest-Based Translation , 2008, ACL.

[18] Xu Sun,et al. Sequential Labeling with Latent Variables: An Exact Inference Algorithm and its Efficient Approximation , 2009, EACL.

[19] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[20] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[21] Yuan Ding,et al. Machine Translation Using Probabilistic Synchronous Dependency Insertion Grammars , 2005, ACL.

[22] Salim Roukos,et al. Direct Translation Model 2 , 2007, HLT-NAACL.

[23] Daniel Marcu,et al. Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.

[24] Dan Klein,et al. Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[25] Salim Roukos,et al. Feature-based language understanding , 1997, EUROSPEECH.

[26] David Chiang,et al. A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[27] Phil Blunsom,et al. A Discriminative Latent Variable Model for Statistical Machine Translation , 2008, ACL.

[28] Jason Eisner. Bilexical Grammars and a Cubic-time Probabilistic Parser , 1997, IWPT.

[29] Noah A. Smith,et al. Compiling Comp Ling: Weighted Dynamic Programming and the Dyna Language , 2005, HLT.

[30] Daniel Marcu,et al. SPMT: Statistical Machine Translation with Syntactified Target Language Phrases , 2006, EMNLP.

[31] Hermann Ney,et al. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[32] David Chiang,et al. Forest Rescoring: Faster Decoding with Integrated Language Models , 2007, ACL.

[33] Ben Taskar,et al. An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.

[34] Kevin Knight,et al. A Syntax-based Statistical Translation Model , 2001, ACL.

[35] Phil Blunsom,et al. Probabilistic Inference for Machine Translation , 2008, EMNLP.

[36] Noah A. Smith,et al. Rich Source-Side Context for Statistical Machine Translation , 2008, WMT@ACL.

[37] Noah A. Smith,et al. What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA , 2007, EMNLP.

[38] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[39] Fernando Pereira,et al. Inside-Outside Reestimation From Partially Bracketed Corpora , 1992, HLT.

[40] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[41] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[42] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[43] Haitao Mi,et al. Forest-based Translation Rule Extraction , 2008, EMNLP.

[44] Jinxi Xu,et al. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[45] Chris Quirk,et al. Dependency Treelet Translation: Syntactically Informed Phrasal SMT , 2005, ACL.