论文信息 - Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation - 字舞流文

Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation

We present the system we developed to provide efficient large-scale feature-rich discriminative training for machine translation. We describe how we integrate with MapReduce using Hadoop streaming to allow arbitrarily scaling the tuning set and utilizing a sparse feature set. We report our findings on German-English and RussianEnglish translation, and discuss benefits, as well as obstacles, to tuning on larger development sets drawn from the parallel training data.

Jimmy J. Lin | Vladimir Eidelman | Philip Resnik | Ferhan Türe | Ke Wu | P. Resnik | Vladimir Eidelman | Ferhan Ture | Ke Wu

[1] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[2] Jaime G. Carbonell,et al. Large-Scale Discriminative Training for Statistical Machine Translation Using Held-Out Line Search , 2013, HLT-NAACL.

[3] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[4] Philip Resnik,et al. Online Large-Margin Training of Syntactic and Structural Translation Features , 2008, EMNLP.

[5] Jimmy J. Lin,et al. Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce , 2013, ACL.

[6] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[7] Koby Crammer,et al. Adaptive regularization of weight vectors , 2009, Machine Learning.

[8] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[9] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.

[10] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[11] Adam Lopez,et al. Hierarchical Phrase-Based Translation with Suffix Arrays , 2007, EMNLP.

[12] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[13] Vladimir Eidelman,et al. Optimization Strategies for Online Large-Margin Learning in Machine Translation , 2012, WMT@NAACL-HLT.

[14] David Chiang,et al. Forest Rescoring: Faster Decoding with Integrated Language Models , 2007, ACL.

[15] Vladimir Eidelman,et al. cdec: A Decoder, Alignment, and Learning Framework for Finite- State and Context-Free Translation Models , 2010, ACL.

[16] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[17] Christopher D. Manning,et al. Fast and Adaptive Online Training of Feature-Rich Translation Models , 2013, ACL.

[18] Chris Dyer,et al. Using a maximum entropy model to build segmentation lattices for MT , 2009, NAACL.

[19] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[20] David Chiang,et al. Hope and Fear for Discriminative Training of Statistical Translation Models , 2012, J. Mach. Learn. Res..

[21] George F. Foster,et al. Batch Tuning Strategies for Statistical Machine Translation , 2012, NAACL.

[22] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[23] Matthew G. Snover,et al. A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[24] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[25] Chris Dyer,et al. Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT , 2012, ACL.

[26] Kevin Knight,et al. 11,001 New Features for Statistical Machine Translation , 2009, NAACL.