论文信息 - Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation - 字舞流文

Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation

Satoshi Nakamura | Graham Neubig | Akiva Miura | Michael Paul | Graham Neubig | Satoshi Nakamura | Michael Paul | Akiva Miura

[1] Nello Cristianini,et al. Learning Performance of a Machine Translation System: a Statistical and Computational Analysis , 2008, WMT@ACL.

[2] Rohit Prasad,et al. A Semi-Supervised Batch-Mode Active Learning Strategy for Improved Statistical Machine Translation , 2010, CoNLL.

[3] Jun'ichi Tsujii,et al. Text Categorization with All Substring Features , 2009, SDM.

[4] David Chiang,et al. Hierarchical Phrase-Based Translation , 2007, CL.

[5] Satoshi Nakamura,et al. Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff , 2014, TACL.

[6] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[7] Udo Hahn,et al. Semi-Supervised Active Learning for Sequence Labeling , 2009, ACL.

[8] Gholamreza Haffari,et al. Active Learning for Statistical Phrase-based Machine Translation , 2009, NAACL.

[9] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[10] Mark Craven,et al. An Analysis of Active Learning Strategies for Sequence Labeling Tasks , 2008, EMNLP.

[11] Rohit Prasad,et al. Discriminative Sample Selection for Statistical Machine Translation , 2010, EMNLP.

[12] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[13] Toshiaki Nakazawa,et al. ASPEC: Asian Scientific Paper Excerpt Corpus , 2016, LREC.

[14] Stefan Riezler,et al. Analyzing Parallelism and Domain Similarities in the MAREC Patent Corpus , 2012, IRFC.

[15] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[16] Francisco Casacuberta,et al. Active learning for interactive machine translation , 2012, EACL.

[17] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[18] Alexander H. Waibel,et al. Low Cost Portability for Statistical Machine Translation based on N-gram Frequency and TF-IDF , 2005, IWSLT.

[19] Kenneth Ward Church,et al. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 2001, Computational Linguistics.

[20] Tomoki Toda,et al. Ckylark: A More Robust PCFG-LA Parser , 2015, HLT-NAACL.

[21] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[22] Ulrich Germann. Dynamic Phrase Tables for Machine Translation in an Interactive Post-editing Scenario , 2014 .

[23] Philipp Koehn,et al. Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[24] Satoshi Nakamura,et al. Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation , 2016, NAACL.

[25] Gholamreza Haffari,et al. Active Learning for Multilingual Statistical Machine Translation , 2009, ACL.

[26] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[27] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[28] Chris Callison-Burch,et al. Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation , 2010, ACL.

[29] Hal Daumé,et al. Domain Adaptation for Machine Translation by Mining Unseen Words , 2011, ACL.

[30] Hiroki Arimura,et al. Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications , 2001, CPM.

[31] Graham Neubig,et al. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis , 2011, ACL.

[32] Jörg Tiedemann,et al. News from OPUS — A collection of multilingual parallel corpora with tools and interfaces , 2009 .