Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation

[1]  Nello Cristianini,et al.  Learning Performance of a Machine Translation System: a Statistical and Computational Analysis , 2008, WMT@ACL.

[2]  Rohit Prasad,et al.  A Semi-Supervised Batch-Mode Active Learning Strategy for Improved Statistical Machine Translation , 2010, CoNLL.

[3]  Jun'ichi Tsujii,et al.  Text Categorization with All Substring Features , 2009, SDM.

[4]  David Chiang,et al.  Hierarchical Phrase-Based Translation , 2007, CL.

[5]  Satoshi Nakamura,et al.  Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff , 2014, TACL.

[6]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[7]  Udo Hahn,et al.  Semi-Supervised Active Learning for Sequence Labeling , 2009, ACL.

[8]  Gholamreza Haffari,et al.  Active Learning for Statistical Phrase-based Machine Translation , 2009, NAACL.

[9]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[10]  Mark Craven,et al.  An Analysis of Active Learning Strategies for Sequence Labeling Tasks , 2008, EMNLP.

[11]  Rohit Prasad,et al.  Discriminative Sample Selection for Statistical Machine Translation , 2010, EMNLP.

[12]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[13]  Toshiaki Nakazawa,et al.  ASPEC: Asian Scientific Paper Excerpt Corpus , 2016, LREC.

[14]  Stefan Riezler,et al.  Analyzing Parallelism and Domain Similarities in the MAREC Patent Corpus , 2012, IRFC.

[15]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[16]  Francisco Casacuberta,et al.  Active learning for interactive machine translation , 2012, EACL.

[17]  Kenneth Heafield,et al.  KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[18]  Alexander H. Waibel,et al.  Low Cost Portability for Statistical Machine Translation based on N-gram Frequency and TF-IDF , 2005, IWSLT.

[19]  Kenneth Ward Church,et al.  Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 2001, Computational Linguistics.

[20]  Tomoki Toda,et al.  Ckylark: A More Robust PCFG-LA Parser , 2015, HLT-NAACL.

[21]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[22]  Ulrich Germann Dynamic Phrase Tables for Machine Translation in an Interactive Post-editing Scenario , 2014 .

[23]  Philipp Koehn,et al.  Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[24]  Satoshi Nakamura,et al.  Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation , 2016, NAACL.

[25]  Gholamreza Haffari,et al.  Active Learning for Multilingual Statistical Machine Translation , 2009, ACL.

[26]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[27]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[28]  Chris Callison-Burch,et al.  Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation , 2010, ACL.

[29]  Hal Daumé,et al.  Domain Adaptation for Machine Translation by Mining Unseen Words , 2011, ACL.

[30]  Hiroki Arimura,et al.  Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications , 2001, CPM.

[31]  Graham Neubig,et al.  Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis , 2011, ACL.

[32]  Jörg Tiedemann,et al.  News from OPUS — A collection of multilingual parallel corpora with tools and interfaces , 2009 .