Alibaba Submission to the WMT20 Parallel Corpus Filtering Task
暂无分享,去创建一个
[1] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[2] Noah A. Smith,et al. The Web as a Parallel Corpus , 2003, CL.
[3] Hermann Ney,et al. Automatic Filtering of Bilingual Corpora for Statistical Machine Translation , 2005, NLDB.
[4] Xiaoyi Ma,et al. Champollion: A Robust Parallel Text Sentence Aligner , 2006, LREC.
[5] Chih-Jen Lin,et al. A Practical Guide to Support Vector Classication , 2008 .
[6] Eiichiro Sumita,et al. Method of Selecting Training Data to Build a Compact and Efficient Translation Model , 2008, IJCNLP.
[7] William D. Lewis,et al. Intelligent Selection of Language Model Training Data , 2010, ACL.
[8] Shahram Khadivi,et al. A discriminative approach to filter out noisy sentence pairs from bilingual corpora , 2010, 2010 5th International Symposium on Telecommunications.
[9] Deniz Yuret,et al. Instance Selection for Machine Translation using Feature Decay Algorithms , 2011, WMT@EMNLP.
[10] Jaime G. Carbonell,et al. Active learning and crowdsourcing for machine translation in low resource scenarios , 2012 .
[11] Alon Lavie,et al. The CMU-Avenue French-English Translation System , 2012, WMT@NAACL-HLT.
[12] Philipp Koehn,et al. Scalable Modified Kneser-Ney Language Model Estimation , 2013, ACL.
[13] Noah A. Smith,et al. A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.
[14] Mari Ostendorf,et al. Data Selection With Fewer Words , 2015, WMT@EMNLP.
[15] George F. Foster,et al. Bilingual Methods for Adaptive Training Data Selection for Machine Translation , 2016, AMTA.
[16] Philipp Koehn,et al. Six Challenges for Neural Machine Translation , 2017, NMT@ACL.
[17] Boxing Chen,et al. Alibaba Submission to the WMT20 Parallel Corpus Filtering Task , 2020, WMT@EMNLP.
[18] Marcin Junczys-Dowmunt,et al. Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora , 2018, WMT.
[19] Huda Khayrallah,et al. On the Impact of Various Types of Noise on Neural Machine Translation , 2018, NMT@ACL.
[20] Guillaume Lample,et al. Unsupervised Machine Translation Using Monolingual Corpora Only , 2017, ICLR.
[21] Philipp Koehn,et al. Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings , 2019, WMT.
[22] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[23] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .
[24] Guillaume Lample,et al. Cross-lingual Language Model Pretraining , 2019, NeurIPS.