Effective Parallel Corpus Mining using Bilingual Sentence Embeddings
暂无分享,去创建一个
Keith Stevens | Ray Kurzweil | Gustavo Hernández Ábrego | Noah Constant | Yinfei Yang | Yun-Hsuan Sung | Brian Strope | Daniel Cer | Mandy Guo | Qinlan Shen | Heming Ge | Noah Constant | Mandy Guo | Daniel Matthew Cer | Yinfei Yang | Yun-Hsuan Sung | B. Strope | R. Kurzweil | Heming Ge | K. Stevens | Qinlan Shen
[1] Alexandra Antonova,et al. Building a Web-Based Parallel Corpus and Filtering Out Machine-Translated Text , 2011, BUCC@ACL.
[2] MarcuDaniel,et al. Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005 .
[3] Lei Shi,et al. A DOM Tree Alignment Model for Mining Parallel Data from the Web , 2006, ACL.
[4] Nan Hua,et al. Universal Sentence Encoder , 2018, ArXiv.
[5] Christopher C. Yang,et al. Mining English/Chinese Parallel Documents from the World Wide Web , 2002 .
[6] Marcin Junczys-Dowmunt,et al. The United Nations Parallel Corpus v1.0 , 2016, LREC.
[7] Dragos Stefan Munteanu,et al. Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora , 2006, ACL.
[8] Jakob Uszkoreit,et al. Large Scale Parallel Document Mining for Machine Translation , 2010, COLING.
[9] Ray Kurzweil,et al. Learning Semantic Textual Similarity from Conversations , 2018, Rep4NLP@ACL.
[10] Jeff Johnson,et al. Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.
[11] Noah A. Smith,et al. The Web as a Parallel Corpus , 2003, CL.
[12] Matthew Henderson,et al. Efficient Natural Language Response Suggestion for Smart Reply , 2017, ArXiv.
[13] Jian-Yun Nie,et al. Parallel Web text mining for cross-language IR , 2000, RIAO.
[14] Sanjiv Kumar,et al. Nearest Neighbor Search in Google Correlate , 2013 .
[15] Mike Schuster,et al. Japanese and Korean voice search , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Philipp Koehn,et al. Zipporah: a Fast and Scalable Data Cleaning System for Noisy Web-Crawled Parallel Corpora , 2017, EMNLP.
[17] Laurent Besacier,et al. Mining a Comparable Text Corpus for a Vietnamese-French Statistical Machine Translation System , 2009, WMT@EACL.
[18] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[19] Bowen Zhou,et al. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs , 2015, TACL.
[20] Philipp Koehn,et al. Findings of the 2014 Workshop on Statistical Machine Translation , 2014, WMT@ACL.
[21] Dragos Stefan Munteanu,et al. Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005, CL.
[22] Philip Resnik,et al. Mining the Web for Bilingual Text , 1999, ACL.
[23] Lijun Wu,et al. Achieving Human Parity on Automatic Chinese to English News Translation , 2018, ArXiv.
[24] Eneko Agirre,et al. SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity , 2012, *SEMEVAL.
[25] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[26] Hal Daumé,et al. Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.
[27] Philipp Koehn,et al. Findings of the 2013 Workshop on Statistical Machine Translation , 2013, WMT@ACL.
[28] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.
[29] Houda Bouamor,et al. H2@BUCC18: Parallel Sentence Extraction from Comparable Corpora Using Multilingual Sentence Embeddings , 2018, BUCC@LREC.
[30] Philippe Langlais,et al. A Deep Neural Network Approach To Parallel Sentence Extraction , 2017, ArXiv.
[31] Hermann Ney,et al. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.
[32] Maria Leonor Pacheco,et al. of the Association for Computational Linguistics: , 2001 .
[33] Holger Schwenk,et al. Filtering and Mining Parallel Data in a Joint Multilingual Space , 2018, ACL.