暂无分享,去创建一个
Rico Sennrich | Ivan Titov | Barry Haddow | Biao Zhang | Rico Sennrich | B. Haddow | Ivan Titov | Biao Zhang
[1] Tanja Schultz,et al. Using word latice information for a tighter coupling in speech translation systems , 2004, INTERSPEECH.
[2] David Chiang,et al. Tied Multitask Learning for Neural Speech Translation , 2018, NAACL.
[3] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[4] Gholamreza Haffari,et al. Neural Speech Translation using Lattice Transformations and Graph Networks , 2019, EMNLP.
[5] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.
[6] Morgan Sonderegger,et al. Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi , 2017, INTERSPEECH.
[7] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[8] Kai Fan,et al. Lattice Transformer for Speech Translation , 2019, ACL.
[9] Hermann Ney,et al. Speech translation: coupling of recognition and translation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[10] Shinji Watanabe,et al. ESPnet: End-to-End Speech Processing Toolkit , 2018, INTERSPEECH.
[11] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[12] William J. Byrne,et al. Statistical Phrase-Based Speech Translation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[13] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[14] Zhenglu Yang,et al. Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation , 2020, AAAI.
[15] Wu Guo,et al. Learning Adaptive Downsampling Encoding for Online End-to-End Speech Recognition , 2019, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
[16] Ilya Sutskever,et al. Generating Long Sequences with Sparse Transformers , 2019, ArXiv.
[17] Matthias Sperber,et al. Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation , 2019, TACL.
[18] Xiaofei Wang,et al. A Comparative Study on Transformer vs RNN in Speech Applications , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[19] Ivan Titov,et al. Interpretable Neural Predictions with Differentiable Binary Variables , 2019, ACL.
[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[21] Elizabeth Salesky,et al. Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation , 2019, ACL.
[22] Arya D. McCarthy,et al. Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade , 2019, IWSLT.
[23] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[24] Lin-Shan Lee,et al. Towards End-to-end Speech-to-text Translation with Two-pass Decoding , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[25] Rico Sennrich,et al. On Sparsifying Encoder Outputs in Sequence-to-Sequence Models , 2020, FINDINGS.
[26] Max Welling,et al. Learning Sparse Neural Networks through L0 Regularization , 2017, ICLR.
[27] Ali Can Kocabiyikoglu,et al. Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation , 2018, LREC.
[28] André F. T. Martins,et al. Adaptively Sparse Transformers , 2019, EMNLP.
[29] Yuan Cao,et al. Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] Chanwoo Kim,et al. Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning , 2019, ArXiv.
[31] Yang Liu,et al. Towards Robust Neural Machine Translation , 2018, ACL.
[32] Olivier Pietquin,et al. End-to-End Automatic Speech Translation of Audiobooks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Florian Metze,et al. Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation , 2014, EACL.
[34] Zhenglu Yang,et al. Curriculum Pre-training for End-to-End Speech Translation , 2020, ACL.
[35] Satoshi Nakamura,et al. Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation , 2017, INTERSPEECH.
[36] Mattia Antonino Di Gangi,et al. MuST-C: a Multilingual Speech Translation Corpus , 2019, NAACL.
[37] Jiajun Zhang,et al. End-to-End Speech Translation with Knowledge Distillation , 2019, INTERSPEECH.
[38] Adam Lopez,et al. Low-Resource Speech-to-Text Translation , 2018, INTERSPEECH.
[39] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[40] Navdeep Jaitly,et al. Sequence-to-Sequence Models Can Directly Translate Foreign Speech , 2017, INTERSPEECH.
[41] Jiajun Zhang,et al. Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding , 2019, AAAI.
[42] Steve Renals,et al. Trainable Dynamic Subsampling for End-to-End Speech Recognition , 2019, INTERSPEECH.
[43] David Chiang,et al. An Attentional Model for Speech Translation Without Transcription , 2016, NAACL.
[44] Matt Post,et al. A Call for Clarity in Reporting BLEU Scores , 2018, WMT.
[45] Olivier Pietquin,et al. Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation , 2016, NIPS 2016.
[46] Adam Lopez,et al. Pre-training on high-resource speech recognition improves low-resource speech-to-text translation , 2018, NAACL.
[47] Matteo Negri,et al. Adapting Transformer to End-to-End Spoken Language Translation , 2019, INTERSPEECH.
[48] Lucia Specia,et al. The IWSLT 2019 Evaluation Campaign , 2019, IWSLT.
[49] Sharon Goldwater,et al. Analyzing ASR Pretraining for Low-Resource Speech-to-Text Translation , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[50] Steve Renals,et al. A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition , 2015, INTERSPEECH.